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Introduction and Preface 



As we enter the new millennium, the prospects for the field of Biomedical Engineering are bright. 
Individuals interested in pursuing careers in this field continue to increase and the fruits of medical 
innovation continue to yield both monetary rewards and patient well being. These trends are reflected 
in this second edition of the Biomedical Engineering Handbook. When compared to the first edition 
published in 1995, this new two-volume set includes new sections on "Transport Phenomena and 
Biomimetic Systems" and "Ethical Issues Associated with Medical Technology". In addition, over 60% of 
the chapters has been completely revised, incorporating the latest developments in the field, therefore, 
this second edition is truly an updated version of the "state-of-the-field of biomedical engineering". As 
such, it can serve as an excellent reference for individuals interested not only in a review of fundamental 
physiology, but also in quickly being brought up to speed in certain areas of biomedical engineering 
research. It can serve as an excellent textbook for students in areas where traditional textbooks have not 
yet been developed, and serve as an excellent review of the major areas of activity in each biomedical 
engineering subdiscipline, such as biomechanics biomaterials, clinical engineering, artificial intelligence, 
etc., and finally it can serve as the "bible" for practicing biomedical engineering professionals by covering 
such topics as a "Historical Perspective of Medical Technology, the Role of Professional Societies and the 
Ethical Issues Associated with Medical Technology". 

Biomedical Engineering is no longer an emerging discipline; it has become an important vital inter- 
disciplinary field. Biomedical engineers are involved in many medical ventures. They are involved in the 
design, development and utilization of materials, devices (such as pacemakers, lithotripsy, etc.) and 
techniques (such as signal processing, artificial intelligence, etc.) for clinical research and use; and serve 
as members of the health care delivery team (clinical engineering, medical informatics, rehabilitation 
engineering, etc.) seeking new solutions for difficult heath care problems confronting our society. To 
meet the needs of this diverse body of biomedical engineers, this handbook provides a central core of 
knowledge in those fields encompassed by the discipline of biomedical engineering as we enter the 21st 
century. Before presenting this detailed information, however, it is important to provide a sense of the 
evolution of the modern health care system and identify the diverse activities biomedical engineers 
perform to assist in the diagnosis and treatment of patients. 

Evolution of the Modern Health Care System 

Before 1900, medicine had little to offer the average citizen, since its resources consisted mainly of the 
physician, his education, and his "little black bag." In general, physicians seemed to be in short supply, 
but the shortage had rather different causes than the current crisis in the availability of health care 
professionals. Although the costs of obtaining medical training were relatively low, the demand for 
doctors' services also was very small, since many of the services provided by the physician also could be 
obtained from experienced amateurs in the community. The home was typically the site for treatment 
and recuperation, and relatives and neighbors constituted an able and willing nursing staff. Babies were 
delivered by midwives, and those illnesses not cured by home remedies were left to run their natural, 
albeit frequently fatal, course. The contrast with contemporary health care practices, in which specialized 
physicians and nurses located within the hospital provide critical diagnostic and treatment services, is 
dramatic. 
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The changes that have occurred within medical science originated in the rapid developments that took 
place in the applied sciences (chemistry, physics, engineering, microbiology, physiology, pharmacology, 
etc.) at the turn of the century. This process of development was characterized by intense interdisciplinary 
cross-fertilization, which provided an environment in which medical research was able to take giant 
strides in developing techniques for the diagnosis and treatment of disease. For example, in 1903, Willem 
Einthoven, the Dutch physiologist, devised the first electrocardiograph to measure the electrical activity 
of the heart. In applying discoveries in the physical sciences to the analysis of biologic process, he initiated 
a new age in both cardiovascular medicine and electrical measurement techniques. 

New discoveries in medical sciences followed one another like intermediates in a chain reaction. 
However, the most significant innovation for clinical medicine was the development of x-rays. These 
"new kinds of rays," as their discoverer W. K. Roentgen described them in 1895, opened the "inner man" 
to medical inspection. Initially, x-rays were used to diagnose bone fractures and dislocations, and in the 
process, x-ray machines became commonplace in most urban hospitals. Separate departments of radi- 
ology were established, and their influence spread to other departments throughout the hospital. By the 
1930s, x-ray visualization of practically all organ systems of the body had been made possible through 
the use of barium salts and a wide variety of radiopaque materials. 

X-ray technology gave physicians a powerful tool that, for the first time, permitted accurate diagnosis 
of a wide variety of diseases and injuries. Moreover, since x-ray machines were too cumbersome and 
expensive for local doctors and clinics, they had to be placed in health care centers or hospitals. Once 
there, x-ray technology essentially triggered the transformation of the hospital from a passive receptacle 
for the sick to an active curative institution for all members of society. 

For economic reasons, the centralization of health care services became essential because of many 
other important technological innovations appearing on the medical scene. However, hospitals remained 
institutions to dread, and it was not until the introduction of sulfanilamide in the mid- 1 930s and penicillin 
in the early 1940s that the main danger of hospitalization, i.e., cross-infection among patients, was 
significantly reduced. With these new drugs in their arsenals, surgeons were able to perform their 
operations without prohibitive morbidity and mortality due to infection. Furthermore, even though the 
different blood groups and their incompatibility were discovered in 1900 and sodium citrate was used 
in 1913 to prevent clotting, full development of blood banks was not practical until the 1930s, when 
technology provided adequate refrigeration. Until that time, "fresh" donors were bled and the blood 
transfused while it was still warm. 

Once these surgical suites were established, the employment of specifically designed pieces of medical 
technology assisted in further advancing the development of complex surgical procedures. For example, 
the Drinker respirator was introduced in 1927 and the first heart-lung bypass in 1939. By the 1940s, 
medical procedures heavily dependent on medical technology, such as cardiac catheterization and angio- 
graphy (the use of a cannula threaded through an arm vein and into the heart with the injection of 
radiopaque dye for the x-ray visualization of lung and heart vessels and valves), were developed. As a 
result, accurate diagnosis of congenital and acquired heart disease (mainly valve disorders due to rheu- 
matic fever) became possible, and a new era of cardiac and vascular surgery was established. 

Following World War II, technological advances were spurred on by efforts to develop superior weapon 
systems and establish habitats in space and on the ocean floor. As a by-product of these efforts, the 
development of medical devices accelerated and the medical profession benefited greatly from this rapid 
surge of "technological finds." Consider the following examples: 

1. Advances in solid-state electronics made it possible to map the subtle behavior of the fundamental 
unit of the central nervous system — the neuron — as well as to monitor various physiologic 
parameters, such as the electrocardiogram, of patients in intensive care units. 

2. New prosthetic devices became a goal of engineers involved in providing the disabled with tools 
to improve their quality of life. 
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3. Nuclear medicine — an outgrowth of the atomic age — emerged as a powerful and effective 
approach in detecting and treating specific physiologic abnormalities. 

4. Diagnostic ultrasound based on sonar technology became so widely accepted that ultrasonic 
studies are now part of the routine diagnostic workup in many medical specialties. 

5. "Spare parts" surgery also became commonplace. Technologists were encouraged to provide car- 
diac assist devices, such as artificial heart valves and artificial blood vessels, and the artificial heart 
program was launched to develop a replacement for a defective or diseased human heart. 

6. Advances in materials have made the development of disposable medical devices, such as needles 
and thermometers, as well as implantable drug delivery systems, a reality. 

7. Computers similar to those developed to control the flight plans of the Apollo capsule were used 
to store, process, and cross-check medical records, to monitor patient status in intensive care units, 
and to provide sophisticated statistical diagnoses of potential diseases correlated with specific sets 
of patient symptoms. 

8. Development of the first computer-based medical instrument, the computerized axial tomography 
scanner, revolutionized clinical approaches to noninvasive diagnostic imaging procedures, which 
now include magnetic resonance imaging and positron emission tomography as well. 

The impact of these discoveries and many others has been profound. The health care system consisting 
primarily of the "horse and buggy" physician is gone forever, replaced by a technologically sophisticated 
clinical staff operating primarily in "modern" hospitals designed to accommodate the new medical 
technology. This evolutionary process continues, with advances in biotechnology and tissue engineering 
altering the very nature of the health care delivery system itself. 

The Field of Biomedical Engineering 

Today, many of the problems confronting health professionals are of extreme interest to engineers because 
they involve the design and practical application of medical devices and systems — processes that are 
fundamental to engineering practice. These medically related design problems can range from very 
complex large-scale constructs, such as the design and implementation of automated clinical laboratories, 
multiphasic screening facilities (i.e., centers that permit many clinical tests to be conducted), and hospital 
information systems, to the creation of relatively small and "simple" devices, such as recording electrodes 
and biosensors, that may be used to monitor the activity of specific physiologic processes in either a 
research or clinical setting. They encompass the many complexities of remote monitoring and telemetry, 
including the requirements of emergency vehicles, operating rooms, and intensive care units. The Amer- 
ican health care system, therefore, encompasses many problems that represent challenges to certain 
members of the engineering profession called biomedical engineers. 

Biomedical Engineering: A Definition 

Although what is included in the field of biomedical engineering is considered by many to be quite clear, 
there are some disagreements about its definition. For example, consider the terms biomedical engineering, 
bioengineering, and clinical (or medical) engineering which have been defined in Pacela's Bioengineering 
Education Directory [Quest Publishing Co., 1990]. While Pacela defines bioengineering as the broad 
umbrella term used to describe this entire field, bioengineering is usually defined as a basic research-ori- 
ented activity closely related to biotechnology and genetic engineering, i.e., the modification of animal 
or plant cells, or parts of cells, to improve plants or animals or to develop new microorganisms for 
beneficial ends. In the food industry, for example, this has meant the improvement of strains of yeast 
for fermentation. In agriculture, bioengineers may be concerned with the improvement of crop yields 
by treatment of plants with organisms to reduce frost damage. It is clear that bioengineers of the future 



© 2000 by CRC Press LLC 



will have a tremendous impact on the quality of human life. The potential of this specialty is difficult to 
imagine. Consider the following activities of bioengineers: 

• Development of improved species of plants and animals for food production 

• Invention of new medical diagnostic tests for diseases 

• Production of synthetic vaccines from clone cells 

• Bioenvironmental engineering to protect human, animal, and plant life from toxicants and pol- 
lutants 

• Study of protein-surface interactions 

• Modeling of the growth kinetics of yeast and hybridoma cells 

• Research in immobilized enzyme technology 

• Development of therapeutic proteins and monoclonal antibodies 

In reviewing the above-mentioned terms, however, biomedical engineering appears to have the most 
comprehensive meaning. Biomedical engineers apply electrical, mechanical, chemical, optical, and other 
engineering principles to understand, modify, or control biologic (i.e., human and animal) systems, as 
well as design and manufacture products that can monitor physiologic functions and assist in the 
diagnosis and treatment of patients. When biomedical engineers work within a hospital or clinic, they 
are more properly called clinical engineers. 

Activities of Biomedical Engineers 

The breadth of activity of biomedical engineers is significant. The field has moved significantly from 
being concerned primarily with the development of medical devices in the 1950s and 1960s to include 
a more wide-ranging set of activities. As illustrated below, the field of biomedical engineering now 
includes many new career areas, each of which is presented in this Handbook. These areas include: 

• Application of engineering system analysis (physiologic modeling, simulation, and control) to 
biologic problems 

• Detection, measurement, and monitoring of physiologic signals (i.e., biosensors and biomedical 
instrumentation) 

• Diagnostic interpretation via signal-processing techniques of bioelectric data 

• Therapeutic and rehabilitation procedures and devices (rehabilitation engineering) 

• Devices for replacement or augmentation of bodily functions {artificial organs) 

• Computer analysis of patient-related data and clinical decision making (i.e., medical informatics 
and artificial intelligence) 

• Medical imaging, i.e., the graphic display of anatomic detail or physiologic function 

• The creation of new biologic products (i.e., biotechnology and tissue engineering) 

Typical pursuits of biomedical engineers, therefore, include: 

• Research in new materials for implanted artificial organs 

• Development of new diagnostic instruments for blood analysis 

• Computer modeling of the function of the human heart 

• Writing software for analysis of medical research data 

• Analysis of medical device hazards for safety and efficacy 

• Development of new diagnostic imaging systems 

• Design of telemetry systems for patient monitoring 

• Design of biomedical sensors for measurement of human physiologic systems variables 
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• Development of expert systems for diagnosis of disease 

• Design of closed-loop control systems for drug administration 

• Modeling of the physiologic systems of the human body 

• Design of instrumentation for sports medicine 

• Development of new dental materials 

• Design of communication aids for the handicapped 

• Study of pulmonary fluid dynamics 

• Study of the biomechanics of the human body 

• Development of material to be used as replacement for human skin 

Biomedical engineering, then, is an interdisciplinary branch of engineering that ranges from theoret- 
ical, nonexperimental undertakings to state-of-the-art applications. It can encompass research, develop- 
ment, implementation, and operation. Accordingly, like medical practice itself, it is unlikely that any 
single person can acquire expertise that encompasses the entire field. Yet, because of the interdisciplinary 
nature of this activity, there is considerable interplay and overlapping of interest and effort between them. 
For example, biomedical engineers engaged in the development of biosensors may interact with those 
interested in prosthetic devices to develop a means to detect and use the same bioelectric signal to power 
a prosthetic device. Those engaged in automating the clinical chemistry laboratory may collaborate with 
those developing expert systems to assist clinicians in making decisions based on specific laboratory data. 
The possibilities are endless. 

Perhaps a greater potential benefit occurring from the use of biomedical engineering is identification 
of the problems and needs of our present health care system that can be solved using existing engineering 
technology and systems methodology. Consequently, the field of biomedical engineering offers hope in 
the continuing battle to provide high-quality health care at a reasonable cost; if properly directed toward 
solving problems related to preventive medical approaches, ambulatory care services, and the like, bio- 
medical engineers can provide the tools and techniques to make our health care system more effective 
and efficient. 



Joseph D. Bronzino 

Editor-in-Chief 
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Editor-in-Chief 



Joseph D. Bronzino, Ph.D., P.E., Vernon Roosa Professor of Applied Science at Trinity College, 
Hartford, Connecticut, and director of the Biomedical Engineering Alliance for Connecticut (BEACON), 
teaches graduate and undergraduate courses in biomedical engineering in the fields of clinical engineer- 
ing, electrophysiology, signal analysis, and computer applications in medicine. He earned his B.S. in 
electrical engineering from Worcester Polytechnic Institute, M.S. in electrical engineering from the Naval 
Postgraduate School, and Ph.D. in electrical engineering also from Worcester Polytechnic Institute. Deeply 
concerned with the discipline of biomedical engineering, as well as with ethical and economic issues 
related to the application of technology to the delivery of health care, Dr. Bronzino has written and 
lectured internationally. He is the author of over 200 articles and 8 books: Technology for Patient Care 
(C.V. Mosby, 1977), Computer Applications for Patient Care (Addison-Wesley, 1982), Biomedical Engi- 
neering: Basic Concepts and Instrumentation (PWS Publishing Co., 1986), Expert Systems: Basic Concepts 
and Applications (Research Foundation of the State University of New York, 1989), Medical Technology 
and Society: An Interdisciplinary Perspective (MIT Press, 1990), Management of Medical Technology (But- 
terworth/Heinemann, 1992), The Biomedical Engineering Handbook (CRC Press, 1995), The Introduction 
to Biomedical Engineering (Academic Press, 1999), and The Biomedical Engineering Handbook, 2nd Edition 
(CRC Press, 2000). Dr. Bronzino is a fellow of both the Institute of Electrical and Electronic Engineers 
(IEEE) and the American Institute of Medical and Biological Engineering (AIMBE), a past president of 
the IEEE Engineering in Medicine and Biology Society (EMBS), a past chairman of the Biomedical 
Engineering Division of the American Society for Engineering Education, and a charter member of the 
American College of Clinical Engineering (ACCE) and the Connecticut Academy of Science and Engi- 
neering (CASE). Dr. Bronzino has extensive experience in the formulation of public policy regarding the 
utilization and regulation of medical technology. He has served as a past chairman of both the IEEE 
Health Care Engineering Policy Committee (HCEPC) and the IEEE Technical Policy Council in Wash- 
ington, D.C. 



© 2000 by CRC Press LLC 



Advisory Board 



Jean Louis Coatrieux 

Universite de Rennes I 
Rennes, France 

Dov Jaron 

Drexel University 
Philadelphia, Pennsylvania 

Swamy Laxminarayan 

New Jersey Institute of Technology 
Newark, New Jersey 

Karen M. Mudry 

Formerly of The Whitaker Foundation 
Washington, D.C. 

Michael R. Neuman 

Joint Program in Biomedical Engineering 
The University of Memphis and 

University of Tennessee 
Memphis, Tennessee 



Banu Onaral 

Drexel University 
Philadelphia, Pennsylvania 

Robert Plonsey 

Duke University 
Durham, North Carolina 

Alfred R. Potvin 

MEECO 
Sarasota, Florida 

Charles J. Robinson 

Louisiana Tech University 
Ruston, Louisiana 

Daniel J. Schneck 

Virginia Polytechnic Institute and 

State University 
Blacksburg, Virginia 

John Webster 

University of Wisconsin 
Madison, Wisconsin 



© 2000 by CRC Press LLC 



Contributors 



Joseph Adam 

Premise Development Corporation 
Hartford, Connecticut 

Kai-Nan An 

Biomechanics Laboratory 
Mayo Clinic 
Rochester, Minnesota 

Gary J. Baker 

Stanford University 
Stanford, California 

D. C. Barber 

University of Sheffield 
Sheffield, United Kingdom 

Berj L. Bardakjian 

University of Toronto 
Toronto, Canada 

Roger C. Barr 

Duke University 
Durham, North Carolina 

Khosrow Behbehani 

The University of Texas at Arlington 
and The University of Texas, 
Southwestern Medical Center at 
Dallas 

Arlington, Texas 

Edward J. Berbari 

Indiana University/Purdue 

University at Indianapolis 
Indianapolis, Indiana 

Anna M. Bianchi 

St. Raffaele Hospital 
Milan, Italy 

W. G. Billotte 

University of Dayton 
Dayton, Ohio 

Bruce R. Bowman 

EdenTec Corporation 
Eden Prairie, Minnesota 



G. Faye Boudreaux-Bartels 

University of Rhode Island 
Kingston, Rhode Island 

Joseph D. Bronzino 

Trinity College/The Biomedical 

Engineering Alliance for 

Connecticut (BEACON) 
Hartford, Connecticut 

Richard R Buck 

University of North Carolina 
Chapel Hill, North Carolina 

Thomas R Budinger 

University of California 
Berkeley, California 

K. J. L. Burg 

Carolinas Medical Center 
Charlotte, North Carolina 

Thomas J. Burkholder 

The Georgia Institute of Technology 
Atlanta, Georgia 

Robert D. Butterfield 

IVAC Corporation 
San Diego, California 

Joseph P. Cammarota 

Naval Air Warfare Center, Aircraft 

Division 
Warminster, Pennsylvania 

Thomas R. Canfield 

Argonne National Laboratory 
Argonne, Illinois 

Ewart R. Carson 

City University 

London, United Kingdom 

Sergio Cerutti 

Polytechnic University 
Milan, Italy 



A. Enis Cetin 

Bilkent University 
Ankara, Turkey 

K. B. Chandran 

University of Iowa 
Iowa City, Iowa 

Wei Chen 

Center for Magnetic Resonance 
Research and the University of 
Minnesota Medical School 

Minneapolis, Minnesota 

David A. Chesler 

Massachusetts General Hospital and 
Harvard University Medical 
School 

Boston, Massachusetts 

C. K. Chou 

Motorola, Inc. 
Plantation, Florida 

Chih-Chang Chu 

Cornell University 
Ithaca, New York 

Ben M. Clopton 

University of Washington 
Seattle, Washington 

Robin Coger 

Massachusetts General Hospital, 
Harvard University Medical 
School, and the Shriners Burns 
Institutes 

Cambridge, Massachusetts 

Arnon Cohen 

Ben-Gurion University 
Beer Sheva, Israel 

Steven Conolly 

Stanford University 
Stanford, California 



© 2000 by CRC Press LLC 



Derek G. Cramp 

City University 

London, United Kingdom 

Cristina Cristalli 

Instrumentation Laboratory 
Milano, Italy 

Barbara Y. Croft 

National Institutes of Health 
Kensington, Maryland 

Ian A. Cunningham 

Victoria Hospital, the John P. 
Robarts Research Institute, and 
the University of Western Ontario 

London, Canada 

Roy B. Davis 

Motion Analysis Laboratory 
Shriners Hospitals for Children 
Greenville, South Carolina 

Peter A. DeLuea 

Gait Analysis Laboratory 
Connecticut Children's Medical 

Center 
Hartford, Connecticut 

Philip B. Dobrin 

Hines VA Hospital and Loyola 

University Medical Center 
Hines, Illinois 

Cathryn R. Dooly 

University of Maryland 
College Park, Maryland 

Gary Drzewiecki 

Rutgers State University 
Piscataway, New Jersey 

Edwin G. Duffin 

Medtronic, Inc. 
Minneapolis, Minnesota 

Dominique M. Durand 

Case Western Reserve University 
Cleveland, Ohio 

Jeffrey L. Eggleston 

Valleylab, Inc. 
Boulder, Colorado 

Jeffrey T. Ellis 

Georgia Institute of Technology 
Atlanta, Georgia 



Edward Elson 

Walter Reed Army Institute of 

Research 
Washington, D.C. 

K. Whittaker Ferrara 

Riverside Research Institute 
New York, New York 

Ross Flewelling 

Nellcor Incorporation 
Pleasant, California 

Michael Forde 

Medtronic, Inc. 
Minneapolis, Minnesota 

Kenneth R. Foster 

University of Pennsylvania 
Philadelphia, Pennsylvania 

Michael J. Furey 

Virginia Polytechnic Institute and 

State University 
Blacksburg, Virginia 

Leslie A. Geddes 

Purdue University 
West Lafayette, Indiana 

V. K. Goel 

University of Iowa 
Iowa City, Iowa 

Richard L. Goldberg 

University of North Carolina 
Chapel Hill, North Carolina 

Boris Gramatikov 

Johns Hopkins School of Medicine 
Baltimore, Maryland 

Wallace Grant 

Virginia Polytechnic Institute and 

State University 
Blacksburg, Virginia 

Walter Greenleaf 

Greenleaf Medical 
Palo Alto, California 

Alan R. Hargen 

University of California, San Diego 
and NASA Ames Research Center 
San Diego, California 

Kaj-Age Henneberg 

University of Montreal 
Montreal, Canada 



Craig S. Henriquez 

Duke University 
Durham, North Carolina 

Robert M. Hochmuth 

Duke University 
Durham, North Carolina 

Xiaoping Hu 

Center for Magnetic Resonance 
Research and the University of 
Minnesota Medical School 

Minneapolis, Minnesota 

Bernard F. Hurley 

University of Maryland 
College Park, Maryland 

Arthur T. Johnson 

University of Maryland 
College Park, Maryland 

Christopher R. Johnson 

University of Utah 
Salt Lake City, Utah 

G. Allan Johnson 

Duke University Medical Center 
Durham, North Carolina 

Millard M. Judy 

Baylor Research Institute 
Dallas, Texas 

Philip F. Judy 

Brigham and Women's Hospital and 

Harvard Medical School 
Boston, Massachusetts 

J. Lawrence Katz 

Case Western Reserve University 
Cleveland, Ohio 

Kenton R. Kaufman 

Biomechanics Laboratory, Mayo 

Clinic 
Rochester, Minnesota 

J. C. Keller 

University of Iowa 
Iowa City, Iowa 

Gils on Khang 

Chonbuk National University 
Chonju, Korea 

Albert I. King 

Wayne State University 
Detroit, Michigan 



© 2000 by CRC Press LLC 



Young Kon Kim 

Inje University 
Kyungnam, Korea 

Hayrettin Koymen 

Bilkent University 
Ankara, Turkey 

Kenneth K. Kwong 

Massachusetts General Hospital and 
Harvard University Medical 
School 

Boston, Massachusetts 

Roderic Lakes 

University of Wisconsin-Madison 
Madison, Wisconsin 

Christopher G. Lausted 

University of Maryland 
College Park, Maryland 

Hai Bang Lee 

Korea Research Institute of Chemical 

Technology 
Chungnam, Korea 

Jin Ho Lee 

Hannam University 
Taejon, Korea 

Jack D. Lemmon 

Georgia Institute of Technology 
Atlanta, Georgia 

Shu-Tung Li 

Collagen Matrix, Inc. 

Baruch B. Lieber 

State University of New York at 

Buffalo 
Buffalo, New York 

Richard L. Lieber 

University of California and Veterans 
Administration Medical Centers 
San Diego, California 

Chung-Chiun Liu 

Electronics Design Center and 

Edison sensor Technology Center 
Case Western Reserve University 
Cleveland, Ohio 

A. Llinas 

Pontificia Universidad Javeriana 
Bogota, Colombia 



Albert Macovski 

Stanford University 
Stanford, California 



Luca T. Mainardi 

Polytechnic University 
Milan, Italy 

Jaakko Malmivuo 

Ragnar Granit Institute, Tampere 

University of Technology 
Tampere, Finland 

Angelo Manzoni 

Instrumentation Laboratory 
Milano, Italy 

Andrew D. McCulloch 

University of California 
San Diego, California 

Yitzhak Mendelson 

Worcester Polytechnic Institute 
Worcester, Massachusetts 

Evangelia Micheli- 
Tzanakou 

Rutgers University 
Piscataway, New Jersey 

Jack G. Mottley 

University of Rochester 
Rochester, New York 

Karen M. Mudry 

Formerly of The Whitaker 

Foundation 
Washington, D.C. 

Robin Murray 

University of Rhode Island 
Kingston, Rhode Island 

Joachim H. Nagel 

University of Stuttgart 
Stuttgart, Germany 

Michael R. Neuman 

Joint Program in Biomedical 

Engineering 
The University of Memphis and 

University of Tennessee 
Memphis, Tennessee 

Banu Onaral 

Drexel University 
Philadelphia, Pennsylvania 



Sylvia Ounpuu 

Gait Analysis Laboratory 
Connecticut Children's Medical 

Center 
Hartford, Connecticut 

Joon B. Park 

University of Iowa 
Iowa City, Iowa 

S.-H. Park 

University of Southern California 
Los Angeles, California 

Maqbool Patel 

Center for Magnetic Resonance 
Research and the University of 
Minnesota Medical School 

Minneapolis, Minnesota 

Robert Patterson 

University of Minnesota 
Minneapolis, Minnesota 

A. William Paulsen 

Emory University 
Atlanta, Georgia 

John Pauly 

Stanford University 
Stanford, California 

P. Hunter Peckham 

Case Western Reserve University and 

Veterans Affairs Medical Center 
Cleveland, Ohio 

Athina P. Petropulu 

Drexel University 
Philadelphia, Pennsylvania 

Tom Piantanida 

Greenleaf Medical 
Palo Alto, California 

Roland N. Pittman 

Virginia Commonwealth University 
Richmond, Virginia 

Robert Plonsey 

Duke University 
Durham, North Carolina 

Charles Polk 

University of Rhode Island 
Kingston, Rhode Island 



© 2000 by CRC Press LLC 



Aleksander S. Popel 

Johns Hopkins University 
Baltimore, Maryland 

Rulong Ren 

City of Hope National Medical 

Center 
Duarte, California 

Pat Ridgely 

Medtronic, Inc. 
Minneapolis, Minnesota 

Richard L. Roa 

Baylor University Medical Center 
Dallas, Texas 

Eric Rosow 

Hartford Hospital and Premise 

Development Corporation 
Hartford, Connecticut 

Bradley J. Roth 

Oakland University 
Rochester, Michigan 

Carl F. Rothe 

Indiana University 
Indianapolis, Indiana 

John Schenck 

General Electric Corporate Research 

and Development Center 
Schenectady, New York 

Geert W. Schmid- 
Schonbein 

University of California 
San Diego, California 

Daniel J. Schneck 

Virginia Polytechnic Institute and 

State University 
Blacksburg, Virginia 

Edward Schuck 

EdenTec Corporation 
Eden Prairie, Minnesota 

S. W. Shalaby 

Poly-Med, Inc. 
Anderson, South Carolina 

David Sherman 

Johns Hopkins School of Medicine 
Baltimore, Maryland 



Artin A. Shoukas 

Johns Hopkins University 
Baltimore, Maryland 



Robert E. Shroy, Jr. 

Picker International 
Highland Heights, Ohio 

Stephen W. Smith 

Duke University 
Durham, North Carolina 

Francis A. Spelman 

University of Washington 
Seattle, Washington 

Charles R. Steele 

Stanford University 
Stanford, California 

George Stetten 

Duke University 
Durham, North Carolina 

Primoz Strojnik 

Case Western Reserve University 
Cleveland, Ohio 

Maria A. Stuchly 

University of Victoria 
Victoria, Canada 

Willis A. Tacker 

Purdue University 
West Lafayette, Indiana 

Nitish V. Thakor 

Johns Hopkins School of Medicine 
Baltimore, Maryland 

Jason A. Tolomeo 

Stanford University 
Stanford, California 

Mehmet Toner 

Massachusetts General Hospital, 
Harvard University Medical 
School, and the Shriners Burns 
Institutes 

Cambridge, Massachusetts 

Benjamin M.W. Tsui 

University of North Carolina 
Chapel Hill, North Carolina 



Kamil Ugurbil 

Center for Magnetic Resonance 
Research and the University of 
Minnesota Medical School 

Minneapolis, Minnesota 

Henry F. VanBrocklin 

University of California 
Berkeley, California 

Michael S. Van Lysel 

University of Wisconsin 
Madison, Wisconsin 

Anthony Varghese 

University of Minnesota 
Minneapolis, Minnesota 

David C. Viano 

Wayne State University 
Detroit, Michigan 

Wolf W. von Maltzahn 

Whitaker Foundation 
Washington, D.C. 

Gregory I. Voss 

IVAC Corporation 
San Diego, California 

Richard E. Waugh 

University of Rochester 
Rochester, New York 

James C. Weaver 

Massachusetts Institute of 

Technology 
Cambridge, Massachusetts 

Martin J. Yaffe 

University of Toronto 
Toronto, Canada 

A jit P. Yoganathan 

Georgia Institute of Technology 
Atlanta, Georgia 

Deborah E. Zetes-Tolomeo 

Stanford University 
Stanford, California 

Xiaohong Zhou 

Duke University Medical Center 
Durham, North Carolina 



© 2000 by CRC Press LLC 



Contributors 



James J. Abbas 

University of Kentucky 
Lexington, Kentucky 

Joseph Adam 

Premise Development Corporation 
Hartford, Connecticut 

Patrick Aebischer 

Lausanne University Medical School 
Lausanne, Switzerland 

Robert C. Allen 

Emory University 
Atlanta, Georgia 

John G. Aunins 

Merck Research Laboratories 
Rahway, New Jersey 

Dennis D. Autio 

Dybonics, Inc. 
Portland, Oregon 

James W. Baish 

Bucknell University 
Lewisburg, Pennsylvania 

Pamela J. Hoyes Beehler 

University of Texas at Arlington 
Arlington, Texas 

Ravi Bellamkonda 

Lausanne University Medical School 
Lausanne, Switzerland 

Jan E. W. Beneken 

Eindhoven University of Technology 
Eindhoven, the Netherlands 

Francois Berthiaume 

Surgical Services, Massachusetts 
General Hospital, Harvard 
Medical School, and the Shriners 
Hospital for Children 

Cambridge, Massachusetts 

Jeffrey S. Blair 

IBM Health Care Solutions 
Atlanta, Georgia 



Joseph D. Bronzino 

Trinity College/Biomedical 
Engineering Alliance for 
Connecticut (BEACON) 

Hartford, Connecticut 

Susan V. Brooks 

University of Michigan 
Ann Arbor, Michigan 

Mark E. Bruley 

ECRI 

Plymouth Meeting, Pennsylvania 

Ewart R. Carson 

City University 

London, United Kingdom 

Andrea Caumo 

San Raffaele Scientific Institute 
Milan, Italy 

Joseph A. Chinn 

Sulzer Carbomedics 
Austin, Texas 

Vivian H. Coates 

ECRI 

Plymouth Meeting, Pennsylvania 

Claudio Cobelli 

University of Padova 
Padua, Italy 

Clark K. Colton 

Massachusetts Institute of 

Technology 
Cambridge, Massachusetts 

Rory A. Cooper 

University of Pittsburgh, VA 

Pittsburgh Health Care System 
Pittsburgh, Pennsylvania 



Derek G. Cramp 

City University 

London, United Kingdom 



Yadin David 

Texas Children's Hospital 
Houston, Texas 

Benoit M. Dawant 

Vanderbilt University 
Nashville, Tennessee 

Paul D. Drumheller 

Gore Hybrid Technologies, Inc. 
Flagstaff, Arizona 

G. M. Drzewiecki 

Rutgers University 
Piscataway, New Jersey 

Karen A. Duca 

University of Wisconsin 
Madison, Wisconsin 

Graham A. Dunn 

King's College 

London, United Kingdom 

John Denis Enderle 

University of Connecticut 
Storrs, Connecticut 

John A. Faulkner 

University of Michigan 
Ann Arbor, Michigan 

Stanley M. Finkelstein 

University of Minnesota 
Minneapolis, Minnesota 

Robert J. Fisher 

University of Connecticut 
Storrs, Connecticut 

J. Michael Fitzmaurice 

Department of Health and Human 

Services 
Rockville, Maryland 

L.E. Freed 

Harvard University 
Cambridge, Massachusetts 



© 2000 by CRC Press LLC 



Catherine Garbay 

Laboratoire TIMC/IMAG 
Grenoble, France 

Leslie A. Geddes 

Purdue University 
West Lafayette, Indiana 

John W. Goethe 

The Institute of Living 
Hartford, Connecticut 

Michael L. Gullikson 

Texas Children's Hospital 
Houston, Texas 

Katya Hill 

Edinboro University of Pennsylvania 

and University of Pittsburgh 
Edinboro, Pennsylvania 

Douglas Hobson 

University of Pittsburgh 
Pittsburgh, Pennsylvania 

Jeffrey A. Hubbell 

California Institute of Technology 
Pasadena, California 

H. David Humes 

University of Michigan 
Ann Arbor, Michigan 

Sheik N. Imrhan 

University of Texas at Arlington 
Arlington, Texas 

Marcos Intaglietta 

University of California 
La Jolla, California 

Michel Jaffrin 

Universite de Technologie de 

Compiegne 
Compiegne, France 

Hugo O. Jauregui 

Rhode Island Hospital 
Providence, Rhode Island 

Stephen B. Johnson 

Columbia University 
New York, New York 

Richard D. Jones 

Christchurch Hospital and 

University of Otago 
Christchurch, New Zealand 



Craig T. Jordan 

Somatix Therapy Corp. 
Alameda, California 

Thomas M. Judd 

Kaiser Permanente 
Atlanta, Georgia 

Kurt A. Kaczmarek 

University of Wisconsin at Madison 
Madison, Wisconsin 

Robert Kaiser 

University of Washington 
Seattle, Washington 

Peter L.M. Kerkhof 

Medwise Working Group 
Maarssen, the Netherlands 

Tao Ho Kim 

Harvard University and Boston 

Children's Hospital 
Boston, Massachusetts 

Manfred R. Roller 

Oncosis 

San Diego, California 

George V. Kondraske 

University of Texas at Arlington, 
Human Performance Institute 
Arlington, Texas 

David N. Ku 

Georgia Institute of Technology 
Atlanta, Georgia 

Casimir Kulikowski 

Rutgers University 
Piscataway, New Jersey 

Luis G. Kun 

CIMIC/Rutgers University 
New Brunswick, New Jersey 

Robert S. Langer 

Massachusetts Institute of 

Technology 
Cambridge, Massachusetts 

Douglas A. Lauffenburger 

Massachusetts Institute of 

Technology 
Cambridge, Massachusetts 

Swamy Laxminarayan 

New Jersey Institute of Technology 
Newark, New Jersey 



Joseph M. Le Doux 

Center for Engineering in Medicine, 
and Surgical Services, 
Massachusetts General Hospital, 
Harvard Medical School, and the 
Shriners Burns Hospital 

Cambridge, Massachusetts 

Ann L. Lee 

Merck Research Laboratories 
Rahway, New Jersey 

John K-J. Li 

Rutgers University 
Piscataway, New Jersey 

E.N. Lightfoot 

University of Wisconsin 
Madison, Wisconsin 

Michael W. Long 

University of Michigan 
Ann Arbor, Michigan 

Marilyn Lord 

King's College 

London, United Kingdom 

Michael J. Lysaght 

Brown University 
Providence, Rhode Island 

Vasilis Z. Marmarelis 

University of Southern California 
Los Angeles, California 

Kenneth J. Maxwell 

Departure Technology 
Fort Worth, Texas 

Joseph P. McClain 

Walter Reed Army Medical Center 
Washington, D.C. 

Larry V. Mclntire 

Rice University 
Houston, Texas 

Evangelia Micheli- 
Tzanakou 

Rutgers University 
Piscataway, New Jersey 

David J. Mooney 

Massachusetts Institute of 

Technology 
Cambridge, Massachusetts 



© 2000 by CRC Press LLC 



John Moran 

Vasca, Inc. 

Tewksbury, Massachusetts 

Jeffrey R. Morgan 

Center for Engineering in Medicine, 

and Surgical Services, 

Massachusetts General Hospital, 

Harvard Medical School, and the 

Shriners Burns Hospital 
Cambridge, Massachusetts 

Robert L. Morris 

Dybonics, Inc. 
Portland, Oregon 

Tatsuo Nakamura 

Kyoto University 
Kyoto, Japan 

Brian A. Naughton 

Advanced Tissue Sciences, Inc. 
La Jolla, California 

Robert M. Nerem 

Georgia Institute of Technology 
Atlanta, Georgia 

A. Noordergraaf 

University of Pennsylvania 
Philadelphia, Pennsylvania 

Patrick R. Norris 

Vanderbilt University 
Nashville, Tennessee 

Pirkko Nykanen 

VIT Information Technology 
Tampere, Finland 

Joseph L. Palladino 

Trinity College 
Hartford, Connecticut 

Bernhard 0. Palsson 

University of Michigan 
Ann Arbor, Michigan 

Mohamad Parnianpour 

Ohio State University 
Columbus, Ohio 

Charles W. Patrick, Jr. 

Rice University 
Houston, Texas 

Chi-Sang Poon 

Massachusetts Institute of 

Technology 
Cambridge, Massachusetts 



Dejan B. Popovic 

University of Belgrade 
Belgrade, Yugoslavia 

T. Allan Pryor 

University of Utah 
Salt Lake City, Utah 

Yuchen Qiu 

The Pennsylvania State University 
University Park, Pennsylvania 

Gerard Reach 

Hopital Hotel-Dieu Paris 
Paris, France 

Charles J. Robinson 

Louisiana Tech University 

Ruston, Louisiana 

The Overton Brooks VA Medical 

Center 
Shreveport, Louisiana 

Barry Romich 

Prentke Romich Company 
Wooster, Ohio 

Gerson Rosenberg 

Pennsylvania State University 
Hershey, Pennsylvania 

Eric Rosow 

Hartford Hospital 
Hartford, Connecticut 

Charles M. Roth 

Center for Engineering in Medicine, 
Massachusetts General Hospital, 
Harvard Medical School, and the 
Shriners Burns Hospital 

Cambridge, Massachusetts 

Alan J. Russell 

University of Pittsburgh 
Pittsburgh, Pennsylvania 

Maria Pia Saccomani 

University of Padova 
Padua, Italy 

Pamela S. Saha 

Clemson University 
Clemson, South Carolina 

Subrata Saha 

Clemson University 
Clemson, South Carolina 

W. Mark Saltzman 

Cornell University 
Ithaca, New York 



Rangarajan Sampath 

Rice University 
Houston, Texas 

Niilo Saranummi 

VIT Information Technology 
Tampere, Finland 

Soumitra Sengupta 

Columbia University 
New York, New York 

Yasuhiko Shimizu 

Kyoto University 
Kyoto, Japan 

Michael L. Shuler 

Cornell University 
Ithaca, New York 

Srolamtj Simdara 

Rutgers University 
Piscataway, New Jersey 

Steven M. Slack 

The University of Memphis 
Memphis, Tennessee 

Susan S. Smith 

Texas Woman's University 
Dallas, Texas 

Giovanni Sparacino 

University of Padova 
Padua, Italy 

Ron Summers 

Loughborough University 
Leicestershire, United Kingdom 

Srikanth Sundaram 

Rutgers University 
Piscataway, New Jersey 

Karl Syndulko 

UCLA School of Medicine 
Los Angeles, California 

John M. Tarbell 

The Pennsylvania State University 
University Park, Pennsylvania 

William D. Timmons 

University of Akron 
Akron, Ohio 

Gianna Maria Toffolo 

University of Padova 
Padua, Italy 



© 2000 by CRC Press LLC 



Mehmet Toner 

Massachusetts General Hospital, 
Harvard Medical School, and the 
Shriners Burns Hospital 

Cambridge, Massachusetts 

Elaine Trefler 

University of Pittsburgh 
Pittsburgh, Pennsylvania 

Alan Turner-Smith 

King's College 

London, United Kingdom 

Shiro Usai 

Toyohashi University of Technology 
Toyohashi, Japan 

Joseph P. Vacanti 

Harvard University and Boston 

Children's Hospital 
Boston, Massachusetts 

Robert F. Valentini 

Brown University 
Providence, Rhode Island 

Gary Van Zant 

University of Kentucky Medical 

Center 
Lexington, Kentucky 

Gregg Vanderheiden 

University of Wisconsin 
Madison, Wisconsin 



Paul J. Vasta 

University of Texas at Arlington 
Arlington, Texas 

Chenzhao Vierheller 

University of Pittsburgh 
Pittsburgh, Pennsylvania 

David B. Volkin 

Merck Research Laboratories 
Rahway, New Jersey 

G. Vunjak-Novakovic 

Massachusetts Institute of 

Technology 
Cambridge, Massachusetts 

Alvin Wald 

Columbia University 
New York, New York 

S. Patrick Walton 

Center for Engineering in Medicine, 
Massachusetts General Hospital, 
Harvard Medical School, and the 
Shriners Burns Hospital 

Cambridge, Massachusetts 

Robert M. Winslow 

SANGART, Inc. 
La Jolla, California 

Daniel E. Wueste 

Clemson University 
Clemson, South Carolina 



Ioannis V. Yannas 

Massachusetts Institute of 

Technology 
Cambridge, Massachusetts 

David M. Yarmush 

Rutgers University 
Piscataway, New Jersey 

Martin L. Yarmush 

Massachusetts General Hospital, 
Harvard Medical School, and the 
Shriners Burns Hospital 

Cambridge, Massachusetts 

A jit P. Yoganathan 

Georgia Institute of Technology 
Atlanta, Georgia 

Daniel A. Zahner 

Rutgers University 
Piscataway, New Jersey 

Craig Zupke 

Massachusetts General Hospital and 

the Shriners Burns Institute 
Cambridge, Massachusetts 

Andrew L. Zydney 

University of Delaware 
Newark, Delaware 



© 2000 by CRC Press LLC 



Contents 



SECTION I Physiologic Systems 



Introduction Robert Plonsey 

1 An Outline of Cardiovascular Structure and Function Daniel J. Schneck 

2 Endocrine System Derek G. Cramp, Ewart R. Carson 

3 Nervous System Evangelia Micheli-Tzanakou 

4 Vision System George Stetten 

5 Auditory System Ben M. Clapton, Francis A. Spelman 

6 The Gastrointestinal System Berj L. Bardakjian 

7 Respiratory System Arthur T. Johnson, Christopher G. Lausted, 
Joseph D. Bronzino 

Historical Perspectives 1: Cardiac Pacing — Historical Highlights 

Leslie A. Geddes 

SECTION II Bioelectric Phenomena 

Introduction Craig S. Henriquez 

8 Basic Electrophysiology Roger C. Barr 

9 Volume Conductor Theory Robert Plonsey 

10 The Electrical Conductivity of Tissues Bradley J. Roth 

1 1 Membrane Models Anthony Varghese 

12 Numerical Methods for Bioelectric Field Problems Christopher R. Johnson 

13 Principles of Electrocardiography Edward J. Berbari 

14 Principles of Electromyography Kaj-Age Henneberg 



» 2000 by CRC Press LLC 



15 Principles of Electroencephalography Joseph D. Bronzino 

16 Biomagnetism Jaakko Malmivuo 

17 Electric Stimulation of Excitable Tissue Dominique M. Durand 

Section III Biomechanics 

Introduction Daniel J. Schneck 

18 Mechanics of Hard Tissue /. Lawrence Katz 

19 Mechanics of Blood Vessels Thomas R. Canfield, Philip B. Dobrin 

20 Joint-Articulating Surface Motion Kenton R. Kaufman, Kai-Nan An 

21 Joint Lubrication Michael J. Furey 

22 Musculoskeletal Soft Tissue Mechanics Richard L. Lieber, 
Thomas J. Burkholder 

23 Mechanics of Head/Neck Albert I. King, David C. Viano 

24 Biomechanics of Chest and Abdomen Impact David C. Viano, 
Albert I. King, 

25 Analysis of Gait Roy B. Davis, Peter A. DeLuca, Sylvia Ounpuu 

26 Exercise Physiology Arthur T Johnson, Cathryn R. Dooly 

27 Factors Affecting Mechanical Work in Humans Arthur T Johnson, 
Bernard F. Hurley 

28 Cardiac Biodynamics Andrew D. McCulloch 

29 Heart Valve Dynamics Aijt P. Yoganathan, Jack D. Lemmon, 
Jeffrey T Ellis 

30 Arterial Macrocirculatory Hemodynamics Baruch B. Lieber 

31 Mechanics and Transport in the Microcirculation Aleksander S. Pope 
Roland N. Pittman 

32 Mechanics and Deformability of Hematocytes Richard E. Waugh, 
Robert M. Hochmuth 

33 The Venous System Artin A. Shoukas, Carl F. Rothe 



© 2000 by CRC Press LLC 



34 Mechanics of Tissue/Lymphatic Transport Alan R. Hargen, 
Geert W. Schmid-Schonbein 

35 Cochlear Mechanics Charles R. Steele, Gary J. Baker, Jason A. Tolomeo, 
Deborah E. Zetes-Tolomeo 

36 Vestibular Mechanics Wallace Grant 

Section IV Biomaterials 

Introduction Joon B. Park 

37 Metallic Biomaterials Joon B. Park, Young Kon Kim 

38 Ceramic Biomaterials W. G. Billotte 

39 Polymeric Biomaterials Hai Bang Lee, Gilson Khang, Jin Ho Lee 

40 Composite Biomaterials Roderic Lakes 

41 Biodegradable Polymeric Biomaterials: An Updated Overview 

Chih-Chang Chu 

42 Biologic Biomaterials: Tissue-Derived Biomaterials (Collagen) 
Shu-Tung Li 

43 Soft Tissue Replacements 

43.1 Blood Interfacing Implants K. B. Chandran 

43.2 Non-Blood-Interfacing Implants for Soft Tissues K.J.L Burg, S. W. Shalaby 

44 Hard Tissue Replacements 

44.1 Bone Repair and Joint Implants S-H. Park, A. Llinds, V. K. Goel 

44.2 Dental Implants: The Relationship of Materials Characteristics to Biologic Properties 
/. C. Keller 

45 Preservation Techniques for Biomaterials Robin Coger, Mehmet Toner 

46 Hip Joint Prosthesis Fixation — Problems and Possible Solutions 
Joon B. Park 

Section V Biomedical Sensors 

Introduction Michael R. Neuman 

47 Physical Measurements Michael R. Neuman 

48 Biopotential Electrodes Michael R. Neuman 



© 2000 by CRC Press LLC 



49 Electrochemical Sensors Chung-Chiun Liu 

50 Optical Sensors Yitzhak Mendelson 

51 Bioanalytic Sensors Richard P. Buck 

Historical Perspectives 2: The Elecrocardiograph Leslie A. Geddes 

Section VI Biomedical Signal Analysis 

Introduction Banu Onaral 

52 Biomedical Signals: Origin and Dynamic Characteristics; Frequency-Domain 
Analysis Arnon Cohen 

53 Digital Biomedical Signal Acquisition and Processing Luca T. Mainardi, 
Anna M. Bianchi, Sergio Cerutti 

54 Compression of Digital Biomedical Signals A. Enis Cetin, 
Hayrettin Koymen 

55 Time-Frequency Signal Representations for Biomedical Signals 

G. Faye Boudreaux-Bartels, Robin Murray 

56 Wavelet (Time-Scale) Analysis in Biomedical Signal Processing 
Nitish V. Thakor, Boris Gramatikov, David Sherman 

57 Higher-Order Spectral Analysis Athina P. Petropulu 

5 8 Neural Networks in Biomedical Signal Processing 
Evangelia Micheli-Tzanakou 

59 Complexity, Scaling, and Fractals in Biomedical Signals Banu Onaral, 
Joseph P. Cammarota 

60 Future Directions: Biomedical Signal Processing and Networked Multimedia 
Communications Banu Onaral 

Section VII Imaging 

Introduction Karen M. Mudry 

61 X-Ray 

61.1 X-Ray Equipment Robert E. Shroy, Jr. 

61.2 X-Ray Projection Angiography Michael S. Van Lysel 

61.3 Mammography Martin J . Yaffe 



© 2000 by CRC Press LLC 



62 Computed Tomography 

62.1 Instrumentation Ian A. Cunningham 

62.2 Reconstruction Principles Philip F. Judy 

63 Magnetic Resonance Imaging 

63.1 Acquisition and Processing Steven Conolly, Albert Macovski, John Pauly 

63.2 Hardware/Instrumentation John Schenck 

63.3 Functional MRI Kenneth K. Kwong, David A. Chesler 

63A Chemical-Shift Imaging: An Introduction to Its Theory and Practice Xiaoping Hu, Wei Chen, 
Maqbool Patel, Kamil Ugurbil 

64 Nuclear Medicine 

64.1 Instrumentation Barbara Y. Croft 

64.2 SPECT (Single-Photon Emission Computed Tomography) Benjamin M. W. Tsui 

65 Ultrasound 

65.1 Transducers Richard L. Goldberg, Stephen W. Smith 

65.2 Ultrasonic Imaging Jack G. Mottley 

65.3 Blood Flow Measurement Using Ultrasound K. Whittaker Perrara 

66 Magnetic Resonance Microscopy Xiaohong Zhou, G. Allan Johnson 

67 Positron-Emission Tomography (PET) 

67.1 Radiopharmaceuticals Thomas E. Budinger, Henry F. Van Brocklin 

67.2 Instrumentation Thomas E. Budinger 

68 Electrical Impedance Tomography D. C. Barber 

69 Medical Applications of Virtual Reality Technology Walter Greenleaf, 
Tom Piantanida 

Section VIII Medical Instruments and Devices 



Introduction Wolf W. von Maltzahn 

70 Biopotential Amplifiers Joachim H. Nagel 

71 Noninvasive Arterial Blood Pressure and Mechanics Gary Drzewiecki 

72 Cardiac Output Measurement Leslie A. Geddes 

73 Bioelectric Impedance Measurements Robert Patterson 

74 Respiration Leslie A. Geddes 

75 Clinical Laboratory: Separation and Spectral Methods Richard L. Roa 



© 2000 by CRC Press LLC 



76 Clinical Laboratory: Nonspectral Methods and Automation 
Richard L. Roa 

77 Implantable Cardiac Pacemakers Michael Forde, Pat Ridgely 

78 Implantable Stimulators for Neuromuscular Control Primoz Strojnik, 
P. Hunter Peckham, 

79 External Defibrillators Willis A. Packer 

80 Implantable Defibrillators Edwin G. Duffin 

81 Electrosurgical Devices Jeffrey L. Eggleston, WolfW. von Maltzahn 

82 Mechanical Ventilation Khosrow Behbehani 

83 Parenteral Infusion Devices Gregory I. Voss, Robert D. Butterfield 

84 Essentials of Anesthesia Delivery A. William Paulsen 

85 Biomedical Lasers Millard M. Judy 

86 Noninvasive Optical Monitoring Ross Flewelling 

87 Medical Instruments and Devices Used in the Home Bruce R. Bowman, 
Edward Schuck 

88 Virtual Instrumentation Eric Rosow, Joseph Adam 

Historical Perspectives 3: Recording of Action Potentials Leslie A. Geddes 

Section IX Biologic Effects of Nonionizing Electromagnetic Fields 

Introduction Charles Polk 

89 Dielectric Properties of Tissues Kenneth R. Foster 

90 Low-Frequency Magnetic Fields: Dosimetry, Cellular, and Animal Effects 
Maria A. Stuchly 

91 Therapeutic Applications of Low-Frequency Sinusoidal and Pulsed Electric 
and Magnetic Fields Charles Polk 

92 Biologic Effects of Radiofrequency and Microwave Fields: In Vivo and 
In Vitro Experimental Results Edward Elson 



© 2000 by CRC Press LLC 



93 Radio Frequency Hyperthermia in Cancer Therapy C. K. Chou, 
Rulong Ren 

94 Electroporation of Cells and Tissues James C. Weaver 

Appendix A Basics of Blood Gas Instrumentation Christina Cristalli, 
Angelo Manzoni 



© 2000 by CRC Press LLC 



Contents 



SECTION X Transport Phenomena and Biomimetic Systems 

Introduction Robert J. Fisher 

95 Biomimetic Systems Robert J. Fisher 

96 Diffusional Processes and Engineering Design E. N. Lightfoot 

97 Animal Surrogate Systems Michael L. Shuler 

98 Microvascular Heat Transfer James W. Baish. 

99 Interstitial Transport in the Brain: Principles for Local Drug Delivery 

W. Mark Saltzman 

100 Arterial Wall Mass Transport: The Possible Role of Blood Phase 
Resistance in the Localization of Arterial Disease John M. Tarbell, 
Yuchen Qiu 

SECTION XI Biotechnology 

Introduction Martin L. Yarmush, Mehmet Toner 

101 Protein Engineering Alan]. Russell, Chenzhao Vierheller 

102 Monoclonal Antibodies and Their Engineered Fragments 

Srikanth Sundaram, David M. Yarmush 

103 Antisense Technology S. Patrick Walton, Charles M. Roth, 
Martin L. Yarmush 

104 Tools for Genome Analysis Robert Kaiser 

105 Vaccine Production John G. Aunins, Ann L. Lee, David B. Volkin 

106 Gene Therapy Joseph M. Le Doux, Jeffrey R. Morgan, Martin L. Yarmush 

107 Cell Engineering Douglas A. Lauffenburger 

108 Metabolic Engineering Craig Zupke 



© 2000 by CRC Press LLC 



SECTION XII Tissue Engineering 



Introduction Bernhard 0. Palsson, Jeffrey A. Hubbell 

109 Tissue Engineering Francois Berthiaume, Martin L. Yarmush 

110 Surface Immobilization of Adhesion Ligands for Investigations of 
Cell-Substrate Interactions Paul D. Drumheller, Jeffrey A. Hubbell 

111 Biomaterials: Protein-Surface Interactions 

Joseph A. Chinn, Steven M. Slack 

112 Engineering Biomaterials for Tissue Engineering: The 10-100 Micron Size 
Scale David J. Mooney, Robert S. hanger 

113 Regeneration Templates Ioannis V. Yannas 

114 Fluid Shear Stress Effects on Cellular Function Charles W. Patrick, Jr., 
Rangarajan Sampath, Larry V. Mclntire 

115 The Roles of Mass Transfer in Tissue Function Edwin N. Lightfoot, 
Karen A. Duca 

116 The Biology of Stem Cells Craig T. Jordan, Gary Van Zant 

117 Cell Motility and Tissue Architecture Graham A. Dunn 

118 Tissue Microenvironments Michael W. Long 

119 The Importance of Stromal Cells Brian A. Naughton 

120 Tissue Engineering of Bone Marrow Manfred R. Roller, 
Bernhard 0. Palsson 

121 Tissue Engineering of the Liver Tae Ho Kim, Joseph P. Vacanti 

122 Tissue Engineering in the Nervous System Ravi Bellamkonda, 
Patrick Aebischer 

123 Tissue Engineering of Skeletal Muscle Susan V. Brooks, 
John A. Faulkner 

124 Tissue Engineering of Cartilage L.E. Freed, G. Vunjak-Novakovic 

125 Tissue Engineering of the Kidney H. David Humes 



© 2000 by CRC Press LLC 



SECTION XIII Prostheses and Artificial Organs 

Introduction Pierre M. Galletti, Robert M. Nerem 

126 Artificial Heart and Circulatory Assist Devices Gerson Rosenberg 

127 Cardiac Valve Prostheses Ajit P. Yoganathan 

128 Vascular Grafts David N. Ku, Robert C. Allen 

129 Artificial Lungs and Blood-Gas Exchange Devices Pierre M. Galletti, 
Clark K. Colton 

130 Artificial Kidney Pierre M. Galletti, Clark K. Colton, Michael f. Lysagh 

131 Peritoneal Dialysis Equipment Michael J. Lysaght, John Moran 

132 Therapeutic Apheresis and Blood Fractionation Andrew L. Zydney 

133 Liver Support Systems Pierre M. Galletti, Hugo O. fauregui 

134 Artificial Pancreas Pierre M. Galletti, Clark K. Colton, Michel Jaffrin, 
Gerard Reach. 

135 Nerve Guidance Channels Robert F. Valentini 

136 Tracheal, Laryngeal, and Esophageal Replacement Devices 

Tatsuo Nakamura, Yasuhiko Shimizu 

137 Artificial Blood Marcos Intaglietta, Robert M. Winslow 

138 Artificial Skin and Dermal Equivalents Ioannis V. Yannas 

SECTION XIV Rehabiliation Engineering 

Introduction Charles }. Robinson 

139 Rehabilitation Engineering, Science, and Technology 
Charles J. Robinson 

140 Orthopedic Prosthetics and Orthotics in Rehabilitation Marilyn Lord, 
Alan Turner-Smith 

141 Wheeled Mobility: Wheelchairs and Personal Transportation 

Rory A. Cooper 



© 2000 by CRC Press LLC 



142 Externally Powered and Controlled Orthotics and Prosthetics 
Dejan B. Popovic 

143 Sensory Augmentation and Substitution Kurt A. Kaczmarek 

144 Augmentative and Alternative Communication Barry Romich, 
Gregg Vanderheiden, Katya Hill 

145 Measurement Tools and Processes in Rehabilitation Engineering 
George V. Kondraske 

146 Rehabilitation Engineering Technologies: Principles of Application 

Douglas Hobson, Elaine Trefler 

Historical Perspectives 4: Electromyography Leslie A. Geddes 

SECTION XV Human Performance Engineering 

Introduction George V. Kondraske 

147 A Working Model for Human System-Task Interfaces 

George V. Kondraske 

148 Measurement of Neuromuscular Performance Capacities 

Susan S. Smith 

149 Measurement of Sensory-Motor Control Performance Capacities: 
Tracking Tasks Richard D. Jones 

150 Measurement of Information-Processing Performance Capacities 
George V. Kondraske, Paul J. Vasta 

151 High-Level Task Analysis: Mental Components Kenneth J. Maxwell 

152 Task Analysis and Decomposition: Physical Components 

Sheik N. Imrhan 

153 Human-Computer Interface Design Issues Kenneth J. Maxwell 

154 Applications of Human Performance Measurements to Clinical Trials to 
Determine Therapy Effectiveness and Safety Pamela J. Hoyes Beehler, 
Karl Syndulko 

155 Applications of Quantitative Assessment of Human Performance in 
Occupational Medicine Mohamad Pamianpour 

156 Human Performance Engineering: Computer-Based Design and Analysis 
Tools Paul J. Vasta, George V. Kondraske 



© 2000 by CRC Press LLC 



157 Human Performance Engineering: Challenges and Prospects for the Future 

George V. Kondraske 

SECTION XVI Physiological Modeling, Simulation, and Control 

Introduction Joseph L. Palladino 

158 Modeling Strategies in Physiology /. L. Palladino, G. M. Drzewiecki, 
A. Noordergraaf 

159 Compartmental Models of Physiologic Systems Claudia Cobelli, 
Giovanni Sparacino, Andrea Caumo, Maria Pia Saccomani, 

Gianna Maria Toffolo 

160 Cardiovascular Models and Control William D. Timmons 

161 Respiratory Models and Control Chi-Sang Poon 

162 Neural Networks for Physiological Control James J. Abbas 

163 Methods and Tools for Identification of Physiologic Systems 
Vasilis Z. Marmarelis 

164 Autoregulating Windkessel Dynamics May Cause Low Frequency 
Oscillations Gary Drzewiecki, John K-J. Li, Abraham Noordergraaf 

165 Control of Movements Dejan B. Popovic 

166 The Fast Eye Movement Control System John Denis Enderle 

SECTION XVII Clinical Engineering 

Introduction Yadin David 

167 Clinical Engineering: Evolution of a Discipline Joseph D. Bronzino 

168 Management and Assessment of Medical Technology Yadin David, 
Thomas M. Judd 

169 Risk Factors, Safety, and Management of Medical Equipment 

Michael L. Gullikson 

170 Clinical Engineering Program Indicators Dennis D. Autio, 
Robert L. Morris 

171 Quality of Improvement and Team Building Joseph P. McClain 



© 2000 by CRC Press LLC 



172 A Standards Primer for Clinical Engineers Alvin Wald 

173 Regulatory and Assessment Agencies Mark E. Bruley, Vivian H. Coates 

174 Applications of Virtual Instruments in Health Care Eric Rosow, 
Joseph Adam 

SECTION XVIII Medical Informatics 

Introduction Luis G. Kun 

175 Hospital Information Systems: Their Function and State 

X. Allan Pryor 

176 Computer-Based Patient Records J. Michael Fitzmaurice 

177 Computer Networks in Health Care Soumitra Sengupta 

178 Overview of Standards Related to the Emerging Health Care Information 
Infrastructure Jeffrey S. Blair 

179 Non-AI Decision Making Ron Summers, Derek G. Cramp, 
Ewart R. Carson 

180 Design Issues in Developing Clinical Decision Support and Monitoring 
Systems John W. Goethe, Joseph D. Bronzino 

SECTION XIX Artificial Intelligence 

Introduction Stanley M. Finkelstein 

181 Artificial Intelligence in Medical Decision Making: History, Evolution, and 
Prospects Casimir A. Kulikowski 

182 Artificial Neural Networks: Definitions, Methods, Applications 

Daniel A. Zahner, Evangelia Micheli-Tzanakou 

183 Clinical Decision Systems Pirkko Nykdnen, NHlo Saranummi 

184 Expert Systems: Methods and Tools Ron Summers, Derek Cramp, 
Ewart R. Carson 

185 Knowledge Acquisition and Representation Catherine Garbay 

186 Knowledge-Based Systems for Intelligent Patient Monitoring and 
Management in Critical Care Environments Benoit M. Dawant, 
Patrick R. Norris 



© 2000 by CRC Press LLC 



187 Medical Terminology and Diagnosis Using Knowledge Bases 

Peter L. M. Kerkhof 

188 Natural Language Processing in Biomedicine Stephen B. Johnson 
Historical Perspectives 5: Electroencephalography Leslie A. Geddes 

SECTION XX Ethical Issues Associated with the Use of 
Medical Technology 

Introduction Subrata Saha, Joseph D. Bronzino 

189 Professional Ethics in Biomedical Engineering Daniel E.Wues 



le 



190 Beneficence, Nonmaleficence, and Technological Progress 
Joseph D. Bronzino 

191 Ethical Issues of Animal and Human Experimentation in the Development 
of Medical Devices Subrata Saha, Pamela S. Saha 

192 Regulation of Medical Device Innovation Joseph D. Bronzino 

Appendix A The Role of Professional Societies in Biomedical Engineering 

Swamy Laxminarayan, Joseph D. Bronzino, Jan E.W. Beneken, Shiro Usai, 
Richard D. Jones 



© 2000 by CRC Press LLC 



Robert Plonsey. "Physiologic Systems." 

The Biomedical Engineering Handbook: Second Edition. 

Ed. Joseph D. Bronzino 

Boca Raton: CRC Press LLC, 2000 



FRONTAL LOBE 



SUPERIOR TEMPORAL 
GYRUS 



CUTSURFACEOF 
INSULA 

ANTERIOR 

I 

TRANSVERSE 
TEMPORAL 

GYRUS 
(HESCHL'S) 

I 

POSTERIOR 




OCCIPITAL LOBE 



A view of the human cerebral cortex showing the underlying auditory cortex. 



© 2000 by CRC Press LLC 



Physiologic 
Systems 



Robert Plonsey 

Duke University 



1 An Outline of Cardiovascular Structure and Function Daniel J. Schneck 

The Working Fluid: Blood • The Pumping Station: The Heart • The Piping Network: Blood 
Vessels • Cardiovascular Control 

2 Endocrine System Derek G. Cramp, Ewart R. Carson 

Endocrine System: Hormones, Signals, and Communication Between Cells and 
Tissues • Hormone Action at the Cell Level: Signal Recognition, Signal Transduction, and 
Effecting a Physiological Response • Endocrine System: Some Other Aspects of Regulation 
and Control 

3 Nervous System Evangelia Micheli-Tzanakou 

Definitions • Functions of the Nervous System • Representation of Information in the 
Nervous System • Lateral Inhibition • Higher Functions of the Nervous System 

4 Vision System George Stetten 

Fundamentals of Vision Research • A Modular View of the Vision System 

5 Auditory System Ben M. Clopton, Francis A. Spelman 

Physical and Psychological Variables • The Peripheral Auditory System • The Central 
Auditory System • Pathologies • Models of Auditory Function 

6 The Gastrointestinal System Berj L. Bardakjian 

gastrointestinal Electrical Oscillations • A Historical Perspective • The Stomach • The 
Small Intestine • The Colon • Epilogue 

7 Respiratory System Arthur T. Johnson, Christopher G. Lausted, 
Joseph D. Bronzino 

Respiration Anatomy • Lung Volumes and Gas Exchange • Perfusion of the Lung • Gas 
Partial Pressures • Pulmonary Mechanics • Respiratory Control • The Pulmonary 
Function Laboratory 



THE CONTENT OF THIS HANDBOOK is devoted to the subject of biomedical engineering. We 
understand biomedical engineering to involve the application of engineering science and tech- 
nology to problems arising in medicine and biology. In principle, the intersection of each engi- 
neering discipline (i.e., electrical, mechanical, chemical, etc.) with each discipline in medicine (i.e., 
cardiology, pathology, neurology, etc.) or biology (i.e., biochemistry, pharmacology, molecular biology, 
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cell biology, etc.) is a potential area of biomedical engineering application. As such, the discipline of 
biomedical engineering is potentially very extensive. However, at least to date, only a few of the afore- 
mentioned "intersections" contain active areas of research and/or development. The most significant of 
these are described in this Handbook. 

While the application of engineering expertise to the life sciences requires an obvious knowledge of 
contemporary technical theory and its applications, it also demands an adequate knowledge and under- 
standing of relevant medicine and biology. It has been argued that the most challenging part of finding 
engineering solutions to problems lies in the formulation of the solution in engineering terms. In 
biomedical engineering, this usually demands a full understanding of the life science substrates as well 
as the quantitative methodologies. 

This section is devoted to an overview of the major physiologic systems of current interest to biomedical 
engineers, on which their work is based. The overview may contain useful definitions, tables of basic 
physiologic data, and an introduction to the literature. Obviously these chapters must be extremely brief. 
However, our goal is an introduction that may enable the reader to clarify some item of interest or to 
indicate a way to pursue further information. Possibly the reader will find the greatest value in the 
references to more extensive literature. 

This section contains seven chapters, and these describe each of the major organ systems of the human 
body. Thus we have chapters describing the cardiovascular, endocrine, nervous, visual, auditory, gas- 
trointestinal, and respiratory systems. While each author is writing at an introductory and tutorial level, 
the audience is assumed to have some technical expertise, and consequently, mathematical descriptions 
are not avoided. All authors are recognized as experts on the system which they describe, but all are also 
biomedical engineers. 

The authors in this section noted that they would have liked more space but recognized that the main 
focus of this Handbook is on "engineering." Hopefully, readers will find this introductory section helpful 
to their understanding of later chapters of this Handbook and, as noted above, to at least provide a starting 
point for further investigation into the life sciences. 
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and Function 



1.1 The Working Fluid: Blood 

Daniel J. Schneck L2 The Pumping Station: The Heart 

Virginia Polytechnic Institute and L3 Th e Piping Network: Blood Vessels 

State University 1.4 Cardiovascular Control 



Because not every cell in the human body is near enough to the environment to easily exchange with it 
mass (including nutrients, oxygen, carbon dioxide, and the waste products of metabolism), energy 
(including heat), and momentum, the physiologic system is endowed with a major highway net- 
work — organized to make available thousands of miles of access tubing for the transport to and from a 
different neighborhood (on the order of 10 urn or less) of any given cell whatever it needs to sustain life. 
This highway network, called the cardiovascular system, includes a pumping station, the heart; a working 
fluid, blood; a complex branching configuration of distributing and collecting pipes and channels, blood 
vessels; and a sophisticated means for both intrinsic (inherent) and extrinsic (autonomic and endocrine) 
control. 

1.1 The Working Fluid: Blood 

Accounting for about 8 ± 1% of total body weight, averaging 5200 ml, blood is a complex, heterogeneous 
suspension of formed elements — the blood cells, or hematocytes — suspended in a continuous, straw- 
colored fluid called plasma. Nominally, the composite fluid has a mass density of 1.057 ± 0.007 g/cm 3 , 
and it is three to six times as viscous as water. The hematocytes (Table 1.1) include three basic types of 
cells: red blood cells (erythrocytes, totaling nearly 95% of the formed elements), white blood cells 
(leukocytes, averaging <0.15% of all hematocytes), and platelets (thrombocytes, on the order of 5% of 
all blood cells). Hematocytes are all derived in the active ("red") bone marrow (about 1500 g) of adults 
from undifferentiated stem cells called hemocytoblasts, and all reach ultimate maturity via a process called 
hematocytopoiesis. 

The primary function of erythrocytes is to aid in the transport of blood gases — about 30 to 34% (by 
weight) of each cell consisting of the oxygen- and carbon dioxide-carrying protein hemoglobin (64,000 
< MW <68,000) and a small portion of the cell containing the enzyme carbonic anhydrase, which catalyzes 
the reversible formation of carbonic acid from carbon dioxide and water. The primary function of 
leukocytes is to endow the human body with the ability to identify and dispose of foreign substances 
such as infectious organisms) that do not belong there — agranulocytes (lymphocytes and monocytes) 
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essentially doing the "identifying" and granulocytes (neutrophils, basophils, and eosinophils) essentially 
doing the "disposing." The primary function of platelets is to participate in the blood clotting process. 

Removal of all hematocytes from blood centrifugation or other separating techniques leaves behind 
the aqueous (91% water by weight, 94.8% water by volume), saline (0.15 N) suspending medium called 
plasma — which has an average mass density of 1.035 ± 0.005 g/cm 3 and a viscosity IV2 to 2 times that 
of water. Some 6.5 to 8% by weight of plasma consists of the plasma proteins, of which there are three 
major types — albumin, the globulins, and fibrinogen — and several of lesser prominence (Table 1.2). 

TABLE 1.2 Plasma 





Concentration 


Typical 


Molecular 








Range 


Plasma Value 


Weight 


Typical 


Typical size 


Constituent 


(mg/dl plasma) 


(mg/dl) 


Range 


Value 


(nm) 


Total protein, 


6400-8300 


7245 


21,000-1,200,000 


— 


— 


7% by weight 












Albumin 


2800-5600 


4057 


66,500-69,000 


69,000 


15 x 4 


(56% TP) 












a^Globulin 


300-600 


400 


21,000-435,000 


60,000 


5-12 


(5.5% TP) 












a,-Globulin 


400-900 


542 


100,000-725,000 


200,000 


50-500 


(7.5% TP) 












^-Globulin 


500-1230 


942 


90,000-1,200,000 


100,000 


18-50 


(13% TP) 












y- Globulin 


500-1800 


869 


150,000-196,000 


150,000 


23 x 4 


(12% TP) 












Fibrinogen 


150-470 


290 


330,000-450,000 


390,000 


(50-60) x (3-8) 


(4% TP) 












Other 


70-210 


145 


70,000-1,000,000 


200,000 


(15-25) x (2-6) 


(2% TP) 












Inorganic ash, 


930-1140 


983 


20-100 


— 


— 


0.95% by weight 










(Radius) 


Sodium 


300-340 


325 


— 


22.98977 


0.102 (Na+) 


Potassium 


13-21 


17 


— 


39.09800 


0.138 (K+) 


Calcium 


8.4-11.0 


10 


— 


40.08000 


0.099 (Ca 2+ ) 


Magnesium 


1.5-3.0 


2 


— 


24,30500 


0.072 (Mg 2+ ) 


Chloride 


336-390 


369 


— 


35.45300 


0.181 (CL) 


Bicarbonate 


110-240 


175 


— 


61.01710 


0.163 (HCO3-) 


Phosphate 


2.7-4.5 


3.6 


— 


95.97926 


0.210 (HP0 4 2 i 


Sulfate 


0.5-1.5 


1.0 


— 


96.05760 


0.230 (S0 4 2 i 


Other 


0-100 


80.4 


20-100 


— 


0.1-0.3 


Lipids (fats), 


541-1000 


828 


44,000-3,200,000 


= Lipoproteins 


Up to 200 or more 


0.80% by weight 












Cholesterol 


12-105 "free" 


59 


386.67 


Contained mostly 


in intermediate to 


(34% TL) 


72-259 esterified, 


224 




LDL P-lipoprot 


eins; higher in 




84-364 "total" 


283 




women 




Phospholipid 


150-331 


292 


690-1010 


Contained mainly 


in HDL to VHDL 


(35% TL) 








ct r lipoproteins 




Triglyceride 


65-240 


215 


400-1370 


Contained mainly 


in VLDL a,-lipo- 


(26% TL) 








proteins and chylomicrons 


Other 


0-80 


38 


280-1500 


Fat-soluble vitamins, prostaglandins, 


(5% TL) 








fatty acids 




Extractives, 


200-500 


259 


— 


— 


— 


0.25% by weight 












Glucose 


60-120, fasting 


90 


— 


180.1572 


0.86 D 


Urea 


20-30 


25 


— 


60.0554 


0.36 D 


Carbohydrate 


60-105 


83 


180.16-342.3 


— 


0.74-0.108 D 


Other 


11-111 


61 


— 


— 


— 
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The primary functions of albumin are to help maintain the osmotic (oncotic) transmural pressure 
differential that ensures proper mass exchange between blood and interstitial fluid at the capillary level 
and to serve as a transport carrier molecule for several hormones and other small biochemical constituents 
(such as some metal ions). The primary function of the globulin class of proteins is to act as transport 
carrier molecules (mostly of the a and P class) for large biochemical substances, such as fats (lipoproteins) 
and certain carbohydrates (muco- and glycoproteins) and heavy metals (mineraloproteins), and to work 
together with leukocytes in the body's immune system. The latter function is primarily the responsibility 
of the y class of immunoglobulins, which have antibody activity. The primary function of fibrinogen is 
to work with thrombocytes in the formation of a blood clot — a process also aided by one of the most 
abundant of the lesser proteins, prothrombin (MW = 62,000). 

Of the remaining 2% or so (by weight) of plasma, just under half (0.95%, or 983 mg/dl plasma) 
consists of minerals (inorganic ash), trace elements, and electrolytes, mostly the cations sodium, potas- 
sium, calcium, and magnesium and the anions chlorine, bicarbonate, phosphate, and sulfate — the latter 
three helping as buffers to maintain the fluid at a slightly alkaline pH between 7.35 and 7.45 (average 
7.4). What is left, about 1087 mg of material per deciliter of plasma, includes: (1) mainly (0.8% by weight) 
three major types of fat, i.e., cholesterol (in a free and esterified form), phospholipid (a major ingredient 
of cell membranes), and triglyceride, with lesser amounts of the fat-soluble vitamins (A, D, E, and K), 
free fatty acids, and other lipids, and (2) "extractives" (0.25% by weight), of which about two-thirds 
includes glucose and other forms of carbohydrate, the remainder consisting of the water-soluble vitamins 
(B-complex and C), certain enzymes, nonnitrogenous and nitrogenous waste products of metabolism 
(including urea, creatine, and creatinine), and many smaller amounts of other biochemical constitu- 
ents — the list seeming virtually endless. 

Removal from blood of all hematocytes and the protein fibrinogen (by allowing the fluid to completely 
clot before centrifuging) leaves behind a clear fluid called serum, which has a density of about 1.018 ± 
0.003 g/cm 3 and a viscosity up to IV2 times that of water. A glimpse of Tables 1.1 and 1.2, together with 
the very brief summary presented above, nevertheless gives the reader an immediate appreciation for 
why blood is often referred to as the "river of life." This river is made to flow through the vascular piping 
network by two central pumping stations arranged in series: the left and right sides of the human heart. 

1.2 The Pumping Station: The Heart 

Barely the size of the clenched fist of the individual in whom it resides — an inverted, conically shaped, 
hollow muscular organ measuring 12 to 13 cm from base (top) to apex (bottom) and 7 to 8 cm at its 
widest point and weighing just under 0.75 lb (about 0.474% of the individual's body weight, or some 
325 g) — the human heart occupies a small region between the third and sixth ribs in the central portion 
of the thoracic cavity of the body. It rests on the diaphragm, between the lower part of the two lungs, 
its base-to-apex axis leaning mostly toward the left side of the body and slightly forward. The heart is 
divided by a tough muscular wall — the interatrial-interventricular septum — into a somewhat crescent- 
shaped right side and cylindrically shaped left side (Fig. 1.1), each being one self-contained pumping 
station, but the two being connected in series. The left side of the heart drives oxygen-rich blood through 
the aortic semilunar outlet valve into the systemic circulation, which carries the fluid to within a differential 
neighborhood of each cell in the body — from which it returns to the right side of the heart low in oxygen 
and rich in carbon dioxide. The right side of the heart then drives this oxygen-poor blood through the 
pulmonary semilunar (pulmonic) outlet valve into the pulmonary circulation, which carries the fluid to 
the lungs — where its oxygen supply is replenished and its carbon dioxide content is purged before it 
returns to the left side of the heart to begin the cycle all over again. Because of the anatomic proximity 
of the heart to the lungs, the right side of the heart does not have to work very hard to drive blood 
through the pulmonary circulation, so it functions as a low-pressure (P < 40 mmHg gauge) pump 
compared with the left side of the heart, which does most of its work at a high pressure (up to 140 mmHg 
gauge or more) to drive blood through the entire systemic circulation to the furthest extremes of the 
organism. 
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FIGURE 1.1. Anterior view of the human heart showing the four chambers, the inlet and outlet valves, the inlet 
and outlet major blood vessels, the wall separating the right side from the left side, and the two cardiac pacing 
centers — the sinoatrial node and the atrioventricular node. Boldface arrows show the direction of flow through the 
heart chambers, the valves, and the major vessels. 

Each cardiac (heart) pump is further divided into two chambers: a small upper receiving chamber, or 
atrium (auricle), separated by a one-way valve from a lower discharging chamber, or ventricle, which is 
about twice the size of its corresponding atrium. In order of size, the somewhat spherically shaped left 
atrium is the smallest chamber — holding about 45 ml of blood (at rest), operating at pressures on the 
order of to 25 mmHg gauge, and having a wall thickness of about 3 mm. The pouch-shaped right 
atrium is next (63 ml of blood, to 10 mmHg gauge of pressure, 2-mm wall thickness), followed by the 
conical/cylindrically shaped left ventricle (100 ml of blood, up to 140 mmHg gauge of pressure, variable 
wall thickness up to 12 mm) and the crescent-shaped right ventricle (about 130 ml of blood, up to 40 
mmHg gauge of pressure, and a wall thickness on the order of one-third that of the left ventricle, up to 
about 4 mm). All together, then, the heart chambers collectively have a capacity of some 325 to 350 ml, 
or about 6.5% of the total blood volume in a "typical" individual — but these values are nominal, since 
the organ alternately fills and expands, contracts, and then empties as it generates a cardiac output. 

During the 480-ms or so filling phase — diastole — of the average 750-ms cardiac cycle, the inlet valves 
of the two ventricles (3.8-cm-diameter tricuspid valve from right atrium to right ventricle; 3.1-cm- 
diameter bicuspid or mitral valve from left atrium to left ventricle) are open, and the outlet valves (2.4-cm- 
diameter pulmonary valve and 2.25-cm-diameter aortic semilunar valve, respectively) are closed — the 
heart ultimately expanding to its end-diastolic-volume (EDV), which is on the order of 140 ml of blood 
for the left ventricle. During the 270-ms emptying phase — systole — electrically induced vigorous con- 
traction of cardiac muscle drives the intraventricular pressure up, forcing the one-way inlet valves closed 
and the unidirectional outlet valves open as the heart contracts to its end-systolic-volume (ESV), which 
is typically on the order of 70 ml of blood for the left ventricle. Thus the ventricles normally empty about 
half their contained volume with each heart beat, the remainder being termed the cardiac reserve volume. 
More generally, the difference between the actual EDV and the actual ESV, called the stroke volume (SV), 
is the volume of blood expelled from the heart during each systolic interval, and the ratio of SV to EDV 
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is called the cardiac ejection fraction, or ejection ratio (0.5 to 0.75 is normal, 0.4 to 0.5 signifies mild 
cardiac damage, 0.25 to 0.40 implies moderate heart damage, and <0.25 warms of severe damage to the 
heart's pumping ability). If the stroke volume is multiplied by the number of systolic intervals per minute, 
or heart (HR), one obtains the total cardiac output (CO): 

co = hrx(edv-esv) (1.1) 

Dawson [1991] has suggested that the cardiac output (in milliliters per minute) is proportional to the 
weight W (in kilograms) of an individual according to the equation 

CO- 224 W 3/4 (1.2) 

and that "normal" heart rate obeys very closely the relation 

HR = 229W~ l/4 (1.3) 

For a "typical" 68. 7-kg individual (blood volume = 5200 ml), Equations (1.1), (1.2), and (1.3) yield CO = 
5345 ml/min, HR = 80 beats/min (cardiac cycle period = 754 ms) and SV = CO/HR = 224W 3/4 /229 W- m = 
0.978W= 67.2 ml/beat, which are very reasonable values. Furthermore, assuming this individual lives 
about 75 years, his or her heart will have cycled over 3.1536 billion times, pumping a total of 0.2107 
billion liters of blood (55.665 million gallons, or 8134 quarts per day) — all of it emptying into the 
circulatory pathways that constitute the vascular system. 

1.3 The Piping Network: Blood Vessels 

The vascular system is divided by a microscopic capillary network into an upstream, high-pressure, 
efferent arterial side (Table 1.3) — consisting of relatively thick-walled, viscoelastic tubes that carry blood 
away from the heart — and a downstream, low-pressure, afferent venous side (Table 1.4) — consisting of 
correspondingly thinner (but having a larger caliber) elastic conduits that return blood back to the heart. 
Except for their differences in thickness, the walls of the largest arteries and veins consist of the same 
three distinct, well-defined, and well-developed layers. From innermost to outermost, these layers are 
(1) the thinnest tunica intima, a continuous lining (the vascular endothelium) consisting of a single layer 
of simple squamous (thin, sheetlike) endothelial cells "glued" together by a polysaccharide (sugar) 
intercellular matrix, surrounded by a thin layer of subendothelial connective tissue interlaced with a 
number of circularly arranged elastic fibers to form the subendothelium, and separated from the next 
adjacent wall layer by a thick elastic band called the internal elastic lamina, (2) the thickest tunica media, 
composed of numerous circularly arranged elastic fibers, especially prevalent in the largest blood vessels 
on the arterial side (allowing them to expand during systole and to recoil passively during diastole), a 
significant amount of smooth muscle cells arranged in spiraling layers around the vessel wall, especially 
prevalent in medium-sized arteries and arterioles (allowing them to function as control points for blood 
distribution), and some interlacing collagenous connective tissue, elastic fibers, and intercellular muco- 
polysaccharide substance (extractives), all separated from the next adjacent wall layer by another thick 
elastic band called the external elastic lamina, and (3) the medium-sized tunica adventitia, an outer 
vascular sheath consisting entirely of connective tissue. 

The largest blood vessels, such as the aorta, the pulmonary artery, the pulmonary veins, and others, 
have such thick walls that they require a separate network of tiny blood vessels — the vasa vasorum — just 
to service the vascular tissue itself. As one moves toward the capillaries from the arterial side (see 
Table 1.3), the vascular wall keeps thinning, as if it were shedding 15-um-thick, onion-peel-like concentric 
layers, and while the percentage of water in the vessel wall stays relatively constant at 70% (by weight), 
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the ratio of elastin to collagen decreases (actually reverses) — from 3:2 in large arteries (9% elastin, 6% 
collagen, by weight) to 1:2 in small tributaries (5% elastin, 10% collagen) — and the amount of smooth 
muscle tissue increases from 7.5% by weight of large arteries (the remaining 7.5% consisting of various 
extractives) to 15% in small tributaries. By the time one reaches the capillaries, one encounters single- 
cell-thick endothelial tubes — devoid of any smooth muscle tissue, elastin, or collagen — downstream of 
which the vascular wall gradually "reassembles itself," layer-by-layer, as it directs blood back to the heart 
through the venous system (Table 1.4). 

Blood vessel structure is directly related to function. The thick-walled large arteries and main distrib- 
uting branches are designed to withstand the pulsating 80 to 130 mmHg blood pressures that they must 
endure. The smaller elastic conducting vessels need only operate under steadier blood pressures in the 
range 70 to 90 mmHg, but they must be thin enough to penetrate and course through organs without 
unduly disturbing the anatomic integrity of the mass involved. Controlling arterioles operate at blood 
pressures between 45 and 70 mmHg but are heavily endowed with smooth muscle tissue (hence their 
being referred to as muscular vessels) so that they may be actively shut down when flow to the capillary 
bed they service is to be restricted (for whatever reason), and the smallest capillary resistance vessels 
(which operate at blood pressures on the order of 10 to 45 mmHg) are designed to optimize conditions 
for transport to occur between blood and the surrounding interstitial fluid. Traveling back up the venous 
side, one encounters relatively steady blood pressures continuously decreasing from around 30 mmHg 
all the way down to near zero, so these vessels can be thin-walled without disease consequence. However, 
the low blood pressure, slower, steady (time-dependent) flow, thin walls, and larger caliber that charac- 
terize the venous system cause blood to tend to "pool" in veins, allowing them to act somewhat like 
reservoirs. It is not surprising, then, that at any given instant, one normally finds about two-thirds of 
the total human blood volume residing in the venous system, the remaining one-third being divided 
among the heart (6.5%), the microcirculation (7% in systemic and pulmonary capillaries), and the arterial 
system (19.5 to 20%). 

In a global sense, then, one can think of the human cardiovascular system — using an electrical 
analogy — as a voltage source (the heart), two capacitors (a large venous system and a smaller arterial 
system), and a resistor (the microcirculation taken as a whole). Blood flow and the dynamics of the 
system represent electrical inductance (inertia), and useful engineering approximations can be derived 
from such a simple model. The cardiovascular system is designed to bring blood to within a capillary 
size of each and every one of the more than 10 14 cells of the body — but which cells receive blood at any 
given time, how much blood they get, the composition of the fluid coursing by them, and related physi- 
ologic considerations are all matters that are not left up to chance. 

1.4 Cardiovascular Control 

Blood flows through organs and tissues either to nourish and sanitize them or to be itself processed in 
some sense — e.g., to be oxygenated (pulmonary circulation), stocked with nutrients (splanchnic circu- 
lation), dialyzed (renal circulation), cooled (cutaneous circulation), filtered of dilapidated red blood cells 
(splenic circulation), and so on. Thus any given vascular network normally receives blood according to 
the metabolic needs of the region it perfuses and/or the function of that region as a blood treatment 
plant and/or thermoregulatory pathway. However, it is not feasible to expect that our physiologic trans- 
port system can be "all things to all cells all of the time" — especially when resources are scarce and/or 
time is a factor. Thus the distribution of blood is further prioritized according to three basic criteria: 
(1) how essential the perfused region is to the maintenance of life itself (e.g., we can survive without an 
arm, a leg, a stomach, or even a large portion of our small intestine but not without a brain, a heart, and 
at least one functioning kidney and lung, (2) how essential the perfused region is in allowing the organism 
to respond to a life -threatening situation (e.g., digesting a meal is among the least of the body's concerns 
in a "fight or flight" circumstance), and (3) how well the perfused region can function and survive on a 
decreased supply of blood (e.g., some tissues — like striated skeletal and smooth muscle — have significant 
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anaerobic capability; others — like several forms of connective tissue — can function quite effectively at a 
significantly decreased metabolic rate when necessary; some organs — like the liver — are larger than they 
really need to be; and some anatomic structures — like the eyes, ears, and limbs — have duplicates, giving 
them a built-in redundancy). 

Within this generalized prioritization scheme, control of cardiovascular function is accomplished by 
mechanisms that are based either on the inherent physicochemical attributes of the tissues and organs 
themselves — so-called intrinsic control — or on responses that can be attributed to the effects on cardio- 
vascular tissues of other organ systems in the body (most notably the autonomic nervous system and 
the endocrine system) — so-called extrinsic control. For example, the accumulation of wastes and deple- 
tion of oxygen and nutrients that accompany the increased rate of metabolism in an active tissue both 
lead to an intrinsic relaxation of local precapillary sphincters (rings of muscle) — with a consequent 
widening of corresponding capillary entrances — which reduces the local resistance to flow and thereby 
allows more blood to perfuse the active region. On the other hand, the extrinsic innervation by the 
autonomic nervous system of smooth muscle tissues in the walls of arterioles allows the central nervous 
system to completely shut down the flow to entire vascular beds (such as the cutaneous circulation) when 
this becomes necessary (such as during exposure to extremely cold environments). 

In addition to prioritizing and controlling the distribution of blood, physiologic regulation of cardio- 
vascular function is directed mainly at four other variables: cardiac output, blood pressure, blood volume, 
and blood composition. From Equation (1.1) we see that cardiac output can be increased by increasing 
the heart rate (a chronotropic effect), increasing the end-diastolic volume (allowing the heart to fill longer 
by delaying the onset of systole), decreasing the end-systolic volume (an inotropic effect), or doing all 
three things at once. Indeed, under the extrinsic influence of the sympathetic nervous system and the 
adrenal glands, HR can triple — to some 240 beats/min if necessary — EDV can increase by as much as 
50% — to around 200 ml or more of blood — and ESV and decrease a comparable amount (the cardiac 
reserve) — to about 30 to 35 ml or less. The combined result of all three effects can lead to over a sevenfold 
increase in cardiac output — from the normal 5 to 5.5 liters/min to as much as 40 to 41 liters/min or 
more for very brief periods of strenuous exertion. 

The control of blood pressure is accomplished mainly by adjusting at the arteriolar level the down- 
stream resistance to flow — an increased resistance leading to a rise in arterial backpressure, and vice 
versa. This effect is conveniently quantified by a fluid-dynamic analogue to Ohm's famous E= IR law in 
electromagnetic theory, voltage drop E being equated to fluid pressure drop AP, electric current / corre- 
sponding to flow — cardiac output (CO) — and electric resistance R being associated with an analogous 
vascular "peripheral resistance" (PR). Thus one may write 



AP- 



= (C0)(PR) (1.4) 



Normally, the total systemic peripheral resistance is 15 to 20 mmHg/liter/min of flow but can increase 
significantly under the influence of the vasomotor center located in the medulla of the brain, which 
controls arteriolar muscle tone. 

The control of blood volume is accomplished mainly through the excretory function of the kidney. 
For example, antidiuretic hormone (ADH) secreted by the pituitary gland acts to prevent renal fluid loss 
(excretion via urination) and thus increases plasma volume, whereas perceived extracellular fluid over- 
loads such as those which result from the peripheral vasoconstriction response to cold stress lead to a 
sympathetic/adrenergic receptor-induced renal diuresis (urination) that tends to decrease plasma vol- 
ume — if not checked, to sometimes dangerously low dehydration levels. Blood composition, too, is 
maintained primarily through the activity of endocrine hormones and enzymes that enhance or repress 
specific biochemical pathways. Since these pathways are too numerous to itemize here, suffice it to say 
that in the body's quest for homeostasis and stability, virtually nothing is left to chance, and every 
biochemical end can be arrived at through a number of alternative means. In a broader sense, as the 
organism strives to maintain life, it coordinates a wide variety of different functions, and central to its 

© 2000 by CRC Press LLC 



ability to do just that is the role played by the cardiovascular system in transporting mass, energy, and 
momentum. 

Defining Terms 

Atrioventricular (AV) node: A highly specialized cluster of neuromuscular cells at the lower portion 

of the right atrium leading to the interventricular septum; the AV node delays sinoatrial, (SA) 

node-generated electrical impulses momentarily (allowing the atria to contract first) and then 

conducts the depolarization wave to the bundle of His and its bundle branches. 
Autonomic nervous system: The functional division of the nervous system that innervates most glands, 

the heart, and smooth muscle tissue in order to maintain the internal environment of the body. 
Cardiac muscle: Involuntary muscle possessing much of the anatomic attributes of skeletal voluntary 

muscle and some of the physiologic attributes of involuntary smooth muscle tissue; SA 

node-induced contraction of its interconnected network of fibers allows the heart to expel blood 

during systole. 
Chronotropic: Affecting the periodicity of a recurring action, such as the slowing (bradycardia) or 

speeding up (tachycardia) of the heartbeat that results from extrinsic control of the SA node. 
Endocrine system: The system of ductless glands and organs secreting substances directly into the 

blood to produce a specific response from another "target" organ or body part. 
Endothelium: Flat cells that line the innermost surfaces of blood and lymphatic vessels and the heart. 
Homeostasis: A tendency to uniformity or stability in an organism by maintaining within narrow limits 

certain variables that are critical to life. 
Inotropic: Affecting the contractility of muscular tissue, such as the increase in cardiac power that 

results from extrinsic control of the myocardial musculature. 
Precapillary sphincters: Rings of smooth muscle surrounding the entrance to capillaries where they 

branch off from upstream metarterioles. Contraction and relaxation of these sphincters close and 

open the access to downstream blood vessels, thus controlling the irrigation of different capillary 

networks. 
Sinoatrial (SA) node: Neuromuscular tissue in the right atrium near where the superior vena cava joins 

the posterior right atrium (the sinus venarum); the SA node generates electrical impulses that 

initiate the heartbeat, hence its nickname the cardiac "pacemaker." 
Stem cells: A generalized parent cell spawning descendants that become individually specialized. 
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2.1 Endocrine System: Hormones, Signals, and 
Communication Between Cells and Tissues 

2.2 Hormone Action at the Cell Level: Signal 
Recognition, Signal Transduction, and Effecting 
a Physiological Response 

Hormones Acting at the Cell Surface • Hormones Acting 

Derek G. Cramp Within the Cell 

City University, London 2.3 Endocrine System: Some Other Aspects of Regulation 

and Control 
Lwart K. Uarson Negative Feedback • Positive Feedback • Rhythmic 

City University, London Endocrine Control 

The body, if it is to achieve optimal performance, must possess mechanisms for sensing and responding 
appropriately to numerous biologic cues and signals in order to control and maintain its internal 
environment. This complex role is effected by the integrative action of the endocrine and neural systems. 
The endocrine contribution is achieved through a highly sophisticated set of communication and control 
systems involving signal generation, propagation, recognition, transduction, and response. The signal 
entities are chemical messengers or hormones that are distributed through the body by the blood 
circulatory system to their respective target organs to modify their activity in some fashion. 

Endocrinology has a comparatively long history, but real advances in the understanding of endocrine 
physiology and mechanisms of regulation and control only began in the late 1960s with the introduction 
of sensitive and relatively specific analytical methods; these enabled low concentrations of circulating 
hormones to be measured reliably, simply, and at relatively low cost. The breakthrough came with the 
development and widespread adoption of competitive protein binding and radioimmunoassays that 
superseded existing cumbersome bioassay methods. Since then, knowledge of the physiology of individual 
endocrine glands and of the neural control of the pituitary gland and the overall feedback control of the 
endocrine system has progressed and is growing rapidly. Much of this has been accomplished by applying 
to endocrinological research the methods developed in cellular and molecular biology and recombinant 
DNA technology. At the same time, theoretical and quantitative approaches using mathematical modeling 
complemented experimental studies have been of value in gaining a greater understanding of endocrine 
dynamics. 

2.1 Endocrine System: Hormones, Signals, and 
Communication Between Cells and Tissues 

Hormones are synthesized and secreted by specialized endocrine glands to act locally or at a distance, having 
been carried in the bloodstream (classic endocrine activity) or secreted into the gut lumen (lumocrine 
activity) to act on target cells that are distributed elsewhere in the body. Hormones are chemically diverse, 
physiologically potent molecules that are the primary vehicle for intercellular communication with the 
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FIGURE 2. 1 Representation of the forward pathways of pituitary and target gland hormone release and action: 
tropic hormones; tissue-affecting hormones. 

capacity to override the intrinsic mechanisms of normal cellular control. They can be classified broadly 
into three groups according to their physicochemical characteristics: (1) steroid hormones produced by 
chemical modification of cholesterol, (2) peptide and protein hormones, and (3) those derived from the 
aromatic amino acid tyrosine. The peptide and protein hormones are essentially hydrophilic and are 
therefore able to circulate in the blood in the free state; however, the more hydrophobic lipid-derived 
molecules have to be carried in the circulation bound to specific transport proteins. Figure 2.1 and Table 
2.1 show, in schematic and descriptive form respectively, details of the major endocrine glands of the 
body and the endocrine pathways. 
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TABLE 2.1 Main Endocrine Glands and the Hormones They Produce and Release 



Gland 


Hormone 


Chemical Characteristics 


Hypothalamus/median eminence 


Thyrotropin-releasing hormone (TRH) 
Somatostatin 


Peptides 




Gonadotropin-releasing hormone 


Amine 




Growth hormone-releasing hormone 






Corticotropin-releasing hormone 






Prolactin inhibitor factor 




Anterior pituitary 


Thyrotropin (TSH) 
Luteinizing hormone 


Glycoproteins 




Follicle-stimulating hormone (FSH) 


Proteins 




Growth hormone 






Prolactin 






Adrenocorticotropin (ACTH) 




Posterior pituitary 


Vasopressin (antidiuretic hormone, ADH) 






Oxytocin 


Peptides 


Thyroid 


Triidothyronine (T3) 
Thyroxine (T4) 


Tyrosine derivatives 


Parathyroid 


Parathyroid hormone (PTH) 


Peptide 


Adrenal cortex 


Cortisol 
Aldosterone 


Steroids 


Adrenal medulla 


Epinephrine 
Norepinephrine 


Catecolamines 


Pancreas 


Insulin 

Glucagon 

Somatostatin 


Proteins 


Gonads: Testes 


Testosterone 


Steroids 


Ovaries 


Oestrogen 
Progesterone 





The endocrine and nervous system are physically and functionally linked by a specific region of the 
brain called the hypothalamus, which lies immediately above the pituitary gland, to which it is connected 
by an extension called the pituitary stalk. The integrating function of the hypothalamus is mediated by 
cells that possess the properties of both nerve and processes that carry electrical impulses and on 
stimulation can release their signal molecules into the blood. Each of the hypothalamic neurosecretory 
cells can be stimulated by other nerve cells in higher regions of the brain to secrete specific peptide 
hormones or release factors into the adenohypophyseal portal vasculature. These hormones can then 
specifically stimulate or suppress the secretion of a second hormone from the anterior pituitary. 

The pituitary hormones in the circulation interact with their target tissues, which, if endocrine glands, 
are stimulated to secrete further (third) hormones that feed back to inhibit the release of the pituitary 
hormones. It will be seen from Fig. 2.1 and Table 2.1 that the main targets of the pituitary are the adrenal 
cortex, the thyroid, and the gonads. These axes provide good examples of the control of pituitary hormone 
release by negative-feedback inhibition; e.g., adrenocorticotropin (ACTH), luteinizing hormone (LH), 
and follicle-stimulating hormone (FSH) are selectively inhibited by different steroid hormones, as is 
thyrotropin (TSH) release by the thyroid hormones. 

In the case of growth hormone (GH) and prolactin, the target tissue is not an endocrine gland and 
thus does not produce a hormone; then the feedback control is mediated by inhibitors. Prolactin is under 
dopamine inhibitory control, whereas hypothalamic releasing and inhibitory factors control GH release. 
The two posterior pituitary (neurohypophyseal) hormones, oxytocin and vasopressin, are synthesized in 
the supraoptic and paraventricular nuclei and are stored in granules at the end of the nerve fibers in the 
posterior pituitary. Oxytocin is subsequently secreted in response to peripheral stimuli from the cervical 
stretch receptors or the suckling receptors of the breast. In a like manner, antidiuretic hormone (ADH, 
vasopressin) release is stimulated by the altered activity of hypothalamic osmoreceptors responding to 
changes in plasma solute concentrations. 
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It will be noted that the whole system is composed of several endocrine axes with the hypothalamus, 
pituitary, and other endocrine glands together forming a complex hierarchical regulatory system. There 
is no doubt that the anterior pituitary occupies a central position in the control of hormone secretion 
and, because of its important role, was often called the "conductor of the endocrine orchestra". However, 
the release of pituitary hormones is mediated by complex feedback control, so the pituitary should be 
regarded as having a permissive role rather than having the overall control of the endocrine system. 

2.2 Hormone Action at the Cell Level: 

Signal Recognition, Signal Transduction, 
and Effecting a Physiological Response 

The ability of target glands or tissues to respond to hormonal signals depends on the ability of the cells 
to recognize the signal. This function is mediated by specialized proteins or glycoproteins in or on the 
cell plasma membrane that are specific for a particular hormone, able to recognize it, bind it with high 
affinity, and react when very low concentrations are present. Recognition of the hormonal signal and 
activation of the cell surface receptors initiates a flow of information to the cell interior which triggers 
a chain of intracellular events in a pre-programmed fashion that produces a characteristic response. It 
is useful to classify the site of such action of hormones into two groups: those which act at the cell surface 
without, generally, traversing the cell membrane and those which actually enter the cell before effecting 
a response. In the study of this multi-step sequence, two important events can be readily studied, namely, 
the binding of the hormone to its receptor and activation of cytoplasmic effects. However, it is some of 
the steps between these events, such as receptor activation and signal generation, that are still relatively 
poorly defined. One method employed in an attempt to elucidate the intermediate steps has been to use 
ineffective mutant receptors, which when assayed are either defective in their hormone binding capabil- 
ities or in effector activation and thus unable to transduce a meaningful signal to the cell. But, the 
difficulty with these studies has been to distinguish receptor- activation and signal-generation defects 
from hormone binding and effector activation defects. 

Hormones Acting at the Cell Surface 

Most peptide and protein hormones are hydrophilic and thus unable to traverse the lipid-containing cell 
membrane and must therefore act through activation of receptor proteins on the cell surface. When these 
receptors are activated by the binding of an extracellular signal ligand, the ligand-receptor complex 
initiates a series of protein interactions within or adjacent to the inner surface of the plasma membrane, 
which in turn brings about changes in intracellular activity. This can happen in one of two ways. The 
first involves the so-called second messenger, by altering the activity of a plasma membrane-bound 
enzyme, which in turn increases (or sometimes decreases) the concentration of an intracellular mediator. 
The second involves activation of other types of cell surface receptors which leads to changes in the 
plasma membrane electrical potential and the membrane permeability, resulting in altered transmem- 
brane transport of ions or metabolites. If the hormone is thought of as the "first messenger", cyclic 
adenosine monophosphate (cAMP) can be regarded as the "second messenger"; capable of triggering a 
cascade of intracellular biochemical events that can lead either to a rapid secondary response such as 
altered ion transport, enhanced metabolic pathway flux, steroidogenesis or to a slower response such as 
DNA, RNA, and protein synthesis resulting in cell growth or cell division. 

The peptide and protein hormones circulate at very low concentrations relative to other proteins in 
the blood plasma. These low concentrations are reflected in the very high affinity and specificity of the 
receptor sites, which permits recognition of the relevant hormones amid the profusion of protein mol- 
ecules in the circulation. Adaptation to a high concentration of a signal ligand in a time-dependent 
reversible manner enables cells to respond to changes in the concentration of a ligand instead of to its 
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absolute concentration. The number of receptors in a cell is not constant; synthesis of receptors may be 
induced or repressed by other hormones or even by their own hormones. Adaptation can occur in several 
ways. Ligand binding can inactivate a cell surface receptor either by inducing its internalization and 
degradation or by causing the receptor to adopt an inactive conformation. Alternatively, it may result 
from the changes in one of the non-receptor proteins involved in signal transduction following receptor 
activation. Down regulation is the name given to the process whereby a cell decreases the number of 
receptors in response to intense or frequent stimulation and can occur by degradation or more tempo- 
rarily by phosphorylation and sequestration. Up regulation is the process of increasing receptor expression 
either by other hormones or in response to altered stimulation. 

The cell surface receptors for peptide hormones are linked functionally to a cell membrane-bound 
enzyme that acts as the catalytic unit. This receptor complex consists of three components: ( 1 ) the receptor 
itself which recognizes the hormone, (2) a regulatory protein called a G-protein that binds guanine 
nucleotides and is located on the cytosolic face of the membrane, and (3) adenylate cyclase which catalyzes 
the conversion of ATP to cyclic AMP. As the hormone binds at the receptor site, it is coupled through a 
regulatory protein, which acts as a transducer, to the enzyme adenyl cyclase, which catalyzes the formation 
of cAMP from adenosine triphosphate (ATP). The G-protein consists of 3 subunits, which in the unstim- 
ulated state form a heterotrimer to which a molecule of GDP is bound. Binding of the hormone to the 
receptor causes the subunit to exchange its GDP for a molecule of GTP (guanine triphosphate) which 
then dissociates from the subunits. This in turn decreases the affinity of the receptor for the hormone 
and leads to its dissociation. The GTP subunit not only activates adenylate cyclase, but also has intrinsic 
GTPase activity and slowly converts GTP back to GDP, thus allowing the subunits to reassociate and so 
regain their initial resting state. There are hormones, such as somatostatin, that possess the ability to 
inhibit AMP formation but still have similarly structured receptor complexes. The G-protein of inhibitory 
complexes consists of an inhibitory subunit complexed with a subunit thought to be identical to the 
subunits of the stimulatory G-protein. But, it appears that a single adenylate cyclase molecule can be 
simultaneously regulated by more than one G-protein enabling the system to integrate opposing inputs. 

The adenylate cyclase reaction is rapid, and the increased concentration of intracellular cAMP is short- 
lived, since it is rapidly hydrolyzed and destroyed by the enzyme cAMP phosphodiesterase which termi- 
nates the hormonal response. The continual and rapid removal of cAMP and free calcium ions from the 
cytosol makes for both the rapid increase and decrease of these intracellular mediators when the cells 
respond to signals. Rising cAMP concentrations affect cells by stimulating cAMP-dependent protein 
kinases to phosphorylate specific target proteins. Phosphorylation of proteins leads to conformational 
changes that enhance their catalytic activity, thus providing a signal amplification pathway from hormone 
to effector. These effects are reversible because phosphorylated proteins are rapidly dephosphorylated by 
protein phosphatases when the concentration of cAMP falls. A similar system involving cyclic GMP, 
although less common and less well studied, plays an analogous role to that of cAMP. The action of 
thyrotropin-releasing hormone (TRH), parathyroid hormone (PTH), and epinephrine is catalyzed by 
adenyl cyclase, and this can be regarded as the classic reaction. 

However, there are variant mechanisms. In the phosphatidylinositol-diacylglycerol (DAG)/inositol 
triphosphate (IP3) system, some surface receptors are coupled through another G-protein to the enzyme 
phospholipase C which cleaves the membrane phospholipid to form DAG and IP3 or phospholipase D 
which cleaves phosphatidyl choline to DAG via phosphatidic acid. DAG causes the calcium, phospholid- 
dependent protein kinase C to translocate to the the cell membrane from the cytosolic cell compartment 
becoming 20 times more active in the process. IP3 mobilizes calcium from storage sites associated with 
the plasma and intracellular membranes thereby contributing to the activation of protein kinase C as 
well as other calcium dependent processes. DAG is cleared from the cell either by conversion to phos- 
phatidic acid which may be recycled to phospholipid or it may be broken down to fatty acids and glycerol. 
The DAG derived from phosphatidylinositol usually contains arachidonic acid esterified to the middle 
carbon of glycerol. Arachidonic acid is the precursor of the prostaglandins and leukotrienes which are 
biologically active eicosanoids. 
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Thyrotropin and vasopressin modulate an activity of phospholipase C that catalyzes the conversion 
of phosphatidylinositol to diacylglycerol and inositol, 1, 4, 5-triphosphate, which act as the second 
messengers. They mobilize bound intracellular calcium and activate a protein kinase, which in turn alters 
the activity of other calcium-dependent enzymes within the cell. 

Increased concentrations of free calcium ions affect cellular events by binding to and altering the 
molecular conformation of calmodulin; the resulting calcium ion-calmodulin complex can activate many 
different target proteins, including calcium ion-dependent protein kinases. Each cell type has a charac- 
teristic set of target proteins that are so regulated by cAMP-dependent kinases and/or calmodulin that 
it will respond in a specific way to an alteration in cAMP or calcium ion concentrations. In this way, 
cAMP or calcium ions act as second messengers in such a way as to allow the extracellular signal not 
only to be greatly amplified but, just as importantly, also to be made specific for each cell type. 

The action of the important hormone insulin that regulates glucose metabolism depends on the 
activation of the enzyme tyrosine kinase catalyzing the phosphorylation of tyrosyl residues of proteins. 
This effects changes in the activity of calcium-sensitive enzymes, leading to enhanced movement of 
glucose and fatty acids across the cell membrane and modulating their intracellular metabolism. The 
binding of insulin to its receptor site has been studied extensively; the receptor complex has been isolated 
and characterized. It was such work that highlighted the interesting aspect of feedback control at the cell 
level, down-regulation: the ability of peptide hormones to regulate the concentration of cell surface 
receptors. After activation, the receptor population becomes desensitized, or "down-regulated"; leading 
to a decreased availability of receptors and thus a modulation of transmembrane events. 

Hormones Acting Within the Cell 

Steroid hormones are small hydrophobic molecules derived from cholesterol that are solubilized by 
binding reversibly to specify carrier proteins in the blood plasma. Once released from their carrier 
proteins, they readily pass through the plasma membrane of the target cell and bind, again reversibly, to 
steroid hormone receptor proteins in the cytosol. This is a relatively slow process when compared to 
protein hormones. The latter second messenger mediated phosphorylation-dephosphorylation reactions 
modify enzymatic processes rapidly with the physiological consequences becoming apparent in seconds 
or minutes and are as rapidly reversed. Nuclear-mediated responses, on the other hand, lead to tran- 
scription/translation dependent changes that are slow in onset and tend to persist since reversal is 
dependent on degradation of the induced proteins. The protein component of the steroid hormone- 
receptor complex has an affinity for DNA in the cell nucleus, where it binds to nuclear chromatin and 
initiates the transcription of a small number of genes. These gene products may, in turn, activate other 
genes and produce a secondary response, thereby amplifying the initial effect of the hormone. Each 
steroid hormone is recognized by a separate receptor protein, but this same receptor protein has the 
capacity to regulate several different genes in different target cells. This, of course, suggests that the 
nuclear chromatin of each cell type is organized so that only the appropriate genes are made available 
for regulation by the hormone-receptor complex. The thyroid hormone triiodothyronine (T3) also acts, 
though by a different mechanism than the steroids, at the cell nucleus level to initiate genomic transcrip- 
tion. The hormonal activities of GH and prolactin influence cellular gene transcription and translation 
of messenger RNA by complex mechanisms. 

2.3 Endocrine System: Some Other Aspects 
of Regulation and Control 

From the foregoing sections it is clear that the endocrine system exhibits complex molecular and meta- 
bolic dynamics which involve many levels of control and regulation. Hormones are chemical signals 
released from a hierarchy of endocrine glands and propagated through the circulation to a hierarchy of 
cell types. The integration of this system depends on a series of what systems engineers call "feedback 
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FIGURE 2.2 Illustration of the complexity of hormonal feedback control (+ indicates a positive or augmenting 
effect; - indicates a negative or inhibiting effect). 

loops"; feedback is a reflection of mutual dependence of the system variables: variable x affects variable y, 
and y affects x. Further, it is essentially a closed-loop system in which the feedback of information from 
the system output to the input has the capacity to maintain homeostasis. A diagrammatic representation 
of the ways in which hormone action is controlled is shown in Fig. 2.2. One example of this control 
structure arises in the context of the thyroid hormones. In this case, TRH, secreted by the hypothalamus, 
triggers the anterior pituitary into the production of TSH. The target gland is the thyroid, which produces 
T3 and thyroxine (T4). The complexity of control includes both direct and indirect feedback of T3 and 
T4, as outlined in Fig. 2.2, together with TSH feedback on to the hypothalamus. 

Negative Feedback 

If an increase in y causes a change in x which in turn tends to decrease y, feedback is said to be negative; 
in other words, the signal output induces a response that feeds back to the signal generator to decrease 
its output. This is the most common form of control in physiologic systems, and examples are many. 
For instance, as mentioned earlier, the anterior pituitary releases trophic or stimulating hormones that 
act on peripheral endocrine glands such as the adrenals or thyroid or to gonads to produce hormones 
that act back on the pituitary to decrease the secretion of the trophic hormones. These are examples of 
what is called long-loop feedback (see Fig. 2.2.). (Note: the adjectives long and short reflect the spatial 
distance or proximity of effector and target sites.) The trophic hormones of the pituitary are also regulated 
by feedback action at the level of their releasing factors. Ultrashort-loop feedback is also described. There 
are numerous examples of short-loop feedback as well, the best being the reciprocal relation between 
insulin and blood glucose concentrations, as depicted in Fig. 2.3. In this case, elevated glucose concen- 
tration (and positive rate of change, implying not only proportional but also derivative control) has a 
positive effect on the pancreas, which secretes insulin in response. This has an inhibiting effect on glucose 
metabolism, resulting in a reduction of blood glucose toward a normal concentration; in other words, 
classic negative-feedback control. 
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FIGURE 2.3 The interaction of insulin as an illustration of negative feedback within a hormonal control system. 

Positive Feedback 

If increase in y causes a change in x which tends to increase y, feedback is said to be positive; in other 
words, a further signal output is evoked by the response it induces or provokes. This is intrinsically an 
unstable system, but there are physiologic situations where such control is valuable. In the positive 
feedback situation, the signal output will continue until no further response is required. Suckling provides 
an example; stimulation of nipple receptors by the suckling child provokes an increased oxytocin release 
from the posterior pituitary with a corresponding increase in milk flow. Removal of the stimulus causes 
cessation of oxytocin release. 

Rhythmic Endocrine Control 

Many hormone functions exhibit rhythmically in the form of pulsatile release of hormones. The most 
common is the approximately 24-h cycle (circadian or diurnal rhythm). For instance, blood sampling at 
frequent intervals has shown that ACTH is secreted episodically, each secretory burst being followed 5 to 
10 min later by Cortisol secretion. These episodes are most frequent in the early morning, with plasma 
Cortisol concentrations highest around 7 to 8 am and lowest around midnight. ACTH and Cortisol 
secretion vary inversely, and the parallel circadian rhythm is probably due to a cyclic change in the 
sensitivity of the hypothalamic feedback center to circulating Cortisol. Longer cycles are also known, e.g., 
the infradian menstrual cycle. 

It is clear that such inherent rhythms are important in endocrine communication and control, sug- 
gesting that its physiologic organization is based not only on the structural components of the system 
but also on the dynamics of their interactions. The rhythmic, pulsatile nature of release of many hormones 
is a means whereby time-varying signals can be encoded, thus allowing large quantities of information 
to be transmitted and exchanged rapidly in a way that small, continuous changes in threshold levels 
would not allow. 
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The nervous system, unlike other organ systems, is concerned primarily with signals, information encod- 
ing and processing, and control rather than manipulation of energy. It acts like a communication device 
whose components use substances and energy in processing signals and in reorganizing them, choosing, 
and commanding, as well as in developing and learning. A central question that is often asked is how 
nervous systems work and what are the principles of their operation. In an attempt to answer this 
question, we will, at the same time, ignore other fundamental questions, such as those relating to anatomic 
or neurochemical and molecular aspects. We will concentrate rather on relations and transactions between 
neurons and their assemblages in the nervous system. We will deal with neural signals (encoding and 
decoding), the evaluation and weighting of incoming signals, and the formulation of outputs. A major 
part of this chapter is devoted to higher aspects of the nervous system, such as memory and learning, 
rather than individual systems, such as vision and audition, which are treated extensively elsewhere in 
this Handbook. 

3.1 Definitions 

Nervous systems can be defined as organized assemblies of nerve cells as well as nonnervous cells. Nerve 
cells, or neurons, are specialized in the generation, integration, and conduction of incoming signals from 
the outside world or from other neurons and deliver them to other excitable cells or to effectors such as 
muscle cells. Nervous systems are easily recognized in higher animals but not in the lower species, since 
the defining criteria are difficult to apply. 

A central nervous system (CNS) can be distinguished easily from a peripheral nervous system (PNS), 
since it contains most of the motor and nucleated parts of neurons that innervate muscles and other 
effectors. The PNS contains all the sensory nerve cell bodies, with some exceptions, plus local plexuses, 
local ganglia, and peripheral axons that make up the nerves. Most sensory axons go all the way into the 
CNS, while the remaining sensory axons relay in peripheral plexuses. Motor axons originating in the 
CNS innervate effector cells. 

The nervous system has two major roles: (1) to regulate, acting homeostatically in restoring some 
conditions of the organism after some external stimulus, and (2) to act to alter a preexisting condition 
by replacing it or modifying it. In both cases — regulation or initiation of a process — learning can be 
superimposed. In most species, learning is a more or less adaptive mechanism, combining and timing 
species-characteristic acts with a large degree of evolution toward perfection. 
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The nervous system is a complex structure for which realistic assumptions have led to irrelevant 
oversimplifications. One can break the nervous system down into four components: sensory transducers, 
neurons, axons, and muscle fibers. Each of these components gathers, processes, and transmits informa- 
tion impinging on it from the outside world, usually in the form of complex stimuli. The processing is 
carried out by exitable tissues — neurons, axons, sensory receptors, and muscle fibers. Neurons are the 
basic elements of the nervous system. If put in small assemblies or clusters, they form neuronal assemblies 
or neuronal networks communicating with each other either chemically via synaptic junctions or electri- 
cally via tight junctions. The main characteristics of a cell are the cell body, or soma, which contains the 
nucleus, and a number of processes originating from the cell body, called the dendrites, which reach out 
to surroundings to make contacts with other cells. These contacts serve as the incoming information to 
the cell, while the outgoing information follows a conduction path, the axon. The incoming information 
is integrated in the cell body and generates its action potential at the axon hillock. There are two types 
of outputs that can be generated and therefore two types of neurons: those which generate graded 
potentials that attenuate with distance and those which generated action potentials. The latter travel 
through the axon, a thin, long process that passively passes the action potential or rather a train of action 
potentials without any attenuation (all-or-none effect). A number of action potentials is often called a 
spike train. A threshold built into the hillock, depending on its level, allows or stops the generation of 
the spike train. Axons usually terminate on other neurons by means of synaptic terminals or boutons and 
have properties similar to those of an electric cable with varying diameters and speeds of signal trans- 
mission. Axons can be of two types: myelinated or unmyelinated. In the former case, the axon is sur- 
rounded by a thick fatty material, the myelin sheath, which is interrupted at regular intervals by gaps 
called the nodes ofRanvier. These nodes provide for the saltatory conduction of the signal along the axon. 
The axon makes functional connections with other neurons at synapses on either the cell body, the 
dendrites, or the axons. There exist two kinds of synapses: excitatory and inhibitory, and as the names 
imply, they either increase the firing frequency of the postsynaptic neurons or decrease it, respectively. 

Sensory receptors are specialized cells that, in response to an incoming stimulus, generate a corre- 
sponding electrical signal, a graded receptor potential. Although the mechanisms by which the sensory 
receptors generate receptor potentials are not known exactly, the most plausible scenario is that an external 
stimulus alters the membrane permeabilities. The receptor potential, then, is the change in intracellular 
potential relative to the resting potential. 

It is important to notice here that the term receptor is used in physiology to refer not only to sensory 
receptors but also, in a different sense, to proteins that bind neurotransmitters, hormones, and other 
substances with great affinity and specificity as a first step in starting up physiologic responses. This 
receptor is often associated with nonneural cells that surround it and form a sense organ. The forms of 
energy converted by the receptors include mechanical, thermal, electromagnetic, and chemical energy. 
The particular form of energy to which a receptor is most sensitive is called its adequate stimulus. The 
problem of how receptors convert energy into action potentials in the sensory nerves has been the subject 
of intensive study. In the complex sense organs such as those concerned with hearing and vision, there 
exist separate receptor cells and synaptic junctions between receptors and afferent nerves. In other cases, 
such as the cutaneous sense organs, the receptors are specialized. Where a stimulus of constant strength 
is applied to a receptor repeatedly, the frequency of the action potentials in its sensory nerve declines 
over a period of time. This phenomenon is known as adaptation; if the adaptation is very rapid, then 
the receptors are called phasic, otherwise, they are called tonic. 

Another important issue is the coding of sensory information. Action potentials are similar in all nerves, 
although there are variations in their speed of conduction and other characteristics. However, if the 
action potentials were the same in most cells, then what makes the visual cells sensitive to light and not 
to sound and the touch receptors sensitive to touch and not to smell? And how can we tell if these 
sensations are strong or not? These sensations depend on what is called the doctrine of specific nerve 
energies, which has been questioned over time by several researchers. No matter where a particular sensory 
pathway is stimulated along its course to the brain, the sensation produced is referred to the location of 
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the receptor. This is the law of projections. An example of this law is the "phantom limb," in which an 
amputee complains about an itching sensation in the amputated limb. 

3.2 Functions of the Nervous System 

The basic unit of integrated activity is the reflex arc. This arc consists of a sense organ, afferent neuron, 
one or more synapses in a central integrating station (or sympathetic ganglion), an efferent neuron, and 
an effector. The simplest reflex arc is the monosynaptic one, which has only one synapse between the 
afferent and efferent neurons. With more than one synapse, the reflex arc is called polysynaptic. In each 
of these cases, activity is modified by both spatial and temporal facilitation, occlusion, and other effects. 

In mammals, the concentration between afferent and efferent somatic neurons is found either in the 
brain or in the spinal cord. The Bell-Magendie law dictates that in the spinal cord the dorsal roots are 
sensory, while the ventral roots are motor. The action potential message that is carried by an axon is 
eventually fed to a muscle, to a secretory cell, or to the dendrite of another neuron. If an axon is carrying 
a graded potential, its output is too weak to stimulate a muscle, but it can terminate on a secretory cell 
or dendrite. The latter can have as many as 10,000 inputs. If the endpoint is a motor neuron, which has 
been found experimentally in the case of fibers from the primary endings, then there is a lag between 
the time when the stimulus was applied and when the response is obtained from the muscle. This time 
interval is called the reaction time and in humans is approximately 20 ms for a stretch reflex. The distance 
from the spinal cord can be measured, and since the conduction velocities of both the efferent and afferent 
fibers are known, another important quality can be calculated: the central delay. This delay is the portion 
of the reaction time that was spent for conduction to and from the spinal cord. It has been found that 
muscle spindles also make connections that cause muscle contraction via polysynaptic pathways, while 
the afferents from secondary endings make connections that excite extensor muscles. When a motor 
neuron sends a burst of action potentials to its skeletal muscle, the amount of contraction depends largely 
on the discharge frequency but also on many other factors, such as the history of the load on the muscle 
and the load itself. The stretch error can be calculated from the desired motion minus the actual stretch. 
If this error is then fed back to the motor neuron, its discharge frequency is modified appropriately. This 
corresponds to one of the three feedback loops that are available locally. Another loop corrects for 
overstretching beyond the point that the muscle or tendon may tear. Since a muscle can only contract, 
it must be paired with another muscle {antagonist) in order to effect the return motion. Generally 
speaking, a flexor muscle is paired with an extensor muscle that cannot be activated simultaneously. This 
means that the motor neurons that affect each one of these are not activated at the same time. Instead, 
when one set of motor neurons is activated, the other is inhibited, and vice versa. When movement 
involves two or more muscles that normally cooperate by contracting simultaneously, the excitation of 
one causes facilitation of the other synergistic members via cross-connections. All these networks form 
feedback loops. An engineer's interpretation of how these loops work would be to assume dynamic 
conditions, as is the case in all parts of the nervous system. This has little value in dealing with stationary 
conditions, but it provides for an ability to adjust to changing conditions. 

The nervous system, as mentioned earlier, is a control system of processes that adjust both internal 
and external operations. As humans, we have experiences that change our perceptions of events in our 
environment. The same is true for higher animals, which, besides having an internal environment the 
status of which is of major importance, also share an external environment of utmost richness and variety. 
Objects and conditions that have direct contact with the surface of an animal directly affect the future 
of the animal. Information about changes at some point provides a prediction of possible future status. 
The amount of information required to represent changing conditions increases as the required temporal 
resolution of detail increases. This creates a vast amount of data to be processed by any finite system. 
Considering the fact that the information reaching sensory receptors is too extensive and redundant, as 
well as modified by external interference (noise), the nervous system has a tremendously difficult task 
to accomplish. Enhanced responsiveness to a particular stimulus can be produced by structures that 
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either increase the energy converging on a receptor or increase the effectiveness of coupling of a specific 
type of stimulus with its receptor. Different species have sensory systems that respond to stimuli that are 
important to them for survival. Often one nervous system responds to conditions that are not sensed by 
another nervous system. The transduction, processing, and transmission of signals in any nervous system 
produce a survival mechanism for an organism but only after these signals have been further modified 
by effector organs. Although the nerve impulses that drive a muscle, as explained earlier, are discrete 
events, a muscle twitch takes much longer to happen, a fact that allows for their responses to overlap 
and produce a much smoother output. Neural control of motor activity of skeletal muscle is accomplished 
entirely by the modification of muscle excitation, which involves changes in velocity, length, stiffness, 
and heat production. The important of accurate timing of inputs and the maintenance of this timing 
across several synapses is obvious in sensory pathways of the nervous system. Cells are located next to 
other cells that have overlapping or adjacent receptor or motor fields. The dendrites provide important 
and complicated sites of interactions as well as channels of variable effectiveness for excitatory inputs, 
depending on their position relative to the cell body. Among the best examples are the cells of the medial 
superior olive in the auditory pathway. These cells have two major dendritic trees extending from opposite 
poles of the cell body. One receives synaptic inhibitory input from the ipsilateral cochlear nucleus, the 
other from the contralateral nucleus that normally is an excitatory input. These cells deal with the 
determination of the azimuth of a sound. When a sound is present on the contralateral side, most cells 
are excited, while ipsilateral sounds cause inhibition. It has been shown that the cells can go from complete 
excitation to full inhibition with a difference of only a few hundred milliseconds in arrival time of the 
two inputs. 

The question then arises: How does the nervous system put together the signals available to it so that 
a determination of output can take place? To arrive at an understanding of how the nervous system 
intergrades incoming information at a given moment of time, we must understand that the processes 
that take place depend both on cellular forms and a topologic architecture and on the physiologic 
properties that relate input to output. That is, we have to know the transfer functions or coupling functions. 
Integration depends on the weighting of inputs. One of the important factors determining weighting is 
the area of synaptic contact. The extensive dendrites are the primary integrating structures. Electronic 
spread is the means of mixing, smoothing, attenuating, delaying, and summing postsynaptic potentials. 
The spatial distribution of input is often not random but systematically restricted. Also, the wide variety 
of characteristic geometries of synapses is no doubt important not only for the weighting of different 
combinations of inputs. When repeated stimuli are presented at various intervals at different junctions, 
increasing synaptic potentials are generated if the intervals between them are not too short or too long. 
This increase is due to a phenomenon called facilitation. If the response lasts longer than the interval 
between impulses, such that the second response rises from the residue of the first, then it is temporal 
summation. If, in addition, the response increment due to the second stimulus is larger than the preceding 
one, then it is facilitation. Facilitation is an important function of the nervous system and is found in 
quite different forms and durations ranging from a few milliseconds to tenths of seconds. Facilitation 
may grade from forms of sensitization to learning, especially at long intervals. A special case is the so- 
called posttetanic potentiation that is the result of high-frequency stimulation for long periods of time 
(about 10 seconds). This is an interesting case, since no effects can be seen during stimulation, but 
afterwards, any test stimulus at various intervals creates a marked increase in response up to many times 
more than the "tetanic" stimulus. Antifacilitation is the phenomenon where a decrease of response from 
the neuron is observed at certain junctions due to successive impulses. Its mechanism is less understood 
than facilitation. Both facilitation and antifacilitation may be observed on the same neuron but in different 
functions of it. 

3.3 Representation of Information in the Nervous System 

Whenever information is transferred between different parts of the nervous system, some communication 
paths have to be established, and some parameters of impulse firing relevant to communication must be 
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set up. Since what is communicated is nothing more than impulses — spike trains — the only basic 
variables in a train of events are the number and intervals between spikes. With respect to this, the 
nervous system acts like a pulse-coded analog device, since the intervals are continuously graded. There 
exists a distribution of interval lengths between individual spikes, which in any sample can be expressed 
by the shape of the interval histogram. If one examines different examples, their distributions differ 
markedly. Some histograms look like Poisson distributions; some others exhibit gaussian or bimodal 
shapes. The coefficient of variation — expressed as the standard deviation over the mean — in some 
cases is constant, while in others it varies. Some other properties depend on the sequence of longer and 
shorter intervals than the mean. Some neurons show no linear dependence; some others show positive 
or negative correlations of successive intervals. If a stimulus is delivered and a discharge from the neuron 
is observed, a poststimulus time histogram can be used, employing the onset of the stimulus as a reference 
point and averaging many responses in order to reveal certain consistent features of temporal patterns. 
Coding of information can then be based on the average frequency, which can represent relevant grada- 
tions of the input. Mean frequency is the code in most cases, although no definition of it has been given 
with respect to measured quantities, such as averaging time, weighting functions, and forgetting functions. 
Characteristic transfer functions have been found, which suggests that there are several distinct coding 
principles in addition to the mean frequency. Each theoretically possible code becomes a candidate code 
as long as there exists some evidence that is readable by the system under investigation. Therefore, one 
has to first test for the availability of the code by imposing a stimulus that is considered "normal." After 
a response has been observed, the code is considered to be available. If the input is then changed to 
different levels of one parameter and changes are observed at the postsynaptic level, the code is called 
readable. However, only if both are formed in the same preparation and no other parameter is available 
and readable can the code be said to be the actual code employed. Some such parameters are 

1. Time of firing 

2. Temporal pattern 

3. Number of spikes in the train 

4. Variance of interspike intervals 

5. Spike delays or latencies 

6. Constellation code 

The latter is a very important parameter, especially when used in conjunction with the concept of 
receptive fields of units in the different sensory pathways. The unit receptors do not need to have highly 
specialized abilities to permit encoding of a large number of distinct stimuli. Receptive fields are topo- 
graphic and overlap extensively. Any given stimulus will excite a certain constellation of receptors and is 
therefore encoded in the particular set that is activated. A large degree of uncertainty prevails and requires 
the brain to operate probabilistically. In the nervous system there exists a large amount of redundancy, 
although neurons might have different thresholds. It is questionable, however, if these units are entirely 
equivalent, although they share parts of their receptive fields. The nonoverlapping parts might be of 
importance and critical to sensory function. On the other hand, redundancy does not necessarily mean 
unspecified or random connectivity. Rather, it allows for greater sensitivity and resolution, improvement 
of signal-to-noise ratio, while at the same time it provides stability of performance. 

Integration of large numbers of converging inputs to give a single output can be considered as an 
averaging or probabilistic operation. The "decisions" made by a unit depend on its inputs, or some 
intrinsic states, and reaching a certain threshold. This way every unit in the nervous system can make a 
decision when it changes from one state to a different one. A theoretical possibility also exists that a mass 
of randomly connected neurons may constitute a trigger unit and that activity with a sharp threshold 
can spread through such a mass redundancy. Each part of the nervous system, and in particular the 
receiving side, can be thought of as a filter. Higher-order neurons do not merely pass their information 
on, but instead they use convergence from different channels, as well as divergence of the same channels 
and other processes, in order to modify incoming signals. Depending on the structure and coupling 
functions of the network, what gets through is determined. Similar networks exist at the output side. 
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They also act as filters, but since they formulate decisions and commands with precise spatiotemporal 
properties, they can be thought of as pattern generators. 

3.4 Lateral Inhibition 

This discussion would be incomplete without a description of a very important phenomenon in the 
nervous system. This phenomenon, called lateral inhibition, used by the nervous system to improve spatial 
resolution and contrast. The effectiveness of this type of inhibition decreases with distance. In the retina, 
for example, lateral inhibition is used extensively in order to improve contrast. As the stimulus approaches 
a certain unit, it first excites neighbors of the recorded cell. Since these neighbors inhibit that unit, it 
responds by a decrease in firing frequency. If the stimulus is exactly over the recorded unit, this unit is 
excited and fires above its normal rate, and as the stimulus moves out again, the neighbors are excited, 
while the unit under consideration fires less. If we now examine the output of all the units as a whole 
and at one while half the considered array is stimulated and the other half is not, we will notice that at 
the point of discontinuity of the stimulus going from stimulation to nonstimulation, the firing frequencies 
of the two halves have been differentiated to the extreme at the stimulus edge, which has been enhanced. 
The neuronal circuits responsible for lateral shifts are relatively simple. Lateral inhibition can be consid- 
ered to give the negative of the second spatial derivative of the input stimulus. A second layer of neurons 
could be constructed to perform this spacial differentiation on the input signal to detect the edge only. 
It is probably lateral inhibition that explains the psychophysical illusion known as Mach bands. It is 
probably the same principle that operates widely in the nervous system to enhance the sensitivity to 
contrast in the visual system in particular and in all other modalities in general. Through the years, 
different models have been developed to describe lateral inhibition mathematically, and various methods 
of analysis have been employed. Such methods include 

Functional notations 
Graphic solutions 
Tabular solution 
Taylor's series expansions 
Artificial neural network modeling 

These models include both one-dimensional examination of the phenomenon and two-dimensional 
treatment, where a two-dimensional array is used as a stimulus. This two-dimensional treatment is 
justified because most of the sensory receptors of the body form two-dimensional maps (receptive fields). 
In principle, if a one-dimensional lateral inhibition system is linear, one can extend the analysis to two 
dimensions by means of superposition. The two-dimensional array can be thought of as a function f[x, y), 
and the lateral inhibition network itself is embodied in a separate N x N array, the central square of 
which has a positive value and can be thought of as a direct input from an incoming axon. The 
surrounding squares have negative values that are higher than the corner values, which are also negative. 
The method consists of multiplying the input signal values f[x, y) and their contiguous values by the 
lateral inhibitory network's weighting factors to get a corresponding g{x, y). Figure 3. 1 presents an example 
of such a process. The technique illustrated here is used in the contrast enhancement of photographs. 
The objective is the same as that of the nervous system: to improve image sharpness without introducing 
too much distortion. This technique requires storage of each picture element and lateral "inhibitory" 
interactions between adjacent elements. Since a picture may contain millions of elements, high-speed 
computers with large-scale memories are required. 

At a higher level, similar algorithms can be used to evaluate decision-making mechanisms. In this 
case, many inputs from different sensory systems are competing for attention. The brain evaluates each 
one of the inputs as a function of the remaining ones. One can picture a decision-making mechanism 
resembling a "locator" of stimulus peaks. The final output depends on what weights are used at the inputs 
of a push-pull mechanism. Thus a decision can be made depending on the weights as individual's brain 
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FIGURE 3.1 An example of two-dimensional lateral inhibition. On the left, the 3x3 array corresponds to the 
values of the synaptic junctions weighting coefficients. For simplicity, the corner weights are assumed to be zero. 
g(x, y) represents the output matrix after lateral inhibition has been applied to the input matrix. 

is applying to the incoming information about a situation under consideration. The most important 
information is heavily weighted, while the rest is either totally masked or weighted very lightly. 

3.5 Higher Functions of the Nervous System 
Pattern Recognition 

One way of understanding human perception is to study the mechanism of information processing in 
the brain. The recognition of patterns of sensory input is one of the functions of the brain, a task 
accomplished by neuronal circuits, the feature extractors. Although such neuronal information is more 
likely to be processed globally by a large number of neurons, in animals, single-unit recording is one of 
the most powerful tools in the hands of the physiologist. Most often, the concept of the receptive field is 
used as a method of understanding sensory information processing. In the case of the visual system, one 
could call the receptive field a well-defined region of the visual field which, when stimulated, will change 
the firing rate of a neuron in the visual pathway. The response of that neuron will usually depend on the 
distribution of light in the receptive field. Therefore, the information collected by the brain from the 
outside world is transformed into spatial as well as temporal patterns of neuronal activity. 

The question often asked is how do we perceive and recognize faces, objects, and scenes. Even in those 
cases where only noisy representations exist, we are still able to make some inference as to what the 
pattern represents. Unfortunately, in humans, single-unit recording, as mentioned above, is impossible. 
As a result, one has to use other kinds of measurements, such as evoked potentials (EPs). Although 
physiologic in nature, EPs are still far away from giving us information at the neuronal level. Yet EPs 
have been used extensively as a way of probing the human (and animal) brains because of their nonin- 
vasive character. EPs can be considered to be the result of integrations of the neuronal activity of many 
neurons some place in the brain. This gross potential can then be used as a measure of the response of 
the brain to sensory input. 

The question then becomes: Can we somehow use this response to influence the brain in producing 
patterns of activity that we want? None of the efforts of the past closed this loop. How do we explain 
then the phenomenon of selective attention by which we selectively direct our attention to something of 
interest and discard the rest? And what happens with the evolution of certain species that change 
appearance according to their everyday needs? All these questions tend to lead to the fact that somewhere 
in the brain there is a loop where previous knowledge or experience is used as a feedback to the brain 
itself. This feedback then modifies the ability of the brain to respond in a different way to the same 
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FIGURE 3.2 An ALOPEX system. The stimulus is presented on the CRT. The observer or any pattern-recognition 
device (PRD) faces the CRT; the subject's response is sent to the ALOPEX interface unit, where it is recorded and 
integrated, and the final response is sent to the computer. The computer calculates the values of the new pattern to 
be presented on the CRT according to the ALOPEX algorithm, and the process continues until the desired pattern 
appears on the CRT. At this point, the response is considered to be optimal and the process stops. 

stimulus the next time it is presented. In a way, then, the brain creates mental "images" independent of 
the stimulus which tend to modify the representation of the stimulus in the brain. 

This section describes some efforts in which different methods have been used in trying to address 
the difficult task of feedback loops in the brain. However, no attempt will be made to explain or even 
postulate where these feedback loops might be located. If one considers the brain as a huge set of neural 
nets, then one question has been debated for many years: What is the role of the individual neuron in 
the net, and what is the role of each network in the holistic process of the brain? More specifically, does 
the neuron act as an analyzer or a detector of specific features, or does it merely reflect the characteristic 
response of a population of cells of which it happens to be a member? What invariant relationships exist 
between sensory input and the response of a single neuron, and how much can be "read" about the 
stimulus parameters from the record of a single EP? In turn, then, how much feedback can one use from 
a single EP in order to influence the stimulus, and how successful can that influence be? Many physiol- 
ogists express doubts that simultaneous observations of large numbers of individual neuronal activities 
can be readily interpreted. The main question we are asking is: Can a feedback process influence and 
modulate the stimuli patterns so that they appear optimal? If this is proven to be true, it would mean 
that we can reverse the pattern-recognition process, and instead of recognizing a pattern, we would be 
able to create a pattern from a vast variety of possible patterns. It would be like creating a link between 
our brain and a computer; equivalent to a brain-computer system network. Figure 3.2 is a schematic 
representation of such a process involved in what we call the feedback loop of the system. The pattern- 
recognition device (PRD) is connected to an ALOPEX system (a computer algorithm and an image 
processor in this case) and faces a display monitor where different-intensity patterns can be shown. Thin 
arrows representing response information and heavy arrows representing detailed pattern information 
are generated by the computer and relayed by the ALOPEX system to the monitor. ALOPEX is a set of 
algorithms described in detail elsewhere in this handbook. If this kind of arrangement is used for the 
determination of visual receptive fields of neurons, then the PRD is nothing more than the brain of an 
experimental animal. This way the neuron under investigation does its own selection of the best stimulus 
or trigger feature and reverses the role of the neuron from being a feature extractor to becoming a feature 
generator, as mentioned earlier. The idea is to find the response of the neuron to a stimulus and use this 
response as a positive feedback in the directed evaluation of the initially random pattern. Thus the cell 
involved filters out the key trigger features from the stimulus and reinforces them with the feedback. 

As a generalization of this process, one might consider that a neuron N receives a visual input from 
a pattern P which is transmitted in a modified form P' to an analyzer neuron AN (or even a complex of 
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FIGURE 3.3 Diagammatic representation of the ALOPEX "inverse" pattern-recognition scheme. Each neuron 
represents a feature analyzer that responds to the stimulus with a scalar quantity R called the response. R is then fed 
back to the system, and the pattern is modified accordingly. This process continues until there is a close correlation 
between the desired output and the original pattern. 

neurons), as shown in Fig. 3.3. The analyzer responds with a scalar variable _R that is then fed back to 
the system, and the pattern is modified accordingly. The process continues in small steps until there is 
an almost perfect correlation between the original pattern (template) and the one that neuron N indirectly 
created. This integrator sends the response back to the original modifier. The integrator need not be a 
linear summator. It could take any nonlinear form, a fact that is a more realistic representation of the 
visual cortex. One can envision the input patterns as templates preexisting in the memory of the system, 
a situation that might come about with visual experience. For a "naive" system, any initial pattern will 
do. As experience is gained, the patterns become less random. If one starts with a pattern that has some 
resemblance to one of the preexisting patterns, evolution will take its course. In nature, there might exist 
a mechanism similar to that of ALOPEX. By filtering the characteristics most important for the survival 
of the species, changes would be triggered. Perception, therefore, could be considered to be an interaction 
between sensory inputs and past experience in the form of templates stored in the memory of the perceiver 
and specific to the perceiver's needs. These templates are modifiable with time and adjusted accordingly 
to the input stimuli. With this approach, the neural nets and ensembles of nets under observation generate 
patterns that describe their thinking and memory properties. The normal flow of information is reversed 
and controls the afferent systems. 

The perception processes as well as feature extraction or suppression of images or objects can be 
ascribed to specific neural mechanisms due to some sensory input or even due to some "wishful thinking" 
of the PRD. If it is true that the association cortex is affecting the sensitivity of the sensory cortex, then 
an ALOPEX mechanism is what one needs to close the loop for memory and learning. 

Memory and Learning 

If we try to define what memory is, we will face the fact that memory is not a single mental faculty but 
is rather composed of multiple abilities mediated by separate and distinct brain systems. Memory for a 
recent event can be expressed explicitly as a conscious recollection or implicitly as a facilitation of test 
performance without conscious recollection. The major distinction between these two memories is that 
explicit or declarative memory depends on limbic and diencephalic structures and provides the basis for 
recollection of events, while implicit or nondeclarative memory supports skills and habit learning, single 
conditioning, and the well-researched phenomenon of priming. 
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Declarative memory refers to memory of recent events and is usually assessed by tests of recall or 
recognition for specific single items. When the list of items becomes longer, a subject not only learns 
about each item on the list but also makes associations about what all these items have in common; i.e., 
the subject learns about the category that the items belong to. Learning leads to changes that increase or 
decrease the effectiveness of impulses arriving at the junctions between neurons, and the cumulative 
effect of these changes constitutes memory. Very often a particular pattern of neural activity leads to a 
result that occurs some time after the activity has ended. Learning then requires some means of relating 
the activity that is to be changed to the evaluation that can be made only by the delayed consequence. 
This phenomenon in physics is called hysteresis and refers to any modifications of future actions due to 
past actions. Learning then could be defined as change in any neuronal response resulting from previous 
experiences due to an external stimulus. Memory, in turn, would be the maintenance of these changes 
over time. The collection of neural changes representing memory is commonly known as the engram, 
and a major part of recent work has been to identify and locate engrams in the brain, since specific parts 
of the nervous system are capable of specific types of learning. The view of memory that has recently 
emerged is that information storage is tied to specific processing areas that are engaged during learning. 
The brain is organized so that separate regions of neocortex simultaneously carry out computations on 
specific features or characteristics of the external stimulus, no matter how complex that stimulus might 
be. If the brain learns specific properties or features of the stimulus, then we talk about the nonassociative 
memory. Associated with this type of learning is the phenomenon of habituation, in which if the same 
stimulus is presented repeatedly, the neurons respond less and less, while the introduction of a new 
stimulus increases the sensitization of the neuron. If the learning includes two related stimuli, then we 
talk about associative learning. This type of learning includes two types: classic conditioning and operant 
conditioning. The first deals with relationships among stimuli, while the latter deals with the relationship 
of the stimulus to the animal's own behavior. In humans, there exist two types of memory: short-term 
and long-term memories. The best way to study any physiologic process in humans, and especially memory, 
is to study its pathology. The study of amnesia has provided strong evidence distinguishing between these 
types of memory. Amnesic patients can keep a short list of numbers in mind for several minutes if they 
pay attention to the task. The difficulty comes when the list becomes longer, especially if the amount to 
be learned exceeds the brain capacity of what can be held in immediate memory. It could be that this 
happens because more systems have to be involved and that temporary information storage may occur 
within each brain area where stable changes in synaptic efficacy can eventually develop. Plasticity within 
existing pathways can account for most of the observations, and short-term memory occurs too quickly 
for it to require any major modifications of neuronal pathways. The capacity of long-term memory 
requires the integrity of the medial temporal and diencephalic regions in conjunction with neurons for 
storage of information. Within the domain of long-term memory, amnesic patients demonstrate intact 
learning and retention of certain motor, perceptual, and cognitive skills and intact priming effects. These 
patients do not exhibit any learning deficits but have no conscious awareness of prior study sessions or 
recognition of previously presented stimuli. 

Priming effects can be tested by presenting words and then providing either the first few letters of the 
word or the last part of the word for recognition by the patient. Normal subjects, as expected, perform 
better than amnesic subjects. However, if these patients are instructed to "read" the incomplete word 
instead of memorizing it, then they perform as well as the normal individuals. Also, these amnesic patients 
perform well if words are cued by category names. Thus priming effects seem to be independent of the 
processes of recall and recognition memory, which is also observed in normal subjects. All this evidence 
supports the notion that the brain has organized its memory functions around fundamentally different 
information storage systems. In perceiving a word, a preexisting array of neurons is activated that have 
concurrent activities that produce perception, and priming is one of these functions. 

Memory is not fixed immediately after learning but continues to grow toward stabilization over a 
period of time. This stabilization is called consolidation of memory. Memory consolidation is a dynamic 
feature of long-term memory, especially the declarative memory, but it is neither an automatic process 
with fixed lifetime nor is it determined at the time of learning. It is rather a process of reorganization of 
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stored information. As time passes, some not yet consolidated memories fade out by remodeling the 
neural circuitry that is responsible for the original representation or by establishing new representations, 
since the original one can be forgotten. 

The problems of learning and memory are studied continuously and with increased interest these 
days, especially because artificial systems such as neural networks can be used to mimic functions of the 
nervous system. 
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David Marr, an early pioneer in computer vision, defined vision as extracting "... from images of the 
external world, a description that is useful for the viewer and not cluttered with irrelevant information" 
[Marr, 1982]. Advances in computers and video technology in the past decades have created the expec- 
tation that artificial vision should be realizable. The nontriviality of the task is evidenced by the continuing 
proliferation of new and different approaches to computer vision without any observable application in 
our everyday lives. Actually, computer vision is already offering practical solutions in industrial assembly 
and inspection, as well as for military and medical applications, so it seems we are beginning to master 
some of the fundamentals. However, we have a long way to go to match the vision capabilities of a 4-year- 
old child. In this chapter we will explore what is known about how nature has succeeded at this formidable 
task — that of interpreting the visual world. 

4.1 Fundamentals of Vision Research 

Research into biologic vision systems has followed several distinct approaches. The oldest is psychophysics, 
in which human and animal subjects are presented with visual stimuli and their responses recorded. 
Important early insights also were garnered by correlating clinical observations of visual defects with known 
neuroanatomic injury. In the past 50 years, a more detailed approach to understanding the mechanisms of 
vision has been undertaken by inserting small electrodes deep within the living brain to monitor the electrical 
activity of individual neurons and by using dyes and biochemical markers to track the anatomic course of 
nerve tracts. This research has led to a detailed and coherent, if not complete, theory of a visual system 
capable of explaining the discrimination of form, color, motion, and depth. This theory has been confirmed 
by noninvasive radiologic techniques that have been used recently to study the physiologic responses of the 
visual system, including positron emission tomography [Zeki et al., 1991] and functional magnetic reso- 
nance imaging [Belliveau et al., 1992; Cohen and Bookheimer, 1994], although these noninvasive techniques 
provide far less spatial resolution and thus can only show general regions of activity in the brain. 

4.2 A Modular View of the Vision System 
The Eyes 

Movement of the eyes is essential to vision, not only allowing rapid location and tracking of objects but 
also preventing stationary images on the retina, which are essentially invisible. Continual movement of 
the image on the retina is essential to the visual system. 
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The eyeball is spherical and therefore free to turn in both the horizontal and vertical directions. Each 
eye is rotated by three pairs of mutually opposing muscles, innervated by the oculomotor nuclei in the 
brainstem. The eyes are coordinated as a pair in two useful ways: turning together to find and follow 
objects and turning inward to allow adjustment for parallax as objects become closer. The latter is called 
convergence. 

The optical portion of the eye, which puts an image on the retina, is closely analogous to a photographic 
or television camera. Light enters the eye, passing through a series of transparent layers — the cornea, 
the aqueous humor, the lens, and the vitreous body — to eventually project on the retina. 

The cornea, the protective outer layer of the eye, is heavily innervated with sensory neurons, triggering 
the blink reflex and tear duck secretion in response to irritation. The cornea is also an essential optical 
element, supplying two-thirds of the total refraction in the eye. Behind the cornea is a clear fluid, the 
aqueous humor, in which the central aperture of the iris, the pupil, is free to constrict or dilate. The two 
actions are accomplished by opposing sets of muscles. 

The lens, a flexible transparent object behind the iris, provides the remainder of refraction necessary 
to focus an image on the retina. The ciliary muscles surrounding the lens can increase the lens' curvature, 
thereby decreasing its focal length and bringing nearer objects into focus. This is called accommodation. 
When the ciliary muscles are at rest, distant objects are in focus. There are no contradictory muscles to 
flatten the lens. This depends simply on the elasticity of the lens, which decreases with age. Behind the 
lens is the vitreous humor, consisting of a semigelatinous material filling the volume between the lens 
and the retina. 

The Retina 

The retina coats the back of the eye and is therefore spherical, not flat, making optical magnification 
constant at 3.5 degrees of scan angle per millimeter. The retina is the neuronal front end of the visual 
system, the image sensor. In addition, it accomplishes the first steps in edge detection and color analysis 
before sending the processed information along the optic nerve to the brain. The retina contains five 
major classes of cells, roughly organized into layers. The dendrites of these cells each occupy no more 
than 1 to 2 mm 2 in the retina, limiting the extent of spatial integration from one layer of the retina to 
the next. 

First come the receptors, which number approximately 125 million in each eye and contain the light- 
sensitive pigments responsible for converting photons into chemical energy. Receptor cells are of two 
general varieties: rods and cones. The cones are responsible for the perception of color, and they function 
only in bright light. When the light is dim, only rods are sensitive enough to respond. Exposure to a 
single photon may result in a measurable increase in the membrane potential of a rod. This sensitivity 
is the result of a chemical cascade, similar in operation to the photo multiplier tube, in which a single 
photon generates a cascade of electrons. All rods use the same pigment, whereas three different pigments 
are found in three separate kinds of cones. 

Examination of the retina with an otoscope reveals its gross topography. The yellow circular area 
occupying the central 5 degrees of the retina is called the macula lutea, within which a small circular pit 
called the fovea may be seen. Detailed vision occurs only in the fovea, where a dense concentration of 
cones provides visual activity to the central 1 degree of the visual field. 

On the inner layer of the retina one finds a layer of ganglion cells, whose axons make up the optic 
nerve, the output of the retina. They number approximately 1 million, or less than 1% of the number 
of receptor cells. Clearly, some data compression has occurred in the space between the receptors and 
the ganglion cells. Traversing this space are the bipolar cells, which run from the receptors through the 
retina to the ganglion cells. Bipolar cells exhibit the first level of information processing in the visual 
system; namely, their response to light on the retina demonstrates "center/surround" receptive fields. By 
this I mean that a small dot on the retina elicits a response, while the area surrounding the spot elicits 
the opposite response. If both the center and the surround are illuminated, the net result is no response. 
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Thus bipolar cells respond only at the border between dark and light areas. Bipolar cells come in two 
varieties, on-center and off-center, with the center respectively brighter or darker than the surround. 

The center response of bipolar cells results from direct contact with the receptors. The surround 
response is supplied by the horizontal cells, which run parallel to the surface of the retina between the 
receptor layer and the bipolar layer, allowing the surrounding area to oppose the influence of the center. 
The amacrine cells, a final cell type, also run parallel to the surface but in a different layer, between the 
bipolar cells and the ganglion cells, and are possibly involved in the detection of motion. 

Ganglion cells, since they are triggered by bipolar cells, also have center/surround receptive fields and 
come in two types, on-center and off-center. On-center ganglion cells have a receptive field in which 
illumination of the center increases the firing rate and a surround where it decreases the rate. Off-center 
ganglion cells display the opposite behavior. Both types of ganglion cells produce little or no change in 
firing rate when the entire receptive field is illuminated, because the center and surround cancel each 
other. As in many other areas of the nervous system, the fibers of the optic nerve use frequency encoding 
to represent a scalar quantity. 

Multiple ganglion cells may receive output from the same receptor, since many receptive fields overlap. 
However, this does not limit overall spatial resolution, which is maximum in the fovea, where two points 
separated by 0.5 minutes of arc may be discriminated. This separation corresponds to a distance on the 
retina of 2.5 urn, which is approximately the center-to-center spacing between cones. Spatial resolution 
falls off as one moves away from the fovea into the peripheral vision, where resolution is as low as 1 degree 
of arc. 

Several aspects of this natural design deserve consideration. Why do we have center/surround receptive 
fields? The ganglion cells, whose axons make up the optic nerve, do not fire unless there is meaningful 
information, i.e., a border, falling within the receptive field. It is the edge of a shape we see rather than 
its interior. This represents a form of data compression. Center/surround receptive fields also allow for 
relative rather than absolute measurements of color and brightness. This is essential for analyzing the 
image independent of lighting conditions. Why do we have both on-center and off-center cells? Evidently, 
both light and dark are considered information. The same shape is detected whether it is lighter or darker 
than the background. 

Optic Chiasm 

The two optic nerves, from the left and right eyes, join at the optic chiasm, forming a hemidecussation, 
meaning that half the axons cross while the rest proceed uncrossed. The resulting two bundles of axons 
leaving the chiasm are called the optic tracts. The left optic tract contains only axons from the left half 
of each retina. Since the images are reversed by the lens, this represents light from the right side of the 
visual field. The division between the right and left optic tracts splits the retina down the middle, bisecting 
the fovea. The segregation of sensory information into the contralateral hemispheres corresponds to the 
general organization of sensory and motor centers in the brain. 

Each optic tract has two major destinations on its side of the brain: (1) the superior colliculus and 
(2) the lateral geniculate nucleus (LGN). Although topographic mapping from the retina is scrambled 
within the optic tract, it is reestablished in both major destinations so that right, left, up, and down in 
the image correspond to specific directions within those anatomic structures. 

Superior Colliculus 

The superior colliculus is a small pair of bumps on the dorsal surface of the midbrain. Another pair, the 
inferior colliculus, is found just below it. Stimulation of the superior colliculus results in contralateral eye 
movement. Anatomically, output tracts from the superior colliculus run to areas that control eye and 
neck movement. Both the inferior and superior colliculi are apparently involved in locating sound. In 
the bat, the inferior colliculus is enormous, crucial to that animal's remarkable echolocation abilities. 
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The superior colliculus processes information from the inferior colliculus, as well as from the retina, 
allowing the eyes to quickly find and follow targets based on visual and auditory cues. 

Different types of eye movements have been classified. The saccade (French, for "jolt") is a quick motion 
of the eyes over a significant distance. The saccade is how the eyes explore an image, jumping from 
landmark to landmark, rarely stopping in featureless areas. Nystagmus is the smooth pursuit of a moving 
image, usually with periodic backward saccades to lock onto subsequent points as the image moves by. 
Microsaccad.es are small movements, several times per second, over 1 to 2 minutes of arc in a seemingly 
random direction. Microsaccades are necessary for sight; their stabilization leads to effective blindness. 



LGN 

The thalamus is often called "the gateway to the 
cortex" because it processes much of the sensory 
information reaching the brain. Within the thala- 
mus, we find the lateral geniculate nucleus (LGN), a 
peanut-sized structure that contains a single synap- 
tic stage in the major pathway of visual information 
to higher centers. The LGN also receives informa- 
tion back from the cortex, so-called reentrant con- 
nections, as well as from the nuclei in the brainstem 
that control attention and arousal. 

The cells in the LGN are organized into three 
pairs of layers. Each pair contains two layers, one 
from each eye. The upper two pairs consist of par- 
vocellular cells (P cells) that respond with preference 
to different colors. The remaining lower pair con- 
sists of magnocellular cells (M cells) with no color 
preference (Fig. 4.1). The topographic mapping is 
identical for all six layers; i.e., passing through the 
layers at a given point yields synapses responding to 
a single area of the retina. Axons from the LGN 
proceed to the primary visual cortex in broad bands, 
the optic radiations, preserving this topographic 
mapping and displaying the same center/surround 
response as the ganglion cells. 



(1) COLOR 

LGN(P-cell) — ► VI (blobs) ► V4 

\ / 

V2 (thin) 

(2) MOTION 

LGN(M-cell) — ► VI (layer-4B) ► V5 

\ / 

V2 (thick) 

(3) FORM WITH COLOR 

LGN(P-cell) — ► VI (inter-blob) +-V4 

\ / 

V2 (pale) 

(4) FORM WITH MOTION 

LGN(M-cell) — ► VI (layer-4B) ^V3 

\ / 

V2 (thick) 

FIGURE 4.1 Visual pathways to cortical areas show- 
ing the separation of information by type. The lateral 
geniculate nucleus (LGN) and areas VI and V2 act as 
gateways to more specialized higher areas. 



Area VI 

The LGN contains approximately 1.5 million cells. By comparison, the primary visual cortex, or striate 
cortex, which receives the visual information from the LGN, contains 200 million cells. It consists of a 
thin (2-mm) layer of gray matter (neuronal cell bodies) over a thicker collection of white matter (myeli- 
nated axons) and occupies a few square inches of the occipital lobes. The primary visual cortex has been 
called area 17 from the days when the cortical areas were first differentiated by their cytoarchitectonics 
(the microscopic architecture of their layered neurons). In modern terminology, the primary visual cortex 
is often called visual area 1, or simply VI. 

Destroying any small piece of VI eliminates a small area in the visual field, resulting in scotoma, a local 
blind spot. Clinical evidence has long been available that a scotoma may result from injury, stroke, or 
tumor in a local part of VI. Between neighboring cells in Vl's gray matter, horizontal connections are 
at most 2 to 5 mm in length. Thus, at any given time, the image from the retina is analyzed piecemeal 
in VI. Topographic mapping from the retina is preserved in great detail. Such mapping is seen elsewhere 
in the brain, such as in the somatosensory cortex [Mountcastle, 1957]. Like all cortical surfaces, VI is a 
highly convoluted sheet, with much of its area hidden within its folds. If unfolded, VI would be roughly 
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pear-shaped, with the top of the pear processing information from the fovea and the bottom of the pear 
processing the peripheral vision. Circling the pear at a given latitude would correspond roughly to circling 
the fovea at a fixed radius. 

The primary visual cortex contains six layers, numbered 1 through 6. Distinct functional and anatomic 
types of cells are found in each layer. Layer 4 contains neurons that receive information from the LGN. 
Beyond the initial synapses, cells demonstrate progressively more complex responses. The outputs of VI 
project to an area known as visual area 2 (V2), which surrounds VI, and to higher visual areas in the 
occipital, temporal, and parietal lobes as well as to the superior colliculus. VI also sends reentrant 
projections back to the LGN. Reentrant projections are present at almost every level of the visual system 
[Felleman and Essen, 1991; Edelman, 1978]. 

Cells in VI have been studied extensively in animals by inserting small electrodes into the living brain 
(with surprisingly little damage) and monitoring the individual responses of neurons to visual stimuli. 
Various subpopulations of cortical cells have thus been identified. Some, termed simple cells, respond to 
illuminated edges or bars at specific locations and at specific angular orientations in the visual field. The 
angular orientation must be correct within 10 to 20 degrees for the particular cell to respond. All 
orientations are equally represented. Moving the electrode parallel to the surface yields a smooth rotation 
in the orientation of cell responses by about 10 degrees for each 50 um that the electrode is advanced. 
This rotation is subject to reversals in direction, as well as "fractures," or sudden jumps in orientation. 

Other cells, more common than simple cells, are termed complex cells. Complex cells respond to a set 
of closely spaced parallel edges within a particular receptive field. They may respond specifically to 
movement perpendicular to the orientation of the edge. Some prefer one direction of movement to the 
other. Some complex and simple cells are end-stopped, meaning they fire only if the illuminated bar or 
edge does not extend too far. Presumably, these cells detect corners, curves, or discontinuities in borders 
and lines. End-stopping takes place in layers 2 and 3 of the primary visual cortex. From the LGN through 
the simple cells and complex cells, there appears to be a sequential processing of the image. It is probable 
that simple cells combine the responses of adjacent LGN cells and that complex cells combine the 
responses of adjacent simple cells. 

A remarkable feature in the organization of VI is binocular convergence, in which a single neuron 
responds to identical receptive fields in both eyes, including location, orientation, and directional sensi- 
tivity to motion. It does not occur in the LGN, where axons from the left and right eyes are still segregated 
into different layers. Surprisingly, binocular connections to neurons are present in VI at birth. Some 
binocular neurons are equally weighted in terms of responsiveness to both eyes, while others are more 
sensitive to one eye than to the other. One finds columns containing the latter type of cells in which one 
eye dominates, called ocular dominance columns, in uniform bands approximately 0.5 mm wide every- 
where in VI. Ocular dominance columns occur in adjacent pairs, one for each eye, and are prominent 
in animals with forward-facing eyes, such as cats, chimpanzees, and humans. They are nearly absent in 
rodents and other animals whose eyes face outward. 

The topography of orientation-specific cells and of ocular dominance columns is remarkably uniform 
throughout VI, which is surprising because the receptive fields near the fovea are 10 to 30 times smaller 
than those at the periphery. This phenomenon is called magnification. The fovea maps to a greater relative 
distance on the surface of VI than does the peripheral retina, by as much as 36-fold [Daniel and 
Whitteridge, 1961]. In fact, the majority of VI processes only the central 10 degrees of the visual field. 
Both simple and complex cells in the foveal portion can resolve bars as narrow as 2 minutes of arc. 
Toward the periphery, the resolution falls off to 1 degree of arc. 

As an electrode is passed down through the cortex perpendicular to the surface, each layer demonstrates 
receptive fields of characteristic size, the smallest being at layer 4, the input layer. Receptive fields are 
larger in other layers due to lateral integration of information. Passing the electrode parallel to the surface 
of the cortex reveals another important uniformity to VI. For example, in layer 3, which sends output 
fibers to higher cortical centers, one must move the electrode approximately 2 mm to pass from one 
collection of receptive fields to another that does not overlap. An area approximately 2 mm across thus 
represents the smallest unit piece of VI, i.e., that which can completely process the visual information. 
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Indeed, it is just the right size to contain a complete set of orientations and more than enough to contain 
information from both eyes. It receives a few tens of thousands of fibers from the LGN, produces perhaps 
50,000 output fibers, and is fairly constant in cytoarchitectonics whether at the center of vision, where 
it processes approximately 30 minutes of arc, or at the far periphery, where it processes 7 to 8 degrees of arc. 

The topographic mapping of the visual field onto the cortex suffers an abrupt discontinuity between 
the left and right hemispheres, and yet our perception of the visual scene suffers no obvious rift in the 
midline. This is due to the corpus collousum, an enormous tract containing at least 200 million axons, 
that connects the two hemispheres. The posterior portion of the corpus collousum connects the two 
halves of VI, linking cells that have similar orientations and whose receptive fields overlap in the vertical 
midline. Thus a perceptually seamless merging of left and right visual fields is achieved. Higher levels of 
the visual system are likewise connected across the corpus collousum. This is demonstrated, for example, 
by the clinical observation that cutting the corpus collousum prevents a subject from verbally describing 
objects in the left field of view (the right hemisphere). Speech, which normally involves the left hemi- 
sphere, cannot process visual objects from the right hemisphere without the corpus collousum. 

By merging the information from both eyes, VI is capable of analyzing the distance to an object. Many 
cues for depth are available to the visual system, including occlusion, parallax (detected by the convergence 
of the eyes), optical focusing of the lens, rotation of objects, expected size of objects, shape based on 
perspective, and shadow casting. Stereopsis, which uses the slight difference between images due to the 
parallax between the two eyes, was first enunciated in 1838 by Sir Charles Wheatstone and its probably 
the most important cue [Wheatstone, 1838]. Fixating on an object causes it to fall on the two foveas. 
Other objects that are nearer become outwardly displaced on the two retinas, while objects that are farther 
away become inwardly displaced. About 2 degrees of horizontal disparity is tolerated, with fusion by the 
visual system into a single object. Greater horizontal disparity results in double vision. Almost no vertical 
displacement (a few minutes of arc) is tolerated. Physiologic experiments have revealed a particular class 
of complex cells in VI which are disparity tuned. They fall into three general classes. One class fires only 
when the object is at the fixation distance, another only when the object is nearer, and a third only when 
it is farther away [Poggio and Talbot, 1981]. Severing the corpus collousum leads to a loss of stereopsis 
in the vertical midline of the visual field. 

When the inputs to the two retinas cannot be combined, one or the other image is rejected. This 
phenomenon is known as retinal rivalry and can occur in a piecewise manner or can even lead to blindness 
in one eye. The general term amblyopia refers to the partial or complete loss of eyesight not caused by 
abnormalities in the eye. The most common form of amblyopia is caused by strabismus, in which the 
eyes are not aimed in a parallel direction but rather are turned inward (cross-eyed) or outward (wall- 
eyed). This condition leads to habitual suppression of vision from one of the eyes and sometimes to 
blindness in that eye or to alternation, in which the subject maintains vision in both eyes by using only 
one eye at a time. Cutting selected ocular muscles in kittens causes strabismus, and the kittens respond 
by alternation, preserving functional vision in both eyes. However, the number of cells in the cortex 
displaying binocular responses is greatly reduced. In humans with long-standing alternating strabismus, 
surgical repair making the eyes parallel again does not bring back a sense of depth. Permanent damage 
has been caused by the subtle condition of the images on the two retinas not coinciding. This may be 
explained by the Hebb model for associative learning, in which temporal association between inputs 
strengthens synaptic connections [Hebb, 1961]. 

Further evidence that successful development of the visual system depends on proper input comes 
from clinical experience with children who have cataracts at birth. Cataracts constitute a clouding of the 
lens, permitting light, but not images, to reach the retina. If surgery to remove the cataracts is delayed 
until the child is several years old, the child remains blind even though images are restored to the retina. 
Kittens and monkeys whose eyelids are sown shut during a critical period of early development stay blind 
even when the eyes are opened. Physiologic studies in these animals show very few cells responding in 
the visual cortex. Other experiments depriving more specific elements of an image, such as certain 
orientations or motion in a certain direction, yield a cortex without the corresponding cell type. 
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Color 

Cones, which dominate the fovea, can detect wavelengths between 400 and 700 nm. The population of 
cones in the retina can be divided into three categories, each containing a different pigment. This was 
established by direct microscopic illumination of the retina [Wald, 1974; Marks et al., 1964]. The pigments 
have a bandwidth on the order of 100 nm, with significant overlap, and with peak sensitivities at 560 nm 
(yellow-green), 530 nm (blue-green), and 430 nm (violet). These three cases are commonly known as 
red, green, and blue. Compared with the auditory system, whose array of cochlear sensors can discrim- 
inate thousands of different sonic frequencies, the visual system is relatively impoverished with only three 
frequency parameters. Instead, the retina expends most of its resolution on spatial information. Color 
vision is absent in many species, including cats, dogs, and some primates, as well as in most nocturnal 
animals, since cones are useless in low light. 

By having three types of cones at a given locality on the retina, a simplified spectrum can be sensed and 
represented by three independent variables, a concept known as trichromacy. This model was developed by 
Thomas Young and Hermann von Helmholtz in the 19th century before neurobiology existed and does 
quite well at explaining the retina [Young, 1802; Helmholtz, 1889]. The model is also the underlying basis 
for red-green-blue (RGB) video monitors and color television [Ennes, 1981]. Rods do not help in discrim- 
inating color, even though the pigment in rods does add a fourth independent sensitivity peak. 

Psychophysical experimentation yields a complex, redundant map between spectrum and perceived 
color, or hue, including not only the standard red, orange, yellow, green, and blue but hues such as pink, 
purple, brown, and olive green that are not themselves in the rainbow. Some of these may be achieved 
by introducing two more variables: saturation, which allows for mixing with white light, and intensity, 
which controls the level of color. Thus three variables are still involved: hue, saturation, and intensity. 

Another model for color vision was put forth in the 19th century by Ewald Hering [Hering, 1864]. This 
theory also adheres to the concept of trichromacy, espousing three independent variables. However, unlike 
the Young-Helmholtz model, these variables are signed; they can be positive, negative, or zero. The resulting 
three axes are red-green, yellow-blue, and black-white. The Hering model is supported by the physiologic 
evidence for the center/surround response, which allows for positive as well as negative information. In 
fact, two populations of cells, activated and suppressed along the red-green and yellow-blue axes, have 
been found in monkey LGN. Yellow is apparently detected by a combination of red and green cones. 

The Hering model explains, for example, the perception of the color brown, which results only when 
orange or yellow is surrounded by a brighter color. It also accounts for the phenomenon of color constancy, 
in which the perceived color of an object remains unchanged under differing ambient light conditions 
provided background colors are available for comparison. Research into color constancy was pioneered 
in the laboratory of Edwin Land [Land and McCann, 1971]. As David Hubel says, "We require color 
borders for color, just as we require luminance borders for black and white" [Hubel, 1988, p. 178]. As one 
might expect, when the corpus collousum is surgically severed, color constancy is absent across the midline. 

Color processing in VI is confined to small circular areas, known as blobs, in which double-opponent 
cells are found. They display a center/surround behavior based on the red-green and yellow-blue axes 
but lack orientation selectivity. The VI blobs were first identified by their uptake of certain enzymes, and 
only later was their role in color vision discovered [Livingstone and Hubel, 1984] . The blobs are especially 
prominent in layers 2 and 3, which receive input from the P cells of the LGN. 

Higher Cortical Centers 

How are the primitive elements of image processing so far discussed united into an understanding of the 
image? Beyond VI are many higher cortical centers for visual processing, at least 12 in the occipital lobe 
and others in the temporal and parietal lobes. Areas V2 receives axons from both the blob and interblob 
areas of VI and performs analytic functions such as filling in the missing segments of an edge. V2 contains 
three areas categorized by different kinds of stripes: thick stripes which process relative horizontal position 
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and stereopsis, thin stripes which process color without orientations, and pale stripes which extend the 
process of end-stopped orientation cells. 

Beyond V2, higher centers have been labeled V3, V4, V5, etc. Four parallel systems have been delineated 
[Zeki, 1992], each system responsible for a different attribute of vision, as shown in Fig. 4.1. This is 
obviously an oversimplification of a tremendously complex system. 

Corroborative clinical evidence supports this model. For example, lesions in V4 lead to achromatopsia, 
in which a patient can only see gray and cannot even recall colors. Conversely, a form of poisoning, 
carbon monoxide chromatopsia, results when the VI blobs and V2 thin stripes selectively survive exposure 
to carbon monoxide thanks to their rich vasculature, leaving the patient with a sense of color but not of 
shape. A lesion in V5 leads to akinetopsia, in which objects disappear. 

As depicted in Fig. 4.1, all visual information is processed through VI and V2, although discrete 
channels within these areas keep different types of information separate. A total lesion of VI results in 
the perception of total blindness. However, not all channels are shown in Fig. 4.1, and such a "totally 
blind" patient may perform better than randomly when forced to guess between colors or between motion 
in different directions. The patient with this condition, called blindsight, will deny being able to see 
anything [Weiskrantz, 1990]. 

Area VI preserves retinal topographic mapping and shows receptive fields, suggesting a piecewise 
analysis of the image, although a given area of VI receives sequential information from disparate areas 
of the visual environment as the eyes move. V2 and higher visual centers show progressively larger 
receptive fields and less defined topographic mapping but more specialized responses. In the extreme of 
specialization, neurobiologists joke about the "grandmother cell," which would respond only to a par- 
ticular face. No such cell has yet been found. However, cortical regions that respond to faces in general 
have been found in the temporal lobe. Rather than a "grandmother cell," it seems that face-selective 
neurons are members of ensembles for coding facts [Gross and Sergen, 1992]. 

Defining Terms 

Binocular convergence: The response of a single neuron to the same location in the visual field of each eye. 
Color constancy: The perception that the color of an object remains constant under different lighting 

conditions. Even though the spectrum reaching the eye from that object can be vastly different, 

other objects in the field of view are used to compare. 
Cytoarchitectonics: The organization of neuron types into layers as seen by various staining techniques 

under the microscope. Electrophysiologic responses of individual cells can be correlated with their 

individual layer. 
Magnification: The variation in amount of retinal area represented per unit area of VI from the fovea 

to the peripheral vision. Even though the fovea takes up an inordinate percentage of VI compared 

with the rest of the visual field, the scale of the cellular organization remains constant. Thus the 

image from the fovea is, in effect, magnified before processing. 
Receptive field: The area in the visual field that evokes a response in a neuron. Receptive fields may 

respond to specific stimuli such as illuminated bars or edges with particular directions of motion, etc. 
Stereopsis: The determination of distance to objects based on relative displacement on the two retinas 

because of parallax. 
Topographic mapping: The one-to-one correspondence between location on the retina and location 

within a structure in the brain. Topographic mapping further implies that contiguous areas on the 

retina map to contiguous areas in the particular brain structure. 
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The auditory system can be divided into two large subsystems, peripheral and central. The peripheral 
auditory system converts the condensations and rarefactions that produce sound into neural codes that 
are interpreted by the central auditory system as specific sound tokens that may affect behavior. 

The peripheral auditory system is subdivided into the external ear, the middle ear, and the inner ear 
(Fig. 5.1). The external ear collects sound energy as pressure waves which are converted to mechanical 
motion at the eardrum. This motion is transformed across the middle ear and transferred to the inner 
ear, where it is frequency analyzed and converted into neural codes that are carried by the eighth cranial 
nerve, or auditory nerve, to the central auditory system. 

Sound information, encoded as discharges in an array of thousands of auditory nerve fibers, is 
processed in nuclei that make up the central auditory system. The major centers include the cochlear 
nuclei (CN), the superior olivary complex (SOC), the nuclei of the lateral lemniscus (NLL), the inferior 
colliculi (IC), the medial geniculate body (MGB) of the thalamus, and the auditory cortex (AC). The CN, 
SOC, and NLL are brainstem nuclei; the IC is at the midbrain level; and the MGB and AC constitute the 
auditory thalamocortical system. 

While interesting data have been collected from groups other than mammals, this chapter will empha- 
size the mammalian auditory system. This chapter ignores the structure and function of the vestibular 
system. While a few specific references are included, most are general in order to provide a more 
introductory entry into topics. 

5.1 Physical and Psychological Variables 

Acoustics 

Sound is produced by time-varying motion of the particles in air. The motions can be defined by their 
pressure variations or by their volume velocities. Volume velocity is defined as the average particle velocity 
produced across a cross-sectional area and is the acoustic analogue of electric current. Pressure is the 
acoustic analogue of voltage. Acoustic intensity is the average rate of the flow of energy through a unit 
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Inner ear 




FIGURE 5.1 The peripheral auditory system showing the ear canal, tympanic membrane, middle ear and ossicles, 
and the inner ear consisting of the cochlea and semicircular canals of the vestibular system. Nerves communicating 
with the brain are also shown. 

area normal to the direction of the propagation of the sound wave. It is the product of the acoustic 
pressure and the volume velocity and is analogous to electric power. Acoustic impedance, the analogue 
of electrical impedance, is the complex ratio of acoustic pressure and volume velocity. Sound is often 
described in terms of either acoustic pressure or acoustic intensity [Kinsler and Frey, 1962]. 

The auditory system has a wide dynamic range, i.e., it responds to several decades of change in the 
magnitude of sound pressure. Because of this wide dynamic range, it is useful to describe the independent 
variables in terms of decibels, where acoustic intensity is described by dB = 10 log(Z/7 ), where 7 is the 
reference intensity, or equivalently for acoustic pressure, dB = 20 log(P/P ), where P is the reference 
pressure. 

Psychoacoustics 

Physical variables, such as frequency and intensity, may have correlated psychological variables, such as 
pitch and loudness. Relationships between acoustic and psychological variables, the subject of the field of 
psychoacoustics, are generally not linear and may be very complex, but measurements of human detection 
and discrimination can be made reliably. Humans without hearing loss detect tonal frequencies from 
20 Hz to 20 kHz. At 2 to 4 kHz their dynamic range, the span between threshold and pain, is approximately 
120 dB. The minimum threshold for sound occurs between 2 and 5 kHz and is about 20 uPa. At the low 
end of the auditory spectrum, threshold is 80 dB higher, while at the high end, it is 70 dB higher. Intensity 
differences of 1 dB can be detected, while frequency differences of 2 to 3 Hz can be detected at frequencies 
below about 3 kHz [Fay, 1988]. 



5.2 The Peripheral Auditory System 



The External Ear 

Ambient sounds are collected by the pinna, the visible portion of the external ear, and guided to the 
middle ear by the external auditory meatus, or ear canal. The pinna acquires sounds selectively due to its 
geometry and the sound shadowing effect produced by the head. In those species whose ears can be 
moved voluntarily through large angles, selective scanning of the auditory environment is possible. 

The ear canal serves as an acoustic waveguide that is open at one end and closed at the other. The 
open end at the pinna approximates a short circuit (large volume velocity and small pressure variation), 
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while that at the closed end is terminated by the tympanic membrane (eardrum). The tympanic membrane 
has a relatively high acoustic impedance compared with the characteristic impedance of the meatus and 
looks like an open circuit. Thus the ear canal can resonate at those frequencies for which its length is an 
odd number of quarter wavelengths. The first such frequency is at about 3 kHz in the human. The meatus 
is antiresonant for those frequencies for which its length is an integer number of half wavelengths. For 
a discussion of resonance and antiresonance in an acoustic waveguide, see a text on basic acoustics, e.g., 
Kinsler and Frey [1962]. 

The acoustic properties of the external ear produce differences between the sound pressure produced 
at the tympanic membrane and that at the opening of the ear canal. These differences are functions of 
frequency, with larger differences found at frequencies between 2 and 6 kHz than those below 2 kHz. 
These variations have an effect on the frequency selectivity of the overall auditory system. 

The Middle Ear 
Anatomy 

Tracing the acoustic signal, the boundaries of the middle ear include the tympanic membrane at the 
input and the oval window at the output. The middle ear bones, the ossicles, lie between. Pressure relief 
for the tympanic membrane is provided by the eustachian tube. The middle ear is an air-filled cavity. 

The Ossicles 

The three bones that transfer sound from the tympanic membrane to the oval window axe. called the 
malleus (hammer), incus (anvil), and stapes (stirrup). The acoustic impedance of the atmospheric source 
is much less than that of the aqueous medium of the load. The ratio is 3700 in an open medium, or 
36 dB [Kinsler and Frey, 1962]. The ossicles comprise an impedance transformer for sound, producing 
a mechanical advantage that allows the acoustic signal at the tympanic membrane to be transferred with 
low loss to the round window of the cochlea (inner ear). The air-based sound source produces an acoustic 
signal of low-pressure and high-volume velocity, while the mechanical properties of the inner ear demand 
a signal of high-pressure and low-volume velocity. 

The impedance transformation is produced in two ways: The area of the tympanic membrane is greater 
than that of the footplate of the stapes, and the lengths of the malleus and incus produce a lever whose 
length is greater on the side of the tympanic membrane than it is on the side of the oval window. In the 
human, the mechanical advantage is about 22:1 [Dobie and Rubel, 1989] and the impedance ratio of the 
transformer is 480, 27 dB, changing the mismatch from 3700:1 to about 8:1. 

This simplified discussion of the function of the ossicles holds at low frequencies, those below 2 kHz. 
First, the tympanic membrane does not behave as a piston at higher frequencies but can support modes 
of vibration. Second, the mass of the ossicles becomes significant. Third, the connections between the 
ossicles is not lossless, nor can the stiffness of these connections be ignored. Fourth, pressure variations 
in the middle ear cavity can change the stiffness of the tympanic membrane. Fifth, the cavity of the 
middle ear produces resonances at acoustic frequencies. 

Pressure Relief 

The eustachian tube is a bony channel that is lined with soft tissue. It extends from the middle ear to 
the nasopharynx and provides a means by which pressure can be equalized across the tympanic mem- 
brane. The function is clearly observed with changes in altitude or barometric pressure. A second function 
of the eustachian tube is to aerate the tissues of the middle ear. 

The Inner Ear 

The mammalian inner ear is a spiral structure, the cochlea (snail), consisting of three fluid-filled chambers, 
or scalae, the scala vestibuli, the scala media, and the scala tympani (Fig. 5.2). The stapes footplate 
introduces mechanical displacements into the scala vestibuli through the oval window at the base of the 
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FIGURE 5.2 Cross section of one turn of the cochlea showing the scala vestibuli, scala media, and scala tympani. 
Reissner's membrane separates the SM and SV, while the basilar membrane and organ of Corti separate the SM and ST. 

cochlea. At the other end of the spiral, the apex of the cochlea, the scala vestibuli and the scala tympani 
communicate by an opening, the helicotrema. Both are filled with an aqueous medium, the perilymph. 
The scala media spirals between them and is filled with endolymph, a medium that is high in K + and low 
in Na + . The scala media is separated from the scala vestibuli by Reissner's membrane, which is impermeable 
to ions, and the scala media is separated from the scala tympani by the basilar membrane (BM) and organ 
of Corti. The organ of Corti contains the hair cells that transduce acoustic signals into neural signals, the 
cells that support the hair cells, and the tectorial membrane to which the outer hair cells are attached. 
The BM provides the primary filter function of the inner ear and is permeable so that the cell bodies of 
the hair cells are bathed in perilymph. 

Fluid and tissue displacements travel from the footplate of the stapes along the cochlear spiral from 
base to apex. Pressure relief is provided for the incompressible fluids of the inner ear by the round window 
membrane; e.g., if a transient pressure increase at the stapes displaces its footplate inward, there will be 
a compensatory outward displacement of the round window membrane. 

The Basilar Membrane 

Physiology 

The BM supports the hair cells and their supporting cells (see Fig. 5.2). Sound decomposition into its 
frequency components is a major code of the BM. A transient sound, such as a click, initiates a traveling 
wave of displacement in the BM, and this motion has frequency-dependent characteristics which arise 
from properties of the membrane and its surrounding structures [Bekesy, I960]. The membrane's width 
varies as it traverses the cochlear duct: It is narrower at its basal end than at its apical end. It is stiffer at 
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the base than at the apex, with stiffness varying by about two orders of magnitude [Dobie and Rubel, 
1989]. The membrane is a distributed structure, which acts as a delay line, as suggested by the nature of 
the traveling wave [Lyon and Mead, 1989] . The combination of mechanical properties of the BM produces 
a structure that demonstrates a distance-dependent displacement when the ear is excited sinusoidally. 
The distance from the apex to the maximum displacement is logarithmically related to the frequency of 
a sinusoidal tone [LePage, 1991]. 

Tuning is quite sharp for sinusoidal signals. The slope of the tuning curve is much greater at the high- 
frequency edge than at the low-frequency edge, with slopes of more than 100 dB per octave at the high 
edge and about half that at the low edge [Lyon and Mead, 1989]. The filter is sharp, with a 10-dB 
bandwidth of 10 to 25% of the center frequency. 

The auditory system includes both passive and active properties. The outer hair cells (see below) receive 
efferent output from the brain and actively modify the characteristics of the auditory system. The result 
is to produce a "cochlear amplifier," which sharpens the tuning of the BM [Lyon and Mead, 1989], as 
well as adding nonlinear properties to the system [Geisler, 1992; Cooper and Rhode, 1992], along with 
otoacoustic emissions [LePage, 1991]. 

The Organ of Corti 

The organ of Corti is attached to the BM on the side of the aqueous fluid of the scala media. It is 
comprised of the supporting cells for the hair cells, the hair cells themselves, and the tectorial membrane. 
The cilia of the inner hair cells (IHCs) do not contact the tectorial membrane, while those of the outer 
hair cells (OHCs) do. Both IHCs and OHCs have precise patterns of stereocilia at one end which are held 
within the tectorial plate next to the overlying tectorial membrane. The IHCs synapse with spiral ganglion 
cells, the afferent neurons, while the OHCs synapse with efferent neurons. Both IHCs and OHCs are 
found along the length of the organ of Corti. The IHCs are found in a single line, numbering between 
about 3000 and 4000 in human. There are three lines of OHCs, numbering about 12,000 in human 
[Nadol, 1988]. 

Inner Hair Cells 

The IHCs' stereocilia are of graded, decreasing length from one side of the cell where a kinocilium is 
positioned early in ontogeny. If the cilia are deflected in a direction toward this position, membrane 
channels are further opened to allow potassium to enter and depolarize the cell [Hudspeth, 1987]. 
Displacement in the other direction reduces channel opening and produces a relative hyperpolarization 
[Hudspeth and Corey, 1977]. These changes in intracellular potential modulate transmitter release at the 
base of the IHCs. 

The IHCs are not attached to the tectorial membrane, so their response to motion of the membrane 
is proportional to the velocity of displacement rather than to displacement itself, since the cilia of the 
hair cells are bathed in endolymph. When the membrane vibrates selectively in response to a pure tone, 
the stereocilia are bent atop a small number of hair cells, which depolarize in response to the mechanical 
event. Thus, the tonotopic organization of the BM is transferred to the hair cells and to the rest of the 
auditory system. The auditory system is organized tonotopically, i.e., in order of frequency, because the 
frequency ordering of the cochlea is mapped through successive levels of the system. While this organi- 
zation is preserved throughout the system, it is much more complex than a huge set of finely tuned filters. 

Hair cells in some species exhibit frequency tuning when isolated [Crawford and Fettiplace, 1985], 
but mammalian hair cells exhibit no tuning characteristics. The tuning of the mammalian auditory system 
depends on the mechanical characteristics of the BM as modified by the activity of the OHCs. 

Outer Hair Cells 

The OHCs have cilia that are attached to the tectorial membrane. Since their innervation is overwhelm- 
ingly efferent, they do not transfer information to the brain but are modulated in their mechanical action 
by the brain. There are several lines of evidence that lead to the conclusion that the OHCs play an active 
role in the processes of the inner ear. First, OHCs change their length in response to neurotransmitters 
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[Dobie and Rubel, 1989]. Second, observation of the Henson's cells, passive cells that are intimately 
connected to OHCs, shows that spontaneous vibrations are produced by the Henson's cells in mammals 
and likely in the OHCs as well. These vibrations exhibit spectral peaks that are appropriate in frequency 
to their locations on the BM [Khanna et al., 1993]. Third, action of the OHCs as amplifiers leads to 
spontaneous otoacoustic emissions [Kim, 1984; Lyon and Mead, 1989] and to changes in the response 
of the auditory system [Lyon and Mead, 1989; Geisler, 1992]. Fourth, AC excitation of the OHCs of 
mammals produces changes in length [Cooke, 1993]. 

The OHCs appear to affect the response of the auditory system in several ways. They enhance the 
tuning characteristics of the system to sinusoidal stimuli, decreasing thresholds and narrowing the filter's 
bandwidth [Dobie and Rubel, 1989] . They likely influence the damping of the BM dynamically by actively 
changing its stiffness. 

Spiral Ganglion Cells and the Auditory Nerve 
Anatomy 

The auditory nerve of the human contains about 30,000 fibers consisting of myelinated proximal processes 
of spiral ganglion cells (SGCs). The somas of spiral ganglion cells (SGCs) lie in Rosenthal's canal, which 
spirals medial to the three scalae of the cochlea. Most (93%) are large, heavily myelinated, type I SGCs 
whose distal processes synapse on IHCs. The rest are smaller type II SGCs, which are more lightly 
myelinated. Each IHC has, on average, a number of fibers that synapse with it, 8 in the human and 18 
in the cat, although some fibers contact more than one IHC. In contrast, each type II SGC contacts 
OHCs at a rate of about 10 to 60 cells per fiber. 

The auditory nerve collects in the center of the cochlea, its modiolus, as SGC fibers join it. Low- 
frequency fibers from the apex join first, and successively higher frequency fibers come to lie concentrically 
on the outer layers of the nerve in a spiraling fashion before it exits the modiolus to enter the internal 
auditory meatus of the temporal bone. A precise tonotopic organization is retrained in the concentrically 
wrapped fibers. 

Physiology 

Discharge spike patterns from neurons can be recorded extracellularly while repeating tone bursts are 
presented. A threshold level can be identified from the resulting rate-level function (RLF). In the absence 
of sound and at lower, subthreshold levels, a spontaneous rate of discharge is measured. In the nerve this 
ranges from 50 spikes per second to less than 10. As intensity is raised, the threshold level is encountered, 
where the evoked discharge rate significantly exceeds the spontaneous discharge rate. The plot of threshold 
levels as a function of frequency is the neuron's threshold tuning curve. The tuning curves for axons in 
the auditory nerve show a minimal threshold (maximum sensitivity) at a characteristic frequency (CF) 
with a narrow frequency range of responding for slightly more intense sounds. At high intensities, a large 
range of frequencies elicits spike discharges. RLFs for nerve fibers are monotonic (i.e., spike rate increases 
with stimulus intensity), and although a saturation rate is usually approached at high levels, the spike 
rate does not decline. Mechanical tuning curves for the BM and neural threshold tuning curves are highly 
similar (Fig. 5.3). Mechanical frequency analysis in the cochlea and the orderly projection of fibers 
through the nerve lead to correspondingly orderly maps for CFs in the nerve and the nuclei of the central 
pathways. 

Sachs and Young [ 1979] found that the frequency content of lower intensity vowel sounds is represented 
as corresponding tonotopic rate peaks in nerve activity, but for higher intensities this rate code is lost as 
fibers tend toward equal discharge rates. At high intensities spike synchrony to frequencies near CF 
continue to signal the relative spectral content of vowels, a temporal code. These results hold for high- 
spontaneous-rate fibers (over 15 spikes per second), which are numerous. Less common, low-spontaneous- 
rate fibers (less than 15 spikes per second) appear to maintain the rate code at higher intensities, suggesting 
different coding roles for these two fiber populations. 
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FIGURE 5.3 Mechanical and neural turning curves from the BM and auditory nerve, respectively. The two mechan- 
ical curves show the intensity and frequency combinations for tones required to obtain a criterion displacement or 
velocity, while the neural curve shows the combinations needed to increase neural discharge rates a small amount 
over spontaneous rate. 



5.3 The Central Auditory System 



Overview 

In ascending paths, obligatory synapses occur at the CN, IC, MGB, and AC, but a large number of 
alternative paths exist with ascending and descending internuclear paths and the shorter intranuclear 
connections between neighboring neurons and subdivisions within a major nuclear group. Each of the 
centers listed contains subpopulations of neurons that differ in aspects of their morphologies, discharge 
patterns to sounds, segregation in the nucleus, biochemistry, and synaptic connectivities. The arrange- 
ment of the major ascending auditory pathways is schematically illustrated in Fig. 5.4. For references, 
see Altschuler et al. [1991]. 

Neural Basis of Processing 
The Cochlear Nuclei 

Anatomy of the Cochlear Nuclei. The CN can be subdivided into at least three distinct regions, the 
anteroventral CN (AVCN), the posteroventral CN (PVCN), and the dorsal CN (DCN). Each subdivision 
has one or more distinctive neuron types and unique intra- and internuclear connections. The axon from 
each type I SGC in the nerve branches to reach each of the three divisions in an orderly manner so that 
tonotopic organization is maintained. Neurons with common morphologic classifications are found in 
all three divisions, especially granule cells, which tend to receive connections from type II spiral ganglion 
cells. 
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FIGURE 5.4 A schematic of major connections in the auditory brainstem discussed in the text. All structures and 
connections are bilaterally symmetrical, but connections have been shown on one side only for clarity. No cell types 
are indicated, but the subdivisions of origin are suggested in the CN. Note that the LSO and MSO receive inputs 
from both CN. 

Morphologic classification of neurons based on the shapes of their dendritic trees and somas show 
that the anterior part of the AVCN contains many spherical bushy cells, while in its posterior part both 
globular bushy cells and spherical bushy cells are found. Spherical bushy cells receive input from one type 
I ganglion cell through a large synapse formation containing end bulbs of Held, while the globular cells 
may receive inputs from a few afferent fibers. These endings cover a large part of the soma surface and 
parts of the proximal dendrite, especially in spherical bushy cells, and they have rounded vesicles pre- 
synaptically, indicating excitatory input to the bushy cells, while other synaptic endings of noncochlear 
origins tend to have flattened vesicles associated with inhibitory inputs. Stellate cells are found throughout 
the AVCN, as well as in the lower layers of the DCN. The AVCN is tonotopically organized, and neurons 
having similar CFs have been observed to lie in layers or laminae [Bourk et al., 1 98 1 ] . Isofrequency laminae 
also have been indicated in other auditory nuclei. 

The predominant neuron in the PVCN is the octopus cell, a label arising from its distinctive shape with 
asymmetrically placed dendrites. Octopus cells receive cochlear input from type I SGCs on their somas 
and proximal dendrites. Their dendrites cross the incoming array of cochlear fibers, and these often 
branch to follow the dendrite toward the soma. 

The DCN is structurally the most intricate of the CN. In many species, four or five layers are noticeable, 
giving it a "cortical" structure, and its local circuitry has been compared with that of the cerebellum. 
Fusiform cells are the most common morphologic type. Their somas lie in the deeper layers of the DCN, 
and their dendrites extend toward the surface of the nucleus and receive primarily noncochlear inputs. 
Cochlear fibers pass deeply in the DCN and turn toward the surface to innervate fusiform and giant cells 
that lie in the deepest layer of the DCN. The axons of fusiform and giant cells project out of the DCN 
to the contralateral IC. 

Intracellular recording in slice preparation is beginning to identify the membrane characteristics of 
neuronal types in the CN. The diversity of neuronal morphologic types, their participation in local 
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FIGURE 5.5 Peristimulus time histogram patterns obtained in the CN and nerve. Repeated presentations of a tone 
burst at CF are used to obtain these estimates of discharge rate during the stimulus. (Adapted from Young, 1984.) 

circuits, and the emerging knowledge of their membrane biophysics are motivating detailed compart- 
mental modeling [Arle and Kim, 1991]. 

Spike Discharge Patterns. Auditory nerve fibers and neurons in central nuclei may discharge only 
a few times during a brief tone burst, but if a histogram of spike events is synchronized to the onset of 
the tone burst, a peristimulus time histogram (PSTH) is obtained that is more representative of the neuron's 
response than any single one. The PSTH may be expressed in terms of spike counts, spike probability, 
or spike rate as a function of time, but all these retain the underlying temporal pattern of the response. 
PSTHs taken at the CF for a tone burst intensity roughly 40 dB above threshold have shapes that are 
distinctive to different nuclear subdivisions and even different morphologic types. They have been used 
for functionally classifying auditory neurons. 

Figure 5.5 illustrates some of the major pattern types obtained from the auditory nerve and CN. 
Auditory nerve fibers and spherical bushy cells in AVCN have primary-like patterns in their PSTHs, an 
elevated spike rate after tone onset, falling to a slowly adapting level until the tone burst ends. Globular 
bushy cells may have primary-like, pri-notch (primary-like with a brief notch after onset), or chopper 
patterns. Stellate cells have non-primary-like patterns. Onset response patterns, one or a few brief peaks 
of discharge at onset with little or no discharges afterwards, are observed in the PVCN from octopus 
cells. Chopper, pauser, and buildup patterns are observed in many cells of the DCN. For most neurons 
of the CN, these patterns are not necessarily stable over different stimulus intensities; a primary-like 
pattern may change to a pauser pattern and then to a chopper pattern as intensity is raised [Young, 1984] . 

Functional classification also has been based on the response map, a plot of a neuron's spike discharge 
rate as a function of tonal frequency and intensity. Fibers and neurons with primary-like PSTHs generally 
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have response maps with only an excitatory region of elevated rate. The lower edges of this region 
approximate the threshold tuning curve. Octopus cells often have very broad tuning curves and extended 
response maps, as suggested by their frequency-spanning dendritic trees. More complex response maps 
are observed for some neurons, such as those in the DCN. Inhibitory regions alone, a frequency-intensity 
area of suppressed spontaneous discharge rates, or combinations of excitatory regions and inhibitory 
regions have been observed. Some neurons are excited only within islands of frequency-intensity com- 
binations, demonstrating a CF but having no response to high-intensity sounds. In these cases, an RLF 
at CF would be nonmonotonic; i.e., spike rate decreases as the level is raised. Response maps in the DCN 
containing both excitatory and inhibitory regions have been shown to arise from a convergence of inputs 
from neurons with only excitatory or inhibitory regions in their maps [Young and Voigt, 1981]. 

Superior Olivary Complex (SOC) 

The SOC contains 10 or more subdivisions in some species. It is the first site at which connections from 
the two ears converge and is therefore a center for binaural processing that underlies sound localization. 
There are large differences in the subdivisions between mammalian groups such as bats, primates, 
cetaceans, and burrowing rodents that utilize vastly different binaural cues. Binaural cues to the locus of 
sounds include interaural level differences (ILDs), interaural time differences (ITDs), and detailed spectral 
differences for multispectral sounds due to head and pinna filtering characteristics. 

Neurons in the medial superior olive (MSO) and lateral superior olive (LSO) tend to process ITDs and 
ILDs, respectively. A neuron in the MSO receives projections from spherical bushy cells of the CN from 
both sides and thereby the precise timing and tuning cues of nerve fibers passed through the large synapses 
mentioned. The time accuracy of the pathways and the comparison precision of MSO neurons permit 
the discrimination of changes in ITD of a few tens of microseconds. MSO neurons project to the ipsilateral 
IC through the lateral lemniscus. Globular bushy cells of the CN project to the medial nucleus of the 
trapezoid body (MNTB) on the contralateral side, where they synapse on one and only one neuron in 
a large, excitatory synapse, the calyx of Held. MNTB neurons send inhibitory projections to neurons of 
the LSO on the same side, which also receives excitatory input from spherical bushy cells from the AVCN 
on the same side. Sounds reaching the ipsilateral side will excite discharges from an LSO neuron, while 
those reaching the contralateral side will inhibit its discharge. The relative balance of excitation and 
inhibition is a function of ILD over part of its physiological range, leading to this cue being encoded in 
discharge rate. 

One of the subdivisions of the SOC, the dorsomedial periolivary nucleus (DMPO), is a source of efferent 
fibers that reach the contralateral cochlea in the crossed olivocochlear bundle (COCB). Neurons of the 
DMPO receive inputs from collaterals of globular bushy cell axons of the contralateral ACVN that project 
to the MNTB and from octopus cells on both sides. The functional role of the feedback from the DMPO 
to the cochlea is not well understood. 

Nuclei of the Lateral Lemniscus (NLL) 

The lateral lemniscus consists of ascending axons from the CN and LSO. The NLL lie within this tract, 
and some, such as the dorsal nucleus (DNLL), are known to process binaural information, but less is 
known about these nuclei as a group than others, partially due to their relative inaccessibility. 

Inferior Colliculi (IC) 

The IC are paired structures lying on the dorsal surface of the rostral brainstem. Each colliculus has a 
large central nucleus (ICC), a surface cortex, and paracentral nuclei. Each colliculus receives afferents 
from a number of lower brainstem nuclei, projects to the MGB through the brachium, and communicates 
with the other colliculus through a commissure. The ICC is the major division and has distinctive laminae 
in much of its volume. The laminae are formed from disk-shaped cells and afferent fibers. The disk-shaped 
cells, which make up about 80% of the ICC's neuronal population, have flattened dendritic fields that 
lie in the laminar plane. The terminal endings of afferents form fibrous layers between laminae. The 
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remaining neurons in the ICC are stellate cells that have dendritic trees spanning laminae. Axons from 
these two cell types make up much of the ascending ICC output. 

Tonotropic organization is preserved in the ICC's laminae, each corresponding to an isofrequency 
lamina. Both monaural and binaural information converges at the IC through direct projections from 
the CN and from the SOC and NLL. Crossed CN inputs and those from the ipsilateral MSO are excitatory. 
Inhibitory synapses in the ICC arise from the DNLL, mediated by gamma-amino-butyric acid (GABA), 
and from the ipsilateral LSO, mediated by glycine. 

These connections provide an extensive base for identifying sound direction at this midbrain level, 
but due to their convergence, it is difficult to determine what binaural processing occurs at the IC as 
opposed to being passed from the SOC and NLL. Many neurons in the IC respond differently depending 
on binaural parameters. Varying ILDs for clicks or high-frequency tones often indicates that contralateral 
sound is excitatory. Ipsilateral sound may have no effect on responses to contralateral sound, classifying 
the cell as EO, or it may inhibit responses, in which case the neuron is classified as EI, or maximal excitation 
may occur for sound at both ears, classifying the neuron as EE. Neurons responding to lower frequencies 
are influenced by ITDs, specifically the phase difference between sinusoids at the ears. Spatial receptive 
fields for sounds are not well documented in the mammalian IC, but barn owls, who use the sounds of 
prey for hunting at night, have sound-based spatial maps in the homologous structure. The superior 
colliculus, situated just rostral to the IC, has spatial auditory receptive field maps for mammals and owl. 

Auditory Thalamocortical System 

Medial Geniculate Body (MGB). The MGB and AC form the auditory thalamocortical system. As with 
other sensory systems, extensive projections to and from the cortical region exist in this system. The 
MGB has three divisions, the ventral, dorsal, and medial. The ventral division is the largest and has the 
most precise tonotopic organization. Almost all its input is from the ipsilateral ICC through the brachium 
of the IC. Its large bushy cells have dendrites oriented so as to lie in isofrequency layers, and the axons 
of these neurons project to the AC, terminating in layers III and IV. 

Auditory Cortex. The auditory cortex (AC) consists of areas of the cerebral cortex that respond to 
sounds. In mammals, the AC is bilateral and has a primary area with surrounding secondary areas. In 
nonprimates, the AC is on the lateral surface of the cortex, but in most primates, it lies within the lateral 
fissure on the superior surface of the temporal lobe. Figure 5.6 reveals the area of the temporal lobe 
involved with auditory function in humans. Tonotopic mapping is usually evident in these areas as 
isofrequency surface lines. The primary AC responds most strongly and quickly to sounds. In echo- 
locating bats, the cortex has a large tonotopic area devoted to the frequency region of its emitted cries 
and cues related to its frequency modulation and returned Doppler shift [Aitkin, 1990]. 

The cytoarchitecture of the primary AC shows layers I (surface) through VI (next to white matter), 
with the largest neurons in layers V and VI. Columns with widths of 50 to 75 urn are evident from 
dendritic organization in layers III and IV, with fibers lying between the columns. A description of cell 
types is beyond this treatment. 

Discharge patterns in the AC for sound stimuli are mainly of the onset type. Continuous stimuli often 
evoke later spikes, after the onset discharge, in unanesthetized animals. About half the neurons in the 
primary AC have monotonic RLFs, but the proportion of nonmonotonic RLFs in secondary areas is 
much higher. A number of studies have used complex sounds to study cortical responses. Neural responses 
to species-specific cries, speech, and other important sounds have proven to be labile and to a great extent 
dependent on the arousal level and behavioral set of the animal. 

Cortical lesions in humans rarely produce deafness, although deficits in speech comprehension and 
generation may exist. Audiometric tests will generally indicate that sensitivity to tonal stimuli is retained. 
It has been known for some time that left-hemisphere lesions in the temporal region can disrupt 
comprehension (Wernicke's area) and in the region anterior to the precentral gyrus (Broca's area) can 
interfere with speech production. It is difficult to separate the effects of these areas because speech 
comprehension provides vital feedback for producing correct speech. 
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FIGURE 5.6 A view of the human cerebral cortex showing the auditory cortex on the superior surface of the left 
temporal lobe after removal of the overlying parietal cortex. 

5.4 Pathologies 

Hearing loss results from conductive and neural deficits. Conductive hearing loss due to attenuation in 
the outer or middle ear often can be alleviated by amplification provided by hearing aids and may be 
subject to surgical correction. Sensorineural loss due to the absence of IHCs results from genetic deficits, 
biochemical insult, exposure to intense sound, or aging (presbycusis). For some cases of sensorineural 
loss, partial hearing function can be restored with the cochlear prosthesis, electrical stimulation of 
remaining SGCs using small arrays of electrodes inserted into the scala tympani [Miller and Spelman, 
1990]. In a few patients having no auditory nerve, direct electrical stimulation of the CN has been used 
experimentally to provide auditory sensation. Lesions of the nerve and central structures occur due to 
trauma, tumor growth, and vascular accidents. These may be subject to surgical intervention to prevent 
further damage and promote functional recovery. 



5.5 Models of Auditory Function 



Hearing mechanisms have been modeled for many years at a phenomenologic level using psychophysical 
data. As physiologic and anatomic observations have provided detailed parameters for peripheral and 
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central processing, models of auditory encoding and processing have become more quantitative and 
physically based. Compartmental models of single neurons, especially SGCs and neurons in the CN, 
having accurate morphometric geometries, electrical properties, membrane biophysics, and local cir- 
cuitry are seeing increasing use. 
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The primary function of the gastrointestinal system (Fig. 6.1) is to supply the body with nutrients and 
water. The ingested food is moved along the alimentary canal at an appropriate rate for digestion, 
absorption, storage, and expulsion. To fulfill the various requirements of the system, each organ has 
adapted one or more functions. The esophagus acts as a conduit for the passage of food into the stomach 
for trituration and mixing. The ingested food is then emptied into the small intestine, which plays a 
major role in the digestion and absorption processes. The chyme is mixed thoroughly with secretions 
and it is propelled distally (1) to allow further gastric emptying, (2) to allow for uniform exposure to the 
absorptive mucosal surface of the small intestine, and (3) to empty into the colon. The vigor of mixing 
and the rate of propulsion depend on the required contact time of chyme with enzymes and the mucosal 
surface for efficient performance of digestion and absorption. The colon absorbs water and electrolytes 
from the chyme, concentrating and collecting waste products that are expelled from the system at 
appropriate times. All of these motor functions are performed by contractions of the muscle layers in 
the gastrointestinal wall. 

6.2 Gastrointestinal Electrical Oscillations 

Gastrointestinal motility is governed by myogenic, neural, and chemical control systems (Fig. 6.2). The 
myogenic control system is manifest by periodic depolarizations of the smooth muscle cells, which 
constitute autonomous electrical oscillations called the electrical control activity (ECA) or slow waves 
[Daniel and Chapman, 1963] . The properties of this myogenic system and its electrical oscillations dictate 
to a large extent the contraction patterns in the stomach, small intestine, and colon [Szurszewski, 1987]. 
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FIGURE 6.1 The gastrointestinal tract. 
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© 2000 by CRC Press LLC 



-ERA 



Electrical activity 




Voltage 



Contractile activity 



Force 



Time 



FIGURE 6.3 The relationships between ECA, ERA and muscular contractions. The ERA occurs in the depolarized 
phase of the ECA. Muscular contractions are associated with the ERA, and their amplitude depends on the frequency 
of response potentials within an ERA burst. 

The ECA controls the contractile excitability of smooth muscle cells since the cells may contract only 
when depolarization of the membrane voltage exceeds an excitation threshold. The normal spontaneous 
amplitude of ECA depolarization does not exceed this excitation threshold except when neural or chemical 
excitation is present. The myogenic system affects the frequency, direction, and velocity of the contrac- 
tions. It also affects the coordination or lack of coordination between adjacent segments of the gut wall. 
Hence, the electrical activities in the gut wall provide an electrical basis for gastrointestinal motility. 

In the distal stomach, small intestine, and colon, there are intermittent bursts of rapid electrical 
oscillations, called the electrical response activity (ERA) or spike bursts. The ERA occurs during the 
depolarization plateaus of the ECA if a cholinergic stimulus is present, and it is associated with muscular 
contractions (Fig. 6.3). Thus, neural and chemical control systems determine whether contractions will 
occur or not, but when contractions are occurring, the myogenic control system (Fig. 6.4) determines 
the spatial and temporal patterns of contractions. 

There is also a cyclical pattern of of distally propagating ERA that appears in the small intestine during 
the fasted state [Szurszewski, 1969], called the Migrating Motility Complex (MMC). This pattern consists 
of four phases [Code and Marlett, 1975]: Phase I has little or no ERA, phase II consists of irregular ERA 
bursts, phase III consists of intense repetitive ERA bursts where there is an ERA burst on each ECA cycle, 
and phase IV consists of irregular ERA bursts but is usually much shorter than phase II and may not be 
always present. The initiation and propagation of the MMC is controlled by enteric cholinergic neurons 
in the intestinal wall (Fig. 6.2). The propagation of the MMC may be modulated by inputs from extrinsic 
nerves or circulating hormones [Sarna et al., 1981]. The MMC keeps the small intestine clean of residual 
food, debris, and desquamated cells. 



6.3 A Historical Perspective 



Minute Rhythms 

Alvarez and Mahoney [1922] reported the presence of a rhythmic electrical activity (which they called 
"action currents") in the smooth muscle layers of the stomach, small intestine, and colon. Their data 
was acquired from cat (stomach, small intestine), dog (stomach, small intestine, colon), and rabbit (small 
intestine, colon). They also demonstrated the existence of frequency gradients in excised stomach and 
bowel. Puestow [1933] confirmed the presence of a rhythmic electrical activity (which he called "waves 
of altered electrical potential") and a frequency gradient in isolated canine small intestinal segments. He 
also demonstrated the presence of an electrical spiking activity (associated with muscular contractions) 
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FIGURE 6.4 The Gastrointestinal ECA and ERA, recorded in a conscious dog from electrode sets implanted 
subserosally on stomach (S), duodenum (D), jejunum (J), proximal ascending colon (A), distal ascending colon 
(AC), transverse colon (TC), and descending colon (DC), respectively. Each trace is of 2 min duration. 

superimposed on the rhythmic electrical activity. He implied that the rhythmic electrical activity persisted 
at all times, whereas the electrical spike activity was of an intermittent nature. Bozler [1938, 1939, 1941] 
confirmed the occurrence of an electrical spiking activity associated with muscular contractions both 
in vitro in isolated longitudinal muscle strips from guinea pig (colon, small intestine) and rabbit (small 
intestine), and in situ in exposed loops of small intestine of anesthetized cat, dog, and rabbit as well as 
in cat stomach. He also suggested that the strength of a spontaneous muscular contraction is proportional 
to the frequency and duration of the spikes associated with it. 

The presence of two types of electrical activity in the smooth muscle layers of the gastrointestinal tract 
in several species had been established [Milton and Smith, 1956; Bulbring et al., 1958; Burnstock et al., 
1963; Daniel and Chapman, 1963; Bass, 1965; Gillespie, 1962; Duthie, 1974; Christensen, 1975; Daniel, 
1975; Sarna, 1975a]. The autonomous electrical rhythmic activity is an omnipresent myogenic activity 
[Burnstock et al., 1963] whose function is to control the appearance in time and space of the electrical 
spiking activity (an intermittent activity associated with muscular contractions) when neural and chem- 
ical factors are appropriate [Daniel and Chapman, 1963] . Neural and chemical factors determine whether 
or not contractions will occur, but when contractions are occurring, the myogenic control system 
determines the spatial and temporal patterns of contractions. 

Isolation of a distal segment of canine small intestine from a proximal segment (using surgical 
transection or clamping) had been reported to produce a decrease in the frequency of both the rhythmic 
muscular contractions [Douglas, 1949; Milton and Smith, 1956] and the electrical rhythmic activity 
[Milton and Smith, 1956] of the distal segment, suggesting frequency entrainment or pulling of the distal 
segment by the proximal one. It was demonstrated [Milton and Smith, 1956] that the repetition of the 
electrical spiking activity changed in the same manner as that of the electrical rhythmic activity, thus 
confirming a one-to-one temporal relationship between the frequency of the electrical rhythmic activity, 
the repetition rate of the electrical spiking activity, and the frequency of the muscular contractions (when 
all are present at any one site). Nelson and Becker [1968] suggested that the electrical rhythmic activity 
of the small intestine behaves like a system of coupled relaxation oscillators. They used two forward 
coupled relaxation oscillators, having different intrinsic frequencies, to demonstrate frequency entrain- 
ment of the two coupled oscillators. Uncoupling of the two oscillators caused a decrease in the frequency 
of the distal oscillator simulating the effect of transection of the canine small intestine. 
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The electrical rhythmic activity in canine stomach [Sarna et al., 1972], canine small intestine [Nelson 
and Becker, 1968; Diamant et al., 1970; Sarna et al., 1971], human small intestine [Robertson-Dunn and 
Linkens, 1974], human colon [Bardakjian and Sarna, 1980], and human rectosigmoid [Linkens etal., 
1976] has been modeled by populations of coupled nonlinear oscillators. The interaction between coupled 
nonlinear oscillators is governed by both intrinsic oscillator properties and coupling mechanisms. 

Hour Rhythms 

The existence of periodic gastric activity in the fasted state in both dog [Morat, 1882] and man [Morat, 
1893] has been reported. The occurrence of a periodic pattern of motor activity, comprising bursts of 
contractions alternating with "intervals of repose", in the gastrointestinal tracts of fasted animals was 
noted early in the twentieth century by Boldireff [1905]. He observed that (1) the bursts recurred with 
a periodicity of about 1.5 to 2.5 h, (2) the amplitude of the gastric contractions during the bursts were 
larger than those seen postprandially, (3) the small bowel was also involved, and (4) with longer fasting 
periods, the bursts occurred less frequently and had a shorter duration. Periodic bursts of activity were 
also observed in (1) the lower esphageal sphincter [Cannon and Washburn, 1912], and (2) the pylorus 
[Wheelon and Thomas, 1921]. Further investigation of the fasting contractile activity in the upper small 
intestine was undertaken in the early 1920s with particular emphasis on the coordination between the 
stomach and duodenum [Wheelon and Thomas, 1922; Alvarez and Mahoney, 1923]. More recently, 
evidence was obtained [Itoh et al., 1978] that the cyclical activity in the lower esophageal sphincter noted 
by Cannon and Washburn [1912] was also coordinated with that of the stomach and small intestine. 

With the use of implanted strain gauges, it was possible to observe contractile activity over long periods 
of time and it was demonstrated that the cyclical fasting pattern in the duodenum was altered by feeding 
[Jacoby etal., 1963]. The types of contractions observed during fasting and feeding were divided into 
four groups [Reinke et al., 1967; Carlson et al., 1972]. Three types of contractile patterns were observed 
in fasted animals: (1) quiescent interval, (2) a shorter interval of increasing activity, and (3) an interval 
of maximal activity. The fourth type was in fed animals and it consisted of randomly occurring contrac- 
tions of varying amplitudes. With the use of implanted electrodes in the small intestine of fasted dogs, 
Szurszewski [1969] demonstrated that the cyclical appearance of electrical spiking activity at each elec- 
trode site was due to the migration of the cyclical pattern of quiescence, increasing activity and maximal 
electrical activity down the small intestine from the duodenum to the terminal ileum. He called this 
electrical pattern the migrating myoelectric complex (MMC). Grivel and Ruckebusch [1972] demonstrated 
that the mechanical correlate of this electrical pattern, which they called the migrating motor complex, 
occurs in other species such as sheep and rabbits. They also observed that the velocity of propagation of 
the maximal contractile activity was proportional to the length of the small intestine. Code and Marlett 
[1975] observed the electrical correlate of the cyclical activity in dog stomach that was reported by Morat 
[1882; 1893], and they demonstrated that the stomach MMC was coordinated with the duodenal MMC. 

The MMC pattern has been demonstrated in other mammalian species [Ruckebusch and Fioramonti, 
1975; Ruckebusch and Bueno, 1976], including humans. Bursts of distally propagating contractions has 
been noted in the gastrointestinal tract of man [Beck et al., 1965], and their cyclical nature was reported 
by Stanciu and Bennet [1975]. The MMC has been described in both normal volunteers [Vantrappen 
etal., 1977; Fleckenstein, 1978; Thompson etal, 1980; Kerlin and Phillips, 1982; Rees etal, 1982] and 
in patients [Vantrappen et al. 1977; Thompson et al., 1982; Summers et al., 1982]. 

Terminology 

A nomenclature to describe the gastrointestinal electrical activities has been proposed to describe the 
minute rhythm [Sarna, 1975b] and the hour rhythm [Carlson et al., 1972; Code and Marlett, 1975]. 

Control cycle is one depolarization and repolarization of the transmembrane voltage. Control wave {or 
slow wave) is the continuing rhythmic electrical activity recorded at any one site. It was assumed to be 
generated by the smooth muscle cells behaving like a relaxation oscillator at that site. However, recent 
evidence [Hara et al., 1986; Suzuki et al. 1986; Barajas-Lopez et al., 1989; Serio et al., 1991] indicates that 
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it is generated by a system of interstitial cells of Cajal (ICC) and smooth muscle cells at that site. Electrical 
Control Activity (ECA) is the totality of the control waves recorded at one or several sites. Response 
Potentials (or spikes) are the rapid oscillations of transmembrane voltage in the depolarized state of 
smooth muscle cells. They are associated with muscular contraction and their occurrence is assumed to 
be in response to a control cycle when acetylcholine is present. Electrical Response Activity (ERA) is the 
totality of the groups of response potentials at one or several sites. 

Migrating Motility Complex (MMC) is the entire cycle which is composed of four phases. Initially, the 
electrical and mechanical patterns were referred to as the migrating myoelectric complex and the migrat- 
ing motor complex, respectively. Phase I is the interval during which fewer than 5% of ECA have associated 
ERA, and no or very few contractions are present. Phase II is the interval when 5 to 95% of the ECA has 
associated ERA, and intermittent contractions are present. Phase III is the interval when more than 95% 
of ECA have associated ERA, and large cyclical contractions are present. Phase IV is a short and waning 
interval of intermittent ERA and contractions. Phases II and IV are not always present and are difficult 
to characterize, whereas phases I and III are always present. MMC Cycle Time is the interval from the 
end of one phase III to the end of a subsequent phase III at any one site. Migration Time is the time 
taken for the MMC to migrate from the upper duodenum to the terminal ileum. 

6.4 The Stomach 
Anatomical Features 

The stomach is somewhat pyriform in shape with its large end directed upward at the lower esophageal 
sphincter and its small end bent to the right at the pylorus. It has two curvatures, the greater curvature 
which is four to five times as long as the lesser curvature, and it consists of three regions: the fundus, 
corpus (or body), and antrum, respectively. It has three smooth muscle layers. The outermost layer is 
the longitudinal muscle layer, the middle is the circular muscle layer, and the innermost is the oblique 
muscle layer. These layers thicken gradually in the distal stomach towards the pylorus, which is consistent 
with stomach function since trituration occurs in the distal antrum. The size of the stomach varies 
considerably among subjects. In an adult male, its greatest length when distended is about 25 to 30 cm 
and its widest diameter is about 10 to 12 cm [Pick and Howden, 1977]. 

The structural relationships of nerve, muscle, and interstitial cells of Cajal in the canine corpus indicated 
a high density of gap junctions indicating very tight coupling between cells. Nerves in the corpus are not 
located close to circular muscle cells but are found exterior to the muscle bundles, whereas ICCs have gap 
junction contact with smooth muscle cells and are closely innervated [Daniel and Sakai, 1984]. 

Gastric ECA 

In the canine stomach, the fundus does not usually exhibit spontaneous electrical oscillations, but the 
corpus and antrum do exhibit such oscillations. In the intact stomach, the ECA is entrained to a frequency 
of about 5 cpm (about 3 cpm in humans) throughout the electrically active region with phase lags in 
both the longitudinal and circumferential directions [Sarna et al„ 1972]. The phase lags decrease distally 
from corpus to antrum. 

There is a marked intrinsic frequency gradient along the axis of the stomach and a slight intrinsic 
frequency gradient along the circumference. The intrinsic frequency of gastric ECA in isolated circular 
muscle of the orad and mid corpus is the highest (about 5 cpm) compared to about 3.5 cpm in the rest 
of the corpus, and about 0.5 cpm in the antrum. Also, there is an orad to aborad intrinsic gradient in 
resting membrane potential, with the terminal antrum having the most negative resting membrane 
potential, about 30 mV more negative than the fundal regions [Szurszewski, 1987]. The relatively depo- 
larized state of the fundal muscle may explain its electrical inactivity since the voltage-sensitive ionic 
channels may be kept in a state of inactivation. Hyperpolarization of the fundus to a transmembrane 
voltage of -60 mV produces fundal control waves similar to those recorded from mid and orad corpus. 
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The ECA in canine stomach was modeled [Sarna et al., 1972] using an array of 13 bidirectionally 
coupled relaxation oscillators. The model featured (1) an intrinsic frequency decline from corpus to the 
pylorus and from greater curvature to the lesser curvature, (2) entrainment of all coupled oscillators at 
a frequency close to the highest intrinsic frequency, and (3) distally decreasing phase lags between the 
entrained oscillators. A simulated circumferential transection caused the formation of another frequency 
plateau aboral to the transection. The frequency of the orad plateau remained unaffected while that of 
the aborad plateau was decreased. This is consistent with the observed experimental data. 

The Electrogastrogram 

In a similar manner to other electrophysiological measures such as the electrocardiogram (EKG) and the 
electroencephalogram (EEG), the electrogastrogram (EGG) was identified [Stern and Koch, 1985; Chen 
and McCallum, 1994]. The EGG is the signal obtained from cutaneous recording of the gastric myoelec- 
trical activity by using surface electrodes placed on the abdomen over the stomach. Although the first 
EGG was recorded in the early 1920s [Alvarez, 1922], progress vis-a-vis clinical applications has been 
relatively slow, in particular when compared to the progress made in EKG, which also started in the early 
1920s. Despite many attempts made over the decades, visual inspection of the EGG signal has not led to 
the identification of waveform characteristics that would help the clinician to diagnose functional or 
organic diseases of the stomach. Even the development of techniques such as time-frequency analysis 
[Qiao et al., 1998] and artificial neural network-based feature extraction [Liang et al., 1997; Wang et al., 
1999] for computer analysis of the EGG did not provide clinically relevant information about gastric 
motility disorders. It has been demonstrated that increased EGG frequencies (1) were seen in perfectly 
healthy subjects [Pffafenbach et al., 1995], and (2) did not always correspond to serosally recorded 
tachygastria in dogs [Mintchev and Bowes, 1997]. As yet, there is no effective method of detecting a 
change in the direction or velocity of propagation of gastric ECA from the EGG. 

6.5 The Small Intestine 
Anatomical Features 

The small intestine is a long hollow organ which consists of the duodenum, jejunum, and ileum, 
respectively. Its length is about 650 cm in humans and 300 cm in dogs. The duodenum extends from 
the pylorus to the ligament of Treitz (about 30 cm in humans and dogs). In humans, the duodenum 
forms a C-shaped pattern, with the ligament of Treitz near the corpus of the stomach. In dogs, the 
duodenum lies along the right side of the peritoneal cavity, with the ligament of Treitz in the pelvis. The 
duodenum receives pancreatic exocrine secretions and bile. In both humans and dogs, the jejunum 
consists of the next one-third whereas the ileum consists of the remaining two-thirds of the intestine. 
The major differences between the jejunum and ileum are functional in nature, relating to their absorption 
characteristics and motor control. The majority of sugars, amino acids, lipids, electrolytes, and water are 
absorbed in the jejunum and proximal ileum, whereas bile acids and vitamin B12 are absorbed in the 
terminal ileum. 

Small Intestinal ECA 

In the canine small intestine, the ECA is not entrained throughout the entire length [Diamant and Bortoff, 
1969a; Sarna et al., 1971]. However, the ECA exhibits a plateau of constant frequency in the proximal 
region whereby there is a distal increase in phase lag. The frequency plateau (of about 20 cpm) extends 
over the entire duodenum and part of the jejunum. There is a marked intrinsic frequency gradient in 
the longitudinal direction with the highest intrinsic frequency being less than the plateau frequency. 
When the small intestine was transected in vivo into small segments (15 cm long), the intrinsic frequency 
of the ECA in adjacent segments tended to decrease aborally in an exponential manner [Sarna et al., 
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1971]. A single transection of the duodenum caused the formation of another frequency plateau aboral 
to the transection. The ECA frequency in the orad plateau was generally unaffected, while that in the 
aborad plateau was decreased [Diamant and Bortoff, 1969b; Sarna et al., 1971]. The frequency of the 
aborad plateau was either higher than or equal to the highest intrinsic frequency distal to the transection, 
depending on whether the transection of the duodenum was either above or below the region of the bile 
duct [Diamant and Bortoff, 1969b]. 

The ECA in canine small intestine was modeled using a chain of 16 bidirectionally coupled relaxation 
oscillators [Sarna et al., 1971]. Coupling was not uniform along the chain, since the proximal oscillators 
were strongly coupled and the distal oscillators were weakly coupled. The model featured (1) an expo- 
nential intrinsic frequency decline along the chain, (2) a frequency plateau which is higher than the 
highest intrinsic frequency, and (3) a temporal variation of the frequencies distal to the frequency plateau 
region. A simulated transection in the frequency plateau region caused the formation of another frequency 
plateau aboral to the transection, such that the frequency of the orad plateau was unchanged whereas 
the frequency of the aborad plateau decreased. 

The ECA in human small intestine was modeled using a chain of 100 bidirectionally coupled relaxation 
oscillators [Robertson-Dunn and Linkens, 1976]. Coupling was nonuniform and asymmetrical. The 
model featured (1) a piecewise linear decline in intrinsic frequency along the chain, (2) a piecewise linear 
decline in coupling similar to that of the intrinsic frequency, (3) forward coupling which is stronger than 
backward coupling, and (4) a frequency plateau in the proximal region which is higher than the highest 
intrinsic frequency in the region. 

Small Intestinal MMC 

The MMCs in canine small intestine have been observed in intrinsically isolated segments [Sarna et al., 
1981; 1983], even after the isolated segment has been stripped of all extrinsic innervation [Sarr and Kelly, 
1981] or removed in continuity with the remaining gut as a Thiry Vella loop [Itoh et al., 1981]. This 
intrinsic mechanism is able to function independently of extrinsic innervation since vagotomy [ Weisbrodt 
et al., 1975; Ruckebusch and Bueno, 1977] does not hinder the initiation of the MMC. The initiation of 
the small intestinal MMC is controlled by integrative networks within the intrinsic plexuses utilizing 
nicotinic and muscarinic cholinergic receptors [Ormsbee et al., 1979; El-Sharkawy et al., 1982]. 

When the canine small intestine was transected into four equal strips [Sarna et al., 1981, 1983], it was 
found that each strip was capable of generating an independent MMC that would appear to propagate 
from the proximal to the distal part of each segment. This suggested that the MMC can be modeled by 
a chain of coupled relaxation oscillators. The average intrinsic periods of the MMC for the four segments 
were reported to be 106.2, 66.8, 83.1, and 94.8 min, respectively. The segment containing the duodenum 
had the longest period, while the subsequent segment containing the jejunum had the shortest period. 
However, in the intact small intestine, the MMC starts in the duodenum and not the jejunum. Bardakjian 
et al. [1981, 1984] have demonstrated that both the intrinsic frequency gradients and resting level 
gradients have major roles in the entrainment of a chain of coupled oscillators. In modeling the small 
intestinal MMC with a chain of four coupled oscillators, it was necessary to include a gradient in the 
intrinsic resting levels of the MMC oscillators (with the proximal oscillator having the lowest resting 
level) in order to entrain the oscillators and allow the proximal oscillator to behave as the leading oscillator 
[Bardakjian and Ahmed, 1992]. 

6.6 The Colon 
Anatomical Features 

In humans, the colon is about 100 cm in length. The ileum joins the colon approximately 5 cm from its 
end, forming the cecum which has a worm-like appendage, the appendix. The colon is sacculated, and 
the longitudinal smooth muscle is concentrated in three bands (the taeniae). It lies in front of the small 
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intestine against the abdominal wall and it consists of the ascending (on the right side), transverse (across 
the lower stomach), and descending (on the left side) colon. The descending colon becomes the sigmoid 
colon in the pelvis as it runs down and forward to the rectum. Major functions of the colon are (1) to 
absorb water, certain electrolytes, short chain fatty acids, and bacterial metabolites; (2) to slowly propel 
its luminal contents in the caudad direction; (3) to store the residual matter in the distal region; and 
(4) to rapidly move its contents in the caudad direction during mass movements [Sarna, 1991]. In dogs, 
the colon is about 45 cm in length and the cecum has no appendage. The colon is not sacculated, and 
the longitudinal smooth muscle coat is continuous around the circumference [Miller et al., 1968]. It lies 
posterior to the small intestine and it consists mainly of ascending and descending segments with a small 
transverse segment. However, functionally it is assumed to consist of three regions, each of about 15 cm 
in length, representing the ascending, transverse, and descending colon, respectively. 

Colonic ECA 

In the human colon, the ECA is almost completely phase-unlocked between adjacent sites as close as 1 to 

2 cm apart, and its frequency (about 3 to 15 cpm) and amplitude at each site vary with time [Sarna et al., 
1980]. This results in short duration contractions that are also disorganized in time and space. The 
disorganization of ECA and its associated contractions is consistent with the colonic function of extensive 
mixing, kneading, and slow net distal propulsion [Sarna, 1991]. In the canine colon, the reports about 
the intrinsic frequency gradient were conflicting [Vanasin et al., 1974; Shearin et al., 1978; El-Sharkawy, 
1983]. 

The human colonic ECA was modeled [Bardakjian and Sarna, 1980] using a tubular structure of 
99 bidirectionally coupled nonlinear oscillators arranged in 33 parallel rings where each ring contained 

3 oscillators. Coupling was nonuniform and it increased in the longitudinal direction. The model featured 
(1) no phase-locking in the longitudinal or circumferential directions, (2) temporal and spatial variation 
of the frequency profile with large variations in the proximal and distal regions and small variations in 
the middle region, and (3) waxing and waning of the amplitudes of the ECA which was more pronounced 
in the proximal and distal regions. The model demonstrated that the "silent periods" occurred because 
of the interaction between oscillators and they did not occur when the oscillators were uncoupled. The 
model was further refined [Bardakjian et al., 1990] such that when the ECA amplitude exceeded an 
excitation threshold, a burst of ERA was exhibited. The ERA bursts occurred in a seemingly random 
manner in adjacent sites because (1) the ECA was not phase-locked, and (2) the ECA amplitudes and 
waveshapes varied in a seemingly random manner. 

6.7 Epilogue 

The ECA in stomach, small intestine, and colon behaves like the outputs of a population of coupled 
nonlinear oscillators. The populations in the stomach and the proximal small intestine are entrained, 
whereas those in the distal small intestine and colon are not entrained. There are distinct intrinsic 
frequency gradients in the stomach and small intestine but their profile in the colon is ambiguous. 

The applicability of modeling of gastrointestinal ECA by coupled nonlinear oscillators has been 
reconfirmed [Daniel et al., 1994], and a novel nonlinear oscillator, the mapped clock oscillator, was 
proposed [Bardakjian and Diamant, 1994] for modeling the cellular ECA. The oscillator consists of two 
coupled components: a clock which represents the interstitial cells of Cajal, and a transformer which 
represents the smooth muscle transmembrane ionic transport mechanisms [Skinner and Bardakjian, 
1991]. Such a model accounts for the mounting evidence supporting the role of the interstitial cells of 
Cajal as a pacemaker for the smooth muscle transmembrane voltage oscillations [Hara et al., 1986; Suzuki 
etal., 1986; Barajas-Lopez et al, 1989; Serio et al., 1991; Sanders, 1996]. 

Modeling of the gastrointestinal ECA by populations of coupled nonlinear oscillators [Bardakjian, 
1987] suggests that gastrointestinal motility disorders associated with abnormal ECA can be effectively 
treated by (1) electronic pacemakers to coordinate the oscillators, (2) surgical interventions to remove 
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regional ectopic foci, and (3) pharmacotherapy to stimulate the oscillators. Electronic pacing has been 
demonstrated in canine stomach [Kelly and LaForce, 1972; Sarna and Daniel, 1973; Bellahsene et al., 
1992] and small intestine [Sarna and Daniel, 1975c; Becker et al., 1983]. Also, pharmacotherapy with 
prokinetic drugs such as Domperidone and Cisapride has demonstrated improvements in the coordina- 
tion of the gastric oscillators. 
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As functioning units, the lung and heart are usually considered a single complex organ, but because these 
organs contain essentially two compartments — one for blood and one for air — they are usually 
separated in terms of the tests conducted to evaluate heart or pulmonary function. This chapter focuses 
on some of the physiologic concepts responsible for normal function and specific measures of the lung's 
ability to supply tissue cells with enough oxygen while removing excess carbon dioxide. 

7.1 Respiration Anatomy 

The respiratory system consists of the lungs, conducting airways, pulmonary vasculature, respiratory 
muscles, and surrounding tissues and structures (Fig. 7.1). Each plays an important role in influencing 
respiratory responses. 

Lungs 

There are two lungs in the human chest; the right lung is composed of three incomplete divisions called 
lobes, and the left lung has two, leaving room for the heart. The right lung accounts for 55% of total gas 
volume and the left lung for 45%. Lung tissue is spongy because of the very small (200 to 300 x 10~ 6 m 
diameter in normal lungs at rest) gas-filled cavities called alveoli, which are the ultimate structures for 
gas exchange. There are 250 million to 350 million alveoli in the adult lung, with a total alveolar surface 
area of 50 to 100 m 2 depending on the degree of lung inflation [Johnson, 1991]. 

Conducting Airways 

Air is transported from the atmosphere to the alveoli beginning with the oral and nasal cavities, through 
the pharynx (in the throat), past the glottal opening, and into the trachea or windpipe. Conduction of 
air begins at the larynx, or voice box, at the entrance to the trachea, which is a fibromuscular tube 10 to 
12 cm in length and 1.4 to 2.0 cm in diameter [Kline, 1976]. At a location called the carina, the trachea 
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FIGURE 7.1 Schematic representation of the respiratory system. 

terminates and divides into the left and right bronchi. Each bronchus has a discontinuous cartilaginous 
support in its wall. Muscle fibers capable of controlling airway diameter are incorporated into the walls 
of the bronchi, as well as in those of air passages closer to the alveoli. Smooth muscle is present throughout 
the respiratory bronchiolus and alveolar ducts but is absent in the last alveolar duct, which terminates 
in one to several alveoli. The alveolar walls are shared by other alveoli and are composed of highly pliable 
and collapsible squamous epithelium cells. 

The bronchi subdivide into subbronchi, which further subdivide into bronchioli, which further sub- 
divide, and so on, until finally reaching the alveolar level. Table 7.1 provides a description and dimensions 
of the airways of adult humans. A model of the geometric arrangement of these air passages is presented 
in Fig. 7.2. It will be noted that each airway is considered to branch into two subairways. In the adult 
human there are considered to be 23 such branchings, or generations, beginning at the trachea and 
ending in the alveoli. 

Movement of gases in the respiratory airways occurs mainly by bulk flow (convection) throughout 
the region from the mouth to the nose to the fifteenth generation. Beyond the fifteenth generation, gas 
diffusion is relatively more important. With the low gas velocities that occur in diffusion, dimensions of 
the space over which diffusion occurs (alveolar space) must be small for adequate oxygen delivery into 
the walls; smaller alveoli are more efficient in the transfer of gas than are larger ones. Thus animals with 
high levels of oxygen consumption are found to have smaller-diameter alveoli compared with animals 
with low levels of oxygen consumption. 

Alveoli 

Alveoli are the structures through which gases diffuse to and from the body. To ensure gas exchange 
occurs efficiently, alveolar walls are extremely thin. For example, the total tissue thickness between the 
inside of the alveolus to pulmonary capillary blood plasma is only about 0.4 x 10~ 6 m. Consequently, the 
principal barrier to diffusion occurs at the plasma and red blood cell level, not at the alveolar membrane 
[Ruch and Patton, 1966]. 
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FIGURE 7.2 General architecture of conductive and transitory airways. (Used with permission from Weibel, 1963.) 
In the conductive zone air is conducted to and from the lungs while in the respiration zone, gas exchange occurs. 

Molecular diffusion within the alveolar volume is responsible for mixing of the enclosed gas. Due to 
small alveolar dimensions, complete mixing probably occurs in less than 10 ms, fast enough that alveolar 
mixing time does not limit gaseous diffusion to or from the blood [Astrand and Rodahl, 1970]. 

Of particular importance to proper alveolar operation is a thin surface coating of surfactant. Without 
this material, large alveoli would tend to enlarge and small alveoli would collapse. It is the present view 
that surfactant acts like a detergent, changing the stress-strain relationship of the alveolar wall and thereby 
stabilizing the lung [Johnson, 1991]. 

Pulmonary Circulation 

There is no true pulmonary analogue to the systemic arterioles, since the pulmonary circulation occurs 
under relatively low pressure [West, 1977]. Pulmonary blood vessels, especially capillaries and venules, 
are very thin walled and flexible. Unlike systemic capillaries, pulmonary capillaries increase in diameter, 
and pulmonary capillaries within alveolar walls separate adjacent alveoli with increases in blood pressure 
or decreases in alveolar pressure. Flow, therefore, is significantly influenced by elastic deformation. 
Although pulmonary circulation is largely unaffected by neural and chemical control, it does respond 
promptly to hypoxia. 

There is also a high-pressure systemic blood delivery system to the bronchi that is completely inde- 
pendent of the pulmonary low-pressure (-3330 N/m 2 ) circulation in healthy individuals. In diseased 
states, however, bronchial arteries are reported to enlarge when pulmonary blood flow is reduced, and 
some arteriovenous shunts become prominent [West, 1977]. 

Total pulmonary blood volume is approximately 300 to 500 cm 3 in normal adults, with about 60 to 
100 cm 3 in the pulmonary capillaries [Astrand and Rodahl, 1970]. This value, however, is quite variable, 
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depending on such things as posture, position, disease, and chemical composition of the blood [Kline, 
1976]. 

Since pulmonary arterial blood is oxygen-poor and carbon dioxide-rich, it exchanges excess carbon 
dioxide for oxygen in the pulmonary capillaries, which are in close contact with alveolar walls. At rest, 
the transit time for blood in the pulmonary capillaries is computed as 

t=v lv 

C I c 

where t = blood transmit time, s 

V c = capillary blood volume, m 3 

V c = total capillary blood flow = cardiac output, m 3 /s 

and is somewhat less than 1 s, while during exercise it may be only 500 ms or even less. 

Respiratory Muscles 

The lungs fill because of a rhythmic expansion of the chest wall. The action is indirect in that no muscle 
acts directly on the lung. The diaphragm, the muscular mass accounting for 75% of the expansion of 
the chest cavity, is attached around the bottom of the thoracic cage, arches over the liver, and moves 
downward like a piston when it contracts. The external intercostal muscles are positioned between the 
ribs and aid inspiration by moving the ribs up and forward. This, then, increases the volume of the 
thorax. Other muscles are important in the maintenance of thoracic shape during breathing. (For details, 
see Ruch and Patton [1966] and Johnson [1991]). 

Quiet expiration is usually considered to be passive; i.e., pressure to force air from the lungs comes 
from elastic expansion of the lungs and chest wall. During moderate to severe exercise, the abdominal 
and internal intercostal muscles are very important in forcing air from the lungs much more quickly 
than would otherwise occur. Inspiration requires intimate contact between lung tissues, pleural tissues 
(the pleura is the membrane surrounding the lungs), and chest wall and diaphragm. This is accomplished 
by reduced intrathoracic pressure (which tends toward negative values) during inspiration. 

Viewing the lungs as an entire unit, one can consider the lungs to be elastic sacs within an air-tight 
barrel — the thorax — which is bounded by the ribs and the diaphragm. Any movement of these two 
boundaries alters the volume of the lungs. The normal breathing cycle in humans is accomplished by 
the active contraction of the inspiratory muscles, which enlarges the thorax. This enlargement lowers 
intrathoracic and interpleural pressure even further, pulls on the lungs, and enlarges the alveoli, alveolar 
ducts, and bronchioli, expanding the alveolar gas and decreasing its pressure below atmospheric. As a 
result, air at atmospheric pressure flows easily into the nose, mouth, and trachea. 



7.2 Lung Volumes and Gas Exchange 



Of primary importance to lung functioning is the movement and mixing of gases within the respiratory 
system. Depending on the anatomic level under consideration, gas movement is determined mainly by 
diffusion or convection. 

Without the thoracic musculature and rib cage, as mentioned above, the barely inflated lungs would 
occupy a much smaller space than they occupy in situ. However, the thoracic cage holds them open. 
Conversely, the lungs exert an influence on the thorax, holding it smaller than should be the case without 
the lungs. Because the lungs and thorax are connected by tissue, the volume occupied by both together 
is between the extremes represented by relaxed lungs alone and thoracic cavity alone. The resting volume 
V R , then, is that volume occupied by the lungs with glottis open and muscles relaxed. 

Lung volumes greater than resting volume are achieved during inspiration. Maximum inspiration is 
represented by inspiratory reserve volume (IRV). IRV is the maximum additional volume that can be 
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FIGURE 7.3 Lung capacities and lung volumes. 

accommodated by the lung at the end of inspiration. Lung volumes less than resting volume do not 
normally occur at rest but do occur during exhalation while exercising (when exhalation is active). 
Maximum additional expiration, as measured from lung volume at the end of expiration, is called 
expiratory reserve volume (ERV). Residual volume is the amount of gas remaining in the lungs at the end 
of maximal expiration. 

Tidal volume V T is normally considered to be the volume of air entering the nose and mouth with 
each breath. Alveolar ventilation volume, the volume of fresh air that enters the alveoli during each breath, 
is always less than tidal volume. The extent of this difference in volume depends primarily on the anatomic 
dead space, the 150- to 160-ml internal volume of the conducting airway passages. The term dead is quite 
appropriate, since it represents wasted respiratory effort; i.e., no significant gas exchange occurs across 
the thick walls of the trachea, bronchi, and bronchiolus. Since normal tidal volume at rest is usually 
about 500 ml of air per breath, one can easily calculate that because of the presence of this dead space, 
about 340 to 350 ml of fresh air actually penetrates the alveoli and becomes involved in the gas exchange 
process. An additional 150 to 160 ml of stale air exhaled during the previous breath is also drawn into 
the alveoli. 

The term volume is used for elemental differences of lung volume, whereas the term capacity is used 
for combination of lung volumes. Figure 7.3 illustrates the interrelationship between each of the following 
lung volumes and capacities: 

1. Total lung capacity (TLC): The amount of gas contained in the lung at the end of maximal 
inspiration. 

2. Forced vital capacity (FVC): The maximal volume of gas that can be forcefully expelled after 
maximal inspiration. 

3. Inspiratory capacity (IC): The maximal volume of gas that can be inspired from the resting 
expiratory level. 

4. Functional residual capacity (FRC): The volume of gas remaining after normal expiration. It will 
be noted that functional residual capacity (FRC) is the same as the resting volume. There is a 
small difference, however, between resting volume and FRC because FRC is measured while the 
patient breathes, whereas resting volume is measured with no breathing. FRC is properly defined 
only at end-expiration at rest and not during exercise. 
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TABLE 7.2 Typical Lung Volumes for Normal, Healthy Males 

Lung Volume 

Total lung capacity (TLC) 
Residual volume (RV) 
Vital capacity (VC) 

Inspiratory reserve volume (IRV) 

Expiratory reserve volume (ERV) 
Functional residual capacity (FRC) 
Anatomic dead volume (V D ) 

Upper airways volume 

Lower airways volume 
Physiologic dead volume (V D ) 
Minute volume (VJ at rest 
Respiratory period ( T) at rest 
Tidal volume (V T ) at rest 
Alveolar ventilation volume (V A ) at rest 
Minute volume during heavy exercise 
Respiratory period during heavy exercise 
Tidal volume during heavy exercise 
Alveolar ventilation volume during exercise 

Source: Adapted and used with permission from Forster et al [1986], 

These volumes and specific capacities, represented in Fig. 7.3, have led to the development of specific 
tests (that will be discussed below) to quantify the status of the pulmonary system. Typical values for 
these volumes and capacities are provided in Table 7.2. 





Normal Values 


6.0 x 10- 


m 3 


(6,000 cm 3 ) 


1.2 x 10- 


m 3 


(1,200 cm 3 ) 


4.8 x 10- 


m 3 


(4,800 cm 3 ) 


3.6 x 10- 


m 3 


(3,600 cm 3 ) 


1.2 x 10- 


m 3 


(1,200 cm 3 ) 


2.4 x 10- 


m 3 


(2,400 cm 3 ) 


1.5 x 10- 


m 3 


(150 cm 3 ) 


8.0 x 10- 


m 3 


(80 cm 3 ) 


7.0 x 10- 


m 3 


(70 cm 3 ) 


1.8 x 10- 


m 3 


(180 cm 3 ) 


1.0 x 10- 


m 3 /s 


(6,000 cm 3 /min) 


4s 






4.0 x 10- 


m 3 


(400 cm 3 ) 


2.5 x 10- 


m 3 


(250 cm 3 ) 


1.7 x 10- 


m 3 /s 


(10,000 cm 3 /min) 


1.2 s 






2.0 x 10- 


m 3 


(2,000 cm 3 ) 


1.8 x 10- 


m 3 


(1,820 cm 3 ) 



7.3 Perfusion of the Lung 



For gas exchange to occur properly in the lung, air must be delivered to the alveoli via the conducting 
airways, gas must diffuse from the alveoli to the capillaries through extremely thin walls, and the same 
gas must be removed to the cardiac atrium by blood flow. This three-step process involves (1) alveolar 
ventilation, (2) the process of diffusion, and (3) ventilatory perfusion, which involves pulmonary blood 
flow. Obviously, an alveolus that is ventilated but not perfused cannot exchange gas. Similarly, a perfused 
alveolus that is not properly ventilated cannot exchange gas. The most efficient gas exchange occurs when 
ventilation and perfusion are matched. 

There is a wide range of ventilation-to-perfusion ratios that naturally occur in various regions of the 
lung [Johnson, 1991]. Blood flow is somewhat affected by posture because of the effects of gravity. In 
the upright position, there is a general reduction in the volume of blood in the thorax, allowing for larger 
lung volume. Gravity also influences the distribution of blood, such that the perfusion of equal lung 
volumes is about five times greater at the base compared with the top of the lung [Astrand and Rodahl, 
1970]. There is no corresponding distribution of ventilation; hence the ventilation-to-perfusion ratio is 
nearly five times smaller at the top of the lung (Table 7.3). A more uniform ventilation-to-perfusion ratio 
is found in the supine position and during exercise [Jones, 1984b]. 

Blood flow through the capillaries is not steady. Rather, blood flows in a halting manner and may even 
be stopped if intraalveolar pressure exceeds intracapillary blood pressure during diastole. Mean blood 
flow is not affected by heart rate [West, 1977], but the highly distensible pulmonary blood vessels admit 
more blood when blood pressure and cardiac output increase. During exercise, higher pulmonary blood 
pressures allow more blood to flow through the capillaries. Even mild exercise favors more uniform 
perfusion of the lungs [Astrand and Rodahl, 1970]. Pulmonary artery systolic pressures increases from 
2670 N/m 2 (20 mmHg) at rest to 4670 N/m 2 (35 mmHg) during moderate exercise to 6670 N/m 2 
(50 mmHg) at maximal work [Astrand and Rodahl, 1970]. 
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TABLE 7.3 Ventilation-to-Perfusion Ratios from the Top to Bottom 
of the Lung of Normal Man in the Sitting Position 



Percent 


Alveolar 




Perfusion 




Lung Volume, 


Ventilation Rate, 


Rate, 


Ventilation-to- 


% 


cm 3 /s 




cm'/s 


Perfusion Ratio 






Top 






7 


4.0 




1.2 


3.3 


8 


5.5 




3.2 


1.8 


10 


7.0 




5.5 


1.3 


11 


8.7 




8.3 


1.0 


12 


9.8 




11.0 


0.90 


13 


11.2 




13.8 


0.80 


13 


12.0 




16.3 


0.73 


13 


13.0 


Bottom 


19.2 


0.68 


13 


13.7 




21.5 


0.63 


100 


84.9 




100.0 





Source: Used with permission from West [1962]. 

7.4 Gas Partial Pressure 

The primary purpose of the respiratory system is gas exchange. In the gas-exchange process, gas must 
diffuse through the alveolar space, across tissue, and through plasma into the red blood cell, where it 
finally chemically joins to hemoglobin. A similar process occurs for carbon dioxide elimination. 

As long as intermolecular interactions are small, most gases of physiologic significance can be consid- 
ered to obey the ideal gas law: 

pV = nRT 

where p = pressure, N/m 2 

V = volume of gas, m 3 

n = number of moles, mol 

R = gas constant, (N x m)/(mol x K) 

T = absolute temperature, K 

The ideal gas law can be applied without error up to atmospheric pressure; it can be applied to a 
mixture of gases, such as air, or to its constituents, such as oxygen or nitrogen. All individual gases in a 
mixture are considered to fill the total volume and have the same temperature but reduced pressures. 
The pressure exerted by each individual gas is called the partial pressure of the gas. 

Dalton's law states that the total pressure is the sum of the partial pressures of the constituents of a 
mixture: 

N 



where p t = partial pressure of the rth constituent, N/m 2 
N = total number of constituents 



Dividing the ideal gas law for a constituent by that for the mixture gives 
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TABLE 7.4 Molecular Masses, Gas Constants, and Volume 
Fractions for Air and Constituents 





Molecular 




Volume Fraction 




Mass 


Gas Constant, 


in Air, 


Constituent 


kg/mol 


N-m/(mol-K) 


mVm 3 


Aii- 


29.0 


286.7 


1.0000 


Ammonia 


17.0 


489.1 


0.0000 


Argon 


39.9 


208.4 


0.0093 


Carbon dioxide 


44.0 


189.0 


0.0003 


Carbon monoxide 


28.0 


296.9 


0.0000 


Helium 


4.0 


2078.6 


0.0000 


Hydrogen 


2.0 


4157.2 


0.0000 


Nitrogen 


28.0 


296.9 


0.7808 


Oxygen 


32.0 


259.8 


0.2095 



Note: Universal gas constant is 8314.43 N-m/kg-mol-K. 



PV n.RT 



PV nRT 
so that 

p. nR 
p nR 

which states that the partial pressure of a gas may be found if the total pressure, mole fraction, and ratio 
of gas constants are known. For most respiratory calculations, p will be considered to be the pressure of 
1 atmosphere, 101 kN/m 2 . Avogadro's principle states that different gases at the same temperature and 
pressure contain equal numbers of molecules: 

V 2 nR 2 R 2 
Thus 

P v 

where V-J Vis the volume fraction of a constituent in air and is therefore dimensionless. Table 7.4 provides 
individual gas constants, as well as volume fractions, of constituent gases of air. 

Gas pressures and volumes can be measured for many different temperature and humidity conditions. 
Three of these are body temperature and pressure, saturated (BTPS); ambient temperature and pressure 
(ATP); and standard temperature and pressure, dry (STPD). To calculate constituent partial pressures at 
STPD, total pressure is taken as barometric pressure minus vapor pressure of water in the atmosphere: 



p, = (^>)(p-p h 2 o) 



where p = total pressure, kN/m 2 

pH 2 = vapor pressure of water in atmosphere, kN/m 2 

and Vj/Vas a ratio does not change in the conversion process. 
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TABLE 7.5 Gas Partial Pressures (kN/m 2 ) Throughout 
the Respiratory and Circulatory Systems 











Mixed 








Inspired 


Alveolar 


Expired 


Venous 


Arterial 


Muscle 


Gas 


Air* 


Air 


Air 


Blood 


Blood 


Tissue 


H 2 


— 


6.3 


6.3 


6.3 


6.3 


6.3 


co 2 


0.04 


5.3 


4.2 


6.1 


5.3 


6.7 


o, 


21.2 


14.0 


15.5 


5.3 


13.3 


4.0 


N,t 


80.1 


75.7 


75.3 


76.4 


76.4 


76.4 


Total 


101.3 


101.3 


101.3 


94.1 


101.3 


93.4 



^Inspired air considered dry for convenience. 

♦Includes all other inert components. 

Source: Used with permission from Astrand and Rodahl [1970]. 

Gas volume at STPD is converted from ambient condition volume as 



V. = V 



273/(273 + 0) (p-pH 2 O)/l01.3 



where V,- = volume of gas i corrected to STPD, m 3 

V amb = volume of gas i at ambient temperature and pressure, m 3 

= ambient temperature, °C 

p = ambient total pressure, kN/m 2 

pH 2 = vapor pressure of water in the air, kN/m 2 

Partial pressures and gas volumes may be expressed in BTPS conditions. In this case, gas partial 
pressures are usually known from other measurements. Gas volumes are converted from ambient con- 
ditions by 



V. = V 



31o/(273 + 0) (p-pH 2 o)/p-6.28 



Table 7.5 provides gas partial pressure throughout the respiratory and circulatory systems. 

7.5 Pulmonary Mechanics 

The respiratory system exhibits properties of resistance, compliance, and inertance analogous to the 
electrical properties of resistance, capacitance, and inductance. Of these, inertance is generally considered 
to be of less importance than the other two properties. 
Resistance is the ratio of pressure to flow: 

R=p/V 

where R = resistance, N x s/m 5 
P = pressure, N/m 2 
V = volume flow rate, m 3 /s 

Resistance can be found in the conducting airways, in the lung tissue, and in the tissues of the chest wall. 
Airways exhalation resistance is usually higher than airways inhalation resistance because the surrounding 
lung tissue pulls the smaller, more distensible airways open when the lung is being inflated. Thus airways 
inhalation resistance is somewhat dependent on lung volume, and airways exhalation resistance can be 
very lung volume-dependent [Johnson, 1991]. Respiratory tissue resistance varies with frequency, lung 
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volume, and volume history. Tissue resistance is relatively small at high frequencies but increases greatly 
at low frequencies, nearly proportional to 1/f. Tissue resistance often exceeds airway resistance below 
2 Hz. Lung tissue resistance also increases with decreasing volume amplitude [Stamenovic et al., 1990]. 
Compliance is the ratio of lung volume to lung pressure: 

c=v/ P 

where C = compliance, m 5 /N, 
V = lung volume/m 3 
P = pressure, N/m 2 

As the lung is stretched, it acts as an expanded balloon that tends to push air out and return to its normal 
size. The static pressure-volume relationship is nonlinear, exhibiting decreased static compliance at the 
extremes of lung volume [Johnson, 1991]. As with tissue resistance, dynamic tissue compliance does not 
remain constant during breathing. Dynamic compliance tends to increase with increasing volume and 
decrease with increasing frequency [Stamenovic et al., 1990]. 

Two separate approaches can be used to model lung tissue mechanics. The traditional approach places 
a linear viscoelastic system in parallel with a plastoelastic system. A linear viscoelastic system consists of 
ideal resistive and compliant elements and can exhibit the frequency-dependence of respiratory tissue. 
A plastoelastic system consists of dry-friction elements and compliant elements and can exhibit the 
volume dependence of respiratory tissue [Hildebrandt, 1970]. An alternate approach is to utilize a 
nonlinear viscoelastic system that can characterize both the frequency dependence and the volume 
dependence of respiratory tissue [Suki and Bates, 1991]. 

Lung tissue hysteresivity relates resistance and compliance: 

where ft) = frequency, radians/s 

R = resistance, N x s/m 5 

r\ = hysteresivity, unitless 

C d = dynamic compliance, m 5 /n 

Hysteresivity, analogous to the structural damping coefficient used in solid mechanics, is an empirical 
parameter arising from the assumption that resistance and compliance are related at the microstructural 
level. Hysteresivity is independent of frequency and volume. Typical values range from 0. 1 to 0.3 [Fredberg 
and Stamenovic, 1989]. 



7.6 Respiratory Control 



Control of respiration occurs in many different cerebral structures [Johnson, 1991] and regulates many 
things [Hornbein, 1981]. Respiration must be controlled to produce the respiratory rhythm, ensure 
adequate gas exchange, protect against inhalation of poisonous substances, assist in maintenance of body 
pH, remove irritations, and minimize energy cost. Respiratory control is more complex than cardiac 
control for at least three reasons: 

1. Airways airflow occurs in both directions. 

2. The respiratory system interfaces directly with the environment outside the body. 

3. Parts of the respiratory system are used for other functions, such as swallowing and speaking. 

As a result, respiratory muscular action must be exquisitely coordinated; it must be prepared to protect 
itself against environmental onslaught, and breathing must be temporarily suspended on demand. 
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FIGURE 7.4 General scheme of respiratory control. 

All control systems require sensors, controllers, and effectors. Figure 7.4 presents the general scheme 
for respiratory control. There are mechanoreceptors throughout the respiratory system. For example, 
nasal receptors are important in sneezing, apnea (cessation of breathing), bronchodilation, bronchocon- 
striction, and the secretion of mucus. Laryngeal receptors are important in coughing, apnea, swallowing, 
bronchoconstriction, airway mucus secretion, and laryngeal constriction. Tracheobronchial receptors are 
important in coughing, pulmonary hypertension, bronchoconstriction, laryngeal constriction, and 
mucus production. Other mechanoreceptors are important in the generation of the respiratory pattern 
and are involved with respiratory sensation. 

Respiratory chemoreceptors exist peripherally in the aortic arch and carotic bodies and centrally in 
the ventral medulla oblongata of the brain. These receptors are sensitive to partial pressures of C0 2 and 
2 and to blood pH. 

The respiratory controller is located in several places in the brain. Each location appears to have its 
own function. Unlike the heart, the basic respiratory rhythm is not generated within the lungs but rather 
in the brain and is transmitted to the respiratory muscles by the phrenic nerve. 

Effector organs are mainly the respiratory muscles, as described previously. Other effectors are muscles 
located in the airways and tissues for mucus secretion. Control of respiration appears to be based on two 
criteria: (1) removal of excess C0 2 and (2) minimization of energy expenditure. It is not the lack of 
oxygen that stimulates respiration but increased C0 2 partial pressure that acts as a powerful respiratory 
stimulus. Because of the buffering action of blood bicarbonate, blood pH usually falls as more C0 2 is 
produced in the working muscles. Lower blood pH also stimulates respiration. 

A number of respiratory adjustments are made to reduce energy expenditure during exercise: Respi- 
ration rate increases, the ratio of inhalation time to exhalation time decreases, respiratory flow waveshapes 
become more trapezoidal, and expiratory reserve volume decreases. Other adjustments to reduce energy 
expenditure have been theorized but not proven [Johnson, 1991]. 

7.7 The Pulmonary Function Laboratory 

The purpose of a pulmonary function laboratory is to obtain clinically useful data from patients with 
respiratory dysfunction. The pulmonary function tests (PFTs) within this laboratory fulfill a variety of 
functions. They permit (1) quantification of a patient's breathing deficiency, (2) diagnosis of different 
types of pulmonary diseases, (3) evaluation of a patient's response to therapy, and (4) preoperative 
screening to determine whether the presence of lung disease increases the risk of surgery. 
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FIGURE 7.5 Typical spirometry tracing obtained during testing; inspiratory capacity (IC), tidal volume (TV), 
forced vital capacity (FVC), forced expiratory volume (FEV), and forced expiratory flows. Dashed line represents a 
patient with obstructive lung disease; solid line represents a normal, healthy individual. 

Although PFTs can provide important information about a patient's condition, the limitations of these 
tests must be considered. First, they are nonspecific in that they cannot determine which portion of the 
lungs is diseased, only that the disease is present. Second, PFTs must be considered along with the medical 
history, physical examination, x-ray examination, and other diagnostic procedures to permit a complete 
evaluation. Finally, the major drawback to some PFTs is that they require a full patient cooperation and 
for this reason cannot be conducted on critically ill patients. Consider some of the most widely used 
PFTs: spirometry, body plethysmography, and diffusing capacity. 

Spirometry 

The simplest PFT is the spirometry maneuver. In this test, the patient inhales to total lung capacity (TLC) 
and exhales forcefully to residual volume. The patient exhales into a displacement bell chamber that sits 
on a water seal. As the bell rises, a pen coupled to the bell chamber inscribes a tracing on a rotating 
drum. The spirometer offers very little resistance to breathing; therefore, the shape of the spirometry 
curve (Fig. 7.5) is purely a function of the patient's lung compliance, chest compliance, and airway 
resistance. At high lung volumes, a rise in intrapleural pressure results in greater expiratory flows. 
However, at intermediate and low lung volumes, the expiratory flow is independent of effort after a 
certain intrapleural pressure is reached. 

Measurements made from the spirometry curve can determine the degree of a patient's ventilatory 
obstruction. Forced vital capacity (FVC), forced expiratory volumes (FEV), and forced expiratory flows 
(FEF) can be determined. The FEV indicates the volume that has been exhaled from TLC for a particular 
time interval. For example, FEV 5 is the volume exhaled during the first half-second of expiration, and 
FEVj is the volume exhaled during the first second of expiration; these are graphically represented in 
Fig. 7.5. Note that the more severe the ventilatory obstruction, the lower are the timed volumes (FEV 05 
and FEV 10 ). The FEF is a measure of the average flow (volume/time) over specified portions of the 
spirometry curve and is represented by the slope of a straight line drawn between volume levels. The 
average flow over the first quarter of the forced expiration is the FEF _ 25% , whereas the average flow over 
the middle 50% of the FVC is the FEF 25 _ 75% . These values are obtained directly from the spirometry 
curves. The less steep curves of obstructed patients would result in lower values of FEF _ 25% and FEF 25 _ 75% 
compared with normal values, which are predicted on the basis of the patient's sex, age, and height. 
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FIGURE 7.6 Flow-volume curve obtained from a spirometry maneuver. Solid line is a normal curve; dashed line 
represents a patient with obstructive lung disease. 

Equations for normal values are available from statistical analysis of data obtained from a normal 
population. Test results are then interpreted as a percentage of normal. 

Another way of presenting a spirometry curve is as a flow- volume curve. Figure 7.6 represents a typical 
flow-volume curve. The expiratory flow is plotted against the exhaled volume, indicating the maximum 
flow that may be reached at each degree of lung inflation. Since there is no time axis, a time must mark 
the FEV 5 and FEVj on the tracing. To obtain these flow-volume curves in the laboratory, the patient 
usually exhales through a pneumotach. The most widely used pneumotach measures a pressure drop 
across a flow-resistive element. The resistance to flow is constant over the measuring range of the device; 
therefore, the pressure drop is proportional to the flow through the tube. This signal, which is indicative 
of flow, is then integrated to determine the volume of gas that has passed through the tube. 

Another type of pneumotach is the heated-element type. In this device, a small heated mass responds 
to airflow by cooling. As the element cools, a greater current is necessary to maintain a constant tem- 
perature. This current is proportional to the airflow through the tube. Again, to determine the volume 
that has passed through the tube, the flow signal is integrated. 

The flow-volume loop in Fig. 7.7 is a dramatic representation displaying inspiratory and expiratory 
curves for both normal breathing and maximal breathing. The result is a graphic representation of the 
patients reserve capacity in relation to normal breathing. For example, the normal patient's tidal breathing 
loop is small compared with the patient's maximum breathing loop. During these times of stress, this 
tidal breathing loop can be increased to the boundaries of the outer ventilatory loop. This increase in 
ventilation provides the greater gas exchange needed during the stressful situation. Compare this condi- 
tion with that of the patient with obstructive lung disease. Not only is the tidal breathing loop larger 
than normal, but the maximal breathing loop is smaller than normal. The result is a decreased ventilatory 
reserve, limiting the individual's ability to move air in and out of the lungs. As the disease progresses, 
the outer loop becomes smaller, and the inner loop becomes larger. 

The primary use of spirometry is in detection of obstructive lung disease that results from increased 
resistance to flow through the airways. This can occur in several ways: 

1. Deterioration of the structure of the smaller airways that results in early airways closure. 

2. Decreased airway diameters caused by bronchospasm or the presence of secretions increases the 
airway's resistance to airflow. 

3. Partial blockage of a large airway by a tumor decreases airway diameter and causes turbulent flow. 
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FIGURE 7.7 Typical flow-volume loops, (a) Normal flow-volume loop, (b) Flow-volume loop of patient with 
obstructive lung disease. 

Spirometry has its limitations, however. It can measure only ventilated volumes. It cannot measure 
lung capacities that contain the residual volume. Measurements of TLC, FRC, and RV have diagnostic 
value in defining lung overdistension or restrictive pulmonary disease; the body plethysmograph can 
determine these absolute lung volumes. 

Body Plethysmography 

In a typical plethysmograph, the patient is put in an airtight enclosure and breathes through a pneumo- 
tach. The flow signal through the pneumotach is integrated and recorded as tidal breathing. At the end 
of a normal expiration (at FRC), an electronically operated shutter occludes the tube through which the 
patient is breathing. At this time the patient pants lightly against the occluded airway. Since there is no 
flow, pressure measured at the mouth must equal alveolar pressure. But movements of the chest that 
compress gas in the lung simultaneously rarify the air in the plethysmograph, and vice versa. The pressure 
change in the plethysmograph can be used to calculate the volume change in the plethysmograph, which 
is the same as the volume change in the chest. This leads directly to determination of FRC. 

At the same time, alveolar pressure can be correlated to plethysmographic pressure. Therefore, when 
the shutter is again opened and flow rate is measured, airway resistance can be obtained as the ratio of 
alveolar pressure (obtainable from plethysmographic pressure) to flow rate [Carr and Brown, 1993]. 
Airway resistance is usually measured during panting, at a nominal lung volume of FRC and flow rate 
of ±1 liter/s. 

Airway resistance during inspiration is increased in patients with asthma, bronchitis, and upper respi- 
ratory tract infections. Expiratory resistance is elevated in patients with emphysema, since the causes of 
increased expiratory airway resistance are decreased driving pressures and the airway collapse. Airway 
resistance also maybe used to determine the response of obstructed patients to bronchodilator medications. 

Diffusing Capacity 

So far the mechanical components of airflow through the lungs have been discussed. Another important 
parameter is the diffusing capacity of the lung, the rate at which oxygen or carbon dioxide travel from 
the alveoli to the blood (or vice versa for carbon dioxide) in the pulmonary capillaries. Diffusion of gas 
across a barrier is directly related to the surface area of the barrier and inversely related to the thickness. 
Also, diffusion is directly proportional to the solubility of the gas in the barrier material and inversely 
related to the molecular weight of the gas. 
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FIGURE 7.8 Typical system configuration for the measurement of rebreathing pulmonary diffusing capacity. 

Lung diffusing capacity (Dl) is usually determined for carbon monoxide but can be related to oxygen 
diffusion. The popular method of measuring carbon monoxide diffusion utilizes a rebreathing technique 
in which the patient rebreathes rapidly in and out of a bag for approximately 30 s. Figure 7.8 illustrates 
the test apparatus. The patient begins breathing from a bag containing a known volume of gas consisting 
of 0.3% to 0.5 carbon monoxide made with heavy oxygen, 0.3% to 0.5% acetylene, 5% helium, 21% 
oxygen, and a balance of nitrogen. As the patient rebreathes the gas mixture in the bag, a modified mass 
spectrometer continuously analyzes it during both inspiration and expiration. During this rebreathing 
procedure, the carbon monoxide disappears from the patient-bag system; the rate at which this occurs 
is a function of the lung diffusing capacity. 

The helium is inert and insoluble in lung tissue and blood and equilibrates quickly in unobstructed 
patients, indicating the dilution level of the test gas. Acetylene, on the other hand, is soluble in blood 
and is used to determine the blood flow through the pulmonary capillaries. Carbon monoxide is bound 
very tightly to hemoglobin and is used to obtain diffusing capacity at a constant pressure gradient across 
the alveolar-capillary membrane. 

Decreased lung diffusing capacity can occur from the thickening of the alveolar membrane or the 
capillary membrane as well as the presence of interstitial fluid from edema. All these abnormalities 
increase the barrier thickness and cause a decrease in diffusing capacity. In addition, a characteristic of 
specific lung diseases is impaired lung diffusing capacity. For example, fibrotic lung tissue exhibits a 
decreased permeability to gas transfer, whereas pulmonary emphysema results in the loss of diffusion 
surface area. 

Defining Terms 

Alveoli: Respiratory airway terminals where most gas exchange with the pulmonary circulation takes 

place. 
Diffusion: The process whereby a material moves from a region of higher concentration to a region 

of lower concentration. 
BTPS: Body temperature (37°C) and standard pressure (1 atm), saturated (6.28 kN/m 2 ). 
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Chemoreceptors: Neural receptors sensitive to chemicals such as gas partial pressures. 

Dead space: The portion of the respiratory system that does not take part in gas exchange with the 

blood. 
Expiration: The breathing process whereby air is expelled from the mouth and nose. Also called 

exhalation. 
Functional residual capacity: The lung volume at rest without breathing. 

Inspiration: The breathing process whereby air is taken into the mouth and noise. Also called inhalation. 
Mass spectrometer: A device that identifies relative concentrations of gases by means of mass-to-charge 

ratios of gas ions. 
Mechanoreceptors: Neural receptors sensitive to mechanical inputs such as stretch, pressure, irritants, etc. 
Partial pressure: The pressure that a gas would exert if it were the only constituent. 
Perfusion: Blood flow to the lungs. 

Plethysmography: Any measuring technique that depends on a volume change. 
Pleura: The membrane surrounding the lung. 
Pneumotach: A measuring device for airflow. 
Pulmonary circulation: Blood flow from the right cardiac ventricle that perfuses the lung and is in 

intimate contact with alveolar membranes for effective gas exchange. 
STPD: Standard temperature (0°C) and pressure (1 atm), dry (moisture removed). 
Ventilation: Airflow to the lungs. 
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Historical Perspectives 1 

Cardiac Pacing — 
Historical Highlights 



Cardiac Arrest During Anesthesia 

Closed Chest Pacing for Stokes-Adams Disease 

Dawn of the Modern Era 

Implanted Pacemaker 

Pacemaker Batteries 

Pacing Without a Thoracotomy 
Leslie A. Geddes Demand Pacemaker 

Purdue University Activity-Responsive Pacemakers 

The origin of modern cardiac pacing is accepted by many as the time when the first pacemaker was 
implanted without the need for opening the chest. This even was described by Parsonnet et al. [1962], 
who combined the implantable pacemaker created by Greatbatch and Chardack [1959] with the right- 
ventricular catheter electrode described by Furman et al. [1959]. However, the need for cardiac pacing 
was recognized much earlier, and many interesting investigations were reported. It is the objective of this 
short historical note to describe these studies and show how they provided the background knowledge 
for modern cardiac pacing. 

Cardiac Arrest During Anesthesia 

Gaseous anesthesia with ether entered medicine about 1850. Although a good anesthetic, induction was 
difficult because ether irritates the mucous membranes of the airway. In the search for other agents, 
chloroform was popularized in the late 1860s by Simpson in Scotland. However, when chloroform was 
used, occasional respiratory and cardiac arrest occurred. Although respirators were available, there was 
no way of restarting the heart. 

Although Steiner [1871] in Germany investigated electrical cardiac pacing on animals, the chloroform- 
arrested heart in human subjects was paced noninvasively by Green in the United Kingdom [1872]. 
Pointing out that sudden disappearance of the pulse was an occasional complication of chloroform 
anesthesia, he resuscitated five of seven cases of cardiac arrest by intermittently applying the output of 
a 300-V battery using hand-held electrodes applied to the base of the neck and to the lower left chest. It 
is interesting to observe that the electrode applied to the lower left chest stimulated the ventricles. The 
other electrode applied to the base of the neck delivered current to the phrenic nerve and twitched the 
diaphragm, causing a brisk inspiratory motion. Thus Green applied electrical cardiopulmonary resusci- 
tation and successfully recovered five of seven patients. He advocated that every operating room should 
have a galvanic battery "at the ready" at all times for such emergencies. 

Ziemssen [1822] in Germany had the opportunity to apply cardiac pacing to Catherina Serafin, a 
42 -year-old woman (Fig. HPl.ln) who had a large defect in the anterior left chest wall following resection 
of an enchondroma. The heart was only covered by skin, on which Ziemssen placed electrodes; he also 
recorded radial artery pulses. Using induction-coil shocks, he paced the heart using a stimulus frequency 
higher than that of the normal heart rate, i.e., he achieved capture. Figure HPl.lfo illustrates ventricular 
pacing (between the arrows) at 140, 120, and 180 beats per minute. 
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FIGURE HP1.1 Catherina Sarafin (a), a 42-year-old woman whose heart was paced by Ziemssen [1882] using 
induction-coil shocks, (b) The radial pulse, and between the arrows, pacing was carried out at the rates shown. 

After performing many studies on animals with hearts arrested by chloroform, McWilliam in the 
United Kingdom proposed the routine use of closed-chest pacing. Writing in the British Medical Journal 
[1889], he recommended: 

In order that such excitation [of the heart] should be as effective as possible, it is probably best to send 
the stimulating shocks through the whole heart, so that the auricles may come directly under their 
influence as well as the ventricles. In order to do this in man one electrode should be applied in front 
over the area of cardiac impulse, and the other over the region of the fourth dorsal vertebra behind, 
so that the induction shocks may traverse the organ. The electrodes should be of considerable extent 
(for example, large sponge electrodes), and they and the skin should be well moistened with salt- 
solution. The shocks employed should be strong, sufficient to excite powerful contraction in the 
voluntary muscles. 
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Such a method, it seems to me, is the only rational and effective one for stimulating by direct means 
the action of a heart which has been suddenly enfeebled or arrested in diastole by causes of a temporary 
and transient character. Of course, at the same time the expedient of artificial respiration must by no 
means be neglected, but on the contrary, most sedulously attended to. 

Considerable time passed before the next cardiac pacing studies were reported. During that time, 
clinicians dealt with cardiac arrest by thrusting a hypodermic needle through the chest wall into the left 
ventricle and injecting epinephrine. Intentional cardiac pacing was not resumed until 1932, when Hyman 
[1932] in the United States pointed out that cardiac pacing could be practical clinically. He stated that 
Gould in Australia had stimulated a baby's heart using a chest electrode and a needle electrode thrust 
into the heart. Hyman stated that cardiac arrest was a common clinical event and that "random and 
badly executed procedures are used within the last minute in the hope of resuscitating the stopped heart." 
He called attention to the dubious value of injecting epinephrine directly into the left ventricle via a 
needle thrust into the thorax. He stated quite correctly ( 1 ) that it was the needle prick that evoked a 
ventricular contraction rather than the epinephrine, which obviously requires cardiac pumping to dis- 
tribute it, and (2) the considerable danger of ventricular fibrillation from the needle stimulus and 
epinephrine, particularly in hypoxic hearts. Accordingly, he offered two solutions: (1) inject the epineph- 
rine into an atrium and (2) apply rhythmic electrical stimuli to the needle thrust into an atrium. With 
the atrial injection of epinephrine, the risk of ventricular fibrillation is eliminated. It is unimportant if 
atrial fibrillation occurs, because the ventricles would be driven at a rapid rate and the epinephrine would 
be pumped through the circulatory system. The atrial fibrillation could be treated later with drugs. At 
that time there was no way of arresting ventricular fibrillation. 

Hyman made a batteryless pacemaker for delivery of induction-shock stimuli (60 to 120 per minute) 
to the atria. His pacemaker (Fig. HP1.2) was powered by a hand-wound, spring-driven generator that 
provided 6 minutes of pacemaking without rewinding. He demonstrated the success of his portable, 
manually energized pacemaker by pacing guinea pig, rabbit, and (one) dog hearts that had been arrested 
by asphyxiation. He then used the pacemaker on a few human subjects. Unfortunately, there are no details 
of his clinical success, although he stated that the method of electrical cardiac pacemaking was gaining 
acceptance. However, this was a hope rather than a fact. 

Closed-Chest Pacing for Stokes-Adams Disease 

Interruption of the excitation from the atria to the ventricles (the main pumping chambers of the heart) 
is designated atrioventricular (A-V) block; Stokes-Adams disease cause A-V block. Although the ventricles 
have a low intrinsic rate, the A-V block does not develop immediately, and consciousness is usually lost 
at the onset of an attack. Even if the ventricles start to beat with their low intrinsic rate, cardiac output 
is seldom adequate. The application of rhythmic electrical stimuli to the ventricles provides adequate 
cardiac output. 

Closed-chest ventricular pacing was introduced in the United States in 1952 when Zoll and colleagues 
[1952] reported successful cardiac pacemaking in 10 patients with A-V block. They first used needle 
electrodes inserted subcutaneously. A thyratron (capacitor-discharge) stimulator was employed which 
delivered 2-ms pulses. Later, Zoll [1955] used electrodes that were 3 in in diameter covered with a 
conducting paste, and these were found to be effective in a variety of electrode locations. The criterion 
for optimal location was pacing with the lower voltage. In one configuration, the negative electrode was 
located over the apex beat area and the other was on the right chest. Another location that appeared to 
produce less muscular twitching employed the negative electrode in the V 4 -V 6 site for electrocardio- 
graphy; the positive electrode was in the V 2 -V 4 region. With either location it was recommended that 
the spacing between the electrodes exceed 3 in. The stimulus voltage was to 150 V, and the repetition 
rate was controllable from 30 to 180 per minute. 

Zoll [1956] also introduced closed-chest ventricular defibrillation. Some of the early commercially 
available defibrillators also contained a pacemaker because following defibrillation the ventricles did not 
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FIGURE HP1.2 Hyman's method of pacing the atria (a) using needle electrodes (b).(c) His hand-wound pacemaker, 
which could provide 6 minutes of pacing. (From Hyman AA. 1932. Arch Intern Med 50:283-305, by permission of 

AMA.) 

always start to beat. However, closed-chest pacing did not attract the attention of those who wished to 
pace conscious patients for prolonged periods. The muscle twitching and pain due to the strong, short- 
duration pacing stimuli made the method intolerable to many patients, albeit lifesaving. However, Zoll 
and his son, a physicist, found a way [1981] to reduce the pain considerably by using a long-duration 
pulse (40 ms) and high-impedance electrodes that employed a pad of a high-resistivity gel between the 
metal electrode and the skin. Use of this technique reduced the high current density under the perimeter 
of an electrode when placed directly on the skin. The muscle twitching was reduced by strategically 
locating the pacing electrodes on the chest. This technique is in use today. 

Dawn of the Modern Era 

In the early 1950s, cardiovascular surgery was becoming widespread because of the availability of the 
heart-lung machine and hypothermia. In some cases of heart surgery, an undesirable side effect A-V 
block. To solve this problem, Weirich et al. [1958] demonstrated that direct heart stimulation in closed- 
chest patients could be achieved with surgically implanted slender-wire electrodes. In a series of exper- 
iments on animals, single and double electrodes were implanted in the ventricular myocardium at the 
time of surgery, and the leads were brought out through the chest wall. Complete control of ventricular 
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FIGURE HP1.3 (a) Method used in 1959 by Furman and Schwedel, who were the first to apply transvenous pacing 
to a human subject, (b) Record of the ECG and blood pressure of one of the first two patients showing complete 
A-V block with an idioventricular rate of 28/min. (c) The ECG and blood pressure of the same subject paced at 
60/min {left) and 40/min {right). (Courtesy of S. Furman, M.D.) 

rate (above the normal rate) was achieved. On January 30, 1957, the method was applied to the first 
human subjects, and Weirich et al. [1958] reported successful ventricular pacing in 18 patients with 
complete A-V block. One patient was paced for 21 days before the pacemaking could be discontinued. 
In these patients, a single myocardial electrode was implanted at the time of surgery. A chest electrode 
was used to complete the circuit. When pacing was to be discontinued, the myocardial electrode was 
withdrawn by a gentle pull. Such electrodes were frequently installed during cardiac surgery. 

Aware of Zoll's [1952] precordial pacing and the use of myocardial wire electrodes by Weirich et al. 
[1958], but unaware of the catheter electrode used to stimulate the right atrium in dogs by Callaghan 
and Bigelow [1951], Furman and Schwedel [1959] used a monopolar catheter electrode for ventricular 
pacing in humans. The intracardiac electrode was mounted at the tip of a no. 6F catheter that was passed 
into the right ventricle via the median basilic vein. The indifferent electrode was a silver-plated copper 
wire implanted subcutaneously in the chest wall (Fig. HP1.3o). The electrocardiogram and arterial 
pressure of one of these paced patients is shown in Fig. HP 1 .3 b, c. The first patient was paced for 2 hours, 
and when pacing was stopped, asystole occurred and pacing had to be reinstated. Later, these authors 
found that it was possible to wean the patient from the pacemaker by reducing the pacing rate slowly 
from 60 beats per minute, being the rate adopted for pacing, following which the heart resumed its 
ventricular rhythm. 

Furman and Schwedel's second patient was paced for more than 13 weeks. After about 2 weeks, they 
used the extracorporeal Electrodyne pacemaker, which incorporated the standby technique; i.e., the 
pacemaker turned on if more than a 5-second period of asystole was identified in the ECG being recorded 
on the patient monitor. The patient was ambulatory, and the energy to drive the pacemaker/monitor was 
derived from a long cord connected to the domestic power outlet. Thus, when the patient walked around 
the ward, he pushed his pacemaker equipment with him on a small cart. 

Furman and Schwedel clearly demonstrated that ventricular pacing could be instituted on an emer- 
gency basis by insertion of the catheter electrode into the right ventricle via a superficial vein. The 
threshold for pacing was low, amounting to only about 1.5 V at 0.75 mA. No muscular twitching or 
sensation was perceived by the patients, a fact that took some time for many to accept. 



•) 2000 by CRC Press LLC 




+7.2 
O 



:10 



> J l- -P *£. o 

•^lOK 



HEART 
-O 



+7.2 



VO 



B 



FIGURE HP 1.4 Location for the first implanted pacemaker (a) and the circuit diagram (&) of the Greatbatch 
pacemaker. (Redrawn from Ann NY Acad Sci 1964, 111:1075.) 



Implanted Pacemaker 



It soon became obvious that portability was needed, and a leap in progress was made in 1959 when 
Greatbatch (a biomedical engineer) and Chardack (a surgeon) [1959] developed the first implantable 
pacemaker (Fig. HP1.4), which was first tested in dogs. Then Chardack and his team implanted it in 
human subjects. At that time, considerable surgery was needed for suturing the electrodes to the heart 
and implanting the pacemaker in the abdomen (Fig. HP1.4a). The implant (Fig. HP1.4fo) consisted of a 
transistor oscillator and an amplifier energized by 10 mercury-zinc cells. The 10 cells and electronic 
circuitry were potted in epoxy and covered by a double shell of Silastic. 

The Chardack-Greatbatch implantable pacemaker used the Hunter-Roth [1959, 1973] electrode 
(Fig. HP1.5a), which was about the size of a postage stamp. Implantation of the pacemaker and electrodes 





FIGURE HP1.5 Early pacing electrodes, (a) Hunter-Roth [1959]. (6) Chardack [1965]. (c) Catheter. 
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was a major surgical procedure. Breakage of electrode wires, short battery life, and the need for surgery 
for pacemaker and lead implantation were disadvantages in the early days of pacemakers. Because the 
heart beats over 35 million times each year, flexural strength of the lead wires was a major concern. A 
durable electrode wire made from the alloy that is used in the escapement springs of watches was described 
by Chardack [1961] and is shown in Fig. HP1.5fo. Like its predecessor, it was sutured to the epicardium, 
and a thoracotomy was required. 

Within a year, Lillehei et al. [1960] reported the use of right ventricular catheter electrodes with an 
external pacemaker to pace 66 patients. The pacemaker was built by Earl Bakken, a biomedical engineer 
who had founded Medtronic, Inc., in 1949, which soon became the pioneer pacemaker company that 
manufactured the Greatbatch-Chardack pacemaker. 

Pacemaker Batteries 

Many were concerned about the short life of pacemaker batteries, as well as the gas produced by them. 
To solve the first problem, Elmqvist and Senning [1959] described an implantable pacemaker in which 
the batteries could be recharged by delivering the energy via a coil placed on the chest surface. 

A few investigators described batteryless implanted pacemakers. The first of these was reported by 
Glenn et al. [1958], who developed a passive implanted receiver that was connected to ventricular 
electrodes. A body-surface transmitter broadcast the stimuli to the implant. A biomedical engineer 
(Hickman), working for the author, developed a similar pacer in 1961 and proved its effectiveness in 
dogs. Schuder, another biomedical engineer, and Stoeckle [1962] described an ingenious small passive 
implant that had protruding electrodes that allowed attachment of the tiny device directly to the ven- 
tricular surface. Bursts of radiofrequency energy were delivered through the chest wall into the implant 
to pace the ventricles. 

Ingenious as the foregoing techniques were, the need for a chest-surface device to either recharge the 
batteries in the implanted pacemaker or to excite a passive implant proved to be unacceptable to clinicians 
and patients. The battery problem was solved in a different way. 

Meanwhile, introduction of two different techniques conspired to extend pacemaker battery life. The 
first was employment of a short-duration pulse for the stimulus; the second was the use of small-area 
electrodes for stimulating. Both advances were introduced by Furman and colleagues [1966, 1971]. 

Pacing Without a Thoracotomy 

Elimination of thoracotomy resulted from combining the implantable pacemaker with a catheter elec- 
trode that could be passed into the right ventricle via a superficial chest vein (Fig. HP1.6). Use of this 
technique was reported by Parsonnet et al. [1962]. 

Although monopolar and bipolar catheter electrodes (see Fig. HP1.5) had been around long before 
pacing, it was the new alloys that made them durable. When these new catheter electrodes were used, it 
was noted that the threshold for stimulation increased markedly if the electrode moved the slightest 
distance from the myocardium. The race soon started for electrode fixation techniques. Barbs, hooks, 
tynes, loops, and screens at the catheter tip all afforded a means for anchoring and providing a stable 
pacing threshold for a prolonged period. 

Demand Pacemaker 

In the early days of transchest and direct-ventricular pacing, there was a growing awareness that some 
of the unexpected sudden deaths in paced patients may have been due to ventricular fibrillation resulting 
from competitive pacing, a condition in which a pacemaker stimulus falls in the vulnerable period of a 
normally conducted beat or ventricular ectopic beat. In such a situation, fibrillation can result. The 
vulnerable period spans the T wave of the ECG, and if a single stimulus of adequate strength is delivered 
during this interval, ventricular fibrillation occurs immediately, and cardiac output falls to zero. 
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FIGURE HP1.6 Combination of the implantable pacemaker and catheter electrode by Parsonnet et al. [1962] 
eliminated the need for a thoracotomy. 

In the normal heart, a suprathreshold stimulus is required to precipitate ventricular fibrillation by 
vulnerable-period stimulation. However, in the hypoxic heart, a stimulus only slightly above threshold 
can induce ventricular fibrillation if delivered in the vulnerable period. Three circumstances conspired 
to set the stage for this disastrous event in pacing: (1) setting the intensity of the pacing pulse suprath- 
reshold so that it will be effective when the threshold rises due to natural tissue encapsulation around 
the electrode, (2) an ectopic beat occurring during pacing, and (3) normally conducted beats breaking 
through the A-V node. To eliminate the danger of such competitive pacing, Lemberg and his team [ 1965] , 
which included Berkovits, a biomedical engineer, introduced the demand concept first using a transchest 
pacemaker. In such a device, the ECG is monitored to determine if there is any ventricular electrical 
activity present; if so, the pacemaker is inhibited for a preset time. If A-V conduction returns, or if 
ventricular ectopic beats arise, the pacemaker is inhibited. If no such beats occur, the pacemaker operates 
in the fixed-rate mode. 

The demand pacemaker made its point very well, for Parsonnet and his team, which included Myers, 
a biomedical engineer, soon reported clinical trials with the first implanted R-wave inhibited pacemaker 
[1966]. By this time, the pacing electrodes began to be used for R-wave sensing. This type of pacemaker 
is one of the most popular in use today because its solves the competitive pacing problem. Because it is 
only activated when pacing stimuli are needed, battery life is prolonged. 

Activity-Responsive Pacemakers 

In the early days of pacing, it was recognized that with a fixed-rate pacemaker, a subject would have a 
limited ability to tolerate exercise. Cardiac output does increase slightly when a subject with a fixed-rate 
pacemaker exercises. The increase in cardiac output is due to the exercise-induced increase in venous 
return to the ventricles, thereby increasing the preload, which increases stroke volume via Starling's law. 
In addition, the increased sympathetic nervous drive to the ventricles contributes to increasing stroke 
volume. However, in a normal exercising subject, the same factors operate, but there is also an increase 
in heart rate; therefore, research was devoted to identifying strategies that would increase the rate of a 
pacemaker automatically with exercise. 

In some situations of total A-V block, the atria are still beating; therefore, it was apparent that the 
P-wave of the ECG could be used to develop a stimulus that could be delivered to ventricular electrodes. 
Even before pacing was popular in humans, Folkman and Watkins [1957] used a transistor amplifier to 
enlarge the atrial P waves and deliver them to the ventricles of dogs to evoke ventricular contractions. 
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TBLE HP 1.1 Activity Response Pacemakers 



Sensed Quantity 

Atrial excitation (P-wave) 

Respiration rate 

Respiration rate 

Respiration rate 

Respiration rate 

pH 

pH 

Venous 2 saturation 

Arterial O, saturation 

Ventricular cycle 

Core temperature 

Venous blood temperature 

Right-ventricular blood temperature 

Body motion 



Sensing Method 
Apply delay and stimulate the right ventricle 

Impedance pneumograph 

Diaphragm motion (by piezoelectric detector) 

Negative intrathoracic pressure transducer 

Impedance RV to case 

RA venous blood with In-AgCl electrode 

In pacemaker pocket 

Catheter oximeter 

Electrodes on Hering's nerve 

Stimulus T-wave duration 

Temperature-sensitive capacitor in pacer 

Thermistor RA, RV, PA 

Thermistor 

Accelerometer 



Principal Investigator and Year 

Karlof (1975), Kapenberger 

(1981), Krause (1982) 
Krasner (1966, 1971) 
Ionescu (1980) 
Funkeetal. (1975) 
Rossi etal. (1983) 
CammiUi (1979, 1980, 1981) 
Contini (1981) 
Wirtzfield (1980) 
Bozal-Gonzalez (1980) 
Rickards (1981) 
Fischell (1975) 
Griffin (1981) 
Jolgren (1983) 
Anderson et al. (1984) 



Source: From Geddes LA. 1984. Cardiovascular Devices. New York, Wiley. 

The unit was carried in a backpack and was proven in dogs with surgically induced A- V block. Stephenson 
et al. [1959] created a P-wave-triggered stimulator with a built-in delay to simulate the A-V conduction 
time, and the unit was used successfully in dogs with A-V block. A similar pacemaker was reported by 
Kahn et al. [1960] and Nathan et al. [1963], whose team included Keller, a biomedical engineer. 

In A-V block, use of the P-wave of the ECG does indeed provide automatic control of the pacing rate 
based on bodily activity. However, in diseased hearts, there may be no P-wave, or the atria may flutter 
or be in fibrillation. These cases must be handled differently. For example, if the P-wave disappears, the 
pacemaker can revert to the fixed-rate mode. With flutter or fibrillation, it is necessary to prevent the 
pacemaker from following the electrical activity of the atria. Algorithms have been developed to limit 
the rate of the ventricular pacemaker in these situations. 

The lack of a dependable P-wave to trigger a ventricular pacemaker caused many to identify physiologic 
events that signal an increase in bodily activity; Table 1 presents a summary of most of them. Many of 
the methods shown in Table 1 have been embodied in commercially available pacemakers. Which method 
will turn out to be the best can only be identified by future clinical experience. 

References 

Anderson K, Humen D, Klein GJ, et al. 1983. A rate variable pacemaker which automatically adjusts for 

physical activity. PACE 6:A-12. U.S. patent 4,428,378, Jan 31, 1984. 
Bozal-Gonzalez JL. 1980. Cardiac pacemaker. U.S. patent 4,201,219, May 4, 1980. 
Callaghan JC, Bigelow WG. 1951. An electrical artificial pacemaker for standstill of the heart. Ann Surg 

134:8. 
CammiUi L. 1979. PH-triggered pacemaker. PACE 2,A6. 
CammiUi L, Green GD, Ricci D, Risani R. 1980. PH-triggered pacemaker: Design and clinical results. In 

Proceedings of the 6th World Symposium of Cardiac Pacing, chap 19-8. 
CammiUi L, Alcidi L, Bisi G. 1981. Results, problems and perspectives in the auto-regulating pacemaker. 

2nd European Symposium on Cardiac Pacing, May-June 1981, PACE 4:A-36. 
Chardack WM, Gage AA, Greatbatch W. 1960. A transistorized, self-contained, implantable pacemaker 

for long-term correction of complete heart block. Surgery 48:643. 
Chardack WM, Gage AA, Greatbatch W. 1961. Corrections of complete heart block by a self-contained 

and subcutaneously implanted pacemaker. J Thorac Cardiovasc Surg 42:814. 
Contini C, Papeschi G, Ricci D, et al. 1981. pH changes in chronic pacemaker pocket: A new means of 

increasing rate during exercise: preliminary results. PACE 2:366. 



) 2000 by CRC Press LLC 



Elmqvist R, Senning A. 1960. Implantable pacemaker for the heart. In CN Smyth (ed), Medical Elec- 
tronics: Proceedings of the Second International Conference on Medical Electronics, Paris, June 
1959. London, Iliffe & Sons. 

Fishell RR. 1975. Fixed-rate rechargeable cardiac pacemaker. U.S. patent 3,867,950. 

Folkman MJ, Watkins E. 1957. An electrical conduction system for the management of experimental, 
complete heart block. Clin Cong Am Coll Surg 8:331. 

Funke HD. 1975. Einherzschrittmacher mit beinstungsabhangiger frequenz-regulation. Biomed Tech 
20:225. 

Furman S, Schwedel JB. 1959. An intracardiac pacemaker for Stokes-Adams seizures. N Engl J Med 26:943. 

Furman S, Denize A, Escher D, Schwedel JB. 1966. Energy considerations for cardiac stimulations a 
function of pulse duration. J Surg Res 8:441. 

Furman S, Parker B, Escher D. 1971. Decreasing electrode size and increasing efficiency of cardiac 
stimulation. J Surg Res 11:105. 

Glenn WWL, Mauro A, Longo E, et al. 1959. Remote stimulation of the heart by radiofrequency trans- 
mission. N Engl J Med 261(18):948. 

Greatbatch W, Chardack WM. 1959. Transistorized implantable pacemaker. Proc NEREM 1:8. 

Green T 1872. On death from chloroform; its prevention by galvanism. Br Med J 1:551. 

Griffin JC, Tutzy KR, Claude JP, et al. 1981. Nonelectrographic indices of pacemaker rate. Proc. AEMB 
34:202. 

Hickman DM, Geddes LA, Hoff HE, et al. 1961. A portable miniature transistorized radio-frequency 
coupled cardiac pacemaker. IRE PGBME Trans BME-8(4):258. 

Hunter SW, Boldue L, Long V, Quattelbaum FW. 1973. A new myocardial pacemaker lead. Chest 63:430. 

Hunter SW, Roth NA, Bernardez D, Noble JD. 1959. A bipolar myocardial electrode for complete heart 
block. Lancet 79:506 

Hyman AA. 1932. Resuscitation of the stopped heart by intracardial therapy. Arch Intern Med 50:283. 

Ionsecu VL. 1980. An "on demand" pacemaker responsive to respiratory rate. PACE 3:375. 

Jolgren D, Fearnot NE, Geddes LA. 1983. A rate-responsive pacemaker controlled by right ventricular 
blood temperature, Proceedings of the 32nd Annual Science Session. Am Coll Cardiol J Ace 
1(1):720. 

Kahn M, Senderoff E, Shapiro J, et al. 1960. Bridging of interrupted A-V conduction in experimental 
heart block by electronic means. Am Heart J 59:548. 

Kappenberger L, Turina M, Babotal J. 1981. Differentiated pacemaking therapy. Schweiz Med Wochenschr 
111:45. 

Karlof I. 1975. Hemodynamic effects of atrial-triggered versus fixed rate pacing at rest and during exercise 
in complete heart block. Acta Med Scand 197:195. 

Krasner JL, Voukydis PC, Nardella PC. 1966. A physiologically controlled cardiac pacemaker. JAAMI 20. 

Krasner JL, Nardella P. 1971. Physiologically controlled cardiac pacer. U.S. Patent 3,593,718. 

Krause I, Amman K, Conradson T, Ryden L. 1982. A comparison of the acute and long-term hemody- 
namic effects of ventricular inhibited and atrial synchronous-ventricular inhibited pacing. Circu- 
lation 65:846. 

Lemberg L, Castellanos A, Berkovits BV. 1965. Pacemaking on demand in A-V block. JAMA 191:12. 

Lillehei CW, Gott VL, Hodges PC, et al. 1960. Transistor pacemaker for treatment of complete atrioven- 
tricular dissociation. JAMA 172:2006. 

McWilliam JA. 1889. Electrical stimulation of the heart in man. Br Med J 1:348. 

Nathan DA, Center S, Wu CY, Keller W. 1963. An implantable synchronous pacemaker for the long-term 
correction of complete heart block. Circulation 27:682. 

Parsonnet V, Zucker IR, Asa M. 1962. Preliminary investigation of a permanent implantable pacemaker 
utilizing an intracardiac dipolar catheter. Clin Res 10:391 (see also Am J Cardiol 10:261). 

Parsonnet V, Zucker IR, Myers SH. 1966. Clinical use of an implanted standby pacemaker. JAMA 198:784. 

Rickards AF. 1981. Physiologically adaptive cardiac pacemaker. U.S. patent 4,228,803. 



3 2000 by CRC Press LLC 



Rickards AF, Norman J, Thalen A, et al. 1981. The use of stimulus-T interval to determine cardiac pacing 

rate. PACE 4:A68. 
Rickards AF, Norman J. 1981. Relation between Q-T interval and heart rate. Br Heart J 45:56. 
Rossi P, Plicchi G, Canducci GC, et al. 1983. The implantable respiratory dependent pacemaker. PACE 

3:A-13. 
Schuder JC, Stoeckle H. 1982. A micromodule pacemaker receiver for direct attachment to the ventricle. 

Trans Am Soc Artif Intern Organs 8:344. 
Steiner F. 1871. Ueber die Electropunctur des Herzens als Wiederbelebungsmittel in der Chloroformsyn- 

cope. Arch Klin Chir 12:748. 
Stephenson SE, Edwards WH, Jolly, et al. 1959. Physiologic P-wave cardiac stimulator. J Thorac Cardiovasc 

Surg 38:804. 
Weirich WL, Paneth M, Gott VL, Lillehei CW. 1958. Control of complete heart block by use of an artificial 

pacemaker and a myocardial electrode. Circ Res 6:410. 
Wirtzfield A. 1980. Cardiac pacemaker. U.S. patent 4,202,339. 
Ziemssen H. 1882. Studien ubr die Bewegungsvorgange am mehschlichen Herzen. Deutch Arch Klin Med 

30:270. 
Zoll PM. 1952. Resuscitation of heart in ventricular standstill by external electric stimulation. N Engl J 

Med 247:768. 
Zoll PM, Linenthal AJ, Gibson W, et al. 1956. Termination of ventricular fibrillation in man by externally 

applied electric countershock. N Engl J Med 254:727. 
Zoll PM, Zoll RM, Belgard AH. 1981. External non-invasive electric stimulation of the heart. Crit Care 

Med 9:393. 



© 2000 by CRC Press LLC 



Henriquez, C. S. "Bioelectric Phenomena." 

The Biomedical Engineering Handbook: Second Edition. 

Ed. Joseph D. Bronzino 

Boca Raton: CRC Press LLC, 2000 



Magnetometer coil r = 10 mm 
1 A/s , 

40 20 Oil 20 40 




100 



1150 



h [mm] 



J LM [pA/m ] 




Bioelectric 
Phenomena 



Craig S. Henhquez 

Duke University 

8 Basic Electrophysiology Roger C. Barr 

Membranes • Bioelectric Current Loops • Membrane Polarization • Action Potentials • 
Initiation of Action Potentials • Propagation • Extracellular Waveforms • Stimulation • 
Biomagnetism 

9 Volume Conductor Theory Robert Plonsey 

Basic Relations in the Idealized Homogeneous Volume Conductor • Monopole and Dipole 
Fields in the Uniform Volume of Infinite Extent • Volume Conductor Properties of Passive 
Tissue • Effects of Volume Conductor Inhomogeneities: Secondary Sources and Images 

10 The Electrical Conductivity of Tissues Bradley J. Roth 
Cell Suspensions • Fiber Suspensions • Syncytia 

1 1 Membrane Models Anthony Varghese 

The Action Potential • Patch Clamp Data • General Formulations of Membrane 
Currents • Nerve Cells • Skeletal Muscle Cells • Endocrine Cells • Cardiac Cells • 
Epithelial Cells • Smooth Muscle • Plant Cells • Simplified Models 

12 Numerical Methods for Bioelectric Field Problems Christopher R. Johnson 
Problem Formulation • Model Construction and Mesh Generation • Numerical 
Methods • Adaptive Methods 

13 Principles of Electrocardiography Edward J. Berbari 
Physiology • Instrumentation • Applications • Conclusions 

14 Principles of Electromyography Kaj-Age Henneberg 

The Structure and Function of Muscle • The Origin of Electromyograms • 
Electromyographic Recordings 

15 Principles of Electroencephalography Joseph D. Bronzino 

Historical Perspective • EEG Recording Techniques • Use of Amplitude Histographs to 
Quantify the EEG • Frequency Analysis of the EEG • Nonlinear Analysis of the EEG 

16 Biomagnetism Jaakko Malmivuo 

Theoretical Background • Sensitivity Distribution of Dipolar Electric and Magnetic 
Leads • Magnetocardiography (MCG) • Magnetoencephalography (MEG) 

17 Electric Stimulation of Excitable Tissue Dominique M. Durand 

Electric Stimulation of Neural Tissue • Physiology of Excitation • Electric Fields in Volume 
Conductors • Electric Field Interactions with Excitable Tissue • Electrode-Tissue 
Interface • Conclusion 



© 2000 by CRC Press LLC 



THE STUDY OF EXCITABLE CELLS and bioelectric processes in the body has had a significant 
impact on biomedical engineering. Clinical methods such as electroneurography, electromyogra- 
phy, electrocardiography, and electroencephalography all involve recording and interpreting bio- 
electric signals that arise from the propagation of action potentials, i.e., transient changes in potential 
across the cell membrane. The nature of the recorded signals depends on a number of factors, including 
the number of contributing cells, the geometry and electrical properties of the media through which the 
currents flow, and the distance of the recording electrode from the excitable tissue. All these factors must 
be accounted for when designing recording equipment and making proper interpretations. 

The principles of bioelectricity also have resulted in the development and manufacture of a number 
of clinical devices capable of delivering currents to the body for the treatment or therapy of a variety of 
physical anomalies. Pacemakers and stimulators help to maintain or approximate proper electrical func- 
tion of the heart, muscle, and nerve. Defibrillators are used to terminate potentially fatal arrhythmias. 
These devices have become increasingly sophisticated, with microprocessors capable of processing the 
bioelectric signals to help determine the appropriate action. 

The study and use of bioelectric phenomena in biomedical technologies have advanced, in part, 
through careful experimentation and the development of increasingly accurate biophysical models and 
theoretical concepts. Continued progress in the techniques for diagnosis and treatment of derangements 
of electrical processes in the body will require that biomedical engineers are knowledgeable of the 
fundamentals of bioelectricity. This section focuses on these fundamentals and some of the clinical 
applications. 

Chapter 8 presents a quantitative and qualitative overview of basic electrophysiology and membrane 
biophysics. Chapter 9 discusses how currents are supported in the body and how these currents set up 
potentials that can be measured with external electrodes. Chapter 10 reviews the electrical properties of 
various tissues. The ionic currents that give rise to action potentials of various tissues are currently being 
studied using modern patch-clamp techniques born out of the Nobel Prize-winning work of Hodgkin 
and Huxley. Descriptions of the techniques and some of the mathematical models that result from the 
experiments are discussed in Chapter 11. The theoretical advancements in bioelectric phenomena have 
led to a number of biophysical models that are used to interpret experimental and clinical observations. 
Some of the models are of such complexity that they can only be managed with the aid of a computer. 
The common numerical solution methods for bioelectric field problems are discussed in Chapter 12. 
The most common clinical applications of bioelectric phenomena, electromyography, electroencephel- 
ography, and electrocardiology, are discussed in Chapters 13 through 15. The flow of currents in the 
body also can give rise to magnetic fields that can be detected outside the body. The nature of biomagnetic 
fields and their clinical applications are discussed in Chapter 16. Finally, a number of devices have been 
developed to deliver current to the body. The fundamentals of electrical stimulation are presented in 
Chapter 17. 
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This chapter serves as an overview of some widely accepted concepts of electrophysiology. Subsequent 
chapters in this section provide more specialized information for many of the particular topics mentioned 
here. 



8.1 Membranes 



Bioelectricity has its origin in the voltage differences present between the inside and outside of cells. 
These potentials arise from the specialized properties of the cell membrane, which separates the intrac- 
ellular from the extracellular volume. Much of the membrane surface is made of a phospholipid bilayer, 
an electrically inert material [Byrne and Schultz, 1988]. Because the membrane is thin (about 75 A), it 
has a high capacitance, about 1 uF/cm 2 . (Membrane capacitance is less in nerve, except at nodes, because 
most of the membrane is myelinated, which makes it much thicker.) 

Electrically active membrane also includes a number of integral proteins of different kinds. Integral 
proteins are compact but complex structures extending across the membrane. Each integral protein is 
composed of a large number of amino acids, often in a single long polypeptide chain, which folds into 
multiple domains. Each domain may span the membrane several times. The multiple crossings may build 
the functional structure, e.g., a channel, through which electrically charged ions may flow. The structure 
of integral proteins is loosely analogous to that of a length of thread passing back and forth across a 
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fabric to form a buttonhole. As a buttonhole allows movement of a button from one side of the fabric 
to the other, an integral protein may allow passage of ions from the exterior of a cell to the interior, or 
vice versa. In contrast to a buttonhole, an integral protein has active properties, e.g., the ability to open 
or close. Excellent drawings of channel structure and function are given by Watson et al. [1992]. 

Cell membrane possesses a number of active characteristics of marked importance to its bioelectric 
behavior, including these: (1) Some integral proteins function as pumps. These pumps use energy to 
transport ions across membrane, working against a concentration gradient, a voltage gradient, or both. 
The most important pump moves sodium ions out of the intracellular volume and potassium ions in. 

(2) Other integral proteins function as channels, i.e., openings through the membrane that open and 
close over time. These channels can function selectively so that, for a particular kind of channel, only 
sodium ions may pass through. Another kind of channel may allow only potassium ions or calcium ions. 

(3) The activity of the membrane's integral proteins is modulated by signals specific to its particular 
function. For example, some channels open or close in response to photons or to odorants; thus they 
function as sensors for light or smell. Pumps respond to the concentrations of the ions they move. Rapid 
electrical impulse transmission in nerve and muscle is made possible by changes that respond to the 
transmembrane potential itself, forming a feedback mechanism. These active mechanisms provide ion- 
selective means of current crossing the membrane, both against the concentration or voltage gradients 
(pumps) or in response to them (channels). While the pumps build up the concentration differences 
(and thereby the potential energy) that allow events to occur, channels utilize this energy actively to create 
the fast changes in voltage and small intense current loops that constitute nerve signal transmission, 
initiate muscle contraction, and participate in other essential bioelectric phenomena. 



8.2 Bioelectric Current Loops 



Bioelectricity normally flows in current loops, as illustrated in Fig. 8.1. ' A loop includes four segments: 
an outward traversal of the cell membrane (current I m at position z= z,), an extracellular segment (current 
I e , along a path outside a cell or cells), an inward traversal of the cell membrane (current I m at z = z 2 ), 
and an intracellular segment (current I { , along a segment inside the cell membrane). Each of the segments 
has aspects that give it a unique importance to the current loop. Intense current loops often are contained 
within a millimeter or less, although loops of weaker intensity may extend throughout the whole body 
volume. The energy that supports the current flow arises from one or both of the segments of the loop 
that cross the membrane. Current loops involve potential differences of about 100 mV between extremes. 
The charge carriers are mobile ions. Especially important are ions of sodium, potassium, calcium, and 
chloride because some membranes allow one or more of these ion species to move across when others 
cannot. 

Membrane Currents 

Total membrane current I m may be given mathematically as 

I =1 +1. (8.1) 

m c ion v ' 

where each of these currents often is given per unit area, e.g., milliamperes per square centimeter. 
The first component, I c , corresponds to the charging or discharging of the membrane capacitance. Thus 

I =C dV Idt (8.2) 



'For clarity, this chapter uses equations and examples describing flow in one dimension. Usually analogous 
equations based on the same principles have been developed for two and three dimensions. 
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FIGURE 8.1 Bioelectric current loop, concept drawing. Current enters the membrane at z = z v flows intracellularly 
and emerges at z 2 . Flow in the extracellular volume maybe along the membrane (solid) or throughout the surrounding 
volume conductor (dashed). 

where C m is the membrane capacitance per unit area, V m is the transmembrane voltage, and t is time. 

The second component, 7 ion> corresponds to the current through the membrane carried by ions such 
as those of sodium, potassium, or calcium. It normally is found by summing each of the individual 
components, e.g., 



I = L + 7 M + I r 

ion K Na Ca 



(8.3) 



In turn, each of the component currents is often written in a form such as 



h(v m -E*) 



(8.4) 



where E K is the equilibrium potential for K (see below), a function of K concentrations, and g K is the 
conductivity of the membrane to K ions. The conductivity g K is not constant but rather varies markedly 
as a function of transmembrane voltage and time. More positive transmembrane voltages normally are 
associated with higher conductivities, at least transiently. 

When a more detailed knowledge of channel structure is included, as in the DiFrancesco-Noble model 
for cardiac membrane, each of the individual ionic components, e.g., I K is found in turn as a sum of the 
currents through each kind of channel or pump through which that ion moves. 

The total transmembrane current I m contains the summed contributions from I c and J ion , as given by 
Eq. (8.1), but the effects of these components are quite different. At many membrane sites these com- 
ponents differ both in magnitude and in direction. Thinking in terms of the current loops that exist 
during membrane excitation, one often finds that current at the site of peak outward current is dominated 
by I c . At the site of peak inward current, the total current I m usually dominated by J ion (see Propagation, 
below). 

Conduction Along an Intracellular Path 

Flow of electricity along a nerve or within a muscle occurs passively through the conducting medium 
inside the cell. The nature of the intracellular current path is closely related to the function of the current 
within that kind of cell. Nerve cells may have lengths of a meter or more, so intracellular currents can 
flow unimpeded and quickly throughout this length. Other cells, such as the muscle cells of the heart, 
are much smaller (about 100 um in length). In cardiac cells, current flows intracellularly from cell to cell 
through specialized passages, called junctions, that occur in regions where the cell membranes fuse 
together. Thus intracellular regions may be connected over lengths of centimeters or more, even though 
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each cell is much smaller. Mathematically, the intracellular current in a one-dimensional cable may be 
given as [Plonsey and Barr, 1988, p 109] 

L = -^ (8.5) 

r. dz 

where (j), is the intracellular potential, z is the axial coordinate, r l is the intracellular resistance per unit 
length, and 7, is the longitudinal axial current per unit cross-sectional area. 

Conduction Along an Extracellular Path 

Current flow outside of cells is not constrained to a particular path but rather flows throughout the 
whole surrounding conducting volume (the volume conductor), which may be the whole body. Current 
intensity is highest near the places where current enters or leaves cells through the membrane. (These 
sites are called sinks and sources, respectively, in view of their relation to the extracellular volume.) Current 
flow from bioelectric sources through the volume conductor generates small potential differences, usually 
with a magnitude of a millivolt or less, between different sites within the volume or on the body surface. 
(Current flow from artificial sources, such as stimulators or defibrillators, may generate much higher 
voltages.) The naturally occurring potential differences between points in the volume conductor, e.g., 
between the arms or across the head, are those commonly measured in the study of the heart (electro- 
cardiography), the nerves and muscles (electromyography), and the brain (electroencephalography). 

Mathematically, the extracellular current is easy to specify only in the special case where the extracel- 
lular current is confined to a small space surrounding a single fiber or nerve, as shown diagramatically 
by the solid I c line in Fig. 8.1. In this special case, the extracellular current is equal in magnitude (but 
opposite in direction) to the intracellular current, that is, I e = -I,. 

In most living systems, however, extracellular currents extend throughout a more extensive volume 
conductor, as suggested by the dashed lines for I e in Fig. 8.1. Their direction and magnitude must be 
found as a solution to Poisson's equation in the extracellular medium, with the sources and sinks around 
the active medium used as boundary conditions. In this context, Poisson's equation becomes [Plonsey 
and Barr, 1988, p 22] 

V 2 f=-i = -^ (8.6) 

where I v is the volume density of the sources (sinks being negative sources), a c is the conductivity of the 
extracellular medium, and /is the current density. The fact that V-J takes on nonzero values appears at 
first to be physically unrealistic because it seems to violate the principle of conservation of charge. In 
fact, it only reflects the practice of solving for potentials and currents in the extracellular volume separately 
from those in the intracellular volume. Thus the nonzero divergence represents the movement of currents 
from the intracellular volume across the membrane and into the extracellular volume, where they appear 
from sources or disappear into sinks. 

Duality 

Note the duality Eq. (8.6) and the form of Poisson's equation commonly used in problems in electrostatics. 
There, V 2 (|) \ -p/e, where p is the charge density, and e is the permittivity. Recognition of this duality is 
useful not only in locating solution methods for bioelectric problems, since the mathematics is the same, 
but also in avoiding confusion between electrostatics and problems of extracellular current flow through 
a volume conductor. The problems obviously are physically quite different (e.g., permittivity e is not 
conductivity a e ). 
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8.3 Membrane Polarization 

Ionic pumps within membranes operate, over time, to produce markedly different concentrations of ions 
in the intracellular and extracellular volumes around cells. In cardiac cells, the sodium-potassium pump 
produces concentrations 2 (millimoles per liter) for Purkinje cells as shown in Table 8.1. 

A transmembrane voltage, or polarization, develops across the membrane because of the differences 
in concentrations across the membrane. In the steady state, the transmembrane potential for a two-ion 
system is, according to Goldman's equation [Plonsey and Barr, 1988, p. 51], 



t 



^M. + ^jNaj. 
P K [K], + P Na [Na], y 



(8.7) 



where R is the gas content, T is the temperature, and F is Faraday's constant (RT/F a 25 mV). P K and 
P Na are the permeabilities to potassium and sodium ions, respectively. [K] and [Na] are the concentrations 
of these ions, and the subscripts i and e indicate intracellular and extracellular, respectively. V m is the 
transmembrane voltage at steady state. 

Polarized State 

At rest, P Na is much smaller than P K . Using Eq. (8.6), approximating P Na = P K /100, and using the 
concentrations in Table 8.1, 



V =25mVxln 



4 + (o.01xl4o) 
(o.Ol x 8J 



140 + 



= -82mV (8.8) 



Cardiac Purkinje cells show a polarization at rest of about this amount. The negative sign indicates that 
the interior of the cell is negative with respect to the exterior. 

One way to think about why this voltage exists at the steady state is as follows: In the steady state, 
diffusion tends to move K + from inside because of the much higher concentration of K + intracellularly. 
This flow is offset, however, by a flow from outside to inside due to the potential gradient, with a higher 
potential outside, since K + is a positively charged ion. With the concentrations of Table 8.1, the effects 
of the diffusion gradient and potential gradient are almost equal and opposite when the potential 
intracellularly is about -82 mV with respect to the extracellular potential. The result is a steady state. 
Rather than a true equilibrium, this state has a small but nonzero flow of sodium ions, offset electrically 
by a sustained flow of potassium ions. Thus the membrane potential would run down over time were 
there no Na + -K + pumps maintaining the transmembrane concentrations. (In the absence of such pumps, 
concentration differences diminish over a period of hours.) 



TABLE 8.1 Ionic Concentrations (mmol/liter) 



Na- 



Intracellular 8 140 

Extracellular 140 4 



2 Values taken from the program Heart, by Noble et al. [1994], as restated in Cabo and Barr [1992]. 
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Depolarized State 

Suppose the permeabilities to sodium and potassium of Eq. (8.7) are changed from the values used for 
Eq. (8.8), which approximated membrane at rest (polarized). Instead, to approximate excited membrane, 
suppose that P Na rises in comparison with P K so that P K 



V =25mVxln 



4 + ^2 xl4o) 
140 + (2x8J 



= 15mV 



(8.9) 



Here the positive sign for V m indicates that the interior of the membrane is positive with respect to the 
exterior. Because V m is here more nearly zero in magnitude, this excited state is called depolarized. 

8.4 Action Potentials 

Membranes create action potentials by actively changing their permeabilities to ions such as sodium and 
potassium. A comparison of the results of Eqs. (8.8) and (8.9) shows a swing of about 100 mV between 
the resting and excited states (polarized to depolarized). The change of permeability from resting to 
excited values and then back again allows the membrane to generate an action potential. 

Cardiac Action Potentials 

Action potentials that characterize actual tissue have the roughly rectangular shape that is suggested by 
the preceding calculation, but only as an approximation. An action potential for the cardiac conduction 
system simulated with the DiFrancesco-Noble model is shown in Fig. 8.2. Initially, V m has a baseline 
voltage (B) near -80 mV. During excitation (£), the membrane permeabilities change, and V m rises 
abruptly. After the peak overshoot at about +20 mV, the potential maintains a plateau voltage (P) near 
-20 mV for nearly 300 ms and then recovers (P) rapidly to a baseline phase. The overall action potential 
duration is about 400 ms. 
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FIGURE 8.2 Cardiac action potential. Computed with the DiFrancesco-Noble membrane model for the cardiac 
conduction system (membrane patch). B, baseline; E, excitation; R, recovery (repolarization). 
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FIGURE 8.3 Nerve action potential. Computed with the Hodgkin-Huxley model for nerve membrane (membrane 
patch). B, baseline; E, excitation; P, plateau; R, recovery; A, afterpotential. 

Nerve Action Potentials 

For comparison, an action potential simulated with the Hodgkin-Huxley model for nerve is shown in 
Fig. 8.3. Corresponding baseline, excitation, plateau, and recovery phases may be identified, and the 
voltage change between resting and excited states is again about 100 mV. The plateau phase is so short 
as to be virtually nonexistent, and the overall action potential duration is much shorter than that of 
Fig. 8.2, only about 4 ms. 

Wave Shape and Function 

All excitable membranes are characterized by their ability to change membrane permeabilities and to do 
so selectively. Both the cardiac action potential of Fig. 8.2 and the nerve action potential of Fig. 8.3 
demonstrated rapid depolarization and a subsequent slower repolarization associated with membrane 
permeability changes and changes of about 100 mV in V m amplitude. Conversely, the differences also 
have marked importance. Cardiac action potentials have a long duration that limits the shortest interval 
between heartbeats. This interval must be several hundred milliseconds long to allow time for movement 
of blood. The extended plateau of the cardiac action potential is associated with the movement of calcium 
ions, which is associated with muscle contraction. In contrast, nerve action potentials perform a signaling 
function. This function is supported by a shorter action potential duration, which allows more variability 
in the number of action potentials propagated per second, an important signaling variable. Conversely, 
nerves had no need for contraction and no need for a plateau period for calcium ion movement. 

8.5 Initiation of Action Potentials 

Action potentials of tissue with a specialized sensing function (e.g., for sight or smell) have membrane 
channels linked to receptors for the initiating agent [Watson et al., 1992, p 320]. In contrast, a large 
number of membrane channels are controlled by changes in transmembrane voltage. In particular, when 
the potential rises from its baseline value to a threshold level, the membrane initiates the sequence of 
permeability changes that create an action potential. The change of transmembrane voltage initiating 
the action potential normally comes from currents originating in adjacent tissue (see Propagation, below). 
Such currents also may come from artificial sources, such as stimulators. 
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FIGURE 8.4 Variation in stimulus magnitude. In different trials, three stimuli (SI, S2, S3) of decreasing magnitude 
were applied. Action potentials followed the first two, but not the third. 

Examples 

Responses of a DiFrancesco-Noble membrane to transmembrane stimuli of three magnitudes are shown 
in Fig. 8.4. All three stimuli were 1 ms in duration. Stimulus SI, the largest, caused the transmembrane 
potential to rise from its baseline value to about -65 mV and initiated an action potential (apl), which 
followed a few milliseconds after the stimulus. Stimulus S2 had only 80% of the intensity of SI, and the 
stimulus itself caused a transmembrane voltage change that was proportionally smaller. As did SI, 
stimulus S2 initiated an action potential (ap2), although there was a delay of about 200 ms before 
excitation occurred, and the action potential itself was somewhat diminished in amplitude. Stimulus S3 
had 80% of the magnitude of S2 and caused a transmembrane voltage change that again was propor- 
tionally smaller. S3, however, produced no subsequent action potential. 

Threshold 

An important general result demonstrated in Fig. 8.4 is that action potential initiation has a threshold 
behavior. That is, stimuli producing transmembrane voltages above a threshold value initiate action 
potentials, while those below do not. This response is called "all or none," although (as Fig. 8.4 shows) 
there is a variation in response for near-threshold stimuli. The specific threshold value varies depending 
on factors such as the stimulus duration, the amount of membrane affected, and the intracellular potential 
gradient, which affects the rate of stimulus decay. 

8.6 Propagation 

Initiation of excitation at one end of a excitable fiber leads to an action potential there. Subsequently, 
action potentials may be observed at sites progressively further away. 

Numerical Model 

Use of a numerical model is extremely helpful in investigating the events of propagation, because a 
consistent picture can be obtained of temporal events at different sites and of spatial events at different 
times. For these reasons, such a model is used here. Consider the cylindrical fiber shown in Fig. 8.5, 
having radius a = 75 urn and a length of 10 mm. The fiber is surrounded by an extracellular volume 
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FIGURE 8.5 Geometry for numerical model. The model represented a fiber 10 mm in length. The cross section 
was a series of concentric bands. Nodes were spaced along the axial (z) dimension every 100 um. These nodes are 
portrayed by the columns of circles drawn at four axial positions, although actually there were 101 columns. The 
innermost ( lower) row of nodes, row 0, were sites where intracellular potentials were determined; four surrounding 
rows (rows 1 to 4), at increasing distances, were sites for extracellular potentials. The outermost boundary {dashed 
line) was assigned <t> e = 0. Separation between all rows was the same as the fiber radius, a = 75 um. Transmembrane 
stimuli were applied across the membrane at z = (nodes enclosed). 

extending 3a beyond the membrane. To establish a symmetric reference potential, the outside edge of 
the extracellular region was connected via resistances to a junction where (j) e = 0. Intracellular specific 
resistance R t was 250 fi-cm, and R e was one-third of _R ; . Solutions for intracellular, extracellular, and 
transmembrane potentials were obtained through a process of numerical simulation. The structure of 
this example follows that of Spach et al. [1973], where cardiac Purkinje fibers were studied. Most of the 
examples below use the DiFrancesco-Noble model for Purkinje membrane [Noble et al., 1994]. 

Sequence of Action Potentials 

Action potentials simulated for the cylindrical fiber are shown in Fig. 8.6. Waveforms are shown following 
a transmembrane stimulus at z = 0, where the first action potential originates, and also for sites 2, 5, and 
10 mm away. All the action potentials have similar wave shapes, differing mainly in their timing. Although 
these are cardiac waveforms simulated with the DiFrancesco-Nobel model, note the markedly different 
impression of the wave shape seen in this figure as compared with Fig. 8.2. Note also that the waveform 
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FIGURE 8.6 Action potentials for a 10-mm fiber. Transmembrane potentials V m (t) arising at z positions of 0, 2, 5, 
and 10 mm. The waveform at z = shows a stimulus artefact S. V m ( f) at z = 10 has a larger peak-to-peak rise because 
excitation terminates there, at the end of the fiber. 
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at z = has progressed through excitation, but not into the plateau phase, by the time of excitation at 
z = 1 cm. Secondary changes of action potential wave shape as a function of z, other than the stimulus 
artefact, can be discerned mainly in the baseline as it rises into the upstroke, a region called the foot. 
Differences in the duration of the foot reflect the time that other tissue has been excited, prior to the 
start of excitation at a particular site; i.e., the foot is longer at sites further down the fiber. 

Biophysical Basis 

Although propagation in nerves and muscles often is compared to wave propagation of sound or light, 
to waves in the ocean, or to electrical waves in cables, bioelectric propagation does not have an analogous 
physical basis. In the former categories, energy used at the source creates a disturbance that propagates 
to the observer through an intervening passive medium. Propagation velocity depends on the properties 
of that medium. In bioelectric propagation, what is meant is that the source itself is moving down an 
excitable medium, the membrane, in a process more similar to setting off a chain of firecrackers. The 
velocities associated with bioelectric propagation relate to the magnitudes of transmembrane current 
passing through one site on the membrane and flowing to another and how fast the active responses of 
the membrane occur at the downstream sites. Although short intervals are required for passive propa- 
gation of the consequences of the active events through the surrounding tissue, these intervals are 
minuscule in comparison with the times required for the active changes. Consequently, most bioelectric 
events are analyzed as a sequence of "quasi-static" stages. 

Velocity of Propagation 

It is difficult to predict the velocity of propagation for a particular tissue structure prior to its measure- 
ment. Reported experimental velocities of propagation range from less than 0.01 m/s to more than 10 m/s, 
depending on the particular membrane and structure involved and its environment. Within a particular 
nerve or muscle, however, velocities vary little under normal conditions. Changes in the velocity of 
propagation as a result of changes in fiber diameter a or specific resistivity .R, can be approximated by 
the equation 

9= \** (8.10) 

Here K is a constant determined by the membrane properties, a is the fiber radius, and i?, is the specific 
resistance of the intracellular medium [Plonsey and Barr, 1988, p. 117], and the equation assumes 
relatively low extracellular resistance. 

An important relationship shown by Eq. (8.10) is the proportionality of the velocity to the square root 
of the fiber's radius. Living systems take advantage of this relationship by having larger-diameter fibers 
where higher velocity is essential. For example, the giant axon of the squid carries the signal by which 
the squid responds to predators. 

Transmembrane Current 

The mathematical analysis of propagation is greatly aided by using the principle of conservation of charge 
to develop an equation for membrane current I m that depends on the local distribution of intracellular 
potential rather than on currents through the membrane, as in Eq. (8.1). Conservation of current requires 
that current through the membrane at an axial position z be the difference between current coming to 
that site and current leaving that site (on one side, e.g., intracellularly). Because intracellular currents 
follow the first derivative of (]); (Eq. 8.5), then for one-dimensional flow [Plonsey and Barr, 1988, p. 112], 

I = ^-*±. (8.11) 

znar Sz 
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FIGURE 8.7 Propagation of a current loop, concept drawing. Panels a and b draw a hypothetical current loop as 
it might exist at time t v At this time, outward current is causing the potential to rise at z = z v as indicated by the 
upward arrow there, in panel b. The result is a shift in the distribution of intracellular potential <|) ; . Thus the current 
loop moves to the position shown in panel c. 

Here r t is the axial resistance per unit length, a is the fiber radius, and §- t is the intracellular potential. 
This result is used below, as well as in many mathematical analyses of propagation, because it provides 
a means (sometimes the sole means) of linking the membrane current at one site to electrical events at 
neighboring sites. 

Movement of the Local Current Loop 

Propagation occurs when a local current loop initiates an action potential in an adjacent region. The 
concept of how this occurs is illustrated in Fig. 8.7. At time t v a local current loop exists, as shown in 
Fig. 8.7a. This local loop produces outward current at site z = z 1 . That there must be outward current 
at z l is shown by Fig. 8.7fc, which plots <(>(z) for time t v The gradient of (J), between z 2 and z l is shown in 
Fig. 8.7fo. This gradient produces intracellular current into z l from the left but not current out on the 
right (because the gradient is zero to the right). Current must therefore leave through the membrane at 
Zj. This positive membrane current is indicated by the upward arrow at z x in Fig. 8.7k The positive 
membrane current at z x discharges the membrane capacitance there, so the potential at z l rises, producing 
a new voltage distribution. The change in the intracellular voltage distribution produces a shift in the 
site of the current loop; i.e., the loop propages to the right, as shown in Fig. 8.7c. 

More mathematically, at z x at time t 1 the value of d 1 § i ldz 1 is markedly positive, because <(>, changes 
slope there from negative to zero. Therefore, by Eq. (8.11), I m is large and positive at z = z v Additionally, 
I ioB is relatively small at z l because the intracellular voltage is at its resting value, so (by Eq. 8.1) I c must 
be large. By Eq. (8.2), if I c is large and positive, then dVJdt is large and positive. Thus the transmembrane 
potential at z l rises rapidly, and soon the current loop shown in Fig. 8.7c is achieved. 

For comparison with the schematic drawings in Fig. 8.7, the distribution of voltages and currents in 
the 1-cm simulated fiber is shown in Fig. 8.8. The figure shows the transmembrane potential V m and the 
total membrane current I m . Both are shown as a function of axial distance z along the strand at time t = 
3 ms after the stimulus. (This spatial plot came from the same simulation as the set of waveforms versus 
time shown in Fig. 8.6) Because V m = (|>, - § e , and because in this example the extracellular potentials § e 
are relatively small in magnitude (see below), V m in Fig. 8.8 is a close approximation to § t in Fig. 8.7. 
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FIGURE 8.8 Potential and current distribution in a 10-mm fiber at t = 3 ms following a transmembrane stimulus 
applied at z = 0. The vertical scale is in millivolts for the V„ plot and in microamperes per square centimeter for the 
current plots. Results are from a numerical simulation using the geometry of Fig. 8.6. Transmembrane potential V m at 
this moment is shown by the dashed line. Note the inward and outward peaks of the transmembrane current l m near 
z= 4 and 4.8 mm. Plots for l c and I ion , the components of I m , also are shown. They demonstrate that on the leading 
edge of the waveform (higher values of z), current I c dominates. Compare the results with the concept drawing of Fig. 8.7. 

In Fig. 8.8, the total membrane current /,„ is triphasic, with the outward current on the leading edge 
of the propagating waveform. Propagation is to the right, consistent with the direction of downward 
slope of V m and consistent with the direction of movement of Fig. 8.7. The inward and outward currents 
of the distributed pattern in Fig. 8.8 are consistent with the concept of Fig. 8.7, where the distribution 
is lumped. (One might place z x ~ 4.8 mm in Fig. 8.8 and Zj ~ 4.0 mm.) Figure 8.8 shows that the local 
current loop is more accurately described as a local current distribution, extending over a distance of a 
few millimeters (about 0.8 mm between the maximum and minimum of the I m curve). 

The composition of I„,(z) and be evaluated by comparing l m with its components, the ionic current 
4, n and the membrane capacitative current I c . At the outward current peak (the maximum of I m ), the 
magnitude of the capacitive component I c is greater than the magnitude of the ionic component I ion . At 
the inward peak (the minimum of I m ), |j ion | > \I C \, although both magnitudes are significant. That is, the 
dominant components of I m in Fig. 8.8 are consistent with the drawings in Fig. 8.7. In contrast to Fig. 8.7, 
however, the peak outward current occurs near where V m ~ -40 mV rather than near the baseline value 
(~ -85 mV). Consistent with Fig. 8.7, Fig. 8.8 shows that /,„ is dominated by I c at the peak outward I m 
and increasingly more so as one moves forward on the leading edge. 

8.7 Extracellular Waveforms 

An understanding of the origin of extracellular waveforms is important because most electrophysiologic 
recordings, including those of most clinical studies, are observations of extracellular events rather than 
underlying transmembrane or intracellular events. The extracellular-intracellular relationship is not a 
simple one, however, because it involves distance from the membrane sources, and because a given 
extracellular site usually is affected by currents from many different excitable fibers. 

Although extracellular waveforms (t> t ,(t) reflect the same current loops as the intracellular or trans- 
membrane waveforms, their magnitude, wave shape, and timing usually are entirely different. (The 
greatest similarity occurs with a highly restricted extracellular volume.) With an extensive extracellular 
volume, the normal situation, waveforms such as those in Fig. 8.9 are seen. The three waveforms come 
from "electrodes" just outside the fiber at z = 0, z = 5, and z = 10 mm, or in other words at the site of 
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FIGURE 8.9 Extracellular waveforms from a 10-mm fiber. Extracellular potentials fy e (f) are plotted for positions 
just outside the membrane at z = 0, 5, and 10 mm. The transmembrane stimulus at z = affected the waveform 
there, as seen by stimulus artefact S. For reference, V m (t) at z = 5 mm also is drawn, reduced to 2% of its true amplitude. 

the stimulus, the middle of the strand, and the end where excitation terminates. (For reference, the 
intracellular potential from the middle of the fiber also is shown, reduced to 2% of its true magnitude.) 
All three extracellular waveforms shown are small in comparison with transmembrane waveforms, less 
than 1 mV peak to peak. Their wave shapes differ: Near the stimulus (0), the wave shape has a negative 
deflection from the stimulus and continues to be negative thereafter. In the middle, the wave shape is 
biphasic. A positive deflection occurs as excitation approaches, and a negative deflection occurs once it 
goes past (compare the timing of the extracellular and intracellular waveforms, which are from the same 
axial position). At the terminating end (10), the extracellular wave shape is predominantly positive, 
showing also a long initial rise as excitation approaches. 

It is interesting that these extracellular waveforms show such large changes in wave shape as a function 
of position, whereas their underlying transmembrane waveforms, at the same z values, show no such 
changes, as can be seen by comparison with the transmembrane waveforms shown in Fig. 8.6. The relative 
timing of the extracellular as compared with the transmembrane waveforms also is noteworthy: Close to 
the membrane, most of the major deflections of the extracellular waveforms are confined to the short time 
period when the transmembrane potential is undergoing excitation. During the hundreds of milliseconds 
thereafter, the intracellular potential is large but changing slowly (e.g., Fig. 8.2), so the transmembrane 
currents are small. These smaller currents are distributed and produce proportionally small deflections. As 
the recording site moves away from the membrane, however, the distributed recovery currents do not decline 
as rapidly with distance, so the potentials they produce become larger in relation to potentials produced by 
the currents of excitation. On the body surface, the voltages produced during recovery may have similar 
magnitudes and play a large role in waveform interpretation (e.g., the T wave of the electrocardiogram). 

Biphasic waveforms are observed at most extracellular sites, because most sites are neither the origin 
or termination of excitation. Further, most real extracellular waveforms are more complicated and last 
longer than those shown in Fig. 8.9, because they show the composite effect from many different fibers 
rather than a single one in isolation. 



Spatial Relation to Intracellular 

Extracellular waveforms are created by the flow of current through the volume conductor surrounding 
the active membrane. An expression for the extracellular potential <() e , outside a cylindrical fiber with a 
large conducting volume was given by Spach et al. [1973] as 
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where z is the axial coordinate of the fiber, t is the time when the spatial distribution was obtained, (^ 
is the intracellular potential, a is the fiber radius, and d is the perpendicular distance from the fiber 
membrane to the electrode. Conductivities a, and a c apply to the intracellular and extracellular conduct- 
ing media, respectively. The integration limits of ±<x> recognize that the integration should be over the 
entire fiber. Note that t„ must be chosen in a succession of values to generate a temporal waveform at 
position z and that the right-hand side of Eq. (8.12) has to be evaluated separately for each t choice. 

A number of examples of extracellular waveforms, as compared with their underlying intracellular 
ones, are given by Spach et al. [1973], together with an evaluation of the accuracy of Eq. (8.12). A central 
feature of the equation is that the extracellular waveform is a weighted summation of the second spatial 
derivative of the intracellular potential. This relationship holds, to a good approximation, for all the 
extracellular waveforms shown in Fig. 8.9. The different weighting for different waveforms (variation in 
z" - z for different choices of z) produces the varying shapes of the extracellular waveforms at different sites. 

Temporal Relation to Intracellular 

If an action potential is assumed to be propagating with constant velocity 8 on a long fiber, then the 
spatial derivative in the preceding equation can be converted to a temporal derivative using z = 8f so that 
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where t is time, z is the axial coordinate of the position where the intracellular and extracellular 
"electrodes" are located, and is the velocity with which the action potential is moving along the z axis. 
The integration limits of ±oo recognize that only in long fibers does the velocity remain approximately 
constant over a significant period of time. 

Some of the limitations of Eq. (8.13) are apparent when considering the waveforms of Fig. 8.9. The 
equation would be suitable only for the extracellular waveform in the middle of the fiber (z = 5 mm). 
At either end, the equation would be unsuitable, because velocity is not constant through initiation or 
through termination of propagation. 

8.8 Stimulation 

The waveforms of Fig. 8.6 show that action potentials can be initiated and controlled by (artificially 
injected) transmembrane currents. Such transmembrane currents can be injected through use of pene- 
trating microelectrodes. Some experimental studies use this method of initiating action potentials because 
the site of current injection is precisely localized. Most stimulation for clinical purposes and most 
experimental studies use extracellular electrodes, however, because positioning the electrodes outside the 
active tissue can be done more simply, usually without tissue damage. 

Suppose two stimulating electrodes are placed just outside the 1-cm fiber, as identified in Fig. 8.10. 
Note that the anode is placed at z = 10 mm (encircled +), and the cathode is placed at z = (encircled -). 
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FIGURE 8.10 Extracellular stimulus currents, concept drawing. A stimulus is applied between and extracellular 
anode, at z = 10, and an extracellular cathode, at z = 0. Most current will flow in the extracellular volume (thick 
line). Some current will enter the intracellular volume (thin line). Current entering or leaving the membrane causes 
that portion of the membrane to become hyperpolarized (high z region) or depolarized (low z region). 

Both electrodes are extracellular. When current is injected through these electrodes, most of the current 
will flow along the extracellular path (thick flow line). Some current also will cross the resistive barrier 
formed by the membrane and flow intracellularly (thin flow line) . Where the intracellular current emerges 
from the membrane, it will tend to depolarize the membrane. (Note the effects of the stimulus on potential 
differences: The region depolarized is the region where the intracellular potential becomes higher as 
compared with the extracellular potential directly across the membrane. In contrast, the region depolar- 
ized has a lower potential near the cathode as compared with the potential at the other end of the fiber, 
near the anode, in both the intracellular and extracellular volumes.) If the magnitude of the depolarization 
is large enough in relation to the size of the depolarized region, then the stimulus will initiate action 
potentials and propagation. 

Transmembrane potential distributions following the extracellular stimulus are shown in Fig. 8.11. At 
time t = 0, the transmembrane potential (as a function of z) is flat, indicating that the membrane is 
uniformly in the resting state. Then a stimulus of 0.05 mA amplitude and 0.5 ms duration is applied to 
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FIGURE 8.11 Transmembrane potentials following an extracellular stimulus. The simulation used the geometry 
of Fig. 8.5, except that the stimulus was applied to the row 2 nodes at z= and z= 10 (similar to Fig. 8.10). Successive 
plots show the transmembrane potentials as a function of z at time (just before the stimulus), 0.5 ms (end of the 
stimulus), 3.0 ms (propagation underway), and 5.0 ms. The potential distribution at 0.5 ms is consistent in its major 
features with the concept drawing of Fig. 8.10. 
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the stimulus electrodes, with the source-sink positioning that of Fig. 8.10. Figure 8.11 shows the distri- 
bution of transmembrane potentials at t = 0.5 ms, the time of the end of the stimulus. At f = 0.5 ms, 
the membrane is depolarized near the cathode (near z = 0) and is hyperpolarized near the anode (near 
z = 10). Interestingly, there is a small region of hyperpolarization beside the depolarized end (near z = 
2 mm) and depolarization near the hyperpolarized end (near z = 8 mm), a predictable effect [Plonsey 
and Barr, 1995]. The transmembrane potentials along the central part of the fiber are much less affected 
by the stimulus than are the ends, even though the same magnitude of stimulus current flows along the 
entire length. This stimulus was large enough to produce action potentials, as can be seen by inspecting 
the transmembrane potentials for 3 ms. Figure 8.11 shows that the excitation wave that began near z = 
has, by 3 ms, progressed about a third of the way down the fiber. Propagation to a further point is seen 
at 5.0 ms. 

Many devices for medical stimulation have now come into routine use, e.g., cardiac pacemakers. Large 
artificial currents also are used to terminate action potentials in some cases, e.g., cardiac defibrillators. 

8.9 Biomagnetism 

Panofsky and Phillips [1962, p. 160] show the connection between the electric field E to the magnetic 
field B and current j tme by giving two (of the four) Maxwell's equations as 

VxH = -i, +— (8.14) 

' ,rue dt 

and 

VxE = -— (8.15) 

dt 

together with the constituitive relations H = B/(j, and D = K£ E, where u, k, and e are constants for a 
given medium. 

Both currents j lme and changing electric fields (E and thereby D) are present naturally in and around 
excitable membranes, so Eq. (8.14) shows that corresponding magnetic fields are to be expected. These 
magnetic fields are small, however, even in comparison with the earth's magnetic field. They can none- 
theless be precisely measured if careful attention is paid to the design of equipment and the surroundings, 
as demonstrated by Wikswo and Egeraat [1991]. At present, magnetic fields are not routinely measured 
for clinical purposes, but in principle, they offer some important advantages, including the absence of 
required electrode contact. 

In a complementary way, rapidly changing magnetic fields induce electric fields (and thereby currents) 
in living tissue, as indicated by Eq. (8.15). Stimulation of nerves or muscles is thereby possible, either by 
design (see, e.g., Roth and Basser [1990]) or as a side effect of magnetic resonance imaging (MRI). 
Extensive consideration of biomagnetism has been presented by Malmivuo and Plonsey [1994]. 

Defining Terms 

Action potential: The cycle of changes of transmembrane potential, negative to positive to negative 

again, that characterizes excitable tissue. This cycle also is described as resting to excited and 

returning to rest. 
Excitation: The change of the membrane from resting to excited states, characterized by the movement 

of the transmembrane potential across a threshold. 
Extracellular potentials: Potentials generated between two sites outside the membrane (external to 

active cells), e.g., between two sites on the skin, due to current flow in the volume conductor. 
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Intracellular (extracellular): Inside (outside) cells. 

Membrane polarization: The sustained (and approximately constant) transmembrane potential of a 

cell at rest that arises due to different intracellular and extracellular ionic concentrations. 
Propagation: Excitation of one region of tissue as a result of an action potential in an adjacent region. 
Stimulation: Change in the transmembrane potential at a site due to an influence from another site. 

The term is used most frequently when current is applied through wires from an artificial current 

source. 
Transmembrane potential: The potential inside a cell membrane minus the potential just outside the 

membrane. 
Volume conductor: The electrically conductive interior region of the body surrounding electrically 

active membrane. 
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This chapter considers the properties of the volume conductor as it pertains to the evaluation of electric 
and magnetic fields arising therein. The sources of the aforementioned fields are described by /', a 
function of position and time, which has the dimensions of current per unit area or dipole moment per 
unit volume. Such sources may arise from active endogenous electrophysiologic processes such as prop- 
agating action potentials, generator potentials, synaptic potentials, etc. Sources also may be established 
exogenously, as exemplified by electric or magnetic field stimulation. Details on how one may quantita- 
tively evaluate a source function from an electrophysiologic process are found in other chapters. For our 
purposes here, we assume that such a source function /' is known and, furthermore, that it has well- 
behaved mathematical properties. Given such a source, we focus attention here on a description of the 
volume conductor as it affects the electric and magnetic fields that are established in it. As a loose 
definition, we consider the volume conductor to be the contiguous passive conducting medium that 
surrounds the region occupied by the source /'. (This may include a portion of the excitable tissue itself 
that is sufficiently far from /' to be described passively.) 

9.1 Basic Relations in the Idealized Homogeneous 
Volume Conductor 

Excitable tissue, when activated, will be found to generate currents both within itself and also in all 
surrounding conducting media. The latter passive region is characterized as a volume conductor. The 
adjective volume emphasizes that current flow is three-dimensional, in contrast to the confined one- 
dimensional flow within insulated wires. The volume conductor is usually assumed to be a monodomain 
(whose meaning will be amplified later), isotropic, resistive, and (frequently) homogeneous. These are 
simply assumptions, as will be discussed subsequently. The permeability of biologic tissues is important 
when examining magnetic fields and is usually assumed to be that of free space. The permittivity is a 
more complicated property, but outside cell membranes (which have a high lipid content) it is also usually 
considered to be that of free space. 
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A general, mathematical description of a current source is specified by a function ]'(x, y, z, f), namely, 
a vector field of current density in say milliamperes per square centimeter that varies both in space and 
time. A study of sources of physiologic origin shows that their temporal behavior lies in a low-frequency 
range. For example, currents generated by the heart have a power density spectrum that lies mainly under 
1 kHz (in fact, clinical ECG instruments have upper frequency limits of 100 Hz), while most other 
electrophysiologic sources of interest (i.e., those underlying the EEG, EMG, EOG, etc.) are of even lower 
frequency. Examination of electromagnetic fields in regions with typical physiologic conductivities, with 
dimensions of under 1 m and frequencies less than 1 kHz, shows that quasi-static conditions apply. That 
is, at a given instant in time, source-field relationships correspond to those found under static conditions. 1 
Thus, in effect, we are examining direct current (dc) flow in physiologic volume conductors, and these 
can be maintained only by the presence of a supply of energy (a "battery"). In fact, we may expect that 
wherever a physiologic current source /' arises, we also can identify a (normally nonelectrical) energy 
source that generates this current. In electrophysiologic processes, the immediate repository of energy is 
the potential energy associated with the varying chemical compositions encountered (extracellular ionic 
concentrations that differ greatly from intracellular concentrations), but the long-term energy source is 
the adenosine triphosphate (ATP) that drives various pumps that create and maintain the aforementioned 
concentration gradients. 

Based on the aforementioned assumptions, we consider a uniformly conducting medium of conduc- 
tivity rj and of infinite extent within which a current source /' lies. This, in turn, establishes an electric 
field E and, based on Ohm's law, a conduction current density a E. The total current density / is the sum 
of the aforementioned currents, namely, 

J = ai+J i (9.1) 

Now, by virtue of the quasi-static conditions, the electric field may be derived from a scalar potential O 
[Plonsey and Heppner, 1967] so that 

E = -V<$> (9.2) 

Since quasi-steady-state conditions apply, / must be solenoidal, and consequently, substituting Eq. (9.2) 
into Eq. (9.1) and then setting the divergence of Eq. (9.1) to zero show that cD must satisfy Poisson's 
equation, namely, 

V 2 <& = (l/a)v-/' (9.3) 

An integral solution to Eq. (9.3) is [Plonsey and Collin, 1961] 

j_rv^ 

no Jv r 
where rinEq. (9.4) is the distance from a field point P(x\y',z') to an element of source at dv(x,y, z), that is, 



3> (*',/, z') = —\^-^dv (9.4) 

FV ; 4710 Jv r 



(x-x'J +(y-y'\ + (z-z'J (9.5) 



'Note that while, in effect, we consider relationships arising when d/dt = 0, all fields are actually assumed to vary 
in time synchronously with /'. Furthermore for the special case of magnetic field stimulation, the source of the 
primary electric field, 8 Aldt, where A is the magnetic vector potential, must be retained. 
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Equation (9.4) may be transformed to an alternate form by employing the vector identity 

V j(l/r)rWl/r)v-/' + v(l/r)-r (9.6) 

Based on Eq. (9.6), we may substitute for the integrand in Eq. (9.4) the sum V-[(l/r) /'] - V(l/r)- /', 
giving the following: 

*p(^/' z ') = - 4 ^{j V -[( 1 /'')/']^-Jv(l/r);^v} (9.7) 

The first term on the right-hand side may be transformed using the divergence theorem as follows: 

Jv-[(l/r)r]rfv=J(l/r)r-rfS = (9.8) 

The volume integral in Eq. (9.4) and (9.8) is defined simply to include all sources. Consequently, in 
Eq. (9.8),the surface S, which bounds V, necessarily lies away from /'. Since /' thus is equal to zero on S, 
the expression in Eq. (9.8) must likewise equal zero. The result is that Eq. (9.4) also may be written as 

* (x',/,z') = 1 —[^—!-dv= — h i -v(l/r)dv (9.9) 

M ; 4710 Jv r 47C0Jv y ' 

We will derive the mathematical expressions for monopole and dipole fields in the next section, but based 
on those results, we can give a physical interpretation of the source terms in each of the integrals on the 
right-hand side of Eq. (9.9). In the first, we note that -V- /' is a volume source density, akin to charge 
density in electrostatics. In the second integral of Eq. (9.9), /' behaves with the dimensions of dipole 
moment per unit volume. This confirms an assertion, above, that /' has a dual interpretation as a current 
density, as originally defined in Eq. (9.1), or a volume dipole density, as can be inferred from Eq. (9.9); 
in either case, its dimension are mA/cm 2 / = /mA- cm/cm 3 . 

9.2 Monopole and Dipole Fields in the Uniform 
Volume of Infinite Extent 

The monopole and dipole constitute the basic source elements in electrophysiology. We examine the 
fields produced by each in this section. 

If one imagines an infinitely thin wire insulated over its extent except at its tip to be introducing a 
current into a uniform volume conductor of infinite extent, then we illustrate an idealized point source. 
Assuming the total applied current to be J and located at the coordinate origin, then by symmetry the 
current density at a radius r must be given by the total current 7 divided by the area of the spherical 
surface, or 

1 = ^\A (9.10) 

and a r is a unit vector in the radial direction. This current source can be described by the nomenclature 
of the previous section as 

V./' = -/ 5(r) (9.11) 

where 8 denotes a volume delta function. 
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One can apply Ohm's law to Eq. (9.10) and obtain an expression for the electric field, and if Eq. (9.2) 
is also applied, we get 

£ = ^$ = -^5 ( 9 _ 12 ) 

4na r 2 

where a is the conductivity of the volume conductor. Since the right-hand side of Eq. (9.12) is a function 
of r only, we can integrate to find O, which comes out 

*-— ^— (9.13) 



4nor 

In obtaining Eq. (9.13), the constant of integration was set equal to zero so that the point at infinity has 
the usually chosen zero potential. 

The dipole source consists of two monopoles of equal magnitude and opposite sign whose spacing 
approaches zero and whose magnitude during the limiting process increases such that the product of 
spacing and magnitude is constant. If we start out with both component monopoles at the origin, then 
the total source and field are zero. However, if we now displace the positive source in an arbitrary 
direction d, then cancellation is no longer complete, and at a field point P we see simply the change in 
monopole field resulting from the displacement. For a very small displacement, this amounts to (i.e., we 
retain only the linear term in a Taylor series expansion) 



*, = - 

p dd 



v 47t0r y 



d (9.14) 



The partial derivative in Eq. (9.14) is called the directional derivative, and this can be evaluated by taking 
the dot product of the gradient of the expression enclosed in parentheses with the direction of d [that 
is, V( ) ■ a d , where a d is a unit vector in the d direction]. The result is 

*, = -^VV(lA) (9-15) 

By definition, the dipole moment m = I d in the limits as d — > 0; as noted, m remains finite. Thus, finally, 
the dipole field is given by [Plonsey, 1969] 



$ = m -V 



p 



4710 



7 (l/r). (9.16) 



9.3 Volume Conductor Properties of Passive Tissue 

If one were considering an active single isolated fiber lying in an extensive volume conductor (e.g., an 
in vitro preparation in a Ringer's bath), then there is a clear separation between the excitable tissue and 
the surrounding volume conductor. However, consider in contrast activation proceeding in the in vivo 
heart. In this case, the source currents lie in only a portion of the heart (nominally where VV,* 0). The 
volume conductor now includes the remaining (passive) cardiac fibers along with an inhomogeneous 
torso containing a number of contiguous organs (internal to the heart are blood-filled cavities, while 
external are pericardium, lungs, skeletal muscle, bone, fat, skin, air, etc.). 

The treatment of the surrounding multicellular cardiac tissue poses certain difficulties. A recently used 
and reasonable approximation is that the intracellular space, in view of the many intercellular junctions, 
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TABLE 9.1 Conductivity Values for 
Cardiac Bidomain 



S/mm 


Clerc [1976] 


Roberts [1982] 


Six 


1.74 x 10-' 


3.44 x 10-" 


Sly 


1.93 x 10- 5 


5.96 x 10- 5 


SaX 


6.25 x 10-" 


1.17 x 10-" 


Say 


2.36 x 10-" 


8.02 x 10- 5 



can be represented as a continuum. A similar treatment can be extended to the interstitial space. This 
results in two domains that can be regarded as occupying the same physical space; each domain is 
separated from the other by the membrane. This view underlies the bidomain model [Plonsey, 1989]. To 
reflect the underlying fiber geometry, each domain is necessarily anisotropic, with the high conductivity 
axes defined by the fiber direction and with an approximate cross-fiber isotropy. A further simplification 
may be possible in a uniform tissue region that is sufficiently far from the sources, since beyond a few 
space constants transmembrane currents may become quite small and the tissue would therefore behave 
as a single domain (a mono domain). Such a tissue also would be substantially resistive. On the other 
hand, if the membranes behave passively and there is some degree of transmembrane current flow, then 
the tissue may still be approximated as a uniform monodomain, but it may be necessary to include some 
of the reactive properties introduced via the highly capacitative cell membranes. A classic study by Schwan 
and Kay [1957] of the macroscopic (averaged) properties of many tissues showed that the displacement 
current was normally negligible compared with the conduction current. 

It is not always clear whether a bidomain model is appropriate to a particular tissue, and experimental 
measurements found in the literature are not always able to resolve this question. The problem is that if 
the experimenter believes the tissue under consideration to be, say, an isotropic monodomain, measure- 
ments are set up and interpreted that are consistent with this idea; the inherent inconsistencies may never 
come to light [Plonsey and Barr, 1986]. Thus one may find impedance data tabulated in the literature 
for a number of organs, but if the tissue is truly, say, an anisotropic bidomain, then the impedance tensor 
requires six numbers, and anything less is necessarily inadequate to some degree. For cardiac tissue, it is 
usually assumed that the impedance in the direction transverse to the fiber axis is isotropic. Consequently, 
only four numbers are needed. These values are given in Table 9.1 as obtained from, essentially, the only 
two experiments for which bidomain values were sought. 

9.4 Effects of Volume Conductor Inhomogeneities: 
Secondary Sources and Images 

In the preceding we have assumed that the volume conductor is homogeneous, and the evaluation of 
fields from the current sources given in Eq. (9.9) is based on this assumption. Consider what would 
happen if the volume conductor in which /' lies is bounded by air, and the source is suddenly introduced. 
Equation (9.9) predicts an initial current flow into the boundary, but no current can escape into the 
nonconducting surrounding region. We must, consequently, have a transient during which charge piles 
up at the boundary, a process that continues until the field from the accumulating charges brings the 
net normal component of electric field to zero at the boundary. To characterize a steady-state condition 
with no further increase in charge requires satisfaction of the boundary condition that d<$/dn = at the 
surface (within the tissue). The source that develops at the bounding surface is secondary to the initiation 
of the primary field; it is referred to as a secondary source. While the secondary source is essential for 
satisfaction of boundary conditions, it contributes to the total field everywhere else. 

The preceding illustration is for a region bounded by air, but the same phenomena would arise if the 
region were simply bounded by one of different conductivity. In this case, when the source is first "turned 
on," since the primary electric field E a is continuous across the interface between regions of different 
conductivity, the current flowing into such a boundary (e.g., a l fQ is unequal to the current flowing 
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away from that boundary (e.g., o 2 E). Again, this necessarily results in an accumulation of charge, and 
a secondary source will grow until the applied plus secondary field satisfies the required continuity of 
current density, namely, 

-a 1 d&Jd n = -a 2 tt>Jdn=J n (9.17) 

where the surface normal n is directed from region 1 to region 2. The accumulated single source density 
can be shown to be equal to the discontinuity of dO/dn in Eq. (9.17) [Plonsey, 1974], in particular, 

The magnitude of the steady-state secondary source also can be described as an equivalent double layer, 
the magnitude of which is [Plonsey, 1974] 

**=**K-°'*)" (9 - 18) 

where the condition at the fcth interface is described. In Eq. (9.18), the two abutting regions are designated 
with prime and double-prime superscripts, and « is directed from the primed to the double-primed 
region. Actually, Eq. (9.18) evaluates the double-layer source for the scalar function v|/ = Oct, its strength 
being given by the discontinuity in \\i at the interface [Plonsey, 1974] [the potential is necessarily 
continuous at the interface with the value (i> k called for in Eq. (9.18)]. The (secondary) potential field 
generated by K' k , since it constitutes a source for \\i with respect to which the medium is uniform and 
infinite in extent, can be found from Eq. (9.9) as 

Vp = <^P = ^£|^<-<)"- V ( 1 / r ) dS (9 - 19) 

k 

where the superscript S denotes the secondary source/field component (alone). Solving Eq. (9.19) for <1>, 
we get 



*'=i^-EJ**K-°'J s - v ( 1 /^ 



(9.20) 



where a p in Eqs. (9.19) and (9.20) takes on the conductivity at the field point. The total field is obtained 
from Eq. (9.20) by adding the primary field. Assuming that all applied currents lie in a region with 
conductivity a a , then we have 



H^'^vIhK-^'^)* (9 - 21) 



<&p 

4%0 J v ' ' 4710 „ 

a P k 

[if the primary currents lie in several conductivity compartments, then each will yield a term similar to 
the first integral in Eq. (9.21)]. Note that in Eqs. (9.20) and (9.21) the secondary source field is similar 
in form to the field in a homogeneous medium of infinite extent, except that a p is piecewise constant 
and consequently introduces interfacial discontinuities. With regard to the potential, these just cancel 
the discontinuity introduced by the double layer itself so that O k is appropriately continuous across each 
passive interface. 

The primary and secondary source currents that generate the electrical potential in Eq. (9.21) also set 
up a magnetic field. The primary source, for example, is the forcing function in the Poisson equation 
for the vector potential A [Plonsey, 1981], namely, 
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V Z A = -}' (9.22) 

From this it is not difficult to show that, due to Eq. (9.21) for <1>, we have the following expression for 
the magnetic field H [Plonsey, 1981]: 

fi= toP' XV ( 1//r ^ V + ^^l^( 0; '" O; '" XV ( 1/,r ^ S (9 ' 23) 

k 

A simple illustration of these ideas is found in the case of two semi-infinite regions of different 
conductivity, 1 and 2, with a unit point current source located in region 1 a distance h from the interface. 
Region 1, which we may think of as on the "left," has the conductivity a,, while region 2, on the "right," 
is at conductivity a 2 . The field in region 1 is that which arises from the actual point current source plus 
an image point source of magnitude (p 2 - P!)/(p 2 + Pi) located in region 2 at the mirror-image point 
[Schwan and Kay, 1957]. The field in region 2 arises from an equivalent point source located at the actual 
source point but of strength [1 + (p 2 - Pi)/(p 2 + pj. One can confirm this by noting that all fields satisfy 
Poisson's equation and that at the interface O is continuous while the normal component of current 
density is also continuous (i.e., CTjcXDj/Sn = a 2 d<f> 2 /dn). 

The potential on the interface is constant along a circular path whose origin is the foot of the 
perpendicular from the point source. Calling this radius rand applying Eq. (9.13), we have for the surface 
potential O s 



s f— - 

A%\lr + r 



1+ (V^Pl (9.24) 

V P2+P1J 



and consequently a secondary double-layer source iC 5 equals, according to Eq. (9.18), 

(c7 2 -CT,)H (9.25) 



( 
Pi 






1 + ^- 

P2+PJ 



where n is directed from region 1 to region 2. The field from K s in region 1 is exactly equal to that from 
a point source of strength (p 2 - Pj)/(p 2 + pj) at the mirror- image point, which can be verified by evaluating 
and showing the equality of the following: 

If- t I \ (p2-Pi)/(p2-Pi) 

47t0j J v ' 4710^ 

where _R in Eq. (9.26) is the distance from the mirror-image point to the field point P, and r in Eq. (9.26) 
is the distance from the surface integration point to the field point. 
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One of the most common problems in bioelectric theory is the calculation of the potential distribution, 
<J> ( V) , throughout a volume conductor. The calculation of O is important in impedance imaging, cardiac 
pacing and defibrillation, electrocardiogram and electroencephalogram analysis, and functional electrical 
stimulation. In bioelectric problems, (i> often changes slowly enough so that we can assume it to be 
quasistatic [Plonsey, 1969]; that is, we ignore capacitive and inductive effects and the finite speed of 
electromagnetic radiation. (For bioelectric phenomena, this approximation is usually valid for frequencies 
roughly below 100 kHz.) Under the quasistatic approximation, the continuity equation states that the 
divergence, V # , of the current density, J (A/m 2 ), is equal to the applied or endogenous source of electrical 
current, S (A/m 3 ): 



V • J = S. 



(10.1) 



In regions where there are no such sources, S is zero. In these cases, the divergenceless of J is equivalent 
to the law of conservation of current that is often invoked when analyzing electrical circuits. Another 
fundamental property of a volume conductor is that the current density and the electric field, E (V/m), 
are related linearly by Ohm's Law, 



J = gE, 



(10.2) 



where g is the electrical conductivity (S/m). Finally, the relationship between the electric field and the 
gradient, V, of the potential is 



-V 0>. 



(10.3) 



The purpose of this chapter is to characterize the electrical conductivity. This task is not easy, because 
g is generally a macroscopic parameter (an "effective conductivity") that represents the electrical prop- 
erties of the tissue averaged in space over many cells. The effective conductivity can be anisotropic, 
complex (containing real and imaginary parts), and can depend on both the temporal and spatial 
frequencies. 
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Before beginning our discussion of tissue conductivity, let us consider one of the simplest volume 
conductors: saline. The electrical conductivity of saline arises from the motion of free ions in response 
to a steady electric field, and is on the order of 1 S/m. Besides conductivity, another property of saline 
is its electrical permittivity, e (S s/m). Permittivity is related to the dielectric constant, k (dimensionless), 
by e = k e , where e is the permittivity of free space, 8.854 x 10~ 12 S s/m. (The permittivity is also related 
to the electric susceptibility, %, by e = e + %.) Dielectric properties arise from a bound charge that is 
displaced by the electric field, creating a dipole. Dielectric behavior also arises if the applied electric field 
aligns molecular dipoles (such as the dipole moments of water molecules) that normally are oriented 
randomly. The DC dielectric constant of saline is similar to that of water (about k = 80). 

The movement of free charge produces conductivity, whereas stationary dipoles produce permittivity. 
In steady state the distinction between the two is clear, but at higher frequencies the concepts merge. In 
such a case, we can combine the electrical properties into a complex conductivity, g': 



g' = g + 1COE, 



(10.4) 



where co (rad/s) is the angular frequency (co = 2ni, where f is the frequency in Hz) and i is J-L . The real 
part of g' accounts for the movement of charge that is in phase with the electric field; the imaginary part 
accounts for out-of-phase motion. Both the real and the imaginary parts of the complex conductivity 
may depend on the frequency. For many bioelectric phenomena, the first term in Eq. (10.4) is much 
larger than the second, so the tissue can be represented as purely conductive [Plonsey, 1969]. (The 
imaginary part of the complex conductivity represents a capacitive effect, and therefore technically violates 
our assumption of quasistationarity. This violation is the only exception we make to our rule of a 
quasistatic potential.) 

10.2 Cell Suspensions 

The earliest and simplest model describing the effective electrical conductivity of a biological tissue is a 
suspension of cells in a saline solution [Cole, 1968]. Let us consider a suspension of spherical cells, each 
of radius a (Fig. 10.1a). The saline surrounding the cells constitutes the interstitial space (conductivity 
a e ), while the conducting fluid inside the cells constitutes the intracellular space (conductivity aj. (We 



a) 



b) 






© 



FIGURE 10.1 (A) A schematic diagram of a suspension of spherical cells; the effective conductivity of the suspension 
is IL/VA. (B) An electric circuit equivalent of the effective conductivity of the suspension. 
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shall follow Henriquez [1993] in denoting macroscopic effective conductivities by g and microscopic 
conductivities by a.) The cell membrane separates the two spaces; a thin layer having conductivity per 
unit area G m (S/m 2 ) and capacitance per unit area C m (F/m 2 ). One additional parameter — the intracellular 
volume fraction, f (dimensionless) — indicates how tightly the cells pack together. The volume fraction 
can range from nearly zero (a dilute solution) to almost 1 (spherical cells cannot approach a volume 
fraction of 1, but tightly packed nonspherical cells can). For irregularly shaped cells, the cell "radius" can 
be difficult to define. In these cases, it is easier to specify the surface-to-volume ratio of the tissue (the 
ratio of the membrane surface area to tissue volume). For spherical cells, the surface-to-volume ratio is 
3f/a. 

We can define operationally the effective conductivity, g, of the cell suspension by the following process 
(Fig. 10.1a): Place the cell suspension in a cylindrical tube of length L and cross-sectional area A (be sure 
L and A are large enough so the volume contains many cells). Apply a DC potential difference V across 
the two ends of the cylinder (so that the electric field has strength V/L) and measure the total current, I, 
passing through the suspension. The effective conductivity is IL/VA. 

Deriving an expression for the effective conductivity of a suspension of spheres in terms of microscopic 
parameters is an old and interesting problem in electromagnetic theory [Cole, 1968]. For DC fields, the 
effective conductivity, g, of a suspension of insulating spheres placed in a saline solution of conductivity 



2(l-f) 

s=t^- (10 " 5) 

For most cells, G m is small enough so that the membrane behaves essentially as an insulator at DC, in 
which case the assumption of insulating spheres is applicable. The net effect of the cells within the saline 
is to decrease the conductivity of the medium (the decrease can be substantial for tightly packed cells). 
The cell membrane has a capacitance of about 0.01 F/m 2 (or, in traditional units, 1 uF/cm 2 ), which 
introduces a frequency dependence into the electrical conductivity. The suspension of cells can be 
represented by the electrical circuit in Fig. 10.1b: R e is the effective resistance to current passing entirely 
through the interstitial space; R; is the effective resistance to current passing into the intracellular space; 
and C is the effective membrane capacitance. (The membrane conductance is usually small enough so 
that it has little effect on the suspension behavior, regardless of the frequency.) At low frequencies, all of 
the current is restricted to the interstitial space, and the electrical conductivity is given approximately by 
Eq. (10.5). At large frequencies, C shunts current across the membrane, so that the effective conductivity 
of the tissue is again resistive: 

2(l-f)a + (l + 2f)<X 

g = -) y^t v^°e. (10-6) 

(2+f)a e + (l-f)c. 

At intermediate frequencies, the effective conductivity has both real and imaginary parts, because the 
membrane capacitance contributes significantly to the effective conductivity. In these cases, Eq. (10.6) 
still holds if 0; is replaced by a* , where 



O . Y a 

. 1 m 

O.+Y a' 



with Y =G +icoC . (10.7) 



Figure 10.2 shows the effective conductivity (magnitude and phase) as a function of frequency for a 
typical tissue. The increase in the phase at about 300 kHz is sometimes called the "beta dispersion". 
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FIGURE 10.2 The magnitude and phase of the effective conductivity as a function of frequency, for a suspension 
of spherical cells: f = 0.5; a = 20 pm; a € = 1 S/m; a, = 0.5 S/m; G m = 0; and C m = 0.01 F/m 2 . 



10.3 Fiber Suspensions 



Some of the most interesting electrically active tissues, such as nerve and skeletal muscle, are better 
approximated as a suspension of fibers rather than as a suspension of spheres. This difference has 
profound implications because it introduces anisotropy: The effective electrical conductivity depends on 
direction. Henceforth, we must speak of the longitudinal effective conductivity parallel to the fibers, g L , 
and the transverse effective conductivity perpendicular to the fibers, g P (In theory, the conductivity could 
be different in three directions; however, we consider only the case in which the electrical properties in 
the two directions perpendicular to the fibers are the same.) The conductivity is no longer a scalar 
quantity, but is a tensor instead, and therefore must be represented by a 3 x 3 symmetric matrix. If we 
choose our coordinate axes along the principle directions of this matrix (invariably, the directions parallel 
to and perpendicular to the fibers), then the off-diagonal terms of the matrix are zero. If, however, we 
choose our coordinate axes differently, or if the fibers curve so that the direction parallel to the fibers 
varies over space, we have to deal with tensor properties, including off-diagonal components. 

When the electric field is perpendicular to the fiber direction, a suspension of fibers is similar to the 
suspension of cells described above (in Fig. 10.1a, we must now imagine that the circles represent cross- 
sections of cylindrical fibers, rather than cross-sections of spherical cells). The expression for the effective 
transverse conductivity of a suspension of cylindrical cells, of radius a and intracellular conductivity cj;, 
placed in a saline solution of conductivity o" e , with intracellular volume fraction f, is [Cole, 1968] 
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FIGURE 10.3 An electrical circuit representing a one-dimensional nerve or muscle fiber: r t and r e are the intracel- 
lular and extracellular resistances per unit length (fl/m); r m is the membrane resistance times unit length (fl m); 
and c m is the membrane capacitance per unit length (F/m). 



(l-f)a. + (i+f)a; 

St = 7 TT °e > 

(l + f)a e + (l-f)a; 

where Eq. (10.7) defines a*. At DC (and assuming that G m = 0) Eq. (10.8) reduces to 



(10.8) 



1-f 
1 + f 



(10.9) 



When an electric field is parallel to the fiber direction, a new behavior arises that is fundamentally 
different from that observed for a suspension of spherical cells. Let us return for a moment to our 
operational definition of the effective conductivity. Surprisingly, the effective longitudinal conductivity 
of a suspension of fibers depends on the length L of the tissue sample used for the measurement. To 
understand this phenomenon, we must consider one-dimensional cable theory [Plonsey, 1969]. A single 
nerve or muscle fiber can be approximated by the circuit shown in Fig. 10.3. Adopting the traditional 
electrophysiology definitions, we denote the intracellular and extracellular resistances per unit length 
along the fiber by x i and r e (Q/m), the membrane resistance times unit length by r m (Q m), and the 
capacitance per unit length by c m (F/m). The cable equation governs the transmembrane potential, V m : 



, 2 3X =x av^ +v 
8x 2 at ni 



(10.10) 



where x is the time constant, r m c m , and "k is the length constant, *yr m /(rj + r e ). For a truncated fiber of 
length L (m) with sealed ends, and with a steady-state current I (A) injected into the extracellular space 
at one end and removed at the other, the solution to the cable equation is 



V =Ir X 



sinhlx/X) 
cosh(L/2A.) ' 



(10.11) 



where the origin of the x-axis is at the midpoint between electrodes. The extracellular potential, V e , 
consists of two terms: One is proportional to x, and the other is r e /(r;+r e ) times V m . We can evaluate V e 
at the two ends of the fiber to obtain the voltage drop between the electrodes, AV e 
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AV = -!-*- 
r. +r 



L + ^2A,tanh(L/2A,) 



(10.12) 
Two limiting cases are of interest. If L is very large compared to X , the extracellular voltage drop reduces to 



AV=-L^LI L»X. (10.13) 

Fj + r 

The leading factor is the parallel combination of the intracellular and extracellular resistances. If, on the 
other hand, L is very small compared to X, the extracellular voltage drop becomes 

AV e = r e L I L « X . (10.14) 

In this case, the leading factor is the extracellular resistance alone. Physically, there is a redistribution of 
current into the intracellular space that occurs over a distance on the order of a length constant. If the 
tissue length is much longer than a length constant, the current is redistributed completely between the 
intracellular and extracellular spaces. If the tissue length is much smaller than a length constant, the 
current does not enter the fiber, but instead is restricted to the extracellular space. If either of these two 
conditions is met, then the effective conductivity (IL/AAV e , where A is the cross-sectional area of the 
tissue strand) is independent of L. However, if L is comparable to X, the effective conductivity depends 
on the size of the sample. A complete model of the effective conductivity of a suspension of fibers must 
take this behavior into account. 

To avoid having the effective conductivity depend on the length of the sample used for the measure- 
ment, Roth et al. [1988] recast the expression for the effective longitudinal conductivity in terms of spatial 
frequency, k (rad/m). There are two advantages to this approach. First, the temporal and spatial behavior 
are both described using frequency analysis. Second, a parameter derived from a specific source geometry 
is not necessary: the spatial frequency dependence becomes a property of the tissue, not the source. The 
expression for the DC effective longitudinal conductivity is 



(l-f)c e +fc. 

fa 1 

1+ 7 S~- 



(10.15) 



1 + 



v Xk y 



To relate the effective longitudinal conductivity to Eqs. (10.13) and (10.14), note that 1/k plays the same 
role as L. If kA,«l, g L reduces to (l-f)a e + fo ; , which is equivalent to the parallel combination of 
resistances in Eq. (10.13). If kX»l, g L becomes (l-f)a e , implying that the current is restricted to the 
interstitial space, as in Eq. (10.14). Equation (10.15) can be generalized to all temporal frequencies by 
defining X in terms of Y m instead of G m [Roth et al., 1988]. Figure 10.4 shows the magnitude and phase 
of the longitudinal and transverse effective conductivities as functions of the temporal and spatial 
frequencies. 

The measurement of effective conductivities is complicated by the traditionally used electrode geom- 
etry. Most often, a uniform electric field is not (or cannot be) applied to the tissue. Typically, one uses 
a four-electrode technique [Steendijk et al., 1993], in which two electrodes inject current and two other 
electrodes measure the potential (Fig. 10.5). Gielen et al. [1984] used this method to measure the electrical 
properties of skeletal muscle and found that the effective conductivity depended on the interelectrode 
distance. Roth [1989] reanalyzed Gielen et al.'s data using the spatial frequency dependent model and 
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FIGURE 10.4 The magnitude (a, c) and phase (b, d) of the effective longitudinal (a, b) and transverse (c, d) effective 
conductivities, calculated using the spatial, k, and temporal, co, frequency model. The parameters used in this 
calculation are G m = 1 S/m 2 ; C m = 0.01 F/m 2 ; f = 0.9; a = 20 um; cj; = 0.55 S/m; and o e = 2 S/m. Source: Roth, B. J., 
Gielen, F. L. H., and Wikswo, J. P., Jr. 1988. Math. Biosci, 88:159-189. With permission. 
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FIGURE 10.5 A schematic diagram of the four-electrode technique for measuring tissue conductivities. Current, 
I, is passed through the outer two electrodes, and the potential, V, is measured between the inner two. The interelec- 
trode distance is s. 

found agreement with some of the more unexpected features of their data (Fig. 10.6). Table 10.1 contains 
typical values of skeletal muscle effective conductivities and microscopic tissue parameters. Table 10.2 
lists nerve effective conductivities. 
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FIGURE 10.6 The calculated (A) amplitude and (B) phase of g L (solid) and g T (dashed) as a function of frequency, 
for an interelectrode distance of 0.5 mm; (C) and (D) show the quantities for an interelectrode distance of 3.0 mm. 
Circles represent experimental data; g L (filled), g T (open). Source: Roth, B. J. 1989. Med. Biol. Eng. Comput, 27:491-495. 
With permission. 

TABLE 10.1 Skeletal Muscle 



Ref. 



gL gr 

Macroscopic Effective Conductivities (S/m) 



Note 



Gielenetal., 1984 






0.35 
0.20 




0.086 
0.092 


10 Hz, IED = 3 mm 
10 Hz, IED = 0.5 mm 


Epstein and Foster, MBEC, 


21:51 




0.52 
0.70 




0.076 

0.32 


20 Hz, IED = 17 mm 
100 kHz, IED = 17 mm 


Rush et al., Circ Res, 12:40 






0.67 




0.040 


0.1 sec pulse 






Microscopic Tissue Parameters 








a, (S/m) 


a e 


(S/m) 


f 


C m (F/m 2 ) G m (S/m 2 ) 


Gielen et al., MBEC, 24:34 




0.55 




2.4 


0.9 


0.01 1.0 



Note: (IED = interelectrode distance). 



TABLE 10.2 Nerve 



Ref. 



Macroscopic Effective Conductivities (S/m) 



Tasaki, J., Neurophysiol, 27:1199 
Ranck & BeMent, Exp Neurol, 11:451 



0.41 
0.57 



0.01 
0.083 



Microscopic Tissue Parameters 
o-,(S/m) o e (S/m) 



Note- 



Toad sciatic nerve 

Cat dorsal column, 10 Hz 



C m (F/m 2 ) G m (S/m 2 ) 



Roth & Altman, Med Biol Eng Comput, 30:103 



0.64 



1.54 



0.35 



0.44 



Note: Volume fraction of myelin = 0.27. G m is proportional to axon diameter; the above value is for an axon 
with outer diameter of 6.5 um. 
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FIGURE 10.7 A circuit representing a two-dimensional syncytium (i.e., a bidomain). The lower array of resistors 
represents the intracellular space, the upper array represents the interstitial space, and the parallel resistors and 
capacitors represent the membrane. 

10.4 Syncytia 

Cardiac tissue is different from the other tissues we have discussed in that it is an electrical syncytium: 
the cells are coupled through intercellular junctions. To the extent that we can justify a continuum model 
and thereby ignore the discrete cellular structure of the tissue, we can use the bidomain model to describe 
the electrical properties of cardiac muscle [Henriquez, 1993]. The bidomain model is essentially a two- 
or three-dimensional cable model that takes into account the resistance of both the intracellular and the 
interstitial spaces (Fig. 10.7). Thus, the concept of current redistribution, discussed above in the context 
of the longitudinal effective conductivity of a suspension of fibers, now applies in all directions. Further- 
more, cardiac muscle is markedly anisotropic. These properties make impedance measurements of cardiac 
muscle difficult to interpret [Plonsey and Barr, 1986]. The situation is complicated further because the 
intracellular space is more anisotropic than the interstitial space (in the jargon of bidomain modeling, 
this condition is known as "unequal anisotropy ratios") [Henriquez, 1993]. Consequently, an expression 
for a single effective conductivity for cardiac muscle is difficult, if not impossible, to derive. In general, 
one must solve a pair of coupled partial differential equations simultaneously for the intracellular and 
interstitial potentials. For unequal anisotropy ratios, this solution is invariably found numerically. 

The bidomain model characterizes the electrical properties of the tissue by four effective conductivities: 
8a.' SiT> SeL> an d g e T> where i and e denote the intracellular and interstitial spaces, and L and T denote the 
directions parallel to and perpendicular to the myocardial fibers. We can relate these parameters to the 
microscopic tissue properties by using an operational definition of an effective bidomain conductivity, 
similar to the operational definition given earlier. (The operational definition of a bidomain conductivity 
is most useful for thought experiments, since its actual implementation would be fraught with experi- 
mental difficulties.) To determine the interstitial conductivity, we first dissect a cylindrical tube of tissue 
of length L and cross-sectional area A (one must be sure that L and A are large enough so the volume 
contains many cells, and that the dissection damages a negligible amount of tissue). Next, we apply a 
drug to the tissue that makes the membrane essentially insulating (i.e., the length constant is much longer 
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than L). Finally, we apply a DC potential difference, V, across the two ends of the cylinder and measure 
the total current, I, passing through the tissue. The effective interstitial conductivity is then IL/VA. This 
procedure could be performed twice, once with the fibers parallel to the long axis of the cylinder, and 
once with the fibers perpendicular to it. To determine the effective intracellular conductivities, the above 
procedure should be followed, except that the voltage difference must be applied to the intracellular space 
instead of to the interstitial space. Although the procedure would be extraordinarily difficult in practice, 
we can imagine two arrays of microelectrodes that impale the cells at both ends of the cylinder, and that 
maintain the potential at each end constant. 

Expressions have been derived for the effective bidomain conductivities in terms of the microscopic 
tissue parameters [Roth, 1988; Henriquez, 1993; Neu and Krassowska, 1993]. The effective conductivities 
in the direction parallel to the fibers are simplest. Imagine that the tissue is composed of long, straight 
fibers (like skeletal muscle) and that the intracellular space of these fibers occupies a fraction f of the 
tissue cross-sectional area. If the conductivity of the interstitial fluid is CT e , then the effective interstitial 
conductivity parallel to the fibers, g eL , is simply 

geL=(l-f)° e . (10.16) 

If we neglect the resistance of the gap junctions, we obtain a similar expression for the effective intrac- 
ellular conductivity parallel to the fibers in terms of the myoplasmic conductivity, c^: g iL = f cj;. When 
the gap junctional resistance is not negligible compared to the myoplasmic resistance, the expression for 
g iL is more complicated: 

g iL = ^— f°i. (10-17) 

7la a 

1+ L 

bG 

where G is the junctional conductance between two cells (S), b is the cell length (m), and a is the cell 
radius (m). The effective interstitial conductivity perpendicular to the fibers is identical with the DC 
transverse effective conductivity for skeletal muscle given in Eq. (10.9) 

geT=^<T e . (10-18) 

The effective intracellular conductivity perpendicular to the fibers is the most difficult to model, but a 
reasonable expression for g iT is 

6r=— J— «!• (10-19) 

1 + ^ 



Table 10.3 contains measured values of the bidomain conductivities (see also Roth [ 1997] ). Typical values of 
the microscopic tissue parameters are given in Table 10.3, although some are quite uncertain (particularly G). 
In our analysis of skeletal muscle, we accounted for the redistribution of current between the intrac- 
ellular and interstitial spaces by the spatial and temporal frequency dependence of the effective conduc- 
tivity. In the bidomain model, however, the current redistribution is accounted for by the pair of coupled 
equations that govern the intracellular and extracellular potentials. Thus, the conductivities are simpler 
for cardiac muscle than for skeletal muscle (no temporal or spatial frequency dependence); but for cardiac 
muscle, we must solve two equations for the intracellular and extracellular potentials instead of one. 
Roth and Gielen [1987] showed that, to a good approximation, the frequency-dependent conductivity 
model presented in Eq. (10.15) is equivalent to the bidomain model in the limit when g iT = 0. 
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TABLE 10.3 Cardiac Muscle 



Ref. g, L g, T g eL 

Macroscopic Effective Conductivities (Ventricular Muscle) (S/m) 



Clerc, J Physiol, 255:335 
Roberts et al., Circ Res, 44:701 
Roberts & Scher, Circ Res, 50:342 



Roth, Ann Biomed Eng, 16:609 
Neu & Krassowska, 1993 



0.17 




0.019 




0.62 0.24 


0.28 




0.026 




0.22 0.13 


0.34 




0.060 




0.12 0.080 


Microscopic Tissue Parameters 






a { (S/m) tr ( 


. (S/m) 


f 


a (urr 


l) b (um) G (uS 


1 


1 


0.7 


100 


10 3 


0.4 


2 


0.85 


100 


7.5 0.05 



If the intercellular junctions contribute significantly to the intracellular resistance, the bidomain model 
will only approximate the tissue behavior [Neu and Krassowska, 1993]. For sufficiently large junctional 
resistance, the discrete cellular properties become important, and a continuum model no longer represents 
the tissue well. Interestingly, as the junctional resistance increases, cardiac tissue behaves less like a syncytium 
and more like a suspension of cells. Thus, we come full circle. We started by considering a suspension of 
cells, then examined suspensions of fibers, and finally generalized to syncytia. Yet, when the intercellular 
junctions in a syncytium are disrupted, we find ourselves again thinking of the tissue as a suspension of cells. 
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Defining Terms 

Anisotropic: Having different properties in different directions. 

Bidomain: A two- or three-dimensional cable model that takes into account the resistance of both the 

intracellular and the extracellular spaces. 
Cable theory: Representation of a cylindrical fiber as two parallel rows of resistors (one each for the 

intracellular and extracellular spaces) connected in a ladder network by a parallel combination of 

resistors and capacitors (the cell membrane). 
Conservation of current: A fundamental law of electrostatics, stating that there is no net current 

entering or leaving at any point in a volume conductor. 
Conductivity: A parameter (g) that measures how well a substance conducts electricity. The coefficient 

of proportionality between the electric field and the current density. The units of conductivity are 

Siemens per meter (S/m). A Siemens is an inverse ohm, sometimes called a "mho" in older literature. 
Interstitial space: The extracellular space between cells in a tissue. 
Ohm's Law: A linear relation between the electric field and current density vectors. 
Permittivity: A parameter (e) that measures the size of the dipole moment induced in a substance by 

an electric field. The units of permittivity are Siemens second per meter (S s/m). 
Quasistatic: A potential distribution that changes slowly enough so that we can accurately describe it 

by the equations of electrostatics (capacitive, inductive, and propagation effects are ignored). 
Spatial frequency: A parameter governing how rapidly a function changes in space; k = 1/(2tis), where 

s is the wavelength of a sinusoidally varying function. 
Syncytium (pi., syncytia): A tissue in which the intracellular spaces of adjacent cells are coupled through 

intercellular channels, so that current can pass between any two intracellular points without 

crossing the cell membrane. 
Volume conductor: A three-dimensional region of space containing a material that passively conducts 

electrical current. 
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Membrane Models 



11.1 The Action Potential 

11.2 Patch Clamp Data 

11.3 General Formulations of Membrane Currents 
Nernst-Planck Equation • Hodgkin-Huxley Resistor 
Battery Model • Goldman-Hodgkin-Katz Constant Field 
Formulation • GHK with Correction for Fixed Surface 
Charge • Eyring Rate Theory Models of Ionic Currents • 
Ion Pump • Ion Exchanges • Synapses • Calcium as a 
Second Messenger 

11.4 Nerve Cells 

Sensory Neurons • Efferent Neurons 

11.5 Skeletal Muscle Cells 

11.6 Endocrine Cells 

11.7 Cardiac Cells 

11.8 Epithelial Cells 

11.9 Smooth Muscle 
Anthony Varghese 11.10 Plant Cells 

University of Minnesota 11.11 Simplified Models 



The models discussed in this chapter involve the time behavior of electrochemical activity in excitable 
cells. These models are all systems of ordinary differential equations where the independent variable is 
time. While a good understanding of linear circuit theory is useful in order to understand the models 
presented in this chapter, most of the phenomena of interest involve nonlinear circuits with time-varying 
components. 

Electrical activity in plant and animal cells is caused by two main factors: first, there are differences 
in the concentrations of ions inside and outside the cell; and second, there are molecules embedded in 
the cell membrane that allow these ions to be transported across the membrane. The ion concentration 
differences and the presence of large membrane-impermeant anions inside the cell result in the existence 
of a polarity: the potential inside a cell is typically 50 to 100 mV lower than that in the external solution. 
It is important to note that almost all of this potential difference occurs across the membrane itself. The 
bulk solutions both inside and outside the cell are for the most part at a uniform potential. This 
transmembrane potential difference is, in turn, sensed by molecules in the membrane that control the 
flow of ions. 

The lipid bilayer, which constitutes the majority of the cell membrane, acts as a capacitor with a specific 
capacitance that is typically 1 uF/cm 2 . The rest of the membrane comprises large protein molecules that 
act as (1) ion channels, (2) ion pumps, or (3) ion exchangers. The flow of ions across the membrane 
causes changes in the transmembrane potential which is typically the main observable quantity in 
experiments. 



© 2000 by CRC Press LLC 



1 


\ 
















T~~~ 




> 












J3, 








Overshoot 




<v 












o 












S3 












<D 


















\_ ' 














5 
















d 








.2 




.2 




i ^ 




c 




la 




i a 




0} 




N 




\ <> 












\ o 




o 




13 




\ *< 




fc 








\ £■ 




CD 
IS 

f-4 




Q 




\ %• 

\ o 




Xi 








\ ^ 




a 












QJ 












a 












1 


Resting 










0) 


Potential 


1 


















O 








Hyperpolarization 

















Time 

FIGURE 11.1 Schematic representation of an action potential in an excitable cell. The abscissa represents time and 
the ordinate represents the transmembrane potential. 



11.1 The Action Potential 

The main behavior that will be examined in this chapter is the action potential. This is a term used to 
denote a temporal phenomenon exhibited by every electrically excitable cell. A schematic representation 
of an action potential is shown in Fig. 11.1. The transmembrane potential difference of most excitable 
cells usually stays at some negative potential called the resting potential. External current or voltage 
inputs can cause the potential of the cell to deviate in the positive direction and if the input is large 
enough the result is an action potential. An action potential is characterized by a depolarization which 
typically results in an overshoot beyond the mV level (see Fig. 11.1) followed by repolarization. Some 
cells may actually hyperpolarize before returning to the resting potential. 

A key concept in the modeling of excitable cells is the idea of ion channel selectivity. A particular 
type of ion channel will only allow certain ionic species to pass through; most types of ion channels are 
modeled as being permeant to a single ionic species. In excitable cells at rest, the membrane is most 
permeable to potassium. This is because only potassium channels (i.e., channels selective to potassium) 
are open at the resting potential. For a given stimulus to result in action potential, the cell has to be 
brought to threshold, i.e, the stimulus has to be larger than some critical size; smaller sub-threshold 
stimuli will result in an exponential decay to the resting potential. The upstroke, or fast initial depolar- 
ization of the action potential is caused by a large influx of sodium ions as sodium channels open (in 
some cells, entry of calcium ions though calcium channels is responsible for the upstroke) in response 
to a stimulus. This is followed by repolarization as potassium ions start flowing out of the cell in response 
to the new potential gradient. While responses of most cells to subthreshold inputs are usually linear and 
passive, the suprathreshold response — the action potential — is a nonlinear phenomenon. Unlike linear 
circuits where the principle of superposition holds, the nonlinear processes in cell membranes do not 
allow responses of two stimuli to be added. If an initial stimulus results in an action potential, a subsequent 
stimulus administered at the peak voltage will not produce an even larger action potential; instead it may 
have no effect at all. Following an action potential most cells have a refractory period, during which 
they are unable to respond to stimuli. Nonlinear features such as these make modeling of excitable cells 
a nontrivial task. In addition, the molecular behavior of channels is not completely well known and, 
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therefore, it is not possible to construct membrane models from first principles. The models in this 
chapter were all constructed using empirical data. 

In 1952, Alan Hodgkin and Andrew Huxley published a paper showing how a nonlinear empirical 
model of the membrane processes could be constructed [Hodgkin and Huxley, 1952]. In the five decades 
since their work, the Hodgkin-Huxley (abbreviated HH) paradigm of modeling cell membranes has been 
enormously successful. While the concept of ion channels was not established when they performed their 
work, one of their main contributions was the idea that ion-selective processes existed in the membrane. 
It is now known that most of the passive transport of ions across cell membranes is accomplished by 
ion-selective channels. In addition to constructing a nonlinear model, they also established a method to 
incorporate experimental data into a nonlinear mathematical membrane model. 

11.2 Patch Clamp Data 

The main source of experimental data used to construct models of cell electrophysiology is the patch 
clamp method [Hamill et al., 1981]. In the whole-cell patch clamp configuration, a glass pipette with a 
very fine tip, typically 1 um in diameter, containing a solution that is close in ionic composition to the 
intracellular fluid, is brought to the external surface of the cell. A small amount of suction is applied to 
form a tight seal between the tip of the glass pipette and the membrane and by applying additional 
suction, the cell membrane under the tip can be ruptured and fluid in the electrode is allowed to come 
in physical contact with the internal fluid of the cell. With electrical contact thus established between the 
fluid of the glass pipette and the interior of the cell, it becomes possible to monitor as well as control 
cell electrical activity using electronic instruments. 

The components of cellular electrophysiology can be summarized schematically in Fig. 11. 2A. Cell 
membranes may contain ion channels that allow ions to flow down their electrochemical gradient and 
ion pumps that move ions against their gradients. The various cell membrane channel, pump, and 
exchanger currents are summed in I ion = I ion ( V m y) which is a nonlinear function of V m and ~y, a vector 
representing the kinetics of the various channels, etc. The capacitance of the membrane, C m , responds 
to changes in the cell transmembrane potential difference, V m , with a capacitative current, I c = C m ^j L . 
Using Kirchoff 's Current Law from electrical circuit theory, 

I r + I. =0 (11.1) 

C ion y ' 

which yields the basic differential equation for the cell transmembrane potential difference, V m : 

C ^ = -1. (v -y) (11.2) 

m j wn\ m / I v ' 

The kinetics of y is usually defined by a system of equations of the form: 

Cell patch-clamping allows experimenters to either inject a current while observing the response in 
V m , (current-clamp mode, Fig. 11. 2B) or impose a waveform, V cl (t), on the cell transmembrane 
potential and observe membrane current, I m (voltage-clamp mode, Fig. 11.2). By choosing appropriate 
voltage-clamp waveforms, Vclamp(t), experimenters can make deductions about the structure of the 
nonlinear function F(V m ,y) in Eq. (11.3). The membrane current measured during voltage clamp exper- 
iments, I m (t) = C m '~'™' e + I ion (V clam ,y) is then painstakingly dissected into its essential components due 
to the various ion channels, pumps, and exchangers by established protocols as well as by trial and error. 
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A Cell Membrane Model and Circuit Equivalent 
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FIGURE 11.2 Schematic view of membrane currents and patch clamp techniques. Most cells have a low internal 
concentration of sodium (Na+) and a high internal concentration of potassium (K+) ions while the external concen- 
trations are reversed. Sodium and potassium channels allow these ions to run down their electrochemical gradients 
while ion pumps can pump these ions against their respective electrochemical gradients. This electrical activity can 
be represented by an equivalent circuit shown to the left. The cell membrane capacitance is represented by the capacitor 
Cm and the various ionic currents are added together into a single nonlinear element through which the current I ion 
passes. Thus, the equation of circuit equivalent is: C„ — - + ^ o „(V r m ,y) = and the time-dependent properties of the 
nonlinearites of l io „ are described by the vector equation -j- = H V m , y). In the current clamp mode of the whole cell 
patch clamp configuration, one electrode is used to inject a known current l damp (t), while the other electrode is used 
to sense the potential inside the cell. In voltage clamp mode, the potential inside the cell is compared to a known 
voltage supplied by the experimenter and an amplifier is used to supply whatever current is required to keep the cell 
potential at the specified voltage V ciamp (t). The resulting circuit equations are indicated below the circuit diagrams. 
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11.3 General Formulations of Membrane Currents 

In this section we examine the various components common to membrane models. We start with the 
Nernst-Planck formulation of ion flow across a membrane. Although it is seldom used now, this equation 
is needed to derive other, more practical models such as the resistor-battery model or the Gold- 
man-Hodgkin-Katz current model. Some preliminary remarks are in order before examining these 
models. At dilute concentrations, ions in aqueous solutions behave like gas molecules. This is why the 
gas constant, R, is ubiquitous in models of ion flow in cells. Similarly, the phenomenon of chemisorption 
of gas molecules is used as an analog to study the binding of ions (and drug molecules) to receptors in 
the cell membrane. In chemisorption under equilibrium conditions, the fraction of gas molecules bound 
to fixed reaction sites is given by an expression of the form: [C]'7(k + [C] n ) where [C] is the concentration 
of gas, k is a constant, and n is typically a small positive number. This expression is also derived from 
Michaelis-Menten type kinetic schemes. Similar terms arise in many membrane models cited in this 
chapter and they indicate that some fraction of an ionic species, C, is binding to receptor molecules on 
the cell surface. 

Nernst-Planck Equation 

One of the most general descriptions of ion flow across a membrane is given by the Nernst-Planck 
equation. This is a partial differential equation where the independent variables represent space (x) and 
time (t). The main dependent variable is the concentration of the ion (c(x,f)). The potential {u(x,t}) is 
usually a fixed function but can be made a dependent variable in which case an additional equation is 
required. The Nernst-Planck equation can be written as: 



dcix,t) g (i(x,tj 



dt 



RT 



dc[x,t) 



+ zc\ 



( x Ay x ( x A 



(11.4) 



where the symbols are defined thus: 



Variable 


Description 


Dimensions 


R 


Gas constant 


8314.41 mJoules/(mole°K) 


T 


Temperature 


310°K 


F 


Faraday's constant 


96485 Coulombs/mole 


u 


potential 


mV 


c 


ion concentration 


Moles liter 1 


z 


valence of the ion 




E 


electric field force 


mV m~' 


M 


ion mobility 


m 2 mV~V 



Note that u can vary with both space and time depending on the type of ion channel or pump and 
its gating properties. Due to the complexity of this formulation, it is of limited use when examining 
behavior at the cellular level but it can be used to derive simpler formulations under certain assumptions 
as shown below. 

Hodgkin-Huxley Resistor Battery Model 

This is the model that is most frequently employed to look at ions flowing through channels and is shown 
in terms of its circuit equivalent in Fig. 11.3. This models the passive flow of ions due to a transmembrane 
potential gradient, V m , and a concentration gradient (represented in the battery, E ; ) through a nonlinear 
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FIGURE 11.3 Circuit representation of membrane current. The con- 
ductance can be nonlinear as indicated by the powers p and q on the 
state variables m t and h iy respectively. These state variables are typically 
time-varying functions of the transmembrane potential difference, V. 
The battery, Ei, represents the electrochemical gradient of the ionic 
species responsible for the current. 



V 



G.m P h Q 
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membrane conductance. From the structure of the circuit, we see that the equation for the current for 
ionic species i has the form: 



:Gm p h q (v -E) 

, t i \ m ij 



(11.5) 



where G, represents the maximum value of the conductance of the membrane, and m l and h i are gating 
variables that vary in time and take values between and 1. p and q are integers that depend on the 
characteristics of the membrane channel. 

The above expression can be derived from the Nernst-Planck equations with the assumption that 
concentration does not vary with time. In addition the space dimension of Eq. (11.4) is reduced to a 
two-compartment — cell interior and exterior — model. Thus, the steady-state of Eq. (11.4) tells us that 
when there is no net current flow, the transmembrane potential will equal a quantity called the "Nernst" 
potential: E ; = 



RT 



In 



[C] 



— . z.-is the valence of the ion, [C ; L is the concentration of the ion on the outside 

v ' C li 1. . ... 

and [CJ is the concentration on the inside. Quantities such as ion mobility are effectively lumped into 
a nonlinear time-varying conductance (G,WJ/z'). 

First order differential equations are used to model the time behavior of m i and h t . For example, the 
equation for m, will have the form: 



dm 

dt 



- = a \\-m -B 



(11.6) 



where the a m and f5 m are empirically determined functions of the membrane potential. This equation is 
also written as: 4 s - = J 



where m im g " and x, = - ' . Voltage-clamp protocols are used by experi- 
menters to characterize m ia and x t empirically as functions of V m for a given cell. As a result, there is no 
unique derivation for the differential equations describing the time behavior of the gating variables. 
Selectivity is implicitly modeled by assuming that only one ionic species flows through a current branch. 
A number of such branches are connected in parallel to model the interaction of the various ion currents. 

Goldman-Hodgkin-Katz Constant Field Formulation 

From Eq. (11.4), assuming that potential varies linearly across the membrane and that the ion flux is 
constant, we obtain an expression for current flow of the form [Goldman, 1943]: 



I. = P.d p f q 



V 

m 

RT/zF 



1- 



RT/xF 



C 



c 



RT/zF 



(11.7) 



where P ; is the maximum permeability of the membrane and d and /are gating variables like m and h 
in Eq. (11.5). This equation is frequently used when large concentration gradients are present. 
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FIGURE 11.4 Schematic representation of energy barrier in a rate theory model of ionic current. A single energy 
barrier of height G b located at a fraction 8 of the membrane thickness is assumed in this example. 

GHK with Correction for Fixed Surface Charge 

When surface charge is present on the inside or outside of the cell membrane, the effective potential 
gradient sensed by the channel is modified by a correction factor, V surface , and the current then has the 
form [Frankenhaeuser, I960]: 



I. = P,d?f q 

1 I I J I 



v — V , 

m surface 

RT/zF 



1-e 



RTJzF 



c. - c 



RT/zF 



(11. 



Eyring Rate Theory Models of Ionic Currents 

An alternative to the Nernst-Planck models of charge diffusion is the rate-theory model which views ion 
permeation through a channel or pump from a statistical thermodynamics perspective. In this approach, 
one or more energy barriers are assumed to exist at fixed points in the channel and ions permeate the 
channel when the energy of the ions overcomes the energy barrier. The case of one barrier is shown 
schematically in Fig. 11.4. The forward and backward ion flux rates can be described using Boltzmann 
equations as shown below. 



K- 



- G,,-C„ -z8 



rt/f 



(11.9) 



'rt/f 



T1.10) 



where k x is the forward flux rate from the outside to the inside, and k j is the flux rate in the other 
direction; v is some nominal flux; G and G, are the energies at the external and internal surfaces of the 
cell membrane; G b is the height of the energy barrier; z is the valence of the permeating ion; 5 is the 
location of the membrane barrier as measured from the external side of the membrane; V m is the trans- 
membrane potential difference and is implicitly assumed to vary linearly through the thickness of the 
membrane; and R, T, and F are as defined earlier. Given these flux rates, we can write the flux equations as 



influx = C 



JO 



.11.11) 
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efflux : 



-[c]^ (11.12) 



If the net current is then assumed to be the difference between the efflux and the influx, it can be 
written as: 



L = v 



C e l ' l ' RT / F -\c\e RJ I F 



V 



(11.13) 



The equations for the case of multiple barriers can be derived in a similar way but are much more 
tedious to work through. 

Ion Pump 

Ion pumps are membrane molecules that, unlike channels, require energy to function. While the workings 
of ion pumps can get quite complex especially if the source of energy for these pumps needs to be modeled, 
most models in use assume a simplified form using Michaelis-Menten terms such as: 



m "[x] l + k[Yl + k 



p pmax [ 



In the above equation, \ pmax is the maximum pump current, [X] ; is the concentration of the ionic 
species on the inside being pumped out, and [Y] is the concentration of the ionic species on the outside 
being pumped in. k x and k represent sensitivities of the pump to these ion concentrations. 

Ion Exchangers 

Ion exchangers are similar to ion pumps but do not require energy directly. Instead exchangers use the 
energy stored in a concentration gradient of ion X to transport ion Y in the opposite direction. An 
example of an exchanger current is the cardiac Na-Ca exchanger which uses the concentration gradient 
of sodium (high outside, low inside) to pump one calcium ion out of the cell for every three sodium 
ions allowed into the cell. An additional difference between exchangers and pumps is that exchangers 
are more voltage-sensitive and are thus more readily operated in the reverse direction under physiological 
conditions. 

Synapses 

Yamada and Zucker [1992] have tested various kinetic mechanisms to model calcium control of presyn- 
amptic release of transmitters. The classic model of postsynaptic response to a synaptic input is the so- 
called alpha function due to Rail [1967] and has the form: 



G(tj= t —^-e p " k (11.15) 

peak 

where G is usually a conductance in a resistor-battery type branch circuit. Although the alpha function 
is the one that is most commonly used, more detailed schemes have been constructed using kinetic 
models [Destexhe etal., 1994]. 
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Calcium as a Second Messenger 

Calcium entry into a cell frequently has a number of secondary effects such as initiation of contraction, 
release of neurotransmitters, and modulation of membrane ion channels. This is usually accomplished 
by the binding of calcium ions to calcium receptors inside the cell. Michaelis-Menten kinetic schemes 
with steady-state assumptions are used to model this binding and therefore expressions of the form f = 
[Ca ++ ]"/ [Ca ++ ]" + K are frequently employed. Here f is the fraction of calcium that is bound to the 
receptor, K is the dissociation constant for the reaction, and n is the number of calcium ions that bind 
to each receptor molecule. 

11.4 Nerve Cells 

Nerve cells typically have complex geometries with axons, branching dendrites, spines, and synapses. 
While it was thought for a long time that dendrites could be modeled using linear resistor-capacitor 
circuits, it is becoming increasingly clear that nonlinear ionic currents are present in many dendritic 
trees. Due to technical difficulties in measuring currents in dendrites and synapses directly, most neuronal 
models in this chapter describe ionic currents of cell soma or axons. Since the HH equations constitute 
the basis for membrane modeling, the reader should be familiar with this model before examining the 
others. 

Squid Axon: Hodgkin and Huxley — Most models of excitable cells are descendents of this model. The 
basic circuit equivalent of this model comprises a linear capacitance in parallel with three resistor-battery 
sub-circuits (one each for sodium, potassium, and a nonspecific leakage channel). The HH equations in 
their original form were: 

C m d ^ = -G N m'h(v-E Na )-G K n 4 (v-E K ) = G l (v-E?} 



(11.16) 
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rate constants 


were given originally as: 












a m = 


(v+25)/l0 

g (v + 25)/l0_ 1 



(11.17) 



,V/18 



(3 =4e v/ls (11.18) 



cc ;i = 0.07e v ' /20 (11.19) 



(11.20) 



+ 1 
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i (v+io)/io 

a„ = — \_ w ; / — (11.21) 

■1 



10 > +10 )/ 10 



(v/so) 

(3 =- (11.22) 

H " 8 

The physical constants used in these equations are: G Na = 120 mS/cm 2 , G K = 36 mS/cm 2 , G ( = 0.3 mS/cm 2 , 
Cm = luF/cm 2 , E Nn = -115mV, E K = 12 mV, and E, = -10.613. 

The original equations as listed by Hodgkin and Huxley [1952] unfortunately used a sign convention 
that has not been followed since. Using the sign convention of today for the transmembrane potential, 
the rate constants can be rewritten by replacing all occurrences of V with — (V m + 60). 

The successes of this model include the ability to predict the velocity of action potential propagation 
when the spatial aspect is included, the continuous nature of the threshold of the nerve, and the repetitive 
firing behavior seen under the influence of a constant current. A couple of remarks should be made 
regarding this model that hold for other models as well. First, L'Hospital's Rule allows us to compute the 
value of expressions of the form ~T7 as in Eq. (11.17) at the point x = 0. Second, for temperatures other 
than 6.3°C, multiply each a and (3 and § = 3 <T_63)/10 where T is given in degrees Celsius; this is referred 
to as a Q 10 temperature dependence of rate constants. 

Toad Myelinated Neuron — Frankenhaeuser and Huxley [1964] modified the HH model to describe 
the fast sodium and delayed rectifier potassium currents with the GHK model rather than the resistor- 
battery formulation used by Hodgkin and Huxley. In addition, they added a small non-specific current 
to account for inward currents late in action potentials of myelinated neurons of the toad Xenopus laevis. 

Gastropod Neuron — The model of Connor and Stevens [1971] used an HH-type system of differential 
equations to model repetitive firing in isolated gastropod neurons. This was the first model to include 
an inactivating potassium current (the A- type current) that has since been used in a number of other cells. 

Aplysia Abdominal Ganglion R15 cell — The bursting behavior of Aplysia neurons has been extensively 
studied. The first model was that of Plant [1976], who extended the HH equations to include an 
inactivating potassium current, a slow potassium current, and a constant hyperpolarizing current due 
to the Na/K pump. Bertram [1993] modeled bursting in these cells by augmenting an HH model with 
a calcium current, a second delayed rectifier type potassium current, and serotonin activated inward- 
rectifying potassium current and a negative slope region calcium current. A detailed model including a 
fast inward sodium current, fast and slow calcium currents, a delayed rectifier and an inward rectifier 
potassium current, a sodium-potassium pump, a sodium-calcium exchanger, a calcium pump and a 
leakage current was described in Butera et al. [1995]. 

CA3 Hippocampal Pyramidal Neurons — Traub et al. [1991] have constructed a multicompartment- 
soma and dendrites-model of hippocampal neurons with up to six membrane currents in each compart- 
ment: (1) a fast sodium current, (2) a calcium current, (3) a delayed rectifier potassium current, (4) an 
inactivating (A-type) potassium current, (5) a long-duration calcium activated potassium current, and 
(6) a short-duration calcium activated potassium current. Furthermore, calcium concentration changes 
in a restricted space beneath the membrane is modeled using a simplified linear buffering scheme. Good 
and Murphy [1996] modified this model to include N-, L-, and T-type calcium currents along with an 
AMPA activated current and the effects of (3-amyloid block of the A-type current. A branching dendritic 
version of the model [Traub et al., 1991] with synaptic input is described in [Traub and Miles, 1995]. 

CA1 Hippocampal Pyramidal Neurons — A model that accurately predicts accommodation in CA1 
neurons was constructed by Warman et al. [1994]. Using the previous work of Traub et al. [1991] and 
taking experimental data into consideration, they constructed a 16-compartment model with the follow- 
ing membrane currents: (1) a fast sodium current, (2) a calcium current, (3) a delayed rectifier potassium 
current, (4) an inactivating (A-type) potassium current, (5) a long-duration calcium dependent potas- 
sium current, (6) a short-duration calcium and voltage activated potassium current, (7) a persistent 
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muscarinic potassium current, and (8) a leakage current. Unlike the Traub model, separate pools of 
calcium were used to modulate the potassium currents. A linear buffering scheme was assumed for both 
internal calcium pools but the decay times were assumed to be different in each pool. 

Stomatogastric Ganglion Neurons — Epstein and Marder [1990] constructed an HH-type model of 
the lobster stomatogastric ganglion neuron with a fast sodium current, a delayed rectifier potassium 
current, a voltage dependent calcium current, a calcium dependent potassium current, and a linear 
leakage current in order to study the mechanism of bursting oscillations in these cells. 

Chopper Units in the Anteroventral Cochlear Nucleus — Banks and Sachs [1991] constructed an 
equivalent cylinder model of chopper units in the anteroventral cochlear nucleus. The model of the soma 
membrane included a fast sodium current, a delayed rectifier current, a linear leakage current, and 
inhibitory and excitatory synaptic currents using the Rail a-wave model. 

Lamprey CNS Neurons — The model of Brodin et al. [ 1 99 1 ] consisted of voltage-gated sodium, potas- 
sium, and calcium currents, a calcium-activated potassium current, and an NMDA receptor channel. 
The voltage-dependent block by magnesium of the NMDA was also modeled in this paper. Two com- 
partments were used to model calcium concentration changes inside the cells. 

Thalamocortical Cells — Neurons in the thalamocortical systems show various kinds of oscillatory 
behavior during sleep and wake cycles. It has been shown in the last decade that these neurons interact 
to produce a wide range of oscillatory activity. Huguenard and McCormick [1992] constructed a model 
of thalamic relay neurons with: ( 1 ) a low-threshold transient calcium current, (2) an inactivating (A-type) 
potassium current, (3) a slowly inactivating potassium current, and (4) a hyperpolarization activated 
nonspecific current. McCormick and Huguenard [1992] extended the above model to simulate the 
behavior of thalamocortical relay neurons by adding: (1) a fast sodium current, (2) a persistent sodium 
current, (3) a high-threshold calcium current, (4) a calcium activated potassium current, and (5) linear 
leakage currents. Destexhe, McCormick, and Sejnowski [1993] and Destexhe, Babloyantz, and Sejnowski 
[1993] have examined similar models as well. 

Human Node of Ranvier — Schwartz et al. [ 1995] measured action potentials and separated membrane 
currents in nodes of Ranvier in human nerve trunks. Their model comprises a fast sodium current, fast 
and slow potassium currents, and a leakage current. 

Dopaminergic Neurons — A minimal model of the soma with a fast sodium current and a delayed 
rectifier potassium current and a dendritic compartment with a sodium-potassium pump, NMDA- 
activated current and leakage current was augmented with an L-type calcium current in the dendritic 
compartment and an A-type potassium current, T-type calcium current, and a calcium sensitive potas- 
sium current in the soma by Li et al. [1996]. 

Sensory Neurons 

Rabbit Sciatic Nerve Axons — A fast sodium current and a leak current were found to be sufficient to 
model single action potentials in the nodes of Ranvier of axons from rabbit sciatic nerve by Chiu et al. 
[1979]. It is likely that this model should be augmented to include potassium currents in order to model 
trains of action potentials accurately. 

Myelinated Auditory-Nerve Neuron — A detailed model of the morphology of myelination and ion 
channel distribution was formulated by Colombo and Parkins [1987] using the Frankenhaeuser-Huxley 
model as a base: it includes a fast sodium current, fast potassium current, a leakage current, and a slow 
sodium current. 

Retinal Ganglion Cells — Fohlmeister et al. [1990] modeled the retinal ganglion cells using a fast 
inactivating sodium current, a calcium current, a noninactivating (delayed-rectifier) potassium current, 
an inactivating potassium current, and a calcium-activated potassium current. An important feature in 
this model is that it models the calcium concentration inside the cell using a simplified buffering scheme 
and in this way they are able to model the modulation potassium currents by calcium. 

Retinal Horizontal Cells — The model of Winslow and Knapp [1991] consisted of a fast sodium 
current, an inactivating (A-type) potassium current, a noninactivating potassium current, an anomalous 
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rectifier potassium current, a calcium-inactivated calcium current, and a linear leakage current. In 
addition, calcium concentration changes in a restricted region beneath the membrane was also modeled. 

Rat Nodose Neurons — Schild et al. [1994] built a model comprising fast and slow sodium currents, 
an A-type potassium current, a delayed rectifier potassium current, T- and L-type calcium currents, a 
calcium-activated potassium current, a slowly inactivating potassium current, a sodium-calcium 
exchanger, a sodium-potassium pump, a calcium pump current, and background currents. This model 
was enhanced with descriptions of vesicular storage and release in Schild et al. [1995] and further tuned 
to fit myelianted and non-myelinated neurons in Schild and Kunze [1997]. 

Muscle Spindle Primary Endings — Otten et al. [1995] extended the Frankenhaeuser-Huxley model 
with modifications to reproduce repetitive firing in sensory endings: in addition to the fast sodium 
current, fast potassium current, and leakage currents of the Fh model, a slowly activating delayed rectifier 
current was required. 

Vertebrate Retinal Cone Photoreceptors — A model of vertebrate photorecptor cells that includes a 
light-sensitive current, an L-type calcium current, a delayed rectifier potassium current, calcium-sensitive 
potassium and chloride currents, a hyperpolarization-activated current, a sodium-potassium pump, and 
a leak current in addition to the response of the cell to a channel blocker was published by Usui et al. [1996] . 

Primary and Secondary Sensory Neurons of the Enteric Nervous System — A model of primary and 
secondary afferents with a fast sodium current, an N-type calcium current, a delayed rectifier potassium 
current, a calcium sensitive potassium current, and a chloride current was set up by Miftakhov and 
Wingate [1996]. 

Invertebrate Photoreceptor Neurons — The membrane properties of nerve terminals, axons, soma, 
and microvilli of Type-B photoreceptors of the invertebrate Hermissenda were modeled using a fast 
sodium current, an A-type potassium current, a calcium-dependent potassium current, a delayed rectifier 
potassium current, a non-inactivating calcium current, light-induced sodium and calcium currents, and 
a leakage current by Fost and Clark [1996]. 

Fly Optic Lobe Tangential Cells — Three families of tangential cells of the blowfly were modeled by 
Haag et al. [1997]. The centrifugal-horizontal cell has an inward calcium current and outward potassium 
currents. The other cells have a fast sodium current, a delayed rectifier potassium current, and a sodium- 
dependent potassium current. 

Rat Mesencephalic Trigeminal Neurons — Negro and Chandler [1997] have constructed a model of 
sensory neurons involved in brain stem control of jaw musculature. This model of the trigeminal neurons 
includes a fast sodium current, N- and T-type calcium currents, two A-type potassium currents, a 
sustained potassium current, a delayed rectifier potassium current, a calcium-dependent potassium 
current, a hyperpolarization-activated current, and a leakage current. 

Primary Afferents and Related Efferents — A model comprising a fast sodium current, a delayed 
rectifier potassium current, a calcium current, and a calcium-dependent potassium current was used to 
model primary afferents as well as interneurons and efferent neurons connected to the afferents by Saxena 
etal.[1997]. 

Myelinated I Primary Afferent Neurons — The intraspinal collateral of a myelinated primary afferent 
neuron was modeled with a myelination morphology by D'Incamps et al. [1998] and included a fast 
sodium current, a delayed rectifier potassium current, and a Rail synaptic current at a particular node 
of Ranvier in the network. 

Efferent Neurons 

Sympathetic Neurons of the Superior Cervical Ganglia — Superior cervical ganglion neurons were mod- 
eled by Belluzzi and Sacchi [1991] using the following membrane currents: (1) sodium currents with 
fast and slow components, (2) a GHK-type calcium current, (3) a delayed rectifier potassium current, 
(4) an inactivating (A-type) potassium current, and (5) a calcium activated potassium current. One 
drawback of this model is the calcium activation of the potassium channels was modeled using a fixed 
time delay rather than by allowing the internal calcium concentration to change. 
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Small Intestine Cholinergic and Adrenergic Neurons — Miftakhov and Wingate [ 1 994a] used the HH 
equations (fast sodium current, delayed rectifier potassium current, and a leak current attributed to 
chloride channels) and added a system of equations to model the release of the neurotransmitter acetyl- 
ocholine by the presynaptic terminal. Similarly, release of noradrenaline was modeled in Miftakhov and 
Wingate [1994b]. 

Pyloric Constrictor neurons of the Lobster — Models of the crab lateral pyloric neuron were modified 
to include a fast sodium current, an A-type potassium current, a delayed rectifier potassium current, a 
calcium current, a calcium-dependent potassium current, and a leak current by Harris- Warrick et al. [1995] 
to examine the effect of the neurotransmitter dopamine on the A-type current. This model was augmented 
by Harris- Warrick, Coniglio, Levini et al. [1995] to include a hyperpolarization activated current. 

Mammalian Spinal Motoneuron — Halter et al. [1995] set up a detailed model of myelinated moto- 
neurons with myelination morphology and ionic currents including: a fast sodium current, fast and slow 
delayed rectifier potassium currents, and a leak current. 

Leech Heartbeat Oscillator Interneuron — Interneurons controlling the leech heartbeat were modeled 
using: a fast sodium current, an A-type potassium current, an inward rectifier current, fast and slowly 
inactivating calcium currents, a persistent sodium current, a hyperpolarization activated current, slowly 
inactivating and persistent potassium currents, and synaptic chloride currents [Nadim et al., 1995]. 

Snail RPal Bursting Neuron — A model including a fast sodium current, a delayed rectifier potassium 
current, a voltage-dependent calcium current, a voltage- and calcium-dependent calcium current, and a 
leak current was constructed by Berezetskaya et al. [1996] to investigate the bursting behavior of pace- 
maker cells of the Helix pomatia. 

Vertebrate Motoneuron — Data from motonerouns from various species were put together to construct 
a generic model of vertebrate motoneurons by Booth et al. [1997]. This model contains a fast sodium 
current, a delayed rectifier potassium current, L- and N-type calcium currents, and a calcium dependent 
potassium current. 

Xenopus Central Pattern Generator Neuron — A model of Xenopus embryo swimming central pattern 
generator neurons with HH currents was modified by Tabak and Moore [1998] to include a voltage- and 
magnesium-sensitive NMDA channel current and a Rail alpha-wave mechanism for non-NMDA post- 
synaptic currents. 

11.5 Skeletal Muscle Cells 

Frog Sartorius Muscle Cell — Skeletal muscle cells have a significant amount of current flowing through 
the cell membrane in the T- tubules of the cells. For this reason, it becomes imperative to model the currents 
in the T-tubules along with the rest of the cell membrane. The Adrian and Peachery [1973] equations are 
a system of coupled HH-type circuits and thus have a significant spatial component. The equations describ- 
ing the velocity field have the same structure as the HH equations. Two models were suggested in the paper: 
if sodium and potassium channels are assumed to exist in the T-tubules, the usual HH formulation of the 
corresponding currents are used; otherwise, only a linear leakage current appears in the voltage equation. 
Barnacle Muscle — Morris and Lecar [1981] sought to model the oscillatory activity in current- 
clamped barnacle muscle fibers using just two non-inactivating currents: a fast calcium current and a 
potassium current. It should be kept in mind that their model does not take into account the presence 
of a calcium chelator (EGTA) inside the cells used to obtain experimental results. Their equations have 
the same general structure as the HH equations: 

C m d ^ = -G L (v-E L )-G c m(v-E Ca )=G K n(v-E K ) (11.23) 
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Typical values of the parameters are: C m = 20 uF/cm 2 , G L = 2 mS/cm 2 , G K = 8 mS/cm 2 , G Ca = 4 mS/cm 2 , 
E L = -50m V, E K = -70mV, E Ca = lOOmV, V; = 10m V, v 2 = 15mV, v, = lOmV, v 4 = 14.5mV, X> 0.1, and 
X = 1/15. An alternative model of the calcium current was also proposed. Instead of the HH resistor- 
battery type formulation, a GHK formulation was suggested: 
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where typical values of [Ca ++ ], and [Cfl ++ ] were 0.001 mM and 100 mM, respectively. 

11.6 Endocrine Cells 

Pancreatic (3-cells — An interesting electrophysiological phenomena occurs in the Islet of Langerhans of 
the pancreas: the release of insulin is controlled in these islets by trains of action potentials occurring in 
rapid bursts followed by periods of quiescence. This "bursting" behavior occurs only in intact islets: single 
cells do not display such bursting activity. Chay and Keizer [1983] was the first attempt to model this 
phenomenon quantitatively. Sherman et al. [1988] sought to explain the absence of bursting in single 
P-cells using the idea of "channel-sharing". Keizer [1988] modified this model by substituting an ATP- 
and ADP-dependent K channel instead of the Ca-dependent K channel. This model was then further 
improved by Keizer and Magnus [1989]. Sherman et al. [1990] constructed a domain model to examine 
the effect of Ca on Ca channel inactivation. Further refinements have been made by Keizer and Young 
[1983]. 

Pituitary Gonadotrophs and Corticotrophs — Li et al. [1995] have constructed a model of membrane 
fluxes and calcium release from the endoplasmic reticulum of rat pituitary gonadotrophs. The membrane 
currents modeled include an L- and T-type calcium current, delayed rectifier and calcium-sensitive 
potassium currents, and a leak current. A modification of this model was used by LeBeau et al. [1997] 
to model pituitary corticotroph cells. 
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11.7 Cardiac Cells 

Purkinje Fiber — Older models of the Purkinje fiber [Noble, 1960; 1962; Noble and Tsien, 1969; McAllister 
et al., 1975] failed to model the pacemaking currents correctly. Also, external and internal concentration 
changes were not modeled until the work of DiFrancesco and Noble [1985]. (The original paper contained 
a number of errors and a corrected listing of the model equations can be found in Varghese and Winslow 
[1993]. Despite the fact that the magnitude of the changes in calcium concentrations is at odds with 
recent experimental observations, this model has been very influential in directing modeling efforts. Most 
cardiac models being constructed now use the DiFrancesco-Noble structure. 

Sinoatrial Node — As was the case with Purkinje fibers, the pacemaking mechanism was not modeled 
correctly in older models [Bristow and Clark, 1982; Yanagihara et al., 1980] and it was only after the 
Purkinje fiber model of DiFrancesco and Noble [1985] that accurate models of the sinoatrial node models 
could be constructed. An excellent summary of these modeling efforts can be found in Wilders et al. 
[1991] complete with listings of the equations. The most detailed model of sinoatrial node cells to date 
is that of Demir et al. [1994]; this model includes a biophysically accurate description of internal calcium 
buffering and varying extracellular concentrations. A model similar to ones in Wilders et al. [1991] was 
augmented to model the effects of acetylcholine changes outside sinoatrial node cells by Dokos, et al. 
[1996]. 

Atrial Muscle — The first detailed model of the excitation-contraction coupling mechanism in heart 
cells was constructed by Hilgemann and Noble [1987] and a single cell model was completed by Earm 
and Noble [1990]. A similarly detailed model of rabbit atrial cells was constructed Lindblad et al. [1996] 
and modified to fit data from human atrial cells by Nygren et al. [ 1998] . Another model of human atrial 
cells based on the ventricular cell model of Luo and Rudy [1994] but with improved calcium handling 
equations was formulated by Courtemanch et al. [1998]. The main differences between the Nygren and 
Courtemanche models are that the latter uses a steady-state approximation for the calcium buffers and 
the relative sizes of the rapid and slow components of the delayed rectifier are reversed. 

Ventricular Muscle — The first model of ventricular cells [Beeler and Reuter, 1977] consisted of (1) a 
fast inward sodium current, (2) a slow inward calcium current, (3) an inward-rectifying potassium 
current, and (4) a voltage-dependent potassium current. A simple linear model of calcium buffering was 
also included. Drouhard and Roberge [1987] improved the Beeler-Reuter model by varying parameters 
and the equations for the rate constants for sodium activation and inactivation in order to match 
experimental results. Luo and Rudy [1991] brought further improvements by including a plateau potas- 
sium current and updating all the current characteristics to match data from whole cell and patch clamp 
experiments. A more detailed model with more accurate descriptions of internal calcium concentration 
changes has also been constructed [Luo and Rudy, 1994]. The main disadvantage of this effort is the 
formulation of the calcium release current: it depends on the time of the maximum rate of depolarization 
and activates with a time delay. This was remedied by the model of Jafri et al [1998] and also by Priebe 
and Beucklemann [1998] for human ventricular cells. 

Noble et al. [1991] constructed a model of the guinea-pig ventricular cell by varying parameters in 
equations for the atrial cell model of Earm and Noble [1990]. Further refinements including better 
handling of the fast and slow components of the delayed rectifier current and feedback of length and 
tension changes on the electrophysiology were incorporated in Nobel et al. [1998]. Nordin [1993] mod- 
ified the DiFrancesco and Noble [1985] equations to model ventricular cells. His model included mem- 
brane calcium and potassium pumps and subcompartments inside the cell for sodium and calcium 
concentrations. 

Amphibian Sinus Venosus and Atrial Cells — Rasmusson et al. [1990] have constructed a detailed 
model of the bullfrog sinus venosus pacemaker cells that includes: (1) a delayed rectifier potassium 
current, (2) a GHK-type calcium current, linear background, (3) sodium and (4) calcium currents, (5) an 
Na/K pump current, (6) an Na/Ca exchanger current, and (7) a calcium pump current. Calcium binding 
to troponin, troponin-Mg, and calmodulin was also included in the model. Furthermore, internal and 
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external sodium, calcium, and potassium were also modeled as time-varying quantities. In addition to 
the features of the sinus venosus cell, the bullfrog atrial cell [Rasmusson et al., 1990] includes a fast 
sodium current and an inward rectifying potassium current. 

11.8 Epithelial Cells 

A detailed model of the slow processes in principal cells of the cortical collecting tubule of the mammalian 
kidney was built by Tang and Othmer [1960] and includes: sodium, potassium, and chloride channels, 
a sodium-potassium pump, a calcium pump, a sodium-potassium-chloride cotransporter, and a sodium- 
calcium exchanger. 

11.9 Smooth Muscle 

Smooth muscle cells, like many other kinds of cells, have significant currents carried by pumps and 
exchangers. A simple model of smooth muscle cell membrane electrical activity was constructed by 
Gonzalez-Fernandez and Ermentrout [1994]. This model has the same structure as the Morris-Lecar 
model of barnacle muscle fibers. Besides model parameter differences, the gating variable m was set to 
its steady-state value of m a and internal calcium was modeled as a time-varying quantity with a simple 
linear model of internal buffering. A detailed model of electrical activity in smooth muscle of the small 
bowel with L- and T-type calcium currents, a calcium dependent potassium current, a delayed rectifier 
potassium current, and a background chloride current has been published by Miftakhov et al. [1996]. 

11.10 Plant Cells 

The electrophysiology of a number of plant cells has been studied and characterized. A plasmalemmal 
proton pump, a hydrogen-chloride symporter, inward and outward rectifying K + currents, and a chloride 
channel were modeled in the equations representing electrical activity in Egeria densa Planchon by 
Buschmann et al [1996]. 



11.11 Simplified Models 



There is a large number of papers investigating the dynamics of cell electrophysiology using simplifications 
of the models listed above. A good reason to use simplified models is that mathematical analysis of such 
models is tractable. 

Hill — The model constructed by Hill [1936] was one of the earliest differential equation models of 
nerve electrical activity. While the behavior of this model was compared with experimental observations 
in animal preparations, the model is a phenomenological one that only reproduced subthreshold 
responses. Hill's main focus was the modeling of accommodation and he modeled this using a time- 
varying "threshold", U. 

The basic model is a linear differential equation with time being the independent variable and voltage, 
V, and threshold voltage, U, being the dependent variables: 

v=^(v-v) + i(t) 

(11.31) 



u=—(u-u)+-(v-v) 



where V and U are some steady-state values of voltage and the threshold function and k, X, and [3 are 
relaxation time-constants (k is a few milliseconds and X and (5 are a few hundred milliseconds) and I(t) 
is some time-varying current source. The above equations can be solved using the variation of constants 
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method and Hill documented the responses of the model to various functions, I (t), and various parameter 
values. Such a model Is only of use if one is not interested in supra-threshold behavior. 

FitzHugh-Nagumo — The FitzHugh-Nagumo equations are also called the Bonhoeffer-Van der Pol 
equations and have been used as a generic system that shows excitability and oscillatory activity. FitzHugh 
[1969] showed that much of the behavior of the HH equations can be reproduced by a system of two 
differential equations: 

v=v-— — U+ lit) 

3 W (11.32) 



U=tylv-bU+a\ 



where a,b, and <|> are positive constants (typical values are: a = 0.7, b = 0.8, and <|> = 0.08. With no input 
current (I = 0) the system has a stable resting state and at I = 0.4 the system exhibits oscillatory activity. 
This model reproduces excitability, threshold phenomena, and repetitive firing. 

Hindmarsh-Rose Models — The first model of Hindmarsh and Rose [1982a] was an attempt to 
improve on the FitzHugh-Nagumo model without increasing the number of state variables. In most 
neurons undergoing repetitive firing, the time between action potentials is usually much greater than 
the duration of action potentials; however, the FHN model has an action potential duration that is 
roughly the same order of magnitude as the inter-spike interval. In addition, FHN does not yield a linear 
current-frequency relationship. These inadequacies were addressed by Hindmarsh and Rose and in their 
model the action potential duration and the inner-spike interval are closer to experimental recordings. 
Their model can be written as: 

V = -alf(v)-y-l\ 

(11.33) 
y=b[g(v)-y) 

where the nonlinearities are defined to be: f(V) = cV 3 +dV 2 + eV + h and g(V) = f(V) - qe rV +s. The 
constants used to model action potentials in snail visceral ganglion neurons were: a = 5400 Mfl/s, b = 
30s-l, c = 0.00017, d = 0.001, e = 0.01, h = 0.1, q = 0.024, r = 0.088, and s = 0.046. 

A second model [Hindmarsh and Rose, 1982b] sought to include the phenomenon of bursting by 
adding a third variable. The model had the following form: 

V = -aV i + bV 2 + y-z+I 

y=c -dV 2 -y (11.34) 



r^V-V,)- 



where a = 1, b = 3, c = 1, d = 5, r = 0.001, s = 1, and setting 1=1 for a short period triggers the bursting 
response. 

Defining Terms 

Ion channel: A protein molecule embedded in the cell membrane. It is thought to have the structure 
of a pipe with obstructions that gate the flow of ions into and out of the cell. 

Ion pump: Uses energy (in the form of ATP molecules) to pump ions against their electrochemical 
gradients. 

Ion exchanger: Uses the potential energy in the electrochemical gradients to pump one ionic species 
into the cell and another species out. 
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Transmembrane potential difference: The potential difference between the inside and the outside of 
a cell manifests itself very close to membrane. In many cases the potential of the external fluid is 
taken to be the reference or ground potential and the transmembrane potential difference is the 
same as the cell potential, otherwise called the membrane potential. 

Selectivity: A property of ion channels where a certain type of channel only allows a specific ionic 
species to pass through it. Some channels are less specific than others and may allow more than 
one ionic species to pass through it. 

Depolarization: A process that is said to occur whenever the transmembrane potential becomes more 
positive than some "resting" potential. 

Repolarization: A process that usually follows depolarization and causes the transmembrane potential 
to return to a polarized state. 

Hyperpolarization: A deviation of the transmembrane potential in the negative direction from a 
"resting" state. 

Action potential: A phenomenon involving temporal changes in the transmembrane potential. Typi- 
cally characterized by a fast depolarization followed by a slower repolarization and sometimes 
hyperpolarization as well. 

Threshold: There is no satisfactory definition or quantitative description of threshold that will work 
for all conditions. A working definition of threshold would be the state the cell has to reach in order 
to produce an action potential. It is usually characterized by the strength of an external stimulus 
required to bring the cell from some initial state to the threshold state. The problem with this 
definition is that the threshold stimulus will vary considerably depending on the initial state of 
the cell: the cell may be in a resting state or it may be undergoing repolarization or depolarization. 

Refractory period: A period of time after an action potential during which the cell is unable to undergo 
another action potential in response to a second stimulus. 

Voltage clamp: The experimental procedure of using active electronic circuits to hold the transmem- 
brane potential difference at a fixed value by pumping current into the cell. 
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Further Information 

The best introduction to ion channels and excitable behavior in cells can be found in Ionic Channels of 
Excitable Membranes by Bertil Hille (Sunderland, MA: Sinauer Associates; 1992). 

In the case of neurons, Principles of Neural Science by Kandel, Schwartz, and lessell (New York: Elsevier; 
1991) is the bible. Detailed discussions of cardiovascular cell membrane phenomena can be found in 
The Heart and Cardiovascular System edited by Fozzard et al. (New York: Raven Press; 1992). 

Mathematical analysis of cell models can be found in Guckenheimer et al. [1997], Bertram et al. [1995], 
and Koch, et al. [1995]. 

Numerical Methods: Most of the models presented in this chapter are systems of first order non- 
linear ordinary differential equations. While the solution of linear ordinary differential equations can be 
written down explicitly, this is usually impossible for nonlinear systems. The only option to investigate 
the behavior of these solutions is to compute approximate solutions using numerical time-integration. 
There are a number of numerical methods with varying degrees of accuracy and they fall into two main 
classes: explicit and implicit methods. The advantage of explicit methods is that they are simpler to 
implement, whereas implict methods require the use of a nonlinear solver which can be extremely 
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complicated. The disadvantage of explicit methods is that the errors of these methods are very sensitive 
to the time-step used and generally very small time steps are required to ensure accuracy; implicit methods 
generally allow much larger time steps. A general guideline would be to try a variable-step explicit method 
such as a Runge-Kutta method with tight tolerances to control the error; if this proves to be too slow, 
an implicit method such as the backward difference formula could also be used. A good source of 
integration packages is http://www.netlib.org. 
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Computer modeling and simulation continue to grow more important to the field of bioengineering. 
The reasons for this growing importance are manyfold. First, mathematical modeling has been shown 
to be a substantial tool for the investigation of complex biophysical phenomena. Second, since the level 
of complexity one can model parallels existing hardware configurations, advances in computer architec- 
ture have made it feasible to apply the computational paradigm to complex biophysical systems. Hence, 
while biologic complexity continues to outstrip the capabilities of even the largest computational systems, 
the computational methodology has taken hold in bioengineering and has been used successfully to 
suggest physiologically and clinically important scenarios and results. 

This chapter provides an overview of numerical techniques that can be applied to a class of bioelectric 
field problems. Bioelectric field problems are found in a wide variety of biomedical applications that 
range from single cells [1], to organs [2], up to models that incorporate partial to full human structures 
[3,4,5]. I describe some general modeling techniques that will be applicable, in part, to all the aforemen- 
tioned problems. I focus this study on a class of bioelectric volume conductor problems that arise in 
electrocardiography and electroencephalography. 

I begin by stating the mathematical formulation for a bioelectric volume conductor, continue by 
describing the model construction process, and follow with sections on numerical solutions and com- 
putational considerations. I concluded with a section on error analysis coupled with a brief introduction 
to adaptive methods. 
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12.1 Problem Formulation 

As noted in Chap. 9, most bioelectric field problems can be formulated in terms of either the Poisson or 
the Laplace equation for electrical conduction. Since Laplace's equation is the homogeneous counterpart 
of the Poisson equation, I will develop the treatment for a general three-dimensional Poisson problem 
and discuss simplifications and special cases when necessary. 

A typical bioelectric volume conductor can be posed as the following boundary value problem: 

V-0V4> = -/ V mD. (12.1) 

where Q is the electrostatic potential, a is the electrical conductivity tensor, and I v is the current per unit 
volume defined within the solution domain, Q. The associated boundary conditions depend on what 
type of problem one wishes to solve. There are generally considered to be two different types of direct 
and inverse volume conductor problems. 

One type of problem deals with the interplay between the description of the bioelectric volume source 
currents and the resulting volume currents and volume and surface voltages. Here, the problem statement 
would be to solve Eq. (12.1) for O with a known description of I v and the Neumann boundary condition: 

oV4>-n = inT T (12.2) 

which says that the normal component of the electric field is zero on the surface interfacing with air 
(here denoted by r r ). This problem can be used to solve two well-known problems in medicine, the 
direct EEG (electroencephalography) and ECG (electrocardiography) volume conductor problems. In 
the direct EEG problem, one usually discretizes the brain and surrounding tissue and skull. One then 
assumes a description of the bioelectric current source within the brain (this usually takes the form of 
dipoles or multipoles) and calculates the field within the brain and on the surface of the scalp. Similarly, 
in one version of the direct EEG problem, one utilizes descriptions of the current sources in the heart 
(either dipoles or membrane current source models such as the FitzHugh-Nagumo and Beeler-Reuter, 
among others) and calculates the currents and voltages within the volume conductor of the chest and 
voltages on the surface of the torso. The inverse problems associated with these direct problems involve 
estimating the current sources I v within the volume conductor from measurements of voltages on the 
surface of either the head or body. Thus one would solve Eq. (12.1) with the boundary conditions 

3> = 3> onEczr r (12.3) 

oV$-h = onT r (12.4) 

The first is the Dirichlet condition, which says that one has a set of discrete measurements of the voltage 
of a subset of the outer surface. The second is the natural Neumann condition. While it does not look 
much different than the formulation of the direct problem, the inverse formulations are ill-posed. The 
bioelectric inverse problem in terms of primary current sources does not have a unique solution, and 
the solution does not depend continuously on the data. Thus, to obtain useful solutions, one must try 
to restrict the solution domain (i.e., number of physiologically plausible solutions) [11] for the former 
case and apply so-called regularization techniques to attempt to restore the continuity of the solution on 
the data in the latter case. 

Another bioelectric direct/inverse formulation poses both the problems in terms of scalar values at 
the surfaces. For the EEG problem, one would take the surface of the brain (cortex) as one bounded 
surface and the surface of the scalp as the other surface. The direct problem would involve making 
measurements of voltage of the surface of the cortex at discrete locations and then calculating the voltages 
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on the surface of the scalp. Similarly, for the ECG problem, voltages could be measured on the surface 
of the heart and used to calculate the voltages at the surface of the torso, as well as within the volume 
conductor of the thorax. To formulate the inverse problems, one uses measurements on the surface of 
the scalp (torso) to calculate the voltages on the surface of the cortex (heart). Here, we solve Laplace's 
equation instead of Poisson's equation, because we are interested in the distributions of voltages on a 
surface instead of current sources within a volume. This leads to the following boundary value problem: 

V-0V<5 = O inQ. (12.5) 

<& = <&„ on£czr r (12.6) 

aV<5n = onr r (12.7) 

For this formulation, the solution to the inverse problem is unique [17]; however, there still exists the 
problem of continuity of the solution on the data. The linear algebra counterpart to the elliptic boundary 
value problem is often useful in discussing this problem of noncontinuity. The numerical solution to all 
elliptic boundary value problems (such as Poisson and Laplace problems) can be formulated in terms of 
a set of liner equations A<1> = b. For the solution of Laplace's equation, the system can be reformulated as 

A3>. =<& f (12.8) 

in out v ' 

where O in is the vector of data on the inner surface bounding the solution domain (the electrostatic 
potentials on the cortex or heart, for example), <l> out is the vector of data that bound the outer surface 
(the subset of voltage values on the surface of the scalp or torso, for example), and A is the transfer matrix 
between <J> out and <J> in , which usually contains the geometry and physical properties (conductivities, 
dielectric constants, etc.) of the volume conductor. The direct problem is then simply (well) posed as 
solving Eq. (12.8) for <D out given O in . Likewise, the inverse problem is to determine <D in given ® out . 

A characteristic of A for ill-posed problems is that it has a very large condition number. In other words, 
the ill-conditioned matrix A is very near to being singular. Briefly, the condition number is defined as 
k(A) = ||A||-||A _1 || or the ratio of maximum to minimum singular values measured in the L 2 norm. The 
ideal problem conditioning occurs for orthogonal matrices which have k(A) ~ 1, while an ill-conditioned 
matrix will have k(A) > 1. When one inverts a matrix that has a very large condition number, the 
inversion process is unstable and is highly susceptible to errors. The condition of a matrix is relative. It 
is related to the precision level of computations and is a function of the size of the problem. For example, 
if the condition number exceeds a linear growth rate with respect to the size of the problem, the problem 
will become increasingly ill-conditioned. See [36] for more about the condition number of matrices. 

A number of techniques have arisen to deal with ill-posed inverse problems. These techniques include 
truncated singular value decomposition (TSVD), generalized singular value decomposition (GSVD), 
maximum entropy, and a number of generalized least squares schemes, including Twomey and Tikhonov 
regularization methods. Since this chapter is concerned more with the numerical techniques for approx- 
imating bioelectric field problems, the reader is referred to [12-15] to further investigate the regularization 
of ill-posed problems. A particularly useful reference for discrete ill-posed problems is the Matlab package 
developed by Per Christian Hansen, which is available via netlib [16]. 

12.2 Model Construction and Mesh Generation 

Once we have stated or derived the mathematical equations that define the physics of the system, we 
must figure out how to solve these equations for the particular domain we are interested in. Most 
numerical methods for solving boundary value problems require that the continuous domain be broken 
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up into discrete elements, the so-called mesh or grid, which one can use to approximate the governing 
equation(s) using the particular numerical technique (finite element, boundary element, finite difference, 
or multigrid) best suited to the problem. 

Because of the complex geometries often associated with bioelectric field problems, construction of 
the polygonal mesh can become one of the most time-consuming aspects of the modeling process. After 
deciding on the particular approximation method to use (and the most appropriate type of element), 
one needs to construct a mesh of the solution domain that matches the number of degrees of freedom 
of the fundamental element. For the sake of simplicity, we will assume that we will use linear elements, 
either tetrahedrons, which are usually used for modeling irregular three-dimensional domains, or hexa- 
hedrons, which are used for modeling regular, uniform domains. 

There are several different strategies for discretizing the geometry into fundamental elements. For 
bioelectric field problems, two approaches to mesh generation have become standard: the divide and 
conquer (or subsequent subdivision) strategy and the Delaunay triangulation strategy. 

In using the divide and conquer strategy, one starts with a set of points that define the bounding 
surface(s) in three dimensions (contours in two dimensions). The volume (surface) is repeatedly divided 
into smaller regions until a satisfactory discretization level has been achieved. Usually, the domain is 
broken up into eight-node cubic elements, which can then be subdivided into five (minimally) or six 
tetrahedral elements if so desired. This methodology has the advantage of being fairly easy to program; 
furthermore, commercial mesh generators exist for the divide and conquer method. For use in solving 
bioelectric field problems, its main disadvantage is that it allows elements to overlap interior boundaries. 
A single element may span two different conductive regions, for example, when part of an element 
represents muscle tissue (which could be anisotropic) and the other part of the element falls into a region 
representing fat tissue. It then becomes very difficult to assign unique conductivity parameters to each 
element and at the same time accurately represent the geometry. 

A second method of mesh generation is the Delaunay triangulation strategy. Given a three-dimensional 
set of points that define the boundaries and interior regions of the domain to be modeled, one tessellates 
the point cloud into an optimal mesh of tetrahedra. For bioelectric field problems, the advantages and 
disadvantages tend to be exactly contrary to those arising from the divide and conquer strategy. The 
primary advantage is that one can create the mesh to fit any predefined geometry, including subsurfaces, 
by starting with points that define all the necessary surfaces and subsurfaces and then adding additional 
interior points to minimize the aspect ratio. For tetrahedra, the aspect ratio can be defined as 4 73/2 
(p k /h k ), where p k denotes the diameter of the sphere circumscribed about the tetrahedron, and h k is the 
maximum distance between two vertices. These formulations yield a value of 1 for an equilateral tetra- 
hedron and a value of for a degenerate (flat) element [18]. The closer to the value of 1, the better. The 
Delaunay criterion is a method for minimizing the occurrence of obtuse angles in the mesh, yielding 
elements that have aspect ratios as close to 1 as possible, given the available point set. While the ideas 
behind Delaunay triangulation are straightforward, the programming is nontrivial and is the primary 
drawback to this method. Fortunately, there exist public domain, two-dimensional versions, including 
one from netlib called sweep2.c from the directory Voronoi, as well as at least one three-dimensional 
package [41]. For more information on mesh generation and various aspects of biomedical modeling, 
see [19-27]. 

12.3 Numerical Methods 

Because of the geometric complexity of and numerous inhomogeneities inherent in anatomic structures 
in physiologic models, solutions of bioelectric field problems are usually tractable (except in the most 
simplified of models) only when one employs a numerical approximation method such as the finite 
difference (FD), the finite element (FE), boundary element (BE), or the multigrid (MG) method to solve 
the governing field equation(s). 
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Approximation Techniques: The Galerkin Method 

The problem posed in Eq. (12.1) can be solved using any of the aforementioned approximation schemes. 
One technique that addresses three of the previously mentioned techniques (FD, FE, and BE) can be 
derived by the Galerkin method. The Galerkin method is one of the most widely used methods for 
discretizing elliptic boundary value problems such as Eq. (12.1) and for treating the spatial portion of 
time-dependent parabolic problems, which are common in models of cardiac wave propagation. While 
the Galerkin technique is not essential to the application of any of the techniques, it provides for a 
unifying bridge between the various numerical methods. To express the problem in a Galerkin form, one 
begins by rewriting Eq. (12.1) as 

A<t> = -I v (12.9) 

where A is the differential operator, A = V-(aV). An equivalent statement of Eq. (12.9) is, find O such 
that (A® + I v , <D) = 0. Here, O is an arbitrary test function, which can be thought of physically as a virtual 
potential field, and the notation (§ lt <|> 2 ) = \ n § 1 (|) 2 dQ. denotes the inner products in L 2 (Q). Applying 
Green's theorem, one can equivalently write 

(ovs>,v<i>)-/— ,<i>\=-(i v ,<i>) (12.10) 

where the notation (§ lt (j) 2 ) = j s ^ <j) 2 dS denotes the inner product on the boundary S. When the Dirichlet, 
<J> = O , and Neumann, oVO-n = 0, boundary conditions are specified on S, one obtains the weak form 
ofEq. (12.1): 

(cvs>, v<i>) = -h, <i>) . (i2.il) 

It is understood that this equation must hold for all test functions O, which must vanish at the boundaries 
where O = O . The Galerkin approximation § to the weak- form solution O inEq. (12.11) can be expressed 

as 

N 

♦(*) = £♦,■¥«(*) ( 12 -12) 

The trial functions \|/,, i = 0, 1, ... , Nform a basis for an N + 1 dimensional space S. One can define 
the Galerkin approximation to be the element § e S that satisfies 



(aV^V ¥ .) = -(/ v)¥ .) (v Vj es) (12.13) 



Since our differential operator A is positive definite and self adjoint [i.e., (A®, O) > a(0, O) > for 
some nonzero positive constant a and (A®, O) = (O, A<1>), respectively], then we can define a space E 
with an inner product defined as (<1>, <!>)£ = (A®, O) = a(0, O) and norm (the so-called energy norm) 
equal to 



P : 
I, We 



f (Vf&J dQ.{ 2 =(®,<f>y (12.14) 
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The solution O of Eq. (12.9) satisfies 

(A*,i|f.) = -(/„, \|f.) (V\|/,.es) (12.15) 

and the approximate Galerkin solution obtained by solving Eq. (12.13) satisfies 

(A|>,Y.) = -(l v ,\|f.) (vy.es) (12.16) 

Subtracting Eq. (12.15) from Eq. (12.16) yields 

(ji((|>-*),Y.) = (<p-<p,\|f.) B = (vy.es) (12.17) 

The difference (|> - O denotes the error between the solution in the finite dimensional space V and the 
N + 1 dimensional space S. Equation (12.17) states that the error is orthogonal to all basis functions 
spanning the space of possible Galerkin solutions. Consequently, the error is orthogonal to all elements 
in S and must therefore be the minimum error. Thus the Galerkin approximation is an orthogonal 
projection of the true solution cD onto the given finite dimensional space of possible approximate 
solutions. Therefore, the Galerkin approximation is the best approximation in the energy space E. Since 
the operator is positive definite, the approximate solution is unique. Assume for a moment that there 
are two solutions, <)>! and (|) 2 , satisfying 

(a^ 1)Vi ) = -(/ v , Vi ) (A|>2» ¥*) = -(',>¥<) K-es) (12.18) 

respectively. Then the difference yields 

(a(^-^ 2 ) )¥ .) = (Vy.es) (12.19) 

The function arising from subtracting one member from another member in S also belongs in S; hence 
the difference function can be expressed by the set of A orthogonal basis functions spanning S: 

N 

^A4>.(A(y.,y.)) = (V\|/,.e) (12.20) 

When i # j, the terms vanish due to the basis functions being orthogonal with respect to A. Since A is 
positive definite, 

(a^.sOx) i = 0,...,N (12.21) 

Thus A(|); = 0, ! != 0, ... ,N, and by virtue of Eq. (12.20), 8(j) = 0, such that ^ = (|) 2 . The identity contradicts 
the assumption of two distinct Galerkin solutions. This proves the solution is unique [28]. 

The Finite-Difference Method 

Perhaps the most traditional way to solve Eq. (12.1) utilizes the finite-difference approach by discretizing 
the solution domain D. using a grid of uniform hexahedral elements. The coordinates of a typical grid 
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point are x = Ih, y = mh, z= nh(l, m, n = integers), and the value of 0(x, y, z) at a grid point is denoted 
by <!>;,„„. Taylor's theorem can then be used to provide the difference equations. For example, 



f 



3>, 



a* i , , a 2 <& i , , a 3 <& 



<t> + h + -h 



+ -W 



(12.22) 
with similar equations for 3> Mimj „, Q>i im+ i, n , $>i,m-\, «,...• The finite-difference representation of Eq. (12.1) is 



<*V, -2<5, + <&, , <5, ^ -2<5, + <&, , 

f+l,m,H l,m,n l—\,m,n _, l,m+l,n l,m,n l,m—\,n 

h 2 ~ h 2 

<5, J .,-2<I>, + <&, , / \ 

, l,m,n+\ l,m,n l,m,n—\ j i 1 

,2 ~ /,m,«^ j 



(12.23) 



or, equivalently, 



3>, 



i,n l-l 



m,w t,tn+l,n l,m—\,n l,m,n+\ l,m,n—\ 



-64>, 



-^ 2 /, 



,,„W ( 12 -24) 



If one defines the vector cD to be [O, 



O, 



O, 



^N-l,N-l,N-l 



] T to designate the (N - l) 3 



unknown grid values and pull out all the known information from (24), one can reformulate Eq. (12.1) 
by its finite-difference approximation in the form of the matrix equation A<D = b, where b is a vector 
that contains the sources and modifications due to the Dirichlet boundary condition. 

Unlike the traditional Taylor's series expansion method, the Galerkin approach utilizes basis functions, 
such as linear piecewise polynomials, to approximate the true solution. For example, the Galerkin 
approximation to sample problem (12.1) would require evaluating Eq. (12.13) for the specific grid 
formation and specific choice of basis function: 



1 



dx dx 



3d) d\u 

a — -— — + o z 
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-f 'vf, 
Ja 



dQ. 



(12.25) 



Difference quotients are then used to approximate the derivatives in Eq. (12.25). Note that if linear basis 
functions are utilized in Eq. (12.25), one obtains a formulation that corresponds exactly with the standard 
finite-difference operator. Regardless of the difference scheme or order of basis function, the approxima- 
tion results in a linear system of equations of the form AO = b, subject to the appropriate boundary 
conditions. 

The Finite-Element Method 

As seen earlier, in the classic numerical treatment for partial differential equation — the finite-difference 
method — the solution domain is approximated by a grid of uniformly spaced nodes. At each node, the 
governing differential equation is approximated by an algebraic expression that references adjacent grid 
points. A system of equations is obtained by evaluating the previous algebraic approximations for each 
node in the domain. Finally, the system is solved for each value of the dependent variable at each node. 
In the finite-element method, the solution domain can be discretized into a number of uniform or 
nonuniform finite elements that are connected via nodes. The change of the dependent variable with 
regard to location is approximated within each element by an interpolation function. The interpolation 
function is defined relative to the values of the variable at the nodes associated with each element. The 
original boundary value problem is then replaced with an equivalent integral formulation [such as 



e 2000 by CRC Press LLC 



Eq. (12.13)]. The interpolation functions are then substituted into the integral equation, integrated, and 
combined with the results from all other elements in the solution domain. The results of this procedure 
can be reformulated into a matrix equation of the form A® = b, which is subsequently solved for the 
unknown variables [20, 29]. 

The formulation of the finite-element approximation starts with the Galerkin approximation, (aV<t>, 
V<1>) = -(/„, <!>), where <D is our test function. Now one can use the finite-element method to turn the 
continuous problems into a discrete formulation. First, one discretizes the solution domain, 
f2 = Uj = if2 e , and defines a finite dimensional subspace V h a V = {<!>: <1> is continuous on Q, V® is piece- 
wise continuous on Q}. One usually defines parameters of the function ® e V h at node points a, = ®(x,), 
i = 0, 1, ... , N. If one now defines the basis functions \|/ ; e V h as linear continuous piecewise functions 
that take the value 1 at node points and zero at other node points, then one can represent the 
function ® e V h as 

N 

H x )=Y, d ' Wi ( x ) (12 - 26) 

such that each ® e V h can be written in a unique way as a linear combination of the basis functions 
VPj e V h . Now the finite-element approximation of the original boundary value problem can be stated as 

Find® h eV h such that (oV® h ,VG>) = -(l v ,®\ (12.27) 

Furthermore, if®,, e V h satisfies problem (12.27), then we have (ctV® a , VF;) = -(/„,¥,) [30]. Finally, 
since ® A itself can be expressed as the linear combination 

N 

*»=5^(*) $<=*»(*.■) (i2 - 28) 

j=0 

one can then write problem (12.27) as 

N 

£^(o f V»P. > V'P.) = -(l y ,«P.) j = 0,...,N (12.29) 

i=0 

subject to the Dirichlet boundary condition. Then the finite-element approximation of Eq. (12.1) can 
equivalently be expressed as a system of N equations with N unknowns £,-, ..., t, N (the electrostatic 
potentials, for example). In matrix form, the preceding system can be written as At, = b, where A = (a,y) 
is called the global stiffness matrix and has elements (a„) = (du VT;, V^F •), while b i = -(/„, *¥ t ) and is 
usually termed the load vector. 

For volume conductor problems, A contains all the geometry and conductivity information of the 
model. The matrix A is symmetric and positive definite; thus it is nonsingular and has a unique solution. 
Because the basis function differs from zero for only a few intervals, A is sparse (only a few of its entries 
are nonzero). 

Application of the FE Method for 3D Domains 

Now let us illustrate the concepts of the finite-element method by considering the solution of Eq. (12.1) 
using linear three-dimensional elements. One starts with a 3D domain D. that represents the geometry 
of our volume conductor and breaks it up into discrete elements to form a finite dimensional subspace 
D. h . For 3D domains, one has the choice of representing the function as either tetrahedra 
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4> = a 1 + a 2 x+a 3 y+oc 4 z, 



(12.30) 



or hexahedra 



4> = (Xj + a 2 x + 0C 3 y + oc 4 z + a 5 xy+ OL 6 yz + a 7 xz + a s xyz 



(12.31) 



Because of space limitations, let us restrict the development to tetrahedra, knowing that it is easy to 
modify the formulas for hexahedra. Take out a specific tetrahedra from the finite dimensional subspace 
and apply the previous formulations for the four vertices: 






1 x i 7i 

1 x 2 yi z i 

1 x 3 y 3 z, 
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«1 

a, 



a s 



yv a 4y 



(12.32) 



<& =Ca 



(12.33) 



which define the coordinate vertices, and 



ot = C l <5> 



(12.34) 



which defines the coefficients. From Eqs. (12.30) and (12.34) one can express O at any point within the 
tetrahedra, 



or, most succinctly, 



4> = [l, x, y, zlcc = Sa = SC -1 ^ 



(12.35) 



(12.36) 



(D ,is the solution value at node i, and N = SCr 1 is the local shape function or basis function. This can be 
expressed in a variety of ways in the literature (depending, usually, on whether you are reading engineering 
or mathematical treatments of finite element analysis): 



where 
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defines the volume of the tetrahedra V. 



(12.37) 



(12.38) 
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Now that a suitable set of basis functions is available, one can find the finite-element approximation 
to the 3D problem. The original problem can be formulated as 



aiu, v) = ll v , v) Vve£2 



where 



and 



alM, v)= I Vm- 



VvdQ. 



The finite-element approximation to the original boundary value problem is 

a \ u h' v ) = \ 1 v' v ) Vv£ ^ 

which has the equivalent form 






where 



4* J >*j)=«( , M N J ,, M N ')) 

which can be expressed by the matrix and vector elements. 



w-I 



dN. dN dN dN dN. dN 

1_ )_+ l L + L L 

n | dx dx dy dy dz dz 



dQ. 



and 



7 = f Nl 



I = I NX. dQ. 



(12.39) 



(12.40) 



(12.41) 



(12.42) 



(12.43) 



(12.44) 



(12.45) 



(12.46) 



Fortunately, these quantities are easy to evaluate for linear tetrahedra. As a matter of fact, there are closed- 
form solutions for the matrix elements (a,;): 



f N?N b 2 NlN d 4 dQ = 6V 1 a - Udd - . 

Ja h ' (a+b+c+d+3y. 



(12.47) 



Therefore, 



/ \ f bb - + c-c + dd 
1 vl k 6V 2 



h+cc + dd 
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(12.48) 
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and, for the right hand side, one has, assuming constant sources, 



f a. + b.x+ cy+ d.z 
~J 6V v 



cKl- 



VI 



(12.49) 



which have the compact forms 



«>.< = — I j»'W + eW + dW + d 1 " 



and 



(12.50) 



VI 



for constant sources 



(12.51) 



Now one adds up all the contributions from each element into a global matrix and global vector: 



mm 



(12.52) 



where N e] is equal to the total number of elements in the discretized solution domain, and i represents 
the node numbers (vertices). This yields a linear system of equations of the form A® = b, where cD is 
the solution vector of voltages, A represents the geometry and conductivity of the volume conductor, 
and b represents the contributions from the current sources and boundary conditions. 

For the finite-difference method, it turns out that the Dirichlet boundary condition is easy to apply, 
while the Neumann condition takes a little extra effort. For the finite-element method, it is just the 
opposite. The Neumann boundary condition 



V<5 • n = 



(12.53) 



is satisfied automatically within the Galerkin and variational formulations. This can be seen by using 
Green's divergence theorem, 



[v -Adx= f An <iS 
and applying it to the left-hand side of the Galerkin finite-element formulation: 



(12.54) 
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If one multiples the original differential equation, V 2 ® = -I v , by an arbitrary test function and integrates, 
one obtains 

(/,v) = -f (v 2 <$>)vdD. = -[ — vdS+[ V<S>VvdD. = a(<S>, v) (12.56) 

^ " ' Ja ^ ' Jr dn Ja v ' 

where the boundary integral term BO/dn vanishes, and one obtains the standard Galerkin finite-element 
formulation. 

To apply the Dirichlet condition, one has to work a bit harder. To apply the Dirichlet boundary 
condition directly, one usually modifies the (a, •) matrix and b i vector such that one can use standard 
linear system solvers. This is accomplished by implementing the following steps. Assuming that the rth 
value of m, is known, 

1. Subtract from the rth member of the right-hand side the product of a tj and the known value of 
CD; (call it (£>); this yields the new right-hand side, bi j = b t - a t S> . 

2. Zero the rth row and column of A: a^ = a , = 0. 

3. Assign d u = 1. 

4. Set the ;th member of the right-hand side equal to <!>;. 

5. Continue for each Dirichlet condition. 

6. Solve the augmented system A® = b v . 

The Boundary-Element Method 

The bioelectric field problems with isotropic domains (and few inhomogeneities), another technique, 
called the boundary -element method, may be used. This technique utilizes information only on the 
boundaries of interest and thus reduces the dimension of any field problem by one. For differential 
operators, the response at any given point to sources and boundary conditions depends only on the 
response at neighboring points. The FD and FE methods approximate differential operators defined on 
subregions (volume elements) in the domain; hence direct mutual influence (connectivity) exists only 
between neighboring elements, and the coefficient matrices generated by these methods have relatively 
few nonzero coefficients in any given matrix row. As is demonstrated by Maxwell's laws, equations in 
differential forms often can be replaced by equations in integral forms; e.g., the potential distribution in 
a domain is uniquely defined by the volume sources and the potential and current density on the 
boundary. The boundary-element method uses this fact by transforming the differential operators defined 
in the domain to integral operators defined on the boundary. In the boundary-element method [31-33], 
only the boundary is discretized; hence the mesh generation is considerably simpler for this method than 
for the volume methods. Boundary solutions are obtained directly by solving the set of linear equations; 
however, potentials and gradients in the domain can be evaluated only after the boundary solutions have 
been obtained. Since this method has a rich history in bioelectric field problems, the reader is referred 
to some of the classic references for further information regarding the application of the BE method to 
bioelectric field problems [6, 42-44]. 

Solution Methods and Computational Considerations 

Application of each of the previous approximation methods to Eq. (12.1) yields a system of linear 
equations of the form A<1> = b, which must be solved to obtain the final solution. There is a plethora of 
available techniques for the solutions of such systems. The solution techniques can be broadly categorized 
as direct and iterative solvers. Direct solvers include Gaussian elimination and LU decomposition, while 
iterative methods include Jacobi, Gauss-Seidel, successive overrelaxation (SOR), and conjugate gradient 
(CG) methods, among others. The choice of the particular solution method is highly dependent on the 
approximation technique employed to obtain the linear system, on the size of the resulting system, and 



© 2000 by CRC Press LLC 



on accessible computational resources. For example, the linear system resulting from the application of 
the FD or FE method will yield a matrix A that is symmetric, positive definite, and sparse. The matrix 
resulting from the FD method will have a specific band-diagonal structure that is dependent on the order 
of difference equations one uses to approximate the governing equation. The matrix resulting from the 
FE method will be exceedingly sparse and only a few of the off-diagonal elements will be nonzero. The 
application of the BE method, on the other hand, will yield a matrix A that is dense and nonsymmetric 
and thus requires a different choice of solver. 

The choice of the optimal solver is further complicated by the size of the system versus access to 
computational resources. Sequential direct methods are usually confined to single workstations, and thus 
the size of the system should fit in memory for optimal performance. Sequential iterative methods can 
be employed when the size of the system exceeds the memory of the machine; however, one pays a price 
in terms of performance, since direct methods are usually much faster than iterative methods. In many 
cases, the size of the system exceeds the computational capability of a single workstation, and one must 
resort to the use of clusters of workstations and/or parallel computers. 

While new and improved methods continue to appear in the numerical analysis literature, my studies 
comparing various solution techniques for direct and inverse bioelectric field problems have resulted in 
the conclusion that the preconditioned conjugate gradient methods and multigrid methods are the best 
overall performers for volume conductor problems computed on single workstations. Specifically, the 
incomplete Choleski conjugate gradient (ICCG) method works well for the FE method, 1 and the precon- 
ditioned biconjugate gradient (BCG) methods are often used for BE methods. When clusters of worksta- 
tions and/or parallel architectures are considered, the choice is less clear. For use with some high- 
performance architectures that contain large amounts of memory, parallel direct methods such as LU 
decomposition become attractive; however, preconditioned conjugate gradient methods still perform well. 

A discussion of parallel computing methods for the solution of biomedical field problems could fill 
an entire text. Thus the reader is directed to the following references on parallel scientific computing 
[45-47]. 

Comparison of Methods 

Since there is not enough space to give a detailed, quantitative description of each of the previously 
mentioned methods, an abbreviated summary is given of the applicability of each method in solving 
different types of bioelectric field problems. 

As outlined earlier, the FD, FE, and BE methods can all be used to approximate the boundary value 
problems that arise in biomedical research problems. The choice depends on the nature of the problem. 
The FE and FD methods are similar in that the entire solution domain must be discretized, while with 
the BE method only the bounding surfaces must be discretized. For regular domains, the FD method is 
generally the easiest method to code and implement, but the FD method usually requires special mod- 
ifications to define irregular boundaries, abrupt changes in material properties, and complex boundary 
conditions. While typically more difficult to implement, the BE and FE methods are preferred for 
problems with irregular, inhomogeneous domains and mixed boundary conditions. The FE method is 
superior to the BE method for representing nonlinearity and true anisotropy, while the BE method is 
superior to the FE method for problems where only the boundary solution is of interest or where solutions 
are wanted in a set of highly irregularly spaced points in the domain. Because the computational mesh 
is simpler for the BE method than for the FE method, the BE program requires less bookkeeping than 
an FE program. For this reason, BE programs are often considered easier to develop than FE programs; 
however, the difficulties associated with singular integrals in the BE method are often highly underesti- 
mated. In general, the FE method is preferred for problems where the domain is highly heterogeneous, 
whereas the BE method is preferred for highly homogeneous domains. 



'This is specifically for the FE method applied to elliptic problems. Such problems yield a matrix that is symmetric 
and positive definite. The Choleski decomposition only exists for symmetric, positive-definite matrices. 
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12.4 Adaptive Methods 

Thus far how one formulates the problem, discretizes the geometry, and finds an approximate solution 
have been discussed. Now one is faced with answering the difficult question pertaining to the accuracy 
of the solution. Without reference to experimental data, how can one judge the validity of the solutions? 
To give oneself an intuitive feel for the problem (and possible solution), consider the approximation of 
a two-dimensional region discretized into triangular elements. The finite-element method will be applied 
to solve Laplace's equation in the region. 

First, consider the approximation of the potential field 0(x, y) by a two-dimensional Taylor's series 
expansion about a point (x, y): 



<&\x+h, y+k\ = $>\x, y\ + 
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d 2 y 



(12.57) 



where h and k are the maximum x and y distances within an element. Using the first two terms (up to 
first-order terms) in the preceding Taylor's expansion, one can obtain the standard linear interpolation 
function for a triangle: 
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(12.58) 



where A is the area of the triangle. Likewise, one could calculate the interpolant for the other two nodes 
and discover that 



d*(*,-» y) = d<i, ( x p y] = d ®( x m > y m ) 

dx dx dx 



(12.59) 



is constant over the triangle (and thus so is the gradient in y as well). Thus one can derive the standard 
linear interpolation formulas on a triangle that represents the first two terms of the Taylor's series 
expansion. This means that the error due to discretization (from using linear elements) is proportional 
to the third term of the Taylor's expansion: 
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(12.60) 



where O is the exact solution. One can conjecture, then, that the error due to discretization for first- 
order linear elements is proportional to the second derivative. If <1> is a linear function over the element, 
then the first derivative is a constant and the second derivative is zero, and there is no error due to 
discretization. This implies that the gradient must be constant over each element. If the function is not 
linear or the gradient is not constant over an element, the second derivative will not be zero and is 
proportional to the error incurred due to "improper" discretization. Examining Eq. (12.60), one can 
easily see that one way to decrease the error is to decrease the size of h and k. As h and k go to zero, the 
error tends to zero as well. Thus decreasing the mesh size in places of high errors due to high gradients 
decreases the error. As an aside, note that if one divided Eq. (12.9) by hk, one can also express the error 
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in terms of the elemental aspect ratio h/k, which is a measure of the relative shape of the element. It is 
easy to see that one must be careful to maintain an aspect ratio as close to unity as possible. 

The problem with the preceding heuristic argument is that one has to know the exact solution a priori 
before one can estimate the error. This is certainly a drawback considering that one is trying to accurately 
approximate O. 

Convergence of a Sequence of Approximate Solutions 

Let's try to quantify the error a bit further. When one considers the preceding example, it seems to make 
sense that if one increases the number of degrees of freedom used to approximate the function, the accuracy 
must approach the true solution. That is, one would hope that the sequence of approximate solutions will 
converge to the exact solution as the number of degrees of freedom (DOF) increases indefinitely: 



<P 
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asn^M (12.61) 



This is a statement of pointwise convergence. It describes the approximate solution as approaching arbi- 
trarily close to the exact solution at each point in the domain as the number of DOF increases. 

Measures of convergence often depend on how the closeness of measuring the distance between 
functions is defined. Another common description of measuring convergence is uniform convergence, 
which requires that the maximum value of ||0(x) - ®„(x)|| in the domain vanish as N — > oo. This is 
stronger than pointwise convergence because it requires a uniform rate of convergence at every point in 
the domain. Two other commonly used measures are convergence in energy and convergence in mean, 
which involve measuring an average of a function of the pointwise error over the domain [40]. 

In general, proving pointwise convergence is very difficult except in the simplest cases, while proving 
the convergence of an averaged value, such as energy, is often easier. Of course, scientists and engineers 
are often much more interested in ensuring that their answers are accurate in a pointwise sense than in 
an energy sense because they typically want to know values of the solution <D(x) and gradients V<D(x) 
at specific places. 

One intermediate form of convergence is called the Cauchy convergence. Here, one requires the 
sequences of two different approximate solutions to approach arbitrarily close to each other: 



<P 



ix)-<&ix) ->0 asm,«^oo (12.62) 



While the pointwise convergence expression would imply the preceding equation, it is important to note 
that the Cauchy convergence does not imply pointwise convergence, since the functions could converge 
to an answer other than the true solution. 

While one cannot be assured of pointwise convergence of these functions for all but the simplest cases, 
there do exist theorems that ensure that a sequence of approximate solutions must converge to the exact 
solution (assuming no computational errors) if the basis functions satisfy certain conditions. The theo- 
rems can only ensure convergence in an average sense over the entire domain, but it is usually the case 
that if the solution converges in an average sense (energy, etc.), then it will converge in the pointwise 
sense as well. 

Energy Norms 

The error in energy, measured by the energy norm, is defined in general as [37-39] 



\ e T LedQ\ (12.63) 
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where e = <t>(x) — <b„(x), and L is the differential operator for the governing differential equation [i.e., it 
contains the derivatives operating on ©(x) and any function multiplying ©(x)]. For physical problems, 
this is often associated with the energy density. 

Another common measure of convergence utilizes the L 2 norm. This can be termed the average error 
and can be associated with errors in any quantity. The L 2 norm is defined as 



1/ 



edQ.\ (12.64) 



While the norms given above are defined on the whole domain, one can note that the square of each 
can be obtained by summing element contributions: 



M 

yyi (12.65) 

i=\ 



where i represents an element contribution and m the total element number. Often for an optimal finite- 
element mesh, one tries to make the contributions to this square of the norm equal for all elements. 

While the absolute values given by the energy or L 2 norms have little value, one can construct a relative 
percentage error that can be more readily interpreted: 



XlOO (12.66) 



This quantity, in effect, represents a weighted RMS error. The analysis can be determined for the whole 
domain or for element subdomains. One can use it in an adaptive algorithm by checking element errors 
against some predefined tolerance r\ a and increasing the DOF only of those areas above the predefined 
tolerance. 

Two other methods, the p and the hp methods, have been found, in most cases, to converge faster 
than the h method. The p method of refinement requires that one increase the order of the bias function 
that was used to represent the interpolation (i.e., linear to quadratic to cubic, etc.) The hp method is a 
combination of the h and p methods and has recently been shown to converge the fastest of the three 
methods (but, as you might imagine, it is the hardest to implement). To find out more about adaptive 
refinement methods, see [27, 30, 34, 35, 37, 40]. 
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The electrocardiogram (ECG) is the recording on the body surface of the electrical activity generated by 
heart. It was originally observed by Waller in 1889 [1] using his pet bulldog as the signal source and the 
capillary electrometer as the recording device. In 1903, Einthoven [2] enhanced the technology by 
employing the string galvanometer as the recording device and using human subjects with a variety of 
cardiac abnormalities. Einthoven is chiefly responsible for introducing some concepts still in use today, 
including the labeling of the various waves, defining some of the standard recording sites using the arms 
and legs, and developing the first theoretical construct whereby the heart is modeled as a single time- 
varying dipole. We also owe the EKG acronym to Einthoven's native Dutch language, where the root 
word cardio is spelled with a k. 

In order to record an ECG waveform, a differential recording between two points on the body are 
made. Traditionally, each differential recording is referred to as a lead. Einthoven defined three leads 
numbered with the Roman numerals I, II, and III. They are defined as 

ni=v u .-v IA 

where RA = right arm, LA = left arm, and LL = left leg. Because the body is assumed to be purely resistive 
at ECG frequencies, the four limbs can be thought of as wires attached to the torso. Hence lead I could 
be recorded from the respective shoulders without a loss of cardiac information. Note that these are not 
independent, and the following relationship holds: II = I + III. 

The evolution of the ECG proceeded for 30 years when F.N. Wilson added concepts of a "unipolar" 
recording [3] . He created a reference point by tying the three limbs together and averaging their potentials 
so that individual recording sites on the limbs or chest surface would be differentially recorded with the 
same reference point. Wilson extended the biophysical models to include the concept of the cardiac source 
enclosed within the volume conductor of the body. He erroneously thought that the central terminal 
was a true zero potential. However, from the mid- 1930s until today, the 12 leads composed of the 3 limb 
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FIGURE 13.1 The 12-lead ECG is formed by the 3 bipolar surface leads: I, II, and III; the augmented Wilson 
terminal referenced limb leads: aV R , aV L , and aV F ; and the Wilson terminal referenced chest leads: V p V 2 , V 3 , V 4 , V 5 , 
andV 6 . 

leads, 3 leads in which the limb potentials are referenced to a modified Wilson terminal (the augmented 
leads [4]), and 6 leads placed across the front of the chest and referenced to the Wilson terminal form 
the basis of the standard 12-lead ECG. Figure 13.1 summarizes the 12-lead set. These sites are historically 
based, have a built-in redundancy, and are not optimal for all cardiac events. The voltage difference from 
any two sites will record an ECG, but it is these standardized sites with the massive 90-year collection of 
empirical observations that have firmly established their role as the standard. Figure 13.2 is a typical or 
stylized ECG recording from lead II. Einthovin chose the letters of the alphabet from P to U to label the 
waves and to avoid conflict with other physiologic waves being studied at the turn of the century. The 
ECG signals are typically in the range of ±2 mV and require a recording bandwidth of 0.05 to 150 Hz. 
Full technical specification for ECG equipment has been proposed by both the American Heart Associ- 
ation [5] and the Association for the Advancement of Medical Instrumentation [6]. 

There have been several attempts to change the approach for recording the ECG. The vectorcardiogram 
used a weighted set of recording sites to form an orthogonal xyz lead set. The advantage here was 
minimum lead set, but in practice it gained only a moderate degree of enthusiasm among physicians. 



© 2000 by CRC Press LLC 




FIGURE 13.2 This is a stylized version of a normal lead II recording showing the P wave, QRS complex, and the 
T and U waves. The PR interval and the ST segment are significant time windows. The peak amplitude of the QRS 
is about 1 mV. The vertical scale is usually 1 mV/cm. The time scale is usually based on millimeters per second scales, 
with 25 mm/s being the standard form. The small boxes of the ECG are lxl mm. 

Body surface mapping refers to the use of many recording sites (>64) arranged on the body so that 
isopotential surfaces could be computed and analyzed over time. This approach still has a role in research 
investigations. Other subsets of the 12-lead ECG are used in limited-mode recording situations such as 
the tape-recorded ambulatory ECG (usually 2 leads) or in intensive care monitoring at the bedside 
(usually 1 or 2 leads) or telemetered within regions of the hospital from patients who are not confined 
to bed (1 lead). The recording electronics of these ECG systems have followed the typical evolution of 
modern instrumentation, e.g., vacuum tubes, transistors, integrated chips, and microprocessors. 

Application of computers to the ECG for machine interpretation was one of the earliest uses of 
computers in medicine [7]. Of primary interest in the computer-based systems was replacement of the 
human reader and elucidation of the standard waves and intervals. Originally this was performed by 
linking the ECG machine to a centralized computer via phone lines. The modern ECG machine is 
completely integrated with an analog front end, a 12- to 16-bit analog-to-digital (A/D) converter, a 
computational microprocessor, and dedicated input-output (I/O) processors. These systems compute a 
measurement matrix derived from the 12 lead signals and analyze this matrix with a set of rules to obtain 
the final set of interpretive statements [8]. Figure 13.3 shows the ECG of a heartbeat and the types of 
measurements that might be made on each of the component waves of the ECG and used for classifying 
each beat type and the subsequent cardiac rhythm. The depiction of the 12 analog signals and this set 
of interpretive statements form the final output, with an example shown in Fig. 13.4. The physician will 
overread each ECG and modify or correct those statements which are deemed inappropriate. The larger 
hospital-based system will record these corrections and maintain a large database of all ECGs accessible 
by any combination of parameters, e.g., all males, older than age 50, with an inferior myocardial infarction. 

There are hundreds of interpretive statements from which a specific diagnosis is made for each ECG, 
but there are only about five or six major classification groups for which the ECG is used. The first step 
in analyzing an ECG requires determination of the rate and rhythm for the atria and ventricles. Included 
here would be any conduction disturbances either in the relationship between the various chambers or 
within the chambers themselves. Then one would proceed to identify features that would relate to the 
presence or absence of scarring due to a myocardial infarction. There also may be evidence of acute 
events that would occur with ischemia or an evolving myocardial infarction. The ECG has been a primary 
tool for evaluating chamber size or enlargement, but one might argue that more accurate information 
in this area would be supplied by noninvasive imaging technologies. 
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FIGURE 13.3 The ECG depicts numerous measurements that can be made with computer-based algorithms. These 
are primarily durations, amplitudes, and areas. (Courtesy of Hewlett Packard Company, Palo Alto, Calif.) 
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FIGURE 13.4 This is an example of an interpreted 12-lead ECG. A 2-1/2-s recording is shown for each of the 
12 leads. The bottom trace is a continuous 10-s rhythm strip of lead II. Patient information is given in the top area, 
below which is printed the computerized interpretive statements {Courtesy of the Hewlett Packard Company, Palo 
Alto, Calif.) 
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More recently, the high-resolution ECG has been developed, whereby the digitized ECG is signal- 
averaged to reduce random noise [9,10]. This approach, coupled with postaveraging high-pass filtering, 
is used to detect and quantify low-level signals (-1.0 uV) not detectable with standard approaches. This 
computer-based approach has enabled the recording of events that are predictive of future life-threatening 
cardiac events [11,12]. 

13.1 Physiology 

The heart has four chambers; the upper two chambers are called the atria, and the lower two chambers 
are called the ventricles. The atria are thin-walled, low-pressure pumps that receive blood from the venous 
circulation. Located in the top right atrium are a group of cells that act as the primary pacemaker of the 
heart. Through a complex change of ionic concentration across the cell membranes (the current source), 
an extracellular potential field is established which then excites neighboring cells, and a cell-to-cell 
propagation of electrical events occurs. Because the body acts as a purely resistive medium, these potential 
fields extend to the body surface [13]. The character of the body surface waves depends on the amount 
of tissue activating at one time and the relative speed and direction of the activation wavefront. Therefore, 
the pacemaker potentials that are generated by a small tissue mass are not seen on the ECG. As the 
activation wavefront encounters the increased mass of atrial muscle, the initiation of electrical activity 
is observed on the body surface, and the first ECG wave of the cardiac cycle is seen. This is the P wave, 
and it represents activation of the atria. Conduction of the cardiac impulse proceeds from the atria 
through a series of specialized cardiac cells (the A-V node and the His-Purkinje system) which again are 
too small in total mass to generate a signal large enough to be seen on the standard ECG. There is a 
short, relatively isoelectric segment following the P wave. Once the large muscle mass of the ventricles 
is excited, a rapid and large deflection is seen on the body surface. The excitation of the ventricles causes 
them to contract and provides the main force for circulating blood to the organs of the body. This large 
wave appears to have several components. The initial downward deflection is the Q wave, the initial 
upward deflection is the R wave, and the terminal downward deflection is the S wave. The polarity and 
actual presence of these three components depend on the position of the leads on the body as well as a 
multitude of abnormalities that may exist. In general, the large ventricular waveform is generically called 
the QRS complex regardless of its makeup. Following the QRS complex is another short relatively 
isoelectric segment. After this short segment, the ventricles return to their electrical resting state, and a 
wave of repolarization is seen as a low-frequency signal known as the T wave. In some individuals, a 
small peak occurs at the end or after the T wave and is the U wave. Its origin has never been fully 
established, but it is believed to be a repolarization potential. 

13.2 Instrumentation 

The general instrumentation requirements for the ECG have been addressed by professional societies 
through the years [5,6] . Briefly, they recommend a system bandwidth between 0.05 and 150 Hz. Of great 
importance in ECG diagnosis is the low-frequency response of the system, because shifts in some of the 
low-frequency regions, e.g., the ST segment, have critical diagnosis value. While the heart rate may only 
have a 1-Hz fundamental frequency, the phase responses of typical analog high-pass filters are such that 
the system corner frequency must be much smaller than the 3-dB corner frequency where only the 
amplitude response is considered. The system gain depends on the total system design. The typical ECG 
amplitude is ±2 mV, and if A/D conversion is used in a digital system, the enough gain to span the full 
range of the A/D converter is appropriate. 

To first obtain an ECG the patient must be physically connected to the amplifier front end. The patient- 
amplifier interface is formed by a special bioelectrode that converts the ionic current flow of the body 
to the electron flow of the metallic wire. These electrodes typically rely on a chemical paste or gel with 
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a high ionic concentration. This acts as the transducer at the tissue-electrode interface. For short-term 
applications, silver-coated suction electrodes or "sticky" metallic foil electrodes are used. Long-term 
recordings, such as for the monitored patient, require a stable electrode-tissue interface, and special 
adhesive tape material surrounds the gel and an Ag + /Ag + Ch electrode. 

At any given time, the patient may be connected to a variety of devices, e.g., respirator, blood pressure 
monitor, temporary pacemaker, etc., some of which will invade the body and provide a low-resistance 
pathway to the heart. It is essential that the device not act as a current source and inject the patient with 
enough current to stimulate the heart and cause it to fibrillate. Some bias currents are unavoidable for 
the system input stage, and recommendations are that these leakage currents be less than 10 uA per 
device. This applies to the normal setting, but if a fault condition arises whereby the patient comes in 
contact with the high-voltage side of the alternating current (ac) power lines, then the isolation must be 
adequate to prevent 10 pA of fault current as well. This mandates that the ECG reference ground not be 
connected physically to the low side of the ac power line or its third-wire ground. For ECG machines, 
the solution has typically been to AM modulate a medium frequency carrier signal (~400 kHz) and use 
an isolation transformer with subsequent demodulation. Other methods of signal isolation can be used, 
but the primary reason for the isolation is to keep the patient from being part of the ac circuit in the 
case of a patient-to-power-line fault. In addition, with many devices connected in a patient monitoring 
situation, it is possible that ground loop currents will be generated. To obviate this potential hazard, a 
low- impedance ground buss is often installed in these rooms, and each device chassis will have an external 
ground wire connected to the buss. Another unique feature of these amplifiers is that they must be able 
to withstand the high-energy discharge of a cardiac defibrillator. 

Older-style ECG machines recorded one lead at a time and then evolved to three simultaneous leads. 
This necessitated the use of switching circuits as well as analog weighting circuits to generate the various 
12 leads. This is usually eliminated in modern digital systems by using an individual single-ended 
amplifier for each electrode on the body. Each potential signal is then digitally converted, and all the 
ECG leads can be formed mathematically in software. This would necessitate a 9-amplifier system. By 
performing some of the lead calculations with the analog differential amplifiers, this can be reduced to 
an 8-channel system. Thus only the individual chest leads V l through V 6 and any 2 of the limb leads, 
e.g., I and III, are needed to calculate the full 12-lead ECG. Figure 13.5 is a block diagram of a modern 
digital based ECG system. This system uses up to 13 single-ended amplifiers and a 16-bit A/D converter, 
all within a small lead wire manifold or amplifier lead stage. The digital signals are optically isolated and 
sent via a high-speed serial link to the main ECG instrument. Here the 32-bit CPU and DSP chip perform 
all the calculations, and a hard-copy report is generated (Fig. 13.4). Notice that each functional block 
has its own controller and that the system requires a real-time, multitasking operating system to coor- 
dinate all system functions. Concomitant with the data acquisition is the automatic interpretation of the 
ECG. These programs are quite sophisticated and are continually evolving. It is still a medical/legal 
requirement that these ECGs be overread by the physician. 

13.3 Applications 

Besides the standard 12-lead ECG, there are several other uses of ECG recording technology that rely on 
only a few leads. These applications have had a significant clinical and commercial impact. Following 
are brief descriptions of several ECG applications which are aimed at introducing the reader to some of 
the many uses of the ECG. 

The Ambulatory ECG 

The ambulatory or Holter ECG has an interesting history, and its evolution closely followed both technical 
and clinical progress. The original, analog, tape-based, portable ECG resembled a fully loaded backpack 
and was developed by Dr. Holter in the early 1960s [14]. It was soon followed by more compact devices 
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FIGURE 13.5 This is a block diagram of microprocessor-based ECG system. It includes all the elements of a personal 
computer class system, e.g., 80386 processor, 2 Mbytes of RAM, disk drive, 640 x 480 pixel LCD display, and battery 
operable. In addition, it includes a DSP56001 chip and multiple controllers which are managed with a real-time 
multitasking operating system. (Courtesy of the Hewlett Packard Company, Palo Alto, Calif.) 

that could be worn on the belt. The original large-scale clinical use of this technology was to identify 
patients who developed heart block transiently and could be treated by implanting a cardiac pacemaker. 
This required the secondary development of a device that could rapidly play back the 24 hours of tape- 
recorded ECG signals and present to the technician or physician a means of identifying periods of time 
where the patient's heart rate became abnormally low. The scanners had the circuitry not only to play 
back the ECG at speeds 30 to 60 times real time but also to detect the beats and display them in a 
superimposed mode on a CRT screen. In addition, an audible tachometer could be used to identify the 
periods of low heart rate. With this playback capability came numerous other observations, such as the 
identification of premature ventricular complexes (PVCs), which lead to the development of techniques 
to identify and quantify their number. Together with the development of antiarrhythmic drugs, a marriage 
was formed between pharmaceutical therapy and the diagnostic tool for quantifying PVCs. ECG tapes 
were recorded before and after drug administration, and drug efficacy was measured by the reduction 
of the number of PVCs. The scanner technology for detecting and quantifying these arrhythmias was 
originally implemented with analog hardware but soon advanced to computer technology as it became 
economically feasible. Very sophisticated algorithms were developed based on pattern-recognition tech- 
niques and were sometimes implemented with high-speed specialized numerical processors as the tape 
playback speeds became several hundred times real time [15]. Unfortunately, this approach using the 
ambulatory ECG for identifying and treating cardiac arrhythmias has been on the decline because the 
rationale of PVC suppression was found to be unsuccessful for improving cardiac mortality. However, 
the ambulatory ECG is still a widely used diagnostic tool, and modern units often have built-in micro- 
processors with considerable amounts of random access memory and even small disk drives with capac- 
ities greater than 400 Mbytes. Here the data can be analyzed on-line, with large segments of data selected 
for storage and later analysis with personal computer-based programs. 
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Patient Monitoring 

The techniques for monitoring the ECG in real time were developed in conjunction with the concept of 
the coronary care unit (CCU). Patients were placed in these specialized hospital units to carefully observe 
their progress during an acute illness such as a myocardial infarction or after complex surgical procedures. 
As the number of beds increased in these units, it became clear that the highly trained medical staff could 
not continually watch a monitor screen, and computerized techniques were added that monitored the 
patient's rhythm. These programs were not unlike those developed for the ambulatory ECG, and the 
high-speed numerical capability of the computer was not taxed by monitoring a single ECG. The typical 
CCU would have 8 to 16 beds, and hence the computing power was taken to its limit by monitoring 
multiple beds. The modern units have the CPU distributed within the ECG module at the bedside, along 
with modules for measuring many other physiologic parameters. Each bedside monitor would be inter- 
connected with a high-speed digital line, e.g., Ethernet, to a centralized computer used primarily to 
control communications and maintain a patient database. 

High-Resolution ECG 

High-resolution (HR) capability is now a standard feature on most digitally based ECG systems or as a 
stand-alone microprocessor-based unit [16]. The most common application of the HRECG is to record 
very low level (~1.0-liV) signals that occur after the QRS complex but are not evident on the standard 
ECG. These "late potentials" are generated from abnormal regions of the ventricles and have been strongly 
associated with the substrate responsible for a life-threatening rapid heart rate (ventricular tachycardia). 
The typical HRECG is derived from 3 bipolar leads configured in an anatomic xyz coordinate system. 
These 3 ECG signals are then digitized at a rate of 1000 to 2000 Hz per channel, time aligned via a real- 
time QRS correlator, and summated in the form of a signal average. Signal averaging will theoretically 
improve the signal-to-noise ratio by the square root of the number of beats averaged. The underlying 
assumptions are that the signals of interest do not vary, on a beat-to-beat basis, and that the noise is 
random. Figure 13.6 has four panels depicting the most common sequence for processing the HRECG 
to measure the late potentials. Panel a depicts a 3-second recording of the xyz leads close to normal 
resolution. Panel b was obtained after averaging 200 beats and with a sampling frequency of 10 times 
that shown in panel a. The gain is also 5 times greater. Panel c is the high-pass filtered signal using a 
partially time-reversed digital filter having a second-order Butterworth response and a 3-dB corner 
frequency of 40 Hz [12]. Note the appearance of the signals at the terminal portion of the QRS complex. 
A common method of analysis, but necessarily optimal, is to combine the filtered xyz leads into a vector 
magnitude, that is, (X 2 + Y 2 + Z 2 ) 1 ' 2 . This waveform is shown in panel d. From this waveform, several 
parameters have been derived such as total QRS duration (including late potentials), the RMS voltage 
value of the terminal 40 ms, and the low-amplitude signal (LAS) duration from the 40-u.V level to the 
end of the late potentials. Abnormal values for these parameters are used to identify patients at high risk 
of ventricular tachycardia following a heart attack. 

13.4 Conclusions 

The ECG is one of the oldest instrument-bound measurements in medicine. It has faithfully followed 
the progression of instrumentation technology. Its most recent evolutionary step, to the microprocessor- 
based system, has allowed patients to wear their computer monitor or has provided an enhanced, high- 
resolution ECG that has opened new vistas of ECG analysis and interpretation. 



© 2000 by CRC Press LLC 



19.250 

17.325 

15.400 

13.475 

11.550 

9.625 

7.700 

5.775 

3.850 

1.925 



■v- 



— r 



■v- 



J JL 



■^ «,* ^ ^ <tfP <*» tfj* ^ ^*f* 
Time, ms 



> 
E 




♦ £ <*> <V° S «P N # t>* ^ ^ 
Time, ms 



> 
E 




<£ # # .fl? <# N # ^ TfP ^ # 
ms 



0.256 






- 






0.231 








0.205 




■ 


Duration, 138 ms 


0.179 
> 0.154 






RMS, 20 nV 
LAS 54 ms 


0.128 








0.102 








0.077 
0.051 
0.025 


■ 


67 


/ 


\ 


205 


D 


# 


£ 


* N # s # K # ^ f tf r^ 

ms 



FIGURE 13.6 The signal-processing steps typically performed to obtain a high-resolution ECG are shown in panels 
A through D. See text for a full description. 
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Movement and position of limbs are controlled by electrical signals traveling back and forth between the 
muscles and the peripheral and central nervous system. When pathologic conditions arise in the motor 
system, whether in the spinal cord, the motor neurons, the muscle, or the neuromuscular junctions, the 
characteristics of the electrical signals in the muscle change. Careful registration and study of electrical 
signals in muscle (electromyograms) can thus be a valuable aid in discovering and diagnosing abnor- 
malities not only in the muscles but also in the motor system as a whole. Electromyography (EMG) is 
the registration and interpretation of these muscle action potentials. Until recently, electromyograms 
were recorded primarily for exploratory or diagnostic purposes; however, with the advancement of 
bioelectric technology, electromyograms also have become a fundamental tool in achieving artificial 
control of limb movement, i.e., functional electrical stimulation (FES) and rehabilitation. This chapter 
will focus on the diagnostic application of electromyograms, while FES will be discussed in Chapter 17. 

Since the rise of modern clinical EMG, the technical procedures used in recording and analyzing 
electromyograms have been dictated by the available technology. The concentric needle electrode intro- 
duced by Adrian and Bronk in 1929 provided an easy-to-use electrode with high mechanical qualities 
and stable, reproducible measurements. Replacement of galvanometers with high-gain amplifiers allowed 
smaller electrodes with higher impedances to be used and potentials of smaller amplitudes to be recorded. 
With these technical achievements, clinical EMG soon evolved into a highly specialized field where 
electromyographists with many years of experience read and interpreted long paper EMG records based 
on the visual appearance of the electromyograms. Slowly, a more quantitative approach emerged, where 
features such as potential duration, peak-to-peak amplitude, and number of phases were measured on 
the paper records and compared with a set of normal data gathered from healthy subjects of all ages. In 
the last decade, the general-purpose rack-mounted equipment of the past have been replaced by ergo- 
nomically designed EMG units with integrated computers. Electromyograms are digitized, processed, 
stored on removable media, and displayed on computer monitors with screen layouts that change in 
accordance with the type of recording and analysis chosen by the investigator. 

With this in mind, this chapter provides an introduction to the basic concepts of clinical EMG, a 
review of basic anatomy, the origin of the electromyogram, and some of the main recording procedures 
and signal-analysis techniques in use. 
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FIGURE 14.1 Schematic illustration of different types of muscles: (a) fusiform, (b) unipennate, and (c) bipennate. 

14.1 The Structure and Function of Muscle 

Muscles account for about 40% of the human mass, ranging from the small extraocular muscles that 
turn the eyeball in its socket to the large limb muscles that produce locomotion and control posture. 
The design of muscles varies depending on the range of motion and the force exerted (Fig. 14.1). In the 
most simple arrangement (fusiform), parallel fibers extend the full length of the muscle and attach to 
tendons at both ends. Muscles producing a large force have a more complicated structure in which many 
short muscle fibers attach to a flat tendon that extends over a large fraction of the muscle. This arrange- 
ment (unipennate) increases the cross-sectional area and thus the contractile force of the muscle. When 
muscle fibers fan out from both sides of the tendon, the muscle structure is referred to as bipennate. 

A lipid bilayer (sarcolemma) encloses the muscle fiber and separates the intracellular myoplasma from 
the interstitial fluid. Between neighboring fibers runs a layer of connective tissue, the endomysium, 
composed mainly of collagen and elastin. Bundles of fibers, fascicles, are held together by a thicker layer 
of connective-tissue called the perimysium. The whole muscle is wrapped in a layer of connective tissue 
called the epimysium. The connective tissue is continuous with the tendons attaching the muscle to the 
skeleton. 

In the myoplasma, thin and thick filaments interdigitate and form short, serially connected identical 
units called sarcomeres. Numerous sarcomeres connect end to end, thereby forming longitudinal strands 
of myofibrils that extend the entire length of the muscle fiber. The total shortening of a muscle during 
contraction is the net effect of all sarcomeres shortening in series simultaneously. The individual sar- 
comeres shorten by forming cross-bridges between the thick and thin filaments. The cross-bridges pull 
the filaments toward each other, thereby increasing the amount of longitudinal overlap between the thick 
and thin filaments. The dense matrix of myofibrils is held in place by a structural framework of inter- 
mediate filaments composed of desmin, vimetin, and synemin [Squire, 1986]. 

At the site of the neuromuscular junction, each motor neuron forms collateral sprouts (Fig. 14.2) and 
innervates several muscle fibers distributed almost evenly within an elliptical or circular region ranging 
from 2 to 10 mm in diameter. The motor neuron and the muscle fibers it innervates constitute a functional 
unit, the motor unit. The cross section of muscle occupied by a motor unit is called the motor unit 
territory (MUT). A typical muscle fiber is only innervated at a single point, located within a cross- 
sectional band referred to as the end-plate zone. While the width of the end-plate zone is only a few 
millimeters, the zone itself may extend over a significant part of the muscle. The number of muscle fibers 
per motor neuron (i.e., the innervation ratio) ranges from 3:1 in extrinsic eye muscles where fine-graded 
contraction is required to 120:1 in some limb muscles with coarse movement [Kimura, 1981]. The fibers 
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FIGURE 14.2 Innervation of muscle fibers, (a) Two normal motor units with intermingled muscle fibers. 
(6) Reinnervation of muscle fibers. The second and fourth muscle fibers have lost their motor neuron (2) and 
subsequently have become reinnervated by newly formed sprouts from the motor neuron (1) innervating adjacent 
muscle fibers. Not drawn to scale. 

of one motor unit are intermingled with fibers of other motor units; thus several motor units reside 
within a given cross section. The fibers of the same motor unit are thought to be randomly or evenly 
distributed within the motor unit territory; however, reinnervation of denervated fibers often results in 
the formation of fiber clusters (see Fig. 14.2). 

14.2 The Origin of Electromyograms 

Unlike the myocardium, skeletal muscles do not contain pacemaker cells from which excitations arise 
and spread. Electrical excitation of skeletal muscle is initiated and regulated by the central and peripheral 
nervous systems. Motor neurons carry nerve impulses from the anterior horn cells of the spinal cord to 
the nerve endings, where the axonal action potential triggers the release of the neurotransmitter acetyl- 
choline (Ach) into the narrow clefts separating the sarcolemma from the axon terminals. As Ach binds 
to the sarcolemma, Ach-sensitive sodium channels open, and miniature end-plate potentials arise in the 
sarcolemma. If sufficient Ach is released, the summation of miniature end-plate potentials, i.e., the end- 
plate potential, reaches the excitation threshold, and sarcolemma action potentials propagate in opposite 
directions toward the tendons. As excitation propagates down the fiber, it spreads into a highly branched 
transverse network of tubules (T system) which interpenetrate the myofibrils. The effective radial con- 
duction velocity (~4 cm/s) is about two orders of magnitude slower than the longitudinal conduction 
velocity (2 to 5 m/s). This is due to the fact that the main portion of the total membrane capacitance is 
located in the T system and that the lumen of the T system constitutes a higher electrical resistance than 
the myoplasma. 

The slower tubular conduction velocity implies an increasingly delayed onset of tubular action poten- 
tials toward the center of the fiber relative to that of the sarcolemmal action potential (Fig. 14. 3a). 
However, compared with the time course of the subsequent contraction, the spread of excitation along 
and within the muscle fiber is essentially instantaneous, thereby ensuring simultaneous release of calcium 
from the sarcoplasmic reticulum throughout the entire volume of the muscle. If calcium release were 
restricted to a small longitudinal section of the muscle fiber, only sarcomeres in this region would contract, 
and sarcomeres in the rest of the fiber would stretch accordingly. Similarly, experiments in detubulated 
muscle fibers, i.e., fibers in which the continuation between the sarcolemmal and the tubular membrane 
has been disrupted, have demonstrated that only a thin layer of superficial myofibrils contracts when 
tubular action potentials fail to trigger calcium release deep in the muscle fiber. 

It is well known that the shape of the skeletal muscle action potential differs from that of nerve action 
potentials with regard to the repolarization phase. In skeletal muscle, the falling phase of the action 
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FIGURE 14.3 Simulated sarcolemmal and tubular action potentials of frog sartorius muscle fiber, (a) Temporal 
membrane action potentials calculated in a transverse plane 5 mm from the end-plate zone of a fiber with radius 
fl= 50 )^m. Curve 1: Sarcolemmal action potential; curves 2 to 4: action potentials in tubular membrane patches at 
r = a (2), r = all (3), and r = a/20 (4). (b) Sarcolemmal action potentials for fibers with radius 70 (1), 50 (2), and 
30 (3) urn. The time axes have been expanded and truncated. 

potential is interrupted by a long, slowly decaying potential known as the afterpotential. This late potential 
is caused by two opposing forces, the repolarization force due to the efflux of potassium through the 
sarcolemma and a depolarization force due to an influx of current from the interstitial space into the 
tubular lumen. The latter current is drawn in through the tubular openings by the repolarizing tubular 
membrane. Large muscle fibers have a higher tubular-sarcolemma area ratio; hence the inward surge of 
tubular current increases with fiber size. Figure 14.3fo illustrates this by comparing sarcolemmal action 
potentials for fibers of increasing diameter. The small fiber has only a small amount of tubular membrane; 
hence the sarcolemmal potassium current has sufficient strength to rapidly repolarize the membrane. 
For the large fiber, inward tubular current actually depolarizes the sarcolemma slightly during repolar- 
ization of the tubular membrane system, thereby producing a small hump on the afterpotential. Since 
the hump on the afterpotential is influenced primarily by fiber size, this feature is more typical for large 
frog fibers than for smaller human fibers. Experiments have demonstrated that the sarcolemma action 
potential of detubulated fibers hyperpolarizes in a manner similar to that of a nerve action potential. 

In Fig. 14.4a, the time course of the sarcolemmal current density is compared with that of the current 
passing through the tubular mouth during the time course of the sarcolemmal action potential. The 
positive (outward) peak of the tubular current overlaps in time with the small capacitive sarcolemmal 



E 

I 

S 



-1 



-2 



tubular current 




1 net current source 



n sarcolemma current 



8 10 12 14 
time, ms 



mV 









-4 




1 r = a 


-2 


- 


A r = 2a 







\r = 5a 


■ "^ 




2 


"J 





8 10 12 14 
time, ms 



FIGURE 14.4 Simulated currents and extracellular potentials of frog sartorius muscle fiber (radius a = 50 urn), 
(a) The net fiber current density is the summation of the current density through the sarcolemma and that passing 
the tubular mouth, (b) Extracellular action potentials calculated at increasing radial distances (in units of fiber radius) 
using a bidomain volume conductor model and the net current source in panel a. The time axes have been expanded 
and truncated. 
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FIGURE 14.5 Schematic cross-sectional view of overlapping single-fiber action potential distributions in a normal 
(a) and reinnervated (b) motor unit. Muscle fibers are represented by filled black circles. Concentric rings represent 
axisymmetric isopotential lines of the individual single-fiber potentials at 2 (gray) and 5 radii from the fiber center, 
respectively. Compare with radial decline of extracellular potentials in Fig. 14.4. See text for discussion of simplifying 
assumptions. 

displacement current (initial positive peak) and the negative peak (inward sarcolemmal sodium current). 
As a result, the net current has a much larger positive peak than that of the sarcolemma alone, and the 
negative peak of the net current is only about half that of the sarcolemmal current. The outward 
sarcolemmal potassium current (late positive phase) is almost completely opposed by an antisymmetric 
inward tubular current, i.e., the current drawn into the tubular lumen by the repolarizing T system. The 
combined effect of the sarcolemmal and tubular interaction is a net current with an almost biphasic 
waveform and with similar amplitudes of the positive and negative peaks. 

As the net current source propagates toward the tendon, an extracellular potential field arises and 
follows the action potential complex. At a given field point, this phenomenon is observed as a temporal 
potential waveform (Fig. 14.4fo); however, a more complete picture of the phenomenon is obtained by 
considering the spatial distribution of potentials in the cross section of the motor unit. Figure 14.5 shows 
schematically how the concentric isopotential lines of individual fibers overlap with those of other fibers 
in the same motor unit. In a typical healthy motor unit (Fig. 14.5a), the mean interfiber distance is on 
the order of a few hundred microns. Taking into account the steep radial decline of the potential 
amplitudes illustrated in Fig. 14.4b, it is evident that single-fiber action potential (SFAP) overlapping 
occurs between low-magnitude isopotential lines. Figure 14. 5b illustrates how spatial overlapping between 
SFAPs might look like in a motor unit with extensive fiber grouping. In this case, higher-level isopotential 
lines overlap within the clusters, while regions with no fibers would appear as electrically silent. 

Several factors ignored in Fig. 14.5 need further discussion. All fibers are assumed to be of the same 
size and with identical, perfectly axisymmetric potential distributions. The net single-fiber current source 
is an increasing function of fiber size; thus the magnitude of the potential distribution will vary with 
varying fiber size. Fibers can to a good approximation be considered as constant current sources; hence 
if the resistivity of the muscle tissue increases, e.g., due to increased fiber packing density, the potential 
difference between an observation point and a reference point also will increase. It follows that local 
variations in fiber packing density in the region of an active fiber will destroy the axisymmetric appearance 
of its potential distribution. Muscle fibers are not perfect cylinders, and angular variation in the shape 
of the sarcolemma must be expected to create angular variations in the potential distribution. However, 
due to the relatively high conductivity of the volume conductor, it is plausible that such variations become 
increasingly insignificant as the distance to the fiber is increased. 
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A very important factor not considered in Fig. 14.5 concerns the degree of longitudinal alignment of 
SFAPs in the motor unit. As illustrated in Fig. 14.2, SFAPs are usually dispersed along the motor unit 
axis, and their potential distributions do not sum up in a simple manner. SFAPs can be misaligned for 
several reasons. The end plates are not aligned; hence some SFAPs originate ahead of others. Variations 
in fiber size and packing density cause variations in conduction velocity; thus the dispersion of SFAPs 
may actually increase with increasing distance from the end-plate zone. Neuropathologic conditions can 
affect the alignment of SFAPs. Complete or partial failure of collateral nerve sprouts to conduct nerve 
action potentials can abolish or delay the onset of action potentials, and immature sprouts formed during 
reinnervation may cause significant variability (jitter) in the onset of muscle action potentials. Denervated 
muscle fibers shrink; hence a newly reinnervated muscle fiber is likely to have a slower conduction velocity. 
An increased dispersion of SFAPs creates a very complex and irregular spatial potential distribution. The 
superimposed temporal potential waveform in a fixed observation point, i.e., the motor unit potential 
(MUP), is therefore comprised of several peaks rather than a simple bi- or triphasic waveform. MUPs 
with five or more peaks are classified as polyphasic. A satellite potential is an SFAP that is so misaligned 
from the main potential complex that its waveform is completely separated from the MUP. 

14.3 Electromyographic Recordings 

A considerable amount of information regarding the bioelectrical state of a muscle is hidden in the time- 
varying spatial distribution of potentials in the muscle. Unfortunately, it is not clinically feasible to obtain 
high-resolution three-dimensional samples of the spatial potential distribution, since this would require 
the insertion of hundreds of electrodes into the muscles. In order to minimize the discomfort of the 
patient, routine EMG procedures usually employ only a single electrode that is inserted into different 
regions of the muscle. As the SFAPs of an active motor unit pass by the electrode, only their summation, 
i.e., the MUP, will be registered by the electrode. The electrode is effectively integrating out the spatial 
information hidden in the passing potential complex, leaving only a time-variant potential waveform to 
be recorded and interpreted. It goes without saying that such a constraint on the recording procedure 
puts the electromyographist at a considerable disadvantage. To partially circumvent this setback, technical 
innovations and new procedures have continued to refine EMG examinations to such a level that some 
of the spatial information originally obscured by the electrode can be extracted intuitively from the 
temporal waveforms by an experienced electromyographist. With a detailed understanding of the bio- 
electric principles involved, e.g., the bioelectric sources, the volume conductor, and the recording prop- 
erties of the electrode, the electromyographist can quickly recognize and explain the waveform 
characteristics associated with various neuromuscular abnormalities. 

To increase the amount of diagnostic information, several sets of EMG investigations maybe performed 
using electrodes with different recording characteristics. Figure 14.6 illustrates three of the most popular 
EMG needle electrodes. The concentric and monopolar electrodes have an intermediate pickup range 
and are used in conventional recordings. The single-fiber electrode is a more recent innovation. It has a 
very small pickup range and is used to obtain recordings from only one or two muscle fibers. The macro 
electrode, which is the cannula of either the concentric or single-fiber electrode in combination with a 
remote reference electrode, picks up potentials throughout the motor unit territory. This section will 
review these EMG electrodes, the waveforms they produce, and the signal analysis performed in each case. 

Concentric Electrode EMG 

Adrian and Bronk developed the concentric electrode (see Fig. 14.6) in order to obtain a pickup range 
that is smaller than that of wire electrodes. The modern version of the concentric electrode consists of 
a platinum or stainless steel wire located inside the lumen of a stainless steel cannula with an outer 
diameter of about 0.5 mm. The tip is beveled at 15 to 20 degrees, thereby exposing the central wire as 
an oblique elliptical surface of about 150 x 580 urn. The central wire is insulated from the cannula with 
araldite or epoxy. 
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FIGURE 14.6 Needle electrodes for subcutaneous EMG recordings. For the single-fiber and concentric electrodes, 
the cannula of the hypodermic needle acts as reference electrode. The monopolar electrode is used with a remote 
reference electrode. 

The concentric electrode is connected to a differential amplifier; thus common-mode signals are 
effectively rejected, and a relatively stable baseline is achieved. The cannula cannot be regarded as an 
indifferent reference electrode because it is located within the potential distribution and thus will pick 
up potentials from active fibers. Simulation studies [Henneberg and Plonsey, 1993] have shown that the 
cannula shields the central wire from picking up potentials from fibers located behind the tip. The 
sensitivity of the concentric electrode is therefore largest in the hemisphere facing the oblique elliptical 
surface. Due to the asymmetric sensitivity function, the waveshape of the recorded potentials will vary 
if the electrode is rotated about its axis. This problem is not observed with the axisymmetric monopolar 
electrode; however, this electrode has a more unstable baseline due to the remote location of the reference 
electrode. Both the concentric and monopolar electrodes (see Fig. 14.6) are used in conventional EMG. 
Because of the differences in recording characteristics, however, concentric and monopolar recordings 
cannot be compared easily. A particular EMG laboratory therefore tends to use the one and not the other. 

During the concentric needle examination, the investigator searches for abnormal insertional activity, 
spontaneous activity in relaxed muscles, and motor unit potentials with abnormal appearance. The 
waveshape of motor unit potentials is assessed on the basis of the quantitative waveform features defined 
in Fig. 14.7: 

• Amplitude is determined by the presence of active fibers within the immediate vicinity of the 
electrode tip. Low-pass filtering by the volume conductor attenuates the high-frequency spikes of 
remote SFAPs; hence the MUP amplitude does not increase for a larger motor unit. However, 
MUP amplitude will increase if the tip of the electrode is located near a cluster of reinnervated 
fibers. Large MUP amplitudes are frequently observed in neurogenic diseases. 

• Rise time is an increasing function of the distance between the electrode and the closest active 
muscle fiber. A short rise time in combination with a small MUP amplitude might therefore 
indicate that the amplitude is reduced due to fiber atrophy rather than to a large distance between 
the electrode and the closest fiber. 

• Number of phases indicates the complexity of the MUP and the degree of misalignment between 
SFAPs. In neurogenic diseases, polyphasic MUPs arise due to slow conduction velocity in immature 
nerve sprouts or slow conduction velocity in reinnervated but still atrophied muscle fibers. Vari- 
ation in muscle fiber size also causes polyphasic MUPs in myopathic diseases. To prevent noisy 
baseline fluctuations from affecting the count of MUP phases, a valid baseline crossing must exceed 
a minimum absolute amplitude criterion. 
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MUP duration 

FIGURE 14.7 Definition of quantitative waveform features of MUPs recorded with a concentric EMG electrode. 
MUP area is defined as the rectified area under the curve in the interval between the onset and end. The number of 
MUP phases is defined as the number of baseline crossings (□) plus one. MUP turns are marked by a 0. MUP rise 
time is the time interval between the 10% and 90% deflection (O) of the main negative-going slope. As by convention, 
negative voltage is displayed upward. 

• Duration is the time interval between the first and last occurrence of the waveform exceeding a 
predefined amplitude threshold, e.g., 5 uV. The MUP onset and end are the summation of low- 
frequency components of SFAPs scattered over the entire pickup range of the electrode. As a result, 
the MUP duration provides information about the number of active fibers within the pickup 
range. However, since the motor unit territory can be larger than the pickup range of the electrode, 
MUP duration does not provide information about the total size of a large motor unit. MUP 
duration will increase if a motor unit has an increased number of fibers due to reinnervation. 
MUP duration is affected to a lesser degree by SFAP misalignment. 

• Area indicates the number of fibers adjacent to the electrode; however, unlike MUP amplitude, 
MUP area depends on MUP duration and is therefore influenced by fibers in a larger region 
compared with that of MUP amplitude. 

• Turns is a measure of the complexity of the MUP, much like the number of phases; however, since 
a valid turn does not require a baseline crossing like a valid phase, the number of turns is more 
sensitive to changes in the MUP waveshape. In order to distinguish valid turns from signal noise, 
successive turns must be offset by a minimum amplitude difference. 

Based on the complimentary information contained in the MUP features defined above, it is possible 
to infer about the number and density of fibers in a motor unit as well as the synchronicity of the SFAPs. 
However, the concentric electrode is not sufficiently selective to study individual fibers, nor is it sufficiently 
sensitive to measure the total size of a motor unit. The following two techniques were designed with 
these objectives in mind. 

Single-Fiber EMG 

The positive lead of the single-fiber electrode (see Fig. 14.6) is the end cap of a 25-Ltm wire exposed 
through a side port on the cannula of a steel needle. Due to the small size of the positive lead, bioelectric 
sources, which are located more than about 300 urn from the side port, will appear as common-mode 
signals and be suppressed by the differential amplifier. To further enhance the selectivity, the recorded 
signal is high-pass filtered at 500 Hz to remove low-frequency background activity from distant fibers. 
Due to its very small pickup range, the single-fiber electrode rarely records potentials from more than 
one or two fibers from the same motor unit. Because of the close proximity of the fibers, potentials are 
of large amplitudes and with small rise times. When two potentials from the same motor unit are picked 
up, the slight variation in their interpotential interval (IPI) can be measured (Fig. 14.8). The mean IPI 
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FIGURE 14.8 Measurement of interpotential interval (IPI) between single-fiber potentials recorded simultaneously 
from two fibers of the same motor unit. 

(jitter) is normally 5 to 50 (j,s but increases when neuromuscular transmission is disturbed. When the 
single-fiber electrode records potentials from an increased number of fibers, it usually indicates that the 
side port is close to either a cluster of fibers (reinnervation) or that the positive lead is close to fibers in 
the process of splitting. 

Macro EMG 

For this electrode, the cannula of a single-fiber or concentric electrode is used as the positive lead, while 
the reference electrode can be either a remote subcutaneous or remote surface electrode. Due to the large 
lead surface, this electrode picks up both near- and far-field activity. However, the signal has very small 
amplitude, and the macro electrode must therefore be coupled to an electronic averager. To ensure that 
only one and the same MUP is being averaged, the averager is triggered by a SFAP picked up from that 
motor unit by the side port wire of the single-fiber electrode or by the central wire of the concentric 
electrode. Since other MUPs are not time-locked to the triggering SFAP, they will appear as random 
background activity and become suppressed in the averaging procedure. Quantitative features of the 
macro MUP include the peak-to-peak amplitude, the rectified area under the curve, and the number of 
phases. 

Defining Terms 

Concentric electrode EMG: Registration and interpretation of motor unit potentials recorded with a 

concentric needle electrode. 
Electromyograms (EMGs): Bioelectric potentials recorded in muscles. 
Jitter: Mean variation in interpotential interval between single-fiber action potentials of the same motor 

unit. 
Macro EMG: The registration of motor unit potentials from the entire motor unit using the cannula 

of the single-fiber or concentric electrode. 
Motor unit: The functional unit of an anterior horn cell, its axon, the neuromuscular junctions, and 

the muscle fibers innervated by the motor neuron. 
Motor unit potential (MUP): Spatial and temporal summation of all single-fiber potentials innervated 

by the same motor neuron. Also referred to as the motor unit action potential (MUAP). 
Motor unit territory (MUT): Cross-sectional region of muscle containing all fibers innervated by a 

single motor neuron. 
Satellite potential: An isolated single-fiber action potential that is time-locked with the main MUP. 
Single-fiber action potential (SFAP): Extracellular potential generated by the extracellular current flow 

associated with the action potential of a single muscle fiber. 
Single-fiber EMG (SFEMG): Recording and analysis of single-fiber potentials with the single-fiber 

electrode. 
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Electroencephalograms (EEGs) are recordings of the minute (generally less that 300 uV) electrical 
potentials produced by the brain. Since 1924, when Hans Berger reported the recording of rhythmic 
electrical activity from the human scalp, analysis of EEG activity has been conducted primarily in clinical 
settings to detect gross pathologies and epilepsies and in research facilities to quantify the central effects 
of new pharmacologic agents. As a result of these efforts, cortical EEG patterns have been shown to be 
modified by a wide range of variables, including biochemical, metabolic, circulatory, hormonal, neuro- 
electric, and behavioral factors. In the past, interpretation of the EEG was limited to visual inspection 
by an electroencephalographer, an individual trained to qualitatively distinguish normal EEG activity 
from localized or generalized abnormalities contained within relatively long EEG records. This approach 
left clinicians and researchers alike buried in a sea of EEG paper records. The advent of computers and 
the technologies associated with them has made it possible to effectively apply a host of methods to 
quantify EEG changes. With this in mind, this chapter provides a brief historical perspective followed by 
some insights regarding EEG recording procedures and an in-depth discussion of the quantitative tech- 
niques used to analyze alterations in the EEG. 



15.1 Historical Perspective 



In 1875, Richard Caton published the first account documenting the recording of spontaneous brain 
electrical activity from the cerebral cortex of an experimental animal. The amplitude of these electrical 
oscillations was so low (i.e., in the microvolt range) that Caton's discovery is all the more amazing because 
it was made 50 years before suitable electronic amplifiers became available. In 1924, Hans Berger, of the 
University of Jena in Austria, carried out the first human EEG recordings using metal strips pasted to 
the scalps of his subjects as electrodes and a sensitive galvanometer as the recording instrument. Berger 
was able to measure the irregular, relatively small electrical potentials (i.e., 50 to 100 U.V) coming from 
the brain. By studying the successive positions of the moving element of the galvanometer recorded on 
a continuous roll of paper, he was able to observe the resultant patterns in these brain waves as they 
varied with time. From 1924 to 1938, Berger laid the foundation for many of the present applications of 
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electroencephalography. He was the first to use the word electroencephalogram in describing these brain 
potentials in humans. Berger also noted that these brain waves were not entirely random but instead 
displayed certain periodicities and regularities. For example, he observed that although these brain waves 
were slow (i.e., exhibited a synchronized pattern of high amplitude and low frequency, <3 Hz) during 
sleep, they were faster (i.e., exhibited a desynchronized pattern of low amplitude and higher frequency, 
15 to 25 Hz) during waking behaviors. He suggested, quite correctly, that the brain's activity changed in 
a consistent and recognizable fashion when the general status of the subject changed, as from relaxation 
to alertness. Berger also concluded that these brain waves could be greatly affected by certain pathologic 
conditions after noting a marked increase in the amplitude of these brain waves recorded during con- 
vulsive seizures. However, despite the insights provided by these studies, Berger 's original paper, published 
in 1929, did not excite much attention. In essence, the efforts of this remarkable pioneer were largely 
ignored until similar investigations were carried out and verified by British investigators. 

It was not until 1934, however, when Adrian and Matthews published their classic paper verifying 
Berger 's findings, that the concept of "human brain waves" was truly accepted and the study of EEG 
activity was placed on a firm foundation. One of their primary contributions was the identification of 
certain rhythms in the EEG, e.g., a regular oscillation at approximately 10 to 12 Hz recorded from the 
occipital lobes of the cerebral cortex, which they termed the "alpha rhythm." This alpha rhythm was 
found to disappear when a subject displayed any type of attention or alertness or focused on objects in 
the visual field. The physiologic basis for these results, the "arousing influence" of external stimuli on 
the cortex, was not formulated until 1949, when Moruzzi and Magoun demonstrated the existence of 
pathways widely distributed through the central reticular core of the brainstem that were capable of 
exerting a diffuse activating influence on the cerebral cortex. This "reticular activating system" has been 
called the brain's response selector because it alerts the cortex to focus on certain pieces of incoming 
information while ignoring others. It is for this reason that a sleeping mother will immediately be 
awakened by her crying baby or the smell of smoke and yet ignore the traffic outside her window or the 
television playing in the next room. (Note: For the interested reader, an excellent historical review of this 
early era in brain research is provided in a fascinating text by Brazier [1968].) 

15.2 EEG Recording Techniques 

Scalp recordings of spontaneous neuronal activity in the brain, identified as the EEG, allow measurement 
of potential changes over time between a signal electrode and a reference electrode [Kondraski, 1986]. 
Compared with other biopotentials, such as the electrocardiogram, the EEG is extremely difficult for an 
untrained observer to interpret, partially as a result of the spatial mapping of functions onto different regions 
of the brain and electrode placement. Recognizing that some standardization was necessary, the Interna- 
tional Federation in Electroencephalography and Clinical Neurophysiology adopted the 10-20 electrode 
placement system. In addition to the standard 10-20 scalp array, electrodes to monitor eye movement, ECG, 
and muscle activity are essential for discrimination of different vigilance or behavioral states. 

Any EEG system consists of electrodes, amplifiers (with appropriate filters), and a recording device. 
Instrumentation required for recording EEG activity can be simple or elaborate. (Note: Although the 
discussion presented in this section is for a single-channel system, it can be extended to simultaneous 
multichannel recordings simply by multiplying the hardware by the number of channels required. In 
cases that do not require true simultaneous recordings, special electrode selector panels can minimize 
hardware requirements.) 

Commonly used scale electrodes consist of Ag-AgCl disks, 1 to 3 mm in diameter, with long flexible 
leads that can be plugged into an amplifier. Although a low-impedance contact is desirable at the 
electrode-skin interface (<10 kfi), this objective is confounded by hair and the difficulty of mechanically 
stabilizing the electrode. Conductive electrode paste helps obtain low impedance and keep the electrodes 
in place. Often a contact cement (collodion) is used to fix small patches of gauze over the electrodes for 
mechanical stability, and leads are usually taped to the subject to provide some strain relief. Slight abrasion 
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of the skin is sometimes used to obtain lower electrode impedance, but this can cause slight irritation 
and sometimes infection (as well as pain in sensitive subjects). 

For long-term recordings, as in seizure monitoring, electrodes present major problems. Needle elec- 
trodes, which must be inserted into the tissue between the surface of the scalp and the skull, are sometimes 
useful. However, the danger of infection increases significantly. Electrodes with self-contained miniature 
amplifiers are somewhat more tolerant because they provide a low-impedance source to interconnecting 
leads, but they are expensive. Despite numerous attempts to simplify the electrode application process 
and to guarantee long-term stability, no single method has been widely accepted. 

Instruments are available for measuring impedance between electrode pairs. The procedure is recom- 
mended strongly as good practice, since high impedance leads to distortions that may be difficult to 
separate from actual EEG signals. In fact, electrode impedance monitors are built into some commercially 
available EEG devices. Note that standard dc ohmmeters should not be used, since they apply a polarizing 
current that can result in a buildup of noise at the skin-electrode interface. 

From carefully applied electrodes, signal amplitudes of 1 to 10 (j,V can be obtained. Considerable 
amplification (gain = 10 6 ) is required to bring signal strength up to an acceptable level for input to 
recording devices. Because of the length of electrode leads and the electrically noisy environment where 
recordings commonly take place, differential amplifiers with inherently high input impedance and high 
common-mode rejection ratios are essential for high-quality EEG recordings. 

In some facilities, special electrically shielded rooms minimize environmental electrical noise, partic- 
ularly 60-Hz alternating current (ac) line noise. Since much of the information of interest in the EEG 
lies in frequency bands below 40 Hz, low-pass filters in the amplifier can be used to greatly reduce 60-Hz 
noise. For attenuating ac noise when the low-pass cutoff is above 60 Hz, many EEG amplifiers employ 
a notch filter specific only for frequencies in a narrow band centered around 60 Hz. 

When trying to eliminate or minimize the effect of 60-Hz sources, it is sometimes useful to use a 
dummy source, such as a fixed 100-kQ resistor attached to the electrodes. By employing a dummy source 
as one of the input signals, the output of the differential amplifier represents only contributions from 
interfering sources. If noise can be reduced to an acceptable level (at least by a factor of 10 less than EEG 
signals), it is likely that uncontaminated EEG records can be obtained. 

Different types of recording instruments obtain a temporary or permanent record of the EEG. The 
most common recording device is a pen or chart recorder (usually multichannel), which is an integral 
part of most commercially available EEG instruments. Recordings are on a long sheet of continuous 
paper (from a folded stack) fed past the moving pen at one of several selectable constant speeds. Paper 
speed is selected according to the monitoring situation at hand: slow speed (10 mm/s) for observing the 
spiking characteristically associated with seizure activity and faster speeds (up to 120 mm/s) to identify 
the presence of individual frequency bands in the EEG. 

In addition to (or instead of) a pen recorder, the EEG may be recorded on a multichannel frequency- 
modulated (FM) analog tape recorder. During such recordings, a visual output device such as an oscil- 
loscope or video display is often used to allow visual monitoring of signals. 

Sophisticated FM tape recording and playback systems allow clinicians to review long EEG recordings 
over a greatly reduced time compared with that required to flip through stacks of paper or to observe 
recordings in real time. Such systems take advantage of time-compensation schemes, whereby a signal 
recorded at one speed can be played back at a faster speed. The ratio of playback to recording speed is 
known, so the appropriate correction factor can be applied to played-back data generating a properly 
scaled video display. A standard ratio of 60:1 is often used. Thus a trained clinician can review each 
minute of real-time EEG in 1 s. The display is scrolled at a high rate horizontally across the display screen. 
Features of such instruments allow the clinician to freeze a segment of EEG on the display and to slow 
down or accelerate tape speed from the standard playback as needed. A vertical "tick" mark is usually 
displayed at periodic intervals by one channel as a time mark to provide a convenient timing reference. 

Computers also can be used as recording devices. In such systems, one or more channels of analog 
EEG signal are repeatedly sampled at a fixed time interval (sampling interval), and each sample is 
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converted into a digital representation by an analog-to-digital (A/D) converter. The A/D converter is 
interfaced to a computer system so that each sample can be saved in the computer's memory. The 
resolution of the A/D converter is determined by the smallest amplitude that can be sampled. This is 
determined by dividing the voltage range of the A/D converter by 2 raised to the power of the number 
of bits of the A/D converter. For example, an A/D converter with a range of ±5 V and 12-bit resolution 
can resolve sample amplitudes as small as ±2.4 mV. Appropriate matching of amplification and A/D 
converter sensitivity permits resolution of the smallest signal while preventing clipping of the largest 
signal amplitudes. 

A set of such samples, acquired at a sufficient sampling rate (at least 2x the highest frequency 
component of interest in the sampled signal), is sufficient to represent all the information in the waveform. 
To ensure that the signal is band-limited, a low-pass filter with a cutoff frequency equal to the highest 
frequency of interest is used. Since physically realizable filters do not have ideal characteristics, the 
sampling rate is usually set to 2x the cutoff frequency of the filter or more. Furthermore, once converted 
to digital format, digital filtering techniques can be used. 

Online computer recordings are only practical for short-term recordings or for situations in which 
the EEG is immediately processed. This limitation is primarily due to storage requirements. For example, 
a typical sampling rate of 128 Hz yields 128 new points per second that require storage. For an 8-s sample, 
1024 points are acquired per channel recorded. A 10-minute recording period yields 76,800 data points 
per channel. Assuming 12-bit resolution per sample, one can see that available computer memory quickly 
becomes a significant factor in determining the length (in terms of time) as well as the number of channels 
of EGG activity to be acquired in real time by the computer. 

Further data processing can consist of compression for more efficient storage (with associated loss of 
total information content), as in the use of compressed spectral arrays, determination of a reduced features 
set including only data needed for quantification, as in evoked response recordings, or feature extraction 
and subsequent pattern recognition, as in automated spike detection during monitoring for epileptic 
seizure activity. 

In addition to the information available from spontaneous EEG activity, the brain's electrical response 
to sensory stimulation is also important. Due to the relatively small amplitude of a stimulus-evoked 
potential compared with that of spontaneous EEG potentials, the technique of signal averaging is often 
used to enhance the characteristics of stimulus-evoked responses. Stimulus averaging takes advantage of 
the fact that the brain's electrical response is time locked to the onset of the stimulus, while nonevoked, 
background potential changes are randomly distributed in time. Consequently, the averaging of multiple 
stimulus-evoked responses results in the enhancement of the time-locked activity, while average random 
background activity approaches zero. The result is an evoked response that consists of a number of 
discrete and replicable peaks that occur, depending on the stimulus and recording parameters, at pre- 
dictable latencies associated with the onset of stimulation. 

15.3 Use of Amplitude Histographs to Quantify the EEG 

In general, the EEG contains information regarding changes in the electrical potential of the brain 
obtained from a given set of recording electrodes. These data include the characteristic waveforms with 
accompanying variations in amplitude, frequency, phase, etc., as well as the brief occurrence of electrical 
patterns, such as spindles. Any analysis procedure cannot simultaneously provide information regarding 
all these variables. Consequently, the selection of any analytical technique will emphasize changes in one 
particular variable at the expense of the others. This observation is extremely important if one is to 
properly interpret the results obtained using a given technique. 

In the computation of amplitude distributions of the EEG, for example, successive EEG amplitudes 
must be measured and ordered into specific amplitude classes, or bins. The amplitude histogram that 
results from this process is often a symmetrical, essentially gaussian distribution. The primary charac- 
teristics of the gaussian distribution are summarized simply by specifying its mean and standard deviation, 
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since the higher control moments of the distribution, such as skewness and kurtosis, are equal to zero. 
However, in nongaussian distributions, the measures of skewness and kurtosis assume nonzero values 
and can be used to characterize that particular amplitude distribution. The four primary statistical 
measures used to characterize an EEG amplitude histogram include the mean, standard amplitude, 
skewness, and kurtosis. 

Mean 

Since the sum of (positive and negative) EEG potential is usually on the order of a few microvolts when 
the analysis time is not too short, the mean is essentially a constant, although of small value. Any shifts 
in values of the mean, therefore, are indicative of changes in potential that are of technical origin, such 
as amplifier drifts, etc. 

Standard Amplitude 

The variance of the EEG amplitude distribution is directly related to the total power of the EEG. For 
example, a flat EEG will provide low variance values, while a widely oscillating EEG will yield high 
variance values. To avoid confusion and use units that are more familiar to electroencephalographers, 
the term standard amplitude is often used. 

Skewness 

The degree of deviation from the symmetry of a normal or gaussian distribution is measured by skewness. 
This third central moment of the amplitude histogram has a value of zero when the distribution is 
completely symmetrical and assumes some nonzero value when the EEG waveforms are asymmetrical 
with respect to the baseline (as is the case in some characteristic sleep patterns, murhythms, morphine 
spindles, barbiturate spiking, etc.). In general, a nonzero value of the skewness index reflects the presence 
of monophasic events in the waveform. The following methods can be used to obtain the measure of 
skewness. 

Moment Coefficient of Skewness: 
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Pearson's Second Coefficient of Skewness: 
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Centile Index of Skewness: 
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Kurtosis 

The kurtosis measure reveals the peakedness or flatness of a distribution. A kurtosis value greater than 
that of a normal distribution means that the distribution is leptokurtic, or simply more peaked than the 
normal curve. A value less than that of a normal distribution indicates a flatter distribution. In clinical 
electroencephalography, when analyzing EEGs with little frequency and amplitude modulation, one 
observes negative values of kurtosis. High positive values of kurtosis are present when the EEG contains 
transient spikes, isolated high-voltage wave groups, etc. The following methods can be used to obtain 
the measure of kurtosis. 



Moment Coefficient of Kurtosis: 
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Centile Index of Kurtosis: 
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A normal distribution will have a value of 0.5 for this measures. 

Figure 15.1 illustrates the sensitivity of these measures in analyzing the effect of systemic (IP) admin- 
istration of morphine sulfate (30 mg/kg) on the cortical EEG. It will be noted that the skewness measure 
changes abruptly only immediately after the morphine injection, when the EEG was dominated by the 
appearance of spindles. However, the index of kurtosis characterizes the entire extent of the drug effect 
from onset to its return to baseline. 



MEAN 



STD. AMP. 



SKEW 



KURTOSIS 





, ^WVm 







inj 



1 2 

TIME (Hrs.) 



FIGURE 15.1 Plot of the indices of the amplitude distribution, i.e., the mean, standard amplitude, skewness, and 
kurtosis, of the EEG recorded from a rat prior to and for 3 hours following intraperitoneal injection of morphine 
sulfate (30 mg/kg). Arrow (inj) indicates time of injection. 
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The central moments of the EEG amplitude histogram, therefore, are capable of (1) characterizing the 
amplitude distributions of the EEG and (2) quantifying alterations in these electrical processes brought 
about by pharmacologic manipulations. In addition, use of the centile index for skewness and kurtosis 
provides a computer-efficient method for obtaining these measures in real-time. 

15.4 Frequency Analysis of the EEG 



DATA ACQUISITION PARAMETERS 

EEG Sample Length (sees.) 

Analog/Digital sampling rate (Hz) 
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In early attempts to correlate the EEG with behavior, analog fre- 
quency analyzers were used to examine single channels of EEG 
data. Although disappointing, these initial efforts did introduce 
the use of frequency analysis to the study of gross brain wave 
activity. Although power spectral analysis, i.e., the magnitude 
square of the Fourier transform, provides a quantitative measure 
of the frequency distribution of the EEG, it does so, as mentioned 
above, at the expense of other details in the EEG such as the 
amplitude distribution and information concerning the presence 
of specific EEG patterns. 

The first systematic application of power spectral analysis by 
general-purpose computers was reported in 1963 by Walter; how- 
ever, it was not until the introduction of the fast Fourier transform 
(FFT) by Cooley and Tukey in 1965 that machine computation 
of the EEG became commonplace. Although an individual FFT 
is ordinarily calculated for a short section of EEG data (e.g., from 
1 to 8 s), such signal segmentation with subsequent averaging of 
individual modified periodograms has been shown to provide a 
consistent estimator of the power spectrum. An extension of this 
technique, the compressed spectral array, has been particularly 
useful for evaluating EEG spectra over long periods of time. A 
detailed review of the development and use of various methods 
to analyze the EEG is provided by Bronzino et al. [1984] and 
Givens and Remond [1987]. 

Figure 15.2 provides an overview of the computational pro- 
cesses involved in performing spectral analysis of the EEG, i.e., 
including computation of auto- and cross-spectra [Bronzino 
et al., 1984]. It is to be noted that the power spectrum is the 
autocorrelellogram, i.e., the correlation of the signal with itself. As 

a result, the power spectrum provides only magnitude information in the frequency domain; it does not 
provide any data regarding phase. The power spectrum is computed by 
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FIGURE 1 5.2 Block diagram illustrat- 
ing the steps involved in conventional 
(linear) spectral analysis of EEG activity. 



p(f)=R 2 Uf)}+il\x(f) 



(15.6) 



where X(f) is the Fourier transform of the EEG signal. 

Power spectral analysis not only provides a summary of the EEG in a convenient graphic form but 
also facilitates statistical analysis of EEG changes that may not be evident on simple inspection of the 
records. In addition to absolute power derived directly from the power spectrum, other measures calcu- 
lated from absolute power have been demonstrated to be of value in quantifying various aspects of the 
EEG. Relative power expresses the percentage contribution of each frequency band to the total power 
and is calculated by dividing the power within a band by the total power across all bands. Relative power 
has the benefit of reducing the intersubject variance associated with absolute power that arises from 
intersubject differences in skull and scalp conductance. The disadvantage of relative power is that an 
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increase in one frequency band will be reflected in the calculation by a decrease in other bands; for 
example, it has been reported that directional shifts between high and low frequencies are associated 
with changes in cerebral blood flow and metabolism. Power ratios between low (0 to 7 Hz) and high 
(10 to 20 Hz) frequency bands have been demonstrated to be an accurate estimator of changes in cerebral 
activity during these metabolic changes. 

Although the power spectrum quantifies activity at each electrode, other variables derivable from the 
FFT offer a means of quantifying the relationships between signals recorded from multiple electrodes or 
sites. Coherence (which is a complex number), calculated from the cross-spectrum analysis of two signals, 
is similar to cross-correlation in the time domain. 

The cross-spectrum is computed by 

Cross spectrum=x(/)7 + (/) (15.7) 

where X{f) and Y(f) are Fourier transforms, and * indicates the complex conjugate. 
Coherence is calculated by 

Coherence = cro ss spectrum (15 . 8 ) 

^Px(f)-PY(f) 

The magnitude squared coherence (MSC) values range from 1 to 0, indicating maximum and no 
synchrony, respectively. The temporal relationship between two signals is expressed by the phase angle, 
which is a measure of the lag between two signals of common frequency components or bands. 

Since coherence is a complex number, the phase is simply the angle associated with the polar expression 
of that number. MSC and phase then represent measures that can be employed to investigate interactions 
of cerebral activity recorded from separate brain sites. For example, short (intracortical) and long 
(corticocortical) pathways have been proposed as the anatomic substrates underlying the spatial frequency 
and patterns of coherence. Therefore, discrete cortical regions linked by such fiber systems should 
demonstrate a relatively high degree of synchrony, while the temporal difference between signals, repre- 
sented by the phase measure, quantifies the extent to which one signal leads another. 

15.5 Nonlinear Analysis of the EEG 

As mentioned earlier, the EEG has been studied extensively using signal-processing schemes, most of 
which are based on the assumption that the EEG is a linear, gaussian process. Although linear analysis 
schemes are computationally efficient and useful, they only utilize information retained in the autocor- 
relation function (i.e., the second-order cumulant). Additional information stored in higher-order cumu- 
lants is therefore ignored by linear analysis of the EEG. Thus, while the power spectrum provides the 
energy distribution of a stationary process in the frequency domain, it cannot distinguish nonlinearly 
coupled frequency from spontaneously generated signals with the same resonance condition [Nikias and 
Raghweer, 1987]. 

There is evidence showing that the amplitude distribution of the EEG often deviates from gaussian 
behavior. It has been reported, for example, that the EEG of humans involved in the performance of 
mental arithmetic tasks exhibits significant nongaussian behavior. In addition, the degree of deviation 
from gaussian behavior of the EEG has been shown to depend on the behavioral state, with the state of 
slow-wave sleep showing less gaussian behavior than quiet waking, which is less gaussian than rapid eye 
movement (REM) sleep [Ning and Bronzino, 1989a,b]. Nonlinear signal-processing algorithms such as 
bispectral analysis are therefore necessary to address nongaussian and nonlinear behavior of the EEG in 
order to better describe it in the frequency domain. 
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But what exactly is the bispectrum? For a zero-mean, stationary process {X(k)}, the bispectrum, by 
definition, is the Fourier transform of its third-order cumulant (TOC) sequence: 



a a 



b(co i; co 2 )= £ Y, C ( m ' n ) f (15 " 9 ) 



m--a «--a 



The TOC sequence [C(m, n)} is defined as the expected value of the triple product 

dm,n) = Elx{k)x(k+m)xlk+n)\ (15.10) 

If process X(k) is purely gaussian, then its third-order cumulant C(m, n) is zero for each (m, n), and 
consequently, its Fourier transform, the bispectrum, B(a> v (0 2 ) is also zero. This property makes the 
estimated bispectrum an immediate measure describing the degree of deviation from gaussian behavior. 
In our studies [Ning and Bronzino, 1989a, b], the sum of magnitude of the estimated bispectrum was 
used as a measure to describe the EEG's deviation from gaussian behavior that is, 



D= V ^((DpOoJ (15.11) 



Using bispectral analysis, the existence of significant quadratic phase coupling (QPC) in the hippoc- 
ampal EEG obtained during REM sleep in the adult rat was demonstrated [Ning and Bronzino, 1989a,b, 
1990]. The result of this nonlinear coupling is the appearance, in the frequency spectrum, of a small 
peak centered at approximately 13 to 14 Hz (beta range) that reflects the summation of the two theta 
frequency (i.e., in the 6- to 7-Hz range) waves (Fig. 15.3). Conventional power spectral (linear) approaches 
are incapable of distinguishing the fact that this peak results from the interaction of these two generators 
and is not intrinsic to either. 

To examine the phase relationship between nonlinear signals collected at different sites, the cross- 
bispectrum is also a useful tool. For example, given three zero-mean, stationary processes {xlri)j = 1, 
2, 3}, there are two conventional methods for determining the cross-bispectral relationship, direct and 
indirect. Both methods first divide these three processes into M segments of shorter but equal length. 
The direct method computes the Fourier transform of each segment for all three processes and then 
estimates the cross-bispectrum by taking the average of triple products of Fourier coefficients over 
M segments, that is, 

M 

where X" (ra) is the Fourier transform of the mth segment of {xi n)}, and * indicates the complex conjugate. 
The indirect method computes the third-order cross-cumulant sequence for all segments: 

c w 3 M=L<("k(" +fc k(" +z ) (i5 - i3 > 

we? 

where x is the admissible set for argument n. The cross-cumulant sequences of all segments will be 
averaged to give a resultant estimate: 

M 

c w 3 M=^L c w.,( u ) (15 - 14) 

m-l 
© 2000 by CRC Press LLC 




To.00 8.00 16.80 24.00 32.00 40.00 
Frequency 




(8.00 8.00 



18.00 24.00 
Frequency 



40.00 





FIGURE 15.3 Plots a and b represent the averaged power spectra of sixteen 8-s epochs of REM sleep (digital 
sampling rate = 128 Hz) obtained from hippocampal subfields CA1 and the dentate gyrus, respectively. Note that 
both spectra exhibit clear power peaks at approximately 8 Hz (theta rhythm) and 16 Hz (beta activity). Plots cand 
d represent the bispectra of these same epochs, respectively. Computation of the bicoherence index at f(l) = 8 Hz, 
f(2) = 8 Hz showed significant quadratic phase coupling (QPC), indicating that the 16-Hz peak seen in the power 
spectra is not spontaneously generated but rather results from the summation of activity between the two recording 
sites. 

The cross-bispectrum is then estimated by taking the Fourier transform of the third-order cross- 
cumulant sequence: 



5 W3 K» 2 )= 



EE C - 
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(15.15) 



Since the variance of the estimated cross-bispectrum is inversely proportional to the length of each 
segment, computation of the cross-bispectrum for processes of finite data length requires careful con- 
sideration of both the length of individual segments and the total number of segments to be used. 

The cross-bispectrum can be applied to determine the level of cross-QPC occurring between {x^n)} 
and \x 2 (n)} and its effects on {x 3 (n)}. For example, a peak at Bx 1 x 2 x 3 (co 1 , co 2 ) suggests that the energy 
component at frequency (Dj + co 2 of {x 3 («)} is generated due to the QPC between frequency a l of {x^n)} 
and frequency co 2 of {x 2 {n)}. In theory, the absence of QPC will generate a flat cross-bispectrum. However, 
due to the finite data length encountered in practice, peaks may appear in the cross-bispectrum at 
locations where there is no significant cross-QPC. To avoid improper interpretation, the cross-bicoher- 
ence index, which indicates the significance level of cross-QPC, can be computed as follows: 
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FIGURE 15.4 Cross-bispectral plots of -B C ai dg-cai ( ffl ii m 2) computed using (a) the direct method and (b) the 
indirect method. 
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where P xj ((d) is the power spectrum of process {*•(«)}. The theoretical value of the bicoherence index 
ranges between and 1, and i.e., from nonsignificant to highly significant. 

In situations where the interest is the presence of QPC and its effects on {x{n)}, the cross-bispectrum 
equations can be modified by replacing jxj(n)} and {x 3 (n)} with {x(«)} and {x^n)} with {y(n)}, that is, 



M 



(15.17) 



In theory, both methods will lead to the same cross-bispectrum when data length is infinite. However, 
with finite data records, direct and indirect methods generally lead to cross-bispectrum estimates with 
different shapes (Fig. 15.4). Therefore, like power spectrum estimation, users have to choose an appro- 
priate method to extract the information desired. 

Defining Terms 

Bispectra: Computation of the frequency distribution of the EEG exhibiting nonlinear behavior. 
Cross-spectra: Computation of the energy in the frequency distribution of two different electrical 

signals. 
Electroencephalogram (EEG): Recordings of the electrical potentials produced by the brain. 
Fast Fourier transform (FFT): Algorithms that permit rapid computation of the Fourier transform of 

an electrical signal, thereby representing it in the frequency domain. 
Magnitude squared coherence (MSC): A measure of the degree of synchrony between two electrical 

signals at specified frequencies. 
Power spectral analysis: Computation of the energy in the frequency distribution of an electrical signal. 
Quadratic phase coupling (QPC): A measure of the degree to which specific frequencies interact to 

produce a third frequency. 
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Since the first detection of the magnetocardiogram (MCG) in 1963 by Baule and McFee [Baule and 
McFee, 1963], new diagnostic information from biomagnetic signals has been widely anticipated. The 
first recording of the magnetoencephalogram (MEG) was made in 1968 by David Cohen [Cohen, 1968] , 
but it was not possible to record biomagnetic signals with good signal quality before the invention of 
the superconducting quantum interference device (SQUID) in 1970 [Zimmerman et al., 1970]. 

16.1 Theoretical Background 

Origin of Bioelectric and Biomagnetic Signals 

In 1819, Hans Christian Orsted demonstrated that when an electric current flows in a conductor, it 
generates a magnetic field around it (Orsted, 1820]. This fundamental connection between electricity 
and magnetism was expressed in exact form by lames Clerk Maxwell in 1864 [Maxwell, 1865]. In 
bioelectromagnetism, this means that when electrically active tissue produces a bioelectric field, it simul- 
taneously produces a biomagnetic field as well. Thus the origin of both the bioelectric and the biomagnetic 
signals is the bioelectric activity of the tissue. 
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The following equations describe the electric potential field (16.1) and the magnetic field (16.2) of a 
volume source distribution /' in an inhomogeneous volume conductor. The inhomogeneous volume 
conductor is represented by a piecewise homogeneous conductor where the regions of different conduc- 
tivity ct are separated by surfaces S. 



47ta<&(r)= f ;'V - dv+Y\ f (<j"-o'.)<&V 



V 



dS. 



(16.1) 



47tH(r)= f/'x V -Uv+V [ { a "r a ',)^ V 



V 



dS. 



(16.2) 



The first term on the right-hand side of Eqs. (16.1) and (16.2) describes the contribution of the volume 
source, and the second term describes the contribution of boundaries separating regions of different 
conductivity, i.e., the contribution of the inhomogeneities within the volume conductor. These equations 
were developed by David Geselowitz [Geselowitz, 1967, 1970]. 

Measurement of the Biomagnetic Signals 

The amplitude of the biomagnetic signals is very low. The strongest of them is the MCG, having an 
amplitude on the order of 50 pT This is roughly one-millionth of the static magnetic field of the earth. 
The amplitude of the MEG is roughly 1% of that of the MCG. This means that, in practice, the MEG 
can only be measured with the SQUID and that the measurements must be done in a magnetically 
shielded room. The MCG, instead, can be measured in the clinical environment without magnetic 
shielding. 

Independence of Bioelectric and Biomagnetic Signals 

The source of the biomagnetic signal is the electric activity of the tissue. Therefore, the most interesting 
and most important question in biomagnetism is whether the biomagnetic signals contain new infor- 
mation that cannot be obtained from bioelectric signals; in other words, whether the bioelectric and 
biomagnetic signals are fully independent or whether there is some interdependence. If the signals were 
fully independent, the biomagnetic measurement would possibly give about the same amount of new 
information as the bioelectric method. If there were some interdependence, the amount of new infor- 
mation would be reduced. 

Helmholtz's theorem states that "A general vector field, that vanishes at infinity, can be completely 
represented as the sum of two independent vector fields, one that is irrotational (zero curl) and another 
that is solenoidal (zero divergence)" [Morse and Feshbach, 1953; Plonsey and Collin, 1961]. The 
impressed current density /' is a vector field that vanishes at infinity and, according to the theorem, may 
be expressed as the sum of two components: 



r=u+K 



(16.3) 



where the subscripts F and V denote flow and vortex, respectively. By definition, these vector fields satisfy 
V x /p = and V • J\ = 0. We first examine the independence of the electric and magnetic signals in the 
infinite homogeneous case, when the second term on the right-hand side of Eqs. (16.1) and (16.2), caused 
by inhomogeneities, is zero. The equation for the electric potential may be rewritten as 
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= f V - ■J i dv= f ^-dv (16.4) 

and that for the magnetic field may be rewritten as 

47lH = -(*V - x]'dv = -[ XJ dv. (16.5) 

J,. I r J Jv r 



Substituting Eq. (16.3) into Eqs. (16.4) and (16.5) shows that under homogeneous and unbounded 
conditions, the bioelectric field arises from V ■ ]\, which is the flow source, and the biomagnetic field 
arises from V x /{,, which is the vortex source. For this reason, in the early days of biomagnetic research 
it was generally believed that the bioelectric and biomagnetic signals were fully independent. However, 
it was soon recognized that this could not be the case. For example, when the heart beats, it produces 
an electric field recorded as the P, QRS, and T waves of the ECG, and it simultaneously produces the 
corresponding magnetic waves recorded as the MCG. Thus the ECG and MCG signals are not fully 
independent. 

There have been several attempts to explain the independence/interdependence of bioelectric and 
biomagnetic signals. Usually these attempts discuss different detailed experiments and fail to give a 
satisfying general explanation. This important issue may be easily explained by considering the sensitivity 
distributions of the ECG and MCG lead systems, and this will be discussed in the next section. 

16.2 Sensitivity Distribution of Dipolar Electric 
and Magnetic Leads 

Concepts of Lead Vector and Lead Field 
Lead Vector 

Let us assume that two electrodes (or sets of electrodes) are placed on a volume conductor to form a 
lead. Let us further assume that inside the volume conductor in a certain location Q there is placed a 
unit dipole consecutively in the x, y, and z directions (Fig. 16. la). Due to the sources, we measure from 
the lead the signals c x , c , and c z , respectively. Due to linearity, if instead of the unit dipoles we place in 
the source location dipoles that are p x , p , and p z times the unit vectors, we measure signals that are c x p x , 
c p , and c z p z , respectively. 

If these dipoles are placed simultaneously to the source location, due to the principle of superposition, 
we measure from the lead a voltage that is 

V = c x p x + c y p, + c M p M . (16.6) 

These dipoles can be considered to be components of a dipole p, that is, p = p x i+ p •) + p z k. We may 
understand the coefficients c x , c , and c z to be components of a vector c, that is, c = c x i + c j + c z k. Now 
we may express the lead voltage Eq. (16.6) as the scalar product of the vector c and the dipole p as 

V=c-p. (16.7) 

The vector c is a three-dimensional transfer coefficient that describes how a dipole source p at a fixed 
point Q inside a volume conductor influences the voltage measured from the lead and is called the lead 
vector. 
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LINEARITY 
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Pxi ->V x =c x p x 
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V =c 

y ^y 



k ->V z =c 2 
P»7 ->V y =c y p y p z "k -> V z =c z p z 

Because of linearity, in each case V is linearly proportional to the dipole magnitude. 



SUPERPOSITION 




Pxi+Pyi+P z k 
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c = c x i+c y j+c z k 



VECTOR ALGEBRA 

Mathematically, the voltage V 
is the scalar product of dipole ~p 
and the lead vector c 



V = c»p 




Because of superpositon, V is proportional to 
the sum of the potentials of each dipole component. 
The proportionality coefficient is three-dimensional. 
It is the lead vector c. 



V = c«p = Icllplcosa 



FIGURE 16.1 The concepts of (a) lead vector and (b) lead field. See the text for more details. 

The lead vector c describes what is the sensitivity of the lead to a source locating at the source location. 
It is self-evident that for another source location the sensitivity may have another value. Thus the 
sensitivity, i.e., the lead vector, varies as a function of the location, and we may say that it has a certain 
distribution in the volume conductor. This is called the sensitivity distribution. 

Lead Field 

We may define the value of the lead vector at every point in the volume conductor. If we then place the 
lead vectors to the points for which they are defined, we have a field of lead vectors throughout the 
volume conductor. This field of lead vectors is called the lead field J L . The lead field illustrates the behavior 
of the sensitivity in the volume conductor and is a very powerful tool in analyzing the properties of 
electric and magnetic leads (see Fig. 16. 1 fo). 

It follows from the principle of reciprocity, described by Hermann von Helmholtz in 1853 [Helmholtz, 
1853], that the lead field is identical to the electric current field that arises in the volume conductor if a 
unit current, called reciprocal current I R , is fed to the lead. 

When we know the lead field / L , we can determine the signal V L in the lead due to the volume source 
distribution /'. For each source element the signal is, of course, proportional to the dot product of the 
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The field of the lead vectors c k is the lead field J L 



Each dipole element p k con- 
tributes to the lead voltage by V k = c k «p k 
The total lead voltage is the sum _ _ 

of the lead voltage elements V L = Xc k »p k 




ALTERNATIVE ILLUSTRATION 



J L »J dv 




Because of reciprocity, the field of lead vectors J L 
is the same as the current field J L raised by 
feeding a reciprocal current of 1Ato the lead. 



The lead field may also be illustrated 
with lead field current flow lines. 



FIGURE 16.1 (continued) 



source element and the lead field at the source location, as shown in Eq. (16.7). The contributions of 
the whole volume source is obtained by integrating this throughout the volume source. Thus the signal 
the volume source generates to the lead is 



V, 



J a 



-l-Vdv. 



(16.8) 



The lead field may be illustrated either with lead vectors in certain locations in the volume conductor 
or as the flow lines of the distribution of the reciprocal current in the volume conductor. This is called 
the lead current field. In the latter presentation, the lead field current flow lines are oriented in the direction 
of the sensitivity, and their density is proportional to the magnitude of the sensitivity. 
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Lead Fields of Leads Detecting the Electric and Magnetic Dipole 
Moments of a Volume Source 

Electric Lead 

The sensitivity of a lead system that detects the electric dipole moment of a volume source consists of 
three orthogonal components (Fig. 16.2a). Each of these is linear and homogeneous. In other words, 
one component of the electric dipole moment is detected when the corresponding component of all 
elements of the impressed current density /' are detected with the same sensitivity throughout the source 
area. 

Magnetic Lead 

The sensitivity distribution of a lead system that detects the magnetic dipole moment of a volume source 
also consists of three orthogonal components (Fig. 16.2b). Each of these has such a form that the sensitivity 
is always tangentially oriented around the symmetry axis (the coordinate axis). The magnitude of the 
sensitivity is proportional to the radial distance from the symmetry axis and is zero on the symmetry axis. 

Independence of Dipolar Electric and Magnetic Leads 
Electric Lead 

The sensitivity distributions of the three components of the lead system detecting the electric dipole 
moment of a volume source are orthogonal. This means that none of them can be obtained as a linear 
combination of the two other ones. (Note that any fourth measurement having a similar linear sensitivity 
distribution would always be a linear combination of the three previous ones.) Thus the sensitivity 
distributions, i.e., the leads, are orthogonal and thus independent. However, because the three electric 
signals are only different aspects of the same volume source, they are not (fully) independent. 

Magnetic Lead 

The sensitivity distributions of the three components of the lead system detecting the magnetic dipole 
moment of a volume source are also orthogonal, meaning that no one of them can be obtained as a 
linear combination of the two other ones. Thus, similarly, as in measurement of the electric dipole 
moment, the sensitivity distributions, i.e., the leads, are orthogonal and thus independent. However, 
because the three magnetic signals are only different aspects of the same volume source, they are not 
(fully) independent. 

On the basis of the sensitivity distributions, we also can similarly explain the independence between 
the electric and magnetic signals. According to Helmholtz's theorem, the electric leads are orthogonal to 
the three magnetic leads. This means that none of these six leads can be obtained as a linear combination 
of the other five. However, the six signals, which they measure, are not independent because they arise 
from the same electrically active volume source. 

16.3 Magnetocardiography (MCG) 
Selection of the Source Model for MCG 

In ECG and MCG it is the clinical problem to solve the inverse problem, i.e., to solve the source of the 
detected signal in order to get information about the anatomy and physiology of the source. Although 
the actual clinical diagnostic procedure is based on measuring certain parameters, such as time intervals 
and amplitudes, from the detected signal and actually not to display the components of the source, the 
selection of the source model is very important from the point of view of available information. 

In clinical ECG, the source model is a dipole. This is the model for both the 12-lead ECG and 
vectorcardiography (VCG). In 12-lead ECG, the volume conductor (thorax) model is not considered, 
which causes considerable distortion of the leads. In VCG, only the form of the volume conductor is 
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FIGURE 16.2 Sensitivity distributions, i.e., lead fields of lead systems detecting (a) electric and (b) magnetic dipole 
moments of a volume source. The lead field of the electric lead is shown both with vectors representing the magnitude 
and direction of the lead field (on the left) and with lead field current flow lines (on the right). 
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modeled. This decreases the distortion in the lead fields but does not eliminate it completely. Note that 
today the display systems used in these ECG and VCG systems do not play any role in the diagnostic 
procedure because the computerized diagnosis is always based on the signals, not on the display. 

In selection of the source model for MCG, it is logical, at least initially, to select the magnetic source 
model to be on the same theoretical level with the ECG. Only in this way is it possible to compare the 
diagnostic performance of these methods. It is clear, of course, that if the source model is more accurate, 
i.e., has more independent variables, the diagnostic performance is better, but when comparing ECG 
and MCG, the comparison is relevant only if their complexity is similar [Malmivuo and Plonsey, 1995]. 

Detection of the Equivalent Magnetic Dipole of the Heart 

The basic detection method of the equivalent magnetic dipole moment of a volume source is to measure 
the magnetic field on each coordinate axis in the direction of that axis. To idealize the sensitivity 
distribution throughout the volume source, the measurements must be made at a distance that is large 
compared with the source dimensions. This, of course, decreases the signal amplitude. The quality of 
the measurement may be increased considerably if bipolar measurements are used; i.e., measurements 
are made on both sides of the source. Measurement of the magnetic field on each coordinate axis is, 
however, difficult to perform in MCG due to the geometry of the human body. It would require either 
six sequential measurements with one magnetometer (dewar) or six simultaneous measurements using 
six dewars (Fig. 16.3). 

It has been shown [Malmivuo, 1976] that all three components of the magnetic dipole also can be 
measured from a single location. Applying this unipositional method symmetrically so that measurements 
are made on both the anterior and posterior sides of the thorax at the same distance from the heart, only 
two dewars are needed and a very high quality of lead fields is obtained. Figure 16.4 illustrates the 
sensitivity distributions in nonsymmetrical and symmetrical measurements [Malmivuo and Plonsey, 
1995]. 





(b) 



FIGURE 1 6.3 Measurement of the three orthogonal components of the magnetic dipole moment of the heart ( a) on 
the coordinate axis (xyz lead system) and ( b) at a single location over and under the chest (unipositional lead system). 
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FIGURE 16.4 Sensitivity distributions in the measurement of the magnetic dipole moment of the heart. 
(a) Nonsymmetrical and (b) symmetrical measurements of the x component, (c) Symmetrical measurement of the 
y and z components. 



Diagnostic Performance of ECG and MCG 

The diagnostic performances of ECG and MCG were compared in an extensive study made at the Ragnar 
Granit Institute [Oja, 1993]. The study was made using the asymmetrical unipositional lead system, i.e., 
making measurements only on the anterior side of the thorax. The patient material was selected, however, 
so that myocardial changes were located dominantly on the anterior side. 

This study consisted of 290 normal subjects and 259 patients with different myocardial disorders. It 
was found that the diagnostic performance of ECG and MCG is about the same (83%). Diagnostic 
parameters were then selected from both ECG and MCG. With this combined method, called electro- 
magnetocardiogram (EMCG), a diagnostic performance of 90% was obtained. This improvement in 
diagnostic performance was obtained without increasing the number of parameters used in the diagnostic 
procedure. Moreover, this improvement is significant because it means that the number of incorrectly 
diagnosed patients was reduced by approximately 50%. 

This important result may be explained as follows: The lead system recording the electric dipole 
moment of the volume source has three independent leads. (This is also the case in the 12-lead ECG 
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FIGURE 16.4 (continued) 



system.) Similarly, the lead system detecting the magnetic dipole moment of the volume source has three 
independent leads. Therefore, the diagnostic performances of these methods are about the same. However, 
because the sensitivity distributions of electric and magnetic leads are different, the patient groups 
diagnosed correctly with both methods are not identical. 
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FIGURE 16.4 (continued) 

As stated before, the electric leads are independent of the magnetic leads. If the diagnostic procedure 
simultaneously uses both the ECG and the MCG leads, we obtain 3 + 3 = 6 independent leads, and the 
correctly diagnosed patient groups may be combined. Thus the diagnostic performance of the combined 
method is better than that of either method alone. This is the first large-scale statistically relevant study 
of the clinical diagnostic performance of biomagnetism. 
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Technical Reasons to Use the MCG 

The technical differences between ECG and MCG include the MCG's far better ability to record static 
sources, sources on the posterior side of the heart, monitor the fetal heart, and perform electrodeless 
recording. As a technical drawback, it should be mentioned that the MCG instrument costs 2 to 3 times 
more. An important feature of MCG is that, unlike the MEG instrument, it does not need a magnetically 
shielded room. This is very important because the shielded room is not only very expensive but also 
limits application of the technique to a certain laboratory space. 

Theoretical Reasons to Use the MCG 

It has been shown that MCG has clinical value and that it can be used either alone or in combination 
with ECG as a new technique called the electromagnetocardiogram (EMCG). The diagnostic performance 
of the combined method is better than that of either ECG or MCG alone. With the combined method, 
the number of incorrectly diagnosed patients may be reduced by approximately 50%. 



16.4 Magnetoencephalography (MEG) 



Similarly as in the cardiac applications, in the magnetic measurement of the electric activity of the brain, 
the benefits and drawbacks of the MEG can be divided into theoretical and technical ones. First, the 
theoretical aspects are discussed. 

The two main theoretical aspects in favor of MEG are that it is believed that because the skull is 
transparent for magnetic fields, the MEG should be able to concentrate its measurement sensitivity in a 
smaller region than the EEG, and that the sensitivity distribution of these methods are fundamentally 
different. These questions are discussed in the following: The analysis is made using the classic spherical 
head model introduced by Rush and Driscoll [Rush and Driscoll, 1969]. In this model, the head is 
represented with three concentric spheres, where the outer radii of the scalp, skull, and brain are 92, 85, 
and 80 mm, respectively. The resistivities of the scalp and the brain are 2.22 Q-cm, and that of the skull 
is 80 times higher, being 177 Q-cm. 

The two basic magnetometer constructions in use in MEG are axial and planar gradiometers. In the 
former, both coils are coaxial, and in the latter, they are coplanar. The minimum distance of the coil 
from the scalp in a superconducting magnetometer is about 20 mm. The coil radius is usually about 10 
mm. It has been shown [Malmivuo and Plonsey, 1995] that with this measurement distance, decreasing 
the coil radius does not change the distribution of the sensitivity in the brain region. In the following 
the sensitivity distribution of these gradiometer constructions is discussed. 

To indicate the magnetometer's ability to concentrate its sensitivity to a small region, the concept of 
half-sensitivity volume has been defined. This concept means the region in the source area (brain) where 
the detector sensitivity is one-half or more from the maximum sensitivity. The smaller the half-sensitivity 
volume, the better is the detector's ability to focus its sensitivity to a small region. 

In magnetocardiography, it is relevant to detect the magnetic dipole moment of the volume source of 
the heart and to make the sensitivity distribution within the heart region as independent of the position 
in the axial direction as possible. In magnetoencephalography, however, the primary purpose is to detect 
the electric activity of the cortex and to localize the regions of certain activity. 

Sensitivity Distribution of the Axial Magnetometer 

In a cylindrically symmetrical volume conductor model, the lead field flow lines are concentric circles 
and do not cut the discontinuity boundaries. Therefore, the sensitivity distribution in the brain area of 
the spherical model equals that in an infinite, homogeneous volume conductor. 

Figure 16.5 illustrates the sensitivity distribution of an axial magnetometer. The thin solid lines illustrates 
the lead field flow lines. The dashed lines join the points where the sensitivity has the same value, being 
thus so-called isosensitivity lines. The half-sensitivity volume is represented by the shaded region. 
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FIGURE 16.5 Sensitivity distribution of the axial magnetometer in measuring the MEG (spherical lead model). 



Sensitivity Distribution of the Planar Gradiometer 

Figure 16.6 illustrates the sensitivity distribution of a planar gradiometer. Again, the thin solid lines 
illustrate the lead field flow lines, and the dashed lines represent the isosensitivity lines. The half-sensitivity 
volume is represented by the shaded region. The sensitivity of the planar gradiometer is concentrated 
under the center of the two coils and is mainly linearly oriented. Further, there exist two zero-sensitivity 
lines. 

Half-Sensitivity Volumes of Electro- and Magnetoencephalography Leads 

The half-sensitivity volumes for different EEG and MEG leads as a function of electrode distance and 
gradiometer baselines are shown in Fig. 16.7. The minimum half-sensitivity volume is, of course, achieved 
with the shortest distance/baseline. For three- and two-electrode EEG leads, the half-sensitivity volumes 
at 1 degree of electrode distance are 0.2 and 1.2 cm 3 , respectively. For 10-mm-radius planar and axial 
gradiometer MEG leads, these volumes at 1 degree of coil separation (i.e., 1.6-mm baseline for axial 
gradiometer) are 3.4 and 21.8 cm 3 , respectively. 
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FIGURE 16.6 Sensitivity distribution of the planar gradiometer (half-space model). 

The 20-mm coil distance from scalp and 10-mm coil radii are realistic for the helmet-like whole-head 
MEG detector. There exist, however, MEG devices for recording at a limited region where the coil distance 
and the coil radii are on the order of 1 mm. Therefore, the half-sensitivity volumes for planar gradiometers 
with 1-mm coil radius at 0- to 20-mm recording distances are also illustrated in Fig. 16.7. These curves 
show that when the recording distance is about 12 mm and the distance/baseline is 1 mm, such a planar 
gradiometer has about the same half-sensitivity volume as the two-electrode EEG. 

Short separation will, of course, also decrease the signal amplitude. An optimal value is about 
10 degrees of separation. Increasing the separation to 10 degrees increases the EEG and MEG signal 
amplitudes to approximately 70 to 80% of their maximum value, but the half-sensitivity volumes do not 
increase considerably from their values at 1 degree of separation. 

Thus, contrary to general belief, the EEG has a better ability to focus its sensitivity to a small region 
in the brain than the whole-head MEG. At about 20 to 30 degrees of separation, the two-electrode EEG 
lead needs slightly smaller separation to achieve the same half- sensitivity volume as the planar gradiom- 
eter. The sensitivity distributions of these leads are, however, very similar. Note that if the sensitivity 
distributions of two different lead systems, whether they are electric or magnetic, are the same, they 
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FIGURE 16.7 Half-sensitivity volumes of different EEG leads (dashed lines) and MEG leads (solid lines) as a function 
of electrode distance and gradiometer baseline, respectively. 

detect exactly the same source and produce exactly the same signal. Therefore, the planar gradiometer 
and two-electrode EEG lead detect very similar source distributions. 

Sensitivity of EEG and MEG to Radial and Tangential Sources 

The three-electrode EEG has its maximum sensitivity under that electrode which forms the terminal 
alone. This sensitivity is mainly directed radially to the spherical head model. With short electrode 
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distances, the sensitivity of the two-electrode EEG is directed mainly tangentially to the spherical head 
model. Thus with the EEG it is possible to detect sources in all three orthogonal directions, i.e., in the 
radial and in the two tangential directions, in relation to the spherical head model. 

In the axial gradiometer MEG lead, the sensitivity is directed tangentially to the gradiometer symmetry 
axis and thus also tangentially to the spherical head model. In the planar gradiometer, the sensitivity has 
its maximum under the center of the coils and is directed mainly linearly and tangentially to the spherical 
head model. The MEG lead fields are oriented tangentially to the spherical head model everywhere. This 
may be easily understood by recognizing that the lead field current does not flow through the surface of 
the head because no electrodes are used. Therefore, the MEG can only detect sources oriented in the two 
tangential directions in relation to the spherical head model. 

References 

Baule GM, McFee R. 1963. Detection of the magnetic field of the heart. Am Heart J 55(7):95. 

Cohen D. 1968. Magnetoencephalography: Evidence of magnetic fields produced by alpha-rhythm cur- 
rents. Science 161:784. 

Geselowitz DB. 1967. On bioelectric potentials in an inhomogeneous volume conductor. Biophys J 7(1):1. 

Geselowitz DB. 1970. On the magnetic field generated outside an inhomogeneous volume conductor by 
internal current sources. IEEE Trans Magn MAG-6(2):346. 

Helmholtz HLF. 1853. Ueber einige Gesetze der Vertheilung elektrischer Strome in korperlichen Leitern 
mit Anwendung auf die thierisch-elektrischen Versuche. Ann Physik Chem 89:211. 

Malmivuo J, Plonsey R. 1995. Bioelectromagnetism: Principles and Applications of Bioelectric and Bio- 
magnetic Fields. New York, Oxford University Press. 

Malmivuo JA. 1976. On the detection of the magnetic heart vector: An application of the reciprocity 
theorem. Acta Polytechnol Scand 39:112. 

Maxwell J. 1865. A dynamical theory of the electromagnetic field. Phil Trans R Soc (Lond) 155:459. 

More PM, Feshbach H. 1953. Methods of Theoretical Physics, part I. New York, McGraw-Hill. 

Oja OS. 1993. Vector Magnetocardiogram in Myocardial Disorders, MD thesis, University of Tampere, 
Medical Faculty. 

Orsted HC. 1820. Experimenta circa effectum conflictus electrici in acum magneticam. J F Chem Phys 
29:275. 

Plonsey R, Collin R. 1 96 1 . Principles and Applications of Electromagnetic Fields. New York, McGraw-Hill. 

Rush S, Driscoll DA. 1969. EEG-electrode sensitivity: An application of reciprocity. IEEE Trans Biomed 
EngBME-16(l):15. 

Zimmerman JE, Thiene P, Hardings J. 1970. Design and operation of stable rf biased superconducting 
point-contact quantum devices. J Appl Phys 41:1572. 



3 2000 by CRC Press LLC 



Durand, D. M. "Electric Stimulation of Excitable Tissue . 
The Biomedical Engineering Handbook: Second Edition. 
Ed. Joseph D. Bronzino 
Boca Raton: CRC Press LLC, 2000 



17 



Electric Stimulation 
of Excitable Tissue 



17.1 Electric Stimulation of Neural Tissue 

17.2 Physiology of Excitation 

17.3 Electric Fields in Volume Conductors 
Quasi-State Formulation ■ Equivalence Between Dielectric 
and Conductive Media • Potential from a Monopole 
Source • Potential from Bipolar Electrodes and 

Dipoles • Inhomogeneous Volume Conductors 

17.4 Electric Field Interactions with Excitable Tissue 
Discrete Cable Equation for Myelinated Axons • Equivalent 
Cable Equation for Myelinated Fibers • Unmyelinat- 
ed/Myelinated Fibers • Anodic/Cathodic Stimulation • 
Large/Small-Diameter Axons • Spatial Derivative of the 
Electric Field • Activating Function • Net Driving 
Function • Current-Distance Relationship ■ 
Longitudinal/Transverse Field • Anodal Surround 

Block • Unidirectionally Propagated Action Potentials 

17.5 Electrode-Tissue Interface 

Effect of Stimulation Waveform on Threshold • 
Electrochemistry of Stimulation ■ Stimulation of Brain 
Tissue • Stimulation of the Peripheral Nerve • Stimulation 

Dominique M. Durand of Muscle Tissue • Corrosion ■ Tissue Damage • Biphasic 

Case Western Reserve University Waveforms 

17.6 Conclusion 

17.1 Electric Stimulation of Neural Tissue 

Functional electric stimulation (FES) of neural tissue provides a method to restore normal function to 
neurologically impaired individuals. By inserting electrodes inside or near nerves, it is possible to activate 
pathways to the brain or to muscles. Functional nerve stimulation (FNS) is often used to describe 
applications of electric stimulation in the peripheral nervous system. Neural prostheses refer to applica- 
tions for which electric stimulation is used to replace a function previously lost or damaged. 

Electric stimulation has been used to treat several types of neurologic dysfunction with varying 
amounts of success [see Hambrecht, 1979]. For example, electric stimulation of the auditory nerves to 
restore hearing in deaf patients has proved to be not only feasible but also clinically useful [Clark et al., 
1990]. Similarly, the phrenic nerve of patients with high-level spinal cord injury can be stimulated to 
produce diaphragm contractions and restore ventilation [Glenn et al., 1984]. Electric stimulation of the 
visual cortex produces visual sensations called phosphenes [Brindley and Lewin, 1968], and a visual 
prosthesis for the blind is currently being tested. Electric stimulation in the peripheral nerves of paralyzed 
patients can restore partial function of both the upper extremities for hand function [Peckham et al., 
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1977] and lower extremities for gait [Marsolais and Kobetic, 1988]. Several other attempts were not so 
successful. Electric stimulation of the cerebellum cortex for controlling epileptic seizures has been tried 
but was not reliable. However, a method involving stimulation of the vagus nerve has been successful in 
decreasing seizure frequency in patients with epilepsy [Rutecki, 1990; Fisher et al., 1997]. There are many 
other applications of electric stimulation of the nervous system and many problems associated with each 
one, but it is now clear that the potential and the limits of the technique have not yet been realized. 
Other excitable tissues such as cardiac muscle can also be excited by externally applied electric fields. The 
underlying principles are similar to those reviewed below for neural tissue. 

Stimulation of the nervous system also can be achieved with magnetic fields [Chokroverty, 1990]. A 
coil is placed near the excitable tissue, and a capacitor is rapidly discharged into the coil. Large magnetic 
fluxes are generated, and the induced electric fields can generate excitation. Magnetic stimulation has 
several advantages over electric stimulation. Magnetic fields can easily penetrate low-conductivity tissues 
such as bone, and the stimulation is completely noninvasive. However, magnetic stimulation requires a 
large amount of energy, and the magnetic field is difficult to localize. Magnetic stimulation of excitable 
tissue shares many aspects with electric stimulation, since the electric field is, in both cases, the source 
of the stimulus [Roth and Basser, 1990]. However, there are several important differences [Durand et al., 
1992; Nagarajan et al., 1997] that will not be reviewed below. 

What is the basis for the effect of electric stimulation of the nervous system? Clearly, it comes from 
the fact that a propagated action potential can be triggered by applying a rapidly changing electric field 
near excitable tissue. This fact was demonstrated early in this century, and clinical experimental appli- 
cations in the 1950s resulted in the highly successful cardiac pacemaker. Other clinical applications have 
been slow in coming for several reasons to be discussed below. One of the difficulties is that the 
fundamental principles of the interaction of electric fields and neurons is not completely understood. In 
order to understand how applied currents can generate excitation, it will necessary to describe the 
mechanisms underlying excitation (Sec. 17.2), the distribution of currents inside the volume conductor 
(Sec. 17.3), and the interaction between the axon and applied electric fields (Sec. 17.4). 

Another difficulty lies at the interface between electrodes applying the current and neural tissue to be 
stimulated. Current in the wires to the electrodes is carried by electrons, whereas current in the volume 
conductor is carried by ions. Chemical reasons at the interface will take place, and these reactions are 
still poorly understood. The waveforms used to apply the current can significantly affect the threshold, 
the electrochemistry at the electrode site, and tissue damage. These issues will be reviewed in Sec. 17.5. 

17.2 Physiology of Excitation 

Electric stimulation of excitable tissue is mediated by artificially depolarizing membrane-containing chan- 
nels capable of producing action potentials. Action potentials are normally elicited by synaptic currents, 
which in turn produce depolarization of the somatic membrane. The sodium channels are responsible for 
the generation of the depolarizing phase of the action potential and are sensitive to membrane voltage 
[Ferreira and Marshal, 1985]. Once initiated in the soma, the action potential is carried unattenuated 
along the axon to its intended target, such as another neuron or neuromuscular junction for muscle 
contraction (Fig. 17.1). Unmyelinated axons have ionic channels distributed throughout their membranes, 
and the action potential is carried smoothly along their length. The membranes of the axons containing 
the channels behave as resistive and capacitive elements, limiting the conduction velocity. Myelinated axons 
are surrounded by a insulation sheath of myelin that significantly decreases the membrane capacitance, 
thereby increasing the speed of propagation. Conduction of the action potential is no longer smooth but 
takes place in discrete steps (saltatory conduction). The myelin sheath is broken at regularly spaced intervals 
(nodes of Ranvier) to allow the current to flow through the membrane [Aidley, 1978]. 

Action potentials can be generated artificially by placing electrodes directly inside a cell. Current 
flowing from the inside to the outside will produce depolarization followed by excitation, provided that 
the current amplitude is large enough. This technique cannot be used for functional stimulation because 
we do not yet have the technology to interface electrodes with large numbers of single axons. Therefore, 
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FIGURE 17.1 Electric stimulation of the myelinated fiber. An electrode is located near the axon, and a cathodic 
stimulus is applied to the electrode. The current flow in and around the axon is described in this chapter and causes 
depolarization of the membrane at the sites closest to the electrodes. Action potentials are generated underneath the 
electrode and propagate orthodromically and antidromically. 



electrodes must be placed in the extracellular space near the tissue to be excited. It is then possible to 
activate many cells simultaneously. However, stimulation must be selective. Selectivity is defined as the 
ability of a stimulation system to activate any chosen set of axons. For example, the nervous system 
chooses to recruit small fibers connected to small motor units followed by large fibers connected to large 
motor units for smooth motor control. Applied current pulses activate first large fibers and then small 
fibers with increasing current (reverse recruitment) for reasons described in Sec. 17.4. The reverse 
recruitment order as well as our inability to recruit a chosen set of axons within a nerve bundle makes 
electric stimulation a powerful but difficult-to-control tool for activation of the nervous system. 

17.3 Electric Fields in Volume Conductors 

The excitable tissue to be stimulated (peripheral nerves, motor neurons, CNS neurons) are in all cases 
surrounded by an extracellular fluid with a relatively high conductivity (80 to 300 Q-cm). The electrodes 
used for electric stimulation are always placed in this "volume conductor" and it is essential to understand 
how the currents and the electric fields are distributed [Heringa et al., 1982]. The calculation of current 
density and electric fields can be done easily in simple cases such as a homogeneous (conductivity the 
same everywhere) and isotropic (conductivity the same in all directions) medium. 

Quasi-Static Formulation 

The calculations of electric fields generated by an electrode located in a conducting volume conductor 
can be done by solving Maxwell equations. The frequencies used for electric stimulation are generally 
under 10 kHz, and therefore, a simplified set of equations known as the quasi-static formulation can be 
used [Plonsey, 1969]: 



Conservation of charge: 



V-J = 



(17.1) 



Gauss' law: 



■E=P 



(17.2) 
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Ohm's law for conductors: J = 0E 

Electric field: E = — V(j) 



(17.3) 
(17.4) 



where E is the electric field (V/m) defined as gradient of the scalar potential (j>; J is the current density 
(defined as the current crossing a given surface, in A/m 2 ); a is the conductivity (inverse of resistivity), 
in s/m; p is the charge density, in C/m 3 ; e is the permittivity of the medium; and V-A is the divergence 
of vector A. 

Equivalence Between Dielectric and Conductive Media 

Assuming an infinite homogeneous conducting medium with conductivity o with a single point source 
as shown in Fig. 17.2a, the current density J at any point is the sum of a source term J s and an ohmic 
term aE: 



Using Eq. (17.1), 



J = oE + J. 



V J=V aE+V J =0 



(17.5) 



(17.6) 
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FIGURE 17.2 Voltage and electric field along an axon, (a) The current density J within an infinite, homogeneous, 
and isotropic volume conductor is radial and is inversely proportional to the square of the distance. ( b) Voltage along 
an axon located 1 cm from an anode with 1 mA of current, (c) Electric field along the same axon. 
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Since the volume conductor is homogeneous, V-(aE) = aV-E, and we then have 

V-E = -V-J s (17.7) 

Since E = -V<|) and V-(VA) is by definition the laplacian of A, V 2 A, we then have 

V 2 $ = V-jJa = -lJa (17.8) 

where I v is a source term (in A/m 3 ). The source term /,, is zero everywhere except where the sources are 
located. This equation is nearly identical to the Poisson equation derived from Eq. (17.2) [Kraus and 
Carver, 1973]; 

V 2 (j) = -^ (17.9) 

e 

derived for dielectric media. Using the following equivalence, 



the solution of the Poisson equation for dielectric problems can then be used for the solution of the 
current in volume conductors. 

Potential from a Monopole Source 

Given a point source (monopolar) in an infinite homogeneous and isotropic volume conductor connected 
to a current source /, the potential and currents anywhere can be derived easily. Using spherical symmetry, 
the current density J at a point P located at a distance r from the source is equal to the total current 
crossing a spherical surface with radius r (see Fig. 17.2): 

J=- / -u. (17.10) 

47tr 

where u r is the unit radial vector, and r is the distance between the electrode and the measurement point. 
The electric field is then obtained from Eq. (17.3): 



4710 r 2 



(17.11) 



The electric field is the gradient of the potential. In spherical coordinates, 



E = -^u (17.12) 

dr r 



Therefore, the potential at point P is obtained by integration: 



<j) = — — (17.13) 

A%Or 



) 2000 by CRC Press LLC 



It can be shown easily that this solution satisfies the Poisson equation. For a monopolar electrode, the 
current distribution is radial and is inversely proportional to the conductivity of the medium and the 
distance to the source. The potential decays to zero far away from the electrode and goes to infinity on 
the electrode. The singularity at r = can be eliminated by assuming that the electrode is spherical with 
finite radius a. Equation (17.13) is then valid on the surface of the electrode for r = a and for r > a 
[Nunez, 1981]. 

Equation (17.13) can be generalized to several monopolar electrodes. Assuming n electrodes with a 
current I t located at a distance r, from the recording point, the voltage is then given by 



l -Y^ (17.14) 

rrr> *-^ r. 



47C0 

n ' 

For an axon located in the volume conductor as shown in Fig. 17.2, the voltage along the axon located 
1 cm away from an anode with a 1-mA current sources is given by the following equation and is plotted 
in Fig. 17.2b: 

<\> = I (17.15) 

4710 \lcf + X 2 

The electric field along the axon can be obtained by taking the spatial derivative of Eq. (17.15) and is 
plotted in Fig. 17.2c. 

Potential From Bipolar Electrodes and Dipoles 

In the derivation of the potential generated by a monopolar electrode, the current enters the volume 
conductor at the tip of the electrode and exits at infinity. However, in the following example (saltwater 
tank), a current is applied through two monopolar electrodes separated by a distance d as shown in 
Fig. 17. 3a. The potential generated by this bipolar configuration can be calculated at point P (assuming 
that the voltage reference is at infinity) as follows: 



4710 



1 1 



(17.16) 



When the distance d between the two electrodes is small compared with the distance r, the equation for 
the potential is given by the following dipole equation valid for §> d: 

Id cos 9 

<\> = (17.17) 

47U0r 

where the angle 6 is defined as in Fig. 17.3. The current distribution for a current dipole is no longer 
radial and is shown in Fig. 17.3b. The voltage along a line perpendicular to the axis of the dipole and 
passing through a point equidistant from the electrodes is zero. Therefore, an axon located in that region 
would not be excited regardless of the current amplitude. The potential generated by a dipole is inversely 
proportional to the square of the distance and therefore decays faster than the monopole. Therefore, the 
dipole configuration will have a more localized excitation region. However, the excitation thresholds are 
higher because most of the current flows around the electrode. For an observation point located at a 
distance r from a dipole or monopole with the same current, the ratio of the monopole voltage to that 
of the dipole voltage is proportional to rid. Since r is assumed to be much larger than d, then the monopole 
voltage is much larger than the voltage generated by the dipole. 
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FIGURE 17.3 Voltage generated by bipolar electrodes, (a) Saltwater tank. Two electrodes are located within a large 
tank filled with a conduction solution. The voltage generated by the bipolar arrangement can be measured as shown 
provided that the reference electrode is located far away from the stimulation electrode. The effect of the 
stimulation electrode on the reference potential also can be taken into account, (h) Current lines and equipotential 
voltage distributions (dashed lines) for a dipole. The distance r between the observation Pand the electrodes is much 
greater that the distance d between the electrodes. 



Inhomogeneous Volume Conductors 

For practical applications of electric stimulation, electrodes are placed in various parts of the body. The 
volume conductors are clearly not homogeneous because we must consider the presence of bone with a 
conductivity significantly higher than that of extracellular space or even air above the skin with a 
conductivity of zero. How do those conductivities affect the potentials generated by the electrode? This 
question usually can only be answered numerically by computer models that take into account these 
various compartments such as finite-differences, finite-elements, or boundary-elements methods. A 
simple solution, however, can be obtained in the case of a semi-infinite homogeneous volume conductor 
using the method of images. Consider two volume conductors with conductivities o l and rj 2 separated 
by an infinite plane as shown in Fig. 17.4. A monopolar stimulating electrode is placed in region 1. 
Potential recordings are made in that same region. It can be shown that the inhomogeneous volume 
conductor can be replaced by a homogeneous volume by adding another current source located on the 
other side of the plane (see Fig. 17.4) with an amplitude equal to [Nunez, 1981] 






(17.18) 
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FIGURE 17.4 Method of images. The method of images can be used to calculate the voltage generated by an 
electrode in a homogeneous volume conductor. The two semiinfinite volume conductors with conductivities c^ and 
a 2 (a) are replaced by a single infinite volume conductor with conductivity a, and an additional image electrode (b). 

The voltage at point P is then given by Eq. (17.14). The mirror-image theory is only applicable in simple 
cases but can be useful to obtain approximations when the distance between the recording electrode and 
the surface of discontinuity is small, thereby approximating an infinite surface [Durand et al., 1992]. 

17.4 Electric Field Interactions with Excitable Tissue 

When axons are placed inside a volume conductor with a stimulation electrode, current flows according 
to equations derived in the preceding section. Some of the current lines enter and exit the axon at different 
locations and will produce excitation or inhibition. An example of this current distribution is shown in 
Fig. 17.5a for a monopolar anodic electrode. A length Ax of the axonal membrane can be modeled at 
rest by a capacitance C m in parallel with a series of combination of a battery (E T ) for the resting potential 
and a resistance R m stimulating the combined resistance at rest of all the membrane channels (see 
Fig. 17.5b). Nonlinear ionic channel conductances can be added in parallel with the membrane resistance 
and capacitance. Their contribution at rest is small and becomes significant only around threshold. Since 
we are interested here in how to drive the membrane potential toward threshold from resting values, 
their contribution is ignored. When current enters the membrane flowing from the outside to the inside, 
the membrane is hyperpolarized (moved closer to threshold), as illustrated in Fig. 17.5c. Similarly, when 
the current exits the membrane, depolarization is generated (the membrane voltage is moved closer to 
the threshold). In the case of an anodic electrode, illustrated in Fig. 17.5, stimulation should not take 
place directly underneath the electrode but further along the axon where the membrane is depolarized. 
A more quantitative approach to this problem can be obtained by modeling the interaction of the 
model with the applied current. The applied current J generates a voltage distribution in the extracellular 
space that can be calculated using Eq. (17.15) assuming a homogeneous and isotropic medium. An 
unmyelinated fiber is modeled as a one-dimensional network by linking together electrical models of the 
membrane with a resistance -R (1 to account for the internal resistance of the axon, as shown in Fig. 17.6a. 
The circuit can be stimulated using numerical methods [Koch and Segev, 1989] or by using already 
available general software packages such as Pspice or neuronal simulation packages such as Neuron 
[Hines, 1984]. The variable of interest is the transmembrane potential V m , since the sodium channels 
are sensitive to the voltage across the membrane. V m is defined as the difference between the intracellular 
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FIGURE 17.5 Effect of extracellular current on axon membrane polarization, (a) A monopolar anode is located 
near an axon, and current enters and exits the membrane, (b) At rest, the membrane can be modeled by a simple 
RC network, (c) When current enters the membrane, charge is added to the membrane capacitance, and additive 
membrane polarization is generated. Therefore, the membrane is hyperpolarized. (d) When current exits the axon, 
the membrane is depolarized. 

voltage Vj and the extracellular voltage V e minus the resting potential E r in order to reflect the change 
from resting values. Stimulation of the fiber can occur when the extracellular voltage difference between 
two nodes is large enough to generate transmembrane voltage greater than the firing threshold. Applying 
Kirchoff's law at each node and taking the limit when the length of membrane Ax goes to zero, one 
obtains the following inhomogeneous cable equation [Rail, 1979; Clark and Plonsey, 1966; Altman and 
Plonsey, 1988; Rattay, 1989]: 



d 2 V dV „ 
+ X »-- V 



d 2 V 



dx 1 



dt 



(17.19) 



V m and V e are the transmembrane and extracellular voltages, respectively. A, is the space constant of the 
fiber and depends only on the geometric and electric properties of the axon: 



1 \R S J 



(17.20) 



where -R^is the specific membrane resistance, .R^is the axoplasmic-specific resistance, and d is the 
diameter of the axon. x m is the time constant of the axon and is given by 



x =R C 

m mm 



(17.21) 
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FIGURE 1 7.6 Model of extracellular voltage and axon interactions. ( a) The effect of the extracellular voltage generated 
by the stimulating electrode on the axon can be modeled by directly connecting the membrane compartment to the 
extracellular source. The effect of the ion channels can be taken into account by adding nonlinear conductances and 
an equilibrium battery (not shown), (b) The axon with an extracellular voltage is a is equivalent to an axon inside an 
infinite conductance medium (zero voltage) and an equivalent voltage source inside the membrane. 

The term on the right-hand side of the equation is called the source term or forcing function and is the 
product of the square of the space constant with the second spatial derivative of the extracellular voltage. 
In order to explain the significance of this term, it can be shown easily that the model in Fig. 17.6a is 
equivalent to a model in which the extracellular space with its electrode and voltage has been replaced 
by a set of equivalent voltage sources inside the nerve given by 
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d 2 V 
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(17.22) 



The amplitude of the equivalent voltage sources is plotted in Fig. 17.7a for a 10-um axon stimulated by 
a 1-mA anodic current located 1 cm away from the axon. A positive value for the equivalent voltage 
source indicates membrane depolarization, while a negative value indicates hyperpolarization. The peak 
value of the depolarization indicates the excitation site. Therefore, one would predict that for anodic 
stimulation, excitation would take place at the peak of the two depolarized regions, and two action 
potentials would propagate toward the ends of the axon away from the electrode. Note that the shape of 
V calculated is similar to that predicted by simply examining the current flow in and out of the axon 
in Fig. 17.5. This analysis is valid only at the onset of the pulse, since during the pulse currents will be 
distributed throughout the cable and will affect the transmembrane potential [Warman et al., 1994]. 
However, it has been shown that for short pulses these effects are small, and the shape of the transmem- 
brane voltage can be predicted using the source term of the cable equation. 

Discrete Cable Equation for Myelinated Axons 

Electric stimulation of myelinated fibers is significantly different from that of unmyelinated fibers because 
the presence of a myelin sheath around the axon forces the current to flow in and out of the membrane 
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FIGURE 17.7 Eqivalent source voltage ( V eq ). A positive value of the V indicates membrane depolarization, and a 
negative value indicates membrane hyperpolarization. (a) Equivalent source voltage plotted along the axon for an 
anodic stimulus. The peak value is negative, and therefore, the membrane is hyperpolarized underneath the electrode. 
Depolarization occurs at the sites of maximum value of V eq (arrows), (b) Cathodic stimulation generates depolariza- 
tion underneath the electrode, and the amplitude of V eq is larger than for anodic stimulation. Therefore, a cathodic 
stimulus has a lower threshold than an anodic stimulus, (c) V eq is larger for fibers with larger diameters; therefore, 
large-diameter fibers are recruited first. 

only at the nodes (see Fig. 17.1). The action potential propagates from one node to the next (saltatory 
conduction). The effect of an applied electric field on a myelinated fiber can be described using a discrete 
model of the axon and extracellular voltage [McNeal, 1976; Rattay, 1989]. Such a model is shown in 
Fig. 17.8, in which voltages are applied at each node. The resistance R n represents the membrane resistance 
at the node of Ranvier only because the myelin resistance is considered very high for this model and is 
neglected. C„ is the capacitance at the node of Ranvier. R a represents the resistance between two nodes. 
The battery for the resting potential is removed for simplification by shifting the resting potential to zero. 
-R„, C„, and R t are given by 
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FIGURE 17.8 Discrete model for myelinated fibers. The resistance R n and capacitance C„ of the node of Ranvier 
is modeled with an additional internodal resistance R„. The resistance of the myelin is assumed to be infinite in this 
model. Extracellular voltages are calculated or measured at each node, and the transmembrane voltage can be 
estimated. 
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where d is the inner fiber diameter, I is the width of the node, and L is the internodal distance (see 
Fig. 17.8). The cable equation for this model can be derived using Kirchoff's law: 
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A 2 is the second different operator: A 2 V= V n _ l — 2V n + V n+V Using Eqs. (17.23) to (17.25), one can show 
that R„C n and R n /R a are independent of the diameter, since LID = 100, dID = 0.7 and I is constant. 
Therefore, the left side of Eq. (17.26) is independent of the diameter. The source term, however, does 
depend on the diameter but only implicity. Since the distance between the node increases with the 
diameter of the fiber, the voltage across the nodes also increases, suggesting that fibers with larger 
diameters are more easily excitable because they "see" larger potentials along the nodes. 

Equivalent Cable Equation for Myelinated Fibers 

The diameter dependence of these fibers can be expressed explicitly by using an equivalent cable equation 
recently derived [Basser, 1993] and adapted here to take into effect the extracellular voltage: 
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where X is the equivalent space constant for the case of an axon with myelin sheath of infinite resistance 
and is defined as 
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Unmyelinated/Myelinated Fibers 

The equivalent cable equation for the myelinated fibers is similar to Eq. (17.20) derived for unmyelinated 
axons. The forcing function is also proportional to the first derivative of the extracellular electric field 
along the nerve. The dependence on the diameter can be observed directly by expressing the equivalent 
voltage source for myelinated fibers ( V ' ) as function of the inner diameter d: 

R s d 1 d 2 V 

V =35.7^ '- (17.29) 

R s I dx 2 

a 

Equation (17.29) shows that V is proportional to the square of the diameter of fiber d, while the 
equivalent voltage source for an unmyelinated fiber is proportional to its diameter (17.23). Therefore, 
the change in membrane voltage at t = for myelinated and unmyelinated fibers is proportional to d 2 
and d respectively. 

Anodic/Cathodic Stimulation 

It has been demonstrated experimentally that in the case of electric stimulation of peripheral nerves, 
cathodic stimulation has a lower threshold (less current required) than anodic stimulation. This exper- 
imental result can be explained directly by plotting V for a cathodic and an anodic electrode (1 mA) 
located 1 cm away from a 10-um unmyelinated fiber (see Fig. 17.7). The maximum value of V for anodic 
stimulation is 0.05 mV at the two sites indicated by the arrow. However, the maximum depolarization 
for the cathodic electrode is significantly larger at 0.2 mV, with the site of excitation located directly 
underneath the electrode. In special cases, such as an electrode located on the surface of a cortex, cathodic 
stimulation can have a higher threshold [Ranck, 1975]. 

Large/Small-Diameter Axons 

The equivalent voltage source of the cable equation is proportional to the square of the space constant X . 
X 2 is proportional to the diameter of the fiber (17.20) for myelinated fibers and to the square of the 
diameter (17.29) for myelinated fibers. Therefore, in both cases, V is higher for fibers with larger 
diameters (see Fig. 17.7c), and large-diameter fibers have a lower threshold. Since the physiologic recruit- 
ment order by the CNS is to first recruit the small fibers followed by large ones, electric stimulation 
produces a reverse recruitment order. However, techniques have been developed to recruit small fibers 
before large fibers by using a different stimulation waveform [Fang and Mortimer, 1991]. Since X 2 is also 
dependent on the electrical properties of the axons, it is then possible to predict that fibers with a larger 
membrane resistance or lower axoplasmic resistance also will have lower thresholds. 

Spatial Derivative of the Electric Field 

The first spatial derivative (or the second spatial derivative of the voltage along the nerve) is responsible 
for electric excitation of the nerve. Therefore, an electric field with a nonzero second spatial derivative 
is required for excitation. An axon with a linearly decreasing voltage distribution would not be excited 
despite the presence of a large voltage difference along the axon. This is due to the fact that a linear 
voltage distribution gives a constant electric field, and therefore, the spatial derivative of the field is zero. 

Activating Function 

The second spatial derivative term of the equivalent voltage is also known as the activation function 
[Rattay, 1990]: 
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This function can be evaluated from knowledge of the extracellular voltage alone and can be used to 
predict the location of excitation. For unmyelinated fibers, the activation function does not contain any 
information about the axon to be stimulated. In the case of myelinated fibers, where the voltage is 
evaluated at the nodes of Ranvier, the activating function becomes 
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(17.31) 



The new function contains implicit information about the fiber diameter, since the distance L between 
the nodes of Ranvier is directly proportional to the fiber diameter D (L = 100 x D). the diameter 
dependence can be made explicit in Eq. (17.29). 

Net Driving Function 

The equivalent voltage source or the activating function represents only the source term in the electrical 
model of the axon (see fig. 17.6). However, the transmembrane voltage is determined by a weighted sum 
of the currents flowing at all the nodes. A net driving function that takes into account both the source 
term at each node and the passive redistribution from sources at other nodes has been defined and found 
useful for accurate prediction of the excitation threshold for any applied field [Warman et al., 1994]. 

Current-Distance Relationship 

The amount of current required to activate a fiber with a given diameter depends on its geometry but 
also on its distance from the electrode. The farther away the fiber, the lower is the voltage along the fiber 
(17.15); therefore, larger current will be required to reach threshold. This effect is illustrated in Fig. 17.9 
for myelinated fibers. The distance as a function of current amplitude at threshold is plotted for several 
experiments [Ranck, 1975]. With a current of 1mA, all fibers within 2 mm are activated. The calculated 
current-distance relationship for a 10-u.m fiber has been shown to approximate well the experimental 
data (see dashed line in Fig. 17.9) [Rattay, 1989]. The current- distance relationship is linear only for 
small distances. For distances above 1mm, doubling the distance will require four times the current 
amplitude. 
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FIGURE 17.9 Current-distance relationship for monopolar cathodic stimulation of myelinated axons. The distance 
between the axon and the electrode is plotted as a function of current threshold amplitude for many experiments 
from several authors. The dashed line shows the current-distance relationship calculated for a 10-um fiber stimulated 
with a 200-us pulse. [From Rattay, 1989, with permission.] 
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FIGURE 17.10 (a) Longitudinal/transverse electrode placement, (b) Anodal surround block. 



Longitudinal/Transverse Field 

The equivalent voltage source is also proportional to the second spatial derivative of the potential present 
at each point along the axon. It is important to note that it is the longitudinal component of the electric 
field that is responsible for exciting the nerve. Therefore, electrodes placed longitudinally generate the 
most efficient stimulus, since this placement would produce a large longitudinal electric field component. 
Conversely, electrodes placed transversely (on each side of the axon) require a much higher current, since 
the largest component of the field does not contribute to excitation of the nerve (Fig. 17.10a). 

Anodal Surround Block 

As shown by the current-distance relation, the current amplitudes required for excitation decrease with 
distance. This is not entirely true for cathodic stimulation. Cathodic stimulation produces membrane 
depolarization underneath the electrode and membrane hyperpolarization on both sides of the electrodes 
(see Fig. 17.7fe). As the current amplitude is increased, the hyperpolarization also increases and can block 
propagation of the action potential along the axon. This effect is known as anodal surround block and is 
shown in Fig. 17.10k It is possible to identify three regions around the electrode each giving different 
responses. There is a spherical region close to the electrode (I) in which no excitation will take place due 
to the surround block. Fibers located in the region (II) are excited, and fibers still further away (III) from 
the electrode are below threshold and are not excited. 
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Unidirectionally Propagated Action Potentials 

Electric stimulation of nerves with electrodes normally depolarizes the membrane to threshold, producing 
two action potentials propagating in opposite directions, as shown in Fig. 17.1. Stimulation techniques 
have been developed to generate action potentials propagating in one direction only [Sweeney and 
Mortimer, 1986]. The techniques rely on the fact that bipolar stimulation generates depolarization under 
the cathode and hyperpolarization under the anode. By increasing the amount of hyperpolarization 
relative to the depolarization, the active potential generated underneath the cathode and traveling toward 
the anode can be blocked, while the action potential traveling in the other direction can escape. 

17.5 Electrode-Tissue Interface 

At the interface between the electrode and the tissue, the shape of the waveform can influence the 
threshold for activation as well as the corrosion of the electrode and the tissue damage generated. 

Effect of Stimulation Waveform on Threshold 

Strength-Duration Curve 

It has been known for a long time that it is the time change in the applied current and not the continuous 
application of the external stimulus that can excite. Direct current (dc current) cannot excite and even 
in small amplitude can cause significant tissue damage. It also has been observed experimentally that the 
relationship between the pulse width and the amplitude suggests that it is the total charge injected that 
is the important parameter. This relationship between the amplitude and the width of a pulse required 
to bring an excitable tissue to threshold is shown in Fig. 17.11a. The amplitude of the current threshold 
stimulus J lh decreases with increasing pulse width W and can be modeled by the following relationship 
derived experimentally [Lapicque, 1907]: 

L = t^TTa (17 - 32) 

w 

1 — exp 

The smallest current amplitude required to cause excitation is known as the rheobase current I rh . Tis the 
membrane time constant of the axon is stimulated intracellularly. For extracellular stimulation, T is a 
time constant that takes into account the extracellular space resistance. The relationship between current 
amplitude and pulse width also can be derived theoretically using the cable equation by assuming that 
total charge on the cable for excitation is constant [Jack et al., 1983]. 

Charge Duration Curve 

The threshold charge injected Q th = I th x W is plotted in Fig. 17.1 lfo and increases as the pulse width 
increases: 

I W 

Q th = V^T\ ( 17 - 33 ) 



1 — exp 



W 



The increase in the amount of charge required to fire the axon with increasing pulse width is due to the 
fact that for long pulse duration, the charge is distributed along the cable and does not participate directly 
in raising the membrane voltage at the excitation site. The minimum amount of charge Q min required 
for stimulation is obtained by taking the limit of Q th (17.33) when W goes to zero is equal to I th T. In 
practice, this minimum charge can be nearly achieved by using narrow, current pulses. 



© 2000 by CRC Press LLC 



Strength-Duration Curve 




100 200 300 

Chronarie Pul Be Width (uS) 



B 



60 



o 




e 






so 


•o 








O 




■s 


40 


S 




e 


30 


a 




$ 




2 


20 


o 





Qmin10 



Charge-Duration Curve 



100 200 300 400 500 

Pulse Width (uS) 

FIGURE 17.11 Effect of pulse width on excitation threshold, (a) Strength-duration curve. The amplitude of the 
current required to reach threshold decreases with width of the pulse. This effect can be derived theoretically by 
assuming that the charge on the cable at threshold is constant, (b) The threshold charge (amplitude times pulsewidth) 
injected into the tissue increases with the pulsewidth. Narrow pulses are recommended to minimize charge injection. 

Anodic Break 

Excitation generated by cathodic current threshold normally takes place at the onset of the pulse. However, 
long-duration, subthreshold cathodic or anodic current pulses have been observed experimentally to 
generate excitation at the end of the pulse. This effect has been attributed to the voltage sensitivity of 
the sodium channel. The sodium channel is normally partially inactivated at rest (see Chap. 1). However, 
when the membrane is hyperpolarized during the long-duration pulse, the inactivation of the sodium 
channel is completely removed. Upon termination of the pulse, an action potential is generated, since 
the inactivation gate has a slow time constant relative to the activation time constant and cannot recover 
fast enough [Mortimer, Chap. 3 in Agnew and McCreery, 1990]. This effect can be observed with an 
anodic or cathodic pulse, since both can generate hyperpolarization. Anodic break can be prevented by 
avoiding abrupt termination of the current. Pulse shapes with slow decay phases such as exponential or 
trapezoidal decay shapes have been issued successfully [Fang and Mortimer, 1991]. 

Electrochemistry of Stimulation 

Conduction in metal is carried by electrons, while in tissue, current is carried by ions. Although capacitive 
mechanisms have been tested, electrodes have not yet been developed that can store enough charge for 
stimulation. Therefore, most electric stimulation electrodes rely on faradaic mechanisms at the interface 
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between the metal and the tissue. Faradaic mechanisms require that oxidation and reduction take place 
at the interface [Roblee and Rose, Chap. 2 in Agnew and McCreery, 1990]. Faradaic stimulation mech- 
anisms can be divided into reversible and nonreversible mechanisms. Reversible mechanisms occur at 
or near the electrode potential and include oxide formation and reduction and hydrogen plating. Irre- 
versible mechanisms occur when the membrane is driven far away from its equilibrium potential and 
include corrosion and hydrogen or oxygen evolution. These irreversible processes can cause damage to 
both the electrode and the tissue, since they alter the composition of the electrode surface and can generate 
toxic products with pH changes in the surrounding tissue. During charge injection, the electrode potential 
is modified by an amount related to the charge density (total charge divided by the surface area). In 
order to maintain the electrode potential within regions producing only minimal irreversible changes, 
this charge density must be kept below some values. The maximum charge density allowed depends on 
the metal used for the electrode, the stimulation waveform, the type of electrode used, and the location 
of the electrode within the body. 

Stimulation of Brain Tissue 

Electrodes can be placed on the surface of the brain and directly into the brain to activate CNS pathways. 
Experiments with stimulation of electrode arrays made of platinum and placed on the surface of the 
brain indicate that damage was produced for charge densities between 50 to 300 uC/cm 2 but that the 
total charge per phase also was an important factor and should be kept below 3 uC [Pudenz et al., 1975a, b] . 
Intracortical electrodes with small surface areas can tolerate a charge density as high as 1600 uC/cm 2 , 
provided that the charge per phase remains below 0.0032 uC [Agnew et al., 1986]. 

Stimulation of the Peripheral Nerve 

The electrodes used in the peripheral nerves are varied and include extraneural designs such as the spiral 
cuffs or helix electrodes [Naples et al., 1991] with electrode contacts directly on the surface of the nerve 
or intraneural designs placing electrodes contacts directly inside the nerve [Nannini and Horch, 1991; 
Rutten et al., 1991]. Intraneural electrodes can cause significant damage, since electrodes are inserted 
directly into the nerve through the perineurium. However, these electrodes can display good selectivity. 
Extraneural electrodes are relatively safe because the newer designs such as the spiral or helix are self- 
sizing and allow for swelling without compression but display poor selectivity. New designs are aimed 
at producing selective stimulation by recruiting only small portions of the nerve and little damage [Veraart 
et al., 1990; Tyler and Durand, 1997]. Damage in peripheral nerve stimulation can be caused by the 
constriction of the nerve as well as by neuronal hyperactivity and irreversible reactions at the electrode 
[McCreery et al, 1992]. 

Stimulation of Muscle Tissue 

Muscle tissue can best be excited by electrodes located on the nerve supplying the muscle [Popovic, 1991]. 
However, for some applications, electrodes can be placed directly on the surface of the skin (surface 
stimulation) [Myklebust et al., 1985], directly into the muscle (intramuscular electrode) [Caldwell and 
Reswick, 1975], or on the surface of the muscle (epimysial electrode) [Grandjean and Mortimer, 1985]. 
The current thresholds are higher when compared with nerve stimulation unless the electrode is carefully 
placed near the point of entry of the nerve [Mortimer, 1981]. Stainless steel is often used for these 
electrodes and is safe below 40 uC/cm 2 for coiled wire intramuscular electrodes [Mortimer et al., 1980]. 

Corrosion 

Corrosion of the electrode is a major concern because it can cause electrode damage, metal dissolution, 
and tissue damage. However, corrosion occurs only during the anodic phase of the stimulation. Therefore, 
by using monophasic waveforms as shown in Fig. \7.\2b, corrosion can be avoided. Conversely, the 
monophasic anodic waveform (Fig. 17.12a) must be avoided because it will cause corrosion. (This not 
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FIGURE 17.12 Comparison of stimulation waveforms. The waveforms are ranked for their ability to generate low 
threshold stimulation, low corrosion, and low tissue damage. 

true, however, for capacitive electrode metals such as tantalum, for which a dielectric layer of tantalum 
pentoxide is formed during the anodic phase and reduced during the cathodic phase.) For most appli- 
cations, cathodic stimulation has a lower threshold than anodic stimulation. It appears, therefore, that 
monophasic cathodic waveforms (Fig. 17.11a) would be a preferred stimulation waveform, since it 
minimizes both the current to be injected and the corrosion. However, since the current only flows in 
one direction, the chemical reactions at the interface are not reversed, and the electrode is driven in the 
irreversible region. 

Tissue Damage 

Electrodes operating in the irreversible region can cause significant tissue damage because irreversible 
processes can modify the pH of the surrounding tissue and generate toxic products. Balanced biphasic 
waveforms are preferred because the second phase can completely reverse the charge injected into the 
tissue. Provided that the amplitude of the current is small, the electrode voltage can then be maintained 
within the reversible region. Waveforms that have the most unrecoverable charge are the most likely to 
induce tissue damage. Tissue damage also can be caused by generating a high rate of neural activity 



(D 2000 by CRC Press LLC 



[Agnew et al., Chap. 6 in Agnew and McCreery, 1990]. The mechanisms underlying this effect are still 
unclear but could include damage to the blood-nerve barrier, ischemia, or a large metabolic demand on 
the tissue leading to changes in ionic concentration both intracellularly and extracellularly. 

Biphasic Waveforms 

Common biphasic waveforms for stainless steel or platinum use a cathodic pulse followed by an anodic 
phase. An example of a square balanced biphasic waveform is shown in Fig. 17.12c. Another commonly 
used biphasic waveform easily implemented with capacitor and switches is shown in Fig. 17.12e. This 
waveform ensures that the charge is exactly balanced, since a capacitor is inserted in series with the tissue 
to be stimulated and the charge injected is then reversed by discharging the capacitor [Mortimer, 1981]. 
Biphasic cathodic-first waveforms have a higher threshold than monophasic waveforms because the 
maximum depolarization induced by the cathodic pulse is decreased by the following anodic pulse 
[Mortimer, Chap. 3 in Agnew and McCreery, 1990]. A delay can then be inserted between the cathodic 
and anodic phases, as shown in Fig. 17.12d. However, the time delay also can prevent adequate charge 
reversal and can be dangerous to the electrode and the tissue. An alternative method is to decrease the 
maximum amplitude of the anodic phase but increase its length as shown in Fig. 17.12/ However, this 
also can be damaging to the electrode because the charge is not reversed fast enough following the 
cathodic pulse. The various waveforms shown in Fig. 17.12 are ranked for their effect on tissue damage, 
corrosion, and threshold of activation. 

17.6 Conclusion 

Electric stimulation of excitable tissue has been used for over a century, and important discoveries have 
been made concerning the mechanisms underlying the interaction between the applied fields with the 
tissue and the electrochemistry at the electrode-tissue interface. It is now clear that electric stimulation 
is a powerful technique that could potentially activate any excitable tissue in the body and replace damaged 
functions. It is also clear, however, that if our goal is to provide intimate interfacing between excitable 
tissue and electrodes in order to reproduce the normal function of the nervous system, present technology 
is not adequate. The electrodes are too big and can only access either a few elements separately or a large 
number of elements simultaneously. Moreover, it is also clear that our understanding of the electrochem- 
istry at the interface between the electrode and tissue is limited as well as the mechanisms underlying 
tissue damage. However, the substantial gains to be made are worth the efforts required to solve these 
problems. 
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MECHANICS IS THE ENGINEERING SCIENCE that deals with studying, defining, and math- 
ematically quantifying "interactions" that take place among "things" in our universe. Our 
ability to perceive the physical manifestation of such interactions is embedded in the concept 
of a force, and the "things" that transmit forces among themselves are classified for purposes of analysis 
as being solid, fluid, or some combination of the two. The distinction between solid behavior and fluid 
behavior has to do with whether or not the "thing" involved has disturbance-response characteristics 
that are time-rate-dependent. A constant force transmitted to a solid material will generally elicit a 
discrete, finite, time-independent deformation response, whereas the same force transmitted to a fluid 
will elicit a continuous, time-dependent response called flow. In general, whether or not a given material 
will behave as a solid or a fluid often depends on its thermodynamic state (i.e., its temperature, pressure, 
etc.). Moreover, for a given thermodynamic state, some "things" are solid-like when deformed at certain 
rates but show fluid behavior when disturbed at other rates, so they are appropriately called viscoelastic, 
which literally means, "fluid-solid." Thus a more technical definition of mechanics is the science that deals 
with the action of forces on solids, fluids, and viscoelastic materials. What makes mechanics biomechanics 
is the fact that biomechanics is the science that deals with the time and space response characteristics of 
biologic solids, fluids, and viscoelastic materials to imposed systems of internal and external forces. 

As early as the fourth century B.C., we find in the works of Aristotle (384-322 B.C.) attempts to describe 
through geometric analysis the mechanical action of muscles in producing locomotion of parts or all of 
the animal body. Nearly two thousand yeas later, in his famous anatomic drawings, Leonardo da Vinci 
(a.d. 1452-1519) sought to describe the mechanics of standing, walking up and down hill, rising from 
a sitting position, and jumping, and Galileo (a.d. 1564-1643) followed a hundred years later with some 
of the earliest attempts to mathematically analyze physiologic function. Because of his pioneering efforts 
in defining the anatomic circulation of blood, William Harvey (a.d. 1578-1657) is credited by many as 
being the father of modern-day bio fluid mechanics, and Alfonso Borelli (a.d. 1608-1679) shares the same 
honor for contemporary biosolid mechanics because of his efforts to explore the amount of force 
produced by various muscles and his theorization that bones serve as levers that are operated and 
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controlled by muscles. The early work of these pioneers of biomechanics was followed up by the likes of 
Sir Isaac Newton (a.d. 1642-1727), Daniel Bernoulli (a.d.- 1700-1782), Jean L. M. Poiseuille 
(a.d. 1799-1869), Thomas Young (a.d. 1773-1829), Euler (whose work was published in 1862), and others 
of equal fame. To enumerate all their individual contributions would take up much more space than is 
available in this short introduction, but there is a point to be made if one takes a closer look at the names 
involved. 

In reviewing the preceding list of biomechanical scientists, it is interesting to observe that many of the 
earliest contributions to our ultimate understanding of the fundamental laws of physics and engineering 
(e.g., Bernoulli's equation of hydrodynamics, the famous Young's modulus in elasticity theory, Poiseuille 
flow, and so on) came from physicians, physiologists, and other health care practitioners seeking to study 
and explain physiologic structure and function. The irony in this is that as history has progressed, we 
have just about turned this situation completely around. That is, more recently, it has been biomedical 
engineers who have been making the greatest contributions to the advancement of the medical and 
physiologic sciences. These contributions will become more apparent in the chapters that follow in this 
section. The individual chapters address the subjects of biosolid mechanics and biofluid mechanics as 
they pertain to various subsystems of the human body. 

Since the physiologic organism is 60 to 75% fluid, it is not surprising that the subject of biofluid 
mechanics should be so extensive, including — but not limited to — lubrication of human synovial joints 
(Chapter 21), cardiac biodynamics (Chapter 28), mechanics of heart valves (Chapter 29), arterial macro- 
circulatory hemodynamics (Chapter 30), mechanics and transport in the microcirculation (Chapter 31), 
venous hemodynamics (Chapter 33), mechanics of the lymphatic system (Chapter 34), cochlear mechan- 
ics (Chapter 35), and vestibular mechanics (Chapter 36). The area of biosolid mechanics is somewhat 
more loosely defined — since all physiologic tissue is viscoelastic and not strictly solid in the engineering 
sense of the word. Also generally included under this heading are studies of the kinematics and kinetics 
of human posture and locomotion, i.e., biodynamics, so that under the generic section on biosolid 
mechanics in this Handbook you will find chapters addressing the mechanics of hard tissue (Chapter 18), 
the mechanics of blood vessels (Chapter 19) or, more generally, the mechanics of viscoelastic tissue, 
mechanics of joint articulating surface motion (Chapter 20), musculoskeletal soft tissue mechanics 
(Chapter 22), mechanics of the head/neck (Chapter 23), mechanics of the chest/ abdomen (Chapter 24), 
the analysis of gait (Chapter 25), exercise physiology (Chapter 26), biomechanics and factors affecting 
mechanical work in humans (Chapter 27), and mechanics and deformability of hematocytes (blood cells) 
(Chapter 32). In all cases, the ultimate objectives of the science of biomechanics are generally twofold. 
First, biomechanics aims to understand fundamental aspects of physiologic function for purely medical 
purposes, and second, it seeks to elucidate such function for mostly nonmedical applications. 

In the first instance above, sophisticated techniques have been and continue to be developed to monitor 
physiologic function, to process the data thus accumulated, to formulate inductively theories that explain 
the data, and to extrapolate deductively, i.e., to diagnose why the human "engine" malfunctions as a result 
of disease (pathology), aging (gerontology), ordinary wear and tear from normal use (fatigue), and/or 
accidental impairment from extraordinary abuse (emergency medicine). In the above sense, engineers 
deal directly with causation as it relates to anatomic and physiologic malfunction. However, the work 
does not stop there, for it goes on to provide as well the foundation for the development of technologies 
to treat and maintain {therapy) the human organism in response to malfunction, and this involves 
biomechanical analyses that have as their ultimate objective an improved health care delivery system. 
Such improvement includes, but is not limited to, a much healthier lifestyle (exercise physiology and 
sports biomechanics), the ability to repair and/or rehabilitate body parts, and a technology to support 
ailing physiologic organs (orthotics) and/or, if it should become necessary, to replace them completely 
(with prosthetic parts). Nonmedical applications of biomechanics exploit essentially the same methods 
and technologies as do those oriented toward the delivery of health care, but in the former case, they 
involve mostly studies to define the response of the body to "unusual" environments — such as subgravity 
conditions, the aerospace milieu, and extremes of temperature, humidity, altitude, pressure, acceleration, 
deceleration, impact, shock, and vibration, and so on. Additional applications include vehicular safety 
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considerations, the mechanics of sports activity, the ability of the body to "tolerate" loading without 
failing, and the expansion of the envelope of human performance capabilities — for whatever purpose! 
And so, with this very brief introduction, let us take somewhat of a closer look at the subject of 
biomechanics. 




Free body diagram of the foot. 
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Hani tissue, mineralized tissue, and calcified tissue are often used as synonyms for bone when describing 
the structure and properties of bone or tooth. The hard is self-evident in comparison with all other 
mammalian tissues, which often are referred to as soft tissues. Use of the terms mineralized and calcified 
arises from the fact that, in addition to the principle protein, collagen, and other proteins, glycoproteins, 
and protein-polysaccherides, comprising about 50% of the volume, the major constituent of bone is a 
calcium phosphate (thus the term calcified) in the form of a crystalline carbonate apatite (similar to 
naturally occurring minerals, thus the term mineralized). Irrespective of its biological function, bone is 
one of the most interesting materials known in terms of structure-property relationships. Bone is an 
anisotropic, heterogeneous, inhomogeneous, nonlinear, thermorheologically complex viscoelastic mate- 
rial. It exhibits electromechanical effects, presumed to be due to streaming potentials, both in vivo and 
in vitro when wet. In the dry state, bone exhibits piezoelectric properties. Because of the complexity of 
the structure-property relationships in bone, and the space limitation for this chapter, it is necessary to 
concentrate on one aspect of the mechanics. Currey [1984] states unequivocally that he thinks, "the most 
important feature of bone material is its stiffness." This is, of course, the premiere consideration for the 
weight-bearing long bones. Thus, this chapter will concentrate on the elastic and viscoelastic properties 
of compact cortical bone and the elastic properties of trabecular bone as exemplar of mineralized tissue 
mechanics. 

18.1 Structure of Bone 

The complexity of bone's properties arises from the complexity in its structure. Thus it is important to 
have an understanding of the structure of mammalian bone in order to appreciate the related properties. 
Figure 18.1 is a diagram showing the structure of a human femur at different levels [Park, 1979]. For 
convenience, the structures shown in Fig. 18.1 will be grouped into four levels. A further subdivision of 
structural organization of mammalian bone is shown in Fig. 18.2 [ Wainwright et al., 1982]. The individual 
figures within this diagram can be sorted into one of the appropriate levels of structure shown on Fig. 18.1 
as described in the following. At the smallest unit of structure we have the tropocollagen molecule and 
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the associated apatite crystallites (abbreviated Ap). The former is approximately 1.5 by 280 nm, made 
up of three individual left-handed helical polypeptide (alpha) chains coiled into a right-handed triple 
helix. Ap crystallites have been found to be carbonate-substituted hydroxyapatite, generally thought to 
be nonstoichiometric. The crystallites appear to be about 4 x 20 x 60 nm in size. This level is denoted 
the molecular. The next level we denote the ultrastructural. Here, the collagen and Ap are intimately 
associated and assembled into a microfibrilar composite, several of which are then assembled into fibers 
from approximately 3 to 5 um thick. At the next level, the microstructural, these fibers are either randomly 
arranged (woven bone) or organized into concentric lamellar groups (osteons) or linear lamellar groups 
(plexiform bone). This is the level of structure we usually mean when we talk about bone tissue properties. 
In addition to the differences in lamellar organization at this level, there are also two different types of 
architectural structure. The dense type of bone found, for example, in the shafts of long bone is known 
as compact or cortical bone. A more porous or spongy type of bone is found, for example, at the 
articulating ends of long bones. This is called cancellous bone. It is important to note that the material 
and structural organization of collagen-Ap making up osteonic or haversian bone and plexiform bone 
are the same as the material comprising cancellous bone. 

Finally, we have the whole bone itself constructed of osteons and portions of older, partially destroyed 
osteons (called interstitial lamellae) in the case of humans or of osteons and/or plexiform bone in the 
case of mammals. This we denote the macrostructural level. The elastic properties of the whole bone 
results from the hierarchical contribution of each of these levels. 

18.2 Composition of Bone 

The composition of bone depends on a large number of factors: the species, which bone, the location 
from which the sample is taken, and the age, sex, and type of bone tissue, e.g., woven, cancellous, cortical. 
However, a rough estimate for overall composition by volume is one-third Ap, one-third collagen and 
other organic components, and one-third H 2 0. Some data in the literature for the composition of adult 
human and bovine cortical bone are given in Table 18.1. 
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FIGURE 18.2 Diagram showing the structure of mammalian bone at different levels. Bone at the same level is 
drawn at the same magnification. The arrows show what types may contribute to structures at higher levels [Wain- 
wright et al„ 1982] (Courtesy Princeton University Press), (a) Collagen fibril with associated mineral crystals. 
( b) Woven bone. The collagen fibrils are arranged more or less randomly. Osteocytes are not shown. ( c) Lamellar 
bone. There are separate lamellae, and the collagen fibrils are arranged in "domains" of preferred fibrillar orientation 
in each lamella. Osteocytes are not shown, (d) Woven bone. Blood channels are shown as large black spots. At this 
level woven bone is indicated by light dotting, (e) Primary lamellar bone. At this level lamellar bone is indicated by 
fine dashes, (f) Haversian bone. A collection of Haversian systems, each with concentric lamellae round a central 
blood channel. The large black area represents the cavity formed as a cylinder of bone is eroded away. It will be filled 
in with concentric lamellae and form a new Haversian system, (g) Laminar bone. Two blood channel networks are 
exposed. Note how layers of woven and lamellar bone alternate, (h) Compact bone of the types shown at the lower 
levels, (i) Cancellous bone. 



TABLE 18.1 Composition of Adult Human and Bovine Cortical Bone 
Species % H 2 Q Ap % Dry Weight Collagen GAG* 



Reference 



Bovine 
Human 



9.1 

7.3 



76.4 
67.2 



21.5 
21.2 



N.Dt Herring, 1977 

0.34 Pellagrino and Blitz, 1965; Vejlens, 1971 



* Glycosaminoglycan 
f Not determined 
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18.3 Elastic Properties 

Although bone is a viscoelastic material, at the quasi-static strain rates in mechanical testing and even 
at the ultrasonic frequencies used experimentally, it is a reasonable first approximation to model cortical 
bone as an anisotropic, linear elastic solid with Hooke's law as the appropriate constitutive equation. 
Tensor notation for the equation is written as: 



' ^ijkl e kl 



(18.1) 



where a,-.- and e kl are the second-rank stress and infinitesimal second rank strain tensors, respectively, and 
C- t : kl is the fourth-rank elasticity tenor. Using the reduced notation, we can rewrite Eq. (18.1) as 



'<) 



:lt06 



(18.2) 



where the Q are the stiffness coefficients (elastic constants). The inverse of the Q, the S,v, are known as 
the compliance coefficients. 

The anisotropy of cortical bone tissue has been described in two symmetry arrangements. Lang [1969], 
Katz and Ukraincik [1971], and Yoon and Katz [1976a,b] assumed bone to be transversely isotropic with 
the bone axis of symmetry (the 3 direction) as the unique axis of symmetry. Any small difference in 
elastic properties between the radial (1 direction) and transverse (2 direction) axes, due to the apparent 
gradient in porosity from the periosteal to the endosteal sides of bone, was deemed to be due essentially 
to the defect and did not alter the basic symmetry. For a transverse isotropic material, the stiffness matrix 
[Qj] is given by 
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(18.3) 



where C 66 = 1/2 (C n - C 12 ). Of the 12 nonzero coefficients, only 5 are independent. 

However, Van Buskirk and Ashman [1981] used the small differences in elastic properties between the 
radial and tangential directions to postulate that bone is an orthotropic material; this requires that 9 of 
the 12 nonzero elastic constants be independent, that is, 
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(18.4) 



Corresponding matrices can be written for the compliance coefficients, the S, ■, based on the inverse 
equation to Eq. (18.2): 



So 

'i 1 



i, j = 1 to 6 



(18.5) 
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where the S,yth compliance is obtained by dividing the [ CJ stiffness matrix, minus the rth row and ;th 
column, by the full [Q] matrix and vice versa to obtain the C« in terms of the S, •. Thus, although S 33 = 
l/£ 3 , where £ 3 is Young's modulus in the bone axis direction, £ 3 ■£ C 33 , since C 33 and S 33 , are not reciprocals 
of one another even for an isotropic material, let alone for transverse isotropy or orthotropic symmetry. 
The relationship between the compliance matrix and the technical constants such as Young's modulus 
(Ei) shear modulus (Gi) and Poisson's ratio (v ; ) measured in mechanical tests such as uniaxial or pure 
shear is expressed in Eq. (18.6). 
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(18.6) 



Again, for an orthotropic material, only 9 of the above 12 nonzero terms are independent, due to the 
symmetry of the S, tensor: 



(18.7) 



For the transverse isotropic case, Eq. (18.5) reduces to only 5 independent coefficients, since 



E 1 = E 2 



G, 



G, 



32 

E 



(U 



12 ~ 2 ( l + Vn) 



In addition to the mechanical tests cited above, ultrasonic wave propagation techniques have been 
used to measure the anisotropic elastic properties of bone [Lang, 1969; Yoon and Katz, 1976a,b; Van 
Buskirk and Ashman, 1981]. This is possible, since combining Hooke's law with Newton's second law 
results in a wave equation which yields the following relationship involving the stiffness matrix: 



pv 2 u„, 



C N N U 

mrns r s n 



(18.9) 



where p is the density of the medium, Vis the wave speed, and U and N are unit vectors along the particle 
displacement and wave propagation directions, respectively, so that Um, Nr, etc. are direction cosines. 

Thus to find the five transverse isotropic elastic constants, at least five independent measurements are 
required, e.g., a dilatational longitudinal wave in the 2 and 1(2) directions, a transverse wave in the 
13 (23) and 12 planes, etc. The technical moduli must then be calculated from the full set of Q. For 
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TABLE 18.2 Elastic Stiffness Coefficients for Various Human and Bovine Bones; All Measurements Made 
with Ultrasound except for Knets [1978] Mechanical Tests 



Experiments 
(Bone Type) 


(GPa) 


C„ 
(GPa) 


(GPa) 


C 44 
(GPa) 


Q 5 

(GPa) 


Q6 

(GPa) 


c 12 

(GPa) 


C 13 

(GPa) 


c 2 , 

(GPa) 


Van Buskirk and Ashman 


14.1 


18.4 


25.0 


7.00 


6.30 


5.28 


6.34 


4.84 


6.94 


[1981] (bovine femur) 




















Knets [1978] (human tibia) 


11.6 


14.4 


22.5 


4.91 


3.56 


2.41 


7.95 


6.10 


6.92 


Van Buskirk and Ashman 


20.0 


21.7 


30.0 


6.56 


5.85 


4.74 


10.9 


11.5 


11.5 


[1981] (human femur) 




















Maharidge [1984] (bovine 


21.2 


21.0 


29.0 


6.30 


6.30 


5.40 


11.7 


12.7 


11.1 


femur haversian) 
Maharidge [1984] (bovine 


22.4 


25.0 


35.0 


8.20 


7.10 


6.10 


14.0 


15.8 


13.6 


femur plexiform) 





















improved statistics, redundant measurements should be made. Correspondingly, for orthotropic sym- 
metry, enough independent measurements must be made to obtain all 9 Q; again, redundancy in 
measurements is a suggested approach. 

One major advantage of the ultrasonic measurements over mechanical testing is that the former can 
be done with specimens too small for the latter technique. Second, the reproducibility of measurements 
using the former technique is greater than for the latter. Still a third advantage is that the full set of either 
five or nine coefficients can be measured on one specimen, a procedure not possible with the latter 
techniques. Thus, at present, most of the studies of elastic anisotropy in both human and other mam- 
malian bone are done using ultrasonic techniques. In addition to the bulk wave type measurements 
described above, it is possible to obtain Young's modulus directly. This is accomplished by using samples 
of small cross sections with transducers of low frequency so that the wavelength of the sound is much 
larger than the specimen size. In this case, an extensional longitudinal (bar) wave is propagated (which 
experimentally is analogous to a uniaxial mechanical test experiment), yielding 

V 2 =- (18.10) 

P 

This technique was used successfully to show that bovine plexiform bone was definitely orthotropic while 
bovine haversian bone could be treated as transversely isotropic [Lipson and Katz, 1984]. The results 
were subsequently confirmed using bulk wave propagation techniques with considerable redundancy 
[Maharidge, 1984]. 

Table 18.2 lists the C,-. (in GPa) for human (haversian) bone and bovine (both haversian and plexiform) 
bone. With the exception of Knet's [1978] measurements, which were made using quasi-static mechanical 
testing, all the other measurements were made using bulk ultrasonic wave propagation. 

In Maharidge's study [1984], both types of tissue specimens, haversian and plexiform, were obtained 
from different aspects of the same level of an adult bovine femur. Thus the differences in C« reported 
between the two types of bone tissue are hypothesized to be due essentially to the differences in micro- 
structural organization (Fig. 18.3) [Wainwright et al., 1982]. The textural symmetry at this level of 
structure has dimensions comparable to those of the ultrasound wavelengths used in the experiment, 
and the molecular and ultrastructural levels of organization in both types of tissues are essentially 
identical. Note that while C u , almost equals C 22 and that C 44 and C 55 are equal for bovine haversian bone, 
C n and C 22 and C 44 and C 55 differ by 11.6 and 13.4%, respectively, for bovine plexiform bone. Similarly, 
although C 66 and V4 (C u - C 12 ) differ by 12.0% for the haversian bone, they differ by 31.1% for plexiform 
bone. Only the differences between C 13 and C 23 are somewhat comparable: 12.6% for haversian bone and 
13.9% for plexiform. These results reinforce the importance of modeling bone as a hierarchical ensemble 
in order to understand the basis for bone's elastic properties as a composite material- structure system in 
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FIGURE 18.3 Diagram showing how laminar (plexiform) bone {a) differs more between radial and tangential 
directions (R and T) than does haversian bone (b). The arrows are vectors representing the various directions 
[Wainwright et al., 1982] (Courtesy Princeton University Press). 

which the collagen-Ap components define the material composite property. When this material property 
is entered into calculations based on the microtextural arrangement, the overall anisotropic elastic 
anisotropy can be modeled. 

The human femur data [Van Buskirk and Ashman, 1981] support this description of bone tissue. 
Although they measured all nine individual Q, treating the femur as an orthotropic material, their results 
are consistent with a near transverse isotropic symmetry. However, their nine Q.- for bovine femoral bone 
clearly shows the influence of the orthotropic microtextural symmetry of the tissue's plexiform structure. 

The data of Knets [1978] on human tibia are difficult to analyze. This could be due to the possibility 
of significant systematic errors due to mechanical testing on a large number of small specimens from a 
multitude of different positions in the tibia. 

The variations in bone's elastic properties cited earlier above due to location is appropriately illustrated 
in Table 18.3, where the mean values and standard deviations (all in GPa) for all g orthotropic C« are 
given for bovine cortical bone at each aspect over the entire length of bone. 

Since the Cs are simply related to the "technical" elastic moduli, such as Young's modulus (£), shear 
modulus (G), bulk modulus (K), and others, it is possible to describe the moduli along any given direction. 
The full equations for the most general anisotropy are too long to present here. However, they can be 

TABLE 18.3 Mean Values and Standard Deviations for the 
Cjj Measured by Van Buskirk and Ashman [1981] at Each 
Aspect over the Entire Length of Bone (all values in GPa) 





Anterior 


Medial 


Posterior 


Lateral 


c„ 


18.7 ± 1.7 


20.9 ± 0.8 


20.1 ± 1.0 


20.6 ± 1.6 


c„. 


20.4 ± 1.2 


22.3 ± 1.0 


22.2 ± 1.3 


22.0 ± 1.0 


c„ 


28.6 ± 1.9 


30.1 ±2.3 


30.8 ± 1.0 


30.5 ± 1.1 


c„ 


6.73 ± 0.68 


6.45 ± 0.35 


6.78 ± 1.0 


6.27 ± 0.28 


Q, 


5.55 ± 0.41 


6.04 ± 0.51 


5.93 ± 0.28 


5.68 ± 0.29 


Q, 


4.34 ± 0.33 


4.87 ± 0.35 


5.10 ± 0.45 


4.63 ± 0.36 


c n 


11.2 ±2.0 


11.2 ± 1.1 


10.4 ± 1.0 


10.8 ± 1.7 


c„ 


11.2 ± 1.1 


11.2 ±2.4 


11.6 ± 1.7 


11.7 ± 1.8 


c 23 


10.4 ± 1.4 


11.5 ± 1.0 


12.5 ± 1.7 


11.8 ± 1.1 
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found in Yoon and Katz [1976a]. Presented below are the simplified equations for the case of transverse 
isotropy. Young's modulus is 



E(y 3 ) ^ ' 



+ Y 2 3(l-Y 2 3)(2S 13 + S 44 ) 



(18.11) 



where y 3 = cos (j), and (]) is the angle made with respect to the bone (3) axis. 

The shear modulus (rigidity modulus or torsional modulus for a circular cylinder) is 

G r r j=2( s « + s; 5 ) = s 44 + (s 11 -s 12 )-is 44 (i-Y^) 

+ 2(s n + S }3 -2S u -S u )yl(l-yl) 

where, again y 3 = cos (|>. 

The bulk modulus (reciprocal of the volume compressibility) is 



(18.12) 



K 



■S 3 , + 2(s u + S u+ 2S u y C " + C * +2C *- 4C " 



M)3\M 1 + C 12 J 2C 13 



(18.13) 



Conversion of Eqs. (18.11) and (18.12) from S^ to Qcan be done by using the following transformation 
equations: 



C. C —C. C C —C. 

c _ °22 ^33 °23 c _ °33 Ml M3 

A A 



C C —C C C —C C 

_ O u <^ 12 Q 12 _ M3 M3 M2 M3 

33 _ A 12 _ . 

A A 

c _ M2 M>3 ~ M3 M22 „ _ M2 M3~M3 Ml 

13 ~~ a 23 _ . 

A A 



(18.14) 



where 



°44 „ °55 r °66 „ 

L 44 M>5 M36 



Ui ^n ^i 



12 13 



C 12 C 22 C 23 

M3 ^23 ^33 



: Ml M22 ^33 + ^ C 12 ^23 M3 ' 



~(mi Ml3 + M!2 M^ + MW M^) (18.15) 



In addition to data on the elastic properties of cortical bone presented above, there is also available a 
considerable set of data on the mechanical properties of cancellous (trabecullar) bone including mea- 
surements of the elastic properties of single trabeculae. Indeed as early as 1993, Keaveny and Hayes (1993) 
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TABLE 18.4 Elastic Moduli of Trabecular Bone Material Measured by Different 
Experimental Methods 



Study 


Method 


Average Modulus 


(GPa) 


Townsend et al. (1975) 


Buckling 


11.4 


(Wet) 




Buckling 


14.1 


(Dry) 


Ryan and Williams (1989) 


Uniaxial tension 


0.760 




Choietal. (1992) 


4-point bending 


5.72 




Ashman and Rho (1988) 


Ultrasound 


13.0 


(Human) 




Ultrasound 


10.9 


(Bovine) 


Rho et al. (1993) 


Ultrasound 


14.8 






Tensile Test 


10.4 




Rho et al. (1999) 


Nanoindentation 


19.4 


(Longitudinal) 




Nanoindentation 


15.0 


(Transverse) 


Turner et al. (1999) 


Acoustic microscopy 


17.5 






Nanoindentation 


18.1 




Bumrerraj (1999) 


Acoustic microscopy 


17.4 





presented an analysis of 20 years of studies on the mechanical properties of trabecular bone. Most of the 
earlier studies used mechanical testing of bulk specimens of a size reflecting a cellular solid, i.e., of the 
order of cubic mm or larger. These studies showed that both the modulus and strength of trabecular 
bone are strongly correlated to the apparent density, where apparent density, p a , is defined as the product 
of individual trabeculae density, p„ and the volume fraction of bone in the bulk specimen, V f , and is 
given by p a = p t V f . 

Elastic moduli, E, from these measurements generally ranged from approximately 10 MPa to the order 
of 1 GPa depending on the apparent density and could be correlated to the apparent density in g/cc by 
a power law relationship, E = 6.13P 1 j 4 , calculated for 165 specimens with an r 2 = 0.62 [Keaveny and 
Hayes, 1993]. 

With the introduction of micromechanical modeling of bone, it became apparent that in addition to 
knowing the bulk properties of trabecular bone it was necessary to determine the elastic properties of 
the individual trabeculae. Several different experimental techniques have been used for these studies. 
Individual trabeculae have been machined and measured in buckling, yielding a modulus of 11.4 GPa 
(wet) and 14.1 GPa (dry) [Townsend et al., 1975], as well as by other mechanical testing methods 
providing average values of the elastic modulus ranging from less than 1 GPa to about 8 GPa (Table 18.4). 
Ultrasound measurements [Ashman and Rho, 1988; Rho et al., 1993] have yielded values commensurate 
with the measurements of Townsend et al. (1975) (Table 18.4). More recently, acoustic microscopy and 
nanoindentation have been used, yielding values significantly higher than those cited above. Rho et al. 
(1999) using nanoindentation obtained average values of modulus ranging from 15.0 to 19.4 GPa 
depending on orientation, as compared to 22.4 GPa for osteons and 25.7 GPa for the interstitial lamellae 
in cortical bone (Table 18.4). Turner et al. (1999) compared nanoindentation and acoustic microscopy 
at 50 MHz on the same specimens of trabecular and cortical bone from a common human donor. While 
the nanoindentation resulted in Young's moduli greater than those measured by acoustic microscopy by 
4 to 14%, the anisotropy ratio of longitudinal modulus to transverse modulus for cortical bone was 
similar for both modes of measurement; the trabecular values are given in Table 18.4. Acoustic microscopy 
at 400 MHz has also been used to measure the moduli of both human trabecular and cortical bone 
[Bumrerraj, 1999], yielding results comparable to those of Turner et al. (1999) for both types of bone 
(Table 18.4). 

These recent studies provide a framework for micromechanical analyses using material properties 
measured on the microstructural level. They also point to using nano-scale measurements, such as those 
provided by atomic force microscopy (AFM), to analyze the mechanics of bone on the smallest unit of 
structure shown in Figure 18.1. 
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18.4 Characterizing Elastic Anisotropy 

Having a full set of five or nine Q does permit describing the anisotropy of that particular specimen of 
bone, but there is no simple way of comparing the relative anisotropy between different specimens of 
the same bone or between different species or between experimenters' measurements by trying to relate 
individual C« between sets of measurements. Adapting a method from crystal physics [Chung and 
Buessem, 1968], Katz and Meunier [1987] presented a description for obtaining two scalar quantities 
defining the compressive and shear anisotropy for bone with transverse isotropic symmetry. Later, they 
developed a similar pair of scalar quantities for bone exhibiting orthotropic symmetry [Katz and Meunier, 
1990]. For both cases, the percentage compressive (Ac*) and shear (As*) elastic anisotropy are given, 
respectively, by 



(18.16) 



where K v and K R are the Voigt (uniform strain across an interface) and Reuss (uniform stress across an 
interface) bulk moduli, respectively, and G v and G R are the Voigt and Reuss shear moduli, respectively. 
The equations for K v , K R , G v , and G R are provided for both transverse isotropy and orthotropic symmetry 
in App. 20A. 

Table 18.5 lists the values of K v , K R , G v , G R , Ac*, and As* for the five experiments whose Q are given 
in Table 18.2. 

It is interesting to not that haversian bones, whether human or bovine, have both their compressive 
and shear anisotropy factors considerably lower than the respective values for plexiform bone. Thus, not 
only is plexiform bone both stiffer and more rigid than haversian bone, it is also more anisotropic. The 
higher values of Ac* and As*, especially the latter at 7.88% for the Knets [1978] mechanical testing data 
on human haversian bone, supports the possibility of the systematic errors in such measurements 
suggested above. 

18.5 Modeling Elastic Behavior 

Currey [ 1964] first presented some preliminary ideas of modeling bone as a composite material composed 
of a simple linear superposition of collagen and Ap. He followed this later [1969] with an attempt to 
take into account the orientation of the Ap crystallites using a model proposed by Cox [1952] for fiber- 
reinforced composites. Katz [1971a] and Piekarski [1973] independently showed that the use of Voigt 
and Reuss or even Hashin-Shtrikman [1963] composite modeling showed the limitations of using linear 
combinations of either elastic moduli or elastic compliances. The failure of all these early models could 
be traced to the fact that they were based only on considerations of material properties. This is comparable 
to trying to determine the properties of an Eiffel Tower built using a composite material by simply 

TABLE 18.5 Values of K v , K R , G v , and G R , (all in GPa), and Ac* and As* (%) for the Bone 
Specimens Given in Table 18.2 

Experiments (Bone Type) JC K R G v G R Ac* As* 

Van Buskirk and Ashman [1981] (bovine femur) 
Knets [1978] (human tibia) 
Van Buskirk and Ashman [1981] (human femur) 
Maharidge [1984] (bovine femur haversian) 
Maharidge [1984] (bovine femur plexiform) 
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10.4 


9.87 


6.34 


6.07 


2.68 


2.19 


10.1 


9.52 


4.01 


3.43 


2.68 


7.88 


15.5 


15.0 


5.95 


5.74 


1.59 


1.82 


15.8 


15.5 


5.98 


5.82 


1.11 


1.37 


18.8 


18.1 


6.88 


6.50 


1.84 


2.85 
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FIGURE 18.4 Variation in Young's modulus of bovine femur specimens (£) with the orientation of specimen axis 

to the long axis of the bone, for wet (o) and dry x) conditions compared with the theoretical curve ( ) predicted 

from a fibre-reinforced composite model [Bonfield and Grynpas, 1977] (Courtesy Nature 1977 270:453. © Macmillan 
Magazines Ltd.). 

modeling the composite material properties without considering void spaces and the interconnectivity 
of the structure [Lakes, 1993]. In neither case is the complexity of the structural organization involved. 
This consideration of hierarchical organization clearly must be introduced into the modeling. 

Katz in a number of papers [1971b, 1976] and meeting presentations put forth the hypothesis that 
haversian bone should be modeled as a hierarchical composite, eventually adapting a hollow fiber 
composite model by Hashin and Rosen [1964]. Bonfield and Grynpas [1977] used extensional (longitu- 
dinal) ultrasonic wave propagation in both wet and dry bovine femoral cortical bone specimens oriented 
at angles of 5, 10, 20, 40, 50, 70, 80, and 85 degrees with respect to the long bone axis. They compared 
their experimental results for Young's moduli with the theoretical curve predicted by Currey's model 
[1969]; this is shown in Fig. 18.4. The lack of agreement led them to "conclude, therefore that an 
alternative model is required to account for the dependence of Young's modulus on orientation" [Bonfield 
and Grynpas, 1977]. Katz [1980, 1981], applying his hierarchical material-structure composite model, 
showed that the data in Fig. 18.4 could be explained by considering different amounts of Ap crystallites 
aligned parallel to the long bone axis; this is shown in Fig. 18.5. This early attempt at hierarchical 
micromechanical modeling is now being extended with more sophisticated modeling using either finite- 
element micromechanical computations [Hogan, 1992] or homogenization theory [Crolet et al., 1993]. 
Further improvements will come by including more definitive information on the structural organization 
of collagen and Ap at the molecular- ultrastructural level [Wagner and Weiner, 1992; Weiner and Traub, 
1989]. 

18.6 Viscoelastic Properties 

As stated earlier, bone (along with all other biologic tissues) is a viscoelastic material. Clearly, for such 
materials, Hooke's law for linear elastic materials must be replaced by a constitutive equation which 
includes the time dependency of the material properties. The behavior of an anisotropic linear viscoelastic 
material may be described by using the Boltzmann superposition integral as a constitutive equation: 
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Orienlation of sample relative to longitudinal axis 
of model anil bono (0 = cos* 1 y ) 

FIGURE 18.5 Comparison of predictions of Katz two-level composite model with the experimental data of Bonfield 
and Grynpas. Each curve represents a different lamellar configuration within a single osteon, with longitudinal fibers; 
A, 64%; B, 57%; C, 50%; D, 37%; and the rest of the fibers assumed horizontal. (From Katz JL, Mechanical Properties 
of Bone, AMD, Vol. 45, New York, American Society of Mechanical Engineers, 1981, with permission.) 
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(18.17) 



where oJt) and e H (x) are the time-dependent second rank stress and strain tensors, respectively, and 
Cm{t— t) is the fourth-rank relaxation modulus tensor. This tensor has 36 independent elements for the 
lowest symmetry case and 12 nonzero independent elements for an orthotropic solid. Again, as for linear 
elasticity, a reduced notation is used, i.e., 11 — » 1, 22 — > 2, 33 — > 3, 23 — > 4, 31 — > 5, and 12 — > 6. If we 
apply Eq. (18.17) to the case of an orthotropic material, e.g., plexiform bone, in uniaxial tension 
(compression) in the 1 direction [Lakes and Katz, 1974] , in this case using the reduced notation, we obtain 
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for all t, and 
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for all t. 

Having the integrands vanish provides an obvious solution to Eqs. (18.19) and (18.20). Solving them 

fd€«] fdeW] 

simultaneously for and - — - — - and substituting these values in Eq. (18.17) yields 

dx dx 



o,(t) = f ^(t-xj-^dr (18.21) 

where, if for convenience we adopt the notation Q = Q (f- x), then Young's modulus is given by 



1 (t-x) = C n + C u 
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(18.22) 



In this case of uniaxial tension (compression), only nine independent orthotropic tensor components 
are involved, the three shear components being equal to zero. Still, this time-dependent Young's modulus 
is a rather complex function. As in the linear elastic case, the inverse form of the Boltzmann integral can 
be used; this would constitute the compliance formulation. 

If we consider the bone being driven by a strain at a frequency ©, with a corresponding sinusoidal 
stress lagging by an angle 8, then the complex Young's modulus £*(co) may be expressed as 



E*(co) = £'(co)+i£"(a)) 



(18.23) 



where E'(co), which represents the stress-strain ratio in phase with the strain, is known as the storage 
modulus, and £"(oj), which represents the stress-strain ratio 90 degrees out of phase with the strain, is 
known as the loss modulus. The ratio of the loss modulus to the storage modulus is then equal to tan 8. 
Usually, data are presented by a graph of the storage modulus along with a graph of tan 8, both against 
frequency. For a more complete development of the values of £'(©) and £"(oj), as well as for the derivation 
of other viscoelastic technical moduli, see Lakes and Katz [ 1974] ; for a similar development of the shear 
storage and loss moduli, see Cowin [1989]. 

Thus, for a more complete understanding of bone's response to applied loads, it is important to know 
its rheologic properties. There have been a number of early studies of the viscoelastic properties of various 
long bones [Sedlin, 1965; Smith and Keiper, 1965; Laird and Kingsbury, 1973; Lugassy, 1968; Black and 
Korostoff, 1973]. However, none of these was performed over a wide enough range of frequency (or 
time) to completely define the viscoelastic properties measured, e.g., creep or stress relaxation. Thus it 
is not possible to mathematically transform one property into any other to compare results of three 
different experiments on different bones [Lakes and Katz, 1974], 

In the first experiments over an extended frequency range, the biaxial viscoelastic as well as uniaxial 
viscoelastic properties of wet cortical human and bovine femoral bone were measured using both dynamic 
and stress relaxation techniques over eight decades of frequency (time) [Lakes et al., 1979]. The results 
of these experiments showed that bone was both nonlinear and thermorheologically complex, i.e., time- 
temperature superposition could not be used to extend the range of viscoelastic measurements. A 
nonlinear constitutive equation was developed based on these measurements [Lakes and Katz, 1979a]. 
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FIGURE 18.6 Comparison of relaxation spectra for wet human bone, specimens 5 and 6 [Lakes et al., 1979] in 
simple torsion; T= 37°C. First approximation from relaxation and dynamic data. • Human tibial bone, specimen 6. 
A Human tibial bone, specimen 5, G !ld = G(10 s). G std (5) = G(10 s). G S J5) = 0.590 x 10 6 lb/in 2 . GJfi) x 0.602 x 
10 6 lb/in 2 (Courtesy Journal of Biomechanics, Pergamon Press). 

In addition, relaxation spectrums for both human and bovine cortical bone were obtained; Fig. 18.6 
shows the former [Lakes and Katz, 1979b]. The contributions of several mechanisms to the loss tangent 
of cortical bone is shown in Fig. 18.7 [Lakes and Katz, 1979b]. It is interesting to note that almost all the 
major loss mechanisms occur at frequencies (times) at or close to those in which there are "bumps," 
indicating possible strain energy dissipation, on the relaxation spectra shown on Fig. 18.6. An extensive 
review of the viscoelastic properties of bone can be found in the CRC publication Natural and Living 
Biomaterials [Lakes and Katz, 1984]. 

Following on Katz's [1976, 1980] adaptation of the Hashin-Rosen hollow fiber composite model [1964], 
Gottesman and Hashin [1979] presented a viscoelastic calculation using the same major assumptions. 

18.7 Related Research 

As stated earlier, this chapter has concentrated on the elastic and viscoelastic properties of compact 
cortical bone and the elastic properties of trabecular bone. At present there is considerable research 
activity on the fracture properties of the bone. Professor William Bonfield and his associates at Queen 
Mary and Westfield College, University of London and Professor Dwight Davy and his colleagues at Case 
Western Reserve University are among those who publish regularly in this area. Review of the literature 
is necessary in order to become acquainted with the state of bone fracture mechanics. 

An excellent introductory monograph which provides a fascinating insight into the structure-property 
relationships in bones including aspects of the two areas discussed immediately above is Professor John 
Currey's The Mechanical Adaptations of Bones, published in 1984 by Princeton University Press. 
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FIGURE 18.7 Contributions of several relaxation mechanisms to the loss tangent of cortical bone. A: Homogeneous 
thermoelastic effect. B: Inhomogeneous thermoelastic effect. C: Fluid flow effect. D: Piezoelectric effect [Lakes and 
Katz, 1984]. (Courtesy CRC Press) 



Defining Terms 

Apatite: Calcium phosphate compound, stoichiometric chemical formula Ca 5 (P0 4 ) 3 -X, where X is OHr 
(hydroxyapatite), F~ (fluorapatite), Cl~ (chlorapatite), etc. There are two molecules in the basic 
crystal unit cell. 

Cancellous bone: Also known as porous, spongy, trabecular bone. Found in the regions of the articulating 
ends of tubular bones, in vertebrae, ribs, etc. 

Cortical bone: The dense compact bone found throughout the shafts of long bones such as the femur, 
tibia, etc. also found in the outer portions of other bones in the body. 

Haversian bone: Also called osteonic. The form of bone found in adult humans and mature mammals, 
consisting mainly of concentric lamellar structures, surrounding a central canal called the haversian 
canal, plus lamellar remnants of older haversian systems (osteons) called interstitial lamellae. 

Interstitial lamellae: See Haversian bone above. 

Orthotropic: The symmetrical arrangement of structure in which there are three distinct orthogonal 
axes of symmetry. In crystals this symmetry is called orthothombic. 

Osteons: See Haversian bone above. 

Plexiform: Also called laminar. The form of parallel lamellar bone found in younger, immature non- 
human mammals. 

Transverse isotropy: The symmetry arrangement of structure in which there is a unique axis perpen- 
dicular to a plane in which the other two axes are equivalent. The long bone direction is chosen 
as the unique axis. In crystals this symmetry is called hexagonal. 
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Further Information 

Several societies both in the United States and abroad hold annual meetings during which many presen- 
tations, both oral and poster, deal with hard tissue biomechanics. In the United States these societies 
include the Orthopaedic Research Society, the American Society of Mechanical Engineers, the Biomate- 
rials Society, the American Society of Biomechanics, the Biomedical Engineering Society, and the Society 
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for Bone and Mineral Research. In Europe there are alternate year meetings of the European Society of 
Biomechanics and the European Society of Biomaterials. Every four years there is a World Congress of 
Biomechanics; every three years there is a World Congress of Biomaterials. All of these meetings result 
in documented proceedings; some with extended papers in book form. 

The two principal journals in which bone mechanics papers appear frequently are the Journal of 
Biomechanics published by Elsevier and the Journal of Biomechanical Engineering published by the Amer- 
ican Society of Mechanical Engineers. Other society journals which periodically publish papers in the 
field are the Journal of Orthopaedic Research published for the Orthopaedic Research Society, the Annals 
of Biomedical Engineering published for the Biomedical Engineering Society, and the Journal of Bone and 
Joint Surgery (both American and English issues) for the American Academy of Orthopaedic Surgeons 
and the British Organization, respectively. Additional papers in the field may be found in the journal 
Bone and Calcified Tissue International. 

The 1984 CRC volume, Natural and Living Biomaterials (Hastings GW and Ducheyne P, Eds.) provides 
a good historical introduction to the field. A more advanced book is Bone Mechanics (Cowin SC, 1989); 
a second edition is due out in late 1999 or early 2000. 

Many of the biomaterials journals and society meetings will have occasional papers dealing with hard 
tissue mechanics, especially those dealing with implant-bone interactions. 
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Appendix 

The Voigt and Reuss moduli for both transverse isotropic and orthotropic symmetry are given below: 

Voigt Transverse Isotropic 

2(C 11 + C 12 ) + 4(C 1 3 + C3 3 ) 

9 
(C 11 + C 12 )-4C 13 +2C 3 3 + 12(C 44 + C 66 ) 
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Voigt Orthotropic 
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where A is given in Eq. (18.15). 
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19.1 Assumptions 

This chapter is concerned with the mechanical behavior of blood vessels under static loading conditions 
and the methods required to analyze this behavior. The assumptions underlying this discussion are for 
ideal blood vessels that are at least regionally homogeneous, incompressible, elastic, and cylindrically 
orthotropic. Although physiologic systems are nonideal, much understanding of vascular mechanics has 
been gained through the use of methods based upon these ideal assumptions. 

Homogeneity of the Vessel Wall 

On visual inspection, blood vessels appear to be fairly homogeneous and distinct from surrounding 
connective tissue. The inhomogeneity of the vascular wall is realized when one examines the tissue under 
a low-power microscope, where one can easily identify two distinct structures: the media and adventitia. 
For this reason the assumption of vessel wall homogeneity is applied cautiously. Such an assumption 
may be valid only within distinct macroscopic structures. However, few investigators have incorporated 
macroscopic inhomogeneity into studies of vascular mechanics [17]. 

Incompressibility of the Vessel Wall 

Experimental measurement of wall compressibility of 0.06% at 270 cm of H 2 indicates that the vessel 
can be considered incompressible when subjected to physiologic pressure and load [2]. In terms of the 
mechanical behavior of blood vessels, this is small relative to the large magnitude of the distortional 
strains that occur when blood vessels are deformed under the same conditions. Therefore, vascular 
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compressibility may be important to understanding other physiologic processes related to blood vessels, 
such as the transport of interstitial fluid. 

Inelasticity of the Vessel Wall 

That blood vessel walls exhibit inelastic behavior such as length-tension and pressure-diameter hysteresis, 
stress relaxation, and creep has been reported extensively [1, 10]. However, blood vessels are able to 
maintain stability and contain the pressure and flow of blood under a variety of physiologic conditions. 
These conditions are dynamic but slowly varying with a large static component. 

Residual Stress and Strain 

Blood vessels are known to retract both longitudinally and circumferentially are excision. This retraction 
is caused by the relief of distending forces resulting from internal pressure and longitudinal tractions. 
The magnitude of retraction is influenced by several factors. Among these factors are growth, aging, and 
hypertension. Circumferential retraction of medium-caliber blood vessels, such as the carotid, iliac, and 
bracheal arteries, can exceed 70% following reduction of internal blood pressure to zero. In the case of 
the carotid artery, the amount of longitudinal retraction tends to increase during growth and to decrease 
in subsequent aging [5]. It would seem reasonable to assume that blood vessels are in a nearly stress-free 
state when they are fully retracted and free of external loads. This configuration also seems to be a 
reasonable choice for the reference configuration. However, this ignores residual stress and strain effects 
that have been the subject of current research [4, 11-14, 16]. 

Blood vessels are formed in a dynamic environment which gives rise to imbalances between the forces 
that tend to extend the diameter and length and the internal forces that tend to resist the extension. This 
imbalance is thought to stimulate the growth of elastin and collagen and to effectively reduce the stresses 
in the underlying tissue. Under these conditions it is not surprising that a residual stress state exists when 
the vessel is fully retracted and free of external tractions. This process has been called remodeling [11]. 
Striking evidence of this remodeling is found when a cylindrical slice of the fully retracted blood vessel 
is cut longitudinally through the wall. The cylinder springs open, releasing bending stresses kept in 
balance by the cylindrical geometry [16]. 

19.2 Vascular Anatomy 

A blood vessel can be divided anatomically into three distinct cylindrical sections when viewed under 
the optical microscope. Starting at the inside of the vessel, they are the intima, the media, and the 
adventitia. These structures have distinct functions in terms of the blood vessel physiology and mechanical 
properties. 

The intima consists of a thin monolayer of endothelial cells that line the inner surface of the blood 
vessel. The endothelial cells have little influence on blood vessel mechanics but do play an important 
role in hemodynamics and transport phenomena. Because of their anatomical location, these cells are 
subjected to large variations in stress and strain as a result of pulsatile changes in blood pressure and flow. 

The media represents the major portion of the vessel wall and provides most of the mechanical strength 
necessary to sustain structural integrity. The media is organized into alternating layers of interconnected 
smooth muscle cells and elastic lamellae. There is evidence of collagen throughout the media. These small 
collagen fibers are found within the bands of smooth muscle and may participate in the transfer of forces 
between the smooth muscle cells and the elastic lamellae. The elastic lamellae are composed principally 
of the fiberous protein elastin. The number of elastic lamellae depends upon the wall thickness and the 
anatomical location [18]. In the case of the canine carotid, the elastic lamellae account for a major 
component of the static structural response of the blood vessel [6]. This response is modulated by the 
smooth-muscle cells, which have the ability to actively change the mechanical characteristics of the wall [7] . 

The adventitia consists of loose, more disorganized fiberous connective tissue, which may have less 
influence on mechanics. 
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FIGURE 19.1 Cylindrical geometry of a blood vessel: top: stress-free reference configuration; middle: fully retracted 
vessel free of external traction; bottom: vessel in situ under longitudinal tether and internal pressurization. 



19.3 Axisymmetric Deformation 

In the following discussion we will concern ourselves with deformation of cylindrical tubes, see Fig. 19.1. 
Blood vessels tend to be nearly cylindrical in situ and tend to remain cylindrical when a cylindrical section 
is excised and studied in vitro. Only when the vessel is dissected further does the geometry begin to 
deviate from cylindrical. For this deformation there is a unique coordinate mapping 



(a,8,z)-»(r,8,z) 



(19.1) 



where the undeformed coordinates are given by {R, &, Z) and the deformed coordinates are given by 
(r, 0, z). The deformation is given by a set of restricted functions 



■M 



(19.2) 



e = pe 



(19.3) 
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(19.4) 



where the constants u and (3 have been introduced to account for a uniform longitudinal strain and a 
symmetric residual strain that are both independent of the coordinate 0. 
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If (5 = 1, there is no residual strain. If P ^ 1, residual stresses and strains are present. If P > 1, a 
longitudinal cut through the wall will cause the blood vessel to open up, and the new cross-section will 
form a c-shaped section of an annulus with larger internal and external radii. If (3 < 1, the cylindrical 
shape is unstable, but a thin section will tend to overlap itself. In Choung and Fung's formulation, P = 
n/0 o , where the angle O is half the angle spanned by the open annular section [4]. 

For cylindrical blood vessels there are two assumed constraints. The first assumption is that the 
longitudinal strain is uniform through the wall and therefore 

A, =|I = a constant (19.5) 

for any cylindrical configuration. Given this, the principal stretch ratios are computed from the above 
function as 

X =— (19.6) 

r dR 



\ = V L (19.7) 

R 

X z = [i (19.8) 

The second assumption is wall incompressibility, which can be expressed by 

Wz 51 ( 19 - 9 ) 



R dR 
and therefore 
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Integration of this expression yields the solution 



rdr=—RdR (19.11) 
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As a result, the principal stretch ratios can be expressed in terms of _R as follows: 
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19.4 Experimental Measurements 

The basic experimental setup required to measure the mechanical properties of blood vessels in vitro is 
described in [7]. It consists of a temperature-regulated bath of physiologic saline solution to maintain 
immersed cylindrical blood vessel segments, devices to measure diameter, an apparatus to hold the vessel 
at a constant longitudinal extension and to measure longitudinal distending force, and a system to deliver 
and control the internal pressure of the vessel with 100% oxygen. Typical data obtained from this type 
of experiment are shown in Figs. 19.2 and 19.3. 

19.5 Equilibrium 

When blood vessels are excised, they retract both longitudinally and circumferentially. Restoration to 
natural dimensions requires the application of internal pressure, p t , and a longitudinal tether force, F T . 
The internal pressure and longitudinal tether are balanced by the development of forces within the vessel 
wall. The internal pressure is balanced in the circumferential direction by a wall tension, T. The longi- 
tudinal tether force and pressure are balanced by the retractive force of the wall, F R 



T-- 



M 



(19.16) 



F R= F T + Pi^ 



(19.17) 
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FIGURE 19.2 Pressure-radius curves for the canine carotid artery at various degrees of longitudinal extension. 
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FIGURE 19.3 Longitudinal distending force as a function of radius at various degrees of longitudinal extension. 

The first equation is the familiar law of Laplace for a cylindrical tube with internal radius r r It indicates 
that the force due to internal pressure, p { , must be balanced by a tensile force (per unit length), T, within 
the wall. This tension is the integral of the circumferentially directed force intensity (or stress, cr e ) across 
the wall: 



■r 



O a dr = O a h 



(19.18) 



where a e is the mean value of the circumferential stress and h is the wall thickness. Similarly, the 
longitudinal tether force, F T , and extending force due to internal pressure are balanced by a retractive 
internal force, F R , due to axial stress, a z , in the blood vessel wall: 



F„ = 2n \ O rdr = o nh[ r +r.\ 

R I z z \ e i J 



(19.19) 



where a z is the mean value of this longitudinal stress. The mean stresses are calculated from the above 
equation as 



r. 



(19.20) 



%h(r e + r) 



2 h 



119.21) 



The mean stresses are a fairly good approximation for thin-walled tubes where the variations through 
the wall are small. However, the range of applicability of the thin-wall assumption depends upon the 
material properties and geometry. In a linear elastic material, the variation in a e is less than 5% for 
r/h > 20. When the material is nonlinear or the deformation is large, the variations in stress can be more 
severe (see Fig. 19.10). 
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The stress distribution is determined by solving the equilibrium equation, 

iAU)-^^ (19 . 22) 

r dr y ' r 

This equation governs how the two stresses are related and must change in the cylindrical geometry. 
For uniform extension and internal pressurization, the stresses must be functions of a single radial 
coordinate, r, subject to the two boundary conditions for the radial stress: 

r (r,u) = -£ (19.23) 

oir,(x) = (19.24) 

19.6 Strain Energy Density Functions 

Blood vessels are able to maintain their structural stability and contain steady oscillating internal pres- 
sures. This property suggests a strong elastic component, which has been called the pseudoelasticity [10]. 
This elastic response can be characterized by a single potential function called the strain energy density. 
It is a scalar function of the strains that determines the amount of stored elastic energy per unit volume. 
In the case of a cylindrically orthotropic tube of incompressible material, the strain energy density can 
be written in the following functional form: 

W=W*(A. r ,A, e ,A. z ) + A. r A. e A. z p, (19.25) 

where p is a scalar function of position, R. The stresses are computed from the strain energy by the 
following: 

0=A..^— +p (19.26) 

' ' 3A, 

We make the following transformation [3] 

l_ Pr 



which upon differentiation gives 



dX 

r — : 
dr 



(19.27) 
^(r 2 -c 2 ) 



:P _1 (PX-|XA. 3 ) (19.28) 



After these expressions and the stresses in terms of the strain energy density function are introduced into 
the equilibrium equation, we obtain an ordinary differential equation for p 



dk (3A, = (xA, 3 dk 



(19.29) 
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subject to the boundary conditions 






(19.30) 



(19.31) 



Isotropic Blood Vessels 



A blood vessel generally exhibits anisotropic behavior when subjected to large variations in internal 
pressure and distending force. When the degree of anisotropy is small, the blood vessel may be treated 
as isotropic. For isotropic materials it is convenient to introduce the strain invariants: 



J, =A 2 +A 2 +A 



(19.32) 



2,2 , .2,2 ^2 

z z 



I 2 — A„A + A Q A_ + A_A" 



(19.33) 



7,=A 2 r A 2 e A 2 z (19.34) 

These are measures of strain that are independent of the choice of coordinates. If the material is incompressible 



I 3 = f = l 

and the strain energy density is a function of the first two invariants, then 

w=w(i l ,i 2 ). 



(19.35) 



(19.36) 



The least complex form for an incompressible material is the first-order polynomial, which was first 
proposed by Mooney to characterize rubber: 



W* 



('i-3) + *(l 2 -3) 



(19.37) 



It involves only two elastic constants. A special case, where k = 0, is the neo-Hookean material, which 
can be derived from thermodynamics principles for a simple solid. Exact solutions can be obtained for 
the cylindrical deformation of a thick-walled tube. In the case where there is no residual strain, we have 
the following: 



P = -G\ 



(l + *H 2 ) 



log A 1 

— — + 

(X 2li 2 A 2 



+ c„ 



(19.38) 



a=G 



A.V 2 



+ k 



1 1 

p7 + A 2 J 



v 



+ P 



(19.39) 



o a = G 



X 2 + k 



-+aV 



+ p 



(19.40) 
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FIGURE 19.4 Pressure-radius curves for a Mooney-Rivlin tube with the approximate dimensions of the carotid. 
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FIGURE 19.5 Longitudinal distending force as a function of radius for the Mooney-Rivlin tube. 
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However, these equations predict stress softening for a vessel subjected to internal pressurization at fixed 
lengths, rather than the stress stiffening observed in experimental studies on arteries and veins (see 
Figs. 19.4 and 19.5). 

An alternative isotropic strain energy density function which can predict the appropriate type of stress 
stiffening for blood vessels is an exponential where the arguments is a polynomial of the strain invariants. 
The first-order form is given by 



W* = -S~ exp 



k 1 (l 1 -3)+k 2 (l 2 -3) 



(19.42) 
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FIGURE 19.6 Pressure-radius curves for tube with the approximate dimensions of the carotid calculated using an 
isotropic exponential strain energy density function. 
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FIGURE 19.7 Longitudinal distending force as a function of radius for the isotropic tube. 

This requires the determination of only two independent elastic constants. The third, G , is introduced 
to facilitate scaling of the argument of the exponent (see Figs. 19.6 and 19.7). This exponential form is 
attractive for several reasons. It is a natural extension of the observation that biologic tissue stiffness is 
proportional to the load in simple elongation. This stress stiffening has been attributed to a statistical 
recruitment and alignment of tangled and disorganized long chains of proteins. The exponential forms 
resemble statistical distributions derived from these same arguments. 

Anisotropic Blood Vessels 

Studies of the orthotropic behavior of blood vessels may employ polynomial or exponential strain energy 
density functions that include all strain terms or extension ratios. In particular, the strain energy density 
function can be of the form 
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FIGURE 19.8 Pressure-radius curves for a fully orthotropic vessel calculated with an exponential strain energy 
density function. 
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FIGURE 19.9 Longitudinal distending force as a function of radius for the orthotropic vessel. 



W* = qJk„k B ,K) 



(19.43) 



W* 



q I A..) A.g, ^ z ) 



(19.44) 



where q n is a polynomial of order n. Since the material is incompressible, the explicit dependence upon 
X r can be eliminated either by substituting X r = \~l X~ z or by assuming that the wall is thin and hence 
that the contribution of these terms is small. Figures 19.8 and 19.9 illustrate how well the experimental 
data can be fitted to an exponential strain density function whose argument is a polynomial of order n = 3. 
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Care must be taken to formulate expressions that will lead to stresses that behave properly. For this 
reason it is convenient to formulate the strain energy density in terms of the lagrangian strains 



a/2 



M 



(19.45) 



and in this case we can consider polynomials of the lagrangian strains, q n (e r , e 9 , e z ). 
Vaishnav et al. [15] proposed using a polynomial of the form 



W* 



X5>. 



(19.46) 



to approximate the behavior of the canine aorta. They found better correlation with order-three poly- 
nomials over order-two, but order-four polynomials did not warrant the addition work. 
Later, Fung et al. [10] found very good correlation with an expression of the form 



w C 

W = — exp 

2 



«i( e e " e ?) + « 2 ( e z " < 2 ) + 2 «4( e e e z " e e<) 



(19.47) 



for the canine carotid artery, where e\ and e* are the strains in a reference configuration at in situ length 
and pressure. Why should this work? One answer appears to be related to residual stresses and strains. 

When residual stresses are ignored, large-deformation analysis of thick-walled blood vessels predicts 
steep distributions in a e and a z through the vessel wall, with the highest stresses at the interior. This 
prediction is considered significant because high tensions in the inner wall could inhibit vascularization 
and oxygen transport to vascular tissue. 

When residual stresses are considered, the stress distributions flatten considerably and become almost 
uniform at in situ length and pressure. Fig. 19.10 shows the radial stress distributions computed for a 
vessel with (3 = 1 and [3 = 1.11. Takamizawa and Hayashi have even considered the case where the strain 
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FIGURE 19.10 Stress distributions through the wall at various pressures for the orthotropic vessel. 
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distribution is uniform in situ [13]. The physiologic implications are that vascular tissue is in a constant 
state of flux. New tissue is synthesized in a state of stress that allows it to redistribute the internal loads 
more uniformly. There probably is no stress-free reference state [8, 11, 12]. Continuous dissection of the 
tissue into smaller and smaller pieces would continue to relieve residual stresses and strains [14]. 
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20.1 Ankle 
GeometryoftheArticulatingSurfaces • JointContact ■ Axes 
of Rotation 

20.2 Knee 

Geometry of the Articulating Surfaces • JointContact • Axes 
of Rotation 

20.3 Hip 

Geometry of the Articulating Surfaces ■ JointContact ■ Axes 
of Rotation 

20.4 Shoulder 

Geometry of the Articulating Surfaces ■ JointContact ■ Axes 
of Rotation 

20.5 Elbow 

Geometry of the Articulating Surfaces • JointContact • Axes 
of Rotation 

20.6 Wrist 

Geometry of the Articulating Surfaces ■ JointContact ■ Axes 
of Rotation 

20.7 Hand 
GeometryoftheArticulatingSurfaces ■ JointContact ■ Axes 
of Rotation 

20.8 Summary 



Knowledge of joint-articulating surface motion is essential for design of prosthetic devices to restore 
function; assessment of joint wear, stability, and degeneration; and determination of proper diagnosis 
and surgical treatment of joint disease. In general, kinematic analysis of human movement can be 
arranged into two separate categories: (1) gross movement of the limb segments interconnected by joints, 
or (2) detailed analysis of joint articulating surface motion which is described in this chapter. Gross 
movement is the relative three-dimensional joint rotation as described by adopting the Eulerian angle 
system. Movement of this type is described in Chapter 25: Analysis of Gait. In general, the three- 
dimensional unconstrained rotation and translation of an articulating joint can be described utilizing 
the concept of the screw displacement axis. The most commonly used analytic method for the description 
of 6-degree-of-freedom displacement of a rigid body is the screw displacement axis [Kinzel et al. 1972; 
Spoor & Veldpaus, 1980; Woltring et al. 1985]. 

Various degrees of simplification have been used for kinematic modeling of joints. A hinged joint is 
the simplest and most common model used to simulate an anatomic joint in planar motion about a 
single axis embedded in the fixed segment. Experimental methods have been developed for determination 
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FIGURE 20.1 Three types of articulating surface motion in human joints. 

of the instantaneous center of rotation for planar motion. The instantaneous center of rotation is defined 
as the point of zero velocity. For a true hinged motion, the instantaneous center of rotation will be a 
fixed point throughout the movement. Otherwise, loci of the instantaneous center of rotation or centrodes 
will exist. The center of curvature has also been used to define joint anatomy. The center of curvature is 
defined as the geometric center of coordinates of the articulating surface. 

For more general planar motion of an articulating surface, the term sliding, rolling, and spinning are 
commonly used (Fig. 20.1). Sliding (gliding) motion is defined as the pure translation of a moving 
segment against the surface of a fixed segment. The contact point of the moving segment does not change, 
while the contact point of the fixed segment has a constantly changing contact point. If the surface of 
the fixed segment is flat, the instantaneous center of rotation is located at infinity. Otherwise, it is located 
at the center of curvature of the fixed surface. Spinning motion (rotation) is the exact opposite of sliding 
motion. In this case, the moving segment rotates, and the contact points on the fixed surface does not 
change. The instantaneous center of rotation is located at the center of curvature of the spinning body 
that is undergoing pure rotation. Rolling motion occurs between moving and fixed segments where the 
contact points in each surface are constantly changing and the arc lengths of contact are equal on each 
segment. The instantaneous center of rolling motion is located at the contact point. Most planar motion 
of anatomic joints can be described by using any two of these three basic descriptions. 

In this chapter, various aspects of joint-articulating motion are covered. Topics include the anatomical 
characteristics, joint contact, and axes of rotation. Joints of both the upper and lower extremity are 
discussed. 

20.1 Ankle 

The ankle joint is composed of two joints: the talocrural (ankle) joint and the talocalcaneal (subtalar 
joint). The talocrural joint is formed by the articulation of the distal tibia and fibula with the trochlea 
of the talus. The talocalcaneal joint is formed by the articulation of the talus with the calcaneus. 

Geometry of the Articulating Surfaces 

The upper articular surface of the talus is wedge-shaped, its width diminishing from front to back. The 
talus can be represented by a conical surface. The wedge shape of the talus is about 25% wider in front 
than behind with an average difference of 2.4 mm ±1.3 mm and a maximal difference of 6 mm [Inman, 
1976]. 
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TABLE 20.1 Talocalcaneal (Ankle) Joint Contact Area 



Investigators 


Plantarflexion 


Neutral 


Dorsiflexion 


Ramsey and Hamilton [1976] 




4.40 ± 1.21 




Kimizuka et al. [1980] 




4.83 




Libotteetal. [1982] 


5.01 (30°) 


5.41 


3.60 (30°) 


Paaretal. [1983] 


4.15 (10°) 


4.15 


3.63 (10°) 


Mackoetal. [1991] 


3.81 ±0.93 (15°) 


5.2 ± 0.94 


5.40 ± 0.74 (10°) 


Driscoll et al. [1994] 


2.70 ± 0.41 (20°) 


3.27 ± 0.32 


2.84 ± 0.43 (20°) 


Hartford et al. [1995] 




3.37 ± 0.52 




Pereiraetal. [1996] 


1.49 (20°) 


1.67 


1.47 (10°) 



Note: The contact area is expressed in square centimeters. 



Joint Contact 

The talocrural joint contact area varies with flexion of the ankle (Table 20.1). During plantarflexion, such 
as would occur during the early stance phase of gait, the contact area is limited and the joint is incon- 
gruous. As the position of the joint progresses from neutral to dorsiflexion, as would occur during the 
midstance of gait, the contact area increases and the joint becomes more stable. The area of the subtalar 
articulation is smaller than that of the talocrural joint. The contact area of the subtalar joint is 0.89 ± 
0.21 cm 2 for the posterior facet and 0.28 ± 15 cm 2 for the anterior and middle facets [Wang et al., 1994]. 
The total contact area (1.18 + 0.35 cm 2 ) is only 12.7% of the whole subtalar articulation area (9.31 + 
0.66 cm 2 ) [Wang et al., 1994]. The contact area/joint area ratio increases with increases in applied load 
(Fig. 20.2). 

Axes of Rotation 

Joint motion of the talocrural joint has been studied to define the axes of rotation and their location 
with respect to specific anatomic landmarks (Table 20.2). The axis of motion of the talocrural joint 
essentially passes through the inferior tibia at the fibular and tibial malleoli (Fig. 20.3). Three types of 
motion have been used to describe the axes of rotation: fixed, quasi-instantaneous, and instantaneous 
axes. The motion that occurs in the ankle joints consists of dorsiflexion and plantarflexion. Minimal or 
no transverse rotation takes place within the talocrural joint. The motion in the talocrural joint is 
intimately related to the motion in the talocalcaneal joint which is described next. 

The motion axes of the talocalcaneal joint have been described by several authors (Table 20.3). The 
axis of motion in the talocalcaneal joint passes from the anterior medial superior aspect of the navicular 
bone to the posterior lateral inferior aspect of the calcaneus (Fig. 20.4). The motion that occurs in the 
talocalcaneal joint consists of inversion and eversion. 

20.2 Knee 

The knee is the intermediate joint of the lower limb. It is composed of the distal femur and proximal 
tibia. It is the largest and most complex joint in the body. The knee joint is composed of the tibiofemoral 
articulation and the patellofemoral articulation. 

Geometry of the Articulating Surfaces 

The shape of the articular surfaces of the proximal tibia and distal femur must fulfill the requirement 
that they move in contact with one another. The profile of the femoral condyles varies with the condyle 
examined (Fig. 20.5 and Table 20.4). The tibial plateau widths are greater than the corresponding widths 
of the femoral condyles (Fig. 20.6 and Table 20.5). However, the tibial plateau depths are less than those 
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FIGURE 20.2 Ratio of total contact area to joint area in the (A) anterior/middle facet and (B) posterior facet of 
the subtalar joint as a function of applied axial load for three different positions of the foot. Source: Wagner UA, 
Sangeorzan BJ, Harrington RM, Tencer AE 1992. Contact characteristics of the subtalar joint: load distribution 
between the anterior and posterior facets. J Orthop Res 10:535. With permission. 

of the femoral condyle distances. The medial condyle of the tibia is concave superiorly (the center of 
curvature lies above the tibial surface) with a radius of curvature of 80 mm [Kapandji, 1987]. The lateral 
condyle is convex superiorly (the center of curvature lies below the tibial surface) with a radius of 
curvature of 70 mm [Kapandji, 1987] . The shape of the femoral surfaces is complementary to the shape 
of the tibial plateaus. The shape of the posterior femoral condyles may be approximated by spherical 
surfaces (Table 20.4). 

The geometry of the patellofemoral articular surfaces remains relatively constant as the knee flexes. 
The knee sulcus angle changes only ±3.4° from 15 to 75° of knee flexion (Fig. 20.7). The mean depth 
index varies by only ±4% over the same flexion range (Fig. 20.7). Similarly, the medial and lateral patellar 
facet angles (Fig. 20.8) change by less than a degree throughout the entire knee flexion range (Table 20.6). 
However, there is a significant difference between the magnitude of the medial and lateral patellar facet 
angles. 

Joint Contact 

The mechanism of movement between the femur and tibia is a combination of rolling and gliding. 
Backward movement of the femur on the tibia during flexion has long been observed in the human knee. 
The magnitude of the rolling and gliding changes through the range of flexion. The tibial-femoral contact 
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TABLE 20.2 Axis of Rotation for the Ankle 



Elftman [1945] 


Fix 


Isman and Inman [1969] 


Fix 


Inman and Mann [1979] 


Fix 


Allardetal. [1987] 


Fix 



Investigators Axis* Position 

67.6° ± 7.4° with respect to sagittal plane 

8 mm anterior, 3 mm inferior to the distal tip of the lateral malleolus; 1 mm posterior, 

5 mm inferior to the distal tip of the medial malleolus 
79° (68-88°) with respect to the sagittal plane 

95.4° ± 6.6° with respect to the frontal plane, 77.7° ± 12.3° with respect to the sagittal 
plane, and 17.9° ± 4.5° with respect to the transverse plane 
Fix. 3.0 mm anterior, 2.5 mm inferior to distal tip of lateral malleolus, 2.2 mm posterior, 

10 mm inferior to distal tip of medial malleolus 
Ins. Inside and outside the body of the talus 
Ins. No consistent pattern 
Ins. 96% within 12 mm of a point 20 mm below the articular surface of the tibia along 

the long axis. 
Ins. At an approximate right angle to the longitudinal direction of the foot, passing through 
the corpus tali, with a direction from anterolaterosuperior to posteromedioinferior 
Q-I Dorsiflexion: down and lateral 

Plantarflexion: down and medial 
Q-I Dorsiflexion: 5 mm inferior to tip of lateral malleolus to 15 mm anterior to tip of 
medial malleolus 
Plantarflexion: 5 mm superior to tip of lateral malleolus to 15 mm anterior, 10 mm 
inferior to tip of medial malleolus 

* Fix. = fixed axis of rotation; Ins. = instantaneous axis of rotation; Q-I = quasi-instantaneous axis of rotation 



Singh etal. [1992] 

Sammarco et al. [1973] 
D Ambrosia et al. [1976] 
Parlasca et al. [1979] 

VanLangelaan [1983] 

Barnett and Napier 

Hicks [1953] 



OBLIQUITY OF ANKLE AXIS 



Number 

of 
Specimens 




78 82 86 90 
Angle (degrees) 



94 



s :o.=*± 3 - 6 



FIGURE 20.3 Variations in angle between middle of tibia and empirical axis of ankle. The histogram reveals a 
considerable spread of individual values. Source: Inman VT. 1976. The Joints of the Ankle, Baltimore, Williams and 
Wilkins. With permission. 

point has been shown to move posteriorly as the knee is flexed, reflecting the coupling of anterior/pos- 
terior motion with flexion/extension (Fig. 20.9). During flexion, the weight-bearing surfaces move back- 
ward on the tibial plateaus and become progressively smaller (Table 20.7). 

It has been shown that in an intact knee at full extension the center of pressure is approximately 25 mm 
from the anterior edge of the knee joint line [Andriacchi etal., 1986]. This net contact point moves 
posteriorly with flexion to approximately 38.5 mm from the anterior edge of the knee joint. Similar 
displacements have been noted in other studies (Table 20.8). 
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TABLE 20.3 Axis of Rotation for the Talocalcaneal (Subtalar) Joint 



Investigators 


Axis" 


Manter [1941] 


Fix, 


Shephard [1951] 


Fix. 


Hicks [1953] 


Fix. 


Rootetal. [1966] 


Fix. 


Isman and Inman [1969] 


Fix. 


Kirby [1947] 


Fix. 


Rastegar et al. [1980] 


Ins. 


VanLangelaan [1983] 


Ins. 


Engsberg [1987] 


Ins. 



Position 

16° (8-24°) with respect to sagittal plane, and 42° (29-47°) with respect to transverse 

plane 
Tuberosity of the calcaneus to the neck of the talus 

Posterolateral corner of the heel to superomedial aspect of the neck of the talus 
17° (8-29°) with respect to sagittal plane, and 41° (22-55°) with respect to transverse 

plane 
23° ±11° with respect to sagittal plane, and 41° ± 9° with respect to transverse plane 
Extends from the posterolateral heel, posteriorly, to the first intermetatarsal space, 

anteriorly 
Instant centers of rotation pathways in posterolateral quadrant of the distal 

articulating tibial surface, varying with applied load 
A bundle of axes that make an acute angle with the longitudinal direction of the foot 

passing through the tarsal canal having a direction from anteromediosuperior to 

posterolateroinferior 
A bundle of axes with a direction from anteromediosuperior to posterolateroinferior 

* Fix. = fixed axis of rotation; Ins. = instantaneous axis of rotation 

The patellofemoral contact area is smaller than the tibiofemoral contact area (Table 20.9). As the knee 
joint moves from extension to flexion, a band of contact moves upward over the patellar surface 
(Fig. 20.10). As knee flexion increases, not only does the contact area move superiorly, but it also becomes 
larger. At 90° of knee flexion, the contact area has reached the upper level of the patella. As the knee 
continues to flex, the contact area is divided into separate medial and lateral zones. 

Axes of Rotation 

The tibiofemoral joint is mainly a joint with two degrees of freedom. The first degree of freedom allows 
movements of flexion and extension in the sagittal plane. The axis of rotation lies perpendicular to the 
sagittal plane and intersects the femoral condyles. Both fixed axes and screw axes have been calculated 
(Fig. 20.11). In Fig. 20.11, the optimal axes are fixed axes, whereas the screw axis is an instantaneous axis. 
The symmetric optimal axis is constrained such that the axis is the same for both the right and left knee. 
The screw axis may sometimes coincide with the optimal axis but not always, depending upon the motions 
of the knee joint. The second degree of freedom is the axial rotation around the long axis of the tibia. 
Rotation of the leg around its long axis can only be performed with the knee flexed. There is also an 
automatic axial rotation which is involuntarily linked to flexion and extension. When the knee is flexed, 
the tibia internally rotates. Conversely, when the knee is extended, the tibia externally rotates. 

During knee flexion, the patella makes a rolling/gliding motion along the femoral articulating surface. 
Throughout the entire flexion range, the gliding motion is clockwise (Fig. 20.12). In contrast, the direction 
of the rolling motion is counter-clockwise between 0° and 90° and clockwise between 90° and 120° 
(Fig. 20.12). The mean amount of patellar gliding for all knees is approximately 6.5 mm per 10° of flexion 
between 0° and 80° and 4.5 mm per 10° of flexion between 80° and 120°. The relationship between the 
angle of flexion and the mean rolling/ gliding ratio for all knees is shown in Fig. 20.13. Between 80° and 
90° of knee flexion, the rolling motion of the articulating surface comes to a standstill and then changes 
direction. The reversal in movement occurs at the flexion angle where the quadriceps tendon first contacts 
the femoral groove. 



20.3 Hip 



The hip joint is composed of the head of the femur and the acetabulum of the pelvis. The hip joint is one 
of the most stable joints in the body. The stability is provided by the rigid ball-and-socket configuration. 
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FIGURE 20.4 (A) Variations in inclination of axis of subtalar joint as projected upon the sagittal plane. The 
distribution of the measurements on the individual specimens is shown in the histogram. The single observation of 
an angle of almost 70° was present in a markedly cavus foot. (B) Variations in position of subtalar axis as projected 
onto the transverse plane. The angle was measured between the axis and the midline of the foot. The extent of 
individual variation is shown on the sketch and revealed in the histogram. Source: Inman VT. 1976. The Joints of 
the Ankle, Baltimore, Williams and Wilkins. With permission. 

Geometry of the Articulating Surfaces 

The femoral head is spherical in its articular portion which forms two-thirds of a sphere. The diameter 
of the femoral head is smaller for females than for males (Table 20.10). In the normal hip, the center of 
the femoral head coincides exactly with the center of the acetabulum. The rounded part of the femoral 
head is spheroidal rather than spherical because the uppermost part is flattened slightly. This causes the 
load to be distributed in a ringlike pattern around the superior pole. The geometrical center of the femoral 
head is traversed by the three axes of the joint, the horizontal axis, the vertical axis, and the anterior/pos- 
terior axis. The head is supported by the neck of the femur, which joins the shaft. The axis of the femoral 
neck is obliquely set and runs superiorly, medially, and anteriorly. The angle of inclination of the femoral 
neck to the shaft in the frontal plane is the neck-shaft angle (Fig. 20.14). In most adults, this angle is 
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Lateral Medial 

FIGURE 20.5 Geometry of distal femur. The distances are defined in Table 20.4. 



TABLE 20.4 Geometry of the Distal Femur 







Condyle 












Lateral 




Medial 


Overall 


Parameter 


Symbol 


Distance (mm) 


Symbol 


Distance (mm) 


Symbol 


Distance (mm) 


Medial/lateral distance 


Kl 


31 + 2.3 (male) 
28 ± 1.8 (female) 


K2 


32 ± 31 (male) 
27 + 3.1 (female) 






Anterior/posterior distance 


K3 


72 ± 4.0 (male) 
65 ± 3.7 (female) 


K4 


70 ± 4.3 (male) 
63 ± 4.5 (female) 






Posterior femoral condyle 


K6 


19.2 ± 1.7 


K7 


20.8 ± 2.4 






spherical radii 














Epicondylar width 










K5 


90 ± 6 (male) 
80 ± 6 (female) 


Medial/lateral spacing of 










K8 


45.9 ± 3.4 


center of spherical surfaces 















Note: See Fig. 20.5 for location of measurements. 

Sources: Yoshioka Y, Siu D, Cooke TDV. 1987. The anatomy of functional axes of the femur. J Bone Joint Surg 69A(6):873-880. 
Kurosawa H, Walker PS, Abe S, Garg A, Hunter T. 1985. Geometry and motion of the knee for implant and orthotic design. 
J Biomech 18(7):487. 



about 130° (Table 20.10). An angle exceeding 130° is known as coxa valga; an angle less than 130° is 
known as coxa vara. The femoral neck forms an acute angle with the transverse axis of the femoral 
condyles. This angle faces medially and anteriorly and is called the angle of anteversion (Fig. 20.15). In 
the adult, this angle averages about 7.5° (Table 20.10). 

The acetabulum receives the femoral head and lies on the lateral aspect of the hip. The acetabulum of 
the adult is a hemispherical socket. Its cartilage area is approximately 16 cm 2 [Von Lanz & Wauchsmuth, 
1938]. Together with the labrum, the acetabulum covers slightly more than 50% of the femoral head 
[Tonnis, 1987]. Only the sides of the acetabulum are lined by articular cartilage, which is interrupted 
inferiorly by the deep acetabular notch. The central part of the cavity is deeper than the articular cartilage 
and is nonarticular. This part is called the acetabular fossae and is separated from the interface of the 
pelvic bone by a thin plate of bone. 
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FIGURE 20.6 Contour of the tibial plateau (transverse plane). The distances are defined in Table 20.5. 



TABLE 20.5 Geometry of the Proximal Tibia 



Parameter 


Symbols 


All Limbs 


Male 


Female 


Tibial plateau with widths (mm) 










Medial plateau 


T, 


32 ± 3.8 


34 ± 3.9 


30 ± 22 


Lateral plateau 


T 3 


33 ± 2.6 


35 ± 1.9 


31 ± 1.7 


Overall width 


T t + T, + T 3 


76 + 6.2 


81 ± 4.5 


73 ± 4.5 


Tibial plateau depths (mm) 










AP depth, medial 


T 4 


48 ± 5.0 


52 ± 3.4 


45 ± 4.1 


AP depth, lateral 


T 5 


42 ± 3.7 


45 ± 3.1 


40 ± 2.3 


Interspinous width (mm) 


T 2 


12 ± 1.7 


12 ± 0.9 


12 ± 2.2 


Intercondylar depth (mm) 


T 6 


48 ± 5.9 


52 ± 5.7 


45 ± 3.9 



Source: Yoshioka Y, Siu D, Scudamore RA, Cooke TDV. 1989. Tibial anatomy in 
functional axes. ] Orthop Res 7:132. 

Joint Contact 

Miyanaga et al. [1984] studied the deformation of the hip joint under loading, the contact area between 
the articular surfaces, and the contact pressures. They found that at loads up to 1000 N, pressure was 
distributed largely to the anterior and posterior parts of the lunate surface with very little pressure applied 
to the central portion of the roof itself. As the load increased, the contact area enlarged to include the 
outer and inner edges of the lunate surface (Fig. 20.16). However, the highest pressures were still measured 
anteriorly and posteriorly. Of five hip joints studied, only one had a pressure maximum at the zenith or 
central part of the acetabulum. 

Davy et al. [1989] utilized a telemetered total hip prosthesis to measure forces across the hip after total 
hip arthroplasty. The orientation of the resultant joint contact force varies over a relatively limited range 
during the weight-load-bearing portions of gait. Generally, the joint contact force on the ball of the hip 
prosthesis is located in the anterior/superior region. A three-dimensional plot of the resultant joint force 
during the gait cycle, with crutches, is shown in Fig. 20.17. 
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FIGURE 20.7 The trochlear geometry indices. The sulcus angle is the angle formed by the lines drawn from the top 
of the medial and lateral condyles to the deepest point of the sulcus. The depth index is the ratio of the width of the 
groove (WG) to the depth (DG). Mean and SD; n = 12. Source: Farahmand et al. Quantitative study of the quadriceps 
muscles and trochlear groove geometry related to instability of the patellofemoral joint, J Orthopaedic Res, 16:1, 140. 
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FIGURE 20.8 Medial (y m ) and lateral (y n ) patellar facet angles. Source: Ahmed AM, Burke DL, Hyder A. 1987. Force 
analysis of the patellar mechanism. J Orthopedic Res 5:69-85. 



TABLE 20.6 Patellar Facet Angles 



Facet 




Knee Flexion Angle 




Angle 


0° 


30° 


60° 


90° 


120° 


Tm (deg) 


60.88 


60.96 


61.43 


61.30 


60.34 




3.89" 


4.70 


4.12 


4.18 


4.51 


Tm (deg) 


67.76 


68.05 


69.36 


68.39 


68.20 




4.15 


3.97 


3.63 


4.01 


3.67 



Source: Ahmed AM, Burke DL, Hyder A. 1987. Force 
analysis of the patellar mechanism. J Orthopedic 
Research 5:69-85. 
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FIGURE 20.9 Tibio-femoral contact area as a function of knee flexion angle. Source: Iseki F, Tomatsu T. 1976. The 
biomechanics of the knee joint with special reference to the contact area. Keio J Med 25:37. With permission. 

TABLE 20.7 Tibiofemoral Contact Area 



Knee Flexion (deg) 


Contact Area (cm 2 ) 


-5 


20.2 


5 


19.8 


15 


19.2 


25 


18.2 


35 


14.0 


45 


13.4 


55 


11.8 


65 


13.6 


75 


11.4 


85 


12.1 



Source: Maquet PC, Vandberg AJ, Simonet JC 
1975. Femorotibial weight bearing areas: Experi- 
mental determination. J Bone Joint Surg 57A(6):766. 

TABLE 20.8 Posterior Displacement of the Femur 
Relative to the Tibia 







A/P 






Displacement 


Authors 


Condition 


(mm) 


Kurosawa [1985] 


In vitro 


14.8 


Andriacchi [1986] 


In vitro 


13.5 


Draganich [1987] 


In vitro 


13.5 


Nahass [1991] 


In vivo (walking) 


12.5 




In vivo (stairs) 


13.9 



Axes of Rotation 

The human hip is a modified spherical (ball-and-socket) joint. Thus, the hip possesses three degrees of 
freedom of motion with three correspondingly arranged, mutually perpendicular axes that intersect at 
the geometric center of rotation of the spherical head. The transverse axis lies in the frontal plane and 
controls movements of flexion and extension. An anterior/posterior axis lies in the sagittal plane and 
controls movements of adduction and abduction. A vertical axis which coincides with the long axis of 
the limb when the hip joint is in the neutral position controls movements of internal and external rotation. 
Surface motion in the hip joint can be considered as spinning of the femoral head on the acetabulum. 
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TABLE 20.9 Patellofemoral Contact Area 



Knee Flexion (degrees) 


Contact Area (cm 2 ) 


20 


2.6 ± 0.4 


30 


3.1 ±0.3 


60 


3.9 ± 0.6 


90 


4.1 ± 1.2 


120 


4.6 ± 0.7 



Source: Hubert HH, Hayes WC. 1984. Patel- 
lofemoral contact pressures: The influence of 
Q-angle and tendofemoral contact. J Bone and 
Joint Surgery 66A(5):715-725. 




20° 45° 90° 



FIGURE 20.10 Diagrammatic representation of patella con- 
tact areas for varying degrees of knee flexion. Source: Goodfellow 
J, Hungerford DS, Zindel M, Patellofemoral joint mechanics and 
pathology. J Bone and Joint Surgery 58-B:3, 288. 




135° 



The pivoting of the bone socket in three planes around the center of rotation in the femoral head produces 
the spinning of the joint surfaces. 

20.4 Shoulder 

The shoulder represents the group of structures connecting the arm to the thorax. The combined 
movements of four distinct articulations — glenohumeral, acromioclavicular, sternoclavicular, and scapu- 
lothoracic — allow the arm to be positioned in space. 

Geometry of the Articulating Surfaces 

The articular surface of the humerus is approximately one-third of a sphere (Fig. 20.18). The articular 
surface is oriented with an upward tilt of approximately 45° and is retroverted approximately 30° with 
respect to the condylar line of the distal humerus [Morrey & An, 1990]. The average radius of curvature 
of the humeral head in the coronal plane is 24.0 ±2.1 mm [Iannotti et al., 1992]. The radius of curvature 
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O NONSYMMETRICAL OPTIMAL AXIS 

• SCREW AXIS 

X SYMMETRICAL OPTIMAL AXIS 

FIGURE 20.11 Approximate location of the optimal axis (case 1 — nonsymmetric, case 3 — symmetric), and the 
screw axis (case 2) on the medial and lateral condyles of the femur of a human subject for the range of motion of 
0-90° flexion (standing to sitting, respectively). Source: Lewis JL, Lew WD. 1978. A method for locating an optimal 
"fixed" axis of rotation for the human knee joint. J Biomech Eng 100:187. With permission. 




FIGURE 20.12 Position of patellar ligament, patella, and quadriceps tendon and location of the contact points as 
a function of the knee flexion angle. Source: van Eijden TMGJ, Kouwenhoven E, Verburg J, et al. A mathematical 
model of the patellofemoral joint. J Biomechanics 19(3):227. 

in the anteroposterior and axillary-lateral view is similar, measuring 13.1 ± 1.3 mm and 22.9 ± 2.9 mm, 
respectively [McPherson et al., 1997] . The humeral articulating surface is spherical in the center. However, 
the peripheral radius is 2 mm less in the axial plane than in the coronal plane. Thus the peripheral 
contour of the articular surface is elliptical with a ratio of 0.92 [Iannotti et al., 1992]. The major axis is 
superior to inferior and the minor axis is anterior to posterior [McPherson et al., 1997]. More recently, 
the three-dimensional geometry of the proximal humerus has been studied extensively. The articular 
surface, which is part of a sphere, varies individually in its orientation with respect to inclination and 
retroversion, and it has variable medial and posterior offsets [Boileau and Walch, 1997]. These findings 
have great impact in implant design and placement in order to restore soft-tissue function. 

The glenoid fossa consists of a small, pear-shaped, cartilage-covered bony depression that measures 
39.0 ± 3.5 mm in the superior/inferior direction and 29.0 ± 3.2 mm in the anterior/posterior direction 
[Iannotti et al., 1992]. The anterior/posterior dimension of the glenoid is pear-shaped with the lower 
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FIGURE 20.13 Calculated rolling/gliding ratio for the patellofemoral joint as a function of the knee flexion angle. 
Source: van Eijden TMGJ, Kouwenhoven E, Verburg J, et al. A mathematical model of the patellofemoral joint. 
J Biomechanics 19(3):226. 





20 




■i- 

40 




60 




80 



TABLE 20.10 Geometry of the Proximal Femur 



Parameter 

Femoral head diameter (mm) 
Neck shaft angle (degrees) 
Anteversion (degrees) 



Females 



Males 



45.0 ± 3.0 52.0 ± 3.3 

133 ± 6.6 129 ± 7.3 

8 + 10 7.0 ± 6.8 



Source: Yoshioka Y, Siu D, Cooke TDV. 1987. The anat- 
omy and functional axes of the femur. J Bone Joint Surg 
69A(6):873. 




FIGURE 20.14 The neck-shaft angle. 

half being larger than the top half. The ratio of the lower half to the top half is 1:0.80 ± 0.01 [Iannotti 
et al., 1992]. The glenoid radius of curvature is 32.2 ± 7.6 mm in the anteroposterior view and 40.6 ± 
14.0 mm in the axillary-lateral view [McPherson et al., 1997]. The glenoid is therefore more curved 
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FIGURE 20.15 The normal anteversion angle formed by a line tangent to the femoral condyles and the femoral 
neck axis, as displayed in the superior view. 
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FIGURE 20.16 Pressure distribution and contact area of hip joint. The pressure is distributed largely to the anterior 
and posterior parts of the lunate surface. As the load increased, the contact area increased. Source: Miyanaga Y, 
Fukubayashi T, Kurosawa H. 1984. Contact study of the hip joint: Load deformation pattern, contact area, and contact 
pressure. Arch Orth Trauma Surg 103:13. With permission. 

superior to inferior (coronal plane) and relatively flatter in an anterior to posterior direction (sagittal 
plane). Glenoid depth is 5.0 + 1.1 mm in the anteroposterior view and 2.9 ± 1.0 mm in the axillary- 
lateral [McPherson et al., 1997], again confirming that the glenoid is more curved superior to inferior. 
In the coronal plane the articular surface of the glenoid comprises an arc of approximately 75° and in 
the transverse plane the arc of curvature of the glenoid is about 50° [Morrey & An, 1990]. The glenoid 
has a slight upward tilt of about 5° [Basmajian & Bazant, 1959] with respect to the medial border of the 
scapula (Fig. 20.19) and is retroverted a mean of approximately 7° [Saha, 1971]. The relationship of the 
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FIGURE 20.17 Scaled three-dimensional plot of resultant force during the gait cycle with crutches. The lengths of 
the lines indicate the magnitude offeree. Radial line segments are drawn at equal increments of time, so the distance 
between the segments indicates the rate at which the orientation of the force was changing. For higher amplitudes 
of force during stance phase, line segments in close proximity indicate that the orientation of the force was changing 
relatively little with the cone angle between 30 and 40° and the polar angle between -25 and -15°. Source: Davy DT, 
Kotzar DM, Brown RH, et al. 1989. Telemetric force measurements across the hip after total arthroplasty. J Bone 
Joint Surg 70A(1):45, with permission. 



LJ 30-40° 





FIGURE 20.18 The two-dimensional orientation of the articular surface of the humerus with respect to the 
bicondylar axis. By permission of Mayo Foundation. 

dimension of the humeral head to the glenoid head is approximately 0.8 in the coronal plane and 0.6 in 
the horizontal or transverse plane [Saha, 1971]. The surface area of the glenoid fossa is only one-third 
to one-fourth that of the humeral head [Kent, 1971]. The arcs of articular cartilage on the humeral head 
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FIGURE 20.19 The glenoid faces slightly superior and posterior (retroverted) with respect to the body of the 
scapula. By permission of Mayo Foundation. 

and glenoid in the frontal and axial planes were measured [Jobe and Iannotti, 1995] . In the coronal plane, 
the humeral heads had an arc of 159° covered by 96° of glenoid, leaving 63° of cartilage uncovered. In 
the transverse plane, the humeral arc of 160° is opposed by 74° of glenoid, leaving 86° uncovered. 

Joint Contact 

The degree of conformity and constraint between the humeral head and glenoid has been represented 
by conformity index (radius of head/radius of glenoid) and constraint index (arc of enclosure/360) 
[McPherson, 1997]. Based on the study of 93 cadaveric specimens, the mean conformity index was 0.72 
in the coronal and 0.63 in the sagittal plane. There was more constraint to the glenoid in the coronal vs. 
sagittal plane (0.18 vs. 0.13). These anatomic features help prevent superior-inferior translation of the 
humeral head but allow translation in the sagittal plane. Joint contact areas of the glenohumeral joint 
tend to be greater at mid-elevation positions than at either of the extremes of joint position (Table 20.11). 
These results suggest that the glenohumeral surface is maximum at these more functional positions, thus 
distributing joint load over a larger region in a more stable configuration. The contact point moves 

TABLE 20. 1 1 Glenohumeral Contact Areas 







Contact areas 


Elevation 


Contact areas 


at 20° internal to 


angle (°) 


at SR (cm 2 ) 


SR (cm 2 ) 





0.87 ± 1.01 


1.70 ± 1.68 


30 


2.09 ± 1.54 


2.44 ± 2.15 


60 


3.48 ± 1.69 


4.56 ± 1.84 


90 


4.95 ±2.15 


3.92 ±2.10 


120 


5.07 ± 2.35 


4.84 ± 1.84 


150 


3.52 ± 2.29 


2.33 ± 1.47 


180 


2.59 ± 2.90 


2.51 ± NA 



SR = Starting external rotation which allowed the shoulder 
to reach maximal elevation in the scapular plane (=40° ± 8°); 
NA = Not applicable 

Source: Soslowsky LJ, Flatow EL, Bigliani LU, Pablak R], 
Mow VC, Athesian GA. 1992. Quantitation of in situ contact 
areas at the glenohumeral joint: A biomechanical study. 
J Orthop Res 10(4):524. With permission. 
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FIGURE 20.20 Humeral contact positions as a function of glenohumeral motion and positions. Source: Morrey 
BF, An KN. 1990. Biomechanics of the Shoulder. In CA Rockwood and FA Matsen (eds), The Shoulder, pp 208-245, 
Philadelphia, Saunders. With permission. 

forward and inferior during internal rotation (Fig. 20.20). With external rotation, the contact is poste- 
rior/inferior. With elevation, the contact area moves superiorly. Lippitt and associates [1998] calculated 
the stability ratio, which is defined as a force necessary to translate the humeral head from the glenoid 
fossa divided by the compressive load times 100. The stability ratios were in the range of 50 to 60% in 
the superior-inferior direction and 30 to 40% in the anterior-posterior direction. After the labrum was 
removed, the ratio decreased by approximately 20%. Joint conformity was found to have significant 
influence on translations of humeral head during active positioning by muscles [Karduna, et al. 1996]. 

Axes of Rotation 

The shoulder complex consists of four distinct articulations: the glenohumeral joint, the acromioclavic- 
ular joint, the sternoclavicular joint, and the scapulothoracic articulation. The wide range of motion of 
the shoulder (exceeding a hemisphere) is the result of synchronous, simultaneous contributions from 
each joint. The most important function of the shoulder is arm elevation. Several investigators have 
attempted to relate glenohumeral and scapulothoracic motion during arm elevation in various planes 
(Table 20.12). About two-thirds of the motion takes place in the glenohumeral joint and about one-third 
in the scapulothoracic articulation, resulting in a 2:1 ratio. 

Surface motion at the glenohumeral joint is primarily rotational. The center of rotation of the gleno- 
humeral joint has been defined as a locus of points situated within 6.0 ±1.8 mm of the geometric center 
of the humeral head [Poppen & Walker, 1976]. However, the motion is not purely rotational. The humeral 
head displaces, with respect to the glenoid. From 0-30°, and often from 30-60°, the humeral head moves 
upward in the glenoid fossa by about 3 mm, indicating that rolling and/or gliding has taken place. 
Thereafter, the humeral head has only about 1 mm of additional excursion. During arm elevation in the 
scapular plane, the scapula moves in relation to the thorax [Poppen & Walker, 1976]. From 0-30° the 
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TABLE 20.12 Arm Elevation: Glenohumeral- 
Scapulothoracic Rotation 



Investigator 



Glenohumeral/Scapulothoracic 
Motion Ratio 



Inman etal. [1994] 
Freedman & Munro 
Doody etal. [1970] 
Poppen & Walker [1976] 

Saha [1971] 



2:1 
1966] 1.35:1 

1.74:1 

4.3:1 (<24° elevation) 
1.25:1 (>24° elevation) 
2.3:1 (30-135° elevation) 




MED. 
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FIGURE 20.21 Rotation of the scapula on the thorax in the scapular plane. Instant centers of rotation (solid dots) 
are shown for each 30° interval of motion during shoulder elevation in the scapular plane from zero to 150°. The 
x and y axes are fixed in the scapula, whereas the X and Y axes are fixed in the thorax. From zero to 30° in the scapula 
rotated about its lower midportion; from 60° onward, rotation took place about the glenoid area, resulting in a 
medial and upward displacement of the glenoid face and a large lateral displacement of the inferior tip of the scapula. 
Source: Poppen NK, Walker PS. 1976. Normal and abnormal motion of the shoulder. J Bone Joint Surg 58A:195. 
With permission. 

scapula rotates about its lower mid portion, and then from 60° onward the center of rotation shifts toward 
the glenoid, resulting in a large lateral displacement of the inferior tip of the scapula (Fig. 20.21). The 
center of rotation of the scapula for arm elevation is situated at the tip of the acromion as viewed from 
the edge on (Fig. 20.22). The mean amount of scapular twisting at maximum arm elevation is 40°. The 
superior tip of the scapula moves away from the thorax, and the inferior tip moves toward it. 
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FIGURE 20.22 (A) A plot of the tips of the acromion and coracoid process on roentgenograms taken at successive 
intervals of arm elevation in the scapular plane shows upward movement of the coracoid and only a slight shift in 
the acromion relative to the glenoid face. This finding demonstrates twisting, or external rotation, of the scapula 
about the x axis. (B) A lateral view of the scapula during this motion would show the coracoid process moving 
upward while the acromion remains on the same horizontal plane as the glenoid. Source: Poppen NK, Walker PS. 
1976. Normal and abnormal motion of the shoulder. J Bone Joint Surg 58A:195. With permission. 



20.5 Elbow 

The bony structures of the elbow are the distal end of the humerus and the proximal ends of the radius 
and ulna. The elbow joint complex allows two degrees of freedom in motion: flexion/extension and 
pronation/supination. The elbow joint complex is three separate synovial articulations. The humeral- 
ulnar joint is the articulation between the trochlea of the distal radius and the trochlear fossa of the 
proximal ulna. The humero-radial joint is formed by the articulation between the capitulum of the distal 
humerus and the head of the radius. The proximal radioulnar joint is formed by the head of the radius 
and the radial notch of the proximal ulna. 

Geometry of the Articulating Surfaces 

The curved, articulating portions of the trochlea and capitulum are approximately circular in a cross- 
section. The radius of the capitulum is larger than the central trochlear groove (Table 20.13). The centers 
of curvature of the trochlea and capitulum lie in a straight line located on a plane that slopes at 45-50° 
anterior and distal to the transepicondylar line and is inclined at 2.5° from the horizontal transverse 
plane [Shiba et al., 1988]. The curves of the ulnar articulations form two surfaces (coronoid and olecra- 
non) with centers on a line parallel to the transepicondylar line but are distinct from it [Shiba et al., 
1988]. The carrying angle is an angle made by the intersection of the longitudinal axis of the humerus 
and the forearm in the frontal plane with the elbow in an extended position. The carrying angle is 
contributed to, in part, by the oblique axis of the distal humerus and, in part, by the shape of the proximal 
ulna (Fig. 20.23). 
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TABLE 20.13 Elbow Joint Geometry 



Parameter 


Size (mm) 


Capitulum radius 


10.6 + 1.1 


Lateral trochlear flange radius 


10.8 ± 1.0 


Central trochlear groove radius 


8.8 ± 0.4 


Medial trochlear groove radius 


13.2 ± 1.4 


Distal location of flexion/extension 




axis from transepicondylar line: 




Lateral 


6.8 ± 0.2 


Medial 


8.7 ± 0.6 



Source: Shiba R, Sorbie C, Siu DW, Bryant JT, 
Cooke TDV, Weavers HW. 1988. Geometry of the 
humeral-ulnar joint. J Orthop Res 6:897. 



FIGURE 20.23 Components contributing to the car- 
rying angles: a + X + \|/. Key: a, angle between C-line 
and TEL; y, inclination of central groove (eg); X, angle 
between trochlear notch (tn); \|/, reverse angulation of 
shaft of ulna; TLE, transepicondylar line; C-line, line 
joining centers of curvature of the trochlea and capitel- 
lum; eg, central groove; op, olecranon process; tr, tro- 
chlear ridge; cp, coronoid process, a = 2.5 ± 0.0; A. = 
17.5 ± 5.0 (females) and 12.0 ± 7.0 (males); v|/ = -6.5 ± 
0.7 (females) and -9.5 ± 3.5 (males). Source: Shiba R, 
Sorbie C, Siu DW, Bryant JT, Cooke TDV, Weavers HW. 
1988. Geometry of the humeral-ulnar joint. J Orthop 
Res 6:897. With permission. 




Joint Contact 

The contact area on the articular surfaces of the elbow joint depends on the joint position and the loading 
conditions. Increasing the magnitude of the load not only increases the size of the contact area but shifts 
the locations as well (Fig. 20.25). As the axial loading is increased, there is an increased lateralization of 
the articular contact [Stormont et al., 1985]. The area of contact, expressed as a percentage of the total 
articulating surface area, is given in Table 20.14. Based on a finite element model of the humero-ulnar 
joint, Merz et al. [1997] demonstrated that the humero-ulnar joint incongruity brings about a bicentric 
distribution of contact pressure, a tensile stress exists in the notch that is the same order of magnitude 
as the compressive stress [Merz, 1997]. 
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(POSTERIOR VIEW) 



(ANTERIOR VIEW) 
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FIGURE 20.24 Contact of the ulnohumeral joint with varus and valgus loads and the elbow at 90°. Notice only 
minimal radiohumeral contact in this loading condition. Source: Stormont TJ, An KN, Morrey BF, Chae EY. 1985. In 
Elbow joint contact study: Comparison of techniques. J Biomech 18(5):329. Reprinted with permission of Elsevier 
Science Inc. 




FIGURE 20.25 Very small locus of instant center of rotation for the elbow joint demonstrates that the axis may be 
replicated by a single line drawn from the inferior aspect of the medial epicondyle through the center of the lateral 
epicondyle, which is in the center of the lateral projected curvature of the trochlea and capitellum. Source: Modified 
from Morrey BF, Chao EYS. 1976. In Passive motion of the elbow joint: a biomechanical analysis. J Bone Joint Surg 
58A(4):501. Reprinted with permission of the Journal of Bone and Joint Surgery. 

Axes of Rotation 

The axes of flexion and extension can be approximated by a line passing through the center of the trochlea, 
bisecting the angle formed by the longitudinal axes of the humerus and the ulna [Morrey & Chao, 1976] . 
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TABLE 20.14 Elbow Joint Contact Area 





Total Articulating Surface 






Area of Ulna and 




Position 


Radial Head (mm 2 ) 


Contact Area (%) 


Full extension 


1598 ± 103 


8.1 ±2.7 


90° flexion 


1750 ± 123 


10.8 ± 2.0 


Full flexion 


1594 ± 120 


9.5 ±2.1 



Source: Goel VK, Singh D, Bijlani V. 1982. Contact areas in 
human elbow joints. J Biomech Eng 104:169. 

The instant centers of flexion and extension vary within 2-3 mm of this axis (Fig. 20.25). With the elbow 



11.0 
9.75 

75 

5.25 

3.0 

en 

c 

'5. -1.5 



o 



-3.5 



-6.0 




20 40 60 80 

Elbow Flexion <|>° 



I00 



120 



FIGURE 20.26 During elbow flexion and extension, a linear change in the carrying angle is demonstrated, typically 
going from valgus in extension to varus in flexion. Source: Morrey BF, Chao EYS. 1976. In Passive motion of the 
elbow joint: A biomechanical analysis. J Bone Joint Surg 58A(4):501. Reprinted with permission of the Journal of 
Bone and Joint Surgery. 



fully extended and the forearm fully supinated, the longitudinal axes of humerus and ulna normally 
intersect at a valgus angle referred to as the carrying angle. In adults, this angle is usually 10-15° and 
normally is greater on average in women [Zuckerman & Matsen, 1989]. As the elbow flexes, the carrying 
angle varies as a function of flexion (Fig. 20.26). In extension there is a valgus angulation of 10°; at full 
flexion there is a varus angulation of 8° [Morrey & Chao, 1976]. More recently, the three-dimensional 
kinematics of the ulno-humeral joint under simulated active elbow joint flexion-extension was obtained 
by using an electromagnetic tracking device [Tanaka et al. 1998]. The optimal axis to best represent 
flexion-extension motion was found to be close to the line joining the centers of the capitellum and the 
trochlear groove. Furthermore, the joint laxity under valgus-varus stress was also examined. With the 
weight of the forearm as the stress, a maximum of 7.6° valgus-varus and 5.3° of axial rotation laxity were 
observed. 

20.6 Wrist 

The wrist functions by allowing changes of orientation of the hand relative to the forearm. The wrist 
joint complex consists of multiple articulations of eight carpal bones with the distal radius, the structures 
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of the ulnocarpal space, the metacarpals, and each other. This collection of bones and soft tissues is 
capable of a substantial arc of motion that augments hand and finger function. 

TABLE 20.15 Changes of Wrist Geometry with Grasp 











Analysis of 




Resting 


Grasp 


Variance (p = level) 


Distal radioulnar joint space (mm) 


1.6 ± 0.3 


1.8 ± 


0.6 


0.06 


Ulnar variance (mm) 


-0.2 ± 1.6 


0.7 ± 


1.8 


0.003 


Lunate, uncovered length (mm) 


6.0 ± 1.9 


7.6 ± 


2.6 


0.0008 


Capitate length (mm) 


21.5 ± 2.2 


20.8 ± 


2.3 


0.0002 


Carpal height (mm) 


33.4 ± 3.4 


31.7 ± 


3.4 


0.0001 


Carpal ulnar distance (mm) 


15.8 ± 4.0 


15.8 ± 


3.0 


NS 


Carpal radial distance (mm) 


19.4 ± 1.8 


19.7 ± 


1.8 


NS 


Third metacarpal length (mm) 


63.8 ± 5.8 


62.6 ± 


5.5 


NS 


Carpal height ratio 


52.4 ± 3.3 


50.6 ± 


4.1 


0.02 


Carpal ulnar ratio 


24.9 ± 5.9 


25.4 ± 


5.3 


NS 


Lunate uncovering index 


36.7 ± 12.1 


45.3 ± 


14.2 


0.002 


Carpal radial ratio 


30.6 ± 2.4 


31.6 ± 


2.3 


NS 


Radius — third metacarpal angle (degrees) 


-0.3 ± 9.2 


-3.1 ± 


12.8 


NS 


Radius — capitate angle (degrees) 


0.4 ± 15.4 


-3.8 ± 


22.2 


NS 



Note: 15 normal subjects with forearm in neutral position and elbow at 90° flexion. 
Source: Schuind FA, Linscheid RL, An KN, Chao EYS. 1992. Changes in wrist and forearm 
configuration with grasp and isometric contraction of elbow flexors. J Hand Surg 17A:698. 



Geometry of the Articulating Surfaces 

The global geometry of the carpal bones has been quantified for grasp and active isometric contraction 
of the elbow flexors [Schuind et al., 1992]. During grasping there is a significant proximal migration of 
the radius of 0.9 mm, apparent shortening of the capitate, a decrease in the carpal height ratio, and an 
increase in the lunate uncovering index (Table 20.15). There is also a trend toward increase of the distal 
radioulnar joint with grasping. The addition of elbow flexion with concomitant grasping did not signif- 
icantly change the global geometry, except for a significant decrease in the forearm interosseous space 
[Schuind et al, 1992]. 

Joint Contact 

Studies of the normal biomechanics of the proximal wrist joint have determined that the scaphoid and 
lunate bones have separate, distinct areas of contact on the distal radius/triangular fibrocartilage complex 
surface [Viegas et al., 1987] so that the contact areas were localized and accounted for a relatively small 
fraction of the joint surface, regardless of wrist position (average of 20.6%). The contact areas shift from 
a more volar location to a more dorsal location as the wrist moves from flexion to extension. Overall, 
the scaphoid contact area is 1.47 times greater than that of the lunate. The scapho-lunate contact area 
ratio generally increases as the wrist position is changed from radial to ulnar deviation and/or from 
flexion to extension. Palmer and Werner [1984] also studied pressures in the proximal wrist joint and 
found that there are three distinct areas of contact: the ulno-lunate, radio-lunate, and radio-scaphoid. 
They determined that the peak articular pressure in the ulno-lunate fossa is 1.4 N/mm 2 , in the radio- 
ulnate fossa is 3.0 N/mm 2 , and in the radio-scaphoid fossa is 3.3 N/mm 2 . Viegas et al. [1989] found a 
nonlinear relationship between increasing load and the joint contact area (Fig. 20.27). In general, the 
distribution of load between the scaphoid and lunate was consistent with all loads tested, with 60% of 
the total contact area involving the scaphoid and 40% involving the lunate. Loads greater than 46 lbs 
were found to not significantly increase the overall contact area. The overall contact area, even at the 
highest loads tested, was not more than 40% of the available joint surface. 
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FIGURE 20.27 The nonlinear relation between the contact area and the load at the proximal wrist joint. The contact 
area was normalized as a percentage of the available joint surface. The load of 1 1, 23, 46, and 92 lbs. was applied at 
the position of neutral pronation/supination, neutral radioulnar deviation, and neutral flexion/extension. Source: 
Viegas SF, Patterson RM, Peterson PD, Roefs J, Tencer A, Choi S. 1989. The effects of various load paths and different 
loads on the load transfer characteristics of the wrist. J Hand Surg 14A(3):458. With permission. 

Horii et al. [1990] calculated the total amount of force born by each joint with the intact wrist in the 
neutral position in the coronal plane and subjected to a total load of 143 N (Table 20.16). They found 

TABLE 20.16 Force Transmission at the Intercarpal Joints 



Radio-ulno-carpal 
Ulno-triquetral 
Ulno-lunate 
Radio-lunate 
Radio-scaphoid 

Midcarpal 

Triquetral-hamate 
Luno-capitate 
Scapho-capitate 
Scapho-trapezial 



Force (N) 



12 ±3 
23 ± 8 
52 ± 8 
74 ± 13 

36 ±6 
51 ±6 
32 ±4 
51 ±8 



Note: A total of 143 N axial force applied across the wrist. 

Source: Horii E, Garcia-Elias M, An KN, Bishop AT, Cooney 
WP, Linscheid RL, Chao EY. 1990. Effect on force transmission 
across the carpus in procedures used to treat Kienbock's Disease. 
J Bone Joint Surg 15A(3):393. 



that 22% of the total force in the radio-ulno-carpal joint is dissipated through the ulna 14% through the 
ulno-lunate joint, and 18% through the ulno-triquetral joint) and 78% through the radius (46% through 
the scaphoid fossa and 32% through the lunate fossa). At the midcarpal joint, the scapho-trapezial joint 
transmits 31% of the total applied force, the scapho-capitate joint transmits 19%, the luno-capitate joint 
transmits 29%, and the triquetral-hamate joints transmits 21% of the load. 
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FIGURE 20.28 The location of the center of rotation during ulnar deviation (left) and extension (right), determined 
graphically using two metal markers embedded in the capitate. Note that during radial-ulnar deviation the center 
lies at a point in the capitate situated distal to the proximal end of this bone by a distance equivalent to approximately 
one-quarter of its total longitudinal length. During flexion-extension, the center of rotation is close to the proximal 
cortex of the capitate. Source: Youm Y, McMurty RY, Flatt AE, Gillespie TE. 1978. Kinematics of the wrist: an 
experimental study of radioulnar deviation and flexion/extension. J Bone Joint Surg 60A(4):423. With permission. 



A limited amount of studies have been done to determine the contact areas in the midcarpal joint. 
Viegas et al. [1990] have found four general areas of contact: the scapho-trapezial-trapezoid (STT), the 
scapho-capitate (SC), the capito-lunate (CL), and the triquetral-hamate (TH). The high pressure contact 
area accounted for only 8% of the available joint surface with a load of 32 lbs and increased to a maximum 
of only 15% with a load of 1 18 lbs. The total contact area, expressed as a percentage of the total available 
joint area for each fossa was: STT = 1.3%, SC = 1.8%, CL = 3.1%, and TH = 1.8%. 

The correlation between the pressure loading in the wrist and the progress of degenerative osteoarthritis 
associated with pathological conditions of the forearm was studied in a cadaveric model [Sato, 1995]. 
Malunion after distal radius fracture, tear of triangular fibrocartilage, and scapholunate dissociation were 
all responsible for the alteration of the articulating pressure across the wrist joint. Residual articular 
incongruity of the distal radius following intra-articular fracture has been correlated with early osteoar- 
thritis. In an in vitro model, step-offs of the distal radius articular incongruity were created. Mean contact 
stress was significantly greater than the anatomically reduced case at only 3 mm of step-off [Anderson 
etal., 1996]. 

Axes of Rotation 

The complexity of joint motion at the wrist makes it difficult to calculate the instant center of motion. 
However, the trajectories of the hand during radioulnar deviation and flexion/extension, when they occur 
in a fixed plane, are circular, and the rotation in each plane takes place about a fixed axis. These axes are 
located within the head of the capitate and are not altered by the position of the hand in the plane of 
rotation [Youm et al., 1978]. During radioulnar deviation, the instant center of rotation lies at a point 
in the capitate situated distal to the proximal end of this bone by a distance equivalent to approximately 
one-quarter of its total length (Fig. 20.28). During flexion/extension, the instant center is close to the 
proximal cortex of the capitate, which is somewhat more proximal than the location for the instant center 
of radioulnar deviation. 

Normal carpal kinematics were studied in 22 cadaver specimens using a biplanar radiography method. 
The kinematics of the trapezium, capitate, hamate, scaphoid, lunate, and triquetrum were determined 
during wrist rotation in the sagittal and coronal plane [Kobagashi et al., 1997] . The results were expressed 
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using the concept of the screw displacement axis and covered to describe the magnitude of rotation about 
and translation along three orthogonal axes. The orientation of these axes is expressed relative to the 
radius during sagittal plane motion of the wrist (Table 20.17). The scaphoid exhibited the greatest 



TABLE 20.17 Individual Carpal Rotation Relative to the Radius (Degrees) (Sagittal Plane Motion of the Wrist) 















Axis 


i of Rotation 
















X 






Y 










Z 




Wrist Motion" 


(+) Pronation; (-} Supination 


(+) Flexion; (- 
N-E60 N-E30 


-) Extension 
N-F60 N-E60 


(+) Ulnar Deviation; (-) Radial 
N-E60 N-E30 N-F30 


Deviation 


Carpal Bone 


N-E60 


N-E30 


N-F30 


N-F60 


N-F60 


Trapezium (N-13) 


-0.9 


-1.3 


0.9 


-1.4 


-59.4 


-29.3 


28.7 


54.2 


1.2 


0.3 


-0.4 


2.5 


S.D. 


2.8 


2.2 


2.6 


2.7 


2.3 


1 


1.8 


3 


4 


2.7 


1.3 


2.8 


Capitate (N=22) 


0.9 


-1 


1.3 


-1.6 


60.3 


-30.2 


21.5 


63.5 








0.6 


3.2 


S.D. 


2.7 


1.8 


2.5 


3.5 


2.5 


1.1 


1.2 


2.8 


2 


1.4 


1.6 


3.6 


Hamate (N=9) 


0.4 


-1 


1.3 


-0.3 


-59.5 


-29 


28.8 


62.6 


2.1 


0.7 


0.1 


1.8 


S.D. 


3.4 


1.7 


2.5 


2.4 


1.4 


0.8 


10.2 


3.6 


4.4 


1.8 


1.2 


4.1 


Scaphoid (N=22) 


-2.5 


-0.7 


1.6 


2 


-52.3 


-26 


20.6 


39.7 


4.5 


0.8 


2.1 


7.8 


S.D. 


3.4 


2.6 


2.2 


3.1 


3 


3.2 


2.8 


4.3 


3.7 


2.1 


2.2 


4.5 


Lunate (N=22) 


1.2 


0.5 


0.3 


-2.2 


-29.7 


-15.4 


11.5 


23 


4.3 


0.9 


3.3 


11.1 


S.D. 


2.8 


1.8 


1.7 


2.8 


6.6 


3.9 


3.9 


5.9 


2.6 


1.5 


1.9 


3.4 


Triquetrum (N— 22) 


-3.5 


-2.5 


2.5 


-0.7 


-39.3 


-20.1 


15.5 


30.6 





-0.3 


2.4 


9.8 


S.D. 


3.5 


2 


2.2 


3.7 


4.8 


2.7 


3.8 


5.1 


2.8 


1.4 


2.6 


4.3 



a N-E60: neutral to 60° of extension; N-E30: neutral to 30° of extension; N-F30: neutral to 30° of flexion; N-F60: neutral to 60° of 
flexion. 

S.D. = standard deviation. 

Source: Kobayashi M, Berger RA, Nagy L, et al. 1997. Normal kinematics of carpal bones: A three-dimensional analysis of carpal 
bone motion relative to the radius. J Biomech 30:8, 787. 



magnitude of rotation and the lunate displayed the least rotation. The proximal carpal bones exhibited 
some ulnar deviation in 60° of wrist flexion. During coronal plane motion (Table 20.18), the magnitude 
of radial-ulnar deviation of the distal carpal bones was mutually similar and generally of a greater 
magnitude than that of the proximal carpal bones. The proximal carpal bones experienced some flexion 
during radial deviation of the wrist and extension during ulnar deviation of the wrist. 



20.7 Hand 



The hand is an extremely mobile organ that is capable of conforming to a large variety of object shapes 
and coordinating an infinite variety of movements in relation to each of its components. The mobility 
of this structure is possible through the unique arrangement of the bones in relation to one another, the 
articular contours, and the actions of an intricate system of muscles. Theoretical and empirical evidence 
suggest that limb joint surface morphology is mechanically related to joint mobility, stability, and strength 
[Hamrick, 1996]. 

Geometry of the Articulating Surfaces 

Three-dimensional geometric models of the articular surfaces of the hand have been constructed. The 
sagittal contours of the metacarpal head and proximal phalanx grossly resemble the arc of a circle [Tamai 
et al., 1988]. The radius of curvature of a circle fitted to the entire proximal phalanx surface ranges from 
11-13 mm, almost twice as much as that of the metacarpal head, which ranges from 6-7 mm 
(Table 20.19). The local centers of curvature along the sagittal contour of the metacarpal heads are not 
fixed. The locus of the center of curvature for the subchondral bony contour approximates the locus of 



S 2000 by CRC Press LLC 



TABLE 20.18 Individual Carpal Rotation to the Radius (Degrees) (Coronal Plane Motion of the Wrist) 













Axis of Rotation 












X 






Y 






Z 




Wrist Motion" 


(+) Pronation; (-} Supination 


(+) Flexion; (-) Extension 
N-RD15 N-UD15 N-UD30 


(+) Ulnar Deviation; (-) Ra 
N-RD15 N-UD15 


dial Deviation 


Carpal Bone 


N-RD15 


N-UD15 


N-UD30 


N-UD30 


Trapezium (N=13) 


-4.8 


9.1 


16.3 





4.9 


9.9 


-14.3 


16.4 


32.5 


S.D. 


2.4 


3.6 


3.6 


1.5 


1.3 


2.1 


2.3 


2.8 


2.6 


Capitate (N=22) 


-3.9 


6.8 


11.8 


1.3 


2.7 


6.5 


-14.6 


15.9 


30.7 


S.D. 


2.6 


2.6 


2.5 


1.5 


1.1 


1.7 


2.1 


1.4 


1.7 


Hamate (N=9) 


-4.8 


6.3 


10.6 


1.1 


3.5 


6.6 


-15.5 


15.4 


30.2 


S.D. 


1.8 


2.4 


3.1 


3 


3.2 


4.1 


2.4 


2.6 


3.6 


Scaphoid (N=22) 


0.8 


2.2 


6.6 


8.5 


-12.5 


-17.1 


-4.2 


4.3 


13.6 


S.D. 


1.8 


2.4 


3.1 


3 


3.2 


4.1 


2.4 


2.6 


3.6 


Lunate (N=22) 


-1.2 


1.4 


3.9 


7 


-13.9 


-22.5 


-1.7 


5.7 


15 


S.D. 


1.6 





3.3 


3.1 


4.3 


6.9 


1.7 


2.8 


4.3 


Triquetrum (N=22) 


-1.1 


-1 


0.8 


4.1 


-10.5 


-17.3 


-5.1 


7.7 


18.4 


S.D. 


1.4 


2.6 


4 


3 


3.8 


6 


2.4 


2.2 


4 



a N-RD15: neutral to 15° of radial deviation; N-UD30: neutral to 30° of ulnar deviation; N-UD15: neutral to 15° of ulnar deviation. 
S.D. = standard deviation. 

Source: Kobayashi M, Berger RA, Nagy L, et al. 1997. Normal kinematics of carpal bones: A three-dimensional analysis of carpal 
bone motion relative to the radius. J Biomech 30:8, 787. 

the center for the acute curve of an ellipse (Fig. 20.29). However, the locus of center of curvature for the 



Metacarpal Head 




Local center of cartilage contour 



FIGURE 20.29 The loci of the local centers of curvature for subchondral bony contour of the metacarpal head 
approximates the loci of the center for the acute curve of an ellipse. The loci of the local center of curvature for articular 
cartilage contour of the metacarpal head approximates the loci of the bony center of the obtuse curve of an ellipse. 
Source: Tamai K, Ryu J, An KN, Linscheid RL, Cooney WP, Chao EYS. 1988. In Three-dimensional geometric analysis 
of the metacarpophalangeal joint. J Hand Surg 13A(4):521. Reprinted with permission of Churchill Livingstone. 



articular cartilage contour approximates the locus of the obtuse curve of an ellipse. 

The surface geometry of the thumb carpometacarpal (CMC) joint has also been quantified [Athesian 
et al., 1992]. The surface area of the CMC joint is significantly greater for males than for females 
(Table 20.20). The minimum, maximum, and mean square curvature of these joints is reported in 
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TABLE 20.19 Radius of Curvature of the Middle Sections 
of the Metacarpal Head and Proximal Phalanx Base 





Radius (mm) 




Bony Contour 


Cartilage Contour 


MCH Index 


6.42 ± 1.23 


6.91 ± 1.03 


Long 


6.44 ± 1.08 


6.66 ± 1.18 


PPB Index 


13.01 ± 4.09 


12.07 ± 3.29 


Long 


11.46 ±2.30 


11.02 ± 2.48 



Source: Tamai K, Ryu J, An KN, Linscheid RL, Cooney WP, 
Chao EYS. 1988. Three-dimensional geometric analysis of the 
metacarpophalangeal joint. J Hand Surg 13A(4):521. 

Table 20.20. The curvature of the surface is denoted by k and the radius of curvature is p = 1/k. The 
curvature is negative when the surface is concave and positive when the surface is convex. 

Joint Contact 

The size and location of joint contact areas of the metacarpophalangeal (MCP) joint changes as a function 
of the joint flexion angle (Fig. 20.30) The radioulnar width of the contact area becomes narrow in the 
neutral position and expands in both the hyperextended and fully flexed positions [An & Cooney, 1991]. 
In the neutral position, the contact area occurs in the center of the phalangeal base, this area being slightly 
larger on the ulnar than on the radial side. 

The contact areas of the thumb carpometacarpal joint under the functional position of lateral key 
pinch and in the extremes of range of motion were studied using a stereophotogrammetric technique 
[Ateshian et al., 1995]. The lateral pinch position produced contact predominately on the central, volar, 
and volar-ulnar regions of the trapezium and the metacarpals (Fig. 20.31). Pelligrini et al. [1993] noted 
that the palmar compartment of the trapeziometacarpal joint was the primary contact area during flexion 
adduction of the thumb in lateral pinch. Detachment of the palmar beak ligament resulted in dorsal 
translation of the contact area producing a pattern similar to that of cartilage degeneration seen in the 
osteoarthritic joint. 

Axes of Rotation 

Rolling and sliding actions of articulating surfaces exist during finger joint motion. The geometric shapes 
of the articular surfaces of the metacarpal head and proximal phalanx, as well as the insertion location 
of the collateral ligaments, significantly govern the articulating kinematics, and the center of rotation is 
not fixed but rather moves as a function of the angle of flexion [Pagowski & Piekarski, 1977]. The instant 
centers of rotation are within 3 mm of the center of the metacarpal head [Walker & Erhman, 1975]. 
Recently the axis of rotation of the MCP joint has been evaluated in vivo by Fioretti [1994], The 
instantaneous helical axis of the MCP joint tends to be more palmar and tends to be displaced distally 
as flexion increases (Fig. 20.32). 

The axes of rotation of the CMC joint have been described as being fixed [Hollister et al., 1992], but 
others believe that a polycentric center of rotation exists [Imaeda et al., 1994]. Hollister et al. [1992] 
found that axes of the CMC joint are fixed and are not perpendicular to each other, or to the bones, and 
do not intersect. The flexion/extension axis is located in the trapezium, and the abduction/adduction 
axis is on the first metacarpal. In contrast, Imaeda et al. [1994] found that there was no single center of 
rotation, but rather the instantaneous motion occurred reciprocally between centers of rotations within 
the trapezium and the metacarpal base of the normal thumb. In flexion/extension, the axis of rotation 
was located within the trapezium, but for abduction/adduction the center of rotation was located distally 
to the trapezium and within the base of the first metacarpal. The average instantaneous center of 
circumduction was at approximately the center of the trapezial joint surface (Table 20.21). 
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FIGURE 20.30 (A) Contact area of the MCP joint in five joint positions. (B) End on view of the contact area on 
each of the proximal phalanx bases. The radioulnar width of the contact area becomes narrow in the neutral position 
and expands in both the hyperextended and fully flexed positions. Source: An KN, Cooney WP. 1991. Biomechanics, 
Section II. The hand and wrist. In BF Morrey (ed), Joint Replacement Arthroplasty, pp 137-146, New York, Churchill 
Livingstone. By permission of Mayo Foundation. 
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FIGURE 20.31 Summary of the contact areas for all specimens, in lateral pinch with a 25 N load. All results from 
the right hand are transposed onto the schema of a carpometacarpal joint from the left thumb. Source: Ateshian, 
GA, Ark JW, Rosenwasser MP, et al. Contact areas on the thumb carpometacarpal joint. J Orthop Res 13:450, 1995. 
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FIGURE 20.32 Intersections of the instantaneous helical angles with the metacarpal sagittal plane. They are relative 
to one subject tested twice in different days. The origin of the graph is coincident with the calibrated center of the 
metacarpal head. The arrow indicates the direction of flexion. Source: Fioretti S. 1994. Three-dimensional in-vivo 
kinematic analysis of finger movement. In F Schuind et al. (eds). Advances in the Biomechanics of the Hand and 
Wrist, pp 363-375, New York, Plenum Press. With permission. 



TABLE 20.20 Curvature of Carpometacarpal Joint Articular Surfaces 









Area 


K min 


K max 


K rms 






n 


(cm 2 ) 


(m-') 


(nr 1 ) 


(m->) 


Trapezium 














Female 




8 


1.05 ± 0.21 


-61 ± 22 


190 ± 36 


165 ± 32 


Male 




5 


1.63 ±0.18 


-87 ± 17 


114 ± 19 


118 ± 6 


Total 




13 


1.27 ± 0.35 


-71 ± 24 


161 ± 48 


147 ± 34 


Female versus 


male 




p<0.01 


p < 0.05 


p<0.01 


p<0.01 


Metacarpal 














Female 




8 


1.22 ± 0.36 


-49 ± 10 


175 ± 25 


154 ± 20 


Male 




5 


1.74 ± 0.21 


-37 ± 11 


131 ± 17 


116 ± 8 


Total 




13 


1.42 ± 0.40 


-44 ± 12 


158 ± 31 


140 ± 25 


Female versus 


male 




p<0.01 


p < 0.05 


p<0.01 


p<0.01 



Note: Radius of curvature: p = 1/k 

Source: Athesian JA, Rosenwasser MP, Mow VC. 1992. Curvature characteristics 
and congruence of the thumb carpometacarpal joint: differences between female 
and male joints. J Biomech 25(6):591. 



The axes of rotation of the thumb interphalangeal and metacarpophalangeal joint were located using 
a mechanical device [Hollister et al., 1995]. The physiologic motion of the thumb joints occur about 
these axes (Fig. 20.33 and Table 20.22). The interphalangeal joint axis is parallel to the flexion crease of 
the joint and is not perpendicular to the phalanx. The metacarpophalangeal joint has two fixed axes: a 
fixed flexion-extension axis just distal and volar to the epicondyles, and an abduction-adduction axis 
related to the proximal phalanx passing between the sesamoids. Neither axis is perpendicular to the 
phalanges. 
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FIGURE 20.33 (A) The angles and length and breadth measurements defining the axis of rotation of the interpha- 
langeal joint of the right thumb. (t/T = ratio of anatomic plane diameter; 1/L = ratio of length). (B) The angles and 
length and breadth measurements of the metacarpophalangeal flexion-extension axis' position in the metacarpal. 
(C) The angles and length and breadth measurements that locate the metacarpophalangeal abduction-adduction 
axis. The measurements are made in the metacarpal when the metacapophalangeal joint is at neutral flexion extension. 
The measurements are made relative to the metacarpal because the axis passes through this bone, not the proximal 
phalanx with which it moves. This method of recording the abduction-adduction measurements allows the mea- 
surements of the axes to each other at a neutral position to be made. The metacarpophalangeal abduction-adduction 
axis passes through the volar plate of the proximal phalanx. Source: Hollister A, Giurintano DJ, Buford WL, et al. 
The axes of rotation of the thumb interphalangeal and metacarpophalangeal joints. Clinical Orthopaedics and Related 
Research 320:188, 1995. 



20.8 Summary 

It is important to understand the biomechanics of joint-articulating surface motion. The specific char- 
acteristics of the joint will determine the musculoskeletal function of that joint. The unique geometry 
of the joint surfaces and the surrounding capsule ligamentous constraints will guide the unique charac- 
teristics of the articulating surface motion. The range of joint motion, the stability of the joint, and the 
ultimate functional strength of the joint will depend on these specific characteristics. A congruent joint 
usually has a relatively limited range of motion but a high degree of stability, whereas a less congruent 
joint will have a relatively larger range of motion but less degree of stability. The characteristics of the 
joint-articulating surface will determine the pattern of joint contact and the axes of rotation. These 
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TABLE 20.21 Location of Center of Rotation 
of Trapeziometacarpal Joint 



Mean ± SD (mm) 


Circumduction 










X 








0.1 ± 1.3 


Y 








-0.6 ± 1.3 


Z 








-0.5 ± 1.4 


Flexion/Extension 


(in x- 


■y plane) 




X 










Centroid 








-4.2 ± 1.0 


Radius 








2.0 ± 0.5 


Y 










Centroid 








-0.4 ± 0.9 


Radius 








1.6 ± 0.5 


Abduction/Adduction (i 


in x-z 


plane) 




X 










Centroid 








6.7 ± 1.7 


Radius 








4.6 ± 3.1 


Z 










Centroid 








-0.2 ± 0.7 


Radius 








1.7 ± 0.5 



Note: The coordinate system is denned with the x axis cor- 
responding to internal/external rotation, the y axis correspond- 
ing to abduction/adduction, and the z axis corresponding to 
flexion/extension. The x axis is positive in the distal direction, 
the y axis is positive in the dorsal direction for the left hand 
and in the palmar direction for the right hand, and the z axis 
is positive in the radial direction. The origin of the coordinate 
system was at the intersection of a line connecting the radial 
and ulnar prominences and a line connecting the volar and 
dorsal tubercles. 

Source: Imaeda T, Niebur G, Cooney WP, Linscheid RL, An 
KN. 1994. Kinematics of the normal trapeziometacarpal joint. 
J Orthop Res 12:197. 
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TABLE 20.22 Measurement of Axis Location and Values 
for Axis Position in the Bone 3 

Interphalangeal joint flexion-extension axis (Fig. 20.33A) 

t/T 44 ± 17% 

1/L 90 ± 5% 

© 5 ±2° 

P 83 ± 4° 

Metacarpophalangeal joint flexion-extension axis (Fig. 20.33B) 

t/T 57 ± 17% 

1/L 87 ± 5% 

a 101 ± 6° 

P 5 ±2° 

Metacarpophalangeal joint abduction-adduction axis (Fig. 20.33C) 

t/T 45 ± 8% 

1/L 83 ± 13% 

a 80 ± 9° 

P 74 ± 8° 

M 

a The angle of the abduction-adduction axis with respect to the 
flexion-extension axis is 84.8° ± 12.2°. The location and angulation 
of the K-wires of the axes with respect to the bones were measured 
(©, a, P) directly with a goniometer. The positions of the pins in 
the bones were measured (T, L) with a Vernier caliper. 

Source: Hollister A, Giurintano DJ, Buford WL, et al. The axes of 
rotation of the thumb interphalangeal and metacarpophalangeal 
joints. Clinical Orthopaedics and Related Research, 320:188, 1995. 



characteristics will regulate the stresses on the joint surface which will influence the degree of degeneration 
of articular cartilage in an anatomic joint and the amount of wear of an artificial joint. 
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"The Fabric of the Joints in the Human Body is a subject so much the more entertaining, as it must strike 
every one that considers it attentively with an Idea of fine Mechanical Composition. Wherever the Motion 
of one Bone upon another is requisite, there we find an excellent Apparatus for rendering that Motion safe 
and free: We see, for Instance, the Extremity of one Bone molded into an orbicular Cavity, to receive the 
Head of another, in order to afford it an extensive Play. Both are covered with a smooth elastic Crust, to 
prevent mutual Abrasion; connected with strong Ligaments, to prevent Dislocation; and inclosed in a Bag 
that contains a proper Fluid Deposited there, for lubricating the Two contiguous Surfaces. So much in 
general." 

The above is the opening paragraph of the classic paragraph of the classic paper by the surgeon, Sir 
William Hunter, "Of the Structure and Diseases of Articulating Cartilages" which he read to a meeting 
of the Royal Society, June 2, 1743 [1]. Since then, a great deal of research has been carried out on the 
subject of synovial joint lubrication. However, the mechanisms involved are still unknown. 

21.1 Introduction 

The purpose of this article is twofold: (1) to introduce the reader to the subject of tribology — the study 
of friction, wear, and lubrication; and (2) to extend this to the topic of biotribology, which includes the 
lubrication of natural synovial joints. It is not meant to be an exhaustive review of joint lubrication 
theories; space does not permit this. Instead, major concepts or principles will be discussed not only in 
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the light of what is known about synovial joint lubrication but perhaps more importantly what is not 
known. Several references are given for those who wish to learn more about the topic. It is clear that 
synovial joints are by far the most complex and sophisticated tribological systems that exist. We shall see 
that although numerous theories have been put forth to attempt to explain joint lubrication, the mech- 
anisms involved are still far from being understood. And when one begins to examine possible connections 
between tribology and degenerative joint disease or osteoarthritis, the picture is even more complex and 
controversial. Finally, this article does not treat the (1) tribological behavior of artificial joints or partial 
joint replacements, (2) the possible use of elastic or poroplastic materials as artificial cartilage, and (3) new 
developments in cartilage repair using transplanted chondrocytes. These are separate topics, which would 
require detailed discussion and additional space. 

21.2 Tribology 

The word tribology, derived from the Greek "to rub," covers all frictional processes between solid bodies 
moving relative to one another that are in contact [2]. Thus tribology may be defined as the study of 
friction, wear, and lubrication. 

Tribological processes are involved whenever one solid slides or rolls against another, as in bearings, 
cams, gears, piston rings and cylinders, machining and metalworking, grinding, rock drilling, sliding 
electrical contacts, frictional welding, brakes, the striking of a match, music from a cello, articulation of 
human synovial joints (e.g., hip joints), machinery, and in numerous less obvious processes (e.g., walking, 
holding, stopping, writing, and the use of fasteners such as nails, screws, and bolts). 

Tribology is a multidisciplinary subject involving at least the areas of materials science, solid and surface 
mechanics, surface science and chemistry, rheology, engineering, mathematics, and even biology and bio- 
chemistry. Although tribology is still an emerging science, interest in the phenomena of friction, wear, and 
lubrication is an ancient one. Unlike thermodynamics, there are no generally accepted laws in tribology. 
But there are some important basic principles needed to understand any study of lubrication and wear and 
even more so in a study of biotribology or biological lubrication phenomena. These basic principles follow. 

Friction 

Much of the early work in tribology was in the area of friction — possibly because frictional effects are 
more readily demonstrated and measured. Generally, early theories of friction dealt with dry or unlu- 
bricated systems. The problem was often treated strictly from a mechanical viewpoint, with little or no 
regard for the environment, surface films, or chemistry. 

In the first place, friction may be defined as the tangential resistance that is offered to the sliding of one 
solid body over another. Friction is the result of many factors and cannot be treated as something as 
singular as density or even viscosity. Postulated sources of friction have included (1) the lifting of one 
asperity over another (increase in potential energy), (2) the interlocking of asperities followed by shear, 
(3) interlocking followed by plastic deformation or plowing, (4) adhesion followed by shear, (5) elastic 
hysteresis and waves of deformation, (6) adhesion or interlocking followed by tensile failure, 
(7) intermolecular attraction, (8) electrostatic effects, and (9) viscous drag. The coefficient of friction, 
indicated in the literature by \x or f is defined as the ratio F/W where F = friction force and W = the 
normal load. It is emphasized that friction is a force and not a property of a solid material or lubricant. 

Wear and Surface Damage 

One definition of wear in a tribological sense is that it is the progressive loss of substance from the operating 
surface of a body as a result of relative motion at the surface. In comparison with friction, very little theoretical 
work has been done on the extremely important area of wear and surface damage. This is not too surprising 
in view of the complexity of wear and how little is known of the mechanisms by which it can occur. 
Variations in wear can be, and often are, enormous compared with variations in friction. For example, 
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practically all the coefficients of sliding friction for diverse dry or lubricated systems fall within a relatively 
narrow range of 0.1 to 1. In some cases (e.g., certain regimes of hydrodynamic or "boundary" lubrication), 
the coefficient of friction may be <0.1 and as low as 0.001. In other cases (e.g., very clean unlubricated 
metals in vacuum), friction coefficients may exceed one. Reduction of friction by a factor of two through 
changes in design, materials, or lubricant would be a reasonable, although not always attainable, goal. On 
the other hand, it is not uncommon for wear rates to vary by a factor of 100, 1,000, or even more. 

For systems consisting of common materials (e.g., metals, polymers, ceramics), there are at least four 
main mechanisms by which wear and surface damage can occur between solids in relative motion: 
(1) abrasive wear, (2) adhesive wear, (3) fatigue wear, and (4) chemical or corrosive wear. A fifth, fretting 
wear and fretting corrosion, combines elements of more than one mechanism. For complex biological 
materials such as articular cartilage, most likely other mechanisms are involved. 

Again, wear is the removal of material. The idea that friction causes wear and therefore, low friction 
means low wear, is a common mistake. Brief descriptions of five types of wear; abrasive, adhesive, fatigue, 
chemical or corrosive, and fretting — may be found in [2] as well as in other references in this article. 
Next, it may be useful to consider some of the major concepts of lubrication. 

21.3 Lubrication 

Lubrication is a process of reducing friction and/or wear (or other forms of surface damage) between 
relatively moving surfaces by the application of a solid, liquid, or gaseous substance (i.e., a lubricant). 
Since friction and wear do not necessarily correlate with each other, the use of the word and in place of 
and/or in the above definition is a common mistake to be avoided. The primary function of a lubricant 
is to reduce friction or wear or both between moving surfaces in contact with each other. 

Examples of lubricants are wide and varied. They include automotive engine oils, wheel bearing greases, 
transmission fluids, electrical contact lubricants, rolling oils, cutting fluids, preservative oils, gear oils, 
jet fuels, instrument oils, turbine oils, textile lubricants, machine oils, jet engine lubricants, air, water, 
molten glass, liquid metals, oxide films, talcum powder, graphite, molybdenum disulfide, waxes, soaps, 
polymers, and the synovial fluid in human joints. 

A few general principles of lubrication may be mentioned here. 

1. The lubricant must be present at the place where it can function. 

2. Almost any substance under carefully selected or special conditions can be shown to reduce friction 
or wear in a particular test, but that does not mean these substances are lubricants. 

3. Friction and wear do not necessarily go together. This is an extremely important principle which 
applies to nonlubricated (dry) as well as lubricated systems. It is particularly true under conditions 
of "boundary lubrication," to be discussed later. An additive may reduce friction and increase wear, 
reduce wear and increase friction, reduce both or increase both. Although the reasons are not fully 
understood, this is an experimental observation. Thus, friction and wear should be thought of as 
separate phenomena — an important point when we discuss theories of synovial joint lubrication. 

4. The effective or active lubricating film in a particular system may or may not consist of the original 
or bulk lubricant phase. 

In a broad sense, it may be considered that the main function of a lubricant is to keep the surfaces 
apart so that interaction (e.g., adhesion, plowing, and shear) between the solids cannot occur; thus 
friction and wear can be reduced or controlled. 

The following regimes or types of lubrication may be considered in the order of increasing severity 
or decreasing lubricant film thickness (Fig. 21.1): 

1. Hydrodynamic lubrication 

2. Elastohydrodynamic lubrication 

3. Transition from hydrodynamic and elastohydrodynamic lubrication to boundary lubrication 

4. Boundary lubrication. 
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FIGURE 21.1 Regimes of lubrication 



A fifth regime, sometimes referred to as dry or unlubricated, may also be considered as an extreme or 
limit. In addition, there is another form of lubrication that does not require relative movement of the 
bodies either parallel or perpendicular to the surface, i.e., as in externally pressurized hydrostatic or 
aerostatic bearings. 

Hydrodynamic Lubrication Theories 

In hydrodynamic lubrication, the load is supported by the pressure developed due to relative motion 
and the geometry of the system. In the regime of hydrodynamic or fluid film lubrication, there is no 
contact between the solids. The film thickness is governed by the bulk physical properties of the lubricants, 
the most important being viscosity; friction arises purely from shearing of viscous lubricant. 

Contributions to our knowledge of hydrodynamic lubrication, with special focus on journal bearings, 
have been made by numerous investigators including Reynolds. The classic Reynolds treatment consid- 
ered the equilibrium of a fluid element and the pressure and shear forces on this element. In this treatment, 
eight assumptions were made (e.g., surface curvature is large compared to lubricant film thickness, fluid 
is Newtonian, flow is laminar, viscosity is constant through film thickness). Velocity distributions due to 
relative motion and pressure buildup were developed and added together. The solution of the basic 
Reynolds equation for a particular bearing configuration results in a pressure distribution throughout 
the film as a function of viscosity, film shape, and velocity. 

The total load Wand frictional (viscous) drag Fcan be calculated from this information. For rotating 
disks with parallel axes, the "simple" Reynolds equation yields: 



h 

^ = 4.9 

R 






(21.1) 



where h is the minimum lubricant film thickness, r\ is the absolute viscosity, U is the average velocity 
(U l + U 2 )/2, Wis the applied normal load per unit width of disk, and R is the reduced radius of curvature 
(1AR= l/i*, + 1AR 2 ). 

The dimensionless term (r\U/W) is sometimes referred to as the hydrodynamic factor. It can be seen 
that doubling either the viscosity or velocity doubles the film thickness, and that doubling the applied 
load halves the film thickness. This regime of lubrication is sometimes referred to as the rigid isoviscous 
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or classical Martin condition, since the solid bodies are assumed to be perfectly rigid (non-deformable), 
and the fluid is assumed to have a constant viscosity. 

At high loads with systems such as gears, ball bearings, and other high-contact-stress geometries, two 
additional factors have been considered in further developments of the hydrodynamic theory of lubri- 
cation. One of these is that the surfaces deform elastically; this leads to a localized change in geometry 
more favorable to lubrication. The second is that the lubricant becomes more viscous under the high 
pressure existing in the contact zone, according to relationships such as: 



r|/r| = expa(p-p o ) (21.2) 



where 77 is the viscosity at pressure p, r\ is the viscosity at atmospheric pressure p , and a is the pressure- 
viscosity coefficient (e.g., in Far 1 ). In this concept, the lubricant pressures existing in the contact zone 
approximate those of dry contact Hertzian stress. This is the regime of elastohydrodynamic lubrication, 
sometimes abbreviated as EHL or EHD. It may also be described as the elastic-viscous type or mode of 
lubrication, since elastic deformation exists and the fluid viscosity is considerably greater due to the 
pressure effect. 

The comparable Dowson-Higginson expression for minimum film thickness between cylinders or 
disks in contact with parallel axes is: 



h 

^ = 2.6 

R 



W 



aW 



W 
RE' 



(21.3) 



The term E' represents the reduced modulus of elasticity: 

1 M.M 



E 1 E 2 



(21.4) 



where E is the modulus, v is Poisson's ratio, and the subscripts 1 and 2 refer to the two solids in contact. 
All the other terms are the same as previously stated. In addition to the hydrodynamic factor (r\U/W), 
a pressure-viscosity factor (aW/R), and an elastic deformation factor ( WIRE 1 ) can be considered. Thus, 
properties of both the lubricant and the solids as materials are included. In examining the elastohydro- 
dynamic film thickness equations, it can be seen that the velocity U is an important factor (h oc U °' 7 ) 
but the load Wis rather unimportant (h oc W~ 013 ). 

Experimental confirmation of the elastohydrodynamic lubrication theory has been obtained in certain 
selected systems using electrical capacitance, x-ray transmission, and optical interference techniques to 
determine film thickness and shape under dynamic conditions. Research is continuing in this area, 
including studies on micro-EHL or asperity lubrication mechanisms, since surfaces are never perfectly 
smooth. These studies may lead to a better understanding of not only lubricant film formation in high- 
contact-stress systems but lubricant film failure as well. 

Two other possible types of hydrodynamic lubrication, rigid-viscous and elastic-isoviscous, complete 
the matrix of four, considering the two factors of elastic deformation and pressure-viscosity effects. In 
addition, squeeze film lubrication can occur when surfaces approach one another. For more information 
on hydrodynamic and elastohydrodynamic lubrication, see Cameron [3] and Dowson and Higginson [4]. 

Transition from Hydrodynamic to Boundary Lubrication 

Although prevention of contact is probably the most important function of a lubricant, there is still 
much to be learned about the transition from hydrodynamic and elastohydrodynamic lubrication to 
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boundary lubrication. This is the region in which lubrication goes from the desirable hydrodynamic 
condition of no contact to the less acceptable "boundary" condition, where increased contact usually 
leads to higher friction and wear. This regime is sometimes referred to as a condition of mixed lubrication. 
Several examples of experimental approaches to thin-film lubrication have been reported [3]. It is 
important in examining these techniques to make the distinction between methods that are used to 
determine lubricant film thickness under hydrodynamic or elastohydrodynamic conditions (e.g., optical 
interference, electrical capacitance, or x-ray transmission), and methods that are used to determine the 
occurrence or frequency of contact. As we will see later, most experimental studies of synovial joint 
lubrication have focused on friction measurements, using the information to determine the lubrication 
regime involved; this approach can be misleading. 

Boundary Lubrication 

Although there is no generally accepted definition of boundary lubrication, it is often described as a 
condition of lubrication in which the friction and wear between two surfaces in relative motion are 
determined by the surface properties of the solids and the chemical nature of the lubricant rather than 
its viscosity. An example of the difficulty in defining boundary lubrication can be seen if the term bulk 
viscosity is used in place of viscosity in the preceding sentence — another frequent form. This opens the 
door to the inclusion of elastohydrodynamic effects which depend in part on the influence of pressure 
on viscosity. Increased friction under these circumstances could be attributed to increased viscous drag 
rather than solid-solid contact. According to another common definition, boundary lubrication occurs 
or exists when the surfaces of the bearing solids are separated by films of molecular thickness. That may 
be true, but it ignores the possibility that "boundary" layer surface films may indeed be very thick (i.e., 
10, 20, or 100 molecular layers). The difficulty is that boundary lubrication is complex. 

Although a considerable amount of research has been done on this topic, an understanding of the 
basic mechanisms and processes involved is by no means complete. Therefore, definitions of boundary 
lubrication tend to be nonoperational. This is an extremely important regime of lubrication because it 
involves more extensive solid-solid contact and interaction as well as generally greater friction, wear, and 
surface damage. In many practical systems, the occurrence of the boundary lubrication regime is unavoid- 
able or at least quite common. The condition can be brought about by high loads, low relative sliding 
speeds (including zero for stop-and-go, motion reversal, or reciprocating elements) and low lubricant 
viscosity — factors that are important in the transition from hydrodynamic to boundary lubrication. 

The most important factor in boundary lubrication is the chemistry of the tribological system — the 
contacting solids and total environment including lubricants. More particularly, the surface chemistry 
and interactions occurring with and on the solid surfaces are important. This includes factors such as 
physisorption, chemisorption, intermolecular forces, surface chemical reactions, and the nature, struc- 
ture, and properties of thin films on solid surfaces. It also includes many other effects brought on by the 
process of moving one solid over another, such as: (1) changes in topography and the area of contact, 
(2) high surface temperatures, (3) the generation of fresh reactive metal surfaces by the removal of oxide 
and other layers, (4) catalysis, (5) the generation of electrical charges, and (6) the emission of charged 
particles such as electrons. 

In examining the action of boundary lubricant compounds in reducing friction or wear or both 
between solids in sliding contact, it may be helpful to consider at least the following five modes of film 
formation on or protection of surfaces: (1) physisorption, (2) chemisorption, (3) chemical reactions with 
the solid surface, (4) chemical reactions on the solid surface, and (5) mere interposition of a solid or 
other material. These modes of surface protection are discussed in more detail in [2]. 

The beneficial and harmful effects of minor changes in chemistry of the environment (e.g., the 
lubricant) are often enormous in comparison with hydrodynamic and elastohydrodynamic effects. Thus, 
the surface and chemical properties of the solid materials used in tribological applications become 
especially important. One might expect that this would also be the case in biological (e.g., human joint) 
lubrication where biochemistry is very likely an important factor. 
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FIGURE 21.2 In any tribological system, friction, wear, and surface damage depend on four interrelated factors 

General Comments on Tribological Processes 

It is important to recognize that friction and wear depend upon four major factors, i.e., materials, design, 
operating conditions, and total environment (Fig. 21.2). This four-block figure may be useful as a guide 
in thinking about synovial joint lubrication either from a theoretical or experimental viewpoint — the 
topic discussed in the next section. 

Readers are cautioned against the use of various terms in tribology which are either vaguely defined 
or not defined at all. These would include such terms as "lubricating ability," "lubricity," and even 
"boundary lubrication." For example, do "boundary lubricating properties" refer to effects on friction 
or effects on wear and damage? It makes a difference. It is emphasized once again that friction and wear 
are different phenomena. Low friction does not necessarily mean low wear. We will see several examples 
of this common error in the discussion of joint lubrication research. 

21.4 Synovial Joints 

Examples of natural synovial or movable joints include the human hip, knee, elbow, ankle, finger, and 
shoulder. A simplified representation of a synovial joint is shown in Fig. 21.3. The bones are covered by 
a thin layer of articular cartilage bathed in synovial fluid confined by synovial membrane. Synovial joints 
are truly remarkable systems — providing the basis of movement by allowing bones to articulate on one 
another with minimal friction and wear. Unfortunately, various joint diseases occur even among the 
young — causing pain, loss of freedom of movement, or instability. 

Synovial joints are complex, sophisticated systems not yet fully understood. The loads are surprisingly 
high and the relative motion is complex. Articular cartilage has the deceptive appearance of simplicity 
and uniformity. But it is an extremely complex material with unusual properties. Basically, it consists of 
water (approximately 75%) enmeshed in a network of collagen fibers and proteoglycans with high 
molecular weight. In a way, cartilage could be considered as one of Nature's composite materials. Articular 
cartilage also has no blood supply, no nerves, and very few cells (chondrocytes). 

The other major component of an articular joint is synovial fluid, named by Paracelsus after "synovia" 
(egg-white). It is essentially a dialysate of blood plasma with added hyaluronic acid. Synovial fluid contains 
complex proteins, polysaccharides, and other compounds. Its chief constituent is water (approximately 
85%). Synovial fluid functions as a joint lubricant, nutrient for cartilage, and carrier for waste products. 
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FIGURE 21.3 Representation of a synovial joint 



For more information on the biochemistry, structure, and properties of articular cartilage, Freeman 
[5], Sokoloff [6], Stockwell [7] and articles referenced in these works are suggested. 

21.5 Theories on the Lubrication of Natural 
and Normal Synovial Joints 

As stated, the word tribology means the study of friction, wear, and lubrication. Therefore, biotribology 
may be thought of as the study of biological lubrication processes, e.g., as in synovial joints. A surprisingly 
large number of concepts and theories of synovial joint lubrication have been proposed [8-10] (as shown 
in Table 21.1). And even if similar ideas are grouped together, there are still well over a dozen funda- 
mentally different theories. These have included a wide range of lubrication concepts, e.g., hydrodynamic, 
hydrostatic, elasto-hydrodynamic, squeeze-film, "boundary", mixed-regime, "weeping", osmotic, synovial 
mucin gel, "boosted", lipid, electrostatic, porous layers, and special forms of boundary lubrication (e.g., 
"lubricating glycoproteins", structuring of boundary water "surface-active" phospholipids). This chapter 
will not review these numerous theories, but excellent reviews on the lubrication of synovial joints have 
been written by McCutchen [11], Swanson [12], and Higginsworth and Unsworth [13]. The book edited 
by Dumbleton is also recommended [14]. In addition, theses by Droogendijk [15] and Burkhardt [16] 
contain extensive and detailed reviews of theories of joint lubrication. 

McCutchen was the first to propose an entirely new concept of lubrication, "weeping lubrication", 
applied to synovial joint action [17,18]. He considered unique and special properties of cartilage and 
how this could affect flow and lubrication. The work of Mow et al. continued along a more complex and 
sophisticated approach in which a biomechanical model is proposed for the study of the dynamic 
interaction between synovial fluid and articular cartilage [19,20]. These ideas are combined in the more 
recent work of Ateshian [21] which uses a framework of the biphasic theory of articular cartilage to 
model interstitial fluid pressurization. Several additional studies have also been made of effects of porosity 
and compliance, including the behavior of elastic layers, in producing hydrodynamic and squeeze-film 
lubrication. A good review in this area was given by Unsworth who discussed both human and artificial 
joints [22]. 

The following general observations are offered on the theories of synovial joint lubrication that have 
been proposed: 
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1. Most of the theories are strictly mechanical or rheological — involving such factors as deformation, 
pressure, and fluid flow. 

2. There is a preoccupation with friction, which of course is very low for articular cartilage systems. 

3. None of the theories consider wear — which is neither the same as friction nor related to it. 

4. The detailed structure, biochemistry, complexity, and living nature of the total articular cartilage- 
synovial fluid system are generally ignored. 

These are only general impressions. And although mechanical/rheological concepts seem dominant 
(with a focus on friction), wear and biochemistry are not completely ignored. For example, Simon [23] 
abraded articular cartilage from human patellae and canine femoral heads with a stainless steel rotary 
file, measuring the depth of penetration with time and the amount of wear debris generated. Cartilage 
wear was also studied experimentally by Bloebaum and Wilson [24], Radin and Paul [25], and Lipshitz, 
Etheredge, and Glimcher [26-28]. The latter researchers carried out several in vitro studies of wear of 
articular cartilage using bovine cartilage plugs or specimens in sliding contact against stainless steel plates. 
They developed a means of measuring cartilage wear by determining the hydroxyproline content of both 
the lubricant and solid wear debris. Using this system and technique, effects of variables such as time, 
applied load, and chemical modification of articular cartilage on wear and profile changes were deter- 
mined. This work is of particular importance in that they addressed the question of cartilage wear and 
damage rather than friction, recognizing that wear and friction are different phenomena. 

Special note is also made of two researchers, Swann and Sokoloff, who considered biochemistry as an 
important factor in synovial joint lubrication. Swann et al. very carefully isolated fractions of bovine 
synovial fluid using sequential sedimentation techniques and gel permeation chromatography. They 
found a high molecular weight glycoprotein to be the major constituent in the articular lubrication 
fraction from bovine synovial fluid and called this LGP-I (from lubricating glycoprotein). This was based 
on friction measurements using cartilage in sliding contact against a glass disc. An excellent summary 
of this work with additional references is presented in a chapter by Swann in The Joints and Synovial 
Fluid: I [6]. 

Sokoloff et al. examined the "boundary lubricating ability" of several synovial fluids using a latex-glass 
test system and cartilage specimens obtained at necropsy from knees [29]. Measurements were made of 
friction. The research was extended to other in vitro friction tests using cartilage obtained from the nasal 
septum of cows and widely differing artificial surfaces [30]. As a result of this work, a new model of 
boundary lubrication by synovial fluid was proposed — the structuring of boundary water. The postulate 
involves adsorption of one part of a glycoprotein on a surface followed by the formation of hydration 
shells around the polar portions of the adsorbed glycoprotein; the net result is a thin layer of viscous 
"structured" water at the surface. This work is of particular interest in that it involves not only a specific 
and more detailed mechanism of boundary lubrication in synovial joints but also takes into account the 
possible importance of water in this system. 

In more recent research by Jay, an interaction between hyaluronic acid and a "purified synovial 
lubricating factor" (PSLF) was observed, suggesting a possible synergistic action in the boundary lubri- 
cation of synovial joints [31]. The definition of "lubricating ability" was based on friction measurements 
made with a latex-covered stainless steel stud in oscillating contact against polished glass. 

The above summary of major synovial joint lubrication theories is taken from [10,31] as well as the 
thesis by Burkhardt [33]. 

Two more recent studies are of interest since cartilage wear was considered although not as a part of 
a theory of joint lubrication. Stachowiak et al. [34] investigated the friction and wear characteristics of 
adult rat femur cartilage against a stainless steel plate using an environmental scanning microscope (ESM) 
to examine damaged cartilage. One finding was evidence of a load limit to lubrication of cartilage, beyond 
which high friction and damage occurred. Another study, by Hayes et al. [35] on the influence of crystals 
on cartilage wear, is particularly interesting not only in the findings reported (e.g., certain crystals can 
increase cartilage wear), but also in the full description of the biochemical techniques used. 
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TABLE 21.1 Examples of Proposed Mechanisms and Studies of Synovial Joint Lubrication 



Mechanism 


Authors 


Date 


1. 


Hydrodynamic 


MacConnail 


1932 


2. 


Boundary 


Jones 


1934 


3. 


Hydrodynamic 


Jones 


1936 


4. 


Boundary 


Charnley 


1959 


5. 


Weeping 


McCutchen 


1959 


6. 


Floating 


Barnett and Cobbold 


1962 


7. 


Elastohydrodynamic 


Tanner 


1966 






Dowson 


1967 


8. 


Thixotropic/elastic fluid 


Dintenfass 


1963 


9. 


Osmotic (boundary) 


McCutchen 


1966 


10. 


Squeeze-film 


Fein 


1966 






Higginson et al. 


1974 


11. 


Synovial gel 


Maroudas 


1967 


12. 


Thin-film 


Faber et al. 


1967 


13. 


Combinations of hydrostatic, 
boundary, & EHL 


Linn 


1968 


14. 


Boosted 


Walker et al. 


1968 


15. 


Lipid 


Little et al. 


1969 


16. 


Weeping + boundary 


McCutchen and Wilkins 


1969 






McCutchen 


1969 


17. 


Boundary 


Caygill and West 


1969 


18. 


Fat (or mucin) 


Freeman et al. 


1970 


19. 


Electrostatic 


Roberts 


1971 


20. 


Boundary + fluid squeeze-film 


Radin and Paul 


1972 


21. 


Mixed 


Unsworth et al. 


1974 


22. 


Imbibe/exudate composite model 


Ling 


1974 


23. 


Complex biomechanical model 


Mow et al. 


1974 






Mansour and Mow 


1977 


24. 


Two porous layer model 


Dinnar 


1974 


25. 


Boundary 


Reimann et al. 


1975 


26. 


Squeeze-film + fluid film + boundary 


Unsworth, Dowson et al. 


1975 


27. 


Compliant bearing model 


Rybicki 


1977 


28. 


Lubricating glycoproteins 


Swann et al. 


1977 


29. 


Structuring of boundary water 


Sokoloff et al. 


1979 


30. 


Surface flow 


Kenyon 


1980 


31. 


Lubricin 


Swann et al. 


1985 


32. 


Micro-EHL 


Dowson and Jin 


1986 


33. 


Lubricating factor 


Jay 


1992 


34. 


Lipidic component 


LaBerge et al. 


1993 


35. 


Constitutive modeling of cartilage 


Lai et al. 


1993 


36. 


Asperity model 


Yao et al. 


1993 


37. 


Bingham fluid 


Tandon et al. 


1994 


38. 


Filtration/gel/squeeze film 


Hlavacek et al. 


1995 


39. 


Surface-active phospholipid 


Schwarz and Hills 


1998 


40. 


Interstitial fluid pressurization 


Ateshian et al. 


1998 



A special note should be made concerning the doctoral thesis by Lawrence Malcom in 1976 [36]. This 
is an excellent study of cartilage friction and deformation, in which a device resembling a rotary plate 
rheometer was used to investigate the effects of static and dynamic loading on the frictional behavior of 
bovine cartilage. The contact geometry consisted of a circular cylindrical annulus in contact with a 
concave hemispherical section. It was found that dynamically loaded specimens in bovine synovial fluid 
yielded the more efficient lubrication based on friction measurements. The Malcom study is thorough 
and excellent in its attention to detail (e.g., specimen preparation) in examining the influence of type of 
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loading and time effects on cartilage friction. It does not, however, consider cartilage wear and damage 
except in a very preliminary way. And it does not consider the influence of fluid biochemistry on cartilage 
friction, wear, and damage. In short, the Malcom work represents a superb piece of systematic research 
along the lines of mechanical, dynamic, rheological, and viscoelastic behavior — one important dimension 
of synovial joint lubrication. 

21.6 In Vitro Cartilage Wear Studies 

Over the past fifteen years, studies aimed at exploring possible connections between tribology and 
mechanisms of synovial joint lubrication and degeneration (e.g., osteoarthritis) have been conducted by 
the author and his graduate and undergraduate students in the Department of Mechanical Engineering 
at Virginia Polytechnic Institute and State University. The basic approach used involved in vitro tribo- 
logical experiments using bovine articular cartilage, with an emphasis on the effects of fluid composition 
and biochemistry on cartilage wear and damage. This research is an outgrowth of earlier work carried 
out during a sabbatical study in the Laboratory for the Study of Skeletal Disorders, The Children's Hospital 
Medical Center, Harvard Medical School in Boston. In that study, bovine cartilage test specimens were 
loaded against a polished steel plate and subjected to reciprocating sliding for several hours in the presence 
of a fluid (e.g., bovine synovial fluid or a buffered saline reference fluid containing biochemical constit- 
uents kindly provided by Dr. David Swann). Cartilage wear was determined by sampling the test fluid 
and determining the concentration of 4-hydroxyproline — a constituent of collagen. The results of that 
earlier study have been reported and summarized elsewhere [37-40]. Figure 21.4 shows the average 
hydroxyproline contents of wear debris obtained from these in vitro experiments. These numbers are 
related to the cartilage wear which occurred. However, since the total quantities of collected fluids varied 
somewhat, the values shown in the bar graph should not be taken as exact or precise measures of fluid 
effects on cartilage wear. 

The main conclusions of that study were as follows: 

1. Normal bovine synovial fluid is very effective in reducing cartilage wear under these in vitro 
conditions as compared to the buffered saline reference fluid. 

2. There is no significant difference in wear between the saline reference and distilled water. 

3. The addition of hyaluronic acid to the reference fluid significantly reduces wear; but its effect 
depends on the source. 

4. Under these tests conditions, Swann's LGP-I (Lubricating Glycoprotein-!), known to be extremely 
effective in reducing friction in cartilage-on-glass tests, does not reduce cartilage wear. 




Buffered Saline Reference Fluid 
Bovine Synovial Fluid 
Distilled Water 
Reference + Hyaluronic Acid 

Reference + LGP-I 
Reference + Protein Complex 



400 



FIGURE 21.4 Relative cartilage wear based on hydroxyproline content of debris {in vitro tests with cartilage on 
stainless steel). 
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FIGURE 21.5 Friction and wear are different phenomena 

5. However, a protein complex isolated by Swann is extremely effective in reducing wear — producing 
results similar to those obtained with synovial fluid. The detailed structure of this constituent is 
complex and has not yet been fully determined. 

6. Last, the lack of an added fluid in these experiments leads to extremely high wear and damage of 
the articular cartilage. 

In discussing the possible significance of these findings from a tribological point of view, it may be 
helpful first of all to emphasize once again that friction and wear are different phenomena. Furthermore, 
as suggested by Fig. 21.5, certain constituents of synovial fluid (e.g., Swann's Lubricating Glycoprotein) 
may act to reduce friction in synovial joints while other constituents (e.g., Swann's protein complex or 
hyaluronic acid) may act to reduce cartilage wear. Therefore, it is necessary to distinguish between 
biochemical anti-friction and anti-wear compounds present in synovial fluid. 

In more recent years, this study has been greatly enhanced by the participation of interested faculty 
and students from the Virginia-Maryland College of Veterinary Medicine and Department of Biochem- 
istry and Animal Science at Virginia Tech. One major hypothesis tested is a continuation of previous 
work showing that the detailed biochemistry of the fluid- cartilage system has a pronounced and possibly 
controlling influence on cartilage wear. A consequence of the above hypothesis is that a lack or deficiency 
of certain biochemical constituents in the synovial joint may be one factor contributing to the initiation 
and progression of cartilage damage, wear, and possibly osteoarthritis. A related but somewhat different 
hypothesis concerns synovial fluid constituents which may act to increase the wear and further damage 
of articular cartilage under tribological contact. 

To carry out continued research on biotribology, a new device for studies of cartilage deformation, 
wear, damage, and friction under conditions of tribological contact was designed by Burkhardt [33] and 
later modified, constructed, and instrumented. A simplified sketch is shown in Fig. 21.6. The key features 
of this test device are shown in Table 21.2. The apparatus is designed to accommodate cartilage-on- 
cartilage specimens. Motion of the lower specimen is controlled by a computer- driven x-y table, allowing 
simple oscillating motion or complex motion patterns. An octagonal strain ring with two full semi- 
conductor bridges is used to measure the normal load as well as the tangential load (friction). An LVDT, 
not shown in the figure, is used to measure cartilage deformation and linear wear during a test. However, 
hydroxyproline analysis of the wear debris and washings is used for the actual determination of total 
cartilage wear on a mass basis. 

In one study by Schroeder [41], two types of experiments were carried out, i.e., cartilage-on-stainless 
steel and cartilage-on-cartilage at applied loads up to 70N — yielding an average pressure of 2.2 MPa in 
the contact area. Reciprocating motion (40 cps) was used. The fluids tested included: (1) a buffered saline 
solution, (2) saline plus hyaluronic acid, and (3) bovine synovial fluid. In cartilage-on-stainless steel tests, 
scanning electron microscopy, and histological staining showed distinct effects of the lubricants on surface 
and subsurface damage. Tests with the buffered saline fluid resulted in the most damage, with large wear 
tracks visible on the surface of the cartilage plug, as well as subsurface voids and cracks. When hyaluronic 
acid, a constituent of the natural synovial joint lubricant, was added to the saline reference fluid, less 
severe damage was observed. Little or no cartilage damage was evident in tests in which the natural 
synovial joint fluid was used as the lubricant. 
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FIGURE 21.6 Device for in vitro cartilage-on-cartilage wear studies 



TABLE 21.2 Key Features of Test Device Designed for Cartilage Wear Studies [33] 

Contact system Cartilage-on-cartilage 

Contact geometry Flat-on-flat, convex-on-flat, irregular-on-irregular 

Cartilage type Articular, any source (e.g., bovine) 

Specimen size Upper specimen, 4 to 6 mm diam., lower specimen, ca. 15 to 25 mm diam. 

Applied load 50-660 N 

Average pressure 0.44-4.4 MPa 

Type of motion Linear, oscillating; circular, constant velocity; more complex patterns 

Sliding velocity to 20 mm/s 

Fluid temperature Ambient (20°C); or controlled humidity 

Environment Ambient or controlled humidity 

Measurements Normal load, cartilage deformation, friction; cartilage wear and damage, biochemical analysis of 
cartilage specimens, synovial fluid, and wear debris; sub-surface changes 

These results were confirmed in a later study by Owellen [42] in which hydroxyproline analysis was 
used to determine cartilage wear. It was found that increasing the applied load from 20 to 65N increased 
cartilage wear by eight-fold for the saline solution and approximately three-fold for synovial fluid. Fur- 
thermore, the coefficient of friction increased from an initial low value of 0.01 to 0.02 to a much higher 
value, e.g., 0.20 to 0.30 and higher, during a normal test which lasted 3 hours; the greatest change occurred 
during the first 20 minutes. Another interesting result was that a thin film of transferred or altered material 
was observed on the stainless steel disks — being most pronounced with the buffered saline lubricant and 
not observed with synovial fluid. Examination of the film with Fourier Transfer Infrared Microspectrom- 
etry shows distinctive bio-organic spectra which differs from that of the original bovine cartilage. We 
believe this to be an important finding since it suggests a possible bio-tribochemical effect [43]. 
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FIGURE 21.7 Cartilage damage produced by sliding contact 

In another phase of this research, the emphasis is on the cartilage-on-cartilage system and the influence 
of potentially beneficial as well as harmful constituents of synovial fluid on wear and damage. In cartilage- 
on-cartilage tests, the most severe wear and damage occurred during tests with buffered saline as the 
lubricant. The damage was less severe than in the stainless steel tests, but some visible wear tracks were 
detectable with scanning electron microscopy. Histological sectioning and staining of both the upper and 
lower cartilage samples show evidence of elongated lacunae and coalesced voids that could lead to wear 
by delamination. An example is shown in Fig. 21.7 (original magnification of 500X on 35mm slide). The 
proteoglycan content of the subsurface cartilage under the region of contact was also reduced. When 
synovial fluid was used as the lubricant, no visible wear or damage was detected [44]. These results 
demonstrate that even in in vitro tests with bovine articular cartilage, the nature of the fluid environment 
can have a dramatic affect on the severity of wear and subsurface damage. 

In a more recent study carried out by Berrien in the biotribology program at Virginia Tech, a different 
approach was taken to examine the role of joint lubrication in joint disease, particularly osteoarthritis. 
A degradative biological enzyme, collagenase-3, suspected of playing a role in a cartilage degeneration 
was used to create a physiologically adverse biochemical fluid environment. Tribological tests were 
performed with the same device and procedures described previously. The stainless steel disk was replaced 
with a 1 in. diameter plug of bovine cartilage to create a cartilage sliding on cartilage configuration more 
closely related to the in vivo condition. Normal load was increased to 78.6 N and synovial fluid and 
buffered saline were used as lubricants. Prior to testing, cartilage plugs were exposed to a fluid medium 
containing three concentrations of collagenase-3 for 24 hr. The major discovery of this work was that 
exposure to the collagenase-3 enzyme had a substantial adverse effect on cartilage wear in vitro, increasing 
average wear values by three and one-half times those of the unexposed cases. Figure 21.8 shows an 
example of the effect of enzyme treatment when bovine synovial fluid was used as the lubricant. Scanning 
electron microscopy showed disruption of the superficial layer and collagen matrix with exposure to 
collagenase-3, where unexposed cartilage showed none. Histological sections showed a substantial loss 
of the superficial layer of cartilage and a distinct and abnormal loss of proteoglycans in the middle layer 
of collagenase-treated cartilage. Unexposed cartilage showed only minor disruption of the superficial 
layer [45]. 

This study indicates that some of the biochemical constituents that gain access to the joint space, 
during normal and pathological functions, can have a significant adverse effect on the wear and damage 
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FIGURE 21.8 Effect of collagenase-3 on cartilage wear 

of the articular cartilage. Future studies will include determination of additional constituents that have 
harmful effects on cartilage wear and damage. This research, using bovine articular cartilage in in vitro 
sliding contact tests, raises a number of interesting questions: 

1. Has 'Nature' designed a special biochemical compound which has as its function the protection 
of articular cartilage? 

2. What is the mechanism (or mechanisms) by which biochemical constituents of synovial fluid can 
act to reduce wear of articular cartilage? 

3. Could a lack of this biochemical constituent lead to increased cartilage wear and damage? 

4. Does articular cartilage from osteoarthritic patients have reduced wear resistance? 

5. Do any of the findings on the importance of synovial fluid biochemistry on cartilage wear in our 
in vitro studies apply to living or in vitro systems as well? 

6. How does collagenase-3 treatment of cartilage lead to increased wear and does this finding have 
any significance in the in vivo situation? This question is addressed in the next section. 

21.7 Biotribology and Arthritis: Are There Connections? 

Arthritis is an umbrella term for more than 100 rheumatic diseases affecting joints and connective tissue. 
The two most common forms are osteoarthritis (OA) and rheumatoid arthritis (RA). Osteoarthritis — also 
referred to as osteoarthrosis or degenerative joint disease — is the most common form of arthritis. It is 
sometimes simplistically described as the "wear and tear" form of arthritis. The causes and progression 
of degenerative joint disease are still not understood. Rheumatoid arthritis is a chronic and often pro- 
gressive disease of the synovial membrane leading to release of enzymes which attack, erode, and destroy 
articular cartilage. It is an inflammatory response involving the immune system and is more prevalent 
in females. Rheumatoid arthritis is extremely complex. Its causes are still unknown. 

Sokoloff defines degenerative joint disease as "an extremely common, noninflammatory, progressive 
disorder of movable joints, particularly weight-bearing joints, characterized pathologically by deteriora- 
tion of articular cartilage and by formation of new bone in the sub-chondral areas and at the margins 
of the "joint" [46]. As mentioned, osteoarthritis or osteoarthrosis is sometimes referred to as the "wear 
and tear" form of arthritis; but, wear itself is rarely a simple process even in well-defined systems. 

It has been noted by the author that tribological terms occasionally appear in hypotheses which describe 
the etiology of osteoarthritis (e.g., "reduced wear resistance of cartilage" or "poor lubricity of synovial 
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fluid"). It has also been noted that there is a general absence of hypotheses connecting normal synovial 
joint lubrication (or lack thereof) and synovial joint degeneration. Perhaps it is natural (and unhelpful) 
for a tribologist to imagine such a connection and that, for example, cartilage wear under certain 
circumstances might be due to or influenced by a lack of proper "boundary lubrication" by the synovial 
fluid. In this regard, it may be of interest to quote Swanson [12] who said in 1979 that "there exists at 
present no experimental evidence which certainly shows that a failure of lubrication is or is not a causative 
factor in the first stages of cartilage degeneration." A statement made by Professor Glimcher [52] may 
also be appropriate here. Glimcher fully recognized the fundamental difference between friction and wear 
as well as the difference between joint lubrication (one area of study) and joint degeneration (another 
area of study). Glimcher said that wearing or abrading cartilage with a steel file is not osteoarthritis; and 
neither is digesting cartilage in a test tube with an enzyme. But both forms of cartilage deterioration can 
occur in a living joint and in a way which is still not understood. It is interesting that essentially none 
of the many synovial joint lubrication theories consider enzymatic degradation of cartilage as a factor 
whereas practically all the models of the etiology of degenerative joint disease include this as an important 
factor. 

It was stated earlier that there are at least two main areas to consider, i.e., (1) mechanisms of synovial 
joint lubrication and (2) the etiology of synovial joint degeneration (e.g., as in osteoarthrosis). Both areas 
are extremely complex. And the key questions as to what actually happens in each have yet to be answered 
(and perhaps asked). It may therefore be presumptuous of the present author to suggest possible con- 
nections between two areas which in themselves are still not fully understood. 

Tribological processes in a movable joint involve not only the contacting surfaces (articular cartilage), 
but the surrounding medium (synovial fluid) as well. Each of these depends on the synthesis and transport 
of necessary biochemical constituents to the contact region or interface. As a result of relative motion 
(sliding, rubbing, rolling, and impact) between the joint elements, friction and/or wear can occur. 

It has already been shown and discussed — at least in in vitro tests with articular cartilage — that 
compounds which reduce friction do not necessarily reduce wear; the latter was suggested as being more 
important [10]. It may be helpful first of all to emphasize once again that friction and wear are different 
phenomena. Furthermore, certain constituents of synovial fluid (e.g., Swann's Lubricating Glycoprotein) 
may act to reduce friction in synovial joints while other constituents (e.g., Swann's protein complex or 
hyaluronic acid) may act to reduce cartilage wear. 

A significant increase in joint friction could lead to a slight increase in local temperatures or possibly 
to reduce mobility. But the effects of cartilage wear would be expected to be more serious. When cartilage 
wear occurs, a very special material is lost and the body is not capable of regenerating cartilage of the 
same quality nor at the desired rate. Thus, there are at least two major tribological dimensions 
involved — one concerning the nature of the synovial fluid and the other having to do with the properties 
of articular cartilage itself. Changes in either the synovial fluid or cartilage could conceivably lead to 
increased wear or damage (or friction) as shown in Fig. 21.9. 

A simplified model or illustration of possible connections between osteoarthritis and tribology is 
offered in Fig. 21.10 taken from Furey [53]. Its purpose is to stimulate discussion. There are other 
pathways to the disease, pathways which may include genetic factors. 
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FIGURE 21.9 Two tribological aspects of synovial joint lubrication 



) 2000 by CRC Press LLC 



Synthesis of 
biochemical 
constituents 



Transport to 

synovial 
fluid/cartilage 



Synovial 
fluid 



Tribo- 
processes 



Articular 
cartilage 



Softening and 

degradation of 

cartilage 



Joint 
friction 



Cartilage 
repair 



Cartilage 
wear 



Wear 
particles 



Inflammation 



Enzyme 
release 



Reduced 
mobility? 



Progressive 

thinning 
of cartilage 



Bony 
growth 



Trauma, 
shock, 
injury 



Other 
pathways 



Osteoarthritis 



FIGURE 21.10 Osteoarthritis-tribology connections? 

In some cases, the body makes an unsuccessful attempt at repair, and bone growth may occur at the 
periphery of contact. As suggested by Fig. 21.10, this process and the generation of wear particles could 
lead to joint inflammation and the release of enzymes which further soften and degrade the articular 
cartilage. This softer, degraded cartilage does not possess the wear-resistance of the original. It has been 
shown previously that treatment of cartilage with collagenase-3 increases wear significantly, thus sup- 
porting the idea of enzyme release as a factor in osteoarthritis. Thus, there exists a feedback process in 
which the occurrence of cartilage wear can lead to even more damage. Degradative enzymes can also be 
released by trauma, shock, or injury to the joint. Ultimately, as the cartilage is progressively thinned and 
bony growth occurs, a condition of osteoarthritis or degenerative joint disease may exist. There are other 
pathways to the disease, pathways which may include genetic factors. It is not argued that arthritis is a 
tribological problem. However, the inclusion of tribological processes in one set of pathways to osteo- 
arthrosis would not seem strange or unusual. 

A specific example of a different tribological dimension to the problem of synovial joint lubrication 
(i.e., third-body abrasion), was shown by the work of Hayes et al. [54] . In an excellent study of the effect 
of crystals on the wear of articular cartilage, they carried out in vitro tests using cylindrical cartilage sub- 
chondral bone plugs obtained from equine fetlock joints in sliding contact against a stainless steel plate. 
They examined the effects of three types of crystals (orthorhombic calcium pyrophosphate tetrahydrate, 
monoclinic calcium pyrophosphate dehydrate, and calcium hydroxyapatite) on wear using a Ringer's 
solution as the carrier fluid. Concentration of cartilage wear debris in the fluid was determined by 
analyzing for inorganic sulphate derived from the proteoglycans present. Several interesting findings were 
made, one of them being that the presence of the crystals roughly doubled cartilage wear. This is an 
important contribution which should be read by anyone seriously contemplating research on the tribol- 
ogy of articular cartilage. The careful attention to detail and potential problems, as well as the precise 
description of the biochemical procedures and diverse experimental techniques used, set a high standard. 

21.8 Recapitulation and Final Comments 

It is obvious from the unusually large number of theories of synovial joint lubrication proposed, that 
very little is known about the subject. Synovial joints are undoubtedly the most sophisticated and complex 
tribological systems that exist or will ever exist. It will require a great deal more research — possibly very 
different approaches — before we even begin to understand the processes involved. 
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Some general comments and specific suggestions are offered — not for the purpose of criticizing any 
particular study but hopefully to provide ideas which may be helpful in further research as well as in the 
re-interpretation of some past research. 

Terms and Definitions 

First of all, as mentioned earlier in this chapter, part of the problem has to do with the use and misuse 
of various terms in tribology — the study of friction, wear, and lubrication. A glance at any number of 
the published papers on synovial joint lubrication will reveal such terms and phrases as "lubricating 
ability," "lubricity," "lubricating properties," "lubricating component," and many others, all undefined. 
We also see terms like "boundary lubricant," "lubricating glycoprotein," or "lubricin". There is nothing 
inherently wrong with this but one should remember that lubrication is a process of reducing friction 
and/or wear between rubbing surfaces. Saying that a fluid is a "good" lubricant does not distinguish 
between friction and wear. And assuming that friction and wear are correlated and go together is the 
first pitfall in any tribological study. It cannot be overemphasized that friction and wear are different, 
though sometimes related, phenomena. Low friction does not mean low wear. The terms and phrases 
used are therefore extremely important. For example, in a brief and early review article by Wright and 
Dowson [55], it was stated that "Digestion of hyaluronate does not alter the boundary lubrication," 
referring to the work of Radin, Swann, and Weisser [56]. In another article, McCutchen re-states this 
conclusion in another way, saying "... the lubricating ability did not reside in the hyaluronic acid" and 
later asks the question "Why do the glycoprotein molecules (of Swann) lubricate?" [57] These statements 
are based on effects of various constituents on friction, not wear. The work of the present author showed 
that in tests with bovine articular cartilage, Swann's Lubricating Glycoprotein LGP-I which was effective 
in reducing friction did not reduce cartilage wear. However, hyaluronic acid — shown earlier not to be 
responsible for friction-reduction — did reduce cartilage wear. Thus, it is important to make the distinc- 
tion between friction-reduction and wear-reduction. It is suggested that operational definitions be used 
in place of vague "lubricating ability," etc. terms in future papers on the subject. 

Experimental Contact Systems 

Secondly, some comments are made on the experimental approaches that have been reported in the 
literature on synovial joint lubrication mechanisms. Sliding contact combinations in in vitro studies have 
consisted of (1) cartilage-on-cartilage, (2) cartilage-on-some other surface (e.g., stainless steel, glass), 
and (3) solids other than cartilage sliding against each other in X-on-X or X-on-Y combinations. 

The cartilage-on-cartilage combination is of course the most realistic and yet most complex contact 
system. But variations in shape or macroscopic geometry, microtopography, and the nature of contact 
present problems in carrying out well- controlled experiments. There is also the added problem of 
acquiring suitable specimens which are large enough and reasonably uniform. 

The next combination — cartilage-on-another material — allows for better control of contact, with the 
more elastic, deformable cartilage loaded against a well-defined hard surface (e.g., a polished, flat solid 
made of glass or stainless steel). This contact configuration can provide useful tribological information 
on effects of changes in biochemical environment (e.g., fluids), on friction, wear, and sub-surface damage. 
It also could parallel the situation in a partial joint replacement in which healthy cartilage is in contact 
with a metal alloy. 

The third combination, which appears in some of the literature on synovial joint lubrication, does not 
involve any articular cartilage at all. For example, Jay made friction measurements using a latex-covered 
stainless steel stud in oscillating contact against polished glass [31]. Williams et al., in a study of a lipid 
component of synovial fluid, used reciprocating contact of borosilicate glass-on-glass [58]. And in a 
recent paper on the action of a surface-active phospholipid as the "lubricating component of lubricin," 
Schwarz and Hills carried out friction measurements using two optically flat quartz plates in sliding 
contact [59]. In another study, a standard four-ball machine using alloy steel balls was used to examine 
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the "lubricating ability" of synovial fluid constituents. Such tests, in the absence of cartilage, are easiest 
to control and carry out. However, they are not relevant to the study of synovial joint lubrication. With 
a glass sphere sliding against a glass flat, almost anything will reduce friction — including a wide variety 
of chemicals, biochemicals, semi-solids, and fluids. This has little if anything to do with the lubrication 
of synovial joints. 

Fluids and Materials Used as Lubricants 
in In Vitro Biotribology Studies 

Fluids used as lubricants in synovial joint lubrication studies have consisted of (1) "normal" synovial 
fluid (e.g., bovine), (2) buffered saline solution containing synovial fluid constituents (e.g., hyaluronic 
acid), and (3) various aqueous solutions of surface active compounds neither derived from nor present 
in synovial fluid. In addition, a few studies used synovial fluids from patients suffering from either 
osteoarthritis or rheumatoid arthritis. 

The general comment made here is that the use of synovial fluids — whether derived from human or 
animal sources and whether "healthy" or "abnormal" — is important in in vitro studies of synovial joint 
lubrication. The documented behavior of synovial fluid in producing low friction and wear with articular 
cartilage sets a reference standard and demonstrates that useful information can indeed come from in vitro 
tests. 

Studies that are based on adding synovial fluid constituents to a reference fluid (e.g., a buffered saline 
solution) can also be useful in attempting to identify which biochemical compound or compounds are 
responsible for reductions in frictions or wear. But if significant interactions between compounds exist, 
then such an approach may require an extensive program of tests. It should also be mentioned that in 
the view of the present author, the use of a pure undissolved constituent of synovial fluid, either derived 
or synthetic, in a sliding contact test is not only irrelevant but may be misleading. An example would be 
the use of a pure lipid (e.g., phospholipid) at the interface rather than in the concentration and solution 
form in which this compound would normally exist in synovial fluid. This is basic in any study of 
lubrication and particularly in the case of boundary lubrication where major effects on wear or friction 
can be brought on by minor, seemingly trivial, changes in chemistry. 

The Preoccupation with Rheology and Friction 

The synovial joint as a system — the articular cartilage and underlying bone structure as well as the 
synovial fluid as important elements — is extremely complex and far from being understood. It is noted 
that there is a proliferation of mathematical modeling papers stressing rheology and the mechanics of 
deformation, flow, and fluid pressures developed in the cartilage model. One recent example is the paper 
"The Role of Interstitial Fluid Pressurization and Surface Properties on the Boundary Friction of Articular 
Cartilage" by Ateshian et al. [21]. This study, a genuine contribution, grew out of the early work by Mow 
and connects also with the "weeping lubrication" model of McCutchen. Both McCutchen and Mow have 
made significant contributions to our understanding of synovial joint lubrication, although each 
approach is predominantly rheological and friction-oriented with little regard for biochemistry and wear. 
This is not to say that rheology is unimportant. It could well be that, as suggested by Ateshian, the 
mechanism of interstitial fluid pressurization that leads to low friction in cartilage could also lead to low 
wear rates [60]. 

The Probable Existence of Various Lubrication Regimes 

In an article by Wright and Dowson, it is suggested that a variety of types of lubrication operate in human 
synovial joints at different parts of a walking cycle stating that, "At heel-strike a squeeze-film situation 
may develop, leading to elastohydrodynamic lubrication and possibly both squeeze-film and boundary 
lubrication, while hydrodynamic lubrication may operate during the free-swing phase of walking" [55]. 
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FIGURE 21.11 (a) Hip joint forces and angular velocities at different parts of the walking cycle (after Graham and 
Walker [61]). (b) Calculated ratio of velocity to force for the hip joint (from Figure 22.11a) 

In a simplified approach to examining the various regimes of lubrication that could exist in a human 
joint, it may be useful to look at Fig. 21.11a which shows the variation in force (load) and velocity for a 
human hip joint at different parts of the walking cycle (taken from Graham and Walker [6 1 ] ) . As discussed 
earlier in this chapter, theories of hydrodynamic and elastohydrodynamic lubrication all include the 
hydrodynamic factor (r\U/W) as the key variable, where r\ = fluid viscosity, U= the relative sliding 
velocity, and W = the normal load. High values of (r\ U/W) lead to thicker hydrodynamic films — a more 
desirable condition if one wants to keep surfaces apart. It can be seen from Fig. 21.11a that there is 
considerable variation in load and velocity, with peaks and valleys occurring at different parts of the 
cycle. Note also that in this example, the loads can be quite high (e.g., up to three times body weight). 
The maximum load occurs at 20% of the walking cycle illustrated in Fig. 21.11a, with a secondary 
maximum occurring at a little over the 50% point. The maximum angular velocity occurs at approxi- 
mately 67% of the cycle. If one now creates a new curve of relative velocity/load or ( U/W) from Fig. 21.11a, 
the result obtained is shown in Fig. 21.1 lb. We see now a very different and somewhat simplified picture. 
There is a clear and distinct maximum in the ratio of velocity to load ( U/W) at 80% of walking cycle, 
favoring the formation of a hydrodynamic film of maximum thickness. However, for most of the cycle 
(e.g., from to 60%), the velocity/load ratio is significantly lower, thus favoring a condition of minimum 
film thickness and "boundary lubrication". However, we also know that synovial fluid is non-Newtonian; 
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at higher rates of shear, its viscosity decreases sharply, approaching that of water. The shear rate is equal 
to the relative velocity divided by fluid film thickness (U/h) and is expressed in s _1 . This means that at 
the regions of low ( U/W) ratios or thinner hydrodynamic films, the viscosity term in (r| U/W) is even 
lower, thus pushing the minima to lower values favoring a condition of boundary lubrication. This is 
only a simplified view and does not consider those periods in which the relative sliding velocity is zero 
at motion reversal and where squeeze-film lubrication may come into play. A good example of the 
complexity of load and velocity variation in a human knee joint — including several zero-velocity peri- 
ods — may be found in the chapter by Higginson and Unsworth [62] citing the work of Seedhom et al., 
which deals with biomechanics in the design of a total knee replacement [63]. 

The major point made here is that (1) there are parts of a walking cycle that would be expected to 
approach a condition of minimum fluid film thickness and boundary lubrication and (2) it is during 
these parts of the cycle that cartilage wear and damage resulting from contact is more likely to occur. 
Thus, approaches to reducing cartilage wear in a synovial joint could be broken down into two categories 
(i.e., promoting thicker hydrodynamic films and providing special forms of "boundary lubrication.") 

Recent Developments 

Recent developments in addressing some of the problems that involve cartilage damage and existing joint 
replacements include (1) progress in promoting cartilage repair [64], (2) possible use of artificial cartilage 
materials (e.g., synthetic hydrogels) [65,66], and (3) the development and application of more compliant 
joint replacement materials to promote a more favorable formation of an elastohydrodynamic film [67]. 
Although these are not strictly "lubricant- oriented" developments, they do and will involve important 
tribological aspects discussed in this chapter. For example, if new cartilage growth can be promoted by 
transplanting healthy chondrocytes to a platform in a damaged region of a synovial joint, how long will 
this cartilage last? If a hydrogel is used as an artificial cartilage, how long will it last? And if softer, 
elastomeric materials are used as partial joint replacements or coatings, how long will they last? These 
are questions of wear, not friction. And although the early fundamental studies of hydrogels as artificial 
cartilage measured only friction, and often only after a few moments of sliding, we know from recent 
work that even for hydrogels, low friction does not mean low wear [68]. 

21.9 Conclusions 

The following main conclusions relating to the tribological behavior of natural, "normal" synovial joints 
are presented: 

1. An unusually large number of theories and studies of joint lubrication have been proposed over 
the years. All of the theories focus on friction, none address wear, many do not involve experimental 
studies with cartilage, and very few consider the complexity and detailed biochemistry of the 
synovial-fluid articular-cartilage system. 

2. It was shown by in vitro tests with bovine articular cartilage that the detailed biochemistry of 
synovial fluid has a significant effect on cartilage wear and damage. "Normal" bovine synovial 
fluid was found to provide excellent protection against wear. Various biochemical constituents 
isolated from bovine synovial fluid by Dr. David Swann, of the Shriners Burns Institute in Boston, 
showed varying effects on cartilage wear when added back to a buffered saline reference fluid. 
This research demonstrates once again the importance of distinguishing between friction and wear. 

3. In a collaborative study of biotribology involving researchers and students in Mechanical Engi- 
neering, the Virginia-Maryland College of Veterinary Medicine, and Biochemistry, in vitro tribo- 
logical tests using bovine articular cartilage demonstrated among other things that (1) normal 
synovial fluid provides better protection than a buffered saline solution in a cartilage-on-cartilage 
system, (2) tribological contact in cartilage systems can cause subsurface damage, delamination, 
changes in proteoglycan content, and in chemistry via a "biotribochemical" process not under- 
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stood, and (3) pre-treatment of articular cartilage with the enzyme collagenase-3 — suspected as 
a factor in osteoarthritis — significantly increases cartilage wear. 

4. It is suggested that these results could change significantly the way mechanisms of synovial joint 
lubrication are examined. Effects of biochemistry of the system on wear of articular cartilage are 
likely to be important; such effects may not be related to physical/rheological models of joint 
lubrication. 

5. It is also suggested that connections between tribology/normal synovial joint lubrication and 
degenerative joint disease are not only possible but likely; however, such connections are undoubt- 
edly complex. It is not argued that osteoarthritis is a tribological problem or that it is necessarily 
the result of a tribological deficiency. Ultimately, a better understanding of how normal synovial 
joints function from a tribological point of view could conceivably lead to advances in the pre- 
vention and treatment of osteoarthritis. 

6. Several problems exist that make it difficult to understand and interpret many of the published 
works on synovial joint lubrication. One example is the widespread use of non-operational and 
vague terms such as "lubricating activity," "lubricating factor," "boundary lubricating ability", and 
similar undefined terms which not only fail to distinguish between friction (which is usually 
measured) and cartilage wear (which is rarely measured), but tend to lump these phenomena 
together — a common error. Another problem is that a significant number of the published exper- 
imental studies of biotribology do not involve cartilage at all — relying on the use of glass-on-glass, 
rubber-on-glass, and even steel-on-steel. Such approaches may be a reflection of the incorrect 
view that "lubricating activity" is a property of a fluid and can be measured independently. Some 
suggestions are offered. 

7. Last, the topic of synovial joint lubrication is far from being understood. It is a complex subject 
involving at least biophysics, biomechanics, biochemistry, and tribology. For a physical scientist 
or engineer, carrying out research in this area is a humbling experience. 
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Further Information 

For more information on synovial joints and arthritis, the following books are suggested: The Biology of 
Degenerative Joint Disease [46], Adult Articular Cartilage [5], The Joints and Synovial Fluid: I [6], Textbook 
of Rheumatology [47], Osteoarthritis: Diagnosis and Management [48], Degenerative Joints: Test Tubes, 
Tissues, Models, and Man [49], Biology of the Articular Cartilage in Health and Disease [50], and Crystals 
and Joint Disease [51]. 
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Skeletal muscles are length- and velocity-dependent force generators [Zajac, 1989]. Numerous extrinsic 
and intrinsic factors determine the magnitude of muscle force generated, maximum muscle contractile 
velocity, and the sensitivity of force generation to length and velocity. In this chapter, we briefly describe 
and summarize experimental values reported for these factors. Such information is necessary to generate 
accurate and physiologically relevant musculoskeletal models. Since muscles transmit force to bones via 
tendons, a summary of tendon biomechanical properties is also presented. 

22.2 Fundamentals of Soft Tissues Biomechanics 
Muscle Architecture 

Muscles are organized arrays of multinucleated myofibers. Their mechanical properties depend on both 
the intrinsic properties of those fibers and their extrinsic arrangement, or architecture. Muscle architec- 
ture is typically described in terms of muscle length, mass, myofiber length, pennation angle (the angle 
between the line of action and the myofiber long axis), and physiological cross-sectional area (PCSA). 
PCSA is an approximation of the total cross-sectional area of all muscle fibers, projected along the muscle's 
line of action; it is calculated as: 



PCSA (mm 2 ] = 



M 



(g)»cos9 



p(g/mm 3 )»L f (mm) 



where M= muscle mass, p = muscle density (1.056 g/cm 3 in fresh tissue), = surface pennation angle, 
and Lr= myofiber length. This formulation provides a good estimate of experimentally measured iso- 
metric muscle force output [Powell et al., 1984]. 
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TABLE 22.1 


Skeletal Muscle Spec 


ific Tension 






Species 


Muscle Type 


Preparation 


Specific Tension (kPa) 


Ref. 


Rat 


SO 


Single fiber 


134 


Fitts et al., 1991 


Human 


Slow 


Single fiber 


133 


Fittsetal., 1991 


Rat 


FOG 


Single fiber 


108 


Fitts et al., 1991 


Rat 


FG 


Single fiber 


108 


Fittsetal., 1991 


Human 


Fast 


Single fiber 


166 


Fittsetal., 1991 


Cat 


1 


Motor unit 


59 


Dumetal., 1982 


Cat 


S 


Motor unit 


172 


Bodineetal., 1987 


Cat 


2A 


Motor unit 


284 


Dumetal., 1982 


Cat 


FR 


Motor unit 


211 


Bodineetal., 1987 


Cat 


2B + 2AB 


Motor unit 


343 


Dumetal., 1982 


Cat 


FF/FI 


Motor unit 


249 


Bodineetal., 1987 


Human 


Elbow flexors 


Whole muscle 


230-420 


Edgerton et al., 1990 


Human 


Ankle plantarflexors 


Whole muscle 


45-250 


Fukunaga et al., 1996 


Rat 


Tibialis anterior 


Whole muscle 


272 


Wells, 1965 


Rat 


Soleus 


Whole muscle 


319 


Wells, 1965 


Guinea pig 


Hindlimb muscles 


Whole muscle 


225 


Powell et al., 1984 


Guinea pig 


Soleus 


Whole muscle 


154 


Powell et al., 1984 



Maximum Muscle Stress 

Maximum active stress (often termed specific tension) varies somewhat among fiber types and species 
(Table 22.1) around a typical value of 250 kPa. Specific tension can be determined in any system in which 
it is possible to measure force and estimate the area of contractile material. In practice, area measurements 
may be difficult to make, giving rise to the large variability of reported values. Given muscle PCSA, the 
maximum force produced by a muscle can be predicted by multiplying this PCSA by specific tension 
(Table 22.1). Specific tension can also be calculated for isolated muscle fibers or motor units in which 
estimates of cross-sectional area have been made. 

Maximum Muscle Contraction Velocity 

Muscle maximum contraction velocity is primarily dependent on the type and number of sarcomeres in 
series along the muscle fiber length [Gans, 1982]. This number has been experimentally determined for 
a number of skeletal muscles (Table 22.2). Maximum contraction velocity of a given muscle can thus be 

TABLE 22.2 Skeletal Muscle Dynamic Properties 



Species Muscle Type Preparation 



a/Po b/V m 



Ref. 



Rat 


SO 


Single fiber 


1.49 L/s 


Human 


Slow 


Single fiber 


0.86 L/s 


Rat 


FOG 


Single fiber 


4.91 L/s 


Rat 


FG 


Single fiber 


8.05 L/s 


Human 


Fast 


Single fiber 


4.85 L/s 


Mouse 


Soleus 


Whole muscle 


31.7 um/s 


Rat 


Soleus 


Whole muscle 


18.2 um/s 


Rat 


Soleus 


Whole muscle 


5.4 cm/s 


Cat 


Soleus 


Whole muscle 


13 um/s 


Mouse 


EDL 


Whole muscle 


60.5 um/s 


Rat 


EDL 


Whole muscle 


42.7 um/s 


Cat 


EDL 


Whole muscle 


31 um/s 


Rat 


TA 


Whole muscle 


14.4 cm/s 



0.214 



0.356 



0.23 



0.38 



Fittsetal., 1991 
Fitts et al., 1991 
Fitts et al., 1991 
Fitts et al., 1991 
Fitts et al., 1991 
Close, 1972 
Close, 1972 
Wells, 1965 
Close, 1972 
Close, 1972 
Close, 1972 
Close, 1972 
Wells, 1965 



L/s fiber or sarcomere lengths per second, um/s sarcomere velocity; cm/s whole muscle 



velocity. 
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TABLE 22.3 Architectural Properties of the Human Arm and Forearm a ' b 





Muscle 


Muscle 


Fiber 


Pennation 


Cross-Sectional 






Muscle 


Mass (g) 


Length (mm) 


Length (mm) 


Angle (°) 


Area (cm 2 ) 


FL/ML Ratio 


BR (n = 8) 


16.6 


± 


2.8 


175 


±8.3 


121 


± 8.3 


2.4 ± 


0.6 


1.33 


±0.22 


0.69 


± 0.062 


PT (n = 8) 


15.9 


± 


1.7 


130 


±4.7 


36.4 


± 1.3 


9.6 ± 


0.8 


4.13 


±0.52 


0.28 


± 0.012 


PQ (n = 8) 


5.21 


± 


1.0 


39.3 


±2.3 


23.3 


±2.0 


9.9 ± 


0.3 


2.07 


±0.33 


0.58 


± 0.021 


EDC I (n = 8) 


3.05 


± 


0.45 


114 


±3.4 


56.9 


±3.6 


3.1 ± 


0.5 


0.52 


±0.08 


0.49 


± 0.024 


EDC M (n = 5) 


6.13 


± 


1,2 


112 


±4.7 


58.8 


±3.5 


3.2 ± 


1.0 


1.02 


±0.20 


0.50 


± 0.014 


EDC R (n = 7) 


4.70 


± 


0.75 


125 


± 10.7 


51.2 


± 1.8 


3.2 ± 


0.54 


0.86 


±0.13 


0.42 


± 0.023 


EDC S (n = 6) 


2.23 


± 


0.32 


121 


±8.0 


52.9 


±5.2 


2.4 ± 


0.7 


0.40 


±0.06 


0.43 


± 0.029 


EDQ (n = 7) 


3.81 


± 


0.70 


152 


±9.2 


55.3 


± 3.7 


2.6 ± 


0.6 


0.64 


± 0.10 


0.36 


± 0.012 


EIP (n = 6) 


2.86 


± 


0.61 


105 


±6.6 


48.4 


±2.3 


6.3 ± 


0.8 


0.56 


± 0.11 


0.46 


± 0.023 


EPL (n = 7) 


4.54 


± 


0.68 


138 


± 7.2 


43.6 


±2.6 


5.6 ± 


1.3 


0.98 


±0.13 


0.31 


± 0.020 


PL (n = 6) 


3.78 


± 


0.82 


134 


± 11.5 


52.3 


± 3.1 


3.5 + 


1.2 


0.69 


±0.17 


0.40 


± 0.032 


FDS I(P) (n = 6) 


6.0 


± 


1.1 


92.5 


± 8.4 


31.6 


± 3.0 


5.1 ± 


0.2 


1.81 


±0.83 


0.34 


± 0.022 


FDS 1(D) (n = 9) 


6.6 


± 


0.8 


119 


±6.1 


37.9 


± 3.0 


6.7 ± 


0.3 


1.63 


±0.22 


0.32 


± 0.013 


FDS 1(C) (n = 6) 


12.4 


± 


2.1 


207 


± 10.7 


67.6 ± 2.8 


5.7 ± 


0.2 


1.71 


±0.28 


0.33 


± 0.025 


FDS M (n = 9) 


16.3 


± 


2.2 


183 


± 11.5 


60.8 ± 3.9 


6.9 ± 


0.7 


2.53 


±0.34 


0.34 


± 0.014 


FDS R (n = 9) 


10.2 


± 


1.1 


155 


± 7.7 


60.1 ±2.7 


4.3 ± 


0.6 


1.61 


± 0.18 


0.39 


± 0.023 


FDS S (n = 9) 


1.8 


± 


0.3 


103 


± 6.3 


42.4 


±2.2 


4.9 + 


0.7 


0.40 


±0.05 


0.42 


± 0.014 


FDP I (n = 9) 


11.7 


± 


1.2 


149 


±3.8 


61.4 


±2.4 


7.2 ± 


0.7 


1.77 


± 0.16 


0.41 


± 0.018 


FDP M (n = 9) 


16.3 


± 


1.7 


200 


±8.2 


68.4 


±2.7 


5.7 ± 


0.3 


2.23 


±0.22 


0.34 


± 0.011 


FDP R (n = 9) 


11.9 


± 


1.4 


194 


± 7.0 


64.6 


±2.6 


6.8 ± 


0.5 


1.72 


± 0.18 


0.33 


± 0.009 


FDP S (n = 9) 


13.7 


± 


1.5 


150 


±4.7 


60.7 


± 3.9 


7.8 ± 


0.9 


2.20 


±0.30 


0.40 


±0.015 


FPL (n = 9) 


10.0 


± 


1.1 


168 


± 10.0 


45.1 


±2.1 


6.9 ± 


0.2 


2.08 


±0.22 


0.24 


±0.010 



• Data from Lieber et al., 1990, 1992. 

b BR: brachioradialis; EDC I, EDC M, EDC R, and EDC S: extensor digitorum communis to the index, middle, ring 
and small fingers, respectively; EDQ: extensor digiti quinti; EIP: extensor indicis proprious; EPL: extensor pollicis longus; 
FDP I, FDP M, FDP R, and FDP S: flexor digitorum profundus muscles; FDS I, FDS M, FDS R, and FDS S: flexor 
digitorum superficialis muscles; FDS I (P) and FDS I (D): proximal and distal bellies of the FDS I; FDS I (C): the 
combined properties of the two bellies as if they were a single muscle; FPL: flexor pollicis longus; PQ: pronator quadratus; 
PL: palmaris longus; PT: pronator teres, FL, fiber length; ML, muscle length. 

calculated based on a knowledge of the number of serial sarcomeres along the muscle length (Tables 22.3, 
22.4, and 22.5) multiplied by the maximum contraction velocity of an individual sarcomere (Table 22.2). 
Sarcomere shortening-velocity varies widely among species and fiber types (Table 22.2). 

Types of Muscle Models 

There are three general classes of models for predicting muscle force: (1) Huxley biochemical models, 
(2) Hill phenomenological models, and (3) constitutive models. Huxley style models attempt to deter- 
mine muscle force based on rates of attachment and detachment of muscle cross-bridges [Huxley, 1957]. 
Though this model is an excellent predictor of steady-state muscle force and energetics, it is generally 
computationally prohibitive to model a whole muscle in this manner. Constitutive models, such as those 
described by Ma and Zahalak [1991] model muscle behavior by describing populations of cross-bridges. 
While this represents a novel and potentially powerful approach, this technique has not yet been widely 
adopted. The vanguard of muscle modeling remains the phenomenological model first described by Hill 
[1938]. The Hill model describes a muscle as an active contractile component with series and parallel 
passive elastic components. The contractile component is generally described by its steady-state isometric 
force-length relation (Fig. 22.1) and isotonic force-velocity relation (Fig. 22.2). 

Muscle Force-Length Relationship 

Under conditions of constant length, the muscle force generated is proportional to the magnitude of the 
interaction between the actin and myosin contractile filament arrays. Since myosin filament length in all 
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TABLE 22.4 Architectural Properties of Human Lower Limb"' 







Muscle 


Muscle 


Fiber 


Pennation 


Cross-Sectional 






Muscle 




Mass (g) 


Length (mm) 


Length (mm) 


Angle (°) 


Area (cm 2 ) 


FL/ML Ratio 


RF (n = 


3) 


84.3 ± 14 


316 


± 5.7 


66.0 


± 1.5 


5.0 ± 


0.0 


12.7 ± 


1.9 


0.209 


± 0.002 


VL(n = 


'3) 


220 ± 56 


324 


± 14 


65.7 


±0.88 


5.0 ± 


0.0 


30.6 ± 


6.5 


0.203 


± 0.007 


VM(m 


= 3) 


175 ±41 


335 


± 15 


70.3 


±3.3 


5.0 ± 


0.0 


21.1 ± 


4.3 


0.210 


± 0.005 


VI (n = 


3) 


160 ± 59 


329 


± 15 


68.3 


±4.8 


3.3 ± 


1.7 


22.3 ± 


8.7 


0.208 


± 0.007 


SM (n = 


= 3) 


108 ± 13 


262 


± 1.5 


62.7 


±4.7 


15 ± 


2.9 


16.9 + 


1.5 


0.239 


± 0.017 


BF, (n = 


: 3) 


128 ± 28 


342 


± 14 


85.3 


±5.0 


0.0 ± 


0.0 


12.8 + 


2.8 


0.251 


± 0.022 


BF S (n = 


= 3) 




271 


+ 11 


139 


±3.5 


23 ± 


0.9 






0.517 


± 0.032 


ST(n = 


2) 


76.9 ± 7.7 


317 


± 4 


158 


±2.0 


5.0 ± 


0.0 


5.4 ± 


1.0 


0.498 


±0.0 


SOL(n 


= 2) 


215 
(n=l) 


310 


± 1.5 


19.5 


±0.5 


25 ± 


5.0 


58.C 
(n = 


) 
1) 


0.063 


± 0.002 


MG (a-- 


= 3) 


150 ± 14 


248 


± 9.9 


35.3 


±2.0 


16.7 ± 


4.4 


32.4 + 


3.1 


0.143 


± 0.010 


LG(n = 


3) 




217 


± 11 


50.7 


± 5.6 


8.3 ± 


1.7 






0.233 


± 0.016 


PLT (n ■■ 


= 3) 


5.30 ± 1.9 


85.0 


± 15 


39.3 


±6.7 


3.3 ± 


1.7 


1.2 + 


0.4 


0.467 


± 0.031 


FHL(n 


= 3) 


21.5 ± 3.3 


222 


±5.0 


34.0 


± 1.5 


10.0 ± 


2.9 


5.3 ± 


0.6 


0.154 


± 0.010 


FDL (n 


= 3) 


16.3 ± 2.8 


260 


± 15 


27.0 


±0.58 


6.7 ± 


1.7 


5.1 ± 


0.7 


0.104 


± 0.004 


PL(n = 


3) 


41.5 ± 8.5 


286 


± 17 


38.7 


±3.2 


10.0 ± 


0.0 


12.3 ± 


2.9 


0.136 


± 0.010 


PB(n = 


3) 


17.3 ± 2.5 


230 


± 13 


39.3 


±3.5 


5.0 ± 


0.0 


5.7 ± 


1.0 


0.170 


± 0.006 


TP (n = 


3) 


53.5 ± 7.3 


254 


± 26 


24.0 


± 4.0 


11.7 ± 


1.7 


20.8 ± 


3 


0.095 


± 0.015 


TA(n = 


3) 


65.7 ± 10 


298 


± 12 


77.3 


± 7.8 


5.0 ± 


0.0 


9.9 ± 


1.5 


0.258 


± 0.015 


EDL(n 


= 3) 


35.2 ± 3.6 


355 


± 13 


80.3 


± 8.4 


8.3 ± 


1.7 


5.6 ± 


0.6 


0.226 


± 0.024 


EHL(n 


= 3) 


12.9 ± 1.6 


273 


± 2.4 


87.0 


±8.0 


6.0 ± 


1.0 


1.8 ± 


0.2 


0.319 


± 0.030 


SAR(n 


= 3) 


61.7 ± 14 


503 


± 27 


455 


± 19 


0.0 ± 


0.0 


1.7 ± 


0.3 


0.906 


± 0.017 


GR(n = 


= 3) 


35.3 ± 7.4 


335 


± 20 


277 


± 12 


3.3 ± 


1.7 


1.8 + 


0.3 


0.828 


± 0.017 


AM (n^ 


= 3) 


229 ± 32 


305 


± 12 


115 


±7.9 


0.0 ± 


0.0 


18.2 ± 


2.3 


0.378 


± 0.013 


AL (n = 


3) 


63.5 ± 16 


229 


± 12 


108 


± 2.0 


6.0 ± 


1.0 


6.8 + 


1.9 


0.475 


± 0.023 


AB(n = 


'3) 


43.8 ± 8.4 


156 


± 12 


103 


± 6.4 


0.0 ± 


0.0 


4.7 ± 


1.0 


0.663 


± 0.036 


PEC(n 


= 3) 


26.4 ± 6.0 


123 


± 4.5 


104 


± 1.2 


0.0 ± 


0.0 


2.9 ± 


0.6 


0.851 


± 0.040 


POP (n 


= 2) 


20.1 ± 2.4 


108 


± 7.0 


29.0 


± 7.0 


0.0 ± 


0.0 


7.9 ± 


1.4 


0.265 


± 0.048 



" Data from Wickiewicz et al., 1983. 

b AB, adductor brevis; AL, adductor longus; AM, adductor magnus; BF P biceps femoris, long head; BF S , biceps 
femoris, short head; EDL, extensor digitorum longus; EHL, extensor hallucis longus; FDL, flexor digitorum longus; 
GR, gracilis; FHL, flexor hallucis longus; LG, lateral gastrocnemius; MG, medial gastrocnemius; PEC, pectineus; 
PB, peroneus brevis; PL, peroneus longus; PLT, plantaris; POP, popliteus; RF, rectus femoris; SAR, sartorius; SM, 
semimembranosus; SOL, soleus; ST, semitendinsus; TA, tibialis anterior; TP, tibialis posterior; VI, vastus interme- 
dius; VL, vastus lateralis; VM, vastus medialis, FL, fiber length; ML, muscle length. 

species is approximately 1.6 um and actin filament length varies (Table 22.6), optimal sarcomere length 
and maximum sarcomere length are calculated using these values. For optimal force generation, each 
half-myosin filament must completely overlap a half-actin filament, without opposing actin filaments 
overlapping. No active force is produced at sarcomere spacings shorter than the myosin filament length 
nor longer than the sum of actin and myosin filament lengths. The range of operating sarcomere lengths 
varies between muscles, but generally spans the range of optimal length ± 20% (Fig. 22.3). 

Muscle Force-Velocity Relationship 

Under conditions of constant contractile load the relationship between muscle force and velocity is given 
by the Hill equation [Hill, 1938]. The behavior of muscle is quite different when it is allowed to shorten 
(often termed "concentric" contraction) compared to when it is forced to lengthen (often termed "eccen- 
tric" contraction) and thus, the shortening force-velocity relation can be described by: 



(p+a)v=b(p-P) 



(22.1) 
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TABLE 22.5 Architectural Properties of Human Foot Muscles 1 ' 





Muscle 


Muscle 


Fiber 


Cross-Sectional 


Muscle 


Volume (cm 3 ) 


Length (mm) 


Length (mm) 


Area (cm 2 ) 


ABDH 


15.2 ± 


5.3 


115.8 


±4.9 


23.0 


± 5.5 


6.68 


± 2.07 


ABDM 


8.8 ± 


4.7 


112.8 


± 19.0 


23.9 


± 7.4 


3.79 


± 1.83 


ADHT 


1.1 ± 


0.6 


24.8 


±4.2 


18.7 


± 5.2 


0.62 


±0.26 


ADHO 


9.1 ± 


3.1 


67.4 


±4.6 


18.6 


± 5.3 


4.94 ± 1.36 


EDB2 


2.1 ± 


1.2 


69.8 


± 16.8 


28.0 


±6.5 


0.79 


±0.43 


EDB3 


1.3 ± 


0.7 


82.2 


±20.7 


26.4 


±5.1 


0.51 


± 0.30 


EDB4 


1.0 ± 


0.7 


70.4 


±21.1 


23.1 


±3.8 


0.44 


± 0.29 


EHB 


3.6 ± 


1.5 


65.7 


± 8.5 


27.9 


± 5.7 


1.34 


± 0.66 


FDB2 


4.5 ± 


2.3 


92.9 


± 15.0 


25.4 


±4.5 


1.78 


± 0.79 


FDB3 


3.2 ± 


1.5 


98.8 


± 18.1 


22.8 


±4.0 


1.49 


± 0.71 


FDB4 


2.6 ± 


1.0 


103.0 


±9.2 


20.8 


±4.5 


1.26 


± 0.47 


FDB5 


0.7 ± 


0.3 


83.2 


± 3.0 


18.2 


± 2.2 


0.35 


± 0.16 


FDMB 


3.4 ± 


1.7 


51.0 


± 5.3 


17.7 


± 3.8 


2.00 


± 1.02 


FHBM 


3.1 ± 


1.3 


76.0 


± 19.8 


17.5 


±4.8 


1.80 ± 0.75 


FHBL 


3.4 ± 


1.4 


65.3 


± 7.1 


16.5 ± 3.4 


2.12 


± 0.84 


DI1 


2.7 ± 


1.4 


51.0 


±4.9 


16.1 


±4.4 


1.70 


± 0.64 


DI2 


2.5 ± 


1.4 


49.9 


± 5.1 


15.3 


±4.0 


1.68 


±0.80 


DI3 


2.5 ± 


1.2 


44.3 


± 5.6 


15.6 


± 5.4 


1.64 


±0.58 


DI4 


4.2 ± 


2.0 


61.4 


±4.5 


16.0 


±4.8 


2.72 


± 1.33 


LB2 


0.6 ± 


0.4 


53.9 


± 11.8 


22.4 


± 6.5 


0.28 


± 0.17 


LB3 


0.5 ± 


0.4 


45.2 


± 8.7 


22.3 


± 6.7 


0.28 


± 0.09 


LB4 


0.6 ± 


0.4 


37.3 


± 19.9 


21.1 


±9.3 


0.30 


±0.32 


LB5 


0.4 ± 


0.4 


41.0 


± 12.1 


16.2 


±7.0 


0.18 


± 0.13 


PI1 


1.5 ± 


0.5 


46.2 


±4.0 


13.6 


± 3.7 


1.23 


± 0.65 


PI2 


1.9 ± 


0.7 


56.6 


±6.6 


13.9 


± 3.5 


1.41 


± 0.48 


PI3 


1.8 ± 


0.6 


48.8 


± 9.9 


14.2 


± 5.9 


1.38 


± 0.55 


QPM 


5.6 ± 


3.4 


81.3 


± 20.1 


27.5 


± 7.0 


1.96 


± 0.94 


QPL 


2.4 ± 


1.2 


55.3 


± 3.9 


23.4 


± 7.1 


1.00 


± 0.41 



a Data from Kura et al., 1997. 

b ABDH, abductor hallucis; FHBM flexor hallucis brevis medialis; FHBL, flexor 
hallucis brevis lateralis; ADHT, adductor hallucis transverse; ADHO, adductor 
hallucis oblique; ABDM, abductor digiti minimi; FDMB, flexor digiti minimi 
brevis; DI, dorsal interosseous; PI, plantar interosseous; FDB, flexor digitorum 
brevis; LB, lumbrical; QPM, quadratus plantaris medialis; QPL, quadratus plan- 
taris lateralis; EHB, extensor hallucis brevis; EDB, extensor digitorum brevis. 

where a and b are constants derived experimentally (usually about 0.25, Table 22.2), P is muscle force, 
P is maximum tetanic tension, and v is muscle velocity. The lengthening relation can be described by: 

V +V 

F = 1.8-0.8 — — . 

V -7.6 V 

max 

The dynamic parameters (a, b and V max ) vary with species and fiber type (Table 22.2). 

Tendon Biomechanics 

Tendons, as most biological structures, have highly nonlinear stress-strain properties. This tissue is 
composed of a network of collagen fibrils in a hydrated matrix of elastin and proteoglycans. At rest, the 
collagen fibrils are significantly crimped or wavy so that initial loading acts primarily to straighten these 
fibrils. At higher strains, the straightened collagen fibrils must be lengthened which requires greater stress. 
As a result, tendons are more compliant at low loads and less compliant at high loads. The highly nonlinear 
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FIGURE 22.1 Sarcomere length-tension curve elucidated for isolated frog skeletal muscle fibers by Gordon, Huxley, 
and Julian (1966). This demonstrates the length dependence of skeletal muscle on isometric strength. 
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FIGURE 22.2 Force-velocity relationship for isolated skeletal muscle for shortening and lengthening. These data 
are plots of Eq. 22.1 and 22.2 normalized for the maximum contraction velocity of the muscle. 

low-load region has been referred to as the "toe" region and occurs up to approximately 3% strain and 
5 MPa [Butler et al., 1979; Zajac, 1989]. Typically tendons have linear properties from about 3% strain 
until ultimate strain which is approximately 10% (Table 22.7). The tangent modulus in this linear region 
is approximately 1.5 GPa. Ultimate tensile stress reported for tendons is approximately 100 MPa 
(Table 22.7). Tendons operate physiologically at stresses of approximately 5 to 10 MPa (Table 22.7). Thus, 
tendons operate with a safety factor of 10 to 20 MPa. 
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TABLE 22.6 Actin Filament Lengths 



Species 


1/2 Actin filament length (pm) 


Ref. 


Human 


1.18 




Lieber etal., 1994 


Cat 


1.12 




Herzogetal., 1992 


Rat 


1.09 




Herzogetal., 1992 


Rabbit 


1.09 




Herzogetal., 1992 


Frog 


0.98 




Page and Huxley, 1963 


Monkey 


1.16 




Walker and Schrodt, 1973 


Human 


1.27 




Walker and Schrodt, 1973 


Hummingbird 


1.75 




Mathieu-Costello etal., 1992 


Chicken 


0.95 




Page, 1969 


Wild Rabbit 


1.12 




Dimery, 1985 


Carp 


0.98 




Sosnicki et al., 1991 



1.00 
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FIGURE 22.3 Sarcomere length excursion range during normal movement summarized from 48 different scientific 
studies. Since thin filament length varies among species, these lengths were normalized to the thin filament plus the 
thick filament length. Thick filaments are 1.65 pm long while thin filament length varies from 2.0 to 2.5 pm. 



TABLE 22.7 Tendon 


Biomechan 


ical Properties 










Ultimate 


Ultimate 


Stress 


Strain 








Stress 


Strain 


Under Normal 


Under Normal 


Tangent 




Tendon 


(MPa) 


(%) 


Loads (MPa) 


Loads (%) 


Modulus (GPa) 


Ref. 


Swine 


40 


9 








Woo etal., 1980 


Wallaby 






15-40 




1.56 


Bennett et al., 1986 


Porpoise 










1.53 


Bennett et al., 1986 


Dolphin 










1.43 


Bennett et al., 1986 


Deer 






28-74 




1.59 


Bennett et al., 1986 


Sheep 










1.65 


Bennett et al., 1986 


Donkey 






22-44 




1.25 


Bennett et al., 1986 


Human leg 






53 




1.0-1.2 


Bennett et al., 1986 


Cat leg 










1.21 


Bennett et al., 1986 


Pig tail 










0.9 


Bennett et al., 1986 


Rat tail 










0.8-1.5 


Bennett et al., 1986 


Horse 








4-10 




Ker et al., 1988 


Dog leg 






84 






Ker et al., 1988 


Camel ankle 






18 






Ker etal., 1988 


Human limb (various) 


60-120 










McElhaney et al., 1976 


Human calcaneal 


55 


9.5 








McElhaney et al., 1976 


Human wrist 


52-74 


11-17 


3.2-3.3 


1.5-3.5 




Loren and Lieber, 1994 
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Injury is a major societal problem in the United States. Approximately 140,000 fatalities occur each year 
due to both intentional and unintentional injuries. Two-thirds of these are unintentional, and of these, 
about one-half are attributable to automotive-related injuries. In 1993, the estimated number of auto- 
motive-related fatalities dipped under 40,000 for the first time in the last three decades due to a continuing 
effort by both the industry and government to render vehicles safer in crash situations. However, for 
people under 40 years of age, automotive crashes, falls, and other unintentional injuries are the highest 
risks of fatality in the United States in comparison with all other causes. 

The principal aim of impact biomechanics is the prevention of injury through environmental modi- 
fication, such as the provision of an airbag for automotive occupants to protect them during a frontal 
crash. To achieve this aim effectively, it is necessary that workers in the field have a clear understanding 
of the mechanisms of injury, be able to describe the mechanical response of the tissues involved, have some 
basic information on human tolerance to impact, and be in possession of tools that can be used as human 
surrogates to assess a particular injury [Viano et al., 1989]. This chapter deals with the biomechanics of 
blunt impact injury to the head and neck. 

23.1 Mechanisms of Injury 
Head Injury Mechanisms 

Among the more popular theories of brain injury due to blunt impact are changes in intracranial pressure 
and the development of shear strains in the brain. Positive pressure increases are found in the brain 
behind the site of impact on the skull. Rapid acceleration of the head, in-bending of the skull, and the 
propagation of a compressive pressure wave are proposed as mechanisms for the generation of intracranial 
compression that causes local contusion of brain tissue. At the contrecoup site, there is an opposite 
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response in the form of a negative-pressure pulse that also causes bruising. It is not clear as to whether 
the injury is due to the negative pressure itself (tensile loading) or to a cavitation phenomenon similar 
to that seen on the surfaces of propellers of ships (compression loading). The pressure differential across 
the brain necessarily results in a pressure gradient that can give rise to shear strains developing within 
the deep structures of the brain. Furthermore, when the head is impacted, it not only translates but also 
rotates about the neck. It is postulated that the relative motion of the brain surface with respect to the 
rough inner surface of the skull results in surface contusions and the tearing of bridging veins between 
the brain and the dura mater, the principal membrane protecting the brain beneath the skull. Gennarelli 
[1983] has found that rotational acceleration of the head can cause a diffuse injury to the white matter 
of the brain in animal models, as evidenced by retraction balls developing along the axons of injured 
nerves. This injury was described by Strich [1961] as diffuse axonal injury (DAI) that she found in the 
white matter of autopsied human brains. Other researchers, including Lighthall et al. [1990], have been 
able to cause the development of DAI in the brain of an animal model (ferrets) by the application of 
direct impact to the brain without the associated head angular acceleration. Adams et al. [1986] indicated 
that DAI is the most important factor in severe head injury because it is irreversible and leads to 
incapacitation and dementia. It is postulated that DAI occurs as a result of the mechanical insult but 
cannot be detected by staining techniques at autopsy unless the patient survives the injury for at least 
several hours. 

Neck Injury Mechanisms 

The neck or the cervical spine is subjected to several forms of unique injuries that are not seen in the 
thoracolumbar spine. Injuries to the upper cervical spine, particularly at the atlanto-occipital joint, are 
considered to be more serious and life-threatening than those at the lower level. The atlanto-occipital 
joint can be dislocated either by an axial torsional load or a shear force applied in the anteroposterior 
direction, or vice versa. A large compression force can cause the arches of CI to fracture, breaking it up 
into two or four sections. The odontoid process of C2 is also a vulnerable area. Extreme flexion of the 
neck is a common cause of odontoid fractures, and a large percentage of these injuries are related to 
automotive accidents [Pierce and Barr, 1983] . Fractures through the pars interarticularis of C2, commonly 
known as "hangman's fractures" in automotive collisions, are the result of a combined axial compression 
and extension (rearward bending) of the cervical spine. Impact of the forehead and face of unrestrained 
occupants with the windshield can result in this injury. Garfin and Rothman [1983] discussed this injury 
in relation to hanging and traced the history of this mode of execution. It was estimated by a British 
judiciary committee that the energy required to cause a hangman's fracture was 1708 N-m (1260 ft-lb). 
In automotive-type accidents, the loading on the neck due to head contact forces is usually a combi- 
nation of an axial or shear load with bending. Bending loads are almost always present, and the degree 
of axial of shear force depends on the location and direction of the contact force. For impacts near the 
crown of the head, compressive forces predominate. If the impact is principally in the transverse plane, 
there is less compression and more shear. Bending modes are infinite in number because the impact can 
come from any angle around the head. To limit the scope of the discussion, the following injury modes 
are considered: tension-flexion, tension-extension, compression-flexion and compression-extension in 
the midsagittal plane and lateral bending. 

Tension-Flexion Injuries. Forces resulting from inertial loading of the head-neck system can result in 
flexion of the cervical spine while it is being subjected to a tensile force. In experimental impacts of 
restrained subjects undergoing forward deceleration, Thomas and Jessop [1983] reported atlanto-occip- 
ital separation and CI — C2 separation occurring in subhuman primates at 120 g. Similar injuries in 
human cadavers were found at 34 to 38 gby Cheng et al. [1982], who used a preinflated driver airbag 
system that restrained the thorax but allowed the head and neck to rotate over the bag. 



© 2000 by CRC Press LLC 



Tension-Extension Injuries. The most common type of injury due to combined tension and extension 
of the cervical spine is the "whiplash" syndrome. However, a large majority of such injuries involve the 
soft tissues of the neck, and the pain is believed to reside in the joint capsules of the articular facets of 
the cervical vertebrate [Lord et al., 1993]. In severe cases, teardrop fractures of the anterosuperior aspect 
of the vertebral body can occur. Alternately, separation of the anterior aspect of the disk from the vertebral 
endplate is known to occur. More severe injuries occur when the chin impacts the instrument panel or 
when the forehead impacts the windshield. In both cases, the head rotates rearward and applies a tensile 
and bending load on the neck. In the case of windshield impact by the forehead, hangman's fracture of 
C2 can occur. Garfin and Rothman [1983] suggested that it is caused by spinal extension combined with 
compression on the lamina of C2, causing the pars to fracture. 

Compression-Flexion Injuries. When a force is applied to the posterosuperior quadrant of the head or 
when a crown impact is administered while the head is in flexion, the neck is subjected to a combined 
load of axial compression and forward bending. Anterior wedge fractures of vertebral bodies are com- 
monly seen, but with increased load, burst fractures and fracture-dislocations of the facets can result. 
The latter two conditions are unstable and tend to disrupt or injure the spinal cord, and the extent of 
the injury depends on the penetration of the vertebral body or its fragments into the spinal canal. Recent 
experiments by Pintar et al. [1989, 1990] indicate that burst fractures of lower cervical vertebrae can be 
reproduced in cadaveric specimens by a crown impact to a flexed cervical spine. A study by Nightingale 
et al. [1993] showed that fracture-dislocations of the cervical spine occur very early in the impact event 
(within the first 10 ms) and that the subsequent motion of the head or bending of the cervical spine 
cannot be used as a reliable indicator of the mechanism of injury. 

Compression-Extension Injuries. Frontal impacts to the head with the neck in extension will cause 
compression-extension injuries. These involve the fracture of one or more spinous processes and, possibly, 
symmetrical lesions of the pedicles, facets, and laminae. If there is a fracture-dislocation, the inferior 
facet of the upper vertebra is displaced posteriorly and upward and appears to be more horizontal than 
normal on x-ray. 

Injuries Involving Lateral Bending. If the applied force or inertial load on the head has a significant 
component out of the midsagittal plane, the neck will be subjected to lateral or oblique along with axial 
and shear loading. The injuries characteristic of lateral bending are lateral wedge fractures of the vertebral 
body and fractures to the posterior elements on one side of the vertebral column. 

Whenever there is lateral or oblique bending, there is the possibility of twisting the neck. The associated 
torsional loads may be responsible for unilateral facet dislocations or unilateral locked facets [Moffat et 
al., 1978]. However, the authors postulated that pure torsional loads on the neck are rarely encountered 
in automotive accidents. It was shown by Wismans and Spenny [1983] that, in a purely lateral impact, 
the head rotated axially about the cervical axis while it translated laterally and vertically and rotated 
about an anteroposterior axis. These responses were obtained from lateral impact tests performed by the 
Naval Biodynamics Laboratory on human subjects who were fully restrained at and below the shoulders. 

23.2 Mechanical Response 
Mechanical Response of the Head 

A large number of cadaveric studies on blunt head impact have been carried out over the past 50 years. 
The head was impacted by rigid and padded surfaces and by impactors of varying shapes to simulate flat 
surfaces and knobs encountered in the automotive environment. In general, the impact responses were 
described in terms of head acceleration or impact force. Both of these responses are dependent on a 
variety of factors, including the inertial properties of the head and the surface impacted by the head. In 
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TABLE 23.1 Average Male Head Mass 







No. of 


Average Body 


Mass 


Average Head Mass 


Reference 




Subjects 


(kg) 




(kg) 


Walker et al., 1973 




16 


67.1 




4.49 


Hubbard and McLeod, 


1974 


11 






4.54 


Reynolds et al., 1975 




6 


65.2 




3.98 


Adjusted per HMRTF 




6 


76.9 




4.69 


Beier et al., 1980 




19 


74.7 




4.32 


McConville et al, 1980 




31 


77.5 




4.55* 


Robbins, 1983 




25 


76.7 




4.54* 



* Based on adjusted head volume of 95% of the reported head volume (4396 cm 3 ) 
and a head specific gravity of 1.097. 

f Based on an estimated head volume of 4137 cm' and a head specific gravity of 1.097. 



TABLE 23.2 Average Mass Moments of Inertia* of the Male 
Head (kg-m 2 x 1Q- 3 ) 



Reference 


L 


l yy 


I zz 


Walker et al., 1973 




2?>3 




Hubbard and McLeod, 1974 


17.4 


16.4 


20.3 


Adjusted per HMRTF 


22.6 


21.3 


26.3 


Beier et al., 1980 (16 male subjects only) 


20.7 


22.6 


14.9 


McConville et al., 1980 


20.4 


23.2 


15.1 


Adjusted by sp. gr. 1.097 


22.4 


25.5 


16.6 


Robbins, 1983 


20.0 


22.2 


14.5 


Adjusted by sp. gr. 1.097 


22.0 


24.2 


15.9 



* The mass moments of inertia given are about the x, y, or z 
anatomic axes through the center of gravity of the head. 

this section, the inertial properties of the head will be described, and response data for head impact 
against a flat, rigid surface will be provided. It should be noted that while response data against surfaces 
with a variety of shapes and stiffnesses are of interest, the only generally applicable and reproducible data 
are those of impacts to flat, rigid surfaces. 

Inertial Properties of the Head. There are several sources of data on the inertial properties of the 
human head. Mass data are shown in Table 23.1, while mass moment of inertial data can be found in 
Table 23.2. The data by Walker et al. [1973] shown in Table 23.1 were analyzed by Hubbard and McLeod 
[1974], who found that 16 of the 20 heads used by Walker et al. [1973] had dimensions that were close 
to those of the average male and provided the data for the average body and head mass. Similarly, Hubbard 
and McLeod [1974] analyzed the head-mass data of Hodgson and Thomas [1971] and Hodgson et al. 
[ 1 972 ] for 1 1 heads, the dimensions of which were consistent with those of a fiftieth percentile male head 
and found an average value of 4.54 kg, as shown in Table 23.1. The data of Reynolds et al. [1975] were 
obtained from 6 cadavers that were of low body weight. The Human Mechanical Response Task Force 
(HMRTF) of the Society of Automotive Engineers (SAE) made an adjustment for the body and head 
mass, and the results are shown just below the original data of Reynolds et al. [1975] in Table 23.1. The 
data of Beier et al. [1980] are based on their original study. The studies of McConville et al. [1980] and 
Robbins [1983] were derived from anthropometric measurement of living subjects. In these studies, head 
mass was estimated from volumetric measurements and a previously determined value of the specific 
gravity of the head — 1.097. For the average values of the mass moment of inertia, shown in Table 23.2, 
the origin of the coordinate axes is at the center of gravity (eg) of the head, and the x axis is in the 
posteroanterior direction, while the z axis is in the inferosuperior direction and the y axis is in a lateral 
direction. The adjustments made in Table 23.1 are applicable to Table 23.2 as well. 
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FIGURE 23.1 Impact response of the head in terms of peak force. 
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FIGURE 23.2 Impact response of the head in terms of peak acceleration. 

Cranial Impact Response. Impact response of the head against a flat, rigid surface was obtained by 
Hodgson and Thomas [1973, 1975], who performed a series of drop tests using embalmed cadavers. The 
responses in terms of peak force and acceleration are shown in Figs. 23.1 and 23.2, respectively, as a 
function of an equivalent free-fall drop height. Details of adjustments made to the original data are 
described by Prasad et al. [1985]. It should be noted that the data shown were from the frontal, lateral, 
and occipital directions and that there was a large scatter in the peak values. For this reason, the data 
were pooled, and individual data points were not shown. 

The difficulty with acceleration measurements in head impact is twofold. The head is not a rigid body, 
and accelerometers cannot be mounted at the center of gravity of the head. The center of gravity 
acceleration can be computed if head angular acceleration is measured, but the variation in skull stiffness 
cannot be corrected for easily. It is recommended that in all future head-impact studies, head angular 
acceleration be measured. Several methods for measuring this parameter have been proposed. At present, 
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the most reliable method appears to be that proposed by Padgaonkar et al. [1975] using an array of 9 
linear accelerometers arranged in a 3-2-2-2 cluster. 

The data presented in this section do not refer to the response of the brain during an injury-producing 
impact. For intact heads, the motion of the brain inside the skull has not been studied exhaustively. There 
is evidence that relative motion of the brain with respect to the skull occurs [Nusholtz et al., 1984], 
particularly during angular acceleration of the head. However, this motion does not fully explain injuries 
seen in the center of the brain and in the brainstem. More research is needed to explore the mechanical 
response of the brain to both linear and angular acceleration and to relate this response to observed 
injuries, such as diffuse axonal injury. 

Mechanical Response of the Neck 

The mechanical response of the cervical spine was studied by Mertz and Patrick [1967, 1971], Patrick 
and Chou [1976], Schneider et al. [1975], and Ewing et al. [1978]. Mertz et al. [1973] quantified the 
response in terms of rotation of the head relative to the torso as a function of bending moment at the 
occipital condyles. Loading corridors were obtained for flexion and extension, as shown in Figs. 23.3 and 
23.4. An exacting definition of the impact environments to be used in evaluating dummy necks relative 
to the loading corridors illustrated in these figures is included in SAE J 1460 [1985]. It should be noted 
that the primary basis for these curves is volunteer data and that the extension of these corridors to 
dummy tests in the injury-producing range is somewhat surprising. Static and dynamic lateral response 
data were provided by Patrick and Chou [1976]. A response envelope for lateral flexion is shown in 
Fig. 23.5. A limited amount of the voluminous data obtained by Ewing et al. [1978] (six runs) was 
analyzed by Wismans and Spenny [1983, 1984] for lateral and sagittal flexion. The rotations were 
represented in three dimensions by a rigid link of fixed length pivoted at Tl at the bottom and within 
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FIGURE 23.3 Loading corridor for neck flexion (forward bending). 
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FIGURE 23.4 Loading corridor for neck extension (rearward bending). 
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FIGURE 23.5 Lateral flexion response envelop. 
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FIGURE 23.6 The Wayne State Tolerance Curve for head injury. 

the head at the top. In terms of torque at the occipital condyles and head rotation, the results fell within 
the corridor for forward and lateral flexion shown in Figs. 23.3 and 23.5, respectively. 

23.3 Regional Tolerance of the Head 
and Neck to Blunt Impact 

Regional Tolerance of the Head 

The most commonly measured parameter for head injury is acceleration. It is therefore natural to express 
human tolerance to injury in terms of head acceleration. The first known tolerance criterion is the Wayne 
State Tolerance Curve, proposed by Lissner et al. [1960] and subsequently modified by Patrick et al. 
[1965] by the addition of animal and volunteer data to the original cadaveric data. The modified curve 
is shown in Fig. 23.6. The head can withstand higher accelerations for shorter durations, and any exposure 
above the curve is injurious. When this curve is plotted on logarithmic paper, it becomes a straight line 
with a slope of -2.5. This slope was used as an exponent by Gadd [1961] in his proposed severity index, 
now known as the Gadd Severity Index (GSI): 



GST 



■JV 

Jo 



dt 
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where a = instantaneous acceleration of the head 
T = duration of the pulse 

If the integrated value exceeds 1 000, a severe injury will result. A modified form of the GSI, now known 
as the Head Injury Criterion (HIC), was proposed by Versace [1970] to identify the most damaging part 
of the acceleration pulse by finding the maximum value of the following integral: 



HIC = 






where a(t) 
U- 



= resultant instantaneous acceleration of the head 
^ = time interval over which HIC is a maximum 



A severe but not life-threatening injury would have occurred if the HIC reached or exceeded 1000. 
Subsequently, Prasad and Mertz [1985] proposed a probabilistic method of assessing head injury and 
developed the curve shown in Fig. 23.7. At an HIC of 1000, approximately 16% of the population would 
sustain a severe to fatal injury. It is apparent that this criterion is useful in automotive safety design and 
in the design of protective equipment for the head, such as football and bicycle helmets. However, there 
is another school of thought that believes in the injurious potential of angular acceleration in its ability 
to cause cerebral contusion of the brain surface and rupture of the parasagittal bridging veins between 
the brain and the dura mater. A proposed limit for angular acceleration is 4500 rad/s 2 , based on a 
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FIGURE 23.7 Head injury risk curve based on HIC. 



© 2000 by CRC Press LLC 



mathematical model developed by Lowenhielm [1974]. This limit has not received universal acceptance. 
Many other criteria have been proposed, but HIC is the current criterion for Federal Motor Vehicle Safety 
Standard (FMVSS) 214, and attempts to replace it have so far been unsuccessful. 

Regional Tolerance of the Neck 

Currently, there are no tolerance values for the neck for the various injury modes. This is not due to a 
lack of data but rather to the many injury mechanisms and several levels of injury severity, ranging from 
life-threatening injuries to the spinal cord to minor soft-tissue injuries that cannot be identified on 
radiographic or magnetic scans. It is likely that a combined criterion of axial load and bending moment 
about one or more axes will be adopted as a future FMVSS. 

23.4 Human Surrogates of the Head and Neck 

The Experimental Surrogate 

The most effective experimental surrogate for impact biomechanics research is the unembalmed cadaver. 
This is also true for the head and neck, despite the fact that the cadaver is devoid of muscle tone because 
the duration of impact is usually too short for the muscles to respond adequately. It is true, however, 
that muscle pretensioning in the neck may have to be added under certain circumstances. Similarly, for 
the brain, the cadaveric brain cannot develop DAI, and the mechanical properties of the brain change 
rapidly after death. If the pathophysiology of the central nervous system is to be studied, the ideal 
surrogate is an animal brain. Currently, the rat is frequently used as the animal of choice, and there is 
some work in progress using the minipig. 

The Injury-Assessment Tool 

The response and tolerance data acquired from cadaveric studies are used to design human-like surro- 
gates, known as anthropomorphic test devices (ATD). These surrogates are required to have biofidelity, 
the ability to simulate human response, but also need to provide physical measurements that are repre- 
sentative of human injury. In addition, they are designed to be repeatable and reproducible. The current 
frontal impact dummy is the Hybrid III, which is human-like in many of its responses, including that 
of the head and neck. The head consists of an aluminum headform covered by an appropriately designed 
vinyl "skin" to yield human-like acceleration responses for frontal and lateral impacts against a flat, rigid 
surface. Two-dimensional physical models of the brain were proposed by Margulies et al. [1990] using 
a silicone gel in which preinscribed grid lines would deform under angular acceleration. No injury 
criterion is associated with this gel model. 

The dummy neck was designed to yield responses in flexion and extension that would fit within the 
corridors shown in Figs. 23.3 and 23.4. The principal function of the dummy neck is to place the head 
in the approximate position of a human head in the same impact involving a human occupant. 

Computer Models 

Models of head impact first appeared over 50 years ago [Holbourn, 1943]. Extensive reviews of such 
models were made by King and Chou [1977] and Hardy et al. [1994]. The use of the finite-element 
method (FEM) to simulate the various components of the head appears to be the most effective and 
popular means of modeling brain response. Despite the large numbers of nodes and elements used, the 
models are still not detailed enough to predict the location of DAI development following a given impact. 
The research is also hampered by the limited amount of animal DAI data currently available. 

A large number of neck and spinal models also have been developed over the past four decades. A 
recent paper by Kleinberger [1993] provides a brief and incomplete review of these models. However, 
the method of choice for modeling the response of the neck is the finite-element method, principally 
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because of the complex geometry of the vertebral components and the interaction of several different 
materials. A fully validated model for impact responses is still not available. 
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Injury is caused by energy transfer to the body by an impacting object. It occurs when the body is struck 
by a blunt object, such as a vehicle instrument panel or side interior, and sufficient force is concentrated 
on the chest or abdomen. The risk of injury is influenced by the object's shape, stiffness, point of contact, 
and orientation. It can be reduced by energy-absorbing padding or crushable materials, which allow the 
surfaces in contact to deform, extend the duration of impact, and reduce loads. The torso is viscoelastic, 
so reaction force substantially increases with the speed of body deformation. 

The biomechanical response of the body has three components: (1) inertial resistance by acceleration 
of body masses, (2) elastic resistance by compression of stiff structures and tissues, and (3) viscous 
resistance by rate-dependent properties of tissue. For low impact speeds, the elastic stiffness protects 
from crush injuries, whereas, for high rates of body deformation, the inertial and viscous properties 
determine the force developed and limit deformation. The risk of skeletal and internal organ injury relates 
to energy stored or absorbed by the elastic and viscous properties. The reaction load is related to these 
responses and inertial resistance of body masses that combine to resist deformation and prevent injury. 
When tissues are deformed beyond their recoverable limit, injuries result. 

24.1 Chest and Abdomen Injury Mechanisms 

The primary mechanism of chest and abdomen injury is compression of the body at high rates of loading. 
This causes deformation and stretching of internal organs and vessels. When the compression of the 
torso exceeds the ribcage tolerance, fractures occur and internal organs and vessels can be contused or 
ruptured. In some chest impacts, however, internal injury occurs without skeletal damage. This can 
happen during high-speed loading. It is due to the viscous or rate-sensitive nature of human tissue as 
biomechanical responses differ for low- and high-speed impact. 

When organs or vessels are loaded slowly, the input energy is absorbed gradually through deformation, 
which is resisted by elastic properties and pressure buildup in tissue. When loaded rapidly, reaction force 
is proportional to the speed of tissue deformation as the viscous properties of the body resist deformation 
and provide a natural protection from impact. However, there is also a considerable inertial component 
to the reaction force. In this case, the body develops high internal pressure, and injuries can occur before 
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the ribs deflect much. The ability of an organ or other biologic system to absorb impact energy without 
failure is called tolerance. 

If an artery is stretched beyond its tensile strength, the tissue will tear. Organs and vessels can be 
stretched in different ways, which result in different types of injury. Motion of the heart during chest 
compression stretches the aorta along its axis from points of tethering in the body. This elongation 
generally leads to a transverse laceration when the strain limit is exceeded. In contrast, an increase in 
vascular pressure dilates the vessel and produces biaxial strain that is larger in the transverse than axial 
direction. If pressure rises beyond the vessel's limit, it will burst. For severe impacts, intraaortic pressure 
exceeds 500 to 1000 mmHg, which is a significant, nonphysiologic level but is tolerable for short durations. 
When laceration occurs, the predominant mode of aortic failure is axial, so the combined effects of stretch 
and internal pressure contribute to injury. Chest impact also compresses the ribcage, causing tensile 
strain on the outer surface of the ribs. As compression increases, the risk of rib fracture increases. In 
both cases, the mechanism of injury is tissue deformation. 

The abdomen is more vulnerable to injury than the chest because there is little bony structure below 
the ribcage to protect internal organs in front and lateral impacts. Blunt impact of the upper abdomen 
can compress and injure the liver and spleen before significant whole-body motion occurs. In the liver, 
compression increases intrahepatic pressure and generates tensile or shear strains. If the tissue is suffi- 
ciently deformed, laceration of the major hepatic vessels can result in hemoperitoneum. Abdominal 
deformation also causes lobes of the liver to move relative to each other, stretching and shearing the 
vascular attachment at the hilar region. 

Effective occupant restraints, safety systems, and protective equipment not only spread impact energy 
over the strongest body structures but also reduce contact velocity between the body and the impacted 
surface or object. The design of protective systems is aided by an understanding of injury mechanisms, 
quantification of human tolerance levels, and development of numerical relationships between measur- 
able engineering parameters, such as force, acceleration, or deformation, and injury. These relationships 
are called injury criteria. 



24.2 Injury Tolerance Criteria 



Acceleration Injury 

Stapp [1970] conducted rocket-sled experiments on belt-restraint systems and achieved a substantial 
human tolerance to long-duration, whole-body acceleration. Safety belts protected military personnel 
exposed to rapid but sustained acceleration. The experiments enabled Eiband [1959] to show in Fig. 24.1 
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FIGURE 24.1 Whole-body human tolerance to vehicle acceleration based on impact duration. (Redrawn from 
Eiband [1959] and from Viano [1988], with permission.) 
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that the tolerance to whole-body acceleration increased as the exposure duration decreased. This linked 
human tolerance and acceleration for exposures of 2- to 1000-ms duration. The tolerance data are based 
on average sled acceleration rather than the acceleration of the volunteer subject, which would be higher 
due to compliance of the restraint system. Even with this limitation, the data provide useful early 
guidelines for the development of military and civilian restraint systems. 

More recent side impact tests have led to other tolerance formulas for chest injury. Morgan et al. 
[1986] evaluated rigid, side-wall cadaver tests and developed TTI, a thoracic trauma index, which is the 
average rib and spine acceleration. TTI limits human tolerance to 85 to 90g in vehicle crash tests. Better 
injury assessment was achieved by Cavanaugh [1993] using average spinal acceleration (ASA), which is 
the average slope of the integral of spinal acceleration. ASA is the rate of momentum transfer during 
side impact, and a value of 30g is proposed. In most cases, the torso can withstand 60 to 80g peak whole- 
body acceleration by a well-distributed load. 

Force Injury 

Whole-body tolerance is related to Newton's second law of motion, where acceleration of a rigid mass 
is proportional to the force acting on it, or F = ma. While the human body is not a rigid mass, a well- 
distributed restraint system allows the torso to respond as though it were fairly rigid when load is applied 
through the shoulder and pelvis. The greater the acceleration, the greater is the force and risk of injury. 
For a high-speed frontal crash, a restrained occupant can experience 60g acceleration. For a body mass 
of 76 kg, the inertial load is 44.7 kN (10,000 lb) and is tolerable if distributed over strong skeletal elements. 

The ability to withstand high acceleration for short durations implies that tolerance is related to 
momentum transfer, because an equivalent change in velocity can be achieved by increasing the accel- 
eration and decreasing its duration, since AV = aAt. The implication for occupant protection systems is 
that the risk of injury can be decreased if the crash deceleration is extended over a greater period of time. 
For occupant restraint in 25 ms, a velocity change of 14.7 m/s (32.7 mi/h) occurs with 60g whole-body 
acceleration. This duration can be achieved by crushable vehicle structures and occupant restraints. 

Prior to the widespread use of safety belts, safety engineers needed information on the tolerance of 
the chest to design energy- absorbing instrument panels and steering systems. The concept was to limit 
impact force below human tolerance by crushable materials and structures. Using the highest practical 
crush force, safety was extended to the greatest severity of vehicle crashes. GM Research and Wayne State 
University collaborated on the development of the first crash sled that was used to simulate progressively 
more severe frontal impacts. Embalmed human cadavers were exposed to head, chest, and knee impact 
on 15-cm-diameter (6-in) load cells until bone fracture was observed on x-ray. Patrick [1965] demon- 
strated that blunt chest loading of 3.3 kN (740 lb) could be tolerated with minimal risk of serious injury. 
This is a pressure of 187 kPa. Gadd and Patrick [1968] later found a tolerance of 8.0 kN (1800 lb) if the 
load was distributed over the shoulders and chest by a properly designed steering wheel and column. 
Cavanaugh [1993] found that side-impact tolerance is similar to frontal tolerance and that shoulder 
contact is also an important load path. 

Compression Injury 

High-speed films of cadaver impacts show that whole-body acceleration does not describe torso impact 
biomechanics. Tolerance of the chest and abdomen must consider body deformation. Force acting on the 
body causes two simultaneous responses: (1) compression of the complaint structures of the torso and 
(2) acceleration of body masses. The previously neglected mechanism of injury was compression, which 
causes the sternum to displace toward the spine as ribs bends and possibly fracture. Acceleration and force 
per se are not sufficient indicators of impact tolerance because they cannot discriminate between the two 
responses. Numerous studies have shown that acceleration is less related to injury than compression. 

The importance of chest deformation was confirmed by Kroell [1971, 1974] in blunt thoracic impacts 
of unembalmed cadavers. Peak spinal acceleration and impact force were poorer injury predictors than 
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FIGURE 24.2 (a) Injury severity from blunt impact of human cadavers as a function of the maximum chest 
compression. (From Viano [1988], with permission.) (b) Severity of skeletal injury and incidence of internal organ 
injury as a function of maximum chest compression for blunt impacts of human cadavers. (From Viano [1988], with 
permission.) 

the maximum compression of the chest, as measured by the percentage change in the anteroposterior 
thickness of the body. A relationship between injury risk and compression involves energy stored by 
elastic deformation of the body. Stored energy E s by a spring representing the ribcage and soft tissues is 
related to the displacement integral of force: £ s = )F dx. Force in a spring is proportional to deformation: 
F= kx, where k is a spring constant. Stored energy is E s = k\x dx = Q.bkx 2 . Over a reasonable range, stored 
energy is proportional to deformation or compression, so E s ~ C. 

Tests with human volunteers showed that compression up to 20% during moderate-duration loading 
was fully reversible. Cadaver impacts with compression greater than 20% (Fig. 24.2a) an increase in rib 
fractures and internal organ injury as the compression increased to 40%. The deflection tolerance was 
originally set at 8.8 cm (3.5 in) for moderate but recoverable injury. This represents 39% compression. 
However, at this level of compression, multiple rib fractures and serious injury can occur, so a more 
conservative tolerance of 34% is used to avert the possibility of flail chest (Fig. 24.2b). This reduces the risk 
of direct loading on the heart, lungs, and internal organs by a loss of the protective function of the ribcage. 

Viscous Injury 

The velocity of body deformation is an important factor in impact injury. For example, when a fluid- 
filled organ is compressed slowly, energy can be absorbed by tissue deformation without damage. When 
loaded rapidly, the organ cannot deform fast enough, and rupture may occur without significant change 
in shape, even though the load is substantially higher than for the slow-loading condition. 

The viscoelastic behavior of soft tissues is important when the velocity of deformation exceeds 3 m/s. 
For lower speeds, such as in slow crushing loads or for a belt-restrained occupant in a frontal crash, 
tissue compression is limited by elastic properties resisting skeletal and internal organ injury. For higher 
speeds of deformation, such as occupant loading by the door in a side impact or for an unrestrained 
occupant or pedestrian, maximum compression does not adequately address the viscous and inertial 
properties of the torso, nor the time of greatest injury risk. In these conditions, the tolerance to com- 
pression is progressively lower as the speed of deformation increases, and the velocity of deformation 
becomes a dominant factor in injury. 

Insight on a rate-dependent injury mechanism came from over 20 years of research by Clemedson 
and Jonsson [1979] on high-speed impact and blast-wave exposures. The studies confirmed that tolerable 
compression inversely varied with the velocity of impact. The concept was studied further in relation to 
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the abdomen by Lau [1981] for frontal impacts in the range of 5 to 20 m/s (10 to 45 mi/h). The liver 
was the target organ. Using a maximum compression of 16%, the severity of injury increased with the 
speed of loading, including serious mutilation of the lobes and major vessels in the highest-speed impacts. 
While the compression was within limits of volunteer loading at low speeds, the exposure produced 
critical injury at higher speeds. Subsequent tests on other animals and target organs verified an interre- 
lationship between body compression, deformation velocity, and injury. 

The previous observations led Viano and Lau [1988] to propose a viscous injury mechanism for soft 
biologic tissues. The viscous response VC is defined as the product of velocity of deformation V and 
compression C, which is a time-varying function in an impact. The parameter has physical meaning to 
absorbed energy E a by a viscous dashpot under impact loading. Absorbed energy is related to the 
displacement integral of force: E a = \F dx, and force in a dashpot is proportional to the velocity of 
deformation: F = cV, where c is a dashpot parameter. Absorbed energy is E a = c\V dx, or a time integral 
by substitution: E a = cjV 2 dt. The integrand is composed of two responses, so E a = c[\d( Vx) - \ax dt], 
where a is acceleration across the dashpot. The first term is the viscous response, and the second is an 
inertial term related to the deceleration of fluid set in motion. Absorbed energy is given by E a = c(Vx — 
lax dt). The viscous response is proportional to absorbed energy, or E a ~ VC, during the rapid phase of 
impact loading prior to peak compression. 

Subsequent tests by Lau and Viano [1988, 1986] verified that serious injury occurred at the time of 
peak VC. For blunt chest impact, peak VC occurs in about half the time for maximum compression. Rib 
fractures also occur progressively with chest compression, as early as 9 to 14 ms — at peak VC — in a 
cadaver impact requiring 30 ms to reach peak compression. Upper abdominal injury by steering wheel 
contact also relates to viscous loading. Lau [1987] showed that limiting the viscous response by a self- 
aligning steering wheel reduced the risk of liver injury, as does force limiting an armrest in side impacts. 
Animal tests also have shown that VC is a good predictor of functional injury to the heart and respiratory 
systems. In these experiments, Stein [1982] found that the severity of cardiac arrhythmia and traumatic 
apnea was related to VC. This situation is important to baseball impact protection of children [Viano 
et al., 1992] and in the design of bulletproof protective vests [Quatros, 1994]. 

Figure 24.3 summarizes injury mechanisms associated with impact deformation. For low speeds of 
deformation, the limiting factor is crush injury from compression C of the body. This occurs at C = 35 to 
40% depending on the contact area and orientation of loading. For deformation speeds above 3 m/s, 
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FIGURE 24.3 Biomechanics of chest injury by a crushing injury mechanism limited by tolerable compression at 
C max = 35%, a viscous injury mechanism limited by the product of velocity and extent of deformation at VC m „ = 
1.0 m/s, and a blast injury mechanism for shock- wave loading. 
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injury is related to peak viscous response of VC = 1.0 m/s. In a particular situation, injury can occur by 
a compression or viscous responses, or both, since these responses occur at different times in an impact. 
At extreme rates of loading, such as in a blast-wave exposure, injury occurs with less than 10 to 15% 
compression by high energy transfer to viscous elements of the body. 

24.3 Biomechanical Responses During Impact 

The reaction force developed by the chest varies with the velocity of deformation and biomechanics is 
best characterized by a family of force-deflection responses. Figure 24.4 summarizes frontal and lateral 
chest biomechanics for various impact speeds. The dynamic compliance is related to viscous, inertial, 
and elastic properties of the body. The initial rise in force is due to inertia as the sternal mass is rapidly 
accelerated to the impact speed. The plateau force is related to the viscous component, which is rate- 
dependent, and a superimposed elastic stiffness that increases force with chest compression. Unloading 
provides a hysteresis loop representing the energy absorbed by body deformation. 

Melvin [1988] analyzed frontal biomechanics and modeled the force-deflection response as an initial 
stiffness A = 0.26 + 0.60(V- 1.3) and a plateau force B = 1.0 + 0.75( V- 3.7), where A is in kN/cm, B 
is in kN, and V in m/s. The force B reasonably approximates the plateau level for lateral chest and 
abdominal impacts, but the initial stiffness is considerably lower at A = 0.12( V- 1.2) for side loadings. 

A simple, but relevant, lumped-mass model of the chest was developed by Lobdell [1973] and is shown 
in Fig. 24.5. The impacting mass is m,, and skin compliance is represented by k l2 . An energy-absorbing 
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FIGURE 24.4 Frontal and lateral force-deflection response of the human cadaver chest at various speeds of blunt 
pendulum impact. The initial stiffness is followed by a plateau force until unloading. (From Kroell [1974] and Viano 
[1989], summarized by Cavanaugh [1993], with permission.) 
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FIGURE 24.4 (continued) 

interface was added by Viano [1987] to evaluate protective padding. Chest structure is represented by a 
parallel Voigt and Maxwell spring-dashpot system that couples the sternal m 2 and spinal m 3 masses. When 
subjected to a blunt sternal impact, the model follows established force-deflection corridors. The bio- 
mechanical model is effective in studying compression and viscous responses. It also simulates military 
exposure to high-speed nonpenetrating projectiles (Fig. 24.6), even though the loading conditions are 
quite different from the cadaver database used to develop the model. This mechanical system characterizes 
the elastic, viscous, and inertial components of the torso. 
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FIGURE 24.5 Lumped-mass model of the human thorax with impacting mass and energy-absorbing material 
interface. The biomechanical parameters are given for mass, spring, and damping characteristics of the chest in blunt 
frontal impact. (Modified from Lobdell [1973] by Viano [1987], with permission.) 
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FIGURE 24.6 Tolerance levels for blunt loading as a function of impact mass and velocity. The plot includes 
information from automotive impact situations and from high-speed military projectile impacts. The Lobdell model 
is effective over the entire range of impact conditions. (Modified from Quatros [1993], with permission.) 



© 2000 by CRC Press LLC 



The Hybrid III dummy reported on by Foster [1977] was the first to demonstrate human-like chest 
responses typical of the biomechanical data for frontal impacts. Rouhana [1989] developed a frangible 
abdomen, useful in predicting injury for lap-belt submarining. More recent work by Schneider [1992] 
led to a new prototype frontal dummy. Lateral impact tests of cadavers against a rigid wall and blunt 
pendulum led to side-impact dummies such as the Eurosid and Biosid [Mertz, 1993]. 

24.4 Injury Risk Assessment 

Over years of study, tolerances have been established for most responses of the chest and abdomen. 
Table 24. 1 provides tolerance levels from reviews by Cavanaugh [1993] and Rouhana [1993]. While these 
are single thresholds, they are commonly used to evaluate safety systems. The implication is that for 
biomechanical responses below tolerance, there is no injury, and for responses above tolerance, there is 
injury. An additional factor is biomechanical response scaling for individuals of different size and weight. 
The commonly accepted procedure involves equal stress and velocity, which enabled Mertz et al. [1989] 
to predict injury tolerances and biomechanical responses for different-sized adult dummies. 

Injury risk assessment is frequently used. It evaluates the probability of injury as a continuous function 
of a biomechanical response. A logist function relates injury probability p to a biomechanical response 
x by p(x) = [1 + exp(a - fix)] -1 , where a and [3 are parameters derived from statistical analysis of 
biomechanical data. This function provides a sigmoidal relationship with three distinct regions in 
Fig. 24.7. For low biomechanical response levels, there is a low probability of injury. Similarly, for very 
high levels, the risk asymptotes to 100%. The transition region between the two extremes involves risk 
that is proportional to the biomechanical response. A sigmoidal function is typical of human tolerance 
because it represents the distribution in weak through strong subjects in a population exposed to impact. 
Table 24.2 summarizes available parameters for chest and abdominal injury risk assessment. 



TABLE 24. 1 Human Tolerance for Chest and Abdomen Impact 





Chest 


Abdomen 




Criteria 


Frontal 


Lateral 


Frontal 


Lateral 


Criteria 


Acceleration 










Acceleration 


3-ms limit 

TTI 

ASA 

AIS4+ 


60s 


85-90g 

30g 

45g 




39g 


AIS4+ 


Force 










Force 


Sternum 


3.3 kN 










Chest + shoulder 


8.8 kN 


10.2 kN 








AIS3+ 






2.9 kN 


3.1 kN 


AIS3+ 


AIS4+ 




5.5 kN 


3.9 kN 


6.7 kN 


AIS4+ 


Pressure 






166 kPa 
216 kPa 




Pressure 
AIS3+ 
AIS4+ 


Compression 
Rib fracture 


20% 








Compression 


Ribcage 


32% 




38% 




AIS3+ 


Flail chest 


40% 


38% 


48% 


44% 


AIS4+ 


Viscous 










Viscous 


AIS3+ 


1.0 m/s 








AIS3+ 


AIS4+ 


1.3 m/s 


1.47 m/s 


1.4 m/s 


1.98 m/s 


AIS4+ 



Adapted from Cavanaugh [1993] and Rouhana [1993]. 
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2.5 



FIGURE 24.7 Typical logist injury-probability function relating the risk of serious injury to the viscous response 
of the chest. (From Viano [1988], with permission.) 



TABLE 24.2 Injury Probability Functions for Blunt Impact (modified from 
Viano [1989]) 



Body Region 


ED 25% 


a 


P 


x- 


P 


R 








Frontal 


Impact 






Chest (AIS 4+) 














VC 


1.0 m/s 


11.42 


11.56 


25.6 


0.000 


0.68 


C 


34% 


10.49 


0.277 
Lateral 


15.9 
Impact 


0.000 


0.52 


Chest (AIS 4+) 














VC 


1.5 m/s 


10.02 


6.08 


13.7 


0.000 


0.77 


C 


38% 


31.22 


0.79 


13.5 


0.000 


0.76 


Abdomen (AIS 4+) 














VC 


2.0 m/s 


8.64 


3.81 


6.1 


0.013 


0.60 


C 


47% 


16.29 


0.35 


4.6 


0.032 


0.48 


Pelvis (pubic ramus fracture) 














C 


27% 


84.02 


3.07 


11.5 


0.001 


0.91 
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Gait analysis is the quantitative measurement and assessment of human locomotion including both 
walking and running. A number of different disciplines use gait analysis. Basic scientists seek a better 
understanding of the mechanisms that normal ambulators use to translate muscular contractions about 
articulating joints into functional accomplishment, e.g., level walking and stair climbing. Increasingly, 
researchers endeavor to better appreciate the relationship between the human motor control systems and 
gait dynamics. With respect to running, athletes and their coaches use gait analysis techniques in a ceaseless 
quest for meaningful improvements in performance while avoiding injury. Sports equipment manufac- 
turers seek to quantify the perceived advantages of their products relative to a competitor's offering. 

In the realm of clinical gait analysis, medical professionals apply an evolving knowledge base in the 
interpretation of the walking patterns of impaired ambulators for the planning of treatment protocols, 
e.g., orthotic prescription and surgical intervention. Clinical gait analysis is an evaluation tool that allows 
the clinician to determine the extent to which an individual's gait has been affected by an already 
diagnosed disorder [Brand & Crowninshield, 1981]. Examples of clinical pathologies currently served by 
gait analysis include amputation, cerebral palsy (CP), degenerative joint disease, poliomyelitis, multiple 
sclerosis, muscular dystrophy, myelodysplasia, rheumatoid arthritis, stroke, and traumatic brain injury. 

Generally, gait analysis data collection protocols, measurement precision, and data reduction models 
have been developed to meet the requirements specific to the research, sport, or clinical setting. For 
example, gait measurement protocols in a research setting might include an extensive physical examina- 
tion to characterize the anthropometrics of each subject. This time expenditure may not be possible in 
a clinical setting. Also, sport assessments generally require higher data acquisition rates because of 
increased velocity amplitudes relative to walking. The focus of this chapter is on the methods for the 
assessment of walking patterns of persons with locomotive impairment, i.e., clinical gait analysis. The 
discussion will include a description of the available measurement technology, the components of data 
collection and reduction, the type of gait information produced for clinical interpretation, and the 
strengths and limitations of clinical gait analysis. 



25.1 Fundamental Concepts 



Clinical Gait Analysis Information 

Gait is a cyclic activity for which certain discrete events have been defined as significant. Typically, the 
gait cycle is defined as the period of time from the point of initial contact (also referred to as foot contact) 
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of the subject's foot with the ground to next point of initial contact for that same limb. Dividing the gait 
cycle in stance and swing phases is the point in the cycle where the stance limb leaves the ground, called 
toe off or foot off. Gait variables that change over time such as the patient's joint angular displacements 
are normally presented as a function of the individual's gait cycle for clinical analysis. This is done to 
facilitate the comparison of different walking trials and the use of a normative data base [Ounpuu et al., 
1991]. Data that are currently provided for the clinical interpretation of gait include 

• Static physical examination measures, such as passive joint range of motion, muscle strength and 
tone, and the presence and degree of bony deformity 

• Stride and temporal parameters, such as step length and walking velocity 

• Segment and joint angular displacements commonly referred to as kinematics 

• The forces and torque applied to the subject's foot by the ground, or ground reaction forces 

• The reactive joint moments produced about the lower extremity joints by active and passive soft tissue 
forces as well as the associated mechanical power of the joint moment, collectively referred to as kinetics 

• Indications of muscle activity during gait, i.e., voltage potentials produced by contracting muscles, 
known as dynamic electromyography (EMG) 

• A measure of metabolic energy expenditure, e.g., oxygen consumption, energy cost. 

• A videotape of the individual's gait trial for qualitative review and quality control purposes 

Data Collection Protocol 

The steps involved in the gathering of data for the interpretation of gait pathologies usually include a 
complete physical examination, biplanar videotaping, and multiple walks of the "instrumented" subject 
along a walkway that is commonly both level and smooth. The time to complete these steps can range 
from three to five hours (Table 25.1). Although the standard for analysis is barefoot gait, subjects are 
tested in other conditions as well, e.g., lower extremity orthoses and crutches. Requirements and con- 
straints associated with clinical gait data gathering include the following: 

• The patient should not be intimidated or distracted by the testing environment. 

• The measurement equipment and protocols should not alter the subject's gait. 

• Patient preparation and testing time must be minimized, and rest (or play) intervals must be 
included in the process. 

• Data collection techniques must be reasonably repeatable. 

• Methodology must be sufficiently robust and flexible to allow the evaluation of a variety of 
pathological gait abnormalities where the dynamic range of motion and anatomy may be signif- 
icantly different from normal. 

• The collected data must be validated before the end of the test period, e.g., raw data fully processed 
before the patient leaves the facility. 

TABLE 25.1 A Typical Gait Data Collection Protocol 

Test Component Estimate Time (minutes) 

Pretest tasks: Test explanation to the adult subject or child and parent, system calibration 10 

Videotaping: Brace, barefoot, close-up, standing 15-25 

Clinical examination: Range of motion, muscle strength, etc. 30-45 

Motion marker placement 15-20 

Motion data collection: Subject calibration and multiple walks, per test condition (barefoot 30-60 

and orthosis) 

Electromyography (surface electrodes and fine-wire electrodes) 20-60 

Data reduction of all trials 30-90 

Data interpretation 20-30 

Report dictation, generation, and distribution 120-180 
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Measurement Approaches and Systems 

The purpose of this section is to provide an overview of the several technologies that are available to 
measure the dynamic gait variables listed above, including stride and temporal parameters, kinematics, 
kinetics, and dynamic EMG. Methods of data reduction will be described in a section that follows. 

Stride and Temporal Parameters 

The timing of the gait cycle events of initial contact and toe off must be measured for the computation 
of the stride and temporal quantities. These measures may be obtained through a wide variety of 
approaches ranging from the use of simple tools such as a stop watch and tape measure to sophisticated 
arrays of photoelectric monitors. Foot switches maybe applied to the plantar aspect of the subject's foot 
over the bony prominences of the heel and metatarsal heads in different configurations depending on 
the information desired. A typical configuration is the placement of a switch on the heel, first (and fifth) 
metatarsal heads and great toe. In a clinical population, foot switch placement is challenging because of 
the variability of foot deformities and foot-ground contact patterns. This switch placement difficulty is 
avoided through the use of either shoe insoles instrumented with one or two large foot switches or entire 
contact sensitive walkways. Alternatively, video cameras may be employed with video frame counters to 
determine the timing of initial contact and toe-off events. These gait events may also be measured using 
either the camera-based motion measurement systems or the force platform technology described below. 

Motion Measurement 

A number of alternative technologies are available for the measurement of body segment spatial position 
and orientation. These include the use of electrogoniometry, high-speed photography, accelerometry, 
and video-based digitizers. These approaches are described below. 

Electrogoniometry. A simple electrogoniometer consists of a rotary potentiometer with arms fixed to 
the shaft and base for attachment to the extremity across the joint of interest. The advantages of multiaxial 
goniometers (more appropriate for human joint motion measurement) include the capability for real- 
time display and rapid collection of single-joint information on many subjects. Electrogoniometers are 
limited to the measurement of relative angles and may be cumbersome in many typical clinical applica- 
tions such as the simultaneous, bilateral assessment of hip, knee, and ankle motion. 

Cinefilm. High-speed photography offers particular advantages in the assessment of activities such as 
sprinting that produce velocity and acceleration magnitudes greater than those realized in walking. This 
approach is not attractive for clinical use because it is labor intensive, e.g., each frame of data is digitized 
individually and requires an unacceptably long processing time. 

Accelerometry. Multiaxis accelerometers can be employed to measure both linear and angular acceler- 
ations (if multiple transducers are properly configured). Velocity and position data may then be derived 
through numerical integration, although care must be taken with respect to the selection of initial 
conditions and the handling of gravitational effects. 

Videocamera-based Systems. This approach to human motion measurement involves the use of exter- 
nal markers that are placed on the subject's body segments and aligned with specific bony landmarks. 
The marker trajectories produced by the subject's ambulation through a specific measurement volume 
are then monitored by a system of cameras (generally from two to seven) placed around the measurement 
volume. In a frame-by-frame analysis, stereophotogrammetric techniques are then used to produce the 
instantaneous three-dimensional (3-D) coordinates of each marker (relative to a fixed laboratory coor- 
dinate system) from the set of two-dimensional camera images. The processing of the 3-D marker 
coordinate data is described in a later section. 

The videocamera-based systems employ either passive (retroflective) or active (light-emitting diodes) 
markers. Passive marker camera systems use either strobe light sources (typically infrared light- emitting 
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diodes (LEDs) configured in rings around the camera lens) or electronically shuttered cameras. The 
cameras then capture the light returned from the highly reflective markers (usually small spheres). Active 
marker camera systems record the light that is produced by small LED markers that are placed directly 
on the subject. Advantages and disadvantages are associated with each approach. For example, the 
anatomical location (or identity) of each marker used in an active marker system is immediately known 
because the markers are sequentially pulsed by the controlling computer. User interaction is required 
currently for marker identification in passive marker systems, although algorithms have been developed 
to expedite this process, i.e., automatic tracking. The system of cables required to power and control the 
LEDs of the active marker system may increase the possibility for subject distraction and gait alteration. 

Ground Reaction Measurement 

Force Platforms. The three components of the ground reaction force vector, the ground reaction torque 
(vertical), and the point of application of the ground reaction force vector (i.e., center of pressure) are 
measured with force platforms embedded in the walkway. Force plates with typical measurement surface 
dimensions of 0.5 x 0.5 m are comprised of several strain gauges or piezoelectric sensor arrays rigidly 
mounted together. 

Foot Pressure Distributions. The dynamic distributed load that corresponds to the vertical ground 
reaction force can be evaluated with the use of a flat, two-dimensional array of small piezo resistive sensors. 
Overall resolution of the transducer is dictated by the size of the individual sensor "cell." Sensor arrays 
configured as shoe insole inserts and flat plates offer the clinical user two measurement alternatives. 
Although the currently available technology does afford the clinical practitioner better insight into the 
qualitative force distribution patterns across the planar surface of the foot, its quantitative capability is 
limited because of the challenge of calibration and signal stability (e.g., temperature-dependent sensors). 

Dynamic Electromyography (EMG) 

Electrodes placed on the skin's surface and fine wires inserted into muscle are used to measure the voltage 
potentials produced by contracting muscles. The activity of the lower limb musculature is evaluated in 
this way with respect to the timing and the intensity of the contraction. Data collection variables that 
affect the quality of the EMG signal include the placement of and distance between recording electrodes, 
skin surface conditions, distance between electrode and target muscle, signal amplification and filtering, 
and the rate of data acquisition. The phasic characteristics of the muscle activity may be estimated from 
the raw EMG signal. The EMG data may also be presented as a rectified and/or integrated waveform. To 
evaluate the intensity of the contraction, the dynamic EMG amplitudes are typically normalized by a 
reference value, e.g., the EMG amplitude during a maximum voluntary contraction. This latter require- 
ment is difficult to achieve consistently for patients who have limited isolated control of individual 
muscles, such as children with CP. 

25.2 Gait Data Reduction 

The predominant approach for the collection of clinical gait data involves the placement of external 
markers on the surface of body segments that are aligned with particular bony landmarks. These markers 
are commonly attached to the subjects as either discrete units or in rigidly connected clusters (Fig. 25.1). 
As described briefly above, the products of the data acquisition process are arrays containing the 3-D 
coordinates (relative to an inertially fixed laboratory coordinate system) of the spatial trajectory of each 
marker over a gait cycle. If at least three markers or reference points are identified for each body segment, 
then the six degrees of freedom associated with the position of the segment may be determined. The 
following example illustrates this straightforward process. 

Assume that a cluster of three markers has been attached to the thigh and shank of the test subject as 
shown in Fig. 25.2. A body-fixed coordinate system may be computed for each cluster. For example, for the 
thigh, the vector cross product of the unit vectors from markers B to A and B to C produces a vector that 
is perpendicular to the cluster plane. From these vectors, the unit vectors T TX and T TY may be determined 
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FIGURE 25.1 Videocamera-based 
motion measurement systems monitor 
the displacement of external markers 
that are placed on the subject's body 
segments and aligned with specific bony 
landmarks. These markers are com- 
monly attached to the subject as either 
discrete units, e.g., the pelvis, or in rig- 
idly connected clusters, e.g., on the 
thigh and shank. 




FIGURE 25.2 A body-fixed coordinate system may be com- 
puted for each cluster of three or more markers. On the thigh, 
for example, the vector cross product of the unit vectors from 
markers B to A and B to C produces a vector that is perpendicular 
to the cluster plane. From these vectors, the unit vectors T TX and 
T TY may be determined and used to compute the third orthogonal 
coordinate direction T T7 . 




and used to compute the third orthogonal coordinate direction T TZ . In a similar manner the marker-based, 
or technical, coordinate system may be calculated for the shank, i.e., S TX , S TY , and S TZ . At this point, one 
might use these two technical coordinate systems to provide an estimate of the absolute orientation of the 
thigh or shank or the relative angles between the thigh and shank. This assumes that the technical coordinate 
systems reasonably approximately the anatomical axes of the body segments, e.g., that T TZ approximates 
the long axis of the thigh. A more rigorous approach incorporates the use of a subject calibration procedure 
to relate technical coordinate systems with pertinent anatomical directions [Cappozzo, 1984]. 
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In a subject calibration, usually performed with the sub- 
ject standing, additional data are collected by the measure- 
ment system that connects the technical coordinate systems 
to the underlying anatomical structure. For example, as 
shown in Fig. 25.3, the medial and lateral femoral condyles 
and the medial and lateral malleoli may be used as anatom- 
ical references with the application of additional markers. 
With the hip center location estimated from markers placed 
on the pelvis [Bell et al., 1989], and knee and ankle center 
locations based on the additional markers, anatomical coor- 
dinate systems may be computed, e.g., {T A } and {S A }. The 
relationship between the respective anatomical and technical 
coordinate system pairs as well as the location of the joint 
centers in terms of the appropriate technical coordinate sys- 
tem may be stored, to be recalled in the reduction of each 
frame of the walking data. In this way, the technical coordi- 
nate systems (shown in Fig. 25.3) are transformed into align- 
ment with the anatomical coordinate systems. 

Once anatomically aligned body-fixed coordinate systems 
have been computed for each body segment under investi- 
gation, one may compute the angular position of the joints 
and segments in a number of ways. The classical approach 
of Euler angles is commonly used in clinical gait analysis to 
describe the motion of the thigh relative to the pelvis (or hip 
angles), the motion of the shank relative to the thigh (or 
knee angles), the motion of the foot relative to the shank (or 
ankle angles), as well as the absolute orientation of the pelvis 

and foot in space [Grood & Suntay, 1983; Ounpuu et al., 1991]. The joint rotation sequence commonly 
used for the Euler angle computation is flexion-extension, adduction-abduction, and transverse plane 
rotation. Alternatively, joint motion has been described through the use of helical axes [Woltring et al., 
1985]. 

The moments that soft tissue (e.g., muscle, ligaments, joint capsule) forces produce about approximate 
joint centers may be computed through the use of inverse dynamics, i.e., Newtonian mechanics. For 
example, the free body diagram of the foot shown in Fig. 25.4 shows the various external loads to the 
foot as well as the reactions produced at the ankle. The mass, mass moments of inertia, and location of 
the center of mass may be estimated from regression-based anthropometric relationships [Dempster 
et al., 1959], and linear and angular velocity and acceleration may be determined by numerical differen- 
tiation. If the ground reaction loads, F G and T, are provided by a force platform, then the unknown ankle 
reaction force F A may be solved for with Newton's second law. Euler's equations of motion may then be 
applied to compute the net ankle reaction moment, M A . In the application of Euler's equations of motion, 
care must be taken to perform the vector operations with all vectors transformed into the foot coordinate 
system which has been chosen to approximate the principles axes and located at the center of mass of 
the foot [Greenwood, 1965]. This process may then be repeated for the shank and thigh by using distal 
joint loads to solve for the proximal joint reactions. The mechanical power associated with a joint moment 
and the corresponding joint angular velocity may be computed from the vector dot product of the two 
vectors, e.g., ankle power is computed through M A - co A where ca A is the angular velocity of the foot relative 
to the shank. 

Although commonly referred to as muscle moments these net joint reaction moments are generated 
by several mechanisms, e.g., ligamentous forces, passive muscle and tendon force, and active muscle 
contractile force, in response to external loads. Currently, clinical muscle force evaluation is not possible 
because of the scarcity of data related to location of the instantaneous line of muscle force as well as the 



FIGURE 25.3 A subject calibration relates 
technical coordinate systems with anatomical 
coordinate systems, e.g., {T T j with )T A }, 
through the identification of anatomical land- 
marks, e.g., the medial and lateral femoral 
condyles and medial and lateral malleoli. 
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FIGURE 25.4 The moments that soft-tissue forces produce about approximate joint centers may be computed 
through the use of Newtonian mechanics. This free-body diagram of the foot illustrates the external loads to the 
foot, e.g., the ground reaction loads, F G and T, and the weight of the foot, m foot g, as well as the unknown reactions 
produced at the ankle, F A and M A . 

overall indeterminacy of the problem. The possibility of the cocontraction of opposing muscle groups 
also impacts the estimation of bone-on-bone force values, i.e., the magnitude of F A found above reflects 
the minimum estimated value of the bone-on-bone force at the ankle. 

With respect to the kinematic data reduction, the body segments are assumed to be rigid, e.g., soft- 
tissue movement relative to underlying bony structures is small. Consequently, marker of instrumentation 
attachment sites must be selected carefully, e.g., over tendonous structures of the distal shank as opposed 
to the more proximal muscle masses of the gastrocnemius and soleus. The models described above also 
assume that the joint center locations remain fixed relative to the respective segmental coordinate systems, 
e.g., the knee center is fixed relative to the thigh coordinate system. The measurement technology and 
associated protocols cannot currently produce data of sufficient quality for the reliable determination of 
instantaneous centers of rotation. 

The addition assumptions associated with the gait kinetics model described above are related to the 
inertial properties of the body segments. Body segment mass and mass distribution, i.e., mass moments 
of inertia, are generally estimated from statistical relationships derived from cadaver studies. Differences 
between body types are not typically incorporated into these anthropometric models. Moreover, the mass 
distribution changes are assumed to be negligible during motion. 

25.3 Illustrative Clinical Case 

As indicated above, the information available for clinical gait interpretation may include static physical 
examination measures, stride and temporal data, segment and joint kinematics, joint kinetics, elec- 
tromyograms, and a video record. With this information the clinical team can assess the patient's gait 
deviations, attempt to identify the etiology of the abnormalities, and recommend treatment alternatives. 
In this way, clinicians are able to isolate the biomechanical insufficiency that may produce a locomotive 
impairment and require a compensatory response from the patient. For example, a patient may excessively 
elevate a hip (compensatory) in order to gain additional foot clearance in swing which is perhaps 
inadequate due to a weak ankle dorsiflexor (primary problem). The knee kinematics and moments and 
associated electromyographic data shown in Fig. 25.5 were generated by an 11-year-old male with a 
diagnosis of cerebral palsy, spastic diplegia. Based on the limited knee range of motion, insufficient knee 
flexion in the swing phase of the gait cycle, and abnormal activity of the rectus femoris in the swing 
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FIGURE 25.5 These are the knee kinematics and moments and associated electromyographic data for a represen- 
tative subject with a diagnosis of cerebral palsy, spastic diplegia (including shaded bands/bars indicating one standard 
deviation about normal). 

phase, a transfer of the distal end of the rectus femoris to the sartorius was recommended. Moreover, 
the excessive knee flexion at initial contact and throughout the stance phase due to overactivity of the 
hamstrings suggested intramuscular lengthenings of the medial and lateral hamstrings. Transverse plane 
deviations apparent from the gait analysis (not shown) lead to recommendations of derotational osteot- 
omies of both the subject's femurs and tibias. 



25.4 Gait Analysis: Current Status 



As indicated in the modeling discussion above, the utility of gait analysis information may be limited by 
sources of error such as soft tissue displacement relative to bone, estimates of joint center locations, 
particularly the hip, approximations of the inertial properties of the body segments, and the numerical 
differentiation of noisy displacement data. Other errors associated with data collection alter the results 
as well, for example, a marker is improperly placed or a force platform is inadvertently contacted by the 
swing limb. The evaluation of small subjects weakens the data because intermarker distances are reduced, 
thereby reducing the precision of angular computations. It is essential that the potential adverse effects 
of these errors on the gait information be understood and appreciated by the clinician in the interpretation 
process. 

Controversies related to gait analysis techniques include the use of individually placed markers versus 
clusters of markers, the estimation of quasi-static, body-fixed locations of joint centers (as described 
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above) versus the dynamic determination of the instantaneous locations, and the application of helical 
or screw axes versus the use of Euler angles. Additional research and development is needed to resolve 
these fundamental methodological issues. 

Despite these limitations, gait analysis facilitates the systematic quantitative documentation of walking 
patterns. With the various gait data, the clinician has the opportunity to separate the primary causes of 
a gait abnormality from compensatory gait mechanisms. Apparent contradictions between the different 
types of gait information can result in a more carefully developed understanding of the gait deviations. 
It provides the clinical user the capability to more precisely (than observational gait analysis alone) plan 
complex multilevel surgeries and evaluate the efficacy of different surgical approaches or orthotic designs. 
Through gait analysis, movement in planes of motion not easily observed, such as about the long axes 
of the lower limb segments, may be quantified. Finally, quantities that cannot be observed may be assessed, 
e.g., muscular activity and joint kinetics. In the future, it is anticipated that our understanding of gait 
will be enhanced through the application of pattern recognition strategies, coupled dynamics, and the 
linkage of empirical results derived through inverse dynamics with the simulations provided by forward 
dynamics modeling. 
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The study of exercise is important to medical and biological engineers. Cognizance of acute and chronic 
responses to exercise gives an understanding of the physiological stresses to which the body is subjected. 
To appreciate exercise responses requires a true systems approach to physiology, because during exercise 
all physiological responses become a highly integrated, total supportive mechanism for the performance 
of the physical stress of exercise. Unlike the study of pathology and disease, the study of exercise physiology 
leads to a wonderful understanding of the way the body is supposed to work while performing at its 
healthy best. 

For exercise involving resistance, physiological and psychological adjustments begin even before the 
start of the exercise. The central nervous system (CNS) sizes up the task before it, assessing how much 
muscular force to apply and computing trial limb trajectories to accomplish the required movement. 
Heart rate may begin rising in anticipation of increased oxygen demands and respiration may also 



26.1 Muscle Energetics 

Deep in muscle tissue, key components have been stored for this moment. Adenosine triphosphate (ATP), 
the fundamental energy source for muscle cells, is at maximal levels. Also stored are significant amounts 
of creatine phosphate and glycogen. 

When the actinomyocin filaments of the muscles are caused to move in response to neural stimulation, 
ATP reserves are rapidly used, and ATP becomes adenosine diphosphate (ADP), a compound with much 
less energy density than ATP. Maximally contracting mammalian muscle uses approximately 1.7 x 
10~ 5 mole of ATP per gram per second [White et al., 1959]. ATP stores in skeletal muscle tissue amount 
to 5 x 10~ 6 mole per gram of tissue, or enough to meet muscle energy demands for no more than 0.5 s. 

Initial replenishment of ATP occurs through the transfer of creatine phosphate (CP) into creatine. 
The resting muscle contains 4 to 6 times as much CP as it does ATP, but the total supply of high-energy 
phosphate cannot sustain muscle activity for more than a few seconds. 

Glycogen is a polysaccharide present in muscle tissues in large amounts. When required, glycogen is 
decomposed into glucose and pyruvic acid, which, in turn, becomes lactic acid. These reactions form 
ATP and proceed without oxygen. They are thus called anaerobic. 
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When sufficient oxygen is available (aerobic conditions), either in muscle tissue or elsewhere, these 
processes are reversed. ATP is reformed from ADP and AMP (adenosine monophosphate), CP is reformed 
from creatine and phosphate (P), and glycogen is reformed from glucose or lactic acid. Energy for these 
processes is derived from the complete oxidation of carbohydrates, fatty acids, or amino acids to form 
carbon dioxide and water. These reactions can be summarized by the following equations: 

Anaerobic: 



ATP <-> ADP + P + free energy 



(26.1) 



CP + ADP O creatine + ATP 
glycogen or glucose + P + ADP — » lactate + ATP 



(26.2) 
(26.3) 



Aerobic: 



Glycogen or fatty acids + P + ADP + d 2 ^> C0 2 + H 2 Q + ATP 



(26.4) 



All conditions: 



2ADP<->ATP + AMP 



(26.5) 



The most intense levels of exercise occur anaerobically [Mole, 1983] and can be maintained for only 
a minute or two (Fig. 26.1). 




1.0 1.5 

TIME (min) 



2.0 



2.5 



FIGURE 26.1 Muscle energy sources at the beginning of exercise. (Redrawn with permission from Mole, 1983) 
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26.2 Cardiovascular Adjustments 

Mechanoreceptors in the muscles, tendons, and joints send information to the CNS that the muscles 
have begun movement, and this information is used by the CNS to increase heart rate via the sympathetic 
autonomic nervous system. Cardiac output, the rate of blood pumped by the heart, is the product of 
heart rate and stroke volume (amount of blood pumped per heart beat). Heart rate increases nearly 
exponentially at the beginning of exercise with a time constant of about 30 s. Stroke volume does not 
change immediately but lags a bit until the cardiac output completes the loop back to the heart. 

During rest a large volume of blood is stored in the veins, especially in the extremities. When exercise 
begins, this blood is transferred from the venous side of the heart to the arterial side. Attempting to push 
extra blood flow through the resistance of the arteries causes a rise in both systolic (during heart 
ventricular contraction) and diastolic (during the pause between contractions) blood pressures. The 
increased blood pressures are sensed by baroreceptors located in the aortic arch and carotid sinus 
(Fig. 26.2). 

As a consequence, small muscles encircling the entrance to the arterioles (small arteries) are caused 
to relax by the CNS. Since, by Poiseuille's Law: 



8ILX 

4 

%r 



(26.6) 



where R = resistance of a tube, N-s/m 5 

L = length of the tube, m 

u. = viscosity of the fluid, kg/(m-s) or N-s/m 2 

r = radius of the tube lumen, m 

increasing the arteriole radius by 19% will decrease its resistance to one-half. Thus, systolic pressure 
returns to its resting value, and diastolic pressure may actually fall. Increased blood pressures are called 
afterload on the heart. 

To meet the oxygen demand of the muscles, blood is redistributed from tissues and organs not directly 
involved with exercise performance. Thus, blood flows to the gastrointestinal tract and kidneys are 
reduced, whereas blood flows to skeletal muscle, cardiac muscle, and skin are increased. 

The heart is actually two pumping systems operated in series. The left heart pumps blood throughout 
the systemic blood vessels. The right heart pumps blood throughout the pulmonary system. Blood 
pressures in the systemic vessels are higher than blood pressures in the pulmonary system. 
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FIGURE 26.2 General scheme for blood pressure regulation. Dashed lines indicate neural communication, and 
solid lines indicate direct mechanical effect. 
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Two chambers comprise each heart. The atrium is like an assist device that produces some suction and 
collects blood from the veins. Its main purpose is to deliver blood to the ventricle, which is the more powerful 
chamber that develops blood pressure. The myocardium (heart muscle) of the left ventricle is larger and 
stronger than the myocardium of the right ventricle. With two hearts and four chambers in series, there 
could be a problem matching flow rates from each of them. If not properly matched, blood could easily 
accumulate downstream from the most powerful chamber and upstream from the weakest chamber. 

Myocardial tissue exerts a more forceful contraction if it is stretched before contraction begins. This 
property (known as Starling's law of the heart) serves to equalize the flows between the two hearts by 
causing a more powerful ejection from the heart in which more blood accumulates during diastole. The 
amount of initial stretching of the cardiac muscle is known as preload. 

26.3 Maximum Oxygen Uptake 

The heart has been considered to be the limiting factor for delivery of oxygen to the tissues. As long as 
oxygen delivery is sufficient to meet demands of the working muscles, exercise is considered to be aerobic. 
If oxygen delivery is insufficient, anaerobic metabolism continues to supply muscular energy needs, but 
lactic acid accumulates in the blood. To remove lactic acid and reform glucose requires the presence of 
oxygen, which must usually be delayed until exercise ceases or exercise level is reduced. 

Fitness of an individual is characterized is characterized by a mostly reproducible measurable quantity 
known as maximal oxygen uptake ( V , max ) that indicates a person's capacity for aerobic energy transfer 
and the ability to sustain high-intensity exercise for longer than 4 or 5 minutes (Fig. 26.3). The more fit, 
the higher is V 02lMI . Typical values are 2.5 L/min for young male nonathletes, 5.0 L/min for well-trained 
male athletes; women have V 02max values about 70-80% as large as males. Maximal oxygen uptake declines 
with age steadily at 1% per year. 

Exercise levels higher than those that result in V , mix can be sustained for various lengths of time. The 
accumulated difference between the oxygen equivalent of work and V 02max is called the oxygen deficit 
incurred by an individual (Fig. 26.4). There is a maximum oxygen deficit that cannot be exceeded by an 
individual. Once this maximum deficit has been reached, the person must cease exercise. 

The amount of oxygen used to repay the oxygen deficit is called the excess postexercise oxygen con- 
sumption (EPOC). EPOC is always larger than the oxygen deficit because: (1) elevated body temperature 
immediately following exercise increases bodily metabolism in general, which requires more than resting 
levels of oxygen to service, (2) increased blood epinephrine levels increase general bodily metabolism, 
(3) increased respiratory and cardiac muscle activity requires oxygen, (4) refilling of body oxygen stores 
requires excess oxygen, and (5) there is some thermal inefficiency in replenishing muscle chemical stores. 
Considering only lactic acid oxygen debt, the total amount of oxygen required to return the body to its 
normal resting state is about twice the oxygen debt; the efficiency of anaerobic metabolism is about 50% 
of aerobic metabolism. 

26.4 Respiratory Responses 

Respiratory also increases when exercise begins, except that the time constant for respiratory response is 

about 45 s instead of 30 s for cardiac responses (Table 26.1). Control of respiration (Fig. 26.5) appears 

to begin with chemoreceptors located in the aortic arch, in 

the carotid bodies (in the neck), and in the ventral medula TABLE 26.1 Comparison of Response 

(in the brain). These receptors are sensitive to oxygen, carbon Time Constants for Three Major Systems 

dioxide, and acidity levels but are most sensitive to carbon of the Body 

dioxide and acidity. Thus, the function of the respiratory System Dominant Time Constant, s 

system appears to be remove excess carbon dioxide and, sec- Heart 30 

ondarily, to supply oxygen. Perhaps this is because excess CO, Respiratory system 45 

• n- i • Oxygen uptake 49 

has narcotic effects, but insufficient oxygen does not produce Thermal system 3600 

severe reactions until oxygen levels in the inhaled air fall to 
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FIGURE 26.3 Concurrent typical changes in blood and respiratory parameters during exercise progressing from 
rest to maximum [adapted and redrawn from Skinner and McLellan (1980) by permission of the American Alliance 
for Health, Physical Education, Recreation and Dance]. 

one-half of normal. There is no well-established evidence that respiration limits oxygen delivery to the 
tissues in normal individuals. 

Oxygen is conveyed by convection in the upper airways and by diffusion in the lower airways to the 
alveoli (lower reaches of the lung where gas exchange with the blood occurs). Oxygen must diffuse from 
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FIGURE 26.4 Oxygen uptake at the beginning of constant-load exercise increases gradually, accumulating an oxygen 
deficit that must be repaid at the end of exercise. 
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FIGURE 26.5 General scheme of respiratory control. 



the alveoli, through the extremely thin alveolocapillary membrane into solution in the blood. Oxygen 
diffuses further into red blood cells where it is bound chemically to hemoglobin molecules. The order 
of each of these processes is reversed in the working muscles where the concentration gradient of oxygen 
is in the opposite direction. Complete equilibration of oxygen between alveolar air and pulmonary blood 
requires about 0.75 seconds. Carbon dioxide requires somewhat less, about 0.50 seconds. Thus, alveolar 
air more closely reflects levels of blood carbon dioxide than oxygen. 

Both respiration rate and tidal volume (the amount of air moved per breath) increase with exercise, 
but above the anaerobic threshold the tidal volume no longer increases (remains at about 2-2.5 L). From 
that point, increases in ventilation require greater increases in respiration rate. A similar limitation occurs 
for stroke volume in the heart (limited to about 120 mL). 



© 2000 by CRC Press LLC 



The work of respiration, representing only about 1-2% of the body's oxygen consumption at rest, 
increases to 8-10% or more of the body's oxygen consumption during exercise. Contributing greatly to 
this is the work to overcome resistance to movement of air, lung tissue, and chest wall tissue. Turbulent 
airflow in the upper airways (those nearest and including the mouth and nose) contributes a great deal 
of pressure drop. The lower airways are not as rigid as the upper airways and are influenced by the 
stretching and contraction of the lung surrounding them. High exhalation pressures external to the 
airways coupled with low static pressures inside (due to high flow rates inside) tend to close these airways 
somewhat and limit exhalation airflow rates. Resistance of these airways becomes very high, and the 
respiratory system appears like a flow source, but only during extreme exhalation. 

26.5 Optimization 

Energy demands during exercise are so great that optimal courses of action are followed for many 
physiological responses (Table 26.2). Walking occurs most naturally at a pace that represents the smallest 
energy expenditure; the transition from walking to running occurs when running expends less energy 
than walking; ejection of blood from the left ventricle appears to be optimized to minimize energy 
expenditure; respiratory rate, breathing waveforms, the ratio of inhalation time to exhalation time, 
airways resistance, tidal volume, and other respiratory parameters all appear to be regulated to minimize 
energy expenditure [Johnson, 1993]. 

26.6 Thermal Response 

When exercise extends for a long enough time, heat begins to build up in the body. In order for heat 
accumulation to become important, exercise must be performed at a relatively low rate. Otherwise, 
performance time would not be long enough for significant amounts of heat to be stored. 

Muscular activities are at most 20-25% efficient, and, in general, the smaller the muscle, the less 
efficient it is. Heat results from the other 75-80% of the energy supplied to the muscle. 

Thermal challenges are met in several ways. Blood sent to the limbs and blood returning from the 
limbs are normally conveyed by arteries and veins in close proximity deep inside the limb. This tends to 
conserve heat by countercurrent heat exchange between the arteries and veins. Thermal stress causes 
blood to return via surface veins rather than deep veins. Skin surface temperature increases and heat loss 
by convection and radiation also increases. In addition, vasodilation of cutaneous blood vessels augments 
surface heat loss but puts an additional burden on the heart to deliver added blood to the skin as well 
as the muscles. Heart rate increases as body temperature rises. 

Sweating begins. Different areas of the body begin sweating earlier than others, but soon the whole 
body is involved. If sweat evaporation occurs on the skin surface, then the full cooling power of evapo- 
rating sweat (670 W-h/kg) is felt. If the sweat is absorbed by clothing, then the full benefit of sweat 
evaporation is not realized at the skin. If the sweat falls from the skin, no benefit accrues. 

Sweating for a long time causes loss of plasma volume (plasma shift), resulting in some hemoconcentration 
(2% or more). This increased concentration increases blood viscosity, and cardiac work becomes greater. 

26.7 Applications 

Knowledge of exercise physiology imparts to the medical or biological engineer the ability to design 
devices to be used with or by humans or animals, or to borrow ideas from human physiology to apply 
to other situations. There is need for engineers to design the many pieces of equipment used by sports 
and health enthusiasts, to modify prostheses or devices for the handicapped to allow for performance of 
greater than light levels of work and exercise, to alleviate physiological stresses caused by personal 
protective equipment and other occupational ergonometric gear, to design human-powered machines 
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TABLE 26.2 Summary of Exercise Responses for a Normal Young Male 







Light 


Moderate 


Heavy 


Maximal 




Rest 


Exercise 


Exercise 


Exercise 


Exercise 


Oxygen uptake (L/min) 


0.30 


0.60 


2.2 


3.0 


3.2 


Maximal oxygen uptake (%) 


10 


20 


70 


95 


100 


Physical work rate (watts) 





10 


140 


240 


430 


Aerobic fraction (%) 


100 


100 


98 


85 


50 


Performance time (min) 


00 


480 


55 


9.3 


3.0 


Carbon dioxide production (L/min) 


0.18 


1.5 


2.3 


2.8 


3.7 


Respiratory exchange ratio 


0.72 


0.84 


0.94 


1.0 


1.1 


Blood lactic acid (mMol/L) 


1.0 


1.8 


4.0 


7.2 


9.6 


Heart rate (beats/min) 


70 


130 


160 


175 


200 


Stroke volume (L) 


0.075 


0.100 


0.105 


0.110 


0.110 


Cardiac output (L/min) 


5.2 


13 


17 


19 


22 


Minute volume (L/min) 


6 


22 


50 


80 


120 


Tidal volume (L) 


0.4 


1.6 


2.3 


2.4 


2.4 


Respiration rate (breaths/min) 


15 


26 


28 


57 


60 


Peak flow (L/min) 


216 


340 


450 


480 


480 


Muscular efficiency (%) 





5 


18 


20 


20 


Aortic hemoglobin saturation (%) 


98 


97 


94 


93 


92 


Inhalation time (s) 


1.5 


1.25 


1.0 


0.7 


0.5 


Exhalation time (s) 


3.0 


2.0 


1.1 


0.75 


0.5 


Respiratory work rate (watts) 


0.305 


0.705 


5.45 


12.32 


20.03 


Cardiac work rate (watts) 


1.89 


4.67 


9.61 


11.81 


14.30 


Systolic pressure (mmHg) 


120 


134 


140 


162 


172 


Diastolic pressure (mmHg) 


80 


85 


90 


95 


100 


End-inspiratory lung volume (L) 


2.8 


3.2 


4.6 


4.6 


4.6 


End-expiratory lung volume (L) 


2.4 


2.2 


2.1 


2.1 


2.1 


Gas partial pressures (mmHg) 












Arterial pCO, 


40 


41 


45 


48 


50 


p0 2 


100 


98 


94 


93 


92 


Venous pC0 2 


44 


57 


64 


70 


72 


P o 2 


36 


23 


17 


10 


9 


Alveolar pC0 2 


32 


40 


28 


20 


10 


p0 2 


98 


94 


110 


115 


120 


Skin conductance [watts/(m 2 -°C] 


5.3 


7.9 


12 


13 


13 


Sweat rate (kg/s) 


0.001 


0.002 


0.008 


0.007 


0.002 


Walking/running speed (m/s) 





1.0 


2.2 


6.7 


7.1 


Ventilation/perfusion of the lung 


0.52 


0.50 


0.54 


0.82 


1.1 


Respiratory evaporative water loss (L/min) 


1.02 x 10- 5 


4.41 x 10- 5 


9.01 x 10-' 


1.35 x 10- 3 


2.14 x 10- 3 


Total body convective heat loss (watts) 


24 


131 


142 


149 


151 


Mean skin temperature (°C) 


34 


32 


30.5 


29 


28 


Heat production (watts) 


105 


190 


640 


960 


1720 


Equilibrium rectal temperature (°C) 


36.7 


38.5 


39.3 


39.7 


500 


Final rectal temperature (°C) 


37.1 


38.26 


39.3 


37.4 


37 



that are compatible with the capabilities of the operators, and to invent systems to establish and maintain 
locally benign surroundings in otherwise harsh environments. Recipients of these efforts include athletes, 
the handicapped, laborers, firefighters, space explorers, military personnel, farmers, power-plant workers, 
and many others. The study of exercise physiology, especially in the language used by medical and 
biological engineers, can result in benefits to almost all of us. 
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Defining Terms 

Anaerobic threshold: The transition between exercise levels that can be sustained through nearly 
complete aerobic metabolism and those that rely on at least partially anaerobic metabolism. Above 
the anaerobic threshold, blood lactate increases and the relationship between ventilation and 
oxygen uptake becomes nonlinear. 

Maximum oxygen consumption: The maximum rate of oxygen use during exercise. The amount of 
maximum oxygen consumption is determined by age, sex, and physical condition. 

Excess postexercise oxygen consumption (EPOC): The difference between resting oxygen consump- 
tion and the accumulated rate of oxygen consumption following exercise termination. 

Oxygen deficit: The accumulated difference between actual oxygen consumption at the beginning of 
exercise and the rate of oxygen consumption that would exist if oxygen consumption rose imme- 
diately to its steady-state level corresponding to exercise level. 
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Further Information 

A comprehensive treatment of quantitative predictions in exercise physiology is presented in Biomechanics 

and Exercise Physiology by Arthur T. Johnson (John Wiley and Sons, 1991). There are a number 

of errors in the book, but an errata sheet is available from the author. 
P.O. Astrand and K. Rodahl's Textbook of Work Physiology (McGraw-Hill, 1970) contains a great deal of 

exercise physiology and is probably considered to be the standard textbook on the subject. 
Biological Foundations of Biomedical Engineering, edited by J. Kline (Little, Brown, Boston), is a very good 

textbook for physiology written for engineers. 
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High technology has entered our diversions and leisure activities. Sports, exercise, and training are no 
longer just physical activities but include machines and techniques attuned to individual capabilities and 
needs. This chapter will consider several factors related to exercise and training that help in understanding 
human performance. 

Physiological work performance is determined by energy transformation that begin with the process 
of photosynthesis and end with the production of biological work (Fig. 27.1). Energy in the form of 
nuclear transformations is converted to radiant energy, which then transforms the energy from carbon 
dioxide and water into oxygen and glucose through photosynthesis. In plants the glucose can also be 
converted to fats and proteins. Upon ingesting plants or other animals that eat plants, humans convert 
this energy through cellular respiration (the reverse of photosynthesis) to chemical energy in the form 
of adenosine triphosphate (ATP). The endergonic reactions (energy absorbed from the surroundings) 
that produce ATP are coupled with exergonic reactions (energy released to surroundings) that release 
energy from its breakdown to produce chemical and mechanical work in the human body. The steps 
involved in the synthesis and breakdown of carbohydrates, fats, and proteins produce chemical work and 
provide energy for the mechanical work produced from muscular contractions. The purpose of this 
chapter is to provide a brief summary of some factors that can affect mechanical work in humans. 

27.1 Exercise Biomechanics 
Equilibrium 

Any body, including the human body, remains in stable equilibrium if the vectorial sum of all forces and 
torques acting on the body is zero. An unbalanced force results in linear acceleration, and an unbalanced 
torque results in rotational acceleration. Static equilibrium requires that: 
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FIGURE 27.1 Schematic of energy transformations leading to muscular mechanical work. 



F = 



(27.1) 



r=o 



(27.2) 



where F = vectorial forces, N, and T = vectorial torques, N-m. 
Some sport activities, such as wrestling, weightlifting, and 
fencing, require stability, whereas other activities, including run- 
ning, jumping, and diving, cannot be performed unless there is 
managed instability. Shifting body position allows the proper 
control. The mass of the body is distributed as in Table 29.1, and 
center of mass is located at approximately 56% of a person's 
height and midway from side-to-side and front-to-back. The 
center of mass can be made to shift by extending the limbs or 
by bending the torso. 



TABLE 27.1 Fraction of Body Weights 
for Various Parts of the Body 



Body Part 


Fraction 


Head and neck 


0.07 


Trunk 


0.43 


Upper Arms 


0.07 


Forearms and hands 


0.06 


Thighs 


0.23 


Lower legs and feet 


0.14 




1.00 



Muscular Movement 

Mechanical movement results from contraction of muscles that are attached at each end to bones that 
can move relative to each other. The arrangement of this combination is commonly known as a class 3 
lever (Fig. 27.2), where one joint acts as the fulcrum (Fig. 27.3), the other bone acts as the load, and the 
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FIGURE 27.2 A class 3 lever is arranged with the applied force interposed between the fulcrum and the load. Most 
skeletal muscles are arranged in this fashion. 




FIGURE 27.3 The biceps muscle of the arm is arranged as a class 3 lever. The load is located at the hand and the 
fulcrum at the elbow. 

muscle provides the force interposed between fulcrum and load. This arrangement requires that the 
muscle force be greater than the load, sometimes by a very large amount, but the distance through which 
the muscle moves is made very small. These characteristics match muscle capabilities well (muscles can 
produce 7 x 10 5 N/m 2 , but cannot move far). Since the distance is made smaller, the speed of shortening 
of the contracting muscle is also slower than it would be if the arrangement between the force and the 
load were different: 
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(27.3) 



where S 
d 
L,M 



speed, m/s 

distance from fulcrum, m 

denote load and muscle 



Muscular Efficiency 

Efficiency relates external, or physical, work produced to the total chemical energy consumed: 



External work produced 
Chemical energy consumed 



(27.4) 



Muscular efficiencies range from close to zero to about 20-25%. The larger numbers would be obtained 
for leg exercises that involve lifting body weight. In carpentry and foundry work, where both arms and 
legs are used, average mechanical efficiency is approximately 10% [Johnson, 1991]. For finer movements 
that require exquisite control, small muscles are often arranged in antagonistic fashion, that is, the final 
movement is produced as a result of the difference between two or more muscles working against each 
other. In this case, efficiencies approach zero. Isometric muscular contraction, where a force is produced 
but no movement results, has an efficiency of zero. 

Muscles generally are able to exert the greatest force when the velocity of muscle contraction is zero. 
Power produced by this muscle would be zero. When the velocity of muscle contraction is about 8 m/s, 
the force produced by the muscle becomes zero, and the power produced by this muscle again becomes 
zero. Somewhere between, the power produced, and the efficiency, becomes a maximum (Fig. 27.4). 




2 4 6 

VELOCITY OF SHORTENING (m/il 

FIGURE 27.4 Force and power output of a muscle as a function of velocity. (Adapted and used with permission 
from Milsum, 1966) 
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FIGURE 27.5 The gross efficiency for hand cranking or bicycling as a function of the rate of work [Goldman, 1978]. 

The isometric length-tension relationship of a muscle shows that the maximum force developed by a 
muscle is exerted at its resting length (the length of a slightly stretched muscle attached by its tendons 
to the skeleton) and decreases to zero at twice its resting length. Maximum force also decreases to zero 
at the shortest possible muscular length. Since muscular contractile force depends on the length of the 
muscle and length changes during contraction, muscular efficiency is always changing (Fig. 27.5). 

Negative (eccentric) work is produced by a muscle when it maintains a force against an external force 
tending to stretch the muscle. An example of negative work is found as the action of the leg muscle 
during a descent of a flight of stairs. Since the body is being lowered, external work is less than zero. The 
muscles are using physiological energy to control the descent and prevent the body from accumulating 
kinetic energy as it descends. 

Muscular efficiencies for walking downhill approach — 120% [McMahon, 1984]. Since heat produced 
by the muscle is the difference between 100% and the percent efficiency, heat produced by muscles 
walking downhill is about 220% of their energy expenditure. Energy expenditure of muscles undergoing 
negative work is about one-sixth that of a muscle doing positive work [Johnson, 1991], so a leg muscle 
going uphill produces about twice as much heat as a leg muscle going downhill. 

Locomotion 

The act of locomotion involves both positive and negative work. There are four successive stages of a 
walking stride. In the first stage, both feet are on the ground, with one foot ahead of the other. The 
trailing foot pushes forward, and the front foot is pushing backward. In the second stage the trailing foot 
leaves the ground and the front foot applies a braking force. The center of mass of the body begins to 
lift over the front foot. In the third stage the trailing foot is brought forward, and the supporting foot 
applies a vertical force. The body center of mass is at its highest point above the supporting foot. In the 
last stage, the body's center of mass is lowered, and the trailing foot provides an acceleration force. 

This alteration of the raising and lowering of the body center of mass, along with the pushing and 
braking provided by the feet, makes walking a low-efficiency maneuver. Walking has been likened to 
alternately applying the brakes and accelerator while driving a car. Just as the fuel efficiency of the car 
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FIGURE 27.6 Power required for walking, running, and cycling by an adult male. Curves for walking and running 
interact at about 2.3 m/s and show that walking is more efficient below the intersection and running is more efficient 
above. (Redrawn with permission from Alexander, 1984) 

would suffer from this mode of propulsion, so the energy efficiency of walking suffers from the way 
walking is performed. 

There is an optimum speed of walking. Faster than this speed, additional muscular energy is required 
to propel the body forward. Moving slower than the optimal speed requires additional muscular energy 
to retard leg movement. Thus, the optimal speed is related to the rate at which the leg can swing forward. 
Simple analysis of the leg as a physical pendulum shows that the optimal walking speed is related to leg 
length: 



S°< \j L 



(27.5) 



Unlike walking, there is a stage of running during which both feet leave the ground. The center of 
mass of the body does not rise and fall as much during running as during walking, so the efficiency for 
running can be greater than for walking. 

At a speed of about 2.5 m/s, running appears to be more energy-efficient than walking, and the 
transition is usually made between these forms of locomotion (Fig. 27.6). Unlike walking, there does not 
appear to be a functional relationship between speed and leg length, so running power expenditure is 
linearly related to speed alone. 

Why would anyone want to propel the extra weight of a bicycle in addition to body weight? On the 
surface it would appear that cycling would cost more energy than running or walking. However, the 
center of mass of the body does not move vertically as long as the cyclist sits on the seat. Without the 
positive and negative work associated with walking or running, cycling is a much more efficient form of 
locomotion than the other two (Fig. 29.6), and the cost of moving the additional weight of the bicycle 
can easily be supplied. 

Many sports or leisure activities have a biomechanical basis. Understanding of the underlying biome- 
chanical processes can lead to improved performance. Yet, there are limits to performance that cause 
frustrations for competitive athletes. Hence, additional factors are sometimes employed to expand these 
limits. Following is a brief discussion of some of these factors. 
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27.2 Exercise Training 

Many compensatory reactions allow the body to adapt to minor stresses, such as mild exercise, so that 
homeostasis (equilibrium) can be maintained. For example, the increased energy demands of exercise 
stimulate an increase in heart rate, respiration, blood flow, and many other reactions that allow the body 
to maintain homeostasis. As the intensity of exercise increases, it becomes more difficult for compensatory 
mechanisms to maintain homeostasis. After exceeding about 80% of an untrained person's maximal 
capacity, homeostasis can no longer be maintained for more than a few minutes before exhaustion results. 
Regular exercise (training) elevates the level that a single exercise session can be performed before 
disturbing homeostasis. It does this by elevating the maximal physiological capacity for homeostasis so 
that the same amount of exercise may no longer disrupt homeostasis because it is a lower percentage of 
maximal capacity. In addition, training produces specific adaptations during submaximal exercise that 
permit greater and longer amounts of work before losing homeostasis. A good example of this is when 
blood lactate rises with increased intensity of exercise. Prior to training, blood lactate concentration rises 
substantially when the intensity of exercise exceeds about 75% of maximal oxygen consumption (V 02max ). 
Following training, the same intensity of exercise results in a lower concentration of blood lactate, so 
exercise can now be performed at a higher fraction of V 02mii before blood lactate reaches the same level 
as before training [Hurley et al., 1984a]. Thus, exercise training allows an individual to perform a much 
greater amount of work before homeostasis is disturbed to the point at which muscular exhaustion results. 

27.3 Age 

It is well established that the maximal capacity to exercise declines with age. However, many factors that 
change with age also affect an individual's maximal capacity to exercise. These include a gain in body fat, 
a decrease in lean body mass, onset of disease, and a decline in the level of physical activity. For this reason 
it is difficult to determine exactly how much of the loss in capacity to exercise with age can really be 
attributed solely to the effects of aging. V0 2max starts to decline after the age of 25 to 30 years at about 
the rate of 10% per decade in healthy sedentary women and men. This rate of decline is only half as great 
(~5%) when people maintain their physical activity levels as they age. Thus, there is no doubt that maximal 
work capacity declines with advancing age, but regular exercise appears to reduce this decline substantially. 

27.4 Gender 

Cardiovascular fitness and muscular strength are substantially higher in men compared to women. The 
aerobic capacity in men is about 40 to 50% higher than women. In addition, upper body strength is 
approximately 100% higher and lower body strength is about 50% higher in men [Lynch et al., 1998]. 
However, these differences are diminished substantially when body composition is taken into consideration. 
For example, when aerobic capacity is expressed with reference to body mass (ml 2 /kg/min) this difference 
decreases to about 20% and to about 10% when differences in muscle mass are taken into consideration 
[Lynch etal., 1998]. Similar declines in the differences in muscular strength can be demonstrated when 
taking lean body mass differences into consideration. There do not appear to be any differences in respon- 
siveness to training when expressed on a relative basis (% change), but recent evidence suggests that men 
have greater muscle mass gains than women when expressed in absolute terms [Tracey et al., 1988]. 



27.5 Ergogenic Aids 



Anabolic Steroids 

Anabolic steroids are drugs that function similar to the male sex hormone testosterone. Upon binding 
with specific receptor sites, these drugs contribute greatly to male secondary sex characteristics. These 
drugs have been used frequently by a large number of competitive athletes, particularly those involved 
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FIGURE 27.7 Blood viscosity related to blood hematocrit. 

in strength and power sports. Increases in muscular strength, total body mass, and lean body mass have 
been reported [Bhasin et al., 1996]. It has been argued, however, that at least some of the increase in 
total body mass resulting from anabolic steroid use results from an increase in water retention [Casner 
etal., 1971]. Evidence indicates that many of these drugs act by decreasing muscle protein degradation 
rather than by enhancing protein synthesis [Hickson etal., 1990]. Hence, these drugs might be more 
appropriately named "anticatabolic" steroids. There is also some indication that these drugs work by 
increasing aggressive behavior and thereby promotes a greater quantity and quality of training. 

The factors that are responsible for transporting oxygen from the heart to skeletal muscle are enhanced 
following anabolic steroid administration through increases in blood volume, red blood cells (RBC), and 
hemoglobin. This would appear to improve cardiovascular fitness by increasing oxygen transport capacity. 
However, most studies show that anabolic steroids do not significantly increase cardiovascular fitness 
[Johnson et al., 1975]. This may be due to the increased blood viscosity resulting from the hemoconcen- 
tration effects of increasing RBCs and hemoglobin. Increases in blood viscosity may reduce blood flow 
to and from the heart and cause the heart to work harder in order to maintain cardiac output (Fig. 27.7). 

It is clear that the use of anabolic steroids produces adverse effects on the liver and reproductive and 
cardiovascular systems. Effects on the liver include peliosis hepatitis (blood-filled cysts), impaired excre- 
tory function (jaundice), and liver tumors [ACSM, 1984]. Cardiovascular effects include an increase in 
blood pressure, abnormal alterations in cardiac tissue, and abnormal lipoprotein-lipid profiles [Hurley 
etal., 1984fo]. Males can experience a significant reduction in sperm production, testicular size, and 
testosterone and gonadotrophin production, and females often experience a deepening of voice, male 
pattern baldness, enlargement of the clitoris, a reduction in breast size, a disruption in their menstrual 
cycle, and an increase in facial hair. Most of these effects are irreversible even after the drugs are 
discontinued. There are also many psychological effects including an increase in aggressive behavior, an 
increase in anger and hostility, large deviations in mood, and sudden changes in libido. 
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Growth Hormone 

Human growth hormone, also known as somatotropic hormone, is secreted from the pituitary gland 
and is involved in many anabolic processes in the body including normal growth and tissue synthesis. 
Athletes have become more interested in the use of growth hormone (GH) in recent years because of the 
many reports touting its benefits, the increased awareness of the dangers of anabolic steroids, and the 
reduced price due to the increased availability of synthetic forms. The appeal to athletes is that GH 
stimulates amino acid uptake and protein synthesis in skeletal muscle and a degradation of fat in adipose 
tissue. It has been shown to increase fat-free mass and reduce body fat independent of diet and exercise. 
Nevertheless, these benefits do not come without side effects, including acromegaly (enlargement of bony 
structures in the extremities) and impaired glucose tolerance. 

Blood Doping 

This procedure is also called red blood cell (RBC) reinfusion, induced erythrocythemia, or blood boosting. 
It involves withdrawing between 1 and 4 units (1 unit = 450 ml of blood) of a person's blood and then 
separating the plasma (liquid portion) from the RBCs. The plasma portion is immediately reinfused, 
and the packed RBCs are frozen. Each unit of blood is withdrawn over a 3- to 8-week period to prevent 
sudden reduction in hematocrit (RBC concentration). The RBCs are then reinfused 1 to 7 days prior to 
an endurance event. This results in a 10 to 20% increase in RBC and hemoglobin levels as well as an 
increase in total blood volume. It is believed that blood doping enhances oxygen transport to the working 
muscles, and since oxygen transport is considered a limiting factor for increasing V 0lmax , this adaptation 
would appear to enhance cardiovascular endurance performance. Whether blood doping actually 
improves endurance performance depends on the balance between enhanced oxygen transport and 
increased viscosity of blood. The method of storing blood appears to be an important factor in deter- 
mining the potential effects of blood doping on performance. When the proper procedures have been 
used it can increase V 02mai by 5 to 13% and reduce submaximal heart rate and blood lactate during a 
standard exercise test [Sawka et al., 1996]. 

Oxygen Inhalation 

Inhaling high concentrations of oxygen gas mixtures prior to athletic competition or during rest periods 
of sporting events are common observations. This practice is based on the notion that it will increase 
oxygen transport (more oxygen binding to hemoglobin), thereby increasing oxygen delivery to the 
working muscle and resulting in a delay in the onset of fatigue [McArdle et al., 1996]. There are at least 
three problems with this belief. First, hemoglobin is already almost totally saturated with oxygen during 
normal breathing of ambient air at rest (95 to 98%). Hence, even if this practice allowed total oxygen 
saturation of hemoglobin it would only add a small amount of oxygen to the arterial blood leaving the 
lungs (10 ml of extra oxygen for every liter of whole blood). Second, for this practice to result in a delay 
of fatigue one must assume that oxygen transport and delivery is a limiting factor for the specific event 
or sport being performed. This assumption cannot be made for many of the power-type events in which 
oxygen inhalation is often used (e.g., football). Third, any enhancement of oxygen transport from this 
procedure is short-lived. Yet athletes are often observed inhaling oxygen on the sidelines for relatively 
long periods prior to participation. Thus, this practice does not appear to offer much more than psy- 
chological benefits when used prior to athletic competition or during prolonged rest periods. 

However, breathing oxygen during prolonged steady rate exercise results in reduced heart rates, ven- 
tilation, and blood lactate levels. Apparently even small increases in oxygen saturation of hemoglobin 
and some added oxygen dissolved in plasma not bound to hemoglobin results in a substantial increase 
in oxygen availability under conditions that approach hypoxia. It may also increase oxygen diffusion 
capacity across muscle capillaries by elevating the partial pressure of oxygen in the blood. Nevertheless, 
the timing required for oxygen inhalation to be effective makes this procedure impractical to use for 
most athletic events, even if it were considered ethical and legal. 
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Defining Terms 

Anabolic steroids: Synthetic forms of sex hormones (androgens) that possess both synthesizing and 

masculinizing characteristics. 
Biceps muscle: The major muscle group in the upper arm responsible for flexion, joined by tendons 

at the shoulder and elbow. Two collaborating muscles are involved in the biceps. 
Endergonic: An energy-absorbing process. 
Ergogenic aids: Factors that affect human performance. 
Exergonic: An energy-liberating process. 
Hematocrit: The concentration of red blood cells. 
Homeostasis: The tendency to maintain equilibrium. 
Negative work: Work where the applied force and the direction of movement are opposite. Positive 

work involves an applied force in the same direction as the movement. Negative work is known 

as eccentric work, and positive work is known as concentric by biomechanists. 
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Further Information 

A comprehensive treatment of quantitative predictions in exercise physiology and biomechanics is pre- 
sented in Biomechanics and Exercise Physiology by A. T. Johnson (John Wiley and Sons, 1991). 
There are a number of errors in the book, but an errata sheet is available from the author. 

Clear and simple explanations of the biomechanics of sports are given in Physics in Biology and Medicine 
by P. Davidovits (Prentice-Hall, 1975). 

The book by Tom McMahon, Muscles, Reflexes, and Locomotion (Princeton University Press, 1984), is a 
classic not to be missed. 

The article by McArdle and colleagues in Essentials of Exercise Physiology (Williams & Wilkens, 1996) is 
an excellent source for a better understanding of physiological principles of work performance. 

See D.A. Winter's Biomechanics and Motor Control of Human Movement (John Wiley, 1990) for a good 
in-depth explanation of more traditional biomechanics. 
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28.1 Introduction 

The primary function of the heart, to pump blood through the circulatory system, is fundamentally 
mechanical. In this chapter, cardiac function is discussed in the context of the mechanics of the ventricular 
walls from the perspective of the determinants of myocardial stresses and strains (Table 28.1). Many 
physiological, pathophysiological and clinical factors are directly or indirectly affected by myocardial 
stress and strain (Table 28.2). Of course, the factors in Tables 28.1 and 28.2 are interrelated — most of the 
factors affected by myocardial stress and strain in turn affect the stress and strain in the ventricular wall. 
For example, changes in wall stress due to altered hemodynamic load may cause ventricular remodeling, 
which in turn alters geometry, structure, and material properties. This chapter is organized around the 
governing determinants in Table 28.1, but mention is made where appropriate of some of the factors in 
Table 28.2. 

28.2 Cardiac Geometry and Structure 

The mammalian heart consists of four pumping chambers, the left and right atria and ventricles, com- 
municating through the atrioventricular (mitral and tricuspid) valves, which are structurally connected 
by chordae tendineae to papillary muscles that extend from the anterior and posterior aspects of the 
right and left ventricular lumens. The muscular cardiac wall is perfused via the coronary vessels that 
originate at the left and right coronary ostia located in the sinuses of Valsalva immediately distal to the 
aortic valve leaflets. Surrounding the whole heart is the collagenous parietal pericardium that fuses with 
the diaphragm and great vessels. These are the anatomical structures that are most commonly studied 
in the field of cardiac mechanics. Particular emphasis in this chapter is given to the ventricular walls, 
which are the most important for the pumping function of the heart. Most studies of cardiac mechanics 
have focused on the left ventricle, but many of the important conclusions apply equally to the right 
ventricle. 
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TABLE 28.1 Basic Determinants of Myocardial Stress and Strain 

Geometry and Structure 

3D Shape Wall thickness 

Curvature 

Stress-free and unloaded reference configurations 
Tissue structure Muscle fiber architecture 

Connective tissue organization 

Pericardium, epicardium, and endocardium 

Coronary vascular anatomy 

Boundary/Initial Conditions 

Pressure Filling pressure (preload) 

Arterial pressure (afterload) 

Direct and indirect ventricular interactions 

Thoracic and pericardial pressure 

Constraints Effects of inspiration and expiration 

Constraints due to the pericardium and its attachments 
Valves and fibrous valve annuli, chordae tendineae 
Great vessels, lungs 

Material Properties 

Resting or passive Nonlinear finite elasticity 
Quasilinear viscoelasticity 
Anisotropy 
Biphasic poroelasticity 

Active dynamic Activation sequence 

Myofiber isometric and isotonic contractile dynamics 

Sarcomere length and length history 

Cellular calcium kinetics and metabolic energy supply 



TABLE 28.2 Factors Affected by Myocardial Stress and Strain 



Direct Factors 



Electrophysiological Responses 



Development and Morphogenesis 



Vulnerability to Injury 



Remodeling, Repair, and Adaptation 



Progression of Disease 



Regional muscle work 

Myocardial oxygen demand and energetics 

Coronary blood flow 

Action potential duration (QT interval) 

Repolarization (T wave morphology) 

Excitability 

Risk of arrhythmia 

Growth rate 

Cardiac looping and septation 

Valve formation 

Ischemia 

Arrhythmia 

Cell dropout 

Aneurysm rupture 

Eccentric and concentric hypertrophy 

Fibrosis 

Scar formation 

Transition from hypertrophy to failure 

Ventricular dilation 

Infarct expansion 

Response to reperfusion 

Aneurysm formation 
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Inner 


Outer 


Wall Thickness 


Radius (mm) 


Radius (mm) 


Inner Radius 


16 


26 


0.62 


19 


28 


0.47 


22 


30 


0.36 


14 


26 


0.86 


42 


47 


0.12 


1.4 


3.5 


1.50 


3.2 


5.8 


0.81 


24 


32 


0.34 


27 


42 


0.56 


32 


42 


0.33 



TABLE 28.3 Representative Left Ventricular Minor- Axis Dimensions J 

Species Comments 

Dog (21 kg) Unloaded diastole (0 mm Hg) 

Normal diastole (2-12 mm Hg) 

Dilated diastole (24-40 mm Hg) 

Normal systole (1-9 mm Hg EDP) 

Long axis, apex-equator (normal diastole) 
Young rat Unloaded diastole (0 mm Hg) 

Mature rat Unloaded diastole (0 mm Hg) 
Human Normal 

Compensated pressure overload 

Compensated volume overload 

* Dog data from Ross et al. [Ross et al., 1967] and Streeter and Hanna [Streeter and Hanna, 1973]. Human data 
from Grossman et al. [Grossman et al., 1975; Grossman, 1980]. Rat data from unpublished observations in the 
author's laboratory. 

Ventricular Geometry 

From the perspective of engineering mechanics, the ventricles are three-dimensional thick-walled pressure 
vessels with substantial variations in wall thickness and principal curvatures both regionally and tempo- 
rally through the cardiac cycle. The ventricular walls in the normal heart are thickest at the equator and 
base of the left ventricle and thinnest at the left ventricular apex and right ventricular free wall. There 
are also variations in the principal dimensions of the left ventricle with species, age, phase of the cardiac 
cycle, and disease (Table 28.3). But, in general, the ratio of wall thickness to radius is too high to be 
treated accurately by all but the most sophisticated thick- wall shell theories [Taber, 1991]. 

Ventricular geometry has been studied in quantitative detail in the dog heart [Streeter and Hanna, 
1973; Nielsen et al., 1991]. Geometric models have been very useful in the analysis, especially the use of 
confocal and nonconfocal ellipses of revolution to describe the epicardial and endocardial surfaces of the 
left and right ventricular walls (Fig. 28.1). The canine left ventricle is reasonably modeled by a thick 




FIGURE 28.1 Truncated ellipsoid representation of ventricular geometry, showing major left ventricular radius (a), 
minor radius (b), focal length (d), and prolate spheroidal coordinates (^,u,9). 



(D 2000 by CRC Press LLC 



ellipsoid of revolution truncated at the base. The crescentic right ventricle wraps about 180° around the 
heart wall circumferentially and extends longitudinally about two-thirds of the distance from the base 
to the apex. Using a truncated ellipsoidal model, left ventricular geometry in the dog can be defined by 
the major and minor radii of two surfaces, the left ventricular endocardium, and a surface defining the 
free wall epicardium and the septal endocardium of the right ventricle. Streeter and Hanna [1973] 
described the position of the basal plane using a truncation factor, f b , defined as the ratio between the 
longitudinal distances from equator-to-base and equator-to-apex. Hence, the overall longitudinal distance 
from base to apex is ( 1 + f b ) times the major radius of the ellipse. Since variations in f h between diastole 
and systole are relatively small (0.45 to 0.51), they suggested a constant value of 0.5. 

The focal length d of an ellipsoid is defined from the major and minor radii {a and b) by d 1 = a 1 - b 2 , 
and varies only slightly in the dog from endocardium to epicardium between end-diastole (37.3 to 37.9 
mm) and end-systole (37.7 to 37.1 mm) [Streeter and Hanna, 1973]. Hence, within the accuracy that 
the boundaries of the left ventricular wall can be treated as ellipsoids of revolution, the assumption that 
the ellipsoids are confocal appears to be a good one. This has motivated the choice of prolate spheroidal 
(elliptic-hyperbolic-polar) coordinates (A,u,0) as a system for economically representing ventricular 
geometries obtained post-mortem or by noninvasive tomography [Nielsen et al., 1991; Young and Axel, 
1992]. The Cartesian coordinates of a point are given in terms of its prolate spheroidal coordinates by 

x = d cosh A, cos(X, 

x 2 = dsinhA, sin|i cos9, (28.1) 

x, = d sinh X sin|X sinG. 

Here, the focal length d defines a family of coordinate systems that vary from spherical polar when 
d = to cylindrical polar in the limit when d — > oo. A surface of constant transmural coordinate X 
(Fig. 28.1) is an ellipse of revolution with major radius a = d coshA, and minor radius b = d sinhA. In an 
ellipsoidal model with a truncation factor of 0.5, the longitudinal coordinate u varies from zero at the 
apex to 120° at the base. Integrating the Jacobian in prolate spheroidal coordinates gives the volume of 
the wall or cavity: 

d 3 | (sinh 2 A + sin 2 |l)sinh A sin |X d A, d (J. d0 = 

0X < (28.2) 



27Td- 
3 



II — cos |i 2 )cosh 3 X — II — cos 3 \i 2 jcosh A, 



The scaling between heart mass M H and body mass M within or between species is commonly described 
by the allometric formula: 

M H = kM a (28.3) 

Using combined measurements from a variety of mammalian species with M expressed in kilograms, 
the coefficient k is 5.8 g and the power a is close to unity (0.98) [Stahl, 1967]. Within individual species, 
the ratio of heart weight to body weight is somewhat lower in mature rabbits and rats (about 2 g/kg) 
than in humans (5 g/kg) and higher in horses and dogs (8 g/kg) [Rakusan, 1984]. The rate a of heart 
growth with body weight decreases with age in most species but not humans. At birth, left and right 
ventricular weights are similar, but the left ventricle is substantially more massive than the right by 
adulthood. 
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Myofiber Architecture 



Epicardium 



The cardiac ventricles have a complex three-dimen- 
sional muscle fiber architecture (for a comprehensive 
review see [Streeter, 1979]). Although the myocytes are 
relatively short, they are connected such that at any 
point in the normal heart wall there is a clear predom- 
inant fiber axis that is approximately tangent with the 
wall (within 3 to 5° in most regions, except near the 
apex and papillary muscle insertions). Each ventricular 
myocyte is connected via gap junctions at intercalated 
disks to an average of 11.3 neighbors, 5.3 on the sides, 
and 6.0 at the ends [Saffitz etal., 1994]. The classical 
anatomists dissected discrete bundles of fibrous swirls, 
though later investigations showed that the ventricular 
myocardium could be unwrapped by blunt dissection 
into a single continuous muscle "band" [Torrent-Guasp, 
1973]. However, more modern histological techniques 
have shown that in the plane of the wall, the muscle 
fiber angle makes a smooth transmural transition from 
epicardium to endocardium (Fig. 28.2). Similar pat- 
terns have been described for humans, dogs, baboons, 
macaques, pigs, guinea pigs, and rats. In the human or 
dog left ventricle, the muscle fiber angle typically varies 
continuously from about -60° (i.e., 60° clockwise from 
the circumferential axis) at the epicardium to about 
+70° at the endocardium. The rate of change of fiber 
angle is usually greatest at the epicardium, so that cir- 
cumferential (0°) fibers are found in the outer half of 
the wall, and begins to slowly approach the inner third 
near the trabeculata-compacta interface. There are also 
small increases in fiber orientation from end-diastole to 
systole (7 to 19°), with greatest changes at the epicar- 
dium and apex [Streeter et al., 1969]. 

Regional variations in ventricular myofiber orienta- 
tions are generally smooth except at the junction 
between the right ventricular free wall and septum. A 
detailed study in the dog that mapped fiber angles 
throughout the entire right and left ventricles described 
the same general transmural pattern in all regions 
including the septum and right ventricular free wall, but 
with definite regional variations [Nielsen etal., 1991]. 
Transmural differences in fiber angle were about 120 to 
140° in the left ventricular free wall, larger in the septum 
(160 to 180°), and smaller in the right ventricular free 
wall (100 to 120°). A similar study of fiber angle distri- 
butions in the rabbit left and right ventricles has recently been reported [Vetter and McCulloch, 1998]. 
For the most part, fiber angles in the rabbit heart were very similar to those in the dog, except on the 
anterior wall, where average fiber orientations were 20 to 30° counterclockwise of those in the dog. 

The locus of fiber orientations at a given depth in the ventricular wall has a spiral geometry that may 
be modeled as a general helix by simple differential geometry. The position vector x of a point on a helix 




Endocardium 

FIGURE 28.2 Cardiac muscle fiber orientations 
vary continuously through the left ventricular wall 
from a negative angle at the epicardium to near zero 
(circumferential) at the midwall and to increasing 
positive values toward the endocardium. (Micro- 
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inscribed on an ellipsoidal surface that is symmetric about the Xj axis and has major and minor radii, a 
and b, is given by the parametric equation, 



sinf e l + b cost sinwf e 2 + b cost coswf e 3 



(28.4) 



where the parameter is t, and the helix makes tv/4 full turns between apex and equator. A positive w 
defines a left-handed helix with a positive pitch. The fiber angle or helix pitch angle r), varies along the 
arc length: 



sin r\ - 



2 2 . , j2 • 2 . 

a cos t + b sin t 



V(a 2 + b 2 w 2 ) 



cos t+b sin t 



(28.5) 



If another, deformed configuration x is defined in the same way as Eq. (28.4), the fiber segment 
extension ratio ds associated with a change in the ellipsoid geometry [McCulloch et al., 1989] can be 
derived from: 




(28.6) 



Although the traditional notion of discrete myo fiber bundles has been revised in view of the continuous 
transmural variation of muscle fiber angle in the plane of the wall, there is a transverse laminar structure 
in the myocardium that groups fibers together in sheets an average of 4 ± 2 myocytes thick (48 ± 20 urn), 
separated by histologically distinct cleavage planes [Spotnitz et al., 1974; Smaill and Hunter, 1991; LeGrice 
et al., 1995]. LeGrice and colleagues investigated these structures in a detailed morphometric study of 
four dog hearts [LeGrice et al., 1995]. They describe an ordered laminar arrangement of myocytes with 
extensive cleavage planes running approximately radially from endocardium toward epicardium in trans- 
mural section. Like the fibers, the sheets also have a branching pattern with the number of branches 
varying considerably through the wall thickness. 

The fibrous architecture of the myocardium has motivated models of myocardial material symmetry 
as transversely isotropic. The transverse laminae are the first structural evidence for material orthotropy 
and have motivated the development of a model describing the variation of fiber, sheet, and sheet-normal 
axes throughout the ventricular wall [LeGrice et al., 1997]. This has led to the idea that the laminar 
architecture of the ventricular myocardium affects the significant transverse shears [Waldman et al., 1985] 
and myofiber rearrangement [Spotnitz et al., 1974] described in the intact heart during systole. By 
measuring three-dimensional distributions of strain across the wall thickness using biplane radiography 
of radiopaque markers, LeGrice and colleagues [LeGrice et al., 1995] found that the cleavage planes 
coincide closely with the planes of maximum shearing during ejection, and that the consequent reori- 
entation of the myocytes may contribute 50% or more of normal systolic wall thickening. 

A detailed description of the morphogenesis of the muscle fiber system in the developing heart is not 
available but there is evidence of an organized myofiber pattern by day 12 in the fetal mouse heart that 
is similar to that seen at birth (day 20) [McLean et al., 1989]. Abnormalities of cardiac muscle fiber 
patterns have been described in some disease conditions. In hypertrophic cardiomyopathy, which is often 
familial, there is substantial myofiber disarray, typically in the interventricular septum [Maron et al., 
1987; Karlon et al, 1998]. 
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FIGURE 28.3 Schematic representation of cardiac tissue structure showing the association of endomysial and 
perimysial collagen fibers with cardiac myocytes. (Courtesy of Dr. Deidre MacKenna). 



Extracellular Matrix Organization 

The cardiac extracellular matrix consists primarily of the fibrillar collagens, type I (85%) and III (11%), 
synthesized by the cardiac fibroblasts, the most abundant cell type in the heart. Collagen is the major 
structural protein in connective tissues, but only comprises 2 to 5% of the myocardium by weight, 
compared with the myocytes, which make up 90% [Weber, 1989]. The collagen matrix has a hierarchical 
organization (Fig. 28.3), and has been classified according to conventions established for skeletal muscle 
into endomysium, perimysium, epimysium [Caulfield and Borg, 1979; Robinson et al., 1983]. The 
endomysium is associated with individual cells and includes a fine weave surrounding the cell and 
transverse structural connections 120 to 150 nm long connecting adjacent myocytes, with attachments 
localized near the z-line of the sarcomere. The primary purpose of the endomysium is probably to 
maintain registration between adjacent cells. The perimysium groups cells together and includes the 
collagen fibers that wrap bundles of cells into the laminar sheets described above, as well as large coiled 
fibers typically 1 to 3 um in diameter which are composed of smaller collagen fibrils (40 to 50 nm) 
[Robinson et al., 1 988 ] . The helix period of the coiled perimysial fibers is about 20 um and the convolution 
index (ratio of fiber arclength to midline length) is approximately 1.3 in the unloaded state of the ventricle 
[MacKenna et al., 1996; MacKenna et al., 1997]. These perimysial fibers are most likely to be the major 
structural elements of the collagen extracellular matrix though they probably contribute to myocardial 
strain energy by uncoiling rather than stretching [MacKenna et al., 1996]. Finally, a thick epimysial 
collagen sheath surrounds the entire myocardium forming the protective epicardium (visceral pericar- 
dium) and endocardium. 

Collagen content, organization, and ratio of types I to III change with age and in various disease 
conditions including myocardial ischemia and infarction, hypertension, and hypertrophy (Table 28.4). 
Changes in myocardial collagen content and organization coincide with alterations in diastolic myocardial 
stiffness [MacKenna and McCulloch, 1996]. Hence the collagen matrix plays an important role in 
determining the elastic material properties of the resting ventricular myocardium. 
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TABLE 28.4 Changes in Ventricular Collagen Structure and Mechanics With Age and Disease 



Condition 

Pressure 

Overload 

Hypertrophy 



Volume 

Overload 

Hypertrophy 



Acute Ischemia/ 
Stunning 



Chronic 

Myocardial 

Infarction 



Age 



Collagen Morphology 

[Hydroxyproline]: ft-ftftft 
[Medugorac, 1980; Weber etal, 

1988] 
Area fraction: ftftft [Jalil et al., 

1988; Weber et al., 1988] 

[Hydroxyproline]: no 

change-U: [Michel et al., 1986; 

Iimotoetal., 1988] 
Area Fraction: no change 

[Medugorac, 1980; Weber 

etal., 1990] 
[Hydroxyproline]: U Charney, 

1992 #1118] 
Light microscopy: no change/U 

[Whittaker et al., 1991] 
I}*} endomysial fibers [Zhao 

etal., 1987] 
[Hydroxyproline]: ftftft 

[Connelly et al., 1985; Jugdutt 

and Amy, 1986] 
Loss of birefringence 

[Whittaker et al., 1989] 
[Hydroxyproline]: ft-ftfrft 

[Eghbaliet al, 1989;Takahashi 

etal., 1990] collagen fiber 

diameter ft [Eghbali et al., 

1989] 



Types & Crosslinking 

Type III: ft [Mukherjee 

and Sen, 1990] 
Crosslinks: no change 

[Harper etal., 1993] 



Crosslinks: ft [Iimoto 
etal., 1988; Harper 
etal., 1993] 

Type III/I: ft [Iimoto 
etal., 1988] 



Passive Stiffness 



Other 



Chamber: ft-ftft [Jalil Perivascular fibrosis: 



Type III: ft [Jensen et al., 
1990] 



Type III/I: 11 [Medugorac 

and Jacob, 1983] 
Crosslinks: ft 

[Medugorac and Jacob, 

1983] 



et al., 1988; Weber 
et al., 1988] 
Tissue: ftft [Omens 
et al., 1995] 

Chamber: V [Corin 

etal., 1991] 
Tissue: no change/ft 

[Corin etal., 1991] 



<S early [Forrester 

et al., 1972] 
ft late [Pirzada et al., 

1976] 



Chamber: ft early 
[Pfeffer et al., 1991] 

Chamber: U late 
[Pfeffer et al., 1991] 

Chamber: ft [Borg 

etal., 1981] 
Papillary muscle: ft 

[Anversa et al., 1989] 



ftft [Weber et al., 

1988] 
Focal scarring: [Silver 

et al., 1990; Contard 

etal., 1991] 
Parallel changes 



Collagenase activity: ft 
[Takahashi et al., 
1990; Charney et al, 
1992] 



Organization: ft-ftftft 
[Whittaker et al., 
1991; Holmes et al., 
1994] 

Light microscopy: fibril 
diameter ft [Eghbali 
et al., 1989] 



28.3 Cardiac Pump Function 
Ventricular Hemodynamics 

The most basic mechanical parameters of the cardiac pump are blood pressure and volume fiowrate, 
especially in the major pumping chambers, the ventricles. From the point of view of wall mechanics, the 
ventricular pressure is the most important boundary condition. Schematic representations of the time- 
courses of pressure and volume in the left ventricle are shown in Fig. 28.4. Ventricular filling immediately 
following mitral valve opening (MVO) is initially rapid because the ventricle produces a diastolic suction 
as the relaxing myocardium recoils elastically from its compressed systolic configuration below the resting 
chamber volume. The later slow phase of ventricular filling (diastasis) is followed finally by atrial con- 
traction. The deceleration of the inflowing blood reverses the pressure gradient across the valve leaflets 
and causes them to close (MVC). Valve closure may not, however, be completely passive, because the 
atrial side of the mitral valve leaflets, which unlike the pulmonic and aortic valves are cardiac in embry- 
ological origin, have muscle and nerve cells and are electrically coupled to atrial conduction [Sonnenblick 
etal., 1967]. 

Ventricular contraction is initiated by excitation, which is almost synchronous (the duration of the 
QRS complex of the ECG is only about 60 msec in the normal adult) and begins about 0.1 to 0.2 sec 
after atrial depolarization. Pressure rises rapidly during the isovolumic contraction phase (about 50 msec 
in adult humans), and the aortic valve opens (AVO) when the developed pressure exceeds the aortic 
pressure (afterload). Most of the cardiac output is ejected within the first quarter of the ejection phase 
before the pressure has peaked. The aortic valve closes (AVC) 20 to 30 msec after AVO when the ventricular 
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FIGURE 28.4 Left ventricular pressure, aortic pressure, and left ventricular volume during a single cardiac cycle 
showing the times of mitral valve closure (MVC), aortic valve opening (AVO), aortic valve closure (AVC), and mitral 
valve opening (MVO). 

pressure falls below the aortic pressure owing to the deceleration of the ejecting blood. The dichrotic 
notch, a characteristic feature of the aortic pressure waveform and a useful marker of aortic valve closure, 
is caused by pulse wave reflections in the aorta. Since the pulmonary artery pressure against which the 
right ventricle pumps is much lower than the aortic pressure, the pulmonic valve opens before and closes 
after the aortic valve. The ventricular pressure falls during isovolumic relaxation, and the cycle continues. 
The rate of pressure decay from the value P at the time of the peak rate of pressure falls until the mitral 
valve opening is commonly characterized by a single exponential time constant, i.e. 



P(t) 



P e- t/T 



+ P„ 



(28.7) 



where P w is the (negative) baseline pressure to which the ventricle would eventually relax if MVO were 
prevented [Yellin et al., 1986]. In dogs and humans, x is normally about 40 msec, but it is increased by 
various factors including elevated afterload, asynchronous contraction associated with abnormal activa- 
tion sequence or regional dysfunction, and slowed cytosolic calcium reuptake to the sarcoplasmic retic- 
ulum associated with cardiac hypertrophy and failure. The pressure and volume curves for the right 
ventricle look essentially the same, however the right ventricular and pulmonary artery pressures are 
only about a fifth of the corresponding pressures on the left side of the heart. The intraventricular septum 
separates the right and left ventricles and can transmit forces from one to the other. An increase in right 
ventricular volume may increase the left ventricular pressure by deformation of the septum. This direct 
interaction is most significant during filling [Janicki and Weber, 1980]. 

The phases of the cardiac cycle are customarily divided into systole and diastole. The end of dias- 
tole — the start of systole — is generally defined as the time of mitral valve closure. Mechanical end-systole 
is usually defined as the end of ejection, but Brutsaert and colleagues proposed extending systole until 
the onset of diastasis (see the review [Brutsaert and Sys, 1989]) since there remains considerable myo- 
filament interaction and active tension during relaxation. The distinction is important from the point 
of view of cardiac muscle mechanics: the myocardium is still active for much of diastole and may never 
be fully relaxed at sufficiently high heart rates (over 150 beats per minute). Here, we will retain the 
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traditional definition of diastole, but consider the ventricular myocardium to be "passive" or "resting" 
only in the final slow-filling stage of diastole. 

Ventricular Pressure-Volume Relations and Energetics 

A useful alternative to Fig. 28.4 for displaying ventricular pressure and volume changes is the pressure- 
volume loop shown in Fig. 28.5a. During the last 20 years, the ventricular pressure-volume relationship 
has been explored extensively, particularly by Sagawa, Suga et al. [1988], who wrote a comprehensive 
book on the approach. The isovolumic phases of the cardiac cycle can be recognized as the vertical 
segments of the loop, the lower limb represents ventricular filling, and the upper segment is the ejection 
phase. The difference on the horizontal axis between the vertical isovolumic segments is the stroke volume, 
which expressed as a fraction of the end-diastolic volume is the ejection fraction. The effects of altered 
loading on the ventricular pressure-volume relation have been studied in many preparations, but the 
best controlled experiments have used the isolated cross-circulated canine heart in which the ventricle 
fills and ejects against a computer-controlled volume servo-pump. 
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FIGURE 28.5 Schematic diagram of left ventricular pressure-volume loops: (a) End-systolic pressure-volume rela- 
tion (ESPVR), end-diastolic pressure volume relation (EDPVR) and stroke work. The three P-V loops show the 
effects of changes in preload and afterload. (b) Time-varying elastance approximation of ventricular pump function 
(see text). 
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Changes in the filling pressure of the ventricle (preload) move the end-diastolic point along the unique 
end-diastolic pressure-volume relation (EDPVR), which represents the passive filling mechanics of the 
chamber that are determined primarily by the thick- walled geometry and nonlinear elasticity of the resting 
ventricular wall. Alternatively, if the afterload seen by the left ventricle is increased, stroke volume decreases 
in a predictable manner. The locus of end-ejection points (AVC) forms the end-systolic pressure-volume 
relation (ESPVR), which is approximately linear in a variety of conditions and also largely independent of 
the ventricular load history. Hence, the ESPVR is almost the same for isovolumic beats as for ejecting beats, 
although consistent effects of ejection history have been well characterized [Hunter, 1989]. Connecting 
pressure-volume points at corresponding times in the cardiac cycle also results in a relatively linear rela- 
tionship throughout systole with the intercept on the volume axis V remaining nearly constant (Fig. 28.5b). 
This leads to the valuable approximation that the ventricular volume V(t) at any instance during systole is 
simply proportional to the instantaneous pressure P{t) through a time-varying elastance E{t): 



p(t) = E(t){v(t)-V }. (28.8) 



The maximum elastance E max , the slope of the ESPVR, has acquired considerable significance as an 
index of cardiac contractility that is independent of ventricular loading conditions. As the inotropic state 
of the myocardium increases, for example with catecholamine infusion, E 11MX increases, and with a negative 
inotropic effect such as a reduction in coronary artery pressure, it decreases. 

The area of the ventricular pressure-volume loop is the external work (EW) performed by the myo- 
cardium on the ejecting blood: 



H) 



EW= P[t)dV (28.9) 



Plotting this stroke work against a suitable measure of preload gives a ventricular function curve, which 
illustrates the single most important intrinsic mechanical property of the heart pump. In 1914, Patterson 
and Starling [Patterson and Starling, 1914] performed detailed experiments on the canine heart-lung 
preparation, and Starling summarized their results with his famous "Law of the Heart", which states that 
the work output of the heart increases with ventricular filling. The so-called Frank-Starling mechanism 
is now well recognized to be an intrinsic mechanical property of cardiac muscle (see Section 28.4). 

External stroke work is closely related to cardiac energy utilization. Since myocardial contraction is fueled 
by ATP, 90 to 95% of which is normally produced by oxidative phosphorylation, cardiac energy consumption 
is often studied in terms of myocardial oxygen consumption, V0 2 (ml Oj.g^'.beat -1 ). Since energy is also 
expended during non- working contractions, Suga and colleagues [Suga etal., 1981] defined the pressure- 
volume area PVA (J.g^'.beat -1 ) as the loop area (external stroke work) plus the end-systolic potential energy 
(internal work) which is the area under the ESPVR left of the isovolumic relaxation line (Fig. 28.5a), 

PVA=EW + PE. (28.10) 

The PVA has strong linear correlation with V0 2 independent of ejection history. Equation (28.11) has 
typical values for the dog heart: 

V0 2 = 0.12(PVA) + 2.0 x 10- 4 . (28.11) 

The intercept represents the sum of the oxygen consumption for basal metabolism and the energy 
associated with activation of the contractile apparatus, which is primarily used to cycle intracellular Ca 2+ 
for excitation-contraction coupling [Suga etal., 1981]. The reciprocal of the slope is the contractile 
efficiency [Suga and Goto, 1991; Suga et al., 1993]. The V0 2 -PVA relation shifts its elevation but not its 
slope with increments in E max with most positive and negative inotropic interventions [Suga et al., 1988; 
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Suga, 1990; Suga and Goto, 1991; Zhao et al., 1993; Namba et al„ 1994]. However, ischemic-reperfused 
viable but "stunned" myocardium has a smaller 2 cost of PVA [Ohgoshi et al., 1991]. 

Although the PVA approach has also been useful in many settings, it is fundamentally phenomeno- 
logical. Because the time-varying elastance assumptions ignores the well-documented load-history 
dependence of cardiac muscle tension [Guccione and McCulloch, 1993; ter Keurs and de Tombe, 1993; 
Burkhoff et al., 1995], theoretical analyses that attempt to reconcile PVA with crossbridge mechanoen- 
ergetics [Taylor et al., 1993] are usually based on isometric or isotonic contractions. So that regional 
oxygen consumption in the intact heart can be related to myofiber biophysics, regional variations on the 
pressure-volume area have been proposed, such as the tension-area [Goto et al., 1993], normalization of 
E max [Sugawara et al., 1995], and the fiber stress-strain area [Delhaas et al., 1994]. 

In mammals, there are characteristic variations in cardiac function with heart size. In the power law 
relation for heart rate as a function of body mass [analogous to Eq. (28.3)], the coefficient k is 
241 beats. min -1 and the power a is -0.25 [Stahl, 1967]. In the smallest mammals, like soricine shrews 
that weigh only a few grams, maximum heart rates exceeding 1000 beats. min -1 have been measured 
[Vornanen, 1992]. Ventricular cavity-volume scales linearly with heart weight, and ejection fraction and 
blood pressure are reasonably invariant from rats to horses. Hence, stroke work also scales directly with 
heart size [Holt et al., 1962], and thus work rate and energy consumption would be expected to increase 
with decreased body size in the same manner as heart rate. However, careful studies have demonstrated 
only a twofold increase in myocardial heat production as body mass decreases in mammals ranging from 
humans to rats, despite a 4.6-fold increase in heart rate [Loiselle and Gibbs, 1979]. This suggests that 
cardiac energy expenditure does not scale in proportion to heart rate and that cardiac metabolism is a 
lower proportion of total body metabolism in the smaller species. 

The primary determinants of the end-diastolic pressure-volume relation (EDPVR) are the material 
properties of resting myocardium, the chamber dimensions and wall thickness, and the boundary con- 
ditions at the epicardium, endocardium, and valve annulus [Gilbert and Glantz, 1989]. The EDPVR has 
been approximated by an exponential function of volume (see for example, Chapter 9 in Gaasch and 
LeWinter [Gaasch and LeWinter, 1994]), though a cubic polynomial also works well. Therefore, the 
passive chamber stiffness dP/dV is approximately proportional to the filling pressure. Important influ- 
ences on the EDPVR include the extent of relaxation, ventricular interaction and pericardial constraints, 
and coronary vascular engorgement. The material properties and boundary conditions in the septum 
are important since they determine how the septum deforms [Glantz et al., 1978; Glantz and Parmley, 
1978]. Through septal interaction, the end-diastolic pressure-volume relationship of the left ventricle 
may be directly affected by changes in the hemodynamic loading conditions of the right ventricle. The 
ventricles also interact indirectly since the output of the right ventricle is returned as the input to the 
left ventricle via the pulmonary circulation. Slinker and Glantz [1986], using pulmonary artery and venae 
caval occlusions to produce direct (immediate) and indirect (delayed) interaction transients, concluded 
that the direct interaction is about half as significant as the indirect coupling. The pericardium provides 
a low friction mechanical enclosure for the beating heart that constrains ventricular overextension 
[Mirsky and Rankin, 1979]. Since the pericardium has stiffer elastic properties than the ventricles [Lee 
et al., 1987], it contributes to direct ventricular interactions. The pericardium also augments the mechan- 
ical coupling between the atria and ventricles [Maruyama et al., 1982]. Increasing coronary perfusion 
pressure has been seen to increase the slope of the diastolic pressure-volume relation (an "erectile" effect) 
[Salisbury et al, 1960; May-Newman et al., 1994]. 

28.4 Myocardial Material Properties 
Muscle Contractile Properties 

Cardiac muscle mechanics testing is far more difficult than skeletal muscle testing mainly owing to the 
lack of ideal test specimens like the long single fiber preparations that have been so valuable for studying 
the mechanisms of skeletal muscle mechanics. Moreover, under physiological conditions, cardiac muscle 
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FIGURE 28.6 Cardiac muscle isometric twitch tension generated by a model of rat cardiac contraction (courtesy 
of Dr. Julius Guccione): (a) Developed twitch tension as a function of time and sarcomere length; (b) Peak isometric 
twitch tension vs. sarcomere length for low and high calcium concentration. 

cannot be stimulated to produce sustained tetanic contractions due to the absolute refractory period of 
the myocyte cell membrane. Cardiac muscle also exhibits a mechanical property analogous to the relative 
refractory period of excitation. After a single isometric contraction, some recovery time is required before 
another contraction of equal amplitude can be activated. The time constant for this mechanical restitution 
property of cardiac muscle is about 1 sec [Bers, 1991]. 

Unlike skeletal muscle, in which maximal active force generation occurs at a sarcomere length that 
optimizes myofilament overlap ( ~2. 1 um), the isometric twitch tension developed by isolated cardiac muscle 
continues to rise with increased sarcomere length in the physiological range (1.6 to 2.4 um) (Fig. 28.6a). 
Early evidence for a descending limb of the cardiac muscle isometric length-tension curve was found to be 
caused by shortening in the central region of the isolated muscle at the expense of stretching at the damaged 
ends where the specimen was tethered to the test apparatus. If muscle length is controlled so that sarcomere 
length in the undamaged part of the muscle is indeed constant, or if the developed tension is plotted against 
the instantaneous sarcomere length rather than the muscle length, the descending limb is eliminated [ter 
Keurs etal., 1980]. Thus, the increase with chamber volume of end-systolic pressure and stroke work is 
reflected in isolated muscles as a monotonic increase in peak isometric tension with sarcomere length 
(Fig. 28.6b). Note that the active tension shown in Fig. 28.6 is the total tension minus the resting tension 
which, unlike in skeletal muscle, becomes very significant at sarcomere lengths over 2.3 um. The increase 
in slope of the ESPVR associated with increased contractility is mirrored by the effects of increased calcium 
concentration in the length-tension relation. The duration, as well as the tension developed in the active 
cardiac twitch, also increases substantially with sarcomere length (Fig. 28.6a). 

The relationship between cytosolic calcium concentration and isometric muscle tension has mostly 
been investigated in muscle preparations in which the sarcolemma has been chemically permeabilized. 
Because there is evidence that this chemical "skinning" alters the calcium sensitivity of myofilament 
interaction, recent studies have also investigated myofilament calcium sensitivity in intact muscles teta- 
nized by high frequency stimulation in the presence of a compound such as ryanodine that opens calcium 
release sites in the sarcoplasmic reticulum. Intracellular calcium concentration was estimated using 
calcium-sensitive optical indicators such as Fura. The myofilaments are activated in a graded manner by 
microMolar concentrations of calcium, which binds to troponin-C according to a sigmoidal relation 
[Riiegg, 1988]. Half-maximal tension in cardiac muscle is developed at intracellular calcium concentra- 
tions of 10~ 6 to 10~ 5 M (the C 50 ) depending on factors such as species and temperature [Bers, 1991]. 
Hence, relative isometric tension T /T max may be modeled using [Tozeren, 1985; Hunter et al., IS 
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The Hill coefficient, n, governs the steepness of the sigmoidal curve. A wide variety of values have 
been reported but most have been in the range 3 to 6 [Kentish et al., 1986; Yue et al., 1986; Gao et al., 
1994; Backx et al., 1995]. The steepness of the isometric length-tension relation (Fig. 28.6b), compared 
with that of skeletal muscle is due to length-dependent calcium sensitivity. That is, the C 50 (M) and n 
both change with sarcomere length, L (um). Hunter et al. (1998) used the following approximations to 
fit the data of Kentish et al. (1986) from rat right ventricular trabeculae: 



n ■■ 



= 4.25{l + 1.95(l/l ref -l)}, pC 50 =-log 10 C 50 = 5.33{l + 0.3l(l/L ref -l)} (28.13) 



where the reference sarcomere length L ref was taken to be 2.0 um. 

The isotonic force-velocity relation of cardiac muscle is similar to that of skeletal muscle, and A.V. 
Hill's well-known hyperbolic relation is a good approximation except at larger forces greater than about 
85% of the isometric value. The maximal (unloaded) velocity of shortening is essentially independent 
of preload, but does change with time during the cardiac twitch and is affected by factors that affect 
contractile ATPase activity and hence crossbridge cycling rates. De Tombe and colleagues [de Tombe and 
ter Keurs, 1992] using sarcomere length-controlled isovelocity release experiments found that viscous 
forces impose a significant internal load opposing sarcomere shortening. If the isotonic shortening 
response is adjusted for the confounding effects of passive viscoelasticity, the underlying crossbridge 
force-velocity relation is found to be linear. 

Cardiac muscle contraction also exhibits other significant length-history-dependent properties. An 
important example is "deactivation" associated with length transients. The isometric twitch tension 
redevelops following a brief length transient that dissociates crossbridges that reach the original isometric 
value when the transient is imposed early in the twitch before the peak tension is reached. But following 
transients applied at times after the peak twitch tension has occurred, the fraction of tension redeveloped 
declines progressively since the activator calcium has fallen to levels below that necessary for all cross- 
bridges to reattach [ter Keurs et al., 1980]. 

There have been many model formulations of cardiac muscle contractile mechanics, too numerous to 
summarize here. In essence they may be grouped into three categories. Time-Varying Elastance models 
include the essential dependence of cardiac active force development on muscle length and time. These 
models would seem to be well suited to the continuum analysis of whole heart mechanics [Arts et al., 
1979; Chadwick, 1982; Taber, 1991] by virtue of the success of the time-varying elastance concept of 
ventricular function (see section 28.3 above). In Hill models, the active fiber stress development is mod- 
ified by shortening or lengthening according to the force-velocity relation, so that fiber tension is reduced 
by increased shortening velocity [Arts et al., 1982; Nevo and Lanir, 1989]. Fully History-Dependent models 
are more complex and are generally based on A.F. Huxley's crossbridge theory [Panerai, 1980; Guccione 
et al., 1993; Landesberg and Sideman, 1994; Landesberg et al., 1996]. A statistical approach known as the 
Distribution Moment model has also been shown to provide an excellent approximation to the crossbridge 
theory [Ma and Zahalak, 1991]. An alternative, more phenomenological approach, is Hunter's Fading 
Memory Theory, which captures the complete length-history dependence of cardiac muscle contraction 
without requiring all the biophysical complexity of crossbridge models [Hunter et al., 1998]. The appro- 
priate choice of model will depend on the purpose of the analysis. For many models of global ventricular 
function, a time-varying elastance model will suffice, but for an analysis of sarcomere dynamics in isolated 
muscle or the ejecting heart, a history-dependent analysis is more appropriate. 

Although Hill's basic assumption that resting and active muscle fiber tension are additive is axiomatic 
in one-dimensional tests of isolated cardiac mechanics, there remains little experimental information on 
how the passive and active material properties of myocardium superpose in two dimensions or three. 
The simplest and most common assumption is that active stress is strictly one-dimensional and adds to 
the fiber component of the three-dimensional passive stress. However, even this addition will indirectly 
affect all the other components of the stress response, since myocardial elastic deformations are finite, 
nonlinear, and approximately isochoric (volume conserving). In an interesting and important new 
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development, biaxial testing of tetanized and barium-contracted ventricular myocardium has shown that 
developed systolic stress also has a large component in directions transverse to the mean myofiber axis 
that can exceed 50% of the axial fiber component [Lin and Yin, 1998]. The magnitude of this transverse 
active stress depends significantly on the biaxial loading conditions. Moreover, evidence from osmotic 
swelling and other studies suggests that transverse strain can affect contractile tension development along 
the fiber axis by altering myofibril lattice spacing [Schoenberg, 1980; Zahalak, 1996]. The mechanisms 
of transverse active stress development remain unclear but two possible contributors are the geometry 
of the crossbridge head itself which is oriented oblique to the myofilament axis [Schoenberg, 1980], and 
the dispersion of myofiber orientation, which tends to be distributed in intact myocardium with a 
standard deviation of about 12° [Karlon et al., 1998]. 

Resting Myocardial Properties 

Since, by the Frank-Starling mechanism, end-diastolic volume directly affects systolic ventricular work, 
the mechanics of resting myocardium also have fundamental physiological significance. Most biome- 
chanical studies of passive myocardial properties have been conducted in isolated, arrested whole heart 
or tissue preparations. Passive cardiac muscle exhibits most of the mechanical properties characteristic 
of soft tissues in general [Fung, 1993]. In cyclic uniaxial loading and unloading, the stress-strain rela- 
tionship is nonlinear with small but significant hysteresis. Depending on the preparation used, resting 
cardiac muscle typically requires from 2 to 10 repeated loading cycles to achieve a reproducible (precon- 
ditioned) response. Intact cardiac muscle experiences finite deformations during the normal cardiac 
cycle, with maximum Lagrangian strains (which are generally radial and endocardial) that may easily 
exceed 0.5 in magnitude. Hence, the classical linear theory of elasticity is quite inappropriate for resting 
myocardial mechanics. The hysteresis of the tissue is consistent with a viscoelastic response, which is 
undoubtedly related to the substantial water content of the myocardium (about 80% by mass). Changes 
in water content, such as edema, can cause substantial alterations in the passive stiffness and viscoelastic 
properties of myocardium. The viscoelasticity of passive cardiac muscle has been characterized in creep 
and relaxation studies of papillary muscle from cat and rabbit. In both species, the tensile stress in 
response to a step in strain relaxes 30 to 40% in the first 10 sec [Pinto and Patitucci, 1977; Pinto and 
Patitucci, 1980]. The relaxation curves exhibit a short exponential time constant (<0.02 sec) and a long 
one (about 1000 sec), and are largely independent of the strain magnitude, which supports the approx- 
imation that myocardial viscoelasticity is quasilinear. Myocardial creep under isotonic loading is 2 to 3% 
of the original length after 100 sec of isotonic loading and is also quasilinear with an exponential 
timecourse. There is also evidence that passive ventricular muscle exhibits other anelastic properties such 
as maximum strain-dependent "strain softening" [Emery et al., 1997; Emery et al., 1997], a well-known 
property in elastomers first described by Mullins [Mullins, 1947]. 

Since the hysteresis of passive cardiac muscle is small and only weakly affected by changes in strain 
rate, the assumption of pseudoelasticity [Fung, 1993] is often appropriate. That is, the resting myocardium 
is considered to be a finite elastic material with different elastic properties in loading versus unloading. 
Although various preparations have been used to study resting myocardial elasticity, the most detailed 
and complete information has come from biaxial and multiaxial tests of isolated sheets of cardiac tissue, 
mainly from the dog [Demer and Yin, 1983; Halperin et al., 1987; Humphrey et al., 1990]. These exper- 
iments have shown that the arrested myocardium exhibits significant anisotropy with substantially greater 
stiffness in the muscle fiber direction than transversely. In equibiaxial tests of muscle sheets cut from 
planes parallel to the ventricular wall, fiber stress was greater than the transverse stress (Fig. 28.7) by an 
average factor of close to 2.0 [Yin et al., 1987]. Moreover, as suggested by the structural organization of 
the myocardium described in Section 28.2, there may be also be significant anisotropy in the plane of 
the tissue transverse to the fiber axis. 

The biaxial stress-strain properties of passive myocardium display some heterogeneity. Novak et al. 
[1994] measured regional variations of biaxial mechanics in the canine left ventricle. Specimens from 
the inner and outer thirds of the left ventricular free wall were stiffer than those from the midwall and 
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FIGURE 28.7 Representative stress-strain curves for passive rat myocardium computed using Eqs. (28.17) and 
(28.19). Fiber and crossiiber stress are shown for equibiaxial strain (courtesy of Dr. Jeffrey Omens). 

interventricular septum, but the degree of anisotropy was similar in each region. Significant species 
variations in myocardial stiffness have also been described. Using measurements of two-dimensional 
regional strain during left ventricular inflation in the isolated whole heart, a parameter optimization 
approach showed that canine cardiac tissue was several times stiffer than that of the rat, though the 
nonlinearity and anisotropy were similar [Omens et al., 1993]. Biaxial testing of the collagenous parietal 
pericardium and epicardium have shown that these tissues have distinctly different properties than the 
myocardium, being very compliant and isotropic at low biaxial strains (<0.1 to 0.15), but rapidly 
becoming very stiff and anisotropic as the strain is increased [Lee et al., 1987; Humphrey et al., 1990]. 

Various constitutive models have been proposed for the elasticity of passive cardiac tissues. Because 
of the large deformations and nonlinearity of these materials, the most useful framework has been 
provided by the pseudostrain-energy formulation for hyperelasticity. For a detailed review of the material 
properties of passive myocardium and approaches to constitutive modeling, see Chapters 1-6 of Glass 
et al. [1991]. In hyperelasticity, the components of the stress 1 are obtained from the strain energy Was 
a function of the Lagrangian (Green's) strain E RS . 

The myocardium is generally assumed to be an incompressible material, which is a good approximation 
in the isolated tissue, although in the intact heart significant redistribution of tissue volume is sometimes 
associated with phasic changes in regional coronary blood volume. Incompressibility is included as a 
kinematic constraint in the finite elasticity analysis, which introduces a new pressure variable that is 
added as a Lagrange multiplier in the strain energy. The examples that follow are various strain-energy 
functions, with representative parameter values (for Win kPa, i.e., mj.mt 1 ), that have been suggested 
for cardiac tissues. For the two-dimensional properties of canine myocardium, Yin and colleagues [Yin 
et al., 1987] obtained reasonable fits to experimental data with an exponential function, 
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where E u is the fiber strain and E 22 is the crossfiber in-plane strain. Humphrey and Yin [1987] proposed 
a three-dimensional form for W as the sum of an isotropic exponential function of the first principal 
invariant I, of the right Cauchy-Green deformation tensor and another exponential function of the fiber 
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The isotropic part of this expression has also been used to model the myocardium of the embryonic 
chick heart during the ventricular looping stages, with coefficients of 0.02 kPa during diastole and 0.78 kPa 
at end-systole, and exponent parameters of 1.1 and 0.85, respectively [Lin and Taber, 1994]. Another, 
related transversely isotropic strain-energy function was used by Guccione et al. [1991] and Omens et al. 
[1991] to model material properties in the isolated mature rat and dog hearts: 
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where, in the dog 
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and, in the rat 
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(28.18) 



In Eqs. (28.17) and (28.18), normal and shear strain components involving the radial (x 3 ) axis are 
included. Humphrey and colleagues [Humphrey et al., 1990] determined a new polynomial form directly 
from biaxial tests. Novak et al. [1994] gave representative coefficients for canine myocardium from three 
layers of the left ventricular free wall. The outer third follows as 

W = 4.8(a. f -i) +3.4(X F -l) +0.77(l 1 -3)-6.l(l 1 -3)(A. F -l) + 6.2(l 1 -3) . (28.19) 
The midwall region follows as 

W = 5.3(a, f -i) +7.5(?i F -l) +0.43(l 1 -3)-7.7(l 1 -3)(A, F -l)+5.6(l 1 -3) . (28.20) 
The inner layer of the wall follows as 

W = 0.5l(A. F -l) +27.6(X F -l) +0.74(l 1 -3)-7.3(l 1 -3)(A, F -l)+7.o(l 1 -3) . (28.21) 

A power law strain-energy function expressed in terms of circumferential, longitudinal, and transmural 
extension ratios (X lt X 2 , and A, 3 ) was used [Gupta et al., 1994] to describe the biaxial properties of sheep 
myocardium two weeks after experimental myocardial infarction, in the scarred infarct region: 
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and in the remote, non-infarcted tissue: 



W = 0.11 
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Finally, based on the observation that resting stiffness rises steeply at strains that extend coiled collagen 
fibers to the limit of uncoiling, Hunter and colleagues have proposed a pole-zero constitutive relation in 
which the stresses rise asymptotically as the strain approaches a limiting elastic strain [Hunter et al., 1998] . 

The strain in the constitutive equation, must generally be referred to the stress-free state of the tissue. 
However, the unloaded state of the passive left ventricle is not stress-free; residual stress exists in the 
intact, unloaded myocardium, as shown by Omens and Fung [1990]. Cross-sectional equatorial rings 
from potassium-arrested rat hearts spring open elastically when the left ventricular wall is resected 
radially. The average opening angle of the resulting curved arc is 45 + 10° in the rat. Subsequent radial 
cuts produce no further change. Hence, a slice with one radial cut is considered to be stress free, and 
there is a nonuniform distribution of residual strain across the intact wall, being compressive at the 
endocardium and tensile at the epicardium, with some regional differences. Stress analyses of the diastolic 
left ventricle show that residual stress acts to minimize the endocardial stress concentrations that would 
otherwise be associated with diastolic loading [Guccione et al., 1991]. An important physiological con- 
sequence of residual stress is that sarcomere length is nonuniform in the unloaded resting heart. Rodriguez 
et al. [1993] showed that sarcomere length is about 0.13 um greater at epicardium than endocardium in 
the unloaded rat heart, and this gradient vanishes when residual stress is relieved. Three-dimensional 
studies have also revealed the presence of substantial transverse residual shear strains [Costa et al., 1997]. 
Residual stress and strain may have an important relationship to cardiac growth and remodeling. The- 
oretical studies have shown that residual stress in tissues can arise from growth fields that are kinematically 
incompatible [Skalak et al., 1982; Rodriguez et al., 1994]. 

28.5 Regional Ventricular Mechanics: Stress and Strain 

Although ventricular pressures and volumes are valuable for assessing the global pumping performance 
of the heart, myocardial stress and strain distributions are need to characterize regional ventricular 
function, especially in pathological conditions, such as myocardial ischemia and infarction, where pro- 
found localized changes may occur. The measurement of stress in the intact myocardium involves 
resolving the local forces acting on defined planes in the heart wall. Attempts to measure local forces 
[Feigl et al., 1967; Huisman et al., 1980] have had limited success because of the large deformations of 
the myocardium and the uncertain nature of the mechanical coupling between the transducer elements 
and the tissue. Efforts to measure intramyocardial pressures using miniature implanted transducers have 
been more successful but have also raised controversy over the extent to which they accurately represent 
changes in interstitial fluid pressure. In all cases, these methods provide an incomplete description of 
three-dimensional wall stress distributions. Therefore, the most common approach for estimating myo- 
cardial stress distributions is the use of mathematical models based on the laws of continuum mechanics 
[Hunter and Smaill, 1989]. Although there is not room to review these analyses here, the important 
elements of such models are the geometry and structure, boundary conditions and material properties, 
described in the foregoing sections. An excellent review of ventricular wall stress analysis is given by Yin 
[1981]. The most versatile and powerful method for ventricular stress analysis is the finite element 
method, which has been used in cardiac mechanics for over 20 years [Yin, 1985]. However, models must 
also be validated with experimental measurements. Since the measurement of myocardial stresses is not 
yet reliable, the best experimental data for model validation are measurements of strains in the ventricular 
wall. 

The earliest myocardial strain gauges were mercury-in-rubber transducers sutured to the epicardium. 
Today, local segment length changes are routinely measured with various forms of the piezoelectric crystal 
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sonomicrometer. However, since the ventricular myocardium is a three-dimensional continuum, the local 
strain is only fully defined by all the normal and shear components of the myocardial strain tensor. 
Villarreal et al. [1988] measured two-dimensional midwall strain components by arranging three piezo- 
electric crystals in a small triangle so that three segment lengths could be measured simultaneously. They 
showed that the principal axis of greatest shortening is not aligned with circumferential midwall fibers, 
and that this axis changes with altered ventricular loading and contractility. Therefore, uniaxial segment 
measurements do not reveal the full extent of alterations in regional function caused by an experimental 
intervention. Another approach to measuring regional myocardial strains is the use of clinical imaging 
techniques, such as contrast ventriculography, high-speed X-ray tomography, magnetic resonance imag- 
ing (MRI), or two-dimensional echocardiography. But the conventional application of these techniques 
is not suitable for measuring regional strains because they can not be used to identify the motion of 
distinct myocardial points. They only produce a profile or silhouette of a surface, except in the unusual 
circumstance when radiopaque markers are implanted in the myocardium during cardiac surgery or 
transplantation [Ingels et al., 1975]. Hunter and Zerhouni [1989] describe the prospects for non-invasive 
imaging of discrete points in the ventricular wall. The most promising method is the use of MRI tagging 
methods, which are now being used to map three-dimensional ventricular strain fields in conscious 
subjects [Young and Axel, 1992]. 

In experimental research, implantable radiopaque markers are used for tracking myocardial motions 
with high spatial and temporal resolution. Meier et al. [1980a,b] placed triplets of metal markers 10-15 
mm apart near the epicardium of the canine right ventricle and reconstructed their positions from biplane 
cineradiographic recordings. By polar decomposition, they obtained the two principal epicardial strains, 
the principal angle, and the local rotation in the region. The use of radiopaque markers was extended 
to three dimensions by Waldman and colleagues [Waldman et al., 1985], who implanted three closely 
separated columns of 5 to 6 metal beads in the ventricular wall. With this technique, it is possible to find 
all six components of strain and all three rigid-body rotation angles at sites through the wall. For details 
of this method, see the review by Waldman in Chapter 7 of Glass et al. [1991]. An enhancement to this 
method uses high-order finite element interpolation of the marker positions to compute continuous 
transmural distributions of myocardial deformation [McCulloch and Omens, 1991]. 

Studies and models like these are producing an increasingly detailed picture of regional myocardial 
stress and strain distributions. Of the many interesting observations, there are some useful generalizations, 
particularly regarding the strain. Myocardial deformations are large and three-dimensional, and hence 
the nonlinear finite strain tensors are more appropriate measures than the linear infinitesimal Cauchy 
strain. During filling in the normal heart, the wall stretches biaxially but nonuniformly in the plane of 
the wall, and thins in the transmural direction. During systole, shortening is also two-dimensional and 
the wall thickens. There are substantial regional differences in the time-course, magnitude, and pattern 
of myocardial deformations. In humans and dogs, in-plane systolic myocardial shortening and diastolic 
lengthening vary with longitudinal position on the left and right ventricular free walls generally increasing 
in magnitude from base to apex. 

Both during systole and diastole, there are significant shear strains in the wall. In-plane (torsional) 
shears are negative during diastole, consistent with a small left-handed torsion of the left ventricle during 
filling, and positive as the ventricular twist reverses during ejection. Consequently, the principal axes of 
greatest diastolic segment lengthening and systolic shortening are not circumferential or longitudinal but 
at oblique axes, that are typically rotated 10 to 60° clockwise from circumferential. There are circumfer- 
ential variations in the regional left ventricular strain. The principal axes of greatest diastolic lengthening 
and systolic shortening tend to be more longitudinal on the posterior wall and more circumferentially 
oriented on the anterior wall. Perhaps the most significant regional variations are transmural. In-plane 
and transmural, normal or principal strains, are usually significantly greater in magnitude at the endocar- 
dium than the epicardium both in filling and ejection. However, when the strain is resolved in the local 
muscle fiber direction, the transmural variation of fiber strain becomes insignificant. The combination 
of torsional deformation and the transmural variation in fiber direction means that systolic shortening 
and diastolic lengthening tend to be maximized in the fiber direction at the epicardium and minimized 
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at the endocardium. Hence, whereas maximum shortening and lengthening are closely aligned with 
muscle fibers at the subepicardium, they are almost perpendicular to the fibers at the subendocardium. 
In the left ventricular wall there are also substantial transverse shear strains (i.e., in the circumferential- 
radial and longitudinal-radial planes) during systole, though during filling they are smaller. Their func- 
tional significance remains unclear, though they change substantially during acute myocardial ischemia 
or ventricular pacing and are apparently associated with the transverse laminae described earlier [LeGrice 
etal., 1995]. 

Sophisticated continuum mechanics models are needed to determine the stress distributions associated 
with these complex myocardial deformations. With modern finite element methods it is now possible to 
include in the analysis the three-dimensional geometry and fiber architecture, finite deformations, non- 
linear material properties, and muscle contraction of the ventricular myocardium. Some models have 
included other factors such as viscoelasticity, poroelasticity, coronary perfusion, growth and remodeling, 
regional ischemia, and residual stress. To date, continuum models have provided some valuable insight 
into regional cardiac mechanics. These include the importance of muscle fiber orientation, torsional 
deformations and residual stress, and the substantial inhomogeneities associated with regional variations 
in geometry and fiber angle or myocardial ischemia and infarction. 
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The heart has four valves that control the direction of blood flow through the heart, permitting forward 
flow and preventing back flow. On the right side of the heart, the tricuspid and pulmonic valves regulate 
the flow of blood that is returned from the body to the lungs for oxygenation. The mitral and aortic 
valves control the flow of oxygenated blood from the left side of the heart to the body. The aortic and 
pulmonic valves allow blood to be pumped from the ventricles into arteries on the left and right side of 
the heart, respectively. Similarly, the mitral and tricuspid valves lie between the atria and ventricles of 
the left and right sides of the heart, respectively. The aortic and pulmonic valves open during systole 
when the ventricles are contracting, and close during diastole when the ventricles are filling through the 
open mitral and tricuspid valves. During isovolumic contraction and relaxation, all four valves are closed 
(Fig. 29.1). 

When closed, the pulmonic and tricuspid valves must withstand a pressure of approximately 30 mmHg. 
However, the closing pressures on the left side of the heart are much higher. The aortic valve withstands 
pressures of approximately 100 mmHg, while the mitral valve closes against pressures up to 150 mmHg. 
Since diseases of the valves on the left side of the heart are more prevalent, most of this chapter will focus 
on the aortic and mitral valves. Where pertinent, reference will be made to the pulmonic and tricuspid 
valves. 

29.1 Aortic and Pulmonic Valves 

The aortic valve is composed of three semilunar cusps, or leaflets, contained within a connective tissue 
sleeve. The valve cusps are attached to a fibrous ring embedded in the fibers of the ventricular septum 
and the anterior leaflet of the mitral valve. Each of the leaflets is lined with endothelial cells and has a 
dense collagenous core adjacent to the high pressure aortic side. The side adjacent to the aorta is termed 
the fibrosa and is the major fibrous layer within the belly of the leaflet. The layer covering the ventricular 
side of the valve is called the ventricularis and is composed of both collagen and elastin. The ventricularis 
is thinner than the fibrosa and presents a very smooth surface to the flow of blood [Christie, 1990]. The 
central portion of the valve, called the spongiosa, contains variable loose connective tissue and proteins 
and is normally not vascularized. The collagen fibers within the fibrosa and ventricularis are unorganized 
in the unstressed state. When a stress is applied, they become oriented primarily in the circumferential 
direction with a lower concentration of elastin and collagen in the radial direction [Christie, 1990; 
Thubrikar, 1990]. 
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FIGURE 29. 1 Typical pressure and flow curves for the aortic and mitral valves. 

The fibrous annular ring of the aortic valve separates the aorta from the left ventricle and superior to 
this ring is a structure called the sinus of Valsalva, or aortic sinus. The sinus is comprised of three bulges 
at the root of the aorta, with each bulge aligned with the belly or central part of the specific valve leaflet. 
Each valve cusp and corresponding sinus is named according to its anatomical location within the aorta. 
Two of these sinuses give rise to coronary arteries that branch off the aorta, providing blood flow to the 
heart itself. The right coronary artery is based at the right or right anterior sinus, the left coronary artery 
exits the left or left posterior sinus, and the third sinus is called the non-coronary or right posterior sinus. 
Figure 29.2 shows the configuration of the normal aortic sinuses and valve in the closed position. Because 
the length of the aortic valve cusps is greater than the annular radius, a small overlap of tissue from each 
leaflet protrudes and forms a coaptation surface within the aorta when the valve is closed [Emery and 
Arom, 1991]. This overlapped tissue, called the lunula, may help to ensure that the valve is sealed. When 
the valve is open, the leaflets extend to the upper edge of the sinuses of Valsalva. The anatomy of the 
pulmonic valve is similar to that of the aortic valve, but the surrounding structure is slightly different. 
The main differences are that the sinuses are smaller in the pulmonary artery and the pulmonic valve 
annulus is slightly larger than that of the aortic valve. 

The dimensions of the aortic and pulmonic valves and their leaflets have been measured in a number 
of ways. Before noninvasive measurement techniques such as echocardiography became available, valve 
measurements were recorded from autopsy specimens. An examination of 160 pathologic specimens 
revealed the aortic valve diameter to be 23.2 ± 3.3 mm, whereas the diameter of the pulmonic valve was 
measured at 24.3 ± 3.0 mm [Westaby et al., 1984]. However, according to M-mode echocardiographic 
measurements, the aortic root diameter at end systole was 35 ± 4.2 mm and 33.7 ± 4.4 mm at the end 
of diastole [Gramiak and Shah, 1970]. The differences in these measurements reflect the fact that the 
autopsy measurements were not performed under physiologic pressure conditions and that intrinsic 
differences in the measurement techniques exist. On average, pulmonic leaflets are thinnner than aortic 
leaflets: 0.49 mm vs. 0.67 mm [David et al., 1994], although the leaflets of the aortic valve show variable 
dimensions depending on the respective leaflet. For example, the posterior leaflet tends to be thicker, 
have a larger surface area, and weigh more than the right or left leaflet [Silver and Roberts, 1985; Sahasakul 
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FIGURE 29.2 The aortic sinuses and valve in the closed position. The noncoronary cusp (NCC) is in front. The 
left and right coronary cusps (LCC and RCC) are positioned as marked. The aorta is above the closed valve in this 
orientation and the left ventricle is below the dashed line. 

et al., 1988], and the average width of the right aortic leaflet is greater than that of the other two 
[Vollebergh and Becker, 1977]. 

Mechanical Properties 

Due to the location and critical function of the aortic valve, it is difficult to obtain measurements of its 
mechanical properties in vivo; however, reports are available from a small number of animal studies. This 
section will reference the in vivo data whenever possible and defer to the in vitro data when necessary. 
Since little mathematical modeling of the aortic valve's material properties has been reported, it will be 
sufficient to describe the known mechanical properties of the valve. Like most biological tissues, the 
aortic valve is anisotropic, inhomogeneous, and viscoelastic. The collagen fibers within each valve cusp 
are aligned along the circumferential direction. Vesely and Noseworthy [1992] found that both the 
ventricularis and fibrosa were stiffer in the circumferential direction than in the radial direction. However, 
the ventricularis was more extensible radially than circumferentially, while the fibrosa had uniform 
extensibility in both directions. 

There are also elastin fibers, at a lesser concentration, that are oriented orthogonal to the collagen. It 
is this fiber structure that accounts for the anisotropic properties of the valve. The variation in thickness 
and composition across the leaflets is responsible for their inhomogeneous material properties. Although 
the aortic valve leaflet as a whole is asymmetric in its distensibility, the basal region tends to be relatively 
isotropic while the central region shows the greatest degree of anisotropy [Lo and Vesely, 1995] . The role 
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and morphology of elastin and how elastin is coupled to collagen remain points of investigation. Scott 
and Vesely [1996] have shown that the elastin in the ventricularis consists of continuous amorphous 
sheets or compact meshes, while elastin in the fibrosa consists of complex arrays of large tubes that extend 
circumferentially across the leaflet. These tubes may surround the large circumferential collagen bundles 
in the fibrosa. Mechanical testing of elastin structures from the fibrosa and ventricularis separately have 
shown that the purpose of elastin in the aortic valve leaflet is to maintain a specific collagen fiber 
configuration and return the fibers to that state during cyclic loading [Vesely, 1998]. The valve's viscoelas- 
tic properties are actually dominated by the elastic component (over the range of in vitro testing) so that 
the viscous effects, which are largely responsible for energy losses, are small [Thubrikar, 1990] . In addition 
to the collagen and elastin, clusters of lipids have been observed in the central spongiosa of porcine aortic 
valves. Vesely et al. [1994] have shown that the lipids tend to be concentrated at the base of the valve 
leaflets, while the coaptation regions and free edges of the leaflets tend to be devoid of these lipids. In 
addition, the spatial distribution of the lipids within the spongiosal layer of the aortic leaflets corre- 
sponded to areas in which calcification is commonly observed on bioprosthetic valves suggesting that 
these lipid clusters may be potential nucleation sites for calcification. In contrast, pulmonic leaflets showed 
a substantially lower incidence of lipids [Dunmore-Buyze et al., 1995]. The aortic valve leaflets have also 
been shown to be slightly stiffer than pulmonary valve leaflets, although the extensibilities and relaxation 
rates of the two tissues are similar [Leeson-Dietrich et al., 1995]. 

Using marker fluoroscopy, in which the aortic valve leaflets were surgically tagged with radio-opaque 
markers and imaged with high speed X-rays, the leaflets have been shown to change length during the 
cardiac cycle [Thubrikar, 1990]. The cusps are longer during diastole than systole in both the radial and 
circumferential direction. The variation in length is greatest in the radial direction, approximately 20%, 
while the strain in the circumferential direction is about 10% of the normal systolic length [Lo and Vesely, 
1995]. The difference in strain is due to the presence of the compliant elastin fibers aligned in this radial 
direction. The length change in both directions results in an increased valve surface area during diastole. 
During systole, the shortening of the valve leaflets helps to reduce obstruction of the aorta during the 
systolic ejection of blood. It should be noted that this change in area is by no means an active mechanism 
within the aortic valve; the valve simply reacts to the stresses it encounters in a passive manner. 

In addition to this change in surface area, the aortic valve leaflets also undergo bending in the 
circumferential direction during the cardiac cycle. In diastole when the valve is closed, each leaflet is 
convex toward the ventricular side. During systole when the valve is open, the curvature changes and 
each leaflet is concave toward the ventricle. This bending is localized on the valve cusp near the wall of 
the aorta. This location is often thicker than the rest of the leaflet. The total diastolic stress in a valve 
leaflet has been estimated at 2.5 x 10 6 dynes/cm 2 for a strain of 15% [Thubrikar, 1990] The stress in the 
circumferential direction was found to be the primary load bearing element in the aortic valve. Due to 
the collagen fibers oriented circumferentially, the valve is relatively stiff in this direction. The strain that 
does occur circumferentially is primarily due to scissoring of the fibrous matrix and straightening of 
collagen fibers that are kinked or crimped in the presence of no external stress. However, in the radial 
direction, because elastin is the primary element, the valve can undergo a great deal of strain, ranging 
from 20 to 60% in tissue specimens [Christie, 1990; Lo and Vesely, 1995]. In the closed position, the 
radial stress levels are actually small compared to those in the circumferential direction. This demonstrates 
the importance of the lunula in ensuring that the valve seals tightly to prevent leakage. Because of their 
anatomical location, the lunula cause these high circumferential stress levels by enabling the aortic 
pressure to pull each leaflet in the circumferential direction towards the other leaflets. 

The composition, properties, and dimensions of the aortic valve change with age and in the presence 
of certain diseases. The valve leaflets become thicker, the lunula become fenestrated, or mesh-like, and 
in later stages of disease the central portion of the valve may become calcified [Davies, 1980]. This 
thickening of the valve typically occurs on the ventricular side of the valve, in the region where the tips 
of the leaflets come together. Another site of calcification and fibrosis is the point of maximum cusp 
flexion and is thought to be a response to fatigue in the normal valve tissue. 
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Valve Dynamics 

The aortic valve opens during systole when the ventricle is contracting and then closes during diastole 
as the ventricle relaxes and fills from the atrium. Systole lasts about one third of the cardiac cycle and 
begins when the aortic valve opens, which typically takes only 20 to 30 msec [Bellhouse, 1969]. Blood 
rapidly accelerates through the valve and reaches a peak velocity after the leaflets have opened to their 
full extent and start to close again. Peak velocity is reached during the first third of systole and the flow 
begins to decelerate rapidly after the peak is reached, albeit not as fast as its initial acceleration. The 
adverse pressure gradient that is developed affects the low momentum fluid near the wall of the aorta 
more than that at the center; this causes reverse flow in the sinus region [Reul and Talukdar, 1979]. 
Figure 29.2 illustrates the pressure and flow relations across the aortic valve during the cardiac cycle. 
During systole, the pressure difference required to drive the blood through the aortic valve is on the 
order of a few millimeters of mercury; however, the diastolic pressure difference reaches 80 mmHg in 
normal individuals. The valve closes near the end of the deceleration phase of systole with very little 
reverse flow through the valve. 

During the cardiac cycle the heart undergoes translation and rotation due to its own contraction 
pattern. As a result, the base of the aortic valve varies in size and also translates, mainly along the axis 
of the aorta. Using marker fluoroscopy to study the base of the aortic valve in dogs, Thubrikar et al. 
[1993] found that the base perimeter is at its largest at end diastole and decreases in size during systole; 
it then reaches a minimum at the end of systole and increases again during diastole. The range of this 
perimeter variation during the cardiac cycle was 22% for an aortic pressure variation of 120/80 mmHg. 
The valve annulus also undergoes translation, primarily parallel to the aortic axis. The aortic annulus 
moves downward toward the ventricle during systole and then recoils back toward the aorta as the 
ventricle fills during diastole. The annulus also experiences a slight side-to-side translation with its 
magnitude approximately one half the displacement along the aortic axis. 

During systole, vortices develop in all three sinuses behind the leaflets of the aortic valve. The function 
of these vortices was first described by Leonardo da Vinci in 1513, and they have been researched 
extensively in this century primarily through the use of in vitro models [Bellhouse, 1969; Reul and 
Talukdar, 1979]. It has been hypothesized that the vortices help to close the aortic valve so that blood is 
prevented from returning to the ventricle during the closing process. These vortices create a transverse 
pressure difference that pushes the leaflets toward the center of the aorta and each other at the end of 
systole, thus minimizing any possible closing volume. However, as shown in vitro by Reul and Talukdar 
[1979], the axial pressure difference alone is enough to close the valve. Without the vortices in the sinuses, 
the valve still closes but its closure is not as quick as when the vortices are present. The adverse axial 
pressure difference within the aorta causes the low inertia flow within the developing boundary layer 
along the aortic wall to decelerate first and to reverse direction. This action forces the belly of the leaflets 
away from the aortic wall and toward the closed position. When this force is coupled with the vortices 
that push the leaflet tips toward the closed position, a very efficient and fast closure is obtained. Closing 
volumes have been estimated to be less than 5% of the forward flow [Bellhouse and Bellhouse, 1969]. 

The parameters that describe the normal blood flow through the aortic valve are the velocity profile, 
time course of the blood velocity or flow, and magnitude of the peak velocity. These are determined in 
part by the pressure difference between the ventricle and aorta and by the geometry of the aortic valve 
complex. As seen in Fig. 29.3, the velocity profile at the level of the aortic valve annulus is relatively flat. 
However there is usually a slight skew toward the septal wall (less than 10% of the center-line velocity) 
which is caused by the orientation of the aortic valve relative to the long axis of the left ventricle. This 
skew in the velocity profile has been shown by many experimental techniques, including hot film 
anemometry, Doppler ultrasound, and MRI [Paulsen and Hasenkam, 1983; Rossvol et al., 1991; Kilner 
et al., 1993]. In healthy individuals, blood flows through the aortic valve at the beginning of systole and 
then rapidly accelerates to its peak value of 1.35 ± 0.35 m/s; for children this value is slightly higher at 
1.5 ± 0.3 m/s [Hatle and Angelson, 1985]. At the end of systole there is a very short period of reverse 
flow that can be measured with Doppler ultrasound. This reverse flow is probably either a small closing 
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FIGURE 29.3 Velocity profiles measured 2 cm downstream of the aortic valve with hot film anemometry in dogs 
[Paulsen and Hasenkam, 1983]. The timing of the measurements during the cardiac cycle is shown by the marker 
on the aortic flow curve. 

volume or the velocity of the valve leaflets as they move toward their closed position. The flow patterns 
just downstream of the aortic valve are of particular interest because of their complexity and relation to 
arterial disease. Highly skewed velocity profiles and corresponding helical flow patterns have been 
observed in the human aortic arch using magnetic resonance phase velocity mapping [Kilner et al., 1993] . 
The pulmonic valve flow behaves similarly to that of the aortic valve, but the magnitude of the velocity 
is not as great. Typical peak velocities for healthy adults are 0.75 ± 0. 1 5 m/s; again these values are slightly 
higher for children at 0.9 ± 0.2 m/s [Weyman, 1994] . As seen in Fig. 29.4, a rotation of the peak velocity 
can be observed in the pulmonary artery velocity profile. During acceleration, the peak velocity is 
observed inferiorly with the peak rotating counterclockwise throughout the remainder of the ejection 
phase [Sloth et al., 1994]. The mean spatial profile is relatively flat, however, although there is a region 
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FIGURE 29.4 Velocity profiles downstream of the human pulmonary valve obtained with magnetic resonance phase 
velocity mapping [Sloth, 1994]. Again the timing of the measurements is shown by the marker on the flow curve. 

of reverse flow that occurs in late systole which may be representative of flow separation. Typically, there 
is only a slight skew to the profile. The peak velocity is generally within 20% of the spatial mean 
throughout the cardiac cycle. Secondary flow patterns can also be observed in the pulmonary artery and 
its bifurcation. In vitro laser Doppler anemometry experiments have shown that these flow patterns are 
dependent on the valve geometry and thus can be used to evaluate function and fitness of the heart valve 
[Sung and Yoganathan, 1990]. 



29.2 Mitral and Tricuspid Valves 



The mitral (Fig. 29.5) and tricuspid valves are similar in structure with both valves composed of four 
primary elements: (1) the valve annulus, (2) the valve leaflets, (3) the papillary muscles, and (4) the 
chordae tendineae. The base of the mitral leaflets form the mitral annulus, which attaches to the atrial 
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FIGURE 29.5 Schematic of the mitral valve showing the valve leaflets, papillary muscles, and chordae tendineae. 

and ventricular walls, and aortic root. At the free edge of the leaflets, the chordae tendinae insert at 
multiple locations and extend to the tips of the papillary muscles. This arrangement provides continuity 
between the valve and ventricular wall to enhance valvular function. The valvular apparatus, or complex, 
requires an intricate interplay between all components throughout the cardiac cycle. 

The mitral annulus is an elliptical ring composed of dense collagenous tissue surrounded by muscle. 
It goes through dynamic changes during the cardiac cycle by not only changing in size, but also by moving 
three-dimensionally. The circumference of the mitral annulus ranges from 8 to 12 cm during diastole. 
Recent studies involving the measurement of annular shape have also shown that the mitral annulus is 
not planar, but instead has a three-dimensional form. The annulus actually forms a saddle, or ski-slope 
shape [Glasson et al., 1996; Pai et al., 1995; Komoda et al., 1994; Levine et al., 1987]. This three-dimen- 
sional shape must be taken into account when non-invasively evaluating valvular function. 

The mitral valve is a bileaflet valve comprised of an anterior and posterior leaflet. The leaflet tissue is 
primarily collagen-reinforced endothelium, but also contains striated muscle cells, non-myelinated nerve 
fibers, and blood vessels. The anterior and posterior leaflets of the valve are actually one continuous piece 
of tissue, as shown in Fig. 29.6. The free edge of this tissue shows several indentations of which two are 
regularly placed, called the commisures. The commisures separate the tissue into the anterior and 
posterior leaflets. The location of the commisures can be identified by the fan-like distribution of chordae 
tendinae and the relative positioning of the papillary muscles. The combined surface area of both leaflets 
is approximately twice the area of the mitral orifice; this extra surface area permits a large line of 



Anterior leaflet 




Posterior leaflet 




"N. Posteromedial 
\ commisure 




Anterolateral 
commisure 


\ AtfV Medial /\ 


Lateral 
V Central i(ttMiV? callo P / / 
\ scaUop ./wmW 




Chordae 1 
tendineae { 




Chordae 1 1 
tendineaeL— 1 




Posteromedial 


Anterolateral 




papill 
must 


ary 
;le 


papillary 
muscle 





FIGURE 29.6 Diagram of the mitral valve as a continuous piece of tissue. The posterior and anterior leaflets are 
indicated, as are the scallops, chordae tendineae, and papillary muscles. 
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coaptation and ample coverage of the mitral orifice during normal function and provides compensation 
in cases of disease [He, 1997, 1999]. The posterior leaflet encircles roughly two-thirds of the mitral 
annulus and is essentially an extension of the mural endocardium from the free walls of the left atrium. 
The anterior leaflet portion of the annulus is a line of connection for the leaflet, the wall of the ascending 
aorta, the aortic valve, and the atrial septum. The anterior leaflet is slightly larger than the posterior 
leaflet, and is roughly semi-lunar in shape as opposed to the quadrangular shaped posterior leaflet. The 
normal width and height of the anterior leaflet is approximately 3.3 cm and 2.3 cm, respectively. The 
height of the posterior leaflet is 1.3 cm, while the commisure height is less than 1.0 cm. The posterior 
leaflet typically has indentations, called scallops, that divide the leaflet into three regions: (1) the medial, 
(2) central, and (3) lateral scallop [Kunzelman et al„ 1994; Barlow, 1987; Roberts, 1983; Raganathan 
et al., 1970; Silverman and Hurst, 1968]. 

The mitral leaflet tissue can be divided into both a rough and clear zone. The rough zone is the thicker 
part of the leaflet and is defined from the free edge of the valve to the valve's line of closure. The term 
"rough" is used to denote the texture of the leaflet due to the insertion of the chordae tendineae in this 
area. The clear zone is thinner and translucent and extends from the line of closure to the annulus in 
the anterior leaflet and to the basal zone in the posterior leaflet. Unlike the mitral valve, the tricuspid 
valve has three leaflets: (1) an anterior leaflet, (2) a posterior leaflet with a variable number of scallops, 
and (3) a septal leaflet. The tricuspid valve is larger and structurally more complicated than the mitral 
valve and the separation of the valve tissue into distinct leaflets is less pronounced than with the mitral 
valve. The surface of the leaflets is similar to that of the mitral valve; however, the basal zone is present 
in all of the leaflets [Silver et al., 1971]. 

Chordae tendineae from both leaflets attach to each of the papillary muscles. The chordae tendineae 
consist of an inner core of collagen surrounded by loosely meshed elastin and collagen fibers with an 
outer layer of endothelial cells. In the mitral complex structure, there are marginal and basal chordae 
that insert into the mitral leaflets. From each papillary muscle, several chordae originate and branch into 
the marginal and basal chordae. The thinner marginal chordae insert into the leaflet free edge at multiple 
insertion points, while the thicker basal chordae insert into the leaflets at a higher level towards the 
annulus. The marginal chordae function to keep the leaflets stationary while the basal chordae seem to 
act more as supports [Kunzelman, 1994]. 

The left side of the heart has two papillary muscles, called anterolateral and posteromedial, that attach 
to the ventricular free wall and tether the mitral valve in place via the chordae tendinae. This tethering 
prevents the mitral valve from prolapsing into the atrium during ventricular ejection. On the right side 
of the heart, the tricuspid valve has three papillary muscles. The largest one, the anterior papillary muscle, 
attaches to the valve at the commissure between the anterior and posterior leaflets. The posterior papillary 
muscle is located between the posterior and septal leaflets. The smallest papillary muscle, called the septal 
muscle, is sometimes not even present. Improper tethering of the leaflets will result in valve prolapse 
during ventricular contraction, permitting the valve leaflets to extend into the atrium. This incomplete 
apposition of the valve leaflets can cause regurgitation, which is leaking of the blood being ejected back 
into the atrium. 

Mechanical Properties 

Studies on the mechanical behavior of the mitral leaflet tissue have been conducted to determine the key 
connective tissue components which influence the valve function. Histological studies have shown that 
the tissue is composed of three layers which can be identified by differences in cellularity and collagen 
density. Analysis of the leaflets under tension indicated that the anterior leaflet would be more capable 
of supporting larger tensile loads than the posterior leaflet. The differences between the mechanical 
properties between the two leaflets may require different material selection for repair or replacement of 
the individual leaflets [Kunzelman et al., 1993a, b]. 

Studies have also been done on the strength of the chordae tendinae. The tension of chordae tendineae 
in dogs was monitored throughout the cardiac cycle by Salisbury and co-workers [1963]. They found 
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that the tension only paralleled the left ventricular pressure tracings during isovolumic contraction, 
indicating slackness at other times in the cycle. Investigation of the tensile properties of the chordae 
tendineae at different strain rates by Lim and Bouchner [1975] found that the chordae had a non-linear 
stress-strain relationship. They found that the size of the chordae had a more significant effect on the 
development of the tension than did the strain rate. The smaller chordae with a cross-sectional area of 
0.001 to 0.006 cm 2 had a modulus of 2 x 10 9 dynes/cm 2 , while larger chordae with a cross-sectional area 
of 0.006 to 0.03 cm 2 had a modulus of 1 x 10 9 dynes/cm 2 . 

A theoretical study of the stresses sustained by the mitral valve was performed by Ghista and Rao 
[1972]. This study determined that the stress level can reach as high as 2.2 x 10 6 dynes/cm 2 just prior to 
the opening of the aortic valve, with the left ventricular pressure rising to 150 mmHg. A mathematical 
model has also been created for the mechanics of the mitral valve. It incorporates the relationship between 
chordae tendineae tension, left ventricular pressure, and mitral valve geometry [Arts et al., 1983]. This 
study examined the force balance on a closed valve, and determined that the chordae tendinae force was 
always more than half the force exerted on the mitral valve orifice by the transmitral pressure gradient. 
During the past 10 years, computational models of mitral valve mechanics have been developed, with 
the most advanced modeling being three-dimensional finite element models (FEM) of the complete 
mitral apparatus. Kunzelman and co-workers [1993, 1998] have developed a model of the mitral complex 
that includes the mitral leaflets, chordae tendinae, contracting annulus, and contracting papillary muscles. 
From these studies, the maximum principal stresses found at peak loading (120 mmHg) were 5.7 x 
10 6 dynes/cm 2 in the annular region, while the stresses in the anterior leaflet ranged from 2 x 10 6 to 4 x 
10 6 dynes/cm 2 . This model has also been used to evaluate mitral valve disease, repair in chordal rupture, 
and valvular annuloplasty. 

Valve Dynamics 

The valve leaflets, chordae tendineae, and papillary muscles all participate to ensure normal functioning 
of the mitral valve. During isovolumic relaxation, the pressure in the left atrium exceeds that of the left 
ventricle, and the mitral valve cusps open. Blood flows through the open valve from the left atrium to 
the left ventricle during diastole. The velocity profiles at both the annulus and the mitral valve tips have 
been shown to be skewed [Kim et al., 1994] and therefore are not flat as is commonly assumed. This 
skewing of the inflow profile is shown in Fig. 29.7. The initial filling is enhanced by the active relaxation 
of the ventricle, maintaining a positive transmitral pressure. The mitral velocity flow curve shows a peak 
in the flow curve, called the E-wave, which occurs during the early filling phase. Normal peak E-wave 
velocities in healthy individuals range from 50 to 80 cm/s [Samstad et al., 1989; Oh et al., 1997] . Following 
active ventricular relaxation, the fluid begins to decelerate and the mitral valve undergoes partial closure. 
Then the atrium contracts and the blood accelerates through the valve again to a secondary peak, termed 
the A-wave. The atrium contraction plays an important role in additional filling of the ventricle during 
late diastole. In healthy individuals, velocities during the A-wave are typically lower than those of the 
E-wave, with a normal E/A velocity ratio ranging from 1.5 to 1.7 [Oh et al., 1997]. Thus, normal diastolic 
filling of the left ventricle shows two distinct peaks in the flow curve with no flow leaking back through 
the valve during systole. 

The tricuspid flow profile is similar to that of the mitral valve, although the velocities in the tricuspid 
valve are lower because it has a larger valve orifice. In addition, the timing of the valve opening is slightly 
different. Since the peak pressure in the right ventricle is less than that of the left ventricle, the time for 
right ventricular pressure to fall below the right atrial pressure is less than the corresponding time period 
for the left side of the heart. This leads to a shorter right ventricular isovolumic relaxation and thus an 
earlier tricuspid opening. Tricuspid closure occurs after the mitral valve closes since the activation of the 
left ventricle precedes that of the right ventricle [Weyman, 1994]. 

A primary focus in explaining the fluid mechanics of mitral valve function has been understanding 
the closing mechanism of the valve. Bellhouse [1972] first suggested that the vortices generated by 
ventricular filling were important for the partial closure of the mitral valve following early diastole. Their 
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FIGURE 29.7 Two-dimensional transmitral velocity profiles recorded at the level of the mitral annulus in a pig 
[Kim et al., 1994]. (a) systole; (b) peak E-wave; (c) deceleration phase of early diastole; (d) mid-diastolic period 
(diastasis); (e) peak A-wave; (f) time averaged diastolic cross-sectional mitral velocity profile. Reprinted with per- 
mission from the American College of Cardiology (J Am Coll Cardiol 24: 532-545). 

in vitro experiments suggested that without the strong outflow tract vortices, the valve would remain 
open at the onset of ventricular contraction, thus resulting in a significant amount of mitral regurgitation 
before complete closure. Later in vitro experiments by Reul and Talukdar [1981] in a left ventricle model 
made from silicone suggested that an adverse pressure differential in mid-diastole could explain both the 
flow deceleration and the partial valve closure, even in the absence of a ventricular vortex. Thus, the 
studies by Reul and Talukdar suggest that the vortices may provide additional closing effects at the initial 
stage; however, the pressure forces are the dominant effect in valve closure. A more unified theory of 
valve closure put forth by Yellin and co-workers [1981] includes the importance of chordal tension, flow 
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FIGURE 29.8 Magnetic resonance image of a healthy individual during diastole. An outline of the interior left 
ventricle (LV) is indicated in white as are the mitral valve leaflets (MV) and the aorta (AO). Velocity vectors were 
obtained from MRI phase velocity mapping and superimposed on the anatomical image. 

deceleration, and ventricular vortices, with chordal tension being a necessary condition for the other 
two. Their animal studies indicated that competent valve closure can occur even in the absence of vortices 
and flow deceleration. Recent studies using magnetic resonance imaging to visualize the three-dimen- 
sional flow field in the left ventricle showed that in normal individuals a large anterior vortex is present 
at initial partial closure of the valve, as well as following atrial contraction [Kim et al., 1995]. Studies 
conducted in our laboratory using magnetic resonance imaging of healthy individuals clearly show the 
vortices in the left ventricle [Walker et al., 1 996] , which may be an indication of normal diastolic function. 
An example of these vortices is presented in Fig. 29.8. 

Another area of interest has been the motion of the mitral valve complex. The heart moves throughout 
the cardiac cycle; similarly, the mitral apparatus moves and changes shape. Recent studies have been 
conducted which examined the three-dimensional dynamics of the mitral annulus during the cardiac 
cycle. [Pai et al, 1995; Glasson et al, 1996; Komoda et al., 1994; Ormiston et al, 1981]. These studies 
have shown that during systole the annular circumference decreases from the diastolic value due to the 
contraction of the ventricle, and this reduction in area ranges from 10 to 25%. This result agrees with 
an animal study of Tsakiris and co-workers [1971] that looked at the difference in the size, shape, and 
position of the mitral annulus at different stages in the cardiac cycle. They noted an eccentric narrowing 
of the annulus during both atrial and ventricular contractions that reduced the mitral valve area by 10 to 
36% from its peak diastolic area. This reduction in the annular area during systole is significant, resulting 
in a smaller orifice area for the larger leaflet area to cover. Not only does the annulus change size, but it 
also translates during the cardiac cycle. The movement of the annulus towards the atrium has been 
suggested to play a role in ventricular filling, possibly increasing the efficiency of blood flow into the 
ventricle. During ventricular contraction, there is a shortening of the left ventricular chamber along its 
longitudinal axis, and the mitral and tricuspid annuli move towards the apex [Pai et al., 1995; Alam and 
Rosenhamer, 1992; Hammarstrom et al., 1991; Simonson and Schiller, 1989]. 

The movement of the papillary muscles is also important in maintaining proper mitral valve function. 
The papillary muscles play an important role in keeping the mitral valve in position during ventricular 
contraction. Abnormal strain on the papillary muscles could cause the chordae to rupture, resulting in 
mitral regurgitation. It is necessary for the papillary muscles to contract and shorten during systole to 
prevent mitral prolapse; therefore, the distance between the apex of the heart to the mitral apparatus is 
important. The distance from the papillary muscle tips to the annulus was measured in normal individuals 
during systole and was found to remain constant [Sanfilippo et al., 1992]. In patients with mitral valve 
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prolapse, however, this distance decreased, corresponding to a superior displacement of the papillary 
muscle towards the annulus. 

The normal function of the mitral valve requires a balanced interplay between all of the components 
of the mitral apparatus, as well as the interaction of the atrium and ventricle. Engineering studies into 
mitral valve function have provided some insight into its mechanical properties and function. Further 
fundamental and detailed studies are needed to aid surgeons in repairing the diseased mitral valve and 
in understanding the changes in function due to mitral valve pathologies. In addition, these studies are 
crucial for improving the design of prosthetic valves that more closely replicate native valve function. 
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The arterial circulation is a multiply branched network of compliant tubes. The geometry of the network 
is complex, and the vessels exhibit nonlinear viscoelastic behavior. Flow is pulsatile, and the blood flowing 
through the network is a suspension of red blood cells and other particles in plasma which exhibits 
complex non-Newtonian properties. Whereas the development of an exact biomechanical description of 
arterial hemodynamics is a formidable task, surprisingly useful results can be obtained with greatly 
simplified models. 

The geometrical parameters of the canine systemic and pulmonary circulations are summarized in 
Table 30.1. Vessel diameters vary from a maximum of 19 mm in the proximal aorta to 0.008 mm 
(8 microns) in the capillaries. Because of the multiple branching, the total cross-sectional area increases 
from 2.8 cm 2 in the proximal aorta to 1357 cm 2 in the capillaries. Of the total blood volume, approximately 
83% is in the systemic circulation, 12% is in the pulmonary circulation, and the remaining 5% is in the 
heart. Most of the systemic blood is in the venous circulation, where changes in compliance are used to 
control mean circulatory blood pressure. This chapter will be concerned with flow in the larger arteries, 
classes 1-5 in the systemic circulation and 1-3 in the pulmonary circulation in Table 30.1. Flow in the 
microcirculation is discussed in Chapter 31, and venous hemodynamics is covered in Chapter 33. 

30.1 Blood Vessel Walls 

The detailed properties of blood vessels were described earlier in this section, but a few general obser- 
vations are made here to facilitate the following discussion. Blood vessels are composed of three layers, 
the intima, media, and adventitia. The inner layer, or intima, is composed primarily of endothelial cells, 
which line the vessel and are involved in control of vessel diameter. The media, composed of elastin, 
collagen, and smooth muscle, largely determines the elastic properties of the vessel. The outer layer, or 
adventitia, is composed mainly of connective tissue. Unlike in structures composed of passive elastic 
materials, vessel diameter and elastic modulus vary with smooth-muscle tone. Dilation in response to 
increases in flow and myogenic constriction in response to increases in pressure have been observed in 



© 2000 by CRC Press LLC 



TABLE 30.1 Model ofVascular Dimensions in 20-kg dog 



Class 



Vessels 



Total Total 

Mean Number Mean Cross- Blood Percentage 

Diam. of Length section Volume of Total 

(mm) Vessels (mm) (cm 2 ) (ml) Volume 



1 
2 
3 
4 

5 

6 

7 

8 

9 

10 

11 

12 

13 



Aorta 

Arteries 

Arteries 

Arteries 

Arteries 

Arterioles 

Capillaries 

Venules 

Veins 

Veins 

Veins 

Veins 

Venae cavae 



(19-4.5) 
4.000 
1.300 
0.450 
0.150 
0.050 
0.008 
0.100 
0.280 
0.700 
1.800 
4.500 
(5-14) 



Systemic 

1 
40 
500 
6000 
110,000 
2.8 x 10" 
2.7 x 10" 
1.0 x 10 7 
660,000 
40,000 
2,100 
110 
2 



150.0 
45.0 

13.5 
4.0 
1.2 

0.65 
1.6 
4.8 

13.5 

45.0 
150.0 



(2.8-0.2) 

5.0 

6.6 

9.5 

19.4 

55.0 

1357.0 

785.4 

406.4 

154.0 

53.4 

17.5 

(0.2-1.5) 



Total 



60 

75 

30 

13 

8 

7 

88 

126 

196 

208 

240 

263 

92 

1406 



11% 



5% 



67% 



Main artery 

Arteries 

Arteries 

Arterioles 

Capillaries 

Venules 

Veins 

Veins 

Main veins 



1.600 
4.000 
1.000 
0.100 
0.008 
0.110 
1.100 
4.200 
8.000 



Pulmonary 

1 
20 
1550 
1.5 x 10" 
2.7 x 10* 
2.0 x 10" 
1650 
25 
4 



28.0 

10.0 

14.0 

0.7 

0.5 

0.7 

14.0 

100.0 

30.0 



2.0 

2.5 

12.2 

120.0 

1357.0 

190.0 

15.7 



Total 



6 

25 
17 



13 

22 

35 

6 

200 



3% 



4% 



5% 



Total 

Total circulation 



Atria 
Ventricles 



Heart 

2 
2 



30 

54 

84 

1690 



5% 
100% 



Source: Milnor WR. 1989. Hemodynamics, 2d ed, p 45. Baltimore, Williams and Wilkins. With permission. 

some arteries. Smooth-muscle tone is also affected by circulating vasoconstrictors such as norepinephrine 
and vasodilators such as nitroprusside. Blood vessels, like other soft biological tissues, generally do not 
obey Hooke's law, becoming stiffer as pressure is increased. They also exhibit viscoelastic characteristics 
such as hysteresis and creep. Fortunately, for many purposes a linear elastic model of blood vessel behavior 
provides adequate results. 

30.2 Flow Characteristics 

Blood is a complex substance containing water, inorganic ions, proteins, and cells. Approximately 50% 
is plasma, a nearly Newtonian fluid consisting of water, ions, and proteins. The balance contains eryth- 
rocytes (red blood cells), leukocytes (white blood cells), and platelets. Whereas the behavior of blood in 
vessels smaller than approximately 100 um exhibits significant non-Newtonian effects, flow in larger 
vessels can be described reasonably accurately using the Newtonian assumption. There is some evidence 
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TABLE 30.2 Normal Average Hemodynamics Values in Man and Dog 







Dog (20 kg) 






Man (70 kg, 


1.8 m 2 ) 




N w 


Velocity (cm/s) 


N s 


K 


Velocity (cm/s) 


N R 


Systemic Vessels 














Ascending aorta 


16 


15.8 (89/0)* 


870 (4900)* 


21 


18 (112/0)* 


1500 (9400)* 


Abdominal aorta 


9 


12 (60.0) 


370 (1870) 


12 


14 (75/0) 


640 (3600) 


Renal artery 


3 


41 (74/26) 


440 (800) 


4 


40 (73/26) 


700 (1300) 


Femoral artery 


4 


10 (42/1) 


130 (580) 


4 


12 (52/2) 


200 (860) 


Femoral vein 


5 


5 


92 


7 


4 


104 


Superior vena cava 


10 


8 (20/0) 


320 (790) 


15 


9 (23/0) 


550 (1400) 


Inferior vena cava 


11 


19 (40/0) 


800 (1800) 


17 


21 (46/0) 


1400 (3000) 


Pulmonary vessels 














Main artery 


14 


18 (72/0) 


900 (3700) 


20 


19 (96/0) 


1600 (7800) 


Main vein:): 


7 


18 (30/9) 


270 (800) 


10 


19 (38/10) 


800 (2200) 



* Mean (systolic/diastolic) 

f Mean (peak) 

' One of the usually four terminal pulmonary veins 

Source: Milnor WR. 1989. Hemodynamics, 2d ed, p 148, Baltimore, Williams and Wilkins. With permission. 



suggesting that in blood analog fluids wall shear stress distributions may differ somewhat from Newtonian 
values [Liepsch et al., 1991]. 

Flow in the arterial circulation is predominantly laminar with the possible exception of the proximal 
aorta and main pulmonary artery. In steady flow, transition to turbulence occurs at Reynolds numbers 
(N R ) above approximately 2300 



N. 



2rV 
v 



(30.1) 



where r = vessel radius, V = velocity, v = kinematic viscosity = viscosity/density. 

Flow in the major systemic and pulmonary arteries is highly pulsatile. Peak-to-mean flow amplitudes 
as high as 6 to 1 have been reported in both human and dog [Milnor, 1989, p. 149]. Womersley's analysis 
of incompressible flow in rigid and elastic tubes [1957] showed that the importance of pulsatility in the 
velocity distributions depended on the parameter 



N 



CO 

V v 



(30.2) 



where co = frequency. 

This is usually referred to as the Womersley number (N w ) or a-pammeter. Womersley's original report is 
not readily available; however, Milnor provides a reasonably complete account [Milnor, 1989, pp. 106-121]. 

Mean and peak Reynolds numbers in human and dog are given in Table 30.2, which also includes mean, 
peak, and minimum velocities as well as the Womersley number. Mean Reynolds numbers in the entire 
systemic and pulmonary circulations are below 2300. Peak systolic Reynolds numbers exceed 2300 in the 
aorta and pulmonary artery, and some evidence of transition to turbulence has been reported. In dogs, 
distributed flow occurs at Reynolds numbers as low as 1000, with higher Womersley numbers increasing 
the transition Reynolds number [Nerem & Seed, 1972]. The values in Table 30.2 are typical for individuals 
at rest. During exercise, cardiac output and hence Reynolds numbers can increase severalfold. The Wom- 
ersley number also affects the shape of the instantaneous velocity profiles as discussed below. 
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TABLE 30.3 Pressure Wave Velocities in Arteries* 







Wave 


Artery 


Species 


Velocity (cm/s) 


Ascending Aorta 


Man 


440-520 




Dog 


350-472 


Thoracic aorta 


Man 


400-650 




Dog 


400-700 


Abdominal aorta 


Man 


500-620 




Dog 


550-960 


Iliac 


Man 


700-880 




Dog 


700-800 


Femoral 


Man 


800-1800 




Dog 


800-1300 


Popliteal 


Dog 


1220-1310 


Tibial 


Dog 


1040-1430 


Carotid 


Man 


680-830 




Dog 


610-1240 


Pulmonary 


Man 


168-182 




Dog 


255-275 




Rabbit 


100 




Pig 


190 



* All data are apparent pressure wave velocities (although 
the average of higher frequency harmonics approximates 
the true velocity in many cases), from relatively young 
subjects with normal cardiovascular systems, at approxi- 
mately normal distending pressures. 

f Ranges for each vessel and species taken from Table 9. 1 
of source. 

Source: Milnor WR. 1989. Hemodynamics, 2d ed, 
p 235, Baltimore, Williams and Wilkins. With permission. 

30.3 Wave Propagation 

The viscoelasticity of blood vessels affects the hemodynamics of arterial flow. The primary function of 
arterial elasticity is to store blood during systole so that forward flow continues when the aortic valve is 
closed. Elasticity also causes a finite wave propagation velocity, which is given approximately by the 
Moens-Korteweg relationship 



I Eh 
"\J2pr 



(30.3) 



where E = wall elastic modulus, h = wall thickness, p = blood density, r = vessel radius. 

Although Moens [1878] and Korteweg [1878] are credited with this formulation, Fung [1984, p. 107] 
has pointed out that the formula was first derived much earlier [Young, 1808]. Wave speeds in arterial 
blood vessels from several species are given in Table 30.3. In general, wave speeds increase toward the 
periphery as vessel radius decreases and are considerably lower in the main pulmonary artery than in 
the aorta owing primarily to the lower pressure and consequently lower elastic modulus. 

Wave reflections occur at branches where there is not perfect impedance matching of parent and 
daughter vessels. The input impedance of a network of vessels is the ratio of pressure to flow. For rigid 
vessels with laminar flow and negligible inertial effects, the input impedance is simply the resistance and 
is independent of pressure and flow rate. For elastic vessels, the impedance is dependent on the frequency 
of the fluctuations in pressure and flow. The impedance can be described by a complex function expressing 
the amplitude ratio of pressure to flow oscillations and the phase difference between the peaks. 
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Z(co) 
Z(co) 



Q(co) 
P(co) 



Q(co) 

e i (co)=e (p(co)) -e (q(co 



(30.4) 



where Z { is the complex impedance, \Z\ is the amplitude, and 0, is the phase. 

For an infinitely long straight tube with constant properties, input impedance will be independent of 
position in the tube and dependent only on vessel and fluid properties. The corresponding value of input 
impedance is called the characteristic impedance Z given by 



pc 

A 



(30.5) 



where A = vessel cross-sectional area. 

In general, the input impedance will vary from point to point in the network because of variations in 
vessel sizes and properties. If the network has the same impedance at each point (perfect impedance 
matching), there will be no wave reflections. Such a network will transmit energy most efficiently. The 
reflection coefficient _R, defined as the ratio of reflected to incident wave amplitude, is related to the 
relative characteristic impedance of the vessels at a junction. For a parent tube with characteristic 
impedance Z branching into two daughter tubes with characteristic impedances Z l and Z 2 , the reflection 
coefficient is given by 
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and perfect impedance matching requires 
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The arterial circulation exhibits partial impedance matching; however, wave reflections do occur. At each 
branch point, local reflection coefficients typically are less than 0.2. Nonetheless, global reflection coef- 
ficients, which account for all reflections distal to a given site, can be considerably higher [Milnor, 1989, 
p217]. 

In the absence of wave reflections, the input impedance is equal to the characteristic impedance. 
Womersley's analysis predicts that impedance modulus will decrease monotonically with increasing 
frequency, whereas the phase angle is negative at low frequency and becomes progressively more positive 
with increasing frequency. Typical values calculated from Womersley's analysis are shown in Fig. 30.1. In 
the actual circulation, wave reflections cause oscillations in the modulus and phase. Figure 30.2 shows 
input impedance measured in the ascending aorta of a human. Measurements of input resistance, 
characteristic impedance, and the frequency of the first minimum in the input impedance are summarized 
in Table 30.4. 
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4 6 

Womersley Number 

FIGURE 30.1 Characteristic impedance calculated from Womersley 's analysis. The top panel contains the phase of 
the impedance and the bottom panel the modulus, both plotted as a function of the Womersley number N w , which 
is proportional to frequency. The curves shown are for an unconstrained tube and include the effects of wall viscosity. 
The original figure has an error in the scale of the ordinate which has been corrected. Source: Milnor WR. 1989. 
Hemodynamics, 2d ed, p 172, Baltimore, Williams and Wilkins. With permission. 



30.4 Velocity Profiles 



Typical pressure and velocity fluctuations throughout the cardiac cycle in man are shown in Fig. 30.3. 
Although mean pressure decreases slightly toward the periphery due to viscous effects, peak pressure 
shows small increases in the distal aorta due to wave reflection and vessel taper. A rough estimate of 
mean pressure can be obtained as 1/3 of the sum of systolic pressure and twice the diastolic pressure. 
Velocity peaks during systole, with some backflow observed in the aorta early in diastole. Flow in the 
aorta is nearly zero through most of the diastole; however, more peripheral arteries such as the iliac and 
renal show forward flow throughout the cardiac cycle. This is a result of capacitive discharge of the central 
arteries as arterial pressure decreases. 
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FIGUR 30.2 Input impedance derived from the pressure and velocity data in the ascending aorta of Fig. 30.3. The 
top panel contains the modulus and the bottom panel the phase, both plotted as functions of frequency. The peripheral 
resistance (DC impedance) for this plot was 16470 dyne s/cm 5 . Source: Mills CJ, Gabe IT, Gault JN, et al. 1970. 
Pressure-flow relationships and vascular impedance in man. Cardiovasc Res 4:405. With permission. 



TABLE 30.4 Characteristic Arterial Impedances in Some Mammals: 
Average (±SE)*' f 



Species 


Artery 


K 


z a 


U 


Dog 


Aorta 


2809-6830 


125-288 


6-8 


Dog 


Pulmonary 


536-807 


132-295 


2-3.5 


Dog 


Femoral 


110-162* 


4.5-15.8* 


8-13 


Dog 


Carotid 


69 ! 


7.0-9.4* 


8-11 


Rabbit 


Aorta 


20-50' 


1.8-2.1* 


4.5-9.8 


Rabbit 


Pulmonary 




1.1* 


3.0 


Rat 


Aorta 


153' 


11.2* 


12 



Abbreviations: R m , input resistance (mean arterial pressure/flow) in dyn 
s/cm 5 ; Z Q , characteristic impedance, in dyn s/cm 5 , estimated by averaging 
high-frequency input impedances in aorta and pulmonary artery; value at 
5Hz for other arteries, f min , frequency of first minimum of Z,. 

* Values estimated from published figures if averages were not reported. 

* Ranges for each species and vessel taken from values in Table 7.2 of source. 
' 10' dyn s/cm 5 . 

Source: Milnor WR. 1989. Hemodynamics, 2d ed, p 183, Baltimore, Wil- 
liams and Wilkins. With permission. 
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FIGURE 30.3 Simultaneous pressure and blood velocity patterns recorded at points in the systemic circulation of 
a human. Velocities were recorded with a catheter-tip electromagnetic flowmeter probe. The catheter included a 
lumen for simultaneous pressure measurement. Source: Mills CJ, Gabe IT, Gault JN, et al. 1970. Pressure-flow 
relationships and vascular impedance in man. Cardiovasc Res 40:405, With permission. 
V = velocity (cm/s), P = pressure (mmHg) 

Velocity varies across the vessel due to viscous and inertial effects as mentioned earlier. The velocities 
in Fig. 30.3 were measured at one point in the artery. Velocity profiles are complex because the flow is 
pulsatile and vessels are elastic, curved, and tapered. Profiles measured in the thoracic aorta of a dog at 
normal arterial pressure and cardiac output are shown in Fig. 30.4. Backflow occurs during diastole, and 
profiles are flattened even during peak systolic flow. The shape of the profiles varies considerably with 
mean aortic pressure and cardiac output [Ling et al., 1973]. 

In more peripheral arteries the profiles are resembling parabolic ones as in fully developed laminar 
flow. The general features of these fully developed flow profiles can be modeled using Womersley's 
approach, although nonlinear effects may be important in some cases. The qualitative features of the 
profile depend on the Womersley number N w . Unsteady effects become more important as N w increases. 
Below a value of about 2 the instantaneous profiles are close to the steady parabolic shape. Profiles in 
the aortic arch are skewed due to curvature of the arch. 

30.5 Pathology 

Atherosclerosis is a disease of the arterial wall which appears to be strongly influenced by hemodynamics. 
The disease begins with a thickening of the intimal layer in locations which correlate with the shear stress 
distribution on the endothelial surface [Friedman et al., 1993]. Over time the lesion continues to grow 
until a significant portion of the vessel lumen is occluded. The peripheral circulation will dilate to 
compensate for the increase in resistance of the large vessels, compromising the ability of the system to 
respond to increases in demand during exercise. Eventually the circulation is completely dilated, and 
resting flow begins to decrease. A blood clot may form at the site or lodge in a narrowed segment, causing 
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FIGURE 30.4 Velocity profiles obtained with a hot-film anemometer probe in the descending thoracic aorta of a 
dog at normal arterial pressure and cardiac output. The velocity at f = time/(cardiac period) is plotted as a function 
of radial position. Velocity w is normalized by the maximum velocity w m and radial position at each time by the 
instantaneous vessel radius R(t). The aortic valve opens at t = 0. Peak velocity occurs 11% of the cardiac period after 
aortic valve opening. Source: Ling SC, Atabek WG, Letzing WG, et al. 1973. Nonlinear analysis of aortic flow in living 
dogs. Circ Res 33:198. With permission. 

an acute loss of blood flow. The disease is particularly dangerous in the coronary and carotid arteries 
due to the critical oxygen requirements of the heart and brain. 

In addition to intimal thickening, the arterial wall properties also change with age. Most measurements 
suggest that arterial elastic modulus increases with age (hardening of the arteries); however, in some 
cases arteries to become more compliant (inverse of elasticity) [Learoyd & Taylor, 1966]. Local weakening 
of the wall may also occur, particularly in the descending aorta, giving rise to an aneurysm, which, if 
ruptures, can cause sudden death. 

Defining Terms 

Aneurysm: A ballooning of a blood vessel wall caused by weakening of the elastic material in the wall. 
Atherosclerosis: A disease of the blood vessels characterized by thickening of the vessel wall and 

eventual occlusion of the vessel. 
Collagen: A protein found in blood vessels which is much stiffer than elastin. 
Elastin: A very elastic protein found in blood vessels. 
Endothelial: The inner lining of blood vessels. 

Impedance: A (generally) complex number expressing the ratio of pressure to flow. 
Myogenic: A change in smooth-muscle tone due to stretch or relaxation, causing a blood vessel to resist 

changes in diameter. 
Newtonian: A fluid whose stress-rate-of-strain relationship is linear, following Newton's law. The fluid 

will have a viscosity whose value is independent of rate of strain. 
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Pulmonary: The circulation which delivers blood to the lungs for reoxygenation and carbon dioxide 

removal. 
Systemic: The circulation which supplies oxygenated blood to the tissues of the body. 
Vasoconstrictor: A substance which causes an increase in smooth-muscle tone, thereby constricting 

blood vessels. 
Vasodilator: A substance which causes a decrease in smooth-muscle tone, thereby dilating blood vessels. 
Viscoelastic: A substance which exhibits both elastic (solid) and viscous (liquid) characteristics. 
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Further Information 

A good introduction to cardiovascular biomechanics, including arterial hemodynamics, is provided by 
K. B. Chandran in Cardiovascular Biomechanics. Y. C. Fung's Biodynamics — Circulation is also an excellent 
starting point, somewhat more mathematical than Chandran. Perhaps the most complete treatment of 
the subject is in Hemodynamics by W. R. Milnor, from which much of this chapter was taken. Milnor's 
book is quite mathematical and may be difficult for a novice to follow. 

Current work in arterial hemodynamics is reported in a number of engineering and physiological 
journals, including the Annals of Biomedical Engineering, Journal of Biomechanical Engineering, Circulation 
Research, and The American Journal of Physiology, Heart and Circulatory Physiology. Symposia sponsored 
by the American Society of Mechanical Engineers, Biomedical Engineering Society, American Heart 
Association, and the American Physiological Society contain reports of current research. 
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Microcirculation comprises blood vessels — arterioles, capillaries, and venules — with diameters of less 
than approximately 150 urn. The importance of the microcirculation is underscored by the fact that most 
of the hydrodynamic resistance of the circulatory system lies in the microvessels, especially in arterioles, 
and most of the exchange of nutrients and waste products occurs at the level of the smallest microvessels. 
The subjects of microcirculatory research are: (1) blood flow and molecular transport in microvessels, 
(2) mechanical interactions and molecular exchange between these vessels and the surrounding tissue, 
and (3) regulation of blood flow and molecular transport. Quantitative knowledge of microcirculatory 
mechanics and mass transport has been accumulated primarily in the past 30 years owing to significant 
innovations in methods and techniques to measure microcirculatory parameters and methods to analyze 
microcirculatory data. The development of these methods has required joint efforts of physiologists and 
biomedical engineers. Key innovations include: (1) significant improvements in intravital microscopy, 
(2) the dual-slit method (Wayland-Johnson) for measuring velocity in microvessels, (3) the servo-null 
method (Wiederhielm-Intaglietta) for measuring pressure in microvessels, (4) the recessed oxygen micro- 
electrode method (Whalen) for polarographic measurements of partial pressure of oxygen, and (5) the 
microspectrophotometric method (Pittman-Duling) for measuring oxyhemoglobin saturation in 
microvessels. The single-capillary cannulation method (Landis-Michel) has provided a powerful tool for 
studies of the transport of water and solutes through the capillary endothelium. In the last decade, new 
experimental techniques have appeared, many adapted from cell biology and modified for in vivo studies, 
that are having a tremendous impact on the field. Examples include (1) confocal microscopy for better 
three-dimensional resolution of microvascular structures, (2) methods of optical imaging using fluorescent 
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labels (e.g., labeling blood cells for velocity measurements), and fluorescent dyes (e.g., calcium ion 
sensitive dyes for measuring the dynamics of calcium ion concentration in arteriolar smooth muscle and 
endothelial cells in vivo), (3) development of sensors (glass filaments and optical and magnetic tweezers) 
for measuring forces in the nanonewton range that are characteristic of cell-cell interactions, 
(4) phosphorescence decay measurements as an indicator of oxygen tension, and (5) methods of manip- 
ulating receptors on the surfaces of blood cells and endothelial cells. In addition to the dramatic devel- 
opments in experimental techniques, quantitative knowledge and understanding of the microcirculation 
have been significantly enhanced by theoretical studies, perhaps having a larger impact than in other 
areas of physiology. Extensive theoretical work has been conducted on ( 1 ) the mechanics of the red blood 
cell (RBC) and leukocyte, (2) mechanics of blood flow in single microvessels and microvascular networks, 

(3) oxygen (0 2 ) and carbon dioxide (C0 2 ) exchange between microvessels and surrounding tissue, and 

(4) water and solute transport through capillary endothelium and the surrounding tissue. These theo- 
retical studies not only aid in the interpretation of experimental data, but in many cases also serve as a 
framework for quantitative testing of working hypotheses and as a guide in designing and conducting 
further experiments. The accumulated knowledge has led to significant progress in our understanding 
of mechanisms of regulation of blood flow and molecular exchange in the microcirculation in many 
organs and tissues under a variety of physiological and pathological conditions (e.g., hypoxia, hyperten- 
sion, sickle-cell anemia, diabetes, inflammation, sepsis, and cancer). Discussions are under way to start 
organizing the enormous amount of information on the microcirculation in the form of a database or 
a network of databases encompassing anatomical and functional data, and conceptual (pathway) and 
computational models [Popel et al„ 1998]. This effort is called the Microcirculation Physiome project. 

The goal of this chapter is to give an overview of the current status of research on the systemic 
microcirculation. Issues of pulmonary microcirculation are not discussed. Because of space limitations, 
it is not possible to recognize numerous important contributions to the field of microcirculatory mechan- 
ics and mass transport. In most cases reference is made to recent reviews, when available, and journal 
articles where earlier references can be found. A discussion of experimental and theoretical findings is 
presented along with the gaps in the understanding of microcirculatory flow phenomena. 

31.2 Mechanics of Microvascular Blood Flow 

Vessel dimensions in the microcirculation are small enough so that the effects of the particulate nature 
of blood are significant [Lipowsky, 1987]. Blood is a suspension of formed elements [red blood cells, 
white blood cells (leukocytes) and platelets], in plasma. Plasma is an aqueous solution of mostly proteins 
(albumins, globulins, and fibrinogen) and electrolytes. Under static conditions, human RBCs are bicon- 
cave discs with diameter ~7 to 9 um. The chief function of the RBC is delivery of 2 to tissue. Most of 
the 2 carried by the blood is chemically bound to hemoglobin inside the RBCs. The mammalian RBC 
comprises a viscoelastic membrane filled with a viscous fluid, concentrated hemoglobin solution. The 
membrane consists of the plasma membrane and underlying cytoskeleton. The membrane can undergo 
deformations without changing its surface area, which is nearly conserved locally. RBCs are so easily 
deformable that they can flow through small pores with diameter <3 um. Leukocytes (grouped into three 
categories: granulocytes, monocytes, and lymphocytes) are spherical cells with diameter ~10 to 20 um. 
They are stiffer than RBCs. The main function of these cells is immunologic, i.e., protection of the body 
against microorganisms causing disease. In contrast to mammalian RBCs, leukocytes are nucleated and 
are endowed with an internal structural cytoskeleton. Leukocytes are capable of active ameboid motion, 
the property which allows their migration from the blood stream into the tissue. Platelets are disc-shaped 
blood elements with diameter ~2 to 3 um. Platelets play a key role in thrombogenic processes and blood 
coagulation. The normal volume fraction of RBCs (hematocrit) in humans is 40 to 45%. The total volume 
of RBCs in blood is approximately 500 times greater than the volume of leukocytes and platelets. 
Rheological properties of blood in arterioles and venules and larger vessels are determined primarily by 
RBCs; however, leukocytes play an important mechanical role in capillaries and small venules. 
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Blood plasma is a Newtonian fluid with viscosity of approximately 1 .2 cP. The viscosity of whole blood 
in a rotational viscometer or a large-bore capillary viscometer exhibits shear-thinning behavior, i.e., 
decreases when shear rate increases. At shear rates >100 s _1 and a hematocrit of 40% typical viscosity 
values are 3 to 4 cP. The dominant mechanism of the non-Newtonian behavior is RBC aggregation and 
the secondary mechanism is RBC deformation under shear forces. The cross-sectional distribution of 
RBCs in vessels is nonuniform, with a core of concentrated RBC suspension and a cell-free or cell-depleted 
marginal layer, typically 2 to 5 um thick, adjacent to the vessel wall. The nonuniform RBC distribution 
results in the Fahraeus effect (the microvessel hematocrit is smaller than the feed or discharge hematocrit) 
due to the fact that, on the average, RBCs move with a higher velocity than blood plasma, and the 
concomitant Fahraeus-Lindqvist effect (the apparent viscosity of blood is lower than the bulk viscosity 
measured with a rotational viscometer or a large-bore capillary viscometer at high shear rate). The 
apparent viscosity of a fluid flowing in a cylindrical vessel of radius R and length L under the influence 
of a pressure difference AP is defined as 

nAPR 4 

11 = 

" SQL 

where Q is the volumetric flow rate. For a Newtonian fluid the apparent viscosity becomes the dynamic 
viscosity of the fluid and the above equation represents Poiseuille's Law. The apparent viscosity is a 
function of hematocrit, vessel radius, blood flow rate, and other parameters. In the microcirculation, 
blood flows through a complex branching network of arterioles, capillaries, and venules. Arterioles are 
typically 10 to 150 um in diameter, capillaries are 4 to 8 um, and venules are 10 to 200 um. Now we will 
discuss vascular wall mechanics and blood flow in vessels of different size in more detail. 

Mechanics of the Microvascular Wall 

The wall of arterioles comprises the intima, which contains a single layer of contiguous endothelial cells; 
the media, which contains a single layer of smooth muscle cells in terminal and medium-size arterioles 
or several layers in the larger arterioles; and the adventitia, which contains collagen fibers with occasional 
fibroblasts and mast cells. Fibers situated between the endothelium and the smooth muscle cells comprise 
the elastica lamina. The single layer of smooth muscle cells terminates at the capillaries and reappears at 
the level of small venules; the capillary wall is devoid of smooth muscle cells. Venules typically have a 
larger diameter and smaller wall thickness-to-diameter ratio than arterioles of the corresponding branch- 
ing order. 

Most of our knowledge of the mechanics of the microvascular wall comes from in vivo or in vitro 
measurements of vessel diameter as a function of transmural pressure [Davis, 1993; Shoukas and Bohlen, 
1990]. Development of isolated arteriole preparations has made it possible to precisely control the 
transmural pressure during experiments. In addition, these preparations allow one to separate the effects 
of metabolic factors and blood flow rate from the effect of pressure by controlling both the chemical 
environment and the flow rate through the vessel. Arterioles and venules exhibit vascular tone, i.e., their 
diameter is maximal when smooth muscle is completely relaxed (inactivated). When the vascular smooth 
muscle is constricted, small arterioles may even completely close their lumen to blood flow, presumably 
by buckling endothelial cells. Arterioles exhibit a myogenic response not observed in other blood vessels, 
with the exception of cerebral arteries: within a certain physiological pressure range the vessels constrict 
in response to elevation of transmural pressure and dilate in response to reduction of transmural pressure; 
in other words, in a certain range of pressures the slope of the pressure-diameter relationship is negative. 
Arterioles of different size exhibit different degrees of myogenic responsiveness. This effect has been 
documented in many tissues both in vivo and in vitro (in isolated arterioles), and it has been shown to 
play an important role in regulation of blood flow and capillary pressure (see Section 3 1 .4 on Regulation 
of Blood Flow). 



© 2000 by CRC Press LLC 



The stress-strain relationship for a thin-walled microvessel can be derived from the experimentally 
obtained pressure-diameter relationship using the Law of Laplace. Stress in the vessel wall can be decom- 
posed into passive and active components. The passive component corresponds to the state of complete 
vasodilation. The active component determines the vascular tone and the myogenic response. Steady- 
state stress-strain relationships are, generally, nonlinear. For arterioles, diameter variations of 50 to 100% 
under physiological conditions are not unusual, so that finite deformations have to be considered in 
formulating the constitutive relationship for the wall (relationship between stress, strain, and their 
temporal derivatives). Pertinent to the question of microvascular mechanics is the mechanical interaction 
of a vessel with its environment that consists of connective tissue, parenchymal cells, and extracellular 
fluid. There is ultrastructural evidence that blood vessels are tethered to the surrounding tissue, so that 
mechanical forces can be generated when the vessels constrict or dilate, or when the tissue is moving, 
e.g., in contracting striated muscle, myocardium, or intestine. Little quantitative information is currently 
available about the magnitude of these forces, chiefly because of the difficulty of such measurements. A 
recently reported technique, magnetic tweezers [Guilford and Gore, 1995], opens a new way of investi- 
gating the mechanics of the microvascular wall in vivo and its interaction with the surrounding tissue. 

Under time-dependent conditions microvessels exhibit viscoelastic behavior. In response to a stepwise 
change in the transmural pressure, arterioles typically respond with a fast "passive" change in diameter 
followed by a slow "active" response with a characteristic time on the order of tens of seconds. For 
example, when the pressure is suddenly increased, the vessel diameter will quickly increase, with subse- 
quent vasoconstriction which may result in a lower value of steady-state diameter than that prior to the 
increase in pressure. Therefore, to accurately describe the time-dependent vessel behavior, the constitutive 
relationship between stress and strain or pressure and diameter must also contain temporal derivatives 
of these variables. Theoretical analysis of the resulting nonlinear equations shows that such constitutive 
equations lead to predictions of spontaneous oscillations of vessel diameter (vasomotion) under certain 
conditions [Ursino and Fabbri, 1992]. Vasomotion has been observed in vivo in various tissues and under 
various physiological conditions [Griffith, 1996]. Whether experimentally observed vasomotion and its 
effect on blood flow (flow motion) can be quantitatively described by the published theoretical studies 
remains to be established. It should be noted that there are mechanisms leading to spontaneous flow 
oscillations that are associated with blood rheology and not with vascular wall mechanics [Kiani et al., 
1994]. 

For most purposes, capillary compliance is not taken into account. However, in some situations, such 
as analysis of certain capillary water transport experiments or leukocyte flow in a capillary, this view is 
not adequate and capillary compliance has to be accounted for. Since the capillary wall is devoid of 
smooth muscle cells, much of this compliance is passive and its magnitude is small. However, the presence 
of contractile proteins in the cytoskeleton of capillary endothelial cells opens a possibility of active 
capillary constriction or dilation. 

Capillary Blood Flow 

Progress in this area is closely related to studies of the mechanics of blood cells described elsewhere in 
this book. In narrow capillaries RBCs flow in single file, separated by gaps of plasma. They deform and 
assume a parachute-like shape, generally non-axisymmetric, leaving a submicron plasma sleeve between 
the RBC and endothelium. In the smallest capillaries, their shape is sausage-like. The hemoglobin solution 
inside an RBC is a Newtonian fluid. The constitutive relationship for the membrane is expressed by the 
Evans-Skalak finite-deformations model. The coupled mechanical problem of membrane and fluid 
motion has been extensively investigated using both analytical and numerical approaches [Skalak et al., 
1989]. An important result of the theoretical studies is the prediction of the apparent viscosity of blood. 
While these predictions are in agreement with in vitro studies in glass tubes, they underestimate a few 
available in vivo capillary measurements of apparent viscosity. In addition, in vivo capillary hematocrit 
in some tissues is lower than predicted from in vitro studies with tubes of the same size. To explain the 
low values of hematocrit, RBC interactions with the endothelial glycocalyx have been implicated 
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[Desjardins and Duling, 1990]. Recent direct measurements of microvascular resistance and theoretical 
analysis provide further evidence of the role of the glycocalyx [Pries et al., 1997; Secomb et al., 1998]. 

The motion of leukocytes through blood capillaries has also been studied thoroughly in recent years. 
Because leukocytes are larger and stiffer than RBCs, under normal flow conditions an increase in capillary 
resistance caused by a single leukocyte may be 1000 times greater than that caused by a single RBC 
[Schmid-Schonbein et al., 1989]. Under certain conditions flow stoppage may occur caused by leukocyte 
plugging. After a period of ischemia, red blood cell and leukocyte plugging may prevent tissue reperfusion 
(ischemia-reperfusion injury) [Granger and Schmid-Schonbein, 1994]. Chemical bonds between mem- 
brane-bound receptors and endothelial adhesion molecules play a crucial role in leukocyte-endothelium 
interactions. Methods of cell and molecular biology permit manipulation of the receptors and thus make 
it possible to study leukocyte microcirculatory mechanics at the molecular level. More generally, methods 
of cell and molecular biology open new and powerful ways to study cell micromechanics and cell-cell 
interactions [Chien, 1992]. 

Arteriolar and Venular Blood Flow 

The cross-sectional distribution of RBCs in arterioles and venules is nonuniform. A concentrated sus- 
pension of RBCs forms a core surrounded by a cell-free or cell-depleted layer of plasma. This "lubrication" 
layer of lower viscosity fluid near the vessel wall results in lower values of the apparent viscosity of blood 
compared to its bulk viscosity, producing the Fahraeus-Lindqvist effect [Lipowsky, 1987], There is 
experimental evidence that velocity profiles of RBCs are generally symmetric in arterioles, except very 
close to vascular bifurcations, but are generally asymmetric in venules; the profiles are blunted in vessels 
of both types [Ellsworth and Pittman, 1986; Tangelder et al., 1986]. Moreover, flow in the venules may 
be stratified as the result of converging blood streams that do not mix rapidly. The key to understanding 
the pattern of arteriolar and venular blood flow is the mechanics of flow at vascular bifurcations, diverging 
for arteriolar flow and converging for venular flow [Das et al., 1997]. One of the main unresolved 
questions is under what physiological conditions does RBC aggregation affect venular velocity distribu- 
tion and vascular resistance. Much is known about aggregation in vitro, but in vivo knowledge is incom- 
plete [CabeletaL, 1997]. 

Problems of leukocyte distribution in the microcirculation and their interaction with the microvascular 
endothelium have attracted considerable attention in recent years. Leukocyte rolling along the walls of 
venules, but not arterioles, has been demonstrated. This effect results from differences in the microvas- 
cular endothelium, not the flow conditions [Ley and Gaehtgens, 1991]. Platelet distribution in the lumen 
is important because of their role in blood coagulation. Detailed studies of platelet distribution in 
arterioles and venules show that the cross-sectional distribution of these disk-shaped blood elements is 
dependent on the blood flow rate and vessel hematocrit [Woldhuis et al., 1992]. 

To conclude, many important features of blood flow through arterioles and venules are qualitatively 
known and understood; however, a rigorous theoretical description of flow as a suspension of discrete 
particles is not available. Such description is necessary for a quantitative understanding of the mechanisms 
of the nonuniform distribution of blood cells in microvessels. 

Microvascular Networks: Structure and Hemodynamics 

Microvascular networks in different organs and tissues differ in their appearance and structural organi- 
zation. Methods have been developed to quantitatively describe network architectonics and hemodynam- 
ics [Popel, 1987; Hudetz, 1997]. The microvasculature is an adaptable structure capable of changing its 
structural and functional characteristics in response to various stimuli [Skalak and Price, 1996] . Microvas- 
cular angiogenesis and rarefaction (anti-angiogenesis) are important examples of this adaptive behavior 
that play important physiological and pathophysiological roles. Methods of fractal analysis have been 
applied to interpret experimental data on angio architecture and blood flow distribution in the micro- 
circulation; these methods explore the property of geometrical and flow similarity that exists at different 
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scales in the network [Bassingthwaighte et al., 1994]. Microvascular hydraulic pressure varies systemat- 
ically between consecutive branching orders, decreasing from the systemic values down to 20 to 25 mm Hg 
in the capillaries and decreasing further by 10 to 15 mm Hg in the venules. Mean microvascular blood 
flow rate in arterioles decreases towards the capillaries, in inverse proportion to the number of "parallel" 
vessels, and increases from capillaries through the venules. In addition to this longitudinal variation of 
blood flow and pressure among different branching orders, there are significant variations among vessels 
of the same branching order, referred to as flow heterogeneity. The heterogeneity of blood flow and RBC 
distribution in microvascular networks has been well documented in a variety of organs and tissues. This 
phenomenon may have important implications for tissue exchange processes, so significant efforts have 
been devoted to the quantitative analysis of blood flow in microvascular networks. A mathematical model 
of blood flow in a network can be formulated as follows: (1) network topology or vessel interconnections 
have to be specified, (2) the diameter and length of every vascular segment has to be known (alternatively, 
vessel diameter can be specified as a function of transmural pressure and perhaps some other parameters; 
these relationships are discussed in the preceding section on wall mechanics), (3) the apparent viscosity 
of blood has to be specified as a function of vessel diameter, local hematocrit, and shear rate, (4) a 
relationship between RBC flow rates and bulk blood flow rates at diverging bifurcations has to be specified; 
this relationship is often referred to as the "bifurcation law," and (5) boundary conditions have to be 
specified at the inlet vessel branches: bulk flow rate as well as RBC flow rate, or hematocrit (alternatively, 
pressure can be specified at both inlet and outlet branches). This set of generally nonlinear equations 
can be solved to yield pressure at each bifurcation, blood flow rate through each segment, and discharge 
or microvessel hematocrit in each segment. These equations also predict vessel diameters if vessel com- 
pliance is taken into account. The calculated variables can then be compared with experimental data. 
Such a detailed comparison was reported for rat mesentery [Pries et al., 1996]. The authors found a good 
agreement between theoretical and experimental data when histograms of parameter distributions were 
compared, but poor agreement, particularly for vessel hematocrit, when comparison was done on a 
vessel-by-vessel basis. In these calculations, the expression for apparent viscosity was taken from in vitro 
experiments. The agreement was improved when the apparent viscosity was increased substantially from 
its in vitro values, particularly in the smallest vessels. Thus, a working hypothesis was put forward that 
in vivo apparent viscosity in small vessels is higher than the corresponding in vitro viscosity in glass tubes. 
Recent experimental and theoretical studies support this hypothesis [Pries et al., 1997; Secomb et al., 
1998]. 

31.3 Mass Transport in the Microcirculation 

Transport of Oxygen and Carbon Dioxide 

One of the most important functions of the microcirculation is the delivery of 2 to tissue and the 
removal of waste products, particularly of C0 2 , from tissue. 2 is required for aerobic intracellular 
respiration for the production of adenosine triphosphate (ATP). C0 2 is produced as a by-product of 
these biochemical reactions. Tissue metabolic rate can change drastically, e.g., in aerobic muscle in the 
transition from rest to exercise, which necessitates commensurate changes in blood flow and 2 delivery. 
One of the major issues studied is how 2 delivery is matched to 2 demand under different physiological 
and pathological conditions. This question arises for short term or long term regulation of 2 delivery 
in an individual organism, organ, or tissue, as well as in the evolutionary sense, phylogeny. The hypothesis 
of symmorphosis, a fundamental balance between structure and function, has been formulated for the 
respiratory and cardiovascular systems and tested in a number of animal species [Weibel et al., 1992; Roy 
andPopel, 1996]. 

In the smallest exchange vessels (capillaries, small arterioles, and venules) , 2 molecules are ( 1 ) released 
from hemoglobin inside RBCs, (2) diffuse through the plasma, and (3) cross the endothelium, the 
extravascular space, and parenchymal cells until they reach the mitochondria where they are utilized in 
the process of oxidative phosphorylation. The nonlinear relationship between hemoglobin saturation 
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with 2 and the local 2 tension (P0 2 ) is described by the oxyhemoglobin dissociation curve (ODC). 
The theory of 2 transport from capillaries to tissue was conceptually formulated by August Krogh in 
1918 and it has dominated the thinking of physiologists for eight decades. The model he formulated 
considered a cylindrical tissue volume supplied by a single central capillary; this element was considered 
the building block for the entire tissue. A constant metabolic rate was assumed and P0 2 at the capillary- 
tissue interface was specified. The solution to the corresponding transport equation is the Krogh-Erlang 
equation describing the radial variation of 2 tension in tissue. Over the years, the Krogh Tissue Cylinder 
model has been modified by many investigators to include transport processes in the capillary and P0 2 - 
dependent consumption. However, in the past few years new conceptual models of 2 transport have 
emerged. First, it was discovered experimentally and subsequently corroborated by theoretical analysis 
that capillaries are not the only source of oxygen, but arterioles (precapillary 2 transport) and to a 
smaller extent venules (postcapillary 2 transport), also participate in tissue oxygenation; in fact, a 
complex pattern of 2 exchange may exist between arterioles, venules, and adjacent capillary networks 
[Ellsworth et al„ 1994] . Second, theoretical analysis of intracapillary transport suggested that a significant 
part of the resistance to 2 transport, on the order of 50%, is located within the capillary, primarily due 
to poor diffusive conductance of the plasma gaps between the erythrocytes [Roy and Popel, 1996]. Third, 
the effect of myoglobin-facilitated 2 diffusion in red muscle fibers and cardiac myocytes has been re- 
evaluated, however, its significance must await additional experimental studies. Fourth, geometric and 
hemodynamic heterogeneities in 2 delivery have been quantified experimentally and attempts have been 
made to model them. Discussion of theoretical issues of 2 transport can be found in Popel [1989] and 
Heliums et al. [1996] and experimental findings are reviewed in Pittman [1995] and Sibbald et al. [1998]. 
One important emerging area of application of this knowledge is artificial oxygen carriers, hemoglobin- 
based and non-hemoglobin-based [Winslow et al., 1997]. 

Transport of C0 2 is coupled with 2 through the Bohr effect (effect of C0 2 tension on the blood 2 
content) and the Haldane effect (effect of P0 2 on the blood C0 2 content). Diffusion of C0 2 is faster than 
that of 2 because C0 2 solubility in tissue is higher; theoretical studies predict that countercurrent 
exchange of C0 2 between arterioles and venules is of major importance so that equilibration of C0 2 
tension with surrounding tissue should occur before capillaries are reached. Experiments are needed to 
test these theoretical predictions. 

Transport of Solutes and Water 

The movement of solute molecules across the capillary wall occurs primarily by two mechanisms: 
(1) diffusion and (2) solvent drag. Diffusion is the passive mechanism of transport which rapidly and 
efficiently transports small solutes over the small distances (tens of microns) between the blood supply 
(capillaries) and tissue cells. Solvent drag refers to the movement of solute that is entrained in the bulk 
flow of fluid across the capillary wall and is generally negligible, except in cases of large molecules with 
small diffusivities and high transcapillary fluid flow. 

The capillary wall is composed of a single layer of endothelial cells about 1 um thick. Lipid soluble 
substances (e.g., 2 ) can diffuse across the entire wall surface, whereas water soluble substances (e.g., 
glucose) are restricted to small aqueous pathways equivalent to cylindrical pores 8 to 9 nm in diameter. 
Total pore area is about 0.1% of the surface area of a capillary. The permeability of the capillary wall to 
a particular substance depends upon the relative size of the substance and the pore ("restricted" diffusion). 
The efficiency of diffusive exchange can be increased by increasing the number of perfused capillaries 
(e.g., heart and muscle tissue from rest to exercise), since this increases the surface area available for 
exchange and decreases the distances across which molecules must diffuse. 

The actual pathways through which small solutes traverse the capillary wall appear to be in the form 
of clefts between adjacent endothelial cells. Rather than being open slits, these porous channels appear 
to contain a matrix of small cylindrical fibers (perhaps glycosaminoglycans) that occupy about 5% of 
the volume of these pathways. The permeability properties of the capillary endothelium are modulated 
by a number of factors, among which are plasma protein concentration and composition, rearrangement 
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of the endothelial cell glycocalyx, calcium influx into the endothelial cell, and endothelial cell membrane 
potential. Many of the studies that have established our current understanding of the endothelial exchange 
barrier have been carried out on single perfused capillaries of the frog mesentery; some recent investi- 
gations have examined these transport issues in mammalian tissues [Curry, 1994; Michel, 1996; Wein- 
baum, 1998]. There could be, in addition to the porous pathways, nonporous pathways that involve 
selective uptake of solutes and subsequent transcellular transport. In order to study such pathways, one 
must try to minimize the contributions to transcapillary transport from solvent drag. 

The processes whereby water passes back and forth across the capillary wall are called filtration and 
absorption. The flow of water depends upon the relative magnitude of hydraulic and osmotic pressures 
across the capillary wall and is described quantitatively by the Kedem-Katchalsky equations (the particular 
form of the equations applied to capillary water transport is referred to as Starling's Law). Overall, in 
the steady-state there is an approximate balance between hydraulic and osmotic pressures which leads 
to a small net flow of water. Generally, more fluid is filtered than is reabsorbed; the overflow is carried 
back to the vascular system by the lymphatic circulation. The lymphatic network is composed of a large 
number of small vessels, the terminal branches of which are closed. Flap valves (similar to those in veins) 
ensure unidirectional flow of lymph back to the central circulation. The smallest (terminal) vessels are 
very permeable, even to proteins which occasionally leak from systemic capillaries. Lymph flow is deter- 
mined by interstitial fluid pressure and the lymphatic "pump" (one-way flap valves and skeletal muscle 
contraction). Control of interstitial fluid protein concentration is one of the most important functions 
of the lymphatic system. If more net fluid is filtered than can be removed by the lymphatics, the volume 
of interstitial fluid increases. This fluid accumulation is called edema. This circumstance is important 
clinically since solute exchange (e.g., 2 ) decreases due to the increased diffusion distances produced 
when the accumulated fluid pushes the capillaries, tethered to the interstitial matrix, away from each other. 

31.4 Regulation of Blood Flow 

The cardiovascular system controls blood flow to individual organs (1) by maintaining arterial pressure 
within narrow limits and (2) by allowing each organ to adjust its vascular resistance to blood flow so 
that each receives an appropriate fraction of the cardiac output. There are three major mechanisms that 
control the function of the cardiovascular system: local, neural, and humoral. The sympathetic nervous 
system and circulating hormones both provide overall vasoregulation, and thus coarse flow control, to 
all vascular beds. The local mechanisms provide finer regional control within a tissue, usually in response 
to local changes in tissue activity or local trauma. The three mechanisms can work independently of each 
other, but there are also interactions among them. 

The classical view of blood flow control involved the action of vasomotor influences on a set of vessels 
called the "resistance vessels," generally arterioles and small arteries smaller than about 100 um diameter, 
that controlled flow to and within an organ. The notion of "precapillary sphincters" that control flow in 
individual capillaries has been abandoned in favor of the current idea that the terminal arterioles control 
the flow in small capillary networks that branch off of these arterioles. In recent years, it has become 
clear that the resistance to blood flow is distributed over a wider range of vessel branching orders with 
diameters up to 500 um. There are at least two mechanisms to be discussed below that are available for 
coordinating the actions of local control processes over wider regions. 

Neurohumoral Regulation of Blood Flow 

The role of neural influences on the vasculature varies greatly from organ to organ. Although all organs 
receive sympathetic innervation, regulation of blood flow in the cerebral and coronary vascular beds 
occurs mostly through intrinsic local (metabolic) mechanisms. The circulations in skeletal muscle, skin, 
and some other organs, however, are significantly affected by the sympathetic nerves. In general, the level 
of intrinsic myogenic activity and sympathetic discharge sets the state of vascular smooth muscle 
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contraction (basal vascular tone) and hence vascular resistance in organs. This basal tone is modulated 
by circulating and local vasoactive influences, e.g., endothelium-derived relaxing factor (EDRF), identified 
as nitric oxide (NO), endothelin, and vasoactive substances released from parenchymal cells. 

Local Regulation of Blood Flow 

In addition to neural and humoral mechanisms for regulating the function of the cardiovascular system, 
there are mechanisms intrinsic to the various tissues which can operate independently of neurohumoral 
influences. The site of local regulation is the microcirculation. Common examples of local control 
processes are (1) autoregulation of blood flow, (2) reactive hyperemia, and (3) active (or functional) 
hyperemia. The two major theories of local regulation are (1) the myogenic hypothesis which states, in 
essence, that the vascular smooth muscle actively contracts in response to stretch (see section on Mechan- 
ics of the Microvascular Wall in this chapter) and (2) the metabolic hypothesis which states that there is 
a link between blood flow and tissue metabolism [Duling, 1991]. 

Cells have a continuous need for 2 and also continuously produce metabolic wastes, some of which 
are vasoactive (usually vasodilators). Under normal conditions there is a balance between 2 supply and 
demand, but imbalances give rise to adjustments in blood flow that bring supply back into register with 
demand. Consider exercising the skeletal muscle as an example. With the onset of exercise, metabolite 
production and 2 requirements increase. The metabolites diffuse away from their sites of production 
and reach the vasculature. Vasodilation ensues, lowering resistance to blood flow. The resulting increase 
in blood flow increases the 2 supply and finally a new steady-state is achieved in which 2 supply and 
demand are matched. This scenario operates for other tissues in which metabolic activity changes. 

The following 2 -linked metabolites have been implicated as potential chemical mediators in the 
metabolic hypothesis: adenosine (from ATP hydrolysis: ATP— >ADP— »AMP— ^adenosine), H + , and lactate 
(from lactic acid generated by glycolysis). Their levels are increased when there is a reduction in 2 supply 
relative to demand (i.e., tissue hypoxia). The production of more C0 2 as a result of increased tissue 
activity (leading to increased oxidative metabolism) leads to vasodilation through increased H + concen- 
tration. Increased potassium ions and interstitial fluid osmolarity (i.e., more osmotically active particles) 
transiently cause vasodilation under physiological conditions associated with increased tissue activity. 

A recent novel idea for oxygen-linked vasoregulation is that the RBC itself could act as a mobile sensor 
for hypoxia [Ellsworth et al., 1995]. The mechanism works as follows. Under conditions of low oxygen 
and pH, the RBC releases ATP which binds to purinergic receptors on the endothelial cells. This leads 
to the production in the endothelial cells of the vasodilator, NO. Since the most likely location for hypoxia 
would be in or near the venular network, the local vasodilatory response to NO is propagated to upstream 
vessels causing arteriolar vasodilation (see the section below on Coordination of Vasomotor Responses). 

Some Functions of the Endothelium 

Endothelial cells form the lining of all blood vessels. They provide a smooth, non-thrombogenic surface 
with which the blood is in intimate contact. In addition to their passive permeability barrier function, 
endothelial cells produce a number of important vasoactive substances, among which are: 

(1) prostacyclin, a potent vascular smooth muscle relaxant and inhibitor of platelet aggregation; 

(2) endothelin, a potent vasoconstrictor peptide; and (3) NO that mediates the vasodilatory effect of a 
number of vasodilators (e.g., acetylcholine) [Furchgott and Vanhouttee, 1989; Ignarro, 1989]. 

It has been observed in arteries and arterioles that increases in blood flow lead to vasodilation 
(flow-dependent dilation) [Smiesko and Johnson, 1993]. This phenomenon appears to be mediated by 
NO release from the endothelium in those vessels. The sequence of events is: (1) blood flow increases; 
(2) shear stress at the vessel wall increases (thereby increasing the viscous drag on the endothelial lining 
of the vessel); (3) NO is released in response to the mechanical stimulus; and (4) vascular smooth muscle 
relaxation (vasodilation) occurs in response to the elevated level of NO. 
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Coordination of Vasomotor Responses 

Communication via gap junctions between the two active cell types in the blood vessel wall, smooth 
muscle and endothelial cells, appears to play an important role in coordinating the responses among 
resistance elements in the vascular network [Duling, 1991; Segal, 1992]. There is chemical and electrical 
coupling between the cells of the vessel wall, and this signal, in response to locally released vasoactive 
substances (e.g., from vessel wall, RBCs, or parenchymal cells) can travel along a vessel in either direction 
with a length constant of about 2 mm. There are two immediate consequences of this communication. 
A localized vasodilatory stimulus of metabolic origin will be propagated to contiguous vessels, thereby 
lowering the resistance to blood flow in a larger region. In addition, this more generalized vasodilation 
should increase the homogeneity of blood flow in response to the localized metabolic event. The increase 
in blood flow produced as a result of this vasodilation will cause flow also to increase at upstream sites. 
The increased shear stress on the endothelium as a result of the flow increase will lead to vasodilation 
of these larger upstream vessels. Thus, the neurohumoral and local responses are linked together in a 
complex control system that matches regional perfusion to the local metabolic needs [Segal and Kurjiaka, 
1995; Muller et al., 1996]. 
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Defining Terms 

Angiogenesis: The growth and development of the vessels. 

Apparent viscosity: The viscosity of a Newtonian fluid that would require the same pressure difference 

to produce the same blood flow rate through a circular vessel as the blood. 
Fahraeus effect: Microvessel hematocrit is smaller than hematocrit in the feed or discharge reservoir. 
Fahraeus-Lindqvist effect: Apparent viscosity of blood in a microvessel is smaller than the bulk vis- 
cosity measured with a rotational viscometer or a large-bore capillary viscometer. 
Krogh Tissue Cylinder model: A cylindrical volume of tissue supplied by a central cylindrical capillary. 
Myogenic response: Vasoconstriction in response to elevated transmural pressure and vasodilation in 

response to reduced transmural pressure. 
Myoglobin-facilitated 2 diffusion: An increase of 2 diffusive flux as a result of myoglobin molecules 

acting as a carrier for 2 molecules. 
Oxyhemoglobin dissociation curve: The equilibrium relationship between hemoglobin saturation and 

2 tension. 
Poiseuille's Law: The relationship between volumetric flow rate and pressure difference for steady flow 

of a Newtonian fluid in a long circular tube. 
Precapillary 2 transport: 2 diffusion from arterioles to the surrounding tissue. 
Starling's Law: The relationship between water flux through the endothelium and the difference 

between the hydraulic and osmotic transmural pressures. 
Vasomotion: Spontaneous rhythmic variation of microvessel diameter. 
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The term hematocytes refers to the circulating cells of the blood. These are divided into two main classes: 
erythrocytes, or red cells, and leukocytes, or white cells. In addition to these are specialized cell-like 
structures called platelets. The mechanical properties of these cells are of special interest because of their 
physiological role as circulating corpuscles in the flowing blood. The importance of the mechanical 
properties of these cells and their influence on blood flow is evident in a number of hematological 
pathologies. The properties of the two main types of hematocytes are distinctly different. The essential 
character of a red cell is that of an elastic bag enclosing a newtonian fluid of comparatively low viscosity. 
The essential behavior of white cells is that of a high viscous fluid drop with a more or less constant 
cortical (surface) tension. Under the action of a given force, red cells deform much more readily than 
white cells. In this chapter we focus on descriptions of the behavior of the two cell types separately, 
concentrating on the viscoelastic characteristics of the red cell membrane and the fluid of characteristics 
of the white cell cytosol. 

32.1 Fundamentals 

Stresses and Strains in Two Dimensions 

The description of the mechanical deformation of the membrane is cast in terms of principal force 
resultants and principle extension ratios of the surface. The force resultants, like conventional three- 
dimensional strain, are generally expressed in terms of a tensorial quantity, the components of which 
depend on coordinate rotation. For the purposes of describing the constitutive behavior of the surface, 
it is convenient to express the surface resultants in terms of rotationally invariant quantities. These can 
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be either the principal force resultants Nj and N 2 , or the isotropic resultant N and the maximum shear 
resultant N s . The surface strain is also a tensorial quantity but may be expressed in terms of the principal 
extension ratios of the surface X l and X 2 . The rate of surface shear deformation [Evans & Skalak, 1979] 
is given by 



V 



\& 



v x iy 



d_ 

dt 



\Kj 



1/2 



(32.1) 



The membrane deformation is calculated from observed macroscopic changes in cell geometry, usually 
with the use of simple geometric shapes to approximate the cell shape. The membrane force resultants 
are calculated from force balance relationships. For example, in the determination of the area expansivity 
modulus of the red cell membrane or the cortical tension in neutrophils, the force resultants in the plane 
of the membrane of the red cell or the white cell are isotropic. In this case, as long as the membrane 
surface of the cell does not stick to the pipette, the membrane force resultant can be calculated from the 
law of Laplace: 



AP=2N 



1 
R 



(32.2) 



where _R is the radius of the pipette, R c is the radius of the spherical portion of the cell outside the 
pipette, N is the isotropic force resultant (tension) in the membrane, and AP is the aspiration pressure 
in the pipette. 

Basic Equations for Newtonian Fluid Flow 

The constitutive relations for fluid flow in a sphere undergoing axisymmetric deformation can be written: 



-P+2T1- 
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(32.3) 
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where a rr and CT r6 are components of the stress tensor, p is the hydrostatic pressure r is the radial coordinate, 
is the angular coordinate in the direction of the axis of symmetry in spherical coordinates, and V T and 
V e are components of the fluid velocity vector. These equations effectively define the material viscosity, 
r\. In general, r\ may be a function of the strain rate. The second term in Eq. (32.3) contains the radial 
strain rate s rr and the bracketed term in Eq. (32.4) corresponds to e r8 . For the purposes of evaluating 
this dependence, it is convenient to define the mean shear rate y m averaged over the cell volume and 
duration of the deformation process t e : 
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where repeated indices indicate summation. 
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TABLE 32.1 Parameter Values for a Typical 
Red Blood Cell (37°C) 



Area 


132 pm 2 


Volume 


96 pm 2 


Sphericity 


0.77 


Membrane area modulus 


480 mN/m 


Membrane shear modulus 


0.006 mN/m 


Membrane viscosity 


0.00036 mN-s/m 


Membrane bending stiffness 


0.2 x 10- 18 ] 


Thermal area expansivity 


0.12%/°C 


1 dV 


-0.14%/°C 


V dT 





32.2 Red Cells 
Size and Shape 

The normal red cell is a biconcave disk at rest. The average human cell is approximately 7.7 urn in 
diameter and varies in thickness from -2.8 pm at the rim to -1.4 urn at the center [Fung et al., 1981]. 
However, red cells vary considerably in size even within a single individual. The mean surface area 
-130 pm 2 and the mean volume is 96 pm 3 (Table 32.1), but the range of sizes within a population is 
gausian-distributed with standard deviations of -15.8 pm 2 for the area and -16.1 pm 3 for the volume 
[Fung et al., 1981]. Cells from different species vary enormously in size, and tables for different species 
have been tabulated elsewhere [Hawkey et al., 1991]. 

Red cell deformation takes place under two important constraints: fixed surface area and fixed volume. 
The constraint of fixed volume arises from the impermeability of the membrane to cations. Even though 
the membrane is highly permeable to water, the inability of salts to cross the membrane prevents 
significant water loss because of the requirement for colloidal osmotic equilibrium [Lew & Bookchin, 
1986] . The constraint of fixed surface area arises from the large resistance of bilayer membranes to changes 
in area per molecule [Needham & Nunn, 1990]. These two constraints place strict limits on the kinds 
of deformations that the cell can undergo and the size of the aperture that the cell can negotiate. Thus, 
a major determinant of red cell deformability is its ratio of surface area to volume. One measure of this 
parameter is the sphericity, defined as the dimensionless ratio of the two-thirds power of the cell volume 
to the cell area times a constant that makes its maximum value 1.0: 

S= — ^^-^— . (32.6) 



( 4J r/3)' 



The mean value of sphericity of a normal population of cells was measured by interference microscopy 
to be 0.79 with a standard deviation (SD) of 0.05 at room temperature [Fung et al., 1981]. Similar values 
were obtained using micropipettes: mean = 0.81, SD = 0.02 [Waugh 8c Agre, 1988]. The membrane area 
increases with temperature, and the membrane volume decreases with temperature, so the sphericity at 
physiological temperature, is expected to be somewhat smaller. Based on measurements of the thermal 
area expansivity of 0.12%/°C [Waugh & Evans, 1979] and a change in volume of -0.14%/°C [Waugh & 
Evans, 1979], the mean sphericity at 37°C is estimated to be 0.76-0.78. (See Table 32.1.) 

Red Cell Cytosol 

The interior of a red cell is a concentrated solution of hemoglobin, the oxygen-carrying protein, and it 
behaves as a newtonian fluid [Cokelet 8c Meiselman, 1968]. In a normal population of cells there is a 
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TABLE 32.2 Viscosity of Red Cell Cytosol (37°C) 



Hemoglobin 




Measured 


Best Fit 


Concentration, 


g/1 


Viscosity*, mPa-s 


Viscosity*, mPa-s 


290 




4.1-5.0 


4.2 


310 




5.2-6.6 


5.3 


330 




6.6-9.2 


6.7 


350 




8.5-13.0 


8.9 


370 




10.8-17.1 


12.1 


390 




15.0-23.9 


17.2 



* Data taken from Cokelet and Meiselman [1968] and 
Chien and coworkers [1970]. 

f Fitted curve from Ross and Minton [1977]. 

distribution of hemoglobin concentrations in the range 29 to 39 g/dl. The viscosity of the cytosol depends 
on the hemoglobin concentration as well as temperature. (See Table 32.2.) Based on theoretical models 
[Ross & Minton, 1977] , the temperature dependence of the cytosolic viscosity is expected to be the same 
as that of water, that is, the ratio of cytosolic viscosity at 37°C to the viscosity at 20°C is the same as the 
ratio of water viscosity at those same temperatures. In most cases, even in the most dense cells, the 
resistance to flow of the cytosol is small compared with the viscoelastic resistance of the membrane when 
membrane deformations are appreciable. 

Membrane Area Dilation 

The large resistance of the membrane to area dilation has been characterized in micromechanical exper- 
iments. The changes in surface area that can be produced in the membrane are small, and so they can 
be characterized in terms of a simple hookean elastic relationship between the isotropic force resultant N 
and the fractional change in surface area a = AIA - 1 : 

N=Ka. (32.7) 

The proportionality constant K is called the area compressibility modulus or the area expansivity modulus. 
Early estimates placed its value at room temperature at -450 mN/m [Evans & Waugh, 1977] and showed 
a dependence of the modulus on temperature, its value changing from -300 mN/m at 45°C to a value 
of -600 mN/m at 5°C [Waugh & Evans, 1979]. Subsequently it was shown that the measurement of this 
parameter using micropipettes is affected by extraneous electric fields, and the value at room temperature 
was corrected upward to -500 mN/m [Katnik & Waugh, 1990]. The values in Table 32.3 are based on 
this measurement, and the fractional change in the modulus with temperature is based on the original 
micropipette measurements [Waugh & Evans, 1979]. 

TABLE 32.3 Temperature Dependence of Viscoelastic 
Coefficients of the Red Cell Membrane 



Temperature 


TO 


K (mN/m)* 


u m (mN/m) 1 


r\ m (mN-s/m)* 


5 




670 


0.0078 


0.0021 


15 




580 


0.0072 


0.0014 


25 




500 


0.0065 


0.00074 


37 




400 


0.0058 


0.00036 


45 




330 


0.0053 


— 



* Based on a value of the modulus at 35°C of 500 mN/m and the 
fractional change in modulus with temperature measured by Waugh 
and Evans [1979], 

f Based on linear regression to the data of Waugh and Evans [1979]. 

* Data from Hochmuth and colleagues [1980]. 
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Membrane Shear Deformation 

The shear deformations of the red cell surface can be large, and so a simple linear relationship between 
force and extension is not adequate for describing the membrane behavior. The large resistance of the 
membrane composite to area dilation led early investigators to postulate that the membrane maintained 
constant surface density during shear deformation, that is, that the surface was two-dimensionally 
incompressible. Most of what exists in the literature about the shear deformation of the red cell membrane 
is based on this assumption. Only very recently has experimental evidence emerged that this assumption 
is an oversimplification of the true cellular behavior, and that deformation produces changes in the local 
surface density of the membrane elastic network [Discher et al., 1994]. Nevertheless, the older simpler 
relationships provide a description of the cell behavior that can be used for many applications, and so 
the properties of the cell defined under that assumption are summarized here. 

For a simple, two-dimensional, incompressible, hyperelastic material, the relationship between the 
membrane shear force resultant N s and the material deformation [Evans & Skalak, 1979] is 



N =^a 

5 2 






+ 2Tl m V s (32.8) 



where X l and X 2 are the principal extension ratios for the deformation and V s is the rate of surface shear 
deformation [Eq. (32.1)]. The membrane shear modulus u,„ and the membrane viscosity r\ m are defined 
by this relationship. Values for these coefficients at different temperatures are given in Table 32.3. 

Stress Relaxation and Strain Hardening 

Subsequent to these original formulations a number of refinements to these relationships have been 
proposed. Observations of persistent deformations after micropipette aspiration for extended periods of 
time formed the basis for the development of a model for long-term stress relaxation [Markle et al., 
1983]. The characteristic times for these relaxations were on the order of 1-2 h, and these times were 
thought to correlate with permanent rearrangements of the membrane elastic network. 

Another type of stress relaxation is thought to occur over very short times (~0.1 s) after rapid 
deformation of the membrane either by micropipette [Chien et al., 1978] or in cell extension experiments 
(Waugh & Bisgrove, unpublished observations). This phenomenon is thought to be due to transient 
entanglements within the deforming network. Whether the phenomenon actually occurs remains con- 
troversial. The stresses relax rapidly, and it is difficult to account for inertial effects of the measuring 
system and to reliably assess the intrinsic cellular response. 

Finally, there has been some evidence that the coefficient for shear elasticity may be a function of the 
surface extension, increasing with increasing deformation. This was first proposed by Fischer in an effort 
to resolve discrepancies between theoretical predictions and observed behavior of red cells undergoing 
dynamic deformations in fluid shear [Fischer et al., 1981]. Increasing elastic resistance with extension 
has also been proposed as an explanation for discrepancies between theoretical predictions based on a 
constant modulus and measurements of the length of a cell projection into a micropipette [Waugh & 
Marchesi, 1990]. However, due to the approximate nature of the mechanical analysis of cell deformation 
in shear flow, and the limits of optical resolution in micropipette experiments, the evidence for a 
dependence of the modulus on extension is not clear-cut, and this issue remains unresolved. 

New Constitutive Relations for the Red Cell Membrane 

The most modern picture of membrane deformation recognizes that the membrane is a composite of 
two layers with distinct mechanical behavior. The membrane bilayer, composed of phospholipids and 
integral membrane proteins, exhibits a large elastic resistance to area dilation but is fluid in surface shear. 
The membrane skeleton, composed of a network of structural proteins at the cytoplasmic surface of the 
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bilayer, is locally compressible and exhibits an elastic resistance to surface shear. The assumption that 
the membrane skeleton is locally incompressible is no longer applied. This assumption had been chal- 
lenged over the years on the basis of theoretical considerations, but only very recently has experimentally 
evidence emerged that shows definitively that the membrane skeleton is compressible. [Discher et al., 
1994]. This has led to a new constitutive model for membrane behavior [Mohandas & Evans, 1994]. The 
principal stress resultants in the membrane skeleton are related to the membrane deformation by 



*i=H, 



v x 2 



and 



+ K h 



XtX '-m» 



(32.9) 



N 2 =\l t 






+ K. 



KK ~m« 



(32.10) 



where |% and K N are the shear and isotropic moduli of the membrane skeleton respectively, and n is a 
parameter to account for molecular crowding of the skeleton in compression. The original modulus 
based on the two-dimensionally incompressible case is related to these moduli by 






(32.11) 



Values for the coefficients determined from fluorescence measurements of skeletal density distributions 
during micropipette aspiration studies are |i N ~ 6 x 10~ 3 mN/m, and K N /[l N ~ 2. The value for n is 
estimated to be > 2 [Mohandas & Evans, 1994; Discher, et al, 1994]. 

These new concepts for membrane constitutive behavior have only recently been introduced and have 
yet to be thoroughly explored. The temperature dependence of these moduli is unknown, and the 
implications such a model will have on interpretation of dynamic deformations of the membrane remain 
to be resolved. 

Bending Elasticity 

Even though the membrane is very thin, it has a high resistance to surface dilation. This property, coupled 
with the finite thickness of the membrane, gives the membrane a small but finite resistance to bending. 
This resistance is characterized in terms of the membrane-bending modulus. The bending resistance of 
biological membranes is inherently complex because of their lamellar structure. There is a local resistance 
to bending due to the inherent stiffness of the individual leaflets of the membrane bilayer. (Because the 
membrane skeleton is compressible, it is thought to contribute little if anything to the membrane-bending 
stiffness.) In addition to this local stiffness, there is a nonlocal bending resistance due to the net compression 
and expansion of the adjacent leaflets resulting from the curvature change. The nonlocal contribution 
is complicated by the fact that the leaflets may be redistributed laterally within the membrane capsule 
to equalize the area per molecule within each leaflet. The situation is further complicated by the likely 
possibility that molecules may exchange between leaflets to alleviate curvature-induced dilation/com- 
pression. Thus, the bending stiffness measured by different approaches probably reflects contributions 
from both local and nonlocal mechanisms, and the measured values may differ because of different 
contributions from the two mechanisms. Estimates based on buckling instabilities during micropipette 
aspiration give a value of -0.18 x 10~ 18 J [Evans, 1983]. Measurements based on the mechanical formation 
of lipid tubes from the cell surface give a value of -0.20 x 10~ 18 J [Hwang and Waugh, 1997]. 
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32.3 White Cells 

Whereas red cells account for approximately 40% of the blood volume, white cells occupy less than 1% 
of the blood volume. Yet because white cells are less deformable, they can have a significant influence on 
blood flow, especially in the microvasculature. Unlike red cells, which are very similar to each other, as 
are platelets, there are several different kinds of white cells or leukocytes. In general, the white cells are 
classified into groups according to their appearance when viewed with the light microscope. Thus, there 
are granulocytes, monocytes, and lymphocytes [Alberts et al., 1994]. The granulocytes with their many 
internal granules are separated into neutrophils, basophils, and eosinophils according to the way each cell 
stains. The neutrophil, also called a polymorphonuclear leukocyte because of its segmented or multilobed 
nucleus, is the most common white cell in the blood. (See Table 32.4). The lymphocytes, which constitute 
20-40% of the white cells and which are further subdivided into B lymphocytes and killer and helper 
T lymphocytes, are the smallest of the white cells. The other types of leukocytes are found with much less 
frequency. Most of the geometric and mechanical studies of white cells reported below have focused on 
the neutrophil because it is the most common cell in the circulation, although the lymphocyte is now 
starting to receive more attention. 

Size and Shape 

White cells at rest are spherical. The surfaces of white cells contain many folds, projections, and 
"microvilli" to provide the cells with sufficient membrane area to deform as they enter capillaries with 
diameters much smaller than the resting diameter of the cell. (Without the reservoir of membrane area 
in these folds, the constraints of constant volume and membrane area would make a spherical cell 
essentially undeformable.) The excess surface area of the neutrophil, when measured in a wet preparation, 
is slightly more than twice the apparent surface area of a smooth sphere with the same diameter [Evans 
& Yeung, 1989; Ting-Beall et al., 1993]. It is interesting to note that each type of white cell has its own 
unique surface topography, which allows one to readily determine if a cell is, for example, either a 
neutrophil or monocyte or lymphocyte [Hochmuth et al., 1995]. 

The cell volumes listed in Table 32.4 were obtained with the light microscope, either by measuring the 
diameter of the spherical cell or by aspirating the cell into a small glass pipette with a known diameter 
and then measuring the resulting length of the cylindrically shaped cell. Other values for cell volume 
obtained using transmission electron microscopy are somewhat smaller, probably because of cell shrink- 
age due to fixation and drying prior to measurement [Schmid-Schonbein et al., 1980; Ting-Beall et al., 
1995]. Although the absolute magnitude of the cell volume measured with the electron microscope may 
be erroneous, if it is assumed that all parts of the cell dehydrate equally when they are dried in preparation 
for viewing, then this approach can be used to determine the volume occupied by the nucleus (Table 32.4) 

TABLE 32.4 Size and Appearance of White Cells in the Circulation 





Occurrence,* 


Cell 


Cell 




Nucleus,' 


Cortical 




%ofWBCs 


Volume,* pm 3 


Diameter, 1 


fim 


% Cell Volume 


Tension, mN/m 


Granulocytes 














Neutrophils 


50-70 


300-310 


8.2-8.4 




21 


0.024-0.035* 


Basophils 


0-1 












Eosinophils 


1-3 








18 




Monocytes 


1-5 


400 


9.1 




26 


0.06! 


Lymphocytes 


20-40 


220 


7.5 




44 


0.0355 



* Diggsetal. [1985]. 

f Ting-Beall et al. [1993]. (Diameter calculated from the volume of a sphere). 

' Schmid-Schonbein et al. [1980], 

s Evans and Yeung [1989], Needham and Hochmuth [1992], Tsai et al. [1993, 1994]. 

' Preliminary data, Hochmuth, Zhelev, and Ting-Beall. 
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and other organelles of various white cells. The volume occupied by the granules in the neutrophil and 
eosinophil (recall that both are granulocytes) is 15 and 23%, respectively, whereas the granular volume 
in monocytes and lymphocytes is less than a few percent. 

Mechanical Behavior 

The early observations of Bagge and colleagues [1977] led them to suggest that the neutrophil behaves 
as a simple viscoelastic solid with a Maxwell element (an elastic and viscous element in series) in parallel 
with an elastic element. This elastic element in the model was thought to pull the unstressed cell into its 
spherical shape. Subsequently, Evans and Kukan [1984] and Evans and Yeung [1989] showed that the 
cells flow continuously into a pipette, with no apparent approach to a static limit, when a constant suction 
pressure was applied. Thus, the cytoplasm of the neutrophil should be treated as a liquid rather than a 
solid, and its surface has a persistent cortical tension that causes the cell to assume a spherical shape. 

Cortical Tension 

Using a micropipet and a small suction pressure to aspirate a hemispherical projection from a cell body 
into the pipette, Evans and Yeung measured a value for the cortical tension of 0.035 mN/m. Needham 
and Hochmuth [1992] measured for cortical tension of individual cells that were driven down a tapered 
pipette in a series of equilibrium positions. In many cases the cortical tension increased as the cell moved 
further into the pipette, which means that the cell has an apparent area expansion modulus [Eq. (32.7)]. 
They obtained an average value of 0.04 mN/m for the expansion modulus and an extrapolated value for 
the cortical tension (at zero area dilation) in the resting state of 0.024 mN/m. The importance of the 
actin cytoskeleton in maintaining cortical tension was demonstrated by Tsai et al. [1994]. Treatment of 
the cells with a drug that disrupts actin filament structure (CTB = cytochalasin B) resulted in a decrease 
in cortical tension from 0.027 mN/m to 0.022 mN/m at a CTB concentration of 3 u.M and to 0.014 mN/m 
at 30 uM. 

Preliminary measurements in one of the authors' laboratories indicate that the value for the cortical 
tension of a monocyte is about double that for a granulocyte, that is, 0.06 mN/m, and the value for a 
lymphocyte is about 0.035 mN/m. 

Bending Rigidity 

The existence of a cortical tension suggests that there is a cortex — a relatively thick layer of F-actin 
filaments and myosin — that is capable of exerting a finite tension at the surface. If such a layer exists, it 
would have a finite thickness and bending rigidity. Zhelev and colleagues [1994] aspirated the surface of 
neutrophils into pipettes with increasingly smaller diameters and determined that the surface had a 
bending modulus of about 1 to 2 x 10~ 18 J, which is 5 to 50 times the bending moduli for erythrocyte 
or lipid bilayer membranes. The thickness of the cortex should be smaller than the radius of smallest 
pipette used in this study, which was 0.24 Lim. 

Apparent Viscosity 

Using their model of the neutrophil as a newtonian liquid drop with a constant cortical tension and (as 
they showed) a negligible surface viscosity, Yeung and Evans [1989] analyzed the flow of neutrophils into 
a micropipette and obtained a value for the cytoplasmic viscosity of about 200 Pa- s. In their experiments, 
the aspiration pressures were on the order of 10 to 1000 Pa. Similar experiments by Needham and 
Hochmuth [1990] using the same newtonian model (with a negligible surface viscosity) but using higher 
aspiration pressures (ranging from 500 to 2000 Pa) gave an average value for the cytoplasmic viscosity 
of 135 Pa-s for 151 cells from five individuals. The apparent discrepancy between these two sets of 
experiments was resolved to a large extent by Tsai et al. [1993], who demonstrated that the neutrophil 
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viscosity decreases with increasing rate of deformation. They proposed a model of the cytosol as a power 
law fluid: 



T) = \ 



f ■ \ 

Y, 



(32.12) 



where b = 0.52, y m is denned by Eq. (32.5), and r\ c is a characteristic viscosity of 130 Pa-s when the 
characteristic mean shear rate, y c is 1 s _1 . These values are based on an approximate method for calculating 
the viscosity from measurements of the total time it takes for a cell to enter a micropipette. Because of 
different approximations used in the calculations, the values of viscosity reported by Tsai et al. [1993] 
tend to be somewhat smaller than those reported by Evans and coworkers or Hochmuth and coworkers. 
Nevertheless, the shear rate dependence of the viscosity is the same, regardless of the method of calcu- 
lation. Values for the viscosity are given in Table 32.5. 

In addition to the dependence of the viscosity on shear rate, there is also evidence that it depends on 
the extent of deformation. In micropipette experiments the initial rate at which the cell enters the pipette 
is significantly faster than predicted, even when the shear rate dependence of the viscosity is taken into 
account. In a separate approach, the cytosolic viscosity was estimated from observation of the time course 
of the cell's return to a spherical geometry after expulsion from a micropipette. When the cellular 
deformations were large, a viscosity of 150 Pa-s was estimated [Tran-Son-Tay et al., 1991], but when the 
deformation was small, the estimated viscosity was only 60 Pa-s [Hochmuth et al., 1993]. Thus.it appears 
that the viscosity is smaller when the magnitude of the deformation is small and increases as deformations 
become large. 

An alternative attempt to account for the initial rapid entry of the cell into micropipettes involved the 
application of a Maxwell fluid model with a constant cortical tension. Dong and coworkers [1988] used 
this model to analyze both the shape recovery of neutrophils following small, complete deformations in 
pipettes and the small-deformation aspiration of neutrophils into pipettes. However, in another study 
by Dong and coworkers [1991], they used a finite-element, numerical approach and a Maxwell model 
with constant cortical tension to describe the continuous, finite-deformation flow of a neutrophil into a 
pipette. But in order to fit the theory to the data for the increase in length of the cell in the pipette with 
time, Dong and colleagues [1991] had to steadily increase both the elastic and viscous coefficients in 
their finite-deformation Maxwell model. This shows that even a Maxwell model is not adequate for 
describing the rheological properties of the neutrophil. 

Although it is clear that the essential behavior of the cell is fluid, the simple fluid drop model with a 
constant and uniform viscosity does not match the observed time course of cell deformation in detail 
[Drury and Dembo, 1998]. Better agreement between theory and experiment is obtained if approximate 
account is taken for the shear rate dependence of the viscosity, but some discrepancies remain. A 



TABLE 32.5 Viscous Parameters of White Blood Cells 



Range of Viscosities (Pa-s)* Characteristic Shear Rate 



Cell Type 


Min 


Max 


Viscosity (Pa-s) 


Dependence (b) 


Neutrophil 


50 


500 


130f 


0.52f 


(30 uM CTB) 


41 


52 


54f 


0.26f 


Monocyte 


70 


1000 






LC60 (Gl) 






220* 


0.53$ 


HL60 (S) 






330$ 


0.56$ 



* Evans and Yeung [1989], Needham and Hochmuth [1992], Tsai et al. [1993, 1994]. 
t Tsaietal. [1993, 1994]. 
$ Tsai and Waugh [1996a]. 
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quantitative model that accounts for the dependence of viscosity on the magnitude of deformation has 
not yet been formulated, and a complete analysis in which the viscosity is made to vary in space and 
time according to the local shear rate and extent of deformation has not yet been performed. Nevertheless, 
the fluid drop model captures the essential behavior of the cell, and when it is applied consistently (that 
is, for similar rate and extent of deformation) it provides a sound basis for predicting cell behavior and 
comparing the behaviors of different types of cells. 

Although the mechanical properties of the neutrophil have been studied extensively as discussed above, 
the other white cells have not been studied in depth. Preliminary unpublished results from Robert 
Hochmuth's laboratory indicate that monocytes are somewhat more viscous (from roughly 30% to a 
factor of 2) than neutrophils under similar conditions in both recovery experiments and experiments in 
which the monocyte flows into a pipette. A lymphocyte, when aspirated into a small pipette so that its 
relatively large nucleus is deformed, behaves as an elastic body in that the projection length into the 
pipette increases linearly with the suction pressure. This elastic behavior appears to be due to the 
deformation of the nucleus, which has an apparent area elastic modulus of 2 mN/m. A lymphocyte 
recovers its shape somewhat more quickly than the neutrophil does, although this recovery process is 
driven both by the cortical tension and the elastic nucleus. These preliminary results are discussed by 
Tran-Son-Tay and coworkers [ 1994] . Finally, the properties of a human myeloid leukemic cell line (HL60) 
thought to resemble immature neutrophils of the bone marrow have also been characterized, as shown 
in Table 32.5. The apparent cytoplasmic viscosity varies both as a function of the cell cycle and during 
maturation toward a more neutrophil-like cell. The characteristic viscosity (y c = 1 s _1 ) is 220 Pa-s for 
HL60 cells in the Gl stage of the cell cycle. This value increases to 400 Pa-s for cells in the S phase, but 
decreases with maturation, so that 7 days after induction the properties approach those of neutrophils 
(130 Pa-s) [Tsai & Waugh, 1996b]. 

It is important to note in closing that the characteristics described above apply to passive leukocytes. 
It is the nature of these cells to respond to environmental stimulation and engage in active movements 
and shape transformations. White cell activation produces significant heterogeneous changes in cell 
properties. The cell projections that form as a result of stimulation (called pseudopodia) are extremely 
rigid, whereas other regions of the cell may retain the characteristics of a passive cell. In addition, the 
cell can produce protrusive or contractile forces ranging from 2.0-10.7 nN [Guilford, et al., 1995]. The 
changes in cellular mechanical properties that result from cellular activation are complex and only 
beginning to be formulated in terms of mechanical models. This is expected to be a major focus of 
research in the coming years. 

Summary 

Constitutive equations that capture the essential features of the responses of red blood cells and passive 
leukocytes have been formulated, and material parameters characterizing the cellular behavior have been 
measured. The red cell response is dominated by the cell membrane which can be described as a hyper- 
viscoelastic, two-dimensional continuum. The passive white cell behaves like a highly viscous fluid drop, 
and its response to external forces is dominated by the large viscosity of the cytosol. Refinements of these 
constitutive models and extension of mechanical analysis to activated white cells is anticipated as the 
ultrastructural events that occur during cellular deformation are delineated in increasing detail. 

Defining Terms 

Area expansivity modulus: A measure of the resistance of a membrane to area dilation. It is the 
proportionality between the isotropic force resultant in the membrane and the corresponding 
fractional change in membrane area. (Units: 1 mN/m = 1 dyn/cm) 

Cortical tension: Analogous to surface tension of a liquid drop, it is a persistent contractile force per 
unit length at the surface of a white blood cell. (Units: 1 mN/m = 1 dyn/cm) 

Cytoplasmic viscosity: A measure of the resistance of the cytosol to flow. (Units: 1 Pa-s = 10 poise) 
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Force resultant: The stress in a membrane integrated over the membrane thickness. It is the two- 
dimensional analog of stress with units of force/length. (Units: 1 mN/m = 1 dyn/cm) 

Maxwell fluid: A constitutive model in which the response of the material to applied stress includes 
both an elastic and viscous response in series. In response to a constant applied force, the material 
will respond elastically at first, then flow. At fixed deformation, the stresses in the material will 
relax to zero. 

Membrane-bending modulus: The intrinsic resistance of the membrane to changes in curvature. It is 
usually construed to exclude nonlocal contributions. It relates the moment resultants (force times 
length per unit length) in the membrane to the corresponding change in curvature (inverse length). 
(Units: 1 N-m = 1 joule = 10 7 erg) 

Membrane shear modulus: A measure of the elastic resistance of the membrane to surface shear 
deformation; that is, changes in the shape of the surface at constant surface area [Eq. (32.8)]. 
(Units: 1 mN/-m = 1 dyn/cm) 

Membrane viscosity: A measure of the resistance of the membrane to surface shear flow, that is, to 
the rate of surface shear deformation [Eq. (32.8)]. (Units: 1 mN-s/m = 1 m Pas m = 1 dyn • s/cm = 
1 surface poise) 

Nonlocal bending resistance: A resistance to bending resulting from the differential expansion and 
compression of the two adjacent leaflets of a lipid bilayer. It is termed nonlocal because the leaflets 
can move laterally relative to one another to relieve local strains, such that the net resistance to 
bending depends on the integral of the change in curvature of the entire membrane capsule. 

Power law fluid: A model to describe the dependence of the cytoplasmic viscosity on rate of deformation 
[Eq. (32.12)]. 

Principal extension ratios: The ratios of the deformed length and width of a rectangular material 
element (in principal coordinates) to the undeformed length and width. 

Sphericity: A dimensionless ratio of the cell volume (to the 2/3 power) to the cell area. Its value ranges 
from near zero to one, the maximum value corresponding to a perfect sphere [Eq. (32.6)]. 

White cell activation: The response of a leukocyte to external stimuli that involves reorganization and 
polymerization of the cellular structures and is typically accompanied by changes in cell shape and 
cell movement. 
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Further Information 

Basic information on the mechanical analysis of biomembrane deformation can be found in Evans and 
Skalak [1979], which also appeared as a book under the same title (CRC Press, Boca Raton, Fla., 1980). 
A more recent work that focuses more closely on the structural basis of the membrane properties is Berk 
and colleagues, Chapter 15, pp. 423-454, in the book Red Blood Cell Membranes: Structure, Function, 
Clinical Implications edited by Peter Agre and John Parker (Marcel Dekker, New York, 1989). More detail 
about the membrane structure can be found in other chapters of that book. 

Basic information about white blood cell biology can be found in the book by Alberts and coworkers 
[1994]. A more thorough review of white blood cell structure and response to stimulus can be found in 
two reviews by T P. Stossel, one entitled, "The mechanics response of white blood cells," in the book 
Inflammation: Basic Principles and Clinical Correlates, edited by J. I. Galin and colleagues (Raven Press, 
New York, 1988), pp. 325-342, and the second entitled, "The molecular basis of white blood cell motility," 
in the book The Molecular Basis of Blood Diseases, edited by G. Stamatoyannopoulos and colleagues (W. B. 
Saunders, Philadelphia, 1994), pp. 541-562. Recent advances in white cell rheology can be found in the 
book, Cell Mechanics and Cellular Engineering, edited by Van C. Mow and coworkers (Springer Verlag, 
New York, 1994). 
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The venous system not only serves as a conduit for the return of blood from the capillaries to the heart 
but also provides a dynamic, variable blood storage compartment that influences cardiac output. The 
systemic (noncardiopulmonary) venous system contains more than 75% of the blood volume of the 
entire systemic circulation. Although the heart is the source of energy for propelling blood throughout 
the circulation, filling of the right heart before the subsequent beat is primarily passive. The subsequent 
amount of blood ejected is exquisitely sensitive to the transmural filling pressure. (For example, a change 
of right heart filling pressure of 1 cm water can cause the cardiac output to change by about 50%). 

Because the blood vessels are elastic and have smooth muscle in their walls, contraction or relaxation 
of the smooth muscle can quickly redistribute blood between the periphery and the heart to influence 
cardiac filling and thus cardiac output. Even though the right ventricle is not essential for life, its 
functioning acts to reduce the central venous pressure to facilitate venous return [ 1 ] . It largely determines 
the magnitude of the cardiac output by influencing the degree of filling of the left heart. Dynamic changes 
in venous tone, by redistributing blood volume, can thus, at rest, change cardiac output over a range of 
more than ±20%. The dimensions of the vasculature influence both blood flow — by way of their resistive 
properties — and contained blood volume — by way of their capacitive properties. The arteries have about 
10 times the resistance of the veins, and the veins are more than 10 times as compliant as the arteries. 

The conduit characteristics of the venous system primarily depend on the anatomy of the system. 
Valves in the veins of the limbs are crucial for reducing the pressure in dependent parts of the body. Even 
small movements from skeletal muscle activity tend to compress the veins and move blood toward the 
heart. A competent valve then blocks back flow, thus relieving the pressure when the movement stops. 
Even a few steps can reduce the transmural venous pressure in the ankle from as much as 100 mmHg 
to about 20 mmHg. Without this mechanism, transcapillary movement of fluid into the extravascular 
spaces results in edema. Varicose (swollen) veins and peripheral pooling of blood can result from damage 
to the venous valves. During exercise, the rhythmic contraction of the skeletal muscles, in conjunction 
with venous valves, provides an important mechanism — the skeletal muscle pump — aiding the large 
increases in blood flow through the muscles without excessive increases in capillary pressure and blood 
pooling in the veins of the muscles. Without this mechanism, the increase in venous return leading to 
the dramatic increases in cardiac output would be greatly limited. 
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33.1 Definitions 
Capacitance 

Capacitance is a general term that relates the magnitude of contained volume to the transmural pressure 
across the vessel walls and is defined by the pressure-volume relationship. In living blood vessels, the 
pressure-volume relationship is complex and nonlinear. At transmural pressure near zero, there is a finite 
volume within the vessels (see definition of unstressed volume). If this volume is then removed from the 
vessels, there is only a small decrease in transmural pressure as the vessel collapses from a circular cross- 
section to an elliptical one. This is especially true for superficial or isolated venous vessels. However, for 
vessels which are tethered or embedded in tissue a negative pressure may result without appreciably 
changing the shape of the vessels. With increases in contained volume, the vessel becomes distended, and 
there is a concomitant increase in transmural pressure. The incremental change in volume to incremental 
change in transmural pressure is often relatively constant. At very high transmural pressures vessels 
become stiffer, and the incremental volume change to transmural pressure change is small. Because all 
blood vessels exhibit these nonlinearities, no single parameter can describe capacitance; instead the entire 
pressure-volume relationship must be considered. 

Compliance 

Vascular compliance (C) is defined as the slope of the pressure-volume relationship. It is the ratio of the 
change in incremental volume (AV) to a change in incremental transmural pressure (AP). Thus C = 
A WAP. Because the pressure-volume relationship is nonlinear, the slope of the relationship is not constant 
over its full range of pressures, and so the compliance should be specified at a given pressure. Units of 
compliance are those of volume divided by pressure, usually reported in ml/mmHg. Values are typically 
normalized to wet tissue weight or to total body weight. When the compliance is normalized by the total 
contained blood volume, it is termed the vascular distensibility and represents the fractional change in 
volume (AV7V) per change in transmural pressure; D = (AV7V) AP, where Vis the volume at control or 
at zero transmural pressure. 

Unstressed Volume 

Unstressed volume ( V ) is the volume in the vascular system when the transmural pressure is zero. It is 
a calculated volume obtained by extrapolating the relatively linear segment of the pressure-volume 
relationship over the normal operating range to zero transmural pressure. Many studies have shown that 
reflexes and drugs have quantitatively more influence on V than on the compliance. 

Stressed Volume 

The stressed volume ( V) is the volume of blood in the vascular system that must be removed to change 
the computed transmural pressure from its prevailing value to zero transmural pressure. It is computed 
as the product of the vascular compliance and transmural distending pressure: V s = C x P The total 
contained blood volume at a specific pressure (P) is the sum of stressed and unstressed volume. The 
unstressed volume is then computed as the total blood volume minus the stressed volume. Because of 
the marked nonlinearity around zero transmural pressure and the required extrapolation, both V and 
V s are virtual volumes. 

Capacity 

Capacity refers to the amount of blood volume contained in the blood vessels at a specific distending 
pressure. It is the sum of the unstressed volume and the stressed volume, V = V + V s . 
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Mean Filling Pressure 

If the inflow and outflow of an organ are suddenly stopped, and blood volume is redistributed so that 
all pressures within the vasculature are the same, this pressure is the mean filling pressure [5] . This pressure 
can be measured for the systemic or pulmonary circuits or the body as a whole. The arterial pressure 
often does not equal the venous pressure as flow is reduced to zero, because blood must move from the 
distended arterial vessels to the venous beds during the measurement maneuver, and flow may stop before 
equilibrium occurs. This is because smooth-muscle activity in the arterial vessels, rheological properties 
of blood, or high interstitial pressures act to impede the flow. Thus corrections must often be made [4,5]. 
The experimentally measured mean filling pressure provides a good estimate of P v , (the pressure in the 
minute venules), for estimating venous stressed volume. 

Venous Resistance 

Venous Resistance (R) refers to the hindrance to blood flow through the venous vasculature caused by 
friction of the moving blood along the venous vascular wall. By definition it is the ratio of the pressure 
gradient between the entrance of the venous circulation, namely the capillaries, and the venous outflow 
divided by the venous flow rate. Thus 

R=^ ^ (33.1) 

F 

where _R is the venous resistance, P c is the capillary pressure, P m is the right atrial pressure, and F is the 
venous flow. As flow is decreased to zero, arterial closure may occur, leading to a positive perfusion 
pressure at zero flow. With partial collapse of veins, a Starling resistorlike condition is present in which 
an increase in outlet pressure has no influence on flow until the outlet pressure is greater than the 
"waterfall" pressure. 

Venous Inertance 

Venous inertance (I v ) is the opposition to a change in flow rate related to the mass of the bolus of blood 
that is accelerated or decelerated. The inertance I v for a cylindrical tube with constant cross-sectional 
area is I v = Lp/A, where L is the length of the vessel, p is the density of the blood, and A is the cross- 
sectional area [9]. 

33.2 Methods to Measure Venous Characteristics 

Our knowledge of the nature and role of the capacitance characteristics of the venous system has been 
limited by the difficulty of measuring the various variables needed to compute parameter values. State- 
of-the-art equipment is often needed because of the low pressures and many disturbing factors present. 
Many of the techniques that have been used to measure venous capacitance require numerous assump- 
tions that may not be correct or are currently impossible to evaluate [4]. 

Resistance 

For the estimate of vascular resistance, the upstream to outflow pressure gradient across the tissues must 
be estimated along with a measure of flow. Pressures in large vessels are measured with a catheter 
connected to a pressure transducer, which typically involves measurement of minute changes in resistance 
elements attached to a stiff diaphragm which flexes proportionally to the pressure. For the veins in tissue, 
the upstream pressure, just downstream from the capillaries, is much more difficult to measure because 
of the minute size (ca 15 |im) of the vessels. For this a servo-null micropipette technique may be used. 
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A glass micropipette with a tip diameter of about 2 urn is filled with a 1-2 mol saline solution. When 
the pipette is inserted into a vein, the pressure tends to drive the lower conductance blood plasma into 
the pipette. The conductance is measured using an AC-driven bridge. A servosystem, driven by the 
imbalance signal, is used to develop a counter pressure to maintain the interface between the low- 
conductance filling solution and the plasma near the tip of the pipette. This counter pressure, which 
equals the intravascular pressure, is measured with a pressure transducer. Careful calibration is essential. 
Another approach for estimating the upstream pressure in the veins is to measure the mean filling 
pressure of the organ (see above) and assume that this pressure is the upstream venous pressure. Because 
this venous pressure must be less than the capillary pressure and because most of the blood in an organ 
is in the small veins and venules, this assumption, though tenuous, is not unreasonable. To measure flow 
many approaches are available including electromagnetic, transit-time ultrasonic, or Doppler ultrasonic 
flowmeters. Usually the arterial inflow is measured with the assumption that the outflow is the same. 
Indicator dilution techniques are also used to estimate average flow. They are based on the principle that 
the reduction in concentration of infused indicator is inversely proportional to the rate of flow. Either a 
bolus injection or a continuous infusion may be used. Adequacy of mixing of indicator across the flow 
stream, lack of collateral flows, and adequately representative sampling must be considered [2]. 

Capacitance 

For estimating the capacitance parameters of the veins, contained volume, rather than flow, and trans- 
mural pressure, rather than the longitudinal pressure gradient, must be measured. Pressures are measured 
as described above. For the desired pressure-volume relationship the total contained volume must be 
known. 

Techniques used to measure total blood volume include indicator dilution. The ratio of the integral of 
indicator concentration time to that of concentration is used to compute the mean transit time (MIT) 
following the sudden injection of a bolus of indicator [2,4]. The active volume is the product of MTT 
and flow, with flow measured as outlined above. Scintigraphy provides an image of the distribution of 
radioactivity in tissues. A radioisotope, such as technicium 99 that is bound to red blood cells which in 
turn are contained within the vasculature, is injected and allowed to equilibrate. A camera, with many 
collimating channels sensitive to the emitted radiation, is placed over the tissue. The activity recorded is 
proportional to the volume of blood. Currently it is not possible to accurately calibrate the systems to 
provide measures of blood volume because of uncertain attenuation of radiation by the tissue and 
distance. Furthermore, delimiting a particular organ within the body and separating arterial and venous 
segments of the circulation are difficult. 

Compliance 

To estimate compliance, changes in volume are needed. This is generally easier than measuring the total 
blood volume. Using plethysmography, a rigid container is placed around the organ, and a servo system 
functions to change the fluid volume in the chamber to maintain the chamber pressure-constant. The 
consequent volume change is measured and assumed to be primarily venous, because most of the vascular 
volume is venous. With a tight system and careful technique, at the end of the experiment both inflow 
and outflow blood vessels can be occluded and then the contained blood washed out and measured to 
provide a measure of the total blood volume [12]. 

Gravimetric Techniques 

Gravimetric techniques can be used to measure changes in blood volume. If the organ can be isolated and 
weighed continuously with the blood vessels intact, changes in volume can be measured in response to 
drugs or reflexes. With an important modification, this approach can be applied to an organ or the 
systemic circulation; the tissues are perfused at a constant rate, and the outflow is emptied at a constant 
pressure into a reservoir. Because the reservoir is emptied at a constant rate for the perfusion, changes 
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in reservoir volume reflect an opposite change in the perfused tissue blood volume [8). To measure 
compliance, the outflow pressure is changed (2-5 mmHg) and the corresponding change in reservoir 
volume noted. With the inflow and outflow pressure held constant, the pressure gradients are assumed 
to be constant so that 100% of an outflow pressure change can be assumed to be transmitted to the 
primary capacitance vessels. Any reflex or drug-induced change in reservoir volume may be assumed to 
be inversely related to an active change in vascular volume [7,8,10]. If resistances are also changed by 
the reflex or drug, then corrections are needed and the interpretations are more complex. 

Outflow Occlusion 

If the outflow downstream from the venous catheter is suddenly occluded, the venous pressure increases, 
and its rate of increase is measured. The rate of inflow is also measured so that the compliance can be 
estimated as the ratio flow to rate of pressure rise: Compliance in ml/mmHg = (flow in ml/min)/(rate 
of venous pressure rise in mmHg/min). The method is predicated on the assumption that the inflow 
continues at a constant rate and that there is no pressure gradient between the pressure measuring point 
and the site of compliance for the first few seconds of occlusion when the rate of pressure rise is measured. 

Integral of Inflow Minus Outflow 

With this technique both inflow and outflow are measured and the difference integrated to provide the 
volume change during an experimental forcing. If there is a decrease in contained volume, the outflow 
will be transiently greater than the inflow. The volume change gives a measure of the response to drugs 
or reflexes. Following a change in venous pressure, the technique can be used to measure compliance. 
Accurate measures of flow are needed. Serious errors can result if the inflow is not measured but is only 
assumed to be constant during the experimental protocol. With all methods dependent on measured or 
controlled flow, small changes in zero offset, which is directly or indirectly integrated, leads to serious 
error after about 10 minutes, and so the methods are not useful for long-term or slow responses. 

33.3 Typical Values 

Cardiac output, the sine qua non of the cardiovascular system, averages about 100 ml/(min-kg). It is 
about 90 in humans, is over 110 ml/(min-kg) in dogs and cats, and is even higher on a body weight basis 
in small animals such as rats and mice. The mean arterial blood pressure in relaxed, resting, conscious 
mammals averages about 90 mmHg. The mean circulatory filling pressure averages about 7 mmHg, and 
the central venous pressure just outside the right heart about 2 mmHg. The blood volume of the body 
is about 75 ml/kg, but in humans it is about 10% less, and it is larger in small animals. It is difficult to 
measure accurately because the volume of distribution of the plasma is about 10% higher than that of 
the red blood cells. 

Vascular compliance averages about 2 ml (mmHg-kg body weight). The majority is in the venules and 
veins. Arterial compliance is only about 0.05 ml/mmHg-kg). Skeletal muscle compliance is less than that 
of the body as a whole, whereas the vascular compliance of the liver is about 10 times that of other 
organs. The stressed volume is the product of compliance and mean filling pressure and so is about 
15 ml/kg. By difference, the unstressed volume is about 60 ml/kg. 

As flow is increased through a tissue, the contained volume increases even if the outflow pressure is 
held constant, because there is a finite pressure drop across the veins which is increased as flow increases. 
This increase in upstream distending pressure acts to increase the contained blood volume. The volume 
sensitivity to flow averages about 0.1 ml per 1 ml/min change in flow [6]. For the body as a whole, the 
sensitivity is about 0.25 ml per 1 ml/min with reflexes blocked, and with reflexes intact it averages about 
0.4 ml/min 3 . Using similar techniques, it appears that the passive compensatory volume redistribution 
from the peripheral toward the heart during serious left heart failure is similar in magnitude to a reflex- 
engendered redistribution from activation of venous smooth muscle [6]. 
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The high-pressure carotid sinus baroreceptor reflex system is capable of changing the venous capaci- 
tance [10]. Over the full operating range of the reflex it is capable of mobilizing up to 7.5 ml/kg of blood 
by primarily changing the unstressed vascular volume with little or no changes in venous compliance 
[7,8]. Although this represents only a 10% change in blood volume, it can cause nearly a 100% change 
in cardiac output. It is difficult to say with confidence what particular organ and/or tissue is contributing 
to this blood volume mobilization. Current evidence suggests that the splanchnic vascular bed contributes 
significantly to the capacitance change, but this also may vary between species [11]. 
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Transport of fluid and metabolites from blood to tissue is critically important for maintaining the viability 
and function of cells within the body. Similarly, transport of fluid and waste products from tissue to the 
lymphatic system of vessels and nodes is also crucial to maintain tissue and organ health. Therefore, it 
is important to understand the mechanisms for transporting fluid containing micro- and macromolecules 
from blood to tissue and the drainage of this fluid into the lymphatic system. Because of the succinct 
nature of this chapter, readers are encouraged to consult more complete reviews of blood, tissue, and 
lymphatic transport by Aukland and Reed [1993], Bert and Pearce [1984], Casley-Smith [1982], Curry 
[1984], Hargens [1986], Jain [1987], Lai-Fook [1986], Levick [1984], Reddy [1986], Schmid-Schonbein 
[1990], Schmid-Schonbein and Zweifach [1994], Staub [1988], Staub, Hogg, and Hargens [1987], Taylor 
and Granger [1984], Zweifach and Lipowsky [1984], and Zweifach and Silverberg [1985]. 

Most previous studies of blood, tissue, and lymphatic transport have used isolated organs or whole 
animals under general anesthesia. Under these conditions, transport of fluid and metabolites is artificially 
low in comparison to animals which are actively moving. In some cases, investigators employed passive 
motion by connecting an animal's limb to a motor in order to facilitate studies of blood to lymph transport 
and lymphatic flow. However, new methods and technology allow studies of physiologically active animals 
so that a better understanding of the importance of transport phenomena in moving tissues is now 
apparent. Therefore, the major focus of this chapter emphasizes recent developments in the understanding 
of the mechanics of tissue and lymphatic transport. 

The majority of the fluid that is filtered from the microcirculation into the interstitial space is carried 
out of the tissue via the lymphatic network. This unidirectional transport system originates with a set of 
blind channels in distal regions of the microcirculation. It carries a variety of interstitial molecules, 
proteins, metabolites, and even cells along channels deeply embedded in the tissue parenchyma towards 
a set of sequential lymph nodes and eventually back into the venous system via the right and left thoracic 
ducts. The lymphatics are the pathways for immune surveillance by the lymphocytes and thus are one 
of the important highways of the immune system. 
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34.2 Basic Concepts of Tissue and Lymphatic Transport 
Transcapillary Filtration 

Because lymph is formed from fluid filtered from the blood, an understanding of transcapillary exchange 
must be considered first. Usually pressure parameters favor filtration of fluid across the capillary wall to 
the interstitium (J c ) according to the Starling- Landis equation: 



J =L A 



te-- P J-°,(*c-*J 



(34.1) 



where: J c = net transcapillary fluid transport 

L = hydraulic conductivity of capillary wall 

A = capillary surface area 

P c = capillary blood pressure 

P t = interstitial fluid pressure 

a = reflection coefficient for protein 

ti c = capillary blood colloid osmotic pressure 

7t t = interstitial fluid colloid osmotic pressure 

In many tissues, fluid transported out of the capillaries is passively drained by the initial lymphatic 
vessels so that: 

Je^! (34.2) 

where: J[ = lymph flow 

and pressure within the initial lymphatic vessels Pj depends on higher interstitial fluid pressure P t for 
establishing lymph flow: 

P t >P, (34.3) 

Starling Pressures and Edema Prevention 

Hydrostatic and colloid osmotic pressures within the blood and interstitial fluid primarily govern 
transcapillary fluid shifts (Fig. 34.1). Although input arterial pressure averages about 100 mm Hg at heart 
level, capillary blood pressure P c is significantly reduced due to resistance R, according to the Poiseuille 
equation (Eq. 34.4): 

R = ^ (34.4) 

%r 

where: r| = blood viscosity 

1 = vessel length between feed artery and capillary 
r = radius 

Therefore, normally at heart level, P c is approximately 30 mm Hg. However, during upright posture, P c 
at foot level is about 90 mm Hg and only about 25 mm Hg at head level [Parazynski et al., 1991]. 
Differences in P c between capillaries of the head and feet are due to gravitational components of blood 
pressure according to pgh. For this reason, volumes of transcapillary filtration and lymph flows are 
generally higher in tissues of the lower body as compared to those of the upper body. Moreover, one 
might expect much more sparse distribution of lymphatic vessels in upper body tissues. In fact, tissues 
of the lower body of humans and other tall animals have efficient skeletal muscle pumps, prominent 

© 2000 by CRC Press LLC 




Venous 



/*"^\ 



::i 



\ 



y 




TO 

THORACIC 
DUCT AND 
VENA CAVA 



INITIAL 

LYMPHATIC 

VESSEL 



FIGURE 34.1 Starling pressures which regulate transcapillary fluid balance. Pressure parameters which determine 
direction and magnitude of transcapillary exchange include capillary blood pressure P c , interstitial fluid pressure P, 
(directed into capillary when positive or directed into tissue when negative), plasma colloidal osmotic pressure n c , 
and interstitial fluid colloidal osmotic pressure n t . Precapillary sphincters (PS) regulate P c , capillary flow, and capillary 
surface area A. It is generally agreed that a hydrostatic pressure gradient (P t > lymph pressure P,) drains off excess 
interstitial fluid under conditions of net filtration. Relative magnitudes of pressures are depicted by the size of arrows. 
Source: Hargens AR. 1986. Interstitial fluid pressure and lymph flow. In: R Skalak, S Chien (eds), Handbook of 
Bioengineering, vol 19, pp 1-35, New York, McGraw-Hill. With permission. 

lymphatic systems, and noncompliant skin and fascial boundaries to prevent dependent edema [Hargens 
etal., 1987]. 

Other pressure parameters in the Starling-Landis Eq. (34.1) such as P t , n c , and ti, are not as sensitive 
to changes in body posture as is P c . Typical values for P t range from -2 mm Hg to 10 mm Hg depending 
on the tissue or organ under investigation [Wiig, 1990]. However, during movement, P t in skeletal muscle 
increases to 150 mm Hg or higher [Murthy etal., 1994], providing a mechanism to promote lymphatic 
flow and venous return via the skeletal pump (Fig. 34.2). Blood colloid osmotic pressure n c usually ranges 
between 25 and 35 mm Hg and is the other major force for retaining plasma within the vascular system 
and preventing edema. Interstitial n t depends on the reflection coefficient of the capillary wall (ap ranges 
from 0.5 to 0.9 for different tissues) as well as washout of interstitial proteins during high filtration rates 
[Aukland and Reed, 1993]. Typically n l ranges between 8 and 15 mm Hg with higher values in upper body 
tissues compared to those in the lower body [Aukland and Reed, 1993; Parazynski etal., 1991]. Precapillary 
sphincter activity (Fig. 34.1) also decreases blood flow, decreases capillary filtration area A, and reduces 
P c in dependent tissues of the body to help prevent edema during upright posture [Aratow et al., 1991]. 

Interstitial Flow and Lymph Formation 

As stated in the Introduction, many previous investigators were convinced that interstitial flow of proteins 
was limited by simple diffusion according to Fick's equation (Eq. 34.5): 



J =-D 3c % 



(34.5) 



where: J 



5cp/9x 



: one dimensional protein flux 
■ diffusion coefficient 
concentration gradient of protein through interstitium 
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FIGURE 34.2 Simultaneous intramuscular pressure oscillations in the soleus (top panel) and the tibialis anterior 
(bottom panel) muscles during plantar- and dorsiflexion exercise. Soleus muscle is an integral part of the calf muscle 
pump. Source: Murthy G, DE Watenpaugh, RE Ballard, et al., 1994. Supine exercise during lower body negative 
pressure effectively simulates upright exercise in normal gravity. /. Appl. Physiol. 76:2742. Modified with permission. 

However, recent experimental and theoretical understandings of the dependence of volume and solute 
flows on hydrostatic and osmotic pressures [Hammel, 1994; Hargens and Akeson, 1986] strongly suggest 
that convective flow plays an important role in interstitial flow and tissue nutrition. For example, in the 
presence of osmotic or hydrostatic pressure gradients, protein transport J is coupled to fluid transport 
according to Eq. (34.6): 



:c P J v 



(34.6) 



where: c = average protein concentration 
J v = volume flow of fluid 

Most investigators note that lymph formation and flow greatly depend upon tissue movement or 
activity related to muscle contraction. It is also generally agreed that formation of initial lymph depends 
solely on the composition of nearby interstitial fluid and pressure gradients across the interstitial- 
lymphatic boundary [Hargens, 1986; Zweifach and Lipowsky, 1984]. For this reason, lymph formation 
and flow can be quantified by measuring disappearance of isotope-labeled albumin from subcutis or 
skeletal muscle [Reed et al., 1985]. 

Lymphatic Architecture 

To understand lymph transport in engineering terms it is paramount that we develop a detailed picture 
of the lymphatic network topology and vessel morphology. This task is facilitated by a number of 
morphological and ultrastructural studies from past decades which give a general picture of the mor- 
phology and location of lymphatic vessels in different tissues. Lymphatics have been studied by injections 
of macroscopic and microscopic contrast media and by light and electron microscopic sections. The 
display of the lymphatics is organ specific, there are many variations in lymphatic architecture [Schmid- 
Schonbein, 1990]. We will focus this discussion predominantly on skeletal muscle, the intestines, and 
skin. But, the mechanisms may also in part, be relevant with respect to other organs. 
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FIGURE 34.3 Tracing of a typical lymphatic channel (bottom panel) in rat spinotrapezius muscle after injection 
with a micropipette of a carbon contrast suspension. All lymphatics are of the initial type and are closely associated 
with the arcade arterioles. Few lymphatics follow the path of the arcade venules, or their side branches, the collecting 
venules or the transverse arterioles. [Skalak et al., 1986.] 

In skeletal muscle, lymphatics are positioned in immediate proximity of the arterioles [Skalak et al., 
1984]. The majority of feeder arteries in skeletal muscle and most, but not all of the arcade arterioles, 
are closely accompanied by a lymphatic vessel (Fig. 34.3). Lymphatics can be traced along the entire 
length of the arcade arterioles, but they can be traced only over relatively short distances (less than 50 um) 
into the side branches of the arcades, the transverse (terminal) arterioles which supply the blood into 
the capillary network. Systematic reconstructions of the lymphatics in skeletal muscle have yielded little 
evidence for lymphatic channels that enter into the capillary network per se [Skalak et al., 1984]. Thus, 
the network density of lymphatics is quite low compared to the high density of the capillary network in 
the same tissue, a characteristic feature of lymphatics in most organs [Skalak et al., 1986]. The close 
association between lymphatics and the vasculature is also present in the skin [Ikomi and Schmid- 
Schonbein, 1995], and in other organs and may extend into the central vasculature. 

Lymphatic Morphology 

Histological sections of the lymphatics permit the classification into distinctly different subsets, initial 
lymphatics and collecting lymphatics. The initial lymphatics (denoted also as terminal or capillary lym- 
phatics) form a set of blind endings in the tissue, which feed into the collecting lymphatics, and which 
in turn are the conduits into the lymph nodes. While both initial and collecting lymphatics are lined by 
a highly attenuated endothelium, only the collecting lymphatics have a smooth muscle in their media. 
In accordance, collecting lymphatics exhibit spontaneous narrowing of their lumen, while there is no 
evidence for contractility in the initial lymphatics. Contractile lymphatics are capable of peristaltic smooth 
muscle contractions, which in conjunction with periodic opening and closing of the intraluminal valves 
permits unidirectional fluid transport. The lymphatic smooth muscle has an adrenergic innervation 
[Ohhashi et al., 1982], it exhibits myogenic contraction [Hargens and Zweifach, 1977; Mizuno et al., 
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1997], and reacts to a variety of vasoactive stimuli [Ohhashi et al., 1978; Benoit, 1997], including signals 
which involve nitric oxide [Ohhashi and Takahashi, 1991; Bohlen and Lash, 1992; Yokoyama and Ohhashi, 
1993]. None of these contractile features have been documented in the initial lymphatics. 

The lymphatic endothelium has a number of similarities with the vascular endothelium. It forms a 
continuous lining. There are numerous caveolae, Weibel Palade bodies, but lymphatic endothelium has 
fewer interendothelial adhesion complexes and there is a discontinuous basement membrane. The resi- 
dues of the basement membrane are attached to interstitial collagen via anchoring filaments [Leak and 
Burke, 1968] which provide relatively firm attachment of the endothelium to the interstitial structures. 

Lymphatic Network Display 

One of the interesting aspects regarding lymphatic transport in skeletal muscle is the fact that all 
lymphatics inside the muscle parenchyma are of the non-contractile initial type [Skalak et al., 1984]. 
Collecting lymphatics can only be observed outside the muscle fibers as a conduit to adjacent lymph 
nodes. The fact that all lymphatics inside the tissue parenchyma are of the initial type is not unique to 
skeletal muscle but has been demonstrated in other organs [Unthank and Bohlen, 1988; Yamanaka et al., 
1995]. The initial lymphatics are positioned in the adventitia of the arcade arterioles surrounded by 
collagen fibers (Fig. 34.4). In this position they are in immediate proximity to the arteriolar smooth 
muscle, and adjacent to myelinated nerve fibers and a set of mast cells that accompany the arterioles. 
The initial lymphatics are frequently sandwiched between arteriolar smooth muscles and their paired 
venules, and they in turn are embedded between the skeletal muscle fibers [Skalak et al., 1984] . The initial 
lymphatics are firmly attached to the adjacent basement membrane and collagen fibers via anchoring 
filaments [Leak and Burke, 1968]. The basement membrane of the lymphatic endothelium is discontin- 
uous, especially at the interendothelial junctions, so that macromolecular material and even large colloidal 
particles enter the initial lymphatics [Casley- Smith, 1962; Bach and Lewis, 1973; Strand and Persson, 
1979; Bollinger et al., 1981; Ikomi et al, 1996]. 

The lumen cross-section of the initial lymphatics is highly irregular in contrast to the overall circular 
cross-section of the collecting lymphatics (Fig. 34.4). The lumen cross-section of the initial lymphatics is 
partially or completely collapsed and may frequently span around the arcade arteriole. In fact, we have 
documented cases in which the arcade arteriole is completely surrounded by an initial lymphatic channel, 
highlighting the fact that the activity of the lymphatics is closely linked to that of the arterioles [Ikomi 
and Schmid-Schonbein, 1995] . Initial lymphatics in skeletal muscle have intra-luminal valves which consist 
of bileaflets and a funnel structure [Mazzoni et al., 1987] . The leaflets are flexible structures and are opened 
and closed by a viscous pressure drop along the valve funnel. In a closed position the leaflets are able to 
support considerable pressures [Eisenhoffer etal., 1995; Ikomi etal., 1997]. This arrangement serves to 
preserve normal valve function even in initial lymphatics with irregularly shaped lumen cross-sections. 

The lymphatic endothelial cells are attenuated and have many of the morphological characteristics of 
vascular endothelium, including expression of P-selectin, von Willebrand factor [Di Nucci et al., 1996], 
and factor VIII [Schmid-Schonbein, 1990]. An important difference between vascular and lymphatic 
endothelium lies in the arrangement of the endothelial junctions. In the initial lymphatics, the endothelial 
cells lack tight junctions [Schneeberger and Lynch, 1984] and are frequently encountered in an overlap- 
ping but open position, so that proteins, colloid particles, and even chylomicron particles can readily 
pass through the junctions [Casley- Smith, 1962; Casley- Smith, 1964; Leak, 1970]. Examination of the 
junctions with scanning electron microscopy shows that there exists a periodic inter digitating arrange- 
ment of endothelial extensions. Individual extensions are attached via anchoring filaments to the under- 
lying basement membrane and connective tissue, but the two extensions of adjacent endothelial cell 
resting on top of each other are not attached by interendothelial adhesion complexes. Mild mechanical 
stretching of the initial lymphatics shows that the endothelial extensions can be separated in part from 
each other, indicating that the membranes of two neighboring lymphatic endothelial cells are not attached 
to each other, but are firmly attached to the underlying basement membrane [Castenholz, 1984]. Lym- 
phatic endothelium does not exhibit continuous junctional complexes, and instead has a 'streak and dot' 
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FIGURE 34.4 Histological cross-sections of lymphatics (LYM) in rat skeletal muscle before (pg 34-7) and after 
(pg 34-8) contraction of the paired arcade arterioles (ART). The lymphatic channel is of the initial type with a single 
attenuated endothelial layer (curved arrows) . Note, that in the dilated arteriole, the lymphatic is essentially compressed 
(pg 34-7) while the lymphatic is expanded after arteriolar contraction (pg 34-8) which is noticeable by the folded 
endothelial cells in the arteriolar lumen. In both cases the lumen cross-sectional shape of the initial lymphatic channels 
is highly irregular. All lymphatics within skeletal muscle (SKM) have these characteristic features. [Skalaket al„ 1984.] 

immunostaining pattern of VE-cadherin and associated intracellular proteins — desmoplakin and plako- 
globulin [Schmelz et al., 1994]. But the staining pattern is not uniform for all lymphatics, and in larger 
lymphatics a more continuous pattern is present. This highly specialized arrangement has been referred 
to in the following as the lymphatic endothelial microvalves [Schmid-Schonbein, 1990]. 

Mechanics of Lymphatic Valves 

In contrast to the central large valves in the heart, which are closed by inertial fluid forces, the lymphatic 
valves are small and the fluid Reynolds number is almost zero. Thus, since no inertial forces are available 
to open and close these valves, a different valve morphology has evolved. They form long funnel shaped 
channels which are inserted into the lymph conduits and attached at their base and the funnel is prevented 
from inversion by attachment via a buttress to the lymphatic wall. The valve wall structure consists of a 



© 2000 by CRC Press LLC 




FIGURE 34.4 (continued) 

collagen layer sandwiched between two endothelial layers, and the entire structure is quite deformable 
under mild physiological fluid pressures. The funnel structure serves to create a viscous pressure gradient 
which is sufficient to generate, during forward fluid motion, a pressure drop to open the values and upon 
flow reversal to close the valves [Mazzoni et al., 1987]. 

Lymph Pump Mechanisms 

One of the important questions which is fundamental in lymphology is: How do fluid particles in the 
interstitium find their way into the initial lymphatics? In light of the relative sparse display of the initial 
lymphatics, a directed convective transport is required — provided by either a hydrostatic or a colloid 
osmotic pressure drop [Zweifach and Silberberg, 1979]. The documentation of the exact mechanism has 
remained an elusive target. Several proposals have been advanced (which are discussed in detail in Schmid- 
Schonbein, 1990). Briefly, a number of authors have postulated that there exists a constant pressure drop 
from the interstitium into the initial lymph which may support a steady fluid motion into the lymphatics. 
But, repeated measurements with different techniques have uniformly failed to provide supporting 
evidence for a steady pressure drop to transport fluid into the initial lymphatics [Zweifach and Prather, 
1975; Clough and Smaje, 1978]. Under steady-state conditions, a steady pressure drop does not exist in 



© 2000 by CRC Press LLC 



the vicinity of the initial lymphatics in skeletal muscle within the resolution of the measurement (about 
0.2 cm H 2 0) [Skalak et al., 1984]. An order of magnitude estimate of the pressure drop to be expected 
at the relatively slow flow rates of the lymphatics shows, however, that the pressure drop from the 
interstitium may be significantly lower [Schmid-Schonbein, 1990]. Furthermore, the assumption of a 
steady pressure drop is not in agreement with the substantial evidence that lymph flow rate is enhanced 
under unsteady conditions (see below). Some investigators have postulated an osmotic pressure in the 
lymphatics to aspirate fluid into the initial lymphatics [Casley- Smith, 1972] due to ultrafiltration across 
the lymphatic endothelium, a mechanism that has been referred to as 'bootstrap effect' [Perl, 1975]. 
Critical tests of this hypothesis, such as the microinjection of hyperosmotic protein solutions, have not 
lead to a uniformly accepted hypothesis for lymph formation involving an osmotic pressure. Others have 
suggested a retrograde aspiration mechanism, such that the recoil in the collecting lymphatics serves to 
lower the pressure in the initial lymphatics upstream of the collecting lymphatics [Reddy, 1986; Reddy 
and Patel, 1995], or an electric charge difference across lymphatic endothelium [O'Morchoe et al., 1984]. 

Tissue Mechanical Motion and Lymphatic Pumping 

An intriguing feature of the lymphatic pressure is that lymphatic flow rates depend on tissue motion. In 
a resting tissue, the lymph flow rate is relatively small. But different forms of tissue motion serve to 
enhance the lymph flow. This was originally shown for pulsatile pressure in the rabbit ear. Perfusion of 
the ear with steady pressure (even at the same mean pressure) serves to stop most lymph transport, while 
pulsatile pressure promotes lymph transport [Parsons and McMaster, 1938]. In light of the paired 
arrangement of the arterioles and lymphatics, periodic expansion of the arterioles lead to compression 
of the adjacent lymphatics, and vice versa, a reduction of the arteriolar diameter during the pressure 
reduction phase leads to expansion of the adjacent lymphatics [Skalak et al., 1984] (Fig. 34.4). Vasomotion 
which is associated with a slower contraction of the arterioles, but with a larger amplitude than pulsatile 
pressure, serves to increase lymph formation [Intaglietta and Gross, 1982; Colantuoni et al., 1984]. In 
addition, muscle contractions, simple walking [Olszewski and Engeset, 1980], respiration, intestinal 
peristalsis, skin compression [Ohhashi et al., 1991] and other tissue motions are associated with an 
increase in the lymph flow rates. Periodic tissue motions are significantly more effective to enhance the 
lymph flow than for example, elevation of the venous pressure [Ikomi et al., 1996], which is also associated 
with enhanced fluid filtration [Renkin et al., 1977]. 

A requirement for lymph fluid flow is the periodic expansion and compression of the initial lymphatics. 
Since initial lymphatics do not have their own smooth muscle, the expansion and compression of the 
initial lymphatics depends on the motion of tissue in which they are embedded. In skeletal muscle, the 
strategic location of the initial lymphatics in the adventitia of the arterioles provides the opportunity 
that the expansion and compression can be achieved via several mechanisms: (1) arteriolar pressure 
pulsations or vasomotion, (2) active or passive skeletal muscle contractions, or (3) external muscle 
compression. Direct measurements of the cross-sectional area of the initial lymphatics during arteriolar 
contractions or during skeletal muscle shortening support this hypothesis [Skalak et al., 1984; Mazzoni 
et al., 1990] (Fig. 34.5). The different lymph pump mechanisms are additive. Resting skeletal muscle has 
much lower lymph flow rates (provided largely by the arteriolar pressure pulsation and vasomotion) 
than skeletal muscle during exercise (produced by a combination of intramuscular pressure pulsations 
and skeletal muscle shortening). 

Measurements of lymph flow rates in an afferent lymph vessel (diameter about 300 to 500 um, proximal 
to the popliteal node) in the hind leg [Ikomi and Schmid-Schonbein, 1996] serve to demonstrate that 
lymph fluid formation can be influenced by passive or active motion of the surrounding tissue. The 
lymphatics in this tissue region drain muscle and skin of the hind leg, and the majority are of the initial 
type, whereas collecting lymphatics are detected outside the tissue parenchyma in the fascia proximal to 
the node. Without whole leg rotation, lymph flow remains at low non-zero values. If the pulse pressure 
is stopped, the lymph flow falls to values below the detection limit (less than about 10% of the values 
during pulse pressure). Introduction of whole leg passive rotation causes a strong, frequency-dependent 
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FIGURE 34.5 Histograms of initial lymphatic cross-sectional area in rat spinotrapezius m. before (left) and after 
(middle) contraction of the paired arteriole with norepinephrine. Lymphatic cross-sectional area as a function of 
muscle length during active contraction or passive stretch (right). Cross-sectional area and muscle length are 
normalized with respect to the values in-vivo in resting muscle. Note the expansion of the initial lymphatics with 
contraction of the arterioles or muscle stretch. [Skalak et al., 1984; Mazzoni et al., 1990.] 
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FIGURE 34.6 Lymph flow rates in a prenodal afferent lymphatic draining the hindleg as a function of the frequency 
of a periodic surface shear motion (massage) without (panels A, B) and with (panels C, D) elevation of the venous 
pressure by placement of a cuff. Zero frequency refers to a resting leg with a lymph flow rate which depends on pulse 
pressure. The amplitudes of the tangential skin shear motion were 1 cm and 0.5 cm (panels A, B) and 1cm in the 
presence of the elevated venous pressure (panels C, D). Note that the ordinates in panels C and D are larger than 
those in panels A and B. [Ikomi and Schmid-Schonbein, 1996.] 

lymph flow rate which increases linearly with the logarithm of frequency between 0.03 Hz and 1.0 Hz 
(Fig. 34.6). Elevation of the venous pressure, which serves to enhance fluid filtration from the vasculature 
and elevates the flow rates, does not significantly alter the dependency on periodic tissue motion [Ikomi 
etal., 1996]. 
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FIGURE 34.7 Lymph leukocyte count (left) and leukocyte flux (right) before and after application of periodic hind 
leg skin shear motion (massage) at a frequency of about 1 Hz and amplitude of 1 cm. The flux rates were computed 
from the product of lymph flow rates and the lymphocyte counts. *Statistically significant differences from case 
without massage. Adapted from Ikomi et al., 1996. 

Similarly, application of passive tissue compression on the skin serves to elevate the lymph flow rate in 
a frequency-dependent manner. The lymph flow rates are determined to a significant degree by the local 
action of the lymph pump, since arrest of the heart beat and reduction of the central blood pressure to 
zero do not stop lymph flow, and instead reduces the lymph flow rates only about 50% during continued 
leg motion or application of periodic shear stress to the skin for several hours [Ikomi and Schmid- 
Schonbein, 1996]. Periodic compression of the initial lymphatics also serves to enhance proteins and even 
lymphocyte counts in the lymphatics [Ikomi et al., 1996] (Fig. 34.7). Thus either arteriolar smooth muscle 
or parenchymal skeletal muscle activity serve to expand and compress the initial lymphatics in skeletal 
muscle. The mechanisms serve to adjust lymph flow rates according to organ activity, such that a resting 
skeletal muscle has low lymph flow rates. During normal daily activity or mild or strenuous exercise, the 
lymph flow rates as well as the protein and cell transport in the lymphatics increases [Olszewski et al., 1977] . 

In conclusion, the lymphatics are a unique transport system that is even present in primitive physio- 
logical systems. It likely carries out a multitude of functions, many of which have yet to be discovered. 
Details of its operation and its growth kinetics must await a more detailed bioengineering analysis, 
especially at the molecular level [Jeltsch et al., 1997]. 
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Defining Terms 

Capillary: The smallest blood vessel of the body which provides oxygen and other nutrients to nearby 

cells and tissues. 
Colloid Osmotic Pressure: A negative pressure which depends on protein concentration (mainly of 

albumin and globulins) and prevents excess filtration across the capillary wall. 
Edema: Excess fluid or swelling within a given tissue. 
Interstitium: The space between cells of various tissues of the body. Normally fluid and proteins within 

this space are transported from the capillary to the initial lymphatic vessel. 
Lymphatic System: The clear network of vessels which return excess fluid and proteins to the blood 

via the thoracic duct. 
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35.1 Introduction 







The inner ear is a transducer of mechanical force to appropriate neural excitation. The key element is 
the receptor cell, or hair cell, which has cilia on the apical surface and afferent (and sometimes efferent) 
neural synapses on the lateral walls and base. Generally for hair cells, mechanical displacement of the 
cilia in the forward direction toward the tallest cilia causes the generation of electrical impulses in the 
nerves, while backward displacement causes inhibition of spontaneous neural activity. Displacement in 
the lateral direction has no effect. For moderate frequencies of sinusoidal ciliary displacement (20 to 
200 Hz), the neural impulses are in synchrony with the mechanical displacement, one impulse for each 
cycle of excitation. Such impulses are transmitted to the higher centers of the brain and can be perceived 
as sound. For lower frequencies, however, neural impulses in synchrony with the excitation are apparently 
confused with the spontaneous, random firing of the nerves. Consequently, there are three mechanical 
devices in the inner ear of vertebrates that provide perception in the different frequency ranges. At zero 
frequency, i.e., linear acceleration, the otolith membrane provides a constant force acting on the cilia of 
hair cells. For low frequencies associated with rotation of the head, the semicircular canals provide the 
proper force on cilia. For frequencies in the hearing range, the cochlea provides the correct forcing of 
hair cell cilia. In nonmammalian vertebrates, the equivalent of the cochlea is a bent tube, and the upper 
frequency of hearing is around 7 kHz. For mammals, the upper frequency is considerably higher, 20 kHz 
for man but extending to almost 200 kHz for toothed whales and some bats. Other creatures, such as 
certain insects, can perceive high frequencies, but do not have a cochlea nor the frequency discrimination 
of vertebrates. 

Auditory research is a broad field [Keidel and Neff, 1976]. The present article provides a brief guide 
of a restricted view, focusing on the transfer of the input sound pressure into correct stimulation of hair 
cell cilia in the cochlea. In a general sense, the mechanical functions of the semicircular canals and the 
otoliths are clear, as are the functions of the outer ear and middle ear; however, the cochlea continues 
to elude a reasonably complete explanation. Substantial progress in cochlear research has been made in 
the past decade, triggered by several key discoveries, and there is a high level of excitement among workers 
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in the area. It is evident that the normal function of the cochlea requires a full integration of mechanical, 
electrical, and chemical effects on the milli-, micro-, and nanometer scales. Recent texts, which include 
details of the anatomy, are by Pickles [1988] and Gulick and coworkers [1989]. A summary of analysis 
and data related to the macromechanical aspect up to 1982 is given by Steele [1987], and more recent 
surveys specifically on the cochlea are by de Boer [1991], Dallos [1992], Hudspeth [1989], Ruggero 
[1993], and Nobili and colleagues [1998]. 



35.2 Anatomy 



The cochlea is a coiled tube in the shape of a snail shell (Cochlea = Schnecke = Snail), with length about 
35 mm and radius about 1 mm in man. There is not a large size difference across species: the length is 
60 mm in elephant and 7 mm in mouse. There are two and one-half turns of the coil in man and dolphin, 
and five turns in guinea pig. Despite the correlation of coiling with hearing capability of land animals 
[West, 1985], no significant effect of the coiling on the mechanical response has yet been identified. 

Components 

The cochlea is filled with fluid and divided along its length by two partitions. The main partition is at 
the center of the cross-section and consists of three segments: ( 1 ) on one side — the bony shelf, (or primary 
spiral osseous lamina), (2) in the middle, an elastic segment (basilar membrane) (shown in Fig. 35.1) and 
(3) on the other side, a thick support [spiral ligament). The second partition is Reissner's membrane, 
attached at one side above the edge of the bony shelf and attached at the other side to the wall of the 
cochlea. Scala media is the region between Reissner's membrane and the basilar membrane, and is filled 
with endolymphatic fluid. This fluid has an ionic content similar to intracellular fluid, high in potassium 
and low in sodium, but with a resting positive electrical potential of around +80 mV. The electrical 
potential is supplied by the stria vascularis on the wall in scala media. The region above Reissner's 
membrane is scala vestibuli, and the region below the main partition is scala tympani. Scala vestibuli and 
scala tympani are connected at the apical end of the cochlea by an opening in the bony shelf, the 
helicotrema, and are filled with perilymphatic fluid. This fluid is similar to extracellular fluid, low in 
potassium and high in sodium with zero electrical potential. Distributed along the scala media side of 
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FIGURE 35.1 Finite element calculation for the deformation of the cochlear partition due to pressure on the basilar 
membrane (BM). Outer hair cell (OHC) stereocilia are sheared by the motion of the pillars of Corti and reticular 
lamina relative to the tectorial membrane (TM). The basilar membrane is supported on the left by the bony shelf 
and on the right by the spiral ligament. The inner hair cells (IHC) are the primary receptors, each with about 
20 afferent synapses. The inner sulcus (IC) is a fluid region in contact with the cilia of the inner hair cells. 
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the basilar membrane is the sensory epithelium, the organ of Corti. This contains one row of inner hair 
cells and three rows of outer hair cells. In humans, each row contains about 4000 cells. Each of the inner 
hair cells has about twenty afferent synapses; these are considered to be the primary receptors. In 
comparison, the outer hair cells are sparsely innervated but have both afferent (5%) and efferent (95%) 
synapses. 

The basilar membrane is divided into two sections. Connected to the edge of the bony shelf, on the 
left in Fig. 35.1, is the arcuate zone, consisting of a single layer of transverse fibers. Connected to the edge 
of the spiral ligament, on the right in Fig. 35.1, is the pectinate zone, consisting of a double layer of 
transverse fibers in an amorphous ground substance. The arches of Corti form a truss over the arcuate 
zone, which consist of two rows of pillar cells. The foot of the inner pillar is attached at the point of 
connection of the bony shelf to the arcuate zone, while the foot of the outer pillar cell is attached at the 
common border of the arcuate zone and pectinate zone. The heads of the inner and outer pillars are 
connected and form the support point for the recticular lamina. The other edge of the recticular lamina 
is attached to the top of Henson cells, which have bases connected to the basilar membrane. The inner 
hair cells are attached on the bony shelf side of the inner pillars, while the three rows of outer hair cells 
are attached to the recticular lamina. The region bounded by the inner pillar cells, the recticular lamina, 
the Henson cells, and the basilar membrane forms another fluid region. This fluid is considered to be 
perilymph, since it appears that ions can flow freely through the arcuate zone of the basilar membrane. 
The cilia of the hair cells protrude into the endolymph. Thus the outer hair cells are immersed in 
perilymph at mV, have an intracellular potential of -70 mV, and have cilia at the upper surface immersed 
in endolymph at a potential of +80 mV. In some regions of the ears of some vertebrates [Freeman and 
Weiss, 1990], the cilia are free standing. However, mammals always have a tectorial membrane, originating 
near the edge of the bony shelf and overlying the rows of hair cells parallel to the recticular lamina. The 
tallest rows of cilia of the outer hair cells are attached to the tectorial membrane. Under the tectorial 
membrane and inside the inner hair cells is a fluid space, the inner sulcus, filled with endolymph. The 
cilia of the inner hair cells are not attached to the overlying tectorial membrane, so the motion of the 
fluid in the inner sulcus must provide the mechanical input to these primary receptor cells. Since the 
inner sulcus is found only in mammals, the fluid motion in this region generated by acoustic input may 
be crucial to high frequency discrimination capability. 

With a few exceptions of specialization, the dimensions of all the components in the cross section of 
the mammalian cochlea change smoothly and slowly along the length, in a manner consistent with high 
stiffness at the base, or input end, and low stiffness at the apical end. For example, in the cat the basilar 
membrane width increases from 0.1 to 0.4 mm while the thickness decreases from 13 to 5 um. The density 
of transverse fibers decreases more than the thickness, from about 6000 fibers per um at the base to 
500 per um at the apex [Cabezudo 1978]. 

Material Properties 

Both perilymph and endolymph have the viscosity and density of water. The bone of the wall and the 
bony shelf appear to be similar to compact bone, with density approximately twice that of water. The 
remaining components of the cochlea are soft tissue with density near that of water. The stiffnesses of 
the components vary over a wide range, as indicated by the values of Young's modulus listed in Table 35.1. 
These values are taken directly or estimated from many sources, including the stiffness measurements in 
the cochlea by Bekesy [1960], Gummer and coworkers [1981], Strelioff and Flock [1984], Miller [1985], 
Zwislocki and Cefaratti [1989], and Olson and Mountain [1994]. 

35.3 Passive Models 

The anatomy of the cochlea is complex. By modeling, one attempts to isolate and understand the essential 
features. Following is an indication of proposition and controversy associated with a few such models. 
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TABLE 35.1 Typical Values and Estimates for Young's Modulus E 

Compact bone 

Keratin 

Basilar membrane fibers 

Microtubules 

Collagen 

Reissner's membrane 

Actin 

Red blood cell, extended (assuming thickness =10 nm) 

Rubber, elastin 

Basilar membrane ground substance 

Tectorial membrane 

Jell-O 

Henson's cells 



Resonators 

The ancient Greeks suggested that the ear consisted of a set of tuned resonant cavities. As each component 
in the cochlea was discovered subsequently, it was proposed to be the tuned resonator. The most well 
known resonance theory is Helmholtz's. According to this theory, the transverse fibers of the basilar 
membrane are under tension and respond like the strings of a piano. The short strings at the base respond 
to high frequencies and the long strings toward the apex respond to low frequencies. The important 
feature of the Helmholz theory is the place principle, according to which the receptor cells at a certain 
place along the cochlea are stimulated by a certain frequency. Thus the cochlea provides a real-time 
frequency separation (Fourier analysis) of any complex sound input. This aspect of the Helmholtz theory 
has since been validated, since each of the some 30,000 fibers exiting the cochlea in the auditory nerve 
is sharply tuned to a particular frequency. A basic difficulty with such a resonance theory is that sharp 
tuning requires small damping, which is associated with a long ringing after the excitation ceases. Yet, 
the cochlea is remarkable for combining sharp tuning with short time delay for the onset of reception 
and the same short time delay for the cessation of reception. 

A particular problem with the Helmholtz theory arises from the equation for the resonant frequency 
for a string under tension: 



1 '• T 
f = — — (35.1) 

2b\ph 

in which T is the tensile force per unit width, p is the density, b is the length, and h is the thickness of 
the string. In man, the frequency range over which the cochlea operates is /= 200 to 20,000 Hz, a factor 
of 100, while the change in length b is only a factor of 5 and the thickness of the basilar membrane h 
varies the wrong way by a factor of 2 or so. Thus to produce the necessary range of frequency, the 
tension T would have to vary by a factor of about 800. In fact the spiral ligament, which would supply 
such tension, varies in area by a factor of only 10. 

Traveling Waves 

No theory anticipated the actual behavior found in the cochlea in 1928 by Bekesy [I960]. He observed 
traveling waves moving along the cochlea from base toward apex which have a maximum amplitude at 
a certain place. The place depends on the frequency, as in the Helmholz theory, but the amplitude envelope 
in not very localized. In Bekesy 's experimental models, and in subsequent mathematical and experimental 
models, the anatomy of the cochlea is greatly simplified. The coiling, Reissner's membrane, and the organ 
of Corti are all ignored, so the cochlea is treated as a straight tube with a single partition. (An exception 
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is in Fuhrmann and colleagues [ 1986] ). A gradient in the partition stiffness similar to that in the cochlea, 
gives beautiful traveling waves in both experimental and mathematical models. 

One-Dimensional Model 

A majority of work has been based on the assumption that the fluid motion is one-dimensional. With 
this simplification the governing equations are similar to those for an electrical transmission line and for 
the long wavelength response of an elastic tube containing fluid. The equation for the pressure p in a 
tube with constant cross-sectional area A and with constant frequency of excitation is: 

d 2 p 2pco 2 

— i - + -^ — p = 0, 35.2 

dx 2 AK F 

in which x is the distance along the tube, p is the density of the fluid, ro is the frequency in radians per 
second, and K is the generalized partition stiffness, equal to the net pressure divided by the displaced 
area of the cross-section. The factor of 2 accounts for fluid on both sides of the elastic partition. Often 
K is represented in the form of a single degree-of-freedom oscillator: 

K=k+i(Dd-m(£> 2 , (35.3) 

in which k is the static stiffness, d is the damping, and m is the mass density: 

h 
m = p p - (35.4) 

b 

in which p p is the density of the plate, h is the thickness, and b is the width. Often the mass is increased 
substantially to provide better curve fits. A good approximation is to treat the pectinate zone of the basilar 
membrane as transverse beams with simply supported edges, for which 

k= WEh ^ (35.5) 

b 5 

in which E is the Young's modulus, and c ( is the volume fraction of fibers. Thus for the moderate changes 
in the geometry along the cochlea as in the cat, h decreasing by a factor of 2, c ( decreasing by a factor of 
12, b increasing by a factor of 5, the stiffness k from Eq. (35.4) decreases by five orders of magnitude, 
which is ample for the required frequency range. Thus it is the bending stiffness of the basilar membrane 
pectinate zone and not the tension which governs the frequency response of the cochlea. The solution 
of Eq. (35.2) can be obtained by numerical or asymptotic (called WKB or CLG) methods. The result is 
traveling waves for which the amplitude of the basilar membrane displacement builds to a maximum 
and then rapidly diminishes. The parameters of K are adjusted to obtain agreement with measurements 
of the dynamic response in the cochlea. Often all the material of the organ of Corti is assumed to be 
rigidly attached to the basilar membrane so that h is relatively large and the effect of mass m is large. 
Then the maximum response is near the in vacua resonance of the partition given by 

co 2 = — . (35.6) 

hp 

The following are objections to the 1-D model [e.g., Siebert, 1974]: (1) The solutions of Eq. (35.2) 
show wavelengths of response in the region of maximum amplitude that are small in comparison with 
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the size of the cross section, violating the basic assumption of 1-D fluid flow. (2) In the drained cochlea, 
Bekesy [1960] observed no resonance of the partition, so there is no significant partition mass. The 
significant mass is entirely from the fluid and therefore Eq. (35.6) is not correct. This is consistent with 
the observations of experimental models. (3) In model studies by Bekesy [1960] and others, the local- 
ization of response is independent of the area A of the cross-section. Thus Eq. (35.2) cannot govern the 
most interesting part of the response, the region near the maximum amplitude for a given frequency. 
(4) Mechanical and neural measurements in the cochlea show dispersion which is incompatible with the 
1-D model [Lighthill, 1991]. (5) The 1-D model fails badly in comparison with experimental measure- 
ments in models for which the parameters of geometry, stiffness, viscosity, and density are known. 

Nevertheless, the simplicity of Eq. (35.2) and the analogy with the transmission line have made the 
1-D model popular. We note that there is interest in utilizing the principles in an analog model built on 
a silicon chip, because of the high performance of the actual cochlea. Watts [1993] reports on the first 
model with an electrical analog of 2-D fluid in the scali. An interesting observation is that the transmission 
line hardware models are sensitive to failure of one component, while the 2-D model is not. In experi- 
mental models, Bekesy found that a hole at one point in the membrane had little effect on the response 
at other points. 

Two-Dimensional Model 

The pioneering work with two-dimensional fluid motion was begun in 1931 by Ranke, as reported in 
Ranke [1950] and discussed by Siebert [1974]. Analysis of 2-D and 3-D fluid motion without the a priori 
assumption of long or short wavelengths and for physical values of all parameters is discussed by Steele 
[1987]. The first of two major benefits derived from the 2-D model is the allowance of short wavelength 
behavior, i.e, the variation in fluid displacement and pressure in the duct height direction. Localized fluid 
motion near the elastic partition generally occurs near the point of maximum amplitude and the exact 
value of A becomes immaterial. The second major benefit of a 2-D model is the admission of a stiffness- 
dominated elastic partition (i.e., massless) which better approximates the physiological properties of the 
basilar membrane. The two benefits together address all the objections the 1-D model discussed previously. 
Two-dimensional models start with the Navier- Stokes and continuity equations governing the fluid 
motion, and an anisotropic plate equation governing the elastic partition motion. The displacement 
potential cp for the incompressible and inviscid fluid must satisfy Laplace's equation: 

<P,„+<P, zz = (35.7) 

where x is the distance along the partition and z the distance perpendicular to the partition, and the 
subscripts with commas denote partial derivatives. The averaged potential and the displacement of the 
partition are: 



<p = — lydz w=(f]x, o) (35.8) 



H 

o 

so Eq. (35.7) yields the "macro" continuity condition (for constant H): 

(p„-w=0. (35.9) 

An approximate solution is 



pi x, z, t\ = Flxlcosh nlxllz — HI 



(35.10) 
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where F is an unknown amplitude function, n is the local wave number, H is the height of the duct, f is 
time, and co is the frequency. This is an exact solution for constant properties and is a good approximation 
when the properties vary slowly along the partition. The conditions at the plate fluid interface yield the 
dispersion relation: 

«tanh(«H) = ^ 3C °-H (35.11) 

The averaged value Eq. (35.8) of the approximate potential Eq. (35.10) is: 

^_ FsinhnH 

<p = (35.12) 

nH 

so the continuity condition Eq. (35.9) yields the equation: 

^ xx + n 2 tp = (35.13) 

For small values of the wave number n, the system Eq. (35.11) andEq. (35.13) reduce to the 1-D problem 
Eq. (35.2). 

For physiological values of the parameters, the wave number for a given frequency is small at the stapes 
and becomes large (i.e., short wave lengths) toward the end of the duct. With this formulation, the form 
of the wave is not assumed. It is clear that for large n the WKB solution will give excellent approximation 
to the solution of Eq. (35.13) in exponential form. So it is possible to integrate Eq. (35.13) numerically 
for small n for which the solution is not exponential and match the solution with the WKB approximation. 
This provides a uniformly valid solution for the entire region of interest without an a prior assumption 
of the wave form. 

For a physically realistic model, the mass of the membrane can be neglected and K written as: 



K=k(l+ie) (35.14) 



in which k is the static stiffness. For many polymers, the material damping s is nearly constant. If the 
damping comes from the viscous boundary layer of the fluid, then e is approximated by 



(35.15) 



2pco 



in which u is the viscosity. For water, e is small with a value near 0.05 at the point of maximum amplitude. 
The actual duct is tapered, so H= H{x) and additional terms must be added to Eq. (35.13). 

The best verification of the mathematical model and calculation procedure comes from comparison 
with measurements in experimental models for which the parameters are known. Zhou and coworkers 
[1994] provide the first life-sized experimental model, designed to be similar to the human cochlea, but 
with fluid viscosity 28 times that of water to facilitate optical imaging. Results are shown in Figs. 35.2 
and 35.3. Eq. (35.13) gives rough agreement with the measurements. 

Three-Dimensional Model 

A further improvement in the agreement with experimental models can be obtained by adding the 
component of fluid motion in the direction across the membrane for a full 3-D model. The solution by 
direct numerical means is computationally intensive, and was first carried out by Raftenberg [1990] , who 
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FIGURE 35.2 Comparison of 3-D model calculations (solid curves) with experimental results of Zhou and co- 
workers [1994] (dashed curves) for the amplitude envelopes for different frequencies. This is a life-sized model, but 
with an isotropic BM and fluid viscosity 28 times that of water. The agreement is reasonable, except for the lower 
frequencies. 

reports a portion of his results for the fluid motion around the organ of Corti. Bohnke and colleagues 
[1996] use the finite element code ANSYS for the most accurate description to date of the structure of 
the organ of Corti. However, the fluid is not included and only a restricted segment of the cochlea 
considered. The fluid is also not included in the finite element calculations of Zhang and colleagues 
[1996]. A "large finite element method", which combines asymptotic and numerical methods for shell 
analysis, can be used for an efficient computation of all the structural detail as shown in Fig. 35.1. Both 
the fluid and the details of the structure are considered with a simplified element description and 
simplified geometry by Kolston and Ashmore [1996], requiring some 10 5 degrees of freedom and hours 
of computing time (on a 66 MHz PC) for the linear solution for a single frequency. The asymptotic WKB 
solution, however, provides the basis for computing the 3-D fluid motion [Steele, 1987] and yields 
excellent agreement with older measurements of the basilar membrane motion in the real cochlea (with 
a computing time of 1 second per frequency). The 2-D analysis Eq. (35.13) can easily be extended to 
3-D. The 3-D WKB calculations are compared with the measurements of the basilar membrane displace- 
ment in the experimental model of Zhou and coworkers [1994] in Figs. 35.2 and 35.3. As shown by Taber 
and Steele [1979], the 3-D fluid motion has a significant effect on the pressure distribution. This is 
confirmed by the measurements by Olson [1998] for the pressure at different depths in the cochlea, that 
show a substantial increase near the partition. 

35.4 The Active Process 

Before around 1980, it was thought that the processing may have two levels. First the basilar membrane 
and fluid provide the correct place for a given frequency (a purely mechanical "first filter"). Subsequently, 
the micromechanics and electrochemistry in the organ of Corti, with possible neural interactions, per- 
form a further sharpening (a physiologically vulnerable "second filter"). 

A hint that the two-filter concept had difficulties was in the measurements of Rhode [1971], who 
found significant nonlinear behavior of the basilar membrane in the region of the maximum amplitude 
at moderate amplitudes of tone intensity. Passive models cannot explain this, since the usual mechanical 
nonlinearities are significant only at very high intensities, i.e., at the threshold of pain. Russell and Sellick 
[1977] made the first in vivo mammalian intracellular hair cell recordings and found that the cells are as 
sharply tuned as the nerve fibers. Subsequently, improved measurement techniques in several laboratories 
found that the basilar membrane is actually as sharply tuned as the hair cells and the nerve fibers. Thus 
the sharp tuning occurs at the basilar membrane. No passive cochlear model, even with physically 
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FIGURE 35.3 Comparison of 3-D model calculations with experimental results of Zhou and coworkers [1994] for 
amplitude at the place x = 19 mm as a function of frequency. The scales are logarithmic (20 dB is a factor of 10 in 
amplitude). Case 1 shows a direct comparison with the physical parameters of the experiment, with isotropic BM 
and viscosity 28 times that of water. Case 2 is computed for the viscosity reduced to that of water. Case 3 is computed 
for the BM made of transverse fibers. Case 4 shows the effect of active OHC feed-forward, with the pressure gain 
a = 0.21 and feed-forward distance Ax = 25 microns. Thus lower viscosity, BM orthotropy, and active feed-forward 
all contribute to higher amplitude and increased localization of the response. 

unreasonable parameters, has yielded amplitude and phase response similar to such measurements. 
Measurements in a damaged or dead cochlea show a response similar to that of a passive model. Further 
evidence for an active process comes from Kemp [1978], who discovered that sound pulses into the ear 
caused echoes coming from the cochlea at delay times corresponding to the travel time to the place for 
the frequency and back. Spontaneous emission of sound energy from the cochlea has now been measured 
in the external ear canal in all vertebrates [Probst, 1990]. Some of the emissions can be related to the 
hearing disability of tinnitus (ringing in the ear). The conclusion drawn from these discoveries is that 
normal hearing involves an active process in which the energy of the input sound is greatly enhanced. 
A widely accepted concept is that spontaneous emission of sound energy occurs when the local amplifiers 
are not functioning properly and enter some sort of limit cycle (Zweig and Shera, 1995). However, there 
remains doubt about the nature of this process (Allen and Neely [1992], Hudspeth [1989], Nobili and 
coworkers [1998]). 

Outer Hair Cell Electromotility 

Since the outer hair cells have sparse afferent innervation, they have long been suspected of serving a 
basic motor function, perhaps beating and driving the subtectorial membrane fluid. Nevertheless, it was 
surprising when Brownell and colleagues [1985] found that the outer hair cells have electromotility: the 
cell expands and contracts in an oscillating electric field, either extra- or intracellular. The electromotility 
exists at frequencies far higher than possible for normal contractile mechanisms [Ashmore, 1987], The 
sensitivity is about 20 nm/mV (about 10 5 better than PZT-2, a widely used piezoelectric ceramic). It has 
not been determined if the electromotility can operate to the 200 kHz used by high frequency mammals. 
However, a calculation of the cell as a pressure vessel with a fixed charge in the wall [Jen and Steele, 1987] 
indicates that, despite the small diameter (10 um), the viscosity of the intra- and extracellular fluid is 
not a limitation to the frequency response. In a continuation of the work reported by Hemmert and 
coworkers [1996], the force generation is found to continue to 80 kHz in the constrained cell. In 
contradiction, however, the same laboratory (Preyer and coworkers [1996]) finds that the intracellular 
voltage change due to displacement of the cilia drops off at a low frequency. 

The motility appears to be due to a passive piezoelectric behavior of the cell plasma membrane [Kalinec 
and colleagues, 1992]. Iwasa and Chadwick [1992] measured the deformation of a cell under pressure 
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loading and voltage clamping and computed the elastic properties of the wall, assuming isotropy. It 
appears that for agreement with both the pressure and axial stiffness measurements, the cell wall must 
be orthotropic, similar to a filament reinforced pressure vessel with close to the optimum filament angle 
of 38° (Tolomeo and Steele, 1995). Holley [1990] finds circumferential filaments of the cytoskeleton with 
an average nonzero angle of about 26°. Mechanical measurements of the cell wall by Tolomeo and 
coworkers [1996] directly confirm the orthotropic stiffness. 



Hair Cell Gating Channels 

In 1984, Pickles and colleagues discovered tip links connecting the cilia of the hair cell, as shown in 
Fig. 35.4, that are necessary for the normal function of the cochlea. These links are about 6 nm in diameter 
and 200 nm long [Pickles, 1988]. Subsequent work by Hudspeth [1989] and Assad and Corey [1992] 
shows convincingly that there is a resting tension in the links. A displacement of the ciliary bundle in 
the excitatory direction causes an opening of ion channels in the cilia, which in turn decreases the 
intracellular potential. This depolarization causes neural excitation, and in the piezoelectric outer hair 
cells, a decrease of the cell length. 

A purely mechanical analog model of the gating is in Steele [1992], in which the ion flow is replaced 
by viscous fluid flow and the intracellular pressure is analogous to the voltage. A constant flow-rate pump 
and leak channel at the base of the cell establish the steady-state condition of negative intracellular 
pressure, tension in the tip links, and a partially opened gate at the cilia through which there is an average 
magnitude of flow. The pressure drop of the flow through the gate has a nonlinear negative spring effect 
on the system. If the cilia are given a static displacement, the stiffness for small perturbation displacement 
is dependent on the amplitude of the initial displacement, as observed by Hudspeth [ 1989] . For oscillatory 
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FIGURE 35.4 Model of outer hair cell. The normal pumping of ions produces negative intracellular electrical 
potential. Displacement of the cilia in the excitatory direction causes an opening of the ion channels in the cilia. 
There is evidence that the channels are located at the point of closest proximity of the two cilia. The tip and lateral 
links are important to maintain the correct stiffness and position. The opening of the channels decreases the 
intracellular potential, causing a piezoelectric contraction of the cell and excitation of the neural synapses. This can 
be modeled by a constant flow rate pump, leak channel, and spring controlled gate. The mechanical effect of the 
flow on the gate is important. The inner hair cell also has cilia, no piezoelectric property, but about twenty afferent 
synapses. 
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forcing of the cilia, the fluid analog shows that a gain in power is possible, as in an electrical or fluidic 
amplifier, and that a modest change in the parameters can lead to instability. 

Thus it appears that amplification in the cochlea resides in the gating of the outer hair cell cilia, while 
the motility is due to passive piezoelectric properties of the cell wall. The flow through the gate has 
significant nonlinearity at small amplitudes of displacement of the cilia (10 nm). Sufficiently high 
amplitudes of displacement of the cilia will cause the tip links to buckle. We estimate that this will occur 
at around 70 dB sound pressure level, thereby turning off the active process for higher sound intensity. 

There is evidence reported by Hackney and colleagues [1996] that the channels do not occur at either 
end of the tip links, but at the region of closest proximity of the cilia, as shown in Fig. 35.4. Mechanical 
models such as in Furness and colleagues [1997], indicate that the channels at such a location can also 
be opened by force on the cilia. More elaborate models show that the stiffness of the tip links remains 
important to the mechanism. 

There may be a connection between the gating channels and the discovery by Canlon and coworkers 
[1988]. They found that acoustic stimulation of the wall of the isolated outer hair cell caused a tonic (DC) 
expansion of the cell over a narrow frequency band, which is related to the place for the cell along the 
cochlea. Khanna and coworkers [1989] observe a similar tonic displacement of the whole organ of Corti. 

35.5 Active Models 

De Boer, [1991], Geisler [1993], and Hubbard [1993] discuss models in which the electromotility of the 
outer hair cells feeds energy into the basilar membrane. The partition stiffness K is expanded from 
Eq. (35.3) into a transfer function, containing a number of parameters and delay times. These are classed 
as phenomenological models, for which the physiological basis of the parameters is not of primary 
concern. The displacement gain may be defined as the ratio of ciliary shearing displacement to cell 
expansion. For these models, the gain used is larger by orders of magnitude than the maximum found 
in laboratory measurements of isolated hair cells. 

Another approach (Steele and colleagues [1993]), which is physiologically based, appears promising. 
The outer hair cells are inclined in the propagation direction. Thus the shearing of the cilia at the 
distance x causes a force from the hair cells acting on the basilar membrane at the distance x + Ax. This 
"feed-forward" law can be expressed in terms of the pressure as 



Po l , C ( X+Ax ) = a p( X ) = a \ 2 P f ( X )+Po,, C ( X ) 



(35.16) 



where p, the total pressure acting on the basilar membrane, consists of the effective pressure acting on 
the basilar membrane from the hair cells p ohc and the pressure from the fluid p,. The coefficient a is the 
force gain supplied by the outer hair cells. With this law, Eq. (35.11) is replaced by: 



1— e \n 



tanhnH = 2^ C °-H (35.17) 

AK 



from which the local wave number n must be computed numerically. Only two new parameters are 
needed, the gain a and the spacing Ax. With physiologically reasonable gain a = 0.18 and spacing Ax = 
20 um, the result is an increase of the response of the basilar membrane for higher frequencies by a factor 
of 10 2 in a narrow sector, apical to the passive peak. The simple feed-forward addition in Eq. (35.17) 
enhances a narrow band of wave lengths without a closed control loop. At this time, it appears that much 
of the elaborate structure of the organ of Corti is for the purpose of such a "feed-forward". This approach 
is also found to work well for the 1-D model by Geisler and Sang [1995]. The results from the 3-D model 
for the effect of adding some feed-forward are shown in Fig. 35.3. One defect of the current feed-forward 
results is that the shift of the maximum response point is about one octave, as seen in Fig. 35.3, rather 
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than one-half octave consistently shown in normal cochlear measurements of mechanical and neural 
response. 

The significant nonlinear effect is the saturation of the active process at high amplitudes. This can be 
computed by letting the gain a be a function of the amplitude. 

In the normal, active cochlea, it was first observed by Khanna and colleagues [1989] and subsequently 
by Gummer and colleagues [1996], that the tectorial membrane (TM in Fig. 35.1) has a substantially 
higher amplitude than the basilar membrane. Presumably, the electromotile expansion of the outer hair 
cells encounters less resistance from the tectorial membrane than the basilar membrane. This shows the 
importance of getting the correct stiffness and geometry into a model. 



35.6 Fluid Streaming? 



Bekesy [1960] and many others have observed significant fluid streaming in the actual cochlea and in 
experimental models. Particularly for the high frequencies, it is tempting to seek a component of steady 
streaming as the significant mechanical stimulation of the inner hair cells [Lighthill, 1992]. Passive models 
indicated that such streaming occurs only at high sound intensity. Among the many open questions is 
whether or not the enhancement of amplitude provided by the feed-forward of energy by the outer hair 
cells and the mechanical nonlinearity at low amplitudes of displacement provided by the ciliary gating 
can trigger significant streaming. It is clear from the anatomy Fig. 35. 1, that the motion and corresponding 
pressure in the fluid of the inner sulcus is the primary source of excitation for the inner hair cells. The 
DC pressure associated with DC streaming would be an important effect. 

35.7 Clinical Possibilities 

A better understanding of the cochlear mechanisms would be of clinical value. Auditory pathology related 
to the inner ear is discussed by Pickles [1988] and Gulick and colleagues [1989]. The spontaneous and 
stimulated emissions from the cochlea raise the possibility of diagnosing local inner ear problems, which 
is being pursued at many centers around the world. The capability for more accurate, physically realistic 
modeling of the cochlea should assist in this process. A significant step is provided by Zweig and Shera 
[ 1 995 ] , who find that a random distribution of irregularities in the properties along the cochlea explains 
much of the emissions. 

A patient with a completely nonfunctioning cochlea is referred to as having "nerve deafness". In fact, 
there is evidence that in many cases the nerves may be intact, while the receptor cells and organ of Corti 
are defective. For such patients, a goal is to restore hearing with cochlear electrode implants to stimulate 
the nerve endings directly. Significant progress has been made. However, despite electrode stimulation 
of nerves at the correct place along the cochlea for a high frequency, the perception of high frequency 
has not been achieved. So, although substantial advance in cochlear physiology has been made in the 
recent past, several such waves of progress may be needed to adequately understand the functioning of 
the cochlea. 
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The vestibular system is responsible for sensing motion and gravity and using this information for control 
of postural and body motion. This sense is also used to control eyes position during head movement, 
allowing for a clear visual image. Vestibular function is rather inconspicuous and for this reason is 
frequently not recognized for its vital roll in maintaining balance and equilibrium and in controlling eye 
movements. Vestibular function is truly a sixth sense, different from the five originally defined by Greek 
physicians. 

The vestibular system is named for its position within the vestibule of the temporal bone of the skull. 
It is located in the inner ear along with the auditory sense. The vestibular system has both central and 
peripheral components. This chapter deals with the mechanical sensory function of the peripheral end 
organ and its ability to measure linear and angular inertial motion of the skull over the frequency ranges 
encountered in normal activities. 

36.1 Structure and Function 

The vestibular system in each ear consists of the utricle and saccule (collectively called the otolithic organs) 
which are the linear motion sensors, and the three semicircular canals (SCCs) which sense rotational 
motion. The SCC are oriented in three nearly mutually perpendicular planes so that angular motion 
about any axis may be sensed. The otoliths and SCC consist of membranous structures which are situated 
in hollowed out sections of the temporal bone. This hollowed out section of the temporal bone is called 
the bony labyrinth, and the membranous labyrinth lies within this bony structure. The membranous 
labyrinth is filled with a fluid called endolymph, which is high in potassium, and the volume between 
the membranous and bony labyrinths is filled with a fluid called perilymph, which is similar to blood 
plasma. 

The otoliths sit within the utricle and saccule. Each of these organs is rigidly attached to the temporal 
bone of the skull with connective tissue. The three semicircular canals terminate on the utricle forming 
a complete circular fluid path, and the membranous canals are also rigidly attached to the bony skull. 
This rigid attachment is vital to the roll of measuring inertial motion of the skull. 
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Each SCC has a bulge called the ampulla near the one end, and inside the ampulla is the cupula, which 
is formed of saccharide gel. The capula forms a complete hermetic seal with the ampulla, and the cupula 
sits on top of the crista, which contains the sensory receptor cells called hair cells. These hair cells have 
small stereocilia (hairs) which extend into the cupula and sense its deformation. When the head is rotated 
the endolymph fluid, which fills the canal, tends to remain at rest due to its inertia, the relative flow of 
fluid in the canal deforms the cupula like a diaphragm, and the hair cells transduce the deformation into 
nerve signals. 

The otolithic organs are flat layered structures covered above with endolymph. The top layer consists 
of calcium carbonate crystals with otoconia which are bound together by a saccharide gel. The middle 
layer consists of pure saccharide gel, and the bottom layer consists of receptor hair cells which have 
stereocilia that extend into the gel layer. When the head is accelerated, the dense otoconial crystals tend 
to remain at rest due to their inertia as the sensory layer tends to move away from the otoconial layer. 
This relative motion between the octoconial layer deforms the gel layer. The hair cell stereocilia sense 
this deformation, and the receptor cells transduce this deformation into nerve signals. When the head is 
tilted, weight acting on the otoconial layer also will deform the gel layer. The hair cell stereocilia also 
have directional sensitivity which allows them to determine the direction of the acceleration acting in 
the plane of the otolith and saccule. The planes of the two organs are arranged perpendicular to each 
other so that linear acceleration in any direction can be sensed. The vestibular nerve, which forms half 
of the VIII cranial nerve, innervates all the receptor cells of the vestibular apparatus. 

36.2 Otolith Distributed Parameter Model 

The otoliths are an overdamped second-order system whose structure is shown in Fig. 36.1. In this model 
the otoconial layer is assumed to be a rigid and nondeformable, the gel layer is a deformable layer of 
isotropic viscoelastic material, and the fluid endolymph is assumed to be a newtonian fluid. A small 
element of the layered structure with surface area dA is cut from the surface, and a vertical view of this 
surface element, of width dx, is shown in Fig. 36.2. To evaluate the forces that are present, free-body 
diagrams are constructed of each elemental layer of the small differential strip. See the nomenclature 
table for a description of all variables used in the following formulas, and for derivation details see Grant 
and colleagues [1984, 1991]. 




(a) Top View 



(b) Cross-Section 

FIGURE 36.1 Schematic of the otolith organ: (a) Top view showing the peripheral region with differential area dA 
where the model is developed, (b) Cross-section showing the layered structure where dx is the width of the differential 
area dA shown in the top view at the left. 
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Endolymph Fluid Layer The spatial coordinate in 
the vertical direction is y f , and the velocity of the en- 
dolymph fluid is u(y fi ,t), a function of the fluid depth 
y f and time t. 



Otoconial Layer The otoconial layer with thickness 
b and at a height b above the gel layer vertical coordi- 
nate origin. The velocity of the otoconial layer is v(t), 
which is a function of time only. 




b- 



8. 



T ( dA 



TfdA 



v(t) 



TgdA 



Gel Layer Gel layer of thickness b, vertical coordi- 
nate y g , and horizontal coordinate 8 g the gel deflec- 
tion. The gel deflection is a function of both y g and 
time t. The velocity of the gel is w(y g> t), a function of 
y g andt. 




FIGURE 36.2 The free-body diagrams of each layer of the otolith with the forces that act on each layer. The interfaces 
are coupled by shear stresses of equal magnitude that act in opposite directions at each surface. The x shear stress 
acts between the gel-otoconial layer, and the Tracts between the fluid-otoconial layer. The forces acting at these interfaces 
are the product of shear stress x and area dA. The B x and W x forces are respectively the components of the buoyant 
and weight forces acting in the plane of the otoconial layer. See the nomenclature table for definitions of other variables. 

In the equation of motion for the endolymph fluid, the force x f dA acts on the fluid, at the fluid- 
otoconial layer interface. This shear stress x, is responsible for driving the fluid flow. The linear Navier- 
Stokes equations for an incompressible fluid are used to describe this endolymph flow. Expressions for 
the pressure gradient, the flow velocity of the fluid measured with respect to an inertial reference frame, 
and the force due to gravity (body force) are substituted into the Navier-Stokes equation for flow in the 
x-direction yielding 



du d 2 u 

' dt ' dy f 

with boundary and initial conditions: u(0, t) = v(f)> m(oo, t) = 0; u (y f , 0) = 0. 
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(36.1) 



The gel layer is treated as a Kelvin-Voight viscoelastic material where the gel shear stress has both an 
elastic component and a viscous component acting in parallel. This viscoelastic material model is sub- 
stituted into the momentum equation, and the resulting gel layer equation of motion is 



g dt Jo 



n 



dy 2 
's ) 



dt + \i 



d 2 w 



(36.2) 



with boundary and initial conditions: w(b,t) = v(t); w(0,f) = 0; w(y ,0) = 0; S (y„,0) = 0. The elastic term 
in the equation is written in terms of the integral of velocity with respect to time, instead of displacement, 
so the equation is in terms of a single dependent variable, the velocity. 

The otoconial layer equation was developed using Newton's second law of motion, equating the forces 
that act on the otoconial layer — fluid shear, gel shear, buoyancy, and weight — to the product of mass 
and inertial acceleration. The resulting otoconial layer equation is 
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(36.3) 



with the initial condition v(0) = 0. 

36.3 Nondimensionalization of the Motion Equations 

The equations of motion are then nondimensionalized to reduce the number of physical and dimensional 
parameters and combine them into some useful nondimensional numbers. The following nondimen- 
sional variables, which are indicated by overbars, are introduced into the motion equations: 



y f - 



y±_ 



p b 2 
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t u = — v = — w = — 
V V V 



(36.4) 



Several nondimensional parameters occur naturally as a part of the nondimensionalization process. These 
parameters are 
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(36.5) 



These parameters represent the following: R is the density ratio, s is a nondimensional elastic parameter, 
M is the viscosity ratio and represents a major portion of the system damping, and ~g x is the nondimen- 
sional gravity. 

The governing equations of motion in nondimensional form are then as follows. For the endolymph 
fluid layer 



t du d 2 u 
^~dyj 



(36.6) 



with boundary conditions of u (0, t) = v (f ) and u (<x>, t) = and initial conditions of u (y«0) = 0. For 
the otoconial layer 
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with an initial condition of v (0) = 0. For the gel layer 
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(36.8) 



with boundary conditions of w(l, f) = v(f) and w(0, f) = and initial conditions of w (/d.0) = 
and 5 g (y^ 0) = 0. 

These equations can be solved numerically for the case of a step change in velocity of the skull. This 
solution is shown in Figure 36.3 for a step change in the velocity of the head and a step change in the 
acceleration of the head, and can be found in Grant and Cotton [1991]. 



Otolith Model Short Time Solution for a Step Change in Velocity 
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FIGURE 36.3 This figure shows the time response of the otolith model for various values of the nondimensional 
parameters. Parts (a) through (d) show the response to a step change in the velocity of the head. This response can 
be thought of as a transient response since it is an impulse in acceleration. The step change has a nondimensional 
skull velocity of magnitude one. Parts (e) and (f ) show the response to a step change in the acceleration of the skull 
or simulates a constant acceleration stimulus. Again the magnitude of the nondimensional step change is one. (a) Short 
time response for various values of the damping parameter M. Note the effect on maximum displacement produced 
by this parameter, (b) Long time response for various M values. Note the small undershoot for M = 1, no conditions 
of an overdamped system were put on the model. As the system is highly overdamped this set of nondimensional 
parameters (M = 1, R = 0.75, and e = 0.2) would not be an acceptable solution, (c) Short time response for various 
values of e. Note that this elastic parameter does not have much effect on the otoconial layer maximum displacement 
for a step change in velocity solution. This is not true for a step change in acceleration, see part (f). (d) Effect of the 
mass or density parameter R on response. R changes the maximum otoconial layer displacement, (e) Response to a 
step change in acceleration for various values of the damping parameter M. Note the overshoot for M = 1. 
(f ) Acceleration step change time response for various values of the elastic parameter £. Note that this parameter affects 
the final displacement of the system for this type of constant stimulus (compare with part (c)). 
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Otolith Model Long Time Solution for a Step Change in Velocity 
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Otolith Model Short Term Solution for a Step Change in Velocity 



S 0.005 - 



1 I I 1 
: R=0.75: 






: 0=0.1 


' : 8=1.0 : e=u -^ 








e=2.0 








: 



























0.1 



0.4 0.5 0.6 0.7 

Nondimensional time 



(c) 
FIGURE 36.3 (continued) 



36.4 Otolith Transfer Function 

A transfer function of otoconial layer deflection related to skull acceleration can be obtained from the 
governing equations. For details of this derivation see Grant and colleagues [1994]. 

Starting with the nondimensional fluid and gel layer equations, taking the Laplace transform with 
respect to time, and using the initial conditions give two ordinary differential equations. These equations 
can then be solved, using the boundary conditions. Taking the Laplace transform of the otoconial layer 
motion equation, combining with the two differential equation solutions, and integrating otoconial layer 
velocity to get deflection produces the transfer function for displacement re acceleration 
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Otolith Model Short Term Solution for a Step Change in Velocity 
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Otolith Model Time Solution for a Step Change in Acceleration 
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FIGURE 36.3 (continued) 
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where the overbars denoting nondimensional variables have been dropped, s is the Laplace transform 
variable, and a general acceleration term A is defined as 
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Otolith Model Time Solution for a Step Change in Acceleration 
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FIGURE 36.3 (continued) 



A-- 



dV 

s_ 

dt 



(36.10) 



36.5 Otolith Frequency Response 



This transfer function can now be studied in the frequency domain. It should be noted that these are 
linear partial differential equations and that the process of frequency domain analysis is appropriate. The 
range of values of e = 0.01-0.2, M = 5-20, and R = 0.75 have been established by Grant and Cotton 
[1991] in a numerical finite difference solution of the governing equations. With these values established, 
the frequency response can be completed. 

In order to construct a magnitude- and phase-versus-frequency plot of the transfer function, the 
nondimensional time will be converted back to real time for use on the frequency axis. For the conversion 
to real time the following physical variables will be used: p = 1.35 g/cm 3 , b = 15 urn, \i, = 0.85 mPa-s. 
The general frequency response is shown in Fig. 36.4. The flat response from DC up to the first corner 
frequency establishes this system as an accelerometer. These are the range-of-motion frequencies encoun- 
tered in normal motion environments where this transducer is expected to function. 

The range of flat response can be easily controlled with the two parameters s and M. It is interesting 
to note that both the elastic term and the system damping are controlled by the gel layer, and thus an 
animal can easily control the system response by changing the parameters of this saccharide gel layer. 
The cross-linking of saccharide gels is extremely variable, yielding vastly different elastic and viscous 
properties of the resulting structure. 

The otoconial layer transfer function can be compared to recent data from single-fiber neural recording. 
The only discrepancy between the experimental data and theoretical model is a low-frequency phase lead 
and accompanying amplitude reduction. This has been observed in most experimental single-fiber 
recordings and has been attributed to the hair cell. 
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FIGURE 36.4 General performance of the octoconial layer transfer function shown for various values of the 
nondimensional elastic parameter E and M. For these evaluations the other parameter was held constant and R = 
0.75. The value of M = 0.1 shows an underdamped response which is entirely feasible with the model formulation, 
since no restriction was incorporated which limits the response to that of an overdamped system. 

36.6 Semicircular Canal Distributed Parameter Model 

The membranous SCC duct is modeled as a section of a rigid torus filled with an incompressible 
newtonian fluid. The governing equations of motion for the fluid are developed from the Navier-Stokes 
equations. Refer to the nomenclature section for definition of all variables, and Fig. 36.5 for a cross- 
section of the SCC and utricle. 
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FIGURE 36.5 Schematic structure of the semicircular canal showing a cross-section through the canal duct and 
utricle. Also shown in the upper right corner is a cross-section of the duct. R is the radius of curvature of the 
semicircular canal, a is the inside radius to the duct wall, and r is the spacial coordinate in the radial direction of 
the duct. 



We are interested in the flow of endolymph fluid with respect to the duct wall, and this requires that 
the inertial motion of the duct wall RD. be added to the fluid velocity u measured with respect to the 
duct wall. The curvature of the duct can be shown to be negligible since a <§ R, and no secondary flow 
is induced; thus the curve duct can be treated as straight. Pressure gradients arise from two sources in 
the duct: (1) the utricle, (2) the cupula. The cupula when deflected exerts a restoring force on the 
endolymph. The cupula can be modeled as a membrane with linear stiffness K = Ap/AV, where Ap is the 
pressure difference across the cupula and A Vis the volumetric displacement, where 



AV = 271 



fJ/M 



rdr dt 



(36.11) 



If the angle subtended by the membranous duct is denoted by (3, the pressure gradient in the duct 
produced by the cupula is 



dp _KAV 
The utricle pressure gradient can be approximated [see Van Buskirk, 1977] by 
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(36.12) 
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When this information is substituted into the Navier-Stokes equation, the following governing equation 
for endolymph flow relative to the duct wall is obtained: 
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This equation can be nondimensionalized using the following nondimensional variables denoted by 
overbars 
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where Q is a characteristic angular velocity of the canal. In terms of the nondimensional variables, the 
governing equation for endolymph flow velocity becomes 
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where the nondimensional parameter e is defined by 
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The boundary conditions for this equation are as follows 



"M- SM 



and the initial condition is u (r, 0) = 0. 

36.7 Semicircular Canal Frequency Response 

To examine the frequency response of the SCC, we will first get a solution to the nondimensional canal 
equation for the case of a step change in angular velocity of the skull. A step in angular velocity corresponds 
to an impulse in angular acceleration, and in dimensionless form this impulse is 



a(f) = -Q8(f) 



(36.18) 



where 8(f) is the unit impulse function and Q is again a characteristic angular velocity of the canal. The 
nondimensional volumetric displacement is defined as 



(()= lurdn dt 



(36.19) 



the dimensional volumetric displacement is given by AV = {AnRQ.d l lv)§ and the solution (for e <^ 1) is 
given by 
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(36.20) 
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where X n represents the roots of the equation J (x) = 0, where J B is the Bessel function of zero order (X 1 = 
2.405, X 2 = 5.520), and for infinite time ()) = 7i/8p\ For details of the solution see Van Buskirk and Grant 
[1987] and Van Buskirk and coworkers [1978]. 

A transfer function can be developed from the previous solution for a step change in angular velocity 
of the canal. The transfer function of mean angular displacement of endolymph 8 related to co, the angular 
velocity of the head, is 
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(36.21) 



where x L = 8pv[3-R/K7ia 4 , t s = a 2 /\\v, and s is the Laplace transform variable. 

The utility of the above transfer function is apparent when used to generate the frequency response 
of the system. The values for the various parameters are as follows: a = 0.15 mm, R = 3.2 mm, the 
dynamic viscosity of endolymph u = 0.85 mPa-s (v= u/p), p = 1000 kg/m 3 , (3 = 1.4k, and K = 3.4 GPa/m 3 . 
This produces values of the two time constants of T L = 20.8 s and t s = 0.00385 s. The frequency response 
of the system can be see in Fig. 36.6. The range of frequencies from 0.01 Hz to 30 Hz establishes the 
SCCs as angular velocity transducers of head motion. This range of frequencies is that encountered in 
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FIGURE 36.6 Frequency response of the human semicircular canals for the transfer function of mean angular 
displacement of endolymph fluid 6 related to angular velocity of the head w. 
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everyday movement. Environments such as an aircraft flight can produce frequencies outside the linear 
range for these transducers. 

Rabbit and Damino [1992] have modeled the flow of endolymph in the ampulla and its interaction 
with a cupula. This model indicates that the cupula in the mechanical system appears to add a high- 
frequency gain enhancement as well as phase lead over previous mechanical models. This is consistent 
with measurements of vestibular nerve recordings of gain and phase. Prior to this work this gain and 
phase enhancement were thought to be of hair cell origin. 

Defining Terms 

Endolymph: Fluid similar to intercellular fluid (high in potassium) which fills the membranous laby- 
rinth, canals, utricle, and saccule. 

Kelvin- Voight viscoelastic material: The simplest of solid materials which have both elastic and viscous 
responses of deformation. The viscous and elastic responses appear to act in parallel. 

Otolith: Linear accelerometers of the vestibular system whose primary transduced signal is the sum of 
linear acceleration and gravity in the frequency range from DC (static) up to the maximum 
experienced by an animal. 

Semicircular canals: Angular motion sensors of the vestibular system whose primary transduced signal 
is angular velocity in the frequency range of normal animal motion. 
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Nomenclature 

Otolith Variables 

x = coordinate direction in the plane of the otoconial layer 

y = coordinate direction normal to the plane of the otolith with origin at the gel base 
y, = coordinate direction normal to the plane of the otolith with origin at the fluid base 
t = time 
u(yat) = velocity of the endolymph fluid measured with respect to the skull 

v(t) = velocity of the otoconial layer measured with respect to the skull 
w{y„,t) = velocity of the gel layer measured with respect to the skull 
8„{y„,t) = displacement of the gel layer measured with respect to the skull 

8 = displacement of the otoconial layer measured with respect to the skull 

V s = skull velocity in the x direction measured with respect to an inertial reference frame 

V = a characteristic velocity of the skull in the problem (magnitude of a step change) 
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p = density of the otoconial layer 
p, = density of the endolymph fluid 
x = gel shear stress in the x direction 
u = viscosity of the gel material 
[if = viscosity of the endolymph fluid 
G = shear modulus of the gel material 
b = gel layer and otoconial layer thickness (assumed equal) 
g x = gravity component in the x-direction 

Semicircular Canal Variables 

u(r,t) = velocity of endolymph fluid measured with respect to the canal wall 
r = radial coordinate of canal duct 
a = inside radius of the canal duct 
R = radius of curvature of semicircular canal 
p = density of endolymph fluid 
v = endolymph kinematic viscosity 

co = angular velocity of the canal wall measured with respect to an inertial frame 
a = angular acceleration of the canal wall measured with respect to an inertial frame 
K = pressure-volume modulus of the cupula = Ap/AV 
Ap = differential pressure across the cupula 
AF = volumetric displacement of endolymph fluid 
P = angle subtended by the canal in radians (P = jt for a true semicircular canal) 
X„ = roots of / (x) = 0, where J a is Bessel's function of order (X; = 2.405, X 2 = 5.520) 
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ABIOMATERIAL IS USED to make devices to replace a part or a function of the body in a safe, 
reliable, economic, and physiologically acceptable manner [Hench and Erthridge, 1982] . A variety 
of devices and materials are used in the treatment of disease or injury. Commonplace examples 
include sutures, needles, catheters, plates, tooth fillings, etc. A biomaterial is a synthetic material used to 
replace part of a living system or to function in intimate contact with living tissue. The Clemson University 
Advisory Board for Biomaterials has formally defined a biomaterial to be "a systemically and pharma- 
cologically inert substance designed for implantation within or incorporation with living systems." Black 
defined biomaterials as "a nonviable material used in a medical device, intended to interact with biological 
systems" [Black, 1992]. Others include "materials of synthetic as well as of natural origin in contact with 
tissue, blood, and biological fluids, and intended for use for prosthetic, diagnostic, therapeutic, and 
storage applications without adversely affecting the living organism and its components" [Bruck, 1980]. 
Still another definition of biomaterials is stated as "any substance (other than drugs) or combination of 
substances, synthetic or natural in origin, which can be used for any period of time, as a whole or as a 
part of a system which treats, augments, or replaces any tissue, organ, or function of the body" [Williams, 
1987] adds to the many ways of looking the same but expressing in different ways. By contrast, a biological 
material is a material such as skin or artery, produced by a biological system. Artificial materials which 
simply are in contact with the skin, such as hearing aids and wearable artificial limbs are not included 
in our definition of biomaterials since the skin acts as a barrier with the external world. 

According to these definitions one must have a vast field of knowledge or collaborate with different 
specialties in order to develop and use biomaterials in medicine and dentistry as Table IV.l indicates. 
The uses of biomaterials, as indicated in Table IV.2, include replacement of a body part which has lost 
function due to disease or trauma, to assist in healing, to improve function, and to correct abnormalities. 
The role of biomaterials has been influenced considerably by advances in many areas of biotechnology 
and science. For example, with the advent of antibiotics, infectious disease is less of a threat than in 
former times, so that degenerative disease assumes a greater importance. Moreover, advances in surgical 
technique and instruments have permitted materials to be used in ways which were not possible previ- 
ously. This section of the handbook is intended to develop in the reader a familiarity with the uses of 
materials in medicine and dentistry and with some rational basis for these applications. 

TABLE IV.l Fields of Knowledge to Develop Biomaterials 



Discipline 



Examples 



Science and Materials sciences: structure-property relationship of synthetic and biological materials including metals, 
engineering ceramics, polymers, composites, tissues (blood and connective tissues), etc. 

Biology and Cell and molecular biology, anatomy, animal and human physiology, histopathology, experimental surgery, 
physiology immunology, etc. 

Clinical All the clinical specialties: dentistry, maxillofacial, neurosurgery, obstetrics and gynecology, ophthalmology, 
sciences orthopedics, otolaryngology, plastic and reconstructive surgery, thoracic and cardiovascular surgery, 

veterinary medicine, and surgery, etc. 



Source: Modified from von Recum [1994]. 



TABLE IV.2 Uses of Biomaterials 



Problem Area 

Replacement of diseased or damaged part 

Assist in healing 

Improve function 

Correct functional abnormality 

Correct cosmetic problem 

Aid to diagnosis 

Aid to treatment 



Examples 

Artificial hip joint, kidney dialysis machine 

Sutures, bone plates, and screws 

Cardiac pacemaker, intraocular lens 

Cardiac pacemaker 

Augmentation mammoplasty, chin augmentation 

Probes and catheters 

Catheters, drains 
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The performance of materials in the body can be classified in many ways. First, biomaterials may be 
considered from the point of view of the problem area which is to be solved, as in Table IV.2. Second, 
we may consider the body on a tissue level, an organ level (Table IV.3), or a system level (Table IV.4). 
Third, we may consider the classification of materials as polymers, metals, ceramics, and composites as 
is done in Table IV.5. In that vein, the role of such materials as biomaterials is governed by the interaction 
between the material and the body, specifically, the effect of the body environment on the material, and 
the effect of the material on the body [Black, 1992; Bruck, 1980; Greco, 1994; Hench and Erthridge, 1982; 
Park and Lakes, 1992; von Recum, 1986; Williams and Roaf, 1973]. 

It should be evident from any of these perspectives that most current applications of biomaterials 
involve structural functions even in those organs and systems which are not primarily structural in their 
nature, or very simple chemical or electrical functions. Complex chemical functions such as those of the 
liver, and complex electrical or electrochemical functions such as those of the brain and sense organs, 
cannot be carried out by biomaterials at this time. 

TABLE IV.3 Biomaterials in Organs 



Organ Examples 

Heart Cardiac pacemaker, artificial heart valve, total artificial heart 

Lung Oxygenator machine 

Eye Contact lens, intraocular lens 

Ear Artificial stapes, cochlea implant 

Bone Bone plate, intramedullary rod 

Kidney Kidney dialysis machine 

Bladder Catheter and stent 



TABLE IV.4 Biomaterials in Body Systems 



System 



Examples 



Skeletal Bone plate, total joint replacements 

Muscular Sutures, muscle stimulator 

Circulatory Artificial heart valves, blood vessels 

Respiratory Oxygenator machine 

Integumentary Sutures, burn dressings, artificial skin 

Urinary Catheters, stent, kidney dialysis machine 

Nervous Hydrocephalus drain, cardiac pacemaker, nerve stimulator 

Endocrine Microencapsulated pancreatic islet cells 

Reproductive Augmentation mammoplasty, other cosmetic replacements 



TABLE IV.5 Materials for Use in the Body 



Materials Advantages 

Polymers (nylon, silicone rubber, Resilient 

polyester, polytetrafuoroethylene, Easy to fabricate 
etc.) 

Metals (Ti and its alloys, Co-Cr alloys, Strong, tough 

stainless steels, Au, Ag, Pt, etc.) ductile 

Ceramics (aluminum oxide, calcium Very biocompatible, 



phosphates including 
hydroxyapatite, carbon) 

Composites (carbon-carbon, wire or 
fiber reinforced bone cement) 



Inert 
Strong in 

compression 
Strong, tailor-made 



Disadvantages 

Not strong 

Deforms with time 

May degrade 

May corrode 

Dense 

Difficult to make 

Brittle 

Not resilient 

Difficult to make 

Difficult to make 



Examples 

Sutures, blood vessels, hip socket, 
ear, nose, other soft tissues, 
sutures 

Joint replacements, bone plates 
and screws, dental root 
implants, pacer and suture wires 

Dental; femoral head of hip 
replacement, coating of dental 
and orthopedic implants 

Joint implants, heart valves 
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Historical Background 

The use of biomaterials did not become practical until the advent of an aseptic surgical technique 
developed by Dr. J. Lister in the 1860s. Earlier surgical procedures, whether they involved biomaterials 
or not, were generally unsuccessful as a result of infection. Problems of infection tend to be exacerbated 
in the presence of biomaterials, since the implant can provide a region inaccessible to the body's immu- 
nologically competent cells. The earliest successful implants, as well as a large fraction of modern ones, 
were in the skeletal system. Bone plates were introduced in the early 1900s to aid in the fixation of long 
bone fractures. Many of these early plates broke as a result of unsophisticated mechanical design; they 
were too thin and had stress concentrating corners. Also, materials such as vanadium steel which was 
chosen for its good mechanical properties, corroded rapidly in the body and caused adverse effects on 
the healing processes. Better designs and materials soon followed. Following the introduction of stainless 
steels and cobalt chromium alloys in the 1930s, greater success was achieved in fracture fixation, and the 
first joint replacement surgeries were performed. As for polymers, it was found that warplane pilots in 
World War II who were injured by fragments of plastic (polymethyl methacrylate) aircraft canopy, did 
not suffer adverse chronic reactions from the presence of the fragments in the body. Polymethyl meth- 
acrylate (PMMA) became widely used after that time for corneal replacement and for replacements of 
sections of damaged skull bones. Following further advances in materials and in surgical technique, blood 
vessel replacements were tried in the 1950s; and heart valve replacements and cemented joint replacements 
in the 1960s. Table IV.6 lists notable developments relating to implants. Recent years have seen many 
further advances. 



TABLE IV.6 Notable Developments Relating to Implants 



Year 



Investigators 



Development 



Late 18th-19th century 



1860-1870 


J. Lister 


1886 


H. Hansmann 


1893-1912 


W.A. Lane 


1912 


W.D. Sherman 


1924 


A.A. Zierold 


1926 


M.Z. Lange 


1926 


E.W. Hey-Groves 


1931 


M.N. Smith-Petersen 


1936 


C.S. Venable, W.G. Stuck 


1938 


P. Wiles 


1939 


J.C. Burch, H.M Carney 


1946 


J. and R. Judet 


1940s 


M.J. Dorzee, A. Franceschetti 


1947 


J. Cotton 


1952 


A.B. Voorhees, A. Jaretzta, 




A.B. Blackmore 


1958 


S. Furman, G. Robinson 


1958 


J. Charnley 


1960 


A. Starr, M.L. Edwards 


1970s 


W.J. Kolff 



Various metal devices to fix bone fractures; wires and pins from Fe, Au, Ag, 

and Pt. 
Aseptic surgical techniques 
Ni-plated steel bone fracture plate 
Steel screws and plates (Lane fracture plate) 
Vanadium steel plates, first developed for medical use; lesser stress 

concentration and corrosion (Sherman plate) 
Introduced Stellites® (CoCrMo alloy) 

Introduced 18-8sMo stainless steel, better than 18-8 stainless steel 
Used carpenter's screw for femoral neck fracture 
First femoral neck fracture fixation device made of stainless steel 
Introduced Vitallium* (19-9 stainless steel), later changed the material to 

CoCr alloys 
First total hip replacement prosthesis 
Introduced tantalum (Ta) 
First biomechanically designed femoral head replacement prosthesis. First 

plastics (PMMA) used in joint replacements. 
First used acrylics(PMMA) for corneal replacement 
Introduced Ti and its alloys. 
First successful blood vessel replacement made of cloth for issue ingrowth. 

First successful direct heart stimulation 

First use of acrylic bone cement in total hip replacement on the advice of 

Dr. D. Smith. 
First commercial heart valves 
Total heart replacement 



Source: Park [1984]. 
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Performance of Biomaterials 

The success of a biomaterials in the body depends on factors such as the material properties, design, and 
biocompatibility of the material used, as well as other factors not under control of the engineer, including 
the technique used by the surgeon, the health and condition of the patient, and the activities of the 
patient. If we can assign a numerical value /to the probability of failure of an implant, then the reliability 
can be expressed as 

r = 1 -f (IV. 1) 

If, as is usually the case, there are multiple modes of failure, the total reliability r t is given by the 
product of the individual reliabilities r 1 = (1 - fj), etc. 

r, = rrr 2 ---r n (IV.2) 

Consequently, even if one failure mode such as implant fracture is perfectly controlled so that the 
corresponding reliability is unity, other failure modes such as infection could severely limit the utility 
represented by the total reliability of the implant. One mode of failure which can occur in a biomaterial, 
but not in engineering materials used in other contexts, is an attack by the body's immune system on 
the implant. Another such failure mode is an unwanted effect of the implant upon the body; for example, 
toxicity, inducing allergic reactions, or causing cancer. Consequently, biocompatibility is included as a 
material requirement in addition to those requirements associated directly with the function of the 
implant. 

Biocompatibility involves the acceptance of an artificial implant by the surrounding tissues and by the 
body as a whole. Biocompatible materials do not irritate the surrounding structures, do not provoke an 
abnormal inflammatory response, do not incite allergic or immunologic reactions, and do not cause 
cancer. Other compatibility characteristics which may be important in the function of an implant device 
made of biomaterials include ( 1 ) adequate mechanical properties such as strength, stiffness, and fatigue 
properties; (2) appropriate optical properties if the material is to be used in the eye, skin, or tooth; and 
(3) appropriate density. Sterilizability, manufacturability, long term storage, and appropriate engineering 
design are also to be considered. 

The failure modes may differ in importance as time passes following the implant surgery. For example, 
consider the case of a total joint replacement in which infection is most likely soon after surgery, while 
loosening and implant fracture become progressively more important as time goes on. Failure modes 
also depend on the type of implant and its location and function in the body. For example, an artificial 
blood vessel is more likely to cause problems by inducing a clot or becoming clogged with thrombus 
than by breaking or tearing mechanically. 

Defining Terms 

Biomaterial: A synthetic material used to make devices to replace part of a living system or to function 

in intimate contact with living tissue. 
Biological material: A material produced by a biological system. 
Biocompatibility: Acceptance of an artificial implant by the surrounding tissues and by the body as a 

whole. 
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37.1 Introduction 







Metals are used as biomaterials due to their excellent electrical and thermal conductivity and mechanical 
properties. Since some electrons are independent in metals, they can quickly transfer an electric charge 
and thermal energy. The mobile free electrons act as the binding force to hold the positive metal ions 
together. This attraction is strong, as evidenced by the closely packed atomic arrangement resulting in 
high specific gravity and high melting points of most metals. Since the metallic bond is essentially 
nondirectional, the position of the metal ions can be altered without destroying the crystal structure 
resulting in a plastically deformable solid. 

Some metals are used as passive substitutes for hard tissue replacement such as total hip and knee 
joints, for fracture healing aids as bone plates and screws, spinal fixation devices, and dental implants 
because of their excellent mechanical properties and corrosion resistance. Some metallic alloys are used 
for more active roles in devices such as vascular stents, catheter guide wires, orthodontic archwires, and 
cochlea implants. 

The first metal alloy developed specifically for human use was the "vanadium steel" which was used 
to manufacture bone fracture plates (Sherman plates) and screws. Most metals such as iron (Fe), chro- 
mium (Cr), cobalt (Co), nickel (Ni), titanium (Ti), tantalum (Ta), niobium (Nb), molybdenum (Mo), 
and tungsten (W), that were used to make alloys for manufacturing implants can only be tolerated by 
the body in minute amounts. Sometimes those metallic elements, in naturally occurring forms, are 
essential in red blood cell functions (Fe) or synthesis of a vitamin B 12 (Co), but cannot be tolerated in 
large amounts in the body [Black, 1992]. The biocompatibility of the metallic implant is of considerable 
concern because these implants can corrode in an in vivo environment [Williams, 1982]. The conse- 
quences of corrosion are the disintegration of the implant material per se, which will weaken the implant, 
and the harmful effect of corrosion products on the surrounding tissues and organs. 
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37.2 Stainless Steels 



The first stainless steel utilized for implant fabrication was the 18-8 (type 302 in modern classification), 
which is stronger and more resistant to corrosion than the vanadium steel. Vanadium steel is no longer 
used in implants since its corrosion resistance is inadequate in vivo. Later 18-8sMo stainless steel was 
introduced which contains a small percentage of molybdenum to improve the corrosion resistance in 
chloride solution (saltwater). This alloy became known as type 316 stainless steel. In the 1950s the carbon 
content of 316 stainless steel was reduced from 0.08 to a maximum amount of 0.03% (all are weight 
percent unless specified) for better corrosion resistance to chloride solution and to minimize the sensi- 
tization and hence, became known as type 316L stainless steel. The minimum effective concentration of 
chromium is 11% to impart corrosion resistance in stainless steels. The chromium is a reactive element, 
but it and its alloys can be passivated by 30% nitric acid to give excellent corrosion resistance. 

The austenitic stainless steels, especially type 316 and 
316L, are most widely used for implant fabrication. These 
cannot be hardened by heat treatment but can be hardened 
by cold-working. This group of stainless steels is nonmag- 
netic and possesses better corrosion resistance than any 
others. The inclusion of molybdenum enhances resistance 
to pitting corrosion in salt water. The American Society of 
Testing and Materials (ASTM) recommends type 316L 
rather than 316 for implant fabrication. The specifications 
for 316L stainless steel are given in Table 37.1. The only 
difference in composition between the 316L and 316 stain- 
less steel is the maximum content of carbon, i.e., 0.03% 
and 0.08%, respectively, as noted earlier. 

The nickel stabilizes the austenitic phase [y, face cen- 
tered cubic crystal (fee) structure], at room temperature 

and enhances corrosion resistance. The austenitic phase formation can be influenced by both the Ni and 
Cr contents as shown in Fig. 37.1 for 0.10% carbon stainless steels. The minimum amount of Ni for 
maintaining austenitic phase is approximately 10%. 



TABLE 37.1 Compositions of 316L 
Stainless Steel [American Society for Testing 
and Materials, F139-86, p.61, 1992] 



Element 


Composition(%) 


Carbon 


0.03 max 


Manganese 


2.00 max 


Phosphorus* 


0.03 max 


Sulfur 


0.03 max 


Silicon 


0.75 max 


Chromium 


17.00-20.00 


Nickel 


12.00-14.00 


Molybdenum 


2.00-4.00 



Ni 
(w/o) 



25 
20 

15 
10 

5 




Ferri ti c or 
intermediate structure" 



10 15 

Cr(w/o) 



20 25 



FIGURE 37.1 The effect of Ni and Cr contents on the austenitic phase of stainless steels containing 0. 1 % C [ Keating 
19561. 
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TABLE 37.2 Mechanical Properties of 3 1 6L Stainless Steel for Implants [American 
Society for Testing and Materials, F139-86, p.61, 1992] 



Condition 



Ultimate Tensile 

Strength, min. 

(MPa) 



Yield Strength 

(0.2% offset), min. 

(MPa) 



Elongation 
2 in.(50.8mm) Rockwell 
min. % Hardness 



Annealed 
Cold-worked 



485 
860 



172 
690 



40 
12 



95HRB 



Table 37.2 gives the mechanical proper- 
ties of 316L stainless steel. A wide range of 
properties exists depending on the heat 
treatment (annealing to obtain softer mate- 
rials) or cold working (for greater strength 
and hardness). Figure 37.2 shows the effect 
of cold working on the yield and ultimate 
tensile strength of 18-8 stainless steels. The 
engineer must consequently be careful when 
selecting materials of this type. Even the 
316L stainless steels may corrode inside the 
body under certain circumstances in a 
highly stressed and oxygen depleted region, 
such as the contacts under the screws of the 
bone fracture plate. Thus, these stainless 
steels are suitable to use only in temporary 
implant devices such as fracture plates, 
screws, and hip nails. Surface modification 
methods such as anodization, passivation, 
and glow-discharge nitrogen-implantation, 
are widely used in order to improve corro- 
sion resistance, wear resistance, and fatigue 
strength of 316L stainless steel [Bordiji et al., 
1996]. 
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FIGURE 37.2 Effect of cold-work on the yield and ultimate 
tensile strength of 18-8 stainless steel [ASTM, 1980]. 



37.3 CoCr Alloys 



There are basically two types of cobalt-chromium alloys: (1) the castable CoCrMo alloy and (2) the 
CoNiCrMo alloy which is usually wrought by (hot) forging. The castable CoCrMo alloy has been used 
for many decades in dentistry and, relatively recently, in making artificial joints. The wrought CoNiCrMo 
alloy is relatively new, now used for making the stems of prostheses for heavily loaded joints such as the 
knee and hip. 

The ASTM lists four types of CoCr alloys which are recommended for surgical implant applications: 
(1) cast CoCrMo alloy (F75), (2) wrought CoCrWNi alloy (F90), (3) wrought CoNiCrMo alloy (F562), 
and (4) wrought CoNiCrMoWFe alloy (F563). The chemical compositions of each are summarized in 
Table 37.3. At the present time only two of the four alloys are used extensively in implant fabrications, 
the castable CoCrMo and the wrought CoNiCrMo alloy. As can be noticed from Table 37.3, the compo- 
sitions are quite different from each other. 

The two basic elements of the CoCr alloys form a solid solution of up to 65% Co. The molybdenum 
is added to produce finer grains which results in higher strengths after casting or forging. The chromium 
enhances corrosion resistance as well as solid solution strengthening of the alloy. 
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TABLE 37.3 Chemical Compositions of Co-Cr Alloys [American Society for Testing and 
Materials, F75-87, p.42; F90-87, p.47; F562-84, p. 150, 1992] 





CoCrMo (F75) 


CoCrWNi (F90) 


CoNiCrMo (F562) 


CoNiCrMoWFe (F563) 


Element 


Min. 


Max. 


Min. 


Max. 


Min. 


Max. 


Min. 


Max. 


Cr 


27.0 


30.0 


19.0 


21.0 


19.0 


21.0 


18.00 


22.00 


Mo 


5.0 


7.0 


— 


— 


9.0 


10.5 


3.00 


4.00 


Ni 


— 


2.5 


9.0 


11.0 


33.0 


37.0 


15.00 


25.00 


Fe 


— 


0.75 


— 


3.0 


— 


1.0 


4.00 


6.00 


C 


— 


0.35 


0.05 


0.15 


— 


0.025 


— 


0.05 


Si 


— 


1.00 


— 


1.00 


— 


0.15 


— 


0.50 


Mn 


— 


1.00 


— 


2.00 


— 


0.15 


— 


1.00 


W 


— 


— 


14.0 


16.0 


— 


— 


3.00 


4.00 


P 


— 


— 


— 


— 


— 


0.015 


— 


— 


s 


— 


— 


— 


— 


— 


0.010 


— 


0.010 


Ti 


— 


— 


— 


— 


— 


1.0 


0.50 


3.50 


Co 






Balance 













ULTIMATE 

TESILE 2-) 
STRENGTH 
(GPa) 



CoNiCrMo 



15 25 35 45 

REDUCTION (%) 



55 



65 



75 



FIGURE 37.3 Relationship between ultimate tensile and the amount of cold-work for CoNiCrMo alloy [Devine 
andWulff, 1975], 

The CoNiCrMo alloy originally called MP35N (Standard Pressed Steel Co.) contains approximately 
35% Co and Ni each. The alloy is highly corrosion resistant to seawater (containing chloride ions) under 
stress. Cold working can increase the strength of the alloy considerably as shown in Fig. 37.3. However, 
there is a considerable difficulty of cold working on this alloy, especially when making large devices such 
as hip joint stems. Only hot-forging can be used to fabricate a large implant with the alloy. 

The abrasive wear properties of the wrought CoNiCrMo alloy are similar to the cast CoCrMo alloy 
(about 0.14 mm/yr in joint simulation tests with ultra high molecular weight polyethylene acetabular 
cup); however, the former is not recommended for the bearing surfaces of joint prosthesis because of its 
poor frictional properties with itself or other materials. The superior fatigue and ultimate tensile strength 
of the wrought CoNiCrMo alloy make it suitable for the applications which require long service life 
without fracture or stress fatigue. Such is the case for the stems of the hip joint prostheses. This advantage 
is better appreciated when the implant has to be replaced, since it is quite difficult to remove the failed 
piece of implant embedded deep in the femoral medullary canal. Furthermore, the revision arthroplasty 
is usually inferior to the primary surgery in terms of its function due to poorer fixation of the implant. 

The mechanical properties required for CoCr alloys are given in Table 37.4. As with the other alloys, 
the increased strength is accompanied by decreased ductility. Both the cast and wrought alloys have 
excellent corrosion resistance. 
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TABLE 37.4 Mechanical Property Requirements of Co-Cr Alloys (ASTM F76, F90, 
F562) [American Society for Testing and Materials, F75-87, p.42; F90-87, p.47; 
F562-84, p.150, 1992] 







Wrought 


Wrought CoNiCrMo (F562) 




Cast CoCrMo 


CoCrWNi 


Solution 
Annealed 


Cold-Worked 


Property 


(F75) 


(F90) 


and Aged 


Tensile strength 


655 


860 


793-1000 


1793 min. 


(MPa) 










Yield strength 










(0.2% offset) (MPa) 


450 


310 


240-655 


1585 


Elongation (%) 


8 


10 


50.0 


8.0 


Reduction 










of area (%) 


8 


— 


65.0 


35.0 


Fatigue strength (MPa) a 


310 


— 


— 


— 



" From Semlitsch [1980]. 

Experimental determination of the rate of nickel release from the CoNiCrMo alloy and 3f6L stainless 
steel in 37°C Ringer's solution showed an interesting result. Although the cobalt alloy has more initial 
release of nickel ions into the solution, the rate of release was about the same (3 x 10~ 10 g/cm 2 /day) for 
both alloys [Richards-Mfg-Company, f980]. This is rather surprising since the nickel content of the 
CoNiCrMo alloy is about three times that of 3f6L stainless steel. 

The metallic products released from the prosthesis because of wear, corrosion, and fretting may impair 
organs and local tissues. In vitro studies have indicated that particulate Co is toxic to human osteoblast- 
like cell lines and inhibits synthesis of type-1 collagen, osteocalcin and alkaline phosphatase in the culture 
medium. However, particulate Cr and CoCr alloy are well tolerated by cell lines with no significant 
toxicity. The toxicity of metal extracts in vitro have indicated that Co and Ni extracts at 50% concentration 
appear to be highly toxic since all viability parameters were altered after 24 h. However, Cr extract seems 
to be less toxic than Ni and Co [Granchi et al., f 996]. 

The modulus of elasticity for the CoCr alloys does not change with the changes in their ultimate tensile 
strength. The values range from 220 to 234 GPa which are higher than other materials such as stainless 
steels. This may have some implications of different load transfer modes to the bone in artificial joint 
replacements, although the effect of the increased modulus on the fixation and longevity of implants is 
not clear. Low wear (average linear wear on the MeKee-Farrar component was 4.2 um/yr) has been 
recognized as an advantage of metal-on-metal hip articulations because of its hardness and toughness 
[Schmalzried et al, 1996]. 

37.4 Ti Alloys 



Pure Ti and Ti6Al4V 

Attempts to use titanium for implant fabrication dates to the late 
1930s. It was found that titanium was tolerated in cat femurs, as 
was stainless steel and Vitallium® (CoCrMo alloy). Titanium's light- 
ness [4.5 g/cm 3 , see Table 37.5] and good mechanochemical prop- 
erties are salient features for implant application. 

There are four grades of unalloyed commercially pure (cp) 
titanium for surgical implant applications as given in Table 37.6. 
The impurity contents separate them; oxygen, iron, and nitrogen 
should be controlled carefully. Oxygen in particular has a great 
influence on the ductility and strength. 



TABLE 37.5 Specific Gravities 
of Some Metallic Implant Alloys 



Alloys 


Density (g/cm 3 ) 


Ti and its alloys 


4.5 


316 Stainless steel 


7.9 


CoCrMo 


8.3 


CoNiCrMo 


9.2 


NiTi 


6.7 
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TABLE 37.6 Chemical Compositions of Titanium and its Alloy 
[American Society for Testing and Materials, F67-89, p.39; 
F136-84, p.55, 1992] 



Element 


Grade 1 


Grade 2 


Grade 3 


Grade 4 


Ti6A14V 


Nitrogen 


0.03 


0.03 


0.05 


0.05 


0.05 


Carbon 


0.10 


0.10 


0.10 


0.10 


0.08 


Hydrogen 


0.015 


0.015 


0.015 


0.015 


0.0125 


Iron 


0.20 


0.30 


0.30 


0.50 


0.25 


Oxygen 


0.18 


0.25 


0.35 


0.40 


0.13 


Titanium 






Balance 







> Aluminum 6.00% (5.50-6.50), vanadium 4.00% (3.50-4.50), and 
other elements 0.1% maximum or 0.4% total. 
All are maximum allowable weight percent. 



TABLE 37.7 Mechanical Properties of Ti and its Alloys (ASTM F136) [American Society for Testing 
and Materials, F67-89, p.39; F136-84, p.55, 1992 and [Davidson et al„ 1994] 













Ti6A14 




Properties 


Grade 1 


Grade 2 


Grade 3 


Grade 4 


V 


Til3Nbl3Zr 


Tensile strength (MPa) 


240 


345 


450 


550 


860 


1030 


Yield strength (0.2% offset) (MPa) 


170 


275 


380 


485 


795 


900 


Elongation (%) 


24 


20 


18 


15 


10 


15 


Reduction of area (%) 


30 


30 


30 


25 


25 


45 



One titanium alloy (Ti6Al4V) is widely used to manufacture implants and its chemical requirements 
are given in Table 37.7. The main alloying elements of the alloy are aluminum (5.5-6.5%) and vanadium 
(3.5-4.5%). The Ti6Al4V alloy has approximately the same fatigue strength (550 MPa) of CoCr alloy 
after rotary bending fatigue tests [Imam et al., 1983]. Titanium is an alio tropic material, which exists as 
a hexagonal close packed structure (hep, ct-Ti) up to 882°C and body-centered cubic structure (bec, P-Ti) 
above that temperature. Titanium alloys can be strengthened and mechanical properties varied by con- 
trolled composition and thermomechanical processing techniques. The addition of alloying elements to 
titanium enables it to have a wide range of properties: (1) Aluminum tends to stabilize the a-phase, that 
is increase the transformation temperature from a- to (3-phase (Fig. 37.4). (2) Vanadium stabilizes the 
P-phase by lowering the temperature of the transformation from a to p. 

The a-alloy has a single-phase microstructure (Fig. 37.5a) which promotes good weldability. The 
stabilizing effect of the high aluminum content of these groups of alloys makes excellent strength char- 
acteristics and oxidation resistance at high temperature (300~600°C). These alloys cannot be heat-treated 
for precipitation hardening since they are single-phased. 

The addition of controlled amounts of P-stabilizers causes the higher strength P-phase to persist below 
the transformation temperature which results in the two-phase system. The precipitates of P-phase will 
appear by heat treatment in the solid solution temperature and subsequent quenching, followed by aging 
at a somewhat lower temperature. The aging cycle causes the coherent precipitation of some fine a 
particles from the metastable P, imparting a structure may produce local strain field capable of absorbing 
deformation energy. Cracks are stopped or deterred at the a particles, so that the hardness is higher than 
for the solid solution. (Fig. 37.5b). 

The higher percentage of P-stabilizing elements (13%V in Til3VllCr3Al alloy) results in a micro- 
structure that is substantially P which can be strengthened by heat-treatment (Fig. 37.5c). Another Ti 
alloy (Til3Nbl3Zr) withl3%Nb and 13%Zr showed martensite structure after water quenched and aged, 
which showed high corrosion resistant with low modulus (E = 79MPa) [Davidson et al., 1994] . Formation 
of plates of martensite induces considerable elastic distortion in the parent crystal structure and increases 
strength (Fig. 37. 5d). 
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FIGURE 37.4 Part of Phase-diagram of Ti-Al-V at 4 w/o V [Smith and Hughes, 1966] . 
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FIGURE 37.5 Microstructure of Ti alloys (all are 500X) 
annealed, (c) fj-alloy, annealed, (d) Ti6Al4V, heat-treated 



[Hille, 1966], (a) Annealed a-alloy. (b) Ti6Al4V, a-fj alloy, 
at 1650°C and quenched [Imam et al., 1983]. 



The mechanical properties of the commercially pure titanium and its alloys are given in Table 37.7. 
The modulus of elasticity of these materials is about 110 GPa except 13Nbl3Zr alloy. From Table 37.7 
one can see that the higher impurity content of the cp-Ti leads to higher strength and reduced ductility. 
The strength of the material varies from a value much lower than that of 316 stainless steel or the CoCr 
alloys to a value about equal to that of annealed 316 stainless steel of the cast CoCrMo alloy. However, 
when compared by the specific strength (strength per density) the titanium alloys exceed any other 
implant materials as shown in Fig. 37.6. Titanium, nevertheless, has poor shear strength making it less 
desirable for bone screws, plates, and similar applications. It also tends to gall or seize when in sliding 
contact with itself or another metal. 
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FIGURE 37.6 Yield strength to density ratio of some implant materials [Hille, 1966]. 

Titanium derives its resistance to corrosion by the formation of a solid oxide layer to a depth of 10 nm. 
Under in vivo conditions the oxide (Ti0 2 ) is the only stable reaction product. However, micromotion at 
the cement-prosthesis and cement-bone are inevitable and consequently, titanium oxide and titanium 
alloy particles are released in cemented joint prosthesis. Sometimes this wear debris accumulates as 
periprosthetic fluid collections and triggers giant cell response around the implants. This cystic collection 
continued to enlarge and aspiration revealed "dark" heavily stained fluid containing titanium wear 
particles and histiocytic cells. Histological examination of the stained soft tissue showed "fibrin necrotic 
debris" and collagenous, fibrous tissue containing a histiocytic and foreign body giant cell infiltrate. The 
metallosis, black staining of the periprosthetic tissues, has been implicated in knee implant [Breen and 
Stoker, 1993]. 

The titanium implant surface consists of a thin oxide layer and the biological fluid of water molecules, 
dissolved ions, and biomolecules (proteins with surrounding water shell) as shown in Fig. 37.7. The 
microarchitecture (microgeometry, roughness, etc.) of the surface and its chemical compositions are 
important due to the following reasons: 

1. Physical nature of the surface either at the atomic, molecular, or higher level relative to the 
dimensions of the biological units may cause different contact areas with biomolecules, cells, etc. 
The different contact areas, in turn, may produce different perturbations and types of bonding of 
the biological units, which may influence their conformation and function. 

2. Chemical composition of the surface may produce different types of bonding to the biomolecules, 
which may then also affect their properties and function. Metals undergo chemical reactions at 
the surface depending on the environment which cause the difficulties of understanding the exact 
nature of the interactions. 

The surface-tissue interaction is dynamic rather than static, i.e., it will develop into new stages as time 
passes, especially during the initial period after implantation. During the initial few seconds after implan- 
tation, there will be only water, dissolved ions, and free biomolecules in the closest proximity of the 
surface but no cells. The composition of biofluid will then change continuously as inflammatory and 
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FIGURE 37.7 (a) Interface between a titanium implant and bioliquid and (b) the cell surface interaction [Kasemo 
and Lausma, 1988]. 

healing processes proceed, which in turn also probably causes changes in the composition of the adsorbed 
layer of biomolecules on the implant surface until quasiequilibrium sets in. Eventually, cells and tissues 
will approach the surface and, depending on the nature of the adsorbed layer, they will respond in specific 
ways that may further modify the adsorbed biomolecules. The type of cells closest to the surface and 
their activities will change with time. For example, depending on the type of initial interaction, the final 
results may be fibrous capsule formation or tissue integration [Hazan et al., 1993; Kasemo and Lausma, 
1988; Takeshita et al, 1997; Takatsuka et al, 1995; Yan et al, 1997]. 

Osseointegration is defined as direct contact without intervening soft tissue between viable remodeled 
bone and an implant. Surface roughness of titanium alloys have a significant effect on the bone apposition 
to the implant and on the bone implant interfacial pull out strength. The average roughness increased from 
0.5 to 5.9 um and the interfacial shear strength increased from 0.48 to 3.5 MPa [Feighan et al., 1995]. Highest 
levels of osteoblast cell attachment are obtained with rough sand blast surfaces where cells differentiated 
more than those on the smooth surfaces [Keller etal., 1994]. Chemical changes of the titanium surface 
following heat treatment is thought to form a Ti0 2 hydrogel layer on top of the Ti0 2 layer as shown in 
Fig. 37.8. The Ti0 2 hydrogel layer may induce the apatite crystal formation [Kim et al., 1996]. 

In general, on the rougher surfaces there are lower cell numbers, decreased rate of cellular proliferation, 
and increased matrix production compared to smooth surface. Bone formation appears to be strongly 
related to the presence of transforming growth factor fi r in the bone matrix [Kieswetter etal., 1996]. 

TiNi Alloys 

The titanium-nickel alloys show unusual properties i.e., after it is deformed the material can snap back 
to its previous shape following heating of the material. This phenomenon is called shape memory effect 
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FIGURE 37.8 Chemical change of titanium implant surface of alkali following heat treatment [Kim et al., 1996], 

(SME). The SME of TiNi alloy was first observed by Buehler and Wiley at the U.S. Naval Ordnance 
Laboratory [Buehler et al., 1963]. The equiatomic TiNi or NiTi alloy (Nitinol) exhibits an exceptional 
SME near room temperature: if it is plastically deformed below the transformation temperature, it reverts 
back to its original shape as the temperature is raised. The SME can be generally related to a diffusionless 
martensitic phase transformation which is also thermoelastic in nature, the thermoelasticity being attrib- 
uted to the ordering in the parent and martensitic phases [ Wayman and Shimizu, 1 972 ] . Another unusual 
property is the superelasticity, which is shown schematically in Fig. 37.9. As can be seen, the stress does 
not increase with increased strain after the initial elastic stress region and upon release of the stress or 
strain the metal springs back to its original shape in contrast to other metals such as stainless steel. The 
superlastic property is utilized in orthodontic archwires since the conventional stainless steel wires are 
too stiff and harsh for the tooth. In addition, the shape memory effect can also be utilized. 

Some possible applications of shape memory alloys are orthodontic dental archwire, intracranial 
aneurysm clip, vena cava filter, contractile artificial muscles for an artificial heart, vascular stent, catheter 
guide wire, and orthopedic staple [Duerig et al., 1990]. 

In order to develop such devices, it is necessary to understand fully the mechanical and thermal 
behavior associated with the martensitic phase transformation. A widely known NiTi alloy is 55-Nitinol 
(55 weight % or 50 atomic % Ni), which has a single phase and the mechanical memory plus other 
properties, for example, high acoustic damping, direct conversion of heat energy into mechanical energy, 
good fatigue properties, and low temperature ductility. Deviation from the 55-Nitinol (near stoichio- 
metric NiTi) in the Ni-rich direction yields a second group of alloys which are also completely non- 
magnetic but differ from 55-Nitinol in their ability to be thermally hardened to higher hardness levels. 
Shape recovery capability decreases and heat treatability increases rapidly as the Ni content approaches 
60%. Both 55 and 60-Nitinols have relatively low modulus of elasticity and can be tougher and more 
resilient than stainless steel, NiCr, or CoCr alloys. 

Efficiency of 55-Nitinol shape recovery can be controlled by changing the final annealing temperatures 
during preparation of the alloy device [Lee et al., 1988] . For the most efficient recovery, the shape is fixed 
by constraining the specimen in a desired configuration and heating to 482~510°C. If the annealed wire 
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FIGURE 37.9 Schematic illustration of the stainless steel wire and TiNi SMA wire springs for orthodontic archwire 
behavior (modified from [Wayman and Duerig, 1990]). 

is deformed at a temperature below the shape recovery temperature, shape recovery will occur upon 
heating, provided the deformation has not exceeded crystallographic strain limits (~8% strain in tension). 
The NiTi alloys also exhibit good biocompatibility and corrosion resistance in vivo. 

There is no significant difference between titanium and NiTi 
in the inhibition of mitosis in human fibroblasts. NiTi showed 
lower percentage bone and bone contact area than titanium 
and the Ti6Al4V alloy [Takeshita et al., 1997]. 

The mechanical properties of NiTi alloys are especially sen- 
sitive to the stoichiometry of composition (typical composition 
is given in Table 37.8) and the individual thermal and mechan- 
ical history. Although much is known about the processing, 
mechanical behavior, and properties relating to the shape 
memory effect, considerably less is known about the thermo- 
mechanical and physical metallurgy of the alloy. 



TABLE 37.8 


Chemical Composition 


of Ni-Ti 


Alloy 


Wire 


Element 




Composition(%) 


Ni 




54.01 


Co 




0.64 


Cr 




0.76 


Mil 




0.64 


Fe 




0.66 


Ti 




Balance 



37.5 Dental Metals 



Dental amalgam is an alloy made of liquid mercury and other solid metal particulate alloys made of 
silver, tin, copper, etc. The solid alloy is mixed with (liquid) mercury in a mechanical vibrating mixer 
and the resulting material is packed into the prepared cavity. One of the solid alloys is composed of at 
least 65% silver, and not more than 29% tin, 6% copper, 2% zinc, and 3% mercury. The reaction during 
setting is thought to be 

y + Hg-> y + y, + y 2 (37.1) 

in which the y phase is Ag 3 Sn, the y : phase is Ag 2 Hg 3 , and the y 2 phase is Sn 7 Hg. The phase diagram for 
the Ag-Sn-Hg system shows that over a wide compositional range all three phases are present. The final 
composition of dental amalgams typically contain 45% to 55% mercury, 35% to 45% silver, and about 
15% tin after fully set in about one day. 

Gold and gold alloys are useful metals in dentistry as a result of their durability, stability, and corrosion 
resistance [Nielsen, 1986]. Gold fillings are introduced by two methods: casting and malleting. Cast 
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restorations are made by taking a wax impression of the prepared cavity, making a mold from this 
impression in a material such as gypsum silica, which tolerates high temperature, and casting molten 
gold in the mold. The patient is given a temporary filling for the intervening time. Gold alloys are used 
for cast restorations, since they have mechanical properties which are superior to those of pure gold. 
Corrosion resistance is retained in these alloys provided they contain 75% or more of gold and other 
noble metals. Copper, alloyed with gold, significantly increases its strength. Platinum also improves the 
strength, but no more than about 4% can be added, or the melting point of the alloy is elevated excessively. 
Silver compensates for the color of copper. A small amount of zinc may be added to lower the melting 
point and to scavenge oxides formed during melting. Gold alloys of different composition are available. 
Softer alloys containing more than 83% gold are used for inlays which are not subjected to much stress. 
Harder alloys containing less gold are chosen for crowns and cusps which are more heavily stressed. 

Malleted restorations are built up in the cavity from layers of pure gold foil. The foils are welded 
together by pressure at ambient temperature. In this type of welding, the metal layers are joined by 
thermal diffusion of atoms from one layer to the other. Since intimate contact is required in this procedure, 
it is particularly important to avoid contamination. The pure gold is relatively soft, so this type of 
restoration is limited to areas not subjected to much stress. 

37.6 Other Metals 

Several other metals have been used for a variety of specialized implant applications. Tantalum has been 
subjected to animal implant studies and has been shown very biocompatible. Due to its poor mechanical 
properties (Table 37.9) and its high density (16.6 g/cm 3 ) it is restricted to few applications such as wire 
sutures for plastic surgeons and neurosurgeons and a radioisotope for bladder tumors. 

Platinum group metals (PGM) such as Pt, Pd, Rh, Ir, Ru, and Os are extremely corrosion resistant but 
have poor mechanical properties [Wynblatt, 1986]. They are mainly used as alloys for electrodes such as 
pacemaker tips because of their high resistance to corrosion and low threshold potentials for electrical 
conductivity. 

Thermoseeds made of 70% Ni and 30% Cu have been produced which possess Curie points in the 
therapeutic hyperthermia range, approximately 40 to 50°C [Ferguson et al., 1992]. Upon the application 
of an alternating magnetic field, eddy currents are induced, which will provide a continuous heat source 
through resistive heating of the material. As the temperature of a ferromagnetic substance nears its Curie 
point, however, there is a loss of ferromagnetic properties and a resulting loss of heat output. Thus, self- 
regulation of temperature is achieved and can be used to deliver a constant hyperthermic temperature 
extracorporeally at any time and duration. 

Surface modifications of metal alloys such as coatings by plasma spray, physical or chemical vapor 
deposition, ion implantaion, and fluidized bed deposition have been used in industry [Smith, 1993]. 
Coating implants with tissue compatible materials such as hydroxyapatite, oxide ceramics, Bioglass®, and 
pyrolytic carbon are typical applications in implants. Such efforts have been largely ineffective if the 
implants are permanent and particularly if the implants are subjected to a large loading. The main 
problem is the delamination of the coating or eventual wear of the coating. The added cost of coating 
or ion implanting hinders the use of such techniques unless the technique shows unequivocal superiority 
compared to the non-treated implants. 

TABLE 37.9 Mechanical Properties of Tantalum [American 
Society for Testing and Materials, F560-86, p.143, 1992] 



Properties Annealed Cold-Worked 



Tensile strength (MPa) 


207 


517 


Yield strength (0.2% offset) (MPa) 


138 


345 


Elongation (%) 


20-30 


2 


Young's modulus (GPa) 


— 


190 
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37.7 Corrosion of Metallic Implants 

Corrosion is the unwanted chemical reaction of a 
metal with its environment, resulting in its contin- 
ued degradation to oxides, hydroxides, or other 
compounds. Tissue fluid in the human body con- 
tains water, dissolved oxygen, proteins, and various 
ions such as chloride and hydroxide. As a result, the 
human body presents a very aggressive environ- 
ment for metals used for implantation. Corrosion 
resistance of a metallic implant material is conse- 
quently an important aspect of its biocompatibility. 



+ Cathode 
Anode 

Cations^- 
^-Anions 

Electrolyte 

FIGURE 37.10 Electrochemical cell. 



Electrochemical Aspects 

The lowest free energy state of many metals in an 
oxygenated and hydrated environment is that of the 
oxide. Corrosion occurs when metal atoms become 
ionized and go into solution, or combine with oxy- 
gen or other species in solution to form a com- 
pound which flakes off or dissolves. The body environment is very aggressive in terms of corrosion since 
it is not only aqueous but also contains chloride ions and proteins. A variety of chemical reactions occur 
when a metal is exposed to an aqueous environment, as shown in Fig. 37.10. The electrolyte, which 
contains ions in solution, serves to complete the electric circuit. In the human body, the required ions 
are plentiful in the body fluids. Anions are negative ions which migrate toward the anode, and cations 
are positive ions which migrate toward the cathode. At the anode, or positive electrode, the metal oxidizes 
by losing valence electrons as in the following. 



M -> M +n + ne~ 
At the cathode, or negative electrode, the following reduction reactions are important. 



M +n + ne~ 



M 



M ++ + OH-+ 2e ->MOH 
2H 3 + + 2e "-» H 2 T + 2H 2 



l/20 2 + H 2 + 2e~ 



20H- 



(37.2) 

(37.3) 
(37.4) 
(37.5) 
(37.6) 



The tendency of metals to corrode is expressed most simply in the 
standard electrochemical series of Nernst potentials, shown in 
Table 37.10. These potentials are obtained in electrochemical measure- 
ments in which one electrode is a standard hydrogen electrode formed 
by bubbling hydrogen through a layer of finely divided platinum black. 
The potential of this reference electrode is defined to be zero. Noble metals 
are those which have a potential higher than that of a standard hydrogen 
electrode; base metals have lower potentials. 

If two dissimilar metals are present in the same environment, the one 
which is most negative in the galvanic series will become the anode, and 
bimetallic (or galvanic) corrosion will occur. Galvanic corrosion can be 
much more rapid than the corrosion of a single metal. Consequently, 
implantation of dissimilar metals (mixed metals) is to be avoided. Gal- 
vanic action can also result in corrosion within a single metal, if there is 
inhomogeneity in the metal or in its environment, as shown in Fig. 37.11. 



TABLE 37.10 Standard 
Electrochemical Series 



Reaction 


AE„ [volts] 


LioLi+ 


-3.05 


Na<->Na + 


-2.71 


A1<->A1 +++ 


-1.66 


Tk->Ti +++ 


-1.63 


CroCr ++ 


-0.56 


FeoFe ++ 


-0.44 


Cu<->Cu ++ 


-0.34 


Co<->Co ++ 


-0.28 


NioNi ++ 


-0.23 


H,<->2H+ 


0.00 


Ag<->Ag + 


+0.80 


Au<->Au + 


+ 1.68 
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FIGURE37.il Micro-corrosion cells, (a) Grain boundaries are anodic with respect to the grain interior, (b) Crevice 
corrosion due to oxygen-deficient zone in metal's environment. 

The potential difference, E, actually observed depends on the concentration of the metal ions in 
solution according to the Nernst equation, 



o + (RT/nF)] 



E = E + RT/nF In 



M + 



(37.7) 



in which _R is the gas constant, £ is the standard electrochemical potential, T is the absolute temperature, 
F is Faraday's constant (96,487 C/mole), and n is the number of moles of ions. 

The order of nobility observed in actual practice may differ from that predicted thermodynamically. The 
reasons are that some metals become covered with a passivating film of reaction products which protects 
the metal from further attack. The dissolution reaction may be strongly irreversible so that a potential 
barrier must be overcome. In this case, corrosion may be inhibited even though it remains energetically 
favorable. The kinetics of corrosion reactions are not determined by the thermodynamics alone. 

Pourbaix Diagrams in Corrosion 

The Pourbaix diagram is a plot of regions of corrosion, passivity, and immunity as they depend on electrode 
potential and pH [Pourbaix, 1974]. The Pourbaix diagrams are derived from the Nernst equation and 
from the solubility of the degradation products and the equilibrium constants of the reaction. For the 
sake of definition, the corrosion region is set arbitrarily at a concentration of greater than 10~ 6 gram atom/L 
(molar) or more of metal in the solution at equilibrium. This corresponds to about 0.06 mg/L for metals 
such as iron and copper, and 0.03 mg/L for aluminum. Immunity is defined as equilibrium between metal 
and its ions at less than 10 6 molar. In the region of immunity, the corrosion is energetically impossible. 
Immunity is also referred to as cathodic protection. In the passivation domain, the stable solid constituent 
is an oxide, hydroxide, hydride, or a salt of the metal. Passivity is defined as equilibrium between a metal 
and its reaction products (oxides, hydroxides, etc.) at a concentration of 10~ 6 molar or less. This situation 
is useful if reaction products are adherent. In the biomaterials setting, passivity may or may not be 
adequate; disruption of a passive layer may cause an increase in corrosion. The equilibrium state may 
not occur if reaction products are removed by the tissue fluid. Materials differ in their propensity to re- 
establish a passive layer which has been damaged. This layer of material may protect the underlying metal 
if it is firmly adherent and nonporous; in that case further corrosion is prevented. Passivation can also 
result from a concentration polarization due to a buildup of ions near the electrodes. This is not likely 
to occur in the body since the ions are continually replenished. Cathodic depolarization reactions can 
aid in the passivation of a metal by virtue of an energy barrier which hinders the kinetics. Equations (37.5) 
and (37.6) are examples. 
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FIGURE 37.12 Pourbaix diagram for chromium, showing regions associated with various body fluids (modified 
from [Black, 1992]). 

There are two diagonal lines in the diagrams shown in Fig. 37.12. The top oxygen line represents the 
upper limit of the stability of water and is associated with oxygen rich solutions or electrolytes near 
oxidizing materials. In the region above this line, oxygen is evolved according to 2H 2 — > 2 T + 4H + + 
4e~. In the human body, saliva, intracellular fluid, and interstitial fluid occupy regions near the oxygen 
line, since they are saturated with oxygen. The lower hydrogen diagonal line represents the lower limit 
of the stability of water. Hydrogen gas is evolved according to Eq. (37.5). Aqueous corrosion occurs in 
the region between these diagonal lines on the Pourbaix diagram. In the human body, urine, bile, the 
lower gastrointestinal tract, and secretions of ductless glands, occupy a region somewhat above the 
hydrogen line. 

The significance of the Pourbaix diagram is as follows. Different parts of the body have different pH 
values and oxygen concentrations. Consequently, a metal which performs well [is immune or passive] in 
one part of the body may suffer an unacceptable amount of corrosion in another part. Moreover, pH can 
change dramatically in tissue that has been injured or infected. In particular, normal tissue fluid has a pH 
of about 7.4, but in a wound it can be as low as 3.5, and in an infected wound the pH can increase to 9.0. 

Pourbaix diagrams are useful, but do not tell the whole story; there are some limitations. Diagrams 
are made considering equilibrium among metal, water, and reaction products. The presence of other 
ions, e.g., chloride, may result in very much different behavior and large molecules in the body may also 
change the situation. Prediction of passivity may in some cases be optimistic, since reaction rates are not 
considered. 

Rate of Corrosion and Polarization Curves 

The regions in the Pourbaix diagram specify whether corrosion will take place, but they do not determine 
the rate. The rate, expressed as an electric current density [current per unit area], depends upon electrode 
potential as shown in the polarization curves shown in Fig. 37.13. From such curves, it is possible to 
calculate the number of ions per unit time liberated into the tissue, as well as the depth of metal removed 
by corrosion in a given time. An alternative experiment is one in which the weight loss of a specimen of 
metal due to corrosion is measured as a function of time. 

The rate of corrosion also depends on the presence of synergistic factors, such as those of mechanical 
origin (uneven distribution of mechanical stress). The stressed alloy failures occur due to the propagation 
of cracks in corrosive environments. For example, in corrosion fatigue (stress corrosion cracking), 
repetitive deformation of a metal in a corrosive environment results in acceleration of both the corrosion 
and the fatigue microdamage. Since the body environment involves both repeated mechanical loading 
and a chemically aggressive environment, fatigue testing of implant materials should always be performed 
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FIGURE 37.13 Potential-current density curves for some biomaterials [Greener et al., 1972]. 

under physiological environmental conditions; under Ringer's solution at body temperature. In fretting 
corrosion, rubbing of one part on another disrupts the passivation layer, resulting in accelerated corrosion. 
In pitting, the corrosion rate is accelerated in a local region. Stainless steel is vulnerable to pitting. 
Localized corrosion can occur if there is inhomogeneity in the metal or in the environment. Grain 
boundaries in the metal may be susceptible to the initiation of corrosion, as a result of their higher energy 
level. Crevices are also vulnerable to corrosion, since the chemical environment in the crevice may differ 
from that in the surrounding medium. The area of contact between a screw and a bone plate, for example, 
can suffer crevice corrosion. 

Corrosion of Available Metals 

Choosing a metal for implantation should take into account the corrosion properties discussed above. 
Metals which are in current use as biomaterials include gold, cobalt chromium alloys, type 316 stainless 
steel, cp-titanium, titanium alloys, nickel-titanium alloys, and silver-tin-mercury amalgam. 

The noble metals are immune to corrosion and would be ideal materials if corrosion resistance were 
the only concern. Gold is widely used in dental restorations and in that setting it offers superior perfor- 
mance and longevity. Gold is not, however, used in orthopaedic applications as a result of its high density, 
insufficient strength, and high cost. 

Titanium is a base metal in the context of the electrochemical series, however, it forms a robust 
passivating layer and remains passive under physiological conditions. Corrosion currents in normal saline 
are very low: 10" 8 A/cm 2 . Titanium implants remain virtually unchanged in appearance. Ti offers superior 
corrosion resistance but is not as stiff or strong as steel or Co-Cr alloys. 

Cobalt-chromium alloys, like titanium, are passive in the human body. They are widely in use in 
orthopedic applications. They do not exhibit pitting corrosion. 

Stainless steels contain enough chromium to confer corrosion resistance by passivity. The passive layer 
is not as robust as in the case of titanium or the cobalt chrome alloys. Only the most corrosion resistant 
of the stainless steels are suitable for implants. These are the austenitic types — 316, 316L, and 317, which 
contain molybdenum. Even these types of stainless steel are vulnerable to pitting and to crevice corrosion 
around screws. 

The phases of dental amalgam are passive at neutral pH, the transpassive potential for the y 2 phase is easily 
exceeded, due to interphase galvanic couples or potentials due to differential aeration under dental plaque. 
Amalgam, therefore, often corrodes and is the most active (corrosion prone) material used in dentistry. 
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Corrosion of an implant in the clinical setting can result in symptoms such as local pain and swelling 
in the region of the implant, with no evidence of infection; cracking or flaking of the implant as seen 
on x-ray films, and excretion of excess metal ions. At surgery, gray or black discoloration of the sur- 
rounding tissue may be seen and flakes of metal may be found in the tissue. Corrosion also plays a role 
in the mechanical failures of orthopaedic implants. Most of these failures are due to fatigue, and the 
presence of a saline environment certainly exacerbates fatigue. The extent to which corrosion influences 
fatigue in the body is not precisely known. 

Stress Corrosion Cracking 

When an implant is subjected to stress, the corrosion process could be accelerated due to the mechanical 
energy. If the mechanical stress is repeated then fatigue stress corrosion takes place such as in the femoral 
stem of the hip joint and hip nails made of stainless steels [Dobbs and Scales, 1979; Sloter and Piehler, 
1979] . However, other mechanisms of corrosion such as fretting may also be involved at point of contact 
such as in the counter-sink of the hip nail or bone fracture plate for the screws. 

37.8 Manufacturing of Implants 
Stainless Steels 

The austenitic stainless steels work-harden very rapidly as shown in Fig. 37.2 and therefore, cannot be 
cold-worked without intermediate heat treatments. The heat treatments should not induce, however, the 
formation of chromium carbide (CCr 4 ) in the grain boundaries; this may cause corrosion. For the same 
reason, the austenitic stainless steel implants are not usually welded. 

The distortion of components by the heat treatments can occur but this problem can be solved by 
controlling the uniformity of heating. Another undesirable effect of the heat treatment is the formation 
of surface oxide scales which have to be removed either chemically (acid) or mechanically (sand-blasting). 
After the scales are removed the surface of the component is polished to a mirror or mat finish. The 
surface is then cleaned, degreased, and passivated in nitric acid (ASTM Standard F86). The component 
is washed and cleaned again before packaging and sterilizing. 

Co-Cr Alloys 

The CoCrMo alloy is particularly susceptible to work-hardening so that the normal fabrication procedure 
used with other metals cannot be employed. Instead, the alloy is cast by a lost wax (or investment casting) 
method which involves making a wax pattern of the desired component. The pattern is coated with a 
refractory material, first by a thin coating with a slurry (suspension of silica in ethyl silicate solution) 
followed by complete investing after drying: (1) the wax is then melted out in a furnace (100~150°C), 

(2) the mold is heated to a high temperature burning out any traces of wax or gas forming materials, 

(3) molten alloy is poured with gravitational or centrifugal force, and (4) the mold is broken after cooled. 
The mold temperature is about 800~1,000°C and the alloy is at 1,350- 1,400°C. 

Controlling the mold temperature will have an effect on the grain size of the final cast; coarse ones 
are formed at higher temperatures which will decrease the strength. However, high processing temper- 
ature will result in larger carbide precipitates with greater distances between them resulting in a less 
brittle material. Again there is a complementary (trade off) relationship between strength and toughness. 

Ti and its Alloys 

Titanium is very reactive at high temperature and burns readily in the presence of oxygen. Therefore, it 
requires an inert atmosphere for high temperature processing or is processed by vacuum melting. Oxygen 
diffuses readily in titanium and the dissolved oxygen embrittles the metal. As a result, any hot working 
or forging operation should be carried out below 925°C. Machining at room temperature is not the 
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solution to all the problems since the material also tends to gall or seize the cutting tools. Very sharp 
tools with slow speeds and large feeds are used to minimize this effect. Electrochemical machining is an 
attractive means. 

Defining Terms 

Amalgam: An alloy obtained by mixing silver tin alloy with mercury. 

Anode: Positive electrode in an electrochemical cell. 

Cathode: Negative electrode in an electrochemical cell. 

Corrosion: Unwanted reaction of metal with environment. In a Pourbaix diagram, it is the region in 
which the metal ions are present at a concentration of more than 10~ 6 molar. 

Crevice corrosion: A form of localized corrosion in which concentration gradients around pre-existing 
crevices in the material drive corrosion processes. 

Curie temperature: Transition temperature of a material from ferromagnetic to paramagnetic. 

Galvanic corrosion: Dissolution of metal driven by macroscopic differences in electrochemical poten- 
tial, usually as a result of dissimilar metals in proximity. 

Galvanic series: Table of electrochemical potentials (voltage) associated with the ionization of metal 
atoms. These are called Nernst potentials. 

Hyperthermia: Application of high enough thermal energy (heat) to suppress the cancerous cell activ- 
ities. Above 41.5°C (but below 60°C) is needed to have any effect. 

Immunity: Resistance to corrosion by an energetic barrier. In a Pourbaix diagram, it is the region in 
which the metal is in equilibrium with its ions at a concentration of less than 10~ 6 molar. Noble 
metals resist corrosion by immunity. 

Martensite: A metastable structure formed by quenching of austenite (g) structure in alloys such as steel 
and Ti alloys. They are brittle and hard, and therefore, are further treated with heat to make tougher. 

Nernst potential: Standard electrochemical potential measured with respect to a standard hydrogen 
electrode. 

Noble: Type of metal with a positive standard electrochemical potential. 

Passivation: Production of corrosion resistance by a surface layer of reaction products (Normally oxide 
layer which is impervious to gas and water.) 

Passivity: Resistance to corrosion by a surface layer of reaction products. In a Pourbaix diagram, it is 
the region in which the metal is in equilibrium with its reaction products at a concentration of 
less than 10~ 6 molar. 

Pitting: A form of localized corrosion in which pits form on the metal surface. 

Pourbaix diagram: Plot of electrical potential versus pH for a material in which the regions of corro- 
sion, passivity, and immunity are identified. 

Shape memory effect (SME): Thermoelastic behavior of some alloys which can revert back to their 
original shape when the temperature is greater than the phase transformation temperature of the alloy. 

Superelasticity: Minimal stress increase beyond the initial strain region resulting in very low modulus 
in the region for some shape memory alloys. 

References 

ASTM. (1980). Annual Book ofASTM Standards (Philadelphia, PA). 

Black, J. (1992). Biological Performance of Materials, 2nd ed. (New York: M. Dekker, Inc.). 

Bordiji, K., Jouzeau, J., Mainard, D., Payan, E., Delagoutte, J., and Netter, P. (1996). Evaluation of the 

effect of three surface treatments on the biocompatibility of 316L stainless steel using human 

differentiated cells. Biomaterials. 17, 491-500. 
Breen, D. J. and Stoker, D. J. (1993). Titanium lines: a manifestation of metallosis and tissue response to 

titanium alloy megaprostheses at the knee. Clin. Radiology 43, 274-277. 
Buehler, W. J., Gilfrich, J. V., and Wiley, R. C. (1963). Effect of low-temperature phase changes on the 

mechanical properties of alloys near composition Ti-Ni. /. Appl. Phys. 34, 1475-1477. 

© 2000 by CRC Press LLC 



Davidson, J. A., Mishra, A. K., Kovacs, P., and Poggie, R. A. (1994). New surface hardened, low-modulus, 

corrosion-resistant Ti-13Nb-13Zr alloy for total hip arthroplasty. Biomed. Mater. Eng. 4, 231-243. 
Devine, T. M. and Wulff, J. (1975). Cast vs. wrought cobalt-chromium surgical implant alloys. /. Biomed. 

Mater. Res. 9, 151-167. 
Dobbs, H. S. and Scales, J. T. (1979). Fracture and corrosion in stainless steel hip replacement stems. In 

Corrosion and Degradation of Implant Materials, B. C. Syrett and A. Acharya, Eds. (Philadelphia: 

American Society for Testing and Materials), pp. 245-258. 
Duerig, T. W., Melton, K. N., Stockel, D., and Wayman, C. M. (1990). Engineering Aspects of Shape Memory 

Alloys (London: Butterworth-Heinemann). 
Feighan, J. E., Goldberg, V. M., Davy, D., Parr, J. A., and Stevenson, S. (1995). The influence of surface- 
blasting on the incorporation of titanium-alloy implants in a rabbit intramedullary model. 77 A, 

1380-1395. 
Ferguson, S. D., Paulus, J. A., Tucker, R. D., Loening, S. A., and Park, J. B. (1992). Effect of Thermal 

Treatment on Heating Characteristics of Ni-Cu Alloy for Hyperthermie: Preliminary Studies. 

/. Appl. Biomater. 4, 55-60. 
Granchi, D., Ciapetti, C, Savarino, L., Cavedagna, D., Donati, M. E., and Pizzoferrato, A. (1996). 

Assessment of metal extract toxicity on human lymphocytes cultured in vitro. J. Biomed. Mater. 

Res. 31, 183-191. 
Greener, E. H., Harcourt, J. K., and Lautenschlager, E. P. (1972). Materials Science in Dentistry. (Baltimore, 

MD: Williams and Wilkins). 
Hazan, R., Brener, R., and Oron, U. (1993). Bone growth to metal implants is regulated by their surface 

chemical properties. Biomaterials. 570-574. 
Hille, G. H. (1966). Titanium for surgical implants. /. Mater. 1, 373-383. 
Imam, M. A., Fraker, A. C, Harris, J. S., and Gilmore, C. M. (1983). Influence of heat treatment on the 

fatigue lives of Ti-6A1-4V and Ti-4.5Al-5Mo-l.5CR. In Titanium Alloys in Surgical Implants, H. A. 

Luckey and F. E. Kubli, Eds. (Philadelphia, PA: ASTM special technical publication 796), 

pp. 105-119. 
Kasemo, B. and Lausma, J. (1988). Biomaterial and implant surface: A surface science approach. Int. J. 

Oral Maxillofac Impl. 3, 247-259. 
Keating, F. H. (1956). Chromium-Nickel Autentic Steels. (London: Butterworths). 
Keller, J. C, Stanford, C. M., Wightman, J. P., Draughn, R. A., and Zaharias, R. (1994). Characterizations 

of titanium implant surfaces. III. /. Biomed. Mater. Res. 28, 939-946. 
Kieswetter, K., Schwartz, Z., Hummert, T. W., Cochran, D. L., Simpson, J., and Boyan, B. D. (1996). 

Surface roughness modulates the local production of growth factors and cytokines by osteoblast- 

like MG-63 cells. /. Biomed. Mater. Res. 32, 55-63. 
Kim, H., Miyaji, R, Kokubo, T, and Nakamura, T (1996). Preparation of bioactive Ti and its alloys via 

simple chemical surface treatment. /. Biomed. Mater. Res. 32, 409-417. 
Lee, J. H., Park, J. B., Andreasen, G. F., and Lakes, R. S. (1988). Thermomechanical study of Ni-Ti alloys. 

/. Biomed. Mater. Res. 22, 573-588. 
Nielsen, J. P. (1986). Dental Noble-Metal Casting Alloys: Composition and Properties. In Encyclopedia of 

Materials Science and Engineering, M. B. Bever, Ed. (Oxford, Cambridge: Pergamon Press), 

pp. 1093-1095. 
Pourbaix, M. (1974). Atlas of Electrochemical Equilibria in Aqueous Solutions, 2nd ed. (Houston/CEBEL- 

COR, Brussels: NACE). 
Richards-Mfg-Company. (1980). Biophase implant material, technical information publication 3846 

(Memphis, TN). 
Schmalzried, T. P., Peters, P. C, Maurer, B. T, Bragdon, C. R., and Harris, W. H. (1996). Long-duration 

metal-on-metal total hip arthroplasties with low wear of the articulating surfaces. /. Arthroplasty 

iJ, 322-331. 
Semlitch, M. (1980). Properties of wrought CoNiCrMo alloy Protasul-10, a highly corrosion and fatigue 

resistant implant material for joint endoprostheses. Eng. Med. 9, 201-207. 

© 2000 by CRC Press LLC 



Sloter, L. E. and Piehler, H. R. (1979). Corrosion-fatigue performance of stainless steel hip nails — Jewett 
type. In Corrosion and Degradation of Implant Materials, B. C. Syrett and A. Acharya, Eds. (Phil- 
adelphia, PA: American Society for Testing and Materials), pp. 173-195. 

Smith, C. J. E. and Hughes, A. N. (1966). The corrosion-fatigue behavior of a titanium-6w/o aluminum- 
4w/o vanadium alloy. Eng. Med. 7, 158-171. 

Smith, W. F. (1993). Structure and Properties of Engineering Alloys, 2nd ed. (New York: McGraw-Hill). 

Takatsuka, K., Yamamuro, T, Nakamura, T, and Kokubo, T. (1995). Bone-bonding behavior of titanium 
alloy evaluated mechanically with detaching failure load. /. Biomed. Mater. Res. 29, 157-163. 

Takeshita, F., Ayukawa, Y., Iyama, S., Murai, K., and Suetsugu, T. (1997). Long-term evaluation of bone- 
titanium interface in rat tibiae using light microscopy, transmission electron microscopy, and image 
processing. /. Biomed. Mater. Res. 37, 235-242. 

Wayman, C. M. and Duerig, T. W. (1990). An introduction of martensite and shape memory. In Engi- 
neering Aspects of Shape Memory Alloys, T. W. Duerig, K. N. Melton, D. Stockel, and C. M. Wayman, 
Eds. (London: Butterworth-Heinemann), pp. 3-20. 

Wayman, C. M. and Shimizu, K. (1972). The shape memory ('Marmem') effect in alloys. Metal Sci. J. 6, 
175-183. 

Williams, D. F. (1982). Biocompatibliity in Clinial Practice (Boca Raton, FL: CRC Press). 

Wynblatt, P. (1986). Platinum Group Metals and Alloys. In Encyclopedia of Materials Science and Engi- 
neering, M. B. Bever, Ed. (Oxford, Cambridge: Pergamon Press), pp. 3576-3579. 

Yan, W., Nakamura, T, Kobayashi, M., Kim, H., Miyaji, F., and Kokubo, T. (1997). Bonding of chemically 
treated titanium implants to bone. /. Biomed. Mater. Res. 37, 267-275. 

Further Reading 

American Society for Testing and Materials. 1992. Annual Book of ASTM Standards, vol. 13, Medical 

Devices and Services, American Society for Testing and Materials, Philadelphia, PA . 
Bardos, D.I. 1977. Stainless steels in medical devices, in Handbook of Stainless Steels. Peckner, D. and 

Bernstein, I.M., Eds. pp. 1-10, McGraw-Hill, New York. 
Bechtol, C. O., Ferguson, Jr., A. B., and Laing, P. G. 1959. Metals and Engineering in Bone and Joint 

Surgery, Williams and Wilkins, Baltimore, MD. 
Comte, T.W. 1984. Metallurgical observations of biomaterials, in Contemporary Biomaterials, Boretos, 

J.W and Eden, M., Eds., pp. 66-91, Noyes, Park Ridge, NJ. 
Duerig, T. W, Melton, K. N., Stockel, D., and Wayman, C. M., Eds. 1990. Engineering Aspects of Shape 

Memory Alloys, Butterworth-Heinemann, London. 
Dumbleton, J.H. and Black, J. 1975. An Introduction to Orthopaedic Materials, C. Thomas, Springfield, IL. 
Fontana, M.G. and Greene, N.O. 1967. Corrosion Engineering, pp. 163-168, McGraw-Hill, New York. 
Greener, E. H., Harcourt, J. K., and Lautenschlager, E. P. 1972. Materials Science in Dentistry, Williams 

and Wilkins, Baltimore, MD. 
Hildebrand, H.E and Champy, M., Eds. 1988. Biocompatibility of Co-Cr-Ni Alloys. Plenum Press, New York. 
Levine, S.N., Ed. 1968. Materials in Biomedical Engineering, Annals of New York Academy of Science, 

vol. 146. New York. 
Luckey H. A., Ed. 1983. Titanium Alloys in Surgical Implants, ASTM Special Technical Publication 796, 

Philadelphia, PA 
Mears, D.C. 1979. Materials and Orthopaedic Surgery, Williams and Wilkins, Baltimore, MD. 
Park, J. B. 1984. Biomaterials Science and Engineering, Plenum Pub., New York. 
Perkins, J., Ed. 1975. Shape Memory Effects in Alloys, Plenum Press, New York. 
Puckering, F.B., Ed. 1979. The Metallurgical Evolution of Stainless Steels, 1-42, American Society for Metals 

and the Metals Society, Metals Park, OH. 
Smith, W.F. 1993. Structure and Properties of Engineering Alloys, 2nd ed., McGraw-Hill, New York. 
Weinstein, A., Horowitz, E., and Ruff, A.W., Eds. 1977. Retrieval and Analysis of Orthopaedic Implants, 

NBS, U.S. Department of Commerce, Washington, D.C. 
Williams, D.F. and Roaf, R. 1973. Implants in Surgery, W.B. Sauders Co., LTD, London. 

© 2000 by CRC Press LLC 



Billotte, W. G. "Ceramic Biomaterials." 

The Biomedical Engineering Handbook: Second Edition. 

Ed. Joseph D. Bronzino 

Boca Raton: CRC Press LLC, 2000 



38 



W.G. Billotte 

University of Dayton 

38.1 Introduction 



Ceramic Biomaterials 



38.1 Introduction 

38.2 Nonabsorbable or Relatively Bioinert Bioceramics 
Relatively Bioinert Ceramics 

38.3 Biodegradable or Resorbable Ceramics 
Calcium Phosphate ■ Aluminum-Calcium-Phosphate 
(ALCAP) Ceramics • Coralline • Tricalcium Phosphate 
(TCP) Ceramics • Zinc-Calcium-Phosphorous Oxide 
(ZCAP) Ceramics ■ Zinc-Sulfate-Calcium-Phosphate 
(ZSCAP) Ceramics • Ferric-Calcium-Phosphorous-Oxide 
(FECAP) Ceramics 

38.4 Bioactive or Surface-Reactive Ceramics 
Glass Ceramics • Ceravital 

38.5 Deterioration of Ceramics 

38.6 Bioceramic Manufacturing Techniques 
Hard Tissue Replacement • Tissue Integration 



Ceramics are defined as the art and science of making and using solid articles that have as their essential 
component, inorganic nonmetallic materials [Kingery et al., 1976]. Ceramics are refractory, polycrystal- 
line compounds, usually inorganic, including silicates, metallic oxides, carbides and various refractory 
hydrides, sulfides, and selenides. Oxides such as A1 2 3 , MgO, Si0 2 , and Zr0 2 contain metallic and 
nonmetallic elements and ionic salts, such as NaCl, CsCl, and ZnS [Park and Lakes, 1992]. Exceptions 
to the preceding include covalently bonded ceramics such as diamond and carbonaceous structures like 
graphite and pyrolized carbons [Park and Lakes, 1992]. 

Ceramics in the form of pottery have been used by humans for thousands of years. Until recently, 
their use was somewhat limited because of their inherent brittleness, susceptibility to notches or micro- 
cracks, low tensile strength, and low impact strength. However, within the last 100 years, innovative 
techniques for fabricating ceramics have led to their use as "high tech" materials. In recent years, humans 
have realized that ceramics and their composites can also be used to augment or replace various parts 
of the body, particularly bone. Thus, the ceramics used for the latter purposes are classified as bioceramics. 
Their relative inertness to the body fluids, high compressive strength, and aesthetically pleasing appear- 
ance led to the use of ceramics in dentistry as dental crowns. Some carbons have found use as implants 
especially for blood interfacing applications such as heart valves. Due to their high specific strength as 
fibers and their biocompatibility, ceramics are also being used as reinforcing components of composite 
implant materials and for tensile loading applications such as artificial tendon and ligaments [Park and 
Lakes, 1992]. 

Unlike metals and polymers, ceramics are difficult to shear plastically due to the (ionic) nature of the 
bonding and minimum number of slip systems. These characteristics make the ceramics non-ductile and 
are responsible for almost zero creep at room temperature [Park and Lakes, 1992]. Consequently, ceramics 
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TABLE 38.1 Desired Properties of Implantable Bioceramics 

1. Should be nontoxic. 

2. Should be noncarcinogenic. 

3. Should be nonallergic. 

4. Should be noninflammatory. 

5. Should be biocompatible. 

6. Should be biofunctional for its lifetime in the host. 

are very susceptible to notches or microcracks because instead of undergoing plastic deformation (or 
yield) they will fracture elastically on initiation of a crack. At the crack tip the stress could be many times 
higher than the stress in the material away from the tip, resulting in a stress concentration which weakens 
the material considerably. The latter makes it difficult to predict the tensile strength of the material 
(ceramic). This is also the reason ceramics have low tensile strength compared to compressive strength. 
If a ceramic is flawless, it is very strong even when subjected to tension. Flawless glass fibers have twice 
the tensile strengths of high strength steel (~7 GPa) [Park and Lakes, 1992]. 

Ceramics are generally hard; in fact, the measurement of hardness is calibrated against ceramic 
materials. Diamond is the hardest, with a hardness index of 10 on Moh's scale, and talc (Mg 3 Si,O 10 COH) 
is the softest ceramic (Moh's hardness 1), while ceramics such as alumina (A1 2 3 ; hardness 9), quartz 
(Si0 2 ; hardness 8), and apatite (Ca 5 P 3 12 F; hardness 5) are in the middle range. Other characteristics of 
ceramic materials are (1) their high melting temperatures and (2) low conductivity of electricity and 
heat. These characteristics are due to the chemical bonding within ceramics. 

In order to be classified as a bioceramic, the ceramic material must meet or exceed the properties listed 
in Table 38.1. The number of specific ceramics currently in use or under investigation cannot be accounted 
for in the space available for bioceramics in this book. Thus, this chapter will focus on a general overview 
of the relatively bioinert, bioactive or surface reactive ceramics, and biodegradable or resorbable bioce- 
ramics. 

Ceramics used in fabricating implants can be classified as nonabsorbable (relatively inert), bioactive 
or surface reactive (semi-inert) [Hench, 1991, 1993] and biodegradable or resorbable (non-inert) [Hen- 
trich et al., 1971; Graves et al., 1972]. Alumina, zirconia, silicone nitrides, and carbons are inert biocer- 
amics. Certain glass ceramics and dense hydroxyapatites are semi-inert (bioreactive) and calcium 
phosphates and calcium aluminates are resorbable ceramics [Park and Lakes, 1992]. 

38.2 Nonabsorbable or Relatively Bioinert Bioceramics 
Relatively Bioinert Ceramics 

Relatively bioinert ceramics maintain their physical and mechanical properties while in the host. They 
resist corrosion and wear and have all the properties listed for bioceramics in Table 38.1. Examples of 
relatively bioinert ceramics are dense and porous aluminum oxides, zirconia ceramics, and single phase 
calcium aluminates (Table 38.2). Relatively bioinert ceramics are typically used as structural-support 
implants. Some of these are bone plates, bone screws, and femoral heads (Table 38.3). Examples of non- 
structural support uses are ventilation tubes, sterilization devices [Feenstra and de Groot, 1983] and drug 
delivery devices (see Table 38.3). 

Alumina (A1 2 3 ) 

The main source of high purity alumina (aluminum oxide, A1 2 3 ) is bauxite and native corundum. The 
commonly available alumina (alpha, a) can be prepared by calcining alumina trihydrate. The chemical 
composition and density of commercially available "pure" calcined alumina are given in Table 38.4. The 
American Society for Testing and Materials (ASTM) specifies that alumina for implant use should contain 
99.5% pure alumina and less than 0.1% combined Si0 2 and alkali oxides (mostly Na 2 0) [F603-78]. 
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TABLE 38.2 Examples of Relatively Bioinert Bioceramics 



Bioinert Ceramics 



1. Pyrolitic carbon coated devices 



2. Dense and nonporous aluminum oxides 



3. Porous aluminum oxides 



4. Zirconia ceramics 



5. Dense hydroxyapatites 



6. Calcium aluminates 
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Alpha alumina has a rhombohedral crystal structure (a = 4.758 A and c= 12.991 A). Natural alumina 
is known as sapphire or ruby, depending on the types of impurities which give rise to color. The single 
crystal form of alumina has been used successfully to make implants [Kawahara, 1989; Park 1991]. Single 
crystal alumina can be made by feeding fine alumina powders onto the surface of a seed crystal which 
is slowly withdrawn from an electric arc or oxy-hydrogen flame as the fused powder builds up. Single 
crystals of alumina up to 10 cm in diameter have been grown by this method [Park and Lakes, 1992]. 

The strength of p oh/crystalline alumina depends on its grain size and porosity. Generally, the smaller 
the grains, the lower the porosity and the higher the strength [Park and Lakes, 1992] . The ASTM standards 
(F603-78) requires a flexural strength greater than 400 MPa and elastic modulus of 380 GPa (Table 38.5). 

Aluminum oxide has been used in the area of orthopaedics for more than twenty-five years [Hench, 
1991]. Single crystal alumina has been used in orthopaedics and dental surgery for almost 20 years. 
Alumina is usually a quite hard material, its hardness varies from 20 to 30 GPa. This high hardness 
permits its use as an abrasive (emery) and as bearings for watch movements [Park and Lakes, 1992]. 
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TABLE 38.3 Uses of Bioinert Bioceramics 



Bioinert Ceramics 

1 . In reconstruction of acetabular cavities. 

2. As bone plates and screws. 

3. In the form of ceramic-ceramic composites. 



4. In the form of ceramic-polymer composites. 

5. As drug delivery devices. 

6. As femoral heads. 



7. As middle ear ossicles. 

8. In the reconstruction of orbital rims. 

9. As components of total and partial hips. 

10. In the form of sterilization tubes. 

11. As ventilation tubes. 

12. In the repair of the cardiovascular area. 
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TABLE 38.4 Chemical Composition of 
Calcined Alumina 



TABLE 38.5 Physical Property Requirements 
of Alumina and Partially Stabilized Zirconia 



Chemicals 



Composition (Weight %) 



A1 2 3 
Si0 2 
FeA 
Na,0 



99.6 
0.12 
0.03 
0.04 



Source: Park, J.B., and Lakes, R.S. 1992. Ceramic 
Implant Materials In: Biomaterials An Introduction, 
2nd ed., p. 121 Plenum Press, New York. 



Properties 


Alumina 


Zirconia 


Elastic modulus (GPa) 


380 


190 


Flexural strength (GPa) 


>0.4 


1.0 


Hardness, Mohs 


9 


6.5 


Density (g/cm 3 ) 


3.8-3.9 


5.95 


Grain size (um) 


4.0 


0.6 



Note: Both ceramics contain 3 mole % Y 2 3 . 
Source: J.B. Park, personal communication, 1993. 



Both polycrystalline and single crystal alumina have been used clinically. The high hardness is accom- 
panied by low friction and wear and inertness to the in vivo environment. These properties make alumina 
an ideal material for use in joint replacements [Park and Lakes, 1992]. Aluminum oxide implants in 
bones of rhesus monkeys have shown no signs of rejection or toxicity for 350 days (Graves et al., 1972; 
Hentrich etal., 1971). One of the most popular uses for aluminum oxide is in total hip protheses. 
Aluminum oxide hip protheses with an ultra-high molecular weight polyethylene (UHMWPE) socket 
have been claimed to be a better device than a metal prostheses with a UHMWPE socket [Oonishi, 1992] . 
However, the key for success of any implant, besides the correct surgical implantation, is the highest 
possible quality control during fabrication of the material and the production of the implant [Hench, 
1991]. 

Zirconia (ZrO z ) 

Pure zirconia can be obtained from chemical conversion of zircon (ZrSi0 4 ), which is an abundant mineral 
deposit [Park and Lakes, 1992]. Zirconia has a high melting temperature (T m = 2953 K) and chemical 
stability with a = 5.145 A, b = 0.521 A, c= 5.311 A, and (3 = 99°14 in [Park and Lakes, 1992]. It undergoes 
a large volume change during phase changes at high temperature in pure form; therefore, a dopant oxide 
such as Y 2 3 is used to stabilize the high temperature (cubic) phase. We have used 6 mole% Y 2 3 as 
dopant to make zirconia for implantation in bone [Hentrich etal., 1971]. Zirconia produced in this 
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manner is referred to as partially stabilized zirconia [Drennan and Steele, 1991]. However, the physical 
properties of zirconia are somewhat inferior to that of alumina (Table 38.5). 

High density zirconia oxide showed excellent compatibility with autogenous rhesus monkey bone and 
was completely nonreactive to the body environment for the duration of the 350 day study [Hentrich 
et al., 1971]. Zirconia has shown excellent biocompatibility and good wear and friction when combined 
with ultra-high molecular weight polyethylene [Kumar et al., 1989; Murakami and Ohtsuki, 1989]. 

Carbons 

Carbons can be made in many allotropic forms: crys- 
talline diamond, graphite, noncrystalline glassy car- 
bon, and quasicrystalline pyrolitic carbon. Among 
these, only pyrolitic carbon is widely utilized for 
implant fabrication; it is normally used as a surface 
coating. It is also possible to coat surfaces with dia- 
mond. Although the techniques of coating with dia- 
mond have the potential to revolutionize medical 
device manufacturing, it is not yet commercially avail- 
able [Park and Lakes, 1992]. 

The crystalline structure of carbon, as used in 
implants, is similar to the graphite structure shown in 
Fig. 38.1. The planar hexagonal arrays are formed by 
strong covalent bonds in which one of the valence 
electrons or atoms is free to move, resulting in high 

but anisotropic electric conductivity. Since the bonding between the layers is stronger than the van der 
Waals force, it has been suggested that the layers are cross-linked. However, the remarkable lubricating 
property of graphite cannot be attained unless the cross-links are eliminated [Park and Lakes, 1992]. 

The poorly crystalline carbons are thought to contain unassociated or unoriented carbon atoms. The 
hexagonal layers are not perfectly arranged, as shown in Fig. 38.2. Properties of individual crystallites 
seem to be highly anisotropic. However, if the crystallites are randomly dispersed, the aggregate becomes 
isotropic [Park and Lakes, 1992]. 




O.I4l7nm*| :— — ■; 
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FIGURE 38.1 Crystal structure of graphite. (Source: 
Shobert E.I. 1964. Carbon and Graphite. New York, 
Academic Press.) 
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FIGURE 38.2 Schematic presentation of poorly crystalline carbon, (a) Single-layer plane, (b) Parallel layers in a 
crystallite, (c) Unassociated carbon, (d) An aggregate of crystallites, single layers and unassociated carbon. {Source: 
Bokros JC. 1972. Deposition Structure and Properties of Pyrolitic Carbon. Chemistry and Physics of Carbon. 5:70-81, 
New York, Marcel Dekker.) 
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FIGURE 38.3 Fracture stress versus density for unalloyed LTI pyrolite carbons. (Source: Kaae JL. 1971. Structure 
and Mechanical Properties of Isotropic Pyrolitic Carbon Deposited Below 1600°C. /. Nucl. Mater. 38:42-50.) 
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FIGURE 38.4 Elastic modulus versus density for unal- 
loyed LTI pyrolite carbons. {Source: Kaae JL. 1971. 
Structure and Mechanical Properties of Isotropic 
Pyrolitic Carbon Deposited Below 1600°C. /. Nucl. 
Mater. 38:42-50.) 
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The mechanical properties of carbon, especially pyrolitic carbon, are largely dependent on its density, 
as shown in Figs. 38.3 and 38.4. The increased mechanical properties are directly related to increased 
density, which indicates that the properties of pyrolitic carbon depend mainly on the aggregate structure 
of the material [Park and Lakes, 1992]. 

Graphite and glassy carbon have a much lower mechanical strength than pyrolitic carbon (Table 38.6). 
However, the average modulus of elasticity is almost the same for all carbons. The strength of pyrolitic 
carbon is quite high compared to graphite and glassy carbon. Again.this is due to the fewer number of 
flaws and unassociated carbons in the aggregate. 

Table 38.6 Properties of Various Types of Carbon 





Types of Carbon 


Properties 


Graphite 


Glassy 


Pyrolitic 8 


Density (g/cm 3 ) 
Elastic modulus (GPa) 
Compressive strength (MPa) 
Toughness (Mn/cm 3 ) b 


1.5-1.9 
24 
138 
6.3 


1.5 

24 

172 
0.6 


1.5-2.0 
28 

517 (575") 
4.8 



■ 1.0 w/o Si-alloyed pyrolitic carbon, Pyrolite™ (Carbomedics, Aus- 
tin, TX) 

b 1 m-N/cm 3 = 1.45 x 10" 3 in-lb/in 3 

Source: Park JB, Lakes RS. 1992. Ceramic Implant Materials In: Bio- 
materials An Introduction, 2nd ed, p. 133. Plenum Press, New York. 
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TABLE 38.7 Mechanical Properties of Carbon Fiber- Reinforced Carbon 







Fiber 


lay-up 




Property 


Unidirectional 


0-90 


D Crossply 


Flexural Modulus (GPa) 










Longitudinal 


140 






60 


Transverse 


7 






60 


Flexural Strength (MPa) 










Longitudinal 


1,200 






500 


Transverse 


15 






500 


Interlaminar shear strength (MPa) 


18 






18 



Source: Adams D. and Williams DF. 1978. Carbon Fiber-Reinforced Carbon 
as a Potential Implant Material. J. Biomed. Mater. Res. 12:38. 

A composite carbon which is reinforced with carbon fiber has been considered for making implants. 
However, the carbon-carbon composite is highly anisotropic, and its density is in the range of 1.4 to 
1.45 g/cm 3 with a porosity of 35 to 38% (Table 38.7). 

Carbons exhibit excellent compatibility with tissue. Compatibility of pyrolitic carbon-coated devices 
with blood have resulted in extensive use of these devices for repairing diseased heart valves and blood 
vessels (Park and Lakes, 1992). 

Pyrolitic carbons can be deposited onto finished implants from hydrocarbon gas in a fluidized bed at 
a controlled temperature and pressure. The anisotropy, density, crystallite size and structure of the 
deposited carbon can be controlled by temperature, composition of the fluidized gas, the bed geometry, 
and the residence time (velocity) of the gas molecules in the bed. The microstructure of deposited carbon 
should be highly controlled, since the formation of growth features associated with uneven crystallization 
can result in a weaker material (Fig. 38.5). It is also possible to introduce various elements into the 
fluidized gas and co-deposit them with carbon. Usually silicon (10 to 20 w/o) is co-deposited (or alloyed) 
to increase hardness for applications requiring resistance to abrasion, such as heart valve discs. 

Recently, success was achieved in depositing pyrolitic carbon onto the surfaces of blood vessel implants 
made of polymers. This type of carbon is called ultra low temperature isotropic (ULTI) carbon instead 
of low temperature isotropic (LTI) carbon. The deposited carbon has excellent compatibility with blood 
and is thin enough not to interfere with the flexibility of the grafts [Park and Lakes, 1992]. 

The vitreous or glassy carbon is made by controlled pyrolysis of polymers such as phenolformaldehyde, 
Rayon (cellulose), and polyacrylnitrite at a high temperature in a controlled environment. This process 
is particularly useful for making carbon fibers and textiles which can be used alone or as components of 
composites. 

38.3 Biodegradable or Resorbable Ceramics 

Although Plaster of Paris was used in 1892 as a bone substitute [Peltier, 1961], the concept of using 
synthetic resorbable ceramics as bone substitutes was introduced in 1969 [Hentrich et al., 1969; Graves 
et al., 1972]. Resorbable ceramics, as the name implies, degrade upon implantation in the host. The 
resorbed material is replaced by endogenous tissues. The rate of degradation varies from material to 
material. Almost all bioresorbable ceramics except Biocoral and Plaster of Paris (calcium sulfate dihydrate) 
are variations of calcium phosphate (Table 38.8). Examples of resorbable ceramics are aluminum calcium 
phosphate, coralline, Plaster of Paris, hydroxyapatite, and tricalcium phosphate (Table 38.8). 

Calcium Phosphate 

Calcium phosphate has been used in the form of artificial bone. This material has been synthesized and 
used for manufacturing various forms of implants, as well as for solid or porous coatings on other 
implants (Table 38.9). 
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FIGURE 38.5 Microstructure of carbons deposited in a fluidized bed. (a) A granular carbon with distinct growth 
features, (b) An isotropic carbon without growth features. Both under polarized light. 240X. (Source: Bokros JC, 
LaGrange LD, and Schoen GJ. 1972. Control of Structure of Carbon for Use in Bioengineering. Chemistry and Physics 
of Carbon. 9:103-171. New York, Marcel Dekker.) 

Calcium phosphate can be crystallized into salts such as hydroxyapatite and (3-whitlockite depending 
on the Ca:P ratio, presence of water, impurities, and temperature. In a wet environment and at lower 
temperatures (<900°C), it is more likely that hydroxyl- or hydroxyapatite will form, while in a dry 
atmosphere and at a higher temperature, (3-whitlockite will be formed [Park and Lakes 1992]. Both forms 
are very tissue compatible and are used as bone substitutes in a granular form or a solid block. The apatite 
form of calcium phosphate is considered to be closely related to the mineral phase of bone and teeth. 
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TABLE 38.8 Examples of Biodegradable Bioceramics 



Biodegradable or Resorbable Bioceramics 

1. Aluminum-Calcium-Phosphorous Oxides. 

2. Glass Fibers and their composites. 

3. Corals. 



4. Calcium Sulfates, including Plaster of Paris. 



5. Ferric Calcium Phosphorous Oxides. 



6. Hydroxyapatites. 



7. Tricalcium Phosphate. 



8. Zinc-Calcium-Phosphorous Oxides. 



9. Zinc-Sulfate-Calcium-Phosphorous Oxides. 
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Bajpai, 1983 

Peltier, 1961 

Scheidler and Bajpai, 1992 

Fuskietal., 1993 

Larrabee et al., 1993 

Strieker et al., 1992 

Bajpai and Fuchs, 1985 

Bajpai, 1983 

Jenei et al., 1986 

Riccietal., 1986 

Bajpai, 1983 

Bajpaietal., 1988 

Lemons etal., 1988 

Morris and Bajpai, 1989 

Araretal., 1989 

Bajpai, US. Patent No. 4778471 

Binzer and Bajpai, 1987 

Gromofsky et al., 1988 

Scheidler and Bajpai, 1992 



TABLE 38.9 Uses of Biodegradable Bioceramics 



Biodegradable or Resorbable Ceramics 
1 . As drug delivery devices. 



2. For repairing damaged bone due to disease or trauma. 



3. For filling space vacated by bone screws, donor bone, excised 

tumors, and diseased bone loss. 

4. For repairing and fusion of spinal and lumbo-sacral vertebrae. 

5. For repairing herniated discs. 

6. For repairing maxillofacial and dental defects. 

7. Hydroxyapatite Ocular Implants. 



References 

Abrams and Bajpai, 1994 
Bajpai, 1992 
Bajpai, 1994 
Benghuzzi et al., 1991 
Moldovan and Bajpai, 1994 
Nagy and Bajpai, 1994 
Bajpai, 1990 
Gromofsky et al., 1988 
Khavari and Bajpai, 1993 
Morris and Bajpai, 1987 
Scheidler and Bajpai, 1992 
Bajpai and Fuchs, 1985 
Riccietal., 1986 
Bajpaietal., 1984 
Yamamuro et al., 1988 
Bajpaietal., 1984 
Freeman et al., 1981 
De Potter et al., 1994 
Shields et al., 1993 



© 2000 by CRC Press LLC 




FIGURE 38.6 Hydroxyapatite structure projected down the c-axis onto the basal plane. (Source: Posner AS, Perloff 
A, and Diorio AD. 1958. Refinement of Hydroxyapatite structure, Acta. Cryst. 11:308-309.) 

The mineral part of bone and teeth is made of a crystalline form of calcium phosphate similar to 
hydroxyapatite [Ca 10 (PO 4 ) 6 (OH) 2 ]. The apatite family of mineral [A 10 (BO 4 ) 6 X 2 ] crystallizes into hexag- 
onal rhombic prisms and has unit cell dimensions a = 9.432 A and c= 6.881 A. The atomic structure of 
hydroxyapatite projected down the c-axis onto the basal plane is shown in Fig. 38.6. Note that the hydroxyl 
ions lie on the corners of the projected basal plane and they occur at equidistant intervals (3.44 A) along 
the columns perpendicular to the basal plane and parallel to the c-axis. Six of the ten calcium ions in 
the unit cell are associated with the hydroxyls in these columns, resulting in strong interactions among 
them [Park and Lakes, 1992]. 

The ideal Ca:P ratio of hydroxyapatite is 10:6 and the calculated density is 3.219 g/cm 3 . Substitution 
of OH with fluoride gives the apatite greater chemical stability due to the closer coordination of fluoride 
(symmetric shape) as compared to the hydroxyl (asymmetric, two atoms) by the nearest calcium. This 
is why fluoridation of drinking water helps in resisting caries of the teeth. [Park and Lakes, 1992]. 

The mechanical properties of synthetic calcium phosphates vary considerably (Table 38.10). The wide 
variations in properties of polycrystalline calcium phosphates are due to the variations in the structure 
and manufacturing processes. Depending on the final firing conditions, the calcium phosphate can be 
calcium hydroxyapatite or P-whitlockite. In many instances, both types of structures exist in the same 
final product [Park and Lakes, 1992]. 

Polycrystalline hydroxyapatite has a high elastic modulus (40 to 117 GPa). Hard tissue such as bone, 
dentin, and dental enamel are natural composites which contain hydroxyapatite (or a similar mineral), 
as well as protein, other organic materials, and water. Enamel is the stiffest hard tissue, with an elastic 
modulus of 74 GPa, and contains the most mineral. Dentin (E = 21 GPa) and compact bone (£ = 12 to 
18 GPa) contain comparatively less mineral. The Poisson's ratio for the mineral or synthetic hydroxy- 
apatite is about 0.27 which is close to that of bone (~0.3) [Park and Lakes, 1992]. 
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Table 38.10 Physical Properties of Calcium Phosphate 



Properties 

Elastic modulus (GPa) 
Compressive strength (MPa) 
Bending strength (MPa) 
Hardness (Vickers, GPa) 
Poisson's ratio 
Density (theoretical, g/cm 1 ) 



Values 

4.0-117 

294 

147 
3.43 
0.27 
3.16 



Source: Park JB and Lakes RS. 1992. Ceramic Implant 
Materials In: Biomaterials An Introduction, 2nd ed., p 125. 
Plenum Press, New York. 



Hontsu et al. (1997) were able to deposit an amorphous HA film on Ti, a-Al 2 3 , SiO//Si(100), and 
SrTi0 3 using a pulsed ArF excimer laser. Upon heat treatment the amorphous film was converted to the 
crystalline form of HA. The HA film's electrical properties were measured for the first time (Table 38.14). 

Among the most important properties of hydroxyapatite as a biomaterial is its excellent biocomaptibil- 
ity. Hydroxyapatite appears to form a direct chemical bond with hard tissues [Piattelli and Trisi, 1994]. 
On implantation of hydroxyapatite particles or porous blocks in bone, new lamellar cancellous bone 
forms within 4 to 8 weeks [Bajpai and Fuchs, 1985]. Scanning electron micrograph (500X) of a set and 
hardened hydroxyapatite-cysteine composite is shown in Fig. 38.7. The composite sets and hardens on 
addition of water. 

Many different methods have been developed to make precipitates of hydroxyapatite from an aqueous 
solution of Ca(NO,) 2 and NaH 2 P0 4 There has been successful use of modifications to Jarcho and 
colleagues wet precipitation procedure for synthesizing hydroxyapatites for use as bone implants [Jarcho 
et al., 1979; Bajpai and Fuchs, 1985], and drug delivery devices [Abrams and Bajpai, 1994; Bajpai, 1992, 
1994; Parker and Bajpai, 1993]. The dried, filtered precipitate is placed in a high-temperature furnace 




FIGURE 38.7 Scanning electron micrograph (X500) of a set and hardened hydroxyaptite (HA) -cysteine composite. 
The small white cysteine particles can be seen on the larger HA particles. 
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and calcined at 1150°C for one hour. The calcined powder is then ground in a ball mill, and the particles 
are separated by an automatic sieve shaker and sieves. The sized particles are then pressed in a die and 
sintered at 1200°C for 36 h for making drug delivery devices [Abrams and Bajpai, 1994; Bajpai, 1989, 
1992]. Above 1250°C, hydroxyapatite shows a second precipitation phase along the grain boundaries 
[Park and Lakes 1992]. 

Aluminum-Calcium-Phosphate (ALCAP) Ceramics: 

Initially we fabricated a calcium aluminate ceramic containing phosphorous pentoxide [Hentrich et al., 
1969, 1971; Graves et al., 1972]. Aluminum-calcium-phosphorous oxide ceramic (ALCAP) was devel- 
oped later [Bajpai and Graves, 1980]. ALCAP has insulating dielectric properties but no magnetic or 
piezoelectric properties [Allaire et al., 1989]. ALCAP ceramics are unique because they provide a multi- 
purpose crystallographic system where one phase of the ceramic on implantation can be more rapidly 
resorbed than the others [Bajpai, 1983; Mattie and Bajpai, 1988; Wyatt et al, 1976]. ALCAP is prepared 
from stock powders of aluminum oxide, calcium oxide, and phosphorous pentoxide. A ratio of 50:34:16 
by weight of A10 2 :CaO:P 2 5 is used to obtain the starting mixture for calcining at 1350°C in a high 
temperature furnace for 12 h. The calcined material is ground in a ball mill and sieved by an automatic 
siever to obtain particles of the desired size. The particulate powder is then pressed into solid blocks or 
hollow cylinders (green shape) and sintered at 1400°C for 36 h to increase the mechanical strength. 
ALCAP ceramic implants have given excellent results in terms of biocompatibility and gradual replace- 
ment of the ceramic material with endogenous bone [Bajpai, 1982; Mattie and Bajpai, 1988]. A scanning 
electron micrograph (1000X) of sintered porous ALCAP is shown in Fig. 38.8. 

Coralline 

Coral is a natural substance made by marine invertebrates. According to Holmes et al. [ 1984] , the marine 
invertebrates live in the limestone exostructure, or coral. The porous structure of the coral is unique for 
each species of marine invertebrate [Holmes et al., 1984]. Corals for use as bone implants are selected 




FIGURE 38.8 Scanning electron micrograph (X1000) of porous sintered ALCAP. 
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on the basis of structural similarity to bone [Holmes et al., 1984]. Coral provides an excellent structure 
for the ingrowth of bone, and the main component, calcium carbonate, is gradually resorbed by the body 
[Khavari and Bajpai, 1993]. Corals can also be converted to hydroxyapatite by a hydrothermal exchange 
process. Interpore 200, a coral hydroxyapatite, resembles cancellous bone [Sartoris et al., 1986]. Both 
pure coral (Biocoral) and coral transformed to hydroxyapatite are currently used to repair traumatized 
bone, replace diseased bone, and correct various bone defects. 

Biocoral is composed of crystalline calcium carbonate or aragonite, the metastable form of calcium 
carbonate. The compressive strength of Biocoral varies from 26 (50% porous) to 395 MPa (dense) and 
depends on the porosity of the ceramic. Likewise, the modulus of elasticity (Young's Modulus) of Biocoral 
varies from 8 (50% porous) to 100 GPa (dense) [Biocoral, 1989]. 

Tricalcium Phosphate (TCP) Ceramics: 

A multicrystalline porous form of [3-tricalcium phosphate [(3-Ca 3 (P0 4 ) 2 ] has been used successfully to 
correct periodontal defects and augment bony contours [Metsger et al., 1982]. X-ray diffraction of 
)3-tricalcium phosphate shows an average interconnected porosity of over lOOum [Lemons et al., 1979]. 
Often tribasic calcium phosphate is mistaken for )3-tricalcium phosphate. According to Metsger and 
colleagues [ 1982] , tribasic calcium phosphate is a nonstoichiometric compound often bearing the formula 
of hydroxyapatite [Ca 10 (PO 4 ) 6 (OH) 2 ]. 

[3-Tricalcium phosphate is prepared by a wet precipitation procedure from an aqueous solution of 
Ca(N0 3 ) 2 and NaH 2 P0 4 [Bajpai et al, 1988]. The precipitate is calcined at 1150°C for 1 h, ground, and 
sieved to obtain the desired size particles for use as bone substitutes [Bajpai et al., 1988; Bajpai, 1990] 
and for making ceramic matrix drug delivery systems [Morris and Bajpai, 1989; Nagy and Bajpai, 1994; 
Moldovan and Bajpai, 1994]. These particles are used as such or pressed into cylindrical shapes and 
sintered at 1 150 to 1200 C C for 36 h to achieve the appropriate mechanical strength for use as drug delivery 
devices [Bajpai, 1989, 1992, 1994; Benghuzzi et al., 1991]. A scanning electron micrograph (500X) of a 
set and hardened TCP-cysteine composite is shown in Fig. 38.9. The composite sets and hardens on the 
addition of water. TCP is usually more soluble than synthetic hydroxyapatite and, on implantation, allows 
good bone ingrowth and eventually is replaced by endogenous bone. 

Zinc-Calcium-Phosphorous Oxide (ZCAP) Ceramics 

Zinc is essential for human metabolism and is a component of at least 30 metalloenzymes [Pories and 
Strain, 1970]. In addition, zinc may also be involved in the process of wound healing [Pories and Strain 
1970]. Thus zinc-calcium-phosphorous oxide polyphasic ceramics (ZCAP) were synthesized to repair 
bone defects and deliver drugs [Binzer and Bajpai, 1987; Bajpai, 1988, 1993; Arar and Bajpai, 1992]. 
ZCAP is prepared by a thermal mixing of zinc oxide, calcium oxide, and phosphorous pentoxide powders 
[Bajpai, 1988]. ZCAP, like ALCAP, has insulating dielectric properties but no magnetic or piezoelectric 
properties [Allaire et al., 1989]. Various ratios of these powders have been used to produce the desired 
material [Bajpai, 1988]. The oxide powders are mixed in a ball mill and subsequently calcined at 800°C 
for 24 h. The calcined ceramic is then ground and sieved to obtain the desired size particles. Scanning 
electron micrograph (500X) of a set and hardened ZCAP-cysteine composite is shown in Fig. 38.10. The 
composite sets and hardens on addition of water. To date, ZCAP ceramics have been used to repair 
experimentally induced defects in bone and for delivering drugs [Binzer and Bajpai, 1987; Bajpai, 1993]. 

Zinc-Sulfate-Calcium-Phosphate (ZSCAP) Ceramics 

Zinc-sulfate-calcium-phosphate polyphasic ceramics (ZSCAP) are prepared from stock powders of zinc 
sulfate, zinc oxide, calcium oxide, and phosphorous pentoxide [Bajpai, 1988]. A ratio of 15:30:30:25 by 
weight of ZnS0 4 :ZnO:CaO:P 2 5 is mixed in a crucible and allowed to cool for 30 min after the exothermal 
reaction has subsided. The cooled mixture is calcined in a crucible at 650°C for 24 h. The calcined ceramic 
is ground in a ball mill and the particles of the desired size are separated by sieving in an automatic 
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FIGURE 38.9 Scanning electron micrograph (X500) of a set and hardened TCP-cysteine composite. The small 
white cysteine particles can be seen on the larger TCP particles. 




FIGURE 38.10 Scanning electron micrograph (X500) of a set and hardened ZCAP-cysteine composite. The small 
white cysteine particles have blended with the ZCAP particles. 
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FIGURE 38.11 Scanning electron micrograph (X2000) of a set and hardened ZSCAP particles (45-63 um). Sulfate 
is hardly visible between the cube-shaped ZCAP particles. 

siever. Scanning electron micrograph (2000X) of set and hardened ZSCAP particles (45 to 63 pm) is 
shown in Fig. 38.11. ZSCAP sets and hardens on addition of water. ZSCAP particles, on implantation in 
bone, set and harden on contact with blood and have been used to repair experimentally induced defects 
in bone [Scheidler and Bajpai, 1992]. 

Ferric-Calcium-Phosphorous-Oxide (FECAP) Ceramics 

Ferric-Calcium-Phosphorous-Oxide polyphasic ceramic (FECAP) is prepared from powders of ferric 
(III) oxide, calcium oxide, and phosphorous pentoxide [Fuski et al., 1993; Larrabee et al., 1993; Strieker 
et al.,1992] . The powders are combined in various ratios by weight and mixed in a blender. Blocks of the 
mixture are then pressed in a die by means of a hydraulic press and calcined at 1100°C for 12 h. The 
calcined ceramic blocks are crushed and ground in a ball mill. The calcined ceramic is ground in a ball 
mill and the particles of the desired size are separated by sieving in an automatic siever. A scanning 
electron micrograph (1000X) of a set and hardened FECAP-a ketoglutaric acid composite is shown in 
Fig. 38.12. The composite sets and hardens on the addition of water. Studies conducted to date suggest 
complete resorption of FECAP particles implanted in bone within 60 days [Larrabee et al., 1993]. This 
particular ceramic could be used in patients suffering from anemia and similar diseases [Fuski et al., 1993] . 

38.4 Bioactive or Surface-Reactive Ceramics 

Upon implantation in the host, surface reactive ceramics form strong bonds with adjacent tissue. Exam- 
ples of surface reactive ceramics are dense nonporous glasses, Bioglass and Ceravital, and hydroxyapatites 
(Table 38.11). One of their many uses is the coating of metal prostheses. This coating provides a stronger 
bonding to the adjacent tissues, which is very important for protheses. A list of the uses of surface-reactive 
ceramics is shown in Table 38.12. 
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FIGURE 38.12 Scanning electron micrograph (X1000) of a set and hardened FECAP-a- ketoglutaric acid compos- 
ite. Plate-shaped FECAP particles have been aggregated by the acid. 



Table 38.11 Examples of Surface Reactive Bioceramics 



Surface Reactive Bioceramics 



1. Bioglasses and Ceravital" 



2. Dense and non-porous glasses 



3. Hydroxyapatite 



References 

Ducheyne, 1985 
Gheyson et al., 1983 
Hench, 1991 
Hench, 1993 
Ogino et al., 1980 
Ritteretal., 1979 
Andersson etal., 1992 
Blencke et al., 1978 
Li etal, 1991 
Ohtsuki et al., 1992 
Ohura et al., 1992 
Schepers etal., 1993 
Takatsuko et al., 1993 
Bagambisa et al., 1993 
Bajpai, 1990 
Fredette et al., 1989 
Huaxiaetal., 1992 
Knowles and Bonfield, 1993 
Niwaetal., 1980 
Park and Lakes, 1992 
Posner et al., 1958 
Schwartz et al., 1993 
Whitehead et al., 1993 
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TABLE 38.12 Uses of Surface Reactive Bioceramics 



Surface Reactive Bioceramics 
1. For coating of metal prostheses. 



2. In reconstruction of dental defects. 



3. For filling space vacated by bone screws, donor bone, excised tumors, 

and diseased bone loss. 

4. As bone plates and screws. 



5. As replacements of middle ear ossicles. 



6. For lengthening of rami. 

7. For correcting periodontal defects. 

8. In replacing subperiosteal teeth. 



References 

Cotelletal., 1992 

Huaxia et al„ 1992 

Ritter et al„ 1979 

Takatsuko et al., 1993 

Whitehead et al, 1993 

Hulbert et al., 1987 

Gheysen et al., 1983 

Schepers et al., 1988 

Schepers et al., 1989 

Hulbert et al., 1987 

Schepers et al., 1993 

Terry etal., 1989 

Doyle, 1990 

Ducheyne and McGuckin, 1990 

Yamamuro etal., 1988 

Feenstra and de Groot, 1983 

Grote, 1987 

Hench, 1991 

Hench, 1993 

Reck et al., 1988 

Feenstra and de Groot, 1983 

Feenstra and de Groot, 1983 

Hulbert, 1992 

Hulbert, 1992 



Glass Ceramics 

Several variations of Bioglass and Ceravital glass ceramics have been used by various workers within the 
last decade. Glass ceramics used for implantation are silicon oxide based systems with or without 
phosphorous pentoxide. 

Glass ceramics are p oh/crystalline ceramics made by controlled crystallization of glasses developed by 
S.D. Stookey of Corning Glass Works in the early 1960s [Park and Lakes, 1992] . Glass ceramics were first 
utilized in photosensitive glasses, in which small amounts of copper, silver, and gold are precipitated by 
ultraviolet light irradiation. These metallic precipitates help to nucleate and crystallize the glass into a 
fine grained ceramic which possesses excellent mechanical and thermal properties. Both Bioglass and 
Ceravital glass ceramics have been used as implants [Yamamuro et al., 1990]. 

The formation of glass ceramics is influenced by the nucleation and growth of small ( < 1 um diameter) 
crystals as well as the size distribution of these crystals. It is estimated that about 10 12 to 10 15 nuclei per 
cubic centimeter are required to achieve such small crystals. In addition to the metallic agents already 
mentioned, Pt groups, Ti0 2 , Zr0 2 , and P 2 5 are widely used for nucleation and crystallization. The 
nucleation of glass is carried out at temperatures much lower than the melting temperature. During 
processing the melt viscosity is kept in the range of 10 11 and 10 12 Poise for 1 to 2 h. In order to obtain a 
larger fraction of the microcrystalline phase, the material is further heated to an appropriate temperature 
for maximum crystal growth. Deformation of the product, phase transformation within the crystalline 
phases, or redissolution of some of the phases should be avoided. The crystallization is usually more than 
90% complete with grain sizes 0.1 to 1 um. These grains are much smaller than those of conventional 
ceramics. Figure 38.13 shows a schematic representation of temperature-time cycle for a glass ceramic 
[Park and Lakes, 1992]. 

The glass ceramics developed for implantation are Si0 2 -CaO-Na 2 0-P 2 5 and Li 2 0-ZnO-Si0 2 systems. 
Two major groups are experimenting with the Si0 2 -CaO-Na 2 0-P 2 5 glass ceramic. One group varied 
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FIGURE 38.13 Temperature-time cycle for a glass-ceramic. (Source: Kingery WD, Bowen HK, and Uhlmann DR. 
1976. Introduction to Ceramics, 2nd ed., p.368. New York, J. Wiley and Sons.) 

TABLE 38.13 Compositions of Bioglass and Ceravital Glass Ceramics 

Type Code SiO, CaO Na,0 P,0 5 MgO K,0 

Bioglass 



42S5.6 


42.1 


29.0 


26.3 


2.6 


(45S5)46S5.2 


46.1 


26.9 


24.4 


2.6 


49S4.9 


49.1 


25.3 


23.8 


2.6 


52S4.6 


52.1 


23.8 


21.5 


2.6 


55S4.3 


55.1 


22.2 


20.1 


2.6 


60S3.8 


60.1 


19.6 


17.7 


2.6 


'Bioactive 


40-50 


30-35 


5-10 


10-15 


"Nonbioactive 


30-35 


25-30 


3.5-7.5 


7.5-12 



Ceravital "Bioactive 40-50 30-35 5-10 10-15 2.5-5 0.5-3 

1-2.5 0.5-2 

"The Ceravital® composition is in weight % while the Bioglass® compositions are in 
mol %. 

"In addition A1 2 3 (5.0-15.0), TiO 2 (1.0-5.0) and Ta 2 O s (5-15) are added. 

Source: Park JB and Lakes RS. 1992. Ceramic Implant Materials In: Biomaterials An 
Introduction, 2nd ed., p. 127. Plenum Press, New York. 

the compositions (except for P 2 5 ) in order to obtain the best glass ceramic composition for inducing 
direct bonding with bone (Table 38.13). The bonding to bone is related to the simultaneous formation 
of a calcium phosphate and Si0 2 -rich film layer on the surface, as exhibited by the 46S5.2 type Bioglass. 
If a Si0 2 -rich layer forms first and a calcium phosphate film develops later (46 to 55 mol % Si0 2 samples) 
or no phosphate film is formed (60 mole % Si0 2 ) then direct bonding with bone does not occur [Park 
and Lakes, 1992]. The approximate region of the Si0 2 -CaO-Na 2 system for the tissue-glass-ceramic 
reaction is shown in Fig. 38.14. As can be seen, the best region (region A) for good tissue bonding is the 
composition given for 46S5.2 type Bioglass (see Table 38.13) [Park and Lakes, 1992]. 

Ceravital 

The composition of Ceravital is similar to that of Bioglass in Si0 2 content but differs somewhat in other 
components (see Table 38.13). In order to control the dissolution rate, A1 2 3 , Ti0 2 , and Ta 2 5 are added in 
Ceravital glass ceramic. The mixtures, after melting in a platinum crucible at 1500°C for three h, are annealed 
and cooled. The nucleation and crystallization temperatures are 680°C and 750°C, respectively, each for 
24 h. When the size of crystallites reaches approximately 4 A and the characteristic needle structure is not 
formed, the process is stopped to obtain a fine grain structured glass ceramic. [Park and Lakes, 1993] 

Glass ceramics have several desirable properties compared to glasses and ceramics. The thermal 
coefficient of expansion is very low, typically 10~ 7 to 10~ 5o C _1 , and in some cases it can even be made 
negative. Due to the controlled grain size and improved resistance to surface damage, the tensile strength 
of these materials can be increased by at least a factor of two, from about 100 to 200 MPa. The resistance 
to scratching and abrasion of glass ceramics is similar to that of sapphire [Park and Lakes, 1992]. 
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FIGURE 38.14 Approximate regions of the tissue-glass-ceramic bonding for the Si0 2 -CaO-Na 2 system. A: Bond- 
ing within 30 days. B: Nonbonding; reactivity is too low. D: Bonding does not form glass. (Source: Hench LL, Ethridge 
EC. 1982. Biomateriah: An Interfacial Approach. p.147, New York, Academic Press.) 

A transmission electron micrograph of Bioglass glass ceramic implanted in the femur of rats for six 
weeks showed intimate contacts between the mineralized bone and the Bioglass (Fig. 38.15). The mechan- 
ical strength of the interfacial bond between bone and Bioglass ceramic is on the same order of magnitude 
as the strength of the bulk glass ceramic (850 kg/cm 2 or 83.3 MPa), which is about three-fourths that of 
the host bone strength [Park and Lakes, 1992]. 

A negative characteristic of the glass ceramic is its brittleness. In addition, limitations on the compo- 
sitions used for producing a biocompatibile (or osteoconductive) glass ceramic hinders the production 
of a glass ceramic which has substantially higher mechanical strength. Thus glass ceramics cannot be 
used for making major load-bearing implants such as joint implants. However, they can be used as fillers 
for bone cement, dental restorative composites, and coating material (see Table 38.12). A glass ceramic 
containing 36 wt% of magnetite in a (3-wollastonite- and CaOSi0 2 -based glassy matrix has been synthe- 
sized for treating bone tumors by hyperthermia [Kokubo et al., 1992]. 

38.5 Deterioration of Ceramics 

It is of great interest to know whether the inert ceramics such as alumina undergo significant static or 
dynamic fatigue. Even for the biodegradable ceramics, the rate of degradation in vivo is of paramount 
importance. Controlled degradation of an implant with time on implantation is desirable. Above a critical 
stress level, the fatigue strength of alumina is reduced by the presence of water. This is due to the delayed 
crack growth, which is accelerated by the water molecules [Park and Lakes, 1992]. Reduction in strength 
occurs if water penetrates the ceramic. Decrease in strength was not observed in samples which did not 
show water marks on the fractured surface (Fig. 38.16). The presence of a small amount of silica in one 
sample lot may have contributed to the permeation of water molecules that is detrimental to the strength 
[Park and Lakes, 1992]. It is not clear whether the static fatigue mechanism operates in single crystal 
alumina. It is reasonable to assume, that static fatigue will occur if the ceramic contains flaws or impurities, 
because these will act as the source of crack initiation and growth under stress [Park and Lakes, 1992]. 

Studies of the fatigue behavior of vapor-deposited pyrolitic carbon fibers (4000 to 5000 A thick) onto 
a stainless steel substrate showed that the film does not break unless the substrate undergoes plastic 
deformation at 1.3 x 10~ 2 strain and up to one million cycles of loading. Therefore, the fatigue is closely 
related to the substrate, as shown in Fig. 38.17. Similar substrate-carbon adherence is the basis for the 
pyrolitic carbon deposited polymer arterial grafts [Park and Lakes, 1992]. 

The fatigue life of ceramics can be predicted by assuming that the fatigue fracture is due to the slow 
growth of preexisting flaws. Generally, the strength distribution, Cj, of ceramics in an inert environment 
can be correlated with the probability of failure F by the following equation: 
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FIGURE 38.15 Transmission electron micrograph of well-mineralized bone (b) juxtaposed to the glass-ceramic (c) 
which fractured during sectioning. X5 1,500. Insert a is the diffraction pattern from ceramic area and b is from bone 
area. (Source: Beckham CA, Greenlee TK Jr, Crebo AR. 1971. Bone Formation at a Ceramic Implant Interfaces, Calc. 
Tiss. Res. 8:165-171.) 
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Both m and s are constants in the equation. Figure 38.18 shows a good fit for Bioglass coated alumina 
[Park and Lakes, 1992]. 

A minimum service life (t min ) of a specimen can be predicted by means of a proof test wherein it is 
subjected to stresses that are greater than those expected in service. Proof tests also eliminate the weaker 
pieces. This minimum life can be predicted from the following equation. 



(38.2) 



Here ct is the proof test stress, a a is the applied stress, and B and N are constants. 
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FIGURE 38.16 Flexural strength of dense alumina rods after aging under stress in Ringer's solution. Lot 1 and 2 
are from different batches of production. (Source: Krainess FE, Knapp WJ. 1978. Strength of a Dense Alumina Ceramic 
after Aging in vitro. J. Biomed. Mater. Res. 12:245.) 
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FIGURE 38.17 Strain versus number of cycles to failure (o = absence of fatigue cracks in carbon film; • = fracture 
of carbon film due to fatigue failure of substrates; □ = data from substrate determined in single-cycle tensile test). 
(Source: Shimm HS, Haubold AD. 1980. The Fatigue Behavior of Vapor Deposited Carbon Films. Biomater. Med. Dev. 
Art. Org. 8:333-344.) 
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FIGURE 38.18 Plot of Ln Ln [1/(1-F)] versus Ln s for Bioglass-coated alumina in a tris-hydroxyaminomethane 
buffer and liquid nitrogen. Pis the probability of failure and S is strength. (Source: Ritter JE Jr, Greenspan DC, Palmer 
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Equation (38.2) after rearrangement, reads as follows: 

t . 2 = B(0 /a) (38.3) 

min a \ p / a J v ' 

Figure 38.19 shows a plot of Eq. (38.3) for alumina on a logarithmic scale [Park and Lakes, 1992]. 

38.6 Bioceramic Manufacturing Techniques 

In order to fabricate bioceramics in more and more complex shapes, scientists are investigating the use 
of old and new manufacturing techniques. These techniques range from the adaptation of an age old 
pottery technique to the latest manufacturing methods for high temperature ceramic parts for airplane 
engines. No matter where the technique is perfected, the ultimate goal is the fabrication of bioceramic 
particles or devices in a desired shape in a consistent manner with the desired properties. The technique 
used to fabricate the bioceramic device will depend greatly on the ultimate application of the device, 
whether it is for hard tissue replacement or the integration of the device within the surrounding tissue. 

Hard Tissue Replacement 

Hard tissue replacement implies that the bioceramic device will be used for load bearing applications. 
Although it is desirable to have a device with a sufficient porosity for the surrounding tissue to infiltrate 
and attach to the device, the most important and immediate property is the strength of the device. In 
order to accomplish this, one must manufacture a bioceramic implant with a density and strength 
sufficient to mimic that of bone. However, if the bioceramic part is significantly stronger than the 
surrounding bone, one runs into the common problem seen with metals called stress shielding. The 
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TABLE 38.14 Electrical properties of an HA Film 

Dielectric constant (s r ) 5.7 (25°C 1 MHz) 

Loss tangent (tan 5) <2% 

Breakdown electric field 10 4 Vcnr' 

Source: Hontsu etal., 1997. 

density of the bioceramic greatly determines its overall strength. As the density increases so does the 
overall strength of the bioceramic. Some of the techniques used to manufacture dense bioceramics are 
injection molding, gel casting, bicontinuous microemulsion, inverse microemulsion, emulsion, and additives. 

Injection molding is a common technique used to form plastic parts for many commercial applications 
such as automobile parts. Briefly, the process involves forcing a heated material into a die and then 
ejecting the formed piece from the die. Injection molding allows for making complex shapes. Cihlar and 
Trunec (1996) found that by calcining (1273 K, 3 hr) and milling the hydroxyapatite (HA) prior to mixing 
with a binder, an ethylene vinyl acetate copolymer (EVA)/HA mixture of 63% HA, they achieved 98% 
relative density with only 16% shrinkage using injection molding. The maximum flexural strength was 
60 MPa for HA products sintered at 1473 K. However, this is still not strong enough for load bearing 
applications. They also observed that HA decomposed to a-TCP at temperatures greater than 1573 K 
[Cihlar and Trunec, 1996]. 

In gel casting, HA is formed using the standard chemical precipitation. The calcium phosphate precip- 
itant (30% w/v) is then mixed with glycerol and filtered. The 'gel cake' is sintered at 1200°C for two hours. 
This yielded a density of >99% and a highly uniform microstructure [Varma and Sivakumar, 1996]. 

Bicontinuous microemulsion, inverse microemulsion, and emulsion are all wet chemistry based meth- 
ods to produce nanometer size HA powders. All three methods yield >97% relative density upon sintering 
at 1200°C for two hours. The biocontinuous and inverse microemulsion resulted in the two smallest HA 
particle sizes, 22 and 24 nm respectively [Lim et al., 1997]. 

Another strategy to increase the density of ceramics is to use additives or impurities in small weight 
percents during sintering. The main disadvantages to this technique include (1) the possible decompo- 
sition of the original pure bioceramic and (2) results may end in all or portions of the bioceramic being 
non-biocompatible. 

Suchanek etal. (1997) studied the addition of several different additives to HA in 5 wt% amounts. 
The additives studied were K 2 C0 3 , Na 2 C0 3 , H 3 B0 3 , KF, CaCl 2 , KC1, KH 2 P0 4 , (KP0 3 ) n , Na 2 Si 2 5 , Na 2 P 2 7 , 
Na 3 P0 4 , (NaP0 3 ) n , Na 5 P 3 O 10 , and (3-NaCaP0 4 . HA has a fracture toughness of 1 MPa-m 1 ' 2 , whereas 
human bone has a fracture toughness of 2 to 12 MPa-m 1 ' 2 . One of the ways to improve the mechanical 
properties is to improve the densification of HA. Suchanek et al. (1997) found that the following additives 
(5 wt.%) did not improve the densification of HA: H 3 BO„ CaCl 2 , KC1, KH 2 P0 4 , (KP0 3 ) n , and Na 2 Si 2 5 . 
The densification of HA was improved through the addition (5 wt.%) of K 2 C0 3 , Na 2 C0 3 , KF, Na 2 P 2 7 , 
Na 3 P0 4 , (NaP0 3 ) n , Na 5 P 3 O 10 , and P-NaCaP0 4 . However, H 3 B0 3 , CaCl 2 , KH 2 P0 4 , (KP0 3 ) n , Na 2 Si 2 5 , 
K 2 C0 3 , Na 2 C0 3 , and KF produced the formation of P-TCP or CaO. The sodium phosphates used in this 
study were added to HA without the formation of P-TCP or CaO. The only compound that improved 
densification, did not cause formation of P-TCP or CaO, and provided a weak interface for HA was P~ 
NaCaP0 4 . 

Another additive that has been investigated to improve the performance of HA is lithium (Li). The 
addition of lithium can increase the microhardness and produces a fine microstructure in HA. Fanovich 
etal. (1998) found that the addition of 0.2 wt% of Li to HA produced the maximum microhardness 
(5.9 GPa). However, the addition of high amounts of Li to HA results in abnormal grain growth and 
large pores. Furthermore, Li addition to HA results in the formation of P-TCP upon sintering. 

Zirconia has been used as an additive to HA in order to improve its mechanical strength. Kawashima 
et al. (1997) found that addition of partially stabilized zirconia (PSZ) to HA can be used to increase the 
fracture toughness to 2.8 MPa-m 1 ' 2 . Bone has a fracture toughness of 2-12 MPa-m 1 ' 2 [Suchanek etal., 
1997]. PSZ was added to HA in different percentages (17, 33, 50 wt%) and it was found that 50 wt% 
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Table 38.15 Bioceramic Manufacturing Techniques for Hard Tissue 
Replacement or Tissue Integration 

Manufacturing Technique References 

Hard Tissue Replacement 

Injection Molding Cihlar and Trunec, 1996 

Bicontinuous microemulsion Lim etal., 1997 

Inverse microemulsion 
Emulsion 

Additives Fanovich et al., 1998; Kawashima et al., 

1997; Suchanek et al., 1997 

Tissue Integration 

Drip casting Liu, 1996; Lyckfeldt and Ferreira, 1998 

Starch consolidation Lyckfeldt and Ferreira, 1998] 

Polymeric sponge method 

Foaming method 

Organic additives 

Gel casting 

Slip casting 

Direct coagulation consolidation 

Hydrolysis assisted solidification 

Freezing 

PSZ had the highest fracture toughness. The surface energy of the PSZ-HA was not significantly different 
from HA alone. This suggests that the PSZ-HA composite could be biocompatible because of the similarity 
of the surface with HA [Kawashima et al., 1997]. 

Tissue Integration 

The porosity is a critical factor for growth and integration of a tissue into the bioceramic implant. In 
particular the open porosity, that which is connected to the outside surface, is critical to the integration 
of tissue into the ceramic especially if the bioceramic is inert. Several methods have been developed to 
form porous ceramics, two of these are starch consolidation and drip casting. 

In starch consolidation (Table 38.15), starch powders of a specific size are mixed with a bioceramic 
slurry at a predetermined weight percent. Upon heating, the starch will uptake water from the slurry 
mixture and swell. Upon sintering of the starch-bioceramic mixture, the starch is burned out and the 
pores are left in their place. Starch consolidation has been used to form complex shapes in alumina with 
ultimate porosities between 23 and 70 volume %. By controlling the starch content, one can control the 
ultimate porosity and resulting pore sizes. Large pores formed using starch consolidation in alumina 
were in the size range 10 to 80 um whereas small pores varied between 0.5 to 9.5 um [Lyckfeldt and 
Ferreira, 1998]. 

Liu (1996) used a drip casting technique (Table 38.15) to form porous HA granules with pore sizes from 
95 to 400 um. The granules had a total porosity from 24 to 76 volume %. The HA was made into a slurry 
using water and poly( vinyl butyral) powders. The slurry was then dripped onto a spherical mold surface. 
This technique is similar to that of drip casting by dripping an HA slurry into a liquid nitrogen bath. In 
both instances, calcining and sintering procedures are used to produce the final product [Liu, 1996]. 
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In Memory Of 

In memory of Dr. P.K. Bajpai who left us early in 1998, I would like to share our lab's recipe for 
hydroxyapatite. Dr. Bajpai's lab and research at the University of Dayton have ended after 30 plus years, 
but I felt it would be fitting to share this recipe as a way to encourage other scientists to continue exploring 
the possibilities of bioceramics. 

Hydroxyapatite Synthesis Method 

Prepare the following solutions: 

Solution 1: Dissolve 157.6 g calcium nitrate tetrahydrate [Ca(N0 3 ) 2 4H 2 0] in 500 ml DI water. Bring 
the solution to a pH of 11 by adding » 70 ml ammonium hydroxide [NH 4 OH]. Bring 
the solution to 800 ml with DI water. 

Solution 2: Dissolve 52.8 g ammonium phosphate dibasic [(NH 4 ) 2 HP0 4 ] in 500 ml DI water. Bring 
the solution to a pH of 11 by adding ~ 150 ml ammonium hydroxide [NH 4 OH]. Add 
DI water until the precipitate is completely dissolved, 250-350 ml. 

Special Note: If you use calcium nitrate [Ca(N0 3 ) 2 nH20] instead of calcium nitrate tetrahydrate you 
need to recalculate the amount of calcium nitrate to add to make Solution 1 on the basis of the absence 
of the extra four waters. If you do not, you will have to add a large amount of ammonium hydroxide to 
pH the solution. 

1. Add one-half of Solution 1 to a 2 L separatory funnel. 

2. Add one-half of Solution 2 to a 2 L separatory funnel. 

3. Titrate both solutions into a 4 L beaker with heat and constant stirring. 

4. Boil gently for 30 min. 

5. Repeat steps 1-4 for the rest of Solutions 1 and 2. 

6. Let cool completely allowing precipitate to settle to bottom of beaker. 

7. Pour contents of beaker into 250 ml polypropylene bottles. 

8. Centrifuge bottles for 10 min at (10,000 rpm) 16,000 g. 

9. Collect precipitate from the six bottles into two and resuspend with DI water. 

10. Fill the four empty bottles with the reaction mixture and centrifuge all six as before. 

11. Collect precipitant from the two bottles that were resuspended with DI water. 

12. Combine remaining four bottles into two bottles and resuspend with DI water. 

13. Repeat steps 10-12 as necessary. 

14. Dry precipitate for 24-48 h at 70°C. 

15. Calcine the precipitate for one hour at 1140°C. 

16. Grind and sieve product as desired. 

Defining Terms 

Alumina: Aluminum oxide (A1 2 3 ) which is very hard (Mohs hardness is 9) and strong. Single crystals 
are called sapphire or ruby depending on color. Alumina is used to fabricate hip joint socket 
components or dental root implants. 

Calcium phosphate: A family of calcium phosphate ceramics including aluminum calcium phosphate, 
ferric calcium phosphate, hydroxyapatite and tricalcium phosphate (TCP), and zinc calcium phos- 
phate which are used to substitute or augment bony structures and deliver drugs. 

Glass-ceramics: A glass crystallized by heat treatment. Some of those have the ability to form chemical 
bonds with hard and soft tissues. Bioglass and Ceravital are well known examples. 

Hydroxyapatite: A calcium phosphate ceramic with a calcium to phosphorus ratio of 5/3 and nominal 
composition Ca 10 (PO 4 ) 6 (OH) 2 . It has good mechanical properties and excellent biocompatibility. 
Hydroxyapatite is the mineral constituent of bone. 
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LTI carbon: A silicon alloyed pyrolitic carbon deposited onto a substrate at low temperature with 
isotropic crystal morphology. It is highly compatible with blood and used for cardiovascular 
implant fabrication such as artificial heart valve. 

Maximum radius ratio: The ratio of atomic radii computed by assuming the largest atom or ion which 
can be placed in a crystal's unit cell structure without deforming the structure. 

Mohs scale: A hardness scale in which 10 (diamond) is the hardest and 1 (talc) is the softest. 
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Introduction 

Polymerization and Basic Structure 
Polymerization • Basic Structure • Effect of Structural 
Modification on Properties 

Polymers Used as Biomaterials 
Polyvinylchloride (PVC) • Polyethylene (PE) ■ 
Polypropylene (PP) • Polymethylmethacrylate 
(PMMA) • Polystyrene (PS) and its Co-Polymers • 
Polyester • Polyamide (Nylon) ■ Fluorocarbon 
Polymers ■ Rubbers ■ Polyurethanes • Polyacetal, 
Polysulfone, and Polycarbonate ■ Biodegradable Polymers 

Sterilization 

Surface Modifications for Improving 

Biocompatibility 

Chemogradient Surfaces for Cell and Protein 

Interaction 



Synthetic polymeric materials have been widely used in medical disposable supply, prosthetic materials, 
dental materials, implants, dressings, extracorporeal devices, encapsulants, polymeric drug delivery sys- 
tems, tissue engineered products, and orthodoses as that of metal and ceramics substituents [Lee, 1989]. 
The main advantages of the polymeric biomaterials compared to metal or ceramic materials are ease of 
manufacturability to produce various shapes (latex, film, sheet, fibers, etc.), ease of secondary process- 
ability, reasonable cost, and availability with desired mechanical and physical properties. The required 
properties of polymeric biomaterials are similar to other biomaterials, that is, biocompatibility, steriliz- 
ability, adequate mechanical and physical properties, and manufacturability as given in Table 39.1. 

The objectives of this chapter are (1) the review of basic chemical and physical properties of the 
synthetic polymers, (2) the sterilization of the polymeric biomaterials, (3) the importance of the surface 
treatment for improving biocompatability, and (4) the application of the chemogradient surface for the 
study on cell to polymer interactions. 

39.2 Polymerization and Basic Structure 
Polymerization 

In order to link the small molecules one has to force them to lose their electrons by the chemical processes 
of condensation and addition. By controlling the reaction temperature, pressure, and time in the presence 
of catalyst(s), the degree to which repeating units are put together into chains can be manipulated. 
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TABLE 39.1 Requirements for Biomedical Polymers 



Properties Description 

Biocompatibility Noncarcinogenesis, nonpyrogenicity, nontoxicity, and nonallergic response 

Sterilizability Autoclave, dry heating, ethylenoxide gas, and radiation 

Physical property Strength, elasticity, and durability 

Manufacturability Machining, molding, extruding, and fiber forming 

Modified from Ikada [1989]. 



TABLE 39.2 Typical Condensation Polymers 



Type 


Interunit Linkage 


Polyester 







-C-O- 


Polyamide 


O H 




-C-N- 


Polyurea 


H O H 




-N-C-N- 


Polyurethane 


O H 




-O-C-N- 


Polysiloxane 


R 




-Si-O- 




R 


Protein 


O H 




-C-N- 



Condensation or Step Reaction Polymerization 

During condensation polymerization a small molecule such as water will be condensed out by the 
chemical reaction. For example: 

R-NH 2 + R'COOH -> R'CONHR+ H 2 (39.1) 

(amine) I carboxylic acid) (amide) (condensed molecule) 

This particular process is used to make polyamides (Nylons). Nylon was the first commercial polymer, 
made in the 1930s. 

Some typical condensation polymers and their interunit linkages are given in Table 39.2. One major 
drawback of condensation polymerization is the tendency for the reaction to cease before the chains 
grow to a sufficient length. This is due to the decreased mobility of the chains and reactant chemical 
species as polymerization progresses. This results in short chains. However, in the case of Nylon the 
chains are polymerized to a sufficiently large extent before this occurs and the physical properties of the 
polymer are preserved. 

Natural polymers, such as polysaccharides and proteins are also made by condensation polymerization. 
The condensing molecule is always water (H 2 0). 

Addition or Free Radical Polymerization 

Addition polymerization can be achieved by rearranging the bonds within each monomer. Since each 
'mer' has to share at least two covalent electrons with other mers the monomer should have at least one 
double bond. For example, in the case of ethylene: 
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TABLE 39.3 Monomers for Addition Polymerization and Suitable Processes 

Monomer Chemical Polymerization Mechanism 

Names Structure Radical Cationic Anionic Coordination 



Acrylonitrile 


CH 2 =CH 




C=N 


Ethylene 


CH 2 =CH 2 


Methacrylate 


CH 2 =CH 




COOCH3 


Methylmethacrylate 


CH 2 =CCH, 




COOCH3 


Propylene 


CH 2 =CH 




CH 


Styrene 


CH 2 =CH 




CH 


Vinylchloride 


CH 2 =CH 




CI 




CI 


Vinylidenechloride 


CH,=C 




CI 



+: high polymer formed. -: no reaction or oligomers only. 
Modified from Billmeyer [1984]. 



H H H H H H 

n {C=C} -> -C4C-C4C- (39.2) 

1 111 

H H H H H H 

The breaking of a double bond can be made with an initiator. This is usually a free radical such as 
benzoyl peroxide (H 5 C 6 COO-OOCC 6 H 5 ). The initiation can be activated by heat, ultraviolet light, and 
other chemicals. The free radicals (initiators) can react with monomers and this free radical can react 
with another monomer and the process can continue on. This process is called propagation. The prop- 
agation process can be terminated by combining two free radicals, by transfer, or by disproportionate 
processes. Some of the free radical polymers are given in Table 39.3. There are three more types of 
initiating species for addition polymerization beside free-radicals; cations, anions, and coordination 
(stereospecific) catalysts. Some monomers can use two or more of the initiation processes but others can 
use only one process as given in Table 39.3. 

Basic Structure 

Polymers have very long chain molecules which are formed by covalent bonding along the backbone 
chain. The long chains are held together either by secondary bonding forces such as van der Waals and 
hydrogen bonds or primary covalent bonding forces through crosslinks between chains. The long chains 
are very flexible and can be tangled easily. In addition, each chain can have side groups, branches and 
copolymeric chains or blocks which can also interfere with the long-range ordering of chains. For 
example, paraffin wax has the same chemical formula as polyethylene (PE) [(CH 2 CH 2 )J, but will 
crystallize almost completely because of its much shorter chain lengths. However, when the chains become 
extremely long {from 40 to 50 repeating units [-CH 2 CH 2 -] to several thousands as in linear PE} they 
cannot be crystallized completely (up to 80 to 90% crystallization is possible). Also, branched PE in 
which side chains are attached to the main backbone chain at positions normally occupied by a hydrogen 
atom, will not crystallize easily due to the steric hindrance of side chains resulting in a more noncrystalline 
structure. The partially crystallized structure is called semicrystalline which is the most commonly 
occurring structure for linear polymers. The semicrystalline structure is represented by disordered 
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Crystalline(folded & extended chains) 
Amorphous(uncoordinated chains) 



FIGURE 39.1 Fringed-micelle model of a linear polymer with semi-crystalline structure. 

noncrystalline (amorphous) regions and ordered crystalline regions which may contain folded chains as 
shown in Fig. 39.1. 

The degree of polymerization (DP) is defined as an average number of mers, or repeating units, per 
molecule, i.e., chain. Each chain may have a different number of mers depending on the condition of 
polymerization. Also, the length of each chain may be different. Therefore, it is assumed there is an 
average degree of polymerization or average molecular weight (MW). The relationship between molecular 
weight and degree of polymerization can be expressed as: 

MW of polymer = DP x MW of mer (or repeating unit) (39.3) 

The two average molecular weights most commonly used are defined in terms of the numbers of 
molecules, Ni, having molecular weight, Mi; or wi, the weight of species with molecular weights Mi as 
follows: 

1. The number- average molecular weight, Mn, is defined by the equation, 

w EMM/' „ .,,. EWi 1 

Mn = = E«iMj = — 7 , v = — i — . — r (39.4) 

EMM/' L\Wi/Mij L[wi/ Mi] 

2. The weight average molecular weight, Mw, is defined by the equation, 

LWiMi „ .,,. EMM/' 2 

Mw = =Ew/M/ = (39.5) 

ZWi EMMi 

An absolute method of measuring the molecular weight is one that depends on theoretical consider- 
ations, counting molecules and their weight directly. The relative methods require calibration based on 
an absolute method and include intrinsic viscosity and gel permeation chromatography (GPC). Absolute 
methods of determining the number-average molecular weight (Mn) include osmometry and other 
colligative methods, and end group analysis. Light-scattering yields an absolute weight-average molecular 
weight (Mw). 

As the molecular chains become longer by the progress of polymerization, their relative mobility 
decreases. The chain mobility is also related to the physical properties of the final polymer. Generally, 
the higher the molecular weight, the less the mobility of chains which results in higher strength and 
greater thermal stability. The polymer chains can be arranged in three ways; linear, branched, and a cross- 
linked (or three-dimensional) network as shown in Fig. 39.2. Linear polymers such as polyvinyls, polya- 
mides, and polyesters are much easier to crystallize than the cross-linked or branched polymers. However, 
they cannot be crystallized 100% as with metals. Instead they become semicrystalline polymers. The 
arrangement of chains in crystalline regions is believed to be a combination of folded and extended 
chains. The chain folds, which are seemingly more difficult to form, are necessary to explain observed 
single crystal structures in which the crystal thickness is too small to accommodate the length of the 
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FIGURE 39.2 Arrangement of polymer chains into linear, branched, and network structure depending on the 
functionality of the repeating units. 

chain as determined by electron and X-ray diffraction studies. The classical "fringed-micelle" model in 
which the amorphous and crystalline regions coexist has been modified to include chain folds in the 
crystalline regions. The cross-linked or three-dimensional network polymers such as polyphenolformal- 
dehyde cannot be crystallized at all and they become noncrystalline, amorphous polymers. 

Vinyl polymers have a repeating unit -CH 2 -CHX- where X is some monovalent side group. There 
are three possible arrangements of side groups (X): (1) atactic, (2) isotactic, and (3) syndiotactic. In 
atactic arrangements the side groups are randomly distributed while in syndiotactic and isotactic arrange- 
ments they are either in alternating positions or on one side of the main chain. If side groups are small 
like polyethylene (X=H) and the chains are linear, the polymer crystallizes easily. However, if the side 
groups are large as in polyvinyl chloride (X=C1) and polystyrene (X=C 6 H 5 , benzene ring) and are 
randomly distributed along the chains (atactic), then a noncrystalline structure will be formed. The 
isotactic and syndiotactic polymers usually crystallize even when the side groups are large. 

Copolymerization, in which two or more homopolymers (one type of repeating unit throughout its 
structure) are chemically combined, always disrupts the regularity of polymer chains thus promoting the 
formation of a noncrystalline structure. Possible arrangement of the different copolymerization is shown 
in Fig. 39.3. The addition of plasticizers to prevent crystallization by keeping the chains separated from 
one another will result in more flexible polymers, a noncrystalline version of a polymer which normally 
crystallizes. An example is celluloid which is normally made of crystalline nitrocellulose plasticized with 
camphor. Plasticizers are also used to make rigid noncrystalline polymers like polyvinylchloride (PVC) 
into a more flexible solid (a good example is Tygon® tubing). 

Elastomers, or rubbers, are polymers which exhibit large stretchability at room temperature and can 
snap back to their original dimensions when the load is released. The elastomers are non-crystalline 
polymers which have an intermediate structure consisting of long chain molecules in three-dimensional 
networks (see next section for more details). The chains also have "kinks" or "bends" in them which 
straighten when a load is applied. For example, the chains of cw-polyisoprene (natural rubber) are bent 
at the double bond due to the methyl group interfering with the neighboring hydrogen in the repeating 
unit [— CH 2 — C(CH 3 )=CH— CH 2 — ]. If the methyl group is on the opposite side of the hydrogen then it 
becomes frans-polyisoprene which will crystallize due to the absence of the steric hindrance present in 
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FIGURE 39.3 Possible arrangements of copolymers. 

the cis form. The resulting polymer is a very rigid solid called gutta percha which is not an elastomer. 
Below the glass transition temperature (T ; second order transition temperature between viscous liquid 
and solid) natural rubber loses its compliance and becomes a glass-like material. Therefore, to be flexible, 
all elastomers should have T well below room temperature. What makes the elastomers not behave like 
liquids above T is in fact due to the cross-links between chains which act as pinning points. Without 
cross-links the polymer would deform permanently. An example is latex which behaves as a viscous 
liquid. Latex can be cross-linked with sulfur (vulcanization) by breaking double bonds (C=C) and 
forming C-S-S-C bonds between the chains. The more cross-links are introduced the more rigid the 
structure becomes. If all the chains are cross-linked together, the material will become a three-dimensional 
rigid polymer. 

Effect of Structural Modification on Properties 

The physical properties of polymers can be affected in many ways. In particular, the chemical composition 
and arrangement of chains will have a great effect on the final properties. By such means the polymers 
can be tailored to meet the end use. 

Effect of Molecular Weight and Composition 

The molecular weight and its distribution have a great effect on the properties of a polymer since its 
rigidity is primarily due to the immobilization or entanglement of the chains. This is because the chains 
are arranged like cooked spaghetti strands in a bowl. By increasing the molecular weight the polymer 
chains become longer and less mobile and a more rigid material results as shown in Fig. 39.4. Equally 
important is that all chains should be equal in length since if there are short chains they will act as 
plasticizers. Another obvious way of changing properties is to change the chemical composition of the 
backbone or side chains. Substituting the backbone carbon of a polyethylene with divalent oxygen or 
sulfur will decrease the melting and glass transition temperatures since the chain becomes more flexible 
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TABLE 39.4 Effect of Side 
Chain Substitution on Melting 
Temperature in Polyethylene 
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FIGURE 39.4 Approximate relations among molecular weight, T , T m , and polymer properties. 

due to the increased rotational freedom. On the other hand if the backbone chains can be made more 
rigid then a stiffer polymer will result. 

Effect of Side Chain Substitution, Cross-linking, 
and Branching 

Increasing the size of side groups in linear polymers such as polyeth- 
ylene will decrease the melting temperature due to the lesser perfection 
of molecular packing, i.e., decreased crystallinity. This effect is seen 
until the side group itself becomes large enough to hinder the move- 
ment of the main chain as shown in Table 39.4. Very long side groups 
can be thought of as being branches. 

Cross-linking of the main chains is in effect similar to the side-chain 
substitution with a small molecule, i.e., it lowers the melting temper- 
ature. This is due to the interference of the cross-linking which causes 
decreased mobility of the chains resulting in further retardation of the 
crystallization rate. In fact, a large degree of cross-linking can prevent 
crystallization completely. However, when the cross-linking density 
increases for a rubber, the material becomes harder and the glass tran- 
sition temperature also increases. 

Effect of Temperature on Properties 

Amorphous polymers undergo a substantial change in their properties as a function of temperature. The 
glass transition temperature, T , is a boundary between the glassy region of behavior in which the polymer 
is relatively stiff and the rubbery region in which it is very compliant. T can also be defined as the 
temperature at which the slope of volume change versus temperature has a discontinuity in slope as 
shown in Fig. 39.5. Since polymers are non-crystalline or at most semicrystalline, the value obtained in 
this measurement depends on how fast it is taken. 

39.3 Polymers Used as Biomaterials 

Although hundreds of polymers are easily synthesized and could be used as biomaterials only ten to 
twenty polymers are mainly used in medical device fabrications from disposable to long-term implants 
as given in Table 39.5. In this section, the general information of the characteristics, properties, and 
applications of the most commonly used polymers will be discussed [Billmeyer, 1984; Park, 1984; 
Leininger and Bigg, 1986; Shalaby, 1988; Brandrup and Immergut, 1989; Sharma and Szycher, 1991; Park 
and Lakes, 1992; Dumitriu, 1993; Lee and Lee, 1995; Ratner et al., 1996]. 
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FIGURE 39.5 Change of volume versus temperature of a solid. The glass transition temperature (T g ) depends on 
the rate of cooling and below (T g ) the material behaves as a solid like a window glass. 

TABLE 39.5 Biomedical Application of Polymeric Biomaterials 



Synthetic Polymers 



Applications 



Polyvinylchloride (PVC) 

Polyethylene (PE) 

Polypropylene (PP) 

Polymethylmetacrylate (PMMA) 

Polystyrene (PS) 
Polyethylenterephthalate (PET) 
Polytetrafluoroethylene (PTFE) 
Polyurethane (PU) 
Polyamide (Nylon) 



Blood and solution bag, surgical packaging, IV sets, dialysis devices, 

catheter bottles, connectors, and cannulae. 
Pharmaceutical bottle, nonwoven fabric, catheter, pouch, flexible 

container, and orthopedic implants. 
Disposable syringes, blood oxygenator membrane, suture, 

nonwoven fabric, and artificial vascular grafts. 
Blood pump and reservoirs, membrane for blood dialyzer, 

implantable ocular lens, and bone cement. 
Tissue culture flasks, roller bottles, and filterwares. 
Implantable suture, mesh, artificial vascular grafts, and heart valve. 
Catheter and artificial vascular grafts. 
Film, tubing, and components. 
Packaging film, catheters, sutures, and mold parts. 



Polyvinylchloride (PVC) 

PVC is an amorphous, rigid polymer due to the large side group (CI, chloride) with a Tg of 75~105°C. 
It has a high melt viscosity hence it is difficult to process. To prevent the thermal degradation of the 
polymer (HC1 could be released), thermal stabilizers such as metallic soaps or salts are incorporated. 
Lubricants are formulated on PVC compounds to prevent adhesion to metal surfaces and facilitate the 
melt flow during processing. Plasticizers are used in the range of 10 to 100 parts per 100 parts of PVC 
resin to make it flexible. Di-2-ethylhexylphthalate (DEHP or DOP) is used in medical PVC formulation. 
However, the plasticizers of trioctyltrimellitate (TOTM), polyester, azelate, and phosphate ester are also 
used to prevent extraction by blood, aqueous solution, and hot water during autoclaving sterilization. 

PVC sheets and films are used in blood and solution storage bags and surgical packaging. PVC tubing 
is commonly used in intravenous (IV) administration, dialysis devices, catheters, and cannulae. 

Polyethylene (PE) 

PE is available commercially in five major grades: (1) high density (HDPE), (2) low density (LDPE), 
(3) linear low density (LLDPE), (4) very low density (VLDPE), and (5) ultra high molecular weight 
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(UHMWPE). HDPE is polymerized in a low temperature (60~80°C), and at a low pressure (-10 kg/cm 2 ) 
using metal catalysts. A highly crystalline, linear polymer with a density ranging from 0.94 to 0.965 g/cm 3 
is obtained. LDPE is derived from a high temperature (150~300°C) and pressures (1,000-3,000 kg/cm 2 ) 
using free radical initiators. A highly branched polymer with lower crystallinity and densities ranging 
from 0.915 to 0.935 g/cm 3 is obtained. LLDPE (density: 0.91-0.94 g/cm 3 ) and VLDPE (density: 
0.88-0.89 g/cm 3 ), which are linear polymers, are polymerized under low pressures and temperatures 
using metal catalysts with comonomers such as 1-butene, 1-hexene, or 1-octene to obtain the desired 
physical properties and density ranges. 

HDPE is used in pharmaceutical bottles, nonwoven fabrics, and caps. LDPE is found in flexible 
container applications, nonwoven-disposable and laminated (or coextruded with paper) foil, and poly- 
mers for packaging. LLDPE is frequently employed in pouches and bags due to its excellent puncture 
resistance and VLDPE is used in extruded tubes. UHMWPE (MW > 2 x 10 6 g/mol) has been used for 
orthopedic implant fabrications, especially for load-bearing applications such as an acetabular cup of 
total hip and the tibial plateau and patellar surfaces of knee joints. Biocompatability tests for PE are given 
by ASTM standards in F981, F639, and F755. 

Polypropylene (PP) 

PP can be polymerized by a Ziegler-Natta stereospecific catalyst which controls the isotactic position of 
the methyl group. Thermal (T :-12°C, T m :125~167°C and density: 0.85-0.98 g/cm 3 ) and physical prop- 
erties of PP are similar to PE. The average molecular weight of commercial PP ranges from 2.2-7.0 x 
10 5 g/mol and has a wide molecular weight distribution (polydispersity) which is from 2.6 to 12. Additives 
for PP such as antioxidants, light stabilizer, nucleating agents, lubricants, mold release agents, antiblock, 
and slip agents are formulated to improve the physical properties and processability. PP has an excep- 
tionally high flex life and excellent environment stress-cracking resistance, hence it had been tried for 
finger joint prostheses with an integrally molded hinge design [Park, 1984]. The gas and water vapor 
permeability of PP are in-between those of LDPE and HDPE. PP is used to make disposable hypothermic 
syringes, blood oxygenator membrane, packaging for devices, solutions, and drugs, suture, artificial 
vascular grafts, nonwoven fabrics, etc. 

Polymethylmetacrylate (PMMA) 

Commercial PMMA is an amorphous (Tg:105°C and density:l. 15-1. 195 g/cm 3 ) material with good 
resistance to dilute alkalis and other inorganic solutions. PMMA is best known for its exceptional light 
transparency (92% transmission), high refractive index (1.49), good weathering properties, and as one 
of the most biocompatible polymers. PMMA can be easily machined with conventional tools, molded, 
surface coated, and plasma etched with glow or corona discharge. PMMA is used broadly in medical 
applications such as a blood pump and reservoir, an IV system, membranes for blood dialyzer, and in 
in vitro diagnostics. It is also found in contact lenses and implantable ocular lenses due to excellent optical 
properties, dentures, and maxillofacial prostheses due to good physical and coloring properties, and bone 
cement for joint prostheses fixation (ASTM standard F451). 

Another acrylic polymer such as polymethylacrylate (PMA), polyhydroxyethyl-methacrylate 
(PHEMA), and polyacrylamide (PAAm) are also used in medical applications. PHEMA and PAAm are 
hydrogels, lightly cross-linked by ethyleneglycoldimethylacrylate (EGDM) to increase their mechanical 
strength. The extended wear soft contact lenses are synthesized from PMMA and N-vinylpyrollidone or 
PHEMA which have high water content (above 70%) and a high oxygen permeability. 

Polystyrene (PS) and its Co-Polymers 

PS is polymerized by free radical polymerization and is usually atactic. Three grades are available; 
unmodified general purpose PS (GPPS, Tg:100°C), high impact PS (HIPS), and PS foam. GPPS has good 
transparency, lack of color, ease of fabrication, thermal stability, low specific gravity (1.04-1.12 g/cm 3 ), 
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and relatively high modulus. HIPS contains a rubbery modifier which forms chemical bonding with the 
growing PS chains. Hence the ductility and impact strength are increased and the resistance to environ- 
mental stress-cracking is also improved. PS is mainly processed by injection molding at 180~250°C. To 
improve processability additives such as stabilizers, lubricants, and mold releasing agents are formulated. 
GPPS is commonly used in tissue culture flasks, roller bottles, vacuum canisters, and filterware. 

Acrylonitrile-butadiene-styrene (ABS) copolymers are produced by three monomers; acrylonitrile, 
butadiene, and styrene. The desired physical and chemical properties of ABS polymers with a wide range 
of functional characteristics can be controlled by changing the ratio of these monomers. They are resistant 
to the common inorganic solutions, have good surface properties, and dimensional stability. ABS is used 
for IV sets, clamps, blood dialyzers, diagnostic test kits, and so on. 

Polyesters 

Polyesters such as polyethyleneterephthalate (PET) are frequently found in medical applications due to 
their unique chemical and physical properties. PET is so far the most important of this group of polymers 
in terms of biomedical applications such as artificial vascular graft, sutures, and meshes. It is highly 
crystalline with a high melting temperature (T m : 265°C), hydrophobic and resistant to hydrolysis in dilute 
acids. In addition, PET can be converted by conventional techniques into molded articles such as luer 
filters, check valves, and catheter housings. Polycaprolactone is crystalline and has a low melting tem- 
perature (T m : 64°C). Its use as a soft matrix or coating for conventional polyester fibers was proposed by 
recent investigation [Leininger and Bigg, 1986]. 

Polyamides (Nylons) 

Polyamides are known as nylons and are designated by the number of carbon atoms in the repeating 
units. Nylons can be polymerized by step-reaction (or condensation) and ring-scission polymerization. 
They have excellent fiber-forming ability due to interchain hydrogen bonding and a high degree of 
crystallinity, which increases strength in the fiber direction. 

The presence of -CONH- groups in polyamides attracts the chains strongly toward one another by 
hydrogen bonding. Since the hydrogen bond plays a major role in determining properties, the number 
and distribution of -CONH- groups are important factors. For example, Tg can be decreased by decreas- 
ing the number of -CONH- groups . On the other hand, an increase in the number of -CONH- groups 
improves physical properties such as strength as one can see that Nylon 66 is stronger than Nylon 610 
and Nylon 6 is stronger than Nylon 11. 

In addition to the higher Nylons (610 and 11) there are aromatic polyamides named aramids. One of 
them is poly (p-phenylene terephthalate) commonly known as Kevlar®, made by DuPont. This material 
can be made into fibers. The specific strength of such fibers is five times that of steel, therefore, it is most 
suitable for making composites. 

Nylons are hygroscopic and lose their strength in vivo when implanted. The water molecules serve as 
plasticizers which attack the amorphous region. Proteolytic enzymes also aid in hydrolyzing by attacking 
the amide group. This is probably due to the fact that the proteins also contain the amide group along 
their molecular chains which the proteolytic enzymes could attack. 

Fluorocarbon Polymers 

The best known fluorocarbon polymer is polytetrafluoroethylene (PTFE), commonly known as Teflon® 
(DuPont). Other polymers containing fluorine are polytrifluorochloroethylene (PTFCE), polyvinylfluo- 
ride (PVF), and fluorinated ethylene propylene (FEP). Only PTFE will be discussed here since the others 
have rather inferior chemical and physical properties and are rarely used for implant fabrication. 

PTFE is made from tetrafluoroethylene under pressure with a peroxide catalyst in the presence of 
excess water for removal of heat. The polymer is highly crystalline (over 94% crystallinity) with an average 
molecular weight of 0.5-5 x 10 6 g/mol. This polymer has a very high density (2.15-2.2 g/cm 3 ), low 
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modulus of elasticity (0.5 GPa) and tensile strength (14 MPa). It also has a very low surface tension 
(18.5 erg/cm 2 ) and friction coefficient (0.1). 

Standard specifications for the implantable PTFE are given by ASTM F754. PTFE also has an unusual 
property of being able to expand on a microscopic scale into a microporous material which is an excellent 
thermal insulator. PTFE cannot be injection molded or melt extruded because of its very high melt 
viscosity and it cannot be plasticized. Usually the powders are sintered to above 327°C under pressure 
to produce implants. 

Rubbers 

Silicone, natural, and synthetic rubbers have been used for the fabrication of implants. Natural rubber 
is made mostly from the latex of the Hevea brasiliensis tree and the chemical formula is the same as that 
of cis- 1,4 polyisoprene. Natural rubber was found to be compatible with blood in its pure form. Also, 
cross-linking by X-ray and organic peroxides produces rubber with superior blood compatibility com- 
pared with rubbers made by the conventional sulfur vulcanization. 

Synthetic rubbers were developed to substitute for natural rubber. The Ziegler-Natta types of ste- 
reospecific polymerization techniques have made this variety possible. The synthetic rubbers have rarely 
been used to make implants. The physical properties vary widely due to the wide variations in preparation 
recipes of these rubbers. 

Silicone rubber, developed by Dow Corning company, is one of the few polymers developed for medical 
use. The repeating unit is dimethyl siloxane which is polymerized by a condensation polymerization. 
Low molecular weight polymers have low viscosity and can be cross-linked to make a higher molecular 
weight, rubber-like material. Medical grade silicone rubbers contain stannous octate as a catalyst and 
can be mixed with a base polymer at the time of implant fabrication. 

Polyurethanes 

Polyurethanes are usually thermosetting polymers: they are widely used to coat implants. Polyurethane 
rubbers are produced by reacting a prepared prepolymer chain with an aromatic di-isocyanate to make 
very long chains possessing active isocyanate groups for cross-linking. The polyurethane rubber is quite 
strong and has good resistance to oil and chemicals. 

Polyacetal, Polysulfone, and Polycarbonate 

These polymers have excellent mechanical, thermal, and chemical properties due to their stiffened main 
backbone chains. Polyacetals and polysulfones are being tested as implant materials, while polycarbonates 
have found their applications in the heart/lung assist devices, food packaging, etc. 

Polyacetals are produced by reacting formaldehyde. These are also sometimes called polyoxymethylene 
(POM) and known widely as Delrin® (DuPont). These polymers have a reasonably high molecular weight 
(>2 x 10 4 g/mol) and have excellent mechanical properties. More importantly, they display an excellent 
resistance to most chemicals and to water over wide temperature ranges. 

Polysulfones were developed by Union Carbide in the 1960s. These polymers have a high thermal 
stability due to the bulky side groups (therefore, they are amorphous) and rigid main backbone chains. 
They are also highly stable to most chemicals but are not so stable in the presence of polar organic 
solvents such as ketones and chlorinated hydrocarbons. 

Polycarbonates are tough, amorphous, and transparent polymers made by reacting bisphenol A and 
diphenyl carbonate. It is noted for its excellent mechanical and thermal properties (high T :150°C), 
hydrophobicity, and antioxidative properties. 

Biodegradable Polymers 

Recently, several biodegradable polymers such as polylactide (PLA), polyglycolide (PGA), poly(glycolide-co- 
lactide) (PLGA), poly(dioxanone), poly(trimethylene carbonate), poly( carbonate), and so on are extensively 
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used or tested on a wide range of medical applications due to their good biocompatibility, controllable 
biodegradability, and relatively good processability [Khang et al., 1997]. PLA, PGA, and PLGA are biore- 
sorbable polyesters belonging to the group of poly a-hydroxy acids. These polymers degrade by nonspe- 
cific hydrolytic scission of their ester bonds. The hydrolysis of PLA yields lactic acid which is a normal 
byproduct of anaerobic metabolism in the human body and is incorporated in the tricarboxylic acid 
(TCA) cycle to be finally excreted by the body as carbon dioxide and water. PGA biodegrades by a 
combination of hydrolytic scission and enzymatic (esterase) action producing glycolic acid which can 
either enter the TCA cycle or is excreted in urine and can be eliminated as carbon dioxide and water. 
The degradation time of PLGA can be controlled from weeks to over a year by varying the ratio of 
monomers and the processing conditions. It might be a suitable biomaterial for use in tissue engineered 
repair systems in which cells are implanted within PLGA films or scaffolds and in drug delivery systems 
in which drugs are loaded within PLGA microspheres. PGA (T m : 225-230°C, Tg: 35-40°C) can be melt 
spun into fibers which can be converted into bioresorbable sutures, meshes, and surgical products. PLA 
(T m : 173-178 °C, Tg: 60-65°C) exhibit high tensile strength and low elongation resulting in a high 
modulus suitable for load-bearing applications such as in bone fracture fixation. Poly-p-dioxanone (T m : 
107-1 12°C, Tg: ~10°C) is a bioabsorbable polymer which can be fabricated into flexible monofilament 
surgical sutures. 

39.4 Sterilization 

Sterilizability of biomedical polymers is an important aspect of the properties because polymers have 
lower thermal and chemical stability than other materials such as ceramics and metals, consequently, 
they are also more difficult to sterilize using conventional techniques. Commonly used sterilization 
techniques are dry heat, autoclaving, radiation, and ethylene oxide gas [Block,1977]. 

In dry heat sterilization, the temperature varies between 160 and 190°C. This is above the melting and 
softening temperatures of many linear polymers like polyethylene and PMMA. In the case of polyamide 
(Nylon), oxidation will occur at the dry sterilization temperature although this is below its melting 
temperature. The only polymers which can safely be dry sterilized are PTFE and silicone rubber. 

Steam sterilization (autoclaving) is performed under high steam pressure at relatively low temperature 
(125-130°C). However, if the polymer is subjected to attack by water vapor, this method cannot be 
employed. PVC, polyacetals, PE (low-density variety), and polyamides belong to this category. 

Chemical agents such as ethylene and propylene oxide gases [Glaser,1979], and phenolic and hypochlo- 
ride solutions are widely used for sterilizing polymers since they can be used at low temperatures. 
Chemical agents sometimes cause polymer deterioration even when sterilization takes place at room 
temperature. However, the time of exposure is relatively short (overnight), and most polymeric implants 
can be sterilized with this method. 

Radiation sterilization [Sato, 1983] using the isotopic 60 Co can also deteriorate polymers since at high 
dosage the polymer chains can be dissociated or cross-linked according to the characteristics of the 
chemical structures, as shown in Table 39.6. In the case of PE, at high dosage (above 10 6 Gy) it becomes 
a brittle and hard material. This is due to a combination of random chain scission cross-linking. PP 
articles will often discolor during irradiation giving the product an undesirable color tint but the more 
severe problem is the embrittlement resulting in flange breakage, luer cracking, and tip breakage. The 
physical properties continue to deteriorate with time, following irradiation. These problems of coloration 
and changing physical properties are best resolved by avoiding the use of any additives which discolor at 
the sterilizing dose of radiation [Khang, 1996c]. 

39.5 Surface Modifications for Improving Biocompatability 

Prevention of thrombus formation is important in clinical applications where blood is in contact such 
as hemodialysis membranes and tubes, artificial heart and heart-lung machines, prosthetic valves, and 
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TABLE 39.6 Effect of Gamma Irradiation on Polymers 
Which Could be Cross-Linked or Degraded 

Cross-Linking Polymers Degradable Polymers 

Polyethylene Polyisobutylene 

Polypropylene Poly-a-methylstyrene 

Polystyrene Polymethylmetacrylate 

Polyarylates Polymethacrylamide 

Polyacrylamide Polyvinylidenechloride 

Polyvinylchloride Cellulose and derivatives 

Polyamides Polytetrafluoroethylene 

Polyesters Polytrifluorochloroethylene 

Polyvinylpyrrolidone 

Polymethacrylamide 

Rubbers 

Polysiloxanes 

Polyvinylalcohol 

Polyacroleine 



artificial vascular grafts. In spite of the use of anticoagulants, considerable platelet deposition and 
thrombus formation take place on the artificial surfaces [Branger, 1990]. 

Heparin, one of the complex carbohydrates known as mucopolysaccharides or glycosaminoglycan is 
currently used to prevent formation of clots. In general, heparin is well tolerated and devoid of serious 
consequences. However, it allows platelet adhesion to foreign surfaces and may cause hemorrhagic 
complications such as subdural hematoma, retroperitoneal hematoma, gastrointestinal bleeding, hemor- 
rage into joints, ocular and retinal bleeding, and bleeding at surgical sites [Lazarus, 1980]. These diffi- 
culties give rise to an interest in developing new methods of hemocompatible materials. 

Many different groups have studied immobilization of heparin [Kim and Feijen, 1985; Park et al., 
1988] on the polymeric surfaces, heparin analogues and heparin-prostaglandin or heparin-fibrinolytic 
enzyme conjugates [Jozefowicz and Jozefowicz, 1985]. The major drawback of these surfaces is that they 
are not stable in the blood environment. It has not been firmly established that a slow leakage of heparin 
is needed for it to be effective as an immobilized antithrombogenic agent, if not its effectiveness could 
be hindered by being "coated over" with an adsorbed layer of more common proteins such as albumin 
and fibrinogen. Fibrinolytic enzymes, urokinase, and various prostaglandins have also been immobilized 
by themselves in order to take advantage of their unique fibrin dissolution or antiplatelet aggregation 
actions [Ohshiro, 1983]. 

Albumin-coated surfaces have been studied because surfaces that resisted platelet adhesion in vitro 
were noted to adsorb albumin preferentially [Keogh et at., 1992]. Fibronectin coatings have been used 
in in vitro endothelial cell seeding to prepare a surface similar to the natural blood vessel lumen [Lee 
et al., 1989]. Also, algin-coated surfaces have been studied due to their good biocompatibility and 
biodegradability [Lee et al, 1990b; Lee et al., 1997b]. 

Recently, plasma gas discharge [Khang et al., 1997a] and corona treatment [Khang et al., 1996d] with 
reactive groups introduced on the polymeric surfaces have emerged as other ways to modify biomaterial 
surfaces [Lee et al., 1991; Lee et al, 1992], 

Hydrophobic coatings composed of silicon- and fluorine-containing polymeric materials as well as 
polyurethanes have been studied because of the relatively good clinical performances of Silastic®, Teflon®, 
and polyurethane polymers in cardiovascular implants and devices. Polymeric fluorocarbon coatings 
deposited from a tetrafluoroethylene gas discharge have been found to greatly enhance resistance to both 
acute thrombotic occlusion and embolization in small diameter Dacron® grafts. 

Hydrophilic coatings have also been popular because of their low interfacial tension in biological 
environments [Hoffman, 1981]. Hydrogels as well as various combinations of hydrophilic and hydro- 
phobic monomers have been studied on the premise that there will be an optimum polar-dispersion 
force ratio which could be matched on the surfaces of the most passivating proteins. The passive surface 
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TABLE 39.7 Physical and Chemical Surface Modification Methods for Polymeric Biomaterials 

To modify blood compatibility Octadecyl group attachment to surface 

Silicon containing block copolymer additive 

Plasma fluoropolymer deposition 

Plasma siloxane polymer deposition 

Radiation-grafted hydrogels 

Chemically modified polystyrene for heparin-like activity 
To influence cell adhesion and growth Oxidized polystyrene surface 

Ammonia plasma-treated surface 

Plasma-deposited acetone or methanol film 

Plasma fluoropolymer deposition 
To control protein adsorption Surface with immobilized polyethyelenglycol 

Treated ELISA dish surface 

Affinity chromatography particulates 

Surface cross-linked contact lens 
To improve lubricity Plasma treatment 

Radiation-grafted hydrogels 

Interpenetrating polymeric networks 
To improve wear resistance and Ion implantation 

corrosion resistance Diamond deposition 

Anodization 
To alter transport properties Plasma deposition (methane, fluoropolymer, siloxane) 

To modify electrical characteristics Plasma deposition 

Solvent coatings 

Parylene coatings 

Source: Ratneretal., [1996], pl06. 

may induce less clot formation. Polyethylene oxide coated surfaces have been found to resist protein 
adsorption and cell adhesion and have therefore been proposed as potential "blood compatible" coatings 
[Lee et al., 1990a]. General physical and chemical methods to modify the surfaces of polymeric bioma- 
terials are listed in Table 39.7 [Ratner et al., 1996]. 

Another way of making antithrombogenic surfaces is the saline perfusion method, which is designed 
to prevent direct contacts between blood and the surface of biomaterials by means of perfusing saline 
solution through the porous wall which is in contact with blood [Park and Kim, 1993; Khang et al., 
1996a; Khang et al., 1996b]. It has been demonstrated that the adhesion of the blood cells could be 
prevented by the saline perfusion through PE, alumina, sulfonated/nonsulfonated PS/SBR, ePTFE 
(expanded polytetrafluoroethylene), and polysulfone porous tubes. 

39.6 Chemogradient Surfaces for Cell 
and Protein Interaction 

The behavior of the adsorption and desorption of blood proteins or adhesion and proliferation of different 
types of mammalian cells on polymeric materials depends on the surface characteristics such as wetta- 
bility, hydrophilicity/hydrophobicity ratio, bulk chemistry, surface charge and charge distribution, surface 
roughness, and rigidity. 

Many research groups have studied the effect of the surface wettability on the interactions of biological 
species with polymeric materials. Some have studied the interactions of different types of cultured cells 
or blood proteins with various polymers with different wettabilities to correlate the surface wettability 
and blood- or tissue-compatibility [Baier et al., 1984]. One problem encountered from the study using 
different kinds of polymers is that the surfaces are heterogeneous, both chemically and physically (dif- 
ferent surface chemistry, roughness, rigidity, crystallinity, etc.), which caused widely varying results. Some 
others have studied the interactions of different types of cells or proteins with a range of methacrylate 
copolymers with different wettabilities and have the same kind of chemistry but are still physically 
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FIGURE 39.6 Schematic diagram showing corona discharge apparatus for the preparation of wettability chemo- 
gradient surfaces. 

heterogeneous [van Wachem et al., 1987]. Another methodological problem is that such studies are often 
tedious, laborious, and time-consuming because a large number of samples must be prepared to char- 
acterize the complete range of the desired surface properties. 

Many studies have been focused on the preparation of surfaces whose properties are changed gradually 
along the material length. Such chemogradient surfaces are of particular interest in basic studies of the 
interactions between biological species and synthetic materials surfaces since the affect of a selected 
property can be examined in a single experiment on one surface preparation. A chemogradient of methyl 
groups was formed by diffusion of dimethyldichlorosilane through xylene on flat hydrophilic sili- 
conedioxide surfaces [Elwing et al., 1989]. The wettability chemogradient surfaces were made to inves- 
tigate hydrophilicity-induced changes of adsorbed proteins. 

Recently, a method for preparing wettability chemogradients on various polymer surfaces was devel- 
oped [Lee et al., 1989 and 1990; Khang et al., 1997b]. The wettability chemogradients were produced via 
radio frequency (RF) and plasma discharge treatment by exposing the polymer sheets continuously to 
the plasma [Lee et al., 1991]. The polymer surfaces oxidized gradually along the sample length with 
increasing plasma exposure time and thus the wettability chemogradient was created. Another method 
for preparing a wettability chemogradient on polymer surfaces using corona discharge treatment has 
been developed as shown in Fig. 39.6 [Lee et al., 1992]. The wettability chemogradient was produced by 
treating the polymer sheets with corona from a knife-type electrode whose power was gradually changed 
along the sample length. The polymer surface gradually oxidized with the increasing power and the 
wettability chemogradient was created. Chemogradient surfaces with different functional gruops such as 
-COOH, -CH 2 OH, -CONH 2 and -CH 2 NH 2 were produced on PE surfaces by the above corona treatment 
followed by vinyl monomer grafting and substitution reactions [Kim et al., 1993; Lee et al., 1994a; Lee 
et al., 1994b]. We have also prepared chargeable functional groups [Lee et al., 1997c; Lee et al., 1997d; Lee 
et al., 1998a], comb-like polyethyleneoxide (PEO) [Jeong et al., 1996; Lee et al., 1997a] and phospholipid 
polymer chemogradient surfaces [Iwasaki et al., 1997] by the corona discharge treatment, followed by 
the graft copolymerization with subsequent substitution reaction of functional vinyl monomers as acrylic 
acid, sodium p-sulfonic styrene and N, N-dimethyl aminopropyl acrylamide, poly(ethyleneglycol) mono- 
methacrylate, and ro-methacryloyloxyalkyl phosphorylcholine (MAPC), respectively. 

The water contact angles of the corona-treated PE surfaces gradually decrease along the sample length 
with increasing corona power (from about 95° to about 45°) as shown in Fig. 39.7. The decrease in contact 
angles, i.e., the increase in wettability along the sample length was due to the oxygen-based polar 
functionalities incorporated on the surface by the corona treatment. It was also confirmed also by fourier- 
transform infrared spectroscopy in the attenuated total reflectance mode and electron spectroscopy for 
chemical analysis (ESCA). 
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FIGURE 39.7 Changes in water contact angle of corona-treated PE surface along the sample length. Sample num- 
bers, n = 3. 

In order to investigate the interaction of different types of cells in terms of the surface hydrophilic- 
ity/hydrophobicity of polymeric materials, Chinese hamster ovaries (CHO), fibroblasts, and bovine aortic 
endothelial cells (EC) were cultured for 1 and 2 days on the PE wettability chemogradient surfaces. The 
maximum adhesion and growth of the cells appeared around a water contact angle of 50-55° as shown 
in Fig. 39.8. The observation of scanning electron microscopy (SEM) also verified that the cells are more 
adhered, spread, and grown onto the sections with moderate hydrophilicity as shown in Fig. 39.9. 
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FIGURE 39.8 CHO, fibroblast, and endothelial cell growth on wettability chemogradient PE surfaces after 2 days 
culture (number of seeded cells, 4 x lOVcm 2 ). n = 3. 
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FIGURE 39.9 SEM microphotographs of CHO, fibroblast, and endothelial cells grown on PE wettability chemo- 
gradient surface along the sample length after 2 days culture (original magnification; x400). 

To determine the cell proliferation rates, the migration of fibroblasts on PE wettability chemogradient 
surfaces were observed [Khang et al., 1998b]. After the change of culture media at 24 h, cell growth 
morphology was recorded for 1 or 2 hr intervals at the position of 0.5, 1.5, 2.5, and 4.5 cm for the 
counting of grown cells and the observation of cell morphology with a video tape recorder. The prolif- 
eration rates of fibroblast cells were calculated from the slopes of Fig. 39.10 as given in Table 39.8. The 
proliferation rates on the PE surfaces with wettability chemogradient showed that as the surface wetta- 
bility increased, it increased and then decreased. The maximum proliferation rate of the cells as 
1,111 cells/hem 2 appeared at around the position 2.5 cm. 

To observe the effect of serum proteins on the cell adhesion and growth behaviors, fetal bovine serum 
(FBS), which contains more than 200 kinds of different proteins, was adsorbed onto the wettability 
gradient PE surfaces for 1 h at 37°C. Figure 39.11 shows the relative adsorbed amount of serum proteins 
on the wettability gradient surfaces determined by ESCA. The maximum adsorption of the proteins 
appeared at around the 2.5 cm position, which is the same trend as the cell adhesion, growth, and 
migration behaviors. It can be explained that preferential adsorption of some serum proteins, like 
fibronectin and vitronectin from culture medium, onto the moderately wettable surfaces may be a reason 
for better cell adhesion, spreading, and growth. Proteins like fibronectin and vitronectin are well known 
as cell-adhesive proteins. Cells attached on surfaces are spread only when they are compatible on the 
surfaces. It seems that surface wettability plays an important role for cell adhesion, spreading, and 
migration. 

Also investigated were: (1) platelet adhesion on wettability chemogradient [Lee and Lee, 1998b], (2) cell 
interaction on microgrooved PE surfaces (groove depth, 0.5 um; groove width, 0.45 um; and pitch, 
0.9 urn) with wettability chemogradient [Khang et al., 1997c], (3) detachment of human endothelial 
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TABLE 39.8. Proliferation Rates of Fibroblast Cells on Wettability 
Gradient PE Surfaces 

Positions (cm) Contact Angle (°) Cell Proliferation Rate (#cell/h cm 2 ' 

1,111 
924 

838 
734 



2.5 


55 


4.5 


45 


1.5 


67 


0.5 


85 



Note: 24 to 60 h culture. 
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FIGURE39.il Serum protein adsorption on PE wettability chemogradient surface (1 h adsorption), n = 3. 
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under flow from wettability gradient surface with different functional groups [Ruardy et al., 1997] , (4) cell 
interaction on microporous polycarbonate membrane with wettability chemogradient [Lee et al., 1998c] , 
and (5) cell interaction on poly(lactide-co-glycolide) surface with wettability chemogradient [Khang et al., 
1998a]. 

During the last several years, "chemogradient surfaces" have evolved into easier and more popular 
tools for the study of protein adsorption and platelet or cell interactions continuously which relate to 
the surface properties such as wettability, chemistry and charge, or dynamics of polymeric materials. In 
many studies, different kinds of polymeric materials with widely varying surface chemistries are used 
and the explanation of the results is often in controversy due to the surface heterogeneity. In addition, 
these studies are tedious, laborious, and time-consuming, and biological variations are more likely to 
occur. The application of chemogradient surfaces for these studies can reduce these discomforts and 
problems, and eventually save time and money. Also, chemogradient surfaces are valuable in investigating 
the basic mechanisms by which complicated systems such as proteins or cells interact with surfaces, since 
a continuum of selected and controlled physical-chemical properties can be studied in one experiment 
on the polymeric surface. 

The possible applications of chemogradient surfaces in the near future are: (1) separation devices of 
cells and/or biological species by different surface properties, (2) column packing materials for separation, 
(3) biosensoring, etc. 
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Defining Terms 

Acetabulum: The socket portion of the hip joint. 

Addition (or free radical) polymerization: Polymerization in which monomers are added to the grow- 
ing chains, initiated by free radical agents. 

Biocompatibility: Acceptance of an artificial implant by the surrounding tissues and as a whole. The 
implant should be compatible with tissues in terms of mechanical, chemical, surface, and phar- 
macological properties. 

Biomaterials: Synthetic materials used to replace part of a living system or to function in intimate 
contact with living tissue. 

Bone cement: Mixture of polymethylmethacrylate powder and methylmethacrylate monomer liquid 
to be used as a grouting material for the fixation of orthopedic joint implants. 

Branching: Chains grown from the sides of the main backbone chains. 

Chemogradient surface: The surface whose properties such as wettability, surface charge, and hydro- 
philicity/hydrophobicity ratio are changed gradually along the material length. 

Condensation (step reaction) polymerization: Polymerization in which two or more chemicals are 
reacted to form a polymer by condensing out small molecules such as water and alcohol. 

Copolymers: Polymers made from two or more monomers which can be obtained by grafting, block, 
alternating, or random attachment of the other polymer segment. 

Covalent bonding: Bonding of atoms or molecules by sharing valence electrons. 

Dacron®: Polyethyleneterephthalate polyester that is made into fiber. If the same polymer is made into 
a film, it is called Mylar®. 

Delrin®: Polyacetal made by Union Carbide. 

Elastomers: Rubbery materials. The restoring force comes from uncoiling or unkinking of coiled or 
kinked molecular chains. They can be highly stretched. 

Embolus: Any foreign matter, as a blood clot or air bubble, carried in the blood stream. 
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Fibrinogen: A plasma protein of high molecular weight that is converted to fibrin through the action 

of thrombin. This material is used to make (absorbable) tissue adhesives. 
Filler: Materials added as a powder to a rubber to improve its mechanical properties. 
Free volume: The difference in volume occupied by the crystalline state (minimum) and non-crystalline 

state of a material for a given temperature and a pressure. 
Glass transition temperature: Temperature at which solidification without crystallization takes place 

from viscous liquid. 
Grafts: A transplant. 
Heparin: A substance found in various body tissues, especially in the liver, that prevents the clotting 

of blood. 
Hydrogel: Polymer which can absorb 30% or more of its weight in water. 
Hydrogen bonding: A secondary bonding through dipole interactions in which the hydrogen ion is 

one of the dipoles. 
Hydroquinone: Chemical inhibitor added to the bone cement liquid monomer to prevent accidental 

polymerization during storage. 
Initiator: Chemical used to initiate the addition polymerization by becoming a free radical which in 

turn reacts with a monomer. 
Ionic bonding: Bonding of atoms or molecules through electrostatic interaction of positive and negative 

ions. 
Kevlar®: Aromatic polyamides made by DuPont. 
Lexan®: Polycarbonate made by General Electric. 
Oxygenator: An apparatus by which oxygen is introduced into blood during circulation outside the 

body, as during open-heart surgery. 
Plasticizer: Substance made of small molecules, mixed with (amorphous) polymers to make the chains 

slide more easily past each other, making the polymer less rigid. 
Refractive index: Ratio of speed of light in vacuum to speed of light in a material. It is a measure of 

the ability of a material to refract (bend) a beam of light. 
Repeating unit: Basic molecular unit which can represent a polymer backbone chain. The average 

number of repeating units is called the degree of polymerization. 
Repeating unit: The smallest unit representing a polymer molecular chain. 
Semi-crystalline solid: Solid which contains both crystalline and noncrystalline regions and usually 

occurs in polymers due to their long chain molecules. 
Side group: Chemical group attached to the main backbone chain. It is usually shorter than the branches 

and exists before polymerization. 
Steric hindrance: Geometrical interference which restrains movements of molecular groups such as 

side chains and main chains of a polymer. 
Suture: Material used in closing a wound with stitches. 
Tacticity: Arrangement of asymmetrical side groups along the backbone chain of polymers. Groups 

could be distributed at random (atactic), one side (isotactic), or alternating (syndiotactic). 
Teflon®: Polytetrafluoroethylene made by DuPont. 
Thrombus: The fibrinous clot attached at the site of thrombosis. 
Udel®: Polysulfone made by General Electric. 
Valence electrons: The outermost (shell) electrons of an atom. 

van der Waals bonding: A secondary bonding arising through the fluctuating dipole-dipole interactions. 
Vinyl polymers: Thermoplastic linear polymers synthesized by free radical polymerization of vinyl 

monomers having a common structure of CH 2 =CHR. 
Vulcanization: Cross-linking of a (natural) rubber by adding sulfur. 

Ziegler-Natta catalyst: Organometallic compounds which have the remarkable capacity of polymeriz- 
ing a wide variety of monomers to linear and stereoregular polymers. 
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Composite materials are solids which contain two or more distinct constituent materials or phases, on 
a scale larger than the atomic. The term "composite" is usually reserved for those materials in which the 
distinct phases are separated on a scale larger than the atomic, and in which properties such as the elastic 
modulus are significantly altered in comparison with those of a homogeneous material. Accordingly, 
reinforced plastics such as fiberglass as well as natural materials such as bone are viewed as composite 
materials, but alloys such as brass are not. A foam is a composite in which one phase is empty space. 
Natural biological materials tend to be composites. Natural composites include bone, wood, dentin, 
cartilage, and skin. Natural foams include lung, cancellous bone, and wood. Natural composites often 
exhibit hierarchical structures in which particulate, porous, and fibrous structural features are seen on 
different micro-scales [Katz, 1980; Lakes, 1993]. In this segment, composite material fundamentals and 
applications in biomaterials [Park and Lakes, 1988] are explored. Composite materials offer a variety of 
advantages in comparison with homogeneous materials. These include the ability for the scientist or 
engineer to exercise considerable control over material properties. There is the potential for stiff, strong, 
lightweight materials as well as for highly resilient and compliant materials. In biomaterials, it is important 
that each constituent of the composite be biocompatible. Moreover, the interface between constituents 
should not be degraded by the body environment. Some applications of composites in biomaterial 
applications are: (1) dental filling composites, (2) reinforced methyl methacrylate bone cement and ultra- 
high molecular weight polyethylene, and (3) orthopedic implants with porous surfaces. 

40.1 Structure 

The properties of composite materials depend very much upon structure. Composites differ from 
homogeneous materials in that considerable control can be exerted over the larger scale structure, and 
hence over the desired properties. In particular, the properties of a composite material depend upon the 
shape of the heterogeneities, upon the volume fraction occupied by them, and upon the interface among 
the constituents. The shape of the heterogeneities in a composite material is classified as follows. The 
principal inclusion shape categories are (1) the particle, with no long dimension, (2) the fiber, with one 
long dimension, and (3) the platelet or lamina, with two long dimensions, as shown in Fig. 40.1. The 
inclusions may vary in size and shape within a category. For example, particulate inclusions may be 
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FIGURE 40.1 Morphology of basic composite inclusions, (a) particle, (b) fiber, (c) platelet. 

spherical, ellipsoidal, polyhedral, or irregular. If one phase consists of voids, filled with air or liquid, the 
material is known as a cellular solid. If the cells are polygonal, the material is a honeycomb; if the cells 
are polyhedral, it is a foam. It is necessary in the context of biomaterials to distinguish the above structural 
cells from biological cells, which occur only in living organisms. In each composite structure, we may 
moreover make the distinction between random orientation and preferred orientation. 

40.2 Bounds on Properties 

Mechanical properties in many composite materials depend on structure in a complex way, however for 
some structures, the prediction of properties is relatively simple. The simplest composite structures are 
the idealized Voigt and Reuss models, shown in Fig. 40.2. The dark and light areas in these diagrams 
represent the two constituent materials in the composite. In contrast to most composite structures, it is 
easy to calculate the stiffness of materials with the Voigt and Reuss structures, since in the Voigt structure 
the strain is the same in both constituents; in the Reuss structure the stress is the same. The Young's 
modulus, E, of the Voigt composite is: 



E = E.V.+E 1-V. 



(40.1) 



in which E i is the Young's modulus of the inclusions, and Vj is the volume fraction of inclusions, and E m 
is the Young's modulus of the matrix. The Voigt relation for the stiffness is referred to as the rule of 
mixtures. 






FIGURE 40.2 Voigt (a, laminar; b, fibrous) and Reuss (c) composite models, subjected to tension force indicated 
by arrows. 
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FIGURE 40.3 Stiffness vs. volume fraction for Voigt and Reuss models, as well as for dilute isotropic suspensions 
of platelets, fibers, and spherical particles embedded in a matrix. Phase moduli are 200 GPa and 3 GPa. 

The Reuss stiffness E, 



L+ \ — / 



(40.2) 



is less than that of the Voigt model. The Voigt and Reuss models provide upper and lower bounds 
respectively upon the stiffness of a composite of arbitrary phase geometry [Paul, I960]. The bounds are 
far apart if, as is commonplace, the phase moduli differ a great deal, as shown in Fig. 40.3. For composite 
materials which are isotropic, the more complex relations of Hashin and Shtrikman (1963) provide tighter 
bounds upon the moduli (Fig. 40.3); both the Young's and shear moduli must be known for each 
constituent to calculate these bounds. 

40.3 Anisotropy of Composites 

Observe that the Reuss laminate is identical to the Voigt laminate, except for a rotation with respect to 
the direction of load. Therefore, the stiffness of the laminate is anisotropic, that is, dependent on direction 
[Agarwal and Broutman, 1980; Nye, 1976; Lekhnitskii, 1963]. Anisotropy is characteristic of composite 
materials. The relationship between stress a, -and strain e H in anisotropic materials is given by the tensorial 
form of Hooke's law as follows: 



/ , / , C .jki £ ki 



(40.3) 



Here C™ is the elastic modulus tensor. It has 3 4 = 81 elements, however since the stress and strain are 
represented by symmetric matrices with six independent elements each, the number of independent 
modulus tensor elements is reduced to 36. An additional reduction to 21 is achieved by considering 
elastic materials for which a strain energy function exists. Physically, C 2323 represents a shear modulus 
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since it couples a shear stress with a shear strain. C lm couples axial stress and strain in the 1 or x direction, 
but it is not the same as Young's modulus. The reason is that Young's modulus is measured with the 
lateral strains free to occur via the Poisson effect, while C ml is the ratio of axial stress to strain when 
there is only one non-zero strain value; there is no lateral strain. A modulus tensor with 21 independent 
elements describes a tridinic crystal, which is the least symmetric crystal form. The unit cell has three 
different oblique angles and three different side lengths. A triclinic composite could be made with groups 
of fibers of three different spacings, oriented in three different oblique directions. Triclinic modulus 
elements such as C 2311 , known as cross-coupling constants, have the effect of producing a shear stress in 
response to a uniaxial strain; this is undesirable in many applications. An orthorhombic crystal or an 
orthotropic composite has a unit cell with orthogonal angles. There are nine independent elastic moduli. 
The associated engineering constants are three Young's moduli, three Poisson's ratios, and three shear 
moduli; the cross-coupling constants are zero when stresses are aligned to the symmetry directions. An 
example of such a composite is a unidirectional fibrous material with a rectangular pattern of fibers in 
the cross-section. Bovine bone, which has a laminated structure, exhibits orthotropic symmetry, as does 
wood. In a material with hexagonal symmetry, out of the nine C elements, there are five independent 
elastic constants. For directions in the transverse plane the elastic constants are the same, hence the 
alternate name transverse isotropy. A unidirectional fiber composite with a hexagonal or random fiber 
pattern has this symmetry, as does human Haversian bone. In cubic symmetry, there are three independent 
elastic constants, a Young's modulus, E, a shear modulus, G, and an independent Poisson's ratio, v. Cross- 
weave fabrics have cubic symmetry. Finally, an isotropic material has the same material properties in any 
direction. There are only two independent elastic constants, hence E, G, v, and also the bulk modulus B 
are related in an isotropic material. Isotropic materials include amorphous solids, polycrystalline metals 
in which the grains are randomly oriented, and composite materials in which the constituents are 
randomly oriented. 

Anisotropic composites offer superior strength and stiffness in comparison with isotropic ones. Mate- 
rial properties in one direction are gained at the expense of properties in other directions. It is sensible, 
therefore, to use anisotropic composite materials only if the direction of application of the stress is known 
in advance. 



40.4 Particulate Composites 



It is often convenient to stiffen or harden a material, commonly a polymer, by the incorporation of 
particulate inclusions. The shape of the particles is important [see, e.g., Christensen, 1979]. In isotropic 
systems, stiff platelet (or flake) inclusions are the most effective in creating a stiff composite, followed 
by fibers; and the least effective geometry for stiff inclusions is the spherical particle, as shown in Fig. 40.3. 
A dilute concentration of spherical particulate inclusions of stiffness E ; and volume fraction V y in a matrix 
(with Poisson's ratio assumed to be 0.5) denoted by the subscript m, gives rise to a composite with a 
stiffness E: 

5(E. -E V. 

E = -^ m ^ + E m (40.4) 

E. 

3 + 2 — l 

E m 

The stiffness of such a composite is close to the Hashin-Shtrikman lower bound for isotropic composites. 
Even if the spherical particles are perfectly rigid compared with the matrix, their stiffening effect at low 
concentrations is modest. Conversely, when the inclusions are more compliant than the matrix, spherical 
ones reduce the stiffness the least and platelet ones reduce it the most. Indeed, soft platelets are suggestive 
of crack-like defects. Soft platelets, therefore result not only in a compliant composite, but also a weak 
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FIGURE 40.4 Microstructure of a dental composite. Miradapt® ( Johnson & Johnson) 50% by volume filler: Barium 
glass and colloidal silica [Park and Lakes, 1992]. 

one. Soft spherical inclusions are used intentionally as crack stoppers to enhance the toughness of 
polymers such as polystyrene [high impact polystyrene] , with a small sacrifice in stiffness. 

Particle reinforcement has been used to improve the properties of bone cement. For example, inclusion 
of bone particles in PMMA cement somewhat improves the stiffness and improves the fatigue life 
considerably [Park et al., 1986]. Moreover, the bone particles at the interface with the patient's bone are 
ultimately resorbed and are replaced by ingrown new bone tissue. This approach is in the experimental 
stages. 

Rubber used in catheters, rubber gloves, etc. is usually reinforced with very fine particles of silica (Si0 2 ) 
to make the rubber stronger and tougher. 

Teeth with decayed regions have traditionally been restored with metals such as silver amalgam. Metallic 
restorations are not considered desirable for anterior teeth for cosmetic reasons. Acrylic resins and silicate 
cements had been used for anterior teeth, but their poor material properties led to short service life and 
clinical failures. Dental composite resins have virtually replaced these materials and are very commonly 
used to restore posterior teeth as well as anterior teeth [Cannon, 1988]. 

The dental composite resins consist of a polymer matrix and stiff inorganic inclusions [Craig, 1981]. 
A representative structure is shown in Fig. 40.4. The particles are very angular in shape. The inorganic 
inclusions confer a relatively high stiffness and high wear resistance on the material. Moreover, since they 
are translucent and their index of refraction is similar to that of dental enamel, they are cosmetically 
acceptable. Available dental composite resins use quartz, barium glass, and colloidal silica as fillers. Fillers 
have particle size from 0.04 um to 13 um, and concentrations from 33 to 78% by weight. In view of the 
greater density of the inorganic filler phase, a 77% weight percent of filler corresponds to a volume 
percent of about 55%. The matrix consists of a polymer, typically BIS-GMA. In restoring a cavity, the 
dentist mixes several constituents, then places them in the prepared cavity to polymerize. For this 
procedure to be successful the viscosity of the mixed paste must be sufficiently low and the polymerization 
must be controllable. Low viscosity liquids such as triethylene glycol dimethacrylate are used to lower 
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TABLE 40.1 Properties of Bone, Teeth, and Biomaterials 





Young's 










modulus 


Density p 


Strength 




Material 


E[GPa] 


(g/cm 3 ) 


(MPa) 


Refs. 


Hard Tissue 


17 


1.8 


130 (tension) 


(Craig, and Peyton, 


Tooth, bone, human compact bone, 








1958; Peters et al., 


longitudinal direction 








1984; Park and Lakes, 
1992; Reilly and 
Burstein, 1975) 


Tooth dentin 


18 


2.1 


138 (compression) 




Tooth enamel 


50 


2.9 






Polymers 








(Park and Lakes, 1992) 


Polyethylene (UHMW) 


1 


0.94 


30 (tension) 




Polymethyl methacrylate, PMMA 


3 


1.1 


65 (tension) 




PMMA bone cement 


2 


1.18 


30 (tension) 




Metals 








(Park and Lakes, 1992) 


316L Stainless steel (wrought) 


200 


7.9 


1000 (tension) 




Co-Cr-Mo (cast) 


230 


8.3 


660 (tension) 




Co Ni Cr Mo (wrought) 


230 


9.2 


1800 (tension) 




Ti6A14V 


110 


4.5 


900 (tension) 




Composites 










Graphite-epoxy (unidirectional fibrous, 


215 


1.63 


1240 (tension) 


(Schwartz, 1997) 


high modulus) 










Graphite-epoxy (quasi-isotropic fibrous) 


46 


1.55 


579 (tension) 


(Schwartz, 1997) 


Dental composite resins (particulate) 


10-16 




170-260 (compression) 


(Cannon, 1988) 


Foams 








(Gibson and Ashby, 


Polymer foams 


10 4 -l 


0.002-0.8 


0.01-1 (tension) 


1988) 



the viscosity and inhibitors such as BHT (butylated trioxytoluene) are used to prevent premature poly- 
merization. Polymerization can be initiated by a thermochemical initiator such as benzoyl peroxide, or 
by a photochemical initiator (benzoin alkyl ether) which generates free radicals when subjected to 
ultraviolet light from a lamp used by the dentist. 

Dental composites have a Young's modulus in the range 10 to 16 GPa, and the compressive strength 
from 170 to 260 MPa [Cannon, 1988]. As shown in Table 40.1, these composites are still considerably 
less stiff than dental enamel, which contains about 99% mineral. Similar high concentrations of mineral 
particles in synthetic composites cannot easily be achieved, in part because the particles do not pack 
densely. Moreover, an excessive concentration of particles raises the viscosity of the unpolymerized paste. 
An excessively high viscosity is problematical since it prevents the dentist from adequately packing the 
paste into the prepared cavity; the material will then fill in crevices less effectively. 

The thermal expansion of dental composites, as with other dental materials, exceeds that of tooth 
structure. Moreover, there is a contraction during polymerization of 1.2 to 1.6%. These effects are thought 
to contribute to leakage of saliva, bacteria, etc., at the interface margins. Such leakage in some cases can 
cause further decay of the tooth. 

Use of colloidal silica in the so-called "microfilled" composites allows these resins to be polished, so 
that less wear occurs and less plaque accumulates. It is more difficult, however, to make these with a high 
fraction of filler. All the dental composites exhibit creep. The stiffness changes by a factor of 2.5 to 4 
(depending on the particular material) over a time period from 10 sec to three h under steady load 
[Papadogianis et al., 1985] . This creep may result in indentation of the restoration, but wear seems to be 
a greater problem. 

Dental composite resins have become established as restorative materials for both anterior and pos- 
terior teeth. The use of these materials is likely to increase as improved compositions are developed and 
in response to concern over long term toxicity of silver-mercury amalgam fillings. 
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40.5 Fibrous Composites 

Fibers incorporated in a polymer matrix increase the stiffness, strength, fatigue life, and other properties 
[Agarwal and Broutman, 1980; Schwartz, 1992]. Fibers are mechanically more effective in achieving a 
stiff, strong composite than are particles. Materials can be prepared in fiber form with very few defects 
which concentrate stress. Fibers such as graphite are stiff (Young's modulus is 200 to 800 GPa) and strong 
(the tensile strength is 2.7 to 5.5 GPa). Composites made from them can be as strong as steel but much 
lighter, as shown in Table 40.1. The stiffness of a composite with aligned fibers, if it is loaded along the 
fibers, is equivalent to the Voigt upper bound, Eq. (40.1). Unidirectional fibrous composites, when loaded 
along the fibers, can have strengths and stiffnesses comparable to that of steel, but with much less weight 
(Table 40.1). However if it is loaded transversly to the fibers, such a composite will be compliant, with 
a stiffness not much greater than that of the matrix alone. While unidirectional fiber composites can be 
made very strong in the longitudinal direction, they are weaker than the matrix alone when loaded 
transversely, as a result of stress concentration around the fibers. If stiffness and strength are needed in 
all directions, the fibers may be oriented randomly. For such a three-dimensional isotropic composite, 
for a low concentration of fibers, 



E.V. 
E = -*-*- + E m> (40.5) 

6 

so the stiffness is reduced by about a factor of six in comparison with an aligned composite as illustrated 
in Fig. 40.3. However if the fibers are aligned randomly in a plane, the reduction in stiffness is only a 
factor of three. The degree of anisotropy in fibrous composites can be very well controlled by forming 
laminates consisting of layers of fibers embedded in a matrix. Each layer can have fibers oriented in a 
different direction. One can achieve quasi-isotropic behavior in the laminate plane; such a laminate is 
not as strong or as stiff as a unidirectional one, as illustrated in Table 40.1. Strength of composites depends 
on such particulars as the brittleness or ductility of the inclusions and the matrix. In fibrous composites 
failure may occur by (1) fiber breakage, buckling, or pullout, (2) matrix cracking, or (3) debonding of 
fiber from matrix. 

Short fiber composites are used in many applications. They are not as stiff or as strong as composites 
with continuous fibers, but they can be formed economically by injection molding or by in situ poly- 
merization. Choice of an optimal fiber length can result in improved toughness, due to the predominance 
of fiber pull-out as a fracture mechanism. 

Carbon fibers have been incorporated in the high density polyethylene used in total knee replacements 
(Fig. 40.5). The standard ultra high molecular weight polyethylene (UHMWPE) used in these implants 
is considered adequate for most purposes for implantation in older patients. A longer wear-free implant 
lifetime is desirable for use in younger patients. It is considered desirable to improve the resistance to 
creep of the polymeric component, since excessive creep results in an indentation of that component 
after long term use. Representative properties of carbon reinforced ultra high molecular weight polyeth- 
ylene are shown in Fig. 40.6 [Sclippa and Piekarski, 1973]. Enhancements of various properties by a 
factor of two are feasible. 

Polymethyl methacrylate (PMMA) used in bone cement is compliant and weak in comparison with 
bone. Therefore several reinforcement methods have been attempted. Metal wires have been used clini- 
cally as macroscopic "fibers" to reinforce PMMA cement used in spinal stabilization surgery [Fishbane 
and Pond, 1977]. The wires are made of a biocompatible alloy such as cobalt-chromium alloy or stainless 
steel. Such wires are not currently used in joint replacements owing to the limited space available. Graphite 
fibers have been incorporated in bone cement [Knoell et al., 1975] on an experimental basis. Significant 
improvements in the mechanical properties have been achieved. Moreover, the fibers have an added 
beneficial effect of reducing the rise in temperature which occurs during the polymerization of the PMMA 
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FIGURE 40.5 Knee prostheses with polyethylene tibial components reinforced with carbon fiber . 

in the body. Such high temperature can cause problems such as necrosis of a portion of the bone into 
which it is implanted. Thin, short titanium fibers have been embedded in PMMA cement [Topoleski 
et al., 1992]; a toughness increase of 51% was observed with a 5% volumetric fiber content. Fiber 
reinforcement of PMMA cement has not found much acceptance since the fibers also increase the viscosity 
of the unpolymerized material. It is consequently difficult for the surgeon to form and shape the 
polymerizing cement during the surgical procedure. 

Metals are currently used in bone plates for immobilizing fractures and in the femoral component of 
total hip replacements. A problem with currently used implant metals is that they are much stiffer than 
bone, so they shield the nearby bone from mechanical stress. Stress-shielding results in a kind of disuse 
atrophy: the bone resorbs [Engh and Bobyn, 1988]. Therefore composite materials have been investigated 
as alternatives [Bradley et al., 1980; Skinner, 1988]. Fibrous composites can deform to higher strains (to 
about 0.01) than metals (0.001 for a mild steel) without damage. This resilience is an attractive charac- 
teristic for more flexible bone plates and femoral stems. Flexible composite bone plates are effective in 
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FIGURE 40.6 Properties of carbon fiber reinforced ultra high molecular weight polyethylene. Replotted from 
Sclippa and Piekarski [1973]. 

promoting healing [Jockish, 1992]. Composite hip replacement prostheses have been made with carbon 
fibers in a matrix of polysulfone and polyetherether ketone (PEEK). These prostheses experience heavy 
load with a static component. Structural metals such as stainless steel and cobalt chromium alloys do 
not creep significantly at room or body temperature. In composites which contain a polymer constituent, 
creep behavior is a matter of concern. The carbon fibers exhibit negligible creep, but polymer constituents 
tend to creep. Prototype composite femoral components were found to exhibit fiber dominated creep of 
small magnitude and are not expected to limit the life of the implant [Maharaj and Jamison, 1993]. 

Fibrous composites have also been used in external medical devices such as knee braces [Yeaple, 1989], 
in which biocompatibility is not a concern but light weight is crucial. 

40.6 Porous Materials 

The presence of voids in porous or cellular solids will reduce the stiffness of the material. For some 
purposes, that is both acceptable and desirable. Porous solids are used for many purposes: flexible 
structures such as (1) seat cushions, (2) thermal insulation, (3) filters, (4) cores for stiff and lightweight 
sandwich panels, (5) flotation devices, and (6) to protect objects from mechanical shock and vibration; 
and in biomaterials, as coatings to encourage tissue ingrowth. Representative cellular solid structures are 
shown in Fig. 40.7. 

The stiffness of an open-cell foam is given by [Gibson and Ashby, 1988] 



E = E V 



(40.6) 



in which E s is the Young's modulus and V s is the volume fraction of the solid phase of the foam; V s is 
also called the relative density. 

The strength for crushing of a brittle foam and the elastic collapse of an elastomeric foam is given, 
respectively, by 



a .=0.65 a, V 

crush r,s I s 



3/2 



(40.7) 
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FIGURE 40.7 Cellular solids structures, after Gibson and Ashby [1988], Left: synthetic cellular solids: (a) open-cell 
polyurethane, (b) closed-cell polyethylene, (c) foamed nickel, (d) foamed copper, (e) foamed zirconia, (f) foamed 
mullite, (g) foamed glass, (h) polyester foam with both open and closed cells. Right: natural cellular solids: (a) cork, 
(b) balsa wood, (c) sponge, (d) cancellous bone, (e) coral, (f) cuttlefish bone, (g) iris leaf, (h) plant stalk. 



= 0.05E V 



(40.8) 



Here, o, s is the fracture strength of the solid phase. These strength relations are valid for relatively small 
density. Their derivation is based on the concept of bending of the cell ribs and is presented by Gibson 
and Ashby [ 1988] . Most man-made closed cell foams tend to have a concentration of material at the cell 
edges, so that they behave mechanically as open cell foams. The salient point in the relations for the 
mechanical properties of cellular solids is that the relative density dramatically influences the stiffness 
and the strength. As for the relationship between stress and strain, a representative stress strain curve is 
shown in Fig. 40.8. The physical mechanism for the deformation mode beyond the elastic limit depends 
on the material from which the foam is made. Trabecular bone, for example, is a natural cellular solid, 
which tends to fail in compression by crushing. Many kinds of trabecular bone appear to behave 
mechanically as an open cell foam. For trabecular bone of unspecified orientation, the stiffness is 
proportional to the cube of the density and the strength as the square of the density [Gibson and Ashby, 
1988], which indicates behavior dominated by bending of the trabeculae. For bone with oriented trabe- 
cule, both stiffness and strength in the trabecular direction are proportional to the density, a fact which 
indicates behavior dominated by axial deformation of the trabeculae. 

Porous materials have a high ratio of surface area to volume. When porous materials are used in 
biomaterial applications, the demands upon the inertness and biocompatibility are likely to be greater 
than for a homogeneous material. 

Porous materials, when used in implants, allow tissue ingrowth [Spector et al., 1988a,b]. The ingrowth 
is considered desirable in many contexts, since it allows a relatively permanent anchorage of the implant 
to the surrounding tissues. There are actually two composites to be considered in porous implants: (1) the 
implant prior to in growth, in which the pores are filled with tissue fluid which is ordinarily of no 
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FIGURE 40.8 Representative stress-strain curve for a cellular solid. The plateau region for compression in the case 
of elastomeric foam (a ruhbery polymer) represents elastic buckling; for an elastic-plastic foam (such as metallic 
foam), it represents plastic yield, and for an elastic-brittle foam (such as ceramic) it represents crushing. On the 
tension side, point 'A' represents the transition between cell wall bending and cell wall alignment. In elastomeric 
foam, the alignment occurs elastically, in elastic plastic foam it occurs plastically, and an elastic-brittle foam fractures 
at A. 

mechanical consequence; and (2) the implant filled with tissue. In the case of the implant prior to 
ingrowth, it must be recognized that the stiffness and strength of the porous solid are much less than in 
the case of the solid from which it is derived. 

Porous layers are used on bone compatible implants to encourage bony ingrowth [Galante et al., 1971; 
Ducheyne, 1984]. The pore size of a cellular solid has no influence on its stiffness or strength (though 
it does influence the toughness), however pore size can be of considerable biological importance. Spe- 
cifically, in orthopedic implants with pores larger than about 150 um, bony ingrowth into the pores 
occurs and this is useful to anchor the implant. This minimum pore size is on the order of the diameter 
of osteons in normal Haversian bone. It was found experimentally that pores <75 um in size did not 
permit the ingrowth of bone tissue. Moreover, it was difficult to maintain fully viable osteons within 
pores in the 75 to 150 um size range. Representative structure of such a porous surface layer is shown in 
Figure 40.9. Porous coatings are also under study for application in anchoring the artificial roots of dental 
implants to the underlying jawbone. Porous hydroxyapatite has been studied for use in repairing large 
defects in bone [Holmes et al., 1986; Meffert et al., 1985]. Hydroxyapatite is the mineral constituent of 
bone, and it has the nominal composition Ca 10 (PO 4 ) 6 (OH) 2 . Implanted hydroxyapatite is slowly resorbed 
by the body over several years and replaced by bone. Tricalcium phosphate is resorbed more quickly and 
has been considered as an implant constituent to speed healing. 

When a porous material is implanted in bone, the pores become filled first with blood which clots, 
then with osteoprogenitor mesenchymal cells, then, after about 4 weeks, bony trabeculae. The ingrown 
bone then becomes remodeled in response to mechanical stress. The bony ingrowth process depends on 
a degree of mechanical stability in the early stages of healing. If too much motion occurs, the ingrown 
tissue will be collagenous scar tissue, not bone. 

Porous materials used in soft tissue applications include polyurethane, polyamide, and polyester 
velours used in percutaneous devices. Porous reconstituted collagen has been used in artificial skin, and 
braided polypropylene has been used in artificial ligaments. As in the case of bone implants, the porosity 
encourages tissue ingrowth which anchors the device. 

Blood vessel replacements are made with porous materials which encourage soft tissue to grow in, 
eventually forming a new lining, or neointima. The new lining consists of the patient's own cells. It is a 
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FIGURE 40.9 Irregular pore structure of porous coating in Ti5Al4V alloy for bony ingrowth, from Park and Lakes 
[1992]. The top scanning electron microscopic picture is a 5x magnification of the rectangular region of the bottom 
picture (200x). 

natural nonthrombogenic surface resembling the lining of the original blood vessel. This is a further 
example of the biological role of porous materials as contrasted with the mechanical role. 

Ingrowth of tissue into implant pores is not always desirable. For example, sponge (polyvinyl alcohol) 
implants used in early mammary augmentation surgery underwent ingrowth of fibrous tissue, and 
contracture and calcification of that tissue, resulting in hardened, calcified breasts. Current mammary 
implants make use of a balloon-like nonporous silicone rubber layer enclosing silicone oil or gel, or 
perhaps a saline solution in water. A porous layer of polyester felt or velour attached to the balloon is 
provided at the back surface of the implant so that limited tissue ingrowth will anchor it to the chest 
wall and prevent it from migrating. 

Foams are also used externally to protect the human body from injury. Examples include knee pads, 
elbow pads, wrestling mats, and wheelchair cushions. Since these foams are only in contact with skin 
rather than any internal organs, they are not subject to rigorous biocompatibility requirements. They 
are therefore designed based on mechanical considerations. Foam used in sports equipment must have 
the correct compliance to limit impact force without bottoming out. Foam used in wheelchair cushions 
is intended to prevent pressure sores in people who suffer limited mobility. The properties of cushions 
are crucial in reducing illness and suffering in people who are confined to wheelchairs or hospital beds 
for long periods. Prolonged pressure on body parts can obstruct circulation in the capillaries. If this lasts 
too long it may cause a sore or ulcer called a pressure sore, also called a bed sore. In its most severe 
manifestation, a pressure sore can form a deep crater-like ulcer in which underlying muscle or bone is 
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exposed [Dinsdale, 1 974] . A variety of flexible cushion materials have been tried to minimize the incidence 
and severity of pressure sores [Garber, 1985]. Viscoelastic foam allows the cushion to progressively 
conform to the body shape. However, progressive densification of the foam due to creep results in a 
stiffer cushion which must be periodically replaced. 

Porous materials are produced in a variety of ways. For example, in the case of bone compatible 
surfaces they are formed by sintering of beads or wires. Vascular and soft tissue implants are produced 
by weaving or braiding fibers as well as by nonwoven "felting" methods. Protective foams for use outside 
the body are usually produced by use of a "blowing agent" which is a chemical which evolves gas during 
the polymerization of the foam. An interesting approach to producing micro-porous materials is the 
replication of structures found in biological materials: the replamineform process [White et al., 1976]. 
The rationale is that the unique structure of communicating pores is thought to offer advantages in the 
induction of tissue ingrowth. The skeletal structure of coral or echinoderms (such as sea urchins) is 
replicated by a casting process in metals and polymers; these have been tried in vascular and tracheal 
prostheses as well as in bone substitutes. 

40.7 Biocompatibility 

Carbon itself has been successfully used as a biomaterial. Carbon based fibers used in composites are 
known to be inert in aqueous (even seawater) environments, however they do not have a track record 
in the biomaterials setting. In vitro studies by Kovacs [1993] disclose substantial electrochemical activity 
of carbon fiber composites in an aqueous environment. If such composites are placed near a metallic 
implant, galvanic corrosion is a possibility. Composite materials with a polymer matrix absorb water 
when placed in a hydrated environment such as the body. Moisture acts as a plasticizer of the matrix 
and shifts the glass transition temperature towards lower values [Delasi and Whiteside, 1978], hence a 
reduction in stiffness and an increase in mechanical damping. Water immersion of a graphite epoxy 
cross-ply composite [Gopalan et al., 1989] for 20 days reduced the strength by 13% and the stiffness by 
9%. Moisture absorption by polymer constituents also causes swelling. Such swelling can be beneficial 
in dental composites since it offsets some of the shrinkage due to polymerization. 

Flexible composite bone plates are effective in promoting healing [Jockish, 1992] , but particulate debris 
from composite bone plates gives rise to a foreign body reaction similar to that caused by ultra high 
molecular weight polyethylene. 

Summary 

Composite materials are a relatively recent addition to the class of materials used in structural applica- 
tions. In the biomaterials field, the ingress of composites has been even more recent. In view of their 
potential for high performance, composite materials are likely to find increasing use as biomaterials. 
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The term biodegradation is loosely associated with materials that could be broken down by nature either 
through hydrolytic mechanisms without the help of enzymes and/or enzymatic mechanism. Other terms 
like absorbable, erodible, and resorbable have also been used in the literature to indicate biodegradation. 
The interests in biodegradable polymeric biomaterials for biomedical engineering use have increased 
dramatically during the past decade. This is because this class of biomaterials has two major advantages 
that non-biodegradable biomaterials do not have. First, they do not elicit permanent chronic foreign- 
body reactions due to the fact that they are gradually absorbed by the human body and do not perma- 
nently leave traces of residual in the implantation sites. Second, some of them have recently been found 
to be able to regenerate tissues, so called tissue engineering, through the interaction of their biodegra- 
dation with immunologic cells like macrophages. Hence, surgical implants made from biodegradable 
biomaterials could be used as a temporary scaffold for tissue regeneration. This approach toward the 
reconstruction of injured, diseased, or aged tissues is one of the most promising fields in the next century. 
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TABLE 41.1 Properties of Commercially Important Synthetic Absorable Polymers 
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" Glycolide per lactide = 9/1. 

b Glycolide per trimethylene carbonate = 9/1. 

c Depending on the copolymer composition. 

d 2/0 size Monocryl (glycolide-£-caprolactone copolymer). 

' PSI unit 

Source: Kimura, Y., 1993. Biodegradable Polymers, In: Biomedical Applications of Polymeric Materials, T. Tsuruta, 
T. Hayashi, K. Kataoka, K. Ishihara, and Y. Kimura, Eds. CRC Press, Boca Raton, Fla. and Chu, C.C., von Fraunhofer, 
J.A., and Greisler, HP, 1996. Wound Closure Biomaterials and Devices. CRC Press, Boca Raton, Fla. 

Although the earliest and most commercially significant biodegradable polymeric biomaterials were 
originated from linear aliphatic polyesters like polyglycolide and polylactide from poly(a-hydroxyacetic 
acids), recent introduction of several new synthetic and natural biodegradable polymeric biomaterials 
extends the domain beyond this family of simple polyesters. These new commercially significant biode- 
gradable polymeric biomaterials include poly(orthoesters), polyanhydrides, polysaccharides, polyester- 
amides), tyrosine-based polyarylates or polyiminocarbonates or polycarbonates, poly(D,L-lactide-ure- 
thane), poly(P-hydroxybutyrate), poly(s-caprolactone), poly[bis(carboxylatophenoxy) phosphazene], 
poly( amino acids), pseudo -poly (amino acids), and copolymers derived from amino acids and non-amino 
acids. 

All the above biodegradable polymeric biomaterials could be generally divided into eight groups based 
on their chemical origin: (1) Biodegradable linear aliphatic polyesters (e.g., polyglycolide, polylactide, 
polycaprolactone, polyhydroxybutyrate) and their copolymers within the aliphatic polyester family like 
poly(glycolide-L-lactide) copolymer and poly(glycolide-e-caprolactone) copolymer; (2) Biodegradable 
copolymers between linear aliphatic polyesters in ( 1 ) and monomers other than linear aliphatic polyesters 
like, poly(glycolide-trimethylene carbonate) copolymer, poly(L-lactic acid-L-lysine) copolymer, Tyrosine- 
based polyarylates or polyiminocarbonates or polycarbonates, poly(D,L-lactide-urethane), and polyester- 
amide); (3) Polyanhydrides; (4) Poly(orthoesters); (5) Poly( ester-ethers) like poly-p-dioxanone; (6) Bio- 
degradable polysaccharides like hyaluronic acid, chitin and chitson; (7) polyamino acids like poly-L- 
glutamic acid and poly-L-lysine; (8) Inorganic biodegradable polymers like polyphosphazene and 
poly[bis(carboxylatophenoxy)phosphazene] which have a nitrogen-phosphorus backbone instead of ester 
linkage. Recently, there is a new approach of making new biodegradable polymers through melt-blending 
of highly accepted biodegradable polymers like those of glycolide and lactide base [Shalaby, 1994]. 

The earliest, most successful, and frequent biomedical applications of biodegradable polymeric bio- 
materials have been in wound closure [Chu et al., 1996]. All biodegradable wound closure biomaterials 
are based upon the glycolide and lactide families. For example, polyglycolide (Dexon from American 
Cyanamid), poly(glycolide-L-lactide) random copolymer with 90 to 10 molar ratio (Vicryl from Ethicon), 
poly(ester-ether) (PDS from Ethicon), poly(glycolide-trimethylene carbonate) random block copolymer 
(Maxon from American Cyanamid), and poly(glycolide-s-caprolactone) copolymer (Monocryl from 
Ethicon). This class of biodegradable polymeric biomaterials is also the one most studied for their 
chemical, physical, mechanical, morphological, and biological properties and their changes with degra- 
dation time and environment. Some of the above materials like Vicryl have been commercially used as 
surgical meshes for repair of a hernia or the body wall. 
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The next largest biomedical application of biodegradable polymeric biomaterials that are commercially 
satisfactory is drug control/release devices. Some well-known examples in this application are polyanhy- 
drides and poly(ortho-ester). Biodegradable polymeric biomaterials, particularly totally resorbable com- 
posites, have also been experimentally used in the field of orthopedics, mainly as components for internal 
bone fracture fixation like PDS pins. However, their wide acceptance in other parts of orthopaedic 
implants may be limited due to their inherent mechanical properties and their biodegradation rate. 
Besides the commercial uses described above, biodegradable polymeric biomaterials have been experi- 
mented with as (1) vascular grafts, (2) vascular stents, (3) vascular couplers for vessel anastomosis, 
(4) nerve growth conduits, (5) augmentation of defected bone, (6) ligament/tendon prostheses, 
(7) intramedullary plug during total hip replacement, (8) anastomosis ring for intestinal surgery, and 
(9) stents in ureteroureterostomies for accurate suture placement. 

Due to space limitation, the emphasis of this chapter will be on the commercially most significant and 
successful biomedical biodegradable polymers based on (1) linear aliphatic polyesters, (2) some very 
recent research and development of important classes of synthetic biodegradable polymers, (3) a new 
theoretical approach to modeling the hydrolytic degradation of glycolide/lactide based biodegradable 
polymers, (4) the effects of some new extrinsic factors on the degradation of the most commercially 
significant biodegradable polymers, and (5) the new biomedical applications of this class of synthetic 
biodegradable polymers in tissue engineering and regeneration. The details of the applications of this 
family and other biodegradable polymeric biomaterials and their chemical, physical, mechanical, biolog- 
ical, and biodegradation properties can be found in other recent reviews [Barrows, 1986; Kimura, 1993; 
Shalaby, 1994; Chu et al., 1996; Hollinger, 1995; Vert et al., 1992; Park et al., 1993]. 

41.2 Glycolide/Lactide Based Biodegradable 
Linear Aliphatic Polyesters 

This class of biodegradable polymers is the most successful, important, and commercially widely used 
biodegradable biomaterials in surgery. It is also the class of biodegradable biomaterials that were most 
extensively studied in terms of degradation mechanisms and structure-property relationships. Among 
them, polyglycolide or polyglycolic acid (PGA) is the most important one because most other biode- 
gradable polymers are derived from PGA either through copolymerization, e.g., poly(glycolide-L-lactide) 
copolymer or modified glycolide monomer, e.g., poly-p-dioxanone. 

Glycolide Based Biodegradable Homopolymers Polyesters 

PGA can be polymerized either directly or indirectly from glycolic acid. The direct polycondensation 
produces a polymer of M n less than 1 0,000 because of the requirement of a very high degree of dehydration 
(99.28% up) and the absence of monofunctional impurities. For PGA of molecular weight higher than 
10,000 it is necessary to proceed through the ring-opening polymerization of the cyclic dimers of glycolic 
acid. Numerous catalysts are available for this ring-opening polymerization. They include organometallic 
compounds and Lewis acids [Chujo et al., 1967; Wise et al., 1979]. For biomedical applications, stannous 
chloride dihydrate or trialkyl aluminum are preferred. PGA was found to exhibit an orthorhombic unit 
cell with dimensions a = 5.22 A,b = 6.19 A, and c (fiber axis) = 7.02 A. The planar zigzag-chain molecules 
form a sheet structure parallel to the ac plane and do not have the polyethylene type arrangement [Chatani 
et al., 1968]. The molecules between two adjacent sheets orient in opposite directions. The tight molecular 
packing and the close approach of the ester groups might stabilize the crystal lattice and contribute to 
the high melting point, T m , of PGA (224-230°C). The glass transition temperature, T , ranges from 36 to 
40°C. The specific gravities of PGA are 1.707 for a perfect crystal and 1.50 in a completely amorphous 
state [Chujo et al, 1967a]. The heat of fusion of 100% crystallized PGA is reported to be 12 KJ/mole 
(45.7 cal/gram) [Brandrup et al., 1975]. A recent study of injection molded PGA disks reveals their IR 
spectroscopic characteristics [Chu et al., 1995]. As shown in Fig. 41.1, the four bands at 850, 753, 713, 
and 560 cm 1 are associated with the amorphous regions of the PGA disks and could be used to assess 
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FIGURE 41.1 FTIR spectra of polyglycolic acid disks as a function of in vitro hydrolysis time in phosphate buffer 
of pH 7.44 at 37°C. (a): day; (b) 55 hrs; (c) 7 days; (d) 21 days. 

the extends of hydrolysis. Peaks associated with the crystalline phase included those at 972, 901, 806, 
627, and 590 cm -1 . Two broad, intense peaks at 1142 and 1077 cnr 1 can be assigned to C-0 stretching 
modes in the ester and oxymethylene groups, respectively. These two peaks are associated mainly with 
ester and oxymethylene groups originating in the amorphous domains. Hydrolysis could cause both of 
these C-O stretching modes to substantially decrease in intensity. 

Glycolide-Based Biodegradable Copolyesters Having Aliphatic 
Polyester Based Co-Monomers 

Other commercially successful glycolide-based biodegradable polymeric biomaterials are the copolymers 
of glycolide with other monomers within linear aliphatic polyesters like lactides, carbonates, and e-capro- 
lactone. The glycolide-lactide random copolymers are the most studied and have a wide range of prop- 
erties and applications, depending on the composition ratio of glycolide to lactide. Figure 41.2 illustrates 
the dependence of biodegradation rate on the composition of glycolide to lactide in the copolymer. For 
wound closure purposes, a high concentration of glycolide monomer is required for achieving proper 
mechanical and degradation properties. Vicryl sutures, sometime called polyglactin 910, contain a 90/10 
molar ratio of glycolic to L-lactide and this molar ratio is important for the Vicryl suture to retain 
crystalline characteristics. For biomedical use, Lewis acid catalysts are preferred for the copolymers [Wise 
et al., 1979]. If DL- instead of L-lactide is used as the co-monomer, the U-shape relationship between 
the level of crystallinity and glycolide composition disappears. This is because polylactide from 100% 
DL-lactide composition is totally amorphous. IR bands associated with Vicryl molecules in the amor- 
phous domains are 560, 710, 850 and 888 cm 1 , while 590, 626, 808, 900 and 972 cm 1 are associated 
with the crystalline domains [Frederick et al., 1984]. Like PGA, these IR bands could be used to assess 
the extent of hydrolysis. 

A relatively new block copolymer of glycolide and carbonates, such as trimethylene carbonate, has 
been commercialized. Maxon is made from a block copolymer of glycolide and l,3-dioxan-2-one (trim- 
ethylene carbonate or GTMC) and consists of 32.5% by weight (or 36 mol%) of trimethylene carbonate 
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FIGURE 41.2 The effect of poly(L-lactide) composition in polyglycolide on the time required for 50% mass loss 
implanted under the dorsal skin of rat. (Source: Miller, R.A., Brady, J.M., and Outright, D.E., 1977. "Degradation 
Rates of Oral Resorbable Implants (Polylactates and Polyglycolates): Rate Modification with Changes in PLA/PGA 
Copolymer Ratios." /. Biomed. Mater. Res., 11(5):711. (With permission.) 

[Katz et al., 1985; Casey et al., 1984]. Maxon is a poly(ester-carbonate). The polymerization process of 
Maxon is divided into two stages. The first stage is the formation of a middle block which is a random 
copolymer of glycolide and l,3-dioxan-2-one. Diethylene glycol is used as an initiator and stannous 
chloride dihydrate (SnCl 2 -2H 2 0) serves as the catalyst. The polymerization is conducted at about 180°C. 
The weight ratio of glycolide to trimethylene carbonate in the middle block is 15:85. After the synthesis 
of the middle block, the temperature of the reactive bath is raised to about 220°C to prevent the 
crystallization of the copolymer and additional glycolide monomers as the end blocks are added into the 
reaction bath to form the final triblock copolymer. 

The latest glycolide-based copolymer that has become commercially successful is Monocryl suture. It 
is a segmented block copolymer consisting of both soft and hard segments. The purpose of having soft 
segments in the copolymer is to provide good handling properties like pliability, while the hard segments 
are used to provide adequate strength. The generic copolymerization process between glycolic acid and 
e-caprolactone was recently reported by Fukuzaki et al. in Japan [1989, 1991]. The resulting copolymers 
were low molecular weight biodegradable copolymers of glycolic acid and various lactones for potential 
drug delivery purposes. The composition of lactone ranged from as low as 15 to as high as 50 mol% and 
the weight average for molecular weight ranged from 4,510 to 16,500. The glass transition temperature 
ranged from 18 to -43°C, depending on the copolymer composition and molecular weight. 

Monocryl is made from two stages of the polymerization process [Bezwada et al., 1995]. In the first 
stage, soft segments of prepolymer of glycolide and s-caprolactone are made. This soft segmented 
prepolymer is further polymerized with glycolides to provide hard segments of polyglycolide. Monocryl 
has a composition of 75% glycolide and 25% e-caprolactone and should have a higher molecular weight 
than those glycolide/e-caprolactone copolymers reported by Fukuzaki et al., for adequate mechanical 
properties required by sutures. The most unique aspect of Monocryl monofilament suture is its pliability 
as claimed by Ethicon [Bezwada et al., 1995]. The force required to bend a 2/0 suture is only about 2.8 x 
10 4 lb-in 2 for Monocryl, while the same size PDSII and Maxon monofilament sutures require about 
3.9 and 11.6 x 10 4 lb-in 2 force, respectively. This inherent pliability of Monocryl is due to the presence 
of soft segments and T resulting from the e-caprolactone co-monomer unit. Its T is expected to be 
between 15 and -36°C. 
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Glycolide-Based Biodegradable Copolyesters with Non-Aliphatic 
Polyester-Based Co-Monomers 

In this category, the most important one is the glycolide copolymer consisting of poly(ethylene 1,4- 
phenylene-bis-oxyacetate) (PEPBO) [Jamiokowski and Shalaby, 1991]. The development of this type of 
glycolide-based copolymer was initiated because of the adverse effect of y irradiation on the mechanical 
properties of glycolide-based synthetic absorbable sutures. There is a great desire to develop y-irradiation 
sterilizable, synthetic, absorbable polymers to take advantage of the highly convenient and reliable method 
of sterilization. Shalaby et al. recently reported that the incorporation of about 10 mol% of a polymeric 
radiostabilizer like PEPBO into PGA backbone chains would make the copolymer sterilizable by y 
irradiation without a significant accelerated loss of mechanical properties upon hydrolysis when com- 
pared with the unirradiated copolymer control (MPG) [Jamiokowski et al., 1991]. The changes in tensile 
breaking force of both MPG and PGA sutures implanted intramuscularly and subcutaneously in rats for 
various periods show the great advantage of such copolymers. MPG fibers y-irradiated at 2.89 Mrads did 
not show any loss in tensile breaking force during the first 14 days postimplantation when compared 
with unimplanted samples. On the contrary, PGA sutures y-irradiated at 2.75 Mrads lost 62% of the 
tensile breaking force of their unimplanted samples. There was no tensile breaking force remaining for 
the irradiated PGA at the end of 21 days, while both 2.89 and 5 Mrads irradiated MPG retained 72 and 
55% of their corresponding day controls, respectively. The inherent more hydrolytic resistance of MPG 
must be attributed to the presence of an aromatic group in the backbone chains. This aromatic polyester 
component is also responsible for the observed y-irradiation stability. It is not known at this time whether 
the new y-irradiation-resistant MPG is biocompatible with biologic tissues due to the lack of published 
histologic data. 

Glycolide-Derived Biodegradable Polymers Having Ether Linkage 

Poly-p-dioxanone (PDS) is derived from the glycolide family with better flexibility. It is polymerized 
from ether-containing lactones, l,4-dioxane-2,5-dione (i.e.,p-dioxanone) monomers with a hydroxylic 
initiator and tin catalyst [Shalaby, 1994]. The resulting polymer is semi- crystalline with T m about 106 to 
115°C and T -10 to 0°C. The improved flexibility of PDS relative to PGA as evidenced in its lower T 
is due to the incorporation of an ether segment in the repeating unit which reduces the density of ester 
linkages for intermolecular hydrogen bonds. Because of the less dense ester linkages in PDS when 
compared with PGA or glycolide-L-lactide copolymers, PDS is expected and has been shown to degrade 
at a slower rate in vitro and in vivo. PDS having an inherent viscosity of 2.0 dL/g in hexafiuoroisopropanol 
is adequate for making monofilament sutures. Recently, an advanced version of PDS, PDSII, was intro- 
duced. PDSII was achieved by subjecting the melt-spun fibers to a high temperature (128°C) for a short 
period of time. This additional treatment partially melts the outermost surface layer of PDS fibers and 
leads to a distinctive skin-core morphology. The heat employed also results in larger crystallites in the 
core of the fiber than the untreated PDS fiber. The tensile strength-loss profile of PDSII sutures is better 
than that of PDS sutures. 

A variety of copolymers having high molar ratios of PDS compared to other monomers within the 
same linear aliphatic polyester family have been reported for the purpose of improving the mechanical 
and biodegradation properties [Shalaby, 1994]. For example, copolymer of PDS (80%) and PGA (up to 
20%) has an absorption profile similar to Dexon and Vicryl sutures but it has compliance similar to PDS. 
Copolymer of PDS (85%) and PLLA (up to 15%) results in a more compliant (low modulus) suture 
than homopolymer PDS but with absorption profiles similar to PDS [Bezwada et al., 1990]. 

Copolymer fibers made from PDS and monomers other than linear aliphatic polyester like morpholine- 
2,5-dione (MD) exhibit rather interesting biodegradation properties. This copolymer fiber was absorbed 
10 to 25% earlier than PDS. The copolymer, however, retained a tensile breaking strength profile similar 
to PDS with a slightly faster strength loss during the earlier stage, i.e., the first 14 days [Shalaby, 1994]. 
This ability to break the inherent fiber structure-property relationship through copolymerization is a 
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major improvement in biodegradation properties of absorbable sutures. It is interesting to recognize that 
a small % (3%) of MD in the copolymer suture is sufficient to result in a faster mass loss profile without 
the expense of its tensile strength-loss profile. The ability to achieve this ideal biodegradation property 
might be attributed to both an increasing hydrophilicity of the copolymer and the disruption of crystalline 
domains due to MD moiety. As described later, the loss of suture mass is mainly due to the destruction 
of crystalline domains, while the loss of tensile breaking strength is chiefly due to the scission of tie-chain 
segments located in the amorphous domains. The question is why MD-PDS copolymeric suture retains 
its strength-loss similar to PDS. The possible explanation is that the amide functional groups in MD 
could form stronger intermolecular hydrogen bonds than ester functional groups. This stronger hydrogen 
bond contributes to the strength retention of the copolymer of PDS and MD during in vivo biodegra- 
dation. The incorporation of MD moiety into PDS also lowers the unknot and knot strength of 
unhydrolyzed specimens, but increases elongation at break. This suggests that the copolymer of PDS and 
MD should have a lower level of crystallinity than PDS which is consistent with its observed faster mass 
loss in vivo. 

To improve y-irradiation stability of PDS, radiostabilizers like PEPBO have been copolymerized with 
PDS to form segmented copolymers the same way as PEPBO with glycolide described above [Shabaly, 
1994; Koelmel et al., 1991]. The incorporation of 5 to 10% of such stabilizer in PDS has been shown not 
only to improve y-irradiation resistance considerably but to also increase the compliance of the material. 
For example, PEPBO-PDS copolymer retained 79, 72, and 57% of its original tensile breaking strength 
at 2, 3, and 4 weeks in in vivo implantation, while PDS homopolymer retained only 43, 30, and 25% at 
the corresponding periods. It appears that an increasing (CH 2 ) group between the two ester functional 
groups of the radiation stabilizers improves the copolymer resistance toward y-irradiation. 

Lactide Biodegradable Homopolymers and Copolymers 

Polylactides, particularly poly-L-lactide (PLLA), and copolymers having >50% L- or DL-lactide have 
been explored for medical use without much success mainly due to their much slower absorption and 
difficulty in melt processing. PLLAs are prepared in solid state through ring-opening polymerization due 
to their thermal instability and should be melt-processed at the lowest possible temperature [Shalaby, 
1994]. Other methods like solution spinning, particularly for high molecular weight, and suspension 
polymerization have been reported as better alternatives. PLLA is a semi-crystalline polymer with T m = 
170°C and T = 56°C. This high T is mainly responsible for the extremely slow biodegradation rate 
reported in the literature. The molecular weight of lactide-based biodegradable polymers suitable for 
medical use ranges from 1.5 to 5.0 dL/g inherent viscosity in chloroform. Ultra high molecular weight 
of polylactides have been reported [Tunc, 1983; Leenslag et al., 1984]. For example, an intrinsic viscosity 
as high as 13 dL/g was reported by Leenslag et al. High strength PLLA fibers from this ultra high molecular 
weight polylactide was made by hot-drawing fibers from solutions of good solvents. The resulting fibers 
had tensile breaking strength close to 1.2 GPa [Gogolewski et al., 1983]. Due to a dissymmetric nature 
of lactic acid, the polymer made from the optically inactive racemic mixture of D and L enantiomers, 
poly-DL-lactide, however, is an amorphous polymer. 

Lactide-based copolymers having a high percentage of lactide have recently been reported, particularly 
those copolymerized with aliphatic polycarbonates like trimethylene carbonate (TMC) or 3,3-dimethyl- 
trimethylene carbonate (DMTMC) [Shieh et al., 1990]. The major advantage of incorporating TMC or 
DMTMC units into lactide is that the degradation products from TMC or DMTMC are largely neutral 
pH and hence are considered to be advantageous. Both in vitro toxicity and in vivo non-specific foreign 
body reactions like sterile sinuses have been reported in orthopaedic implants made from PGA and/or 
PLLA [Bostman et al, 1990; Daniels et al, 1992; Eitenmuller et al., 1989; Winet et al, 1993; Hofmann, 
1992]. Several investigators indicated that the glycolic or lactic-acid rich- degradation products have the 
potential to significantly lower the local pH in a closed and less body-fluid buffered regions surrounded 
by bone [Sugnuma et al., 1992]. This is particularly true if the degradation process proceeds with a burst 
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mode (i.e., a sudden and rapid release of degradation products). This acidity tends to cause abnormal 
bone resorption and/or demineralization. The resulting environment may be cytotoxic [Daniels et al., 
1992]. Indeed, inflammatory foreign body reactions with a discharging sinus and osteolytic foci visible 
on X-ray have been encountered in clinical studies [Eitenmuller et al., 1989]. Hollinger et al. recently 
confirmed the problem associated with PGA and/or PLLA orthopaedic implants [Winet et al., 1993]. A 
rapid degradation of a 50:50 ratio of glycolide-lactide copolymer in bone chambers of rabbit tibias has 
been found to inhibit bone regeneration. However, emphasis has been placed on the fact that extrapo- 
lation of in vitro toxicity to in vivo biocompatibility must consider microcirculatory capacity. The increase 
in the local acidity due to a faster accumulation of the highly acidic degradation products is also known 
to lead to an accelerated acid-catalyzed hydrolysis in the immediate vicinity of the biodegradable device. 
This acceleration in hydrolysis could lead to a faster loss of mechanical property of the device than we 
expect. This finding suggests the need to use components in totally biodegradable composites so that 
degradation products with less acidity would be released into the surrounding area. A controlled slow 
release rather than a burst release of degradation products at a level that the surrounding tissue could 
timely metabolize them would also be helpful in dealing with the acidity problem. Copolymers of 
composition ratio of 10DMTMC/90LLA or 10TMC/90LLA appear to be a promising absorbable ortho- 
paedic device. Other applications of this type of copolymers include nerve growth conduits, tendon 
prostheses, and coating materials for biodegradable devices. 

Another unique example of L-lactide copolymer is the copolymer of L-lactide and 3-(S)[(alkyloxycar- 
bonyl) methyl]-l,4-dioxane-2,5-dione, a cyclic diester [Kimura, 1993]. The most unique aspect of this 
new biodegradable copolymer is the carboxyl acid pendant group which obviously would make the new 
polymer not only more hydrophilic and hence faster biodegradation but also more reactive toward future 
chemical modification through the pendant carboxyl group. The availability of these carboxyl reactive 
pendant sites could be used to chemically bond antimicrobial agents or other biochemicals like growth 
factors for making future wound closure biomaterials having new and important biological functions. 
Unfortunately, there are no reported data to evaluate the performance of this new absorbable polymer 
for biomedical engineering use up to the present time. 

Block copolymers of PLLA with poly(amino acids) have also been reported as a potential controlled 
drug delivery system [Nathan et al., 1994]. This new class of copolymers consists of both ester and amide 
linkages in the backbone molecules and is sometimes referred as poly(depsipeptides) or polyesters- 
amides). Poly(depsipeptides) could also be synthesized from ring-opening polymerization of morpho- 
line-2,5-dione and its derivatives [Helder et al., 1986]. Barrows has also made a series of poly(ester- 
amides) from polyesterification of diols that contain preformed amide linkages, such as amidediols 
[Barrows, 1994]. Katsarava and Chu et al. just reported the synthesis of high-molecular- weight poly(ester- 
amides) of M w from 24,000 to 167,000 with narrow polydispersity (MJM n = 1.20 - 1.81) via solution 
polycondensation of di-p-toluenesulfonic acid salts of bis-(a-amino acid) a,co-alkylene diesters and di- 
p-nitrophenyl esters of diacids [Katsarava et al., In press]. These poly(ester-amide)s consisted of naturally 
occurring and non-toxic building blocks and had excellent film forming properties. These polymers were 
mostly amorphous materials with T from 11 to 59°C. The rationale for making poly(ester-amides) is to 
combine the well-known absorbability and biocompatibility of linear aliphatic polyesters with the high 
performance and the flexibility of potential chemical reactive sites of amide of polyamides. Polyester- 
amides) could be degraded either by enzyme and/or nonenzymatic mechanisms. There is no commercial 
use of this class of copolymers at the present time. 

The introduction of poly( ethylene oxide) (PEO) into PLLA in order to modulate the hydrophilicity 
and degradability of PLLA for drug control/release biomaterials has been reported and an example is the 
triblock copolymer of PLA/PEO/PLA [Li et al., 1998a]. Biomaterials having an appropriate PLLA and 
PEO block length were found to have a hydrogel property that could deliver hydrophilic drugs as well 
as hydrophobic ones like steroids and hormones. Another unique biodegradable biomaterial consisting 
of a star-block copolymer of PLLA, PGA, and PEO was also reported for protein drug delivery devices 
[Li et al., 1998b]. This star-shaped copolymer has 4 or 8 arms made of PEO, PLLA, and PGA. The glass 
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transition temperature and the crystallinity of this star-shaped block copolymer were significantly lower 
than the corresponding linear PLLA and PGA. 

Because of the characteristic of very slow biodegradation rate of PLLA and the copolymers having a 
high composition ratio of PLLA, their biomedical applications have been mainly limited to 
(1) orthopaedic surgery, (2) drug control/release devices, (3) coating materials for suture, (4) vascular 
grafts, and (5) surgical meshes to facilitate wound healing after dental extraction. 

41.3 Non-Glycolide/Lactide Based Linear Aliphatic Polyesters 

All glycolide/lactide based linear aliphatic polyesters are based on poly(a-hydroxy acids). Recently, there 
are two unique groups of linear aliphatic polyesters based on poly(co-hydroxy acids) and the most famous 
ones are poly(e-caprolactone) [Kimura, 1993],poly(P-hydroxybutyrate) (PHB),poly(P-hydroxyvalerate) 
(PHV) and the copolymers of PHB/PHV [Gross, 1994]. Poly(e-caprolactone) has been used as a comono- 
mer with a variety of glycolide/lactide based linear aliphatic polyesters described earlier. PHB and PHV 
belong to the family of poly(hydroxyalkanoates) and are mainly produced by prokaryotic types of 
microorganisms like Pseudomonas olevorans or Alcaligenes eutrophus through biotechnology. PHB and 
PHV are the principal energy and carbon storage compounds for these microorganisms and are produced 
when there are excessive nutrients in the environment. These naturally produced PHB and PHV are 
stereochemically pure and are isotactic. They could also be synthesized in labs, but the characteristics of 
steroregularity is lost. 

This family of biodegradable polyesters is considered to be environmentally friendly because they are 
produced from propionic acid and glucose and could be completely degraded to water, biogas, biomass, 
and humic materials [Gross, 1994]. Their biodegradation requires enzymes. Hence, PHB, PHV, and their 
copolymers are probably the most important biodegradable polymers for environmental use. However, 
the biodegradability of this class of linear aliphatic polyesters in human or animal tissues has been 
questionable. For example, high molecular weight PHB or PHB/PHV fibers do not degrade in tissues or 
simulated environments over periods of up to six months [Williams, 1990]. The degradability of PHB 
could be accelerated by y-irradiation or copolymerization with PHV. 

An interesting derivative of PHB, poly([5-malic acid) (PMA), has been synthesized from (3-benzyl 
malolactonate followed by catalytic hydrogenolysis. PMA differs from PHB in that the P-(CH 3 ) substit- 
uent is replaced by -COOH [Kimura, 1993]. The introduction of pendant carboxylic acid group would 
make PMA more hydrophilic and easier to be absorbed. 

41.4 Non- Aliphatic Polyesters Type Biodegradable Polymers 
Aliphatic and Aromatic Polycarbonates 

The most significant aliphatic polycarbonates are based upon DMTMC and TMC. They are made by the 
same ring-opening polymerization as glycolide-based biodegradable polyesters. The homopolymers are 
biocompatible with a controllable rate of biodegradation. Pellets of poly(ethylene carbonate) were 
absorbed completely in two weeks in the peritoneal cavity of rats. A slight variation of this polycarbonate, 
i.e., poly(propylene carbonate), however, did not show any sign of absorption after two months [Barrows, 
1986]. Copolymers of DMTMC/e-caprolactone and DMTMC/TMC have been reported to have adequate 
properties for wound closure, tendon prostheses, and vascular grafts. The most important advantage of 
aliphatic polycarbonates is the neutral pH of the degradation products. 

Poly(BPA-carbonates) made from bisphenol A (BPA) and phosgene is non-biodegradable, but an 
analog of poly(BPA-carbonate) like poly(iminocarbonates) have been shown to degrade in about 200 days 
[Barrows, 1986]. In general, this class of aromatic polycarbonates takes an undesirably long period to 
degrade, presumably due to the presence of an aromatic ring which could protect adjacent ester bonds 
to be hydrolyzed by water or enzymes. Different types of degradation products of this polymer under 
different pH environments are produced. At pH > 7.0, the degradation products of this polymer are BPA, 
and ammonia and C0 2 , while insoluble poly(BPA-carbonate) oligomers were produced with pH < 7.0 
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[Barrows, 1986]. The polymer had good mechanical properties and acceptable tissue biocompatibility. 
Unfortunately, there is currently no commercial use of this class of polymer in surgery. 

Poly(alkylene oxalates) and Copolymers 

This class of high crystalline biodegradable polymers was initially developed [Shalaby, 1994] for absorb- 
able sutures and their coating. They consist of [-ROOC— COO— ] n repeating unit where _R is (CH 2 ) X with 
x ranging from 4 to 12. _R could also be cyclic (1,4-trans-cyclohexanedimethanol) or aromatic (1,4- 
benzene, 1,3-benzene dimethanol) for achieving higher melting temperature. The biodegradation prop- 
erties depend on the number of (CH 2 ) group, x, and the type of _R group (i.e., acyclic vs cyclic or aromatic) . 
In general, a higher number of methylene group and/or the incorporation of cyclic or aromatic _R group 
would retard the biodegradation rate and hence make the polymer absorbed slower. For example, there 
was no mass of the polymer with x = 4 remaining in vivo (rats) after 28 days, while the polymer with 
x = 6 retained 80% of its mass after 42 days in vivo. An isomorphic copolyoxalate consisting of 80% cyclic 
-R group like 1,4-trans-cyclohexanedimethanol and 20% with acyclic _R group like 1,6-hexanediol retained 
56% of its original mass after 180 days in vivo. By varying the ratio of cyclic to acyclic monomers, 
copolymers with a wide range of melting temperatures could be made, e.g., copolymer of 95/5 ratio of 
cyclic (i.e., l,4-trans-cyclohexanedimethanol)/acyclic (i.e., 1,6-hexanediol) monomers had a T m = 210°C, 
while the copolymer with 5/95 ratio had a T m = 69°C. Poly(alkylene oxalates) with x= 3 or 6 had been 
experimented with as drug control/release devices. The tissue reaction to this class of biodegradable 
polymers has been minimal. 

41.5 Biodegradation Properties of Synthetic 
Biodegradable Polymers 

The reported biodegradation studies of a variety of biodegradable polymeric biomaterials have mainly 
focused on their tissue biocompatibility, the rate of drug release, or loss of strength and mass. Recently, 
the degradation mechanisms and the effects of intrinsic and extrinsic factors, such as pH [Chu, 1981, 
1982], enzymes [Chu et al., 1983; Williams, 1979; Williams et al, 1977, 1984], y-irradiation [Chu et al., 
1982, 1983; Campbell et al, 1981; Williams et al, 1984; Zhang et al, 1993], electrolytes [Pratt et al, 
1993a], cell medium [Chu et al., 1992], annealing treatment [Chu et al., 1988], plasma surface treatment 
[Loh et al., 1992], external stress [Chu, 1985a; Miller et al., 1984], and polymer morphology [Chu et al., 
1989] and on a chemical means to examine the degradation of PGA fibers [Chu et al., 1985] have been 
systemically examined and the subject has been recently reviewed [Chu, 1985b, 1991, 1995a, 1995b; Chu 
et al., 1996; Hollinger, 1995]. Table 41.2 is an illustration of structural factors of polymers that could 
control their degradation [Kimura, 1993]. Besides these series of experimental studies of a variety of 
factors that could affect the degradation of biodegradable polymeric biomaterials, there are two new 
areas that broaden the above traditional study of biodegradation properties of biodegradable polymers 
into the frontier of science. They are: theoretical modeling and the role of free radicals. 



TABLE 41.2 Structural Factors to Control the Polymer Degradability 

Factors Methods of Control 

Chemical structure of main chain and side groups Selection of chemical bonds and functional groups 

Aggregation state Processing, copolymerization 

Crystalline state Polymer blend 

Hydrophilic/hydrophobic balance Copolymerization, introduction of functional groups 

Surface area Micropores 

Shape and morphology Fiber, film, composite 

Source: Kimura, Y., 1993, Biodegradable Polymers, in Biomedical Applications of Polymeric Materials, T.Tsu- 
ruta, T. Hayashi, K.Kataoka, K. Ishihara, and Y. Kimura, Eds. p. 164-190. CRC Press, Boca Raton, FL. 
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Theoretical Modeling of Degradation Properties 

The most systematic theoretical modeling study of degradation properties of biodegradable biomaterials 
was reported by Pratt and Chu who used computational chemistry to theoretically model the effects of 
a variety of substituents which could exert either steric effect and/or inductive effect on the degradation 
properties of glycolide/lactide based biodegradable polymers [Pratt et al., 1993b; 1994a and b]. This new 
approach could provide scientists with a better understanding of the relationship between the chemical 
structure of biodegradable polymers and their degradation behavior at a molecular level. It also could 
help the future research and development of this class of polymers through the intelligent prediction of 
structure-property relationships. In those studies, Pratt and Chu examined the affect of various derivatives 
of linear aliphatic polyester (PGA) and a naturally occurring linear polysaccharide (hyaluronic acid) on 
their hydrolytic degradation phenomena and mechanisms. 

The data showed a decrease in the rate of hydrolysis by about a factor of 10 6 with isopropyl a-substit- 
uents, but nearly a six-fold increase with t-butyl ot-substituents [Pratt et al., 1993b]. The role of electron 
donating and electron withdrawing groups on the rate of hydrolytic degradation of linear aliphatic 
polyesters was also theoretically modeled by Pratt and Chu [Pratt et al., 1994a]. Electron withdrawing 
substituents a to the carbonyl group would be expected to stabilize the tetrahedral intermediate resulting 
from hydroxide attack, i.e., favoring hydroxide attack but disfavoring alkoxide elimination. Electron 
releasing groups would be expected to show the opposite effect. Similarly, electronegative substituents 
on the alkyl portion of the ester would stabilize the forming alkoxide ion and favor the elimination step. 
Pratt and Chu found that the rate of ester hydrolysis is greatly affected by halogen substituents due 
primarily to charge derealization. The data suggest that the magnitude of the inductive effect on the 
hydrolysis of glycolic esters decreases significantly as the location of the substituent is moved further 
away from the a-carbon because the inductive effect is very distance-sensitive. In all three locations of 
substitutions (a, [3, and y), CI and Br substituents exhibited the largest inductive effect compared to other 
halogen elements. 

Therefore, Pratt and Chu concluded that the rate of ester hydrolysis is greatly affected by both alkyl 
and halogen substituents due primarily to either steric hinderance or charge derealization. In the steric 
effect, alkyl substituents on the glycolic esters cause an increase in activation enthalpies, and a corre- 
sponding decrease in reaction rate, up to about three carbon sizes, while bulkier alkyl substituents other 
than isopropyl make the rate-determining elimination step more facile. It appears that aliphatic polyesters 
containing a isopropyl groups, or slightly larger linear alkyl groups, such as n-butyl, n-pentyl, etc., would 
be expected to show a longer strength retention, given the same fiber morphology. In the inductive effect, 
a-substituents on the acyl portion of the ester favor the formation of the tetrahedral intermediate through 
charge derealization, with the largest effect seen with CI substitution, but retard the rate-determining 
alkoxide elimination step by stabilizing the tetrahedral intermediate. The largest degree of stabilization 
is caused by the very electronegative F substituent. 

The Role of Free Radicals in Degradation Properties 

Salthouse et al. had demonstrated that the biodegradation of synthetic absorbable sutures is closely related 
to macrophage activity through the close adhesion of macrophage onto the surface of the absorbable 
sutures [Matlaga et al., 1980]. It is also known that inflammatory cells, particularly leukocytes and 
macrophages are able to produce highly reactive oxygen species like superoxide (-0 2 ) an d hydrogen 
peroxide during inflammatory reactions toward foreign materials [Badwey et al., 1980; Devereux et al., 
1991]. These highly reactive oxygen species participate in the biochemical reaction, frequently referred 
to as a respiratory burst, which is characterized by the one electron reduction of 2 into superoxide via 
either NADPH or NADH oxidase as shown below. The reduction of 2 results in an increase in 2 uptake 
and the consumption of glucose. 

(NADPH Oxidast) 

20 2 + NADPH— i 2_^2-0; + NADP + + H + (41.1) 
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The resulting superoxide radicals are then neutralized to H 2 2 via cytoplasmic enzyme superoxide 
dismutase (SOD). 

i (sod) 
2-0;+2H + — i — UH 2 2 +0 2 , (41.2) 

Very recently, Williams et al. suggested that these reactive oxygen species may be harmful to polymeric 
implant surfaces through their production of highly reactive, potent, and harmful hydroxyl radicals -OH 
in the presence of metals like iron as shown in the following series of redox reactions [Ali et al., 1993; 
Zhong et al, 1994; Williams et al, 1991]. 

•0 2 +M +n ^0 2 +M + ( n ~ 1 ' (41.3) 

H 2 2 +M + ' n " 1 '-^«OH + HO"+M +n (41.4) 

The net reaction will be: 

•0; + H 2 2 ^»OH + HCT + 2 (41.5) 

and is often referred to as the metal-catalyzed Haber-Weiss reaction [Haber et al., 1934]. 

Although the role of free radicals in the hydrolytic degradation of synthetic biodegradable polymers 
is largely unknown, a very recent study using absorbable sutures like Vicryl in the presence of an aqueous 
free radical solution prepared from H 2 2 and ferrous sulfate, FeS0 4 , raised the possibility of the role of 
free radicals in the biodegradation of synthetic absorbable sutures [Williams et al., 1991; Zhong et al., 
1994]. As shown below, both -OH radicals and OH~ are formed in the process of oxidation of Fe +2 by 
H 2 2 and could exert some influence on the subsequent hydrolytic degradation of Vicryl sutures. 

Fe +2 + H 2 2 -> Fe +3 + 'OH + OH- 

SEM results indicated that Vicryl sutures in the presence of free radical solutions exhibited many 
irregular surface cracks at both 7 and 14 days in vitro, while the same sutures in the two controls (H 2 2 
or FeS0 4 solutions) did not have these surface cracks. Surprisingly, the presence of surface cracks of 
Vicryl sutures treated in the free radical solutions did not accelerate the tensile breaking strength-loss as 
would be expected. Thermal properties of Vicryl sutures under the free radical and 3% H 2 2 media 
showed the classical well-known maximum pattern of the change of the level of crystallinity with 
hydrolysis time. The level of crystallinity of Vicryl sutures peaked at 7 days in both media (free radical 
and 3% H 2 2 ). The time for peak appearance in these two media was considerably earlier than Vicryl 
sutures in conventional physiological buffer media. Based on the Chu's suggestion of using the time of 
the appearance of the crystallinity peak as an indicator of degradation rate, it appears that these two 
media accelerated the degradation of Vicryl sutures when compared with regular physiological buffer 
solution. Based on their findings, Williams et al. proposed the possible routes of the role of -OH radicals 
in the hydrolytic degradation of Vicryl sutures [Zhong et al., 1994]. Unfortunately, the possible role of 
OH", one of the byproducts of Fenton reagents (H 2 2 /FeS0 4 ), was not considered in the interpretation 
of their findings. OFT species could be more potent than *OH toward hydrolytic degradation of synthetic 
absorbable sutures. This is because hydroxyl anions are the sole species which attack carbonyl carbon of 
the ester linkages during alkaline hydrolysis. Since an equal amount of -OH and OH~ are generated in 
Fenton reagents, the observed changes in morphological, mechanical, and thermal properties could be 
partially attributed to OFT ions as well as -OH radicals. 

Besides hydroxyl radicals, the production of superoxide ions and singlet oxygen during phagocytosis 
has been well documented [Babior et al., 1973]. Although the role of superoxide in simple organic ester 
hydrolysis has been known since the 1970s [Forrester et al., 1984, 1987; Johnson, 1976; Mango et al., 
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1976; San Fillipo et al., 1976], its role in the hydrolytic degradation of synthetic biodegradable polyester- 
based biomaterials has remained largely unknown. Such an understanding of the superoxide ion role 
during the biodegradation of foreign materials has become increasing desirable because of the advanced 
understanding of how the human immune system reacts to foreign materials and the increasing use of 
synthetic biomaterials for human body repair. 

Lee and Chu very recently examined the reactivity of the superoxide ion towards biodegradable 
biomaterials having an aliphatic polyester structure at different reaction conditions such as temperature, 
time, and superoxide ion concentration [Lee et al., 1996a] . Due to the extreme reactivity of the superoxide 
ion, it has been observed that the effect of superoxide ion-induced hydrolytic degradation of PDLLA and 
PLLA was significant in terms of changes in molecular weights and thermal properties. The superoxide 
ion-induced fragmentation of PDLLA would result in a mixture of various species with different chain 
lengths. A combined GPC method with a chemical tagging method revealed that the structure of oligomer 
species formed during the superoxide-induced degradation of PDLLA and PLLA was linear. The signif- 
icant reduction in molecular weight of PDLLA by superoxide ion was also evident in the change of 
thermal properties like T . The linear low molecular species (oligomer, trimers, and dimers) in the reaction 
mixture could act as an internal plasticizer to provide the synergetic effects of lowering X by increasing 
free volume. The effect of the superoxide ion-induced hydrolytic degradation on molecular weight of 
PLLA was similar to PDLLA but with a much smaller magnitude. The mechanism of simple hydrolysis 
of ester by superoxide ion proposed by Forrester et al. was subsequently modified to interpret the data 
obtained from the synthetic biodegradable polymers. 

In addition to the PDLLA and PLLA, superoxide ions also have a significant adverse effect on the 
hydrolytic degradation of synthetic absorbable sutures [Lee et al., 1996c]. A significant reduction in 
molecular weight has been found along with mechanical and thermal properties of these sutures over a 
wide range of superoxide ion concentrations, particularly during the first few hours of contact with 
superoxide ions. For example, the PGA suture lost almost all of its mass at the end of 24 h contact with 
superoxide ions at 25°C, while the same suture would take at least 50 days in an in vitro buffer for a 
complete mass loss. The surface morphology of these sutures was also altered drastically. The exact 
mechanism, however, is not fully known yet; Lee et al. suggested the possibility of simultaneous occur- 
rence of several main-chain scissions by three different nucleophilic species. 

Lee and Chu also reported that the addition of Fenton agent or hydrogen peroxide to the degradation 
medium would retard the well-known adverse effect of the conventional y-irradiation sterilization of 
synthetic absorbable sutures [Lee et al., 1996a]. They found that these y-irradiated sutures retained better 
tensile breaking strength in the Fenton medium than in the regular buffer media. Chu et al. postulated 
that the y-irradiation induced a-carbon radicals in these sutures react with the hydroxyl radicals from 
the Fenton agent medium and hence neutralize the adverse effect of a-carbon radicals on the backbone 
chain scission. This mechanism is supported by the observed gradual loss of ESR signal of the sutures 
in the presence of the Fenton agent in the medium. 

Instead of the adverse effect of free radicals on the degradation properties of synthetic biodegradable 
polyesters, Lee and Chu described an innovative approach of covalent bonding nitroxyl radicals onto these 
biodegradable polymers so that the nitroxyl radical attached polymers would have biological functions 
similar to nitric oxide [Lee et al., 1996b, 1998]. A preliminary in vitro cell culture study of these new 
biologically active biodegradable polymers indicated that they could retard the proliferation of human 
smooth muscle cells as native nitric oxide. The full potential of this new class of biologically active 
biodegradable polymers is currently under investigation by Chu for a variety of therapeutic applications. 

41.6 The Role of Linear Aliphatic Biodegradable Polyesters 
in Tissue Engineering and Regeneration 

The use of biodegradable polymers as the temporary scaffolds either to grow cells/tissues in vitro for 
tissue engineering applications or to regenerate tissues in vivo has very recently become a highly important 



© 2000 by CRC Press LLC 



aspect of research and development that broadens this class of biodegradable polymers beyond their 
traditional use in wound closure and drug control/release biomaterials. The scaffolds used in either tissue 
engineering or regeneration are to provide support for cellular attachment and subsequent controlled 
proliferation into a predefined shape or form. Obviously, a biodegradable scaffold would be preferred 
because of the elimination of chronic foreign body reaction and the generation of additional volume for 
regenerated tissues. 

Although many other biodegradable polymers of natural origin like alginate [Atala et al., 1994], 
hyaluronate [Benedetti et al., 1993; Larsen et al., 1993], collagen [Hirai et al., 1995] and laminin [Dixit, 
1994] have been experimented with for such a purpose, synthetic biodegradable polymers of linear 
aliphatic polyesters like PGA, PLA, and their copolymers [Bowald etal, 1979, 1980; Freed etal, 1993; 
Greisler, 1982; Greisler et al., 1985, 1987 (a and b), 1988 (a, b and c), 1991a; Kim et al., 1998 (a and b); 
Mikos et al, 1993; Mooney et al., 1994, 1995, 1996 (a, b, and c); Yu et al., 1993, 1994] have received more 
attention because of their consistent sources, reproducible properties, means to tailor their properties, 
and versatility in manufacturing processes. 

Biodegradable polymers must be fabricated into stable textile structures before they can be used as 
the scaffold for tissue engineering or regeneration. The stability of the scaffold structure is important 
during tissue engineering and regeneration in order to maintain its proper size, shape, or form upon the 
shear force imposed by the circulating culture media in a bioreactor, the contractile force imposed by 
the growing cells on the scaffold surface, and other forces like the compression from surrounding tissues. 

Kim et al. reported that, although ordinary non-woven PGA matrices have very good porosity (to 
facilitate diffusion of nutrients) with a high surface to volume ratio (to promote cell attachment and 
proliferation) and have been used to engineer dental pulp and smooth muscle tissues having comparable 
biological contents as the native tissues [Kim et al., 1998b; Mooney et al., 1996c], these non-woven PGA 
matrices could not maintain their original structure during tissue engineering due to the relatively weak 
non-woven textile structure and stronger contractile force exerted by the attached and proliferated 
cells/tissues [Kim etal., 1998a]. This led to deformed engineered tissues that may have undesirable 
properties; for example, the smooth muscle engineered on collagen gels exhibited significant contraction 
over time [Hirai et al, 1995; Zeigler et al, 1994]. 

Because of this shortcoming of the existing non-woven PGA matrices, Kim et al. very recently reported 
the use of PLLA to stabilize the PGA matrices [Kim et al., 1998a]. A 5% w/v PLLA solution in chloroform 
was sprayed onto PGA non-woven matrices (made of 12 urn diameter PGA fibers) of 97% porosity and 
either 3 mm or 0.5 mm thickness. The PLLA-impregnated PGA non-wovens could be subjected to 
additional heat treatment at 195°C to enhance their structural stability further. Figure 41.3 shows the 
morphology of such a heat annealed PLLA-impregnated PGA non-woven matrix [Kim et al., 1998]. The 
PLLA was deposited mainly on the crosspoints of PGA fibers and hence interlocked the possible sliding 
of PGA fibers upon external force. Depending on the amount of PLLA used and subsequent heat 
treatment, the resulting PLLA-impregnated PGA non-woven matrices had an increase in compressive 
modulus of 10- to 35-fold when compared with the original PGA non-woven. The PLLA-impregnated 
PGA non-woven matrices also retained their initial volume (101 ± 4%) and about same shape as the 
original during the seven weeks in culture, while the untreated PGA non-woven exhibited severe distor- 
tion in shape and contracted about 5% of its original volume. Since PLLA is well-known to degrade at 
a much slower rate than PGA, its presence on the PGA fibers surface would be expected to make the 
treated PGA non-woven matrices degrade at a much slower rate than the untreated PGA non-woven. 
For example, the PLLA treated PGA non-woven retained about 80% of its initial mass, while the untreated 
PGA control had only 10% at the end of the seven week culture. 

Linear aliphatic polyesters like PGA, its lactide copolymer, and poly-p-dioxanone have also been 
fabricated into both woven and knitted forms for the in vivo regeneration of blood vessels in animals 
[Bowald et al, 1979, 1980; Greisler, 1982; Greisler et al., 1985, 1987a, 1988c, 1991b; Yu et al, 1993, 1994]. 
The published results from a variety of animals like dogs and rabbits indicate that full-wall healing with 
pseudo-endothelial lining was observed. This class of synthetic biodegradable polymers are promising 
candidates for the regeneration of vascular tissue. 
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FIGURE 41.3 Scanning electron micrograph of the exterior of PLLA- impregnated and annealed PGA matrix. 
(Source: Kim, B.S. and Mooney, D.J., 1998a. Engineering smooth muscle tissue with a predefined structure. /. Biomed. 
Mater. Res., 41(2):322-332). 



These encouraging findings were believed to be associated with the intense macrophage/biomaterial 
interactions. [Greisler, 1988a; Greisler etal., 1989]. This interaction leads to a differential activation of 
the macrophage which, in turn, yields different macrophage products being released into the microen- 
vironment [Greisler et al., 1991b]. Greisler et al. have documented active stimulatory or inhibitory effects 
of various bioresorbable and non-resorbable materials on myofibroblast, vascular smooth muscle cell, 
and endothelial cell regeneration, and has shown a transinterstitial migration to be their source when 
lactide/glycolide copolymeric prostheses are used [Greisler etal., 1988b]. The rate of tissue ingrowth 
parallels the kinetics of macrophage mediated prosthetic resorption in all lactide/glycolides studied 
[Gresler, 1982; Greisler etal., 1985, 1987a, 1988a]. Macrophage phagocytosis of the prosthetic material 
is observed histologically as early as one week following implantation of a rapidly resorbed material, such 
as PGA or polyglactin 910 (PG910), and is followed by an extensive increase in the myofibroblast 
population and neovascularization of the inner capsules [Greisler, 1982; Greisler etal., 1985, 1986]. 
Autoradiographic analyses using tritiated thymidine demonstrated a significantly increased mitotic index 
within these inner capsular cells, that mitotic index paralleling the course of prosthetic resorption 
[Greisler et al., 1991a]. Polyglactin 910, for example, resulted in a mitotic index of 20.1 ± 16.6% three 
weeks following implantation, progressively decreasing to 1.2 ± 1.3% after 12 weeks. The more slowly 
resorbed polydioxanone prostheses demonstrated a persistently elevated mitotic index, 7.1 ± 3.8%, 
12 weeks after implantation, a time in which the prosthetic material was still being resorbed. By contrast 
Dacron never yielded greater than a 1.2 ± 1.3% mitotic index [Greisler etal., 1991a]. These mitotic 
indices correlated closely with the slopes of the inner capsule thickening curves suggesting that myofi- 
broblast proliferation contributed heavily to this tissue deposition. 
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Therefore, the degradation property of synthetic biodegradable polymers somehow relates to mac- 
rophage activation which subsequently leads to the macrophage production of the required growth factors 
that initiate tissue regeneration. Different degradation properties of synthetic biodegradable polymers 
would thus be expected to result in different levels of macrophage activation, i.e., different degrees of 
tissue regeneration. 

Defining Terms 

Biodegradation: Materials that could be broken down by nature either through hydrolytic mechanisms 
without the help of enzymes and/or enzymatic mechanism. It is loosely associated with absorbable, 
erodable, resorbable. 

Tissue Engineering: The ability to regenerate tissue through the help of artifical materials and devices. 

References 

Ali, S.A.M., Zhong, S.P., Doherty, P.J., and Williams, D.F., 1993. Mechanisms of polymer degradation in 

implantable devices. I. Poly(caprolactone). Biomaterials, 14:648. 
Atala, A., Kim, W., Paige, K.T., Vancanti, C.A., and Retil, A., 1994. Endoscopic treatment of vesicoureterall 

reflux with a chondrocye-alginate suspension. /. Urol, 152:641-643. 
Babior, B.M., Kipnes R.S., and Cumutte, J.T., 1973. Biological defense mechanisms. The production by 

leukocytes of superoxide, A potential bactercidal agent. /. Clin. Invest., 52: 741. 
Badwey, J.A. and Kamovsky, M.L., 1980. Active oxygen species and the functions of phagocytic leucocytes. 

Ann. Rev. Biochem., 49:695. 
Barrows, T.H., 1986, Degradable implant materials: A review of synthetic absorbable polymers and their 

applications. Clin. Mater., 1:233-257. 
Barrows, T.H., 1994. Bioabsorbable Poly(ester-amides), In: Biomedical Polymers: Designed-to-Degrade Sys- 
tems, S.W. Shalaby, Ed., Hanser, New York, chap. 4. 
Benedetti, L., Cortivo, R., Berti, T, Berti, A., and Pea, R, 1993. Biocompatibility and biodegradation of 

different hyaluronan derivatives (Hyaff) implanted in rats. Biomaterials, 14:1154-1160. 
Bezwada, R.S., Jamiolkowski, D.D., Lee, I.Y., Agarwal, V., Persivale, J., Trenka-Benthin, S., Erneta, M., 

Suryadevara, J., Yang, A., and Liu, S., 1995. Monocryl suture: A new ultra-pliable absorbable 

monofilament suture, Biomaterials, 16: 1141-1148. 
Bezwada, R.S., Shalaby, S.W., Newman, H.D. Jr., and Kafrawy, A., 1990. Bioabsorbable copolymers of 

p-dioxanone and lactide for surgical devices. Trans. Soc. for Biomater. vol. XIII, p. 194. 
Bostman, O., Hirvensalo, E., Vainionpaa, S. et al., 1990. Degradable polyglycolide rods for the internal 

fixation of displaced bimalleolar fractures. Intern. Orthop. (Germany), 14(l):l-8. 
Bowald, S., Busch, C, and Eriksson, I., 1979. Arterial regeneration following polyglactin 910 suture mesh 

grafting, Surgery, 86(5):722-729. 
Bowald, S., Busch, C, and Eriksson, I., 1980. Absorbable material in vascular prosthesis. Acta.Chir. Scand., 

146: 391-395. 
Brandrup, J., and Immergut, E. H., 1975. Polymer Handbook, 2nd ed., Wiley, New York. 
Campbell, N.D., and Chu, C.C., 1981. The effect of y-irradiation on the biodegradation of polyglycolic 

acid synthetic sutures, the Tensile Strength Study. 27th International Symposium on Macromolecules, 

Abstracts of Communications, vol. II, 1348-1352, Strasbourg, France, July 6-9, 1981. 
Casey, D. J., and Roby, M.S., 1984. Synthetic copolymer surgical articles and method of manufacturing 

the same. US Patent 4,429,080, American Cyanamid. 
Chatani, Y, Suehiro, K., Okita, Y, Tadokoro, H., and Chujo, K., 1968. Structural studies of polyesters, I. 

Crystal structure of polyglycolide. Die Makromol Chemie, 113:215-229. 
Chu, C.C., 1981. The In-vitro degradation of poly(glycolic acid) sutures: Effect of pH. /. Biomed. Mater. 

Res., 15: 795-804. 



3 2000 by CRC Press LLC 



Chu, C.C., 1982. The effect of pH on the in vitro degradation of poly(glycolide lactide) copolymer 

absorbable sutures. /. Biomed. Mater. Res., 16:117-124. 
Chu, C.C., 1985a. Strain-Accelerated Hydrolytic Degradation of Synthetic Absorbable Sutures. In: Surgical 

Research Recent Development, C.W. Hall, Ed., Pergamon Press, San Antonio, Texas. 
Chu, C.C., 1985b. The Degradation and Biocompatibility of Suture Materials. In: CRC Critical Reviews in 

Biocompat., D.F. Williams, Ed., vol. 1 (3), CRC Press, Boca Raton, FL. 261-322. 
Chu, C.C., 1991. Recent Advancements in Suture Fibers for Wound Closure. In: High-Tech Fibrous Materials: 

Composites, Biomedical Materials, Protective Clothing, and Geotextiles, T.L. Vigo and A.E Turbak, 

Eds., ACS Symposium Series #457, American Chemical Society, Washington, D.C. 167-213. 
Chu, C.C., 1995a. Biodegradable Suture Materials: Intrinsic and Extrinsic Factors Affecting Biodegradation 

Phenomena. In: Handbook of Biomaterials and Applications, D.L. Wise, D.E. Altobelli, E.R. Schwartz, 

M. Yszemski, J. D. Gresser, and D.J. Trantolo, Eds., Marcel Dekker, New York. 
Chu, C.C. 1995b. Biodegradable Suture Materials: Intrinsic and Extrinsic Factors Affecting Biodegradation. 

In: Encyclopedic Handbook of Biomaterials and Applications, Part A: Materials, Volume 1, D. L. Wise, 

Ed., Marcel Dekker, chap. 17, 543-688. 
Chu, C.C. and Browning, A., 1988. The study of thermal and gross morphologic properties of polyglycolic 

acid upon annealing and degradation treatments. /. Biomed. Mater. Res., 22(8): 699-712. 
Chu, C.C. and Campbell, N.D., 1982. Scanning electron microscope study of the hydrolytic degradation 

of poly(glycolic acid) suture. /. Biomed. Mater. Res., 16(4): 417-430. 
Chu, C.C, Hsu, A., Appel, M., and Beth, M. 1992. The effect of macrophage cell media on the in vitro 

hydrolytic degradation of synthetic absorbable sutures. 4th World Biomaterials Congress, April 

27-May 1, 1992, Berlin, Germany. 
Chu, C.C, and Kizil, Z., 1989. The effect of polymer morphology on the hydrolytic degradation of 

synthetic absorbable sutures. 3rd International ITV Conference on Biomaterials — Medical Textiles, 

Stuttgart, W. Germany, June 14-16, 1989. 
Chu, C.C. and Louie, M., 1985. A chemical means to study the degradation phenomena of polyglycolic 

acid absorbable polymer. /. Appl. Polym. Sci, 30: 3133-3141. 
Chu, C.C, von Fraunhofer, J.A., and Greisler, H.P., 1996. Wound Closure Biomaterials and Devices. CRC 

Press, Boca Raton, FL. 
Chu, C.C, and Williams, D.E, 1983. The effect of y-irradiation on the enzymatic degradation of polyg- 
lycolic acid absorbable sutures. /. Biomed. Mater. Res., 17(6):1029. 
Chu, C.C, Zhang, L., and Coyne, L., 1995. Effect of irradiation temperature on hydrolytic degradation 

properties of synthetic absorbable sutures and polymers. /. Appl. Polym. Sci. 56: 1275-1294. 
Chujo, K., Kobayashi, H, Suzuki, J., Tokuhara, S., and Tanabe, M., 1967a. Ring-opening polymerization 

of glycolide. Die Makromol. Chemie, 100:262-266. 
Chujo, K., Kobayashi, H, Suzuki, J., and Tokuhara, S., 1967b. Physical and chemical characteristics of 

polyglycolide. Die Makromol. Chemie, 100:267-270. 
Daniels, A.U., Taylor, M.S., Andriano, K.P., and Heller, J., 1992. Toxicity of absorbable polymers proposed 

for fracture fixation devices. Trans. 38th Ann. Mtg. Orthop. Res. Soc, 17:88. 
Devereux, D.E, O'Connell, S.M., Liesch, J.B., Weinstein, M., and Robertson, EM., 1991. Induction of 

leukocyte activation by meshes surgically implanted in the peritoneal cavity. Am. J. Surg., 162:243. 
Dixit, V, 1994. Development of a bioartificial liver using isolated hepatocytes. Artif. Organs, 18:371-384. 
Eitenmiiller, K.L., Schmickal, G.T., and Muhr, G., 1989. Die versorgung von sprunggelenksfrakturen unter 

verwendung von platten und schrauben aus resorbierbarem polymer material. Presented at Jahr- 

estagung der Deutschen Gesellschaft fur Unfallheilkunde, Berlin, November 1989. 
Forrester, A.R., and Purushotham, V, 1987. Reactions of carboxylic acid derivatives with superoxide. 

/. Chem. Soc. Perkin Trans. 1,945. 
Forrester, A.R., and Purushotham, V, 1984. Mechnism of hydrolysis of esters by superoxide. /. Chem. Soc, 

Chem. Commun., 1505. 



© 2000 by CRC Press LLC 



Fredericks, R.J., Melveger, A. J., and Dolegiewitz, L.J., 1984. Morphological and structural changes in a 

copolymer of glycolide and lactide occurring as a result of hydrolysis. /. Polym. Sci. Phy. Ed. 

22:57-66. 
Freed, L.E., Marquis, J.C. Nohia, A., Emmanual, J., Mikos, A.G., and Langer, R., 1993. Neocartilage 

formation in vitro and in vivo using cells cultured on synthetic biodegradable polymers. /. Biomed. 

Mater. Res., 27:11-23. 
Fukuzaki, H., Yoshida, M., Asano, M., Aiba, Y., and Kumakura, M., 1989. Direct copolymerization of 

glycolic acid with lactones in the absence of catalysts. Eur. Polym. J., 26:457-461. 
Fukuzaki, H., Yoshida, M., Asano, M., Kumakura, M., Mashimo, T., Yuasa, H., Imai, K., Yamandka, H., 

Kawaharada, U., and Suzuki, K., 1 99 1 . A new biodegradable copolymer of glycolic acid and lactones 

with relatively low molecular weight prepared by direct copolycondensation in the absence of 

catalysts. /. Biomed. Mater. Res., 25:315-328. 
Gogolewski, S., and Pennings, A.J., 1983. Resorbable materials of poly(L-lactide). II. Fibres spun from 

solutions of poly(L-lactide) in good solvents. /. Appl. Polym. Sci., 28:1045-61. 
Greisler, H.P., 1982. Arterial Regeneration Over Absorbable Prostheses. Arch. Surg., 117:1425-1431. 
Greisler, H.P., 1988a. Macrophage-Biomaterial Interactions with Bioresorbable Vascular Prostheses. Trans- 
actions ofASAIO, 34(4):1051-1059. 
Greisler, H.P., 1991a. Macrophage Activation in Bioresorbable Vascular Grafts. In: Vascular Endothelium: 

Physiological Basis of Clinical Problems. JD Catravas, AD Callow, CN Gillis, and U Ryan, Eds. 

Plenum Publishing, New York, NATO Advanced Study Institute. 253-254. 
Greisler, H.P., Dennis, J.W., Endean, E.D., Ellinger, J., Friesel, R., and Burgess, W., 1989. Macrophage/Bio- 

material interactions: The stiulation of endotherlialization. /. Vase. Surg., 9(4):588-593. 
Greisler, H.P., Dennis, J.W., Endean, E.D., and Kim, D.U., 1988b. Derivation of neointima of vascular 

grafts. Circ. Suppl. I, 78:16-112. 
Greisler, H.P., Ellinger, J., Schwarcz, T.H., Golan, J., Raymond, R.M., and Kim, D.U., 1987a. Arterial 

regeneration over polydioxanone prostheses in the rabbit. Arch. Surg., 122:715-721. 
Greisler, H.P., Endean, E.D, Klosak, J.J., Ellinger, I., Dennis, J.W., Buttle, K, and Kim, D.U., 1988c. 

Polyglactin 910/polydioxanone bicomponent totally resorbable vascular prostheses. /. Vase. Surg., 

7:697-705. 
Greisler, H.P., Kim, D.U., Dennis, J.W., Klosak, J.J., Widerborg, K.A., Endean, E.D., Raymond, R.M., and 

EllingerJ., 1987b. Compound polyglactin 910/polypropylene small vessel prostheses. /. Vase. Surg., 

5:572-583. 
Greisler, H.P., Kim, D.U., Price, J.B., and Voorhees, A.B., 1985. Arterial regenerative activity after prosthetic 

implantation. Arch. Surg., 120: 315-323. 
Greisler, H.P., Schwarcz, T.H., Ellinger, J., and Kim, D.U.,1986. Dacron inhibition of arterial regenerative 

activity. /. Vase. Surg., 747-756. 
Greisler, H.P., Tattersall, C.W., Kloask, J.J. et al., 1991b. Partially bioresorbable vascular grafts in dogs. 

Surgery, 110(4):645-655. 
Gross, R.A., 1994. Bacterial polyesters: Structural variability in microbial synthesis. In: Biomedical Poly- 
mers: Designed-to-Degrade Systems, ed. S.W. Shalaby, chap. 7. Hanser, New York. 
Haber, F. and Weiss, J., 1934. The catalytic decomposition of hydrogen peroxide by iron salts. Proc. R. 

Soc. Lond., A147:332. 
Helder, J., Feijen, J., Lee, S.J., and Kim, W., 1986. Copolyemrs of DL-lactic acid and glycine. Makromol. 

Chem., Rapid. Commun. 7:193. 
Hirai, J., & Matsuda, T, 1995. Self- organized, tubular hybrid vascular tissue composed of vascular cells 

and collagen for low pressure-loaded venous system, Cell Transplant, 4: 597-608. 
Hofmann, G.O., 1992. Biodegradable implants in orthopaedic surgery-A review of the state of the art. 

Clin. Mater., 10(1 & 2):75. 
Hollinger, J.O., 1995. Biomedical Applications of Synthetic Biodegradable Polymers, CRC Press, Boca Raton, 

FL. 



3 2000 by CRC Press LLC 



Jamiokowski, D. D., and Shalaby, S. W., 1991. A polymeric radiostabilizer for absorbable polyesters. In: 

Radiation Effect of Polymers, R. L. Clough and S. W. Shalaby, Eds., chap. 18, pp.300-309. ACS 

Symposium Series # 475, ACS, Washington, D.C. 
Johnson, R.A., 1976. Tetrahedron Lett., 331. 
Katsarava, R., Beridze, V., Arabuli, N. Kharadze, D., Chu, C. C, and Won, C.Y., Amino Acid Based 

Bioanalogous Polymers. Synthesis and Study of Regular Poly(ester amide)s Based on Bis(a-amino 

acid) a, ro-alkylene Diesters and Aliphatic Dicarboxylic Acids. /. Polym. Sci., Chemistry ed. (In 

Press). 
Katz, A., Mukherjee, D.P., Kaganov, A.L., and Gordon, S., 1985. A new synthetic monofilament absorbable 

suture made from polytrimethylene carbonate. Surg. Gynecol. Obstet., 161:213-222. 
Kim, B.S., Mooney, D.J., 1998a. Engineering smooth muscle tissue with a predefined structure. /. Biomed. 

Mater. Res., 41 (2):322. 
Kim, B.S., Putman, A.J., Kulik, T.J., and Mooney, D.J., 1998b. Optimizing seeding and culture methods 

to engineer smooth muscle tissue on biodegradable polymer matrices. Biotechnol. Bioeng., 

57:64-54. 
Kimura, Y., 1993. Biodegradable Polymers. In: Biomedical Applications of Polymeric Materials, T.Tsuruta, 

T. Hayashi, K.Kataoka, K. Ishihara, and Y. Kimura, Eds. p. 164-190. CRC Press, Inc., Boca Raton, FL. 
Koelmel, D.F., Jamiokowski, D.D., Shalaby, S.W., Bezwada, R.S., 1991. Low modulus radiation sterilizable 

monofilament sutures. Polym. Prepr, 32(2): 235-236. 
Larsen, N.E., Pollak, C.T., Reiner, K., Leshchiner, E., and Balazs, E.A., 1993. Hylan gel biomaterial: Dermal 

and immunologic compatibility. /. Biomed. Meter. Res., 27:1129-1134. 
Lee, K.H. and Chu, C. C, 1996a. The role of free radicals in hydrolytic degradation of absorable polymeric 

biomaterials. 5th World Biomaterials Congress, Toronto, Canada, May 29-June 2. 
Lee, K.H. and Chu, C.C., 1998. Molecular design of biologically active biodegradable polymers for 

biomedical applications. Macromol. Symp., 130:71. 
Lee, K.H., Chu, C.C., and Fred, J., 1996b. Aminoxyl- containing radical spin in polymers and copolymers. 

U.S. Patent 5,516,881, May 16. 
Lee, K.H. , Won, C.Y., and Chu, C.C., 1996c. Hydrolysis of absorable polymeric biomaterials by superoxide. 

5th World Biomaterials Congress, Toronto, Canada, May 29-June 2. 
Leenslag, J.W., Pennings, A.J., 1984. Synthesis of high-molecular weight poly(L-lactide) initiated with tin 

2-ethylhexanoate. Makromol. C/iem.,188(8):1809-1814. 
Li, S., Anjard, S., Tashkov, I., and Vert, M., 1998a. Hydrolytic degradation of PLA/PEO/PLA triblock 

copolymers prepared in the presence of Zn metal or CaH 2 , Polymer, 39(22): 5421-5430. 
Li, Y and Kissel, T, 1998b. Synthesis, characteristics and in vitro degradation of star-block copolymers 

consisting of L-lactide, glycolide and branched multi-arm poly(ethylene oxide). Polymer, 39(18): 

4421-4427. 
Loh, I.H., Chu, C.C., and Lin, H.L., 1992. Plasma surface modification of synthetic absorbable fibers for 

wound closure. /. Appl. Biomater. 3(2):131— 146. 
Magno, F. and Bontempelli, C, 1976. /, Electroanal. Chem., 68: 337. 
Matlaga, V.F. and Salthouse, T.N., 1980. Electron microscopic observations of polyglactin 910 suture sites, 

In First World Biomaterials Congress, Abstr., Baden, Austria, April 8-12, 2. 
Mikos, A.G., Sarakinos, G., Leite, S.M., Vacanti, J.P., and Langer, R., 1993. Laminated three-dimensional 

biodegradable forms for use in tissue engineering. Biomaterials, 14:323-330. 
Miller, N.D., and Williams, D.F., 1984. The in vivo and in vitro degradation of poly(glycolic acid) suture 

material as a function of applied strain. Biomaterials, 5:365-368. 
Mooney, D.J., Baldwin, D.F., Vacanti, J.P., and Langer, R., 1996a. Novel approach to fabricate porous 

sponges of poly(D,L-lactic-co-glycolic acid) without the use of organic solvents. Biomaterials, 

17:1417-1422. 
Mooney, D.J., Breuer, C, McNamara, K., Vacanti, J.P., and Langer, R., 1995. Fabricating tubular devices 

from polymers of lactic and glycolic acid for tissue engineering. Tissue Eng., 1: 107-118. 



© 2000 by CRC Press LLC 



Mooney, D.J., Mazzoni, C.L., Breuer, K., McNamara, J.P., Vacanti, J.P., and Langer, R., 1996b. Stabilized 

polyglycolic acid fibre-based tubes for tissue engineering. Biomaterials, 17:115-124. 
Mooney, D.J., Organ, G., Vacanti, J.P., and Langer, R., 1994. Design and fabrication of biodegradable 

polymer devices to engineer tubular tissue. Cell Transplant, 3: 203-210. 
Mooney, D.J, Powell, C, Piana, J., Rutherford, B., 1996c. Engineering dental pulp-like tissue in vitro. 

Biotechnol. Prog., 12:865-868. 
Nathan, A., and Kohn, J., 1994. Amino acid derived polymers. ln:Biomedical Polymers: Designed-to- 

Degrade Systems, S.W. Shalaby, chap. 5. Hanser Publishers, New York. 
Park, K., Shalaby, W.S.W., and Park. H., 1993. Biodegradable Hydrogels for Drug Delivery, Technomic 

Publishing, Lancaster, PA. 
Pratt, L., Chu, A., Kim, J., Hsu, A. and Chu, C.C., 1993a. The effect of electrolytes on the in vitro hydro lytic 

degradation of synthetic biodegradable polymers: Mechanical properties, thermodynamics and 

molecular modeling. /. Polym Sci. Chem Ed., 31:1759-1769. 
Pratt, L. and Chu, C.C., 1993b. Hydrolytic degradation of a-substituted polyglycolic acid: A semi- 
empirical computational study. /. Computational Chemistry, 14(7):809-817. 
Pratt, L. and Chu, C.C., 1994a. The effect of electron donating and electron withdrawing substituents 

on the degradation rate of bioabsorbable polymers: A semi-empirical computational study. 

/. Molecular Structure, 304:213-226. 
Pratt, L. and Chu, C.C., 1994b. A Computational Study of the Hydrolysis of Degradable Polysaccharide 

Biomaterials: Substituent effects on the hydrolytic mechanism. /. Computational Chemistry, 

15:241-248. 
Puelacher, W.C., Mooney, D., Langer, R., Upton, J., Vacanti, J.P., and Vananti, C.A., 1994. Design of 

nasoseptal cartilage replacements sunthesized from biodegradable polymers and chondrocytes. 

Biomaterials, 15:774-778. 
San Fillipo, Jr, J., Romano, L.J., Chem, C.I., and Valentine, J.S., 1976. Cleavage of esters by superoxide. 

/. Org. Chem., 4:586. 
Shalaby, S.W., 1994. Biomedical Polymers: Designed-to-Degrade Systems, Hanser Publishers, New York. 
Shieh, S.J., Zimmerman, M.C., and Parsons, J.R., 1990. Preliminary characterization of bioresorbable 

and nonresorbable synthetic fibers for the repair of soft tissue injuries. /. Biomed. Mater. Res., 

24:789-808. 
Sugnuma, J., Alexander, H., Traub, J., and Ricci, J.L., 1992. Biological response of intramedullary bone 

to poly-L-lactic acid. In: Tissue-Inducing Biomater, L.G. Cima and E.S. Ron, Eds., Mater. Res. Soc. 

Sump. Proc, 252:339-343. 
Tunc, D.C., 1983. A high strength absorbable polymer for internal bone fixation. Trans. Soc. Biomater., 

6:47. 
Vert, M., Feijen, J., Albertsson, A., Scott, C, and Chiellini, E., 1992. Biodegradable Polymers and Plastics, 

Royal Society of Chemistry, Cambridge, England. 
Williams, D.F., 1979. Some observations on the role of cellular enzymes in the In vivo degradation of 

polymers. ASTM Spec. Tech. Publ, 684: 61-75. 
Williams, D.F., 1990. Biodegradation of medical polymers. IN: Concise Encyclopedia of Medical & Dental 

Materials, D.F. Williams, Ed. pp. 69-74. Pergamon Press, New York. 
Williams, D.F., and Mort, E., 1977. Enzyme-accelerated hydrolysis of polyglycolic acid. /. Bioeng. 

1:231-238. 
Williams, D.F., and Chu, C.C., 1984. The effects of enzymes and gamma irradiation on the tensile strength 

and morphology of poly(p-dioxanone) fibers. /. Appl. Polym. Sci., 29:1865-1877. 
Williams, D.F. and Zhong, S.P., 1991. Are free radicals involved in the biodegradation of implanted 

polymers. Advanced Materials, 3:623. 
Winet, H., Hollinger, J.O., 1993. Incorporation of polylactide-polyglycolide in a cortical defect: Neoost- 

eogenesis in a bone chamber. /. Biomed. Mater. i?es.,27:667-676. 



) 2000 by CRC Press LLC 



Wise, D.L., Fellmann, T.D., Sanderson, J.E., & Wentworth, R.L., 1979. Lactic/Glycolic acid polymers. In 

Drug Carriers in Biology and Medicine, G. Gregoriadis, Ed., 237-270. Academic Press, New York. 
Yu, T.J. and Chu, C.C., 1993. Bicomponent vascular grafts consisting of synthetic biodegradable fibers. 

Part I. In vitro study. /. Biomed. Mater. Res. 27:1329-1339. 
Yu, T.J., Ho, D.M., & Chu, C.C., 1994. Bicomponent vascular grafts consisting of synthetic biodegradable 

fibers. Part II. In vivo Healing Response. /. Investigative Surg. 7:195-211. 
Zhang, L., Loh, I.H., and Chu, C.C., 1993. A combined y-irradiation and plasma deposition treatment 

to achieve the ideal degradation properties of synthetic absorbable polymers. /. Biomed. Mater. 

Res., 27:1425-1441. 
Zhong, S.P., Doherty, P.J., and Williams, D.F., 1994. A preliminary study on the free radical degradation 

of glycolic acid/lactic acid copolymer. Lastics, Rubber and Composites Processing and Application, 

21:89. 
Ziegler, T. and Nerem, R.M., 1994. Tissue engineering a blood vessel: Regulation of vascular biology by 

mechanical stress, /. Cell. Biochem., 56: 204-209. 

Further Information 

Several recent books have very comprehensive descriptions of a variety of biodegradable polymeric 
biomaterials, their synthesis, physical, chemical, mechanical, biodegradable, and biological properties. 

Barrows, T.H., 1986, Degradable implant materials: A review of synthetic absorbable polymers and their 

applications, Clin. Mater., 1:233-257. 
Chu, C. C, Biodegradable suture materials: Intrinsic and extrinsic factors affecting biodegradation 

phenomena, In: Handbook of Biomaterials and Applications, D.L.Wise, D.E. Altobelli, E.R. Schwartz, 

M. Yszemski, J. D. Gresser, and D.J. Trantolo, Eds. Marcel Dekker, New York, (1995). 
Chu, C.C., von Fraunhofer, J.A., and Greisler, H.P., 1996. Wound Closure Biomaterials and Devices. CRC 

Press, Boca Raton, FL. 
Hollinger, J.O., Ed. 1995. Biomedical Applications of Synthetic Biodegradable Polymers, CRC Press, Boca 

Raton, FL. 
Kimura, Y., 1993, Biodegradable Polymers, In: Biomedical Applications of Polymeric Materials, T.Tsuruta, 

T. Hayashi, K.Kataoka, K. Ishihara, and Y. Kimura, Eds., p.164-190. CRC Press, Inc., Boca Raton, 

FL. 
Park, K., Shalaby, W.S.W., and Park. H., 1993. Biodegradable Hydrogels for Drug Delivery, Technomic 

Publishing, Lancaster, PA. 
Shalaby, S.W., 1994. Biomedical Polymers: Designed-to-Degrade Systems, Hanser Publishers, New York. 
Vert, M., Feijen, J., Albertsson, A., Scott, G. and Chiellini, E., 1992. Biodegradable Polymers and Plastics, 

Royal Society of Chemistry, Cambridge, England. 

The review by Barrows is brief with an emphasis on their applications with an extensive lists of patents. 
The book and chapter by Chu et al. focuses on the most successful use of biodegradable polymers in 
medicine, namely wound closure biomaterials like sutures. It is so far the most comprehensive review of 
all aspects of biodegradable wound closure biomaterials with very detailed chemical, physical, mechanical, 
biodegradable, and biological information. The chapter by Kimura is an overview of the subject with 
some interesting new polymers. The chapter includes both enzymatically degradable natural polymers 
and non-enzymatically degradable synthetic polymers. The biodegradable hydrogel book by Park et al. 
is the only book available that focuses on hydrogel. Probably the most broad coverage of biodegradable 
polymeric biomaterials is the very recent book edited by Shalaby. It has 8 chapters and covers almost all 
commercially and experimentally available biodegradable polymers. The book edited by Vert et al. is 
based on the Proceedings of the 2nd International Scientific Workshop on Biodegradable Polymers and 
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Plastics held in Montpellier, France in November 1991. The book covers both medical and non-medical 
applications of biodegradable polymers. It has broader coverage of biodegradable polymers with far more 
chapters than Shalaby's book, but its chapters are shorter and less comprehensive than Shalaby's book 
which is far more focused on biomedical use. 

Topics of biodegradable polymeric biomaterials can also be found in /. of Biomedical Materials Research, 
J. of Applied Biomaterials, Biomaterials, J. of Biomaterials Science: Polymer Ed., J. of Applied Polymer Science, 
J. of Materials Science, and /. Polymer Science. 
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42.1 Structure and Properties of Collagen 
and Collagen-Rich Tissues 

Structure of Collagen 

Collagen is a multifunctional family of proteins of unique structural characteristics. It is the most 
abundant and ubiquitous protein in the body, its functions ranging from serving crucial biomechanical 
functions in bone, skin, tendon, and ligament to controlling cellular gene expressions in development 
[Nimni & Harkness, 1988] . Collagen molecules like all proteins are formed in vivo by enzymatic regulated 
step-wise polymerization reaction between amino and carboxyl groups of amino acids, where R is a side 
group of an amino acid residue. 

O H H 

(-C-N-C-) n (42.1) 

R 



The simplest amino acid is glycine (Gly) (R = H), where a hypothetical flat sheet organization of 
polyglycine molecules can form and be stabilized by intermolecular hydrogen bonds (Fig. 42.1a). How- 
ever, when R is a large group as in most other amino acids, the stereochemical constraints frequently 
force the polypeptide chain to adapt a less constraining conformation by rotating the bulky R groups 
away from the crowded interactions, forming a helix, where the large R groups are directed toward the 
surface of the helix (Fig. 42.1 b). The hydrogen bonds are allowed to form within a helix between the 
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FIGURE 42.1 (a) Hypothetical flat sheet structure of a protein, (fc) Helical arrangement of a protein chain. 

hydrogen attached to nitrogen in one amino acid residue and the oxygen attached to a second amino 
acid residue. Thus, the final conformation of a protein, which is directly related to its function, is governed 
primarily by the amino acid sequence of the particular protein. 

Collagen is a protein comprised of three polypeptides (a chains), each having a general amino acid 
sequence of (-Gly-X-Y-) n , where Xis any other amino acid and is frequently proline (Pro) and Yis any 
other amino acid and is frequently hydroxyproline (Hyp). A typical amino acid composition of collagen 
is shown in Table 42.1. The application of helical diffraction theory to high-angle collagen x-ray diffrac- 
tion pattern [Rich and Crick, 1961] and the stereochemical constraints from the unusual amino acid 
composition [Eastoe, 1967] led to the initial triple-helical model and subsequent modified triple helix 
of the collagen molecule. Thus, collagen can be broadly defined as a protein which has a typical triple 
helix extending over the major part of the molecule. Within the triple helix, glycine must be present as 
every third amino acid, and proline and hydroxyproline are required to form and stabilize the triple helix. 

To date, 19 proteins can be classified as collagen [Fukai et al., 1994]. Among the various collagens, 
type I collagen is the most abundant and is the major constituent of bone, skin, ligament, and tendon. 
Due to the abundance and ready accessibility of these tissues, they have been frequently used as a source 
for the preparation of collagen. This chapter will not review the details of the structure of the different 
collagens. The readers are referred to recent reviews for a more in-depth discussion of this subject [Nimni, 
1988; van der Rest et al., 1990; Fukai et al., 1994; Brodsky and Ramshaw, 1997] . It is, however, of particular 
relevance to review some salient structural features of the type I collagen in order to facilitate the 
subsequent discussions of properties and its relation to biomedical applications. 

TABLE 42. 1 Amino Acid Content of Collagen 



Amino Acids 


Content, 


residues/1000 residues* 


Gly 




334 


Pro 




122 


Hyp 




96 


Acid polar (Asp, Glu, Asn) 




124 


Basic polar (Lys, Arg, His) 




91 


Other 




233 



* Reported values are average values of 10 different determi- 
nations for tendon tissue. 
Source: Eastoe [1967]. 
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FIGURE 42.2 Diagram depicting the formation of collagen, which can be visualized as taking place in several steps: 
(a) single chain left-handed helix; (b) three single chains intertwined into a triple stranded helix; (c) a collagen 
(tropocollagen) molecule; (d) collagen molecules aligned in D staggered fashion in a fibril producing overlap and 
hole regions. 

A type I collagen molecule (also referred to as tropocollagen) isolated from various tissues has a 
molecular weight of about 283,000 daltons. It is comprised of three left-handed helical polypeptide chains 
(Figure 42.2a) which are intertwined forming a right-handed helix around a central molecular axis 
(Figure 42. 2b). Two of the polypeptide chains are identical (a x ) having 1056 amino acid residues, and 
the third polypeptide chain (a 2 ) has 1029 amino acid residues [Miller, 1984]. The triple-helical structure 
has a rise per residue of 0.286 nm and a unit twist of 108°, with 10 residues in three turns and a helical 
pitch (repeating distance within a single chain) of 30 residues or 8.68 nm [Fraser et al., 1983]. Over 95% 
of the amino acids have the sequence of G\y-X-Y. The remaining 5% of the molecule does not have the 
sequence of Gly-X-Y and is therefore not triple-helical. These nonhelical portions of the molecule are 
located at the N- and C-terminal ends and are referred to as telopeptides (9-26 residues) [Miller, 1984]. 
The whole molecule has a length of about 280 nm and a diameter of about 1.5 nm and has a conformation 
similar to a rigid rod (Fig. 42.2c). 

The triple-helical structure of a collagen molecule is stabilized by several factors (Fig. 42.3): (1) a tight 
fit of the amino acids within the triple-helix — this geometrical stabilization factor can be appreciated 
from a space-filling model constructed from a triple helix with (Gly-Pro-Hyp) sequence (Fig. 42.3); 
(2) the interchain hydrogen bond formation between the backbone carbonyl and amino hydrogen inter- 
actions; and (3) the contribution of water molecules to the interchain hydrogen bond formation. 

The telopeptides are regions where intermolecular crosslinks are formed in vivo. A common intermo- 
lecular crosslinks is formed between an allysine (the e-amino group of lysine or hydroxy-lysine has been 
converted to an aldehyde) of one telopeptide of one molecule and an e-amino group of a lysine or 
hydroxylysine in the triple helix or a second molecule (42.2). Thus the method commonly used to 
solubilize the collagen molecules from crosslinked fibrils with proteolytic enzymes such as pepsin removes 
the telopeptides (cleaves the intermolecular crosslinks) from the collagen molecule. The pepsin solubilized 
collagen is occasionally referred to as atelocollagen [Stenzl, 1974]. 



S) 2000 by CRC Press LLC 




FIGURE 42.3 A space-filling model of the collagen triple helix, showing all the atoms in a ten-residue segment of 
repeating triplet sequence (Gly-Pro-Hyp) n . The arrow shows an interchain hydrogen bond. The arrow heads identify 
the hydroxy groups of hydroxyproline in one chain. The circle shows a hydrogen-bonded water molecule. The short 
white lines identify the ridge of amino acid chains. The short black lines indicate the supercoil of one chain [Piez, 1 984 ] . 

OH 
Pr-CH 2 -CH 2 -CH 2 -CHO + H 2 N-CH 2 -CH-CH 2 -CH 2 -Pr 



Allysine 



Hydroxylysine 



(42.2) 



OH 

i 



-> Pr-CH 2 -CH 2 -CH 2 -CH=N-CH 2 -CH-CH 2 -CH 2 -Pr 
Dehydrohydroxylysinonorleucine 

Since the presence of hydroxyproline is unique in collagen {elastin contains a small amount), the deter- 
mination of collagen content in a collagen-rich tissue is readily done by assaying the hydroxyproline content. 

Collagen does not appear to exist as isolated molecules in the extracellular space in the body. Instead, 
collagen molecules aggregate into fibrils. Depending on the tissue and age, a collagen fibril varies from 
about 50 nm to about 300 nm in diameter with indeterminate length and can be easily seen under 
electron microscopy (Fig. 42.4). The fibrils are important structural building units for large fibers 
(Fig. 42.5). Collagen molecules are arranged in specific orders both longitudinally and in cross-section, 
and the organization of collagen molecules in a fibril is tissue-specific [Katz and Li, 1972; 1973fo]. The 
two-dimensional structure (the projection of a three-dimensional structure onto a two-dimensional 
plane) of a type I collagen fibril has been unequivocally defined both by an analysis of small-angle x-ray 
diffraction pattern along the meridian of a collagenous tissue [Bear, 1944] and by examination of the 
transmission electron micrographs of tissues stained with negative or positive stains [Hodge & Petruska, 
1963]. In this structure (Fig. 42.2d), the collagen molecules are staggered with respect to one another by 
a distance of D (64 nm to 67 nm) or multiple of D, where D is the fundamental repeat distance seen in 
the small-angle x-ray diffraction pattern, or the repeating distance seen in the electron micrographs. 
Since a collagen molecule has a length of about 4.4D, this staggering of collagen molecules creates overlap 
regions of about 0.4D and hole or defect regions of about 0.6D. 
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(c) 



FIGURE 42.4 (a) Scanning electron micrograph of the surface of an adult rabbit bone matrix, showing how the 
collagen fibrils branch and interconnect in an intricate, woven pattern (x4800) [Tiffit, 1980]. (b) Transmission 
electron micrographs of (x24,000) parallel collagen fibrils in tendon [Fung, 1992]. (c) Transmission electron micro- 
graphs of (x24,000) mesh work of fibrils in skin [Fung, 1993], 
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FIGURE 42.5 Diagram showing the collagen fibers of the connective tissue in general which are composed of unit 
collagen fibrils. 

One interesting and important structural aspect of collagen is its approximate equal number of acidic 
(aspartic and glutamic acids) and basic (lysines and arginines) side groups. Since these groups are charged 
under physiological conditions, the collagen is essentially electrically neutral [Li and Katz, 1976]. The 
packing of collagen molecules with a D staggering results in clusters of regions where the charged groups 
are located [Hofmann and Kuhn, 1981]. These groups therefore are in close proximity to form intra- 
and intermolecular hydrogen-bonded salt-linkages of the form (Pr-COO~ + H,N-Pr) [Li et al., 1975]. In 
addition, the side groups of many amino acids are nonpolar [alanine (Ala), valine (Val), leucine (Leu), 
isoleucine (He), proline (Pro), and phenolalanine (Phe)] in character and hence hydrophobic; therefore, 
chains with these amino acids avoid contact with water molecules and seek interactions with the nonpolar 
chains of amino acids. In fact, the result of molecular packing of collagen in a fibril is such that the 
nonpolar groups are also clustered, forming hydrophobic regions within collagen fibrils [Hofmann and 
Kuhn, 1981]. Indeed, the packing of the collagen molecules in various tissues is believed to be a result 
of intermolecular interactions involving both the electrostatic and hydrophobic interactions [Hofmann 
and Kuhn, 1981; Katz and Li, 1981; Li et al, 1975]. 

The three-dimensional organization of type I collagen molecules within a fibril has been the subject 
of extensive research over the past 40 years [Fraser et al., 1983; Katz and Li, 1972, 1973a, 1973&, 1981; 
Miller, 1976; Ramachandran, 1967; Yamuchi et al., 1986]. Many structural models have been proposed 
based on an analysis of equatorial and off-equatorial x-ray diffraction patterns of rat-tail-tendon collagen 
[Miller, 1976; North et al., 1954], intrafibrillar volume determination of various collagenous tissues [Katz 
and Li, 1972, 1973a, 1973fo], intermolecular side chain interactions [Hofmann and Kuhn, 1981; Katz and 
Li, 1981; Li et al., 1981], and intermolecular crosslinking patterns studies [Yamuchi et al., 1986]. The 
general understanding of the three-dimensional molecular packing in type I collagen fibrils is that the 
collagen molecules are arranged in hexagonal or near hexagonal arrays [Katz and Li, 1972, 1981; Miller, 
1976]. Depending on the tissue, the intermolecular distance varies from about 0.15 nm in rat tail tendon 
to as large as 0.18 nm in bone and dentin [Katz and Li, 1973fo]. The axial staggering of the molecules by 
1~4D with respect to one another is tissue-specific and has not yet been fully elucidated. 
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TABLE 42.2 Composition of Collagen-Rich Soft Tissues 

Component Composition, % 

Collagen 75 (dry), 30 (wet) 

Proteoglycans and polysaccharides 20 (dry) 

Elastin and glycoproteins < 5 (dry) 
Water 60-70 

Source: Park and Lakes [1992]. 

There are very few interspecies differences in the structure of type I collagen molecule. The extensive 
homology of the structure of type I collagen may explain why this collagen obtained from animal species 
is acceptable as a material for human implantation. 

Properties of Collagen-Rich Tissue 

The function of collagenous tissue is related to its structure and properties. This section reviews some 
important properties of collagen-rich tissues. 

Physical and Biomechanical Properties 

The physical properties of tissues vary according to the amount and structural variations of the collagen 
fibers. In general, a collagen-rich tissue contains about 75-90% of collagen on a dry weight basis. 
Table 42.2 is a typical composition of a collagen-rich soft tissue such as skin. Collagen fibers (bundles of 
collagen fibrils) are arranged in different configurations in different tissues for their respective functions 
at specific anatomic sites. For example, collagen fibers are arranged in parallel in tendon [Fig. 42.4fe] and 
ligament for their high-tensile strength requirements, whereas collagen fibers in skin are arranged in 
random arrays [Fig. 42.4c] to provide the resiliency of the tissue under stress. Other structure-supporting 
functions of collagen such as transparency for the lens of the eye and shaping of the ear or tip of the 
nose can also be provided by the collagen fiber. Thus, an important physical property of collagen is the 
three-dimensional organization of the collagen fibers. 

The collagen-rich tissues can be thought of as a composite polymeric material in which the highly 
oriented crystalline collagen fibrils are embedded in the amorphous ground substance of noncollagenous 
polysaccharides, glycoproteins, and elastin. When the tissue is heated, its specific volume increases, exhib- 
iting a glass transition at about 40°C and a melting of the crystalline collagen fibrils at about 56°C. The 
melting temperature of crystalline collagen fibrils is referred to as the denaturation temperature of 
collagenous tissues. 

The stress-strain curves of a collagenous tissue such as tendon exhibit nonlinear behavior (Fig. 42.6). 
This nonlinear behavior of stress-strain of tendon collagen is similar to that observed in synthetic fibers. 
The initial toe region represents alignment of fibers in the direction of stress. The steep rise in slope 
represents the majority of fibers stretched along their long axes. The decrease in slope following the steep 
rise may represent the breaking of individual fibers prior to the final catastrophic failure. Table 42.3 
summarizes some mechanical properties of collagen and elastic fibers. The difference in biomechanical 
properties between collagen and elastin is a good example of the requirements for these proteins to serve 
their specific functions in the body. 

Unlike tendon or ligament, skin consists of collagen fibers randomly arranged in layers or lamellae. 
Thus skin tissues show mechanics anisotropy (Fig. 42.7). Another feature of the stress-strain curve of the 
skin is its extensibility under small load as compared to tendon. At small load the fibers are straightened 
and aligned rather than stretched. Upon further stretching the fibrous lamellae align with respect to each 
other and resist further extension. When the skin is highly stretched the modulus of elasticity approaches 
that of tendon as expected of the aligned collagen fibers. 

Cartilage is another collagen-rich tissue which has two main physiological functions. One is the 
maintenance of shape (ear, tip of nose, and rings around the trachea), and the other is to provide bearing 
surfaces at joints. It contains very large and diffuse proteoglycan (protein-polysaccharide) molecules 

(D 2000 by CRC Press LLC 



STRESS 
(MPa) 




-i 1 1 1 1 r 

2 4 6 8 10 12 

STRAIN (%) 

FIGURE 42.6 A typical stress-strain curve for tendon [Rigby et al., 1959]. 



TABLE 42.3 Elastic Properties of Collagen and 
Elastic Fibers 

Modulus of Tensile Ultimate 

Elasticity, Strength, Elongation, 

Fibers MPa MPa % 

Collagen 1000 50-100 10 

Elastin 0.6 1 100 

Source: Park and Lakes [1992], 
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FIGURE 42.7 Stress-strain curves of human abdominal skin [Daly, 1966]. 



which form a gel in which the collagen-rich molecules entangled. They can affect the mechanical 
properties of the collagen by hindering the movements through the interstices of the collagenous matrix 
network. 
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TABLE 42.4 Mechanical Properties of Some 
Nonmineralized Human Tissues 





Tensile 


Ultimate 


Tissues 


Strength, MPa 


Elongation, % 


Skin 


7.6 


78.0 


Tendon 


53.0 


9.4 


Elastic cartilage 


3.0 


30.0 


Heart valves (aortic) 






Radial 


0.45 


15.3 


Circumferential 


2.6 


10.0 


Aorta 






Transverse 


1.1 


77.0 


Longitudinal 


0.07 


81.0 



Source: Park and Lakes [1992]. 

The joint cartilage has a very low coefficient of friction (<0.0 1 ) . This is largely attributed to the squeeze- 
film effect between cartilage and synovial fluid. The synovial fluid can be squeezed out through highly 
fenestrated cartilage upon compressive loading, and the reverse action will take place in tension. The 
lubricating function is carried out in conjunction with glycosaminoglycans (GAG), especially chondroitin 
sulfates. The modulus of elasticity (10.3-20.7 MPa) and tensile strength (3.4 MPa) are quite low. However, 
wherever high stress is required the cartilage is replaced by purely collagenous tissue. Mechanical prop- 
erties of some collagen-rich tissues are given in Table 42.4 as a reference. 

Physiochemical Properties 

Electrostatic Properties. A collagen molecule has a total of approximately 240 e-amino and guanidino 
groups of lysines, hydroxylysines, and arginines and 230 carboxyl groups of aspartic and glutamic acids. 
These groups are charged under physiological conditions. In a native fibril, most of these groups interact 
either intra- or intermolecularly forming salt-linkages providing significant stabilization energy to the 
collagen fibril [Li et al., 1975] . Only a small number of charged groups are free. However, the electrostatic 
state within a collagen fibril can be altered by changing the pH of the environment. Since the pK for an 
amino group is about 10 and about 4 for a carboxyl group, the electrostatic interactions are significantly 
perturbed at a pH below 4 and above 10. The net result of the pH change is a weakening of the intra- 
and intermolecular electrostatic interactions, resulting in a swelling of the fibrils. The fibril swelling can 
be prevented by chemically introducing covalent intermolecular crosslinks. Any bifunctional reagent 
which reacts with amino, carboxyl, and hydroxyl groups can serve as a crosslinking agent. The introduc- 
tion of covalent intermolecular crosslinks fixes the physical state of the fibrillar structure and balances 
the swelling pressures obtained from any pH changes. 

Another way of altering the electrostatic state of a collagen fibril is by chemically modifying the 
electrostatic side groups. For example, the positively charged s-amino groups of lysine and hydroxylysine 
can be chemically modified with acetic anhydride, which converts the £-amino groups to a neutral acetyl 
group [Green et al., 1954]. The result of this modification increases the number of the net negative 
charges of the fibril. Conversely, the negatively charged carboxyl groups of aspartic and glutamic acid 
can be chemically modified to a neutral group by methylation [Fraenkel-Conrat, 1944] . Thus, by adjusting 
the pH of the solution and applying chemical modification methods, a range of electrostatic properties 
of collagen can be obtained. 

Ion and Macromolecular Binding Properties. In the native state and under physiological conditions, a 
collagen molecule has only about 60 free carboxyl groups [Li et al., 1975] . These groups have the capability 
of binding cations such as calcium with a free energy of formation for the protein-COO-Ca ++ of about 
1.2 Kcal/mol. This energy is not large enough to compete for the hydrogen bonded salt-linkage interac- 
tions, which have a free energy of formation of about -1.6 Kcal/mol. The extent of ion binding, however, 
can be enhanced in the presence of lyotropic salts such as KCNS, which breaks the salt-linkages, or by 
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shifting the pH away from the isoelectric point of collagen. Macromolecules can bind to collagen via 
covalent bonding, cooperative ionic binding, entrapment, entanglement, and a combination of the above. 
In addition, binding of charged ions and macromolecules can be significantly increased by modifying 
the charge profile of collagen as described previously. For example, a complete N-acetylation of collagen 
will eliminate all the positively charged s-amino groups and, thus, will increase the free negatively charged 
groups. The resulting acetylated collagen enhances the binding of positively charged ions and macro- 
molecules. On the other hand, the methylation of collagen will eliminate the negatively charged carboxyl 
groups and, thus, will increase the free positively charge moieties. The methylated collagen, therefore, 
enhances the binding of negatively charged ions and macromolecules [Li and Katz, 1976]. 

Fiber-Forming Properties. Native collagen molecules are organized in tissues in specific orders. Polymor- 
phic forms of collagen can be reconstituted from the collagen molecules, obtained either from enzymatic 
digestion of collagenous tissues or by extracting the tissues with salt solutions. The formation of poly- 
morphic aggregates of collagen depends on the environment for reconstitution [Piez, 1984]. Native 
arrangement of the collagen molecules is formed under physiological conditions. Various polymorphic 
molecular aggregates may be formed by changing the state of intermolecular interactions. For example, 
when collagen molecules are aggregated under high concentrations of a neutral salt or under nonaqueous 
conditions, the collagen molecules associate into random arrays having no specific regularities detectable 
by electron microscopy. The collagen molecules can be induced to aggregate into other polymorphic 
forms such as the segment-long-spacing (SLS) form where all heads are aligned in parallel and the fibrous- 
long-spacing (FLS) form where the molecules are randomly aligned in either a head-to-tail, tail-to-tail, 
or head-to-head orientation. 

Biologic Properties 

Hemostatic Properties. Native collagen aggregates are intrinsically hemostatic. The mechanism of col- 
lagen-induced hemostasis has been the subject of numerous investigations [Jaffe and Deykin, 1974; Wang 
et al., 1978; Wilner et al., 1968]. The general conclusion from these studies is that platelets first adhere 
to a collagen surface. This induces the release of platelet contents, followed by platelet aggregation, leading 
to the eventual hemostatic plug. The hemostatic activity of collagen is dependent on the size of the 
collagen aggregate and the native organization of the molecules [Wang et al., 1978]. Denatured collagen 
(gelatin) is not effective in inducing hemostasis [Jonas et al., 1988]. 

Cell Interaction Properties. Collagen forms the essential framework of the tissues and organs. Many cells, 
such as epithelial and endothelial cells, are found resting on the collagenous surfaces or within a collag- 
enous matrix such as that of many connective tissue cells. Collagen-cell interactions are essential features 
during the development stage and during wound healing and tissue remodeling in adults [Kleinman 
et al., 1981; Linbold and Kormos, 1991]. Studying collagen-cell interactions is useful in developing 
simulated tissue and organ structures and in investigating cell behavior in the in vivo simulated systems. 
Numerous studies have aimed at developing viable tissues and organs in vitro for transplantation appli- 
cations [Bell etal, 1981; Montesano et al., 1983; Silbermann, 1990; Bellamkonda and Aebischer, 1994; 
Hubbel, 1995; Moghe et al, 1996; Sittinger et al, 1996]. 

Immunologic Properties. Soluble collagen has long been known to be a poor immunogen [Timpl, 1982] . 
A significant level of antibodies cannot be raised without the use of Freund's complete adjuvant (a mixture 
of mineral oil and heat-killed mycobacteria) which augments antibody response. It is known that insoluble 
collagen is even less immunogenic [Stenzel et al., 1974]. Thus, xenogeneic collagenous tissue devices such 
as porcine and bovine pericardial heart valves are acceptable for long-term implantation in humans. The 
reasons for the low antibody response against collagen are not known. It may be related to the homology 
of the collagen structure from different species (low level of foreignness) or to certain structural features 
associated with collagen [Timpl, 1982]. 
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42.2 Biotechnology of Collagen 
Isolation and Purification of Collagen 

There are two distinct ways of isolating and purifying collagen material. One is the molecular technology 
and the other is the fibrillar technology. These two technologies are briefly reviewed here. 

Isolation and Purification of Soluble Collagen Molecules 

The isolation and purification of soluble collagen molecules from a collagenous tissue is achieved by 
using a proteolytic enzyme such as pepsin to cleave the telopeptides [Miller and Rhodes, 1982]. Since 
telopeptides are the natural crosslinking sites of collagen, the removal of telopeptides renders the collagen 
molecules and small collagen aggregates soluble in an aqueous solution. The pepsin-solubilized collagen 
can be purified by repetitive precipitation with a neutral salt. Pepsin-solubilized collagen in monomeric 
form is generally soluble in a buffer solution at low temperature. The collagen molecules may be 
reconstituted into fibrils of various polymorphisms. However, the reconstitution of the pepsin-solubilized 
collagen into fibrils of native molecular packing is not as efficient as the intact molecules, since the 
telopeptides facilitate fibril formation [Comper and Veis, 1977]. 

Isolation and Purification of Fibrillar Collagen 

The isolation and purification of collagen fibers relies on the removal of noncollagenous materials from 
the collagenous tissue. Salt extraction removes the newly synthesized collagen molecules that have not 
been covalently incorporated into the collagen fibrils. Salt also removes the noncollagenous materials 
that are soluble in aqueous conditions and are bound to collagen fibrils by nonspecific interactions. Lipids 
are removed by low-molecular- weight organic solvents such as low-molecular-weight ethers and alcohols. 
Acid extraction facilities the removal of acidic proteins and glycosaminoglycans due to weakening of the 
interactions between the acidic proteins and collagen fibrils. Alkaline extraction weakens the interaction 
between the basic proteins and collagen fibrils and thus facilitates the removal of basic proteins. In 
addition, various enzymes other than collagenase can be used to facilitate the removal of the small 
amounts of glycoproteins, proteoglycans, and elastins from the tissue. Purified collagen fibers can be 
obtained through these sequential extractions and enzymatic digestions from the collagen-rich tissues. 

Matrix Fabrication Technology 

The purified collagen materials obtained from either the molecular technology or from the fibrillar 
technology are subjected to additional processing to fabricate the materials into useful devices for specific 
medical applications. The different matrices and their medical applications are summarized in Table 42.5. 
The technology in fabricating these matrices are briefly outlined below. 

TABLE 42.5 Summary of Different Collagen Matrices and Their Medical Applications 

Matrix Form Medical Application 

Membrane (film, sheet) Oral tissue repair; wound dressings; dura repair; patches 

Porous (sponge, felt, fibers) Hemostats; wound dressings; cartilage repair; soft-tissue augmentation 

Gel Drug and biologically active macromolecule delivery; soft- and hard-tissue augmentation 

Solution Soft-tissue augmentation; drug delivery 

Filament Tendon and ligament repair; sutures 

Tubular (membrane, sponge) Nerve repair; vascular repair 

Composite 

Collagen/synthetic polymer Vascular repair; skin repair; wound dressings 

Collagen/biological polymer Soft-tissue augmentation; skin repair 

Collagen/ceramic Hard-tissue repair 



S) 2000 by CRC Press LLC 



Membranous Matrix 

Collagen membranes can be produced by drying a collagen solution or a fibrillar collagen dispersion cast 
on a nonadhesive surface. The thickness of the membrane is governed by the concentration and the 
initial thickness of the cast solution or dispersion. In general, membrane thickness of up to 0.5 mm can 
be easily obtained by air drying a cast collagen material. Additional chemical crosslinking is required to 
stabilize the membrane from dissolution or dissociation. The membrane produced by casting and air 
drying does not permit manipulation of the pore structure. Generally, the structure of a cast membrane 
is dense and amorphous with minimal permeability to macromolecules [Li et al., 1991]. Porous mem- 
branes may be obtained by freeze-drying a cast solution or dispersion of a predetermined density or by 
partially compressing a preformed porous matrix to a predetermined density and pore structure. 

Porous Matrix 

Porous collagen matrices are generally obtained by freeze-drying an aqueous volume of collagen solution 
or dispersion. The freeze-dried porous matrix requires chemical crosslinking to stabilize the structure. 
A convenient way to stabilize the porous matrix is to crosslink the matrix by vapor using a volatile 
crosslinking agent such as formaldehyde or glutaraldehyde. The pore structure of the matrix depends, 
to a large extent, on the concentration of the collagen in the solution or dispersion. Other factors that 
contribute to the pore structure include the rate of freezing, the size of fibers in the dispersion, and the 
presence and absence of other macromolecules. Apparent densities from 0.05 to 0.3 gram matrix per cubic 
centimeter matrix volume can be obtained. These porous matrices generally have pores from about 50 
|jm to as large as 1500 um. 

Gel Matrix 

A gel matrix may be defined as a homogeneous phase between a liquid and a solid. As such, a gel may 
vary from a simple viscous fluid to a highly concentrated puttylike material. Collagen gels may be formed 
by shifting the pH of a dispersion away from its isoelectric point. Alternatively, the collagen material may 
be subjected to a chemical modification procedure to change its charge profile to a net positively charged 
or a net negatively charged protein before hydrating the material to form a gel matrix. For example, 
native fibers dispersed in water at pH 7 will be in the form of two phases. The dispersed fibers become 
gel when the pH changes from 7 to 3. Succinylating the primary amino groups of collagen, which converts 
the positively charged amino groups to negatively charged carboxyl groups, changes the isoelectric point 
of collagen from about 7 to about 4.5. Such a collagen material swells to a gel at a pH of 7. 

Solution Matrix 

A collagen solution is obtained by dissolving the collagen molecules in an aqueous solution. Collagen 
molecules are obtained by digesting the insoluble tissue with pepsin to cleave the crosslinking sites of 
collagen (telopeptides) as previously described. The solubility of collagen depends on the pH, the tem- 
perature, the ionic strength of the solution, and the molecular weight. Generally, collagen is more soluble 
in the cold. Collagen molecules aggregate into fibrils when the temperature of the solution increases to 
the body temperature. pH plays an important role in solubilizing collagen. Collagen is more soluble at 
a pH away from the isoelectric point of the protein. Collagen is less soluble at higher ionic strength of a 
solution. The solubility of collagen decreases with increasing the size of molecular aggregates. Thus, 
collagen becomes increasingly less soluble with increasing the extent of crosslinking [Bailey et al., 1970] . 

Filamentous Matrix 

Collagen filaments can be produced by extrusion techniques [Kemp et al., 1995; Li and Stone, 1993; 
Schimpf and Rodriquez, 1976). A collagen solution or dispersion having a concentration in the range of 
0.5-1.5% (w/v) is first prepared. Collagen is extruded into a coacervation bath containing a high con- 
centration of a salt or into an aqueous solution at a pH of the isoelectric point of the collagen. Tensile 
strength of 30 MPa has been obtained for the reconstituted filaments. 
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Tubular Matrix 

Tubular matrices may be formed by extrusion through a coaxial cylinder [Stenzl et al., 1974], or by 
coating collagen onto a mandrel [Li, 1990]. Different properties of the tubular membranes can be 
obtained by controlling the drying properties. 

Composite Matrix 

Collagen can form a variety of homogeneous composites with other water-soluble materials. Ions, 
peptides, proteins, and polysaccharides can all be uniformly incorporated into a collagen matrix. The 
methods of homogeneous composite formation include ionic and covalent bonding, entrapment, entan- 
glement, and coprecipitation. A heterogeneous composite can be formed between collagen, ceramics, 
and synthetic polymers that have distinct properties for medical applications [Li, 1988]. 

42.3 Design of a Resorbable Collagen-Based 
Medical Implant 

Designing a medical implant for tissue or organ repair requires a thorough understanding of the structure 
and function of the tissue and organ to be repaired, the structure and properties of the materials used 
for repair, and the design requirements. There are at present two schools of thought regarding the design 
of an implant, namely the permanent implant and the resorbable implant. The permanent implants are 
intended to permanently replace the damaged tissues or organs are fabricated from various materials 
including metals and natural or synthetic polymers. For example, most of the weight-bearing orthopedic 
and oral implants are made of metals or alloys. Non-weight-bearing tissues and organs are generally 
replaced with implants that are fabricated either from synthetic or natural materials. Implants for blood 
vessel, heart valve, and most soft tissue repair fall into this class. Permanent implants, particularly those 
made of synthetic and biological materials, frequently suffer from the long-term effects of material 
degradation. Material degradation can result from biological processes such as enzymatic degradation 
or environmentally induced degradation from mechanical, metal-catalyzed oxidation, and from the 
permeation of body fluids into the polymeric devices [Bruck, 1991]. The material degradation is partic- 
ularly manifested in applications where there is repetitive stress-strain on the implant, such as artificial 
blood vessels and heart valves. 

As a result of the lack of suitable materials for long-term implantation, the concept of using a resorbable 
template to guide host tissue regeneration (guided tissue regeneration) has received vigorous attention 
in recent years. This area of research can be categorized into synthetic and biological templates. Polyglycolic 
acid (PGA), polylactic acid (PLA), polyglycolic-polylactic acid copolymers, and polydioxanone are among 
the polymers most selected for resorbable medical implant development. Among the biological materials 
used for resorbable medical implant development, collagen has been one of the most popular materials 
in this category. Collagen-based templates have been developed for skin [Yannas and Burke, 1981], 
peripheral nerve [Li et al., 1990; Yannas et al., 1985], oral tissue [Altman and Li, 1990; Blumenthal, 1988], 
and meniscal regeneration [Li et al., 1994; Stone et al., 1997] . A variety of other collagen based templates 
are being developed for tissue repair and regeneration applications [Goldstein et al., 1989; Ma et al., 1990; 
Li, etal, 1997]. 

The following discussion is useful in designing a template for tissue repair and regeneration applications. 
By way of an example, the design parameters listed below are specifically applied to the development of 
a resorbable collagen based template for guiding meniscal tissue repair and regeneration in the knee joint. 

Menisci are semilunar fibrocartilages that are anatomically located between the femoral condyles and 
tibial plateau, providing stability, weight bearing, shock absorption and assisting in lubrication of the 
knee joint. A major portion of the meniscal tissue is avascular except the peripheral rim, which comprises 
about 10-30% of the total width of the structure and which is nourished by the peripheral vasculature 
[Arnoczky and Warren, 1982]. Collagen is the major matrix material of the meniscus, and the fibers are 
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oriented primarily in the circumferential direction in the line of stress for mechanical function. Repair 
of damaged meniscal tissue in the peripheral vascular rim can be accomplished with sutures. However, 
in cases where the injured site is in the avascular region, partial or total removal of the meniscal tissue 
is often indicated. This is primarily due to the inadequacy of the fibrochondrocytes alone to self-repair 
the damaged meniscal tissue. Studies in animals and humans have shown that removal of the meniscus 
is a prelude to degenerative knees manifested by the development of osteoarthritis [Hede and Sarberg, 
1992; Shapiro and Glimcher, 1980]. At present there is no suitable permanent substitute for meniscal 
tissue. 

Biocompatibility 

Biocompatibility of the materials and their degraded products is a prerequisite for resorbable implant 
development. Purified collagen materials have been used either as implants or have been extensively 
tested in clinical studies as implants without adverse effects. The meniscus template can be fabricated 
from purified type I collagen fibers that are further crosslinked chemically to increase the stability and 
reduce the immunogenicity in vivo. In addition, small amounts of noncollagenous materials such as 
glycosaminoglycans and growth factors can be incorporated into the collagen matrix to improve the 
osmotic properties as well as the rate of tissue ingrowth. 

Since the primary structure of a collagen molecule from bovine is homologous to human collagen 
[Miller, 1984], the in vivo degradation of bovine collagen implant should be similar to the normal host 
tissue remodeling process during wound healing. For a resorbable collagen template, the matrix is slowly 
degraded by the host over time. It is known that a number of cell types such as polymorphonuclear 
leukocytes, fibroblasts, and macrophages, during the wound healing period, are capable of secreting enzyme 
collagenases which cleave a collagen molecule at 1/4 position from the C-terminal end of the molecule 
[Woolley, 1984]. The enzyme first reduces a collagen molecule to two smaller triple helices which are 
not stable at body temperature and are subsequently denatured to random coiled polypeptides. These 
polypeptides are further degraded by proteases into amino acids and short peptides that are metabolized 
through normal metabolic pathways [Nimni and Harkness, 1988]. 

Despite the safety record of collagen materials for implantation, during the process of preparing the 
collagen template, small amounts of unwanted noncollagenous materials could be incorporated into the 
device such as salts and crosslinking agents. Therefore, a series of biocompatibility testing must be 
conducted to ensure the residuals of these materials do not cause any safety issues. The FDA has published 
a new guideline for biocompatibility testing of implantable devices [Biological Evaluation of Medical 
Devices, 1995]. 

Physical Dimension 

The physical dimension of a template defines the boundary of regeneration. Thus, the size of the collagen 
template should match the tissue defect to be repaired. A properly sized meniscal substitute has been 
found to function better than a substitute which mismatches the physical dimension of the host meniscus 
[Rodkey et al., 1998; Sommerlafh et al., 1991]. For a porous, elastic matrix such as the one designed from 
collagen for meniscal tissue repair, the shape of the meniscus is further defined in vivo by the space 
available between the femoral condyles and tibial plateau within the synovial joint. 

Apparent Density 

The apparent density as defined as the weight of the dry matrix in a unit volume of matrix. Thus, the 
apparent density is a direct measure of the empty space which is not occupied by the matrix material 
per se in the dry state. For example, for a collagen matrix of an apparent density 0.2 g/cm 3 , the empty 
space would be 0.86 cm 3 for a 1 cm 3 total space occupied by the matrix, taking the density of collagen 
to be 1.41 g/cm 3 [Noda, 1992]. The apparent density is also directly related to the mechanical strength 
of a matrix. In weight-bearing applications, the apparent density has to be optimized such that the 
mechanical properties are not compromised for the intended function of the resorbable implant as 
described in the mechanical properties section. 
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Pore Structure 

The dimension of a mammalian fibrogenic cell body is on the order of 10-50 urn, depending on the 
substrate to which the cell adheres [Folkman et al., 1978]. In order for cells to infiltrate into the interstitial 
space of a matrix, the majority of the pores must be significantly larger than the dimension of a cell such 
that both the cell and its cellular processes can easily enter the interstitial space. In a number of studies 
using collagen-based matrices for tissue regeneration, it has been found that pore size plays an important 
role in the effectiveness of the collagen matrix to induce host tissue regeneration [Chvapil, 1982; Dagalailis 
et al., 1980; Yannas, 1996]. It was suggested that pore size in the range of 100-400 urn was optimal for 
tissue regeneration. Similar observations were also found to be true for porous metal implants in total 
hip replacement [Cook et al., 1991]. The question of interconnecting pores may not be a critical issue 
in a collagen template as collagenases are synthesized by most inflammatory cells during wound healing 
and remodeling processes. The interporous membranes which exist in the noninterconnecting pores 
should be digested as part of resorption and wound healing processes. 

Mechanical Property 

In designing a resorbable collagen implant for weight-bearing applications, not only the initial mechanical 
strength is important, but the gradual strength reduction of the partially resorbed template has to be 
compensated by the strength increase from the regenerated tissue such that at any given time point, the 
total mechanical properties of the template are maintained. In order to accomplish this goal, one must 
first be certain that the initial mechanical properties are adequate for supporting the weight-bearing 
application. For example, compressing the implant with multiple body weights should not cause fraying 
of the collagen matrix material. It is also of particular importance to design an implant having an adequate 
and consistent suture pullout strength in order to reduce the incidence of detachment of the implant 
from the host tissue. The suture pullout strength is also important during surgical procedures as the lack 
of suture pull strength may result in retrieval and reimplantation of the template. In meniscal tissue 
repair the suture pullout strength of 1 kg has been found to be adequate for arthroscopically assisted 
surgery in simulated placement procedures in human cadaver knees, and this suture pullout strength 
should be maintained as the minimal strength required for this particular application. 

Hydrophilicity 

Hydration of an implant facilitates nutrient diffusion. The extent of hydration would also provide 
information on the space available for tissue ingrowth. The porous collagen matrix is highly hydrophilic 
and therefore facilitates cellular ingrowth. The biomechanical properties of the hydrophilic collagen 
matrix such as fluid outflow under stress, fluid inflow in the absence of stress, and the resiliency for 
shock absorption are the properties also found in the weight-bearing cartilagenous tissues. 

Permeability 

The permeability of ions and macromolecules is of primary importance in tissues that do not rely on 
vascular transport of nutrients to the end organs. The diffusion of nutrients into the interstitial space 
ensures the survival of the cells and their continued ability of growth and synthesis of tissue specific 
extracellular matrix. Generally, the permeability of a macromolecule the size of the bovine serum albumin 
(MW 67,000) can be used as a guideline for probing accessibility of the interstitial space of a collagen 
template [Li et al, 1994]. 

In Vivo Stability 

As stated above, the rate of template resorption and the rate of new tissue regeneration have to be balanced 
so that the adequate mechanical properties are maintained at all times. The rate of in vivo resorption of 
a collagen-based implant can be controlled by controlling the density of the implant and the extent of 
intermolecular crosslinking. The lower the density, the greater the interstitial space and generally the 
larger the pores for cell infiltration, leading to a higher rate of matrix degradation. The control of the 
extent of intermolecular crosslinking can be accomplished by using bifunctional crosslinking agents under 
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conditions that do not denature the collagen. Glutaraldehyde, formaldehyde, adipyl chloride, hexame- 
thylene diisocyanate, and carbodiimides are among the many agents used in crosslinking the collagen- 
based implants. Crosslinking can also be achieved through vapor phase of a crosslinking agent. The vapor 
phase crosslinking is effective using crosslinking agents of high vapor pressures such as formaldehyde 
and glutaraldehyde. The vapor crosslinking is particularly useful for thick implants of vapor permeable 
dense fibers where crosslinking in solution produces nonuniform crosslinking. In addition, intermolec- 
ular crosslinking can be achieved by heat treatment under high vacuum. This treatment causes the 
formation of an amide bond between an amino group of one molecule and the carboxyl group of an 
adjacent molecule and has often been referred to in the literature as dehydrothermal crosslinking. 

The shrinkage temperature of the crosslinked matrix has been used as a guide for in vivo stability of 
a collagen implant [Li, 1988]. The temperature of shrinkage of collagen fibers measures the transition 
of the collagen molecules from the triple helix to a random coil conformation. This temperature depends 
on the number of intermolecular crosslinks formed by chemical means. Generally, the higher the number 
of intermolecular crosslinks, the higher the thermal shrinkage temperature and more stable the material 
in vivo. 

A second method of assessing the in vivo stability is to determine the crosslinking density by applying 
the theory of rubber elasticity to denatured collagen [Wiederhorn and Beardon, 1952]. Thus, the in vivo 
stability can be directly correlated with the number of intermolecular crosslinks introduced by a given 
crosslinking agent. 

Another method that has been frequently used in assessing the in vivo stability of a collagen-based 
implant is to conduct an in vitro collagenase digestion of a collagen implant. Bacterial collagenase is 
generally used in this application. The action of bacterial collagenase on collagen is different from that 
of mammalian collagenase [Woolley, 1984]. In addition, the enzymatic activity used in in vitro studies 
is arbitrarily defined. Thus, the data generated from the bacterial collagenase should be viewed with 
caution. The bacterial collagenase digestion studies, however, are useful in comparing a prototype with 
a collagen material of known rate of in vivo resorption. 

Each of the above parameters should be considered in designing a resorbable implant. The interde- 
pendency of the parameters must also be balanced for maximal efficacy of the implant. 



42.4 Tissue Engineering for Tissue 
and Organ Regeneration 



Biomedical applications of collagen have entered a new era in the past decade. The potential use of 
collagen materials in medicine has increasingly been appreciated as the science and technology advances. 

One major emerging field of biomedical research which has received rigorous attention in recent years 
is tissue engineering. Tissue engineering is an interdisciplinary science of biochemistry, cell and molecular 
biology, genetics, materials science, biomedical engineering, and medicine to produce innovative three- 
dimensional composites having structure/function properties that can be used either to replace or correct 
poorly functioning components in humans and animals or to introduce better functional components 
into these living systems. Thus, the field of tissue engineering requires a close collaboration among various 
disciplines for success. 

Tissue engineering consists primarily of three components: (1) extracellular matrix, (2) cells, and 
(3) regulatory signals (e.g., tissue specific growth factors). One of the key elements in tissue engineering 
is the extracellular matrix which either provides a scaffolding for cells or acts as a delivery vehicle for 
regulatory signals such as growth factors. 

Type I collagen is the major component of the extracellular matrix and is intimately associated with 
development, wound healing, and regeneration. The development of the type I collagen based matrices 
described in this review article will greatly facilitate the future development of tissue engineering products 
for tissue and organ repair and regeneration applications. 
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TABLE 42.6 Survey of Collagen-Based Medical Products, and Research and Development Activities 

Applications Comments 

Hemostasis Commercial Products: Sponge, fiber, and felt forms are used in cardiovascular [Abbott and Austin, 1975]; 

neurosurgical [Rybock and Long, 1977]; dermatological [Larson, 1988]; ob/gyn [Cornell etal., 1985]; 
orthopaedic [Blanche and Chaux, 1988]; oral surgical applications [Stein et al., 1985]. 
Dermatology Commercial Products: Injectable collagen for soft tissue augmentation [Webster et al., 1984]; collagen 

based artificial skins [Bell et al., 1981; Yannas and Burke, 1981]. Research and Development: Collagen 
based wound dressings [Armstrong et al., 1986]. 
Cardiovascular Commercial Products: Collagen coated and gelatin coated vascular grafts [Jonas et al., 1988, Li, 1988]; 
Surgery and chemically processed human vein graft [Dardik et al., 1974]; bovine arterial grafts [Sawyer et al., 1977]; 

Cardiology porcine heart valves [Angell et al., 1982]; bovine pericardial heart valves [Walker et al., 1983]; vascular 

puncture hole seal device [Merino et al., 1992]. 
Neurosurgery Research and Development: Guiding peripheral nerve regeneration [Archibald et al., 1991; Yannas et al., 

1985]; dura replacement material [Collins etal., 1991]. 
Periodontal and Research and Development: Collagen membranes for periodontal ligament regeneration [Blumenthal, 
Oral Surgery 1988]; resorbable oral tissue wound dressings [Ceravalo and Li, 1988]; collagen/hydroxyapatite for 

augmentation of alveolar ridge [Gongloff and Montgomery, 1985]. 
Ophthalmology Commercial Products: Collagen corneal shield to facilitate epithelial healing [Ruffmi et al., 1989] . Research 

and Development: Collagen shield for drug delivery to the eye [Reidy et al., 1990]. 
Orthopaedic Commercial Products: Collagen with hydroxyapatite and autogenous bone marrow for bone repair 

Surgery [Hollinger et al., 1989]. Research and Development: Collagen matrix for meniscus regeneration [Li et al., 

1994]; collagenous material for replacement and regeneration of Achilles tendon [Kato et al., 1991]; 
reconstituted collagen template for ACL reconstruction [Li et al., 1997]. 
Other Research and Development: Drug delivery support [Sorensen et al., 1990]; delivery vehicles for growth 

Applications factors and bioactive macromolecules [Deatherage and Miller, 1987; Li et al., 1996]; collagenous matrix 

for delivery of cells for tissue and organ regeneration [Bell et al., 1981]. 

To date, collagen-based implants have been attempted for many tissue and organ repair and regener- 
ation applications. A complete historical survey of all potential medical applications of collagen is a 
formidable task but a selected survey of collagen-based medical products and the research and develop- 
ment activities are summarized in Table 42.6 as a reference. 

Defining Terms 

Alanine (Ala): One of the amino acids in collagen molecules. 

Allysine: The e-amino group of lysine has been enzymatically modified to an aldehyde group. 

Apparent density: Calculated as the weight of the dry collagen matrix per unit volume of matrix. 

Arginine (Arg): One of the amino acids in collagen molecules. 

Aspartic acid (Asp): One of the amino acids in collagen molecules. 

Atelocollagen: A collagen molecule without the telopeptides. 

Chondroitin sulfate: Sulfated polysaccharide commonly found in cartilages, bone, corea, tendon, and 

skin. 
Collagen: A family fibrous insoluble proteins having a triple helical conformation extending over a 

major part of the molecule. Glycine is present at every third amino acid in the triple helix and 

proline and hydroxyproline are required in the triple helix. 
Collagenase: A proteolytic enzyme that specifically catalyzes the degradation of collagen molecules. 
Dehydrohydroxylysinonorleucine (deH-HLNL): A covalently crosslinked product between an allysine 

and a hydroxylysine residues in collagen fibrils. 
D spacing: The repeat distance observed in collagen fibrils by electron microscopic and x-ray diffraction 

methods. 
Elastin: One of the proteins in connective tissue. It is highly stable at high temperatures and in 

chemicals. It also has rubberlike properties. 
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Fiber: A bundled group of collagen fibrils. 

Fibril: A self-assembled group of collagen molecules. 

Fibroblast: Any cell from which connective tissue is developed. 

Fibrochondrocyte: Type of cells that are associated with special types of cartilage tissues such as 

meniscus of the knee and intervertebral disc of the spine. 
Fibrous long spacing (FLS): One of the polymorphic forms of collagen where the collagen molecules 

are randomly aligned in either head-to-tail, tail-to-tail, or head-to-head orientation. 
Gelatin: A random coiled form (denatured form) of collagen molecules. 
Glutamic acid (Glu): One of the amino acids in collagen molecules. 

Glycine (Gly): One of the amino acids in collagen molecules having the simplest structure. 
Glycoprotein: A compound consisting of a carbohydrate protein. The carbohydrate is generally hex- 

osamine, an amino sugar. 
Glycosaminoglycan (GAG): A polymerized sugar (see polysaccharide) commonly found in various 

connective tissues. 
Helical pitch: Repeating distance within a single polypeptide chain in a collagen molecule. 
Hemostat: Device or medicine which arrests the flow of blood. 
Hydrophilicity: The tendency to attract and hold water. 
Hydrophobicity: The tendency to repel or avoid contact with water. Substances generally are nonpolar 

in character, such as lipids and nonpolar amino acids. 
Hydroxylysine (Hyl): One of the amino acids in collagen molecules. 
Hydroxyproline (Hyp): One of the amino acids uniquely present in collagen molecules. 
Inflammatory cell: Cells associated with the succession of changes which occur in living tissue when 

it is injured. These include macrophages, polymorphonuclear leukocytes, and lymphocytes. 
Intermolecular crosslink: Covalent bonds formed in vivo between a side group of one molecule and 

a side group of another molecule; covalent bonds formed between a side group of one molecule 

and one end of a bifunctional agent and between a side group of a second molecule and the other 

end of a bifunctional agent. 
Intrafibrillar volume: The volume of a fibril excluding the volume occupied by the collagen molecule. 
In vitro: In glass, as in a test tube. An in vitro test is one done in the laboratory, usually involving 

isolated tissues, organs, or cells. 
In vivo: In the living body or organism. A test performed in a living organism. 
Isoelectric point: Generally used to refer to a particular pH of a protein solution. At this pH, there is 

no net electric charge on the molecule. 
Isoleucine (He): One of the amino acids in collagen molecules. 
Leucine (Leu): One of the amino acids in collagen molecules. 
Lipid: Any one of a group of fats or fat-like substances, characterized by their insolubility in water and 

solubility in fat solvents such as alcohol, ether, and chloroform. 
Lysine (Lys): One of the amino acids in collagen molecules. 
Meniscus: A C-shaped fibrocartilage anatomically located between the femoral condyles and tibial 

plateau providing stability and shock absorption and assisting in lubrication of the knee joint. 
Macrophage: Cells of the reticuloendothelial system having the ability to phagocytose particulate 

substances and to store vital dyes and other colloidal substances. They are found in loose connective 

tissues and various organs of the body. 
Mycobacterium: A genus of acid-fast organisms belonging to the Mycobacteriaceae which includes the 

causative organisms of tuberculosis and leprosy. They are slender, nonmotile, gram-positive rods 

and do not produce spores or capsules. 
Osteoarthritis: A chronic disease involving the joint, especially those bearing the weight, characterized 

by destruction of articular cartilage, overgrown of bone with impaired function. 
Permeability: The space within a collagen matrix, excluding the space occupied by collagen molecules, 

which is accessible to a given size of molecule. 
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Pepsin: A proteolytic enzyme commonly found in the gastric juice. It is formed by the chief cells of 

gastric glands and produces maximum activity at a pH of 1.5-2.0. 
Phenolalanine (Phe): One of the amino acids in collagen molecules. 
Platelet: A round or oval disk, 2 to 4 urn in diameter, found in the blood of vertebrates. Platelets 

contain no hemoglobin. 
Polydioxanone: A synthetic polymer formed from dioxanone monomers which degrades by hydrolysis. 
Polyglycolic acid (PGA): A synthetic polymer formed from glycolic acid monomers which degrades 

by hydrolysis. 
Polylactic acid (PLA): A synthetic polymer formed from lactic acid monomers which degrades by 

hydrolysis. 
Polymorphism: Different types of aggregated states of the collagen molecules. 
Polymorphonuclear leukocyte: A white blood cell which possesses a nucleus composed of two or more 

lobes or parts; a granulocyte (neutrophil, eosinophil, basophil). 
Polypeptide: Polymerized amino acid molecules formed by enzymatically regulated stepwise polymer- 
ization in vivo between the carboxyl group of one amino acid and the amino group of a second 

amino acid. 
Polysaccharide: Polymerized sugar molecules found in tissues as lubricant (synovial fluid) or cement 

(between osteons, tooth root attachment) or complexed with proteins such as glycoproteins or 

proteoglycans. 
Proline (Pro): One of the amino acids commonly occurring in collagen molecules. 
Proteolytic enzyme: Enzymes which catalyze the breakdown of native proteins. 
Resorbable collagen: Collagen which can be biodegraded in vivo. 
Salt-linkage: An electrostatic bond formed between a negative charge group and a positive charge 

group in collagen molecules and fibrils. 
Segment-long-spacing (SLS): One of the polymorphic forms of collagen where all heads of collagen 

molecules are aligned in parallel. 
Soluble collagen: Collagen molecules that can be extracted with salts and dilute acids. Soluble collagen 

molecules contain the telopeptides. 
Telopeptide: The two short nontriple helical peptide segments located at the ends of collagen molecules. 
Valine (Val): One of the amino acids in collagen molecules. 
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43.1 Blood Interfacing Implants 



K. B. Chandran 
Introduction 

Blood comes in contact with foreign materials for a short term in extracorporeal devices such as dialysers, 
blood oxygenators, ventricular assist devices, and catheters. Long term vascular implants include heart 
valve prostheses, vascular grafts, and cardiac pacemakers among others. In this section, we will be 
concerned with development of biomaterials for long-term implants, specifically for heart valve prosthe- 
ses, total artificial heart (TAH), and vascular grafts. The primary requirements for biomaterials for long- 
term implants are biocompatibility, nontoxicity, and durability. Furthermore, the material should be 
nonirritating to the tissue, resistant to platelet and thrombus deposition, nondegradable in the physio- 
logical environment, and neither absorb blood constituents nor release foreign substances into the blood 
stream [Shim and Lenker, 1988]. In addition, design considerations include that the implant should 
mimic the function of the organ that it replaces without interfering with the surrounding anatomical 
structures and must be of suitable size and weight. The biomaterials chosen must be easily available, 
inexpensive, easily machinable, sterilizable, and have a long storage life. The selection of material will 
also be dictated by the strength requirement for the implant being made. As an example, an artificial 
heart valve prosthesis is required to open and close on an average once every second. The biomaterial 
chosen must be such that the valve is durable and will not fail under fatigue stress after implantation 
in a patient. As sophisticated measurement techniques and detailed computational analyses become 
available with the advent of super computers, our knowledge on the complex dynamics of the functioning 
of the implants is increasing. Improvements in design based on such knowledge and improvements in 
selection and manufacture of biomaterials will minimize problems associated with blood interfacing 
implants and significantly improve the quality of life for patients with implants. We will discuss the 
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TABLE 43.1 Heart Valve Prostheses Developed and Currently Available in the U.S. 



Type 

Caged ball 
Tilting disc 



Bileaflet 



Porcine bioprostheses 



Pericardial bioprostheses 



Name 

Starr-Edwards 

Medtronic-Hall 

Lillehei-Kaster 

Omni-Science 

St. Jude Medical 

Carbomedics 

ATS Valve* 

On-X valve" 

Carpentier-Ed wards 

Standard 

Hancock Standard 

Hancock modified orifice 

Hancock II 

Carpentier-Edwards 



Manufacturer 

Baxter Health Care, Irvine, CA 

Medtronic Blood Systems, Minneapolis, MN 

Medical Inc., Inner Grove Heights, MN 

St. Jude Medical, Inc., St. Paul, MN 

Carbomedics, Austin, TX 

ATS Medical, St. Paul, MN 

Medical Carbon Research Inst., Austin, TX 

Baxter Health Care, Irvine, CA 

Medtronic Blood Systems, Santa Ana, CA 



Edwards Laboratories, Santa Ana, CA 



" FDA approval pending 

development of biomaterials for the blood interfacing implants, problems associated with the same, and 
future directions in the development of such implants. 



Heart Valve Prostheses 

Attempts at replacing diseased natural human valves with prostheses began about four decades ago. The 
details of the development of heart valve prostheses, design considerations, in vitro functional testing, 
and durability testing of valve prototypes can be found in several monographs [Shim and Lenker, 1988; 
Chandran, 1992] . The heart valve prostheses can be broadly classified into mechanical prostheses (made 
of non-biological material) and bioprostheses (made of biological tissue) . Currently available mechanical 
and tissue heart valve prostheses in the U.S. are listed in Table 43.1. 

Mechanical Heart Valves 

Lefrak and Starr [1970] describe the early history of mechanical valve development. The initial designs 
of mechanical valves were of centrally occluding caged ball or caged disc type. The Starr-Edwards caged 
ball prostheses, commercially available at the present time, was successfully implanted in the mitral 
position in 1961. The caged ball prostheses is made of a polished Co-Cr alloy (Stellite 21®) cage and a 
silicone rubber ball (Silastic®) which contains 2% by weight barium sulfate for radiopacity (Fig. 43.1). 
The valve sewing rings use a silicone rubber insert under a knitted composite polytetrafiuorefhylene 
(PTFE-Teflon®) and polypropylene cloth. Even though these valves have proven to be durable, the 
centrally occluding design of the valve results in a larger pressure drop in flow across the valve and higher 
turbulent stresses distal to the valve compared to other designs of mechanical valve prostheses [Yoga- 
nathan et al., 1979a, b; 1986; Chandran et al., 1983]. The relatively large profile design of caged ball or 
disc construction also increases the possibility of interference with anatomical structures after implan- 
tation. The tilting disc valves, with improved hemodynamic characteristics, were introduced in the late 
1960s. The initial design consisted of a polyacetal (Delrin®) disc with a Teflon® sewing ring. Delrin acetal 
resins are thermoplastic polymers manufactured by the polymerization of formaldehyde [Shim and 
Lenker, 1988]. Even though Delrin exhibited excellent wear resistance and mechanical strength with 
satisfactory performance after more than 20 years of implantation, it was also found to swell when exposed 
to humid environments such as autoclaving and blood contact. To avoid design and manufacturing 
difficulties due to the swelling phenomenon, the Delrin disc was soon replaced by the pyrolytic carbon 
disc and has become the preferred material for mechanical valve prostheses occluders to date. Pyrolytic 
carbons are formed in a fluidized bed by pyrolysis of a gaseous hydrocarbon in the range of 1000 to 
2400°C. For biomedical applications, carbon is deposited onto a preformed polycrystalline graphite substrate 
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FIGURE 43.1 A caged-ball heart valve prosthesis. (Courtesy of Baxter Health Care, Irvine, CA.) 

at temperatures below 1500°C (low temperature isotropic pyrolytic carbon, LTI Pyrolite®). Increase in 
strength and wear resistance is obtained by codepositing silicone (up to 10% by weight) with carbon in 
applications for heart valve prostheses. The pyrolytic carbon discs exhibit excellent blood compatibility, 
as well as wear and fatigue resistance. The guiding struts of tilting disc valves are made of titanium or 
Co-Cr alloys (Haynes 25® and Stellite 21®). The Co-Cr based alloys, along with pure titanium and its 
alloy (Ti6A14V) exhibit excellent mechanical properties as well as resistance to corrosion and thrombus 
deposition. A typical commercially available tilting disc valve with a pyrolytic carbon disk is shown in 
Fig. 43.2a. A tilting disc valve with the leaflet made of ultra high molecular weight polyethylene (Chitra 
valve — Fig. 43.2b) is currently marketed in India. The advantages of leaflets with relatively more flexibility 
compared to pyrolytic carbon leaflets are discussed in Chandran et al. [1994a]. Another new concept in 
a tilting disc valve design introduced by Reul et al. [1995] has an S-shaped leaflet with leading and trailing 
edges being parallel to the direction of blood flow. The housing for the valve is nozzle-shaped to minimize 




FIGURE 43.2a Photograph of a typical tilting disc valve prosthesis. (Courtesy of Medtronic Heart Valves, Minne- 
apolis, MN.) 



S) 2000 by CRC Press LLC 




FIGURE 43.2b Chitra tilting disc valve prosthesis with the occluder made of ultra high molecular weight poly- 
ethylene. (Courtesy of Sree Chitra Tirunal Institute for Medical Sciences and Technology, India). 

flow separation at the inlet and energy loss in flow across the valve. Results from in vitro evaluation and 
animal implantation have been encouraging. 

In the late 1970s, a bileaflet design was introduced for mechanical valve prostheses and several different 
bileaflet models are being introduced into the market today. The leaflets as well as the housing of the bileaflet 
valves are made of pyrolytic carbon and the bileaflet valves show improved hemodynamic characteristics 
especially in smaller sizes compared to tilting disc valves. A typical bileaflet valve is shown in Fig. 43.3. 
Design features to improve the hydrodynamic characteristics of the mechanical valves include the opening 
angle of the leaflets [Baldwin et al., 1997] as well as having an open-pivot design in which the pivot area 
protrudes into the orifice and is exposed to the washing action of flowing blood [Drogue and Villafana, 
1997]. Other design modifications to improve the mechanical valve function include: the use of double 
polyester (Dacron®) velour material for the suture ring to encourage rapid and controlled tissue ingrowth, 




FIGURE 43.3 A CarboMedics bileaflet valve with pyrolytic carbon leaflets and housing (Courtesy of Sulzer-Car- 
boMedics, Austin, TX.) 
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FIGURE 43.4 

CA.) 



A tri-leaflet heart valve prosthesis under development. (Courtesy of Triflo Medical, Inc., Costa Mesa, 



and mounting the cuff on a rotation ring which surrounds the orifice ring to protect the cuff mounting 
mechanism from deeply placed annulus sutures. A PTFE (Teflon®) insert in the cuff provides pliability 
without excessive drag on the sutures. Tungsten (20% by weight) is incorporated into the leaflet substrate 
in order to visualize the leaflet motion in vivo. 

Another attempt to design a mechanical valve which mimics the geometry and function of the tri- 
leaflet aortic valve is that of Lapeyre et al. [1994: Fig.43.4a, b]. The geometry of the valve affords true 
central flow characteristics with reduced backflow. Accelerated fatigue tests have also shown good wear 
characteristics for this design and the valve is undergoing further evaluation including animal studies. 
Other improvements in the mechanical valves which augment performance include: machining of the 
valve housing to fit a disk so as to produce optimal washing and minimal regurgitation [McKenna, 1997] ; 
a supra-annular design so that a larger-sized valve can be inserted in the aortic position in the case of 
patients with small aortic annulus [Bell, 1997]; and coating of a titanium alloy ring with a thin, uniform, 
and strongly adherent film of high-density turbostratic carbon (Carbofilm™) [Bona et al., 1997] in order 
to integrate the structural stability of the metal alloy to the non-thrombogenecity of pyrolytic carbon. 
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Details of contemporary design efforts in mechanical valve design and potential future biomaterials such 
as Boralyn® (boron carbide) are discussed in Wieting [1997]. 

In spite of the desirable characteristics of the biomaterials used in the heart valve prostheses, problems 
with thrombo-embolic complications are significant with implanted valves and patients with mechanical 
valves are under long-term anticoagulant therapy. The mechanical stresses induced by the flow of blood 
across the valve prostheses have been linked to the lysis and activation of formed elements of blood (red 
blood cells, white blood cells, and platelets) resulting in the deposition of thrombi in regions with relative 
stasis in the vicinity of the prostheses. Numerous in vitro studies with mechanical valves in pulse dupli- 
cators simulating physiological flow have been reported in the literature and have been reviewed by 
Chandran [1988] and Dellsperger and Chandran [1991]. Such studies have included measurement of 
velocity profiles and turbulent stresses distal to the valve due to flow across the valve. The aim of these 
studies has been the correlation of regions prone to thrombus deposition and tissue overgrowth with 
explanted valves and the experimentally measured bulk turbulent shear stresses as well as regions of 
relative stasis. In spite of improvements in design of the prostheses to afford a centralized flow with 
minimal flow disturbances and fluid mechanical stresses, the problems with thrombus deposition remain 
significant. 

Reports of strut failure, material erosion and leaflet escapes, as well as pitting and erosion of valve 
leaflets and housing, have resulted in numerous investigations of the closing dynamics of mechanical 
valves. The dynamics of the leaflet motion and its impact with the valve housing or seat stop is very 
complex and a number of experimental and numerical studies have appeared recently in the literature. 
As the leaflet impacts against the seat stop and comes to rest instantaneously, high positive and negative 
pressure transients are present on the outflow and inflow side of the occluder, respectively, at the instant 
when the leaflet impacts against the seat stop or the guiding strut [Leuer, 1986; Chandran et al., 1994a]. 
The negative pressure transients have been shown to reach magnitudes below the liquid vapor pressure 
and have been demonstrated to be a function of the loading rate on the leaflet inducing the valve closure. 
As the magnitudes of negative pressure transients go below the liquid vapor pressure, cavitation bubbles 
are initiated and the subsequent collapse of the cavitation bubbles may also be a factor in the lysis of red 
blood cells, platelets, and valvular structures [Chandran et al., 1994a; Lee et al, 1994]. Typical cavitation 
bubbles visualized in an in vitro study with tilting disc and bileaflet valves are shown in Fig. 43.5. A 
correlation is also observed between the region where cavitation bubbles are present, even though for a 
period of time less than a millisecond after valve closure, and sites of pitting and erosion reported in the 
pyrolytic carbon material in the valve housing and on the leaflets with explanted valves [Kafesjian, 1994] 
as well as those used in total artificial hearts [Leuer, 1987]. An electron micrograph of pitting and erosion 
observed in the pyrolytic carbon valve housing of an explanted bileaflet mechanical valve is shown in 
Fig. 43.6. The pressure transients at valve closure are substantially smaller in mechanical valves with a 
flexible occluder and leaflets made of ultra high molecular weight polyethylene (Fig. 43.2b) may prove 
to be advantageous based on the closing dynamic analysis [Chandran et al., 1994a] . A correlation between 
the average velocity of the leaflet edge and the negative pressure transients in the same region at the 
instant of valve closure, as well as the presence of cavitation bubbles has been reported recently [Chandran 
etal., 1997]. This study demonstrated that for the valves of the same geometry (e.g., tilting disk) and 
size, the leaflet edge velocity as well as the negative pressure transients were similar. However, the presence 
of cavitation bubbles depended on the local interaction between the leaflet and the seat stop. Hence, it 
was pointed out that magnitudes of leaflet velocity or presence of pressure transients below the liquid 
vapor pressure might not necessarily indicate cavitation inception with mechanical valve closure. Chan- 
dran et al. [1998] have also demonstrated the presence of negative pressure transients in the atrial chamber 
with implanted mechanical valves in the mitral position in animals, demonstrating that potential for 
cavitation exists with implanted mechanical valves. Similar to the in vitro results, the transients were of 
smaller magnitudes with the Chitra valve made of flexible leaflets, and no pressure transients were 
observed with tissue valve implanted in the mitral position in vivo. The demonstration of the negative 
pressure transients with mechanical valve closure also shows that this phenomenon is localized and the 
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FIGURE 43.5 Cavitation bubbles visualized on the inflow side of the valves in vitro [Chandran et al„ 1994a]: 
(a) Medtronic-Hall tilting disc valve; (b) Edwards-Duromedics bileaflet valve; (c) CarboMedics bileaflet valve. 

flow chamber or valve holder rigidity with the in vitro experiments will not affect the valve closing 
dynamics. 

The pressure distribution on the leaflets and impact forces between the leaflets and guiding struts have 
also been experimentally measured in order to understand the causes for strut failure [Chandran et al., 
1994b] . The flow through the clearance between the leaflet and the housing at the instant of valve closure 
[Lee and Chandran, 1994a, b] and in the fully closed position [Reif, 1991] and the resulting wall shear 
stresses within the clearance are also being suggested as responsible for clinically significant hemolysis 
and thrombus initiation. Detailed analysis of the complex closing dynamics of the leaflets may also be 
exploited in improving the design of the mechanical valves to minimize problems with structural failure 
[Cheon and Chandran, 1994]. Further improvements in the design of the valves based on the closing 
dynamics as well as improvements in material may result in minimizing thrombo-embolic complications 
with implanted mechanical valves. 



© 2000 by CRC Press LLC 




FIGURE 43.5 (continued) 




FIGURE 43.6 Photographs showing pitting on pyrolytic carbon surface of a mechanical heart valve. (Courtesy of 
Baxter Health Care, Irvine, CA.) 



Biological Heart Valves 

The first biological valves implanted were homografts with valves explanted from cadavers within 
48 hours after death. Preservation of the valves included various techniques of sterilization, freeze drying, 
and immersing in antibiotic solution. The use of homografts is not popular due to problems with long 
term durability and due to limited availability except in a few centers [Shim and Lenker, 1988; Lee and 
Boughner, 1991]. Attempts were also made in the early 1960s in the use of xenografts (valves made from 
animal tissue) and porcine bioprostheses became commercially available after the introduction of the 
gluteraldehyde (rather than formaldehyde which was initially used) fixation technique. Gluteraldehyde 
reacts with tissue proteins to form crosslinks and results in improved durability [Carpentier et al., 1969]. 
The valves are harvested from 7 to 12 month old pigs and attached to supporting stents and preserved. 
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(a) 



(b) 




FIGURE 43.7 Typical bioprostheses: (a) Hancock porcine bioprosthesis (courtesy of Medtronic Heart Valves, Min- 
neapolis, MN); (b) PhotoFix™ a pericardial prosthesis (courtesy of Sulzer-CarboMedics, Inc., Austin, TX.) 

The stent provided support to preserve the valve in the natural shape and to achieve normal opening 
and closing. Initial supports were made of metal and subsequently flexible polypropylene stents were 
introduced. The flexible stents provided the advantage of ease of assembling the valve and finite element 
analyses have demonstrated reduction in stresses at the juncture between the stent and tissue leaflets 
resulting in increased durability and increased leaflet coaptation area [Reis et al., 1971; Hamid et al., 
1985]. A typical porcine bioprosthesis is included in Fig. 43.7a. 

Fixed bovine pericardial tissue is also used to construct heart valves in which design characteristics 
such as orifice area, valve height, and degree of coaptation can be specified and controlled. Thus, the 
geometry and flow dynamics past pericardial prostheses mimic those of the natural human aortic valves 
more closely. Due to the low profile design of pericardial prostheses and increased orifice area, these 
valves are less stenotic compared to porcine bioprostheses, especially in smaller sizes [Chandran et al., 
1984]. In the currently available bioprostheses, the stents are constructed from polypropylene, Acetol® 
homopolymer or copolymer, Elgiloy wire, or titanium. A stainless steel radiopaque marker is also intro- 
duced to visualize the valve in vivo. Other biomaterials, which have been employed in making the 
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TABLE 43.2 Biomaterial Used in Heart Valve Prostheses [Shim and Lenker, 1988; Dellsperger 
and Chandran, 1988] 

Biomaterial 

Silastic 

Stellite 21®/Titanium 

Silicone rubber insert under knitted composite Teflon®/polypropylene cloth 

Delrin®; Pyrolytic carbon (carbon deposited on graphite substrate); ultra high 

molecular polyethylene (UHMPE) 
Haynes 25®/Titanium 
Teflon®/Dacron® 
Pyrolytic carbon 
Pyrolytic carbon 

Double velour Dacron® tricot knit polyester 
Porcine aortic valve fixed by stabilized gluteraldehyde 
Polypropylene stent covered with Dacron®; lightweight Elgiloy wire covered with 

porous knitted Teflon® cloth 
Dacron®; soft silicone rubber insert covered with porous, seamless Teflon® cloth 
Porcine pericardial tissue fixed by stabilized gluteraldehyde before leaflets are sewn 

to the valve stents 
Polypropylene stent covered with Dacron®; Elgiloy wire and nylon support band 

covered with polyester and Teflon® cloth 
PTFE fabric over silicone rubber filter 



Type 


Component 


Caged ball 


Ball/occluder 




Cage 




Suture ring 


Tilting disc 


Leaflet 




Housing/strut 




Suture ring 


Bileaflet 


Leaflets 




Housing 




Suture ring 


Porcine 


Leaflets 


bioprostheses 


Stents 




Suture ring 


Pericardial 


Leaflets 


bioprostheses 






Stents 




Suture ring 



bioprostheses, include fascia lata tissue as well as human duramater tissue. The former was prone to 
deterioration and hence unsuitable for bioprosthetic application, while the latter lacked commercial 
availability. 

The advantage with bioprostheses is the freedom from thrombo-embolism and hence not requiring 
long term anticoagulant therapy in general. These prostheses are preferable in patients who do not tolerate 
anticoagulants. On the other hand, bioprosthetic valves are prone to calcification and leaflet tear with 
an average lifetime of about 10 years before replacement is necessary, and is generally attributed to the 
tissue fixation process. Numerous attempts are being made to improve the design as well as fixation in 
bioprostheses in order to minimize problems with calcification and increase duration of the function of 
the implant. As an example, a bovine pericardial trileafiet valve (Fig. 43.7b) treated with a non-aldehyde 
fixation resulting in collagen crosslink formation without a new "foreign" chemical process [Phillips and 
Printz, 1997] has been introduced in the European market. A non-aldehyde iodine-based sterilization 
process also sterilizes the valve. 

Numerous studies linking the mechanical stresses on the leaflets with calcification, focal thinning, and 
leaflet failure [Thubrikar et al., 1982a; Sabbah etal., 1985], and design improvements to minimize the 
stresses on the leaflets [Thubrikar etal., 1982b] have been reported in the literature. Further details on 
the effects of tissue fixation and mechanical effects of fixation on the leaflets are reported elsewhere [Lee 
and Boughner, 1991]. Improvements in fixation techniques as well as in design of the bioprostheses are 
continually being made in order to minimize problems with calcification of the leaflets and improve the 
durability and functional characteristics of bioprosthetic heart valves [Piwnica and Westaby, 1998]. The 
biomaterials used in commercially available mechanical and bioprosthetic heart valves are included in 
Table 43.2. Table 43.3 includes a summary of the problems associated with implanted artificial heart valves. 

Synthetic Heart Valves 

Concurrently, efforts have also been made in the development of valve prostheses made of synthetic 
material. Several attempts to make bileaflet [Braunwald et al., 1960] and trileafiet valves [Roe et al., 1958; 
Hufnagel, 1977; Gerring etal., 1974; Ghista and Reul, 1977] made of polyurethanes, polyester fabrics, 
and silicone rubber were not successful due to problems with durability of relatively thin leaflets made 
of synthetic material. With the advent of the total artificial hearts (TAH) and left ventricular assist 
devices (LVAD) in the 1980s, an additional impetus on the development of synthetic valves is present. 
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TABLE 43.3 Common Problems with Implanted 
Prosthetic Heart Valves [Yoganathan et al., 1979a; 
Shim and Lenker, 1988; Chandran, 1992] 

I, Mechanical Valves: 

a) Thromboembolism 

b) Structural failure 

c) Red blood cell and platelet destruction 

d) Tissue overgrowth 

e) Damage to endothelial lining 

f) Paravalvular/perivalvular leakage 

g) Tearing of sutures 
h) Infection 

II. Bioprosthetic Valves: 

a) Tissue calcification 

b) Leaflet rupture 

c) Paravalvular/perivalvular leakage 

d) Infection 

Due to problems with thrombus deposition in the vicinity of the mechanical valves used in the TAH and 
subsequent stroke episodes in patients with permanent implants, the use of the device is currently 
restricted as a bridge to transplantation. In such temporary use before a donor heart becomes available 
(on an average of several weeks), the four mechanical prostheses used in the TAH results in substantial 
cost. Hence, efforts are being made to replace the mechanical valves with those made with synthetic 
material. With vacuum forming or solution casting techniques, synthetic valves can be made at a fraction 
of the cost of mechanical valves, provided their function in a TAH environment for several weeks will 
be satisfactory. Implantation of synthetic trileaflet valves [Russel et al., 1980; Harold et al., 1987], even 
more recently, have resulted in limited success due to leaflet failure and calcification. Hemodynamic 
comparison of vacuum formed and solution cast trileaflet valves to currently available bioprostheses have 
produced satisfactory results [Chandran et al., 1989a, b]. Finite element analysis of synthetic valves can 
be exploited in design improvements similar to those reported for bioprostheses [Chandran et al., 1991a] . 

Total Artificial Hearts (TAH) or Ventricular Assist Devices (VAD) 

Artificial circulatory support can be broadly classified into two categories. The first category is for those 
patients who undergo open heart surgery to correct valvular disorders, ventricular aneurysm, or coro- 
nary artery disease. In several cases, the heart may not recover sufficiently after surgery to take over the 
pumping action. In such patients ventricular assist devices are used as extracorporeal devices to maintain 
circulation until the heart recovers. Other ventricular assist devices include intra-aortic balloon pumps 
as well as cardiopulmonary bypass. Within several days or weeks, when the natural heart recovers, these 
devices will be removed. In the second category are patients with advanced stages of cardiomyopathy 
and are subjects for heart transplantation. Due to problems in the availability of suitable donor hearts, 
not all patients with a failed heart are candidates for heart transplantation. For those patients not selected 
for transplantation, the concept of replacing the natural heart with a total artificial heart has gained 
attention in recent years [Akutsu and Kolff, 1958; Jarvick, 1981; DeVries and Joyce, 1983; Unger, 1989; 
Kambic and Nose, 1991]. A number of attempts in the permanent implantation of TAH with pneumat- 
ically powered units were made in the 1980s. However, due to neurological complications as a result of 
thrombo-embolism, infection, and hematological and renal complications, permanent implantations are 
currently suspended. If a suitable donor heart is not readily available, TAHs can be used as "bridge to 
transplantation" for several weeks until a donor heart becomes available. Until recently, most of the 
circulatory assist devices were pneumatically driven and a typical pneumatic heart is shown in Fig. 43.8a. 
It has two chambers for the left and right ventricle with inlet and outlet valves for each of the chambers. 
A line coming from the external pneumatic driver passes through the skin and is attached to the 
diaphragm housing through the connector shown in the photograph. Thus, the patient is tethered to an 
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FIGURE 43.8 Typical prototype designs of total artificial hearts: (a) pneumatically powered TAH. The right and 
left ventricular chambers, inflow and outflow valves, as well as the connector for the pneumatic line are visible in 
the photograph; (b) electrically powered TAH. Shown are the external battery pack, transcutaneous energy trans- 
mission system (TETS) primary and secondary coils, implanted electronics, energy converter and the blood pumps, 
compliance chamber and the subcutaneous access port. (Courtesy of G. Rosenberg, Pennsylvania State University.) 
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external pneumatic drive. He can move around for a short period of time by attaching the pneumatic 
line to a portable driver that he can carry. 

Electrically driven blood pumps, which can afford tether-free operation within the body, unlike those 
of the pneumatically powered pumps, are currently at various stages of development for long-term use 
(of more than two years). The components of such devices include the blood pump in direct contact 
with blood, energy converter (from electrical to mechanical energy), variable column compensator, 
implantable batteries, transcutaneous energy transmission system, and external batteries. The blood 
pump configuration in these devices includes sac, diaphragm, and pusher plate devices. Materials used 
in blood contacting surfaces in these devices are synthetic polymers (polyurethanes, segmented polyure- 
thanes, Biomer®, and others). Segmented polyurethane elastomer used in prosthetic ventricles with a 
thromboresistant additive modifying the polymeric surface have resulted in improved blood compatibility 
and reduced thrombo-embolic risk in animal trials [Farrar et al., 1988]. Design considerations include 
reduction of regions of stagnation of blood within the blood chamber and minimizing the mechanical 
stresses induced on the formed elements in blood. Apart from the characteristics of these materials to 
withstand repetitive high mechanical stresses and minimize failure due to fatigue, surface interaction 
with blood is also another crucial factor. An electrically powered total artificial heart intended for long 
term implantation is shown in Fig. 43.8b. The details of the design considerations for the circulatory 
assist devices are included in Rosenberg [1995a] and details of the evaluation of the electrically powered 
heart is included in Rosenberg et al. [1995b]. 

Due to significant problems with thrombo-embolic complications and subsequent neurological prob- 
lems with long-term implantation of TAH in humans, attention has been focused on minimizing factors 
responsible for thrombus deposition. In order to eliminate crevices formed with the quick connect system, 
valves sutured in place at the inflow and outflow orifices were offered as an alternative in the Philadelphia 
Heart [Wurzel et al., 1988]. An alternative quick connect system using precision machined components 
has been demonstrated to reduce valve- and connector-associated thrombus formation substantially 
[Holfert et al., 1987]. Several in vitro studies have been reported in the literature in order to assess the 
effect of fluid dynamic stresses on thrombus deposition [Phillips et al., 1979; Tarbell et al., 1986; Baldwin 
et al., 1990; Jarvis etal., 1991]. These have included flow visualization and laser Doppler anemometry 
velocity and turbulence measurements within the ventricular chamber as well as in the vicinity of the 
inflow and outflow orifices. The results of such studies indicate that the flow within the chamber generally 
has a smooth washout of blood in each pulsatile flow cycle with relatively large turbulent stresses and 
regions of stasis found near the valves. The thrombus deposition found with implanted TAH in the 
vicinity of the inflow valves also indicates that the major problem with the working of these devices are 
still with the flow dynamics across the mechanical valves. Computational flow dynamic analysis within 
the ventricular chamber may also be exploited to improve the design of the valve chambers and the 
mechanical valves in order to reduce the turbulent stresses near the vicinity of the inflow and outflow 
orifices [Kim etal., 1992]. Structural failure of the mechanical valves, initially reported with the TAH 
may have been the result of increased load on the valves during closure due to the relatively large dp/dt 
(p is pressure, t is time) at which the TAH was operated. Attempts at reducing the dp/dt during closure 
of the inflow valves have also been reported with modified designs of the artificial heart driver [Wurzel 
et al., 1988]. Due to the relatively large dp/dt at which TAHs are operated, there is increased possibility 
of cavitation bubble initiation and subsequent collapse of the bubbles may also be another important 
reason for thrombus deposition near the mechanical valve at the inflow orifice. Introducing synthetic 
valves to replace the mechanical valves [Chandran etal., 1991b] may prove to be advantageous with 
respect to cavitation initiation and may minimize thrombus formation. 

Vascular Prostheses 

In advanced stages of vascular diseases such as obstructive atherosclerosis and aneurysmal dilatation, 
when other treatment modalities fail, replacement of diseased segments with vascular prostheses is a 
common practice. Vascular prostheses can be classified as given in Table 43.4. 
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TABLE 43.4 Classification of Vascular Prostheses 



Prosthesis Comments 

Surgically-Implanted Biological Grafts 

Autograft Graft transplanted from part of a patient's body to another. 

Example: saphenous vein graft for peripheral bypass. 
Allograft Homograft. Transplanted vascular graft tissues derived 

from the same species as recipient. Example: 

glutaraldehyde treated umbilical cord vein graft. 
Xenograft Heterograft. Surgical graft of vascular tissues derived from 

one species to a recipient of another species. Example: 

modified bovine heterograft. 

Surgically Implanted Synthetic Grafts 

Dacron (polyethylene terephthalate) Woven, knitted 
PTFE (polytetrafluoroethylene) Expanded, knitted 

Other Nylon, polyurethane 



Surgically Implanted Biological Grafts 

Arterial homografts, even though initially used in large scale, resulted in aneurysm formation especially 
in the proximal suture line [Strandness and Sumner, 1975]. Still, a viable alternative is to use the 
saphenous vein graft from the same patient. Vein grafts have a failure rate of about 20% in one year and 
up to 30% in five years after implantation. Vein grafts from the same patients are also unavailable or 
unsuitable in about 10 to 30% of the patients [Abbott and Bouchier-Hayes, 1978]. Modified bovine 
heterograft and gluteraldehyde treated umbilical cord vein grafts have also been employed as vascular 
prostheses with less success compared to autologous vein grafts. 

Surgically Implanted Synthetic Grafts 

Prostheses made of synthetic material for vascular replacement have been used for over 40 years. Poly- 
meric material currently used as implants include nylon, polyester, polytetrafluoroethylene (PTFE), 
polypropylene, polyacrylonitrile, and silicone rubber [Park and Lakes, 1992]. However, Dacron® (poly- 
ethylene terephthalate) and PTFE are the more common vascular prostheses materials currently available. 
These materials exhibit the essential qualities for implants — they are biocompatible, resilient, flexible, 
durable, and resistant to sterilization and biodegradation. Detailed discussion on the properties, manu- 
facturing techniques, and testing of Dacron® prostheses is included in Guidoin and Couture [1992]. 
Figure 43.9a depicts a Dacron vascular graft having a bifurcated configuration. Figure 43.9b shows 
expanded PTFE vascular grafts having a variety of configurations and sizes: straight, straight with external 
reinforcement rings (to resist external compression), and bifurcated. 

Synthetic vascular grafts implanted as large-vessel replacements have resulted in reasonable degrees of 
success. However, in medium- and small- diameter prostheses (less than 6 mm in diameter), loss of 
patency within several months after implantation is more acute. Graft failure due to thrombosis or 
intimal hyperplasia with thrombosis is primarily responsible in failures within 30 days after implantation, 
and intimal hyperplasia formation is the reason for failure within 6 months after surgery. Soon after 
implantation, a layer of fibrin and fibrous tissue covers the intimal and outer surface of the prosthesis, 
respectively. A layer of fibroblasts replaces the fibrin and is referred to as neointima. In the later stages, 
neointimal hyperplasia formation occurs and ultimately results in the occlusion of the vessels in small- 
diameter vascular grafts. Attempts are being made currently in suitably modifying the surface character- 
istics of the prostheses in order to reduce the problems with loss of patency. Studies are also being 
performed in order to understand the mechanical stresses induced at the anastomotic region, which may 
result in deposits on the intimal surface and occlusion of the vessels [Chandran and Kim, 1994]. The 
alterations in mechanical stresses with the implantation of vascular prostheses in the arterial circulation 
may include changes in the deformation and stress concentrations at the anastomotic site. Altered fluid 
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FIGURE 43.9a Photograph of a Dacron vascular graft having a bifurcated configuration. (Courtesy of W. L. Gore 
and Associates, Inc., Flagstaff, AZ.) 




FIGURE 43.9b Photographs of expanded PTFE vascular grafts with straight, straight with external reinforcement 
rings to resist compression, and bifurcated configurations. (Courtesy of W. L. Gore and Associates, Inc., Flagstaff, AZ.) 

shear stresses at the intimal surface in the vicinity of the anastomosis has also been suggested as important 
particularly since the loss of patency is present more often at the distal anastomosis. The vascular 
prostheses should have the same dynamic response after implantation as the host artery in order to reduce 
the effect of abnormal mechanical stresses at the junction. For a replacement graft of the same size as 
the host artery, mismatch in compliance may be the most important factor resulting in graft failure 
[Abbott and Bouchier-Hayes, 1978]. In implanting the prostheses, end-to-end configuration is common 
in the reconstruction of peripheral arteries. End-to-side configuration is common in coronary artery 
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bypass where blood will flow from the host artery (aorta) to the prosthesis branching out at the anasto- 
motic site. At the other end, the graft is attached distal to the occlusion in the host (coronary) vessel to 
enable perfusion of the vascular bed downstream from the occlusion. Numerous studies analyzing the 
abnormal flow dynamics within the anastomotic geometry and stress distribution within the vascular 
material at the junction to the prostheses have been reported in delineating the causes for intimal 
hyperplasia formation and loss of patency [Kim and Chandran, 1993; Kim et al., 1993; Ojha et al., 1990; 
Keynton et al., 1991: Chandran et al, 1992; Rodgers et al, 1987; Rhee and Tarbell, 1994] and a detailed 
discussion on the mechanical aspects of vascular prostheses can be found in Chandran and Kim [1994]. 
Improvements in the blood-surface interactions are also being attempted in order to improve the func- 
tioning capability of vascular grafts. Attempts at seeding the grafts with endothelial cells [Hunter et al., 
1983], and modifying the graft material properties by removing the crimping and heat fusing a coil of 
bendable and dimensionally stable polypropylene at the outer surface to make it kink resistant [Guidoin 
et al., 1983], and employing a compliant and biodegradable graft which will promote regeneration of 
arterial wall in small caliber vessels [VanderLeiet al., 1985, 1986] are a few examples of such improvements. 

Transluminally Placed Endovascular Prostheses (Stent-Grafts) 

Endoluminal approaches to treating vascular disease involve the insertion of a prosthetic device into the 
vasculature through a small, often percutaneous, access site created in a remote vessel, followed by the 
intraluminal delivery and deployment of a prosthesis via transcatheter techniques [Veith etal., 1995]. In 
contrast to conventional surgical therapies for vascular disease, the use of transluminally placed endovascular 
prostheses are distinguished by their "minimally invasive" nature. Because these techniques do not require 
extensive surgical intervention, they have the potential to simplify the delivery of vascular therapy, improve 
procedural outcomes, decrease procedural costs, reduce morbidity, and broaden the patient population that 
may benefit from treatment. Not surprisingly, endoluminal therapies have generated intense interest within 
the vascular surgery, interventional radiology, and cardiology communities over recent years. 

The feasibility of using transluminally placed endovascular prostheses, or stent-grafts, for the treatment 
of traumatic vascular injury [Marin etal., 1994], atherosclerotic obstructions [Cragg and Dake, 1993], 
and aneurysmal vascular disease [Parodi etal., 1991; Yusuf etal., 1994; Dake etal., 1994] has been 
demonstrated in human beings. Endoluminal stent-grafts continue to evolve to address a number of 
cardiovascular pathologies at all levels of the arterial tree. Figure 43.10a depicts endoluminal stent-grafts 
having a variety of configurations (straight, bifurcated) and functional diameters (peripheral, aortic) that 
are currently under clinical investigation. 

Endoluminal stent-grafts are catheter-deliverable endoluminal prostheses comprised of an intravas- 
cular stent component and a biocompatible graft component. The function of these devices is to provide 
an intraluminal conduit that enables blood flow through pathologic vascular segments without the need 
for open surgery. The stent component functions as an arterial attachment mechanism and provides 
structural support to both the graft and the treated vascular segment. By design, stents are delivered to 
the vasculature in a low profile, small diameter delivery configuration, and can be elastically or plastically 
expanded to a secondary, large diameter configuration upon deployment. Vascular attachment is achieved 
by the interference fit created when a stent is deployed within the lumen of a vessel having a diameter 
smaller than that of the stent. The graft component, on the other hand, is generally constructed from a 
biocompatible material such as expanded polytetrafluoroethylene (ePTFE), woven polyester (Dacron), 
or polyurethane. The graft component has a number of real and theoretical functions, including: segre- 
gating potential thromboemboli or atheroemboli from the bloodstream, presenting a physical barrier to 
mass transport between the bloodstream and arterial wall, and mitigating cellular infiltration and the 
host inflammatory response. Figure 43.10b shows a stent-graft implant consisting of an expanded poly- 
tetrafluoroethylene (ePTFE) graft that is externally supported along its entire length by a self-expanding 
nitinol stent. The implant is radially constrained and attached to the leading end of a dual lumen 
polyethylene delivery catheter that allows transluminal delivery and deployment. Following introduction 
into the vascular system, the implant is positioned fluoroscopically within the diseased segment and 
released from the delivery system. 
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FIGURE 43.10a Endoluminal stent-grafts of straight and bifurcated configurations and sizes currently under clin- 
ical investigation. (Courtesy of W. L. Gore and Associates, Inc., Flagstaff, AZ.) 




FIGURE 43.10b A stent-graft implant consisting of an expanded PTFE graft that is externally supported by a self- 
expanding nitinol stent. (Courtesy of W. L. Gore and Associates, Inc., Flagstaff, AZ.) 

Mechanical properties play an important role in determining the in vivo performance of an endolu- 
minal stent-graft. Since the graft component typically lacks significant structural integrity, the mechanical 
behavior of the stent-graft predominantly depends upon the mechanical properties of its stent compo- 
nent. The type of mechanism required to induce dilatation from the delivery (small diameter) configu- 
ration, to the deployed (large diameter) configuration typically classifies stents. Self- expanding stents are 
designed to spontaneously dilate (i.e., elastically recover) from the delivery diameter up to a maximal, 
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pre-determined deployed diameter; whereas balloon-expandable stents are designed to be plastically 
enlarged over a range of values with the use of appropriately sized and pressurized dilatation balloons. 
Consequently, self-expanding stents exert a continuous, radially outward directed force on periluminal 
tissues, while balloon-expandable stents assume a fixed diameter that resists recoil of the surrounding 
periluminal tissues. Both types of stents exhibit utilitarian features. For example, in comparison to 
balloon-expandable devices, self-expanding stents can be rapidly deployed without the use of dilatation 
balloons, are elastic and therefore less prone to external compression, can radially adapt to post-deploy- 
ment vascular remodeling, and retain some of the natural compliance of the vascular tissues. In contrast, 
balloon-expandable stents are much more versatile when it comes to conforming to irregular vascular 
morphologies because their diameter can be radially adjusted via balloon dilatation. Since the luminal 
diameter of self-expanding stents cannot be adjusted (i.e., enlarged) to any appreciable degree, accurate 
sizing of the host vessel is critical. A sizing mismatch resulting in oversizing can cause overcompression 
of the self-expanding stent and obstructive invagination of the stent into the lumen. Undersizing, in turn, 
can result in a poor interference fit, inadequate anchoring, device migration, and/or leakage of blood 
into the abluminal compartment. In either case, the stent provides a scaffold that structurally supports 
the graft material. Ongoing work in the field of biomedical engineering is directed at optimizing the 
biomechanical and biological performance of these devices. 

Conclusions 

In the last four decades, we have observed significant advances in the development of biocompatible 
materials to be used in blood interfacing implants. In the case of mechanical heart valve prostheses, 
pyrolytic carbon has become the material of choice for the occluder and the housing. The pyrolytic 
carbon is chemically inert and exhibits very little wear even after more than 20 years of use. However, 
thrombo-embolic complications still remain significant with mechanical valve implantation. The complex 
dynamics of valve function and the resulting mechanical stresses on the formed elements of blood appear 
to be the main cause for initiation of thrombus. More recent reports of structural failure with implanted 
mechanical valves and pitting and erosion observed on the pyrolytic carbon surfaces have resulted in 
investigations on cavitation bubble formation during valve closure. Along with further improvements in 
biomaterials for heart valves, detailed analysis of the closing dynamics and design improvements to 
minimize the adverse effects of mechanical stresses may be the key to reducing thrombus deposition. 
Improvements on mechanical heart valves or further developments in durable synthetic leaflet valves may 
also be vital for the development of TAHs for long-term implantation without neurologic complications. 
In the case of vascular grafts, the mismatch of material properties (compliance) between the host 
artery and the graft, as well as geometric considerations in end-to-side anastomoses, appear to be 
important for the loss of patency within several months after implantation particularly with medium 
and small diameter arterial replacement. Most of the vascular grafts are stiffer compared to the host 
artery and it has been suggested that the mechanical stresses resulting from the discontinuity at the 
junction is the major cause for neointimal hyperplasia formation and subsequent occlusion of the conduit. 
Developments with more compliant grafts and in modifying the surface interaction of the graft with 
blood (endothelialization or other treatment of the graft material) may result in reducing the problems 
with loss of patency. Recent advances in the use of minimally invasive stent-grafts also show promise in 
improving the quality of life of patients with vascular disease. 

Defining Terms 

Acetol: Product of the addition of two moles of alcohol to one of an aldehyde. 

Aneurysms: Abnormal bulging or dilatation of a segment of a blood vessel or myocardium. 

Atherosclerosis: Lipid deposits in the intima of arteries. 

Artery: Blood vessel transporting blood in a direction away from the heart. 

ATS valve: A bileaflet mechanical valve made by ATS (Advancing The Standard) Inc. 

Autoclaving: Sterilizing by steam under pressure. 
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Biomer®: Segmented polyurethane elastomer. 

Bioprostheses: Prosthetic heart valves made of biological tissue. 

Blood oxygenators: Extracorporeal devices to oxygenate blood during heart bypass surgery. 

Bovine heterograft: Graft material (arterial) transplanted from bovine species. 

Calcification: Deposition of insoluble salts of calcium. 

Cardiac pacemakers: Prosthesis implanted to stimulate cardiac muscles to contract. 

Cardiopulmonary bypass: Connectors bypassing circulation to the heart and the lungs. 

Catheters: Hollow cylindrical tubing to be passed through the blood vessels or other canals. 

Cavitation bubbles (vapor cavitation): Formation of vapor bubbles due to transient reduction in 

pressure to below the liquid vapor pressure. 
Closing dynamics: Dynamics during the closing phase of heart valves. 
Compliance: A measure of ease with which a structure can be deformed; ratio of volumetric strain to 

increase in unit pressure. 
Crimping: Creasing of the synthetic vascular grafts in the longitudinal direction to accommodate the 

large intermittent flow of blood. 
Delrin®: Polyacetal made by Union Carbide. 

Dialysers: Devices to filter the blood of waste products taking over the function of the kidney. 
dp/dt: Slope of the pressure vs. time curve of the ventricles. 

Duramater: A tough fibrous membrane forming the outer cover of the brain and the spinal cord. 
Electrohydraulic blood pump: Blood pumps energized by the conversion of electrical to hydraulic 

energy. 
End-to-end configuration: End of the vascular graft anastamosed to the end of the host artery. 
End-to-side configuration: End of the vascular graft anastamosed to the side of the host. 
Endothelial cells: A layer of flat cells lining the intimal surface of blood vessels. 
Erosion: A state of being worn away. 

Fascia lata: A sheet of fibrous tissue enveloping the muscles of the thigh. 
Fatigue stress: Level of stress below which the material would not undergo fatigue failure (10 7 cycles 

is used as the normal limit). 
Fibrin: An elastic filamentous protein derived from fibrinogen in coagulation of the blood. 
Fibroblasts: An elongated cell with cytoplasmic processes present in connective tissue capable of 

forming collagen fibers. 
Finite element analysis: Structural analysis with the aid of a computer which divides the structure into 

finite elements and applies the laws of mechanics on each element. 
Formaldehyde: Formic aldehyde, methyl aldehyde, a pungent gas used as antiseptic. 
Formed elements in blood: Red blood cells, white blood cells, platelets, and other cells in whole blood. 
Haynes 25®: Co-Cr alloy. 

Homografts: Transplants (heart valves, arterial segments, etc.) from the same species. 
Intra-aortic balloon pumps: A balloon catheter inserted in the descending aorta and alternately inflated 

and deflated timed to the EKG in order to assist the ventricular pumping. 
Laser Doppler anemometry: A velocity measurement device using the principle of Doppler shifted 

frequency of laser light by particles moving with the fluid. 
Leaflets: Occluders on valves which open and close to aid blood flow in one direction. 
Left ventricular assist devices: Prosthetic devices to assist the left ventricle in pumping blood. 
Liquid vapor pressure: Pressure at which liquid vaporizes. 
LTI: Low temperature (below 1500°C) isotropic pyrolytic carbon. 
Mechanical prostheses: Prostheses made of non-biological material. 
Negative pressure transients: Reduction in pressure for a short duration. 
Neointima: Newly formed intimal surface. 

Neointimal hyperplasia: Growth of new intimal surface formed by fibroblasts. 
Nylon: Synthetic polymer with condensation polymerization. 
Patency: State of being freely open. 
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Pericardial prostheses: Heart valve prosthesis made with fixed bovine pericardial tissue. 

Pitting: Depression or indent on a surface. 

Platelet: One of the formed elements of blood responsible for blood coagulation. 

Polypropylene: One of the vinyl polymers with good flex life and good environmental stress crack 

resistance. 
Polytetrafluoroethylene (PTFE): A fluorocarbon polymer known as Teflon®. 
Pusher plate devices: Artificial heart devices working with pusher plates moving the blood. 
Pyrolytic carbon: Carbon deposited onto preformed polycrystalline graphite substrate. 
Radiopacity: Being opaque to x-ray. 
Sewing rings: Rings surrounding the housing of artificial heart valves used to sew the valve to the tissue 

orifice with suture. 
Solution casting: Casting by pouring molten material on dyes to form a structure. 
Stellite 21®: Co-Cr alloy. 

Stent: A device used to maintain the bodily orifice or cavity. 
Strut: A projection in the structure such as guiding struts in heart valves used to guide the leaflets 

during opening and closing. 
TAH: Total artificial heart replacing a failed natural heart. 
Teflon®: See PTFE. 
Thrombo-embolic complications: Complications due to breaking away (emboli) of thrombus blocking 

the distal blood vessels. 
Thrombus: A clot in the blood vessels or in the cavities of the heart formed from the constituents of 

blood. 
Tilting disc valves: Valves with a single leaflet tilting open and shut. 
Titanium: Highly reactive metal having low density, good mechanical properties, and biocompatibility 

due to tenacious oxide layer formation. 
Turbulent stresses: Stresses generated in the fluid due to agitated random motion of particles. 
Ultra high molecular weight polyethylene: Linear thermoplastics with very high molecular weight 

(>2 x 10 6 g/mol) used for orthopedic devices such as acetabular cup for hip joint replacement. 
Umbilical cord vein grafts: Vascular graft made from umbilical cord veins. 
Vacuum forming: A manufacturing technique for thermoplastic polymer in which a sheet is heated 

and formed over a mold while a vacuum is present under the sheet. 
Valvular disorders: Diseased states of valves such as stenosis. 
Valvular structures: Components of valves such as leaflets, struts, etc. 
Vascular grafts: Grafts to replace segments of diseased vessels. 
Xenografts: Grafts obtained from species other than that of the recipient. 
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43.2 Non-Blood-Interfacing Implants for Soft Tissues 

K.J.L. Burg and S.W. Shalaby 

Most tissues other than bone and cartilage are of the soft category. Implants do not generally interface 
directly with blood; the exceptions are located primarily in the cardiovascular systems. Non-blood- 
interfacing soft tissue implants are used to augment or replace natural tissues or to redirect specific 
biological functions. The implants can be transient; that is, of short-term function and thus made of 
absorbable materials, or they can be long-term implants which are expected to have prolonged functions 
and are made of nonabsorbable materials. 

Toward the successful development of a new biomedical device or implant, including those used for 
soft tissues, the following milestones must be achieved: (1) acquire certain biologic and biomechanic 
data about the implant site and its function to aid in the selection of materials and engineering design 
of such an implant, to meet carefully developed product requirements; (2) construct a prototype and 
evaluate its physical and biologic properties both in vitro and in vivo, using the appropriate animal model; 
and (3) conduct a clinical study following a successful battery of animal safety studies depending on 
intended application and availability of historical safety and clinical data on the material or design. Extent 
of the studies associated with any specific milestone can vary considerably. Although different applications 
require different materials with specific properties, minimum requirements for soft-tissue implants 
should be met. The implant must (1) exhibit physical properties (e.g., flexibility and texture) which are 
equivalent or comparable to those called for in the product profile; (2) maintain the expected physical 
properties after implantation for a specific period; (3) elicit no adverse tissue reaction; (4) display no 
carcinogenic, toxic, allergenic, and/or immunogenic effect; and (5) achieve assured sterility without 
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compromising the physicochemical properties. In addition to these criteria, a product of potentially broad 
applications is expected to (1) be easily mass produced at a reasonable cost; (2) have acceptable aesthetic 
quality; (3) be enclosed in durable, properly labeled, easy-access packaging; and (4) have adequate shelf 
stability. 

The most common types of soft-tissue implants are (1) sutures and allied augmentation devices; 
(2) percutaneous and cutaneous systems; (3) maxillofacial devices; (4) ear and eye prostheses; (5) space- 
filling articles; and (6) fluid transfer devices. 

Sutures and Allied Augmentation Devices 

Sutures and staples are the most common types of augmentation devices. In recent years, interest in 
using tapes and adhesives has increased and may continue to do so, should new efficacious systems be 
developed. 

Sutures and Suture Anchors 

Sutures are usually packaged as a thread attached to a metallic needle. Although most needles are made 
of stainless steel alloys, the thread component can be made of various materials, and the type used 
determines the class of the entire suture. In fact, it is common to refer to the thread as the suture. Presently, 
most needles are drilled (mechanically or by laser) at one end for thread insertion. Securing the thread 
in the needle hole can be achieved by crimping or adhesive attachment. Among the critical physical 
properties of sutures are their diameter, in vitro knot strength, needle-holding strength, needle penetra- 
tion force, ease of knotting, knot security, and in vitro strength retention profile. 

Two types of threads are used in suture manufacturing and are distinguished according to the retention 
of their properties in the biologic environment, namely, absorbable and nonabsorbable. These may also 
be classified according to their source of raw materials, that is, natural (catgut, silk, and cotton), synthetic 
(nylon, polypropylene, polyethylene terephthalate, and polyglycolide and its copolymers), and metallic 
sutures (stainless steel and tantalum). Sutures may also be classified according to their physical form, 
that is, monofilament and twisted or braided multifilament (or simply braids). 

The first known suture, the absorbable catgut, is made primarily of collagen derived from sheep 
intestinal submucosa. It is usually treated with a chromic salt to increase its in vivo strength retention 
and through imparted crosslinking that retards absorption. Such treatment extends the functional per- 
formance of catgut suture from 1 to 2 weeks up to about 3 weeks. The catgut sutures are packaged in a 
specially formulated fluid to prevent drying and maintain necessary compliance for surgical handling 
and knot formation. 

The use of synthetic absorbable sutures exceeded that of catgut over the past two decades. This is 
attributed to many factors including (1) higher initial breaking strength and superior handling charac- 
teristics; (2) availability of sutures with a broad range of in vivo strength retention profiles; (3) consid- 
erably milder tissue reactions and no immunogenic response; and (4) reproducible properties and highly 
predictable in vivo performance. Polyglycolide (PG) was the first synthetic absorbable suture to be 
introduced, about three decades ago. Because of the high modulus of oriented fibers, PG is made mostly 
in the braided form. A typical PG suture braid absorbs in about 4 months and retains partial in vivo 
strength after 3 weeks. However, braids made of the 90/10 glycolide/1-lactide copolymer have a compa- 
rable or improved strength retention profile and faster absorption rate relative to PG. The copolymeric 
sutures absorb in about 3 months and have gained wide acceptance by the surgical community. 

As with other types of braided sutures, an absorbable coating which improves suture handling and 
knot formation has been added to the absorbable braids. To minimize the risk of infection and tissue 
drag that are sometimes associated with braided sutures, four types of monofilament sutures have been 
commercialized. The absorbable monofilaments were designed specifically to approach the engineering 
compliance of braided sutures, by combining appropriate materials to achieve low moduli, e.g., poly- 
dioxanone and copolymers of glycolide with caprolactone or trimethylene carbonate. 
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Members of the nonabsorbable family of sutures include braided silk (a natural protein), nylon, and 
polyethylene terephthalate (PET). These braids are used as coated sutures. Although silk sutures have 
retained wide acceptance by surgeons, nylon and particularly PET sutures are used for critical procedures 
where high strength and predictable long-term performance are required. Meanwhile, the use of cotton 
sutures is decreasing constantly because of their low strength and occasional tissue reactivity due to 
contaminants. Monofilaments are important forms of nonabsorbable sutures and are made primarily of 
polypropylene, nylon, and stainless steel. An interesting application of monofilament sutures is illustrated 
in the use of polypropylene loops (or haptics) for intraocular lenses. The polypropylene sutures exhibit 
not only the desirable properties of monofilaments but also the biologic inertness reflected in the minimal 
tissue reactions associated with their use in almost all surgical sites. With the exception of its natural 
tendency to undergo hydrolytic degradation and, hence, continued loss of mechanical strength postop- 
eratively, nylon monofilament has similar attributes to those of polypropylene. Because of their excep- 
tionally high modulus, stainless steel sutures are not used in soft-tissue repair because they can tear these 
tissues. All sutures can be sterilized by gamma radiation except those made of synthetic absorbable 
polymers, polypropylene, or cotton, which are sterilized by ethylene oxide. 

Related to the suture is the tissue suture anchor, used to attach soft tissue to bone. The anchor is 
embedded into bone and the suture can be used to reattach the soft tissue. The most common anchor 
is polylactide-based and is used in shoulder repair. 

Nonsuture Fibrous and Microporous Implants 

Woven PET and polypropylene fabrics are commonly used as surgical meshes for abdominal wall repair 
and similar surgical procedures where surgical "patching" is required. Braid forms and similar construc- 
tion made of multifilament PET yarns have been used for repairing tendons and ligaments. Microporous 
foams of polytetrafluoroethylene (PTFE) are used as pledgets (to aid in anchoring sutures to soft tissues) 
and in repair of tendons and ligaments. Microporous collagen-based foams are used in wound repair to 
accelerate healing. 

Clips, Staples, and Pins 

Ligating clips are most commonly used for temporary or long-term management of the flow in tubular 
tissues. Titanium clips are among the oldest and still-versatile types of clips. Thermoplastic polymers 
such as nylon can be injection-molded into different forms of ligating clips. These are normally designed 
to have a latch and living hinge. Absorbable polymers made of lactide/glycolide copolymers and poly- 
dioxanone have been successfully converted to ligating clips with different design features for a broad 
range of applications. 

Metallic staples were introduced about three decades ago as strong competitors to sutures for wound 
augmentation; their use has grown considerably over the past 10 years for everything from skin closure 
procedures to a multiplicity of internal surgical applications. Major advantages associated with the use 
of staples are ease of application and minimized tissue trauma. Metallic staples can be made of tantalum, 
stainless steel, or titanium-nickel alloys. Staples are widely used to facilitate closure of large incisions 
produced in procedures such as Caesarean sections and intestinal surgery. Many interesting applications 
of small staples have been discovered for ophthalmic and endoscopic use, a fast-growing area of minimally 
invasive surgery. 

Thermoplastic materials based on lactide/glycolide copolymers have been used to produce absorbable 
staples for skin and internal wound closures. These staples consist primarily of two interlocking compo- 
nents, a fastener and receiver. They are advantageous in that they provide a quick means of closure with 
comparable infection resistance. They are limited to locations which do not have large tensile loads and/or 
thicker or more sensitive tissue. 

A new form of ligating device is the subcutaneous pin. This is designed with a unique applicator to 
introduce the pin parallel to the axis of the wound. During its application, the linear pin acquires a zig- 
zag-like configuration for stabilized tissue anchoring. The pins are made of lactide/glycolide polymers. 
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Surgical Tapes 

Surgical tapes are intended to minimize necrosis, scar tissue formation, problems of stitch abscesses, and 
weakened tissues. The problems with surgical tapes are similar to those experienced with traditional skin 
tapes. These include (1) misaligned wound edges, (2) poor adhesion due to moisture or dirty wounds, 
and (3) late separation of tapes when hematoma or wound drainage occur. 

Wound strength and scar formation in skin may depend on the type of incision made. If the subcu- 
taneous muscles in the fatty tissue are cut and the overlying skin is closed with tape, then the muscles 
retract. This, in turn, increases the scar area, causing poor cosmetic appearance when compared to a 
suture closure. Tapes also have been used successfully for assembling scraps of donor skin for skin graft. 

Tissue Adhesives 

The constant call for tissue adhesives is particularly justified when dealing with the repair of exceptionally 
soft tissues. Such tissues cannot be easily approximated by sutures, because sutures inflict substantial 
mechanical damage following the traditional knotting scheme and associated shear stresses. However, 
the variable biological environments of soft tissues and their regenerative capacity make the development 
of an ideal tissue adhesive a difficult task. Experience indicates that an ideal tissue adhesive should (1) be 
able to wet and bond to tissues; (2) be capable of onsite formation by the rapid polymerization of a 
liquid monomer without producing excessive heat or toxic byproducts; (3) be absorbable; (4) not interfere 
with the normal healing process; and (5) be easily applied during surgery. The two common types of 
tissue adhesives currently used are based on alkyl-o-cyanoacrylates and fibrin. The latter is a natural 
adhesive derived from fibrinogen, which is one of the clotting components of blood. Although fibrin is 
used in Europe, its use in the U.S. has not been approved because of the risk of its contamination with 
hepatitis and/or immune disease viruses. Due to its limited mechanical strength (tensile strength and 
elastic modulus of 0.1 and 0.15 MPa, respectively), fibrin is used mostly as a sealant and for adjoining 
delicate tissues as in nerve anastomoses. Meanwhile, two members of the cyanoacrylate family of adhe- 
sives, n-butyl- and iso-butyl-cyano-acrylates, are used in a number of countries as sealants, adhesives, 
and blocking agents. They are yet to be approved for use in the U.S. because of the lack of sufficient 
safety data. Due to a fast rate of polymerization and some limited manageability in localizing the adhesive 
to the specific surgical site, the in vivo performance of cyanoacrylates can be unpredictable. Because of 
the low strength of the adhesive joints or sealant films produced on in vivo polymerization of these 
cyanoacrylates, their applications generally are limited to use in traumatized fragile tissues (such as spleen, 
liver, and kidney) and after extensive surgery on soft lung tissues. A major safety concern of these alkyl 
cyanoacrylates is related to their nonabsorbable nature. Hence, a number of investigators have directed 
their attention to certain alkoxy-alkyl cyanoacrylates which can be converted to polymeric adhesives with 
acceptable absorbable profiles and rheological properties. Methoxypropyl cyanoacrylate, for example, 
has demonstrated both the absorbability and high compliance that is advantageous to soft tissue repair. 

Percutaneous and Skin Implants 

The need for percutaneous (trans or through the skin) implants has been accelerated by the advent of 
artificial kidneys and hearts and the need for prolonged injection of drugs and nutrients. Artificial skin 
is urgently needed to maintain the body temperature and prevent infection in severely burned patients. 
Actual permanent replacement of skin by biomaterials is still a great clinical challenge. 

Percutaneous Devices 

The problem of obtaining a functional and viable interface between the tissue (skin) and an implant 
(percutaneous device) is primarily due to the following factors. First, although initial attachment of the 
tissue into the interstices of the implant surface occurs, attachment cannot be maintained for a sustained 
time since the dermal tissue cells turn over continuously. Downgrowth of epithelium around the implant 
or overgrowth on the implant leads to extrusion or invagination, respectively. Second, any opening near 
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the implant that is large enough for bacteria to penetrate may result in infection, even though initially 
there may be a tight seal between skin and implant. Several factors are involved in the development of 
percutaneous devices: 

1. Type of end use — this may deal with transmission of information (biopotentials, temperature, 
pressure, blood flow rate), energy (electrical stimulation, power for heart-assist devices), transfer 
of matter (cannula for blood), and load (attachment of a prosthesis); 

2. Engineering factors — these may address materials selection (polymers, ceramics, metals, and com- 
posites), design variation (button, tube with and without skirt, porous or smooth surface), and 
mechanical stresses (soft and hard interface, porous or smooth interface); 

3. Biologic factors — these are determined by the implant host (human, dog, hog, rabbit, sheep), and 
implant location (abdominal, dorsal, forearm); 

4. Human factors — these can pertain to postsurgical care, implantation technique, and esthetic look. 

No percutaneous devices are completely satisfactory. Nevertheless, some researchers believe that 
hydroxyapatite may be part of a successful approach. In one experimental trial, a hydroxyapatite-based 
percutaneous device was associated with less epidermal downgrowth (1 mm after 17 months vs. 4.6 mm 
after 3 months) when compared with a silicone rubber control specimen in the dorsal skin of canines. 
Researchers have also investigated coatings such as laminin-5 which has been shown to enhance epithelial 
attachment. 

Artificial Skins 

Artificial skin is another example of a percutaneous implant, and the problems are similar to those 
described above. Important for this application is a material which can adhere to a large (burned) surface 
and thus prevent the loss of fluids, electrolytes, and other biomolecules until the wound has healed. 

In one study on wound-covering materials with controlled physicochemical properties, an artificial 
skin was designed with a crosslinked collagen-polysaccharide (chondroitin 6-sulfate) composite mem- 
brane. This was specifically chosen to have controlled porosity (5-150 um in diameter), flexibility (by 
varying crosslink density), and moisture flux rate. 

Several polymeric materials and reconstituted collagen have also been examined as burn dressings. 
Among the synthetic ones are the copolymers of vinyl chloride and vinyl acetate as well as polymethyl 
cyanoacrylate (applied as a fast-polymerizing monomer). The latter polymer and/or its monomer were 
found to be too brittle and histotoxic for use as a burn dressing. The ingrowth of tissue into the pores 
of polyvinyl alcohol sponges and woven fabric (nylon and silicone rubber velour) was also attempted 
without much success. Nylon mesh bonded to a silicone rubber membrane, another design attempt, 
prevented water evaporation but has not been found to induce fibrovascular growth. 

Rapid epithelial layer growth by culturing cells in vitro from the skin of the burn patient for covering 
the wound area may offer a practical solution for less severely burned patients. Implantation of an 
allogenic fibroblast/polymer construct has proven useful for providing long-term skin replacement. 
Related to this, temporary tissue engineered replacements are possible alternatives for burns requiring 
larger area coverage. These can be similar to the synthetic dressing, a nylon mesh and silicone rubber 
component, but incorporates allogeneic fibroblasts. This temporary covering hopefully will stimulate or 
allow fibrovascular growth into the wound bed by providing the appropriate matrix proteins and growth 
factors. 

Maxillofacial Implants 

There are two types of maxillofacial implants: extraoral and intraoral. The former deals with the use of 
artificial substitutes for reconstructing defective regions in the maxilla, mandible, and face. Useful poly- 
meric materials for extraoral implants require: (1) match of color and texture with those of the patient; 
(2) mechanical and chemical stability (i.e., material should not creep or change color or irritate skin); 
and (3) ease of fabrication. Copolymers of vinyl chloride and vinyl acetate (with 5 to 20% acetate), 
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polymethyl methacrylate, silicones, and polyurethane rubbers are currently used. Intraoral implants are 
used for repairing maxilla, mandibular, and facial bone defects. Material requirements for the intraoral 
implants are similar to those of the extraoral ones. For the latter group of implants, metallic materials 
such as tantalum, titanium, and CoCr alloys are commonly used. For soft tissues, such as gum and chin, 
polymers such as silicone rubber and polymethylmethacrylate are used for augmentation. 

Ear and Eye Implants 

Implants can be used to restore conductive hearing loss from otosclerosis (a hereditary defect which 
involves a change in the bony tissue of the ear) and chronic otitis media (the inflammation of the middle 
ear which can cause partial or complete impairment of the ossicular chain). A number of prostheses are 
available for correcting these defects. The porous polyethylene total ossicular implant is used to achieve 
a firm fixation by tissue ingrowth. The tilt-top implant is designed to retard tissue ingrowth into the 
section of the shaft which may diminish sound conduction. Materials used in fabricating these implants 
include polymethyl methacrylate, polytetrafluoroethylene, polyethylene, silicone rubber, stainless steel, 
and tantalum. More recently, polytetrafluoroethylene-carbon composites, porous polyethylene, and pyro- 
lytic carbon have been described as suitable materials for cochlear (inner ear) implants. 

Artificial ear implants capable of processing speech have been developed with electrodes to stimulate 
cochlear nerve cells. Cochlear implants also have a speech processor that transforms sound waves into 
electrical impulses that can be conducted through coupled external and internal coils. The electrical 
impulses can be transmitted directly by means of a percutaneous device. 

Eye implants are used to restore the functionality of damaged or diseased corneas and lenses. Usually 
the cornea is transplanted from a suitable donor. In cataracts, eye lenses become cloudy and can be 
removed surgically. Intraocular lenses (IOL) are implanted surgically to replace the original eye lens and 
to restore function. IOL are made from transparent acrylics, particularly polymethyl methacrylate, which 
has excellent optical properties. Infection and fixation of the lens to the tissues are frequent concerns, 
and a number of measures are being used to address them. Transplantation of retinal pigmented epithe- 
lium can be used in the treatment of adult onset blindness; the challenge is developing readily detachable 
or absorbable materials on which to culture sheets of these cells. 

Space-Filling Implants 

Breast implants are common space-filling implants. At one time, the enlargement of breasts was done 
with various materials such as paraffin wax and silicone fluids, by direct injection or by enclosure in a 
rubber balloon. Several problems have been associated with directly injected implants, including pro- 
gressive instability and ultimate loss of original shape and texture as well as infection and pain. One of 
the early efforts in breast augmentation was to implant a sponge made of polyvinyl alcohol. However, 
soft tissues grew into the pores and then calcified with time, and the so-called marble breast resulted. 
Although the enlargement or replacement of breasts for cosmetic reasons alone is not recommended, 
prostheses have been developed for patients who have undergone radical mastectomy or who have 
nonsymmetrical deformities. The development of the tissue-engineered breast is ongoing, where fat or 
normal breast tissue may be derived from the patient and combined with an absorbable scaffold for 
transplantation. A silicone rubber bag filled with silicone gel and backed with polyester mesh to permit 
tissue ingrowth for fixation is a widely used prosthesis, primarily for psychological reasons. The artificial 
penis, testicles, and vagina fall into the same category as breast implants, in that they make use of silicones 
and are implanted for psychological reasons rather than to improve physical health. 

Fluid Transfer Implants 

Fluid transfer implants are required for cases such as hydrocephalus, urinary incontinence, glaucoma- 
related elevated intraocular pressure, and chronic ear infection. Hydrocephalus, caused by abnormally 
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high pressure of the cerebrospinal fluid in the brain, can be treated by draining the fluid (essentially an 
ultrafiltrate of blood) through a cannula. Earlier shunts had two one-way valves at either end. However, 
the more recent Ames shunt has simple slits at the discharging end, which opens when enough fluid 
pressure is exerted. The Ames shunt empties the fluid in the peritoneum while others drain into the 
blood stream through the right internal jugular vein or right atrium of the heart. The simpler peritoneal 
shunt shows less incidence of infection. 

The use of implants for correcting the urinary system has not been successful because of the difficulty 
of adjoining a prosthesis to the living system for achieving fluid tightness. In addition, blockage of the 
passage by deposits from urine, salt for example, and constant danger of infection are major concerns. 
Several materials have been used for producing these implants, with limited long-term success; these 
include Dacron™, glass, polyvinyl alcohol, polyethylene, rubber, silver, tantalum, Teflon™, and Vital- 
lium™. Tissue engineered devices also have application in urinary repair; for example, an alginate- 
chondrocyte system has been used clinically to treat vesicoureteral reflux. Preliminary results suggest that 
by transplanting the hydrogel system as a bulking agent below a refluxing ureter, neocartilage gradually 
develops to correct the reflux. Uroepithelial cells, combined with porous, absorbable polyester matrices 
show promise in the replacement of urologic tissues. 

Drainage tubes, which are impermanent implants for chronic ear infection, can be made from poly- 
tetrafluoroethylene (Teflon™). Glaucoma-related elevated intraocular pressure may be relieved by 
implanting a tube connecting the anterior eye chamber to the external subconjunctival space in order to 
direct aqueous humor. Complications can arise with occlusion of the tube due to wound healing processes; 
however, researchers are investigating combining this device with an absorbable drug delivery plug to 
regulate flow and deliver drugs to regulate the wound healing process. 

Technologies of Emerging Interest 

A new process for uniaxial solid-state orientation, using a range of compressive forces and temperatures, 
has been developed to produce stock sheets of polyether-ether ketone (PEEK) for machining dental 
implants with substantially increased strength and modulus as compared to their unoriented counter- 
parts. 

Surface treatment of materials is another emerging interest, a process potentially influencing both 
surface charge, topography, and conductivity. The former has obvious effects on cell adhesion and 
integration of traditional implants with surrounding tissues, and it has a profound effect on the success 
or failure of a tissue-engineered device. 

Pertinent to the effect of surface chemistry and texture on tissue regeneration/ingrowth, recent studies 
on surface-phosphonylation to create hydroxyapatite-like substrates show that (1) surface-microtextured 
polypropylene and polyethylene transcortical implants in goat tibia having phosphonate functionalities 
(with or without immobilized calcium ions) do induce bone ingrowth, and (2) microtexture and surface 
phosphonylated (with and without immobilized calcium ions) rods made of PEEK and similarly treated 
rods of carbon fiber-reinforced PEEK induce bone ingrowth when implanted in the toothless region of 
the lower jaw of goats. The use of surface-phosphonylated and post-treated (with a bridging agent) 
UHMW-PE fibers and fabric do produce high strength and modulus composites at exceptionally low 
filler loading. Among these composites are those based on methyl methacrylate matrices, similar to those 
used in bone cement. 

Conductivity manipulation may be extremely useful in such areas as biosensor design. Surface-con- 
ducting phosphonylated, ultra-high molecular weight polyethylene may be formed by exposure to aque- 
ous pyrole solution. Preliminary results suggest that this is a stable process, yielding no apparent cytotoxic 
effects due to the material or leachables. New surface treatments will be instrumental in the development 
of the two rapidly expanding areas of biosensor design and tissue engineering, both areas which encom- 
pass non-blood-interfacing soft tissue implants. 
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44.1 Bone Repair and Joint Implants 

S-H Park, A. Llinas, and V.K. Goel 

The use of biomaterials to restore the function of traumatized or degenerated connective tissues and thus 
improve the quality of life of a patient has become widespread. In the past, implants were designed with 
insufficient cognizance of biomechanics. Accordingly, the clinical results were not very encouraging. An 
upsurge of research activities into the mechanics of joints and biomaterials has resulted in better designs 
with better in vivo performance. The improving long-term success of total joint replacements for the 
lower limb is testimony to this. As a result, researchers and surgeons have developed and used fixation 
devices for the joints, including artificial spine discs. A large number of devices are also available for the 
repair of the bone tissue. This chapter provides an overview of the contemporary scientific work related 
to the use of biomaterials for the repair of bone (e.g., fracture) and joint replacements ranging from a 
hip joint to a spine. 

Long Bone Repair 

The principal functions of the skeleton are to provide a frame to support the organ-systems, and to 
determine the direction and range of body movements. Bone provides an anchoring point (insertion), 
for most skeletal muscles and ligaments. When the muscles contract, long bones act as levers, with the 
joints functioning as pivots, to cause body movement. 

Bone is the only tissue able to undergo spontaneous regeneration and to remodel its micro- and macro- 
structure. This is accomplished through a delicate balance between an osteogenic (bone forming) and 
osteoclastic (bone removing) process [Brighton, 1984]. Bone can adapt to a new mechanical environment 
by changing the equilibrium between osteogenesis and osteoclasis. These processes will respond to 
changes in the static and dynamic stress applied to bone; that is, if more stress than the physiological is 
applied, the equilibrium tilts toward more osteogenic activity. Conversely, if less stress is applied the 
equilibrium tilts toward osteoclastic activity (this is known as Wolff's Law of bone remodeling) [Wolff, 
1986]. 
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Nature provides different types of mechanisms to repair fractures in order to be able to cope with 
different mechanical environments about a fracture [Hulth, 1989; Schenk, 1992]. For example, incomplete 
fractures (cracks), which only allow micromotion between the fracture fragments, heal with a small 
amount of fracture-line callus, known as primary healing. In contrast, complete fractures which are 
unstable, and therefore generate macromotion, heal with a voluminous callus stemming from the sides 
of the bone, known as secondary healing [Hulth, 1989; Brighton, 1984]. 

The goals of fracture treatment are obtaining rapid healing, restoring function, and preserving cosmesis 
without general or local complications. Implicit in the selection of the treatment method is the need to 
avoid potentially deleterious conditions, for example, the presence of excessive motion between bone 
fragments which may delay or prevent fracture healing [Brand and Rubin, 1987; Brighton, 1984]. 

Each fracture pattern and location results in a unique combination of characteristics ("fracture per- 
sonality") that require specific treatment methods. The treatments can be non-surgical or surgical. 
Examples of non-surgical treatments are immobilization with casting (plaster or resin) and bracing with 
a plastic apparatus. The surgical treatments are divided into external fracture fixation, which does not 
require opening the fracture site, or internal fracture fixation, which requires opening the fracture. 

With external fracture fixation, the bone fragments are held in alignment by pins placed through the 
skin onto the skeleton, structurally supported by external bars. With internal fracture fixation, the bone 
fragments are held by wires, screws, plates, and/or intramedullary devices. Figure 44.1a and b show 
radiographs of externally and internally fixed fractures. 

All the internal fixation devices should meet the general requirement of biomaterials, that is, biocom- 
patability, sufficient strength within dimensional constraints, and corrosion resistance. In addition, the 
device should also provide a suitable mechanical environment for fracture healing. From this perspective, 
stainless steel, cobalt-chrome alloys, and titanium alloys are most suitable for internal fixation. Detailed 
mechanical properties of the metallic alloys are discussed in the chapter on metallic biomaterials. Most 
internal fixation devices persist in the body after the fracture has healed, often causing discomfort and 
requiring removal. Recently, biodegradable polymers, e.g., polylactic acid (PLA) and polyglycolic acid 
(PGA), have been used to treat minimally loaded fractures, thereby eliminating the need for a second 
surgery for implant removal. A summary of the basic application of biomaterials in internal fixation is 
presented in Table 44.1. A description of the principal failure modes of internal fixation devices is 
presented in Table 44.2. 

Wires 

Surgical wires are used to reattach large fragments of bone, like the greater trochanter, which is often 
detached during total hip replacement. They are also used to provide additional stability in long-oblique 
or spiral fractures of long bones which have already been stabilized by other means (Fig. 44.1b). Similar 
approaches, based on the use of wires, have been employed to restore stability in the lower cervical spine 
region and in the lumbar segment as well (Fig. 44.1c). 

Twisting and knotting is unavoidable when fastening wires to bone; however, by doing so, the strength 
of the wire can be reduced by 25% or more due to stress concentration [Tencer et al., 1993]. This can 
be partially overcome by using a thicker wire, since its strength increases directly proportional to its 
diameter. The deformed regions of the wire are more prone to corrosion than the un-deformed because 
of the higher strain energy. To decrease this problem and ease handling, most wires are annealed to 
increase the ductility. 

Braided multistrain (multifilament) wire is an attractive alternative because it has a similar tensile 
strength than a monofilament wire of equal diameter, but more flexibility and higher fatigue strength 
[Taitsman and Saha, 1977]. However, bone often grows into the grooves of the braided multistrain wire, 
making it exceedingly difficult to remove, since it prevents the wire from sliding when pulled. When a 
wire is used with other metallic implants, the metal alloys should be matched to prevent galvanic corrosion 
[Park and Lakes, 1992]. 
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FIGURE 44.1 Radiographs of (a) tibial fracture fixed with four pins and an external bar; (b) a total hip joint 
replacement in a patient who sustained a femoral fracture and was treated with double bone plates, screws, and 
surgical wire (arrows); (c) application of wires, screws, and plates in the spine. 

Pins 

Straight wires are called Steinmann pins; however if the pin diameter is less than 2.38 mm, it is called 
Kirschner wire. They are widely used primarily to hold fragments of bones together provisionally or 
permanently and to guide large screws during insertion. To facilitate implantation, the pins have different 
tip designs which have been optimized for different types of bone (Fig. 44.2). The trochar tip is the most 
efficient in cutting; hence, it is often used for cortical bone. 

The holding power of the pin comes from elastic deformation of surrounding bone. In order to increase 
the holding power to bone, threaded pins are used. Most pins are made of 316L stainless steel; however, 
recently, biodegradable pins made of polylactic or polyglycolic acid have been employed for the treatment 
of minimally loaded fractures. 

The pins can be used as part of elaborate frames designed for external fracture fixation (Fig. 44.1a). 
In this application, several pins are placed above and below the fracture, but away from it. After the 
fracture fragments are manually approximated (reduced) to resemble the intact bone, the pins are 
attached to various bars, which upon assembly will provide stability to the fracture. 
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FIGURE 44.1 (continued) 



Screws 

Screws are the most widely used devices for fixation of bone fragments. There are two types of bone 
screws: (1) cortical bone screws, which have small threads, and (2) cancellous screws, which have large 
threads to get more thread-to-bone contact. They may have either V or buttress threads (Fig. 44.3). The 
cortical screws are subclassified further according to their ability to penetrate, into self-tapping and non- 
self-tapping (Fig. 44.3). The self-tapping screws have cutting flutes which thread the pilot drill-hole during 
insertion. In contrast, the non-self-tapping screws require a tapped pilot drill-hole for insertion. 

The holding power of screws can be affected by the size of the pilot drill-hole, the depth of screw 
engagement, the outside diameter of the screw, and quality of the bone [Cochran, 1982; DeCoster et al., 
1990] . Therefore, the selection of the screw type should be based on the assessment of the quality of the 
bone at the time of insertion. Under identical conditions, self-tapping screws provide a slightly greater 
holding power than non-self-tapping screws [Tencer et al., 1993]. 

Screw pullout strength varies with time after insertion in vivo, and it depends on the growth of bone into 
the screw threads and/or resorption of the surrounding bone [Schatzker et al., 1975] . The bone immediately 
adjacent to the screw often undergoes necrosis initially, but if the screw is firmly fixed, when the bone 
revascularizes, permanent secure fixation may be achieved. This is particularly true for titanium alloy screws 
or screws with a roughened thread surface, with which bone ongrowth results in an increase in removal 
torque [Hutzschenreuter and Brtimmer, 1980]. When the screw is subject to micro- or macro-movement, 
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TABLE 44. 1 Biomaterials Applications in Internal Fixation 



Materials 
Stainless Steel 

Ti alloy 

Co-Cr alloys (wrought) 



Properties 
Low cost, easy fabrication 



High cost 

Low density and modulus 

Excellent bony contact 

High cost 

High density and modulus 

nifflrult fabrication 



Application 

Surgical wire (annealed) 

Pin, plate, screw 

IM nail 

Surgical wire 

Plate, screws, IM nails 

Surgical Wire 
IM Nails 



Poly lactic acid 
Poly glycolic acid 
Nylon 



Resorbable 
Weak strength 
Non-resorbable plastic 



Pin, screw 
Cerclage band 
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TABLE 44.2 Failure Modes of Internal Fixation Devices 



Failure Mode 
Overload 

Fatigue 

Corrosion 

Loosening 



Failure Location 

Bone fracture site 
Implant screw hole 
Screw thread 
Bone fracture site 
Implant screw hole 
Screw thread 

Screw head-plate hole 
Bent area 



Screw 



Reasons for Failure 

Small size implant 
Unstable reduction 
Early weight bearing 
Early weight bearing 
Small size implant 
Unstable reduction 
Fracture non-union 
Different alloy implants 
Over-tightening screw 
Misalignment of screw 
Over-bent 
Motion 

Wrong choice of screw type 
Osteoporotic bone 




FIGURE 44.2 Types of metallic pin tip: (a) trochar end, and (b) diamond end. 

the contacting bone is replaced by a membrane of fibrous tissue, the purchase is diminished, and the 
screw loosens. 

The two principal applications of bone screws are (1) as interfragmentary fixation devices to "lag" or 
fasten bone fragments together, or (2) to attach a metallic plate to bone. Interfragmentary fixation is 
used in most fractures involving cancellous bone and in those oblique fractures in cortical bone. In order 
to lag the fracture fragments, the head of the screw must engage the cortex on the side of insertion 
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FIGURE 44.3 Bone screws: (a) a self-tapping V-threaded (has a cutting flute), and (b) a non-self-tapping, buttress 
threaded screw. 

without gripping the bone, while the threads engage cancellous bone and/or the cortex on the opposing 
side. When screws are employed for bone plate fixation, the bone screw threads must engage both cortices. 
Screws are also used for the fixation of spine fractures (for plate fixation or compression of bone fragment; 
Fig. 44.1c). 

Plates 

Plates are available in a wide variety of shapes and are intended to facilitate fixation of bone fragments. 
They range from the very rigid, intended to produce primary bone healing, to the relatively flexible, 
intended to facilitate physiological loading of bone. 

The rigidity and strength of a plate in bending depend on the cross-sectional shape (mostly thickness) 
and material of which it is made. Consequently, the weakest region in the plate is the screw hole, especially 
if the screw hole is left empty, due to a reduction of the cross-sectional area in this region. The effect of 
the material on the rigidity of the plate is defined by the elastic modulus of the material for bending, 
and by the shear modulus for twisting [Cochran, 1982]. Thus, given the same dimensions, a titanium 
alloy plate will be less rigid than a stainless steel plate, since the elastic modulus of each alloy is 1 10 GPa 
and 200 GPa, respectively. 

Stiff plates often shield the underlying bone from the physiological loads necessary for its healthful 
existence [O'Slullivan et al., 1989; Perren et al., 1988]. Similarly, flat plates closely applied to the bone 
prevent blood vessels from nourishing the outer layers of the bone [Perren, 1988]. For these reasons, the 
current clinical trend is to use more flexible plates (titanium alloy) to allow micromotion, and low- 
contact plates (only a small surface of the plate contacts the bone, LCP), to allow restoration of vascularity 
to the bone [Uhthoff abd Finnegan, 1984; Claes, 1989]. The underlying goals of this philosophical change 
are to increase the fracture healing rate, to decrease the loss of bone mass in the region shielded by the 
plate, and consequently, to decrease the incidence of re-fractures which occur following plate removal. 

The interaction between bone and plate is extremely important since the two are combined into a 
composite structure. The stability of the plate-bone composite and the service life of the plate depends 
upon accurate fracture reduction. The plate is most resistant in tension; therefore, in fractures of long 
bones, the plate is placed along the side of the bone which is typically loaded in tension. Having excellent 
apposition of the bone fragments, as well as developing adequate compression between them, is critical 
in maintaining the stability of the fixation and preventing the plate from repetitive bending and fatigue 
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FIGURE 44.4 Principle of a dynamic compression plate for fracture fixation. During tightening a screw, the screw 
head slides down on a ramp in a plate screw hole which results in pushing the plate away from a fracture end and 
compressing the bone fragments together. 

failure. Interfragmentary compression also creates friction at the fracture surface, increasing resistance 
to torsional loads [Perren, 1991; Tencer et al., 1993]. On the contrary, too much compression causes 
micro fractures and necrosis of contacting bone due to the collapse of vesicular canals. 

Compression between the fracture fragments can be achieved with a special type of plate called a 
dynamic compression plate (DCP). The dynamic compression plate has elliptic shape screw holes with its 
long axis oriented parallel to that of the plate. The screw hole has a sliding ramp to the long axis of the 
plate. Figure 44.4 explains the principle of the dynamic compression plate. 

Bone plates are often contoured in the operating room in order to conform to an irregular bone shape 
to achieve maximum contact of the fracture fragments. However, excessive bending decreases the service 
life of the plate. The most common failure modes of a bone plate-screw fixation are screw loosening and 
plate failure. The latter typically occurs through a screw hole due to fatigue and/or crevice corrosion 
[Weinstein et al., 1979]. 

In the vicinity of the joints, where the diameter of long bones is wider, the cortex thinner, and cancellous 
bone abundant, plates are often used as a buttress or retaining wall. A buttress plate applies force to the 
bone perpendicular to the surface of the plate, and prevents shearing or sliding at the fracture site. Buttress 
plates are designed to fit specific anatomic locations and often incorporate other methods of fixation 
besides cortical or cancellous screws, for example, a large lag screw or an I-beam. For the fusion of 
vertebral bodies following diskectomy, spinal plates are used along with bone grafts. These plates are 
secured to the vertebral bodies using screws (Fig. 44.1c). Similar approaches have been employed to 
restore stability in the thoracolumbar and cervical spine region as well. Figure 44.5 illustrates a variety 
of types of bone plates. 

Intramedullary Nails 

Intramedullary devices (IM nails) are used as internal struts to stabilize long bone fractures. IM nails are 
also used for fixation of femoral neck or intertrochanteric bone fractures; however, this application 
requires the addition of long screws. A gamut of designs are available, going from solid to cylindrical, 
with shapes such as cloverleaf, diamond, and "C" (slotted cylinders). Figure 44.6 shows a variety of 
intramedullary devices. 

Compared to plates, IM nails are better positioned to resist multi-directional bending than a plate or 
an external fixator, since they are located in the center of the bone. However, their torsional resistance 
is less than that of the plate [Cochran, 1982]. Therefore, when designing or selecting an IM nail, a high 
polar moment of inertia is desirable to improve torsional rigidity and strength. The torsional rigidity is 
proportional to the elastic modulus and the moment of inertia. For nails with a circular cross-section, 
torsional stiffness is proportional to the fourth power of the nail's radius. The wall thickness of the nail 
also affects the stiffness. A slotted, open section nail is more flexible in torsion and bending, and it allows 
easy insertion into a curved medullary canal for example, that of the femur [Tencer, 1993]. However, in 
bending, a slot is asymmetrical with respect to rigidity and strength. For example, a slotted nail is strongest 
when bending is applied so that the slot is near the neutral plane; the nail is weakest when oriented so 
that the slot is under tension. 
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FIGURE 44.5 Bone Plates: (a) dynamic compression plate, (b) hybrid compression plate (lower part has dynamic 
compression screw holes), (c) reconstruction bone plate (easy contouring), (d) buttress bone plate, (e) L shaped 
buttress plate, (f) nail plate (for condylar fracture), and (g) dynamic compression hip screw. 

In addition to the need to resist bending and torsion, it is vital for an IM nail to have a large contact 
area with the internal cortex of the bone to permit torsional loads to be transmitted and resisted by shear 
stress. Two different concepts are used to develop shear stress: (1) a three-point, high pressure contact, 
achieved with the insertion of curved pins, and (2) a positive interlocking between the nail and intramed- 
ullary canal, to produce a unified structure. Positive interlocking can be enhanced by reaming the 
intramedullary canal. Reaming permits a larger, longer, nail-bone contact area and allows the use of a 
larger nail, with increased rigidity and strength [Kessler et al„ 1986]. 

The addition of screws through the bone and nail, proximal and distal to the fracture, known as 
interlocking, increases torsional stability and prevents shortening of the bone, especially in unstable 
fractures [Perren, 1989] . The IM nail which has not been interlocked allows interfragmentary compressive 
force due to its low resistance to axial load. Another advantage of IM nails is that they do not require 
opening the fracture site, since they can be inserted through a small skin incision, typically located in 
one extreme of the bone. The insertion of an intramedullary nail, especially those that require reaming 
of the medullary canal, destroys the intramedullary vessels which supply two thirds of the cortex. However, 
this is not of clinical significance because revascularization occurs rapidly [Kessler et al., 1986; O'Slullivan 
etal., 1989]. 

Joint Replacements 

Our ability to replace damaged joints with prosthetic implants has brought relief to millions of patients 
who would otherwise have been severely limited in their most basic activities and doomed to a life in 
pain. It is estimated that about 16 million people in the U.S. are affected by osteoarthritis, one of the 
various conditions that may cause joint degeneration and may lead a patient to a total joint replacement. 
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FIGURE 44.6 Intramedullary devices: (a) Gross-Kempf (slotted), (b) Uniflex (Ti alloy, slotted), (c) Kuntscher, 
(d) Samson, (e) Harris, (f) Brooker- Wills distal locking pin, and (g) Enders pins. 

Joint degeneration is the end-stage of a process of destruction of the articular cartilage, which results 
in severe pain, loss of motion, and occasionally, an angular deformity of the extremity [Buckwalter et al., 
1993]. Unlike bone, cartilage has a very limited capacity for repair [Salter, 1989]. Therefore, when exposed 
to a severe mechanical, chemical, or metabolic injury, the damage is permanent and often progressive. 

Under normal conditions, the functions of cartilage are to provide a congruent articulation between 
bones, to transmit load across the joint, and to allow low-friction movements between opposing joint 
surfaces. The sophisticated way in which these functions are performed becomes evident from some of 
the mechanical characteristics of normal cartilage. For example, due to the leverage geometry of the 
muscles and the dynamic nature of human activity, the cartilage of the hip is exposed to about eight 
times body weight during fast walking [Paul, 1976]. Over a period of 10 years, an active person may 
subject the cartilage of the hip to more than 17 million weight bearing cycles [Jeffery, 1994]. From the 
point of view of the optimal lubrication provided by synovial fluid, cartilage's extremely low frictional 
resistance makes it 15 times easier to move opposing joint surfaces than to move an ice-skate on ice 
[Mow and Hayes, 1991; Jeffery, 1994]. 

Cartilage functions as a unit with subchondral bone, which contributes to shock absorption by 
undergoing viscoelastic deformation of its fine trabecular structure. Although some joints, like the hip, 
are intrinsically stable by virtue of their shape, the majority require an elaborate combination of ligaments, 
meniscus, tendons, and muscles for stability. Because of the large multidirectional forces that pass through 
the joint, its stability is a dynamic process. Receptors within the ligaments fire when stretched during 
motion, producing an integrated muscular contraction that provides stability for that specific displace- 
ment. Therefore, the ligaments are not passive joint restraints as once believed. The extreme complexity 
and high level of performance of biologic joints determine the standard to be met by artificial implants. 

Total joint replacements are permanent implants, unlike those used to treat fractures, and the extensive 
bone and cartilage removed during implantation makes this procedure irreversible. Therefore, when 
faced with prosthesis failure and the impossibility to reimplant, the patient will face severe shortening 
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of the extremity, instability or total rigidity of the joint, difficulty in ambulation, and often will be confined 
to a wheel chair. 

The design of an implant for joint replacement should be based on the kinematics and dynamic load 
transfer characteristic of the joint. The material properties, shape, and methods used for fixation of the 
implant to the patient determines the load transfer characteristics. This is one of the most important 
elements that determines long-term survival of the implant, since bone responds to changes in load 
transfer with a remodeling process, mentioned earlier as Wollff's law. Overloading the implant-bone 
interface or shielding it from load transfer may result in bone resorption and subsequent loosening of 
the implant [Sarmiento et al., 1990]. The articulating surfaces of the joint should function with minimum 
friction and produce the least amount of wear products [Charnley, 1973] . The implant should be securely 
fixed to the body as early as possible (ideally immediately after implantation); however, removal of the 
implant should not require destruction of a large amount of surrounding tissues. Loss of tissue, especially 
of bone, makes re-implantation difficult and often shortens the life span of the second joint replacement 
[Dupont and Charnley, 1972]. 

Decades of basic and clinical experimentation have resulted in a vast number of prosthetic designs 
and material combinations (Tables 44.3 and 44.4) [Griss, 1984]. In the following section, the most relevant 
achievements in fixation methods and prosthetic design for different joints will be discussed at a con- 
ceptual level. Most joints can undergo partial replacement (hemiarthroplasty), that is, reconstruction of 
only one side of the joint while retaining the other. This is indicated in selected conditions when global 

TABLE 44.3 Biomaterials for Total Joint Replacements 



Materials 


Applications 


Properties 


Co-Cr alloy 


Stem, head (ball) 


Heavy, hard, stiff 


(casted or wrought) 


Cup, porous coating 
Metal backing 


High wear resistance 


Ti alloy 


Stem, porous coating 


Low stiffness 




Metal backing 


Low wear resistance 


Pure titanium 


Porous coating 


Excellent osseousintegration 


Tantalum 


Porous structure 


Excellent osseousintegration 
Good mechanical strength 


Alumina 


Ball, cup 


Hard, brittle 

High wear resistance 


Zirconia 


Ball 


Heavy and high toughness 
High wear resistance 


UHMWPE 


Cup 


Low friction, wear debris 
Low creep resistance 


PMMA 


Bone cement fixation 


Brittle, weak in tension 
Low fatigue strength 



Note: Stem: femoral hip stem/chondylar knee stem; head: femoral head of 
the hip stem; cup: acetabular cup of the hip. 



TABLE 44.4 Types of Total Joint Replacements 

Joint Types 

Hip Ball and socket 

Knee Hinged, semiconstrained, surface replacement 

Unicompartment or bicompartment 

Shoulder Ball and socket 

Ankle Surface replacement 

Elbow Hinged, semiconstrained, surface replacement 

Wrist Ball and socket, space filler 

Finger Hinged, space filler 
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joint degeneration has not taken place. This section will focus on total joint replacement, since this allows 
for a broader discussion of the biomaterials used. 

Implant Fixation Method 

The development of a permanent fixation mechanism of implants to bone has been one of the most 
formidable challenges in the evolution of joint replacement. There are three types of methods of fixation: 
(1) mechanical interlock, which is achieved by press-fitting the implant [Cameron, 1994a], by using 
polymethylmethacrylate, which is called bone cement, as a grouting agent [Charnley, 1979], or by using 
threaded components [Albrektsson et al., 1994]; (2) biological fixation, which is achieved by using tex- 
tured or porous surfaces, which allow bone to grow into the interstices [Cameron, 1994b]; and (3) direct 
chemical bonding between implant and bone, for example, by coating the implant with calcium hydroxya- 
patite, which has a similar mineral composition to bone [Morscher, 1992]. Recently, direct bonding with 
bone was observed with Bioglass, a glass-ceramic, through selective dissolution of the surface film [Hench, 
1994]; however, the likelihood of its clinical application is still under investigation. 

Each of the fixation mechanisms has an idiosyncratic behavior, and their load transfer characteristics 
as well as the failure mechanisms are different. Further complexity arises from prostheses which combine 
two or more of the fixation mechanisms in different regions of the implant. Multiple mechanisms of 
fixation are used in an effort to customize load transfer to requirements of different regions of bone in 
an effort to preserve bone mass. Loosening, unlocking, or de-bonding between implant and bone 
constitute some of the most important mechanisms of prosthetic failure. 

Bone Cement Fixation 

Fixation of implants with polymethylmethacrylate (PMMA, bone cement) provides immediate stability, 
allowing patients to bare all of their weight on the extremity at once. In contrast, implants which depend 
on bone ingrowth require the patient to wait about 12 weeks to bear full weight. 

Bone cement functions as a grouting material; consequently, its anchoring power depends on its ability 
to penetrate between bone trabeculae during the insertion of the prosthesis [Charnley, 1979]. Being a 
viscoelastic polymer, it has the ability to function as a shock absorber. It allows loads to be transmitted 
uniformly between the implant and bone, reducing localized high-contact stress. 

Fixation with bone cement creates bone-cement and cement-implant interfaces, and loosening may 
occur at either one. The mechanisms to enhance the stability of the metal-cement interface constitute 
an area of controversy in joint replacement. Some investigators have focused their efforts on increasing 
the bond between metal and cement by roughening the implant, or pre-coating it with PMMA to prevent 
sinking of the prosthesis within the cement mantle, and circulation of debris within the interface [Park 
et al., 1978; Barb etal., 1982; Harris, 1988]. In contrast, others polish the implant surfaces and favor 
wedge-shaped designs which encourage sinking of the prosthesis within the cement, to profit from the 
viscoelastic deformation of the mantle by loading the cement in compression [Ling, 1992]. 

The problems with bone-cement interface may arise from intrinsic factors, such as the properties of 
the PMMA and bone, as well as extrinsic factors such as the cementing technique. Refinements in the 
cementing technique, such as pulsatile lavage of the medullary canal, optimal hemostasis of the cancellous 
bone, as well as drying of the medullary canal and pressurized insertion of the prosthesis, can result in a 
cement-bone interface free of gaps, with maximal interdigitation with cancellous bone [Harris and Davies, 
1988]. Despite optimal cementing technique, a thin fibrous membrane may appear in various regions of 
the interface, due to various factors such as the toxic effect of free methylmethacrylate monomer, necrosis 
of the bone resulting from high polymerization temperatures, or devascularization during preparation of 
the canal [Goldring etal., 1983]. Although a fibrous membrane in the bone cement interface may be 
present in a well-functioning implant, it may also increase in width over time (most probably as a result 
of the accumulation of polyethylene wear debris from the bearing couple), and may result in macromo- 
tion, bone loss, and eventual loosening [Ebramzadeh et al., 1994]. Finally, the cement strength itself may 
be improved by removing air bubbles by mixing monomer and polymer under vacuum and/or centri- 
fuging it [Harris, 1988]. During implantation, various devices are used to guarantee uniform thickness 
of the mantle to minimize risk of fatigue failure of the cement [Oh et al., 1978]. 
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Porous Ingrowth Fixation 

Bone ingrowth can occur with inert implants which provide pores larger than 25 um in diameter, which 
is the size required to accommodate an osteon. For the best ingrowth in clinical practice, pore size range 
should be 100 to 350 urn and pores should be interconnected with each other with similar size of opening 
[Cameron, 1994b] . Implant motion inhibits bony ingrowth and large bone-metal gaps prolong or prevent 
the osseointegration [Curtis et al., 1992]. Therefore, precise surgical implantation and prevention of 
post-operative weight bearing for about 12 weeks are required for implant fixation. 

The porous coated implants require active participation of the bone in the fixation of the implant, in 
contrast to cementation where the bone has a passive role. Therefore, porous coated implants are best 
indicated in conditions where the bone mass is near-normal. The implant design should allow ingrown 
bone to be subjected to continuous loading within a physiologic range in order to prevent loss of bone 
mass due to stress shielding. Porous ingrowth prostheses are notoriously difficult to remove, and 
substantial bone damage often results from the removal process. For this reason, they should be optimized 
to provide predictable ingrowth with a minimal area of surgically accessible porous coated surface. 

Commercially pure titanium, titanium alloy, tantalum, and calcium hydroxyapatite (HA) are currently 
used as porous coating materials. With pure titanium, three different types of porosity can be achieved: 
(1) plasma spray coating, (2) sintering of wire mesh, or (3) sintering of beads on an implant surface 
(Fig. 44.7) [Morscher, 1992]. Thermal processing of the porous coating may weaken the underlying metal 
(implant). Additional problems may result from flaking of the porous coating materials, since loosened 
metal particles may cause severe wear when they migrate into the articulation (bearing couple) [Agins 
et al., 1988]. A thin calcium hydroxyapatite coating over the porous titanium surface has been used in 
an effort to enhance osseointegration; however, it improves only early-stage interfacial strength [Fried- 
man, 1992; Capello and Bauer, 1994]. The long-term degradation and/or resorption of hydroxyapatite 
is still under investigation. 

Recently, a cellular, structural biomaterial comprised of 15 to 25% tantalum (75 to 85% porous) has 
been developed. The average pore size is about 550 um, and the pores are fully interconnected. The 
porous tantalum is a bulk material (i.e., not a coating) and is fabricated via a proprietary chemical vapor 




FIGURE 44.7 Scanning electron micrographs of four different types of porous structures: (a) plasma sprayed coating 
(7X), (b) sintered wire mesh coating (7X), (c) sintered beads coating (20X), and (d) Hedrocell porous tantalum (50X). 
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infiltration process in which pure tantalum is uniformly precipitated onto a reticulated vitreous carbon 
skeleton. The porous tantalum possesses sufficient compressive strength for most physiological loads, 
and tantalum exhibits excellent biocompatibility [Black, 1994] . This porous tantalum can be mechanically 
attached or diffusion bonded to substrate materials such as Ti alloy. Current commercial applications 
included polyethylene- porous tantalum acetabular components for total hip joint replacement and repair 
of defects in the acetabulum. 

Total Joint Replacements 

Hip Joint Replacement 

The prosthesis for total hip replacement consists of a femoral component and an acetabular component 
(Fig. 44.8a). The femoral stem is divided into head, neck, and shaft. The femoral stem is made of Ti alloy 
or CoCr alloy (316L stainless steel was used earlier) and is fixed into a reamed medullary canal by 
cementation or press fitting. The femoral head is made of CoCr alloy, alumina, or zirconia. Although Ti 
alloy heads function well under clean articulating conditions, they have fallen into disuse because of their 
low wear resistance to third bodies, e.g., bone or cement particles. The acetabular component is generally 
made of ultra-high molecular weight polyethylene (UHMWPE). 
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FIGURE 44.8(a) Radiograph of bone cement fixed Charnley hip joint (monolithic femoral and acetabular com- 
ponent, 15-year follow-up). 
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FIGURE 44.8(b) Modular total hip system: head, femoral stem, porous coated proximal wedge, porous coated 
metal backing for cup, UHMWPE cup, and fixation screws. 

The prostheses can be monolithic when they consist of one part, or modular when they consist of two 
or more parts and require assembly during surgery. Monolithic components are often less expensive, and 
less prone to corrosion or disassembly. However, modular components allow customizing of the implant 
intraoperatively, and during future revision surgeries, for example, modifying the length of an extremity 
by using a different femoral neck length after the stem has been cemented in place, or exchanging a worn 
polyethylene bearing surface for a new one without removing the well-functioning part of the prosthesis 
from the bone. In modular implants (Fig. 44.8b), the femoral head is fitted to the femoral neck with a 
Morse taper, which allows changes in head material and size, and neck length. Table 44.5 illustrates the 
most frequently used combinations of material in total hip replacement. 

When the acetabular component is monolithic, it is made of UHMWPE; when it is modular, it consists 
of a metallic shell and a UHMWPE insert. The metallic shell seeks to decrease the microdeformation of 
the UHMWPE and to provide a porous surface for fixation of the cup [Skinner, 1992]. The metallic shell 
allows worn polyethylene liners to be exchanged. In cases of repetitive dislocation of the hip after surgery, 
the metallic shell allows replacing the old liner with a more constrained one, to provide additional stability. 
Great effort has been placed on developing an effective retaining system for the insert, as well as on 
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TABLE 44.5 Possible Combination of Total Hip Replacements 



FEMORAL COMPONENT ACETABULAR COMPONENT 

FIXATION STEM BALL CUP BACKING FIXATION 



PMMA v « PMMA 

CoCr alloy _ ► CoCr alloy 



CoCr alloy V J " Alumina „ UHMWPE \ \ ' METAL 







Screw, or 
Bone Ingrowth ^ A I /\ f\ \\ A *■ P ress fitting 

Ti alloy ^-^^ \ / \ ^^- — ■*" None 

Zirconia Alumina 

Bone Ingrowth 
Press fitting 



maximizing the congruity between insert and metallic shell (Fig. 44.8b). Dislodgment of the insert results 
in dislocation of the hip and damage of the femoral head, since it contacts the metallic shell directly. 
Micromotion between insert and shell produces additional polyethylene debris which can eventually 
contribute to bone loss [Friedman, 1994]. 

The hip joint is a ball-and-socket joint, which derives its stability from congruity of the implants, 
pelvic muscles, and capsule. The prosthetic hip components are optimized to provide a wide range of 
motion without impingement of the neck of the prosthesis on the rim of the acetabular cup to prevent 
dislocation. The design characteristics must enable implants to support loads that may reach more than 
eight times body weight [Paul, 1976]. Proper femoral neck length and correct restoration of the center 
of motion and femoral offset decrease the bending stress on the prosthesis-bone interface. High-stress 
concentration or stress shielding may result in bone resorption around the implant. For example, if the 
femoral stem is designed with sharp corners (diamond shaped in a cross-section), the bone in contact 
with the corners of the implant may necrose and resorb. 

Load bearing and motion of the prosthesis produces wear debris from the articulating surface, and 
from the interfaces were there is micromotion, e.g., stem-cement interface. Bone chip, cement chip, or 
broken porous coating are often entrapped in the articulating space and cause severe polyethylene wear 
(third-body wear). The principal source of wear under normal conditions is the UHMWPE bearing 
surface in the cup. Approximately 150,000 particles are generated with each step and a large proportion 
of these particles are smaller than 1 um. Cells from the immune system of the host, for example, 
macrophages, are able to identify the polyethylene particles as foreign and initiate a complex inflammatory 
response. This response may lead to rapid focal bone loss (osteolysis), bone resorption, loosening, and/or 
fracture of the bone. Recently, low-wear UHMWPE has been developed using a cross-linking of poly- 
ethylene molecular chains. There are several effective methods of cross-linking polyethylene, including 
irradiation of cross-linking, peroxide cross-linking, and silane cross-linking [Shen et al., 1996]. However, 
none of the cross-linked polyethylene has been clinically tested yet. Numerous efforts are underway to 
modify the material properties of articulating materials to harden and improve the surface finish of the 
femoral head [Friedman, 1994]. There is growing interest in metal-metal and ceramic-ceramic hip 
prostheses as a potential solution to the problem of osteolysis induced by polyethylene wear debris. 

Knee Joint Replacements 

The prosthesis for total knee joint replacement consists of femoral, tibial, and/or patellar components. 

Compared to the hip joint, the knee joint has a more complicated geometry and movement biomechanics, 



) 2000 by CRC Press LLC 




FIGURE 44.9 Various types of knee joints: (a) metal hinged, (b) hinged with plastic liner, (c) intramedullary fixed 
semiconstrained, (d) surface replacement, (e) uni-compartmental replacement, and (f) bi-compartmental replacement. 

and it is not intrinsically stable. In a normal knee, the center of movement is controlled by the geometry 
of the ligaments. As the knee moves, the ligaments rotate on their bony attachments and the center of 
movement also moves. The eccentric movement of the knee helps distribute the load throughout the 
entire joint surface [Burstein and Wright, 1993]. 

The prostheses for total knee replacement (Fig. 44.9) can be divided according to the extent to which 
they rely on the ligaments for stability: (1) Constrained: these implants have a hinge articulation, with 
a fixed axis of rotation, and are indicated when all of the ligaments are absent, for example in recon- 
structive procedures for tumoral surgery. (2) Semi-constrained: these implants control posterior dis- 
placement of the tibia on the femur and medial-lateral angulation of the knee, but rely on the remaining 
ligaments and joint capsule to provide the rest of the constraint. Semi-constrained prostheses are often 
used in patients with severe angular deformities of the extremities, or in those that require revision 
surgery, when moderate ligamentous instability has developed. (3) Non-constrained: these implants 
provide minimal or no constraint. The prosthesis that provides minimal constraint requires resection of 
the posterior cruciate ligament during implantation, and the prosthetic constraint reproduces that nor- 
mally provided by this ligament. The ones that provide no constraint spare the posterior cruciate ligament. 
These implants are indicated in patients who have joint degeneration with minimal or no ligamentous 
instability. As the degree of constraint increases with knee replacements, the need for the use of femoral 
and tibial intramedullary extensions of the prosthesis is greater, since the loads normally shared with the 
ligaments are then transferred to the prosthesis-bone interface. 

Total knee replacements can be implanted with or without cement, the latter relying on porous coating 
for fixation. The femoral components are typically made of CoCr alloy and the monolithic tibial com- 
ponents are made of UHMWPE. In modular components, the tibial polyethylene component assembles 
onto a titanium alloy tibial tray. The patellar component is made of UHMWPE, and a titanium alloy 
back is added to components designed for uncemented use. The relatively small size of the patellar 
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component compared to the forces that travel through the extensor mechanism, and the small area of 
bone available for anchorage of the prosthesis, make the patella vulnerable. 

The wear characteristic of the surface of tibial polyethylene is different from that of acetabular com- 
ponents. The point contact stress and sliding motion of the components result in delamination and 
fatigue wear of the UHMWPE [Walker, 1993]. Presumably because of the relatively larger particle size 
of polyethylene debris, osteolysis around a total knee joint is less frequent than in a total hip replacement. 

Ankle Joint Replacement 

Total ankle replacements have not met with as much success as total hip and knee replacements, and 
typically loosen within a few years of service [Claridge et al., 1991]. This is mainly due to the high load 
transfer demand over the relatively small ankle surface area, and the need to replace three articulating 
surfaces (tibial, talar, and fibular). The joint configurations that have been used are cylindrical, reverse 
cylindrical, and spherical. The materials used to construct ankle joints are usually CoCr alloy and 
UHMWPE. Degeneration of the ankle joint is currently treated with fusion of the joint, since prostheses 
for total ankle replacement are still considered to be under initial development. Figure 44.10 shows ankle 
and other total joint replacements. 

Shoulder Joint Replacements 

The prostheses for total shoulder replacement consist of a humeral and a glenoid component. Like the 
femoral stem, the humeral component can be divided into head, neck, and shaft. Variations in the length 
of the neck result in changes in the length of the extremity. Even though the patient's perception of length 
of the upper extremity is not as accurate as that of the lower extremity, the various lengths of the neck 
are used to fine-tune the tension of the soft tissues to obtain maximal stability and range of motion. 





FIGURE 44.10 Miscellaneous examples of prostheses for total joint replacement: (a) ankle, (b) socket-ball shoulder 
joint, (c) hinged elbow joint, and (d) encapsulated finger joint. 
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The shoulder has the largest range of motion in the body, which results from a shallow ball and socket 
joint, which allows a combination of rotation and sliding motions between the joint surfaces. To com- 
pensate for the compromise in congruity, the shoulder has an elaborate capsular and ligamentous 
structure, which provides the basic stabilization. In addition, the muscle girdle of the shoulder provides 
additional dynamic stability. A decrease in the radius of curvature of the implant to compensate for soft 
tissue instability will result in a decrease in the range of motion [Neer, 1990]. 

Elbow Joint Replacements 

The elbow joint is a hinge-type joint allowing mostly flexion and extension, but having a polycentric 
motion [Goel and Blair, 1985]. The elbow joint implants are either hinged, semi-constrained, or uncon- 
strained. These implants, like those of the ankle, have a high failure rate and are not used commonly. 
The high loosening rate is the result of high rotational moments, limited bone stock for fixation, and 
minimal ligamentous support [Morrey, 1993]. In contrast to fusions of the ankle which function well, 
fusions of the elbow result in a moderate degree of incapacitation. 

Finger Joint Replacements 

Finger joint replacements are divided into three types: (1) hinge, (2) polycentric, and (c) space-filler. The 
most widely used are the space-filler type. These are made of high performance silicone rubber (poly- 
dimethylsiloxane) and are stabilized with a passive fixation method. This method depends on the devel- 
opment of a thin, fibrous membrane between implant and bone, which allows pistoning of the prosthesis. 
This fixation can provide only minimal rigidity of the joint [Swanson, 1973] . Implant wear and cold flow 
associated with erosive cystic changes of adjacent bone have been reported with silicone implants [Carter 
etal., 1986; Maistrelli, 1994]. 

Prosthetic Intervertebral Disk 

Because of adjacent-level degeneration and other complications, such as non-fusion, alternatives to 
fusions have been proposed. One of the most recent developments for non-fusion treatment alternatives 
is replacement of the intervertebral discs [Hedman et al., 1991]. The goal of this treatment alternative is 
to restore the original mechanical function of the disc. One of the stipulations of artificial disc replacement 
is that remaining osseous spinal and paraspinal soft tissue components are not compromised by patho- 
logic changes. Several artificial disk prostheses have been developed to achieve these goals (Fig. 44.11). 

Prostheses for Limb Salvage 

Prosthetic implant technology has brought new lifestyles to thousands of patients who would lose their 
limbs due to bone cancer. In the past, the treatment for primary malignant bone cancer of the extremities 
was amputation. Significant advances in bone tumor treatment have taken place during the last two 
decades. The major treatment methods for limb reconstruction following bone tumor resection are 
resection arthrodesis (fusion of two bones), allograft-endoprosthetic composite, and endoprosthetic 
reconstruction. Endoprosthetic reconstruction is an extension of a total joint replacement(s) component 
to the resected bone area, and is the most popular option due to an advantage of fast postoperative 
recovery (Fig. 44.12). Therefore, material and fixation methods for limb salvage endoprostheses are 
exactly the same as for total joint replacements. Femur, tibia, humerus, pelvis, and scapular are often 
resected and replaced endoprostheses. Similar to total joint replacements, disadvantages of the endopros- 
thetic reconstruction are prosthesis loosening due to polyethylene wear and cement failure, and mechan- 
ical failure of the prostheses. 

Most of the endoprostheses for limb salvage are of the expandable type [Ward et al., 1996]. Expandable 
endoprostheses are required for children who have a potential for skeletal growth. Several expandable 
prostheses require an open surgical procedure to be lengthened, whereas others have been developed that 
can be lengthened by servomechanisims within the endoprosthesis. The modular segmental system is a 
new option for expandable endoprosthesis. They can be revised easily to elongate modular components 
to gain length over time. The modular segmental system has several advantages over the mechanically 
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FIGURE 44.11 Experimental artificial discs "used" to restore function of the degenerated spine disc. 



expandable one. The use of the modular system allows intraoperative customization of the endoprosthesis 
during surgery. It minimizes discrepancies between custom implants and actual skeletal defects due to 
the radiographic magnification and uncertainty of margin of tumor resection. It allows the surgeon to 
assemble the prosthesis intraoperatively. The cost of the modular system is less than the cost of an 
expandable custom endoprosthesis. The modular system allows for simpler and less expensive revision 
when failures occur, obviating the need for an entirely new prosthesis when only one part needs to be 
replaced. On the other hand, the modular system has a high chance of corrosion failure at the Morse 
tapers and dislodgment at the Morse taper fittings. It can be lengthened only in certain increments. The 
modular component system has additional applications: metastatic bone disease, failure of internal 
fixation, severe acute fractures with poor bone quality, and failure of total joints with insufficient bone 
stock. 
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FIGURE 44.12 Radiographic appearance (montage) of a modular endoprosthetic replacement. Entire femur, hip 
joint, and knee joint of the bone tumor patient were replaced with prostheses for a limb salvage. 



Defining Terms 

Bone resorption: A type of bone loss due to the greater osteoclastic activity than the osteogenic activity. 
Callus: Unorganized meshwork of woven bone which is formed following fracture of bone to achieve 

early stability of the fracture. 
Fibrous membrane: Thin layer of soft tissue which covers an implant to isolate it from the body. 
Necrosis: Cell death caused by enzymes or heat. 

Osseointegration: Direct contact of bone tissues to an implant surface without fibrous membrane. 
Osteolysis: Dissolution of bone mineral from the bone matrix. 

Primary Healing: Bone healing in which union occurs directly without forming callus. 
Secondary Healing: Bone union with a callus formation. 
Stress shielding: Bone is protected from stress by the stiff implant. 
Wolff's Law: Bone develops or adapts its structure to that most suited to resist the forces acting upon it. 

© 2000 by CRC Press LLC 



References 

Agins HJ, Alcock NW, Bansal M et al., 1988. Metallic wear in failed Titanium-alloy total hip replacements. 

A histological and quantitative analysis. J Bone Joint Surg 70-A(3): 347. 
Albrektsson T, Carlsson LV, Morberg P et al., 1994. Directly bone-anchored implants. In Bone Implant 

Interface, ed. R Hurley, pp 97-120, Mosby, St. Louis, MO. 
Barb W, Park JB, von Recum AF et al., 1982. Intramedullary fixation of artificial hip joints with bone 

cement precoated implants: I. Interfacial strengths. J Biomed Mater Res 16:447. 
Black J. 1994. Biological performance of tantalum. Clinical Materials 16:167. 
Brand R A, Rubin, C T. 1987. Fracture Healing, In Scientific Basics of Orthopaedics, 2nd ed. J. Albert 

and R. Brand, pp 325-340, Appleton & Lange, Norwalk, CT. 
Brighton CT. 1984. Principle of Fracture Healing, In Instructional Course Lectures, ed. J. Murray, 

pp 60-106, The American Academy of Orthopaedic Surgeons. 
Buckwalter JA, Woo S et al., 1993. Soft-tissue aging and musculoskeletal function. J Bone Joint Surg 

75A(10): 1533. 
Burstein AH, Wright TH. 1993. Biomechanics. In J Insall, R Windsor, W Scott et al. (eds), Surgery of the 

knee, 2nd ed. Vol 7 pp 43-62, Churchill Livingstone, New York. 
Cameron HU. 1994a. Smooth metal-bone interface. In Bone Implant Interface, ed. R Hurley, pp 121-144, 

Mosby, St. Louis, MO. 
Cameron HU. 1994b. The implant-bone interface: porous metals. In Bone Implant Interface, ed. 

R. Hurley, pp 145-168, Mosby, St. Louis, MO. 
Capello WN, Bauer TW. 1994. Hydroxyapatite in orthopaedic surgery. In Bone Implant Interface, ed. 

R. Hurley, pp 191-202, Mosby, St. Louis, MO. 
Carter P, Benton L, Dysert P. 1986. Silicone rubber carpal implants; A study of the incidence of late 

osseous complications, J Hand Surg 11A(5):639. 
Charnley J. 1979. Low Friction Arthroplasty of the Hip. Springer- Verlag, Berlin. 
Claes L. 1989. The mechanical and Morphological properties of bone beneath internal fixation plates of 

differing rigidity. J Orthop Res 7:170. 
Claridge R J, Hart MB, Jones RA et al., 1991. Replacement arthroplasties of the ankle and foot. In Disorder 

of the Foot & Ankle, 2nd ed. M. Jahss, pp 2647-2664, Saunders, Philadephia, PA. 
Cochran GVB. 1982. Biomechanics of orthopaedic Structures. In Primer in Orthopaedic Biomechanics, 

pp 143-215, Churchill Livingstone, New York. 
Curtis MJ, Jinnah RH, Wilson VD, Hungerford DS. 1992. The initial stability of uncemented acetabular 

components. J Bone Joint Surg 74B(3): 372. 
DeCoster TA, Heetderks DB et al., 1990. Optimizing Bone Screw Pullout Force. J Orthop Trauma 4(2): 

169. 
Dupont JA, Charnley J. 1972. Low-friction arthroplasty of the hip for the failures of previous operations. 

J Bone Joint Surg 54B(1): 77. 
Ebramzadeh E, Sarmiento A, McKellop HA et al., 1994. The cement mantle in total hip arthroplasty. 

Analysis of long-term radiographic results. J Bone Joint Surg 76A(1): 77-87. 
Friedman RF, Black J, Galante JO et al., 1994. Current concepts in orthopaedic biomaterials and implant 

fixation. In JM Schafer (ed), Instructional Course Lectures, pp 233-255. The American Academy 

of Orthopaedic Surgeons. 
Friedman RJ. 1992. Advanced in biomaterials and factors affecting implant fixation. In RE Eilert (ed), 

Instructional Course Lectures, pp 127-135, The American Academy of Orthopaedic Surgeons. 
Goel VK, Blair W. 1985. Biomechanics of the elbow joint. Automedica 6:119. 
Goldring SR., Schiller AL, Roelke M et al., 1983. The synovial-like membrane at the bone-cement interface 

in loose total hip replacements and its proposed role in bone lysis. J Bone Joint Surg 65A(5): 575. 
Griss P. 1984. Assessment of clinical status of total joint replacement., In P Ducheyne, and GW Hastings 

(eds), Functional Behavior of Orthopaedic Biomaterials pp 21-48, CRC Press, Boca Raton, FL. 



© 2000 by CRC Press LLC 



Harris WH, Davies JP.1988. Modern use of modern cement for total hip replacement. Orthop. Clin. 

North Am 19(3): 581. 
Hedman TP, Kostuik PJ, Fernie GR et al., 1991. Design of an intervertebral disc prosthesis. Spine 16: 256. 
Hench LL. 1994. Bioactive glasses, ceramics and composites. In R Hurley (ed), Bone Implant Interface 

pp 181-190, Mosby, St. Louis, MO. 
Hulth A. 1989. Current concepts of fracture healing. Clin Orthop 249:265. 
Hutzschenreuter P, Brummer H. 1980. Screw design and stability. In H Uhthoff (ed), Current concepts 

of Internal Fixation, pp 244-250, Springer- Verlag, Berlin. 
Jeffery AK. 1994. Articular cartilage and the orthopaedic surgeon. Part 1: Structure and Function. Current 

Orthopaedic 8: 38. 
Kessler SB, Hallfeldt KK, Perren SM et al., 1986. The effects of reaming and intramedullary nailing on 

fracture healing. Clin Orthop 212:18. 
Ling RS. 1992. The use of a collar and precoating on cemented femoral stems is unnecessary and 

detrimental. Clin. Orthop 285:73. 
Maistrelli GL. 1994. Polymer in orthopaedic surgery, In R Hurley (ed), Bone Implant Interface, 

pp 169-190, Mosby, St. Louis, MO. 
McKellop HA, Campbell P, Park SH et al., 1995 The origin of submicron polyethylene wear debris in 

total hip arthroplasty. Clin Orthop 311:3. 
Morrey BF. 1993. The Elbow and its Disorders, 2nd ed, Saunders, Philadelphia, PA. 
Morscher EW. 1992. Current status of acetabular fixation in primary total hip arthroplasty. Clin. Orthop 

274:172. 
Mow VC, Hayes WC. 1991. Basic Orthopaedic Biomechanics, Raven Press, New York. 
Neer CS. 1990. Shoulder Reconstruction, Saunders, Philadelphia, PA. 
Oh I, Carlson CE, Tomford WW et al., 1978. Improved fixation of the femoral component after total hip 

replacement using a methacrylate intramedullary plug. J Bone Joint Surg 60A(5):608. 
O'Slullivan ME, Chao EYS, Kelly PJ. 1989. Current concepts review, The effects of fixation on fracture 

healing. J Bone Joint Surg. 71A(2):306. 
Park JB, Lakes RS. 1992. Biomaterials: An Introduction, 2nd ed, Plenum, London. 
Park JB, Malstrom CS, von Recum AF. 1978 Intramedullary fixation of implants precoated with bone 

cement: A preliminary study. Biomater Med Dev Artif Org 6(4):361. 
Paul JP. 1976. Loading on normal hip and knee joints and joint replacement, In M Schaldach, 

D Hohmann (eds), Advances in Hip and Knee Joint Technology, pp 53-77, Springer- Verlag, Berlin. 
Perren SM, 1989. The biomechnics and biology of internal fixation using plates and nails, Orthopaedics 

12(1):21. 
Perren SM. 1991. Basic aspects of internal fixation. In M Miiller, M Allgower, R Schneider, H Willenegger 

(eds), Manual of Internal Fixation, 3rd ed, pp 1-112, Springer- Verlag, Berlin. 
Salter RB. 1988. The biologic concept of continuous passive motion of synovial joints. The first 18 years 

of basic research and its clinical application. Clin. Orthop 242:12. 
Sarmiento A, Ebramzadeh E, Gogan WJ et al., 1990. Cup containment and orientation in cemented total 

hip arthroplasties. J Bone Joint Surg 72B(6):996. 
Schatzker J, Sanderson R, Murnaghan JP. 1975. The holding power of orthopaedic screws in vivo. Clin 

Orthop 108:115. 
Schenk R K. 1992. Biology of fracture repair. In B Browner, J Jupitor, A Levine, P Trafton (eds). Skeletal 

Trauma, pp 31-75, W.B. Saunders, Philadephia, PA. 
Shen FW, McKellop H, Salovey R. 1996. Irradiation of chemically crosslinked ultrahigh molecular weight 

polyethylene. J Polymer Science: Part B: Polymer Physics 34:1063. 
Skinner HB. 1992. Current biomaterial problems in implants. In RE Eilert (ed), Instructional Course 

Lectures, pp 137-144, The American Academy of Orthopaedic Surgeons. 
Swanson AB. 1973. Concepts of flexible implant design. In A Swanson (ed), Flexible Implant Reconstruc- 
tion Arthroplasty in the Hand and Extremity, pp 47-59, Mosby, St. Louis, MO. 



© 2000 by CRC Press LLC 



Taitsman JP, Saha S. 1977. Tensile strength of wire-reinforced bone cement and twisted stainless-steel 
wire. J Bone Joint Surg. 59A: 419. 

Tencer AF, Johnson KD, Kely RF et al., 1993. Biomechanics of fractures and fracture fixation. In JD 
Heckman (ed), Instructional Course Lectures, pp 19-55, The American Academy of Orthopaedic 
Surgeons. 

Uhthoff HK, Finnegan MA. 1984. The role of rigidity in fracture fixation. Arch Orthop Trauma Surg 
102:163. 

Walker PS. 1993. Design of total knee arthroplasty. In J Insall, R Windsor, W Scott et al. (eds), Surgery 
of the Knee, 2nd ed, Vol 7, pp. 723-738, Churchill Livingstone, New York. 

Ward WG, Yang RS., Eckart JJ. 1996. Endoprosthetic bone reconstruction following malignant tumor 
resection in skeletally immature patients. Orthop Clin North Am 27:493. 

Weinstein AM, Spires WP Jr, Klawitter et al., 1979. Orthopaedic implant retrieval and analysis study. In 
BC Syrett, A Acharya (eds), Corrosion and Degradation of Implant Materials, pp 212-228, Amer- 
ican Society for Testing and Materials Tech. Pub. No. 684, Philadelphia, PA. 

Wolff J. 1986. The Law of Bone Remodeling, R Maquet, R Furlong (trans), Springer- Verlag, Berlin. 

44.2 Dental Implants: The Relationship of Materials 
Characteristic to Biologic Properties 

/. C. Keller 

As dental implants have become an acceptable treatment modality for partially and fully edentulous 
patients, it has become increasingly apparent that the interaction of the host tissue with the underlying 
implant surface is of critical importance for long-term prognosis [Smith, 1993; Young, 1988]. From the 
anatomical viewpoint, it is generally accepted that dental implants must contact and become integrated 
with several types of host tissues. Due largely to the work of Branemark and his colleagues [ Albrektsson 
et al., Branemark, 1983], the importance of developing and maintaining a substantial bone-implant 
interface for mechanical retention and transmission of occlusal forces was realized. Despite documented 
long-term success of dental implants, longer-term implant failures are noted due to poor integration of 
connective and epithelial tissues and subsequent failure to develop a permucosal seal akin to that with 
natural tooth structures. From a biologic point of view, the characteristics of the implant substrate which 
permit hard and soft tissue integration and prevent adhesion of bacteria and plaque need to be further 
understood. It is likely that as a more complete understanding of the basic biologic responses of host 
tissues becomes known, refinements in the currently employed materials as well as new and improved 
materials will become available for use in the dental implant field. 

It is important to realize that the overall biologic response of host tissue to dental implants can be 
divided into two distinct but interrelated phases (as given in Table 44.6). Phase I consists of the tissue 
responses which occur during the clinical healing phase immediately following implantation of dental 
implants. During this healing phase, the initial biologic processes of protein and molecular deposition 
on the implant surface are followed by cellular attachment, migration, and differentiation [Stanford & 
Keller, 1991]. It is therefore important to understand the characteristics of the implant material which 
affect the initial formation of the host tissue-implant interface. The characteristics include materials 
selection and the physical and chemical properties of the implant surface. These initial tissue responses 
lead to the cellular expression and maturation of extracellular matrix and ultimately to the development 
of bony interfaces with the implant material. After the initial healing phase is complete, usually between 
3-6 months according to the two-stage Branemark implant design, the bone interface remodels under 
the occlusal forces placed on the implant during the Phase II functional period [Brunski, 1992; Skalak, 
1985] . The overall bioresponses including bone remodeling during the functional phase of implant service 
life are then strongly influenced by the characteristics of loading and distribution of stress at the interface 
[Brunski, 1992]. The ability of the maturing interface to "remodel" as stresses are placed on the implant 
thus depend in large part on the original degree of tissue-implant surface interaction. 
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TABLE 44.6 Correlation Between the Clinical Phases of Implant Service Life with Biologic 
Events and Important Implant Materials Characteristics 



Clinical Phase 



Biological Events 



Influential Materials 
Characteristics 



I (healing) Protein deposition 

Cell attachment 
Cell migration 

Development of extracellular matrix 
Bone deposition 



Materials selection 

Metals 

Ceramics 
Chemical and physical characteristics 

Topography 

Micro 

Macro 

Surface chemistry 

Inert 

Dissolution 



II (functional) Matrix and bone remodeling 



This chapter will focus on the factors concerning dental implants which affect biologic properties of 
currently available dental implant materials. As pertains to each major topic, the influence of the materials 
properties on biologic responses will be emphasized. 

Effects of Materials Selection 
Metals and Alloys 

Previously, dental implants have been fabricated from several metallic systems, including stainless steel 
and cobalt-chrome alloys as well as from the titanium family of metals. Several studies have reported on 
the ability of host bone tissues to "integrate" with various metal implant surfaces [Albrektsson et al., 
1983; Johansson et al., 1989; Katsikeris et al., 1987]. In current paradigms, the term osseointegration refers 
to the ability of host tissues to form a functional interface with implant surfaces without an intervening 
layer of connective tissue akin to a foreign body tissue capsule observable at the light microscopic level 
[Albrektsson et al., 1983; Branemark, 1983]. By this definition, it becomes apparent that several biomed- 
ical materials including Ti and Ti alloy fulfill this general criterion. Ultrastructural investigations using 
transmission electron microscopy (TEM) approaches have further refined descriptions of the tissue 
implant interface, and the early work by Albrektsson and colleagues [1983] has become the descriptive 
standard by which other materials interfaces are compared. When bone was allowed to grow on Ti, a 
partially calcified amorphous ground substance was deposited in immediate contact with the implant, 
followed by a collagenous fibril-based extracellular matrix, osteoblast cell processes, and a more highly 
calcified matrix generally 200-300 A from the implant surface. 

However, other metallic materials have fallen from favor for use as dental implants due to widely 
differing mechanical properties compared to bone (Table 44.7), which can result in a phenomenon termed 

TABLE 44.7 Approximate Room Temperature Mechanical Properties of Selected Implant 
Materials Compared to Bone 





Elastic Modulus 


Proportional 


Ultimate Tensile 


Percent 




(MPa x 10 3 ) 


Limit (MPa) 


Strength (MPa) 


Elongation 


316L SS 










Annealed 


200 


240 


550 


50 


Cold worked 


200 


790 


965 


20 


CoCrMo(ASTM-F75) 


240 


500 


700 


10 


Ti(ASTM-F67) 


100 


520 


620 


18 


Ti-6A1-4V(ASTM-F136) 


110 


840 


900 


12 


Cortical bone 


18 


130 


140 


1 



Source: Keller and Lautenshlager [1986]. 
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stress shielding [Brunski, 1992; Slalak, 1985], and the propensity for formation of potentially toxic 
corrosion products due to insufficient corrosion resistance properties [Lucas et al., 1987; Van Orden, 
1985]. Ultrastructurally, the interface between bone and 316L stainless steel was described as consisting 
of a multiple-cell layer separating the bone from metal. Inflammatory cells were prominent in this layer, 
and a thick proteoglycan noncollagenous coating was present. This histologic appearance resembled that 
of a typical foreign body reaction and typifies a nonosseointegration-type response. The poor biologic 
response to stainless steel alloys has been reconfirmed by recently conducted in vitro studies which related 
the inability of host tissue to attach to the metal surface to the toxicity associated with metal ion release 
[Vrouwenvelder et al., 1993]. 

Due in large measure to the introduction and overall clinical success of the Branemark system, the 
range of metallic materials utilized for dental implants has become limited largely to commercially pure 
titanium (cpTi > 99.5%) and its major alloy, Ti-6A1-4V [De Porter et al., 1986; Keller et al., 1987]. 
Controversy remains, largely due to commercial advertising interests, as to which material provides a 
more suitable surface for tissue integration. Early work by Johansson and coworkers [1989] reported that 
sputtercoated Ti alloy surfaces resulted in wide (5000 A) amorphous zones devoid of collagen filaments, 
compared to the thinner 200-400 A collagen-free amorphous zone surrounding cpTi surfaces. Subsequent 
studies revealed differences in the oxide characteristics between these sputtercoated cpTi and Ti alloy 
surfaces used for histologic and ultrastructural interfacial analyses. Significant surface contamination and 
the presence of V was observed in the Ti alloy surface, which led to an overall woven bone interface 
compared to the cpTi surface which had a more compact bone interface. Orr and colleagues [1992] 
demonstrated a similar ultrastructural morphology for interfaces of bone to Ti and Ti alloy, respectively. 
In each case an afibrillar matrix with calcified globular accretions, similar in appearance to cement lines 
in haversian systems, were observed in intimate contact with the oxide surface. Any slight differences in 
morphology were attributed to minor differences in surface topography or microtexture rather than 
chemical differences between the two surface oxides. 

This is an area that is still under investigation, although recent research indicates that the stable oxide 
of cpTi and Ti alloy provide suitable surfaces for biologic integration [Keller et al., 1994]. Studies involving 
comprehensive surface analyses of prepared bulk cpTi and Ti alloy indicated that although the oxide on 
Ti alloy is somewhat thicker following standard surface preparations (polishing, cleaning, and acid 
passivation), the overall topography, the chemical characteristics, including presence and concentration 
of contaminants, and surface energetics were virtually identical for both materials as given in Table 44.8. 
In vitro experiments confirmed that the inherently clean condition of these oxides supports significant 
osteoblast cell attachment and migration and provides a hospitable surface to allow in vitro mineralization 
processes to occur [Orr et al., 1992]. 

TABLE 44.8 Surface Characterizations of cpTi and Ti 
Alloy (means ± standard deviations) 





cpTi 


Ti-6A1-4V 


Surface roughness (Ra) (mil) 






Sandblasted 


0.9 ± 0.2 


0.7 ± 0.03 


600 grit polish 


0.2 ± 0.02 


0.1 ± 0.02 


Smooth, 1 |rm polish 


0.04 ± 0.01 


0.03 ± 0.01 


Atomic ratios to Ti 






C 


1.5 ± 0.2 


1.2 ± 0.1 


O 


2.8 ± 0.1 


3.1 ± 0.2 


N 


0.08 ± 0.01 


0.05 ± 0.01 


Al 


— 


0.2 ± 0.04 


V* 


— 


(0.02)* 


Oxide thickness (A) 


32 ± 8 


83 ± 12 


Wetting angles (°) 


52 ±2 


56 ±4 



* One specimen 
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In vitro experiments designed to study the ultrastructural details of bone-implant interfaces made 
from cpTi and Ti alloy may provide additional clues as to the histologic and ultrastructural differences 
which have been observed with these materials. Since clinical implants made from both materials appear 
to be successful [Branemark, 1983; De Porter et al„ 1986], it is possible that because of the difference in 
mechanical properties between unalloyed and Ti alloy material, the longer-term tissue interface results 
from differences in bone remodeling due to the local biomechanical environment surrounding these 
materials [Brunski, 1992]. This hypothesis requires continued investigation for more definitive answers. 

Ceramics and Ceramic Coatings 

The use of single crystal sapphire or A1 2 3 ceramic implants has remained an important component in 
the dental implant field [Driskell et al., 1973]. Although this material demonstrates excellent biologic 
compatibility, implants fabricated from A1 2 3 have not reached a high degree of popularity in the United 
States. Morphologic analyses of the soft tissue interface with A1 2 3 revealed a hemidesmosomal external 
lamina attachment adjacent to the junctional epithelium — implant interface [Steflik et al., 1984]. This 
ultrastructural description is often used for comparison purposes when determining the extent of soft- 
tissue interaction with dental implants. Similarly, in vivo studies of the bone — A1 2 3 implant interface 
reveal high levels of bone-to-implant contact, with areas of intervening fibrous connective tissue. 
Although fibrous tissue was present at the interface, the implant remained immobile, and the interface 
was consistent with a dynamic support system. More recent ultrastructural studies have demonstrated a 
mineralized matrix in immediate apposition to the A1 2 3 implants similar to that described for Ti 
implants [Steflik et al, 1993]. 

An approach to enhancing tissue responses at dental implant interfaces has been the introduction of 
ceramics like, calcium-phosphate-containing (CP) materials as implant devices. The use of calcium- 
phosphate materials, in bulk or particulate form or as coatings on metal substrates has taken a predom- 
inant position in the biomedical implant area and has been the focus of several recent reviews [Kay, 1992; 
Koeneman et al., 1990] . One of the most important uses of CP materials has been as a coating on metallic 
(cpTi and Ti alloy) substrates. This approach has taken advantage of thin-film-coating technology to 
apply thin coatings of hydroxyapatite (HA) and tricalcium phosphate (TCP) materials to the substrate 
in order to enhance bone responses at implant sites. The most popular method of coating has been the 
plasma spray process [Herman, 1988]; although this process has some advantages, there are reports of 
nonuniform coatings, interfacial porosity, and vaporization of elements in the powder [Cook et al., 1991; 
Kay, 1992]. 

The use of this class of materials is based on the premise that a more natural hydroxyapatite (HA-like) 
could act as a scaffold for enhanced bone response — osseointegration — and thereby minimize the long- 
term healing periods currently required for uncoated metal implants. 

Numerous in vivo investigations have clearly demonstrated that HA-like coatings can enhance bone 
responses at implant interfaces [Cook et al., 1991; Jarcho et al., 1997], although the mechanisms respon- 
sible for the development of the interface between hard tissue and these ceramic coatings are not well 
understood [Jansen et al., 1993] . Histologically, the overall bone-coating interface is similar in appearance 
and chronologic development to that reported for uncoated implant surfaces. Initially, an immature, 
trabecular, woven bone interface is formed followed by more dense, compact lamellar supporting bone 
structure. Ultrastructurally, the interface is reported to consist of a globular, afibrillar matrix directly on 
the HA surface, an electron-dense, proteoglycan rich layer (20-60 nm thick) and the presence of a 
mineralized collagenous matrix [De Bruijn et al., 1993]. Although the morphologic descriptions of the 
HA and Ti interfaces are similar, numerous studies have shown that the bone response to HA coatings 
is more rapid than with uncoated Ti surfaces, requiring approximately one-third to one-half the time to 
establish a firm osseous bed as uncoated Ti. Likewise, the extent of the bone response to HA coatings is 
superior and, according to some studies, leads to a several-fold increase in interfacial strength compared 
to uncoated Ti [Cook et al., 1991]. 

The cellular events which take place and lead to the interfacial ultrastructure with bone tissue and 
ceramic surfaces are under current investigation. Based upon preliminary findings, the advantageous 
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biologic properties of HA coatings do not appear to be related to recruitment of additional cells during 
the early attachment phase of healing. Although recent work indicated that bone cells and tissue form 
normal cellular focal contacts during attachment to HA coatings, the level of initial in vitro attachment 
generally only approximates that observed with Ti [Keller et al., 1992; Puleo et al., 1991]. Rather, the 
mechanisms for the enhanced in vitro cell responses appear to be related, to a certain degree, to the 
degradation properties and release of Ca +2 and PO~| ions into the biologic milieu. This surface corrosion 
is associated with highly degradable amorphous components of the coating and leads to surface irregu- 
larities which may enhance the quality of cell adhesion to these roughened materials [Bowers et al., 1992; 
Chehroudi et al., 1992]. Cellular events which occur following attachment may be influenced by the 
nature of the ceramic surface. Emerging evidence from a number of laboratories suggests that cellular- 
mediated events, including proliferation, matrix expression, and bone formation are enhanced following 
attachment to HA coatings and appear to be related to the gene expression of osteoblasts when cultured 
on different ceramic materials. These early cellular events lead to histologic and ultrastructural descrip- 
tions of bone healing which take place on these surfaces and are very similar to those reported from 
in vivo studies [Orr et al, 1992; Steflik et al, 1993]. 

As determined from in vitro dissolution studies, there is general agreement that the biodegradation 
properties of the pertinent CP materials can be summarized as a-TCP > P-TCP >>> HA, whereas 
amorphous HA is more prone to biodegradation than crystalline HA [Koeneman et al., 1990]. Consid- 
erable attempts to investigate the effects of coating composition (relative percentages of HA, TCP) on 
bone integration have been undertaken. Using an orthopedic canine total hip model, Jasty and coworkers 
(1992) reported that by 3 weeks, a TCP/HA mixed coating resulted in significantly more woven bone 
apposition to the implants than uncoated implants. As determined by x-ray diffraction, the mixed coating 
consisted of 60% TCP, 20% crystalline HA, and 20% unknown Ca-P0 4 materials. Jansen and colleagues 
[1993] reported that, using HA-coated implants (90% HA, 10% amorphous CP), bony apposition was 
extensive in a rabbit tibia model by 12 weeks. However, significant loss of the coating occurred as early 
as 6 weeks after implantation. Most recently, Maxian and coworkers [1993] reported that poorly crys- 
tallized HA (60% crystalline) coatings demonstrated significant degradation and poor bone apposition 
in vivo compared to amorphous coatings. Both these reports suggest that although considerable biore- 
sorption of the coating occurred in the cortical bone, there was significant bone apposition (81 ± 2% 
for amorphous HA at 12 weeks, 77% for crystalline HA, respectively) which was not significantly affected 
by bioresorption. 

From these in vivo reports, it is clear that HA coatings with relatively low levels of crystallinity are 
capable of significant bone apposition. However, as reported in a 1990 workshop report, the FDA is 
strongly urging commercial implant manufacturers to use techniques to increase the postdeposition 
crystallinity and to provide adequate adhesion of the coating to the implant substrate [Filiaggi et al., 
1993]. Although the biologic responses to HA coatings are encouraging, other factors regarding HA 
coatings continue to lead to clinical questions regarding their efficacy. Although the overall bone response 
to HA-coated implants occurs more rapidly than with uncoated devices, with time an equivalent bone 
contact area is formed for both materials [Jasty et al., 1992]. These results have questioned the true need 
for HA-coated implants, especially when there are a number of disadvantages associated with the coating 
concept. Clinical difficulties have arisen due to failures within the coating itself and with continued 
dissolution of the coating, and to catastrophic failure at the coating-substrate interface [Koeneman et al., 
1988]. 

Recent progress is reported in terms of the improvements in coating technology. Postdeposition heat 
treatments are often utilized to control the crystallinity (and therefore the dissolution characteristics) of 
the coatings, although there is still debate as to the relationship between compositional variations 
associated with differing crystallinity and optimization of biologic responses. Additional coating- related 
properties are also under investigation in regard to their effects on bone. These include coating thickness, 
level of acceptable porosity in the coating, and adherence of the coating to the underlying substrate. 
However, until answers concerning these variables have been more firmly established, HA coatings used 
for dental implants will remain an area of controversy and interest. 
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FIGURE 44.13 Examples of current dental implant designs, illustrating the variety of macroscopic topographies 
which are used to encourage tissue ingrowth. Left to right: Microvent, Corevent, Screw-vent, Swede-vent, Branemark, 
IMZ implant. 

Effects of Surface Properties 
Surface Topography 

The effects of surface topography are different than the overall three-dimensional design or geometry of 
the implant, which is related to the interaction of the host tissues with the implant on a macroscopic 
scale as shown in Fig. 44.13. This important consideration in overall biologic response to implants is 
discussed later in this chapter. In this discussion the concept of surface topography refers to the surface 
texture on a microlevel. It is on this microscopic level that the intimate cell and tissue interactions leading 
to osseointegration are based as shown in Fig. 44.14. 

The effects of surface topography on in vitro and in vivo cell and tissue responses have been a field of 
intense study in recent years. The overall goal of these studies is to identify surface topographies which 
mimic the natural substrata in order to permit tissue integration and improve clinical performance of 
the implant. In terms of cell attachment, the in vitro work by Bowers and colleagues [1992] established 
that levels of short-term osteoblast cell attachment were higher on rough compared to smooth surfaces 
and cell morphology was directly related to the nature of the underlying substrate. After initial attachment, 
in many cases, cells of various origin often take on the morphology of the substrate as shown in Fig. 44.15. 
Increased surface roughness, produced by such techniques as sand or grit blasting or by rough polishing, 
provided the rugosity necessary for optimum cell behavior. 

Work in progress in several laboratories is attempting to relate the nature of the implant surface to 
cell morphology, intracellular cytoskeletal organization, and extracellular matrix development. Pioneer- 
ing work by Chehroudi and coworkers [1992] suggests that microtextured surfaces (via micromachining 
or other techniques) could help orchestrate cellular activity and osteoblast mineralization by several 
mechanisms including proper orientation of collagen bundles and cell shape and polarity. This concept 
is related to the theory of contact guidance and the belief that cell shape will dictate cell differentiation 
through gene expression. In Chehroudi's work, both tapered pitted and grooved surfaces (with specific 
orientation and sequence patterns) supported mineralization with ultrastructural morphology similar 
in appearance to that observed by Davies and colleagues [1990]. However, mineralization was not 
observed on smooth surfaces in which osteoblastlike cells did not have a preferred growth orientation. 
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Thus the control of surface microtopography by such procedures as micromachining may prove to be a 
valuable technology for the control and perhaps optimization of bone formation on implant surfaces. 

It is apparent that macroscopic as well as microscopic topography may affect osteoblast differentiation 
and mineralization. In a recent study by Groessner-Schrieber and Tuan [1992], osteoblast growth, dif- 
ferentiation, and synthesis of matrix and mineralized nodules were observed on rough, textured, or 
porous coated titanium surfaces. It may be possible therefore, not only to optimize the interactions of 
host tissues with implant surfaces during the Phase I tissue responses but also to influence the overall 
bone responses to biomechanical forces during the remodeling phase (Phase II) of tissue responses. 

Based on these concepts, current implant designs employ microtopographically roughened surfaces 
with macroscopic grooves, threads, or porous surfaces to provide sufficient bone ingrowth for mechanical 
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FIGURE 44.14 Laboratory-prepared cpTi surfaces with 
(a-c, top to bottom) smooth (1 um polish), grooved 
(600 grit polish), and rough (sandblasted) surfaces. 



FIGURE 44.15 Osteoblastlike cell morphology 
after 2 hours' attachment on (a-c, top to bottom) 
smooth, grooved, and rough cpTi surfaces. 
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stabilization and the prevention of detrimental micro- 
motion as shown in Figs. 44.16 and 44.17 [Brunski, 
1992; De Porter etal., 1986; Keller et al., 1987; Pilliar 
etal., 1991]. 

Surface Chemistry 

Considerable attention has focused on the properties 
of the oxide found on titanium implant surfaces fol- 
lowing surface preparation. Sterilization procedures 
are especially important and are known to affect not 
only the oxide condition but also the subsequent 
in vitro [Stanford etal, 1994; Swart etal, 1992] and 
in vivo [Hartman et al., 1989] biologic responses. 
Interfacial surface analyses and determinations of sur- 
face energetics strongly suggest that steam autoclaving 
is especially damaging to titanium oxide surfaces. 
Depending upon the purity of the autoclave water, con- 
taminants have been observed on the metal oxide and 
are correlated with poor tissue responses on a cellular 
[Keller et al., 1990, 1994] and tissue [Baier et al, 1984; 
Hartman et al, 1989; Meenaghan et al, 1979] level. 

The role of multiple sterilization regimens on the 
practice of implant utilization is also under scrutiny. 
Many implants and especially bone plate systems are 
designed for repackaging if the kit is not exhausted. 
However, early evidence indicates that this practice is 
faulty and, depending on the method of sterilization, 
may affect the integrity of the metal oxide surface 
chemistry [Vezeau etal., 1991]. In vitro experiments 
have verified that multiple-steam-autoclaved and eth- 
ylene-oxide-treated implant surfaces adversely affected 
cellular and morphologic integration. However, the 
effects of these treatments on long-term biological 
responses including in vivo situations remain to be 
clarified. 

Other more recently introduced techniques such as radiofrequency argon plasma cleaning treatments 
have succeeded in altering metal oxide chemistry and structure [Baier etal., 1984; Swart etal., 1992]. 
Numerous studies have demonstrated that PC treatments produce a relatively contaminant-free surface 
with improved surface energy (wettability), but conflicting biologic results have been reported with these 
surfaces. Recent in vitro studies have demonstrated that these highly energetic surfaces do not necessarily 
improve cellular responses such as attachment and cell expression. This has been confirmed by in vivo 
studies which indicate that the overall histologic and ultrastructural morphology of the bone-implant 
interface is similar for plasma-cleaned and dry-heat-sterilized implant surfaces [Albrektsson et al., 1983]. 
Another promising technique for the sterilization of implant materials is the exposure of the implant 
surface to ultraviolet light [Singh and Schaff, 1989] or gamma irradiation [Keller et al., 1994]. Both these 
methods of sterilization produce a relatively contaminant-free thin oxide layer which fosters high levels 
of cell attachment [Keller et al., 1994] and inflammatory-free long-term in vivo responses [Hartman et al., 
1989]. Currently, gamma irradiation procedures are widely used for the sterilization of metallic dental 
implant devices. 




FIGURE 44.16 Light microscopic photomicro- 
graph of a bone-smooth cpTi interface with inter- 
vening layer of soft connective tissue. This implant 
was mobile in the surgical site due to lack of tissue 
ingrowth. (Original magnification = 50x.) 



© 2000 by CRC Press LLC 




, 



FIGURE 44.17 Light microscopic photomicrograph of a bone-porous Ti alloy implant interface. Note significant 
bone ingrowth in open porosity at the apical end of the implant. (Original magnification = lOx.) 

Metallic Corrosion 

Throughout the history of the use of metals for biomedical implant applications, electrochemical cor- 
rosion with subsequent metal release has been problematic [Galante et al., 1991]. Of the biomedical metal 
systems available today, Ti and its major medical alloy, Ti-6A1-4V, are thought to be the most corrosion 
resistant; however, Ti metals are not totally inert in vivo [Woodman et al., 1984]. Release of Ti ions from 
Ti oxides can occur under relatively passive conditions [Ducheyne, 1988]. Whereas other factors such as 
positioning of the implant and subsequent biomechanical forces may play important roles in the overall 
tissue response to implants, it is not unreasonable to predict that electrochemical interactions between 
the implant surface and host tissue may affect the overall response of host bone [Blumenthal and Cosma, 
1989]. For example, it has been shown by several groups [De Porter et al., 1986; Keller et al., 1987] that 
the percentages of intimate bony contact with the implant is inconsistent, at best, and generally averages 
approximately 50% over a 5-year period. Continued studies involving the effects of dissolution products 
on both local and systemic host responses are required in order to more fully understand the consequences 
of biologic interaction with metal implants. 

Future Considerations for Implant Surfaces 

It is clear that future efforts to improve the host tissue responses to implant materials will focus, in large 
part, on controlling cell and tissue responses at implant interfaces. This goal will require continued 
acquisition of fundamental knowledge of cell behavior and cell response to specific materials' character- 
istics. It is likely that a better understanding of the cellular-derived extracellular matrix-implant interface 
will offer a mechanism by which biologic response modifiers such as growth and attachment factors or 
hormones may be incorporated. Advancements of this type will likely shift the focus of future research 
from implant surfaces which as osseoconductive (permissive) to those which are osseoinductive (bioactive). 
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Defining Terms 

Amorphous zone: A region of the tissue-implant interface immediately adjacent to the implant sub- 
strate. This zone is of variable thickness (usually < 1000 A), is free of collagen, and is comprised 
of proteoglycans of unknown composition. 

Calcium phosphate: A family of calcium- and phosphate-containing materials of synthetic or natural 
origin which are utilized for implants and bone augmentation purposes. The most prominent 
materials are the tricalcium-phosphate- and hydroxyapatite-based materials, although most syn- 
thetic implants are a mixture of the various compositions. 

Contact guidance: The theory by which cells attach to and migrate on substrates of specific micro - 
structure and topographic orientation. The ability of the cell to attach and migrate on a substrate 
is related to the cytoskeletal and attachment molecules present on the cell membrane. 

Junctional epithelium: The epithelial attachment mechanism which occurs with teeth, and has been 
observed infrequently with implants by some researchers. Less than 1 cell layers thick, the hemides- 
mosomal attachments of the basal cells to the implant surface provide a mechanical attachment 
for epithelium and prevent bacterial penetration into the sulcular area. 

Osseointegration: A term developed by P.I. Branemark and his colleagues indicating the ability of host 
bone tissues to form a functional, mechanically immobile interface with the implant. Originally 
described for titanium only, several other materials are capable of forming this interface, which 
presumes a lack of connective tissue (foreign body) layer. 

Plasma spray: A high-temperature process by which calcium-phosphate-containing materials are 
coated onto a suitable implant substrate. Although the target material may be of high purity, the 
high-temperature softening process can dramatically affect and alter the resultant composition of 
the coating. 
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Biomaterials — i.e., proteins, cells, tissues, and organs — are used daily to preserve life. Uses such as blood 
transfusions, artificial insemination, burn repair, transplantation, and pharmaceuticals rely on their 
availability. Natural materials, however, are labile and often deteriorate over time. To counter this effect, 
preservation procedures for retarding deterioration rates have been developed. Furthermore, since each 
biomaterial is characterized by unique compositional and physical complexities, a variety of storage 
techniques exists. 

Table 45.1 lists examples of biomaterials that have been preserved successfully using various procedures. 
The list, although abbreviated, illustrates the wide range of cells, tissues, organs, and macromolecule 
structures that have been stored successfully and demonstrates how, in some cases (e.g., red blood cells), 
multiple preservation techniques may be appropriate. In the discussion that follows, four biomaterial 
storage procedures — nonfreezing, freeze-thaw, freeze-drying (lyophilization), and vitrification — are sum- 
marized. Nonfreezing techniques enable biomaterial storage by retarding metabolic processes and chem- 
ical reactions during cooling from physiologic to nonfreezing temperatures. With freeze-thaw techniques, 
the biomaterial is stored at low temperatures (usually less than — 70°C) in crystalline form and then 
thawed for actual use. Lyophilized biomaterials are first frozen, then dehydrated by sublimation for storage 
at ambient temperature, and finally reconstituted for use. With vitrification, the biomaterials is cooled 
to subzero temperature in such a way that it is transformed to an amorphous solid for storage and then 
rewarmed for use. Each procedure — i.e., its applications and risks — will be described in more detail 
below. This discussion is not a comprehensive review but a general overview of the principles governing 
each of these four common biomaterial storage techniques. 

45.1 Phase Behavior 

As mentioned previously, biomaterials in their naturally occurring forms tend to deteriorate over time. 
Hence to achieve safe long-term storage, it is generally necessary to alter the physicochemical state of the 
biomaterial. One approach is to promote the transformation of the original substance to a form that can 
be stored safely and then, when needed, restored to its original state. These types of transformations can 
best be represented using phase diagrams. 
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TABLE 45.1 Some Examples of Biomaterials 
Stored Using Various Storage Techniques 



Nonfreezing 



Freeze-thaw 



Lyophilization (freeze-dry) 



Vitrification 



Liver 

Kidney 

Heart valves 

Protein solutions 

Red blood cells 

Cartilage 

Bone marrow 

Skin 

Red blood cells 

Platelets 

Penicillin 

Collagen 

Liposomes 

Embryos 

Islets of Langerhans 

Corneal tissue 

Drosophila melanogaster 



S> 



t 

I 




-140 



~60 

SOLUTE CONCENTRATION. % (W/W) 

FIGURE 45.1 Phase diagram for a hypothetical solute additive. [Adapted from Fahy et al., 1984] 

Figure 45.1 illustrates a temperature versus concentration phase diagram, where concentration corre- 
sponds to the quantity of a hypothetical solute additive present in the solution. The diagram is particularly 
useful for describing phase transformations in which a thermodynamic phase of reduced energy nucleates 
within a parent phase. Crystallization is one example of this type of phase transformation and is a two- 
step process of nucleation of the new phase and its subsequent growth. If the formation of the new phase 
is catalyzed from the surface of foreign particles or a substrate, the growth process is triggered by 
heterogeneous nucleation (T HET ). If, however, the new phase develops from the clustering of individual 
water molecules, growth of the new phase occurs from homogeneous nucleation (T HOM ) [Hobbs, 1974]. 
The latter type of nucleation requires higher activation energies than heterogeneous nucleation and thus 
occurs at lower temperatures. Figure 45.1 also illustrates the relative positions of the melting (T M ) and 
the glass-transition (T G ) temperature curves. The melting-temperature curve shows the melting-point 
depression of a crystalline sample relative to solute concentration. The glass-transition curves signifies 
the temperatures for which a supercooled solution becomes glassy during cooling. The final curve in 
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Fig. 45. 1 ( T D ) demonstrates the devitrification temperature profile and illustrates the conditions for which 
a substance may experience crystallization damage during warming. For glasslike solids, damage is 
incurred as the glass transforms to a crystalline form at T D , while for previously crystalline or partially 
vitrified solids, recrystallization (the coalescence of small crystals during warming) produces damage. 
The described elements of the phase diagram are relevant because all the biomaterial storage techniques 
described in this chapter involve either a phase transformation or its avoidance. Furthermore, if a phase 
transformation is improperly controlled, the effects can be detrimental to the biomaterial. Four tech- 
niques relevant to biomaterial storage are now described. 

45.2 Nonfreezing Storage: Hypothermic 

Given the high cost of organ transplantation, longer preservation times are urgently needed to make the 
procedure cost-effective [Evans, 1993] and expand the geographic area over which organs can be harvested 
and transplanted. Presently, hypothermic storage is the clinically employed organ preservation technique. 

There are essentially two techniques for hypothermic preservation of organs for subsequent transplan- 
tation. The first is static cold storage by ice immersion at ~4°C — to reduce metabolism as much as possible 
without the formation of deleterious ice crystals [Belzer and Southard, 1988]. The second procedure is 
continuous cold machine perfusion at ~10°C to provide diminished metabolism and to remove delete- 
rious end products [Southard and Belzer, 1988]. The perfusate used for static cold storage mimics the 
intracellular ionic composition of the organ and has impermeant solutes added to prevent cell swelling. 
However, for perfusion preservation techniques in general, modified plasma or solutions of plasma-like 
composition usually have been preferred. This is presumably due to the slightly higher temperatures used 
in perfusion techniques that permit some degree of metabolism to occur during storage. 

In most practical instances, simple cold storage is the preservation mode of choice, since the storage time 
is only slightly increased by using continuous perfusion. Furthermore, the development of the University 
of Wisconsin (UW) solution has dramatically extended the preservation times. In the case of human liver, 
for example, the storage time increased from less than 10 to about 30 hours, thus alleviating the need for 
continuous-perfusion storage. As a result of extensive studies, it seems that lactobionate is an essential 
component of the UW solution, which is believed to act as an effective osmotic agent while suppressing 
cell swelling in metabolically depressed cells and tissues [Southard and Belzer, 1993]. 

Another nonfreezing technique that can be used to retard the chemical reaction rates of a biomaterial is 
undercooled storage. During undercooled storage, the biomaterial is exposed to subzero temperatures in the 
absence of deleterious ice crystal formation. Such a method may be particularly applicable to the medium- 
term (months) storage of proteins. Small droplets of aqueous-based solutions are dispersed in an inert oil 
phase, and the final preparation is then put into test tubes for freezer storage at X > -20°C. By keeping the 
quantity of aqueous droplets much greater than the number of heterogeneous nucleation sites in the bulk 
solution, one can effectively deter heterogeneous nucleation during storage. When needed, the dispersions 
are removed from the freezer and warmed to room temperature and reconstituted [Franks, 1988]. 

Unfortunately, a chief problem in using the undercooled storage technique in biochemical application 
lies in the scale-up of the procedure to the larger volumes needed for clinical and commercial applications. 
Furthermore, methods for "efficiently" separating the innocuous oil from the biomaterial are a problem 
that must be solved if the previously stored protein solutions are to be used in therapeutic applications. 
Apparently, it is also difficult to conduct all steps of the undercool procedure under sterile conditions 
[Franks, 1988]. 

45.3 Freeze-Thaw Technology 

Freeze-thaw technology is the most commonly used method for storing cells and tissues; hence it is an 
important cryopreservation methodology. When a transplantation emergency arises, the preserved bio- 
material can be thawed and used to save lives. In addition, the development of cell and tissue transplan- 
tation procedures also benefits the growing field of tissue engineering — where frozen biomaterials are 
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utilized in organ replacement devices (e.g., bioartificial liver) [Borel-Rinkes et al., 1992; Karlsson et al., 
1993]. Even the simple cell, however, is thermophysically complex. Consequently, the design of feasible 
cryopreservation protocols must incorporate knowledge from biology, engineering, and medicine to be 
successful. 

In 1949, Polge and coworkers made an important contribution to biomaterial preservation research. 
In work with sperm, they were the first to report the protective effects of additives or cryoprotectant 
agents (CPAs), i.e., glycerol, on biomaterials at low temperatures. The mechanisms by which CPAs protect 
cells from freeze injury are of fundamental importance but are, unfortunately poorly understood. There 
are four major protective actions of these compounds. First, CPAs act to stabilize the proteins of biom- 
aterials under low-temperature conditions. Experimental evidence indicates that for cells, this effect 
results from the interaction of sugars with the polar head groups of phospholipids [McGann, 1978]. 
Recent thermodynamic analyses demonstrate that the preferential exclusion of CPAs from the protein 
hydration shell, at low temperatures, results in stabilization [Arakawa et al., 1990]. Second, CPAs lower 
the electrolyte concentration of the suspending medium of the cell at a given temperature by altering 
the phase relationship during cooling. Third, CPAs reduce the temperature at which cells undergo lethal 
intracellular ice formation. Fourth, CPAs promote the formation of the vitreous, rather than crystalline, 
phases inside the cell during cooling and help prevent intracellular ice formation (see Fig. 45.1) [Karlsson 
etal., 1994]. 

Unfortunately, the addition and removal of CPAs from the biomaterial introduce a separate set of 
problems. In the case of penetrating additives (i.e., dimethyl sulfoxide and glycerol), the permeability of 
the cell membrane is typically several orders of magnitude less than the permeability of the membrane 
to water [McGrath, 1988]. During the prefreeze addition of these compounds, the biomaterial recognizes 
the CPA as another extracellular solute. Hence the cell responds by initiating the rapid transport of water 
across the cell membrane and into the extracellular medium. Meanwhile, if the CPA is permeable, it 
gradually diffuses into the cell, thus contributing to the state of chemical nonequilibrium experienced 
by the cell. These transport processes continue (usually for minutes) until equilibrium is regained, at 
which time the volume of the biomaterial returns to normal. If the initial concentration of CPA is too 
large, the cell volume variations may be so severe that the biomaterial experiences osmotic stress damage. 
The reverse process, of cell swelling and the subsequent return to normal volume, is observed during the 
removal of CPAs. Osmotic injury from the addition or removal of penetrating compounds can be 
minimized by incrementally increasing or decreasing, respectively, their concentration. 

Unfortunately, the benefits of using stepwise CPA addition and removal in reducing osmotic damage 
is counterbalanced by the increased risk of toxic damage to the biomaterial from longer exposure times 
[Fahy, 1986]. Also, in some cases, a single-step CPA removal process has been shown to be effective 
[Friedler et al., 1988] . The balance between these considerations can be optimized if both the permeability 
of the biomaterial to CPAs and the effects of the CPA on the biomaterial are known. Methodologies for 
measuring these critical parameters are discussed in McGrath [1988]. 

In the case of impermeable CPAs (i.e., polyvinylpyrrolidone, hydroxyethyl starch), the potentially 
injurious effects of exosmosis are also experienced. However, because impermeable CPAs remain in the 
extracellular medium, they are relatively easier to remove from the biomaterial than permeable additives. 
Their major disadvantage is believed to be their inability to afford direct protection to intracellular 
structures during freezing. 

Figure 45.2 illustrates extrema observed when cooling biologic substances to subzero temperatures. 
To minimize cryoinjury, a suitable CPA is added to the biomaterial before the actual freeze-thaw protocol 
is begun. Cooling then proceeds at a controlled rate. As the material is cooled along the freezing path 
(Fig. 45.2), ice formation initiates first in the extracellular medium. This phase transformation is initiated 
either by external seeding or by heterogeneous nucleation from the walls of the container and is crucial 
because it corresponds to the initial temperature at which physico chemical changes occur within the 
biomaterial during cooling. The formation of extracellular ice results in a chemical potential gradient 
across the cell membrane that can be balanced by exosmosis of the intracellular water. If the freezing 
process occurs at slow cooling rates, the intracellular fluid has sufficient time to leave the cell. However, 
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FIGURE 45.2 Schematic of the physicochemical processes experienced by cells during cryopreservation. 

if excessive exosmosis occurs, the result is excessive cell dehydration and shrinkage and subsequent cell 
death from the high concentration of solutes remaining within the cell after water diffusion. If the freezing 
rate is rapid, however, water entrapped in the cell becomes supercooled. Then, at some subzero temper- 
ature, thermodynamic equilibrium is achieved through intracellular ice formation. Unfortunately, intra- 
cellular ice formation is usually associated with irreversible cell damage. Although the exact damage 
mechanism is not known, mechanical effects and high electrolyte concentrations caused by the crystal 
formation are believed to be major modes of cell damage [Mazur, 1984]. 

Figure 45.3 schematically illustrates the relationship between cell survival and cooling rate during a 
freeze-thaw protocol. As shown, cell damage at suboptimal cooling rates is predominantly caused by 
"solution" effects (e.g., exposure to high electrolyte concentrations, decreased unfrozen fractions, exces- 
sive dehydration), until a range of peak survival is obtained at the optimal cooling rate. Experimental 
evidence suggests that the optimal cooling rate for maximum cellular survival is the fastest rate that will 
not result in intracellular ice formation. Beyond this peak, cells are exposed to supraoptimal cooling 
conditions, in which damage from intracellular ice formation dominates. 

Theoretical models have been developed to describe both the kinetics of water loss and the probability 
of intracellular ice formation [Mazur, 1984; Pitt, 1992; Muldrew and McGann, 1993] . The value of these 
models in designing effective freezing protocols for biomaterial preservation also has been demonstrated 
[e.g., Karlsson et al, 1993]. 

Once a cell or tissue sample has been frozen successfully, it can then be stored in liquid nitrogen in a 
-70°C freezer until needed. This step is usually not very crucial, since, as long as the freezer temperature 
is kept stable, storage does not cause cell damage. 

The next challenge in freeze-thaw technology is to thaw the biomaterial using a warming protocol that 
minimizes the risk of recrystallization, or devitrification if applicable, and promotes high survival. The 
optimal thawing rate correlates directly with the rate at which the sample was cooled before storage. If 
the material was frozen slowly at a suboptimal rate (Fig. 45.3), a wide range of thawing rates usually can 
be utilized with negligible effects on survival. If the biomaterial experienced slightly supraoptimal cooling 
rates (Fig. 45.3), rapid thawing is almost exclusively required to avoid recrystallization of small intracel- 
lular ice crystals formed during the initial cooling step. The rapid thawing of volumes of appreciable size 
is particularly difficult because the thermal mass of the specimen dictates the warming rate that can be 
achieved. 
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FIGURE 45.3 A diagram illustrating the relationship between cell survival and cooling rate during a freeze-thaw 
protocol. 



45.4 Freeze-Drying 



Freeze-drying, or lyophilization, is a dehydration storage technique that is advantageous because it 
produces a final product that can be stored at suprazero temperatures and reconstituted with the addition 
of an appropriate solvent. The procedure is commonly applied to protein products (e.g., collagen, 
penicillin) and less frequently applied to cells (e.g., platelets, red blood cells) [Doebbler et al., 1966; 
Goodrich and Sowemimo-Coker, 1993]. Freeze-drying techniques also have been applied to lipo- 
somes — vesicles utilized in drug delivery systems that can trap water-soluble substances within their 
lumina. Lyophilization is a dual stage process that consists of rapidly cooling the liquid material to its 
solid form and subsequent drying (or removing) of the solidified solvent. The intricacies of the procedures 
employed directly influence the shelf life of the final dehydrated product. The drying stage of the process 
utilizes vacuum sublimation (transformation of a solid phase directly to the vapor phase) and may itself 
consist of multiple steps. The complexities of the drying procedure are determined by the continuity of 
the ice phase throughout the frozen specimen [MacKenzie, 1976]. Franks [1982] explains that the quality 
of the final lyophilized product partially depends on the freezing protocol experienced by the original 
sample, the size and distribution of the resultant ice crystals, the degree of heterogeneity, the presence 
or absence of amorphous regions, and the conditions imposed on the sample during drying. As one 
might expect, these factors can produce differentiations between the desired goals of the freeze-drying 
process and the actual final product. 

The effects of the freezing protocol and the size and distribution of the resultant crystals are important 
for the reasons discussed previously. It should be mentioned that in the case of protein lyophilization, 
carbohydrates are often added for protection, while for cells, glycerol is a common lyoprotectant. The 
next two factors — the degree of heterogeneity and presence of amorphous regions — directly relate to the 
drying procedure. For simple cases in which large areas of ice (e.g., continuous channels) have formed 
in the sample, ice is removed by direct sublimation. However, for the more complex configurations 
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commonly encountered with biomaterials, sublimation alone is inadequate because the ice crystal for- 
mation is discontinuous and amorphous regions are present. The amorphous regions correspond to 
pockets of water bound by freeze-concentrated solutes and biomaterial components (e.g., cell mem- 
branes). Water bound in this way serves to satisfy the hydration shell requirements of the specimen 
[Steinbrecht and Miiller, 1987], hence contributing to the stabilization of the biomaterial. It is now 
recognized that the formation of amorphous regions is necessary for the successful lyophilization of 
biomaterials [Crowe et al., 1993a; Levine and Slade, 1992]. Since material destabilization is one source 
of damage during freeze-drying, this is an important area of lyophilization research. 

Regardless of the complexity of the frozen matrix, biomaterial injury can occur during drying. Factors 
such as the effective drying temperature and the rate of drying are important in determining the degree 
of damage incurred. The drying step entails the removal of first the free water and then the bound water. 
However, the complete removal of this bound water during freeze-drying can be injurious to biomaterials 
and can promote protein denaturation and possibly aggregation into insoluble precipitates. Denaturation 
is the disruption of the folded structure of a protein and is affected by variables such as pH, surface 
interactions, and thermal and chemical changes [Darby and Creighton, 1993]. It is unfavorable because 
the final lyophilized product may no longer resemble the original biomaterial once denaturation occurs. 
Hence the avoidance of protein denaturation is crucial to effective biomaterial lyophilization. Investiga- 
tions with liposomes and proteins reveal that the survival of biomaterials in the dry state is linked to the 
stabilizing influence of disaccharides (such as sucrose and trehalose) present in the system [Crowe et al., 
1993b]. However, to minimize denaturation during freeze-drying protocols, lyoprotectants and CPAs 
should be used [Cleland et al., 1993]. It should be mentioned that the removal of these additives can be 
problematic in the large-scale industrial use of freeze-drying techniques. 

Once a biomaterial has been lyophilized successfully, it is stored for future use. For the special case of 
protein storage — an area that is of particular interest to the pharmaceutical industry — the stability of the 
drug or protein must be guaranteed throughout a reasonable shelf life (the time period, measured in 
years, for which the drug maintains its original physical and functional properties). Two means of 
biomaterial destabilization during storage are the occurrence of oxidative and chemical reactions after 
lyophilization. Oxidation effects can be reduced by the exclusion of oxygen from containers of the dried 
materials and the use of antioxidants. The chemical reactions can be inhibited through the maintenance 
of low residual moisture levels [Cleland et al., 1993]. 

The final step, in the use of freeze-drying techniques for biomaterial storage, is the reconstitution of 
the lyophilized product. If pure water is used as the rehydration solvent, concentration gradients and 
osmotic imbalances can result in severe injury. To counter these effects, biomaterials are commonly 
rehydrated using isotonic solutions or media. The effects of using additives in the reconstituting solvent 
of biomaterials has recently been addressed. Apparently sugars are also effective in reducing damage 
during this rehydration step [Crowe et al., 1993b]. 

45.5 Vitrification 

The hazards of ice crystal formation are significantly reduced by rapidly cooling the biomaterial to low 
temperatures at sufficient rates to produce an amorphous solid. This alternative, depicted in Fig. 45.2, 
was originally proposed in the 1930s and is called vitrification [Luyet, 1937; Goetz and Goetz, 1938]. 
Vitrification is the kinetic process by which a liquid solidifies into a glass. It requires the rapid cooling 
of the liquid to the glass-transition temperature T G and is most easily achieved in high-viscosity liquids 
[Doremus, 1973]. The molecular configuration of the supercooled liquid (T> T G ) is the same as that of 
the glass (T< T G ). Hence rapid cooling is necessary to prevent the supercooled liquid molecules from 
reorganizing into a regular (e.g., lattice) configuration. Vitrification is a second-order phase transition. 
Hence, by definition, the specific volumes of both phases (near T G ) are identical, although the thermo- 
dynamic property values (i.e., heat capacity, coefficient of thermal expansion) are not [Kauzmann, 1948]. 
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The difficulty in successfully vitrifying a material lies in reaching its glass transition temperature T G prior 
to crystal formation. Hence reducing the distance between T M and T G by increasing the solute concen- 
tration (Fig. 45.1) increases the probability that a given liquid will form a glass. Two alternative ways to 
achieve glassy state are ( 1 ) to cool biomaterials at ultrarapid rates such that T G is reached before nucleation 
can proceed and (2) to increase the pressure of the system such that the intersection of T H0M and T G 
(Fig. 45.1) occurs at lower CPA concentrations [Fahy et al., 1984; Kanno and Angell, 1977]. Since glass 
formation circumvents the deleterious effects of freeze injury during cooling, it is becoming an increas- 
ingly important biomaterial storage technique. 

We have already mentioned the roles of viscosity and rapid cooling in vitrification. The more viscous 
a liquid, the slower it can be cooled to achieve its vitrified form. Fairly large cryoprotectant concentrations 
are necessary to obtain the vitrified form of aqueous solutions by slow cooling, and the requirement 
increases with specimen volume. This is undesirable, since high concentrations of CPAs are toxic to 
biomaterials. Hence, in practical applications of vitrification technology, a balance between the thermo- 
dynamic conditions necessary to achieve vitrification and the physicochemical conditions suitable for 
survival is crucial. 

The most common difficulty encountered in attempts to vitrify biologic materials is their susceptibility 
to ice crystal formation. As mentioned previously, the larger the sample volume, the higher the CPA 
concentration necessary to reduce the probability of crystallization [Coger et al., 1990]. Drosophila mel- 
anogaster cells, corneal tissue, and pancreatic islets of Langerhans are three examples of biomaterials that 
have been vitrified successfully [Steponkus et al., 1990; Bourne, 1986; Jutte et al., 1987]. Of these, only 
D. melanogaster has not been preserved previously by the freeze-thaw technique. In fact, the freeze-thaw 
technique remains the chief method of cells storage. For many cell and tissue types, the appropriate 
combination of effective temperature, cryoprotectant concentration, and cooling and warming rate 
conditions necessary for their vitrification have yet to be determined. For long-term organ storage, the 
freeze-thaw technique is not an alternative, because the mechanical stress and toxic damage that the 
organ would incur during freezing would be lethal [Bernard et al., 1988; Fahy et al., 1984]. Organ 
vitrification attempts, although unsuccessful, have demonstrated that the CPA toxicity limitation is 
especially evident in organ preservation such that toxic death prevention is an unavoidable challenge in 
organ vitrification [Fahy, 1986] . With respect to cooling rate, the relatively large volumes of organs require 
the use of slow cooling protocols to ensure a uniform thermal history throughout the total volume. 
Organs therefore require high-pressure, high-cryoprotectant concentrations and slow cooling conditions 
to achieve the vitreous state. 

Once a biomaterial has been vitrified successfully, it must be stored at temperatures below T G (Fig. 45.1) 
to encourage stability. When restoring the sample to physiologic conditions, special care must be taken 
to avoid crystallization during the warming protocol. Crystallization under these circumstances, termed 
devitrification, is possible (or even probable) since at temperatures greater than T G , the crystalline phase 
is more stable than the amorphous solid. If such a transformation occurs, cryoinjury is unavoidable. The 
probability of devitrification is significantly reduced by warming the biomaterial at rates equivalent to 
those imposed during the original cooling [Fahy, 1988; MacFarlane et al., 1992]. The use of vitrification 
as a biomaterial storage technique is an ongoing and important area of cryopreservation research. 
Presently, it is the only apparent solution for the long-term storage of organs. 

45.6 Summary 

Biomaterial storage is an exciting research area whose advances are of value to a variety of fields, including 
medicine, biologic research, and drug design. However, the compositional and physical complexities of 
the various biosubstances (e.g., proteins, cells, tissues, and organs) require the development of specialized 
preservation procedures. In the present discussion, four stage techniques — nonfreezing, freeze-thaw, 
lyophilization, and vitrification — and their relevance to specific biomaterial examples have been reviewed. 
Although there have been definite advances, important challenges still remain for future investigations. 
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Defining Terms 

Cryopreservation: Techniques utilized to store cells, tissues, and organs under subzero conditions for 

future use in clinical applications. 
Cryoprotectant: Chemical additive used to protect biomaterials during cooling to low temperatures 

by reducing freezing injury. 
Devitrification: Crystallization of an amorphous substance during warming. 
Freeze-drying: Dehydration of a sample by vacuum sublimation. 
Lyoprotectant: Expedients added to lyophilization formulations to protect the biomaterial from the 

damaging effects of the process. 
Recrystallization: The coalescence of small ice crystals during warming. 
Shelf life: Length of time in which the stability of the components of a biomaterial (e.g., pharmaceutical 

drugs) is guaranteed during storage. 
Vitrification: Solidification process in which an amorphous (glasslike) solid, devoid of crystals, is 

formed. 
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Further Information 

In addition to the review works cited throughout the text, the following publications also may be of 
interest. 

For more detailed information concerning lyophilization, as applied specifically to proteins, see The 
development of stable protein formulations: A close look at protein aggregation, deamidation, and 
oxidation, by Cleland et al., an especially helpful article (1993. Crit. Rev. Ther. Drug Carrier Syst 10:307). 
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A good introduction to the fields of cryobiology and cryopreservation is presented in The Clinical 
Applications of Cryobiology, by B. J. Fuller and B. W. W. Grout (CRC Press, Boca Raton, Fla., 1991). For 
review work relevant to vitrification and organ preservation, The Biophysics of Organ Cryopreservation, 
edited by D. E. Pegg and A. M. Karow Jr. (Plenum Press, New York, 1987) should be consulted. 

For additional information on the physicochemical processes encountered during cooling of bioma- 
terials, refer to Korber's 1988 review (Q. Rev. Biophys. 21:229). For an overview of bioheat transfer 
processes, refer to Diller's review in Bioengineering Heat Transfer 22:157, 1992. For additional information 
on the fundamental principles of intracellular ice formation, consult the 1993 review by Toner (pp. 1-51 
in Advances in Low Temperature Biology, edited by P. L. Steponkus, vol. 2, JAI Press, London). 
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Total hip joint replacement (THR) has been very successful mainly due to the introduction of bone 
cement for the fixation by Dr. John Charnley on the advice of Dr. Dennis Smith in the late 1950s [Charnley, 
1970; Charnley, 1972; Charnley and Cupic, 1973; Eftekhar, 1978; Wroblewski, 1986]. One of the inherent 
problems of orthopedic joint prosthesis implantation is the fixation and maintenance of a stable interface 
between the device and the host tissue at the cellular and organ levels [Crowninshield, 1988; Ducheyne, 
1988; Park, 1992; Park, 1995]. The fixation can be classified into several categories as given in Table 46.1. 
Basically, the hip joint can be considered as three parts, (1) acetabular cup, (2) cup-femoral head-articu- 
lating surface, and (3) femoral stem (Fig. 46.1). The problems and possible solutions related to each of 
the three parts will be examined. 

Most frequent fixation problems are related to (1) infection, (2) wear and wear particulate, (3) migration 
and failure of implants, and (4) loosening of which the "long-term loosening" of the implant is especially 
important [Ducheyne, 1988; Park, 1992]. These problems manifest into osteolysis in the bone bed which 
is the major cause of long-term loosening mostly for the femoral stem [Anthony et al., 1990; Harris, 
1995; Maloney etal., 1990; Maloney and Smith, 1995]. Some major factors related to (late) loosening 
are (1) mismatch of the physical properties between tissues and implant, (2) biocompatibility of the 
implant, (3) deterioration of physical properties of implant materials, (4) surgical techniques, (5) design 
of the implant, (6) selection of patients, and (7) post surgical care, etc. [Amstutz et al., 1976; Carlsson 
et al., 1983; Crowninshield, 1988; Dowd et al, 1995; Gruen et al., 1979; Harris and McGann, 1984; Havelin 
et al., 1995; Lu et al., 1993; Mohler et al, 1995; Mulroy and Harris, 1990; Nilsen and Wiig, 1996; Riegels- 
Nielsen et al., 1995; Stromberg et al., 1996]. In this review, recently reported clinical cases based on the 
large studies will be examined, especially the Norwegian Arthroplasty Register which will shed some light 
on the cemented vs. uncemented fixation of the total hip arthroplasty. This is mainly due to the fact that 
the individuals reporting and their clinical results could be widely varied and biased. 

Variables related to the total (hip) joint replacement are (1) materials (Table 46.2), (2) design, and 
(3) fixation method (Table 46.3). One should keep in mind that any particular type of prosthesis is made 
to have a particular fixation method, e.g., the Charnley prosthesis is designed to be used with bone 
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TABLE 46.1 Summary of Various Methods of Prosthesis Fixation 



Methods of Fixation 

1. Mechanical Fixation 

a. Active — use of screws, bolts, nuts, wires, etc. 

b. Passive — interference fit and noninterference fit 

2. Bone Cement Fixation 

a. Pure cement 

b. Modified cement — composite cement 

3. Biological Fixation 

a. Porous ingrowth 

b. Modified porous ingrowth — electrical and pulsed 
electromagnetic field (PEMF) stimulation 

4. Direct (Chemical) Bonding Fixation 

a. Osteogenic/inductive-glass-ceramics 

b. Osteoconductive hydroxyapatite 



Examples 



Pre-Charnley implants [Williams and Roaf, 1973] 
[Moore, 1952; Mittlemeier, 1975, 1976] 

[Charnley, 1970; 1972] 

[Dai et al., 1991; Henrich et al., 1993] resorbable particle 
impregnated cement, [Liu et al., 1987; Park et al., 1986] 

[Klawitter and Hulbert, 1972; Sauer et al., 1974; Hirshhorn 
et al., 1972; Homsy et al., 1972; Smith, 1963] 
[Park, 1983; Park and Kenner, 1975; Weinstein et al., 1976] 



[Blencke et al., 1978; Hench and Paschall, 1973] 
[de Groot, 1983; Kay, 1988] 



ACETABULAR CUP 



ARTICULATING 
SURFACE 



FEMORAL STEM 



CEMENT PLUG 




FIGURE 46.1 Schematic diagram showing the three important areas of THR. 

cement at both the cup and femoral prosthesis sites while some were designed to be used as uncemented 
such as the BIAS® femoral stem prosthesis [Onsten et al., 1995]. Also, sometimes the outcome of the 
THR may depend on the type of bone cement used; (1) low- vs. high-viscosity, (2) plain vs. antibiotics- 
impregnated, and (3) low vs. high heat of polymerization cement, etc. A more mitigating problem is that 
the outcome of the THR could not be clearly attributed to any one specific factor, e.g., the osteolysis may 
be due to the particulates which are due to wear and corrosion products which in turn could be initiated 
between the articulating surfaces or between the metal backing and polyethylene. However, the critical 
stage may be reached due to the accelerated rate of particulate production due to the loosening of the 
implant, which causes a larger and unnatural motion of the articulation. Once the lysis starts, the cycle 
becomes more vicious — further aggravating the situation. 

The design and fixation related variables are summarized in Table 46.3. It is not that uncommon to 
find that a prosthesis designed for uncemented fixation is used with cement, making the outcome study 
more complicated. 
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TABLE 46.2 Physical Properties of Materials Used for Joint Prosthesis and Bone (Park and Lakes, 1992) 



Materials 


Young's Modulus 


UTS a 


Elongation 


Density 




(GPa) 


(MPa) 


(%) 


(g/cm 3 ) 


Metals 










316LS.S. (wrought) 


200 


1000 


9 


7.9 


CoCrMo (cast) 


230 


660 


8 


8.3 


CoNiCrMo (wrought) 


230 


1800 


8 


9.2 


Ti6Al4V 


110 


900 


10 


4.5 


Ceramics 










Alumina (A1 2 3 , polycrystalline) 


400 


260 


<0.1 b 


3.9 


Glass-ceramic (Bioglass®) 


200 


200 


~0.1 b 


2.5 b 


Calcium Phosphate (Dense Hydroxyapatite) 


120 


200 


~0.1 b 


3.2 


Polymers 










PMMA'(Solid) 


3 


65 


5 


1.18 


PMMA Bone Cement 


2 


30 


3 


1.1 


UHMW d Polyethylene 


1 


30 


200 


0.94 


Polysulfone 


2.5 


70 


50 


1.24 


Silicone Rubber 


<0.01 


6 


>350 


1.12 


Fibers and Wires 










Aramid (Kevlar) 


130 


2700 


2 


1.45 


Carbon 


400 


2500 


1 


2 


Nylon 


5 


500 


10 


1.07 


Steel (piano wire) 


200 


2450 


1.2 


7.8 


Bone 










Femur (compact), long axis 


17 


130 


3 


2.0 


Femur (compact), tangential 


12 


60 


1 


2.0 


Spongy bone 


0.1 


2 


2.5 


1.0 



" UTS: ultimate tensile strength, for ceramics the Young's modulus and UTS are for the flexural modulus 
and bending strength, respectively. 
b Estimated values. 
c PMMA: poly(methylmethacrylate) 
d UHMW: ultra high molecular weight (>2 x 10 6 g/mole) 



TABLE 46.3 List of Design and Fixation Related Variables in 
THR 



Part 



Materials Used 



Fixation Methods 



Cup 



Articulating 
surfaces 



Femoral stem 



Head 



A11PE 

Metal-backed 
Porous-coated 
Screw-holed 

All Ceramic 

PE/Metal 

PE/Ceramic 

Ceramic/ceramic 

Metal/metal 

Metal 

Composite 
Smooth 
Textured 
Porous-coated 
PMMA-coated 
HA-coated 

Metal 

Ceramic 



Cemented 

Uncemented (mechanical) 
Uncemented (mechanical) 
Uncemented (mechanical) 
Cemented/uncemented 



Cemented/uncemented 

Uncemented 

Threaded(rare) 

Uncemented 

Uncemented 

Cemented 

Uncemented 

Fixed 

Rotated (trunnion/modular) 



PE: Polyethylene (ultra high molecular weight) 
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46.1 Acetabular Cup 



The advent of the uncemented acetabular cup necessitated the metal-backing of the polyethylene liner 
although some use all polyethylene without cement [Ritter, 1995]. Due to the relatively new advocacy of 
the so-called "hybrid" THR, where the femoral prosthesis is cemented and the cup is uncemented, there 
are fewer cases of comparing the outcome of the cemented vs. uncemented acetabular cup performance. 
Havelin et al. compared 1 1 of the most widely used uncemented cups (4352 cases) [Havelin et al., 1995] . 
After 5 years, the overall cumulative revision rate was 3.2% but increased drastically to 7.1% after 6 years 
with large differences among the designs. Hydroxyapatite and porous (metal) coated cups had a failure 
rate of less than 0.1%. It is interesting that all polyethylene cups had a cumulative revision rate of 14% 
while threaded uncoated metal-backed cups varied from 0% (PM® cups) to 21% (Ti-Fit®) after 6 years. 
They concluded that the coating of the surface is the most important factor for the uncemented cups 
for the first 6 years of implantation. As is the case with all tissue ingrowth fixation, the most important 
factor for short-term results with uncemented cups is the immediate stability and possibility for the tissue 
ingrowth. They also concluded 

"It remains to be seen if the results with some uncemented cups, in some groups of patients, are better 
than for the best-cemented. Until more is known about their long-term results, uncemented acetabular 
components should be used as part of randomized trials." 

Of course, the performance of the acetabular cup may be closely interrelated to the femoral stem. 
However, Havelin's study could not detect any outstanding difference with different combinations of the 
cup and femoral stem. 

Espehaug et al. studied 12,179 hip prostheses comparing 10 different brands reported to the Norwegian 
Arthroplasty Register during 1987-1993 [Espehaug et al., 1995]. The overall revision rate after 5 years 
was 2.5% while the Charnley prosthesis was 2.9%. Elite cup and Charnley stem combination resulted in 
much poorer performance (9.84% after 5 years). However, other combinations gave better results. The 
main cause of revision was aseptic loosening after 5 years (1.8%) followed by infection (0.5% after 
5 years). In conclusion, they 

"... observed good overall results of cemented hip prostheses. However, clinically important differences 
in revision rates were demonstrated among the various cemented total hip prosthesis brands." 



Some studied the aseptic loosening of the cemented 
cups and concluded that the mechanism of late loosen- 
ing is biological (osteolysis) not mechanical, which is 
the opposite of the femoral stem [Harris, 1995; 
Schmalzried et al., 1992]. Therefore, the metal-backed 
cup which is designed to reduce the stresses in the bone- 
bone cement interface would be ineffective. Some clin- 
ical studies showed that the all polyethylene cup per- 
formed better than the metal-backed cemented cups 
[Dall etal, 1993; Harris and Penenberg, 1987; Ritter, 
1995; Ritter et al, 1990]. 

Malchau and Herberts reported revision and re-revi- 
sion rates in THR in Sweden. The reasons for revision 
were similar to the Norwegian study (Table 46.4), and 
are as follows: (1) aseptic loosening, which was the 
major factor (72.3%) (2) infection (7.2%), (3) fracture 
(4.7%), and (4) dislocation (4.2%), etc. [Malchau and 
Herberts, 1998]. The loosening of metal-backed and 



TABLE 46.4 Reason for Revision of THR 
in Sweden (Malchau and Herberts, 1998) 





Numbe 




Reason 


r 


Percent(%) 


Aseptic loosening 


6965 


72.3 


Primary deep infection 


690 


7.2 


Fracture only 


4.7 




Dislocation 


403 


4.2 


2-stage procedure 


386 


4.0 


Technical error 


372 


3.9 


Implant fracture 


161 


1.7 


Secondary infection 


94 


1.0 


Pain 


37 


0.4 


Polyethylene wear 


26 


0.3 


Miscellaneous 


33 


0.3 


Missing 


13 


0.1 


Total 


9634 


100.0 
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all-polyethylene acetabular cups were compared. Mixed results were obtained, where in one design the 
all-poly was clearly better, while in others both performed equally well. One would be tempted to advocate 
the use of all-poly since it is a lot less costly (1/2 to 1/3) and equal or better in clinical performance 
compared to the metal-backed acetabular cup. 

Basically, the Charnley type all polyethylene cup has been the 'gold standard' for long-term clinical 
results. The metal-backed, either cemented or uncemented polyethylene lined acetabular cup is still 
considered as experimental until long-term follow-up studies (more than 10 years) become available 
[Malchau and Herberts, 1998] . Also, the benefits of biological fixation due to the lack of long-term follow- 
up studies are inconclusive at this time, although some designs gave promising results. Other variations 
such as flanged vs. unflanged, elevated vs. conventional cups, etc., should be proven clinically through 
long-term trials although the flanged cup performed better in one analysis [Wroblewski, 1993]. 

In our laboratory we have been developing polymethylmethacrylate pre-coated acetabular cups and 
tibial plateau of the total knee replacement (TKR) for cemented fixation [Dowd et al., 1995; Gruen et al., 
1979; Harris and McGann, 1984; Havelin et al.] It is very difficult to adhere bone cement to the surface 
of ultra high molecular weight polyethylene (UHMWPE) with PMMA bone cement. Therefore, the 
acetabular cup is deeply grooved to make a mechanical bonding and thus causes a weak implant. The 
ultra high molecular weight polyethylene (UHMWPE) surface is treated with xylene, polymethyl- 
methacrylate, and a methylmefhacrylate monomer for chemical bonding between the two polymers. 
Preliminary results show that the interface tensile strength between the UHMWPE and bone cement 
resulted in about 10 MPa which is about one-third of the tensile strength of bone cement and UHMWPE 
[Kang, 1998; Kang and Park, 1998; Park and Lakes, 1992]. Recent studies on the UHMWPE powders 
treated with the methylmefhacrylate (MMA) monomers without the use of xylene showed a high degree 
of adhesion after sintering at a high temperature (165°C) and pressure (77.4 MPa) [Park and Park, 1998]. 
Increased interfacial tensile strength by further treating the MMA treated powders with "PMMA + MMA' 
and "PMMA + MMA + BPO" is evidenced as shown in Table 46.5. The sintering of UHMWPE with a 
mold is one way of making an acetabular cup and tibial plateau, therefore, the latest results open the 
possibility of making a graded composite structure of the UHMWPE implants from pure PE articulating 
surface to the pure PMMA surface to which the acrylic bone cement can be attached directly via chemical 
bonding as shown in Fig. 46.2 [Park, 1998]. It is also possible to sinter UHMWPE powers which were 
cross-linked, which also showed improved wear properties [Gul et al.; Marrs et al., 1998; McKellop et al., 
1998; Oonishi et al., 1998]. Glow discharge technique could be used for the high and low density 
polyethylene if one chooses to use this material for the cup as was the case for the early (before the mid 
1980s, HDPE) Charnley prostheses [Foerch et al., 1994; Khang et al., 1996]. However, the interfacial 
strength is not as high as the present values with the UHMWPE. 

TABLE 46.5 The Tensile Strength for Molded and Original Specimens (Park, 1998) 





Sintering 






Interfacial 


Sintered Layers 


Temp(°C) 


PE Powder Treatments 


Group 


Strength (MPa) 


PMMA rod/PMMA 


165 


MMA only 


165M 


2.1 ± 1.6 


powder/treated PE 




MMA+MMA/PMMA 


165PM 


10.6 ± 3.6 


powder/PMMA 




MMA+MMA/PMMA+BPO 


165PMB 


17.7 ± 2.53 


powder/PMMA rod 










PE rod/PE powder/treated PE 


145 


MMA+MMA/PMMA+BPO 


145PMB 


12.4 + 1.67 


powder/PE powder/PE rod 


165 


MMA+MMA/PMMA+BPO 


165PMB 


14.95 + 0.85 




185 


MMA+MMA/PMMA+BPO 


185PMB 


22.6 ± 1.26 


PE rod/pure PE powder/PE rod 


205 


MMA+MMA/PMMA+BPO 


205PMB 


22.8 ± 2.5 


PMMA rod 


205 


— 


205PE 


32.5 ± 1.20 


PErod 


— 


— 


PMMA 


75.9 ± 1.00 




— 


— 


UHMWPE 


36.9 ± 1.90 
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FIGURE 46.2 Schematic diagram showing the graded UHMWPE and PMMA for better fixation and wear [Park, 
19981. 



46.2 Femoral Stem 

Cemented and Uncemented Fixation 

The design of the femoral stem also involves the articulating surface with the acetabular cup which could 
be a more important factor for the longevity of the stem function. The fixation of the stem is largely 
divided into two categories, i.e., cemented and uncemented. The uncemented can be classified into 
interference fit and porous-coated for tissue ingrowth fixation. The porous-coated type can be further 
coated with a hydroxyapatite layer to aid tissue ingrowth. 

A large study involving 14,009 cases of cemented and 1,326 uncemented total hip arthroplasties were 
reported [Havelin et al., 1994]. It was found that the cumulative revision rate for the cemented hip was 
2.7% after 4.5 years and 6.5% for the uncemented. For the cup, it was 0.6% for the cemented and 1.7% 
for the uncemented, and for the femoral components 1.7% and 3.9% after 4.5 years. The results for the 
uncemented prostheses were less favorable in young patients. In men and women under 60 the revision 
rates were 6 and 3% for the uncemented and cemented respectively. There were large variations in 
performance among the individual designs of the uncemented prostheses. However, the early results 
warrant further clinical trials for the uncemented implants. Malchau and Herberts reported revision and 
re-revision rates in THR in a large Swedish population consisting of 148,359 people. The reasons for 
revision were similar to the Norwegian study (Table 46.4): (1) aseptic loosening was the major factor 
(72.3%), (2) infection (7.2%), (3) fracture (4.7%), and (4) dislocation (4.2%), etc. [Malchau and Her- 
berts, 1998]. The survival at 17 years (1979-86) was 81.8% and 9 years (1987-96) was 95.5% for the 
cemented implants. The survival at 13 years (1979-86) was 68.9% and at 9 years (1987-96) was 87.2% 
for the uncemented implants [Malchau and Herberts, 1998]. Again the results show a definitively better 
surgical outcome where cements were used regardless of the various factors involved, such as types of 
implants, materials, patients, etc. 

Using less stiff materials and filling the bone cavity with a custom-designed prosthesis has not met 
researchers' expectations [Lombardi et al., 1995; Morscher, 1995]. It is believed that the bone reacts to 
the presence of implants as if they are part of the supporting member of the body weight. Thus, the 
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filling of the femoral cavity would cause a greatly different behavior of the bone since the prosthesis alters 
the stress pattern completely. Incorporation of the bony tissue into the interstices of pores would also 
alter the stress pattern of the femur. This also complicates the situation. Any stiff material contacting the 
surface of the bone will cause bone density change due to the altered stress pattern and bone lysis due 
largely to the particulate generated at the articulating surface between the acetabular cup and femoral 
head [Maloney et al., 1990; Maloney and Smith, 1995]. Most of the osteolysis is located in the distal and 
proximal end of the prosthesis. The osteolysis can be accelerated by the aseptic loosening of the implants, 
which in turn causes the accelerated wear of the UHMWPE cup. The initial mode of failure of the femoral 
prosthesis is the debonding of the stem/bone cement interface followed by osteolysis at the distal end of 
the stem where the particulate may accumulate by gravity [Jasty et al., 1991]. 

It is interesting to note that the firm fixation of the porous and hydroxyapatite coated implants cause 
a great deal of difficulties during revision surgery, sometimes necessitating cutting of the femora [Bhamra 
et al., 1996]. This result creates the Catch-22 dilemma for the fixation of implants — to have a stable 
implant, it should be firmly embedded into the intramedullary cavity, but when it fails one should be 
able to remove it easily. 

Possible Solutions for Cemented Fixation 

There are many studies concerning the loosening of the femoral stem of the hip joint replacement but 
very few facts have been elucidated with respect to the loosening [Amstutz et al., 1976; Carlsson et al., 
1983; Crowninshield et al, 1980; Fowler et al, 1988; Harris, 1992; Hedley et al, 1979; Jasty et al, 1991]. 
For instance, some believe that the strengthening of bone cement may not be beneficial to the enhance- 
ment of fixation due to the unfavorable stress distribution [Crowninshield et al., 1980; Crugnola et al., 
1979]. On the other hand, the bone cement is the weakest link between bone and prosthesis. Failure of 
the bone cement itself and interface loosening between cement and stem may be large contributing factors 
toward the failure of the fixation of the prosthesis [Amstutz et al., 1976; Carlsson etal., 1983; Harris, 
1992; Lu etal., 1993; Wykman etal., 1991]. Recent reports on the failure rate of the Boneloc® bone 
cement implanted prostheses alarmed the European orthopedic community and it was abandoned 
[Riegels-Nielsen et al., 1995]. The Boneloc® bone cement was designed to give less exothermic heat of 
polymerization but its mechanical properties were compromised [Thanner et al., 1995; Wykman et al., 
1991]. 

Cement/Prosthesis Interface 

Bone cement fixation creates two interfaces: (1) cement/bone and (2) cement-implant. According to an 
earlier report [Amstutz etal., 1976] the incidence of loosening for the femoral prostheses were evenly 
divided at about 10 and 11% for cement/bone and cement-implant interfaces, respectively. Jasty et al. 
reported that the initiation of loosening of the cemented femoral components originates from the 
prosthesis-cement interface, especially the pores trapped between them [Jasty et al., 1991]. The cement- 
implant interface loosening can be minimized by "pre-coating" with bone cement or polymethyl- 
methacrylate polymer [Ahmed etal, 1984; Barb etal, 1982; Park, 1983; Park etal, 1982; Park etal, 
1978; Raab etal., 1982]. Pre-coating can achieve a good bonding between the "cement" and prosthesis 
during the manufacturing process. During surgery, the fresh cement adheres well to the pre-coated cement 
[Park et al., 1978; Park et al, 1979]. 

Mohler et al. reported early loosening of the femoral component at the cement-prosthesis interface 
after 1941 THRs performed between 1980 and 1990 [Mohler et al, 1995]. They analyzed 29 failed hips 
(27 patients). The femoral stems were matte finished; 20 were PMMA-precoated and 9 were not pre- 
coated. Initiation of debonding took place in zone 1 of Gruen between cement and prosthesis followed 
by progressive loosening with maintenance of the bone-cement interface. Extensive osteolysis took place 
after debonding in many hips (20/29). The average time for revision between the onset of the symptoms 
after surgery was 9 months. The authors attributed the progressive loosening to the stem geometry (a 
cylindrical shape, distal to the proximal cobra shape) and matted surface finish. The cylindrical shape 
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allowed the rotation of the prosthesis, permitting the matte-finished surface to grind the bone cement. 
This results in loosening and osteolysis. It is also not likely that the PMMA-coating would help to avoid 
rotation of the prosthesis since it is only applied in the proximal region and the resistance to rotation is 
greatly compromised by the cylindrical shaped stem. Shear strength of the PMMA bone cement is about 
1/3 of the tensile strength. Therefore, if the bone cement is subjected to shear, such as rotation of the 
stem, the cement will not be able to resist the load. It has been suggested that the stem should be more 
square with a smooth surface finish. 

Malchau and Herberts reported inconclusive results on the question of the matte or polished finish 
on the survival of the implants [Malchau and Herberts, 1998]. It would be reasonable to assume that 
the matte finish may hold the implants better initially due to the larger surface area but may act to grind 
the bone cement when the implant becomes loose. 

The high strains or stresses on the cement at the tip of the prosthesis are strong indications that this 
location is a likely site for the initial event of failure [O'Connor et al., 1991]. Others reported that high 
strains at the proximal and distal tip of the prosthesis appeared to cause debonding between the stem 
and cement and was associated with radial crack initiation at the debonded surface of the pores in the 
cement surrounding the areas of debonding [Harrigan et al., 1992]. These radial cracks seemed to be 
due to stress changes secondary to debonding. 

There has been some discussion on the merits of smooth versus rough finish on the femoral stem surface 
with regards to the finish's effect on the longevity of the implant [Harris, 1992; Ling, 1992]. Recent studies 
show that the effect of the surface roughness may depend on the effectiveness of the load transfer from 
cement to the stem as shown in Fig. 46.3. The smooth surface will generate local stress peaks in the cement 
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FIGURE 46.3 Illustration of the Von Mises stress patterns in the cement mantle from smooth to the rough surface 
finish (right) and corresponding axial load transfer effectiveness [Verdonschot and Huiskes, 1998]. 
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FIGURE 46.4 Illustration of the reinforcement of the bone cement around the prosthesis to resist radial- and hoop- 
stress on the cement mantle by incorporating a wire coil conforming to the contours of the femoral stem which can 
be placed during surgery or prefabricated during manufacturing of the implant [Kim et al., 1994]. 

mantle as well as ineffective load transfer. The ineffective (rough) irregular surface which has shallow 
profiles will be ineffective in axial load transfer and will act as a stress inducer. The effective load transfer 
profile will have deeper surface irregularity as shown in Fig. 46.3c [Verdonschot and Huiskes, 1998]. 

The cement/prosthesis interface strength could be enhanced by placing reinforcing material around 
the stem to resist the radial- and hoop-stress caused by body loading [Park, 1993]. It is also critical that 
the cement is reinforced to counteract the hoop-stress and to some extent the radial stress created by the 
prosthesis when it is loaded [Ahmed et al., 1984; Mann et al., 1991]. Therefore, a coil or mesh of wire 
conforming to the contours of the prosthesis could be placed prior to bone cement injection during 
surgery as shown in Fig. 46.4. It could also be pre-fabricated around the prosthesis. Also, some advocate 
the use of a mesh reinforcement around the prosthesis which can be fixed with fresh bone cement at the 
time of surgery. This mesh reinforcement is similar to the coil but is more difficult to incorporate 
[Davidson, 1988; Willert et al., 1991]. The simple wire coil would resist the hoop-stress developed by the 
prosthesis, which in turn will decrease the stress on the bone cement beyond the wire coil toward the bone. 

Changes to the mechanical property of the bone cement by reinforcing the cement with a wire coil in 
a simulated stem loading condition has been tried [Kim and Park, 1994] instead of the traditional 
reinforcement of adding wires, fibers, etc. to the bone cement [Saha and Pal, 1984]. The coils were made 
from 20 gage 302/304 stainless steel wire and were very effective counteracting the static radial- and 
hoop-stress as given in Table 46.6. The magnitude of the ultimate load of the control specimens (3.70 ± 
1.13 kN) is similar to the ones obtained by Mann et al. (~32 kN), whose specimen was 8 times longer 
[Mann et al., 1991]. If one extrapolates the ultimate load of the wire reinforced specimens to that of 
Mann's work (which is closer to the in vivo condition) the load to fracture would be about 72 kN; well 
above the normal loading on the prosthesis even in dynamic loading (6~8 times body weight; 4 kN) 
[Paul, 1976]. The percentage increases of improvements in mechanical properties are similar or higher 



TABLE 46.6 Summary of the Static Mechanical Property Measurements 




Specimen Type 


Maximum Strain (%) 


Maximum Load (kN) 


Stiffness (GNm/m) 


Toughness (Nm/m) 


Control 


2.6 ± 0.5 


3.70 ± 1.13 


1.73 ± 0.39 


49.33 ± 18.63 


Wire reinforced 


3.2 ± 0.9 


7.68 ± 2.33 


4.17 ± 1.16 


129.48 ± 65.3 


% Change 


+22.7 


+ 107.6 


+ 141.0 


+ 162.5 



Source: Kim and Park, 1994. 
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than other studies although the increases are not as high as predicted by the composite theory [Taitsman 
and Saha, 1977; Topoleski et al., 1990]. 

The fatigue test showed a substantial increase in its cyclic load to failure by the wire coil specimens 
[Kim and Park, 1996]. It is believed that the beneficial effects of the wire (fiber) layer lies in its ability 
to inhibit the propagation of fatigue cracks [Manley and Stern, 1985; Manley et al., 1987]. At high stress 
levels in the cement mantle, fatigue cracks could initiate from defects such as air bubbles and surface 
discontinuities, and propagate gradually over a large number of load cycles. The presence of the fiber or 
wire coil increases the energy necessary for crack propagation by crack tip blunting, crack deflection, 
and the bridging of cracks by intact fibers [Taylor, 1987; Taylor et al., 1989]. 

Another observation gained from fatigue testing was a subsidence of the stem. The subsidence is due 
to the creep of the viscoelastic materials such as bone and bone cement [Park and Lakes, 1992]. The net 
effect from the creep would be a continued subsidence of the implant shortening the leg [Kim and Park, 
1996; Manley et al., 1987]. 

All measured mechanical properties show that the wire coil reinforcement significantly enhanced the 
strength, fracture strain, stiffness, and toughness over the control [Kim and Park, 1996]. The most 
significant increases were in the toughness, indicating that the coil reinforced cement mantle will resist 
the load much more than the non-reinforced control. This may result in a prolonged fixation life of the 
prosthesis, which is the ultimate goal of all arthroplasties. 

Cement-Bone Interface 

The problems at the bone-cement interface cannot be easily overcome since these problems arise from 
the intrinsic properties of the bone cement as well as extrinsic factors such as cementing technique, 
condition of the bone surface, etc. The toxicity of the monomer, inherent weakness of the cement as a 
material due to the unavoidable inclusion of the pores (Table 46.7), and blood and tissue debris mixed 
during surgery can contribute to the problem of loosening at the bone-cement interface [Park, 1983]. 

The bone-cement interface strength may be enhanced by growing bone into the cement after fixation. 
Bone cement can be used for immediate fixation yet provides tissue ingrowth space later by incorporating 
resorbable particles (such as inorganic bone particles) as shown conceptually in Fig. 46.5. Some studies 
[Dai et al, 1991; Henrich et al, 1993; Liu et al, 1987; Park et al., 1986] indicate that the concept can be 
used effectively, at least in rabbit and canine models [Kwon et al., 1997]. In one study the bone-particle 
impregnated bone cement was used to fix femoral stem prostheses in the femora of dogs (one side was 
experimental, the other was control) and after a pre-determined time period, the femora were harvested 
and sectioned into disks for the push-out test to measure the interfacial strength between the bone and 
cement interface [Dai et al., 1991]. The results as shown in Fig. 46.6 indicate that the experimental side 
increased in strength for up to five months while the control side decreased slightly. The histology also 
showed the integration of tissues into the spaces of the dissolved particles. Of course, care should be 
taken to control the amount of particles to balance out the increased viscosity and decreased strength. 
Table 46.7 illustrates the relationship between the amount of resorbable particle (inorganic bone particles, 

TABLE 46.7 Effect of Inorganic Bone Particle Mixed with Acrylic 
Bone Cement on the Pore Size, Porosity, and Tensile Strength 



Particle 


Pore Size 


Porosity 


Tensile Strength 


Amount (%) 


(urn) 


(%) 


(MPa) 





154.7 ± 72 


7.2 ± 2.5 


23.2 ± 2.3 


10 


160.6 ± 68 


5.0 ± 1.7* 


22.6 ± 2.0* 


20 


172.9 ± 52* 


4.9 ± 1.5* 


20.1 ± 1.1* 


30 


218.0 ± 92* 


2.4 ± 0.7* 


19.7 ± 1.1* 



Note: 15 specimens for each group. 

* Statistically significant vs. control (p < 0.05). 

Source: Kim et al, 1994. 
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FIGURE 46.5 Basic concept of the bone cement with resorbable particle fixation. Immediate fixation is achieved 
as in the ordinary bone cement with the enhanced biological fixation to be achieved later for a prolonged stabilization 
of the implant [Park, 1992 and 1995]. 
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FIGURE 46.6 Maximum interfacial shear strength between bone and bone cement versus implant period when 
the femoral stems were implanted with ordinary bone cement and cement with bone particles. In both cases the 
interfacial strength was stabilized after 5 months for this canine model [Dai et al., 1991]. 

100-300 |_im) and the porosity and pore size [Kim et al., 1994]. As more particles are incorporated onto 
the bone cement, a lesser amount of porosity resulted, however, the average pore size increased. The 
tensile strength decreased with an increased amount of bone particles. Fatigue properties improved with 
higher particle inclusion, however, it was found that about 30% (by weight) of bone particles can provide 
sufficient interconnected porosity for bony ingrowth and yet give reasonable compromises to other 
parameters [Liu et al., 1987]. The bone-particle impregnated bone cement has been used clinically but 
its long-term success remains to be proven since it has been only five years without any failures [Dai, 
1991 and 1996]. 
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Uncemented Fixation 

Efforts to develop a viable interface between the tissue and implants have been made ever since Moore 
designed a femoral prosthesis which had a fenestrated large hole in the proximal region [Moore, 1952]. 
Ironically, the fixation itself is by the passive mechanical fixation technique as discussed earlier. Smith 
[Smith, 1963] tried to develop a bone substitute with porous aluminate ceramic impregnated with an 
epoxy resin called Cerocium®. Although the material demonstrated a good adherence to the tissues, the 
pore size (average 18 um diameter) was too small to allow any bony tissue ingrowth. Later, ceramics 
[Klawitter and Hulbert, 1972; Oonishi et al, 1989; Predecki et al, 1972], metals [Hirshhorn et al, 1972; 
Niles and Lapitsky, 1975], and polymers [Sauer et al., 1974] were used to test the ingrowth idea. Basically, 
any biocompatible material will allow bony tissues to grow into any space large enough to accommodate 
osteons. However, to be continuously viable the space must be large enough (more than 75 um in diameter 
for the bony tissues) and contiguous to the surface of bone matrix [Heimke, 1990]. In addition, Wolff's 
law dictates that the ingrown tissues should be subjected to bodily loading in order to prevent resorption, 
even after the initial ingrowth has taken place. The same principle also makes it difficult to have a uniform 
tissue ingrowth throughout the implant surface. This is the reason why the tissue ingrowth takes place 
where the stress transfer occurs, e.g., in the distal lateral region of the femoral stem of the artificial hip 
joint. 

Figure 46.7 shows the general trends of the fixation (interfacial) strength variation with implantation 
period in animals to be up to six months for metallic implants (Co-Cr alloys, Ti and its alloys, and 
stainless steel). A few observations can be made from this data. First, the maximum interfacial shear 
strength between bone and porous implant peaked at about 12 to 13 MPa in about 6 to 8 weeks regardless 
of the implant material, animal type, or location of the implants (i.e., cortical or cancellous bone and 
transcortical or intramedullary canal). Second, the data are widely scattered. (The original data showed 
wide variations, therefore, they are not distinguished among the variables). The decrease in interfacial 
strength with time after the peak for the metallic implants is somewhat distressing since this may also 
be true with human patients. This may be caused by two factors. The first is that of bone resorption 
from the initially ingrown area due to the stress shielding. The second is the release of a large number 
of metal ions due to the increased surface area by making the implant porous. Porous polymers did not 
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FIGURE 46.7 Maximum interfacial shear strength between bone and porous metal implants versus implant of 
various metals and animals, data from [Spector, 1982]. 
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FIGURE 46.8 Maximum interfacial shear strength between bone and porous polymeric implants versus implant. 
[Data from Spector, 1982]. 

have the decreasing trends as shown in Fig. 46.8 [Spector, 1982]. This may be interpreted as the lack of 
stress shielding (after bone is ingrown into the pores) due to the low modulus of the polymers. However, 
there are not enough data available to come to a definitive conclusion [Keet and Runne, 1989; Sadr and 
Arden, 1987; Spector, 1982; Tullos et al, 1984]. 

Further problems with biological fixation are: (1) the unforgiving nature of the surgery, (2) the long 
immobilization time for tissue ingrowth, (3) the unpredictable time to ambulate, (4) the difficulty in 
eradicating infection, and (5) once the interface is destroyed by an accidental overloading, it cannot be 
regrown with certainty. Moreover, the porous coating may weaken the underlying prosthesis itself and, 
in case of metals, there is an increased danger of corrosion fatigue due to the increased surface area 
[Brand, 1987; Morscher, 1984]. Due to these problems and clinical results of porous fixation some 
investigators have insisted that the bone cement fixation technique is still the gold standard at this time 
[Johnston, 1987; Wroblewski, 1993]. 

In order to alleviate these problems several modifications have been tried. 

Pre-Coating the Metallic Porous Surface with Ceramics or Carbons 

This method has been tried with limited success [Cranin et al., 1972; Thomas et al., 1985]. The problem 
of coating deep in the pores and the thermal expansion difference between the metal and ceramic 
materials, both make a uniform and good adherent coating very difficult. An attractive material for 
coating is hydroxyapatite [Ca 10 (PO 4 ) 6 (OH),] ceramic which is similar to bone mineral. It is not yet 
conclusive that this material is more beneficial than other ceramics. Some preliminary studies in our 
laboratory indicate that during the early period of fixation, the bioactive hydroxyapatite coating may be 
more beneficial, but the effect may diminish after three weeks as shown Fig. 46.9. The result is for the 
simple cortical bone plug experiment performed on canine femora [Park, 1988]. Others have shown 
promising results with the same material system used on rabbits instead of canines as in our study 
[Oonishi et al, 1989]. 

Pre-Coating with Porous Polymeric Materials on Metal Stem 

Theoretically, this method has two distinct advantages over the ceramics or carbon coating methods 
discussed above [Keet and Runne, 1989; Sadr and Arden, 1987; Spector, 1982; Tullos et al., 1984; Homsy 
et al., 1972]. First, the low modulus polymeric material could transfer the load from the implant to the 
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FIGURE 46.9 Maximum interfacial shear strength between bone and bioactive ceramic coated porous plug implants 
versus implant period when the plugs with and without (control) coating were implanted in the cortices of canine 
femora [Park, 1988]. 

bone more gradually and evenly and thus, prevent stress-shielding effects on the ingrown tissues. Second, 
this method would prevent the metallic-surface ion release, i.e., less corrosion of the metal. One major 
problem with this technique is the weak interfacial strength between the polymer and metallic stem, 
especially in dynamic loading conditions in vivo. 

Enhancement of Porous Ingrowth with Electrical or Electromagnetic Stimulation 

This technique combines the porous ingrowth with the stimulating effects of electrical and/or electro- 
magnetic stimulation [Park and Kenner, 1975; Weinstein etal., 1976]. Direct current stimulation can 
indeed accelerate tissue ingrowth in the early stages of healing but its effect diminishes over time 
[Weinstein et al, 1976]. Direct current stimulation has one distinct problem: the invasive nature of the 
power source. The pulsed electromagnetic field stimulation is a better method since the stimulation can 
be carried out extracorporeally. 

Porous Ingrowth with the Use of Filling Materials 

There has been some effort to use filling materials around the porous implant since it is very difficult to 
prepare the tissue bed with the exact shape of the prosthesis to eliminate the micro-motion of the 
prosthesis after implantation. Bone matrix proteins (BMP) or demineralized bone and hydroxyapatite 
crystals can be used for this purpose [Parsons et al., 1987]. The success of this technique has not been 
fully documented. 

Some glass-ceramics tend to achieve direct bonding with the bone through a selective dissolution of 
the surface film [Blencke et al., 1978; Hench and Paschall, 1973; Yamamuro et al., 1990]. Some researchers 
have reported a direct bonding of tissues to hydroxyapatites [Geesink et al., 1987; Kay, 1988]. It has been 
difficult to coat a metallic surface with these materials due to the large difference in the thermal expansion 
coefficient between the coating and base materials although dip-coating can be utilized [West et al., 1990] . 
The glass-ceramics have not been used to make load bearing implants due to their brittleness. 

46.3 Articulating Surface of the Acetabular Cup 
and Femoral Head 

Any articulating joint will involve the friction and wear of the two opposing members. Ever since the 
total joint replacement has been practiced this subject gave the orthopedic community pause and posed 
a considerable problem to the biomedical engineers due to the fact that any material or design can avoid 
this subject. The lowest friction coefficient material such as polytetrafluoroethylene (PTFE), best known 
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as Teflon®, is not sufficient as a member of a joint as it was proved to be disastrous by Dr. J. Charnley. 
Earlier than Charnley, the all metal-metal combinations were used but abandoned. Some researchers 
advocate it again [Bohler, 1995; Muller, 1995] claiming that smaller head design resulted in drastically 
reduced wear after 10 years of implantation (60, 1330, and 2100 urn for CoCrMo/CoCrMo, A1 2 3 /UHM- 
WPE, and CoCrNo/UHMWPE, respectively). Saikko reported wear of the UHMWPE cup and five 
different Al 2 0, ceramic head combinations and found a vastly different wear volume with a different 
femoral head design and stem fit [Saikko, 1995]. The head on these implants was attached to the Ti-alloy 
stem by taper-fit. The wear particles from the ceramic and head resulted in severe third-body abrasive 
wear (3170 mg) compared to minimal wear of others (5.3 to 124 mg) after 26 million cycles. Obviously 
there was a loosening between the head and the stem collar of the femoral implant for the worst case 
(Protek®) since others had similar taper-fit. In this respect, the trunnion design of the head and stem 
should be avoided at all possible costs to eliminate the possibility of third-body wear. Modularity of the 
head of the femoral stem is sound in principle since it could reduce inventory while providing the surgeons 
with versatility. The full benefits of such flexibility have not been realized in actual usage. Third body 
wear of the acetabular cup, increased chance of corrosion, and less than optimal design (although the 
optimal design remains elusive) contribute to the problem of the modularity [Chmell et al., 1995; Jacobs 
et al., 1995; Salvati et al., 1995]. 

Similar problems of third-body wear exist for the cemented polyethylene acetabular cup and metal- 
baked (porous coated) cup. PMMA bone cement particles were found in autopsies for the cemented 
cups. Ritter [Ritter, 1995] analyzed uncemented metal-backed vs. a cemented polyethylene cups and 
found statistically significant better clinical performance of the cemented cup in terms of the linear wear 
(0.05 vs. 0.11 mm/yr). There were also differences in the wear between machined and molded cups, 
indicating that the molding resulted in better polyethylene powder consolidation than the raw material 
used in the machine from rod stocks. 

It is also now widely recognized that gamma radiation causes the polyethylene to lose its original 
chemical and mechanical properties due to the oxidation about 1 mm below the surface after about 
three years of implantation [Li et al., 1995; Sutula et al., 1995] . This increases the density by crystallization 
but causes brittleness of the material. Increased localized wear could result. Inert or vacuum sterilization 
may minimize the problem unless the sterilization method is changed. 

Conclusions 

In summary, the wear of the articulating surface in a complex phenomenon like THR could be minimized 
by having the most consolidated polyethylene (molded) as a cup material with a small femoral head 
(either ceramic or metal) cemented with bone cement. Although some advocate the use of "hybrid" 
fixation techniques, i.e., cemented stem and uncemented cup, the final verdict is not yet in. A metal- 
backed polyethylene lined cup would cause more problems, either cemented or uncemented due to the 
increased complexity and decreased shock absorbing capacity of the cup during walking. The same is 
true with modularity of the femoral stem and head. 

It is strongly recommended that all nations follow the example of the Norwegian Arthroplasty Register 
in obtaining meaningful outcome data of all implantations. Thus, the clinical performance of a particular 
implant could be determined and disasters like the Boneloc® bone cement and Christianson prosthesis 
could be avoided more quickly [Linder, 1995]. 

Orthopedic surgeons often suggest that the fixation of implants should not be too strong since it would 
be difficult to remove, should it become necessary to have a revision arthroplasty to restore its function. 
The two requirements could be mutually exclusive since the interface must be strong enough to hold the 
prosthesis in vivo but not so strong that it may impair any remedial measures. 

The assessment of success or failure of the prosthesis fixation could be quite difficult clinically as 
demonstrated by some researchers. For example, some [Ling, 1992] contend that the relative motion 
between the implant and bone cement and subsidence of the prosthesis are not a direct cause of failure 
of arthroplasties while others consider it a failure [Harris, 1992]. 
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Another problem is related to the more frequent use of the implants for younger patients, which 
requires firmer fixation and longer implant life [Park et al., 1995]. The original expectation of uncemented 
porous ingrowth fixation for the young is tempered somewhat and we need to explore a longer lasting 
method of fixation. 

Acknowledgments 

This work is partially funded by the Biomaterials Research Fund, and the University of Iowa Research 
Foundation. A major part of the paper was presented at the International Symposium on Biomaterials 
and Drug Delivery Systems held at the Korea Research Institute of Chemical Technology in Taejon, Korea, 
July 4-5, 1996. 

References 

Ahmed, A. M., Raab, S., and Miller, J. E. (1984). Metal/cement interface strength in cemented stem 

fixation. /. Orthop. Res. 2, 105-118. 
Amstutz, H. C, Markolf, K. L., McNeices, G. M., and Gruen, T A. W. (1976). Loosening of total hip 

components: Cause and prevention. In: The Hip (St. Louis: Mosby), pp. 102-116. 
Anthony, P. P., Gie, G. A., Howie, C. R., and Ling, R. S. M. (1990). Localised endosteal bone lysis in relation 

to the femoral components of cemented total hip arthroplasties. /. Bone Joint Surg. 72B, 971-979. 
Barb, W., Park, J. B., von Recum, A. F., and Kenner, G. H. (1982). Intramedullary fixation of artificial 

hip joints with bone cement precoated implants: I, Interfacial strengths. /. Biomed. Mater. Res. 16, 

447-458. 
Bhamra, M., Rao, G. S., and Robson, M. (1996). Hydroxyapatite-coated hip prostheses: Difficulties with 

revision in four cases. Acta Orthop. Scand. 67, 49-52. 
Blencke, B. A., Bromer, H., and Deutscher, K. K. (1978). Compatibility and long-term stability of glass 

ceramic implants. /. Biomed. Mater. Res. 12, 307-316. 
Bohler, N. (1995). Metal/metal articulating interfaces. Orthoped. 18, 879-880. 
Brand, R. A. (1987). The Hip: Non-Cemented Hip Implants (St. Louis: C.V. Mosby Co). 
Carlsson, A. S., Gentz, C. F., and Lindberg, H. O. (1983). Thirty-two noninfected total hip arthroplasties 

revised due to stem loosening. Clin. Orthop. Rel. Res. 181, 196-203. 
Charnley, J. (1970). Acrylic Cement in Orthopaedic Surgery (Edinburgh: Churchill/Livingstone). 
Charnley, J. (1972). The long-term results of low- friction arthroplasty of the hip, performed as a primary 

intervention. /. Bone Joint Surg. 54B, 61-76. 
Charnley, J. and Cupic, Z. (1973). The nine and ten year results of the low-friction arthroplasty of the 

hip. Clin. Orthop. Rel. Res. 95, 9-25. 
Chmell, M. J., Rispler, D., and Poss, R. (1995). The impact of modularity in total hip arthroplasty. Clin. 

Orthop. Rel. Res. 319, 77-84. 
Cranin, A. N., Schnitman, P. A., Rabkin, M., Dennison, T, and Onesto, E. J. (1972). Alumina and zirconia 

coated Vitailium oral endosteal implants. /. Biomed. Mater. Res. Symposium No. 6, 257-262. 
Crowninshield, R. (1988). An overview of prosthetic materials for fixation. Clin. Orthop. Rel. Res. 235, 

166-172. 
Crowninshield, R. D., Brand, R. A., Johnston, R. C, and Milroy, J. C. (1980). An analysis of femoral 

component stem design in total hip arthroplasty. /. Bone Joint Surg. 62A, 68-78. 
Crugnola, A., Ellis, E. J., Radin, E. L., and Rose, R. M. (1979). A second generation of acrylic cements. 

Trans. Soc. Biomat. 3, 91. 
Dai, K. R. (1991 and 1996). Personal communications. 
Dai, K. R., Liu, Y. K., Park, J. B., and Zhang, Z. K. (1991). Bone particle impregnated bone cement: An 

in vivo weight-bearing study. /. Biomed. Mater. Res. 25, 141-156. 
Dall, D. M., Learmonth, I. D., Solomon, M., and Davenport, M. (1993). 811 Charnley hips followed for 

3-17 years. Acta Orthop. Scand. 64, 252-256. 

© 2000 by CRC Press LLC 



Davidson, J. A. (1988). Bone cement reinforcement and method. In Patent # 4,735,625 (USA). 

de Groot, K. (1983). Bioceramics of Calcium Phosphate (Boca Raton, FL: CRC Press). 

Dowd, J. E., Schwendeman, L., Macaulay, W., Doyle, J. S., Shanbhag, A. S., Wilson, S., Herndon, J. H., 

and Rubash, H. E. (1995). Aseptic loosening in uncemented total hip arthroplasty in a canine 

model. Clin. Orthop. Rel. Res. 319, 106-121. 
Ducheyne, P. (1988). Prosthesis fixation for orthopedics. In: Encyclopedia of Medical Devices and Instru- 
mentation, J. G. Webster, Ed. (N.Y.: Wiley-Interscience), 2146-2154. 
Eftekhar, N. S. (1978). Principles of Total Hip Arthroplasty (St. Louis: Mosby). 
Espehaug, B., Havelin, L. I., Engesaeter, L. B., Vollset, S. E., and Langeland, N. (1995). Early revision 

among 12,179 prostheses: A comparison often different brands reported to the Norwegian arthro- 
plasty register, 1987-1993. Acta Orthop. Scand. 66, 487-493. 
Foerch, R., Kill, G., and Walzak, M. (1994). Plasma Surface Modification of Polymers. In: Plasma Surface 

Modification of Polyethylene: Short-Term vs. Long-Term Plasma Treatment, M. Strobel, C. Lyons and 

K. L. Mittal, Eds. (Utrecht, Netherlands: VSP Press), 99-111. 
Fowler, J. L., Gie, G. A., Lee, A. J. C, and Ling, R. S. M. (1988). Experience with Exeter total hip 

replacement since 1970. Orthop. Clin. N. Am. 19, 477-489. 
Geesink, R. G., de Groot, K., and Klein, C. (1987). Chemical implant fixation using hydroxyaptite coatings. 

Clinic. Orthop. Rel. Res. 226, 147-170. 
Gruen, T. S., McNeice, G. M., and Amstutz, H. A. (1979). "Modes of failure" of cemented stem-type 

femoral components. Clin. Orthop. Rel. Res. 141, 17-27. 
Gul, R. M., Muratoglu, O. K., McGarry, F. J., Bragdon, C. R., Jasty, M., and Harris, W. H. (1998). The 

effect of the peroxide content on the cross-link density, mechanical properties, and wear behavior 

of UHMWPE. In: Orthop Res. Soc. (New Orleans, LA), 99. 
Harrigan, T. P., Kareh, J. A., O'Connor, D. O, Burke, D. W., and Harris, W. H. (1992). A finite element 

study of the initiation of failure of fixation in cemented femoral total hip components. /. Orthop. 

Res. 10, 134-144. 
Harris, W. H. (1992). Is it advantageous to strengthen the cement-metal interface and use a collar for 

cemented femoral components of total hip replacements? Clin. Orthop. Rel. Res. 285, 67-72. 
Harris, W H. (1995). The problem is osteolysis. Clin. Orthop. Rel. Res. 311, 46-53. 
Harris, W. H. and McGann, W. A. (1984). Loosening of the femoral component after use of the medullary- 
plug cementing technique. /. Bone Joint Surg. 67B, 222-224. 
Harris, W. H. and Penenberg, B. L. (1987). Further follow-up on socket fixation using a metal-backed 

acetabular component for total hip replacement. A minimum ten year follow-up study. /. Bone 

Joint Surg. 69A, 1140-1143. 
Havelin, L. I., Espehaug, B., Vollset, S. E., and Engesaeter, L. B. (1994). Early failures among 14,009 

cemented and 1,326 uncemented prostheses for primary coarthrosis: The Norwegian Arthroplasty 

Register, 1987-1992. Acta Orthop Scand 65, 1-6. 
Havelin, L. I., Vollset, S. E., and Engesaeter, L. B. (1995). Revision for aseptic loosening of uncemented 

cups in 4,352 primary total hip prostheses: A report from the Norwegian arthroplasty register, 

1987-1993. Acta Orthop. Scand. 66, 494-500. 
Hedley, A. K., Moreland, J., Bloebaum, R. D., Coster, I., Gruen, T, and Clarke, I. (1979). Press-fit, 

cemented, and bone ingrowth surface replacement-canine fixation model. Trans. Orthop. Res. Soc. 

4, 163. 
Heimke, G. E. (1990). Osseo-Integrated Implants, vol. I & II (Boca Raton, FL: CRC Press). 
Hench, L. L. and Paschall, H. A. (1973). Direct chemical bond of bioactive glass-ceramic materials to 

bone and muscle. /. Biomed. Mater. Res. Symposium No. 4, 25-43. 
Henrich, D. E., Cram, A. E., Park, J. B., Liu, Y. K., and Reddi, H. (1993). Inorganic bone and bone 

morphogenetic protein impregnated bone cement: A preliminary in vivo study. /. Biomed. Mater. 

Res. 27, 277-280. 
Hirshhorn, J. S., McBeath, A. A., and Dustoor, M. R. (1972). Porous titanium surgical implant materials. 

/. Biomed. Mater. Res. Symposium No. 2, 49-69. 



e 2000 by CRC Press LLC 



Homsy, C. A., Cain, T. E., Kessler, F. B., Anderson, M. S., and King, J. W. (1972). Porous implant systems 

for prosthetic stabilization. Clinic. Orthop. Rel. Res. 89, 220-231. 
Jacobs, J. J., Urban, R. M., Gilbert, J. L., Skipor, A. K., Jasty, M., Black, J., and Galante, J. O. (1995). Local 

and distant products from modularity. Clin. Orthop. Rel. Res. 319, 94-105. 
Jasty, M., Maloney, W. J., Bragdon, C. R., O'Connor, D. O., Haire, T., and Harris, W. H. (1991). The 

initiation of failure in cemented femoral components of hip arthroplasties. /. Bone Joint Surg. 73B, 

551-558. 
Johnston, R. C. (1987). The case for cemented hips. In The Hip, R. A. Brand, Ed. (St. Louis: C.V. Mosby 

Co.), 351-358. 
Kang, Y. H. (1998). Pre-coating of Ultra-High Molecular Weight Polyethylene (UHMWPE) with Polym- 
ethylmethacrylate (PMMA). In: Biomedical Engineering (Iowa City: The University of Iowa), 1-160. 
Kang, Y. H. and Park, J. B. (1998). Interfacial Tensile Strength of PMMA Pre-coated UHMWPE and Bone 

Cement. /. Biomed. Mat. Res. (Appl. Biomat.) 43, 261-269. 
Kay, J. F. (1988). Bioactive surface coatings: Cause for encouragement and caution. /. Oral Implant 16, 

43-54. 
Keet, G. G. M., and Runne, W C. (1989). The anaform endoprosthesis: A proplast-coated femoral 

endoprosthesis. Orthop. 12, 1185-1190. 
Khang, G., Kang, Y, Lee, H., and Park, J. (1996). Improved bonding strength of polyethylene/polyme- 

thylmethacrylate bone cement: A preliminary study. Biomed. Mater. Eng. 6, 335-344. 
Kim, J. K. and Park, J. B. (1996). Fatigue properties and stem subsidence of wire coil reinforced PMMA 

bone cement: An in vitro study. Biomed. Mater. Eng. 6, 453-462. 
Kim, J. K. and Park, J. B. (1994). Reinforcement of bone cement around prostheses by precoated wire 

coil: A preliminary study. Biomed. Mater. Eng. 4, 369-380. 
Kim, J. K. and Park, J. B. (1996). Reinforcement of bone cement around the femoral prosthesis tip by 

pre-coated wire coil: A human cadaver bone study. Biomed. Mater. Eng. 6, 159-164. 
Kim, Y S., Kang, Y. H., Kim, J. K., and Park, J. B. (1994). Effect of bone mineral particles on the porosity 

of bone cement. Biomed. Mater. Eng. 4, 1-10. 
Klawitter, J. J., and Hulbert, S. F. (1972). Application of porous ceramics for the attachment of load 

bearing internal orthopedic applications. /. Biomed. Mater. Res. Symp. No. 2, 161-229. 
Kwon, S. Y, Kim, Y. S., Woo, Y K, Kim, S. S., and Park, J. B. (1997). Hydroxyapatite impregnated bone 

cement: In vitro and in vivo studies. Biomed. Mar. Eng. 7, 129-140. 
Li, S., Chang, J. D., Barrena, E. G., Furman, B. D., Wright, T. M., and Salvati, E. (1995). Nonconsolidated 

polyethylene particles and oxidation in Charnley acetabular cups. Clin. Orthop. Rel. Res.319, 54-63. 
Linder, L. (1995). Boneloc® — the Christiansen experience revisited, Editorial. Acta Orthop. Scand. 66, 

205-206. 
Ling, R. S. M. (1992). The use of a collar and precoating on cemented femoral stems is unnecessary and 

detrimental. Clin. Orthop. Rel. Res. 285, 73-83. 
Liu, Y. K., Njus, G. O., Park, J. B., and Stienstra, D. (1987). Bone particle impregnated bone cement I. 

In vitro study. /. Biomed. Mater. Res. 21, 247-261. 
Lombardi, A. V., Mallory, T H., Eberle, R. W, Mitchell, M. B., Lefkowitt, M. S., and Williams, J. R. ( 1995). 

Failure of intraoperatively customized non-porous femoral components inserted without cement 

in total hip arthroplasty. /. Bone Joint Surg. 77A, 1836-1844. 
Lu, Z., Ebramzadeh, E., and Sarmiento, A. (1993). The effect of failure of the cement interfaces on gross 

loosening of cemented total hip femoral components. Trans. Orthop. Res. Soc. 18, 519. 
Malchau, H. and Herberts, P. (1998). Prognosis of total hip replacement: Revision and re-revision rate 

in IHR: A revision-risk study of 148,359 primary operations. In 65th Annual Ame. Acad. Orthop. 

Surg. (New Orleans, LA). 
Maloney, W. J., Jasty, M., Rosenberg, A., and Harris, W. H. (1990). Bone lysis in well-fixed cemented 

femoral components. /. Bone Joint Surg. 72B, 966-970. 
Maloney, W. J. and Smith, R. L. (1995). Periprosthetic osteolysis in total hip arthroplasty: The role of 

particulate wear debris. /. Bone Joint Surg. 11 A, 1448-1460. 

© 2000 by CRC Press LLC 



Manley, M. T. and Stern, L. S. (1985). The load carrying and fatigue properties of the stem-cement 

interface with smooth and porous coated femoral components. /. Biomed. Mater. Res. 19, 563-575. 
Manley, M. T., Stern, L. S., Kotzar, G., and Stulberg, B. N. (1987). Femoral component loosening in hip 

arthroplasty. Acta Orthop. Scand. 58, 485-490. 
Mann, K. A., Bartel, D. L., Wright, T. M., and Ingraffea, A. R. (1991). Mechanical characteristics of stem- 
cement interface. /. Orthop. Res., 798-808. 
Marrs, H., Barton, D. C, Ward, I. M., Doyle, C, Fisher, J., and Jones, R. A. (1998). Comparative wear 

under three different tribological conditions of acetylene crosslinked ultra high molecular weight 

polyethylene. In: Orthop Res. Soc. (New Orleans, LA), 100. 
McKellop, H. A., Shen, F. W., and Salovey, R. (1998). Extremely low wear of gamma crosslinked/remelted 

UHMW polyethylene acetabular cups. In: Orthop Res. Soc. (New Orleans, LA), 98. 
Mittlemeier, H. (1976). Anchoring hip endoprosthesis without bone cement. In: Advances in Artificial 

Hip and Knee Joint Technology, M. Schaldach and D. Hohmann, Eds. (Berlin: Springer- Verlag), 

387-402. 
Mittlemeier, H. (1975). New development of wear- resistant ceramic and metal composite prostheses with 

ribbed support shafts for cement-free implantation. Hefte zur Unfallheilkunde: Beihefe zue 

Monatsschrift fur Unfallheilkunde. Verischerings, Versorgungs und Verkehrsmedizin 126, 333-336. 
Mohler, C. G., Callaghan, J. J., Collis, D. K., and Johnston, R. C. (1995). Early loosening of the femoral 

component at cement-prosthesis interface after total hip replacement. /. Bone Joint Surg. 77A, 

1315-1322. 
Moore, A. T. (1952). Metal hip joint: A new self-locking Vitallium prosthesis. South. Med. J. 45, 10-15. 
Morscher, E. (1984). The Cementless Fixation of Hip Endoprosthesis (Heidelberg: Springer- Verlag). 
Morscher, E. W. (1995). The clinical significance of implant stiffness. Orthoped. 18, 795-800. 
Muller, M. E. (1995). The benefits of metal-on-metal total hip replacements. Clin. Orthop. Rel. Res. 311, 

54-59. 
Mulroy, R. D. and Harris, W. H. (1990). The effect of improved cementing techniques on component 

loosening in total hip replacement: An eleven year radiographic review. /. Bone Joint Surg. 72B, 757-760. 
Niles, J. L. and Lapitsky, M. (1975). Biomechanical investigations of bone-porous carbon and metal 

interfaces. Biomed. Mater. Res. Symposium 4, 63-84. 
Nilsen, A. R. and Wiig, M. (1996). Total hip arthroplasty with Boneloc®: Loosening in 102/157 cases 

after 0.5-3 years. Acta Orthop. Scand. 67, 57-59. 
O'Connor, D. O., Burke, D. W., Sedlacek, R. C, and Harris, W. H. (1991). Peak cement strains in cemented 

femoral total hip. Trans. Orthop. Res. Soc. 16, 220. 
Onsten, I., Sanzen, L., Carlsson, A., and Besjakov, J. (1995). Migration of uncemented, long-term femoral 

components in revision hip arthroplasty: A 2-8 year clinical follow-up of 45 cases and radiostere- 

ometric analysis of 13 cases. Acta Orthop. Scand. 66, 220-224. 
Oonishi, H., Saito, M., and Kadoya, Y. (1998). Wear of high-dose gamma irradiated polyethylene in total 

joint replacement. Long-term radiological evaluation. In: Orthop Res. Soc. (New Orleans, LA), 97. 
Oonishi, H., Yamamoto, M., Ishimaru, H., Tsuji, E., Kushitani, S., Aono, M., and Ukon, Y (1989). 

Comparisons of bone ingrowth into porous Ti-6A1-4V beads uncoated and coated with hydroxya- 

patite. In: Bioceramics, H. Ooonoishi, H. Aoki and K. Sawai, Eds. (Tokyo and St. Louis: Ishiyaku 

EuroAmerica, Inc.), 400-405. 
Park, J. B. (1983). Acrylic bone cement: In vitro and in vivo property-structure relationship — A selective 

review. Annals Biomed. Eng. 11, 297-312. 
Park, J. B. (1983). Implant fixation by pulsed electromagnetic field stimulation. Unpublished study (Iowa 

City, IA: The University of Iowa). 
Park, J. B. (1988). In vivo evaluation of hydroxyapatite coated implant. Unpublished study (Iowa City: 

The University of Iowa). 
Park, J. B. (1992). Orthopedic Prosthesis Fixation. Annals Biomed. Eng. 20, 583-594. 
Park, J. B. (1995). Orthopedic Prosthesis Fixation. In: Biomedical Engineering Handbook, J. D. Bronzino, 

Ed. (Boca Raton, FL: CRC Press), 704-723. 



© 2000 by CRC Press LLC 



Park, J. B. (1993). Reinforcement of bone cement around prosthesis by pre-coated wire coil/wire mesh. 

In Patent Disclosure. 
Park, J. B. (1998). Sintering of pure and cross-linked UHMWPE, MMA, and PMMA treated powders to 

create graded total joint implants for better fixation and wear. In Patent Disclosure (USA). 
Park, J. B., Barb, W., Kenner, G. H., and von Recum, A. F. (1982). Intramedullary fixation of artificial 

hip joints with bone cement precoated implants. II. Density and histological study. /. Biomed. 

Mater. Res. 16, 459-469. 
Park, J. B., Choi, W. W., Liu, Y. K., and Haugen, T. W. (1986). Bone particle impregnated polymethyl- 
methacrylate: In vitro and in vivo study. In: Tissue Integration in Oral and Facial Reconstruction, 

D. Van Steenberghe, Ed. (Amsterdam: Excerptu Medica), 118-124. 
Park, J. B. and Kenner, G. H. (1975). Effect of electrical stimulation on the tensile strength of the porous 

implant and bone interface. Biomater. Med. Dev. Artif. Org. 3, 233-243. 
Park, J. B. and Lakes, R. S. (1992). Biomaterials: An Introduction: 2nd Edition (N.Y.: Plenum Pub.). 
Park, J. B., Malstrom, C. S., and von Recum, A. F. (1978). Intramedullary fixation of implants pre-coated 

with bone cement: A preliminary study. Biomater. Med. Dev. Artif. Org. 6, 361—373. 
Park, J. B., von Recum, A. E, and Gratzick, G. E. (1979). Pre-coated orthopedic implants with bone 

cement. Med. Dev. Artif. Org. 7, 41-53. 
Park, K. D. and Park, J. B. (1998). Pre-coating UHMWPE powers with PMMA and sintering (Iowa City: 

The University of Iowa). 
Park, S.-H., Llinas, A., and Goel, V. K. (1995). Hard tissue replacement. In: Biomedical Engineering 

Handbook, J. D Bronzino, Ed. (Boca Raton, FL: CRC Press), 672-691. 
Parsons, J. R., Ricci, J. L., Liebrecht, P., Salsbury, R. L., Patras, A. S., and Alexander, H. (1987). Enhanced 

stabilization of orthopedic implants with spherical hydroxylapatite particulate (Dublin, CA: Ortho- 
Matrix, Inc.). 
Paul, J. P. (1976). Loading on normal hip and knee joints and joint replacements. In: Advances in Hip 

and Knee Joint Technology, M. Schaldach and D. Hohmann, Eds. (Berlin: Springer- Verlag), 53-70. 
Predecki, P., Stephan, J. E., Auslander, B. E., Mooney, V. L., and Kirkland, K. (1972). Kinetics of bone 

growth into cylindrical channels in aluminum oxide and titanium. /. Biomed. Mater. Res. 6, 

375-400. 
Raab, S., Ahmed, A. M., and Provan, J. W. (1982). Thin film PMMA precoating for improved implant 

bone-cement fixation. /. Biomed. Mater. Res. 16, 679-704. 
Riegels-Nielsen, P., Sorensen, L., Andersen, H. M., and Lindequist, S. (1995). Boneloc® cemented total 

hip prostheses: Loosening in 28/43 cases after 3-38 months. Acta Orthop. Scand. 66, 215-217. 
Ritter, M. A. (1995). All polyethylene versus metal backing. Clin. Orthop. Rel. Res. 311, 69-75. 
Ritter, M. A., Keating, E. M., Faris, P. M., and Brugo, G. (1990). Metal-backed acetabular cups in total 

hip arthroplasty. /. Bone Joint Surg. 72A, 672-677. 
Sadr, B., and Arden, G. P. (1987). A comparison of the stability of proplast-coated and cemented Thomp- 
son prosthesis in the treatment of subcapital femoral fractures. Injury 8, 234-237. 
Saha, S., and Pal, S. (1984). Mechanical properties of bone cement: A review. /. Biomed. Mater. Res. 18, 

435-462. 
Saikko, V. (1995). Wear of the polyethylene acetabular cup: The effect of head material, head diameter, 

and cup thickness studied with a hip simulator. Acta Orthop. Scand. 66, 501-506. 
Salvati, E. A., Lieberman, J. R., Huk, O. L., and Evans, B. G. (1995). Complications of femoral and 

acetabular modularity. Clin. Orthop. Rel. Res. 319, 85-93. 
Sauer, B. W., Weinstein, A. M., Klawitter, J. J., Hulbert, S. E, Leonard, R. B., and Bagwell, J. G. (1974). 

The role of porous polymeric materials in prosthesis attachment. /. Biomed. Mater. Res. 

Symposium 5, 145-156. 
Schmalzried, T. P., Kwong, L. M., Sedlacek, R. C, Jasty, M., Haire, T. C, O'Connor, D. O., Bragdon, C. R., 

Kabo, J. M., Malcolm, A. J., and Harris, W. H. (1992). The mechanism of loosening of cemented 

acetabular components in total hip arthroplasty: Analyis of specimens retrieved at autopsy. Clin. 

Orthop. Rel. Res. 274, 60-78. 

© 2000 by CRC Press LLC 



Smith, L. (1963). Ceramic-plastic material as a bone substitute. Arch. Surg. 87, 653-661. 

Spector, M. (1982). Bone Ingrowth into porous metals. In: Biocompatibility of Orthopedic Implants, D. R 
Williams, Ed. (Boca Raton, FL: CRC Press), 55-88. 

Spector, M. (1982). Bone Ingrowth into porous polymers. In: Biocompatibility of Orthopedic Implants, 
D. F. Williams, Ed. (Boca Raton, FL: CRC Press), 89-128. 

Stromberg, C. N., Herberts, P., Palmertz, B., and Garellick, G. (1996). Radiographic risk signs for 
loosening after cemented THA: 61 loose stems and 23 loose sockets compared with 42 controls. 
Acta Orthop. Scand. 67, 43-48. 

Sutula, L. C, Collier, J. P., Saum, K. A., Currier, B. H., Currier, J. H., Sanford, W. M., Mayor, M. B., 
Wooding, R. E., Spering, D. K., Williams, I. R., Kasprzak, D. J., and Sorprenant, V. A. (1995). 
Impact of gamma sterilization on clinical performance of polyethylene in the hip. Clin. Orthop. 
Rel. Res. 319, 28-40. 

Taitsman, J. P. and Saha, S. (1977). Tensile strength of wire-reinforced bone cement and twisted stainless- 
steel wire. /. Bone Joint Surg. 59A, 419-425. 

Taylor, D. (1987). Fatigue failure in bone cements for hip joint implants. Fatigue 87, 1353. 

Taylor, D., Clarke, E, McCormack, B., and Sheehan, J. (1989). Reinforcement of bone cement using metal 
meshes. Engng. Med. 203, 49-53. 

Thanner, J., Freij-Larsson, C, Karrholm, J., Malchau, H., and Wesslen, B. (1995). Evaluation of Boneloc®: 
Chemical and mechanical properties, and a randomized clinical study of 30 total hip arthroplasties. 
Acta Orthop. Scand. 66, 207-214. 

Thomas, K. A., Cook, S. D., Renz, E. A., Anderson, R. C, Haddad, R. J. J., Haubold, A. D., and Yapp, R. 
(1985). The effect of surface treatments on the interface mechanics of LTI pyrolytic carbon 
implants. /. Biomed. Mater. Res. 19, 145-160. 

Topoleski, L. D. T, Ducheyne, P., and Cuckler, J. M. (1990). The fracture toughness of short-titanium- 
fiber reinforced bone cement. Trans. Soc. Biomat. 13, 107. 

Tullos, H. S., McCaskill, B. L., Dickey, R., and Davidson, J. (1984). Total hip arthroplasty with a low- 
modulus porous-coated femoral component. /. Bone Joint Surg. 66A, 888-898. 

Verdonschot, N., and Huiskes, E. T (1998). Effects of prosthesis surface roughness on the failure process 
of cemented hip implants after stem-cement debonding. /. Biom. Mater. Res. 42, 554-559. 

Weinstein, A. M., Klawitter, J. J., Cleveland, T. W, and Amoss, D. C. (1976). Electrical stimulation of 
bone growth into porous A1 2 3 . /. Biomed. Mater. Res. 10, 231-247. 

West, J. K., Clark, A. E., Hall, M. B., and Turner, G. F. (1990). In vivo bone-bonding study of Bioglass®- 
coated titanium alloy. In: CRC Handbook of Bio active Ceramics vol I: Bioactive Glasses and Glass- 
Ceramics, T Yamamuro, L. L. Hench and I. Wilson, Eds. (Boca Raton, FL: CRC Press), 161-166. 

Willert, H. G., Koch, R., and Burgi, M. (1991). Reinforcement for a bone cement bed. In Patent # 5,035,714 
(USA). 

Williams, D. F. and Roaf, R. (1973). Implants in Surgery (London: W.B. Saunders). 

Wroblewski, B. M. (1986). 15-21 Year results of the charnley low-friction arthroplasty. Clin. Orthop. Rel. 
Res. 211,30-35. 

Wroblewski, B. M. (1993). Cementless versus cemented total hip arthroplasty: A scientific controversy? 
Orthop. Clin. North Am. 24, 591-597. 

Wykman, A. G. M., Olsson.A. E., Axdorph, G., and Goldie, I. (1991). Total hip arthroplasty. A comparison 
between cemented and press-fit noncemented fixation. /. Arthoplasty 6, 19-29. 

Yamamuro, T, Hench, L. L., and Wilson, I. E. (1990). CRC Handbook of Bioactive Ceramics vol I: Bioactive 
Glasses and Glass-Ceramics and vol. II Calcium Phosphate and Hydroxylapatite Ceramics (Boca 
Raton, FL: CRC Press). 



© 2000 by CRC Press LLC 



Neuman, M.R. "Biomedical Sensors." 

The Biomedical Engineering Handbook: Second Edition. 

Ed. Joseph D. Bronzino 

Boca Raton: CRC Press LLC, 2000 




Biomedical 
Sensors 



Michael R. Neuman 

Joint Program in Biomedical Engineering, 
The University of Memphis and 
University of Tennessee, Memphis 

47 Physical Measurements Michael R. Neuman 

Description of Sensors • Biomedical Applications of Physical Sensors 

48 Biopotential Electrodes Michael R. Neuman 

Sensing Bioelectric Signals • Electrical Characteristics • Practical Electrodes for Biomedical 
Measurements • Biomedical Applications 

49 Electrochemical Sensors Chung-Chiun Liu 

Conductivity/Capacitance Electrochemical Sensors • Potentiometric Sensors • Voltammetric 
Sensors • Reference Electrodes • Summary 

50 Optical Sensors Yitzhak Mendelson 

Instrumentation • Optical Fibers • General Principles of Optical Sensing • Applications 

51 Bioanalytic Sensors Richard R Buck 

Classification of Biochemical Reactions in the Context of Sensor Design and Development • 
Classification of Transduction Processes — Detection Methods • Tables of Sensors from the 
Literature • Application of Microelectronics in Sensor Fabrication 



SENSORS CONVERT SIGNALS OF ONE type of quantity such as hydrostatic fluid pressure into 
an equivalent signal of another type of quantity, for example, an electrical signal. Biomedical sensors 
take signals representing biomedical variables and convert them into what is usually an electrical 
signal. As such, the biomedical sensor serves as the interface between a biologic and an electronic system 
and must function in such a way as to not adversely affect either of these systems. In considering 
biomedical sensors, it is necessary to consider both sides of the interface: the biologic and the electronic, 
since both biologic and electronic factors play an important role in sensor performance. 

Many different types of sensors can be used in biomedical applications. Table V.l gives a general 
classification of these sensors. It is possible to categorize all sensors as being either physical or chemical. 
In the case of physical sensors, quantities such as geometric, mechanical, thermal, and hydraulic 
variables are measured. In biomedical applications these can include things such as muscle displacement, 
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blood pressure, core body temperature, blood flow, TABLE V-1 classifications of Biomedical Sensors 

cerebrospinal fluid pressure, and bone growth. Two 

„ , . , , . . . Physical sensors 

types ot physical sensors deserve special mention ' 

with regard to their biomedical application: Sensors Mechanical 

of electrical phenomena in the body, usually known Thermal 

as electrodes, play a special role as a result of their Hydraulic 

diagnostic and therapeutic applications. The most Electric 

familiar of these are sensors used to pick up the „, . , 

r r Chemical sensors 

electrocardiogram, an electrical signal produced by Q as 

the heart. The other type of physical sensor that finds Electrochemical 

many applications in biology and medicine is the Photometric 

optical sensor. These sensors can use light to collect ° ther P h y sical chemical methods 

. r . i • i r n • Bioanalytic 

information, and, in the case of fiber optic sensors, . 

light is the signal transmission medium as well. 

The second major classification of sensing devices is chemical sensors. In this case the sensors are 
concerned with measuring chemical quantities such as identifying the presence of particular chemical 
compounds, detecting the concentrations of various chemical species, and monitoring chemical activities 
in the body for diagnostic and therapeutic applications. A wide variety of chemical sensors can be classified 
in many ways. One such classification scheme is illustrated in Table V.l and is based upon the methods 
used to detect the chemical components being measured. Chemical composition can be measured in the 
gas phase using several techniques, and these methods are especially useful in biomedical measurements 
associated with the pulmonary system. Electrochemical sensors measure chemical concentrations or, 
more precisely, activities based on chemical reactions that interact with electrical systems. Photometric 
chemical sensors are optical devices that detect chemical concentrations based upon changes in light 
transmission, reflection, or color. The familiar litmus test is an example of an optical change that can be 
used to measure the acidity or alkalinity of a solution. Other types of physical chemical sensors such as 
the mass spectrometer use various physical methods to detect and quantify chemicals associated with 
biologic systems. 

Although they are essentially chemical sensors, bioanalytic sensors are often classified as a separate 
major sensor category. These devices incorporate biologic recognition reactions such as enzyme-substrate, 
antigen-antibody, or ligand-receptor to identify complex biochemical molecules. The use of biologic 
reactions gives bioanalytic sensors high sensitivity and specificity in identifying and quantifying bio- 
chemical substances. 

One can also look at biomedical sensors from the standpoint of their applications. These can be 
generally divided according to whether a sensor is used for diagnostic or therapeutic purposes in clinical 
medicine and for data collection in biomedical research. Sensors for clinical studies such as those carried 
out in the clinical chemistry laboratory must be standardized in such a way that errors that could result 
in an incorrect diagnosis or inappropriate therapy are kept to an absolute minimum. Thus these sensors 
must not only be reliable themselves, but appropriate methods must exist for testing the sensors that are 
a part of the routine use of the sensors for making biomedical measurements. 

One can also look at biomedical sensors from the standpoint of how they are applied to the patient 

or research subject. Table V.2 shows the range of general approaches to attaching biomedical sensors. At 

the top of the list we have the method that involves the least interaction with the biologic object being 

studied; the bottom of the list includes sensors that interact to the greatest extent. Clearly if a measurement 

can be made equally well by a sensor that does not contact the subject being measured or by one that 

must be surgically implanted, the former is by far the 

most desirable. However, a sensor that is used to pro- TABLE V.2 Types of Sensor- Subject Interfaces 

vide information to help control a device surgically XT . .■ , ■ ■ % 

r ° ' Noncontacting (noninvasive) 

placed in the body to replace or assist a failing organ skin sur f ace (contacting) 
should be implanted, since this is the best way to com- Indwelling (minimally invasive) 

municate with the internal device. Implantable (invasive) 



i 2000 by CRC Press LLC 



You will notice in reading this section that the majority of biomedical sensors are essentially the same 
as sensors used in other applications. The unique part about biomedical sensors is their application. 
There are, however, special problems that are encountered by biomedical sensors that are unique to them. 
These problems relate to the interface between the sensor and the biologic system being measured. The 
presence of foreign materials, especially implanted materials, can affect the biologic environment in which 
they are located. Many biologic systems are designed to deal with foreign materials by making a major 
effort to eliminate them. The rejection reaction that is often discussed with regard to implanted materials 
or transplanted tissues is an example of this. Thus, in considering biomedical sensors, one must worry 
about this rejection phenomenon and how it will affect the performance of the sensor. If the rejection 
phenomenon changes the local biology around the sensor, this can result in the sensor measuring 
phenomena associated with the reaction that it has produced as opposed to phenomena characteristic 
of the biologic system being studied. 

Biologic systems can also affect sensor performance. This is especially true for indwelling and implanted 
sensors. Biologic tissue represents a hostile environment which can degrade sensor performance. In 
addition to many corrosive ions, body fluids contain enzymes that break down complex molecules as a 
part of the body's effort to rid itself of foreign and toxic materials. These can attack the materials that 
make up the sensor and its package, causing the sensor to lose calibration or fail. 

Sensor packaging is an especially important problem. The package must not only protect the sensor 
from the corrosive environment of the body, but it must allow that portion of the sensor that performs 
the actual measurement to communicate with the biologic system. Furthermore, because it is frequently 
desirable to have sensors be as small as possible, especially those that are implanted and indwelling, it is 
important that the packaging function be carried out without significantly increasing the size of the 
sensor structure. Although there have been many improvements in sensor packaging, this remains a 
major problem in biomedical sensor research. High-quality packaging materials that do not elicit major 
foreign body responses from the biologic system are still being sought. 

Another problem that is associated with implanted sensors is that once they are implanted, access to 
them is very limited. This requires that these sensors be highly reliable so that there is no need to repair 
or replace them. It is also important that these sensors be highly stable, since in most applications it is 
not possible to calibrate the sensor in vivo. Thus, sensors must maintain their calibration once they are 
implanted, and for applications such as organ replacement, this can represent a potentially long time, 
the remainder of the patient's life. 

In the following sections we will look at some of the sensors described above in more detail. We will 
consider physical sensors with special sections on biopotential electrodes and optical sensors. We will 
also look at chemical sensors, including bioanalytic sensing systems. We would also like to refer you to 
Appendix A "Basics of Blood Gas Instrumentation", where applications of sensors for measurements of 
oxygen and carbon dioxide in blood samples and blood acid-base balance are discussed. Although it is 
not possible to cover the field in extensive detail in a handbook such as this, it is hoped that these sections 
can serve as an introduction to this important aspect of biomedical engineering and instrumentation. 
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of Tennessee, Memphis 47.2 Biomedical Applications of Physical Sensors 

Physical variables associated with biomedical systems are measured by a group of sensors known as physical 
sensors. Although many specific physical variables can be measured in biomedical systems, these can be 
categorized into a simple list as shown in Table 47.1. Sensors for these variables, whether they are measuring 
biomedical systems or other systems, are essentially the same. Thus, sensors of linear displacement can 
frequently be used equally well for measuring the displacement of the heart muscle during the cardiac cycle 
or the movement of a robot arm. There is, however, one notable exception regarding the similarity of these 
sensors: the packaging of the sensor and attachment to the system being measured. Although physical 
sensors used in nonbiomedical applications need to be packaged so as to be protected from their environ- 
ment, few of these sensors have to deal with the harsh environment of biologic tissue, especially with the 
mechanisms inherent in this tissue for trying to eliminate the sensor as a foreign body. Another notable 
exception to this similarity of sensors for measuring physical quantities in biologic and nonbiologic systems 
are the sensors used for fluidic measurements such as pressure and flow. Special needs for these measure- 
ments in biologic systems have resulted in special sensors and instrumentation systems for these measure- 
ments that can be quite different from systems for measuing pressure and flow in nonbiologic environments. 
In this chapter, we will attempt to review various examples of sensors used for physical measurement 
in biologic systems. Although it would be beyond the scope of this chapter to cover all these in detail, 
the principal sensors applied for biologic measurements will be described. Each section will include a 
brief description of the principle of operation of the sensor and the underlying physical principles, 
examples of some of the more common forms of these sensors for application in biologic systems, 
methods of signal processing for these sensors where appropriate, and important considerations for when 
the sensor is applied. 

47.1 Description of Sensors 
Linear and Angular Displacement Sensors 

A comparison of various characteristics of displacement sensors described in detail below is outlined in 
Table 47.2. 

Variable Resistance Sensor 

One of the simplest sensors for measuring displacement is a variable resistor similar to the volume control 
on an audio electronic device [11. The resistance between two terminals on this device is related to the 
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TABLE 47.1 Physical Variables and Sensors 



Physical Quantity 
Geometric 



Kinematic 

Force-Torque 
Fluidic 

Thermal 



Sensor 

Strain gauge 

LVDT 

Ultrasonic transit time 

Velocimeter 

Accelerometer 

Load cell 

Pressure transducer 

Flow meter 

Thermometer 

Thermal flux sensor 



Variable Sensed 

Strain 

Displacement 

Displacement 

Velocity 

Acceleration 

Applied force or torque 

Pressure 

Flow 

Temperature 

Heat flux 



TABLE 47.2 Comparison of Displacement Sensors 





Electrical 










Sensor 


Variable 


Measurement Circuit 


Sensitivity 


Precision 


Range 


Variable resistor 


Resistance 


Voltage divider, ohmmeter, 
bridge, current source 


High 


Moderate 


Large 


Foil strain gauge 


Resistance 


Bridge 


Low 


Moderate 


Small 


Liquid metal strain gauge 


Resistance 


Ohmmeter, bridge 


Moderate 


Moderate 


Large 


Silicon strain gauge 


Resistance 


Bridge 


High 


Moderate 


Small 


Mutual inductance coils 


Inductance 


Impedance bridge, 


Moderate 


Moderate 


Moderate 






inductance meter 


to high 


to low 


to large 


Variable reluctance 


Inductance 


Impedance bridge, 
inductance meter 


High 


Moderate 


Large 


LVDT 


Inductance 


Voltmeter 


High 


High 


High 


Parallel plate capacitor 


Capacitance 


Impedance bridge, 


Moderate 


Moderate 


Moderate 






capacitance meter 


to high 




to large 


Sonic/ultrasonic 


Time 


Timer circuit 


High 


High 


Large 



linear or angular displacement of a sliding tap along a resistance element. Precision devices are available 
that have a reproducible, linear relationship between resistance and displacement. These devices can be 
connected in circuits that measure such resistance as an ohmmeter or bridge, or they can be used as a 
part of a circuit that provides a voltage that is proportional to the displacement. Such circuits include 
the voltage divider (as illustrated in Fig. 47. 1 a) or driving a known constant current through the resistance 
and measuring the resulting voltage across it. This sensor is simple and inexpensive and can be used for 
measuring relatively large displacements. 

Strain Gauge 

Another displacement sensor based on an electrical resistance change is the strain gauge [2]. If a long 
narrow electrical conductor such as a piece of metal foil or a fine gauge wire is stretched within its elastic 
limit, it will increase in length and decrease in cross-sectional area. Because the electric resistance between 
both ends of this foil or wire can be given by 



R = p 



I 



(47.1) 



where p is the electrical resistivity of the foil or wire material, / is its length, and A is its cross-sectional 
area, this stretching will result in an increase in resistance. The change in length can only be very small 
for the foil to remain within its elastic limit, so the change in electric resistance will also be small. The 
relative sensitivity of this device is given by its gauge factor, y, which is defined as 
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FIGURE 47.1 Examples of displacement sensors: (a) variable resistance sensor, (b) foil strain gauge, (c) linear 
variable differential transformer (LVDT), (d) parallel plate capacitive sensor, and (e) ultrasonic transit time displace- 
ment sensor. 



AR/R 
Al/l 



(47.2) 



where A_R is the change in resistance when the structure is stretched by an amount A /. Foil strain gauges 
are the most frequently applied and consist of a structure such as shown in Fig. 47.1k A piece of metal 
foil that is attached to an insulating polymeric film such as polyimide that has a much greater compliance 
than the foil itself is chemically etched into the pattern shown in Fig. 47.1b. When a strain is applied in 
the sensitive direction, the direction of the individual elements of the strain gauge, the length of the 
gauge will be slightly increased, and this will result in an increase in the electrical resistance seen between 
the terminals. Since the displacement or strain that this structure can measure is quite small for it to 
remain within its elastic limit, it can only be used to measure small displacements such as occur as loads 
are applied to structural beams. If one wants to increase the range of a foil strain gauge, one has to attach 
it to some sort of a mechanical impedance converter such as a cantilever beam. If the strain gauge is 
attached to one surface of the beam as shown in Fig. 47.2, a fairly large displacement at the unsupported 
end of the beam can be translated to a relatively small displacement on the beam's surface. It would be 
possible for this structure to be used to measure larger displacements at the cantilever beam tip using a 
strain gauge on the beam. 

Because the electric resistance changes for a strain gauge are quite small, the measurement of this 
resistance change can be challenging. Generally, Wheatstone bridge circuits are used. It is important to 
note, however, that changes in temperature can also result in electric resistance changes that are of the 
same order of magnitude or even larger than the electric resistance changes due to the strain. Thus, it is 
important to temperature-compensate strain gauges in most applications. A simple method of temper- 
ature compensation is to use a double or quadruple strain gauge and a bridge circuit for measuring the 
resistance change. This is illustrated in Fig. 47.2. If one can use the strain gauge in an application such 
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FIGURE 47.2 Strain gauges on a cantilever structure to provide temperature compensation: (a) cross-sectional 
view of the cantilever and (b) placement of the strain gauges in a half bridge or full bridge for temperature 
compensation and enhanced sensitivity. 



as the cantilever beam application described above, one can place one or two of the strain gauge structures 
on the concave side of the beam and the other one or two on the convex side of the beam. Thus, as the 
beam deflects, the strain gauge on the convex side will experience tension, and that on the concave side 
will experience compression. By putting these gauges in adjacent arms of the Wheatstone bridge, their 
effects can double the sensitivity of the circuit in the case of the double strain gauge and quadruple it in 
the case where the entire bridge is made up of strain gauges on a cantilever. 

In some applications it is not possible to place strain gauges so that one gauge is undergoing tension 
while the other is undergoing compression. In this case, the second strain gauge used for temperature 
compensation can be oriented such that its sensitive axis is in a direction where strain is minimal. Thus, 
it is still possible to have the temperature compensation by having two identical strain gauges at the same 
temperature in adjacent arms of the bridge circuit, but the sensitivity improvement described in the 
previous paragraph is not seen. 

Another constraint imposed by temperature is that the material to which the strain gauge is attached 
and the strain gauge both have temperature coefficients of expansion. Thus, even if a gauge is attached 
to a structure under conditions of no strain, if the temperature is changed, the strain gauge could 
experience some strain due to the different expansion that it will have compared to the structure to which 
it is attached. To avoid this problem, strain gauges have been developed that have identical temperature 
coefficients of expansion to various common materials. In selecting a strain gauge, one should choose a 
device with thermal expansion characteristics as close as possible to those of the object upon which the 
strain is to be measured. 
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A more compliant structure that has found applications in biomedical instrumentation is the liquid 
metal strain gauge [3]. Instead of using a solid electric conductor such as the wire or metal foil, mercury 
confined to a compliant, thin wall, narrow bore elastomeric tube is used. The compliance of this strain 
gauge is determined by the elastic properties of the tube. Since only the elastic limit of the tube is of 
concern, this sensor can be used to detect much larger displacements than conventional strain gauges. 
Its sensitivity is roughly the same as a foil or wire strain gauge, but it is not as reliable. The mercury can 
easily become oxidized or small air gaps can occur in the mercury column. These effects make the sensor's 
characteristics noisy and sometimes results in complete failure. 

Another variation on the strain gauge is the semiconductor strain gauge. These devices are frequently 
made out of pieces of silicon with strain gauge patterns formed using semiconductor microelectronic 
technology. The principal advantage of these devices is that their gauge factors can be more than 50 times 
greater than that of the solid and liquid metal devices. They are available commercially, but they are a 
bit more difficult to handle and attach to structures being measured due to their small size and brittleness. 

Inductance Sensors 
Mutual Inductance 

The mutual inductance between two coils is related to many geometric factors, one of which is the 
separation of the coils. Thus, one can create a very simple displacement sensor by having two coils that 
are coaxial but with different separation. By driving one coil with an ac signal and measuring the voltage 
signal induced in the second coil, this voltage will be related to how far apart the coils are from one 
another. When the coils are close together, the mutual inductance will be high, and so a higher voltage 
will be induced in the second coil; when the coils are more widely separated, the mutual inductance will 
be lower as will the induced voltage. The relationship between voltage and separation will be determined 
by the specific geometry of the coils and in general will not be a linear relationship with separation unless 
the change of displacement is relatively small. Nevertheless, this is a simple method of measuring 
separation that works reasonably well provided the coils remain coaxial. If there is movement of the coils 
transverse to their axes, it is difficult to separate the effects of transverse displacement from those of 
displacement along the axis. 

Variable Reluctance 

A variation on this sensor is the variable reluctance sensor wherein a single coil or two coils remain fixed 
on a form which allows a high reluctance slug to move into or out of the coil or coils along their axis. 
Since the position of this core material determines the number of flux linkages through the coil or coils, 
this can affect the self-inductance or mutual inductance of the coils. In the case of the mutual inductance, 
this can be measured using the technique described in the previous paragraph, whereas self-inductance 
changes can be measured using various instrumentation circuits used for measuring inductance. This 
method is also a simple method for measuring displacements, but the characteristics are generally 
nonlinear, and the sensor generally has only moderate precision. 

Linear Variable Differential Transformer 

By far the most frequently applied displacement transducer based upon inductance is the linear variable 
differential transformer (LVDT) [4]. This device is illustrated in Fig. 47.1c and is essentially a three-coil 
variable reluctance transducer. The two secondary coils are situated symmetrically about the primary 
coil and connected such that the induced voltages in each secondary oppose each other. When the core 
is located in the center of the structure equidistant from each secondary coil, the voltage induced in each 
secondary will be the same. Since these voltages oppose one another, the output voltage from the device 
will be zero. As the core is moved closer to one or the other secondary coils, the voltages in each coil will 
no longer be equal, and there will be an output voltage proportional to the displacement of the core 
from the central, zero-voltage position. Because of the symmetry of the structure, this voltage is linearly 
related to the core displacement. When the core passes through the central, zero point, the phase of the 
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output voltage from the sensor changes by 180 degrees. Thus, by measuring the phase angle as well as 
the voltage, one can determine the position of the core. The circuit associated with the LVDT not only 
measures the voltage but often measures the phase angle as well. 

LVDTs are available commercially in many sizes and shapes. Depending on the configuration of the 
coils, they can measure displacements ranging from tens of micrometers through centimeters. 

Capacitive Sensors 

Displacement sensors can be based upon measurements of capacitance as well as inductance. The fun- 
damental principle of operation is the capacitance of a parallel plate capacitor as given by 

C=e — (47.3) 

d 

where e is the dielectric constant of the medium between the plates, d is the separation between the 
plates, and A is the cross-sectional area of the plates. Each of the quantities in Eq. (47.3) can be varied 
to form a displacement transducer as shown in Fig. 47.1c. By moving one of the plates with respect to 
the other, Eq. (47.3) shows us that the capacitance will vary inversely with respect to the plate separation. 
This will give a hyperbolic capacitance-displacement characteristic. However, if the plate separation is 
maintained at a constant value and the plates are displaced laterally with respect to one another so that 
the area of overlap changes, this can produce a capacitance-displacement characteristic that can be linear, 
depending on the shape of the actual plates. 

The third way that a variable capacitance transducer can measure displacement is by having a fixed 
parallel plate capacitor with a slab of dielectric material having a dielectric constant different from that 
of air that can slide between the plates. The effective dielectric constant for the capacitor will depend on 
how much of the slab is between the plates and how much of the region between the plates is occupied 
only by air. This, also, can yield a transducer with linear characteristics. 

The electronic circuitry used with variable capacitance transducers, is essentially the same as any other 
circuitry used to measure capacitance. As with the inductance transducers, this circuit can take the form 
of a bridge circuit or specific circuits that measure capacitive reactance. 

Sonic and Ultrasonic Sensors 

If the velocity of sound in a medium is constant, the time it takes a short burst of that sound energy to 
propagate from a source to a receiver will be proportional to the displacement between the two trans- 
ducers. This is given by 

d=cT (47 .4) 

where c is the velocity of sound in the medium, T is the transit time, and d is the displacement. A simple 
system for making such a measurement is shown in Fig. 47. le [5]. A brief sonic or ultrasonic pulse is 
generated at the transmitting transducer and propagates through the medium. It is detected by the 
receiving transducer at time T after the burst was initiated. The displacement can then be determined 
by applying Eq. (47.4). 

In practice, this method is best used with ultrasound, since the wavelength is shorter, and the device 
will neither produce annoying sounds nor respond to extraneous sounds in the environment. Small 
piezoelectric transducers to generate and receive ultrasonic pulses are readily available. The electronic 
circuit used with this instrument carries out three functions: (1) generation of the sonic or ultrasonic 
burst, (2) detection of the received burst, and (3) measurement of the time of propagation of the 
ultrasound. An advantage of this system is that the two transducers are coupled to one another only 
sonically. There is no physical connection as was the case for the other sensors described in this section. 
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Velocity Measurement 

Velocity is the time derivative of displacement, and so all the displacement transducers mentioned above 
can be used to measure velocity if their signals are processed by passing them through a differentiator 
circuit. There are, however, two additional methods that can be applied to measure velocity directly. 

Magnetic Induction 

If a magnetic field that passes through a conducting coil varies with time, a voltage is induced in that 
coil that is proportional to the time-varying magnetic field. This relationship is given by 

y=N^ (47.5) 

dt 

where v is the voltage induced in the coil, N is the number of turns in the coil, and (j) is the total magnetic 
flux passing through the coil (the product of the flux density and area within the coil). Thus a simple 
way to apply this principle is to attach a small permanent magnet to an object whose velocity is to be 
determined, and attach a coil to a nearby structure that will serve as the reference against which the 
velocity is to be measured. A voltage will be induced in the coil whenever the structure containing the 
permanent magnet moves, and this voltage will be related to the velocity of that movement. The exact 
relationship will be determined by the field distribution for the particular magnet and the orientation 
of the magnet with respect to the coil. 

Doppler Ultrasound 

When the receiver of a signal in the form of a wave such as electromagnetic radiation or sound is moving 
at a nonzero velocity with respect to the emitter of that wave, the frequency of the wave perceived by the 
receiver will be different than the frequency of the transmitter. This frequency difference, known as the 
Doppler shift, is determined by the relative velocity of the receiver with respect to the emitter and is 
given by 

f d = ^ (47.6) 

c 

where f d is the Doppler frequency shift, f is the frequency of the transmitted wave, u is the relative velocity 
between the transmitter and receiver, and c is the velocity of sound in the medium. This principle can 
be applied in biomedical applications as a Doppler velocimeter. A piezoelectric transducer can be used 
as the ultrasound source with a similar transducer as the receiver. When there is no relative movement 
between the two transducers, the frequency of the signal at the receiver will be the same as that at the 
emitter, but when there is relative motion, the frequency at the receiver will be shifted according to 
Eq. (47.6). 

The ultrasonic velocimeter can be applied in the same way that the ultrasonic displacement sensor is 
used. In this case the electronic circuit produces a continuous ultrasonic wave and, instead of detecting 
the transit time of the signal, now detects the frequency difference between the transmitted and received 
signals. This frequency difference can then be converted into a signal proportional to the relative velocity 
between the two transducers. 

Accelerometers 

Acceleration is the time derivative of velocity and the second derivative with respect to time of displace- 
ment. Thus, sensors of displacement and velocity can be used to determine acceleration when their signals 
are appropriately processed through differentiator circuits. In addition, there are direct sensors of accel- 
eration based upon Newton's second law and Hooke's law. The fundamental structure of an accelerometer 
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FIGURE 47.3 Fundamental structure of an accelerometer. 

is shown in Fig. 47.3. A known seismic mass is attached to the housing by an elastic element. As the 
structure is accelerated in the sensitive direction of the elastic element, a force is applied to that element 
according to Newton's second law. This force causes the elastic element to be distorted according to 
Hooke's law, which results in a displacement of the mass with respect to the accelerometer housing. This 
displacement is measured by a displacement sensor. The relationship between the displacement and the 
acceleration is found by combining Newton's second law and Hooke's law 



k 
a = — x 
m 



(47.7) 



where x is the measured displacement, m is the known mass, k is the spring constant of the elastic element, 
and a is the acceleration. Any of the displacement sensors described above can be used in an accelerometer. 
The most frequently used displacement sensors are strain gauges or the LVDT One type of accelerometer 
uses a piezoelectric sensor as both the displacement sensor and the elastic element. A piezoelectric sensor 
generates an electric signal that is related to the dynamic change in shape of the piezoelectric material 
as the force is applied. Thus, piezoelectric materials can only directly measure time varying forces. A 
piezoelectric accelerometer is, therefore, better for measuring changes in acceleration than for measuring 
constant accelerations. A principal advantage of piezoelectric accelerometers is that they can be made 
very small, which is useful in many biomedical applications. 

Force 

Force is measured by converting the force to a displacement and measuring the displacement with a 
displacement sensor. The conversion takes place as a result of the elastic properties of a material. Applying 
a force to the material distorts the material's shape, and this distortion can be determined by a displace- 
ment sensor. For example, the cantilever structure shown in Fig. 47.2a could be a force sensor. Applying 
a vertical force at the tip of the beam will cause the beam to deflect according to its elastic properties. 
This deflection can be detected using a displacement sensor such as a strain gauge as described previously. 
A common form of force sensor is the load cell. This consists of a block of material with known elastic 
properties that has strain gauges attached to it. Applying a force to the load cell stresses the material, 
resulting in a strain that can be measured by the strain gauge. Applying Hooke's law, one finds that the 
strain is proportional to the applied force. The strain gauges on a load cell are usually in a half-bridge 
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FIGURE 47.4 Structure of an unbonded strain gauge pressure sensor. Reproduced with permission from Neuman 
MR. 1993. Biomedical sensors. In RC Dorf (ed), The Electrical Engineering Handbook, Boca Raton, Fla, CRC Press. 

or full-bridge configuration to minimize the temperature sensitivity of the device. Load cells come in 
various sizes and configurations, and they can measure a wide range of forces. 

Measurement of Fluid Dynamic Variables 

The measurement of the fluid pressure and flow in both liquids and gases is important in many biomedical 
applications. These two variables, however, often are the most difficult variables to measure in biologic 
applications because of interactions with the biologic system and stability problems. Some of the most 
frequently applied sensors for these measurements are described in the following paragraphs. 

Pressure Measurement 

Sensors of pressure for biomedical measurements such as blood pressure [7] consist of a structure such 
as shown in Fig. 47.4. In this case a fluid coupled to the fluid to be measured is housed in a chamber 
with a flexible diaphragm making up a portion of the wall, with the other side of the diaphragm at 
atmospheric pressure. When a pressure exists across the diaphragm, it will cause the diaphragm to deflect. 
This deflection is then measured by a displacement sensor. In the example in Fig. 47.4, the displacement 
transducer consists of four fine-gauge wires drawn between a structure attached to the diaphragm and 
the housing of the sensor so that these wires serve as strain gauges. When pressure causes the diaphragm 
to deflect, two of the fine-wire strain gauges will be extended by a small amount, and the other two will 
contract by the same amount. By connecting these wires into a bridge circuit, a voltage proportional to 
the deflection of the diaphragm and hence the pressure can be obtained. 

Semiconductor technology has been applied to the design of pressure transducers such that the entire 
structure can be fabricated from silicon. A portion of a silicon chip can be formed into a diaphragm and 
semiconductor strain gauges incorporated directly into that diaphragm to produce a small, inexpensive, 
and sensitive pressure sensor. Such sensors can be used as disposable, single-use devices for measuring 
blood pressure without the need for additional sterilization before being used on the next patient. This 
minimizes the risk of transmitting blood-borne infections in the cases where the transducer is coupled 
directly to the patient's blood for direct blood pressure measurement. 

In using this type of sensor to measure blood pressure, it is necessary to couple the chamber containing 
the diaphragm to the blood or other fluids being measured. This is usually done using a small, flexible 
plastic tube known as a catheter, that can have one end placed in an artery of the subject while the other 
is connected to the pressure sensor. This catheter is filled with a physiologic saline solution so that the 
arterial blood pressure is coupled to the diaphragm. This external blood-pressure-measurement method 
is used quite frequently in the clinic and research laboratory, but it has the limitation that the properties 
of the fluid in the catheter and the catheter itself can affect the measurement. For example, both ends 
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of the catheter must be at the same vertical level to avoid a pressure offset due to hydrostatic effects. Also, 
the compliance of the tube will affect the frequency response of the pressure measurement. Air bubbles 
in the catheter or obstructions due to clotted blood or other materials can introduce distortion of the 
waveform due to resonance and damping. These problems can be minimized by utilizing a miniature 
semiconductor pressure transducer that is located at the tip of a catheter and can be placed in the blood 
vessel rather than being positioned external to the body. Such internal pressure sensors are available 
commercially and have the advantages of a much broader frequency response, no hydrostatic pressure 
error, and generally clearer signals than the external system. 

Although it is possible to measure blood pressure using the techniques described above, this remains 
one of the major problems in biomedical sensor technology. Long-term stability of pressure transducers 
is not very good. This is especially true for pressure measurements of venous blood, cerebrospinal fluid, 
or fluids in the gastrointestinal tract, where pressures are relatively low. Long-term changes in baseline 
pressure for most pressure sensors require that they be frequently adjusted to be certain of zero pressure. 
Although this can be done relatively easily when the pressure transducer is located external to the body, 
this can be a major problem for indwelling pressure transducers. Thus, these transducers must be 
extremely stable and have low baseline drift to be useful in long-term applications. The packaging of the 
pressure transducer is also a problem that needs to be addressed, especially when the transducer is in 
contact with blood for long periods. Not only must the package be biocompatible, but it also must allow 
the appropriate pressure to be transmitted from the biologic fluid to the diaphragm. Thus, a material 
that is mechanically stable under corrosive and aqueous environments in the body is needed. 

Measurement of Flow 

The measurement of true volummetric flow in the body represents one of the most difficult problems 
in biomedical sensing [8]. The sensors that have been developed measure velocity rather than volume 
flow, and they can only be used to measure flow if the velocity is measured for a tube of known cross- 
section. Thus, most flow sensors constrain the vessel to have a specific cross-sectional area. 

The most frequently used flow sensor in biomedical systems is the electromagnetic flow meter illus- 
trated in Fig. 47.5. This device consists of a means of generating a magnetic field transverse to the flow 
vector in a vessel. A pair of very small biopotential electrodes are attached to the wall of the vessel such 
that the vessel diameter between them is at right angles to the direction of the magnetic field. As the 
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FIGURE 47.5 Fundamental structure of an electromagnetic flowmeter. Reproduced with permission from Neuman 
MR. 1986. Biosensors: Transducers, electrodes, and physiologic systems. IN ID Bronzino (ed), Biomedical Engineering 
and Instrumentation: Basic Concepts and Applications, Boston, PWS Publishers. 
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FIGURE 47.6 Structure of an ultrasonic Doppler flowmeter with the major blocks of the electronic signal processing 
system. The oscillator generates a signal that, after amplification, drives the transmitting transducer. The oscillator 
frequency is usually in the range of 1-10 MHz. The reflected ultrasound from the blood is sensed by the receiving 
transducer and amplified before being processed by a detector circuit. This block generates the frequency difference 
between the transmitted and received ultrasonic signals. This difference frequency can be converted into a voltage 
proportional to frequency, and hence flow velocity, by the frequency to voltage converter circuit. 

blood flows in the structure, ions in the blood deflect in the direction of one or the other electrodes due 
to the magnetic field, and the voltage across the electrodes is given by 



v=Blu 



(47.8) 



where B is the magnetic field, / is the distance between the electrodes, and u is the average instantaneous 
velocity of the fluid across the vessel. If the sensor constrains the blood vessel to have a specific diameter, 
then its cross-sectional area will be known, and multiplying this area by the velocity will give the volume 
flow. 

Although dc flow sensors have been developed and are available commercially, the most desirable 
method is to use ac excitation of the magnetic field so that offset potential effects from the biopotential 
electrodes do not generate errors in this measurement. 

Small ultrasonic transducers can also be attached to a blood vessel to measure flow as illustrated in 
Fig. 47.6. In this case the transducers are oriented such that one transmits a continuous ultrasound signal 
that illuminates the blood. Cells within the blood diffusely reflect this signal in the direction of the second 
sensor so that the received signal undergoes a Doppler shift in frequency that is proportional to the 
velocity of the blood. By measuring the frequency shift and knowing the cross-sectional area of the vessel, 
it is possible to determine the flow. 

Another method of measuring flow that has had biomedical application is the measurement of cooling 
of a heated object by convection. The object is usually a thermistor (see section on temperature mea- 
surement, below) placed either in a blood vessel or in tissue, and the thermistor serves as both the heating 
element and the temperature sensor. In one mode of operation, the amount of power required to maintain 
the thermistor at a temperature slightly above that of the blood upstream is measured. As the flow around 
the thermistor increases, more heat is removed from the thermistor by convection, and so more power 
is required to keep it at a constant temperature. Relative flow is then measured by determining the amount 
of power supplied to the thermistor. 

In a second approach the thermistor is heated by applying a current pulse and then measuring the 
cooling curve of the thermistor as the blood flows across it. The thermistor will cool more quickly as the 
blood flow increases. Both these methods are relatively simple to achieve electronically, but both also 
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TABLE 47.3 Properties of Temperature Sensors 



Coil of fine platinum wire 


Low 


High 


-100-700°C 


Bead, disk, or rod 


High 


Moderate 


-50-100°C 


Pair of wires 


Low 


High 


-100->1000°C 


Column of Hg in glass capillary 


Moderate 


High 


-50-400°C 


Electronic component 


Moderate 


High 


-50-150°C 



Sensor Form Sensitivity Stability Range 

Metal resistance thermometer 

Thermistor 

Thermocouple 

Mercury in glass thermometer 

Silicon p-n diode 

have severe limitations. They are essentially qualitative measures and strongly depend on how the ther- 
mistor probe is positioned in the vessel being measured. If the probe is closer to the periphery or even 
in contact with the vessel wall, the measured flow will be different than if the sensor is in the center of 
the vessel. 

Temperature 

There are many different sensors of temperature [9], but three find particularly wide application to 
biomedical problems. Table 47.3 summarizes the properties of various temperature sensors, and these 
three, including metallic resistance thermometers, thermistors, and thermocouples, are described in the 
following paragraphs. 

Metallic Resistance Thermometers 

The electric resistance of a piece of metal or wire generally increases as the temperature of that electric 
conductor increases. A linear approximation to this relationship is given by 



R=R 



1 + cc [T 



(r-r )j (47.9) 



where R is the resistance at temperature T , a is the temperature coefficient of resistance, and T is the 
temperature at which the resistance is being measured. Most metals have temperature coefficients of 
resistance of the order of 0.1-0.4%/°C, as indicated in Table 47.4. The noble metals are preferred for 
resistance thermometers, since they do not corrode easily and, when drawn into fine wires, their cross- 
section will remain constant, thus avoiding drift in the resistance over time which could result in an 
unstable sensor. It is also seen from Table 47.4 that the noble metals of gold and platinum have some of 
the highest temperature coefficients of resistance of the common metals. 

Metal resistance thermometers are often fabricated from fine-gauge insulated wire that is wound into 
a small coil. It is important in doing so to make certain that there are not other sources of resistance 
change that could affect the sensor. For example, the structure should be utilized in such a way that no 
external strains are applied to the wire, since the wire could also behave as a strain gauge. Metallic films 
and foils can also be used as temperature sensors, and commercial products are available in the wire, 

TABLE 47.4 Temperature Coefficient of Resistance for Common Metals and Alloys 







Resistivity at 20°C 


Temperature Coefficient of 


Metal or Alloy 




microhm-cm 


Resistance, %/°C 


Platinum 




9.83 


0.3 


Gold 




2.22 


0.368 


Silver 




1.629 


0.38 


Copper 




1.724 


0.393 


Constantan (60% Cu, 40% Ni) 


49.0 


0.0002 


Nichrome (80% Ni, 


20% Cr) 


108.0 


0.013 



Source: Pender H, Mcllwain K. 1957. Electrical Engineers' Handbook, 4th ed, New York, 

Wiley. 
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foil, or film forms. The electric circuits used to measure resistance, and hence the temperature, are similar 
to those used with the wire or foil strain gauges. A bridge circuit is the most desirable, although ohmmeter 
circuits can also be used. It is important to make sure that the electronic circuit does not pass a large 
current through the resistance thermometer to provide self-heating due to the Joule conversion of electric 
energy to heat. 

Thermistors 

Unlike metals, semiconductor materials have an inverse relationship between resistance and temperature. 
This characteristic is very nonlinear and cannot be characterized by a linear equation such as for the 
metals. The thermistor is a semiconductor temperature sensor. Its resistance as a function of temperature 
is given by 



R=R e 



'[? »b J 



(47.10) 



where (3 is a constant determined by the materials that make up the thermistor. Thermistors can take a 
variety of forms and cover a large range of resistances. The most common forms used in biomedical 
applications are the bead, disk, or rod forms of the sensor as illustrated in Fig. 47.7. These structures can 
be formed from a variety of different semiconductors ranging from elements such as silicon and germa- 
nium to mixtures of various semiconducting metallic oxides. Most commercially available thermistors 
are manufactured from the latter materials, and the specific materials as well as the process for fabricating 
them are closely held industrial secrets. These materials are chosen not only to have high sensitivity but 
also to have the greatest stability, since thermistors are generally not as stable as the metallic resistance 
thermometers. However, thermistors are close to an order of magnitude more sensitive. 

Thermocouples 

When different regions of an electric conductor or semi- 
conductor are at different temperatures, there is an electric 
potential between these regions that is directly related to 
the temperature differences. This phenomenon, known as 
the Seebeck effect, can be used to produce a temperature 
sensor known as a thermocouple by taking a wire of metal 
or alloy A and another wire of metal or alloy B and con- 
necting them as shown in Fig. 47.8. One of the junctions 
is known as the sensing junction, and the other is the 
reference junction. When these junctions are at different 
temperatures, a voltage proportional to the temperature 
difference will be seen at the voltmeter when metals A and 
B have different Seebeck coefficients. This voltage is 
roughly proportional to the temperature difference and 
can be represented over the relatively small temperature 
differences encountered in biomedical applications by the 
linear equation 




DISK 



BEAD 



V=S AB (T S 



(47.11) 



where S AB is the Seebeck coefficient for the thermocouple 
made up of metals A and B. Although this equation is a 
reasonable approximation, more accurate data are usually 
found in tables of actual voltages as a function of temper- 
ature difference. In some applications the voltmeter is 




FIGURE 47.7 Common forms of thermistors. 
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FIGURE 47.8 Circuit arrangement for a thermocouple showing the voltage-measuring device, the voltmeter, inter- 
rupting one of the thermocouple wires (a) and at the cold junction (b). 

located at the reference junction, and one uses some independent means such as a mercury in glass 
thermometer to measure the reference junction temperature. Where precision measurements are made, 
the reference junction is often placed in an environment of known temperature such as an ice bath. 
Electronic measurement of reference junction temperature can also be carried out and used to compensate 
for the reference junction temperature so that the voltmeter reads a signal equivalent to what would be 
seen if the reference junction were at 0°C. This electronic reference junction compensation is usually 
carried out using a metal resistance temperature sensor to determine reference junction temperature. 

The voltages generated by thermocouples used for temperature measurement are generally quite small 
being on the order of tens of microvolts per degree C. Thus, for most biomedical measurements where 
there is only a small difference in temperature between the sensing and reference junction, very sensitive 
amplifiers must be used to measure these potentials. Thermocouples have been used in industry for 
temperature measurement for many years. Several standard alloys to provide optimal sensitivity and 
stability of these sensors have evolved. Table 47.5 lists these common alloys, the Seebeck coefficient for 
thermocouples of these materials at room temperature, and the full range of temperatures over which 
these thermocouples can be used. 

TABLE 47.5 Common Thermocouples 

Type Materials Seebeck Coefficient, uV/°C Temperature Range 

S Platinum/platinum 10% rhodium 6 0-1700°C 

T Copper/constantan 50 -190-400°C 

K Chromel/alumel 41 -200-1370°C 

J Iron/constantan 53 -200-760°C 

E Chromel/constantan 78 -200-970°C 
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Thermocouples can be fabricated in many different ways depending on their applications. They are 
especially suitable for measuring temperature differences between two structures, since the sensing 
junction can be placed on one while the other has the reference junction. Higher-output thermocouples 
or thermopiles can be produced by connecting several thermocouples in series. Thermocouples can be 
made from very fine wires that can be implanted in biologic tissues for temperature measurements, and 
it is also possible to place these fine-wire thermocouples within the lumen of a hypodermic needle to 
make short-term temperature measurements in tissue. 

47.2 Biomedical Applications of Physical Sensors 

Just as it is not possible to cover the full range of physical sensors in this chapter, it is also impossible to 
consider the many biomedical applications that have been reported for these sensors. Instead, some 
representative examples will be given. These are summarized in Table 47.6 and will be briefly described 
in the following paragraphs. 

Liquid metal strain gauges are especially useful in biomedical applications, because they are mechan- 
ically compliant and provide a better mechanical impedance match to most biomedical tissues than other 
types of strain gauges. By wrapping one of these strain gauges around a circumference of the abdomen, 
it will stretch and contract with the abdominal breathing movements. The signal from the strain gauge 
can then be used to monitor breathing in patients or experimental animals. The advantage of this sensor 
is its compliance so that it does not interfere with the breathing movements or substantially increase the 
required breathing effort. 

One of the original applications of the liquid metal strain gauge was in limb plethysmography [3], 
One or more of these sensors are wrapped around an arm or leg at various points and can be used to 
measure changes in circumference that are related to the cross-sectional area and hence the volume of 
the limb at those points. If the venous drainage from the limb is occluded, the limb volume will increase 
as it fills with blood. Releasing the occlusion allows the volume to return to normal. The rate of this 
decrease in volume can be monitored using the liquid metal strain gauges, and this can be used to identify 
venous blockage when the return to baseline volume is too slow. 

Breathing movements, although not volume, can be seen using a simple magnetic velocity detector. 
By placing a small permanent magnet on the anterior side of the chest or abdomen and a flat, large-area 
coil on the posterior side opposite from the magnet, voltages are induced in the coil as the chest of 
abdomen moves during breathing. The voltage itself can be used to detect the presence of breathing 
movements, or it can be electronically integrated to give a signal related to displacement. 

The LVDT is a displacement sensor that can be used for more precise applications. For example, it 
can be used in studies of muscle physiology where one wants to measure the displacement of a muscle 
or where one is measuring the isometric force generated by the muscle (using a load cell) and must 



TABLE 47.6 Examples of Biomedical Applications of Physical Sensors 



Sensor 

Liquid metal strain gauge 

Magnetic displacement sensor 
LVDT 

Load cell 

Accelerometer 

Miniature silicon pressure sensor 



Electromagnetic flow sensor 



Application 

Breathing movement 
Limb plethysmography 
Breathing movement 
Muscle contraction 
Uterine contraction sensor 
Electronic scale 
Subject activity 
Intra-arterial blood pressure 
Urinary bladder pressure 
Intrauterine pressure 
Cardiac output (with integrator) 
Organ blood flow 



Signal Range 


Reference 


0-0.05 




0-0.02 


3 


0-10 mm 


10 


0-20 mm 




0-5 mm 


11 


0-440 lbs (0-200 kg) 


12 


0-20 m/s 2 


13 


0-50 Pa (0-350 mm Hg) 




0-10 Pa (0-70 mm Hg) 




0-15 Pa (0-100 mm Hg) 


14 


0-500 ml/min 




0-100 ml/min 


15 
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ensure that there is no muscle movement. It can also be incorporated into other physical sensor such as 
a pressure sensor or a tocodynamometer, a sensor used to electronically "feel" uterine contractions of 
patients in labor or those at risk of premature labor and delivery. 

In addition to studying muscle forces, load cells can be used in various types of electronic scales for 
weighing patients or study animals. The simplest electronic scale consists of a platform placed on top of 
a load cell. The weight of any object placed on the platform will produce a force that can be sensed by 
the load cell. In some critical care situations in the hospital, it is important to carefully monitor the 
weight of a patient. For example, this is important in watching water balance in patients receiving fluid 
therapy. The electronic scale concept can be extended by placing a load cell under each leg of the patient's 
bed and summing the forces seen by each load cell to get the total weight of the patient and the bed. 
Since the bed weight remains fixed, weight changes seen will reflect changes in patient weight. 

Accelerometers can be used to measure patient or research subject activity. By attaching a small 
accelerometer to the individual being studied, any movements can be detected. This can be useful in 
sleep studies where movement can help to determine the sleep state. Miniature accelerometers and 
recording devices can also be worn by patients to study activity patterns and determine effects of disease 
or treatments on patient activity [13]. 

Miniature silicon pressure sensors are used for the indwelling measurement of fluid pressure in most 
body cavities. The measurement of intra-arterial blood pressure is the most frequent application, but 
pressures in other cavities such as the urinary bladder and the uterus are also measured. The small size 
of these sensors and the resulting ease of introduction of the sensor into the cavity make these sensors 
important for these applications. 

The electromagnetic flow sensor has been a standard method in use in the physiology laboratory for 
many years. Its primary application has been for measurement of cardiac output and blood flow to 
specific organs in research animals. New miniature inverted electromagnetic flow sensors make it possible 
to temporarily introduce a flow probe into an artery through its lumen to make clinical measurements. 

The measurement of body temperature using instruments employing thermistors as the sensor has 
greatly increased in recent years. Rapid response times of these low-mass sensors make it possible to 
quickly assess patients' body temperatures so that more patients can be evaluated in a given period. This 
can then help to reduce health care costs. The rapid response time of low-mass thermistors makes them 
a simple sensor to be used for sensing breathing. By placing small thermistors near the nose and mouth, 
the elevated temperature of exhaled air can be sensed to document a breath. 

The potential applications of physical sensors in medicine and biology are almost limitless. To be able 
to use these devices, however, scientists must first be familiar with the underlying sensing principles. It 
is then possible to apply these in a form that addresses the problems at hand. 
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Further Information 

Good overviews of physical sensors are found in these books: Doebelin EO. 1990. Measurement Systems: 
Application and Design, 4th ed, New York, McGraw-Hill; Harvey, GF (ed). 1969. Transducer Compen- 
dium, 2d ed, New York, Plenum. One can also find good descriptions of physical sensors in chapters of 
two works edited by John Webster. Chapters 2, 7, and 8 of his textbook (1992) Medical Instrumentation: 
Application and Design, 2d ed, Boston, Houghton Mifflin, and several articles in his Encyclopedia on 
Medical Devices and Instrumentation, published by Wiley in 1988, cover topics on physical sensors. 

Although a bit old, the text Transducers for Biomedical Measurements (New York, J. Wiley, 1974) by 
Richard S.C. Cobbold, remains one of the best descriptions of biomedical sensors available. By supple- 
menting the material in this book with recent manufacturers' literature, the reader can obtain a wealth 
of information on physical (and for that matter chemical) sensors for biomedical application. 

The journals IEEE Transactions on Biomedical Engineering and Medical and Biological Engineering and 
Computing are good sources of recent research on biomedical applications of physical sensors. The 
journals Physiological Measurement and Sensors and Actuators are also good sources for this material. 
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Biologic systems frequently have electric activity associated with them. This activity can be a constant 
dc electric field, a constant flux of charge-carrying particles or current, or a time-varying electric field 
or current associated with some time-dependent biologic or biochemical phenomenon. Bioelectric phe- 
nomena are associated with the distribution of ions or charged molecules in a biologic structure and the 
changes in this distribution resulting from specific processes. These changes can occur as a result of 
biochemical reactions, or they can emanate from phenomena that alter local anatomy. 

One can find bioelectric phenomena associated with just about every organ system in the body. 
Nevertheless, a large proportion of these signals are associated with phenomena that are at the present 
time not especially useful in clinical medicine and represent time-invariant, low-level signals that are not 
easily measured in practice. There are, however, several signals that are of diagnostic significance or that 
provide a means of electronic assessment to aid in understanding biologic systems. These signals, their 
usual abbreviations, and the systems they measure are listed in Table 48.1. Of these, the most familiar is 
the electrocardiogram, a signal derived from the electric activity of the heart. This signal is widely used 
in diagnosing disturbances in cardiac rhythm, signal conduction through the heart, and damage due to 
cardiac ischemia and infarction. The electromyogram is used for diagnosing neuromuscular diseases, 
and the electroencephalogram is important in identifying brain dysfunction and evaluating sleep. The 
other signals listed in Table 48.1 are currently of lesser diagnostic significance but are, nevertheless, used 
for studies of the associated organ systems. 

Although Table 48.1 and the above discussion are concerned with bioelectric phenomena in animals 
and these techniques are used primarily in studying mammals, bioelectric signals also arise from plants 
[1]. These signals are generally steady-state or slowly changing, as opposed to the time-varying signals 
listed in Table 48.1. An extensive literature exists on the origins of bioelectric signals, and the interested 
reviewer is referred to the text by Plonsey and Barr for a general overview of this area [2]. 

48.1 Sensing Bioelectric Signals 

The mechanism of electric conductivity in the body involves ions as charge carriers. Thus, picking up 
bioelectric signals involves interacting with these ionic charge carriers and transducing ionic currents 
into electric currents required by wires and electronic instrumentation. This transducing function is 
carried out by electrodes that consist of electrical conductors in contact with the aqueous ionic solutions 
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TABLE 48.1 Bioelectric Signals Sensed by Biopotential Electrodes 
and Their Sources 



Bioelectric Signal 


Abbreviation 


Biologic Source 


Electrocardiogram 


ECG 


Heart — as seen from body surface 


Cardiac electrogram 


— 


Heart — as seen from within 


Electromyogram 


EMG 


Muscle 


Electroencephalogram 


EEG 


Brain 


Electrooptigram 


EOG 


Eye dipole field 


Electroretinogram 


ERG 


Eye retina 


Action potential 


— 


Nerve or muscle 


Electrogastrogram 


EGG 


Stomach 


Galvanic skin reflex 


GSR 


Skin 



of the body. The interaction between electrons in the electrodes and ions in the body can greatly affect 
the performance of these sensors and requires that specific considerations be made in their application. 
At the interface between an electrode and an ionic solution redox (oxidation-reduction), reactions 
need to occur for a charge to be transferred between the electrode and the solution. These reactions can 
be represented in general by the following equations: 

C^C" + + ne" (48.1) 

A'"- ^A+me- (48.2) 

where n is the valence of cation material C, and m is the valence of anion material, A. For most electrode 
systems, the cations in solution and the metal of the electrodes are the same, so the atoms C are oxidized 
when they give up electrons and go into solution as positively charged ions. These ions are reduced when 
the process occurs in the reverse direction. In the case of the anion reaction, Eq. (48.2), the directions 
for oxidation and reduction are reversed. For best operation of the electrodes, these two reactions should 
be reversible, that is, it should be just as easy for them to occur in one direction as the other. 

The interaction between a metal in contact with a solution of its ions produces a local change in the 
concentration of the ions in solution near the metal surface. This causes charge neutrality not to be 
maintained in this region, causing the electrolyte surrounding the metal to be at a different electrical 
potential from the rest of the solution. Thus, a potential difference known as the half-cell potential is 
established between the metal and the bulk of the electrolyte. It is found that different characteristic 
potentials occur for different materials, and some of these potentials are summarized in Table 48.2. These 
half-cell potentials can be important when using electrodes for low frequency or dc measurements. 

The relationship between electric potential and ionic concentrations or, more precisely, ionic activities 
is frequently considered in electrochemistry. Most commonly two ionic solutions of different activity are 
separated by an ion-selective semipermeable membrane that allows one type of ion to pass freely through 

TABLE 48.2 Half-cell Potentials for Materials 

and Reactions Encountered in Biopotential Measurement 



Metal and Reaction 


Half-cell Potential, V 


Al -» AF+ + 3e- 


-1.706 


Ni -» Ni 2+ + 2e- 


-0.230 


H, -» 2H+ + 2e- 


0.000 (by definition) 


Ag + Cr^AgCl + e- 


+0.223 


Ag -> Ag + + e- 


+0.799 


Au "■> Au + + e~ 


+1.680 
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the membrane. It can be shown that an electric potential E will exist between the solutions on either side 
of the membrane, based upon the relative activity of the permeable ions in each of these solutions. This 
relationship is known as the Nernst equation 



fir, 

In 

nF 



a, 



V fl 2/ 



(48.3) 



where a x and a 2 are the activities of the ions on either side of the membrane, -R is the universal gas 
constant, T is the absolute temperature, n is the valence of the ions, and F is the Faraday constant. More 
detail on this relationship can be found in Chapter 49. 

When no electric current flows between an electrode and the solution of its ions or across an ion- 
permeable membrane, the potential observed should be the half-cell potential or the Nernst potential, 
respectively. If, however, there is a current, these potentials can be altered. The difference between the 
potential at zero current and the measured potentials while current is passing is known as the over voltage 
and is the result of an alteration in the charge distribution in the solution in contact with the electrodes 
or the ion-selective membrane. This effect is known as polarization and can result in diminished electrode 
performance, especially under conditions of motion. There are three basic components to the polarization 
over potential: the ohmic, the concentration, and the activation over potentials. Of these, the activation 
over potential is of greatest concern in bioelectric measurements. More details on these over potentials 
can be found in electrochemistry or biomedical instrumentation texts [4]. 

Perfectly polarizable electrodes pass a current between the electrode and the electrolytic solution by 
changing the charge distribution within the solution near the electrode. Thus, no actual current crosses 
the electrode-electrolyte interface. Nonpolarized electrodes, however, allow the current to pass freely 
across the electrode-electrolyte interface without changing the charge distribution in the electrolytic 
solution adjacent to the electrode. Although these types of electrodes can be described theoretically, 
neither can be fabricated in practice. It is possible, however, to come up with electrode structures that 
closely approximate their characteristics. 

Electrodes made from noble metals such as platinum are often highly polarizable. A charge distribution 
different from that of the bulk electrolytic solution is found in the solution close to the electrode surface. 
Such a distribution can create serious limitations when movement is present and the measurement 
involves low frequency or even dc signals. If the electrode moves with respect to the electrolytic solution, 
the charge distribution in the solution adjacent to the electrode surface will change, and this will induce 
a voltage change in the electrode that will appear as motion artifact in the measurement. Thus, for most 
biomedical measurements, nonpolarizable electrodes are preferred to those that are polarizable. 

The silver-silver chloride electrode is one that has characteristics similar to a perfectly nonpolarizable 
electrode and is practical for use in many biomedical applications. The electrode (Fig. 48.1«) consists of 
a silver base structure that is coated with a layer of the ionic compound silver chloride. Some of the silver 
chloride when exposed to light is reduced to metallic silver, so a typical silver-silver chloride electrode 
has finely divided metallic silver within a matrix of silver chloride on its surface. Since the silver chloride 
is relatively insoluble in aqueous solutions, this surface remains stable. Because there is minimal polar- 
ization associated with this electrode, motion artifact is reduced compared to polarizable electrodes such 
as the platinum electrode. Furthermore, due to the reduction in polarization, there is also a smaller effect 
of frequency on electrode impedance, especially at low frequencies. 

Silver-silver chloride electrodes of this type can be fabricated by starting with a silver base and 
electrolytically growing the silver chloride layer on its surface [4]. Although an electrode produced in 
this way can be used for most biomedical measurements, it is not a robust structure, and pieces of the 
silver chloride film can be chipped away after repeated use of the structure. A structure with greater 
mechanical stability is the sintered silver-silver chloride electrode in Fig. 48.1 b. This electrode consists 
of a silver lead wire surrounded by a sintered cylinder made up of finely divided silver and silver-chloride 
powder pressed together. 
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FIGURE 48.1 Silver-silver electrodes for biopotential measurements: (a) metallic silver with a silver chloride surface 
layer and ( h) sintered electrode structure. The lower views show the electrodes in cross-section. 

In addition to its nonpolarizable behavior, the silver-silver chloride electrode exhibits less electrical 
noise than the equivalent polarizable electrodes. This is especially true at low frequencies, and so sil- 
ver-silver chloride electrodes are recommended for measurements involving very low voltages for signals 
that are made up primarily of low frequencies. A more detailed description of silver-silver chloride 
electrodes and methods to fabricate these devices can be found in Janz and Ives [5] and biomedical 
instrumentation textbooks [4]. 

48.2 Electric Characteristics 

The electric characteristics of biopotential electrodes are generally nonlinear and a function of the current 
density at their surface. Thus, having the devices represented by linear models requires that they be operated 
at low potentials and currents. Under these idealized conditions, electrodes can be represented by an 
equivalent circuit of the form shown in Fig. 48.2. In this circuit R d and C d are components that represent 
the impedance associated with the electrode-electrolyte interface and polarization at this interface. R s is the 
series resistance associated with interfacial effects and the resistance of the electrode materials themselves. 



■he 



V\AA 



FIGURE 48.2 The equivalent circuit for a biopotential electrode. 
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FIGURE 48.3 An example of biopotential electrode impedance as a function of frequency. Characteristic frequencies 
will be somewhat different for electrode different geometries and materials. 

TABLE 48.3 The Effect of Electrode Properties on Electrode Impedance 



Property 

Surface area 
Polarization 
Surface roughness 
Radius of curvature 
Surface contamination 



Change in Property 

t 

T 
t 
t 
t 



Changes in Electrode Impedance 

I 

T At low frequencies 

I 
I 

t 



The battery E hc represents the half-cell potential described above. It is seen that the impedance of this 
electrode will be frequency dependent, as illustrated in Fig. 48.3. At low frequencies the impedance is 
dominated by the series combination of R s and R d , whereas at higher frequencies C d bypasses the effect of 
R d so that the impedance is now close to _R S . Thus, by measuring the impedance of an electrode at high and 
low frequencies, it is possible to determine the component values for the equivalent circuit for that electrode. 
The electrical characteristics of electrodes are affected by many physical properties of these electrodes. 
Table 48.3 lists some of the more common physical properties of electrodes and qualitatively indicates 
how these can affect electrode impedance. 

48.3 Practical Electrodes for Biomedical Measurements 

Many different forms of electrodes have been developed for different types of biomedical measurements. 
To describe each of these would go beyond the constraints of this article, but some of the more commonly 
used electrodes are presented in this section. The reader is referred to the monograph by Geddes for 
more details and a wider selection of practical electrodes [6]. 

Body-Surface Biopotential Electrodes 

This category includes electrodes that can be placed on the body surface for recording bioelectric signals. 
The integrity of the skin is not compromised when these electrodes are applied, and they can be used 
for short-term diagnostic recording such as taking a clinical electrocardiogram or long-term chronic 
recording such as occurs in cardiac monitoring. 
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Metal Plate Electrodes 

The basic metal plate electrode consists of a metallic conductor in contact with the skin with a thin layer 
of an electrolyte gel between the metal and the skin to establish this contact. Examples of metal plate 
electrodes are seen in Fig. 48.4a. Metals commonly used for this type of electrode include German silver 
(a nickel-silver alloy), silver, gold, and platinum. Sometimes these electrodes are made of a foil of the 
metal so as to be flexible, and sometimes they are produced in the form of a suction electrode (Fig. 48.4fe) 
to make it easier to attach the electrode to the skin to make a measurement and then move it to another 
point to repeat the measurement. These types of electrodes are used primarily for diagnostic recordings 
of biopotentials such as the electrocardiogram or the electroencephalogram. Metal disk electrodes with 
a gold surface in a conical shape such as shown in Fig. 48.4c are frequently used for EEG recordings. The 
apex of the cone is open so that electrolyte gel or paste can be introduced to both make good contact 
between the electrode and the head and to allow this contact medium to be replaced should it dry out 
during its use. 
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FIGURE 48.4 Examples of different skin electrodes: (a) metal plate electrodes, (b) suction electrode for ECG, 
(c) metal cup EEG electrode, (d) recessed electrode, (e) disposable electrode with electrolyte-impregnated sponge 
(shown in cross-section), (f) disposable hydrogel electrode (shown in cross-section), (g) thin-film electrode for use 
with neonates (shown in cross-section), (h.) carbon-filled elastomer dry electrode. 
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FIGURE 48.4 (continued) 
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Electrodes for Chronic Patient Monitoring 

Long-term monitoring of biopotentials such as the electrocardiogram as performed by cardiac monitors 
places special constraints on the electrodes used to pick up the signals. These electrodes must have a 
stable interface between them and the body, and frequently nonpolarizable electrodes are, therefore, the 
best for this application. Mechanical stability of the interface between the electrode and the skin can help 
to reduce motion artifact, and so there are various approaches to reduce interfacial motion between the 
electrode and the coupling electrolyte or the skin. Figure 48. 4d is an example of one approach to reduce 
motion artifact by recessing the electrode in a cup of electrolytic fluid or gel. The cup is then securely 
fastened to the skin surface using a double-sided adhesive ring. Movement of the skin with respect to 
the electrode may affect the electrolyte near the skin-electrolyte interface, but the electrode-electrolyte 
interface can be several millimeters away from this location, since it is recessed in the cup. The fluid 
movement is unlikely to affect the recessed electrode-electrolyte interface as compared to what would 
happen if the electrode was separated from the skin by just a thin layer of electrolyte. 

The advantages of the recessed electrode can be realized in a simpler design that lends itself to mass 
production through automation. This results in low per-unit cost so that these electrodes can be con- 
sidered disposable. Figure 48. 4e illustrates such an electrode in cross section. The electrolyte layer now 
consists of an open-celled sponge saturated with a thickened (high-viscosity) electrolytic solution. The 
sponge serves the same function as the recess in the cup electrodes and is coupled directly to a silver-silver 
chloride electrode. Frequently, the electrode itself is attached to a clothing snap through an insulating- 
adhesive disk that holds the structure against the skin. This snap serves as the point of connection to a 
lead wire. Many commercial versions of these electrodes in various sizes are available, including electrodes 
with a silver-silver chloride interface or ones that use metallic silver as the electrode material. 

A recently developed modification of this basic monitoring electrode structure is shown in Fig. 48.4/ 
In this case the metal electrode is a silver foil with a surface coating of silver chloride. The foil gives the 
electrode increased flexibility to fit more closely over body contours. Instead of using the sponge, a 
hydrogel film (really a sponge on a microscopic level) saturated with an electrolytic solution and formed 
from materials that are very sticky is placed over the electrode surface. The opposite surface of the hydrogel 
layer can be attached directly to the skin, and since it is very sticky, no additional adhesive is needed. 
The mobility and concentration of ions in the hydrogel layer is generally lower than for the electrolytic 
solution used in the sponge or the cup. This results in an electrode that has a higher source impedance 
as compared to these other structures. An important advantage of this structure is its ability to have the 
electrolyte stick directly on the skin. This greatly reduces interfacial motion between the skin surface and 
the electrolyte, and hence there is a smaller amount of motion artifact in the signal. This type of hydrogel 
electrode is, therefore, especially valuable in monitoring patients who move a great deal or during exercise. 

Thin-film flexible electrodes such as shown in Fig. 48.4ghave been used for monitoring neonates. They 
are basically the same as the metal plate electrodes; only the thickness of the metal in this case is less than 
a micrometer. These metal films need to be supported on a flexible plastic substrate such as polyester or 
polyimide. The advantage of using only a thin metal layer for the electrode lies in the fact that these 
electrodes will then become x-ray transparent. This is especially important in infants where repeated 
placement and removal of electrodes, so that x-rays may be taken, can cause substantial skin irritation. 

Electrodes that do not use artificially applied electrolyte solutions or gels and, therefore, are often 
referred to as dry electrodes have been used in some monitoring applications. These sensors as illustrated 
in Fig. 48.4ft can be placed on the skin and held in position by an elastic band or tape. They are made 
up of a graphite or metal-filled polymer such as silicone. The conducting particles are ground into a fine 
powder, and this is added to the silicone elastomer before it cures so to produce a conductive material 
with physical properties similar to that of the elastomer. When held against the skin surface, these 
electrodes establish contact with the skin without the need for an electrolytic fluid or gel. In actuality 
such a layer is formed by sweat under the electrode surface. For this reason these electrodes tend to 
perform better after they have been left in place for an hour or two so that this layer forms. Some 
investigators have found that placing a drop of physiologic saline solution on the skin before applying 
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the electrode accelerates this process. This type of electrode has found wide application in home infant 
cardiorespiratory monitoring because of the ease with which it can be applied by untrained caregivers. 

Intracavitary and Intratissue Electrodes 

Electrodes can be placed within the body for biopotential measurements. These electrodes are generally 
smaller than skin surface electrodes and do not require special electrolytic coupling fluid, since natural 
body fluids serve this function. There are many different designs for these internal electrodes, and only 
a few examples are given in the following paragraphs. Basically these electrodes can be classified as needle 
electrodes, which can be used to penetrate the skin and tissue to reach the point where the measurement 
is to be made, or they are electrodes that can be placed in a natural cavity or surgically produced cavity 
in tissue. Figure 48.5 illustrates some of these internal electrodes. 

A catheter tip or probe electrode is placed in a naturally occurring cavity in the body such as in the 
gastrointestinal system. A metal tip or segment on a catheter makes up the electrode. The catheter or, in 
the case where there is no hollow lumen, probe, is inserted into the cavity so that the metal electrode 
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FIGURE 48.5 Examples of different internal electrodes: (a) catheter or probe electrode, (b) needle electrode, 
(c) coaxial needle electrode, (d) coiled wire electrode. (Reprinted with permission from Webster JG (ed). 1992. 
Medical Instrumentation: Application and Design, Houghton Mifflin, Boston.) 
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makes contact with the tissue. A lead wire down the lumen of the catheter or down the center of the 
probe connects the electrode to the external circuitry. 

The basic needle electrode shown in Fig. 48.5b consists of a solid needle, usually made of stainless 
steel, with a sharp point. An insulating material coats the shank of the needle up to a millimeter or two 
of the tip so that the very tip of the needle remains exposed. When this structure is placed in tissue such 
as skeletal muscle, electrical signals can be picked up by the exposed tip. One can also make needle 
electrodes by running one or more insulated wires down the lumen of a standard hypodermic needle. 
The electrode as shown in Fig. 48.5c is shielded by the metal of the needle and can be used to pick up 
very localized signals in tissue. 

Fine wires can also be introduced into tissue using a hypodermic needle, which is then withdrawn. 
This wire can remain in tissue for acute or chronic measurements. Caldwell and Reswick have used fine 
coiled wire electrodes in skeletal muscle for several years without adverse effects [7]. 

Microelectrodes 

The electrodes described in the previous paragraphs have been applied to studying bioelectric signals at 
the organism, organ, or tissue level but not at the cellular level. To study the electric behavior of cells, 
electrodes that are themselves smaller than the cells being studied need to be used. Three types of 
electrodes have been described for this purpose: etched metal electrodes, micropipette electrodes, and 
metal-film-coated micropipette electrodes. The metal microelectrode is essentially a subminiature version 
of the needle electrode described in the previous section (Fig. 48. 6a). In this case, a strong metal such 
as tungsten is used. One end of this wire is etched electrolytically to give tip diameters on the order of a 
few micrometers. The structure is insulated up to its tip, and it can be passed through the membrane of 
a cell to contact the cytosol. The advantage of these electrodes is that they are both small and robust and 
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FIGURE 48.6 Microelectrodes: (a) metal, (h) micropipette, (c) thin metal film on micropipette. (Reprinted with 
permission from Webster JC (ed). 1992. Medical Instrumentation: Application and Design, Houghton Mifflin, Boston.) 
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can be used for neurophysiologic studies. Their principal disadvantage is the difficulty encountered in 
their fabrication and their high source impedance. 

The second and most frequently used type of microelectrode is the glass micropipette. This structure, 
as illustrated in Fig. 48.6b consists of a fine glass capillary drawn to a very narrow point and filled with 
an electrolytic solution. The point can be as narrow as a fraction of a micrometer, and the dimensions 
of this electrode are strongly dependent on the skill of the individual drawing the tip. The electrolytic 
solution in the lumen serves as the contact between the interior of the cell through which the tip has 
been impaled and a larger conventional electrode located in the shank of the pipette. These electrodes 
also suffer from high source impedances and fabrication difficulty. 

A combined form of these two types of electrodes can be achieved by depositing a metal film over the 
outside surface of a glass micropipette as shown in Fig. 48.6c. In this case, the strength and smaller 
dimensions of the micropipette can be used to support films of various metals that are insulated by an 
additional film up to a point very close to the actual tip of the electrode structure. These electrodes have 
been manufactured in quantity and made available as commercial products. Since they combine the 
features of both the metal and the micropipette electrodes, they also suffer from many of the same 
limitations. They do, however, have the advantage of flexibility due to the capability of being able to 
make films of different metals on the micropipette surface without having to worry about the strength 
of the metal, as would be the case if the metal were used alone. 

Electrodes Fabricated Using Microelectronic Technology 

Modern microelectronic technology can be used to fabricate many different types of electrodes for specific 
biomedical applications. For example, dry electrodes with high source resistances or microelectrodes with 
similar characteristics can be improved by incorporating a microelectronic amplifier for impedance con- 
version right on the electrode itself. In the case of the conventional-sized electrodes, a metal disk 5-10 mm 
in diameter can have a high input impedance microelectronic amplifier configured as a follower integrated 
into the back of the electrode so that localized processing of the high source impedance signal can produce 
one of lower, more practical impedance for signal transmission [8] . Single- and multiple-element electrodes 
can be made from thin-film or silicon technology. Mastrototaro and colleagues have demonstrated probes 
for measuring intramyocardial potentials using thin, patterned gold films on polyimide or oxidised molyb- 
denum substrates [9]. When electrodes are made from pieces of micromachined silicon, it is possible to 
integrate an amplifier directly into the electrode [10]. Multichannel amplifiers or multiplexers can be used 
with multiple electrodes on the same probe. Electrodes for contact with individual nerve fibers can be 
fabricated using micromachined holes in a silicon chip that are just big enough to pass a single growing 
axon. Electrical contacts on the sides of these holes can then be used to pick up electrical activity from these 
nerves [11]. These examples are just a few of the many possibilities that can be realized using microelectronics 
and three-dimensional micromachining technology to fabricate specialized electrodes. 

48.4 Biomedical Applications 

Electrodes can be used to perform a wide variety of measurements of bioelectric signals. An extensive 
review of this would be beyond the scope of this chapter, but some typical examples of applications are 
highlighted in Table 48.4. The most popular application for biopotential electrodes is in obtaining the 
electrocardiogram for diagnostic and patient-monitoring applications. A substantial commercial market 
exists for various types of electrocardiographic electrodes, and many of the forms described in the 
previous section are available commercially. Other electrodes for measuring bioelectric potentials for 
application in diagnostic medicine are indicated in Table 48.4. Research applications of biopotential 
electrodes are highly varied and specific for individual studies. Although a few examples are given in 
Table 48.4, the field is far too broad to be completely covered here. 

Biopotential electrodes are one of the most common biomedical sensors used in clinical medicine. 
Although their basic principle of operation is the same for most applications, they take on many forms 
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TABLE 48.4 Examples of Applications of Biopotential Electrodes 



Application 


Biopotential 


Type of Electrode 


Cardiac monitoring 


ECG 


Ag/AgCl with sponge 
Ag/AgCl with hydrogel 


Infant cardiopulmonary monitoring 


ECG impedance 


Ag/AgCl with sponge 

Ag/AgCl with hydrogel 

Thin-film 

Filled elastomer dry 


Sleep encephalography 


EEC 


Gold cups 
Ag/AgCl cups 
Active electrodes 


Diagnostic muscle activity 


EMG 


Needle 


Cardiac electrograms 


Electrogram 


Intracardiac probe 


Implanted telemetry of biopotentials 


ECG 


Stainless steel wire loops 




EMG 


Platinum disks 


Eye movement 


EOG 


Ag/AgCl with hydrogel 



and are used in the measurement of many types of bioelectric phenomena. They will continue to play 
an important role in biomedical instrumentation systems. 
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Further Information 

Good overviews of biopotential electrodes are found in Geddes LA. 1972. Electrodes and the Measurement 
of Bioelectric Events, New York, Wiley; and Ferris CD. 1974. Introduction to Bioelectrodes, New York, 
Plenum. Even though these references are more than 20 years old, they clearly cover the field, and little 
has changed since these books were written. 

Overviews of biopotential electrodes are found in chapters of two works edited by John Webster. 
Chapter 5 of his textbook, Medical Instrumentation: Application and Design, covers the material of this 
chapter in more detail, and there is a section on "Bioelectrodes" in his Encyclopedia on Medical Devices 
and Instrumentation, published by Wiley in 1988. 

The journals IEEE Transactions on Biomedical Engineering and Medical and Biological Engineering and 
Computing are good sources of recent research on biopotential electrodes. 
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Electrochemical sensors have been used extensively either as a whole or an integral part of a chemical 
and biomedical sensing element. For instance, blood gas (P02, PC02, and pH) sensing can be accom- 
plished entirely by electrochemical means. Many important biomedical enzymatic sensors, including 
glucose sensors, incorporate an enzymatic catalyst and an electrochemical sensing element. The Clark 
type of oxygen sensor [Clark, 1956] is a well-known practical biomedical sensor based on electrochemical 
principles, an amperometric device. Electrochemical sensors generally can be categorized as conductiv- 
ity/capacitance, potentiometric, amperometric, and voltammetric sensors. The amperometric and vol- 
tammetric sensors are characterized by their current-potential relationship with the electrochemical 
system and are less well-defined. Amperometric sensors can also be viewed as a subclass of voltammetric 
sensors. 

Electrochemical sensors are essentially an electrochemical cell which employs a two or three-electrode 
arrangement. Electrochemical sensor measurement can be made at steady-state or transient. The applied 
current or potential for electrochemical sensors may vary according to the mode of operation, and the 
selection of the mode is often intended to enhance the sensitivity and selectivity of a particular sensor. 
The general principles of electrochemical sensors have been extensively discussed in many electroanalytic 
references. However, many electroanalytic methods are not practical in biomedical sensing applications. 
For instance, dropping mercury electrode polarography is a well-established electroanalytic method, yet 
its usefulness in biomedical sensor development, particularly for potential in vivo sensing, is rather 
limited. In this chapter, we shall focus on the electrochemical methodologies which are useful in bio- 
medical sensor development. 

49.1 Conductivity/Capacitance Electrochemical Sensors 

Measurement of the conductivity of an electrochemical cell can be the basis for an electrochemical sensor. 
This differs from an electrical (physical) measurement, for the electrochemical sensor measures the 
conductivity change of the system in the presence of a given solute concentration. This solute is often 
the sensing species of interest. Electrochemical sensors may also involve a capacitative impedance resulting 
from the polarization of the electrodes and the faradaic or charge transfer processes. 

It has been established that the conductance of a homogeneous solution is directly proportional to 
the cross-sectional area perpendicular to the electrical field and inversely proportional to the segment of 
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solution along the electrical field. Thus, the conductance of this solution (electrolyte), G (Q '), can be 
expressed as 

G = 0A/L (49.1) 

where A is the cross-sectional area (in cm 2 ), L is the segment of the solution along the electrical field (in 
cm), and a (in Q _1 cnr 1 ) is the specific conductivity of the electrolyte and is related quantitatively to the 
concentration and the magnitude of the charges of the ionic species. For a practical conductivity sensor, 
A is the surface of the electrode, and L is the distance between the two electrodes. 

Equivalent and molar conductivities are commonly used to express the conductivity of the electrolyte. 
Equivalent conductance depends on the concentration of the solution. If the solution is a strong elec- 
trolyte, it will completely dissociate the components in the solution to ionic forms. Kohlrauch [Maclnnes, 
1939] found that the equivalent conductance of a strong electrolyte was proportional to the square root 
of its concentration. However, if the solution is a weak electrolyte which does not completely dissociate 
the components in the solution to respective ions, the above observation by Kohlrauch is not applicable. 

The formation of ions leads to consideration of their contribution to the overall conductance of the 
electrolyte. The equivalent conductance of a strong electrolyte approaches a constant limiting value at 
infinite dilution, namely, 

A =A =X + +r (49.2) 

o Jim— >o o o v ' 

where A is the equivalent conductance of the electrolyte at infinite dilution and X + and X~~ are the ionic 
equivalent conductance of cations and anions at infinite dilution, respectively. 

Kohlrauch also established the law of independent mobilities of ions at infinite dilution. This implies 
that A at infinite dilution is a constant at a given temperature and will not be affected by the presence 
of other ions in the electrolytes. This provides a practical estimation of the value of A from the values 
of X + and X~ . As mentioned, the conductance of an electrolyte is influenced by its concentration. Kohl- 
rausch stated that the equivalent conductance of the electrolyte at any concentration C in mol/1 or any 
other convenient units can be expressed as 

A = A o -pC 05 (49.3) 

where (3 is a constant depending on the electrolyte. 

In general, electrolytes can be classified as weak electrolytes, strong electrolytes, and ion-pair electro- 
lytes. Weak electrolytes only dissociate to their component ions to a limited extent, and the degree of the 
dissociation is temperature dependent. However, strong electrolytes dissociate completely, and Eq. (49.3) 
is applicable to evaluate its equivalent conductance. Ion-pair electrolytes can by characterized by their 
tendency to form ion pairs. The dissociation of ion pairs is similar to that of a weak electrolyte and is 
affected by ionic activities. The conductivity of ion-pair electrolytes is often nonlinear related to its 
concentration. 

The electrolyte conductance measurement technique, in principle, is relatively straightforward. How- 
ever, the conductivity measurement of an electrolyte is often complicated by the polarization of the 
electrodes at the operating potential. Faradaic or charge transfer processes occur at the electrode surface, 
complicating the conductance measurement of the system. Thus, if possible, the conductivity electro- 
chemical sensor should operate at a potential where no faradaic processes occur. Also, another important 
consideration is the formation of the double layer adjacent to each electrode surface when a potential is 
imposed on the electrochemical sensor. The effect of the double layer complicates the interpretation of 
the conductivity measurement and is usually described by the Warburg impedance. Thus, even in the 
absence of faradaic processes, the potential effect of the double layer on the conductance of the electrolyte 
must be carefully assessed. The influence of a faradaic process can be minimized by maintaining a high 
center constant, LI A, of the electrochemical conductivity sensor, so that the cell resistance lies in the 
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region of 1-50 kfi. This implies the desirable feature of a small electrode surface area and a relatively 
large distance between the two electrodes. Yet, a large electrode surface area enhances the accuracy of the 
measurement, since a large deviation from the null point facilitates the balance of the Wheatstone bridge, 
resulting in improvement of sensor sensitivity. These opposing features can be resolved by using a 
multiple-sensing electrode configuration in which the surface area of each electrode element is small 
compared to the distance between the electrodes. The multiple electrodes are connected in parallel, and 
the output of the sensor represents the total sum of the current through each pair of electrodes. In this 
mode of measurement, the effect of the double layer is included in the conductance measurement. The 
effects of both the double layers and the faradaic processes can be minimized by using a high-frequency, 
low- amplitude alternating current. The higher the frequency and the lower the amplitude of the imposed 
alternating current, the closer the measured value is to the true conductance of the electrolyte. 

49.2 Potentiometric Sensors 

When a redox reaction, Ox + Ze = Red, takes place at an electrode surface in an electrochemical cell, a 
potential may develop at the electrode-electrolyte interface. This potential may then be used to quantify 
the activity (on concentration) of the species involved in the reaction forming the fundamental of 
potentiometric sensors. 

The above reduction reaction occurs at the surface of the cathode and is defined as a half-cell reaction. 
At thermodynamic equilibrium, the Nernst equation is applicable and can be expressed as: 



£=£" + — In 
ZF 



a 



(49.4) 



where E and E° are the measured electrode potential and the electrode potential at standard state, 
respectively; a m and a red are the activities of Ox (reactant in this case) and Red (product in this case), 
respectively; Z is the number of electrons transferred, F the Faraday constant, R the gas constant, and T 
the operating temperature in the absolute scale. In the electrochemical cell, two half-cell reactions will 
take place simultaneously. However, for sensing purposes, only one of the two half-cell reactions should 
involve the species of interest, and the other half-cell reaction is preferably reversible and noninterfering. 
As indicated in Eq. (49.4), a linear relation exists between the measured potential E and the natural 
logarithm of the ratio of the activities of the reactant and product. If the number of electrons transferred, 
Z, is one, at ambient temperature (25°C or 298°K) the slope is approximately 60 mV/decade. This slope 
value governs the sensitivity of the potentiometric sensor. 

Potentiometric sensors can be classified based on whether the electrode is inert or active. An inert 
electrode does not participate in the half-cell reaction and merely provides the surface for the electron 
transfer or provides a catalytic surface for the reaction. However, an active electrode is either an ion 
donor or acceptor in the reaction. In general, there are three types of active electrodes: the metal/metal 
ion, the metal/insoluble salt or oxide, and metal/metal chelate electrodes. 

Noble metals such as platinum and gold, graphite, and glassy carbon are commonly used as inert 
electrodes on which the half-cell reaction of interest takes place. To complete the circuitry for the 
potentiometric sensor, the other electrode is usually a reference electrode on which a noninterference 
half-cell reaction occurs. Silver-silver chloride and calomel electrodes are the most commonly used 
reference electrodes. Calomel consists of Hg/HgCl 2 and is less desirable for biomedical systems in terms 
of toxicity. 

An active electrode may incorporate chemical or biocatalysts and is involved as either an ion donor 
or acceptor in the half-cell reaction. The other half-cell reaction takes place on the reference electrode 
and should also be noninterfering. 

If more than a single type of ion contributes to the measured potential in Eq. (49.4), the potential can 
no longer be used to quantify the ions of interest. This is the interference in a potentiometric sensor. 
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Thus, in many cases, the surface of the active electrode often incorporates a specific functional membrane 
which may be ion-selective, ion-permeable, or have ion-exchange properties. These membranes tend to 
selectivity permit the ions of interest to diffuse or migrate through. This minimizes the ionic interference. 
Potentiometric sensors operate at thermodynamic equilibrium conditions. Thus, in practical poten- 
tiometric sensing, the potential measurement needs to be made under zero-current conditions. Conse- 
quently, a high-input impedance electrometer is often used for measurements. Also, the response time 
for a potentiometric sensor to reach equilibrium conditions in order to obtain a meaningful reading can 
be quite long. These considerations are essential in the design and selection of potentiometric sensors 
for biomedical applications. 

49.3 Voltammetric Sensors 

The current-potential relationship of an electrochemical cell provides the basis for voltammetric sensors. 
Amperometric sensors, that are also based on the current-potential relationship of the electrochemical 
cell, can be considered a subclass of voltammetric sensors. In amperometric sensors, a fixed potential is 
applied to the electrochemical cell, and a corresponding current, due to a reduction or oxidation reaction, 
is then obtained. This current can be used to quantify the species involved in the reaction. The key 
consideration of an amperometric sensor is that it operates at a fixed potential. However, a voltammetric 
sensor can operate in other modes such as linear cyclic voltammetric modes. Consequently, the respective 
current potential response for each mode will be different. 

In general, voltammetric sensors examine the concentration effect of the detecting species on the 
current-potential characteristics of the reduction or oxidation reaction involved. 

The mass transfer rate of the detecting species in the reaction onto the electrode surface and the 
kinetics of the faradaic or charge transfer reaction at the electrode surface directly affect the current- 
potential characteristics. This mass transfer can be accomplished through (a) an ionic migration as a 
result of an electric potential gradient, (b) a diffusion under a chemical potential difference or concen- 
tration gradient, and (c) a bulk transfer by natural or forced convection. The electrode reaction kinetics 
and the mass transfer processes contribute to the rate of the faradaic process in an electrochemical cell. 
This provides the basis for the operation of the voltammetric sensor. However, assessment of the simul- 
taneous mass transfer and kinetic mechanism is rather complicated. Thus, the system is usually operated 
under definitive hydrodynamic conditions. Various techniques to control either the potential or current 
are used to simplify the analysis of the voltammetric measurement. A description of these techniques 
and their corresponding mathematical analyses are well documented in many texts on electrochemistry 
or electroanalysis [Adams, 1969; Bard & Faulkner, 1980; Lingane, 1958; Macdonald, 1977; Murray & 
Reilley, 1966]. 

A preferred mass transfer condition is total diffusion, which can be described by Fick's law of diffusion. 
Under this condition, the cell current, a measure of the rate of the faradaic process at an electrode, usually 
increases with increases in the electrode potential. This current approaches a limiting value when the 
rate of the faradaic process at the electrode surface reaches its maximum mass transfer rate. Under this 
condition, the concentration of the detecting species at the electrode surface is considered as zero and is 
diffusional mass transfer. Consequently, the limiting current and the bulk concentration of the detecting 
species can be related by 

i=ZFkmC* (49.5) 

where km is the mass transfer coefficient and C* is the bulk concentration of the detecting species. At 
the other extreme, when the electrode kinetics are slow compared with the mass transfer rate, the 
electrochemical system is operated in the reaction kinetic control regime. This usually corresponds to a 
small overpotential. The limiting current and the bulk concentration of the detecting species can be 
related as 
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i=ZFkcC* (49.6) 

where kc is the kinetic rate constant for the electrode process. Both Eqs. (49.5) and (49.6) show the linear 
relationship between the limiting current and the bulk concentration of the detecting species. In many 
cases, the current does not tend to a limiting value with an increase in the electrode potential. This is 
because other faradaic or nonfaradaic processes become active, and the cell current represents the 
cumulative rates of all active electrode processes. The relative rates of these processes, expressing current 
efficiency, depend on the current density of the electrode. Assessment of such a system is rather compli- 
cated, and the limiting current technique may become ineffective. 

When a voltammetric sensor operates with a small overpotential, the rate of faradaic reaction is also 
small; consequently, a high-precision instrument for the measurement is needed. An amperometric sensor 
is usually operated under limiting current or relatively small overpotential conditions. Amperometric 
sensors operate under an imposed fixed electrode potential. Under this condition, the cell current can 
be correlated with the bulk concentration of the detecting species (the solute). This operating mode is 
commonly classified as amperometric in most sensor work, but it also is referred to as the chronosuper- 
ometric method, since time is involved. 

Voltammetric sensors can be operated in a linear or cyclic sweep mode. Linear sweep voltammetry 
involves an increase in the imposed potential linearly at a constant scanning rate from an initial potential 
to a defined upper potential limit. This is the so-called potential window. The current-potential curve 
usually shows a peak at a potential where the oxidation or reduction reaction occurs. The height of the 
peak current can be used for the quantification of the concentration of the oxidation or reduction species. 
Cyclic voltammetry is similar to the linear sweep voltammetry except that the electrode potential returns 
to its initial value at a fixed scanning rate. The cyclic sweep normally generates the current peaks 
corresponding to the oxidation and reduction reactions. Under these circumstances, the peak current 
value can relate to the corresponding oxidation or reduction reaction. However, the voltammogram can 
be very complicated for a system involving adsorption (nonfaradaic processes) and charge processes 
(faradaic processes). The potential scanning rate, diffusivity of the reactant, and operating temperature 
are essential parameters for sensor operation, similar to the effects of these parameters for linear sweep 
voltammograms. The peak current may be used to quantify the concentration of the reactant of interest, 
provided that the effect of concentration on the diffusivity is negligible. The potential at which the peak 
current occurs can be used in some cases to identify the reaction, or the reactant. This identification is 
based on the half-cell potential of the electrochemical reactions, either oxidation or reduction. The values 
of these half-cell reactions are listed extensively in handbooks and references. 

The described voltammetric and amperometric sensors can be used very effectively to carry out 
qualitative and quantitative analyses of chemical and biochemical species. The fundamentals of this 
sensing technique are well established, and the critical issue is the applicability of the technique to a 
complex, practical environment, such as in whole blood or other biologic fluids. This is also the exciting 
challenge of designing a biosensor using voltammetric and amperometric principles. 

49.4 Reference Electrodes 

Potentiometric, voltammetric, and amperometric sensors employ a reference electrode. The reference 
electrode in the case of potentiometric and amperometric sensors serves as a counter electrode to complete 
the circuitry. In either case, the reaction of interest takes place at the surface of the working electrode, 
and this reaction is either an oxidation or reduction reaction. Consequently, the reaction at the counter 
electrode, i.e., the reference electrode, is a separate reduction or oxidation reaction, respectively. It is 
necessary that the reaction occurring at the reference electrode does not interfere with the reaction at 
the working electrode. For practical applications, the reaction occurring at the reference electrode should 
be highly reversible and, as stated, does not contribute to the reaction at the working electrode. In 
electrochemistry, the hydrogen electrode is universally accepted as the primary standard with which other 
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electrodes are compared. Consequently, the hydrogen electrode serves extensively as a standard reference. 
A hydrogen reference electrode is relatively simple to prepare. However, for practical applications hydro- 
gen reference electrodes are too cumbersome to be useful in practice. 

A class of electrode called the electrode of the second kind, which forms from a metal and its sparingly 
soluble metal salt, finds use as the reference electrode. The most common electrode of this type includes 
the calomel electrode, Hg/HgCl 2 and the silver-silver chloride electrode, Ag/AgCl. In biomedical appli- 
cations, particularly in in vivo applications, Ag/AgCl is more suitable as a reference electrode. 

An Ag/AgCl electrode can be small, compact, and relatively simple to fabricate. As a reference electrode, 
the stability and reproducibility of an Ag/AgCl electrode is very important. Contributing factors to 
instability and poor reproducibility of Ag/AgCl electrodes include the purity of the materials used, the 
aging effect of the electrode, the light effect, and so on. When in use, the electrode and the electrolyte 
interface contribute to the stability of the reference electrode. It is necessary that a sufficient quantity of 
Ch ions exists in the electrolyte when the Ag/AgCl electrode serves as a reference. Therefore, other 
silver-silver halides such as Ag/AgBr or Ag/AgI electrodes are used in cases where these other halide ions 
are present in the electrolyte. 

In a voltammetric sensor, the reference electrode serves as a true reference for the working electrode, 
and no current flows between the working and reference electrodes. Nevertheless, the stability of the 
reference electrode remains essential for a voltammetric sensor. 

49.5 Summary 

Electrochemical sensors are used extensively in many biomedical applications including blood chemistry 
sensors 1 , P0 2 , PC0 2 , and pH electrodes. Many practical enzymatic sensors, including glucose and lactate 
sensors, also employ electrochemical sensors as sensing elements. Electrochemically based biomedical 
sensors have found in vivo and in vitro applications. We believe that electrochemical sensors will continue 
to be an important aspect of biomedical sensor development. 
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Optical methods are among the oldest and best-established techniques for sensing biochemical analytes. 
Instrumentation for optical measurements generally consists of a light source, a number of optical 
components to generate a light beam with specific characteristics and to direct this light to some 
modulating agent, and a photodetector for processing the optical signal. The central part of an optical 
sensor is the modulating component, and a major part of this chapter will focus on how to exploit the 
interaction of an analyte with optical radiation in order to obtain essential biochemical information. 

The number of publications in the field of optical sensors for biomedical applications has grown 
significantly during the past two decades. Numerous scientific reviews and historical perspectives have 
been published, and the reader interested in this rapidly growing field is advised to consult these sources 
for additional details. This chapter will emphasize the basic concept of typical optical sensors intended 
for continuous in vivo monitoring of biochemical variables, concentrating on those sensors which have 
generally progressed beyond the initial feasibility stage and reached the promising stage of practical 
development or commercialization. 

Optical sensors are usually based on optical fibers or on planar waveguides. Generally, there are three 
distinctive methods for quantitative optical sensing at surfaces: 

1. The analyte directly affects the optical properties of a waveguide, such as evanescent waves (elec- 
tromagnetic waves generated in the medium outside the optical waveguide when light is reflected 
from within) or surface plasmons (resonances induced by an evanescent wave in a thin film 
deposited on a waveguide surface). 

2. An optical fiber is used as a plain transducer to guide light to a remote sample and return light 
from the sample to the detection system. Changes in the intrinsic optical properties of the medium 
itself are sensed by an external spectrophotometer. 

3. An indicator or chemical reagent placed inside, or on, a polymeric support near the tip of the 
optical fiber is used as a mediator to produce an observable optical signal. Typically, conventional 
techniques, such as absorption spectroscopy and fluorimetry, are employed to measure changes 
in the optical signal. 
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FIGURE 50.1 General diagram representing the basic building blocks of an optical instrument for optical sensor 
applications. 

50.1 Instrumentation 

The actual implementation of instrumentation designed to interface with optical sensors will vary greatly 
depending on the type of optical sensor used and its intended application. A block diagram of a generic 
instrument is illustrated in Fig. 50. 1. The basic building blocks of such an instrument are the light source, 
various optical elements, and photodetectors. 

Light Source 

A wide selection of light sources are available for optical sensor applications. These include: highly 
coherent gas and semiconductor diode lasers, broad spectral band incandescent lamps, and narrow-band, 
solid-state, light-emitting diodes (LEDs). The important requirement of a light source is obviously good 
stability. In certain applications, for example in portable instrumentation, LEDs have significant advan- 
tages over other light sources because they are small and inexpensive, consume lower power, produce 
selective wavelengths, and are easy to work with. In contrast, tungsten lamps provide a broader range of 
wavelengths, higher intensity, and better stability but require a sizable power supply and can cause heating 
problems inside the apparatus. 

Optical Elements 

Various optical elements are used routinely to manipulate light in optical instrumentation. These include 
lenses, mirrors, light choppers, beam splitters, and couplers for directing the light from the light source 
into the small aperture of a fiber optic sensor or a specific area on a waveguide surface and collecting 
the light from the sensor before it is processed by the photodetector. For wavelength selection, optical 
filters, prisms, and diffraction gratings are the most common components used to provide a narrow 
bandwidth of excitation when a broadwidth light source is utilized. 

Photodetectors 

In choosing photodetectors for optical sensors, a number of factors must be considered. These include 
sensitivity, detectivity, noise, spectral response, and response time. Photomultipliers and semiconductor 
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quantum photodetectors, such as photoconductors and photodiodes, are both suitable. The choice, 
however, is somewhat dependent on the wavelength region of interest. Generally, both give adequate 
performance. Photodiodes are usually more attractive because of the compactness and simplicity of the 
circuitry involved. 

Typically, two photodetectors are used in optical instrumentation because it is often necessary to 
include a separate reference detector to track fluctuations in source intensity and temperature. By taking 
a ratio between the two detector readings, whereby a part of the light that is not affected by the 
measurement variable is used for correcting any optical variations in the measurement system, a more 
accurate and stable measurement can be obtained. 

Signal Processing 

Typically, the signal obtained from a photodetector provides a voltage or a current proportional to the 
measured light intensity. Therefore, either simple analog computing circuitry (e.g., a current-to-voltage 
converter) or direct connection to a programmable gain voltage stage is appropriate. Usually, the output 
from a photodetector is connected directly to a preamplifier before it is applied to sampling and analog- 
to-digital conversion circuitry residing inside a computer. 

Quite often two different wavelengths of light are utilized to perform a specific measurement. One 
wavelength is usually sensitive to changes in the species being measured, and the other wavelength is 
unaffected by changes in the analyte concentration. In this manner, the unaffected wavelength is used as 
a reference to compensate for fluctuation in instrumentation over time. In other applications, additional 
discriminations, such as pulse excitation or electronic background subtraction utilizing synchronized 
lock-in amplifier detection, are useful, allowing improved selectivity and enhanced signal-to-noise ratio. 

50.2 Optical Fibers 

Several types of biomedical measurements can be made by using either plain optical fibers as a remote 
device for detecting changes in the spectral properties of tissue and blood or optical fibers tightly coupled 
to various indicator-mediated transducers. The measurement relies either on direct illumination of a sample 
through the endface of the fiber or by excitation of a coating on the side wall surface through evanescent 
wave coupling. In both cases, sensing takes place in a region outside the optical fiber itself. Light emanating 
from the fiber end is scattered or fluoresced back into the fiber, allowing measurement of the returning 
light as an indication of the optical absorption or fluorescence of the sample at the fiber optic tip. 

Optical fibers are based on the principle of total internal reflection. Incident light is transmitted through 
the fiber if it strikes the cladding at an angle greater than the so-called critical angle, so that it is totally 
internally reflected at the core/cladding interface. A typical instrument for performing fiber optic sensing 
consists of a light source, an optical coupling arrangement, the fiber optic light guide with or without 
the necessary sensing medium incorporated at the distal tip, and a light detector. 

A variety of high-quality optical fibers are available commercially for biomedical sensor applications, 
depending on the analytic wavelength desired. These include plastic, glass, and quartz fibers which cover the 
optical spectrum from the UV through the visible to the near IR region. On one hand, plastic optical fibers 
have a larger aperture and are strong, inexpensive, flexible, and easy to work with but have poor UV 
transmission below 400 nm. On the other hand, glass and quartz fibers have low attenuation and better 
transmission in the UV but have small apertures, are fragile, and present a potential risk in in vivo applications. 

Probe Configurations 

There are many different ways to implement fiber optic sensors. Most fiber optic chemical sensors employ 
either a single-fiber configuration, where light travels to and from the sensing tip in one fiber, or a double- 
fiber configuration, where separate optical fibers are used for illumination and detection. A single fiber 
optic configuration offers the most compact and potentially least expensive implementation. However, 
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additional challenges in instrumentation are involved in separating the illuminating signal from the 
composite signal returning for processing. 

The design of intravascular catheters requires special considerations related to the sterility and bio- 
compatibility of the sensor. For example, intravascular fiberoptic sensors must be sterilizable and their 
material nonthrombogenic and resistant to platelet and protein deposition. Therefore, these catheters 
are typically made of materials covalently bound with heparin or antiplatelet agents. The catheter is 
normally introduced into the jugular vein via a peripheral cut-down and a slow heparin flush is main- 
tained until it is removed from the blood. 

Optical Fiber Sensors 

Advantages cited for fiber sensors include their small size and low cost. In contrast to electrical measure- 
ments, where the difference of two absolute potentials must be measured, fiber optics are self-contained 
and do not require an external reference signal. Because the signal is optical, there is no electrical risk 
to the patient, and there is no direct interference from surrounding electric or magnetic fields. Chemical 
analysis can be performed in real-time with almost an instantaneous response. Furthermore, versatile 
sensors can be developed that respond to multiple analytes by utilizing multiwavelength measurements. 
Despite these advantages, optical fiber sensors exhibit several shortcomings. Sensors with immobilized 
dyes and other indicators have limited long-term stability, and their shelf life degrades over time. More- 
over, ambient light can interfere with the optical measurement unless optical shielding or special time- 
synchronous gating is performed. 

Indicator-Mediated Transducers 

Only a limited number of biochemical analytes have an intrinsic optical absorption that can be measured 
with sufficient selectivity directly by spectroscopic methods. Other species, particularly hydrogen, oxygen, 
carbon dioxide, and glucose, which are of primary interest in diagnostic applications, are not susceptible 
to direct photometry. Therefore, indicator-mediated sensors have been developed using specific reagents 
that are properly immobilized on the surface of an optical sensor. 

The most difficult aspect of developing an optical biosensor is the coupling of light to the specific 
recognition element so that the sensor can respond selectively and reversibly to a change in the concen- 
tration of a particular analyte. In fiber-optic-based sensors, light travels efficiently to the end of the fiber 
where it exists and interacts with a specific chemical or biologic recognition element that is immobilized 
at the tip of the fiber optic. These transducers may include indicators and ionophores (i.e., ion-binding 
compounds) as well as a wide variety of selective polymeric materials. After the light interacts with the 
sample, the light returns through the same or a different optical fiber to a detector which correlates the 
degree of change with the analyte concentration. 

Typical indicator-mediated fiber-optic-sensor configurations are shown schematically in Fig. 50.2. In 
(fl) the indicator is immobilized directly on a membrane positioned at the end of a fiber. An indicator 
in the form of a powder can be either glued directly onto a membrane, as shown in (b), or physically 
retained in position at the end of the fiber by a special permeable membrane (c), a tubular capillary/mem- 
brane (d), or a hollow capillary tube (e). 

50.3 General Principles of Optical Sensing 

Two major optical techniques are commonly available to sense optical changes at sensor interfaces. These 
are usually based on evanescent wave and surface plasmon resonance principles. 

Evanescent Wave Spectroscopy 

When light propagates along an optical fiber, it is not confined to the core region but penetrates to some 
extent into the surrounding cladding region. In this case, an electromagnetic component of the light 
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FIGURE 50.2 Typical configuration of different indicator-mediated fiber optic sensor tips (from Otto S. Wolfbeis, 
Fiber Optic Chemical Sensors and Biosensors, vol. 1, CRC Press, Boca Raton, 1990). 

penetrates a characteristic distance (on the order of one wavelength) beyond the reflecting surface into the 
less optically dense medium where it is attenuated exponentially according to Beer-Lambert's law (Fig. 50.3). 

The evanescent wave depends on the angle of incidence and the incident wavelength. This phenomenon 
has been widely exploited to construct different types of optical sensors for biomedical applications. 
Because of the short penetration depth and the exponential decay of the intensity, the evanescent wave 
is absorbed mainly by absorbing compounds very close to the surface. In the case of particularly weak 
absorbing analytes, sensitivity can be enhanced by combining the evanescent wave principle with multiple 
internal reflections along the sides of an unclad portion of a fiber optic tip. 

Instead of an absorbing species, a fluorophore can also be used. Light is absorbed by the fluorophore 
emitting detectable fluorescent light at a higher wavelength, thus providing improved sensitivity. Evanes- 
cent wave sensors have been applied successfully to measure the fluorescence of indicators in solution, 
for pH measurement, and in immunodiagnostics. 



Surface Plasmon Resonance 

Instead of the dielectric/dielectric interface 
used in evanescent wave sensors, it is possible 
to arrange a dielectric/metal/dielectric sand- 
wich layer such that when monochromatic 
polarized light (e.g., from a laser source) 
impinges on a transparent medium having a 
metallized (e.g., Ag or Au) surface, light is 
absorbed within the plasma formed by the 
conduction electrons of the metal. This 
results in a phenomenon known as surface 
plasmon resonance (SPR). When SPR is 
induced, the effect is observed as a minimum 
in the intensity of the light reflected off the 
metal surface. 

As is the case with the evanescent wave, an 
SPR is exponentially decaying into solution 
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FIGURE 50.3 Schematic diagram of the path of a light ray at 
the interface of two different optical materials with index of 
refraction n 1 and n-,. The ray penetrates a fraction of a wave- 
length (dp) beyond the interface into the medium with the 
smaller refractive index. 
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with a penetration depth of about 20 nm. The resonance between the incident light and the plasma wave 
depends on the angle, wavelength, and polarization state of the incident light and the refractive indices 
of the metal film and the materials on either side of the metal film. A change in the dielectric constant 
or the refractive index at the surface causes the resonance angle to shift, thus providing a highly sensitive 
means of monitoring surface reactions. 

The method of SPR is generally used for sensitive measurement of variations in the refractive index 
of the medium immediately surrounding the metal film. For example, if an antibody is bound to or 
absorbed into the metal surface, a noticeable change in the resonance angle can be readily observed 
because of the change of the refraction index at the surface, assuming all other parameters are kept 
constant (Fig. 50.4). The advantage of this concept is the improved ability to detect the direct interaction 
between antibody and antigen as an interfacial measurement. 

SPR has been used to analyze immunochemicals and to detect gases. The main limitation of SPR, 
however, is that the sensitivity depends on the optical thickness of the adsorbed layer, and, therefore, 
small molecules cannot be measured in very low concentrations. 

50.4 Applications 



Oximetry 

Oximetry refers to the colorimetric measurement of the degree of oxygen saturation, that is, the relative 
amount of oxygen carried by the hemoglobin in the erythrocytes, by recording the variation in the color 
of deoxyhemoglobin (Hb) and oxyhemoglobin (Hb0 2 ). A quantitative method for measuring blood 
oxygenation is of great importance in assessing the circulatory and respiratory status of a patient. 

Various optical methods for measuring the oxygen saturation of arterial (Sa0 2 ) and mixed venous 
(Sv0 2 ) blood have been developed, all based on light transmission through, or reflecting from, tissue 
and blood. The measurement is performed at two specific 
wavelengths: X v where there is a large difference in light 



absorbance between Hb and Hb0 2 (e.g., 660 nm red light), 
and X 2 , which can be an isobestic wavelength (e.g., 805 nm 
infrared light), where the absorbance of light is independent 
of blood oxygenation, or a different wavelength in the infrared 
region (>805 nm), where the absorbance of Hb is slightly 
smaller than that of Hb0 2 . 

Assuming for simplicity that a hemolyzed blood sample 
consists of a two-component homogeneous mixture of Hb 
and Hb0 2 , and that light absorbance by the mixture of these 
two components is additive, a simple quantitative relationship 
can be derived for computing the oxygen saturation of blood: 



O 



Oxygen saturation = A — B 



ogM 

od(x 2 ) 



where A and B are coefficients which are functions of the 
specific absorptivities of Hb and Hb0 2 , and OD is the corre- 
sponding absorbance (optical density) of the blood. 

Since the original discovery of this phenomenon over 
50 years ago, there has been progressive development in 
instrumentation to measure oxygen saturation along three 
different paths: bench-top oximeters for clinical laboratories, 
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FIGURE 50.4 Surface plasmon resonance 
at the interface between a thin metallic sur- 
face and a liquid (A). A sharp decrease in the 
reflected light intensity can be observed in 
(B). The location of the resonance angle is 
dependent on the refractive index of the 
material present at the interface. 
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FIGURE 50.5 Principle of a three-fiber optical catheter for SvO,/HCT measurement [2]. 

fiber optic catheters for invasive intravascular monitoring, and transcutaneous sensors, which are non- 
invasive devices placed against the skin. 

Intravascular Fiber Optic Sv0 2 Catheters 

In vivo fiberoptic oximeters were first described in the early 1960s by Polanyi and Heir [1]. They 
demonstrated that in a highly scattering medium such as blood, where a very short path length is required 
for a transmittance measurement, a reflectance measurement was practical. Accordingly, they showed 
that a linear relationship exists between oxygen saturation and the ratio of the infrared-to-red (IR/R) 
light backscattered from the blood 



oxygen saturation = a 



-b(lR/R) 



where a and b are catheter-specific calibration coefficients. 

Fiber optic Sv0 2 catheters consist of two separate optical fibers. One fiber is used for transmitting the 
light to the flowing blood, and a second fiber directs the backscattered light to a photodetector. In some 
commercial instruments (e.g., Oximetrix), automatic compensation for hematocrit is employed utilizing 
three, rather than two, infrared reference wavelengths. Bornzin and coworkers [2] and Mendelson and 
coworkers [3] described a 5-lumen, 7.5F thermodilution catheter that is comprised of three unequally 
spaced optical fibers, each fiber 250 urn in diameter, and provides continuous Sv0 2 reading with auto- 
matic corrections for hematocrit variations (Fig. 50.5). 

Intravenous fiberoptic catheters are utilized in monitoring Sv0 2 in the pulmonary artery and can be 
used to indicate the effectiveness of the cardiopulmonary system during cardiac surgery and in the ICU. 
Several problems limit the wide clinical application of intravascular fiberoptic oximeters. These include 
the dependence of the individual red and infrared backscattered light intensities and their ratio on 
hematocrit (especially for Sv0 2 below 80%), blood flow, motion artifacts due to catheter tip "whipping" 
against the blood vessel wall, blood temperature, and pH. 

Noninvasive Pulse Oximetry 

Noninvasive monitoring of Sa0 2 by pulse oximetry is a rapidly growing practice in many fields of clinical 
medicine [4]. The most important advantage of this technique is the capability to provide continuous, 
safe, and effective monitoring of blood oxygenation at the patient's bedside without the need to calibrate 
the instrument before each use. 
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FIGURE 50.6 Disposable finger probe of a noninvasive pulse oximeter. 

Pulse oximetry, which was first suggested by Aoyagi and colleagues [5] and Yoshiya and colleagues [6] , 
relies on the detection of the time-variant photoplethysmographic signal, caused by changes in arterial 
blood volume associated with cardiac contraction. Sa0 2 is derived by analyzing only the time-variant 
changes in absorbance caused by the pulsating arterial blood at the same red and infrared wavelengths 
used in conventional invasive type oximeters. A normalization process is commonly performed by which 
the pulsatile (ac) component at each wavelength, which results from the expansion and relaxation of the 
arterial bed, is divided by the corresponding nonpulsatile (dc) component of the photoplethysmogram, 
which is composed of the light absorbed by the blood-less tissue and the nonpulsatile portion of the 
blood compartment. This effective scaling process results in a normalized red/infrared ratio which is 
dependent on Sa0 2 but is largely independent of the incident light intensity, skin pigmentation, skin 
thickness, and tissue vasculature. 

Pulse oximeter sensors consist of a pair of small and inexpensive red and infrared LEDs and a single, 
highly sensitive, silicon photodetector. These components are mounted inside a reusable rigid spring- 
loaded clip, a flexible probe, or a disposable adhesive wrap (Fig. 50.6). The majority of the commercially 
available sensors are of the transmittance type in which the pulsatile arterial bed, e.g., ear lobe, fingertip, 
or toe, is positioned between the LEDs and the photodetector. Other probes are available for reflectance 
(backscatter) measurement where both the LEDs and photodetectors are mounted side-by-side facing 
the skin [7,8]. 

Noninvasive Cerebral Oximetry 

Another substance whose optical absorption in the near infrared changes corresponding to its reduced 
and oxidized state is cytochrome aa3, the terminal member of the respiratory chain. Although the 
concentration of cytochrome aa3 is considerably lower than that of hemoglobin, advanced instrumen- 
tation including time-resolved spectroscopy and differential measurements is being used successfully to 
obtain noninvasive measurements of hemoglobin saturation and cytochrome aa3 by transiiluminating 
areas of the neonatal brain [9-11]. 

Blood Gases 1 

Frequent measurement of blood gases, i.e., oxygen partial pressure (P0 2 ), carbon dioxide partial pressure 
(PCO,), and pH, is essential to clinical diagnosis and management of respiratory and metabolic problems 



'Additional information on blood gas sensors can be found in Appendix A. 
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in the operating room and the ICU. Considerable effort has been devoted over the last two decades to 
developing disposable extracorporeal and in particular intravascular fiber optic sensors that can be used 
to provide continuous information on the acid-base status of a patient. 

In the early 1970s, Lubbers and Opitz [12] originated what they called optodes (from the Greek, optical 
path) for measurements of important physiologic gases in fluids and in gases. The principle upon which 
these sensors was designed was a closed cell containing a fluorescent indicator in solution, with a 
membrane permeable to the analyte of interest (either ions or gases) constituting one of the cell walls. 
The cell was coupled by optical fibers to a system that measured the fluorescence in the cell. The cell 
solution would equilibrate with the P0 2 or PC0 2 of the medium placed against it, and the fluorescence 
of an indicator reagent in the solution would correspond to the partial pressure of the measured gas. 

Extracorporeal Measurement 

Following the initial feasibility studies of Lubbers and Opitz, Cardiovascular Devices (CDI, USA) devel- 
oped a GasStat™ extracorporeal system suitable for continuous online monitoring of blood gases ex vivo 
during cardiopulmonary bypass operations. The system consists of a disposable plastic sensor connected 
inline with a blood loop through a fiber optic cable. Permeable membranes separate the flowing blood 
from the system chemistry. The C0 2 -sensitive indicator consists of a fine emulsion of a bicarbonate buffer 
in a two-component silicone. The pH-sensitive indicator is a cellulose material to which hydroxypyrene 
trisulfonate (HPTS) is bonded covalently. The 2 -sensitive chemistry is composed of a solution of oxygen- 
quenching decacyclene in a one-component silicone covered with a thin layer of black PTFE for optical 
isolation and to render the measurement insensitive to the halothane anesthetic. 

The extracorporeal device has two channels, one for arterial blood and the other for venous blood, 
and is capable of recording the temperature of the blood for correcting the measurements to 37°C. Several 
studies have been conducted comparing the specifications of the GasStat™ with that of intermittent blood 
samples analyzed on bench-top blood gas analyzers [13-15]. 

Intravascular Catheters 

During the past decade, numerous efforts have been made to develop integrated fiber optic sensors for 
intravascular monitoring of blood gases. A few commercial systems for monitoring blood gases and pH 
are currently undergoing extensive clinical testing. Recent literature reports of sensor performance show 
that considerable progress has been made mainly in improving the accuracy and reliability of these 
intravascular blood gas sensors [16-19]. 

Most fiber optic intravascular blood gas sensors employ either a single- or a double-fiber configuration. 
Typically, the matrix containing the indicator is attached to the end of the optical fiber as illustrated in 
Fig. 50.7. Since the solubility of 2 and CO, gases, as well as the optical properties of the sensing chemistry 
itself, are affected by temperature variations, fiber optic intravascular sensors include a thermocouple or 
thermistor wire running alongside the fiber optic cable to monitor and correct for temperature fluctu- 
ations near the sensor tip. A nonlinear response is characteristic of most chemical indicator sensors, so 
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FIGURE 50.7 Principle diagram of an integrated fiber optic blood gas catheter (from Otto S. Wolfbeis, Fiber Optic 
Chemical Sensors and Biosensors, vol. 2, CRC Press, Boca Raton, 1990). 
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they are designed to match the concentration region of the intended application. Also, the response time 
of the optode is somewhat slower compared to electrochemical sensors. 

Intravascular fiber optic blood gas sensors are normally placed inside a standard 20-gauge catheter, 
which is sufficiently small to allow adequate spacing between the sensor and the catheter wall. The 
resulting lumen is large enough to permit the withdrawal of blood samples, introduction of a continuous 
heparin flush, and the recording of a blood pressure waveform. In addition, the optical fibers are encased 
in a protective tubing to contain any fiber fragments in case they break off. 

pH Sensors 

In 1976, Peterson and coworkers [20] originated the development of the first fiber optic chemical sensor 
for physiological pH measurement. The basic idea was to contain a reversible color-changing indicator 
at the end of a pair of optical fibers. The indicator, phenol red, was covalently bound to a hydrophilic 
polymer in the form of water-permeable microbeads. This technique stabilized the indicator concentra- 
tion. The indicator beads were contained in a sealed hydrogen-ion-permeable envelope made out of a 
hollow cellulose tubing. In effect, this formed a miniature spectrophotometric cell at the end of the fibers 
and represented an early prototype of a fiber optic chemical sensor. 

The phenol red dye indicator is a weak organic acid, and the acid form (un-ionized) and base form 
(ionized) are present in a concentration ratio determined by the ionization constant of the acid and the 
pH of the medium according to the familiar Henderson-Hasselbalch equation. The two forms of the dye 
have different optical absorption spectra, so the relative concentration of one of the forms, which varies 
as a function of pH, can be measured optically and related to variations in pH. In the pH sensor, green 
(560 nm) and red (longer than 600 nm) light emerging from the end of one fiber passes through the dye 
and is reflected back into the other fiber by light-scattering particles. The green light is absorbed by the 
base form of the indicator. The red light is not absorbed by the indicator and is used as an optical 
reference. The ratio of green to red light is measured and is related to pH by an S-shaped curve with an 
approximate high-sensitivity linear region where the equilibrium constant {pK) of the indicator matches 
the pH of the solution. 

The same principle can also be used with a reversible fluorescent indicator, in which case the concen- 
tration of one of the indicator forms is measured by its fluorescence rather than absorbance intensity. 
Light in the blue or UV wavelength region excites the fluorescent dye to emit longer wavelength light, 
and the two forms of the dye may have different excitation or emission spectra to allow their distinction. 

The original instrument design for a pH measurement was very simple and consisted of a tungsten 
lamp for fiber illumination, a rotating filter wheel to select the green and red light returning from the 
fiber optic sensor, and signal processing instrumentation to give a pH output based on the green-to-red 
ratio. This system was capable of measuring pH in the physiologic range between 7.0-7.4 with an accuracy 
and precision of 0.01 pH units. The sensor was susceptible to ionic strength variation in the order of 
0.01 pH unit per 11% change in ionic strength. 

Further development of the pH probe for practical use was continued by Markle and colleagues [21]. 
They designed the fiber optic probe in the form of a 25-gauge (0.5 mm o.d.) hypodermic needle, with an 
ion-permeable side window, using 75-mm-diameter plastic optical fibers. The sensor had a 90% response 
time of 30 s. With improved instrumentation and computerized signal processing and with a three-point 
calibration, the range was extended to ±3 pH units, and a precision of 0.001 pH units was achieved. 

Several reports have appeared suggesting other dye indicator systems that can be used for fiber optic 
pH sensing [22]. A classic problem with dye indicators is the sensitivity of their equilibrium constant to 
ionic strength. To circumvent this problem, Wolfbeis and Offenbacher [23] and Opitz and Lubbers [24] 
demonstrated a system in which a dual sensor arrangement can measure ionic strength and pH and 
simultaneously can correct the pH measurement for variations in ionic strength. 

PC0 2 Sensors 

The PC0 2 of a sample is typically determined by measuring changes in the pH of a bicarbonate solution 
that is isolated from the sample by a C0 2 -permeable membrane but remains in equilibrium with the 
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C0 2 . The bicarbonate and C0 2 , as carbonic acid, form a pH buffer system, and, by the Henderson- 
Hasselbalch equation, hydrogen ion concentration is proportional to the pC0 2 in the sample. This 
measurement is done with either a pH electrode or a dye indicator in solution. 

Vurek [25] demonstrated that the same techniques can also be used with a fiber optic sensor. In his 
design, one plastic fiber carries light to the transducer, which is made of a silicone rubber tubing about 
0.6 mm in diameter and 1.0 mm long, filled with a phenol red solution in a 35-mM bicarbonate. Ambient 
PC0 2 controls the pH of the solution which changes the optical absorption of the phenol red dye. The 
C0 2 permeates through the rubber to equilibrate with the indicator solution. A second optical fiber 
carries the transmitted signal to a photodetector for analysis. The design by Zhujun and Seitz [26] uses 
a PC0 2 sensor based on a pair of membranes separated from a bifurcated optical fiber by a cavity filled 
with bicarbonate buffer. The external membrane is made of silicone, and the internal membrane is HPTS 
immobilized on an ion-exchange membrane. 

P0 2 Sensors 

The development of an indicator system for fiber optic P0 2 sensing is challenging because there are very 
few known ways to measure P0 2 optically. Although a color- changing indicator would have been desir- 
able, the development of a sufficiently stable indicator has been difficult. The only principle applicable 
to fiber optics appears to be the quenching effect of oxygen on fluorescence. 

Fluorescence quenching is a general property of aromatic molecules, dyes containing them, and some 
other substances. In brief, when light is absorbed by a molecule, the absorbed energy is held as an excited 
electronic state of the molecule. It is then lost by coupling to the mechanical movement of the molecule 
(heat), reradiated from the molecule in a mean time of about 10 ns (fluorescence), or converted into 
another excited state with much longer mean lifetime and then reradiated (phosphorescence). Quenching 
reduces the intensity of fluorescence and is related to the concentration of the quenching molecules, such 
as 2 . 

A fiber optic sensor for measuring P0 2 using the principle of fluorescence quenching was developed 
by Peterson and colleagues [27]. The dye is excited at around 470 nm (blue) and fluoresces at about 
515 nm (green) with an intensity that depends on the P0 2 . The optical information is derived from the 
ratio of green fluorescence to the blue excitation light, which serves as an internal reference signal. The 
system was chosen for visible light excitation, because plastic optical fibers block light transmission at 
wavelengths shorter than 450 nm, and glass fibers were not considered acceptable for biomedical use. 

The sensor was similar in design to the pH probe continuing the basic idea of an indicator packing 
in a permeable container at the end of a pair of optical fibers. A dye perylene dibutyrate, absorbed on a 
macroreticular polystyrene adsorbent, is contained in a oxygen-permeable porous polystyrene envelope. 
The ratio of green to blue intensity was processed according to the Stren-Volmer equation 

I 

where I and I are the fluorescence emission intensities in the presence and absence of a quencher, 
respectively, and I is the Stern-Volmer quenching coefficient. This provides a nearly linear readout of 
P0 2 over the range of 0-150 mmHg (0-20 kPa), with a precision of 1 mm Hg (0.13 kPa). The original 
sensor was 0.5 mm in diameter, but it can be made much smaller. Although its response time in a gas 
mixture is a fraction of a second, it is slower in an aqueous system, about 1.5 min for 90% response. 

Wolfbeis and coworkers [28] designed a system for measuring the widely used halothane anesthetic 
which interferes with the measurement of oxygen. This dual-sensor combination had two semipermeable 
membranes (one of which blocked halothane) so that the probe could measure both oxygen and halothane 
simultaneously. The response time of their sensor, 15-20 s for halothane and 10-15 s for oxygen, is 
considered short enough to allow gas analysis in the breathing circuit. Potential applications of this device 
include the continuous monitoring of halothane in breathing circuits and in the blood. 
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FIGURE 50.8 Schematic diagram of a competitive binding fluorescence affinity sensor for glucose measurement 

[29]. 

Glucose Sensors 

Another important principle that can be used in fiber optic sensors for measurements of high sensitivity 
and specificity is the concept of competitive binding. This was first described by Schultz, Mansouri, and 
Goldstein [29] to construct a glucose sensor. In their unique sensor, the analyte (glucose) competes for 
binding sites on a substrate (the lectin concanavalin A) with a fluorescent indicator-tagged polymer 
[fluorescein isothiocyanate (FITC)-dextran]. The sensor, which is illustrated in Fig. 50.8, is arranged so 
that the substrate is fixed in a position out of the optical path of the fiber end. The substrate is bound 
to the inner wall of a glucose-permeable hollow fiber tubing (300 u O.D. x 200 ll I.D.) and fastened to 
the end of an optical fiber. The hollow fiber acts as the container and is impermeable to the large molecules 
of the fluorescent indicator. The light beam that extends from the fiber "sees" only the unbound indictor 
in solution inside the hollow fiber but not the indicator bound on the container wall. Excitation light 
passes through the fiber and into the solution, fluorescing the unbound indicator, and the fluorescent 
light passes back along the same fiber to a measuring system. The fluorescent indicator and the glucose 
are in competitive binding equilibrium with the substrate. The interior glucose concentration equilibrates 
with its concentration exterior to the probe. If the glucose concentration increases, the indicator is driven 
off the substrate to increase the concentration of the indicator. Thus, fluorescence intensity as seen by 
the optical fiber follows the glucose concentration. 

The response time of the sensor was found to be about 5 min. In vivo studies demonstrated fairly close 
correspondence between the sensor output and actual blood glucose levels. A time lag of about 5 min 
was found and is believed to be due to the diffusion of glucose across the hollow fiber membrane and 
the diffusion of FTIC-dextran within the tubing. 

In principle, the concept of competitive binding can be applied to any analysis for which a specific 
reaction can be devised. However, long-term stability of these sensors remains the major limiting factor 
that needs to be solved. 

Immunosensors 

Immunologic techniques offer outstanding selectivity and sensitivity through the process of antibody- 
antigen interaction. This is the primary recognition mechanism by which the immune system detects 
and fights foreign matter and has therefore allowed the measurement of many important compounds at 
trace levels in complex biologic samples. 

In principle, it is possible to design competitive binding optical sensors utilizing immobilized anti- 
bodies as selective reagents and detecting the displacement of a labeled antigen by the analyte. Therefore, 
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FIGURE 50.9 Basic principle of a fiber optic antigen-antibody sensor [33]. 

antibody-based immunologic optical systems have been the subject of considerable research in the past 
few years [30-34] . In practice, however, the strong binding of antigens to antibodies and vice versa causes 
difficulties in constructing reversible sensors with fast dynamic responses. 

Several immunologic sensors based on fiber optic waveguides have been demonstrated for monitoring 
antibody-antigen reactions. Typically, several centimeters of cladding are removed along the fiber's distal 
end, and the recognition antibodies are immobilized on the exposed core surface. These antibodies bind 
fluorophore-antigen complexes within the evanescent wave as illustrated in Fig. 50.9. The fluorescent 
signal excited within the evanescent wave is then transmitted through the cladded fiber to a fluorimeter 
for processing. 

Experimental studies have indicated that immunologic optical sensors can generally detect micromolar 
and even picomolar concentrations. However, the major obstacle that must be overcome to achieve high 
sensitivity in immunologic optical sensors is the nonspecific binding of immobilized antibodies. 
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51.1 Classification of Biochemical Reactions in the Context 
of Sensor Design and Development 

Introduction and Definitions 

Since sensors generate a measurable material property, they belong in some grouping of transducer 
devices. Sensors specifically contain a recognition process that is characteristic of a material sample at 
the molecular-chemical level, and a sensor incorporates a transduction process (step) to create a useful 
signal. Biomedical sensors include a whole range of devices that may be chemical sensors, physical sensors, 
or some kind of mixed sensor. 

Chemical sensors use chemical processes in the recognition and transduction steps. Biosensors are 
also chemical sensors, but they use particular classes of biological recognition/transduction processes. A 
pure physical sensor generates and transduces a parameter that does not depend on the chemistry per se, 
but is a result of the sensor responding as an aggregate of point masses or charges. All these when used 
in a biologic system (biomatrix) may be considered bioanalytic sensors without regard to the chemical, 
biochemical, or physical distinctions. They provide an "analytic signal of the biologic system" for some 
further use. 

The chemical recognition process focuses on some molecular-level chemical entity, usually a kind of 
chemical structure. In classical analysis this structure may be a simple functional group: SiO~ in a glass 
electrode surface, a chromophore in an indicator dye, or a metallic surface structure, such as silver metal 
that recognizes Ag + in solution. In recent times, the biologic recognition processes have been better 
understood, and the general concept of recognition by receptor or chemoreceptor has come into fashion. 
Although these are often large molecules bound to cell membranes, they contain specific structures that 
permit a wide variety of different molecular recognition steps including recognition of large and small 
species and of charged and uncharged species. Thus, chemoreceptor appears in the sensor literature as a 
generic term for the principal entity doing the recognition. For a history and examples, see references [ 1-6] . 
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FIGURE 51.1 Generic bioanalytic sensor. 

Biorecognition in biosensors has especially stressed "receptors" and their categories. Historically, 
application of receptors has not necessarily meant measurement directly of the receptor. Usually there 
are coupled chemical reactions, and the transduction has used measurement of the subsidiary products: 
change of pH, change of dissolved 2 ,' generation of H 2 2 , changes of conductivity, changes of optical 
adsorption, and changes of temperature. Principal receptors are enzymes because of their extraordinary 
selectivity. Other receptors can be the more subtle species of biochemistry: antibodies, organelles, 
microbes, and tissue slices, not to mention the trace level "receptors" that guide ants, such as pheromones, 
and other unusual species. A sketch of a generic bioanalytic sensor is shown in Fig. 51.1 

Classification of Recognition Reactions and Receptor Processes 

The concept of recognition in chemistry is universal. It almost goes without saying that all chemical 
reactions involved recognition and selection on the basis of size, shape, and charge. For the purpose of 
constructing sensors, general recognition based on these factors is not usually enough. Frequently in 
inorganic chemistry a given ion will react indiscriminantly with similar ions of the same size and charge. 
Changes in charge from unity to two, for example, do change the driving forces of some ionic reactions. 
By control of dielectric constant of phases, heterogeneous reactions can often be "tailored" to select 
divalent ions over monovalent ions and to select small versus large ions or vice versa. 

Shape, however, has more special possibilities, and natural synthetic methods permit product control. 
Nature manages to use shape together with charge to build organic molecules, called enzymes, that have 
acquired remarkable selectivity. It is in the realm of biochemistry that these natural constructions are 
investigated and catalogued. Biochemistry books list large numbers of enzymes and other selective 
materials that direct chemical reactions. Many of these have been tried as the basis of selective sensors 
for bioanalytic and biomedical purposes. The list in Table 51.1 shows how some of the materials can be 
grouped into lists according to function and to analytic substrate, both organic and inorganic. The 
principles seem general, so there is no reason to discriminate against the inorganic substrates in favor 
or the organic substrates. All can be used in biomedical analysis. 

51.2 Classification of Transduction Processes — Detection Methods 

Some years ago, the engineering community addressed the topic of sensor classification — Richard 
M. White in IEEE Trans. Ultra., Ferro., Freq. Control (UFFC), UFFC-34 (1987) 124, and Wen H. Ko in 



'Additional information on these types of sensors can be found in Chapter 50 and Appendix A. 
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TABLE 51.1 Recognition Reactions and Receptor Processes 



1. Insoluble salt-based sensors 

a. S + + R~ = (insoluble salt) 
Ion exchange with crystalline SR (homogeneous or heterogeneous crytals) 

chemical signal S +n receptor R~" 

inorganic cations inorganic anions 

examples: Ag+, Hg 2 2+ , Pb 2+ , Cd 2+ , Cu 2+ S=, Se 2 =, SCN-, I", Br, Cb 

b. S- + R +n = SR (insoluble salt) 
Ion exchange with crystalline SR (homogeneous or heterogeneous crystals) 

chemical signal S~" receptor R +n 

inorganic anions inorganic cations 

examples: F-, S=, Se 2 =, SCN-, I-, Br-, Or LaF 2 +, Ag+, Hg 2 2+ , Pb 2+ , Cd 2+ , Cu 2+ 

2. Solid ion exchanges 

a. S +n + R~° (sites) =S + "R~ n = SR (in ion exchanger phase) 

Ion exchange with synthetic ion exchangers containing negative fixed sites (homogeneous or heterogeneous, inorganic 
or organic materials) 

chemical signal S +n receptor R~ n 

inorganic and organic ions inorganic and organic ion sites 

examples: H + , Na + , K + silicate glass Si-0~ 

H + , Na + , K + , other M + " synthetic sulfonated, phosphorylated, EDTA-substituted 

polystyrenes 

b. S~ n + R +n (sites) =S~ n R +n = SR (in ion exchanger phase) 

Ion exchange with synthetic ion exchangers containing positive fixed sites (homogeneous or heterogeneous, inorganic 
or organic materials) 

chemical signal S~" receptor R +n 

organic and inorganic ions organic and inorganic ion sites 

examples: hydrophobic anions quaternized polystyrene 

3. Liquid ion exchanger sensors with electrostatic selection 

a. S~ n + R~ n (sites) = S + °R~" = SR (in ion exchanger phase) 

Plasticized, passive membranes containing mobile trapped negative fixed sites (homogeneous or heterogeneous, 
inorganic or organic materials) 

chemical signal S +n receptor R~ n 

inorganic and organic ions inorganic and organic ion sites 

examples: Ca 2+ diester of phosphoric acid or monoester of a phosphonic 

acid 
M + " dinonylnaphthalene sulfonate and other organic, 

hydrophobic anions 
R,R 2 R 3 R 4 N + and bis-Quaternary Cations tetraphenylborate anion or substituted derivatives 
cationic drugs tetrasubstituted arsonium + 

b. S~ n + R +n (sites) =S~ n R + " = SR (in ion exchanger phase) 

Plasticized, passive membranes containing mobile, trapped negative fixed sites (homogeneous or heterogeneous, 
inorganic or organic materials) 

chemical signal S~ n receptor R +n 

inorganic and organic ions inorganic and organic sites 

examples: anions, simple CL, Br~, QO,f quaternary ammonium cations: e.g. tridodecylmethyl- 

ammonium 
anions, complex, drugs quaternary ammonium cations: e.g. tridodecylmethyl- 

ammonium 

4. Liquid ion exchanger sensors with neutral (or charged) carrier selection 

a. S +n + X and R~ n (sites) =S +n X R" = SXR (in ion exchanger phase) 

Plasticized, passive membranes containing mobile, trapped negative fixed sites (homogeneous or heterogeneous, 
inorganic or organic materials) 

chemical signal S +n receptor R~" 

inorganic and organic ions inorganic and organic ion sites 

examples: Ca 2+ X = synthetic ionophore complexing agent selective to Ca 2+ 

R~" usually a substituted tetra phenylborate salt 
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TABLE 51.1 (continued) Recognition Reactions and Receptor Processes 

Na + , K + , H + X = selective ionophore complexing agent 

b. S~" + X and R +n (sites) =S~°X R +n = SXR (in ion exchanger phase) 

Plasticized, passive membranes containing mobile, trapped negative fixed sites (homogeneous or heterogeneous, 
inorganic or organic materials) 

chemical signal S~ n receptor R +n 



examples: 



inorganic and organic ions 
HP0 4 2 = 



inorganic and organic ion sites 
R +n = quaternary ammonium salt 
X = synthetic ionophore complexing agent; aryl organotin 
compound or suggested cyclic polyamido-polyamines 
HC0 3 ~ X = synthetic ionophore: trifluoro acetophenone 

Ch X = aliphatic organotin compound 

5. Bioaffinity sensors based on change of local electron densities 

S + R=SR 
chemical signal S receptor R 



protein 

saccharide 

glycoprotein 

substrate 

inhibitor 



prosthetic group 
antigen 
hormone 
substrate analogue 



dyes 
lectin 

enzyme 

Transferases 

Hydrolases (peptidases, esterases, etc.) 

Lyases 

Isomerases 

Ligases 

apoenzyme 

antibody 

"receptor" 

transport system 



6. Metabolism sensors based on substrate consumption and product formation 

S + R =SR -> P + R 
chemical signal S receptor R 



substrate 
examples: lactate (SH 2 ) 



enzyme 

hydrogenases catalyze hydrogen transfer from S to acceptor 

A (not molecular oxygen!) reversibly pyruvate + NADH + 

H + using lactate dehydrogenase 



SH 2 + A =S + AH 2 lactate + NAD + 
glucose (SH 2 ) 

SH 2 + i0 2 — S + H 2 or 

SH 2 + 2 =S + H 2 2 

glucose + 2 =gluconolactone + H 2 2 
reducing agents (S) 
2S + 2H+ + H 2 2 = 2S + + 2H 2 
Fe 2+ + H 2 2 + 2H+ =^Fe 3+ + 2H 2 
reducing agents 

L-lactate + 2 = acetate + C0 2 + H 2 
cofactor 
inhibitor 
activator 
enzyme activity 

7. Coupled and hybrid systems using sequences, competition, anti-interference and amplification concepts and reactions. 

8. Biomimetic sensors 

chemical signal S receptor R 



oxidases catalyze hydrogen transfer to molecular oxygen 
using glucose oxidase 

peroxidases catalyze oxidation of a substrate by H 2 2 using 
horseradish peroxidase 

oxygenates catalyze substrate oxidations by molecular 2 

organelle 
microbe 
tissue slice 



sound 
stress 
light 



carrier-enzyme 



Source: Adapted from [2, 6]. 
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IEEE/EMBS Symposium Abstract T.l.l 84CH2068-5 
(1984). It is interesting because the physical and chem- 
ical properties are given equal weight. There are many 
ideas given here that remain without embodiment. This 
list is reproduced as Table 51.2. Of particular interest in 
this section are "detection means used in sensors" and 
"sensor conversion phenomena." At present the princi- 
ple transduction schemes use electrochemical, optical, 
and thermal detection effects and principles. 

Calorimetric, Thermometric, 
and Pyroelectric Transducers 

Especially useful for enzymatic reactions, the genera- 
tion of heat (enthalpy change) can be used easily and 
generally. The enzyme provides the selectivity and the 
reaction enthalpy cannot be confused with other reac- 
tions from species in a typical biologic mixture. The 
ideal aim is to measure total evolved heat, i.e., to per- 
form a calorimetric measurement. In real systems 
there is always heat loss, i.e., heat is conducted away 
by the sample and sample container so that the process 
cannot be adiabatic as required for a total heat evolu- 
tion measurement. As a result, temperature difference 
before and after evolution is measured most often. It 
has to be assumed that the heat capacity of the spec- 
imen and container is constant over the small temper- 
ature range usually measured. 

The simplest transducer is a thermometer coated 
with the enzyme that permits the selected reaction to 
proceed. Thermistors are used rather than thermom- 
eters or thermocouples. The change of resistance of 

certain oxides is much greater than the change of length of a mercury column or the microvolt changes 
of thermocouple junctions. 

Pyroelectric heat flow transducers are relatively new. Heat flows from a heated region to a lower 
temperature region, controlled to occur in one dimension. The lower temperature side can be coated 
with an enzyme. When the substrate is converted, the lower temperature side is warmed. The pyroelectric 
material is from a category of materials that develops a spontaneous voltage difference in a thermal 
gradient. If the gradient is disturbed by evolution or adsorption of heat, the voltage temporarily changes. 

In biomedical sensing, some of the solid-state devices based on thermal sensing cannot be used 
effectively. The reason is that the sensor itself has to be heated or is heated quite hot by catalytic surface 
reactions. Thus pellistors (oxides with catalytic surfaces and embedded platinum wire thermometer), 
chemiresistors, and "Figaro" sensor "smoke" detectors have not found many biologic applications. 



TABLE 51.2 Detection Means and Conversion 
Phenomena Used in Sensors 

Detection means 
Biologic 
Chemical 

Electric, magnetic, or electromagnetic wave 
Heat, temperature 
Mechanical displacement of wave 
Radioactivity, radiation 
Other 
Conversion phenomena 
Biologic 

Biochemical transformation 

Physical transformation 

Effect on test organism 

Spectroscopy 

Other 
Chemical 

Chemical transformation 

Physical transformation 

Electrochemical process 

Spectroscopy 

Other 
Physical 

Thermoelectric 

Photoelectric 

Photomagnetic 

Magnetoelectric 

Elastomagnetic 

Thermoelastic 

Elastoelectric 

Thermomagnetic 

Thermooptic 

Photoelastic 

Other 



Optical, Optoelectronic Transducers 

Most optical detection systems for sensors are small, i.e., they occupy a small region of space because 
the sample size and volume are themselves small. This means that common absorption spectrophotom- 
eters and photofluorometers are not used with their conventional sample-containing cells, or with their 
conventional beam-handling systems. Instead light-conducting optical fibers are used to connect the 
sample with the more remote monochromator and optical readout system. The techniques still remain 
absorption spectrophotometry, fluorimetry including fluorescence quenching, and reflectometry. 
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The most widely published optical sensors use a miniature reagent contained or immobilized at the 
tip of an optical fiber. In most systems a permselective membrane coating allows the detected species to 
penetrate the dye region. The corresponding absorption change, usually at a sensitive externally preset 
wavelength, is changed and correlated with the sample concentration. Similarly, fluorescence can be 
stimulated by the higher-frequency external light source and the lower-frequency emission detected. 
Some configurations are illustrated in references [1, 2], Fluorimetric detection of coenzyme A, 
NAD + /NADH, is involved in many so-called pyridine-linked enzyme systems. The fluorescence of NADH 
contained or immobilized can be a convenient way to follow these reactions. Optodes, miniature encap- 
sulated dyes, can be placed in vivo. Their fluorescence can be enhanced or quenched and used to detect 
acidity, oxygen, and other species. 

A subtle form of optical transduction uses the "peeled" optical fiber as a multiple reflectance cell. The 
normal fiber core glass has a refractive index greater than that of the exterior coating; there is a range of 
angles of entry to the fiber so that all the light beam remains inside the core. If the coating is removed 
and materials of lower index of refraction are coated on the exterior surface, there can be absorption by 
multiple reflections, since the evanescent wave can penetrate the coating. Chemical reagent can be added 
externally to create selective layers on the optical fiber. 

Ellipsometry is a reflectance technique that depends on the optical constants and thickness of surface 
layer. For colorless layers, a polarized light beam will change its plane of polarization upon reflection by 
the surface film. The thickness can sometimes be determined when optical constants are known or 
approximated by constants of the bulk material. Antibody-antigen surface reaction can be detected this 
way. 

Piezoelectric Transducers 

Cut quartz crystals have characteristic modes of vibration that can be induced by painting electrodes on 
the opposite surfaces and applying a megaHertz ac voltage. The frequency is searched until the crystal 
goes into a resonance. The resonant frequency is very stable. It is a property of the material and maintains 
a value to a few parts per hundred million. When the surface is coated with a stiff mass, the frequency 
is altered. The shift in frequency is directly related to the surface mass for thin, stiff layers. The reaction 
of a substrate with this layer changes the constants of the film and further shifts the resonant frequency. 
These devices can be used in air, in vacuum, or in electrolyte solutions. 

Electrochemical Transducers 

Electrochemical transducers are commonly used in the sensor field. The main forms of electrochemistry 
used are potentiometry [zero-current cell voltage (potential difference measurements)], amperometry 
(current measurement at constant applied voltage at the working electrode), and ac conductivity of a cell. 

Potentiometric Transduction 

The classical generation of an activity- sensitive voltage is spontaneous in a solution containing both 
nonredox ions and redox ions. Classical electrodes of types 1, 2, and 3 respond by ion exchange directly 
or indirectly to ions of the same material as the electrode. Inert metal electrodes (sometimes called 
type 0) — Pt, Ir, Rh, and occasionally carbon C — respond by electrons exchange from redox pairs in 
solution. Potential differences are interfacial and reflect ratios of activities of oxidized to reduced forms. 

Amperometric Transduction 

For dissolved species that can exchange electrons with an inert electrode, it is possible to force the transfer 
in one direction by applying a voltage very oxidizing (anodic) or reducing (cathodic). When the voltage 
is fixed, the species will be, by definition, out of equilibrium with the electrode at its present applied 
voltage. Locally, the species (regardless of charge) will oxidize or reduce by moving from bulk solution 
to the electrode surface where they react. Ions do not move like electrons. Rather they diffuse from high 
to low concentration and do not usually move by drift or migration. The reason is that the electrolytes 
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in solutions are at high concentrations, and the electric field is virtually eliminated from the bulk. The 
field drops through the first 1000 Angstroms at the electrode surface. The concentration of the moving 
species is from high concentration in bulk to zero at the electrode surface where it reacts. This process 
is called concentration polarization. The current flowing is limited by mass transport and so is proportional 
to the bulk concentration. 

Conductometric Transducers 

Ac conductivity (impedance) can be purely resistive when the frequency is picked to be about 1000 to 
10,000 Hz. In this range the transport of ions is sufficiently slow that they never lose their uniform 
concentration. They simply quiver in space and carry current forward and backward each half cycle. In 
the lower and higher frequencies, the cell capacitance can become involved, but this effect is to be avoided. 

51.3 Tables of Sensors from the Literature 

The longest and most consistently complete references to the chemical sensor field is the review issue of 
Analytical Chemistry Journal. In the 1970s and 1980s these appeared in the April issue, but more recently 
they appear in the June issue. The editors are Jiri Janata and various colleagues [7-10]. Note all possible 
or imaginable sensors have been made according to the list in Table 51.2. A more realistic table can be 
constructed from the existing literature that describes actual devices. This list is Table 51.3. Book refer- 
ences are listed in Table 51.4 in reverse time order to about 1986. This list covers most of the major 
source books and many of the symposium proceedings volumes. The reviews [7-10] are a principal source 
of references to the published research literature. 



TABLE 51.3 Chemical Sensors and Properties Documented in the Literature 

1. General topics including items II- V; selectivity, fabrication, data processing 
II. Thermal sensors 

III. Mass sensors 

Gas sensors 
Liquid sensors 

IV. Electrochemical sensors 
Potentiometric sensors 

Reference electrodes 
Biomedical electrodes 
Applications to cations, anions 
Coated wire/hybrids 
ISFETs and related 
Biosensors 
Gas sensors 
Amperometric sensors 
Modified electrodes 
Gas sensors 
Biosensors 

Direct electron transfer 

Mediated electron transfer 

Biomedical 
Conductimetric sensors 

Semiconducting oxide sensors 
Zinc oxide-based 
Chemiresistors 
Dielectrometers 
V. Optical sensors 
Liquid sensors 
Biosensors 
Gas sensors 
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TABLE 51.4 Books and Long Reviews Keyed to Items in Table 51.3 
(Reviewed since 1988 in reverse time sequence) 

1. Yamauchi S (ed). 1992. Chemical Sensor Technology, vol 4, Tokyo, Kodansha Ltd. 

Flores JR, Lorenzo E. 1992. Amperometric Biosensors, In MR Smyth , JG Vos (eds), Comprehensive Analytical Chemistry 

Amsterdam, Elsevier 
Vaihinger S, Goepel W. 1991. Multicomponent analysis in chemical sensing. In W Goepel, J Hesse, J Zemel (eds), 

Sensors vol 2 Part 1, pp 191-237, Weinheim, Germany, VCH Publishers 
Wise DL (ed). 1991. Bioinstrumentation and Biosensors, New York, Marcel Dekker 
Scheller F, Schubert F. 1989. Biosensors, Basel, Switzerland, Birkhauser Verlag, see also [2]. 
Madou M, Morrison SR. 1989. Chemical Sensing with Solid State Devices, New York, Academic Press. 
Janata J. 1989. Principles of Chemical Sensors, New York, Plenum Press. 
Edmonds TE (ed). 1988. Chemical Sensors, Glasgow, Blackie. 
Yoda K. 1988. Immobilized enzyme cells. Methods Enzymology, 137:61. 
Turner APF, Karube I, Wilson GS (eds). 1987. Biosensors: Fundamentals and Applications, Oxford, Oxford University 

Press. 
Seiyama T (ed). 1986. Chemical Sensor Technology, Tokyo, Kodansha Ltd. 
II. Thermal Sensor 

There are extensive research and application papers and these are mentioned in books listed under I. However, the up- 
to-date lists of papers are given in references 7-10. 
III. Mass Sensors 

There are extensive research and application papers and these are mentioned in books listed under I. However, the up- 
to-date lists of papers are given in references 7-10. Fundamentals of this rapidly expanding field are recently reviewed: 
Buttry DA, Ward MD. 1992. Measurement of Interfacial processes at Electrode Surfaces with the Electrochemical Quartz 

Crystal Microbalance, Chemical Reviews 92:1355. 
Grate JW, Martin SJ, White RM. 1993. Acoustic Wave Microsensors, Part 1, Analyt Chem 65:940A; part 2, Analyt Chem 

65:987A. 
Ricco AT. 1994. SAW Chemical Sensors, The Electrochemical Society Interface Winter: 38-44. 
IVA. Electrochemical Sensors — Liquid Samples 

Scheller F, Schmid RD (eds). 1992. Biosensors: Fundamentals, Technologies and Applications, GBF Monograph Series, 

New York, VCH Publishers. 
Erbach R, Vogel A, Hoffmann B. 1992. Ion-sensitive field-effect structures with Langmuir-Blodgett membranes. In F 

Scheller, RD Schmid (eds). Biosensors: Fundamentals, Technologies, and Applications, GBF Monograph 17, pp 

353-357, New York, VCH Publishers. 
Ho May YK, Rechnitz GA. 1992. An Introduction to Biosensors, In RM Nakamura, Y Kasahara, GA Rechnitz (eds), 

Immunochemical Assays and Biosensors Technology, pp 275-291, Washington, DC, American Society Microbiology. 
Mattiasson B, Haakanson H. Immunochemically-based assays for process control, 1992. Advances in Biochemical 

Engineering and Biotechnology 46:81. 
Maas AH, Sprokholt R. 1990. Proposed IFCC Recommendations for electrolyte measurements with ISEs in clinical 

chemistry, In A Ivaska, A Lewenstam, R Sara (eds), Contemporary Electroanalytical Chemistry, Proceedings of the 

ElectroFinnAnalysis International Conference on Electroanalytical Chemistry, pp 311-315, New York, Plenum. 
Vanrolleghem P, Dries D, Verstreate W. RODTOX: Biosensor for rapid determination of the biochemical oxygen demand, 

1990. In C Christiansen, L Munck, J Villadsen (eds), Proceedings of the 5th European Congress Biotechnology, vol 1, 

pp 161-164, Copenhagen, Denmark, Munksgaard. 
Cronenberg C, Van den Heuvel H, Van den Hauw M, Van Groen B. Development of glucose microelectrodes for 

measurements in biofilms, 1990. In C Christiansen, L Munck, J Villadsen J (eds), Proceedings of the 5th European 

Congress Biotechnology, vol 1, pp 548-551, Copenhagen, Denmark, Munksgaard. 
Wise DL (ed). 1989. Bioinstrumentation Research, Development and Applications, Boston, MA, Butterworth- 

Heinemann. 
Pungor E (ed). 1989. Ion-Selective Electrodes — Proceedings of the 5th Symposium (Matrafured, Hungary 1988), 

Oxford, Pergamon. 
Wang J (ed). 1988. Electrochemical Techniques in Clinical Chemistry and Laboratory Medicine, New York, VCH 

Publishers. 
Evans A. 1987. Potentiometry and Ion-selective Electrodes, New York, Wiley. 
Ngo TT(ed). 1987. Electrochemical Sensors in Immunological Analysis, New York, Plenum. 
IVB. Electrochemical Sensors — Gas Samples 

Sberveglieri G (ed). 1992. Gas Sensors, Dordrecht, The Netherlands, Kluwer. 

Moseley PT, Norris JOW, Williams DE. 1991. Technology and Mechanisms of Gas Sensors, Bristol, U.K., Hilger. 

Moseley PT, Tofield BD (eds). 1989. Solid State Gas Sensors, Philadelphia, Taylor and Francis, Publishers. 
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TABLE 51.4 (continued) Books and Long Reviews Keyed to Items in Table 51.3 
(Reviewed since 1988 in reverse time sequence) 

V. Optical Sensors 

Coulet PR, Blum LJ. Luminescence in Biosensor Design, 1991. In DL Wise, LB Wingard, Jr (eds). Biosensors with 

Fiberoptics, pp 293-324, Clifton, N.J., Humana. 
Wollbeis OS. 1991. Spectroscopic Techniques, In OS Wollbeis (ed). Fiber Optic Chemical Sensors and Biosensors, vol 

1, pp 25-60. Boca Raton, Fla, CRC Press. 
Wollbeis OS. 1987. Fibre-optic sensors for chemical parameters of interest in biotechnology, In RD Schmidt (ed). GBF 

(Gesellschaft fur Biotechnologische Forschung) Monogr. Series, vol 10, pp 197-206, New York, VCH Publishers. 



51.4 Applications of Microelectronics 
in Sensor Fabrication 



The reviews of sensors since 1988 cover fabrication papers and micro fabrication methods and examples 
[7-10]. A recent review by two of the few chemical sensor scientists (chemical engineers) who also operate 
a microfabrication laboratory is C. C. Liu, Z.-R. Zhang. 1992. Research and development of chemical 
sensors using microfabrication techniques. Selective Electrode 14:147. 
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Historical Perspectives 2 

The Electrocardiograph 



The First Electrocardiogram 

Capillary Electrometer Record 

Rheotome Record 

Mammalian Electrocardiograms 

Corrected Capillary Electrometer Records 

Clinical Electrocardiography 
Leslie A. Lredaes The String Galvanometer • Vacuum Tube Electrocardiograph • Hot- 

Purdue University Stylus Record 



Recording and display of bioelectric events occupied a long time and required the development and 
adaptation of a variety of primitive instruments, not all of which were electronic. The first bioelectric 
recorder was the rheoscopic frog, consisting of a sciatic nerve and its innervated gastrocnemius muscle. 
So sensitive was the nerve that it could be stimulated by the beating heart or a contracting muscle; both 
events contracted the gastrocnemius muscle. However, such a response provided no information on the 
time course of these bioelectric events. Sensitive and rapidly responding indicators were essential for this 
purpose. 

As Etienne Jules Marey [1885], champion of the graphic method, stated: 

In effect, in the field of rigorous experimentation all the sciences give a hand. Whatever is the object 
of these studies, that which measures a force or movement, an electrical state or a temperature, whether 
he be a physician, chemist or physiologist, he has recourse to the same method and employs the same 
instruments. 

Development of the galvanometer and the electric telegraph provided design concepts and instruments 
that could be adapted to the measurement of bioelectric events. For example, Thomson's reflecting 
telegraphic galvanometer was used by Caton [1875] to display the first electroencephalogram. Ader's 
telegraphic string galvanometer [1897] was modified by Einthoven [1903] to create the instrument that 
introduced clinical electrocardiography. Gasser and Erlanger [1922] adapted the Braun cathode-ray tube 
to enable recording of short-duration nerve action potentials. Garceau [1935] used the Western Union 
telegraphic recorder, called the Undulator to create the first direct-inking electroencephalograph. How- 
ever, in the early days of bioelectricity, ingenious electrophysiologists appropriated many devices from 
physics and engineering to establish the existence of bioelectric phenomena. 

The First Electrocardiogram 

The electric activity accompanying the heartbeat was discovered with the rheoscopic frog by Kolliker and 
Mueller [1856]. When these investigations laid the nerve over the beating ventricle of a frog heart, the 
muscle twitched once and sometimes twice. Stimulation of the nerve obviously occurred with depolar- 
ization and repolarization of the ventricles. Because at that time there were no rapidly responding 
galvanometers, Donders [1872] recorded the twitches of the rheoscope to provide a graphic demonstra- 
tion of the existence of an electrocardiographic signal. 
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FIGURE HP2.1 The capillary electrometer (a) used by Marey and Lippmann in 1876 and a tortoise ventricular 
electrogram (b) made with it. This is the first cardiac electrogram from a spontaneously beating heart. 



Capillary Electrometer Record 



The capillary electrometer was created especially for recording the electrocardiogram. The principle 
underlying its operation was being investigated by Lippmann, a colleague of Marey in France. The 
phenomenon of electrocapillarity is the change in contour of a drop of mercury in dilute sulfuric acid 
when a current is passed through the mercury — sulfuric acid interface. This phenomenon was put to 
practical use by Marey [1876], who placed the interface in a capillary tube, transilluminated it, and 
recorded the contour change on a moving (falling) photographic plate. The two wires from the elec- 
trometer were connected to electrodes placed against the exposed tortoise ventricle. Figure HP2.1a 
illustrates the capillary electrometer, and Fig. HP2.1fo is a reproduction of the tortoise ventricular elec- 
trogram showing what we now call the R and T waves. 

Rheotome Record 

Probably unaware that Marey had recorded the cardiac electrogram with the capillary electrometer, 
Burdon-Sanderson [1879] in England used a slow-speed, d'Arsonval-type galvanometer, the rheotome 
[see Hoff and Geddes, 1957], and induction-coil stimulator [see Geddes et al., 1989] to reconstruct the 
ventricular electrogram of the frog heart; Fig. HP2.2 illustrates his reconstruction, showing the R and 
T waves; note their similarity with those obtained by Marey in Fig. HP2.1k 
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FIGURE HP2.2 Burdon-Sanderson's plot of the frog cardiac electrogram. Thirty-five points were determined to 
make the reconstruction. [Burdon-Sanderson and Page, 1879.] 



Mammalian Electrocardiograms 



When news of the capillary electrometer reached the United Kingdom, many investigators fabricated 
their own instruments. One of these was Waller, who used it to record the electrocardiogram of a patient 
whom he called Jimmy. In 1910, Waller revealed the identity of Jimmy, his pet bulldog, shown in 
Fig. HP2.3 having his ECG recorded with a forepaw and hindpaw in glass vessels containing saline and 
metal electrodes. 




FIGURE HP2.3 Waller's patient Jimmy having his ECG recorded with the capillary electrometer. (From Waller AD, 
Hitchcock Lectures, University of London, 1910.) 
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FIGURE HP2.4 First human apex cardiogram and capillary electrometer ECG (a) and the dipole map for the heart 
(b), (a from Waller [1887]; b from Waller [1889].) 

Waller [1887] obtained the first ECGs from human subjects; Fig. HP2.4« is one of his records which 
displays the apex cardiogram and the capillary electrometer record, showing the R and T waves. 

At that time there were no standard sites for electrode placement. Using the extremities, Waller 
experimented with different sites, discovering that there were favorable and unfavorable sites, i.e., sites 
where the amplitude was large or small; Table HP2.1 summarizes his findings. From recordings made 
with these electrodes, Waller [1887] proposed that the heart could be represented as a dipole, as shown 
in Fig. HP2.4fo. 

Corrected Capillary Electrometer Records 

By the 1890s, it was known that the response of the capillary electrometer was slow, and methods were 
developed to correct recordings to obtain a true voltage-time record. Burch [1892] in the United Kingdom 
developed a geometric method that used the tangent at each point along the recording. Figure HP2.5 is 
an illustration showing a capillary electrometer record (dark hump) and the true voltage-time record 
(biphasic waveform). 
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TABLE HP2.1 Waller's Electrode Locations 

The unfavorable combinations were: 

Left hand and left foot 
Left hand and right foot 
Right foot and left foot 
Mouth and right hand 

The favorable combinations were: 

Front of chest and back of chest 
Left hand and right hand 
Right hand and right foot 
Right hand and left foot 
Mouth and left hand 
Mouth and right foot 
Mouth and left foot 



One who was very dissatisfied with capillary-electrometer records was Einthoven in the Netherlands. 
He obtained a capillary electrometer record from a subject (Fig. HP2.6a) and applied the correct method 
to create the ECG shown in Fig. HP2.6fo, revealing the intimate details of what he called the Q and S 
waves, not visible in the capillary electrometer record, in which he used A, B, and C to designate what 
he later called the P, R, and T waves. 



Clinical Electrocardiography 
The String Galvanometer 

Einthoven [1903] set himself the task of creating a high-fidelity 
recorder by improving Ader's string telegraphic galvanometer. 
Einthoven used a silvered quarter filament as the conductor and 
increased the field strength surrounding the filament by using a 
strong electromagnet. He added a lens to focus the image of the 
filament onto a moving photographic surface. Thus the thick 
baseline for the string galvanometer recording was the image of 
the "string" (quartz filament). Electrocardiac current caused the 
silvered filament to be deflected, and its excursions, when mag- 
nified optically and recorded photographically, constituted the 
electrocardiogram. Figure HP2.7a illustrates Einthoven's string 
galvanometer, and Fig. HP2.7fo shows a patient in hospital cloth- 
ing having his ECG recorded. Measurement of calibration 
records published by Einthoven reveals a response time (10% to 
90%) of 20 ms. The corresponding sinusoidal frequency 
response is to 25 Hz (30% attenuation). 

Figure HP2.8 illustrates one of Einthoven's electromagnets 
(lower left) and one of his early cameras (center) and string 
galvanometers (right). Above the bench supporting these are 
mounted a bow and arrow, which were used by Einthoven to 
provide the tension on the quartz rod while it was being heated 
to the melting point, after which the arrow left the bow and 
extruded the rod to a long slender filament, which was later 
silvered to make it conducting. 




20 Sec. 

FIGURE HP2.5 Capillary electrometer 
record (dark hump) and the corrected 
voltage-time record (biphasic wave). 
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FIGURE HP2.6 A capillary electrometer record (a) and its corrected version (b) presented by Einthoven to show 
the inadequacy of the capillary electrometer in displaying rapidly changing waveforms. (From FA Willius, 1941. TE 
Keys, Cardiac Classics, St. Louis, Mosby.) 

Einthoven borrowed handsomely from previous work; he used Marey's recording chart speed 
(25 mm/s) and Waller's bucket electrodes, as well as some of his leads. With the string galvanometer, 
Einthoven ushered in clinical electrocardiography in the early 1900s; soon the string galvanometer was 
used worldwide. The first string galvanometer appeared in the United States in 1912, when it was used 
at the Rockefeller Institute Hospital until 1959. This instrument, which is now in the Smithsonian 
Institute, was designed by Horatio B. Williams, professor of physiology at the College of Physicians and 
Surgeons at Columbia University. Williams had spent some time with Einthoven in Leiden in 1910 and 
1911. On his return to New York, Williams had Charles F. Hindle, a machinist at Columbia, construct 
the first American string galvanometer. Soon thereafter, the Cambridge Instrument Company took over 
manufacture of the Hindle instrument and made them available for sale in the United States. 

Although it is clear that the concept of an electrical axis, i.e., a cardiac vector, was demonstrated by 
Waller's studies, it remained for Einthoven [1913] to make practical use of the concept. Einthoven 
postulated that the heart was at the center of an equilateral triangle, the apices of which were the right 
and left shoulders and the point where both legs joined the trunk. In his early studies, Einthoven used 
the right and left arms and both feet in saline-filled buckets as the three electrodes. Soon he found that 
the electrocardiogram was negligibly altered if the right foot was removed from the bucket electrode. 
Thus he adopted three standard leads: right and left arms and left leg (foot). He postulated that if the 
amplitudes of the electrocardiographic waves are plotted on this triaxial reference frame, it is possible to 
calculate the magnitude and direction of an electric vector that produces these same voltages in leads I, 
II, and III, corresponding to the limb electrodes. He further stated that the arithmetic sum of the 
amplitudes in lead I plus III equals the amplitude in lead II. This is Einthoven's law, and the relationship 
is true only for an equilateral triangle reference frame [Valentinuzzi et al., 1970]. 

Vacuum-Tube Electrocardiograph 

Not long after Einthoven described his string galvanometer, efforts were begun in the United States to 
create an electrocardiograph that used vacuum tubes. At that time, there were rapidly responding mirror 
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FIGURE HP2.7 Einthoven's string galvanometer (a) and a patient having his ECG (lead 1) recorded (b). 
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FIGURE HP2.8 Some of Einthoven's early equipment. On the bench (left) is an electromagnet. In the center is a 
camera, and on the right is a string galvanometer. On the wall are a bow and arrow; the latter was used to apply 
force to a quartz rod which was heated, and when pulled by the arrow, created the quartz filament which was later 
silvered. 

galvanometers, as well as a limited number of vacuum tubes, despite the fact that they has been patented 
only a few years earlier [1907] by DeForest. According to Marvin [1954], the first discussions relative to 
such an instrument were held in 1917 between Steinmetz, Neuman, and Robinson of the General Electric 
Engineering Laboratory. The task of establishing feasibility fell to W. R. G. Baker, who assembled a unit 
and demonstrated its operation to those just identified. However, because of urgent wartime priorities, 
the project was shelved. 

In 1921, General Electric reopened the issue of a vacuum-tube ECG. A second prototype was built 
and demonstrated to the Schenectady County Medical Association some time in 1924 by Robinson and 
Marvin. The instrument was used by Drs. Newman, Pardee, Mann, and Oppenheim, all physicians in 
New York City. Subsequently, six commercial models were made. One instrument was sent to each of 
the four physicians just identified; the fifth was sent to the General Electric Company Hospital; and the 
sixth was sent to the AMA Convention in Atlantic City in 1925. This latter instrument became a prototype 
for future models provided by the General Electric X-Ray Division. A U.S. patent application was filed 
on January 15, 1925, and the instrument was described by Mann [1930]. 

On December 22, 1931, a patent on the General Electric vacuum-tube ECG was granted to Marvin 
and Leibing; Fig. HP2.9 shows the circuit diagram of the instrument, including a specimen record {lower 
left, Fig. HP2.9). The instrument used three triode vacuum tubes in a single-sided, resistance-capacitance- 
coupled amplifier. It was battery operated, and a unique feature was a viewing screen that allowed the 
operator to see the motion of the galvanometer beam as it was being recorded by the camera. 

Between introduction of the string galvanometer and the hot-stylus recorder for ECG, attempts were 
made to create direct-inking ECG recorders. In a review of scientific instruments, Brian Matthews [1935] 
reported 
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FIGURE HP2.9 Circuit diagram of the first vacuum-tube ECG, patented on December 12, 1931. 

There are two ink-writing electrocardiographs available, the author's and that of Drs. Duschel and 
Luthi. Both utilize a moving-iron driving unit with oil damping, a tubular pen writing on moving 
paper. The former has a battery-coupled amplifier. The latter gives, in effect, D.C. amplification; the 
potentials to be recorded are interrupted about 500 times per second by a special type of buzzer, after 
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amplification by resistance capacity coupled valves the interrupted output is reflected by the output 
valve; the amplifier achieves in effect what can be done with a battery-coupled amplifier and obviates 
the coupling batteries. At present the speed of these direct-recording instruments is barely adequate 
to show the finer details of the electrocardiogram, but they enable its main features to be recorded 
instantly. 

Despite the instant availability of inked recordings of the ECG, those produced by the string galvanometer 
were superior, and it took some time for a competitor to appear. Such an instrument did appear in the 
form of the hot-stylus recorder. 

Hot-Stylus Recorder 

The final step toward modern electrocardiography was the introduction of the hot-stylus recorder by 
Haynes [1936] of the Bell Telephone Laboratories (New York). Prior to that time, there was colored, wax- 
coated recording paper, the wax being scraped off by a recording stylus, exposing the colored paper. 
Referring to the scraping method, Haynes wrote 

However, the method is not adaptable to many types of recording instruments because of the large 
amount of friction between the recording stylus and paper arising from the pressure necessary to 
engrave the wax. 

This pressure can be removed and the friction largely eliminated by the use of a special stylus consisting 
essentially of a small electric heating coil situated close to the end of a pointed rod in such a way that 
the temperature of the point may be raised to about 80°C. The point is then capable of melting wax 
and engraving its surface with only a very small fraction of the pressure before necessary. 

The described stylus, when used with waxed recording paper, will provide a means of obtaining a 
permanent record without the use of pen and ink which is adaptable to the most sensitive recording 
instruments. 

Following the end of World War II, vacuum-tube electrocardiographs with heated-stylus recorders 
became very popular; they are still in use today. However, the heritage of a thick baseline, derived from 
the string-galvanometer days, had to be preserved for some time because clinicians objected to a thin 
baseline. It took many years for the hot-stylus baseline to be narrowed without protest. 
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BIOMEDICAL SIGNAL ANALYSIS CENTERS ON the acquisition and processing of information- 
bearing signals that emanate from living systems. These vital signals permit us to probe the state 
of the underlying biologic and physiologic structures and dynamics. Therefore, their interpretation 
has significant diagnostic value for clinicians and researchers. 

The detected signals are commonly corrupted with noise. Often, the information cannot be readily 
extracted from the raw signal, which must be processed in order to yield useful results. Signals and 
systems engineering knowledge and, in particular, signal-processing expertise are therefore critical in all 
phases of signal collection and analysis. 

Biomedical engineers are called on to conceive and implement processing schemes suitable for bio- 
medical signals. They also play a key role in the design and development of biomedical monitoring devices 
and systems that match advances in signal processing and instrumentation technologies with biomedical 
needs and requirements. 

This section is organized in two main parts. In the first part, contributing authors review contemporary 
methods in biomedical signal processing. The second part is devoted to emerging methods that hold the 
promise for major enhancements in our ability to extract information from vital signals. 

The success of signal-processing applications strongly depends on the knowledge about the origin and 
the nature of the signal. Biomedical signals possess many special properties and hence require special 
treatment. Also, the need for noninvasive measurements presents unique challenges that demand a clear 
understanding of biomedical signal characteristics. In the lead chapter, entitled, "Biomedical Signals: 
Origin and Dynamic Characteristics; Frequency-Domain Analysis," Arnon Cohen provides a general 
classification of biomedical signals and discusses basics of frequency domain methods. 

The advent of digital computing coupled with fast progress in discrete-time signal processing has led 
to efficient and flexible methods to acquire and treat biomedical data in digital form. The chapter entitled, 
"Digital Biomedical Signal Acquisition and Processing," by Luca T. Mainardi, Anna M. Bianchi, and 
Sergio Cerutti, presents basic elements of signal acquisition and processing in the special context of 
biomedical signals. 

Especially in the case of long-term monitoring, digital biomedical signal-processing applications gen- 
erate vast amounts of data that strain transmission and storage resources. The creation of multipatient 
reference signal bases also places severe demands on storage. Data compression methods overcome these 
obstacles by eliminating signal redundancies while retaining clinically significant information. A. Enis 
Cetin and Hayrettin Koymen provide a comparative overview of a range of approaches from conventional 
to modern compression techniques suitable for biomedical signals. Futuristic applications involving long- 
term and ambulatory recording systems, and remote diagnosis opportunities will be made possible by 
breakthroughs in biomedical data compression. This chapter serves well as a point of departure. 

Constraints such as stationarity (and time invariance), gaussianity (and minimum phaseness), and 
the assumption of a characteristic scale in time and space have constituted the basic, and by now implicit, 
assumptions upon which the conventional signals and systems theories have been founded. However, 
investigators engaged in the study of biomedical processes have long known that they did not hold under 
most realistic situations and hence could not sustain the test of practice. 

Rejecting or at least relaxing restrictive assumptions always opens new avenues for research and yields 
fruitful results. Liberating forces in signals and systems theories have conspired in recent years to create 
research fronts that target long-standing constraints in the established wisdom (dogma?) of classic signal 
processing and system analysis. The emergence of new fields in signals and system theories that address 
these shortcomings and aim to relax these restrictions has been motivated by scientists who, rather than 
mold natural behavior into artificial models, seek methods inherently suited to represent reality. Bio- 
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medical scientists and engineers are inspired by insights gained from a deeper appreciation for the 
dynamic richness displayed by biomedical phenomena; hence, more than their counterparts in other 
disciplines, they more forcefully embrace innovations in signal processing. 

One of these novel directions is concerned with time-frequency representations tailored for nonsta- 
tionary and transient signals. Faye Boudreaux-Bartels and Robin Murray address this issue, provide an 
introduction to concepts and tools of time-frequency analysis, and point out candidate applications. 

Many physiologic structures and dynamics defy the concept of a characteristic spatial and temporal 
scale and must be dealt with employing methods compatible with their multiscale nature. Judging from 
the recent success of biomedical signal-processing applications based on time-scale analysis and wavelet 
transforms, the resolution of many outstanding processing issues may be at hand. The chapter entitled, 
"Time-Scale Analysis and Wavelets in Biomedical Signals," by Nitish V. Thakor, familiarizes the reader 
with fundamental concepts and methods of wavelet analysis and suggests fruitful directions in biomedical 
signal processing. 

The presence of nonlinearities and statistics that do not comply with the gaussianity assumption and 
the desire for phase reconstruction have been the moving forces behind investigations of higher-order 
statistics and polyspectra in signal-processing and system-identification fields. An introduction to the 
topic and potential uses in biomedical signal-processing applications are presented by Athina Petropulu 
in the chapter entitled, "Higher-Order Spectra in Biomedical Signal Processing." 

Neural networks derive their cue from biologic systems and, in turn, mimic many of the functions of 
the nervous system. Simple networks can filter, recall, switch, amplify, and recognize patterns and hence 
serve well many signal-processing purposes. In the chapter entitled, "Neural Networks in Biomedical 
Signal Processing," Evangelia Tzanakou helps the reader explore the power of the approach while stressing 
how biomedical signal-processing applications benefit from incorporating neural-network principles. 

The dichotomy between order and disorder is now perceived as a ubiquitous property inherent in the 
unfolding of many natural complex phenomena. In the last decade, it has become clear that the common 
threads shared by natural forms and functions are the "physics of disorder" and the "scaling order," the 
hallmark of broad classes of fractal entities. Biomedical signals are the global observables of underlying 
complex physical and physiologic processes. "Complexity" theories therefore hold the potential to provide 
mathematical tools that describe and possibly shed light on the internal workings of physiologic systems. 
In the next to last chapter in this section, Banu Onaral and Joseph P. Cammarota introduce the reader 
to basic tenets of complexity theories and the attendant scaling concepts with hopes to facilitate their 
integration into the biomedical engineering practice. 

The section concludes with a brief chapter on the visions of the future when biomedical signal 
processing will merge with the rising technologies in telecommunication and multimedia computing, 
and eventually with virtual reality, to enable remote monitoring, diagnosis, and intervention. The impact 
of this development on the delivery of health care and the quality of life will no doubt be profound. The 
promise of biomedical signal analysis will then be fulfilled. 
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A signal is a phenomenon that conveys information. Biomedical signals are signals, used in biomedical 
fields, mainly for extracting information on a biologic system under investigation. The complete process 
of information extraction may be as simple as a physician estimating the patient's mean heart rate by 
feeling, with the fingertips, the blood pressure pulse or as complex as analyzing the structure of internal 
soft tissues by means of a complex CT machine. 

Most often in biomedical applications (as in many other applications), the acquisition of the signal is 
not sufficient. It is required to process the acquired signal to get the relevant information "buried" in it. 
This may be due to the fact that the signal is noisy and thus must be "cleaned" (or in more professional 
terminology, the signal has to be enhanced) or due to the fact that the relevant information is not "visible" 
in the signal. In the latter case, we usually apply some transformation to enhance the required information. 
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The processing of biomedical signals poses some unique problems. The reason for this is mainly the 
complexity of the underlying system and the need to perform indirect, noninvasive measurements. A 
large number of processing methods and algorithms is available. In order to apply the best method, the 
user must know the goal of the processing, the test conditions, and the characteristics of the underlying 
signal. In this chapter, the characteristics of biomedical signals will be discussed [Cohen, 1986]. Biomed- 
ical signals will be divided into characteristic classes, requiring different classes of processing methods. 
Also in this chapter, the basics of frequency-domain processing methods will be presented. 

52.1 Origin of Biomedical Signals 

From the broad definition of the biomedical signal presented in the preceding section, it is clear that 
biomedical signals differ from other signals only in terms of the application — signals that are used in the 
biomedical field. As such, biomedical signals originate from a variety of sources. The following is a brief 
description of these sources: 

• Bioelectric signals. The bioelectric signal is unique to biomedical systems. It is generated by nerve 
cells and muscle cells. Its source is the membrane potential, which under certain conditions may 
be excited to generate an action potential. In single cell measurements, where specific microelec- 
trodes are used as sensors, the action potential itself is the biomedical signal. In more gross 
measurements, where, for example, surface electrodes are used as sensors, the electric field gen- 
erated by the action of many cells, distributed in the electrode's vicinity, constitutes the bioelectric 
signal. Bioelectric signals are probably the most important biosignals. The fact that most important 
biosystems use excitable cells makes it possible to use biosignals to study and monitor the main 
functions of the systems. The electric field propagates through the biologic medium, and thus the 
potential may be acquired at relatively convenient locations on the surface, eliminating the need 
to invade the system. The bioelectric signal requires a relatively simple transducer for its acquisi- 
tion. A transducer is needed because the electric conduction in the biomedical medium is done 
by means of ions, while the conduction in the measurement system is by electrons. All these lead 
to the fact that the bioelectric signal is widely used in most fields of biomedicine. 

• Bioimpedcmce signals. The impedance of the tissue contains important information concerning its 
composition, blood volume, blood distribution, endocrine activity, automatic nervous system 
activity, and more. The bioimpedance signal is usually generated by injecting into the tissue under 
test sinusoidal currents (frequency range of 50 kHz to 1 MHz, with low current densities of the 
order of 20 uA to 20 mA). The frequency range is chosen to minimize electrode polarization 
problems, and the low current densities are chosen to avoid tissue damage mainly due to heating 
effects. Bioimpedance measurements are usually performed with four electrodes. Two source 
electrodes are connected to a current source and are used to inject the current into the tissue. The 
two measurement electrodes are placed on the tissue under investigation and are used to measure 
the voltage drop generated by the current and the tissue impedance. 

• Bioacoustic signals. Many biomedical phenomena create acoustic noise. The measurement of this 
acoustic noise provides information about the underlying phenomenon. The flow of blood in the 
heart, through the heart's valves, or through blood vessels generates typical acoustic noise. The 
flow of air through the upper and lower airways and in the lungs creates acoustic sounds. These 
sounds, known as coughs, snores, and chest and lung sounds, are used extensively in medicine. 
Sounds are also generated in the digestive tract and in the joints. It also has been observed that 
the contracting muscle produces an acoustic noise (muscle noise). Since the acoustic energy 
propagates through the biologic medium, the bioacoustic signal may be conveniently acquired on 
the surface, using acoustic transducers (microphones or accelerometers). 

• Biomagnetic signals. Various organs, such as the brain, heart, and lungs, produce extremely weak 
magnetic fields. The measurements of these fields provides information not included in other 
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biosignals (such as bioelectric signals). Due to the low level of the magnetic fields to be measured, 
biomagnetic signals are usually of very low signal-to-noise ratio. Extreme caution must be taken 
in designing the acquisition system of these signals. 

• Biomechanical signals. The term biomechanical signals includes all signals used in the biomedicine 
fields that originate from some mechanical function of the biologic system. These signals include 
motion and displacement signals, pressure and tension and flow signals, and others. The mea- 
surement of biomechanical signals requires a variety of transducers, not always simple and inex- 
pensive. The mechanical phenomenon does not propagate, as do the electric, magnetic, and 
acoustic fields. The measurement therefore usually has to be performed at the exact site. This very 
often complicates the measurement and forces it to be an invasive one. 

• Biochemical signals. Biochemical signals are the result of chemical measurements from the living 
tissue or from samples analyzed in the clinical laboratory. Measuring the concentration of various 
ions inside and in the vicinity of a cell by means of specific ion electrodes is an example of such 
a signal. Partial pressures of oxygen (p0 2 ) and of carbon dioxide (pC0 2 ) in the blood or respiratory 
system are other examples. Biochemical signals are most often very low frequency signals. Most 
biochemical signals are actually dc signals. 

• Biooptical signals. Biooptical signals are the result of optical functions of the biologic system, 
occurring naturally or induced by the measurement. Blood oxygenation may be estimated by 
measuring the transmitted and backscattered light from a tissue (in vivo and in vitro) in several 
wavelengths. Important information about the fetus may be acquired by measuring fluorescence 
characteristics of the amniotic fluid. Estimation of the heart output may be performed by the dye 
dilution method, which requires the monitoring of the appearance of recirculated dye in the 
bloodstream. The development of fiberoptic technology has opened vast applications of biooptical 
signals. 

Table 52.1 lists some of the more common biomedical signals with some of their characteristics. 

52.2 Classification of Biosignals 

Biosignals may be classified in many ways. The following is a brief discussion of some of the most 
important classifications. 

• Classification according to source. Biosignals may be classified according to their source or physical 
nature. This classification was described in the preceding section. This classification may be used 
when the basic physical characteristics of the underlying process is of interest, e.g., when a model 
for the signal is desired. 

• Classification according to biomedical application. The biomedical signal is acquired and processed 
with some diagnostic, monitoring, or other goal in mind. Classification may be constructed 
according to the field of application, e.g., cardiology or neurology. Such classification may be of 
interest when the goal is, for example, the study of physiologic systems. 

• Classification according to signal characteristics. From point of view of signal analysis, this is the 
most relevant classification method. When the main goal is processing, it is not relevant what is 
the source of the signal or to which biomedical system it belongs; what matters are the signal 
characteristics. 

We recognize two broad classes of signals: continuous signals and discrete signals. Continuous signals 
are described by a continuous function s(t) which provides information about the signal at any given 
time. Discrete signals are described by a sequence s{m) which provides information at a given discrete 
point on the time axis. Most of the biomedical signals are continuous. Since current technology provides 
powerful tools for discrete signal processing, we most often transform a continuous signal into a discrete 
one by a process known as sampling. A given signal s{t) is sampled into the sequence s{m) by 
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TABLE 52.1 Biomedical Signals 



Classification 


Acquisition 


Frequency Range 


Dynamic Range 


Comments 


Bioelectric 










Action potential 


Microelectrodes 


100 Hz~2 kHz 


lOuV-lOOmV 


Invasive measurement of cell 
membrane potential 


Electroneurogram (ENG) 


Needle electrode 


100 Hz-1 kHz 


5 uV-10 mV 


Potential of a nerve bundle 


Electroretinogram (ERG) 


Microelectrode 


0.2-200 Hz 


0.5 uV-1 mV 


Evoked flash potential 


Electro-oculogram (EOG) 


Surface electrodes 


dc-100 Hz 


10 uV-5 mV 


Steady-corneal-retinal potential 


Electroencephalogram 










(EEG) 










Surface 


Surface electrodes 


0.5-100 Hz 


2-100 uV 


Multichannel (6-32) scalp 
potential 


Delta range 




0.5-4 Hz 




Young children, deep sleep and 
pathologies 


Theta range 




4-8 Hz 




Temporal and central areas 
during alert states 


Alpha range 




8-13 Hz 




Awake, relaxed, closed eyes 


Beta range 




13-22 Hz 






Sleep spindles 




6-15 Hz 


50-100 liV 


Bursts of about 0.2 to 0.6 s 


K-complexes 




12-14 Hz 


100-200 liV 


Bursts during moderate and 
deep sleep 


Evoked potentials (EP) 


Surface electrodes 




0.1-20 uV 


Response of brain potential to 
stimulus 


Visual (VEP) 




1-300 Hz 


1-20 liV 


Occipital lobe recordings, 
200-ms duration 


Somatosensory (SEP) 




2 Hz-3 kHz 




Sensory cortex 


Auditory (AEP) 




100 Hz-3 kHz 


0.5-10 uV 


Vertex recordings 


Electrocorticogram 


Needle electrodes 


100 Hz-5 kHz 




Recordings from exposed 
surface of brain 


Electromyography (EMG) 










Single-fiber (SFEMG) 


Needle electrode 


500 Hz-10 kHz 


1-10 liV 


Action potentials from single 
muscle fiber 


Motor unit action 


Needle electrode 


5 Hz-10 kHz 


100 uV-2 mV 




potential (MUAP) 










Surface EMG (SEMG) 


Surface electrodes 








Skeletal muscle 




2-500 Hz 


50 uV-5 mV 




Smooth muscle 




0.01-1 Hz 






Electrocardiogram (ECG) 


Surface electrodes 


0.05-100 Hz 


1-10 mV 




High-Frequency ECG 


Surface electrodes 


100 Hz-1 kHz 


100 uV-2 mV 


Notchs and slus waveforms 
superimposed on the ECG. 



s H =s W 



-1,0,1,. 



(52.1) 



where T s is the sampling interval and f s = (2n/T s ) is the sampling frequency. Further characteristic 
classification, which applies to continuous as well as discrete signals, is described in Fig. 52.1. 

We divide signals into two main groups: deterministic and stochastic signals. Deterministic signals are 
signals that can be exactly described mathematically or graphically. If a signal is deterministic and its 
mathematical description is given, it conveys no information. Real-world signals are never deterministic. 
There is always some unknown and unpredictable noise added, some unpredictable change in the 
parameters, and the underlying characteristics of the signal that render it nondeterministic. It is, however, 
very often convenient to approximate or model the signal by means of a deterministic function. 

An important family of deterministic signals is the periodic family. A periodic signal is a deterministic 
signal that may be expressed by 
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SIGNAL 




FIGURE 52.1 Classification of signals according to characteristics. 



lt) = lt+nT\ 



(52.2) 



where n is an integer, and T is the period. The periodic signal consists of a basic wave shape with a 
duration of X seconds. The basic wave shape repeats itself an infinite number of times on the time axis. 
The simplest periodic signal is the sinusoidal signal. Complex periodic signals have more elaborate wave 
shapes. Under some conditions, the blood pressure signal may be modeled by a complex periodic signal, 
with the heart rate as its period and the blood pressure wave shape as its basic wave shape. This is, of 
course, a very rough and inaccurate model. 

Most deterministic functions are nonperiodic. It is sometimes worthwhile to consider an "almost 
periodic" type of signal. The ECG signal can sometimes be considered "almost periodic." The ECG's RR 
interval is never constant; in addition, the PQRST complex of one heartbeat is never exactly the same as 
that of another beat. The signal is definitely nonperiodic. Under certain conditions, however, the RR 
interval is almost constant, and one PQRST is almost the same as the other. The ECG may thus sometimes 
be modeled as "almost periodic." 

52.3 Stochastic Signals 

The most important class of signals is the stochastic class. A stochastic signal is a sample function of a 
stochastic process. The process produces sample functions, the infinite collection of which is called the 
ensemble. Each sample function differs from the other in it fine details; however, they all share the same 
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l 1 l 2 

FIGURE 52.2 The ensemble of the stochastic process s(f). 

distribution probabilities. Figure 52.2 depicts three sample functions of an ensemble. Note that at any 
given time, the values of the sample functions are different. 

Stochastic signals cannot be expressed exactly; they can be described only in terms of probabilities 
which may be calculated over the ensemble. 

Assuming a signal s(f), the Mh-order joint probability function 



p[s(t 1 )<s 1) s(t 2 )<s 2 ,...,s(t N )<s N =p(s 1 ,s 2 ,...,s N ) 



(52.3) 



is the joint probability that the signal at time f, will be less than or equal to S, and at time t will be less 
than or equal to S;, etc. This joint probability describes the statistical behavior and intradependence of 
the process. 

It is very often useful to work with the derivative of the joint probability function; this derivative is 
known as the joint probability density function (PDF): 



p(V« 8 ..-.»5 N ) = 



3s, 3s, •••Ss. 



D ( S l> S 2>--" S iv) 



(52.4) 



Of particular interest are the first- and second-order PDFs. 

The expectation of the process s(r), denoted by E{s(t)} or by m s , is a statistical operator defined as 
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E H r )} = I s p{ s ) ds = m > ( 52 - 5 ) 

The expectation of the function s"(t) is known as the nth-order moment. The first-order moment is 
thus the expectation of the process. The wth-order moment is given by 

EJs"(f)|= j s"p(s\ds (52.6) 

Another important statistical operator is the nth central moment: 

\l n = E \(s-m s )" Lj" (s-m s )"p(s)ds (52.7) 



The second central moment is known as the variance (the square root of which is the standard 
deviation). The variance is denoted by a 2 : 

a 2 = H 2 = E\(s-m\ 2 [= \ (s-m s ) 2 p(s)ds (52.8) 

The second-order joint moment is defined by the joint PDF. Of particular interest is the autocorrelation 
function r ss : 

The cross-correlation function is defined as the second joint moment of the signal 5 at time t v s(f), 
and the signal y at time t 2 , y{t 2 ): 

^(f 1 .f 2 )=4 s ( f i)> , ( f 2 )} = JJ oo s ( f i)H^W s "^)^^ (52 - io) 

Stationary stochastic processes are processes whose statistics do not change in time. The expectation 
and the variance (as with any other statistical mean) of a stationary process will be time-independent. The 
autocorrelation function, for example, of a stationary process will thus be a function of the time difference 
x = t 2 - t , (one-dimensional function) rather than a function of t 2 and t x (two-dimensional function). 

Ergodic stationary processes possess an important characteristic: Their statistical probability distribu- 
tions (along the ensemble) equal those of their time distributions (along the time axis of any one of its 
sample functions). For example, the correlation function of an ergodic process may be calculated by its 
definition (along the ensemble) or along the time axis of any one of its sample functions: 



4 t )=#MH}=^1/(#- t )* (52 - n) 



The right side of Eq. (52.11) is the time autocorrelation function. 

Ergodic processes are nice because one does not need the ensemble for calculating the distributions; 
a single sample function is sufficient. From the point of view of processing, it is desirable to model the 
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signal as an ergodic one. Unfortunately, almost all signals are nonstationary (and hence nonergodic). 
One must therefore use nonstationary processing methods (such as, for example, wavelet transformation) 
which are relatively complex or cut the signals into short-duration segments in such a way that each may 
be considered stationary. 

The sleep EEG signal, for example, is a nonstationary signal. We may consider segments of the signal, 
in which the subject was at a given sleep state, as stationary. In order to describe the signal, we need to 
estimate its probability distributions. However, the ensemble is unavailable. If we further assume that 
the process is ergodic, the distributions may be estimated along the time axis of the given sample function. 
Most of the standard processing techniques assume the signal to be stationary and ergodic. 

52.4 Frequency-Domain Analysis 

Until now we have dealt with signals represented in the time domain, that is to say, we have described 
the signal by means of its value on the time axis. It is possible to use another representation for the same 
signal: that of the frequency domain. Any signal may be described as a continuum of sine waves having 
different amplitudes and phases. The frequency representation describes the signals by means of the 
amplitudes and phases of the sine waves. The transformation between the two representations is given 
by the Fourier transform (FT): 

s(co)= f s( [ ty> m dt = FUt ) }] (52.12) 

where co = 2 71/ is the angular frequency, and F{*} is the Fourier operator. 

The inverse Fourier transform (IFT) is the operator that transforms a signal from the frequency domain 
into the time domain: 

s(f) = — f s(co)e jtof dw = ^ 1 {s(co)} (52.13) 

The frequency domain representation S(ro) is complex; hence 



s(oo)= s(co) 



/W (52.14) 



where S(co)|, the absolute value of the complex function, is the amplitude spectrum, and 9(co), the phase 
of the complex function, is the phase spectrum. The square of the absolute value, S(oo)| 2 , is termed the 
power spectrum. The power spectrum of a signal describes the distribution of the signal's power on the 
frequency axis. A signal in which the power is limited to a finite range of the frequency axis is called a 
band-limited signal. Figure 52.3 depicts an example of such a signal. 

The signal in Fig. 52.3 is a band-limited signal; its power spectrum is limited to the frequency range 
-co max < co < co max . It is easy to show that if s(t) is real (which is the case in almost all applications), the 
amplitude spectrum is an even function and the phase spectrum is an odd function. 

Special attention must be given to stochastic signals. Applying the FT to a sample function would 
provide a sample function on the frequency axis. The process may be described by the ensemble of 
spectra. Another alternative to the frequency representation is to consider the correlation function of the 
process. This function is deterministic. The FT may be applied to it, yielding a deterministic frequency 
function. The FT of the correlation function is defined as the power spectral density function (PSD): 



PSD 



(f)] = S s (co) = F{r s (x)} = j"°° r ss [%)e-> m dx (52.15) 
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s(t) 
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FIGURE 52.3 Example of a signal described in the time and frequency domains. 

The PSD is used to describe stochastic signals; it describes the density of power on the frequency axis. 
Note that since the autocorrelation function is an even function, the PSD is real; hence no phase spectrum 
is required. 

The EEG signal may serve as an example of the importance of the PSD in signal processing. When 
processing the EEG, it is very helpful to use the PSD. It turns out that the power distribution of the EEG 
changes according to the physiologic and psychological states of the subject. The PSD may thus serve as 
a tool for the analysis and recognition of such states. 

Very often we are interested in the relationship between two processes. This may be the case, for 
example, when two sides of the brain are investigated by means of EEG signals. The time-domain 
expression of such relationships is given by the cross-correlation function (Eq. 52.10). The frequency- 
domain representation of this is given by the FT of the cross-correlation function, which is called the 
cross-power spectral density function (C-PSD) or the cross-spectrum: 



M4 



? W x )HM w ) 



(52.16) 



Note that we have assumed the signals s(t) and y(t) are stationary; hence the cross-correlation function 
is not a function of time but of the time difference x. Note also that unlike the autocorrelation function, 
r (t) is not even; hence its FT is not real. Both absolute value and phase are required. 

It can be shown that the absolute value of the C-PSD is bounded: 



s>) 2 ^>K» 



(52.17) 



The absolute value information of the C-PSD may thus be normalized to provide the coherence function: 






<i 



(52.18) 



The coherence function is used in a variety of biomedical applications. It has been used, for example, in 
EEG analysis to investigate brain asymmetry. 
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FIGURE 52.4 Amplitude spectrum of a sampled signal with sampling frequency above the Nyquist frequency ( upper 
trace) and below the Nyquist frequency (lower trace). 



52.5 Discrete Signals 



Assume now that the signal s(f) of Fig. 52.3 was sampled using a sampling frequency of f s = coJ(2n) = 
(2n)/T s . The sampled signal is the sequence s{m). The representation of the sampled signal in the 
frequency domain is given by applying the Fourier operator: 



S s (co)=_F{s(m)|=s(co) 



,/»,(•) 



(52.19) 



The amplitude spectrum of the sampled signal is depicted in Fig. 52.4. It can easily be proven that the 
spectrum of the sampled signal is the spectrum of the original signal repeated infinite times at frequencies 
of nco s . The spectrum of a sampled signal is thus a periodic signal in the frequency domain. It can be 
observed, in Fig. 52.4, that provided the sampling frequency is large enough, the wave shapes of the 
spectrum do not overlap. In such a case, the original (continuous) signal may be extracted from the 
sampled signal by low-pass filtering. A low-pass filter with a cutoff frequency of co milx will yield at its 
output only the first period of the spectrum, which is exactly the continuous signal. If, however, the 
sampling frequency is low, the wave shapes overlap, and it will be impossible to regain the continuous 
signal. 

The sampling frequency must obey the inequality 



CO >2co 



(52.20) 



Equation (52.20) is known as the sampling theorem, and the lowest allowable sampling frequency is called 
the Nyquist frequency. When overlapping does occur, there are errors between the sampled and original 
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signals. These errors are known as aliasing errors. In practical applications, the signal does not possess a 
finite bandwidth; we therefore limit its bandwidth by an antialiasing filter prior to sampling. 

The discrete Fourier transform (DFT) [Proakis & Manolakis, 1988] is an important operator that maps 
a finite sequence s(m), m = 0, 1, ... , N— 1, into another finite sequence S(k), k = 0, 1, ... , N — 1. The 
DFT is defined as 



s(fc) = DFTJs(m)| = V s(m)e 



- jkui 



(52.21) 



An inverse operator, the inverse discrete Fourier transform (IDFT), is an operator that transforms the 
sequence S(k) back into the sequence s(m). It is given by 



N-1 

s(m) = IDFt{s(A:)} = - V s{k]i 



jkm 



(52.22) 



It can be shown that if the sequence s{m) represents the samples of the band-limited signal s(f), sampled 
under Nyquist conditions with sampling interval of T s , the DFT sequence S(k) (neglecting windowing 
effects) represents the samples of the FT of the original signal: 



sW = S s (co) „ fc = 0,l,...,N-l 



(52.23) 



N 



Figure 52.5 depicts the DFT and its relations to the FT. Note that the N samples of the DFT span the 
frequency range one period. Since the amplitude spectrum is even, only half the DFT samples carry the 
information; the other half is composed of the complex conjugates of the first half. 



s(m) 



m 
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FIGURE 52.5 The sampled signal s(m) and its DFT. 
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The DFT may be calculated very efficiently by means of the fast (discrete) Fourier transform (FFT) 
algorithm. It is this fact that makes the DFT an attractive means for FT estimation. The DFT provides 
an estimate for the FT with frequency resolution of 

Af=^= 2 ^ (52.24) 

NT 

where Tis the duration of the data window. The resolution may be improved by using a longer window. 
In cases where it is not possible to have a longer data window, e.g., because the signal is not stationary, 
zero padding may be used. The sequence may be augmented with zeroes: 



1 (m) = {s(o),s(l),...,s(iV-l),0 ) ...,o} (52.25) 



The zero padded sequence s A (m), m = 0, 1, ... , L - 1, contains N elements of the original sequence 
and L — N zeroes. It can be shown that its DFT represents the samples of the FT with an increased 
resolution of A/= 2 TtfLr 1 . 

52.6 Data Windows 

Calculation of the various functions previously defined, such as the correlation function, requires knowl- 
edge of the signal from minus infinity to infinity. This is, of course, impractical because the signal is not 
available for long durations and the results of the calculations are expected at a reasonable time. We 
therefore do not use the signal itself but the windowed signal. 

A window w(t) is defined as a real and even function that is also time-limited: 



,{«): 



Vf>772 



The FT of a window W(a) is thus real and even and is not band-limited. 

Multiplying a signal by a window will zero the signal outside the window duration (the observation 
period) and will create a windowed, time-limited signal s w (t): 

s w (t)= s(t)w(t) (52.26) 

In the frequency domain, the windowed signal will be 

S w (co) = s(cd)*w(co) (52.27) 

where (*) is the convolution operator. The effect of windowing on the spectrum of the signal is thus the 
convolution with the FT of the window. A window with very narrow spectrum will cause low distortions. 
A practical window has an FT with a main lobe, where most of its energy is located, and sidelobes, which 
cover the frequency axis. The convolution of the sidelobes with the FT of the signal causes distortions 
known as spectral leakage. Many windows have been suggested for a variety of applications. 

The simplest window is the rectangular (Dirichlet) window; in its discrete form it is given by w(m) = 
1, m = 0, 1, ... , N— 1. A more useful window is the Hamming window, given by 



vim] 



w \m =0.54-0.46cos 



'2K ' 

— m 

v N J 



;m=0,l,...,N-l (52.28) 



The Hamming window was designed to minimize the effects of the first sidelobe. 
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52.7 Short-Time Fourier Transform (STFT) 

The Fourier analysis discussed in preceding sections assumed that the signal is stationary. Unfortunately, 
most signals are nonstationary. A relatively simple way to deal with the problem is to divide the signal 
into short segments. The segments are chosen such that each one by itself can be considered a windowed 
sample of a stationary process. The duration of the segments has to be determined either by having some 
a priori information about the signal or by examining its local characteristics. Depending on the signal 
and the application, the segments may be of equal or different duration. 

We want to represent such a segmented signal in the frequency domain. We define the short-time 
Fourier transform (STFT): 



STFT 



' s (co,x)=F{s(f)w(f-x)}= f s( [ t)w( [ t-x)e- ja 'dt 



(52.29) 



The window is shifted on the time axis to t = x so that the FT is performed on a windowed segment in 
the range x - ( T/2) < r < x + ( T/2). The STFT describes the amplitude and phase-frequency distributions 
of the signal in the vicinity of f = x. 

In general, the STFT is a two-dimensional, time-frequency function. The resolution of the STFT on 
the time axis depends on the duration T of the window. The narrower the window, the better the time 
resolution. Unfortunately, choosing a short-duration window means a wider-band window. The wider 
the window in the frequency domain, the larger the spectral leakage and hence the deterioration of the 
frequency resolution. One of the main drawbacks of the STFT method is the fact that the time and 
frequency resolutions are linked together. Other methods, such as the wavelet transform, are able to better 
deal with the problem. 

In highly nonstationary signals, such as speech signals, equal-duration windows are used. Window 
duration is on the order of 10 to 20 ms. In other signals, such as the EEG, variable-duration windows 
are used. In the EEG, windows on the order of 5 to 30 s are often used. 

A common way for representing the two-dimensional STFT function is by means of the spectrogram. 
In the spectrogram, the time and frequency axes are plotted, and the STFT PSD value is given by the 
gray-scale code or by a color code. Figure 52.6 depicts a simple spectrogram. The time axis is quantized 
to the window duration T. The gray scale codes the PSD such that black denotes maximum power and 
white denotes zero power. In Figure 52.6, the PSD is quantized into only four levels of gray. The 
spectrogram shows a signal that is nonstationary in the time range to 8 T. In this time range, the PSD 
possesses a peak that is shifted from about 0.6f to about 0.1 f at time 0.71! From time 0.&T, the signal 
becomes stationary with a PSD peak power in the low-frequency range and the high-frequency range. 



0.5f 
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FIGURE 52.6 A spectrogram. 
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52.8 Spectral Estimation 

The PSD is a very useful tool in biomedical signal processing. It is, however, impossible to calculate, since 
it requires infinite integration time. Estimation methods must be used to acquire an estimate of the PSD 
from a given finite sample of the process under investigation. Many algorithms for spectral estimation 
are available in the literature [Kay, 1988], each with its advantages and drawbacks. One method may be 
suitable for processes with sharp spectral peaks, while another will perform best for broad, smoothed 
spectra. An a priori knowledge on the type of PSD one is investigating helps in choosing the proper 
spectral estimation method. Some of the PSD estimation methods will be discussed here. 

The Blackman-Tukey Method 

This method estimates the PSD directly from its definition (Eq. 52.15) but uses finite integration time 
and an estimate rather than the true correlation function. In its discrete form, the PSD estimation is 



m 

M a) ) =T .L f «M e "* m5: 

m = -M 
N-i-l 

f 4 w )=iE x ( m+; ') 



rn=-M 

(52.30) 

N-i-l 



N 
;=o 

where N is the number of samples used for the estimation of the correlation coefficients, and M is the 
number of correlation coefficients used for estimation of the PSD. Note that a biased estimation of the 
correlation is employed. Note also that once the correlations have been estimated, the PSD may be 
calculated by applying the FFT to the correlation sequence. 

The Periodogram 

The periodogram estimates the PSD directly from the signal without the need to first estimate the 
correlation. It can be shown that 



(co) = lim E 



rrl-A f > 



2T J- 



(52.31) 



The PSD presented in Eq. (52.31) requires infinite integration time. The periodogram estimates the PSD 
from a finite observation time by dropping the lim operator. It can be shown that in its discrete form, 
the periodogram estimator is given by 



m 2 

(<fl) = ^DFTJx(m)| (52.32) 



N 



The great advantage of the periodogram is that the DFT operator can very efficiently be calculated by 
the FFT algorithm. 

A modification to the periodogram is weighted overlapped segment averaging (WOSA). Rather than 
using one segment of N samples, we divide the observation segment into shorter subsegments, perform 
a periodogram for each one, and then average all periodograms. The WOSA method provides a smoother 
estimate of the PSD. 
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FIGURE 52.7 Time-series model for the signal s(m). 



Time-Series Analysis Methods 

Time-series analysis methods model the signal as an output of a linear system driven by a white source. 
Figure 52.7 depicts this model in its discrete form. Since the input is a white noise process (with zero 
mean and unity variance), the PSD of the signal is given by 



S ss (co)=h(co)' 



(52.33) 



The PSD of the signal may thus be represented by the system's transfer function. Consider a general 
pole-zero system with p poles and q zeros [ARMA(p, q)]: 



y\b.z-' 



V 

l + V'a.z - ' 



=i 



Its absolute value evaluated on the frequency axis is 



(52.34) 



H(co)' 



hz'' 



1+ > a.z ' 



(52.35) 



Several algorithms are available for the estimation of the model's coefficients. The estimation of the 
ARMA model parameters requires the solution of a nonlinear set of equations. The special case of q = 
0, namely, an all-pole model [AR(p)], may be estimated by means of linear equations. Efficient AR 
estimation algorithms are available, making it a popular means for PSD estimation. Figure 52.8 shows 
the estimation of EMG PSD using several estimation methods. 

52.9 Signal Enhancement 

The biomedical signal is very often a weak signal contaminated by noise. Consider, for example, the 
problem of monitoring the ECG signal. The signal is acquired by surface electrodes that pick up the electric 
potential generated by the heart muscle. In addition, the electrodes pick up potentials from other active 
muscles. When the subject is at rest, this type of noise may be very small, but when the subject is an athlete 
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FIGURE 52.8 PSD of surface EMG. ( Upper trace) Blackman-Tukey (256 correlation coefficients and 256 padding 
zeroes). (Middle trace) Periodogram (512 samples and 512 padding zeroes). {Lower trace) AR model (p = 40). 

performing some exercise, the muscle noise may become dominant. Additional noise may enter the system 
from electrodes motion, from the power lines, and from other sources. The first task of processing is 
usually to enhance the signal by "cleaning" the noise without (if possible) distorting the signal. 
Assume a simple case where the measured signal x(t) is given by 



x(f) = s(f)+n(t) x(co) = s(co)+N(co) 



(52.36) 



where s(t) is the desired signal and n(t) is the additive noise. For simplicity, we assume that both the 
signal and noise are band-limited, namely, for the signal, S(co) = 0, for co max < CO, G) min > CO. Figure 52.9 
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FIGURE 52.9 Noisy signal in the frequency domain. 



depicts the PSD of the signal in two cases, the first where the PSD of the signal and noise do not overlap 
and the second where they do overlap (for the sake of simplicity, only the positive frequency axis was 
plotted). We want to enhance the signal by means of linear filtering. The problem is to design the linear 
filter that will provide best enhancement. Assuming we have the filter, its output, the enhanced signal, 
is given by 



y\ 



{A = At\*h[t\ f(co)=x(co)h(cd) 



(52.37) 



where y(t) = s(t) + n (t) is the enhanced output, and h(t) is the impulse response of the filter. The solution 
for the first case is trivial; we need an ideal bandpass filter whose transfer function H(a>) is 




otherwise 



(52.38) 



Such a filter and its output are depicted in Fig. 52.10. 

As is clearly seen in Fig. 52.10, the desired signal s(f) was completely recovered from the given noisy 
signal x(t). Practically, we do not have ideal filters, so some distortions and some noise contamination 
will always appear at the output. With the correct design, we can approximate the ideal filter so that the 
distortions and noise may be as small as we desire. The enhancement of overlapping noisy signals is far 
from being trivial. 

52.10 Optimal Filtering 

When the PSD of signal and noise overlap, complete, undistorted recovery of the signal is impossible. 
Optimal processing is required, with the first task being definition of the optimality criterion. Different 
criteria will result in different solutions to the problem. Two approaches will be presented here: the Wiener 
filter and the matched filter. 
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FIGURE 52.10 (a) An ideal bandpass filter. ( b) Enhancement of a nonoverlapping noisy signal by an ideal bandpass 
filter. 



Minimization of Mean Squared Error: The Wiener Filter 

Assume that our goal is to estimate, at time t + ^, the value of the signal s(t + ^), based on the observations 
x(t). The case Ej = is known as smoothing, while the case cj > is called prediction. 

We define an output error e(f) as the error between the filter's output and the desired output. The 
expectation of the square of the error is given by 



(52.39) 



E{e\t)} = E\[s(t + ^-y(t + ^) 

s(t + cj)- hU)x(t-x)dl 



The integral term on the right side of Eq. (52.39) is the convolution integral expressing the output of 
the filter. 

The minimization of Eq. (52.39) with respect of h{t) yields the optimal filter (in the sense of minimum 
squared error). The minimization yields the Wiener-Hopf equation: 
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r^ + i) = \ h(n)r x ^-ri)dr] (52.40) 

In the frequency domain, this equation becomes 

S„(co)e^ = H pt (co)s xx (co) (52.41) 



from which the optimal filter H pt (co) can be calculated 



S (go) S (oo) 



CO ) = ^7 e M = "V / e M ( 52 _4 2) 

If the signal and noise are uncorrected and either the signal or the noise has zero mean, the last equation 
becomes 

i \ S (oo) 

H , (0 = ? ; l ; / , e M (52.43) 

The optimal filter requires a priori knowledge of the PSD of noise and signal. These are very often not 
available and must be estimated from the available signal. The optimal filter given in Eqs. (52.42) and 
(52.43) is not necessarily realizable. In performing the minimization, we have not introduced a constraint 
that will ensure that the filter is causal. This can be done, yielding the realizable optimal filter. 

Maximization of the Signal-to-Noise Ratio: The Matched Filter 

The Wiener filter was optimally designed to yield an output as close as possible to the signal. In many 
cases we are not interested in the fine details of the signal but only in the question whether the signal 
exists at a particular observation or not. Consider, for example, the case of determining the heart rate 
of a subject under noisy conditions. We need to detect the presence of the R wave in the ECG. The exact 
shape of the wave is not important. For this case, the optimality criterion used in the last section is not 
suitable. To find a more suitable criterion, we define the output signal-to-noise ratio: Let us assume that 
the signal s(i) is a deterministic function. The response of the filter, s(t) = s(t) * h(t), to the signal is also 
deterministic. We shall define the output signal-to-noise ratio 

as the optimality criterion. The optimal filter will be the filter that maximizes the output SNR at a certain 
given time t = t . The maximization yields the following integral equation: 

J h(^jr m (x-^)d^ = as(t o -x) o<X<T (52.45) 

where T is the observation time and a is any constant. This equation has to be solved for any given noise 
and signal. 
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A special important case is the case where the noise is a white noise so that its autocorrelation function 
is a delta function. In this case, the solution of Eq. (52.45) is 



h ( X ) = ^- X ) 



(52.46) 



where N is the noise power. For this special case, the impulse response of the optimal filter has the form 
of the signal run backward, shifted to the time t . This type of filter is called a matched filter. 

52.11 Adaptive Filtering 

The optimal filters discussed in the preceding section assumed the signals to be stationary with known 
PSD. Both assumptions rarely occur in reality. In most biomedical applications, the signals are nonsta- 
tionary with unknown PSD. To enhance such signals, we require a filter that will continuously adjust 
itself to perform optimally under the changing circumstances. Such a filter is called an adaptive filter 
[Widrow & Stearns, 1985]. 

The general description of an adaptive filter is depicted in Fig. 52.11. The signal s{t) is to be corrected 
according to the specific application. The correction may be enhancement or some reshaping. The signal 
is given in terms of the noisy observation signal x{t). The main part of the system is a filter, and the 
parameters (gain, poles, and zeroes) are controllable by the adaptive algorithm. The adaptive algorithm 
has some a priori information on the signal and the noise (the amount and type of information depend 
on the application). It also has a correction criterion, according to which the signal is operating. The 
adaptive algorithm also gets the input and output signals of the filter so that its performance can be 
analyzed continuously. 

The adaptive filter requires a correction algorithm. This can best be implemented digitally. Most 
adaptive filters therefore are implemented by means of computers or special digital processing chips. 

An important class of adaptive filters requires a reference signal. The knowledge of the noise required 
by this type of adaptive filter is a reference signal that is correlated with the noise. The filter thus has two 
inputs: the noisy signal x(t) = s(f) + n(t) and the reference signal n R (f). The adaptive filter, functioning 
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FIGURE 52.11 Adaptive filter, general scheme. 
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as a noise canceler, estimates the noise n(t) and, by subtracting it from the given noisy input, gets an 
estimate for the signal. Hence 



y\ 



(t)=x(t)-h(t\=s(t) + n(t)-h(t) =s(t) 



(52.47) 



The output of the filter is the enhanced signal. Since the reference signal is correlated with the noise, 
the following relationship exists: 



nJco) = g(co)n(co) 



(52.48) 



which means that the reference noise may be represented as the output of an unknown filter G(co). The 
adaptive filter estimates the inverse of this unknown noise filter and from its estimates the noise: 



^t)=r l [G- l (<o)NJi<o)} 



(52.49) 



The estimation of the inverse filter is done by the minimization of some performance criterion. There 
are two dominant algorithms for the optimization: the recursive least squares (RLS) and the least mean 
squares (LMS). The LMS algorithm will be discussed here. 

Consider the mean square error 



E{e^ = E{/(t)} = E^t) + [n(t)-n(t) 



(52.50) 



The right side of Eq. (52.50) is correct, assuming that the signal and noise are uncorrelated. We are 
searching for the estimate G~ l ((o) that will minimize the mean square error: E{[n(t) — n(t)] 2 }. Since the 
estimated filter affects only the estimated noise, the minimization of the noise error is equivalent to the 
minimization of Eq. (52.50). The implementation of the LMS filter will be presented in its discrete form 
(see Fig. 52.12). 

The estimated noise is 



where 



"4 m ) = H v ' w ' =v »' w 

1 = 

y l =[ v o> n>---> V P ]= L n R [m-\),..., n R [m-p) 
w r = |w , w, ..., w\ 



(52.51) 



(52.52) 



The vector w represents the filter. The steepest descent minimization of Eq. (52.50) with respect to the 
filter's coefficients w yields the iterative algorithm 



w .,. = w + 2|le v . 



(52.53) 
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FIGURE 52.12 Block diagram of LMS adaptive noise canceler. 

where U is a scalar that controls the stability and convergence of the algorithm. In the evaluation of 
Eq. (52.53), the assumption 



dE 



HI 



3e 2 . 

j_ 



(52.54) 



was made. This is indeed a drastic approximation; the results, however, are very satisfactory. Figure 54.12 
depicts the block diagram of the LMS adaptive noise canceler. 

The LMS adaptive noise canceler has been applied to many biomedical problems, among them 
cancellation of power-line interferences, elimination of electrosurgical interferences, enhancement of fetal 
ECG, noise reduction for the hearing impaired, and enhancement of evoked potentials. 

52.12 Segmentation of Nonstationary Signals 

Most biomedical signals are nonstationary, yet the common processing techniques (such as the FT) deal 
with stationary signals. The STFT is one method of processing nonstationary signals, but it does require, 
however, the segmentation of the signal into "almost" stationary segments. The signal is thus represented 
as a piecewise-stationary signal. 

An important problem in biomedical signal processing is efficient segmentation. In very highly non- 
stationary signals, such as the speech signal, short, constant-duration (of the order of 15 ms) segments 
are used. The segmentation processing in such a case is simple and inexpensive. In other cases such as 
the monitoring of nocturnal EEG, a more elaborate segmentation procedure is called for because the 
signal may consist of "stationary" segments with very wide duration range. Segmentation into a priori 
fixed- duration segments will be very inefficient in such cases. 
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FIGURE 52.13 Adaptive segmentation of simulated EEC First 2.5 seconds and last 2.5 seconds were simulated by 
means of different AR models. {Lower trace) SEM calculated with fixed reference window. A new segment has been 
detected at t— 2.5. [From Cohen, 1986, with permission.] 

Several adaptive segmentation algorithms have been suggested. Figure 52.13 demonstrates the basic 
idea of these algorithms. A fixed reference window is used to define an initial segment of the signal. The 
duration of the reference window is determined such that it is long enough to allow a reliable PSD 
estimate yet short enough so that the segment may still be considered stationary. Some a priori infor- 
mation about the signal will help in determining the reference window duration. A second, sliding window 
is shifted along the signal. The PSD of the segment defined by the sliding window is estimated at each 
window position. The two spectra are compared using some spectral distance measure. As long as this 
distance measure remains below a certain decision threshold, the reference segment and the sliding 
segment are considered close enough and are related to the same stationary segment. Once the distance 
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measure exceeds the decision threshold, a new segment is defined. The process continues by defining the 
last sliding window as the reference window of the new segment. 
Let us define a relative spectral distance measure 



»■(-)-£ 



^H-S,(co) V 



s,H 



d(0 (52.55) 



/ 



where S K (co) and S,((o) are the PSD estimates of the reference and sliding segments, respectively, and co M 
is the bandwidth of the signal. A normalized spectral measure was chosen, since we are interested in 
differences in the shape of the PSD and not in the gain. 

Some of the segmentation algorithms use growing reference windows rather than fixed ones. This is 
depicted in the upper part of Fig. 52.13. The various segmentation methods differ in the way the PSDs 
are estimated. Two of the more well-known segmentation methods are the auto-correlation measure 
method (ACM) and the spectral error measure (SEM). 
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Biologic signals carry information that is useful for comprehension of the complex pathophysiologic 
mechanisms underlying the behavior of living systems. Nevertheless, such information cannot be available 
directly from the raw recorded signals; it can be masked by other biologic signals contemporaneously 
detected (endogenous effects) or buried in some additive noise (exogenous effects). For such reasons, 
some additional processing is usually required to enhance the relevant information and to extract from 
it parameters that quantify the behavior of the system under study, mainly for physiologic studies, or 
that define the degree of pathology for routine clinical procedures (diagnosis, therapy, or rehabilitation). 

Several processing techniques can be used for such purposes (they are also called preprocessing tech- 
niques); time- or frequency-domain methods including filtering, averaging, spectral estimation, and 
others. Even if it is possible to deal with continuous time waveforms, it is usually convenient to convert 
them into a numerical form before processing. The recent progress of digital technology, in terms of 
both hardware and software, makes digital rather than analog processing more efficient and flexible. 
Digital techniques have several advantages: Their performance is generally powerful, being able to easily 
implement even complex algorithms, and accuracy depends only on the truncation and round-off errors, 
whose effects can be predicted and controlled by the designer and are largely unaffected by other 
unpredictable variables such as component aging and temperature, which can degrade the performances 
of analog devices. Moreover, design parameters can be more easily changed because they involve software 
rather than hardware modifications. 

A few basic elements of signal acquisition and processing will be presented in the following; our aim 
is to stress mainly the aspects connected with acquisition and analysis of biologic signals, leaving to the 
cited literature a deeper insight into the various subjects for both the fundamentals of digital signal 
processing and the applications. 

53.1 Acquisition 

A schematic representation of a general acquisition system is shown in Fig. 53.1. Several physical mag- 
nitudes are usually measured from biologic systems. They include electromagnetic quantities (currents, 
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FIGURE 53.1 General block diagram of the acquisition procedure of a digital signal. 



potential differences, field strengths, etc.), as well as mechanical, chemical, or generally nonelectrical 
variables (pressure, temperature, movements, etc.). Electric signals are detected by sensors (mainly elec- 
trodes), while nonelectric magnitudes are first converted by transducers into electric signals that can be 
easily treated, transmitted, and stored. Several books of biomedical instrumentation give detailed descrip- 
tions of the various transducers and the hardware requirements associated with the acquisition of the 
different biologic signals [Cobbold, 1988; Tompkins & Webster, 1981; Webster, 1992]. 

An analog preprocessing block is usually required to amplify and filter the signal (in order to make it 
satisfy the requirements of the hardware such as the dynamic of the analog-to-digital converter), to 
compensate for some unwanted sensor characteristics, or to reduce the portion of undesired noise. 
Moreover, the continuous-time signal should be bandlimited before analog-to-digital (A/D) conversion. 
Such an operation is needed to reduce the effect of aliasing induced by sampling, as will be described in 
the next section. Here it is important to remember that the acquisition procedure should preserve the 
information contained in the original signal waveform. This is a crucial point when recording biologic 
signals, whose characteristics often may be considered by physicians as indices of some underlying 
pathologies (i.e., the ST-segment displacement on an ECG signal can be considered a marker of ischemia, 
the peak-and-wave pattern on an EEG tracing can be a sign of epilepsy, and so on). Thus the acquisition 
system should not introduce any form of distortion that can be misleading or can destroy real pathologic 
alterations. For this reason, the analog prefiltering block should be designed with constant modulus and 
linear phase (or zero-phase) frequency response, at least in the passband, over the frequencies of interest. 
Such requirements make the signal arrive undistorted up to the A/D converter. 

The analog waveform is then A/D converted into a digital signal; i.e., it is transformed into a series of 
numbers, discretized both in time and amplitude, that can be easily managed by digital processors. The 
A/D conversion ideally can be divided in two steps, as shown in Fig. 53.1: the sampling process, which 
converts the continuous signal in a discrete-time series and whose elements are named samples, and a 
quantization procedure, which assigns the amplitude value of each sample within a set of determined 
discrete values. Both processes modify the characteristics of the signal, and their effects will be discussed 
in the following sections. 

The Sampling Theorem 

The advantages of processing a digital series instead of an analog signal have been reported previously. 
Furthermore, the basic property when using a sampled series instead of its continuous waveform lies in 
the fact that the former, under certain hypotheses, is completely representative of the latter. When this 
happens, the continuous waveform can be perfectly reconstructed just from the series of sampled values. 
This is known as the sampling theorem (or Shannon theorem) [Shannon, 1949] . It states that a continuous- 
time signal can be completely recovered from its samples if, and only if, the sampling rate is greater than 
twice the signal bandwidth. 

In order to understand the assumptions of the theorem, let us consider a continuous band-limited 
signal x(t) (up to f b ) whose Fourier transform X(f) is shown in Fig. 53.2a and suppose to uniformly 
sample it. The sampling procedure can be modeled by the multiplication of x(t) with an impulse train 
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FIGURE 53.2 Effect of sampling frequency (f s ) on a band-limited signal (up to frequency^,). Fourier transform of 
the original time signal (a), of the sampled signal when f s < 2f b (b), and when f t > 2f b (c). The dark areas in part b 
indicate the aliased frequencies. 



(t)= £a(t-*r f ) 



(53.1) 



where 8(t) is the delta (Dirac) function, k is an integer, and T s is the sampling interval. The sampled 
signal becomes 



s (t)=4t)-i(t)= £x( f )-5( f - fc r) 



(53.2) 



Taking into account that multiplication in time domain implies convolution in frequency domain, we 
obtain 

X s (f) = x(f)*l(f) = x(f)*± 5>(/-fc/J=i £*(/-tf.) (53-3) 

S fc--oo co S k--co Co 

where /, = 1/T S is the sampling frequency. 

Thus X s (f), i.e., the Fourier transform of the sampled signal, is periodic and consists of a series of 
identical repeats of X{f) centered around multiples of the sampling frequency, as depicted in Fig. 53.2fc,c. 
It is worth noting in Fig. 53.2b that the frequency components of X{f) placed above f s /2 appears, when 
f s < Iff,, as folded back, summing up to the lower-frequency components. This phenomenon is known 
as aliasing (higher component look "alias" lower components). When aliasing occurs, the original infor- 
mation (Fig. 53.2a) cannot be recovered because the frequency components of the original signal are 
irreversibly corrupted by the overlaps of the shifted versions of X(f). 
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FIGURE 53.3 Power spectrum of an EEG signal (originally bandlimited up to 40 Hz). The presence of 50-Hz mains 
noise (a) causes aliasing error in the 30-Hz component (i.e., in the fj diagnostic band) in the sampled signal (b) if 
f s = 80 Hz. 

A visual inspection of Fig. 53.2 allows one to observe that such frequency contamination can be avoided 
when the original signal is bandlimited [X(f) = for /> f b ] and sampled at a frequency^ > 2f b . In this 
case, shown in Fig. 53.2c, no overlaps exist between adjacent reply of X(f), and the original waveform 
can be retrieved by low-pass filtering the sampled signal [Oppenheim &Schafer, 1975]. Such observations 
are the basis of the sampling theorem previously reported. 

The hypothesis of a bandlimited signal is hardly verified in practice, due to the signal characteristics 
or to the effect of superimposed wideband noise. It is worth noting that filtering before sampling is 
always needed even if we assume the incoming signal to be bandlimited. Let us consider the following 
example of an EEG signal whose frequency content of interest ranges between and 40 Hz (the usual 
diagnostic bands are 5, to 3.5 Hz; 9, 4 to 7 Hz, a, 8 to 13 Hz; p, 14 to 40 Hz). We may decide to sample 
it at 80 Hz, thus literally respecting the Shannon theorem. If we do it without prefiltering, we could find 
some unpleasant results. Typically, the 50-Hz mains noise will replicate itself in the signal band (30 Hz, 
i.e., the (3 band), thus corrupting irreversibly the information, which is of great interest from a physiologic 
and clinical point of view. The effect is shown in Fig. 53.3a (before sampling) and Fig. 53.3b (after 
sampling). Generally, it is advisable to sample at a frequency greater than 2f b [Gardenhire, 1964] in order 
to take into account the nonideal behavior of the filter or the other preprocessing devices. Therefore, the 
prefiltering block of Fig. 53.1 is always required to bandlimit the signal before sampling and to avoid 
aliasing errors. 

The Quantization Effects 

The quantization produces a discrete signal, whose samples can assume only certain values according to 
the way they are coded. Typical step functions for a uniform quantizer are reported in Fig. 53 Aa,b, where 
the quantization interval A between two quantization levels is evidenced in two cases: rounding and 
truncation, respectively. 

Quantization is a heavily nonlinear procedure, but fortunately, its effects can be statistically modeled. 
Figure 53 Ac,d shows it; the nonlinear quantization block is substituted by a statistical model in which 
the error induced by quantization is treated as an additive noise e{ri) (quantization error) to the signal 
x(n). The following hypotheses are considered in order to deal with a simple mathematical problem: 

1. e(n) is supposed to be a white noise with uniform distribution. 

2. e(n) and x(n) are uncorrected. 

First of all, it should be noted that the probability density of e(n) changes according to the adopted 
coding procedure. If we decide to round the real sample to the nearest quantization level, we have -A/2 < 
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FIGURE 53.4 Nonlinear relationships for rounding (a) and truncation (fa) quantization procedures. Description 
of quantization block (c) by a statistical model (d) and probability densities for the quantization noise e{n) for 
rounding (e) and truncation (/). A is the quantization interval. 

e( n) < A/2, while if we decide to truncate the sample amplitude, we have -A < e( ri) < 0. The two probability 
densities are plotted in Fig. 53.4e,/ 

The two ways of coding yield processes with different statistical properties. In the first case the mean 
and variance value of e(n) are 



m =0 



2 =A 2 /12 



while in the second case m e = -A/2, and the variance is still the same. Variance reduces in the presence 
of a reduced quantization interval as expected. 

Finally, it is possible to evaluate the signal-to-noise ratio (SNR) for the quantization process: 



SNR = 10lo gli 



v ,y 



d0log 1( 



: 6.026+10.79+ 10 log 10 (a 2 .] (53.4) 



having set A = 2~ 2h and where a 2 , is the variance of the signal and b is the number of bits used for coding. 
It should be noted that the SNR increases by almost 6 dB for each added bit of coding. Several forms of 
quantization are usually employed: uniform, nonuniform (preceding the uniform sampler with a non- 
linear block), or roughly (small number of quantization levels and high quantization step). Details can 
be found in Carassa [1983], Jaeger [1982], and Widrow [1956]. 

53.2 Signal Processing 

A brief review of different signal-processing techniques will be given in this section. They include 
traditional filtering, averaging techniques, and spectral estimators. 
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x(n) 



Digital Filter 

T[] 



y(n) 

► 



y(n) = T[x(n)] 



FIGURE 53.5 General block diagram of a digital filter. The output digital signal y(n) is obtained from the input 
x( n) by means of a transformation T[ ■ ] which identifies the filter. 



Only the main concepts of analysis and design of digital filters are presented, and a few examples are 
illustrated in the processing of the ECG signal. Averaging techniques will then be described briefly and 
their usefulness evidenced when noise and signal have similar frequency contents but different statistical 
properties; an example for evoked potentials enhancement from EEG background noise is illustrated. 
Finally, different spectral estimators will be considered and some applications shown in the analysis of 
RR fluctuations [i.e., the heart rate variability (HRV) signal]. 

Digital Filters 

A digital filter is a discrete-time system that operates some transformation on a digital input signal x(n) 
generating an output sequence y{n), as schematically shown by the block diagram in Fig. 53.5. The 
characteristics of transformation T[ ■ ] identify the filter. The filter will be time-variant if T[ ■ ] is a function 
of time or time-invariant otherwise, while is said to be linear if, and only if, having x^n) and x^n) as 
inputs producing y x (n) and y 2 {n), respectively, we have 

rfaxj + bx 2 ] = ar[x 1 ]+ for[x 2 ] = ay l + by 2 . (53.5) 

In the following, only linear, time-invariant filters will be considered, even if several interesting 
applications of nonlinear [Glaser & Ruchkin, 1976; Tompkins, 1993] or time-variant [Cohen, 1983; 
Huta & Webster, 1973; Thakor, 1987; Widrow et al., 1975] filters have been proposed in the literature for 
the analysis of biologic signals. 

The behavior of a filter is usually described in terms of input-output relationships. They are usually 
assessed by exciting the filter with different inputs and evaluating which is the response (output) of the 
system. In particular, if the input is the impulse sequence 5(n), the resulting output, the impulse response, 
has a relevant role in describing the characteristic of the filter. Such a response can be used to determine 
the response to more complicated input sequences. In fact, let us consider a generic input sequence x(n) 
as a sum of weighted and delayed impulses. 

x(n) = V x(fc)-5(n-fc) (53.6) 

/i = -oo,oo 

and let us identify the response to 8{n - k) as h{n — k). If the filter is time-invariant, each delayed impulse 
will produce the same response, but time-shifted; due to the linearity property, such responses will be 
summed at the output: 

y(n)= ^ x ( k )- h ( n ~ k ) ( 53 - 7 ) 

(.- = -00,00 

This convolution product links input and output and defines the property of the filter. Two of them 
should be recalled: stability and causality. The former ensures that bounded (finite) inputs will produce 
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bounded outputs. Such a property can be deduced by the impulse response; it can be proved that the 
filter is stable if and only if 



I I'M 



<°° (53.8) 



Causality means that the filter will not respond to an input before the input is applied. This is in agreement 
with our physical concept of a system, but it is not strictly required for a digital filter that can be 
implemented in a noncausal form. A filter is causal if and only if 

/j(fc) = forfc<0 

Even if relation (53.7) completely describes the properties of the filter, most often it is necessary to express 
the input-output relationships of linear discrete-time systems under the form of the z-transform operator, 
which allows one to express relation (53.7) in a more useful, operative, and simpler form. 

The z-Transform 

The z-transform of a sequence x(n) is defined by [Rainer et al., 1972] 

x[z)= Yx(k)-z~ k (53.9) 

where z is a complex variable. This series will converge or diverge for different z values. The set of z 
values which makes Eq. (53.9) converge is the region of convergence, and it depends on the series x(n) 
considered. 

Among the properties of the z-transform, we recall 

• The delay (shift) property: 

If w(n) = x(n-r) then w(z) = x(z) • z~ r 

• The product of convolution: 

If w(n} = Yx[k}- y[n-k} then W [ z ) = X [ z ) ' Y [ z ) 

fc=-~,°o 

The Transfer Function in the z-Domain 

Thanks to the previous property, we can express Eq. (53.7) in the z-domain as a simple multiplication: 

y(z)= H^Y x(z) (53.10) 

where H(z), known as transfer function of the filter, is the z-transform of the impulse response. H(z) 
plays a relevant role in the analysis and design of digital filters. The response to input sinusoids can be 
evaluated as follows: Assume a complex sinusoid x(n) = e> m " T s as input, the correspondent filter output 
will be 

y(n)= Y^h(k)e iaT2( "- k) = e'^ Y h(k)e~^ = x(n) ■ h(z) ^ (53.1 1) 

Jt=0,~ k=0,oo 
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Then a sinusoid in input is still the same sinusoid 
at the output, but multiplied by a complex quantity 
H((£>). Such complex function defines the response 
of the filter for each sinusoid of © pulse in input, 
and it is known as the frequency response of the 
filter. It is evaluated in the complex z plane by com- 
puting H(z) for z = e> anTs , namely, on the point locus 
that describes the unitary circle on the z plane 
(|g>»<T s | = 1). As a complex function, H(co) will be 
defined by its module H(co) and by its phase 
ZH(co) functions, as shown in Fig. 53.6 for a mov- 
ing average filter of order 5. The figure indicates that 
the lower-frequency components will come through 
the filter almost unaffected, while the higher-fre- 
quency components will be drastically reduced. It is 
usual to express the horizontal axis of frequency 
response from to n. This is obtained because only 
pulse frequencies up to co ( /2 are reconstructable 
(due to the Shannon theorem), and therefore, in the 
horizontal axis, the value of aT s is reported which 
goes from to 71. Furthermore, Fig. 53.6b demon- 
strates that the phase is piecewise linear, and in cor- 
respondence with the zeros of |H(q>)|, there is a 
change in phase of 7t value. According to their fre- 
quency response, the filters are usually classified as 
(1) low-pass, (2) high-pass, (3) bandpass, or (4) 
bandstop filters. Figure 53.7 shows the ideal fre- 
quency response for such filters with the proper low- 
and high-frequency cutoffs. 

For a large class of linear, time-invariant systems, 
H(z) can be expressed in the following general form: 



a) 



Lb z 
m 

I \ m = 0,M 



(53.12) 




b) 







71 



FIGURE 53.6 Modulus ( a) and phase ( b) diagrams of 
the frequency response of a moving average filter of 
order 5. Note that the frequency plots are depicted up 
to 7t. In fact, taking into account that we are dealing 
with a sampled signal whose frequency information is 
up to fJ2, we have co max = 2nfJ2 = nf s or <B m „ = n if 
normalized with respect to the sampling rate. 



1+ ja,z 

k = l,N 



which describes in the z domain the following difference equation in this discrete time domain: 



v(n)=-Y, a ky( n - k ) + Y, b 'A n ~ m ) 



(53.13) 



When at least one of the a k coefficients is different from zero, some output values contribute to the 
current output. The filter contains some feedback, and it is said to be implemented in a recursive form. 
On the other hand, when the a k values are all zero, the filter output is obtained only from the current 
or previous inputs, and the filter is said to be implemented in a nonrecursive form. 

The transfer function can be expressed in a more useful form by finding the roots of both numerator 
and denominator: 
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FIGURE 53.7 Ideal frequency-response moduli for low-pass (a), high-pass (&), bandpass (c), and bandstop filters (d). 



«(,). 



kz~- M n tM {z-z) 

m=l,M l m^ 

n ^i, N ( z "ft) 



(53.14) 



where z m are the zeroes and p k are the poles. It is worth noting that H(z) presents N — M zeros in 
correspondence with the origin of the z plane and M zeroes elsewhere (N zeroes totally) and N poles. The 
pole-zero form of H(z) is of great interest because several properties of the filter are immediately available 
from the geometry of poles and zeroes in the complex z plane. In fact, it is possible to easily assess stability 
and by visual inspection to roughly estimate the frequency response without making any calculations. 

Stability is verified when all poles lie inside the unitary circle, as can be proved by considering the 
relationships between the z-transform and the Laplace s-transform and by observing that the left side of 
the 5 plane is mapped inside the unitary circle [Jackson, 1986; Oppenheim & Schafer, 1975]. 

The frequency response can be estimated by noting that (z- z m )\ z= /° nT ' is a vector joining the mth zero 
with the point on the unitary circle identified by the angle (£>T S . Defining 



•{-•.) 



A = ( z ~P k )_ 



(53.15) 



.*"■, 



we obtain 



H(CO): 



fr„n , m \b 

m=l,M m\ 

n \a\ 



ZH(co)= £z5 m -£zV(N-M)coT. 



(53.16) 



k=l,N 
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b) 




FIGURE 53.8 Poles and zeroes geometry (a) and relative frequency response modulus (fe) and phase (c) charac- 
teristics. Moving around the unitary circle a rough estimation of |ff(co)| and ZH(w) can be obtained. Note the zeroes' 
effects at n and nil and modulus rising in proximity of the poles. Phase shifts are clearly evident in part c closer to 
zeros and poles. 

Thus the modulus of H(co) can be evaluated at any frequency co° by computing the distances between 
poles and zeroes and the point on the unitary circle corresponding to CO = co°, as evidenced by Fig. 53.8, 
where a filter with two pairs of complex poles and three zeros is considered. 

To obtain the estimate of H(co), we move around the unitary circle and roughly evaluate the effect of 
poles and zeroes by keeping in mind a few rules [Challis & Kitney, 1982]: (1) when we are close to a 
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zero, |H(co)| will approach zero, and a positive phase shift will appear in ZH(co) as the vector from the 
zero reverses its angle; (2) when we are close to a pole, H(co) will tend to peak, and a negative phase 
change is found in ZH(co) (the closer the pole to unitary circle, the sharper is the peak until it reaches 
infinite and the filter becomes unstable); and (3) near a closer pole-zero pair, the response modulus will 
tend to zero or infinity if the zero or the pole is closer, while far from this pair, the modulus can be 
considered unitary. As an example, it is possible to compare the modulus and phase diagram of Fig. 53.8 b,c 
with the relative geometry of the poles and zeros of Fig. 53.8a. 

FIR and IIR Filters 

A common way of classifying digital filters is based on the characteristics of their impulse response. For 
finite impulse response (FIR) filters, h(n) is composed of a finite number of nonzero values, while for 
infinite response (IIR) filters, h(n) oscillates up to infinity with nonzero values. It is clearly evident that 
in order to obtain an infinite response to an impulse in input, the IIR filter must contain some feedback 
that sustains the output as the input vanishes. The presence of feedback paths requires putting particular 
attention to the filter stability. 

Even if FIR filters are usually implemented in a nonrecursive form and IIR filters in a recursive form, 
the two ways of classification are not coincident. In fact, as shown by the following example, a FIR filter 
can be expressed in a recursive form 



4)= E--- E •-jSte 



k=Q,N-l k=0,N-l 

for a more convenient computational implementation. 

As shown previously, two important requirements for filters are stability and linear phase response. 
FIR filters can be easily designed to fulfill such requirements; they are always stable (having no poles 
outside the origin), and the linear phase response is obtained by constraining the impulse response 
coefficients to have symmetry around their midpoint. Such constrain implies 

K=±b' M - m (53-18) 

where the b m are the M coefficients of an FIR filter. The sign + or - stays in accordance with the symmetry 
(even or odd) and M value (even or odd). This is a necessary and sufficient condition for FIR filters to 
have linear phase response. Two cases of impulse response that yield a linear phase filter are shown in 
Fig. 53.9. 

It should be noted that condition (53.18) imposes geometric constrains to the zero locus of H(z). 
Taking into account Eq. (53.12), we have 



(')■ 



z M H z\ = H 



' l" 



(53.19) 



Thus, both z m and llz* m must be zeros of H(z). Then the zeroes of linear phase FIR filters must lie on 
the unitary circle, or they must appear in pairs and with inverse moduli. 

Design Criteria 

In many cases, the filter is designed in order to satisfy some requirements, usually on the frequency 
response, which depend on the characteristic of the particular application the filter is intended for. It is 
known that ideal filters, like those reported in Fig. 53.7, are not physically realizable (they would require 
an infinite number of coefficients of impulse response); thus we can design FIR or IIR filters that can 
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Center of symmetry 



b) 



Center of symmetry 



FIGURE 53.9 Examples of impulse response for linear phase FIR filters: odd (a) and even (b) number of coefficients. 

only mimic, with an acceptable error, the ideal response. Figure 53.10 shows a frequency response of a 
not ideal low-pass filter. Here, there are ripples in passband and in stopband, and there is a transition 
band from passband to stopband, defined by the interval co s - co . 

Several design techniques are available, and some of them require heavy computational tasks, which 
are capable of developing filters with defined specific requirements. They include window technique, 
frequency-sampling method, or equiripple design for FIR filters. Butterworth, Chebychev, elliptical 
design, and impulse-invariant or bilinear transformation are instead employed for IIR filters. For detailed 
analysis of digital filter techniques, see Antoniou [1979], Cerutti [1983], and Oppenheim and Schafer 
[1975]. 

Examples 

A few examples of different kinds of filters will be presented in the following, showing some applications 
on ECG signal processing. It is shown that the ECG contains relevant information over a wide range of 
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FIGURE 53.10 Amplitude response for a real low-pass filter. Ripples are admitted in both passband and stopband, 
but they are constrained into restricted areas. Limitations are also imposed to the width of the transition band. 

frequencies; the lower-frequency contents should be preserved for correct measurement of the slow ST 
displacements, while higher-frequency contents are needed to correctly estimate amplitude and duration 
of the faster contributions, mainly at the level of the QRS complex. Unfortunately, several sources of 
noise are present in the same frequency band, such as, higher-frequency noise due to muscle contraction 
(EMG noise), the lower- frequency noise due to motion artifacts (baseline wandering), the effect of 
respiration or the low-frequency noise in the skin-electrode interface, and others. 

In the first example, the effect of two different low-pass filters will be considered. An ECG signal 
corrupted by an EMG noise (Fig. 53.11a) is low-pass filtered by two different low-pass filters whose 
frequency responses are shown in Fig. 53.11&,c. The two FIR filters have cutoff frequencies at 40 and 
20 Hz, respectively, and were designed through window techniques (Weber-Cappellini window, filter 
length = 256 points) [Cappellini et al., 1978]. 

The output signals are shown in Fig. 53.11d,e. Filtering drastically reduces the superimposed noise but 
at the same time alters the original ECG waveform. In particular, the R wave amplitude is progressively 
reduced by decreasing the cutoff frequency, and the QRS width is progressively increased as well. On the 
other hand, P waves appear almost unaffected, having frequency components generally lower than 20 to 
30 Hz. At this point, it is worth noting that an increase in QRS duration is generally associated with 
various pathologies, such as ventricular hypertrophy or bundle-branch block. It is therefore necessary to 
check that an excessive band limitation does not introduce a false-positive indication in the diagnosis of 
the ECG signal. 

An example of an application for stopband filters (notch filters) is presented in Fig. 53.12. it is used 
to reduce the 50-Hz mains noise on the ECG signal, and it was designated by placing a zero in corre- 
spondence of the frequency we want to suppress. 

Finally, an example of a high-pass filter is shown for the detection of the QRS complex. Detecting the 
time occurrence of a fiducial point in the QRS complex is indeed the first task usually performed in ECG 
signal analysis. The QRS complex usually contains the higher-frequency components with respect to the 
other ECG waves, and thus such components will be enhanced by a high-pass filter. Figure 53.13 shows 
how QRS complexes (Fig. 53.13 a) can be identified by a derivative high-pass filter with a cutoff frequency 
to decrease the effect of the noise contributions at high frequencies (Fig. 53.13&). The filtered signal 
(Fig. 53.13c) presents sharp and well-defined peaks that are easily recognized by a threshold value. 

Signal Averaging 

Traditional filtering performs very well when the frequency content of signal and noise do not overlap. 
When the noise bandwidth is completely separated from the signal bandwidth, the noise can be decreased 
easily by means of a linear filter according to the procedures described earlier. On the other hand, when 
the signal and noise bandwidth overlap and the noise amplitude is enough to seriously corrupt the signal, 
a traditional filter, designed to cancel the noise, also will introduce signal cancellation or, at least, 
distortion. As an example, let us consider the brain potentials evoked by a sensory stimulation (visual, 
acoustic, or somatosensory) generally called evoked potentials (EP). Such a response is very difficult to 
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determine because its amplitude is generally much lower than the background EEG activity. Both EP 
and EEG signals contain information in the same frequency range; thus the problem of separating the 
desired response cannot be approached via traditional digital filtering [Aunon et al., 1981]. Another 
typical example is in the detection of ventricular late potentials (VLP) in the ECG signal. These potentials 
are very small in amplitude and are comparable with the noise superimposed on the signal and also for 
what concerns the frequency content [Simson, 1981]. In such cases, an increase in the SNR may be 
achieved on the basis of different statistical properties of signal and noise. 

When the desired signal repeats identically at each iteration (i.e., the EP at each sensory stimulus, the 
VLP at each cardiac cycle), the averaging technique can satisfactorily solve the problem of separating 
signal from noise. This technique sums a set of temporal epochs of the signal together with the super- 
imposed noise. If the time epochs are properly aligned, through efficient trigger-point recognition, the 
signal waveforms directly sum together. If the signal and the noise are characterized by the following 
statistical properties: 

1. All the signal epochs contain a deterministic signal component x(n) that does not vary for all the 
epochs. 

2. The superimposed noise w{ri) is a broadband stationary process with zero mean and variance cr 2 
so that 



E 
E 



{-) 
(») 



= 

(53.20) 



-a 2 



3. Signal x{n) and noise w(n) are uncorrelated so that the recorded signal y(n) at the rth iteration 
can be expressed as 

/[«) = x(«)+ win), (53.21) 

Then the averaging process yields y t : 



N N 

y{ n ) = ^H y ' =x ( n ) + H w l n ) (53 - 22) 



The noise term is an estimate of the mean by taking the average of N realizations. Such an average is a 
new random variable that has the same mean of the sum terms (zero in this case) and which has variance 
of a 2 /N. The effect of the coherent averaging procedure is then to maintain the amplitude of the signal 
and reduce the variance of the noise by a factor of N. In order to evaluate the improvement in the SNR 
(in rms value) in respect to the SNR (at the generic ith sweep): 

SNR=SNR ; -a/n (53.23) 

Thus signal averaging improves the SNR by a factor of Jn in rms value. 

A coherent averaging procedure can be viewed as a digital filtering process, and its frequency charac- 
teristics can be investigated. From expression (53.17) through the z-transform, the transfer function of 
the filtering operation results in 



4). 



1 , "* , -2* , , -( N - l ) h 

1+Z+Z+ — Yz v ' 

(53.24) 
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where N is the number of elements in the average, and h is the number of samples in each response. An 
alternative expression for H(z) is 



HI 



(')■ 



1 1- 

N 1- 



(53.25) 



This is a moving average low-pass filter as discussed earlier, where the output is a function of the preceding 
value with a lag of h samples; in practice, the filter operates not on the time sequence but in the sweep 
sequence on corresponding samples. 

The frequency response of the filter is shown in Fig. 53.14 for different values of the parameter N. In this 
case, the sampling frequency f s is the repetition frequency of the sweeps, and we may assume it to be 1 without 
loss of generality. The frequency response is characterized by a main lobe with the first zero corresponding 
to /= UN and by successive secondary lobes separated by zeroes at intervals UN. The width of each tooth 
decreases as well as the amplitude of the secondary lobes when increasing the number N of sweeps. 

The desired signal is sweep-invariant, and it will be unaffected by the filter, while the broadband noise 
will be decreased. Some leakage of noise energy takes place in the center of the sidelobes and, of course, 
at zero frequency. Under the hypothesis of zero mean noise, the dc component has no effect, and the 




FIGURE 53.14 Equivalent frequency response for the signal-averaging procedure for different values of N (see text). 
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diminishing sidelobe amplitude implies the leakage to be not relevant for high frequencies. It is important 
to recall that the average filtering is based on the hypothesis of broadband distribution of the noise and 
lack of correlation between signal and noise. Unfortunately, these assumptions are not always verified in 
biologic signals. For example, the assumptions of independence of the background EEG and the evoked 
potential may be not completely realistic [Gevins & Remond, 1987]. In addition, much attention must 
be paid to the alignment of the sweeps; in fact, slight misalignments (fiducial point jitter) will lead to a 
low-pass filtering effect of the final result. 

Example 

As mentioned previously, one of the fields in which signal-averaging technique is employed extensively 
is in the evaluation of cerebral evoked response after a sensory stimulation. Figure 53.15a shows the EEG 
recorded from the scalp of a normal subject after a somatosensory stimulation released at time t = 0. 
The evoked potential (N = 1) is not visible because it is buried in the background EEG (upper panel). 
In the successive panels there is the same evoked potential after averaging different numbers of sweeps 
corresponding to the frequency responses shown in Fig. 53.14. As N increases, the SNR is improved by 
a factor Jn (in rms value), and the morphology of the evoked potential becomes more recognizable 
while the EEG contribution is markedly diminished. In this way it is easy to evaluate the quantitative 
indices of clinical interest, such as the amplitude and the latency of the relevant waves. 

Spectral Analysis 

The various methods to estimate the power spectrum density (PSD) of a signal may be classified as 
nonparametric and parametric. 

Nonparametric Estimators of PSD 

This is a traditional method of frequency analysis based on the Fourier transform that can be evaluated 
easily through the fast Fourier transform (FFT) algorithm [Marple, 1987]. The expression of the PSD as 
a function of the frequency P(f) can be obtained directly from the time series y(n) by using the peri- 
odogram expression 



AH 



T.tkY™ -^W) 



(53.26) 



where T s is the sampling period, N is the number of samples, and Y(f) is the discrete time Fourier 
transform of y{n). 

On the basis of the Wiener-Khintchin theorem, PSD is also obtainable in two steps from the FFT of 
the autocorrelation function R yy {k) of the signal, where R yy (k) is estimated by means of the following 
expression: 



1 = 

where * denotes the complex conjugate. Thus the PSD is expressed as 

N 

p(f)=T ■ V R yy [k}e-> lltfkT ' (53.28) 

k=-N 
based on the available lag estimates RJk), where -(1/2T 5 ) </< (1/2T S ). 
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FIGURE 53.15 Enhancement of evoked potential (EP) by means of averaging technique. The EEG noise is pro- 
gressively reduced, and the EP morphology becomes more recognizable as the number of averaged sweeps (N) is 
increased. 

FFT-based methods are widely diffused, for their easy applicability, computational speed, and direct 
interpretation of the results. Quantitative parameters are obtained by evaluating the power contribution 
at different frequency bands. This is achieved by dividing the frequency axis in ranges of interest and by 
integrating the PSD on such intervals. The area under this portion of the spectrum is the fraction of the 
total signal variance due to the specific frequencies. However, autocorrelation function and Fourier 
transform are theoretically defined on infinite data sequences. Thus errors are introduced by the need 
to operate on finite data records in order to obtain estimators of the true functions. In addition, for the 
finite data set it is necessary to make assumptions, sometimes not realistic, about the data outside the 
recording window; commonly they are considered to be zero. This implicit rectangular windowing of 
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the data results in a special leakage in the PSD. Different windows that smoothly connect the side samples 
to zero are most often used in order to solve this problem, even if they may introduce a reduction in the 
frequency resolution [Harris, 1978]. Furthermore, the estimators of the signal PSD are not statistically 
consistent, and various techniques are needed to improve their statistical performances. Various methods 
are mentioned in the literature; the methods of Dariell [1946], Bartlett [1948], and Welch [1970] are the 
most diffused ones. Of course, all these procedures cause a further reduction in frequency resolution. 

Parametric Estimators 

Parametric approaches assume the time series under analysis to be the output of a given mathematical 
model, and no drastic assumptions are made about the data outside the recording window. The PSD is 
calculated as a function of the model parameters according to appropriate expressions. A critical point 
in this approach is the choice of an adequate model to represent the data sequence. The model is 
completely independent of the physiologic, anatomic, and physical characteristics of the biologic system 
but provides simply the input-output relationships of the process in the so-called black-box approach. 
Among the numerous possibilities of modeling, linear models, characterized by a rational transfer 
function, are able to describe a wide number of different processes. In the most general case, they are 
represented by the following linear equation that relates the input-driving signal w{k) and the output of 
an autoregressive moving average (ARMA) process: 

P i 

y(k} = -'Ya i y(k-i} + 'Ybw(k-j} + w{k} (53.29) 

where w{k) is the input white noise with zero mean value and variance X 2 , p and q are the orders of AR 
and MA parts, respectively, and a i and b are the proper coefficients. 

The ARMA model may be reformulated as an AR or an MA if the coefficients b or a, are, respectively, 
set to zero. Since the estimation of the AR parameters results in liner equations, AR models are usually 
employed in place of ARMA or MA models, also on the basis of the Wold decomposition theorem 
[Marple, 1987] that establishes that any stationary ARMA or MA process of finite variance can be 
represented as a unique AR model of appropriate order, even infinite; likewise, any ARMA or AR process 
can be represented by an MA model of sufficiently high order. 

The AR PPSD is then obtained from the following expression: 



# 



X 2 T \ 2 T. 



v 
1+ V a.z~ 



nM 



2=expl j2KfT s I 



1 = 1 



(53.30) 
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The right side of the relation puts into evidence the poles of the transfer function that can be plotted in 
the z-transform plane. Figure 53.16fo shows the PSD function of the HRV signal depicted in Fig. 53.16a, 
while Fig. 53.16c displays the corresponding pole diagram obtained according to the procedure described 
in the preceding section. 

Parametric methods are methodologically and computationally more complex than the nonparametric 
ones, since they require an a priori choice of the structure and of the order of the model of the signal- 
generation mechanism. Some tests are required a posteriori to verify the whiteness of the prediction 
error, such as the Anderson test (autocorrelation test) [Box & Jenkins, 1976] in order to test the reliability 
of the estimation. Postprocessing of the spectra can be performed as well as for nonparametric approaches 
by integrating the P(f) function in predefined frequency ranges; however, the AR modeling has the 
advantage of allowing a spectral decomposition for a direct and automatic calculation of the power and 
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FIGURE 53.16 (a) Interval tachogram obtained from an ECG recording as the sequence of the RR time intervals 
expressed in seconds as a function of the beat number. ( b) PSD of the signal ( a) evaluated by means of an AR model 
(see text), (c) Pole diagram of the PSD shown in (b). 



frequency of each spectral component. In the z-transform domain, the autocorrelation function (ACF) 
R(k) and the P(z) of the signal are related by the following expression: 



R(k) = —[ p{z)z k ' 1 dz 

v ; 2%jJ\z\=i y ' 



(53.31) 



If the integral is calculated by means of the residual method, the ACF is decomposed into a sum of 
dumped sinusoids, each one related to a pair of complex conjugate poles, and of dumped exponential 
functions, related to the real poles [Zetterberg, 1969]. The Fourier transform of each one of these terms 
gives the expression of each spectral component that fits the component related to the relevant pole or 
pole pair. The argument of the pole gives the central frequency of the component, while the rth spectral 
component power is the residual y t in the case of real poles and 2Re(y { ) in case of conjugate pole pairs. 
y,- is computed from the following expression: 
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(53.32) 



It is advisable to point out the basic characteristics of the two approaches that have been described 
above: the nonparametric and the parametric. The latter (parametric) has evident advantages with respect 
to the former, which can be summarized in the following: 
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• It has a more statistical consistency even on short segments of data; i.e., under certain assumptions, 
a spectrum estimated through autoregressive modeling is a maximum entropy spectrum (MES). 

• The spectrum is more easily interpretable with an "implicit" filtering of what is considered random 
noise. 

• An easy and more reliable calculation of the spectral parameters (postprocessing of the spectrum), 
through the spectral decomposition procedure, is possible. Such parameters are directly interpret- 
able from a physiologic point of view. 

• There is no need to window the data in order to decrease the spectral leakage. 

• The frequency resolution does not depend on the number of data. 

On the other hand, the parametric approach 

• Is more complex from a methodologic and computational point of view. 

• Requires an a priori definition of the kind of model (AR, MA, ARMA, or other) to be fitted and 
mainly its complexity defined (i.e., the number of parameters). 

Some figures of merit introduced in the literature may be of help in determining their value [Akaike, 
1974]. Still, this procedure may be difficult in some cases. 

Example 

As an example, let us consider the frequency analysis of the heart rate variability (HRV) signal. In 
Fig. 53.16a, the time sequence of the RR intervals obtained from an ECG recording is shown. The RR 
intervals are expressed in seconds as a function of the beat number in the so-called interval tachogram. 
It is worth noting that the RR series is not constant but is characterized by oscillations of up to the 10% 
of its mean value. These oscillations are not causal but are the effect of the action of the autonomic nervous 
system in controlling heart rate. In particular, the frequency analysis of such a signal (Fig. 53.16& shows 
the PSD obtained by mean of an AR model) has evidenced three principal contributions in the overall 
variability of the HRV signal. A very low frequency (VLF) component is due to the long-term regulation 
mechanisms that cannot be resolved by analyzing a few minutes of signal (3 to 5 minutes are generally 
studied in the traditional spectral analysis of the HRV signal). Other techniques are needed for a complete 
understanding of such mechanisms. The low-frequency (LF) component is centered around 0.1 Hz, in a 
range between 0.03 and 0.15 Hz. An increase in its power has always been observed in relation to sympathetic 
activations. Finally, the high-frequency (HF) component, in synchrony with the respiration rate, is due to 
the respiration activity mediated by the vagus nerve; thus it can be a marker of vagal activity. In particular, 
LF and HF power, both in absolute and in normalized units (i.e., as percentage value on the total power 
without the VLF contribution), and their ratio LF/HF are quantitative indices widely employed for the 
quantification of the sympathovagal balance in controlling heart rate [Malliani et al., 1991]. 

53.3 Conclusion 

The basic aspects of signal acquisition and processing have been illustrated, intended as fundamental 
tools for the treatment of biologic signals. A few examples also were reported relative to the ECG signal, 
as well as EEG signals and EPs. Particular processing algorithms have been described that use digital 
filtering techniques, coherent averaging, and power spectrum analysis as reference examples on how 
traditional or innovative techniques of digital signal processing may impact the phase of informative 
parameter extraction from biologic signals. They may improve the knowledge of many physiologic 
systems as well as help clinicians in dealing with new quantitative parameters that could better discrim- 
inate between normal and pathologic cases. 
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Defining Terms 

Aliasing: Phenomenon that takes place when, in A/D conversion, the sampling frequency f s is lower 

than twice the frequency content f h of the signal; frequency components above fJ2 are folded back 

and are summed to the lower-frequency components, distorting the signal. 
Averaging: Filtering technique based on the summation of N stationary waveforms buried in casual 

broadband noise. The SNR is improved by a factor of J~N . 
Frequency response: A complex quantity that, multiplied by a sinusoid input of a linear filter, gives 

the output sinusoid. It completely characterizes the filter and is the Fourier transform of the impulse 

response. 
Impulse reaction: Output of a digital filter when the input is the impulse sequence 5(n). It completely 

characterizes linear filters and is used for evaluating the output corresponding to different kinds 

of inputs. 
Notch filter: A stopband filter whose stopped band is very sharp and narrow. 
Parametric methods: Spectral estimation methods based on the identification of a signal generating 

model. The power spectral density is a function of the model parameters. 
Quantization error: Error added to the signal, during the A/D procedure, due to the fact that the 

analog signal is represented by a digital signal that can assume only a limited and predefined set 

of values. 
Region of convergence: In the z-transform plane, the ensemble containing the z-complex points that 

makes a series converge to a finite value. 
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Computerized electrocardiogram (ECG), electroencephalogram (EEG), and magnetoencephalogram 
(MEG) processing systems have been widely used in clinical practice [ 1 ] and they are capable of recording 
and processing long records of biomedical signals. The use of such systems (1) enables the construction 
of large signal databases for subsequent evaluation and comparison, (2) makes the transmission of 
biomedical information feasible over telecommunication networks in real time or off line, and 
(3) increases the capabilities of ambulatory recording systems such as the Holter recorders for ECG 
signals. In spite of the great advances in VLSI memory technology, the amount of data generated by 
digital systems may become excessive quickly. For example, a Holter recorder needs more than 
200 Mbits/day of memory space to store a dual-channel ECG signal sampled at a rate of 200 samples/sec. 
with 10 bit/sample resolution. Since the recorded data samples are correlated with each other, there is 
an inherent redundancy in most biomedical signals. This can be exploited by the use of data compression 
techniques which have been successfully utilized in speech, image, and video signals [2] as well. 

The aim of any biomedical signal compression scheme is to minimize the storage space without losing 
any clinically significant information, which can be achieved by eliminating redundancies in the signal, 
in a reasonable manner. 

Data compression methods can be classified into two categories: (1) lossless and (2) lossy coding 
methods. In lossless data compression, the signal samples are considered to be realizations of a random 
variable or a random process and the entropy of the source signal determines the lowest compression 
ratio that can be achieved. In lossless coding the original signal can be perfectly reconstructed. For typical 
biomedical signals lossless (reversible) compression methods can only achieve Compression Ratios 1 (CR) 
in the order of 2 to 1. On the other hand lossy (irreversible) techniques may produce CR results in the 
order of 10 to 1. In lossy methods, there is some kind of quantization of the input data which leads to 



'CR is defined as the ration of the total number of bits used to represent the digital signal before and after 
compression. 
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higher CR results at the expense of reversibility. But this may be acceptable as long as no clinically 
significant degradation is introduced to the encoded signal. The CR levels of 2 to 1 are too low for most 
practical applications. Therefore, lossy coding methods which introduce small reconstruction errors are 
preferred in practice. In this section we review the lossy biomedical data compression methods. 

54.2 Time-Domain Coding of Biomedical Signals 

Biomedical signals can be compressed in time domain, frequency domain, or time-frequency domain. 
In this subsection the time domain techniques, which are the earlier approaches to biomedical signal 
compression, are reviewed. 

Data Compression by DPCM 

Differential pulse code modulation (DPCM) is a well-known data coding technique in which the main 
idea is to decorrelate the input signal samples by linear prediction. The current signal sample, x(n), is 
estimated from the past samples by using either a fixed or adaptive linear predictor 



k{n\ = S\a k x{n-k\ (54.1) 



where x(n) is the estimate of x(ri) at discrete time instant n, and {a k } is the predictor weight. 2 The samples 
of the estimation error sequence, e(n) = x(n) - x(n) are less correlated with each other compared to the 
original signal, x{n) as the predictor removes the unnecessary information which is the predictable 
portion of the sample x(n). In a typical DPCM encoder, the error sequence is quantized by using a 
nonuniform quantizer and quantizer outputs are entropy coded by assigning variable-length codewords 
to the quantized error sequence according to the frequency of occurrence. The variable length codebook 
is constructed by Huffman coding which assigns shorter (longer) codewords to values occurring with 
higher (lower) probabilities. Huffman coding produces compression results that are arbitrarily close to 
the entropy of the quantized error sequence. 

A CR of 7.8 was reported for an ECG signal recorded at a rate of 500 Hz with 8 bit/sample resolution 
[6]. This means that about 1 bit/sample is used to represent the ECG signal. The corresponding Percent 
Root Mean Square Difference (PRD) was 3.5. The PRD is a measure of reconstruction error and it is 
defined as follows: 



PRD-- 
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where Nis the total number of samples in the ECG signal, x(n), and x rec (n) is the reconstructed ECG signal. 
A typical DPCM encoder may become a lossless coder if the quantization step is skipped. In this case 
CR drops drastically to low values. 

AZTEC ECG Compression Method 

The Amplitude Zone Time Epoch Coding (AZTEC) is one of the earliest ECG coding methods. It was 
developed by Cox et al., [12] as a preprocessing software for real-time monitoring of ECGs. It was 



2 In practical DPCM implementation, instead of x(n - k), quantized past samples of the signal are used in the 
linear predictor [2]. 

© 2000 by CRC Press LLC 




FIGURE 54. 1 AZTEC representation of an ECG waveform. 



observed to be useful for automatic analysis of ECGs such as QRS detection, but it is inadequate for 
visual presentation of the ECG signal as the reconstructed signal has a staircase appearance. 

In this method the ECG signal is considered to consist of flat regions and "slopes". If the signal value 
stays within a predetermined constant for more than three consecutive samples then that region is 
assumed to be constant and stored by its duration (number of samples) and a fixed amplitude value. 
Otherwise the signal is assumed to have a slope. A slope is stored by its duration and the amplitude of 
the last sample point. Linear interpolation is used to reconstruct the ECG signal. As a result, the resulting 
signal has a discontinuous nature as shown in Fig. 54.1. 

Even through AZTEC produces a high compression ratio (CR = 10 to 1, for 500 Hz sampled data with 
12 bit/sample resolution) the quality of the reconstructed signal is very low and it is not acceptable to 
the cardiologists. 

Various modified of AZTEC are proposed [3], One notable example is the CORTES technique [13]. 
The CORTES technique is a hybrid of the AZTEC and the Turning Point method which is described in 
the next subsection. 

Turning Point ECG Compression Method 

The Turning Point data reduction method [14] is basically an adaptive downsampling method developed 
especially for ECGs. It reduces the sampling frequency of an ECG signal by a factor of two. 

The method is based on the trends of the signal samples. Three input samples are processed at a time. 
Let x(n) be the current sample at discrete-time instant n. Among the two consecutive input samples, 
x(n+ 1) and x(n + 2), the one producing the highest slope (in magnitude) is retained and the other 
sample is dropped. In this way the overall sampling rate is reduced to one-half of the original sampling 
rate, no other coding is carried out. Therefore, the resulting CR is 2 to 1. A PRD of 5.3% is reported for 
an ECG signal sampled at 200 Hz with 12 bit/sample resolution [14]. In practice the CR value actually 
may be lower than 2 as the retained samples may not be equally spaced and some extra bits may be 
needed for timing determination. 

ECG Compression via Parameter Extraction 

In these methods, the signal is analyzed and some important features such as typical cycles, extreme 
locations, etc. are determined. These features are properly stored. Reconstruction is carried out by using 
appropriate interpolation schemes. 

The location of the extrema or peaks in an ECG signal is important in diagnosis because they basically 
determine the shape of each ECG period. The ECG compression techniques described in [15] take 
advantage of this feature and record only the maxima, minima, slope changes, zero-crossing intervals, 
etc. of the signal. During reconstruction various interpolation schemes such as polynomial fitting and 
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spline functions are used. The performance of the extrema based methods is compared to the AZTEC 
method and it was reported that for a given CR the RMS error is half of that of AZTEC method [15]. 

Other parameter extraction methods include [4, 11] where ECG signals are analyzed in a cycle- 
synchronous manner. An AR model is fitted to the MEG signal and parameters of the AR model are 
stored [16]. An ECG signal is modeled by splines and the data is compressed by storing the spline 
parameters [19]. 

Review of some other time-domain ECG data compression methods can be found in [3]. 

54.3 Frequency-Domain Data Compression Methods 

Transform Coding (TC) is the most important frequency- domain digital waveform compression method 
[2 ] . The key is to divide the signal into frequency components and judiciously allocate bits in the frequency 
domain. 

In most TC methods, the input signal is first divided into blocks of data and each block is linearly 
transformed into the "frequency" domain. Let x = [x x l . . .x N _J T be a vector obtained from a block of N 
input samples. The transform domain coefficients, v,, i— 0, 1, . .., N— 1, are given by 

v = Ax (54.3) 

where v = [v Vj... v N _ J T , and A is the N x N transform matrix representing the linear transform. A 
variety of transform matrices including the discrete Fourier transform matrix is used in digital waveform 
coding [2]. In this subsection discrete Karhunen-Loeve Transform (KLT) and Discrete Cosine Transform, 
which are the most used ones, are reviewed. 

If the input signal is a wide sense stationary random process then the so-called optimum linear 
transform, KLT is well defined and it decorrelates the entries of the input vector. This is equivalent to 
removing all the unnecessary information contained in the vector x. Therefore, by coding the entries of 
the v vector, only the useful information is retained. The entries of the v vector are quantized and stored. 
Usually, different quantizers are used to quantize the entries of the v vector. In general, more bits are 
allocated to those entries which have high energies compared to the ones with low energies. 

The KLT is constructed from the eigenvectors of the autocovariance matrix of the input vector x. In 
most practical waveforms the statistics of the input vector change from block to block. Thus for each 
block a new KLT matrix must be constructed. This is computationally very expensive (in some practical 
cases a fixed KLT matrix is estimated and it is assumed to be constant for a reasonable amount of 
duration). Furthermore, there is no fast algorithm similar to the Fast Fourier Transform (FFT) to compute 
the KLT. 

The Discrete Cosine Transform (DCT) [2] was developed by Ahmed et. al, to approximate KLT when 
there is high correlation among the input samples, which is the case in many digital waveforms including 
speech, music, and biomedical signals. 

The DCT v = [v v l ... v N _J T of the vector x is defined as follows 
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where v k is the fcth DCT coefficient. The inverse discrete cosine transform (IDCT) of v is given as follows 
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There exist fast algorithms, Order (Nlog N), to compute the DCT [2[. Thus, DCT can be implemented 
in a computationally efficient manner. Two recent image and video coding standards, JPEG and MPEG, 
use DCT as the main building block. 

A CR of 3 to 1 for single channel ECG is reported by using DCT and KLT based coders [8] . For multi- 
lead systems, two-dimensional (2-D) transform based methods can be used. A CR of 12 to 1 was reported 
[17] for a three-lead system by using a 2-D KLT. Recently, Philips [18] developed a new transform by 
using time-warped polynomials and obtained a CR of 26 to 9. In this case a DCT based coding procedure 
produced a CR of 24 to 3. 

Since signal recording conditions and noise levels vary from study to study, a thorough comparison 
of the coding methods is very difficult to make. But, frequency-domain coding methods produce higher 
coding results than time-domain coding methods. 

54.4 Wavelet or Subband Coding 

Wavelet Transform ( WT) is a powerful time-frequency signal analysis tool and it is used in a wide variety 
of applications including signal and image coding [20]. WT and Subband Coding (SBC) are closely 
related to each other. In fact the fast implementation of WTs are carried out using Subband (SB) filter 
banks. Due to this reason WT based waveform coding methods are essentially similar to the SBC based 
methods. 

In an SBC structure the basic building block is a digital filter bank which consists of a lowpass and a 
highpass filter. In the ideal case, the passband (stopband) of the lowpass filter is [0, 7i/2] [nil, n\. Let 
H,(e ;m )H u (e"°) be the frequency response of the lowpass (highpass) filter. In SBC the input signal which 
is sampled at the rate of f s is filtered by H i (e' a> ) and H u (e> m ) and the filter outputs are downsampled by a 
factor of two (every other sample is dropped) in order to maintain the overall sampling rate to be equal 
to the original rate. In this way two subsignals, x t {n) and x u (n) which both have a sampling rate of fJ2 
are obtained. The block diagram of a two-level SBC structure is shown in Fig. 54.1. 

The subsignal, x x (n) (x u (ri)), contains the lowpass (highpass) frequency domain information of the 
original signal. The subsignals, x^ri) and x u {n), can be further divided into their frequency components 
in a similar manner. This SB division process can be repeated until the frequency domain is divided in 
a sufficient manner. In [7] the two-level subband structure of Fig. 54.2 which divides the frequency 
domain into four regions, ( — , ( +1) " 1 , is used to code a single channel ECG signal. The resulting 

V 4 4 '1=0,1,2,3 -ll. • 

subsignals can be encoded by various coding methods including DPCM, Entropy coding, and transform 
coding which should be designed to exploit the special nature of the subsignals to achieve the best possible 
CR for each band. Recently, very successful special encoders which take advantage of the tree structure 
of the decomposition were developed for compressing the subsignals [26]. The signal reconstruction 
from the coded subsignals is carried out by using a filter bank consisting of two filters, G,(e' m J and G u (e> m ). 
In this case the subsignal, x t (n) [*„(«)] is first upsampled by inserting a zero between every other sample 
and filtered by G,(e' m ) [G„(e^°)]. The reconstructed signal, y(n), is the sum of the outputs of these two 
filters. 

By properly selecting the filters, H^z), H u (z), G t {z) and G u {z), perfect reconstruction is possible, i.e., 



{n) = x(n-K\ (54.7) 



>'! 

in the absence of quantization errors [20]. In other words, filtering and downsampling operations in the 
SB decomposition structure do not introduce any loss. 
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FIGURE 54.2 Two-level (four branch) subband coder structure. This structure divides the frequency domain into 
four regions. 

A CR 5.22 corresponding to a PRD of 5.94% is reported for an ECG signal sampled at 500 Hz with 
12 bit/sample resolution [7]. 

Other WT based ECG coding methods include [20-23]. 

54.5 Hybrid Multichannel ECG Coding 

In this subsection, a hybrid frequency domain multichannel compression method [24] for the so-called 
standard lead [27] ECG recording system is described. The block diagram of this system is shown in 
Fig. 54.3. The main idea is to exploit not only the correlation between the consecutive ECG samples, but 
also the inherent correlation among the ECG channels. 

The standard lead system has 12 ECG channels. In this method, the recorded digital signals are first 
passed through a preprocessor. The function of the preprocessor is to prepare raw ECG data for further 
processing. After preprocessing the input signals, the resulting discrete-time sequences are linearly trans- 
formed into another set of sequences. The aim of this linear transformation is to decorrelate the highly 
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FIGURE 54.3 Block Diagram of the Multichannel ECG data Compression Scheme. 
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correlated ECG lead signals. In a way, this idea is similar to representing the RGB color components of 
an image in terms of luminance and chrominance components. 

The transformation matrix, A, can be the matrix of the optimum transform, KLT, or the DCT matrix. 
Another approach is to use a nonlinear transform such as an Independent Component Analyzer (ICA) 
[25] . Lastly, to compress the transform domain signals, various coding schemes which exploit their special 
nature are utilized. 

In the following subsections, detailed descriptions of the sub-blocks of the multichannel ECG com- 
pression method are given. 

Preprocessor 

The standard lead ECG recording configuration consists of 12 ECG leads, I, II, III, AVR, AVL, AVF, VI, 
V2, ..., and V6. The leads, III, AVR, AVL, and AVF, are linearly related to I and II. Therefore, eight 
channels are enough to represent a standard 12-channel ECG recording system. 

The preprocessor discards the redundant channels, III, AVR, AVL, and AVF, and rearranges the order 
of the ECG channels. The six precordial (chest) leads, VI, ..., V6, represent variations of the electrical 
heart vector amplitude with respect to time from six different narrow angles. During a cardiac cycle it 
is natural to expect high correlation among precordial leads so the channels VI, ..., V6 are selected as 
the first 6 signals, i.e., X;_, = Vi, i = 1, 2, ..., 6. The two horizontal lead waveforms (I and II) which have 
relatively less energy contents with respect to precordial ECG lead waveforms are chosen as seventh, 
x 6 = I, and eighth channels, x 7 = II. A typical set of standard ECG lead waveforms, X t , i = 0, 1, ..., 7, are 
shown in Fig. 54.4. 

The aim of the reordering the ECG channels is to increase the efficiency of the linear transformation 
operation which is described in the next subsection. 

Linear Transformer 

The outputs of the preprocessor block, Xj, i = 0, 1, ..., 7, are fed to the linear transformer. In this block, 
the ECG channels are linearly transformed to another domain, and eight new transform domain signals 
y,, i = 0, 1, . . ., 7, are obtained which are significantly less correlated (ideally uncorrelated) than the ECG 
signal set, x„ i : = 0, 1, . . ., 7. The transform domain samples at discrete-time instant m are given as follows: 

Y =A-X (54.8) 

where Y m = [y (m),...,y N _ l {m)] T , X m = [x (m),..., x n _ l {rn)] T , and A is the N x N transform matrix. 

The optimum linear transform, discrete KLT, can be properly defined for stationary random processes 
and the entries of the tranform matrix, A KLT depending on the statistics of the random processes. For 
slowly varying, unstationary signals, an approximate KLT matrix can also be defined. Although ECG 
signals cannot be considered to be wide-sense stationary-random processes, a covariance matrix, C x , of 
the ECG channels is estimated as follows: 



C 

A M 



M-l 



(54.9) 



where N is the number of the ECG channels and M is the number of ECG samples per channel used. 
The N x N ECG channel covariance matrix, C x , is used in the construction of an approximate KLT 
matrix. Rows of the approximate KLT matrix are the eigenvectors of C x . Typical approximate KLT 
matrices can be found in [24]. 
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FIGURE 54.4 A typical set of standard ECG lead waveforms x- t , 



0. 1.....7. 



Although there is no fast algorithm to compute the KL transform, the computational burden is not 
high because N= 8. The DCT can also be used as a linear transformer because it approximates the KLT. 

Compression of the Transform Domain Signals 

In this subsection the compression of the uncorrelated transform domain signals y k , k= 0, 1, ..., 7, are 
described. In Fig. 54.5 a typical set of uncorrelated signals, y k , k= 0, 1, ..., 7, are shown. The signals in 
Fig. 54.5 are obtained by KL transforming the ECG signals, x k , k = 0, 1, ..., 7, shown in Fig. 54.4. 

Transform domain signals, y k , k = 0, \, ..., 7, are divided into two classes according to their energy 
contents. The first class of signals, y , y,, ..., y 4 , have higher energy than the second class of signals, y 5 , 
y 6 , and y 7 . More bits are allocated to the high energy signals, y , y x , ..., y A , compared to the low energy 
signals, y 5 , y 6 , and y 7 in coding. 

Subband Coder (SBC) 

Higher energy signals, y„(n), y 1 (ri),...,y i (ri), contain more information than the low energy signals, y 5 (n), 
y 6 (n), andy 7 (n). Therefore, the high energy signals, y„(n), ...,y 4 (n) should be compressed very accurately. 
The signals, y„{n), ..., y 4 («), are compressed using the SBC [7] because this coding scheme does not 
introduce any visual degradation as pointed out in the previous subsection (4). 
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FIGURE 54.5 Uncorrelated signals y { , i = 0, 1, 2,..., 7, corresponding to the ECG signals shown in Fig. 54.4. 

Each signal y i is decomposed into four subsignals by using a filter bank in a tree-structured manner. 
In this way the signal y t is divided into four consecutive bands, [In/ 4, (I + l)jt/4], I = 0, 1, 2, 3, in the 
frequency domain [2], For example, y m (the subsignal at branch A of Fig. 54.2) comes from the lowpass 
frequency band, [0, Jt/4], of the signal y- r In the coding of the subband signals, y lk , j = 0, 1; k = 0, 1, the 
advantage is taken of the nonuniform distribution of energy in the frequency domain to judiciously 
allocate the bits. The number of bits used to encode each frequency band can be different, so the encoding 
accuracy is always maintained at the required frequency bands [2]. 

It is observed that the energy of the signal y t is mainly concentrated in the lowest frequency band [0, 
7t/4]. Because of this, the lowband subsignal y ioo has to be carefully coded. High correlation among 
neighboring samples of y iQ0 makes this signal a good candidate for efficient predictive or transform 
coding. The subsignals y i00 are compressed using a DCT based scheme. After the application of DCT 
with a block size of 64 samples to the lowband subsignal, y iQ0 , the transform domain coefficients, g ioo {k), 
k = 0, 1, . . ., 63 are obtained and they are thresholded and quantized. The coefficients whose magnitudes 
are above a preselected threshold, p\ are retained and the other coefficients are discarded. Thresholded 
DCT coefficients are quantized for bit level representation. Thresholded and quantized nonzero coeffi- 
cients are variable-length coded by using an amplitude table and the zero values are runlength coded. 
The amplitude and runlength lookup tables are Huffman coding tables which are obtained according to 
the histograms of the DCT coefficients [24]. 

In practice, ECG recording levels do not change from one recording to another. If drastic variations 
occur, first scale the input by an appropriate factor, then apply DCT. 

Bandpass and highpass subsignals, y j0 v y t lo y ni (branches B, C, and D), are coded using non-uniform 
quantizers. After quantization, a code assignment procedure is realized using variable length amplitude 
and runlength lookup tables for zero values. The look up tables are obtained according to the histograms 
of quantized subband signals. 



) 2000 by CRC Press LLC 



«4AjJA^ 



*4A*4' / W 



^fTH+iJV^ 



*A^~Y^\, 



V1 



-|~~|— | v, ~H — ^ — ir 



V2 



— -U 




V2 



"~U 



0.00 



o.ao 
Time 



1 .60 

(sec 



2.40 



0.00 



r~ 

0.80 

T 



-r 



original 



1.60 2.40 

me (sec.) 

reconstructed 



FIGURE 54.6 The original and reconstructed ECG lead signals I, II, V„ and V 2 (CR = 6.17, APRD = 6). 

The bit streams which are obtained from coding of four subband signals are multiplexed and stored. 
Appropriate decoders are assigned to each branch to convert the bit streams into time domain samples 
and the four-branch synthesis filter bank performs the reconstruction [7]. 

The low energy signals, y 5 («), y 6 (")> y?( n )> are a l so coded by using the SB coder previously explained. 
However, it is observed that all the subsignals except the subsignal at branch A of Fig. 54.2 contain very 
little information when subband decomposition is applied to the signals y 5 (n), y 6 (n), y 7 (n). Due to this 
fact, only the subsignals at branch A (lowband signals) are processed. The other branches which have 
very little energy are discarded. 

Original and reconstructed ECG waveforms are shown in Fig. 54.6 for CR = 6.17 (CR = 7.98) with 
PRD = 6.19% when DCT (KLT) is used as the Linear Transformer. Recorded ECG signals are sampled 
at 500 Hz with 12 bit/sample resolution. Also, the raw ECG signals are filtered to attenuate the high 
frequency noise with a 33-tap equiripple Parks-McClellan FIR filter whose cutoff frequency is equal to 
125 Hz. In this case a CR = 9.41 is obtained for a PRD = 5.94%. 
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The effect of compressing the data on diagnostic computer analysis results is tested on Cardionics 
program, which is derived from the Mount Sinai program developed by Pordy et al., in conjunction with 
CRO-MED Bionics Company [10]. Morphological measurements include (1) intervals — PR, QRS, QT, 
(2) "width's— Q, R, S, R', S', T, P, (3) amplitudes— P, Q, R, R', S, S', T, JJ, and (4) areas of QRS and 
QRST. There was no difference in the measurement results of both the compressed and the original data. 

It is experimentally observed that the multichannel technique produces better compression results 
than single channel schemes. Also, the computational complexity of the multichannel scheme is compa- 
rable to single channel ECG coding schemes and the algorithm can be implemented by using digital 
signal processor for real-time applications, such as transmission of ECG signals over telephone lines. 

Conclusion 

In this section biomedical signal compression methods are reviewed. Most of the biomedical data 
compression methods have been developed for ECG signals. However, these methods can be applied to 
other biomedical signals with some modifications. 

It is difficult to compare the efficiency of biomedical data compression schemes because coding results 
of various data compression schemes are obtained under different recording conditions such as 
(1) sampling frequency, (2) bandwidth, (3) sample precision, and (4) noise level, which may drastically 
affect the currently used performance measures. 
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One-Dimensional Signal Representations 

Desirable Properties of Time-Frequency 

Representations 

TFR Classes 

Cohen's Class of TFRs • Affine Class of TFRs ■ Hyperbolic 

Class of TFRs ■ Kth Power Class 

Common TFRs and Their Use in Biomedical 

Applications 

Wigner Distribution • Smoothed Wigner Distributions • 

Spectrogram ■ Choi-Williams Exponential and Reduced 

Interference Distributions ■ Scalogram or Wavelet 

Transform Squared Magnitude • Biomedical Applications 



The Fourier transform of a signal x{t) 



X(f) = \x(t)e 



i 2 * f 'dt 



(55.1) 



is a useful tool for analyzing the spectral content of a stationary signal and for transforming difficult 
operations such as convolution, differentiation, and integration into very simple algebraic operations in 
the Fourier dual domain. The inverse Fourier transform equation 



,(t) = ^ X (f)e'^df 



(55.2) 



is a linear combination of complex sinusoids of infinite duration which implicitly assumes that each 
sinusoidal component is present at all times and, hence, that the spectral content of the signal does not 
change with time. However, many signals in bioengineering are produced by biologic systems whose 
spectral characteristics can change rapidly with time. To analyze these rapid spectral changes one needs 
a two-dimensional, mixed time-frequency signal representation (TFR) that is analogous to a musical 
score with time represented along one axis and frequency along the other, indicating which frequency 
components (notes) are present at each time instant. 

The first purpose of this chapter is to review several TFRs defined in Table 55.1 and to describe many 
of the desirable properties listed in Table 55.2 that an ideal TFR should satisfy. TFRs will be grouped into 
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TABLE 55.1 List of Time-Frequency Representations (TFR) of a Signal x(f) Whose Fourier Transform is X(f) 



Ackroyd distribution: 



Affine Wiener distribution: 



Altes-Marinovic distribution: 



(Narrowband) ambiguity function: AF 
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Butterworth distribution: 
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Cohen's nonnegative distribution: CND I 
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TABLE 55.1 (continued) List of Time-Frequency Representations (TFR) of a Signal x( i) Whose Fourier Transform is X(f) 



Cone-Kernel distribution: 



Cumulative attack spectrum: 



Cumulative decay spectrum: 



Flandrin D-distribution: 



Gabor expansion, Ge x (n, k; 



Generalized Altes distribution: 
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GAM x (t,f;a) = \f\je- a "X^fe 
Generalized exponential distribution: GED\t,f) — exp 



Generalized Wigner distribution: 
Hyperbolic ambiguity function: 
Hyperbolic wavelet transform: 

Hyperbologram: 

Levin distribution: 
Margineau-Hill distribution: 
Multiform, Tiltable distributions: 

Butterworth: 
Exponential: 



\X*\ fe {l ' \e' lK ''" du 

AF x {x,v)e' 2 " { ' v ' fi) dxdv 
\ "o 



2M / \2N 



GWD, 



I x\ t+\ — +a \x 



M«)=J 

(W) = fx(0*)x*(fe-Vy^ f/f 'hf 



i-\--d\i\e- l2 ' f 'dx 



HAF, 



HWTit, f;T = \±- X(^)r*\ ±-\ Y"""'^ ''>% 



'if 



j>: 



LCYlAriLAs^ifr) 



HYP it, 



x (t,f) = Re{ X (t)x*(f)e-w} 
Let wx, v; a, r, p, y 



f 



A LA e ' lK " lal fi f ') 



d\ = Ihwt (f, /; r)[ 



LD 



MH 



if (v) 1 ) + (x) 2 ) (vf+2r||ir| 



MTBD. 



*4-JJ 



1-lTM— ,— ;a,r,p\y 



AF 



i{x,v)e' 2 < K ^ 



dxdv 



MTED x (t,f) = [[exp] -Jtft 2X — , — ; a, r, P, y I AF (t, vj/'^'rftdv 



& 2000 by CRC Press LLC 



TABLE 55.1 (continued) List of Time-Frequency Representations (TFR) of a Signal x( f) Whose Fourier Transform is X(f) 



(Inverse) Chebyshev: 



Page distribution: 
Kth power 

Ambiguity function: 

Central member: 



Power spectral density: 



Pseudo-Altes distribution: 



Pseudo-Wigner distribution: 
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TABLE 55.1 (continued) List of Time-Frequency Representations (TFR) of a Signal x( f) Whose Fourier Transform is X(f) 



Rihaczek distribution: 



Running spectrum, past and future: 
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Smoothed Pseudo-Wigner distribution: SPWD (f, /; y, T|]= yl f — f')r| — T|* — 
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Spectrogram: 

Unterberger active distribution: 

Unterberger passive distribution: 



SP£C f , 



UADJt, 



k-fe""""* 



STFT U,/;Y 



c/h 



L/HX 



Wavelet transform: 

Wideband ambiguity function: 
Wigner distribution: 



WT 



WAR 



(^»T)=J^)Jfr{f(,-r)]*=H/')Jfr.fA/'W4r 



u u 






classes satisfying similar properties to provide a more intuitive understanding of their similarities, advan- 
tages, and disadvantages. Further, each TFR within a given class is completely characterized by a unique 
set of kernels that provide valuable insight into whether or not a given TFR (l) satisfies other ideal TFR 
properties, (2) is easy to compute, and (3) reduces nonlinear cross-terms. The second goal of this chapter 
is to discuss applications of TFRs to signal analysis and detection problems in bioengineering. Unfortu- 
nately, none of the current TFRs is ideal; some give erroneous information when the signal's spectra are 
rapidly time-varying. Researchers often analyze several TFRs side by side, keeping in mind the relative 
strengths and weaknesses of each TFR before drawing any conclusions. 

55. 1 One-Dimensional Signal Representations 

The instantaneous frequency and the group delay of a signal are one-dimensional representations that 
attempt to represent temporal and spectral signal characteristics simultaneously. The instantaneous 
frequency of the signal 



© 2000 by CRC Press LLC 



a 
a 

-:: 
Cn 



a 


U 














■■ 


O 





Xi 


U 


» 


-7- 


u- 



o 3 

O -a 

— o 

u u 





s 


CN 


un 


£ 


in 




W 


■M 


p-i 


<D 


ca 


a 


< 




H 


cl, 



I 

II 






< 



II 



a? 



EC 


"3" 


II 


3 


CO. 


.c 



< 



u 

,0 



e-t 

n 



I 



e-t 

n 



to 
"5T 



_ ij ^ 

CO. 



J§L. 
'3T 



H 
I 






I 

E-T 



II 



-tr 



±* 


X 


1-1 


1-1 






H 


^ 



II 



u 



3 

'_> 

c 
c 

id 
o 

2 



CO. 

9- 

II 

"3T 

l 



ca 

l 



or 

ii 
"3? 



9- 

II 



I 

H 



9? 



Eh 

II 



II 



II 
^3 




OS 








a 




3 


= 




3 


u 


e 




<u 


3 


e 


'.< 






h 


VU 









O 

II 
























o 
























Ki 
















































CO. 
























x_p 
















































a: 
























9< 




















-| CM 




«|s- 




















A 




T3 






















^ | s^P 




fl 
























ra 




















o 




i-H 






._ i 






- — i 








II 




II 






II 






ii 
































l~-r 




O 






o 






GO. 








<S 




yj> 






s-J 1 






O 
































a: 




X 






ta 






I 











o 
II 


o 

II 

9- u 






* 






3- 


' - >~~' 

H 


'V 




■—i | (N 






>. * 














t-» 


A 


A 


II 












9^° 

II 


9^ 
ii 


*H * 


O 


© 
II 


II 


II 


II 


IJ 




II 


II 


7 




> f V» 


i 




""^ 






















' H " 


^ 


O 


?r 




o 


5> 


> 




1 


> 


^ 


^ 


o 


e 


^_^ 




^ 


O 


e r 




H 


e r 




























u 


u 


_ u 


u 




_u 


^J 






w U 




9- 





* 


9< 


9< 




9h 


3- 


9- 


o 

A 


9- 


9- 


























.tf 


<*< 
















i': 






























v ^ . 


tu 
















*} 


^ — cT 




tu 


<*-, 
















H K 


H 




^* 


S-H 










^ 






p' 


1 


"a 




o 










•*< 


. . 




ii 


: ™5T < 


^ 


o 


-ir 


^T^ 




""ST 




i 
"to' 


1 

"Co" 




' t ^T 

^4H 


, ll , 


II 




J^-. 


00 


be 






•*-, 


"l1-~ 


**-, 


o 
A 


,_o 


^ 


^ n 


<*? 


-ri 


"« ^ 


l-H 




X 


-^ 


^f 


'^ 


'^r 


,c 


tu 


tu 


^H 1 K 

1 


-H 1 K 

1 












u | ^ 
1 


i 


^1 
1 


<S 
o 

II 


u 

,_o 
o 

II 


^ 
■*-, 


^ 
<<-, 






1 

ti" 

II 




i 

II 


1 

"tS" 

II 


"to' 

II 


to 

ii | **-. 
II 


n 


t-i" 
II 






























<*-, 


<*-, 


• — • 


, ^ . 


- "^ - 




■*-. 




^ 


"*-, 


>*-, 


"*-, 


■*-, 


>*-, 


.. ^ . 


. ^ , 


^ 


h* 


fa* 


h* 


*f 




*•" 


*•" 


*r 


*-" 


*f 


*f 


E-H* 


f-l* 


' ' 


' 


' ' 


' ' 


ti*' 




h" 


ti" 


hn" 


h" 


ti* - 


h^ 




i- 


>. 












a 








O 

ft 

ft 


c 

O. 
P 
3 


c 

3 

cr 




6 

l-i 

<-2 




a 
o 


a 
o 


o 

U 

o 


a 
.9 

N 

u 


a 
o 

J3 

"3 

S 
c 

Q 


a 
.o 


3 

1 


a 

u 
3 

■ i 


3 

o 
a 




in 

a 

S-. 

i-i 




a 


U 

_o 


a. 

l-H 


_o 
'-> 
"c 


3 


"3 


'3 


a 

v" 


p 

3 
O 
■— 


'l-l 

O 
ft 




3 


E 


1-H 

a 


Pi 


ft 

IS 


IS 


ft 


iZ 


£ 


O 




P 


j 


I 


u 


u 


"If 


•^ 


*i> 


•?, 


'ia 




*^ 


'i 


•^ 


'm 


*m 


•^. 


Oh" 


n| 


0| 


a| 


ctT 




0h~ 


oT 


qT 


cC' 


o; 1 


oT 



II 



has been used in communication theory to characterize the time-varying frequency content of narrow- 
band, frequency-modulated signals. It is a generalization of the fact that the frequency f of a complex 
sinusoidal signal x(t) = exp{j2Kf t) is proportional to the derivative of the signal's phase. A dual concept 
used in filter analysis is the group delay 



■M~=s-W)} 



which can be interpreted as the time delay or distortion introduced by the filter's frequency response 
H(f) at each frequency. Group delay is a generalization of the fact that time translations are coded in 
the derivative of the phase of the Fourier transform. Unfortunately, if the signal contains several signal 
components that overlap in time or frequency, then^(f) or x H (f) only provides average spectral charac- 
teristics, which are not very useful. 

55.2 Desirable Properties of Time-Frequency Representations 

Mixed time-frequency representations (TFRs) map a one-dimensional signal into a two-dimensional 
function of time and frequency in order to analyze the time-varying spectral content of the signal. Before 
discussing any particular TFR in Table 55.1, it is helpful to first investigate what types of properties an 
"ideal" time-frequency representation should satisfy. The list of desirable TFR properties in Table 55.2 
can be broken up conceptually into the following categories: covariance, statistical, signal analysis, 
localization, and inner products [Boashash, 1991; Claasen & Mecklenbrauker, 1980; Cohen, 1989; Flan- 
drin, 1993; Hlawatsch & Boudreaux-Bartels, 1992]. The covariance properties P l to P 6 basically state that 
certain operations on the signal, such as translations, dilations, or convolution, should be preserved, i.e., 
produce exactly the same operation on the signal's TFR. The second category of properties originates 
from the desire to generalize the concepts of the one-dimensional instantaneous signal energy |x(f)| 2 and 
power spectral density \x(f) \ 2 into a two-dimensional statistical energy distribution T x (t , f ) that provides 
a measure of the local signal energy or the probability that a signal contains a sinusoidal component of 
frequency j„ at time t . Properties P 7 to P 13 state that such an energy-distribution TFR should be real and 
nonnegative, have its marginal distributions equal to the signal's temporal and spectral energy densities 
\x(t)\ 2 and \X{f)\ 2 , respectively, and preserve the signal energy, mean, variance, and other higher-order 
moments of the instantaneous signal energy and power spectral density. The next category of properties, 
P i4 to P 18 , arises from signal-processing considerations. A TFR should have the same duration and 
bandwidth as the signal under analysis. At any given time t, the average frequency should equal the 
instantaneous frequency of the signal, while the average or center of gravity in the time direction should 
equal the group delay of the signal. These two properties have been used to analyze the distortion of 
audio systems and the complex FM sonar signals used by bats and whales for echolocation. Property P 18 
is the TFR equivalent of the duality property of Fourier transforms. The group of properties P 19 to P 24 
constitutes ideal TFR localization properties that are desirable for high resolution capabilities. Here, 8(a) 
is the Dirac function. These properties state that if a signal is perfectly concentrated in time or frequency, 
i.e., an impulse or a sinusoid, then its TFR also should be perfectly concentrated at the same time or 
frequency. Properties P 21 and P 22 state that the TFRs of linear or hyperbolic spectral FM chirp signals 
should be perfectly concentrated along the chirp signal's group delay. Property P 24 states that a signal 
modulated by a linear FM chirp should have a TFR whose instantaneous frequency has been sheared by 
an amount equal to the linear instantaneous frequency of the chirp. The last property, known as Moyal's 
formula or the unitarity property, states that TFRs should preserve the signal projections, inner products, 
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and orthonormal signal basis functions that are used frequently in signal detection, synthesis, and 
approximation theory. Table 55.3 indicates which properties are satisfied by the TFRs listed in Table 55.1. 
A TFR should be relatively easy to compute and interpret. Interpretation is greatly simplified if the 
TFR is linear, i.e., 



T k> /)=XX( f > f) fOT >W=X>»( f ) 



(55.5) 



However, energy is a quadratic function of the signal, and hence so too are many of the TFRs in Table 55.1. 
The nonlinear nature of TFRs gives rise to troublesome cross-terms. If y(i) contains N signal components 
or auto-terms x„(t) in Eq. (55.5), then a quadratic TFR of y{t) can have as many nonzero cross-terms as 
there are unique pairs of autoterms, i.e., N(N — l)/2. For many TFRs, these cross-terms are oscillatory 
and overlap with autoterms, obscuring visual analysis of TFRs. 

Two common methods used to reduce the number of cross-terms are to reduce any redundancy in 
the signal representation and to use local smoothing or averaging to reduce oscillating cross-terms. TFR 
analysis of real, bandpass signals should be carried out using the analytic signal representation, i.e., the 
signal added to J-i times its Hilbert transform, in order to remove cross-terms between the positive- 
and negative-frequency axis components of the signal's Fourier transform. As we will see in upcoming 
sections, cross-term reduction by smoothing is often achieved at the expense of significant autoterm 
distortion and loss of desirable TFR properties. 

55.3 TFR Classes 

This section will briefly review Cohen's class of shift covariant TFRs, the Affine class of affine covariant 
TFRs, the Hyperbolic class (developed for signals with hyperbolic group delay), and the Power class 
(which is useful for signals with polynomial group delay). Each class is formed by grouping TFRs that 
satisfy two properties. They provide very helpful insight as to which types of TFRs will work best in 
different situations. Within a class, each TFR is completely characterized by a unique set of TFR-dependent 
kernels which can be compared against a class-dependent list of kernel constraints in Tables 55.2 and 
55.6 to quickly determine which properties the TFR satisfies. 

Cohen's Class of TFRs 

Cohen's class consists of all quadratic TFRs that satisfy the frequency-shift and time-shift covariance 
properties, i.e., those TFRs with a check in the first two property rows in Table 55.3 [Claasen & Meck- 
lenbrauker, 1980; Cohen, 1989; Flandrin, 1993; Hlawatsch & Boudreaux-Bartels, 1992]. Time- and 
frequency-shift covariances are very useful properties in the analysis of speech, narrowband Doppler 
systems, and multipath environments. Any TFR in Cohen's class can be written in one of the four 
equivalent "normal forms": 



C x (t,f;V c ) = ^<p c (t-t',x). 



( \ 

, x 

t +- 

V 2 . 



f \ 

, X 

t — 

V 2 , 



e~' 2Kfi dt'dx 



(55.6) 
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e' 2mv df'dv 



(55.7) 
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= IT V c (t- f, f- f ) WD x (t', /') dt'd/' (55.8) 

= J]V c (t, v)aF x (x, vj^^Wdv. (55.9) 

Each normal form is characterized by one of the four kernels cp c (f, t), ^(/j v), \|/ c (r, /), and v P c (t, v) 
which are interrelated by the following Fourier transforms: 

<p c (t,x)= |T* C (/, v)e fl ^ fi+ ^dfdv= f ¥ c (x, v) e ' 2 "^v (55.10) 

Yc(*> /)= f f T c( x ' v)e i2% ^'^(kdv = f<D c (/, v)e i2nv 'dv. (55.11) 

The kernels for the TFRs in Cohen's class are given in Table 55.4. 

The four normal forms offers various computational and analysis advantages. For example, the first 
two normal forms can be computed directly from the signal x(t) or its Fourier transform X(f) via a one- 
dimensional convolution with cp c (t, t) or O c (f, v). If (p c (f, x) is of fairly short duration, then it may be 
possible to implement Eq. (55.6) on a digital computer in real time using only a small number of signal 
samples. The third normal form indicates that any TFR in Cohen's shift covariant class can be computed 
by convolving the TFR-dependent kernel \\i c ( t, f) with the Wigner distribution (WD) of the signal, defined 
in Table 55.1. Hence the WD is one of the key members of Cohen's class, and many TFRs correspond to 
smoothed WDs, as can be seen in the top of Table 55.5. Equation (55.11) and the fourth normal form 
in Eq. (55.9) indicate that the two-dimensional convolution in Eq. (55.8) transforms to multiplication 
of the Fourier transform of the kernel v|/ c (t, f) with the Fourier transform of the WD, which is the 
ambiguity function (AF) in Table 55.1. This last normal form provides an intuitive interpretation that 
the "AF domain" kernel ^P^T, v) can be thought of as the frequency response of a two-dimensional filter. 

The kernels in Eqs. (55.6) to (55.11) are signal-independent and provide valuable insight into the 
performance of each Cohen class TFR, regardless of the input signal. For good cross-term reduction and 
little autoterm distortion, each TFR kernel x l / c (x, v) given in Table 55.4 should be as close as possible to 
an ideal low-pass filter. If these kernels satisfy the constraints in the third column of Table 55.2, then the 
TFR properties in the first column are guaranteed to always hold [Claasen & Mecklenbrauker, 1980; 
Hlawatsch & Boudreaux-Bartels, 1992]. For example, the last row of Table 55.2 indicates that Moyal's 
formula is satisfied by any TFR whose AF domain kernel, listed in the third column of Table 55.4, has 
unit modulus, e.g., the Rihaczek distribution. Since the AF domain kernel of the WD is equal to 1, i.e., 
x P WD (t, v) = 1, then the WD automatically satisfies the kernel constraints in Table 55.2 for properties P 9 
to P 13 and P 16 to P n as well as Moyal's formula. However, it also acts as an all-pass filter, passing all cross- 
terms. The Choi- Williams Gaussian kernel in Table 55.4 was formulated to satisfy the marginal property 
constraints of having an AF domain kernel equal to 1 along the axes and to be a low-pass filter that 
reduces cross-terms. 

Affine Class of TFRs 

TFRs that are covariant to scale changes and time translations, i.e., properties P 2 and P 3 in Tables 55.2 
and 55.3, are members of the Affine class [Bertrand chapter in Boashash, 1991; Flandrin, 1993]. 

The scale covariance property P 3 is useful when analyzing wideband Doppler systems, signals with 
fractal structure, octave-band systems such as the cochlea of the inner ear, and detecting short-duration 
"transients." Any Affine class TFR can be written in four "normal form" equations similar to those of 
Cohen's class: 
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TABLE 55.4 Kernels of Cohen's Shift-Invariant Class of Time-Frequency Representations (TFR) 
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TABLE 55.4 (continued) Kernels of Cohen's Shift- Invariant Class of Time-Frequency Representations (TFR) 



TFR 
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Here, ufj, v; a, r, p, y) = ((r) 2 (fv) 2 )™ + (fj) 2 ) a (v) + 2r((r v)P) T ) and Q(a) is a Chebyshev polynomial of order A.. Functions 
with lowercase and uppercase letters, e.g. y(f) and r(/), indicate Fourier transform pairs. 



A x (tJ;y A ) = \f\\U A (f(f-t) y j%}x(t+T/2)x*(t-x/2)dt'ck (55.12) 

= ^®i^,j)x(f' + v/2)x*(f'-v/2)e i2 ™df'dv (55.13) 



{{vJ/f^yV^'^l^ 



JM/*.^ AF x {x,v)e j2m dx 



dv. 



(55.14) 



(55.15) 



The Affine class kernels are interrelated by the same Fourier transforms given in Eqs. (55.10) and (55.11). 
Note that the third normal form of the Affine class involves an Affine smoothing of the WD. Well-known 
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TABLE 55.5 Many TFRs Are Equivalent to Smoothed or Warped Wigner Distributions 
TFR Name TFR formulation 



Cohen's class TFR 


C,[t, /: V c ) = JJvc(f-f. /- f)WD*(f. f')dt'df 
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Spectrogram 
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Where / r >0 is a positive reference frequency, (WH)(/) = A; , e /// ' nlf r e f/f '),{°W K H^/j = \^fJ ^ '""I H j/ r sgn(/)|/// r 



^/)={i 



,1 ,1. f>0 
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members of the Affine class are the Bertrands' P distribution, the scalogram, and the Unterberger 
distributions. All are defined in Table 55.1, and their kernel forms and TFR property constraints are listed 
in Table 55.6. Because of the scale covariance property, many TFRs in the Affine class exhibit constant-Q 
behavior, permitting multiresolution analysis. 

Hyperbolic Class of TFRs 

The Hyperbolic class of TFRs consists of all TFRs that are covariant to scale changes and hyperbolic time 
shifts, i.e., properties P 3 and P 4 in Table 55.2 [Papandreou et al„ 1993]. They can be analyzed using the 
following four normal forms: 

«x(^/;n)=JJ9„(ff- c ,c)^,( c ^K ;2,l|ln(/// * dc ^ (55 - 16 > 

= ff*Jln^-fc,p f r e h x{f/^ 2 )x*{f r e h -*l 2 )e> 2 *>®dbd$ (55.17) 
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TABLE 55.6 Affine Class Kernels and Constraints 
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AAfJf', f^dt'df 



(55.18) 



=J|w h (c.p)haf4c,p), 



/2Jir«fl3-[iix(/// r )?] 



dt^p 



(55.19) 



where AM x (r, /) is the Altes distribution and HAF X {C,, (3) is the hyperbolic ambiguity function defined 
in Table 55.1, v x (c, Q is defined in Table 55.7, (W X){f) = JJ^f X{f r ef'f') is a unitary warping on the 
frequency axis of the signal, and the kernels are interrelated via the Fourier transforms in Eqs. (55.10) 
and (55.11). 

Tables 55.3 reveals that the Altes-Marinovic, the Bertrands' P , and the hyperbologram distributions 
are members of the Hyperbolic class. Their kernels are given in Table 55.7, and kernel property constraints 
are given in Table 55.2. The hyperbolic TFRs give highly concentrated TFR representations for signals with 
hyperbolic group delay. Each Hyperbolic class TFR, kernel, and property corresponds to a warped version 
of a Cohen's class TFR, kernel, and property, respectively. For example, Table 55.5 shows that the Altes 
distribution is equal to the WD after both the signal and the time-frequency axes are warped appropriately. 
The WD's perfect location of linear FM chirps (P 71 ) corresponds to the Altes distribution's perfect local- 
ization for hyperbolic FM chirps (P 22 )- This one-to-one correspondence between the Cohen and Hyper- 
bolic classes greatly facilitates their analysis and gives alternative methods for calculating various TFRs. 

Kth Power Class 

The Power class of TFRs consists of all TFRs that are scale covariant and power time-shift covariant, i.e., 



p6;\t, f )=p6; 



t-cf^(f/f r ),f\ forY(f) = e-^ /fi h(f) 



(55.20) 



TABLE 55.7 Kernels of the Hyperbolic Class of Time-Frequency Representations 
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where C, K (f) = sgn(/)|/| K , for k # [Hlawatsch et al., 1993]. Consequently, the Kth Power class perfectly 
represents group delay changes in the signal that are powers of frequency. When k = 1, the Power class 
is equivalent to the Affine class. The central member, AM^ K) in Table 55.1, is the Power class equivalent 
to the Altes-Marinovic distribution. 

55.4 Common TFRs and Their Use 
in Biomedical Applications 

This section will briefly review some of the TFRs commonly used in biomedical analysis and summarize 
their relative advantages and disadvantages. 

Wigner Distribution 

One of the oldest TFRs in Table 55.1 is the Wigner distribution (WD), which Wigner proposed in 
quantum mechanics as a two-dimensional statistical distribution relating the Fourier transform pairs of 
position and momentum of a particle. Table 55.3 reveals that the WD satisfies a large number of desirable 
TFR properties, P l to P v P 5 to P 7 , P 9 to P 2V and P 23 and P 25 . It is a member of both the Cohen and the 
Affine classes. The WD is a high-resolution TFR for linear FM chirps, sinusoids, and impulses. Since the 
WD satisfies Moyal's formula, it has been used to design optimal signal-detection and synthesis algo- 
rithms. The drawbacks of the WD are that it can be negative, it requires the signal to be known for all 
time, and it is a quadratic TFR with no implicit smoothing to remove cross-terms. 

Smoothed Wigner Distributions 

Many TFRs are related to the WD by either smoothing or a warping, e.g., see Eqs. (55.8) and (55.18) 
and Table 55.5. An intuitive understanding of the effects of cross-terms on quadratic TFRs can be obtained 
by analyzing the WD of a multicomponent signal y{ t) in Eq. (55.5) under the assumption that each signal 
component is a shifted version of a basic envelope, i.e., X„(t) = x{t— t^e'^f"*: 



IV 

N-l N 

+ 2 E E wD r{ t -w /-/M) cos ( 27r [ A -U f - Af M) ( 55 - 21 ) 

k=l q=k+l 

-A^(/-A/ M ) + A/ M ArJ) 



where Af k = f k — f is the difference or "beat" frequency and f kq = (f k + f q )/2 is the average frequency 
between the Ath and qth signal components. Similarly, At k is the difference time and t k is the average 
time. The auto-WD terms in the first summation properly reflect the fact that the WD is a member of 
Cohen's shift covariant class. Unfortunately, the cross-WD terms in the second summation occur midway 
in the time- frequency plane between each pair of signal components and oscillate with a spatial frequency 
proportional to the distance between them. The Pseudo-WD (PWD) and the smoothed Pseudo-WD 
(SPWD) defined in Table 55.1 use low-pass smoothing windows r|(t) and y(f) to reduce oscillatory cross- 
components. However, Table 55.5 reveals that the Pseudo-WD performs smoothly only in the frequency 
direction. Short smoothing windows greatly reduce the limit of integration in the Pseudo-WD and SPWD 
formulations and hence reduce computation time. However, Table 55.3 reveals that smoothing the WD 
reduces the number of desirable properties it satisfies from 18 to 7 for the Pseudo-WD and to only 3 for 
the SPWD. 
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Spectrogram 

One of the most commonly used TFRs for slowly time-varying or quasi-stationary signals is the spec- 
trogram, defined in Tables 55.1 and 55.5 [Rabiner & Schafer, 1978]. It is equal to the squared magnitude 
of the short-time Fourier transform, performing a local or "short-time" Fourier analysis by using a sliding 
analysis window y(f) to segment the signal into short sections centered near the output time t before 
computing a Fourier transformation. The spectrogram is easy to compute, using either FFTs or a parallel 
bank of filters, and it is often easy to interpret. The quadratic spectrogram smooths away all cross-terms 
except those which occur when two signal components overlap. This smoothing also distorts auto-terms. 
The spectrogram does a poor job representing rapidly changing spectral characteristics or resolving two 
closely spaced components because there is an inherent tradeoff between good time resolution, which 
requires a short analysis window, and good frequency resolution, which requires a long analysis window. 
The spectrogram satisfies only three TFR properties listed in Tables 55.2 and 55.3; i.e., it is a nonnegative 
number of Cohen's shift invariant class. 

Choi-Williams Exponential and Reduced Interference Distributions 

The Choi- Williams exponential distribution (CWD) and the reduced interference distribution (RID) in 
Table 55.1 are often used as a compromise between the high-resolution but cluttered WD versus the 
smeared but easy to interpret spectrogram [Jeong & Williams, 1992; Williams & Jeong, 1991]. Since they 
are members of both Cohen's class and the Affine class, their AF domain kernels in Table 55.4 have a 
very special form, i.e., ^(x, v) = S c {iv), called a product kernel, which is a one-dimensional kernel 
evaluated at the product of its time-frequency variables [Hlawatsch & Boudreaux-Bartels, 1992]. The 
CWD uses a Gaussian product kernel in the AF plane to reduce cross-terms, while the RID typically uses 
a classic window function that is time-limited and normalized to automatically satisfy many desirable 
TFR properties (see Table 55.3). The CWD has one scaling factor <r that allows the user to select either 
good cross-term reduction or good auto-term preservation but, unfortunately, not always both. The 
generalized exponential distribution in Table 55.1 is an extension of the CWD that permits both [Hla- 
watsch & Boudreaux-Bartels, 1992]. Because the CWD and RID product kernels have hyperbolic isoc- 
ontours in the AF plane, they always pass cross-terms between signal components that occur at either 
the same time or frequency, and they can distort auto-terms of linear FM chirp signals whose instanta- 
neous frequency has a slope close to 1. The multiform tiltable exponential distribution (MTED) [Costa 
and Boudreaux-Bartels, 1994], another extension of the CWD, works well for any linear FM chirp. 

Scalogram or Wavelet Transform Squared Magnitude 

The scalogram [Flandrin, 1993], defined in Tables 55.1 and 55.5, is the squared magnitude of the recently 
introduced wavelet transform (WT) [Daubechies, 1992; Meyer, 1993] and is a member of the Affine class. 
It uses a special sliding analysis window y(t), called the mother wavelet, to analyze local spectral infor- 
mation of the signal x(t). The mother wavelet is either compressed or dilated to give a multiresolution 
signal representation. The scalogram can be thought of as the multiresolution output of a parallel bank 
of octave band-filters. High-frequency regions of the WT domain have very good time resolution, whereas 
low- frequency regions of the WT domain have very good spectral resolution. The WT has been used to 
model the middle- to high-frequency range operation of the cochlea, to track transients such as speech 
pitch and the onset of the QRS complex in ECG signals, and to analyze fractal and chaotic signals. One 
drawback of the scalogram is its poor temporal resolution at low- frequency regions of the time -frequency 
plane and poor spectral resolution at high frequencies. Moreover, many "classic" windows do not satisfy 
the conditions needed for a mother wavelet. The scalogram cannot remove cross-terms when signal 
components overlap. Further, many discrete WT implementations do not preserve the important time- 
shift covariance property. 
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Biomedical Applications 

The electrocardiogram (ECG) signal is a recording of the time-varying electric rhythm of the heart. The 
short-duration QRS complex is the most predominant feature of the normal ECG signal. Abnormal heart 
rhythms can be identified on the ECG by detecting the QRS complex from one cycle to the next. The 
transient detection capability of the wavelet transform (WT) has been exploited for detection of the QRS 
complex by Kadambe et al. [1992] and Li and Zheng [1993]. The WT exhibits local maxima which align 
across successive (dyadic) scales at the location of transient components, such as QRS complexes. The 
advantage of using the WT is that it is robust both to noise and to nonstationarities in the QRS complex. 

Other pathologic features in the heart's electrical rhythm that appear only in high-resolution signal- 
averaged ECG signals are ventricular late potentials (VLPs). VLPs are small-amplitude, short-duration 
components that occur after the QRS complex and are precursors of dangerous, life-threatening cardiac 
arrhythmias. Tuteur [1989] used the peak of the WT at a fixed scale to identify simulated VLPs. More 
recently, Jones et al. [1992] compared different time-frequency techniques, such as the spectrogram, 
short-time spectral estimators, the smoothed WD, and the WT, in their ability to discriminate between 
normal patients and patients susceptible to dangerous arrhythmias. Morlet et al. [1993] used the transient 
detection capability of the WT to identify VLPs. 

The WT has also been applied to the ECG signal in the context of ECG analysis and compression by 
Crowe et al. [1992]. Furthermore, Crowe et al. [1992] exploited the capability of the WT to analyze 
fractal-like signals to study heart rate variability (HRV) data, which have been described as having fractal- 
like properties. 

The recording of heart sounds, or phonocardiogram (PCG) signal, has been analyzed using many time- 
frequency techniques. Bulgrin et al. [1993] compared the short- time Fourier transform and the WT for the 
analysis of abnormal PCGs. Picard et al. [1991] analyzed the sounds produced by different prosthetic valves 
using the spectrogram. The binomial RID, which is a fast approximation to the CWD, was used to analyze 
the short-time, narrow-bandwidth features of first heart sound in mongrel dogs by Wood etal. [1992]. 

TFRs have also been applied to nonstationary brain wave signals, including the electrocardiogram 
(EEG), the electrocorticogram (ECoG), and evoked potentials (EPs). Zaveri et al. [1992] used the spec- 
trogram, the WD, and the CWD to characterize the nonstationary behavior of the ECoG of epileptic 
patients. Of the three techniques, the CWD exhibited superior results. The WT was used to identify the 
onset of epileptic seizures in the EEG by Schiff and Milton [ 1993] , to extract a single EP by Bartnik et al. 
[1992], and to characterize changes in somatosensory EPs due to brain injury caused by oxygen depri- 
vation by Thakor et al. [1993]. 

Crackles are lung sounds indicative of pathologic conditions. Verreault [1989] used AR models of slices 
of the WD to discriminate crackles from normal lung sounds. 

The electrogastrogram (EGG) is a noninvasive measure of the time-varying electrical activity of the 
stomach. Promising results regarding abnormal EGG rhythms and the frequency of the EGG slow wave 
were obtained using the CWD by Lin and Chen [1994]. 

Widmalm et al. [1991] analyzed temporomandibular joint (TMI) clicking using the spectrogram, the 
WD, and the RID. The RID allowed for better time-frequency resolution of the TMJ sounds than the 
spectrogram while reducing the cross-terms associated with the WD. TMJ signals were also modeled 
using nonorthogonal Gabor logons by Brown etal. [1994]. The primary advantage of this technique, 
which optimizes the location and support of each Gabor log-on, is that only a few such logons were 
needed to represent the TMJ clicks. 

Auditory applications of TFRs are intuitively appealing because the cochlea exhibits constant-band- 
width behavior at low frequencies and constant-Q behavior at middle to high frequencies. Applications 
include a wavelet-based model of the early stages of acoustic signal processing in the auditory system 
[Yang etal., 1992], a comparison of the WD and Rihaczek distribution on the response of auditory 
neurons to wideband noise stimulation [Eggermont & Smith, 1990], and spectrotemporal analysis of 
dorsal cochlear neurons in the guinea pig [Backoff & Clopton, 1992]. 
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The importance of mammography, x-ray examination of the breast, lies in the early identification of 
tumors. Kaewlium and Longbotham [1993] used the spectrogram with a Gabor window as a texture 
discriminator to identify breast masses. Recently, a mammographic feature-enhancement technique using 
the WT was proposed by Laine et al. [1993]. The wavelet coefficients of the image are modified and then 
reconstructed to the desired resolution. This technique enhanced the visualization of mammographic 
features of interest without additional cost or radiation. 

Magnetic resonance imaging (MRI) allows for the imaging of the soft tissues in the body. Weaver et al. 
[1992] reduced the long processing time of traditional phase encoding of MRI images by WT encoding. 
Moreover, unlike phase-encoded images, Gibb's ringing phenomena and motion artifacts are localized 
in WT encoded images. 

The Doppler ultrasound signal is the reflection of an ultrasonic beam due to moving red blood cells 
and provides information regarding blood vessels and heart chambers. Doppler ultrasound signals in 
patients with narrowing of the aortic valve were analyzed using the spectrogram by Cloutier et al. [1991]. 
Guo and colleagues [1994] examined and compared the application of five different time-frequency 
representations (the spectrogram, short-time AR model, CWD, RID, and Bessel distributions) with 
simulated Doppler ultrasound signals of the femoral artery. Promising results were obtained from the 
Bessel distribution, the CWD, and the short-time AR model. 

Another focus of bioengineering applications of TFRs has concerned the analysis of biologic signals 
of interest, including the sounds generated by marine mammals, such as dolphins and whales, and the 
sonar echolocation systems used by bats to locate and identify their prey. The RID was applied to sperm 
whale acoustic signals by Williams and Jeong [1991] and revealed an intricate time-frequency structure 
that was not apparent in the original time-series data. The complicated time-frequency characteristics 
of dolphin whistles were analyzed by Tyack et al. [1992] using the spectrogram, the WD, and the RID, 
with the RID giving the best results. Flandrin [ 1988] analyzed the time-frequency structure of the different 
signals emitted by bats during hunting, navigation, and identifying prey using the smoothed Pseudo- 
WD. In addition, the instantaneous frequency of the various signals was estimated using time-frequency 
representations. Saillant et al. [1993] proposed a model of the bat's echolocation system using the 
spectrogram. The most common application of the spectrogram is the analysis and modification of quasi- 
stationary speech signals [Rabiner & Schafer, 1978]. 
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Signals recorded from the human body provide information pertaining to its organs. Their characteristic 
shape, or temporal and spectral properties, can be correlated with normal or pathological functions. In 
response to dynamical changes in the function of these organs, the signals may exhibit time-varying as 
well as nonstationary responses. Time -frequency and time-scale analysis techniques are well suited for 
such biological signals. loint time-frequency signal analysis techniques include short-term Fourier trans- 
form and Wigner-Ville distribution, and related reduced interference distribution. loint time-scale 
analysis includes continuous and discrete, orthonormal and non-orthonormal wavelets. These techniques 
find applications in the analysis of transient and time-varying events in biological signals. Examples of 
applications include cardiac signals (for detection of ischemia and reperfusion-related changes in QRS 
complex, and late potentials in ECG) and neurological signals (evoked potentials and seizure spikes). 

56.1 Introduction 

Digital signal processing uses sophisticated mathematical analysis and algorithms to extract information 
hidden in signals derived from sensors. In biomedical applications these sensors such as electrodes, 
accelerometers, optical imagers, etc., record signals from biological tissue with the goal of revealing their 
health and well-being in clinical and research settings. Refining those signal processing algorithms for 
biological applications requires building suitable signal models to capture signal features and components 
that are of diagnostic importance. As most signals of a biological origin are time-varying, there is a special 
need for capturing transient phenomena in both healthy and chronically ill states. 
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A critical feature of many biological signals is frequency domain parameters. Time localization of these 
changes is an issue for biomedical researchers who need to understand subtle frequency content changes 
over time. Certainly, signals marking the transition from severe normative to diseased states of an 
organism sometimes undergo severe changes which can easily be detected using methods such as the 
Short Time Fourier Transform (STFT) for deterministic signals and its companion, the spectrogram, for 
power signals. The basis function for the STFT is a complex sinusoid, e> 2K f', which is suitable for stationary 
analyses of narrowband signals. For signals of a biological origin, the sinusoid may not be a suitable 
analysis signal. Biological signals are often spread out over wide areas of the frequency spectrum. Also 
as Rioul and Vetterli [1] point out, when the frequency content of a signal changes in a rapid fashion, 
the frequency content becomes smeared over the entire frequency spectrum as it does in the case of the 
onset of seizure spikes in epilepsy or a fibrillating heartbeat as revealed on an ECG. The use of a 
narrowband basis function does not accurately represent wideband signals. It is preferred that the basis 
functions be similar to the signal under study. In fact, for a compact representation using as few basis 
functions as possible, it is desirable to use basis functions that have a wider frequency spread as most 
biological signals do. Wavelet theory, which provides for wideband representation of signals [2-4], is 
therefore a natural choice for biomedical engineers involved in signal processing and currently under 
intense study [5-9]. 

56.2 The Wavelet Transform: Variable Time and Frequency 
Resolution. Continuous Wavelet Transform (CWT) 

A decomposition of a signal, based on a wider frequency mapping and consequently better time resolution 
is possible with the wavelet transform. The Continuous Wavelet Transform (CWT) [3] is denned thusly 
for a continuous signal, x(t), 



CWT x (r,a) = ^Ll x(at)g* 
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y cl- 
ot with change of variable as 



dt (56.1a) 



CWr(x,a)=Vfl[x(af)^ > 
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dt (56.1b) 



where g(t) is the mother or basic wavelet, * denotes a complex conjugate, a is the scale factor, and x — a 
time shift. Typically, g(t) is a bandpass function centered around some center frequency, f . Scale a allows 
the compression or expansion of g(t) [1, 3, 10]. A larger scale factor generates the same function 
compressed in time whereas a smaller scale factor generates the opposite. When the analyzing signal is 
contracted in time, similar signal features or changes that occur over a smaller time window can be 
studied. For the wavelet transform, the same basic wavelet is employed with only alterations in this signal 
arising from scale changes. Likewise, a smaller scale function enables larger time translations or delays 
in the basic signal. 

The notion of scale is a critical feature of the wavelet transform because of time and frequency domain 
reciprocity. When the scale factor, a, is enlarged, the effect on frequency is compression as the analysis 
window in the frequency domain is contracted by the amount 1/a [ 10] . This equal and opposite frequency 
domain scaling effect can be put to advantageous use for frequency localization. Since we are using 
bandpass filter functions, a center frequency change at a given scale yields wider or narrower frequency 
response changes depending on the size of the center frequency. This is the same in the analog or digital 
filtering theories as "constant-Q or quality factor" analysis [1, 10, 11]. At a given Q or scale factor, 
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frequency translates are accompanied by proportional bandwidth or resolution changes. In this regard, 
wavelet transforms are often written with the scale factor rendered as 



/o 



(56.2) 
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(56.3) 



This is the equivalent to logarithmic scaling of the filter bandwidth or octave scaling of the filter 
bandwidth for power-of-two growth in center frequencies. Larger center frequency entails a larger band- 
width and vice versa. 

The analyzing wavelet, g(t), should satisfy the following conditions: 

(1) belong to L 2 (R), i.e., be square integmble (be of finite energy) [2]; 

(2) be analytic [G((d) = for © < 0] and thus be complex-valued. In fact many wavelets are real- 
valued; however, analytic wavelets often provide valuable phase information [3], indicative of 
changes of state, particularly in acoustics, speech, and biomedical signal processing [8]; and 

(3) be admissible. This condition was shown to enable invertibility of the transform [2, 6, 12]: 






t-1 



-dadx 



where c is a constant that depends only on g(t) and a is positive. For an analytic wavelet the 
constant should be positive and convergent: 



: ' V ' d(0 < c 



CO 



which in turn imposes an admissibility condition on g(t). For a real-valued wavelet, the integrals 
from both -co to and to +co should exist and be greater than zero. 

The admissibility condition along with the issue of reversibility of the transformation is not so 
critical for applications where the emphasis is on signal analysis and feature extraction. Instead, 
it is often more important to use a fine sampling of both the translation and scale parameters. 
This introduces redundancy which is typical for the CWT, unlike for the discrete wavelet transform, 
which is used in its dyadic, orthogonal, and invertible form. 

All admissible wavelets with g e L'(R) have no zero-frequency contribution. That is, they are 
of zero mean, 



-t-oo 

J"s(r)rfr = 



or equivalently G(co) = for co = 0, meaning that g(t) should not have non-zero DC [6, 12]. This 
condition is often being applied also to nonadmissible wavelets. 
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The complex- valued Morlet's wavelet is often selected as the choice for signal analysis using the CWT. 
Morlet's wavelet [31 is defined as 



# 



,;2jt/„t 



(56.4a) 



with its scaled version written as 
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(56.4b) 



Morlet's wavelet insures that the time-scale representation can be viewed as a time-frequency distribution 
as in Eq. (56.3). This wavelet has the best representation in both time and frequency because it is based 
on the Gaussian window. The Gaussian function guarantees a minimum time-bandwidth product, 
providing for maximum concentration in both time and frequency domains [1]. This is the best com- 
promise for a simultaneous localization in both time and frequency as the Gaussian function's Fourier 
transform is simply a scaled version of its time domain function. Also the Morlet wavelet is defined by 
an explicit function and leads to a quasi continuous discrete version [11]. A modified version of Morlet's 
wavelet leads to fixed center frequency, f a , with width parameter, o, 



?(°>f) = 
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Once again time-frequency (TF) reciprocity determines the degree of resolution available in time and 
frequency domains. Choosing a small window size, a, in the time domain, yields poor frequency reso- 
lution while offering excellent time resolution and vice versa [11, 13]. To satisfy the requirement for 
admissibility and G(0) = 0, a correction term must be added. For co > 5, this correction term becomes 
negligibly small and can be omitted. The requirements for the wavelet to be analytic and of zero mean 
is best satisfied for co = 5.3 [3]. 

Following the definition in Eq. (56.1a,b) the discrete implementation of the CWT in the time-domain 
is a set of bandpass filters with complex-valued coefficients, derived by dilating the basic wavelet by the 
scale factor, a, for each analyzing frequency. The discrete form of the filters for each a is the convolution: 
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with k = xlT s , where T s is the sampling interval. The summation is over a number of terms, n. Because 
of the scaling factor a in the denominator of the argument of the wavelet, the wavelet has to be resampled 
at a sampling interval TJa for each scale a. Should the CWT cover a wide frequency range, a computational 
problem would arise. For example, if we wish to display the CWT over 10 octaves (a change by one 
octave corresponds to changing the frequency by a factor of 2), the computational complexity (size of 
the summation) increases by a factor of 2 10 = 1024. The algorithm by Holschneider et al. [14] solves this 
problem for certain classes of wavelets by replacing the need to resample the wavelet with a recursive 
application of an interpolating filter. Since scale is a multiplicative rather than an additive parameter, 
another way of reducing computational complexity would be by introducing levels between octaves 
(voices) [15]. Voices are defined to be the scale levels between successive octaves, uniformly distributed 
in a multiplicative sense [13, 16]. Thus, the ratio between two successive voices is constant. For example, 
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if one wishes to have ten voices per octave, then the ratio between successive voices is 2 1 ' 10 . The distance 
between two levels, ten voices apart is an octave. 

The CWT can also be implemented in the frequency domain. Equation (56.1) may be formulated in 
the frequency domain as: 

CWr(x,a)=Vfl fs(co)G*(aCfl)e ; ™dco (56.5) 

where S(co) and G(co) denote the Fourier transformed s(t) and g(t), and j = (-1)" 2 . The analyzing wavelet 
g{t) generally has the following Fourier transform: 

G I(l (co)= VflG(aCo)e ;mT (56.6a) 

The Morlet wavelet [Eq. (56.4a,b)] in frequency domain is a Gaussian function: 

G m U) = ^e {a -^l 2 (56.6b) 

V2co 

From Eq. (56.6a,b) it can be seen that for low frequencies, co, (larger scales a) the width, Aco, of the 
Gaussian is smaller and vice versa. In fact, the ratio Aco/co is constant [1], i.e., Morlet wavelets may be 
considered filter banks of the constant- Q factor. 

Based on Eqs. (56.5 and 56.6a,b) the wavelet transform can be implemented in the frequency domain. 
At each scale, the Fourier image of the signal can be computed as 

y(co,a) = s(co).G m (co,fl) 

with S(ro) being the Fourier transform of the signal, G m (co, a) being the scaled Fourier image of the 
Morlet wavelet at scale a, and the operation • standing for element-by-element multiplication (windowing 
in frequency domain). The signal at each scale a will finally be obtained by applying the inverse Fourier 
transform: 



cwtU,o)=(fft} 7(co,a) 



This approach has the advantage of avoiding computationally intensive convolution of time-domain 
signals by using multiplication in the frequency domain, as well as the need of resampling the mother 
wavelet in the time domain [17, 18]. 

Note that the CWT is, in the general case, a complex-valued transformation. In addition to its 
magnitude, its phase often contains valuable information pertinent to the signal being analyzed, partic- 
ularly in instants of transients [3]. Sometimes the TF distribution of the nonstationary signal is much 
more important. This may be obtained by means of real-valued wavelets. Alternatives to the complex- 
valued Morlet wavelet are simpler, real-valued wavelets that may be utilized for the purpose of the CWT. 
For example, the early Morlet wavelet, as used for seismic signal analysis [19], had the following real- 
valued form: 

g(t) = cos(5t)e~' 2 / 2 

It had a few cycles of a sine wave tapered by a Gaussian envelope. Though computationally attractive, 
this idea contradicts the requirement for an analytic wavelet, i.e., its Fourier transform G(co) = for 
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co < 0. An analytic function is generally complex-valued in the time domain and has its real and imaginary 
parts as Hilbert transforms of each other [2, 20]. This guarantees only positive-frequency components 
of the analyzing signal. 

A variety of analyzing wavelets have been proposed in recent years for time-scale analysis of the ECG. 
For example, Senhadji etal. [21] applied a pseudo-Morlet's wavelet to bandpass filtering to find out 
whether some abnormal ECG events like extrasystoles and ischemia are mapped on specific decompo- 
sition levels: 

g(f) = c(l + cos(2Jl/ t))e _2 ' w ° f , f<l/2/ and k integer g{- 1,0, l} 

with the product kf defining the number of oscillations of the complex part, and C representing a 
normalizing constant such that llgll = 1. The above function is a modulated complex sine wave that would 
yield complex-valued CWT including phase information. However its envelope is a cosine, rather than 
a Gaussian, as in the case of the complex Morlet wavelet. It is well known that strictly the Gaussian 
function (both in time and frequency domain) guarantees the smallest possible time-bandwidth product 
which means maximum concentration in the time and frequency domains [22]. 

The STFT has the same time-frequency resolution regardless of frequency translations. The STFT can 
be written as 



STFT{%, /)= f x(f) g^t-xY^'dt 



(56.7) 



where g{t) is the time window that selects the time interval for analysis or otherwise known as the 
spectrum localized in time. Figure 56. 1 shows comparative frequency resolution of both the STFT as well 
as the wavelet transform. The STFT is often thought to be analogous to a bank of bandpass filters, each 
shifted by a certain modulation frequency, f . In fact, the Fourier transform of a signal can be interpreted 
as passing the signal through multiple bandpass filters with impulse response, giffe'^f 1 , and then using 
complex demodulation to downshift the filter output. Ultimately, the STFT as a bandpass filter rendition 
simply translates the same low pass filter function through the operation of modulation. The character- 
istics of the filter stay the same though the frequency is shifted. 

Unlike the STFT, the wavelet transform implementation is not frequency independent so that higher 
frequencies are studied with analysis filters with wider bandwidth. Scale changes are not equivalent to 
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FIGURE 56.1 Comparative frequency resolution for Short Time Fourier Transform (STFT) and Wavelet Transform 
(WT). Note that frequency resolution of STFT is constant across frequency spectrum. The WT has a frequency 
resolution that is proportional to the center frequency of the bandpass filter. 
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varying modulation frequencies that the STFT uses. The dilations and contractions of the basis function 
allow for variation of time and frequency resolution instead of uniform resolution of the Fourier trans- 
form [15]. 

Both the wavelet and Fourier transform are linear Time-Frequency Representations (TFRs) for which 
the rules of superposition or linearity apply [10]. This is advantageous in cases of two or more separate 
signal constituents. Linearity means that cross-terms are not generated in applying either the linear TF 
or time-scale operations. Aside from linear TFRs, there are quadratic TF representations which are quite 
useful in displaying energy and correlation domain information. These techniques, also described else- 
where in this volume include the Wigner-Ville distribution (WVD), smoothed WVD, the reduced infer- 
ence distribution (RID), etc. One example of the smoothed Wigner-Ville distribution is 
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where h(t), is a smoothing function. In this case the smoothing kernel for the generalized or Cohen's 
class of TFRs is 
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These methods display joint TF information in such a fashion as to display rapid changes of energy over 
the entire frequency spectrum. They are not subject to variations due to window selection as in the case 
of the STFT. A problematic area for these cases is the elimination of those cross-terms that are the result 
of the embedded correlation. 

It is to be noted that the scalogram or scaled energy representation for wavelets can be represented as 
a Wigner-Ville distribution as [1] 
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The Discrete Wavelet Transform 

In the discrete TFRs both time and scale changes are discrete. Scaling for the discrete wavelet transform 
involves sampling rate changes. A larger scale corresponds to subsampling the signal. For a given number 
of samples a larger time swath is covered for a larger scale. This is the basis of signal compression schemes 
as well [23]. Typically, a dyadic or binary scaling system is employed so that given a discrete wavelet 
function, V|/(x), is scaled by values that are binary. Thus 



Mf 2i [i) = 2'M/[2't) 



(56.10) 



where j is the scaling index and j = 0,-1, -2, In a dyadic scheme, subsampling is always decimation- 
in-time by a power of 2. Translations in time will be proportionally larger as well as for a more sizable scale. 
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It is for discrete time signals that scale and resolution are related. When the scale is increased, resolution 
is lowered. Resolution is strongly related to frequency. Subsampling means lowered frequency content. Rioul 
and Vetterli [1] use the microscope analogy to point out that smaller scale (higher resolution) helps to 
explore fine details of a signal. This higher resolution is apparent with samples taken at smaller time intervals. 

56.3 A Multiresolution Theory: Decomposition of Signals 
Using Orthogonal Wavelets 

One key result of the wavelet theory is that signals can be decomposed in a series of orthogonal wavelets. 
This is similar to the notion of decomposing a signal in terms of discrete Fourier transform components 
or Walsh or Haar functions. Orthogonality insures a unique and complete representation of the signal. 
Likewise the orthogonal complement provides some measure of the error in the representation. The 
difference in terms of wavelets is that each of the orthogonal vector spaces offers component signals with 
varying levels of resolution and scale. This is why Mallat [24] named his algorithm the multiresolution 
signal decomposition. Each stage of the algorithm generates wavelets with sequentially finer representations 
of signal content. To achieve an orthogonal wavelet representation, a given wavelet function, (j)(f), at a 
scaling index level equal to zero, is first dilated by the scale coefficient 2>, then translating it by 2~'n and 
normalizing by J2r> gives: 

v2-^ 2) (t-2-'n) 

The algorithm begins with an operator A 2 - for discrete signals that takes the projections of a signal, 
/(f) onto the orthonormal basis, V 2 ,: 



M f ) =2 ~ J LM4M M ~ 2 ~M)M r ~ r ' B ) (56 - n) 



where D defines the level of resolution. A 2 j is defined as the multi-resolution operator that approximates a 
signal at a resolution 2>. Signals at successively lower resolutions can be obtained by repeated application 
of the operator A 2 j (-/ < j < -1 ), where / specifies the maximum resolution, such that A 2 j f(x) is the closest 
approximation of function f(x) at resolution 1), Here we note that < > is simply a convolution defined thusly, 



/(m),<|> 2 .(w-2" ; m)\= f f[u\iif{u-2' j n\du (56.12) 



Here §(x) is the impulse response of the scaling function. The Fourier transforms of these functions 
are lowpass filter functions with successively smaller halfband lowpass filters. This convolution synthesizes 
the coarse signal at a resolution/scaling level j: 



C 2J f=(f(t),^ 2l (t-2-'nf) (56.13) 



Each level j generates new basis functions of the particular orthonormal basis with a given discrete 
approximation. In this case, larger j provides for decreasing resolution and increasing the scale in 
proportional fashion for each level of the orthonormal basis. Likewise, each sequentially larger j provides 
for time shift in accordance with scale changes, as mentioned above, and the convolution or inner product 
operation generates the set of coefficients for the particular basis function. A set of scaling functions at 
decreasing levels of resolution, j = 0,-1,-2, ...,-6 is given in [25]. 
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The next step in the algorithm is the expression of basis function of one level of resolution, ()) 2 j by at 
a higher resolution, (|> 2 >n. In the same fashion as above, an orthogonal representation of the basis V 2 i in 
terms of V 2 /+i is possible, or 

^ 2l [t-2->n) = 2-'- 1 Y^^ 2l [u-2->n)^^[u-2->- l k^^ 1 [t-2-'- 1 k) (56.14) 

Here the coefficients are once again the inner products between the two basis functions. A means of 
translation is possible for converting the coefficients of one basis function to the coefficients of the basis 
function at a higher resolution: 

C 2 ,f=[f(u\^,{t-2-'n) 



(56.15) 
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Mallat [24] also conceives of the filter function, h(ri), whose impulse response provides this conversion, 
namely, 

C,/=(/(4^(f-2" J «)) = 2-'- 1 £%n-fc)(/(4v( f - 2 " Hfc )) ( 56 " 16 ) 

k=-oo 

where h(n) = 2~i~ x ((|) 2 i (u — 2~i n), § 2 >* 1 (u - 2~f~ 1 k)) and h(n) = h(—n) is the impulse response of the 
appropriate mirror filter. 

Using the tools already described, Mallat [24] then proceeds to define the orthogonal complement, 
2j to the vector space V 2 i at resolution level j. This orthogonal complement to V 2 i is the error in the 
approximation of the signal in V 2 ;+i by use of basis function belonging to the orthogonal complement. 
The basis functions of the orthogonal complement are called orthogonal wavelets, \\i(x), or simply wavelet 
functions. To analyze finer details of the signal, a wavelet function derived from the scaling function is 
selected. The Fourier transform of this wavelet function has the shape of a bandpass filter in frequency 
domain. A basic property of the function \|/ is that it can be scaled according to 

M? 2 ,(t) = 2>y[2>t) 

An orthonormal basis set of wavelet functions is formed by dilating the function v|/(x) with a coefficient 
2^ and then translating it by 2~>n, and normalizing by J2~i. They are formed by the operation of convolving 
the scale function with the quadrature mirror filter 



y(co) = 



G 



CO 



CO 



where G(co) = er^ H (co + jr.) is the quadrature mirror filter transfer response and g{n) = (-1) 1 "" h(l - n) 
is the corresponding impulse response function. 

The set of scaling and wavelet functions presented here form a duality, together resolving the temporal 
signal into coarse and fine details, respectively. For a given level j then, this detail signal can once again 
be represented as a set of inner products: 
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D 2i f=(f(t),y 2i (x-2->n) 



(56.17) 



For a specific signal, f{x), we can employ the projection operator as before to generate the approxi- 
mation to this signal on the orthogonal complement. As before, the detail signal can be decomposed 
using the higher resolution basis function: 



v=(/(4^( f - 2 " J ' M ))= 

TH ^(vA t - T 'MA u - THk ))(AM2>A t - THk )Y 

or in terms of the synthesis filter response for the orthogonal wavelet 



(56.18) 



^/=(/(4v 2 -(f-2^»)) = 2- H £g(2n-fc)(/(4(t» 2 , +1 (t-2-^fc)) (56.19) 



At this point, the necessary tools are here for a decomposition of a signal in terms of wavelet compo- 
nents, coarse and detail signals. Multiresolution wavelet description provides for the analysis of a signal 
into lowpass components at each level of resolution called coarse signals through the C operators. At the 
same time the detail components through the D operator provide information regarding bandpass 
components. With each decreasing resolution level, different signal approximations are made to capture 
unique signal features. Procedural details for realizing this algorithm follow. 

Implementation of the Multiresolution Wavelet Transform: 
Analysis and Synthesis of Algorithms 

A diagram of the algorithm for the multiresolution wavelet decomposition algorithm is shown in Fig. 56.2. 
A step-by-step rendition of the analysis is as follows: 

1. Start with N samples of the original signal, x(t), at resolution level j = 0. 

2. Convolve signal with original scaling function, §(t), to find Q/as in Eq. (56.13) with j = 0. 

3. Find coarse signal at successive resolution levels, ;'= -1,-2, ...,-/ through Eq. (56.16); Keep every 
other sample of the output. 



2 



Convolve 
G* 



Keep alternate 
sample 



Convolve 
H* 



Keep alternate 
sample 



-D' jf 



C'f 



-Jij <-l 



FIGURE 56.2 Flow chart of multiresolution algorithm showing how successive coarse and detail components of 
resolution level, ;', are generated from higher resolution level, j + 1 . 



) 2000 by CRC Press LLC 



4. Find detail signals at successive resolution levels, j = -1,-2, ...,-/ through Eq. (56.19); Keep every 
other sample of the output. 

5. Decrease j and repeat steps 3 through 5 until j = —J 

Signal reconstruction details are presented in [24, 26]. 

56.4 Further Developments of the Wavelet Transform 

Of particular interest are Daubechies' orthonormal wavelets of compact support with the maximum 
number of vanishing moments [27, 28]. In these works, wavelets of very short length are obtained 
(number of coefficients 4, 6, 8, etc.) thus allowing for very efficient data compression. 

Another important development of the multiresolution decomposition is the wavelet packet approach 
[29]. Instead of the simple multiresolution three-stage filter bank (low-pass/highpass/downsampling) 
described previously in this chapter, wavelet packets analyse all subsystems of coefficients at all scale 
levels, thus yielding the full binary tree for the orthogonal wavelet transform, resulting in a completely 
evenly spaced frequency resolution. The wavelet packet system gives a rich orthogonal structure that 
allows adaptation to particular signals or signal classes. An optimal basis may be found, minimizing some 
function on the signal sequences, on the manifold of orthonormal bases. Wavelet packets allow for a 
much more efficient adaptive data compression [30]. Karrakchou and Kunt report an interesting method 
of interference cancelling using a wavelet packet approach [8]. They apply 'local' subband adaptive 
filtering by using a non-uniform filter bank, having a different filter for each subband (scale). Further- 
more, the orthonormal basis has "optimally" been adapted to the spectral contents of the signal. This 
yields very efficient noise suppression without significantly changing signal morphology. 

56.5 Applications 
Cardiac Signal Processing 

Signals from the heart, especially the ECG, are well suited for analysis by joint time-frequency and time- 
scale distributions. That is because the ECG signal has a very characteristic time-varying morphology, 
identified as the P-QRS-T complex. Throughout this complex, the signal frequencies are distributed: 
(1) low frequency — P and T waves, (2) mid to high frequency — QRS complex [31, 32]. Of particular 
diagnostic importance are changes in the depolarization (activation phase) of the myocardium, repre- 
sented by the QRS complex. These changes cause alterations in the propagation of the depolarization 
wave detectable by time-frequency analysis of one-dimensional electrocardiographic signals recorded 
from the body surface. Two examples are presented below. 

Analysis of the QRS Complex Under Myocardial Ischemia 

Ischemia-related intra-QRS changes. When the heart muscle becomes ischemic or infarcted, characteristic 
changes are seen in the form of elevation or depression of the ST-segment. Detection of these changes 
requires an extension of the signal bandwidth to frequencies down to 0.05 Hz and less, making the 
measurements susceptible to motion artifact errors. Ischemia also causes changes in conduction velocity 
and action potential duration, which results in fragmentation in the depolarization front (Fig. 56.3) and 
appearance of low-amplitude notches and slurs in the body surface ECG signals. These signal changes 
are detectable with various signal processing methods [33, 34]. Depolarization abnormalities due to 
ischemia may also cause arrhythmogenic reentry [35], which is one more reason to detect intra-QRS 
changes precisely. 

Identification of ischemia-related changes in the QRS complex is not as well known, and interpretation 
of the QRS complex would be less susceptible to artifactual errors, as compared to the ST analysis. Thus, 
time-frequency or time-scale analysis would serve a useful function in localizing the ischemia-related 
changes within the QRS complex, but would be somewhat independent of the artifactual errors. 

© 2000 by CRC Press LLC 



a) normal depolarization 



b) ischemic depolarization 




observation points 
(electrode sites) 



Left Ventricle 



observation points 
(electrode sites) 



site of LAD 
occlusion 



isochrones 




Left Ventricle 



FIGURE 56.3 Idealized example of (a) normal propagation and (b) wavefront fragmentation due to an ischemic 
zone of slow conduction. The superimposed lines are isochrones, connecting points at which the depolarization 
arrives at the same time. 

Experimental findings. In experimental animals, the response of the heart to coronary artery occlusion 
and then reperfusion was studied. The Left Anterior Descending (LAD) branch of the coronary artery 
was temporarily occluded for 20 min. Subsequent to that, the occlusion was removed, and resulting 
reperfusion more or less restored the ECG signal after 20 min. The coronary artery was occluded a second 
time for 60 min, and once again occlusion was removed and blood flow was restored. Single ECG cycles 
were analyzed using the continuous wavelet transform [33]. Figure 56.4 shows the time-scale plots for 
the ECG cycles for each of the five stages of this experiment. The three-dimensional plots give time in 
the P-QRS-T complex on one axis, the scale (or equivalent frequency) on another axis, and the normalized 
magnitude on the third axis. First occlusion results in a localized alteration around 100 ms and the 
midscale, which shows up as a bump in the three-dimensional plot or a broadening in the contour plot. 
Upon reperfusion, the time-scale plot returns to the pre-occlusion state. The second occlusion brings 
about a far more significant change in the time-scale plot, with increased response in the 0-200 ms and 
mid-scale ranges. This change is reversible. We were thus able to show, using time-scale technique, 
ischemia related changes in the QRS complex, and the effects of occlusion as well as reperfusion. 

Potential role of ischemia-related intra-QRS changes in coronary angioplasty. The above results are also 
applicable to human ECGs and clinical cardiology. For example, a fairly common disorder is the occlusion 
of coronary vessels, causing cardiac ischemia and eventually infarction. An effective approach to the 
treatment of the occlusion injury is to open the coronary blood vessels using a procedure called coronary 
angioplasty (also known as percutaneous transluminal coronary angioplasty or PTCA). Vessels may be 
opened using a balloon-type or a laser-based catheter. When reperfusion occurs following the restoration 
of the blood flow, initially a reperfusion injury is known to occur (which sometimes leads to arrhythmias) 
[35]. The ST level changes as well, but its detection is not easy due to artifacts, common in a PTCA 
setting. In a clinical study, we analyzed ischemia and reperfusion changes before and after the PTCA 
procedure. Short-term occlusion and ischemia followed by reperfusion were carried out in a cardiac 
catheterization laboratory at the Johns Hopkins Hospital in connection with PTCA) [36]. Figure 56.5 
shows time-scale plots of a patient derived from continuous wavelet transform. Characteristic midscale 
hump in the early stages of the QRS cycle is seen in the three-dimensional time-scale plot. Then, 60 min 
after angioplasty, the normal looking time-scale plot of the QRS complex is restored in this patient. This 
study suggests that time-scale analysis and resulting three-dimensional or contour plots may be usable 
in monitoring the effects of ischemia and reperfusion in experimental or clinical studies. In another study 
(4 patients, LAD) we monitored for intra-QRS changes during PTCA. Despite signal noise and availability 
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FIGURE 56.5 Time-frequency distributions of human ECG study using WT. Pre -angioplasty plot (a) shows a 
characteristic hump at about 35 Hz, which disappears as indicated by the second plot (b) taken one hour after 
angioplasty treatment. 

of recordings only from limb leads, superimposed mid-frequency components during ischemic states of 
the heart were observed, which disappeared when perfusion was restored. There was at least one lead 
that responded to changes in coronary perfusion. Figure 56.6 shows five different stages of a PTCA 
procedure as time plots (lead I) and CWT TFDs (topo-plots). Despite the presence of noise, the WT was 
able to unveil elevation of intra-QRS time-frequency components around 20 Hz during balloon inflation 
(ischemia), and a drop in the same components with reperfusion after balloon deflation. Frequency 
components 20 to 25 Hz during inflation. No substantial ST changes can be observed in the time-domain 
plot. The arrows show the zones of change in TFDs with ischemia and reperfusion. Note the representation 
of power line interference (50 Hz) as "clouds" in (b) and (d) topo plots — far from the region of interest. 

Another study analyzed ECG waveforms from patients undergoing the PTCA procedure by the mul- 
tiresolution wavelet method, decomposing the whole P-QRS-T intervals into coarse and detail compo- 
nents [26, 37], as can be seen from the analysis of one pre-angioplasty ECG cycle in Fig. 56.7 [26]. The 
PTCA procedure results in significant morphological and spectral changes within the QRS complex. It 
was found that certain detail components are more sensitive than others: in this study, the detail com- 
ponents d6 and d5 corresponding to frequency band of 2.2 to 8.3 Hz are most sensitive to ECG changes 
following a successful PTCA procedure. From this study it was concluded that monitoring the energy of 
ECG signals at different detail levels may be useful in assessing the efficacy of angioplasty procedures 
[37] . A benefit of this approach is that a real-time monitoring instrument for the cardiac catheterization 
laboratory can be envisioned (whereas currently X-ray fluroscopy is needed). 

Detection of reperfusion during thrombolytic therapy. Detecting reperfusion-related intra-QRS changes, 
along with ST changes, in the time-frequency domain would possibly find application in thrombolysis 
monitoring after myocardial infarction. At present, the ST elevation and its recovery are the main 
electrocardiographic indicators of acute coronary ischemia and reperfusion. Reports using continuous 
ST-segment monitoring have indicated that 25 to 50% of patients treated with intravenous thrombolytic 
therapy display unstable ST recovery [38], and additional reperfusion indicators are necessary. Signal 
averaged ECG and highest frequency ECG components ( 150 to 250 Hz) have been utilized as a reperfusion 
marker during thrombolysis and after angioplasty, but their utility is uncertain, since the degree of change 
of the energy values chosen does not appear to be satisfactory [39, 40]. We have analyzed the QRS of the 
vector magnitude V = Jx 2 + Y 2 + Z 2 of body surface orthogonal ECG leads X, Yand Z during thrombolytic 
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FIGURE 56.7 Detail and coarse components from one ECG cycle. The coarse components represent the lowpass 
filtered versions of the signal at successive scales. Detail components, dl and d2, consist mainly of electrical interference. 

therapy of two patients with myocardial infarction. Figure 56.8 shows how TFDs may be affected by 
reperfusion during thrombolysis. Two interesting trends may be observed on this figure: (1) a mid- 
frequency peak present during initial ischemia (a) disappears two hours after start of thrombolytic therapy 
(b) due to smoother depolarization front, and (2) high-frequency components appear with reestablished 
perfusion, possibly due to faster propagation velocity of the depolarization front. 

Analysis of Late Potentials in the ECG 

"Late potentials" are caused by fractionation of the depolarization front after myocardial infarction 
[35, 41]. They have been shown to be predictive of life threatening reentrant arrhythmias. Late potentials 
occur in the terminal portion of the QRS complex and are characterized by small amplitude and higher 
frequencies than in the normal QRS complex. The presence of late potentials may indicate underlying 
dispersion of electrical activity of the cells in the heart, and therefore may provide a substrate for 
production of arrhythmias. The conventional Fourier transform does not readily localize these features 
in time and frequency [15, 42]. STFT is more useful because the concentration of signal energy at various 
times in the cardiac cycle is more readily identified. The STFT techniques suffer from the problem of 
selecting a proper window function; for example, window width can affect whether high temporal or 
high spectral resolution is achieved [15]. Another approach sometimes considered is the Wigner-Ville 
distribution, which also produces a composite time-frequency distribution. However, the Wigner-Ville 
distribution suffers from the problem of interference from cross-terms. Comparative representations by 
smoothed Wigner-Ville, wavelet transform (scalogram), and traditional spectrogram are illustrated in 
Fig. 56.9. This problem causes high levels of signal power to be seen at frequencies not representing the 
original signal; for example, signal energy contributed at certain frequencies by the QRS complex may 
mask the signal energy contributed by the late potentials. In this regard, wavelet analysis methods provide 
a more accurate picture of the localized time-scale features indicative of the late potentials [11, 15,43-45]. 
Figure 56.10 shows that the signal energies at 60 Hz and beyond are localized in the late stage of the QRS 
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5 min thrombolysis 



120 min thrombolysis 




FIGURE 56.8 Change of the time-frequency distributions (QRS complex) of the vector magnitude of orthogonal leads 
X, Y and Z during thrombolysis: (a) 5 minutes after start, and (b) 2 hr after initiation of therapy. A mid-frequency peak 
has disappeared due to smoother depolarization, and high-frequency components appear due to faster propagation. 

and into the ST-segment. For comparison, the scalogram from a healthy person is illustrated in Fig. 56.11. 
This spreading of the high frequencies into the late cycle stages of the QRS complex is a hallmark of the 
late potentials. Time-scale analysis of late potentials may therefore serve as a noninvasive diagnostic tool 
for predicting the likelihood of life threatening arrhythmias in the heart. 

Role of Wavelets in Arrhythmia Analysis 

Detection of altered QRS morphology. Further applications to the generalized field of arrhythmia classi- 
fication can be envisaged [46]. When arrhythmias, such as premature ventricular contractions (PVCs) 
and tachycardia do occur, the P-QRS-T complex undergoes a significant morphological change. Abnormal 
beats, such as PVCs, have different time-scale signatures than normal beats. Often the QRS complex may 
widen and sometimes invert with PVCs. As the QRS complex widens, its power spectrum shows dimin- 
ished contribution at higher frequencies and these are spread out over a wider body of the signal [47, 48]. 
This empirical description of the time-domain features of the ECG signal lends itself particularly well to 
analysis by time-frequency and time-scale techniques. A more challenging problem is to distinguish 
multiform PVCs. 

Use of the orthogonal wavelet decomposition to separate dynamical activities embedded in a time series. 
Another interesting application of the discrete wavelet transform to arrhythmia research is the dynamical 
analysis of the ECG before and during ventricular tachycardia and ventricular fibrillation. Earlier works 
on heart rate variability have shown that heart rhythm becomes rigidly constant prior to the onset of 
life threatening arrhythmias, whereas the correlation dimension as a measure of randomness increases 
to values above 1 during disorganized rhythms like fast ventricular tachycardia, ventricular flutter, and 
ventricular fibrillation. This fact was used to identify patients at risk, and is promising with regard to 
prediction of arrhythmic episodes. Encouraging results were recently obtained by combining multires- 
olution wavelet analysis and dynamical analysis, in an attempt to find a decorrelated scale best projecting 
changes in low-dimensional dynamics [49]. The authors used records 115 and 207 from the MIT 
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FIGURE 56.9 Comparison of time-frequency representations of sinusoids with specific on-off times: (a) shows true 
time-frequency representation of 40 and 60 Hz sinusoids, (b) shows representation of smoothed Wigner-Ville trans- 
form, (c) spectrogram representation, (d) shows wavelet transform version of signal. 



Arrhythmia Database to study nonlinear dynamics preceding ventricular flutter. Distinct changes in the 
correlation dimension (D 2 k 3) in frequency band 45 to 90 Hz were observed before the onset of arrythmia, 
indicating the presence of underlying low- dimensional activity. 

Neurological Signal Processing 
Evoked Potentials 

Evoked potentials are the signals recorded from the brain in response to external stimulation. Evoked 
responses can be elicited by electrical stimulation (somatosensory evoked response), visual stimulation 
(visual evoked response), or auditory stimulation (brainstem auditory evoked response). Usually the 
signals are small, while the background noise, mostly the background EEG activity, is quite large. The 
low signal-to-noise ratio (SNR) necessitates use of ensemble averaging, sometimes signal averaging as 
many as a thousand responses [50]. After enhancing the SNR, one obtains a characteristic wave pattern 
that includes the stimulus artifact and an undulating pattern characterized by one or more peaks at 
specific latencies beyond the stimulus. Conventionally, the amplitude and the latency of the signal peaks 
is used in arriving at a clinical diagnosis. However, when the signals have a complex morphology, simple 
amplitude and latency analysis does not adequately describe all the complex changes that may occur as 
a result of brain injury or disease. Time-frequency and wavelet analysis have been shown to be useful in 
identifying the features localized within the waveform that are most indicative of the brain's response [51]. 
In one recent experimental study, we evaluated the somatosensory evoked response from experimental 
animals in whom injury was caused by oxygen deprivation. The evoked response signal was decomposed 



© 2000 by CRC Press LLC 




50 100 150 200 250 300 350 400 450 500 
rime Ora) 

(a) 





(c) 



W) 



FIGURE 56.10 Healthy person: (a) first recorded beat, (b) 3-D representation of the modified WT for the first beat, 
(c) contour plot of the modified WT for the first beat, and (d) contour plot of the modified WT for the second beat. 

into its coarse and detail components with the aid of the multiresolution wavelet analysis technique 
(Fig. 56.12) [25]. The magnitude of the detail components was observed to be sensitive to the cerebral 
hypoxia during its early stages. Figures 56.13a and 56.13b show a time trend of the magnitude of the 
detail components along with the trend of the amplitude and the latency of the primary peak of the 
somatosensory evoked response. The experimental animal was initially challenged by nitrous gas mixture 
with 100% oxygen (a non-injury causing event), and late by inspired air with 7 to 8% oxygen. As expected, 
the amplitude trend shows an initial rise because of the 100% oxygen, and later a gradual decline in 
response to hypoxia. The magnitude of the detail component shows a trend more responsive to injury: 
while there is not a significant change in response to the non-injury causing event, the magnitude of the 
detail component d4 drops quite rapidly when the brain becomes hypoxic. These data suggest that detail 
components of the evoked response may serve as indicators of early stages of brain injury. Evoked response 
monitoring can be useful in patient monitoring during surgery and in neurological critical care [52, 53]. 
Other applications include study of cognitive or event-related potentials in human patients for normal 
cognitive function evaluation or for assessment of clinical situations, such as a response in Alzheimer's 
disease [54]. Proper characterization of evoked responses from multiple channel recordings facilitates 
localization of the source using the dipole localization theory [55]. 

EEG and Seizures 

Electroencephalographic signals are usually analyzed by spectrum analysis techniques, dividing the EEG 
signal into various arbitrary frequency bands (a, (3, 0, 5). Conventional spectrum analysis is useful when 
these events are slowly unfolding, as when a person goes to sleep, the power in the EEG shifts from higher 
to lower frequency bands. However, when transient events such as epileptic seizures occur, there are often 
sharp spikes or a bursting series of events in the recorded waveform. This form of the signal, that is 
temporally well localized and has a spectrum that is distinctive from normal or ongoing events, lends 
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FIGURE 56.1 1 Patient with ventricular tachycardia diagnosis: (a) first beat, (b) 3-D representation of the modified 
WT for the first beat, (c) contour plot of the modified WT for the first beat, and (d) contour plot of the modified 
WT for the second beat. 



itself to wavelet analysis. A patient's EEG recorded over an extended period, preceding and following the 
seizure, was recorded and analyzed. Figure 56.14 shows a short segment of the EEG signal with a seizure 
burst. The multiresolution wavelet analysis technique was employed to identify the initiation of the seizure 
burst. Figure 56.15 shows a sudden burst onset in the magnitude of the detail components when the 
seizure event starts. The bursting subsides at the end of the seizure, as seen by a significant drop in the 
magnitude of the detail components. Wavelet analysis, thus, may be employed for the detection of onset 
and termination of seizures. Further possibilities exist in the use of this technique for discriminating 
interictal spikes and classifying them [56]. Certain seizures, like the petit mal and the grand mal epilepsy 
seizures, have very characteristic morphologies (e.g., spike and dome pattern). These waveforms would 
be expected to lend themselves very well to wavelet analysis. 

Other Applications 

Wavelet, or time-scale analysis is applicable to problems in which signals have characteristic morphologies 
or equivalently differing spectral signature attributed to different parts of the waveform, and the events 
of diagnostic interest are well localized in time and scale. The examples of such situations and applications 
are many. 

In cardiac signal processing, there are several potential applications. Well localized features of ECG 
signals, such as the P-QRS-T lend themselves well to wavelet analysis [57]. The application of wavelet 
analysis to ischemic-reperfusion injury changes and the late potentials has been illustrated above. This 
idea has been extended to the study of body surface maps recorded using numerous electrodes placed 
on the chest. In a preliminary study [58], spatio-temporal maps can been constructed and interpreted 
using time-scale analysis techniques. 
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FIGURE 56.12 Coarse (a) and detail (b) components from somatosensory evoked potentials during normal, 
hypoxic, and reoxygenation phases of experiment. 

In many situations noise and artifact result in inaccurate detection of the QRS complex. Wavelet 
analysis may prove to be helpful in removal of electrical interference from ECG [59] . Wavelet techniques 
have successfully been used in removing the noise from functional MRI data [8]. A more challenging 
application would be in distinguishing artifact from signal. Since wavelet analysis naturally decomposes 
the signals at different scales at well localized times, the artifactual events can be localized and eliminated. 
Fast computational methods may prove to be useful in real-time monitoring of ECG signal at a bedside 
or in analysis of signals recorded by Holter monitors. 

Other cardiovascular signals, such as heart sounds may be analyzed by time-frequency or time-scale 
analysis techniques. Characteristic responses to various normal and abnormal conditions along with 
sounds that are well localized in time and scale make these signals good candidates for wavelet analysis 
[60]. Normal patterns may be discriminated from pathological sound patterns, or sounds from various 
blood vessels can be identified [61]. Blood pressure waveform similarly has a characteristic pattern 
amenable to time-scale analysis. The dicrotic notch of the pressure waveform results from blood flow 
through the valves whose opening and closing affects the pressure signal pattern. The dicrotic notch can 
be detected by wavelet analysis [62]. Sounds from the chest, indicative of respiratory patterns are being 
investigated using wavelet techniques. Applications include analysis of respiratory patterns of infants [63] 
and respiration during sleep [64]. 

Two applications in neurological signal processing, evoked response and seizure detection, are 
described above. Other potential applications include detection and interpretation of signals from mul- 
tiple neurons obtained using microelectrodes [65]. Since waveforms (called spikes) from individual 
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FIGURE 56.13 (a) Amplitude and latency of major evoked potential peak during control, hypoxic, and reoxygen- 
ation portions of experiment; (b) mean amplitude of respective detail components during phases of experiment. 

neurons (called units) have different patterns because of their separation and orientation with respect 
to the recording microelectrode, multiunit spike analysis becomes a challenging problem. Time-scale 
analysis techniques may be employed to localize and analyze the responses of individual units and from 
that derive the overall activity and interrelation among these units so as to understand the behavior of 
neural networks. An analogous problem is that of detecting and discriminating signals from "motor 
units," that is, the muscle cells and fibers [66]. Signals from motor units are obtained by using small, 
micro or needle electrodes, and characteristic spike trains are obtained, which can be further analyzed 
by time-scale analysis techniques to discriminate normal and abnormal motor unit activity. 

Discussion and Conclusions 

Biological signals with their time-varying nature and characteristic morphologies and spectral signatures 
are particularly well suited for analysis and interpretation using time-frequency and time-scale analysis 
techniques. For example, the P-QRS-T complex of the ECG signal shows localized low frequencies in the 
P- and the ST-segments and high frequencies in the QRS complex. In time-scale frame, the ischemia 
related changes are seen in certain detail components of the QRS complex. The late segment of the QRS 
cycle exhibits the so-called late potentials more easily localized by means of time-scale analysis. Other 
cardiovascular signals, such as pressure waves, heart sounds, and blood flow are being analyzed by the 
newly developed wavelet analysis algorithms. Other examples of time-scale analysis include neurological 
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signals with potential applications in the analysis of single and multiunit recordings from neurons, evoked 
response, EEG, and epileptic spikes and seizures. 

The desirable requirements for a successful application of time-scale analysis to biomedical signals is 
that events are well localized in time and exhibit morphological and spectral variations within the 
localized events. Objectively viewing the signal at different scales should provide meaningful new infor- 
mation. For example, are there fine features of signal that are observable only at scales that pick out the 
detail components? Are there features of the signal that span a significant portion of the waveform so 
that they are best studied at a coarse scale. The signal analysis should be able to optimize the trade-off 
between time and scale, i.e., distinguish short lasting events and long lasting events. 

For these reasons, the signals described in this article have been found to be particularly useful models 
for data analysis. However, one needs to be cautious in using any newly developed tool or technology. Most 
important questions to be addressed before proceeding with a new application are: Is the signal well suited 
to the tool, and in applying the tool, are any errors inadvertently introduced? Does the analysis provide a 
new and more useful interpretation of the data and assist in the discovery of new diagnostic information? 

Wavelet analysis techniques appear to have robust theoretical properties allowing novel interpretation 
of biomedical data. As new algorithms emerge, they are likely to find application in the analysis of more 
diverse biomedical signals. Analogously, the problems faced in the biomedical signal acquisition and 
processing world will hopefully stimulate development of new algorithms. 
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The past 20 years witnessed an expansion of power spectrum estimation techniques, which have proved 
essential in many applications, such as communications, sonar, radar, speech/image processing, geophys- 
ics, and biomedical signal processing [32], [27], [20]. In power spectrum estimation, the process under 
consideration is treated as a superposition of statistically uncorrected harmonic components. The dis- 
tribution of power among these frequency components is the power spectrum. As such phase relations 
between frequency components are suppressed. The information in the power spectrum is essentially 
present in the autocorrelation sequence, which would suffice for the complete statistical description of 
a Gaussian process of known mean. However, there are applications where one would need to obtain 
information regarding deviations from the Gaussianity assumption and presence of nonlinearities. In 
these cases power spectrum is of little help, and a look beyond the power spectrum or autocorrelation 
domain is needed. Higher-Order Spectra (HOS) (of order greater than 2), which are defined in terms of 
higher-order cumulants of the data, do contain such information [35]. The third order spectrum is 
commonly referred to as bispectrum and the fourth-order as trispectrum. The power spectrum is also a 
member of the higher-order spectral class; it is the second-order spectrum. 

HOS consist of higher- order moment spectra, which are defined for deterministic signals, and cumu- 
lant spectra, which are defined for random processes. In general there are three motivations behind the 
use of HOS in signal processing: (1) to suppress Gaussian noise of unknown mean and variance, (2) to 
reconstruct the phase as well as the magnitude response of signals or systems, and (3) to detect and 
characterize nonlinearities in the data. 

The first motivation stems from the property of Gaussian processes to have zero higher-order spectra 
of order greater than 2). Due to this property, HOS are high signal-to-noise ratio domains in which one 
can perform detection, parameter estimation, or even signal reconstruction even if the time domain noise 
is spatially correlated. The same property of cumulant spectra can provide a means of detecting and 
characterizing deviations of the data from the Gaussian model. 
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The second motivation is based on the ability of cumulant spectra to preserve the Fourier phase of 
signals. In the modeling of time series, second-order statistics (autocorrelation) have been heavily used 
because they are the result of least-squares optimization criteria. However, an accurate phase reconstruc- 
tion in the autocorrelation domain can be achieved only if the signal is at minimum phase. Nonminimum 
phase signal reconstruction can be achieved only in the HOS domain, due to the HOS ability to preserve 
the phase. 

Being nonlinear functions of the data, HOS are quite natural tools in the analysis of nonlinear systems 
operating under a random input. General relations for stationary random data passing through an 
arbitrary linear system exist and have been studied extensively. Such expressions, however, are not 
available for nonlinear systems, where each type of nonlinearity must be studied separately. Higher-order 
correlations between input and output can detect and characterize certain nonlinearities [56], and for 
this purpose several higher-order spectra based methods have been developed. 

This chapter is organized as follows. In Section 57.2 definitions and properties of cumulants and 
higher-order spectra are introduced. In section 57.3 methods for estimation of HOS from finite length 
data are presented and the asymptotic statistics of the corresponding estimates are provided. In 
Section 57.4 methods for identification of a linear time-invariant system from HOS of the system output 
are outlined, while in Section 57.6, non-linear system estimation is considered. Section 57.6 summarizes 
the research activity related to HOS as applied to biomedical signal analysis and provides details on a 
particular application, namely the improvement of resolution of medical ultrasound images via HOS is 
discussed. 

57.2 Definitions and Properties of HOS 

In this chapter only random one-dimensional processes are considered. The definitions can be easily 
extended to the two-dimensional case [34]. 

The join moments of order r of the random variables x v ..., x„ are given by [42]: 
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where fc, +...+ k n = r, and cl)(-) is their joint characteristic function. The corresponding joint cumulants 
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For a stationary discrete time random process x(k), (k denotes discrete time), the moments of order n 
are given by: 

m*(x 1 ,x 2 ,...,V 1 ) = £{x(fc)x(fc+T I )-x(*+X B _ 1 )}, (57.3) 

where £{.} denotes expectation. The wth order cumulants are functions of the moments of order up to 
n, i.e., 

1st order cumulants: 

c* = ml = Elxik)] (mean) (57.4) 
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2nd order cumulants: 



3rd order cumulants: 
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4th order cumulants: 
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The general relationship between cumulants and moments can be found in [35]. 

Some important properties of moments and cumulants are summarized next. 

[PI] If x(k) is a jointly Gaussian process, then c* (x p x 2 , ..., x,,^) = for w > 2. In other words, all the 
information about a Gaussian process is contained in its first and second-order cumulants. This property 
can be used to suppress Gaussian noise, or as a measure for non-Gaussianity in time series. 

[P2] If x(k) is symmetrically distributed, then c\ (t,, x 2 ) = 0. Third-order cumulants suppress not only 
Gaussian processes, but also all symmetrically distributed processes, such as uniform, Laplace, and 
Bernoulli-Gaussian. 

[P3] Additivity holds for cumulants. If x(k) = s(k) + w{k), where s(k), w(k) are stationary and 
statistically independent random processes, then c* (t p T 2 , ..., T^J = c'„ (x 1; T 2 , ..., t„_[) + c w n {i v x 2 , ..., 
i n _i)- It is important to note that additivity does not hold for moments. 

If w(k) is a Gaussian representing noise which corrupts the signal of interest, s(k), then by means of 
(P2) and (P3), we conclude that c* (x p x 2 , ..., !„_[) = C s n (x x , T 2> ..., x,,^), for n > 2. In other words, in 
higher-order cumulant domains the signal of interest propagates noise free. Property (P3) can also provide 
a measure of the statistical dependence of two processes. 

[P4] if x(k) has zero mean, then C x t (x v ..., x,,^) = m\ (x p ..., x n _^), for n < 3. 

Higher-order spectra are defined in terms of either cumulants (e.g., cumulant spectra) or moments 
(e.g., moment spectra). 

Assuming that the nth order cumulant sequence is absolutely summable, the nth order cumulant 
spectrum of x(k), C*(to p (0 2 , ..., (Q n _i), exists, and is defined to be the (n-l)-dimensional Fourier 
transform of the nth order cumulant sequence. In general, C\ (co p co 2 , ..., ©„_[) is complex, i.e., it has 
magnitude and phase. In an analogous manner, moment spectrum is the multidimensional Fourier 
transform of the moment sequence. 

If v(k) is a stationary non-Gaussian process with zero mean and nth order cumulant sequence 



<(v--->vi)=y;; 5 (v--->vi)> 
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where 8(.) is the delta function, v{k) is said to be wth order white. Its nth order cumulant spectrum is 
then flat and equal to y x n . 

Cumulant spectra are more useful in processing random signals than moment spectra since they posses 
properties that the moment spectra do not share, i.e., properties PI, P3, and flatness of spectra corre- 
sponding to higher-order white noise. 

57.3 HOS Computation from Real Data 

The definitions of cumulants presented in the previous section are based on expectation operations, and 
they assume infinite length data. In practice we always deal with data of finite length, therefore, the 
cumulants can only be estimated. Two methods for cumulants and spectra estimation are presented next 
for the third-order case. 

Indirect Method: 

Let x(k), k = 0,..., N - 1 be the available data. 

1. Segment the data into K records of M samples each. Let x'(fc), k = 0, ..., M- 1, represent the ith 
record. 

2. Subtract the mean of each record. 

3. Estimate the moments of each segment x'{k) as follows: 



< ( (w 2 )=— L*wi l+x M l+ '**)' 

M /=/, (57.9) 

l 1 = max\Q, -Tp-tJ, l 2 =mJ«lM-l-Tp M-l-x 2 , M -l), \xl< L,\x 2 \< L,i= 1,2, ..., K. 

Since each segment has zero mean, its third-order moments and cumulants are identical, i.e., 
c%(t v t 2 ) = m%x v x 2 ). 
4. Compute the average cumulants as: 



K 
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5. Obtain the third-order spectrum (bispectrum) estimate as the two-dimensional Discrete Fourier 
Transform of size M x M of the windowed (?i(x v x 2 ), i.e., 
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where L < M-l, and w{x v x 2 ) is a two-dimensional window of bounded support, introduced to 
smooth out edge effects. The bandwidth of the final bispectrum estimate is A = 1/L. 

A complete description of appropriate windows that can be used in Eq. 57.1 1 and their properties can 
be found in [35]. A good choice of cumulant window is: 



w(x i ,X 2 )= d(x i )d(x 2 )d(x l -X 2 ), (57.12) 
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where 
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X > L 



which is known as the minimum bispectrum bias supremum [36]. 

Direct Method 

Let x(k), k = 1, ... Nbe the available data. 

1. Segment the data into K records of M samples each. Let x'(k), k = 0, ..., M - 1, represent the z'th 
record. 

2. Subtract the mean of each record. 

3. Compute the Discrete Fourier Transform X'(k) of each segment, based on M points, i.e., 

M-l 

X '{ k ) = 'Yi x '{ n ) e ~'^" k > fc=0,l,...,M-l, i = l,2,...,K. (57.14) 

n = 

4. The third-order spectrum of each segment is obtained as 

C^(\,k 2 ) = — X i {k 1 )x i {k 2 )x it {k 1 + k 2 ),i = l,...,K. (57.15) 

Due to the bispectrum symmetry properties, C% (k p k 2 ) needs to be computed only in the triangular 
region < k 2 < k v k^ +k 2 < M I 2. 

5. In order to reduce the variance of the estimate, additional smoothing over a rectangular window 
of size (M 3 x M 3 ) can be performed around each frequency, assuming that the third order spectrum 
is smooth enough, i.e., 

M 3 /2-l Mj/2-1 

C?(K>K) = ^ £ Y,C;-(k 1 + n l ,k 2 + n 2 ). (57.16) 

3 nj=-M,/2 n 2 =-M 3 /2 

6. Finally, the third-order spectrum is given as the average over all third-order spectra, i.e., 

K 

^(K'K)=^Y,^(^ (57 - 17) 

1=1 

The final bandwidth of this bispectrum estimate is A = M i IM, which is the spacing between 
frequency samples in the bispectrum domain. 

For large N, and as long as 

A^O, and A 2 N ^00 (57.18) 
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both the direct and the indirect methods produce asymptotically unbiased and consistent bispectrum 
estimates, with real and imaginary part variances [50]: 
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where V is the energy of the bispectrum window. 

From the above expressions, it becomes apparent that the bispectrum estimate variance can be reduced 
by increasing the number of records, or reducing the size of the region of support of the window in the 
cumulant domain (L), or increasing the size of the frequency smoothing window (M 3 ), etc. The relation 
between the parameters M, K, L, M 3 should be such that 57.18 is satisfied. 

57.4 Linear Processes 

Let x(k) be generated by exiting a linear, time-invariant (LTI), exponentially stable, mixed-phase system 
with frequency response H(a>) (or impulse response h(k)), with an nth order white, zero-mean, non- 
Gaussian stationary process v(k), i.e., 



x(k) = v{k)*h(k) 



where * denotes convolution. 

The output wth order cumulant equals [10]: 



(57.20) 



<(^>--Vi) = Yl^^)^ +x i)--4 fc+x «-i)' "" 3 



(57.21) 



where y v n is a scalar constant and equals the nth order spectrum of v(k). The output nth order cumulant 
spectrum is given by 



C„^((o 1) a) 2 ,... ) a)„_ 1 ) = Y;H(a) 1 )...H(a)„. 1 )jf*(co 1 + -+a)„. 1 ). 



(57.22) 



For a linear non-Gaussian random process x(k), the nth order spectrum can be factorized as in Eq. 57.22 
for every order n, while for a nonlinear process such a factorization might be valid for some orders only 
(it is always valid for n = 2). 

It can be easily shown that the cumulant spectra of successive orders are related as follows: 



c;(co 1 ,co 2 ,...,o) = C 1 (co 1 ,co 2 ,...,co„_ 2 )h(o)^. 

Y„-i 



(57.23) 



For example, the power spectrum of a non-Gaussian linear process can be reconstructed from the 
bispectrum up to a scalar factor, i.e., 



c;(co,o)=c;(co)4- 

Y 2 



(57.24) 
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While for n = 2, Eq. 57.22 provides the magnitude only of H(co), as a function of C x n (a p co 2 , ..., co,^), 
for n > 2 both the magnitude and phase of H(co) can be obtained. A significant amount of research has 
been devoted to estimating the system phase from the phase of the bispectrum or trispectrum [33], [35], 
[45], and [47]. 

Let 



(k)=x(k) + w(k) (57.25) 



y\ 

where x{k) was defined in Eq. 57.20, and w{k) is zero-mean Gaussian noise, uncorrelated to x(k). Methods 
for reconstructing h(k) from y{k) can be divided into two main categories: non-parametric and para- 
metric. 

Non-Parametric Methods 

Non-parametric methods reconstruct the system by recovering its Fourier phase and magnitude. Some 
utilize the whole bispectrum (or polyspectrum) information [33], [7], [43], [53], [41], [30], and others 
use fixed, one-dimensional polyspectrum slices [31], [13]. In [45] and [49] methods for system recovery 
from arbitrarily selected HOS slices have been proposed. A system method based on preselected polyspec- 
trum slices, allows regions where polyspectrum estimates exhibit high variance, or regions where the 
ideal polyspectrum is expected to be zero, such as in the case of bandlimited systems to be avoided. 
Assuming that there existed a mechanism to select "good" slices, it can also lead to computational savings. 
Let the bispectrum of y(k) in Eq. 57.25, i.e., C^((o v co 2 ), be discretized on a square grid of size 2n/N x 
2n/N, and let \C\(m, I), m = 0, ..., N-l}, {O'^m, I + r),m = 0,..., N-l} be two bispectral slices. In [49] 
it was shown that unique identification of the system impulse response can be performed base on two 
horizontal slices of the discretized third-order spectrum of the system output, as long as the distance 
between slices, i.e., r and N are co-prime numbers. Let 

h= logH(l),...,logH(iV-l) ((n-i)xi) (57.26) 

where H{k) is the Discrete Fourier Transform of h( n). Then h can be obtained as the solution of the system: 

Ah=b, (57.27) 

where 

fo(fc) = logC* [-k- r- 1, l)-logC h } [-k- r- 1, 1+ A + c lr 

k = 0,...,N-l (57.28) 



N-l 

^£[to g c,'(fci+r)-iogq(fczy 



'- N 

t = 

and A is a [(N— 1) x (N— 1)] sparse matrix; its first row contains a 1 at the rth column and 0s elsewhere. 
The fcth row contains a -1 at column (k - l)moduloN, and a 1 at column (k + r— l)moduloN. Although 
b(k) requires the computation of the logarithm of a complex quantity, it was shown in [49], that the log- 
bispectrum can be computed based on the principal argument of the bispectral phase, thus bypassing 
the need for two-dimensional phase unwrapping. The bispectrum needed in this method can be substi- 
tuted with a bispectral estimate, obtained with any of the methods outlined in section 57.3. Reduction 
of the HOS estimate variance can be achieved via a low-rank approach [4], [9], and [8]. 
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The above described reconstruction method can be easily extended to any order spectrum [49] . Matlab 
code for this method can be found at http://www.ece.drexel.edu/CSPL. 

Parametric Methods 

These methods fit a parametric model to the data x(k). For example, let us fit a real autoregressive moving 
average (ARMA) model to x(k), i.e., 



£4M fc-i ) = LK ; 'MM (57 - 29) 

where v(k) is a stationary zero-mean wth order white non-Gaussian process, and a(i) and b{j) represent 
the AR and MA parameters. Let y(k) be as defined in Eq. 57 '.25. For estimation of the AR parameters 
from y(k), equations analogous to the Yule- Walker equations can be derived based on third-order 
cumulants of y(k), i.e., 



J\(;)c 3 '(t-m) = 0,T><?, (57.30) 



V 

^a(i)c y 3 (x-i, ]) = -cl[%, j),x> q, (57.31) 

where it was assumed a(0) = 1. Concatenating Eq. 57.31 for % = q+ 1, ..., q + M, M > 0, and j = q — p, 
..., q, the matrix equation 

Ca=c, (57.32) 

can be formed, where C and c are a matrix and a vector, respectively, formed by third-order cumulants 
of the process according to Eq. 57.31, and the vector a contains the AR parameters. If the AR order p is 
unknown and Eq. 57.32 is formed based on an overestimate of p, the resulting matrix C always has rank p. 
In this case the AR parameters can be obtained using a low-rank approximation of C [16]. 

Using the estimated AR parameters, a(i), i = 1,..., p, a, pth order filter with transfer function A(z) = 
1 + El =1 a(;)z~' can be constructed. Based on the filtered-through A(z) process y(k), i.e., y{k), or otherwise 
known as the residual time series [16], the MA parameters can be estimated via any MA method [34], 
for example: 



/ x cUq,k) 

b[k ) = — -) — f, k=0,l,...,q (57.33) 



known as the c{q, k) formula [15]. 

Practical problems associated with the described approach are sensitivity to model order mismatch, 
and AR estimation errors that propagate in the estimation of the MA parameters. Additional parametric 
methods can be found in [5], [14], [34], [35], [44]. [63], [64]. 
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FIGURE 57.1 Second-order Volterra system. Linear and quadratic parts are connected in parallel. 



57.5 Nonlinear Processes 

Despite the fact that progress has been established in developing the theoretical properties of nonlinear 
models, only a few statistical methods exist for detection and characterization of nonlinearities from a 
finite set of observations. In this section we will consider nonlinear Volterra systems excited by Gaussian 
stationary inputs. Let y(k) be the response of a discrete time invariant, pth order, Volterra filter whose 
input is x(k). Then, 



y( k )= h o + y, y^i>-> ^ x ( k ~ x }" x ( k ~ x )' 



(57.34) 



where h i {x v ..., x,) are the Volterra kernels of the system, which are symmetric functions of their argu- 
ments; for causal systems /z^Xp... x,) = for any x, < 0. 

The output of a second-order Volterra system when the input is zero-mean stationary is 

y{ k )=K + Y. h \ x H k - % ) + Y.Y. h l^ x H k - x H k -^\ (57 - 35) 



Equation 57.35 can be viewed as a parallel connection of a linear system ^(xj and a quadratic system 
h 2 (x v x 2 ) as illustrated in Fig. 57.1. Let 



c^(x)=EUk+x)\y(k)-n 
be the cross-covariance of input and output, and 

C 3 ^(T 1 ,T 2 ) = £{x(fc+T 1 )^(fc+X 2 )[ y (fc)-<]} 

be the third-order cross-cumulant sequence of input and output. 
The system's linear part can be obtained by [61] 



(57.36) 



(57.37) 



^(-co)- 



C: 



(-)" 



(57.38) 
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and the quadratic part by 

HJ-co^-co.U ", \ ' , (57.39) 

21 2; 2C*(cd 1 )c/(co 2 ) 

where Cf(o)) and C'f ((Oy (0 2 ) are the Fourier transforms of cf (t) and (^(Tj, t 2 ), respectively. It should 
be noted that the above equations are valid only for Gaussian input signals. More general results assuming 
non-Gaussian input have been obtained in [22] and [48]. Additional results on particular nonlinear 
systems have been reported in [11] and [54]. 

An interesting phenomenon, caused by a second-order nonlinearity, is the quadratic-phase coupling. 
There are situations where nonlinear interaction between two harmonic components of a process con- 
tribute to the power of the sum and/or difference frequencies. The signal 

xlk) = Acosh, 1 k + Q l ) + Bcosh, 2 k + Q 2 ) (57.40) 

after passing through the quadratic system: 

zl k) = x( k) + ex 2 1 k\ei=0, (57.41) 

contains cosinusoidal terms in (X v 6J, (X 2 ,Q 2 ), (2X l 20j), (2X 2 , 20 2 ), (A,, + X 2 , 8j + 2 ), {X l - X 2 , Q l — 9 2 ). 
Such a phenomenon that results in phase relations that are the same as the frequency relations is called 
quadratic phase coupling [28]. Quadratic phase coupling can arise only among harmonically related 
components. Three frequencies are harmonically related when one of them is the sum or difference of 
the other two. Sometimes it is important to find out if peaks at harmonically related positions in the 
power spectrum are in fact phase coupled. Due to phase suppression, the power spectrum is unable to 
provide an answer to this problem. As an example, consider the process [51] 



xlk) = 'Y\cosb, i k+^) (57.42) 



where X l > X 2 > 0, X 4 + X 5 > 0, X 3 = X l + X 2 , X 6 = X 4 + X 5 , § v ... § 5 are all independent, uniformly 
distributed random variables over (0, 2rt), and <(> 6 = ()) 4 + <|> 5 . Among the six frequencies (X v X 2 , X^) and 
(X v X 5 , X 6 ) are harmonically related, however, only A 6 is the result of phase coupling between A 4 and X 5 . 
The power spectrum of this process consists of six impulses at X iy i, ,.., 6 (Fig. 57.2), offering no indication 
whether each frequency component is independent or a result of frequency coupling. On the other hand, 
the bispectrum of x(k), (evaluated in its principal region) is zero everywhere, except at point (X 4 , X 5 ) of 
the (Kip co 2 ) plane, where it exhibits an impulse (Fig. 57.2b). The peak indicates that only X 4 , X 5 are phase 
coupled. 

The biocherence index, defined as 

^Kco 2 )= rrnM v (57 - 43) 

has been extensively used in practical situations for the detection and quantification of quadratic phase 
coupling [28]. The value of the bicoherence index at each frequency pair indicates the degree of coupling 
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FIGURE 57.2 Quadratic phase coupling, (a) The power spectrum of the process described in Eq. 57.42 cannot 
determine what frequencies are coupled, (b) The corresponding magnitude bispectrum is zero everywhere in the 
principle region, except at points corresponding to phase coupled frequencies. 



among the frequencies of that pair. Almost all bispectral estimators can be used in Eq. 57.43. However, 
estimates obtained based on parametric modeling of the bispectrum have been shown to yield resolution 
superior [51 and 52] to the ones obtained with conventional methods. 

57.6 HOS in Biomedical Signal Processing 

The applications of HOS on biomedical signals are clustered according to the HOS property they most 
rely on, i.e., (1) the ability to describe non-Gaussian processes and preserve phase, (2) the Gaussian noise 
immunity, and (3) the ability to characterize nonlinearities. 

In the first class are the works of [ 1 ] , [3], and [60], where ultrasound imaging distortions are estimated 
from the ultrasound echo and subsequently compensated for, to improve the diagnostic quality of the 
image. HOS have been used in modeling the ultrasound radio-frequency (RF) echo [2], where schemes 
for estimation of resolvable periodicity as well as correlations among nonresolvable scatters have been 
proposed. The "tissue color," a quantity that describes the scatterer spatial correlations, and which can 
be obtained from the HOS of the RF echo, has been proposed [2] as a tissue characterization feature. 
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The skewness and kurtosis of mammogram images have been proposed in [18] as a tool for detecting 
microcalcifications in mammograms. 

In the second class are methods that process multicomponent biomedical signals, treating one com- 
ponent as the signal of interest and the rest as noise. HOS have been used in [19] to process lung sounds 
in order to suppress sound originating from the heart, and in [65] to detect human afferent nerve signals, 
which are usually observed in very poor signal-to-noise ratio conditions. 

Most of the HOS applications in the biomedical area belong to the third class, and usually investigate 
the formation and the nature of frequencies present in signals through the presence of quadratic phase 
coupling (QPC). The bispectrum has been applied in EEG signals of the rat during various vigilance states 
[37], where QPC between specific frequencies was observed. QPC changes in auditory evoked potentials 
of healthy subjects and subjects with Alzheimer's dementia has been reported [55] . Visual evoked potentials 
have been analyzed via the bispectrum in [59], [24], and [21]. Bispectral analysis of interactions between 
electrocerebral activity resulting from stimulation of the left and right visual fields revealed nonlinear 
interactions between visual fields [57]. The bispectrum has been used in the analysis of electromyogram 
recruitment [66], and in defining the pattern of summation of muscle fiber twitches in the surface 
mechanomyogram generation [40]. QPC was also observed in the EEG of humans [6] and [23]. 

In the sequel, presented in some detail are the application of HOS on improving the resolution of 
ultrasound images, a topic that has recently attracted a lot of attention. 

Ultrasonic imaging is a valuable tool in the diagnosis of tumors of soft tissues. Some of the distinctive 
features of ultrasound are its safe, non-ionizing acoustic radiation, and its wide availability as a low cost, 
portable equipment. The major drawback that limits the use of ultrasound images in certain cases, (e.g., 
breast imaging) is poor resolution. In B-Scan images, the resolution is compromised due to: (1) the finite 
bandwidth of the ultrasonic transducer, (2) the non-negligible beam width, and (3) phase aberrations 
and velocity variations arising from acoustic inhomogeneity of tissues themselves. The observed ultra- 
sound image can be considered as a distorted version of the true tissue information. Along the axial 
direction the distortion is dominated by the pulse-echo wavelet of the imaging system, while along the 
lateral direction the distortion is mainly due to finite-width lateral beam profile. Provided that these 
distortions are known in advance, or non-invasively measurable, their effects can be compensated for in 
the observed image. With propagation in tissue, however, both axial and lateral distortions change due 
to the inhomogeneities of the media and the geometry of the imaging system. They also change among 
different tissue types and individuals. Distortions measure in a simplified setting, e.g. in a water tank, 
are rarely applicable to clinical images, due to the effects of tissue-dependent components. Therefore, 
distortions must be estimated based on the backscattered RF data that lead to the B-mode image. 

Assuming a narrow ultrasound beam, linear propagation and weak scattering, the ultrasonic RF echo, 
y,(n), corresponding to the z'th axial line in the B-mode image is modeled as [1]: 

y.(n) = h t [n) * f i [n) + w i [n),i= 1,2,..., (57.44) 

where n is the discrete time; w,( n) is observation noise; f{ n) represents the underlying tissue structure 
and is referred to as tissue response; and fy(w) represents the axial distortion kernel. Let us assume that: 
(Al) y,(«) is non-Gaussian; (A2) f^n) is white, non-Gaussian random process; (A3) wf^n) is Gaussian 
noise uncorrelated with f^n); and (A4) y,(n) is a short enough segment, so that the attenuation effects 
stay constant over its duration. (Long segments of data can be analyzed by breaking them into a sequence 
of short segments.) A similar model can be assumed to hold in the lateral direction of the RF image. 

Even though the distortion kernels were known to be non-minimum phase [26], their estimation was 
mostly carried out using second-order statistics (SOS), such as autocorrelation or power spectrum, 
thereby neglecting Fourier phase information. Phase is important in preserving edges and boundaries 
in images. It is particularly important in medical ultrasound images where the nature of edges of a tumor 
provide important diagnostic information about the malignancy. Complex cepstrum-based operations 
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that take phase information into account have been used [60] to estimate distortions. HOS retain phase 
and in addition, are not as sensitive to singularities as the cepstrum. HOS were used [1] for the first time 
to estimate imaging distortions from B-scan images. It was demonstrated that the HOS-based distortion 
estimation and subsequent image deconvolution significantly improved resolution. For the case of breast 
data, in was demonstrated [3] that deconvolution via SOS-based distortion estimates was not as good 
as its HOS counterpart. 

In the following we present some results of distortion estimation followed by deconvolution of clinical 
B-scan images of human breast data. The data were obtained using a flat linear array transducer with a 
nominal center frequency of 7.5 MHz on a clinical imaging system UltraMark-9 Advanced Technology 
Laboratories. Data were sampled at a rate of 20 MHz. Figure 57.3(a) shows parts of the original image, 
where the the logarithm of the envelope has been used for display purposes. Axial and lateral distortion 
kernels were estimated from the RF data via the HOS-based non-parametric method outlined in 
Section 57.4 of this chapter [49], and also via an SOS power cepstrum based method [3]. Each kernel 
estimate was obtained from a rectangular block of RF data described by (x, y, N x , N) where (x, y) are 
the co-ordinates of the upper left corner of the block, N x is its lateral width, and N is the axial height. 
Note that y corresponds to the depth of the upper left corner of the image block from the surface of the 
transducer. In the following, all dimensions are specified in sample numbers. The size of the images used 
was 192 x 1024. Assuming that within the same block the axial distortion kernel does not significantly 
depend on the lateral location of the RF data, all axial RF data in the block can be concatenated to form 
a longer one-dimensional vector. In both HOS and SOS based axial kernel estimations, it was assumed 
that N x = 10, N = 128, N = 128. Note that the N y = 128 samples correspond to 5mm in actual tissue 
space, as is reasonable to assume that attenuation over such a small distance may be assumed constant. 
Fifty kernels were estimated from the blocks (x, 400, 10, 128), the x taking values in the range 1...50. 
Lateral kernels were estimated from the same images with parameters N v = 192, N =10, and N = 64. All 
the kernels were estimated from the blocks (1, y, 192, 10), the y taking values in the range 400... 600 
increasing each time by 5. Data from all adjacent lateral image lines in the block were concatenated to 
make a long one-dimensional vector. Note that N = 10 corresponds to a very small depth, 0.4mm in the 
real tissue space, hence he underlying assumption that the lateral distortion kernel does not vary much 
over this depth, should be reasonable. 

Figure 57.3(b-c) show that the result of lateral followed by axial deconvolution of the image of 
Fig. 57.3(a), using respectively HOS- and SOS-based distortion kernel estimates. The deconvolution was 
performed via the constrained Wiener Filter technique [62]. According to Fig. 57.3, the deconvolution 
in the RF-domain resulted in a significant reduction in speckle size. The speckle size appears smaller in 
the HOS-deconvolved image than in SOS-deconvolved ones, which is evidence of higher resolution. The 
resolution improvement can be quantified based on the width of the main lobe of the autocovariance of 
the image. The axial resolution gain for the HOS-based approach was 1.8 times that of the SOS-based 
approach. The lateral resolution gain for the HOS-based approach was 1.73 times as much. From a 
radiologist's point of view, overall, the HOS image appears to have better spatial resolution than the 
original as well as the SOS deconvolved images. Conversely, the SOS image seems to have incurred a loss 
of both axial and lateral resolution. 
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FIGURE 57.3 (a) Original image, (b) HOS-based deconvolution, (c) SOS-based deconvolution. In all cases the 
logarithm of the envelope is displayed. Autocovariance plots for (d) original (e) HOS deconvolved and (f) SOS 
deconvolved RF data. 
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Computing with neural networks (NNs) is one of the faster growing fields in the history of artificial 
intelligence (AI), largely because NNs can be trained to identify nonlinear patterns between input and 
output values and can solve complex problems much faster than digital computers. Owing to their wide 
range of applicability and their ability to learn complex and nonlinear relationships — including noisy or 
less precise information — NNs are very well suited to solving problems in biomedical engineering and, 
in particular, in analyzing biomedical signals. 

NNs have made strong advances in continuous speech recognition and synthesis, pattern recognition, 
classification of noisy data, nonlinear feature detection, and other fields. By their nature, NNs are capable 
of high-speed parallel signal processing in real time. They have an advantage over conventional technol- 
ogies because they can solve problems that are too complex — problems that do not have an algorithmic 
solution or for which an algorithmic solution is too complex to be found. NNs are trained by example 
instead of rules and are automated. When used in medical diagnosis, they are not affected by factors 
such as human fatigue, emotional states, and habituation. They are capable of rapid identification, analysis 
of conditions, and diagnosis in real time. 

The most widely used architecture of an NN is that of a multilayer perceptron (MLP) trained by an 
algorithm called backpropagation (BP). Backpropagation is a gradient-descent algorithm that tries to minimize 
the average squared error of the network. In real applications, the network is not a simple one-dimensional 
system, and the error curve is not a smooth, bowl-shaped curve. Instead, it is a highly complex, multidimen- 
sional curve with hills and valleys (for a mathematical description of the algorithm, see Chapter 182). 

BP was first developed by P. Werbos in 1974 [1], rediscovered by Parker in 1982 [2], and popularized 
later by Rummelhart et al. in 1986 [3]. There exist many variations of this algorithm, especially trying to 
improve its speed and performance in avoiding getting stuck into local minima — one of its main drawbacks. 

In my work, I use the ALOPEX algorithm developed by my colleagues and myself (see Chapter 182) 
[4-10], and my colleagues and I have applied it in a variety of world problems of considerable complexity. 
This chapter will examine several applications of NNs in biomedical signal processing. One- and two- 
dimensional signals are examined. 
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58.1 Neural Networks in Sensory Waveform Analysis 

Mathematical analysis of the equations describing the processes in NNs can establish any dependencies 
between quantitative network characteristics, the information capacity of the network, and the proba- 
bilities of recognition and retention of information. It has been proposed that electromyographic (EMG) 
patterns can be analyzed and classified by NNs [11] where the standard BP algorithm is used for 
decomposing surface EMG signals into their constituent action potentials (APs) and their firing patterns 
[12]. A system such as this may help a physician in diagnosing time-behavior changes in the EMG. 

The need for a knowledge-based system using NNs for evoked potential recognition was described in 
a paper by Bruha and Madhavan [13]. In this paper, the authors used syntax pattern-recognition algo- 
rithms as a first step, while a second step included a two-layer perceptron to process the list of numerical 
features produced by the first step. 

Myoelectric signals (MES) also have been analyzed by NNs [ 14] . A discrete Hopfield network was used 
to calculate the time-series parameters for a moving-average MES. It was demonstrated that this network 
was capable of producing the same time-series parameters as those produced by a conventional sequential 
least-squares algorithm. In the same paper, a second implementation of a two-layered perceptron was 
used for comparison. The features used were a time-series parameter and the signal power in order to 
train the perceptron on four separate arm functions, and again, the network performed well. 

Moving averages have been simulated for nonlinear processes by the use of NNs [15]. The results 
obtained were comparable with those of linear adaptive techniques. 

Moody and colleagues [16] used an adaptive approach in analyzing visual evoked potentials. This 
method is based on spectral analysis that results in spectral peaks of uniform width in the frequency 
domain. Tunable data windows were used. Specifically, the modified Bessel functions I — sin h, the gaus- 
sian, and the cosine-taper windows are compared. The modified Bessel function window proved to be 
superior in classifying normal and abnormal populations. 

Pulse-transmission NNs — networks that consist of neurons that communicate with other neurons via 
pulses rather than numbers — also have been modeled [7,17] . This kind of network is much more realistic, 
since, in biological systems, action potentials are the means of neuronal communication. Dayhoff [18] 
has developed a pulse-transmission network that can temporally integrate arriving signals and also display 
some resistance to temporal noise. 

Another method is optimal filtering, which is a variation of the traditional matched filter in noise 
[19]. This has the advantage of separating even overlapping waveforms. It also carries the disadvantage 
that the needed knowledge of the noise power spectral density and the Fourier transform of the spikes 
might not be always available. 

Principal-components analysis also has been used. Here the incoming spike waveforms are represented by 
templates given by eigenvectors from their average autocorrelation functions [20]. The authors found that 
two or three eigenvectors are enough to account for 90% of the total energy of the signal. This way each 
spike can be represented by the coordinates of its projection onto the eigenvector space. These coordinates 
are the only information needed for spike classification, which is further done by clustering techniques. 

Multineuronal Activity Analysis 

When dealing with single- or multineuron activities, the practice is to determine how many neurons (or 
units) are involved in the "spike train" evoked by some sensory stimulation. Each spike in a spike train 
represents an action potential elicited by a neuron in close proximity to the recording electrode. These 
action potentials have different amplitudes, latencies, and shape configurations and, when superimposed 
one each other, create a complex waveform — a composite spike. The dilemma that many scientists face 
is how to decompose these composite potentials into their constituents and how to assess the question 
of how many neurons their electrode is recording from. One of the most widely used methods is window 
discrimination, in which different thresholds are set, above which the activity of any given neuron is 
assigned, according to amplitude. Peak detection techniques also have been used [21]. These methods 
perform well if the number of neurons is very small and the spikes are well separated. Statistical methods 
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of different complexity also have been used [22-24] involving the time intervals between spikes. Each 
spike is assigned a unique instant of time so that a spike train can be described by a process of time 
points corresponding to the times where the action potential had occurred. Processes such as these are 
called point processes, since they are characterized by only one number. Given this, a spike train can be 
treated as a stochastic point process that may or may not be stationary. In the former case, its statistics 
do not vary in the time of observation [25]. In the second case, when nonstationarity is assumed, any 
kind of statistical analysis becomes formidable. 

Correlations between spike trains of neurons can be found because of many factors, but mostly because 
of excitatory or inhibitory interactions between them or due to a common input to both. Simulations 
on each possibility have been conducted in the past [26]. In our research, when recording from the optic 
tectum of the frog, the problem is the reversed situation of the one given above. That is, we have the 
recorded signal with noise superimposed, and we have to decompose it to its constituents so that we can 
make inferences on the neuronal circuitry involved. What one might do would be to set the minimal 
requirements on a neural network, which could behave the same way as the vast number of neurons that 
could have resulted in a neural spike train similar to the one recorded. 

This is a very difficult problem to attack with no unique solution. A method that has attracted attention 
is the one developed by Gerstein et al. [22,27]. This technique detects various functional groups in the 
recorded data by the use of the so-called gravitational clustering method. Although promising, the analysis 
becomes cumbersome due to the many possible subgroups of neurons firing in synchrony. Temporal 
patterns of neuronal activity also have been studied with great interest. Some computational methods 
have been developed for the detection of favored temporal patterns [28-31]. My group also has been 
involved in the analysis of complex waveforms by the development of a novel method, the ST-scan method 
[32], This method is based on well-known tomographic techniques, statistical analysis, and template 
matching. The method proved to be very sensitive to even small variations of the waveforms due to the 
fact that many orientations of them are considered, as it is done in tomographic imaging. Each histogram 
represents the number of times a stroke vector at a specific orientation is cutting the composite waveform 
positioned at the center of the window. These histograms were then fed to a NN for categorization. The 
histograms are statistical representations of individual action potentials. The NN therefore must be able 
to learn to recognize histograms by categorizing them with the action potential waveform that they 
represent. The NN also must be able to recognize any histogram as belonging to one of the "learned" 
patterns or not belonging to any of them [33]. In analyzing the ST-histograms, the NN must act as an 
"adaptive demultiplexer." That is, given a set of inputs, the network must determine the correspondingly 
correct output. This is a categorization procedure performed by a perceptron, originally described by 
Rosenblatt [34]. In analyzing the ST-histograms, the preprocessing is done by a perceptron, and the error 
is found either by an LMS algorithm [35] or by an ALOPEX algorithm [36,39,40]. 

Visual Evoked Potentials 

Visual evoked potentials (VEPs) have been used in the clinical environment as a diagnostic tool for many 
decades. Stochastic analysis of experimental recordings of VEPs may yield useful information that is not 
well understood in its original form. Such information may provide a good diagnostic criterion in 
differentiating normal subjects from subjects with neurological diseases as well as provide an index of 
the progress of diseases. 

These potentials are embedded in noise. Averaging is then used in order to improve the signal-to- 
noise (S/N) ratio. When analyzing these potentials, several methods have been used, such as spectral 
analysis of their properties [42], adaptive filtering techniques [43,44], and some signal enhancers, again 
based on adaptive processes [45]. In this latter method, no a priori knowledge of the signal is needed. 
The adaptive signal enhancer consists of a bank of adaptive filters, the output of which is shown to be a 
minimum mean-square error estimate of the signal. 

If we assume that the VEP represents a composite of many action potentials and that each one of these 
action potentials propagates to the point of the VEP recording, the only differences between the various 
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action potentials are their amplitudes and time delays. The conformational changes observed in the VEP 
waveforms of normal individuals and defected subjects can then be attributed to an asynchrony in the 
arrival of these action potentials at a focal point (integration site) in the visual cortex [36]. One can 
simulate this process by simulating action potentials and trying to fit them to normal and abnormal 
VEPs with NNs. 

Action potentials were simulated using methods similar to those of Moore and Ramon [37] and Bell 
and Cook [38] . Briefly, preprocessing of the VEP waveforms is done first by smoothing a five-point filter 
that performs a weighted averaging over its neighboring points: 



s(n)= F(«-2) + 2i ? («-l) + 3F(n) + 2P'(« + l)+i ; '(« + 2) 



(58.1) 



The individual signals V: are modulated so that at the VEP recording site each V: has been changed in 
amplitude am(j) and in phase ph(j). The amplitude change represents the propagation decay, and the 
phases represent the propagation delays of signals according to the equation 



v.l i\ = amiu ■ AP i—phlj\ 



(58.2) 



For a specific choice of am(J) and ph{j), j = 1,2, ... , N, the simulated VEP can be found by 



VEP=fc+F 



(58.3) 



where k is a scaling factor, b is a dc component, and a is a constant [39,40]. 

The ALOPEX process was used again in order to adjust the parameters (amplitude and phase) so that 
the cost function reaches a minimum and therefore the calculated waveform coincides with the experi- 
mental one. 

The modified ALOPEX equation is given by 



pirn = pin- 1) + yApi n\ Ar( n) + \irl n\ 



(58.4) 



where pj{n) are the parameters at iteration n, and y and (j, are scaling factors of the deterministic and 
random components, respectively, which are adjusted so that at the beginning y is small and li is large. 
As the number of iterations increases, y increases while (a, decreases. The cost function R is monitored 
until convergence has been achieved at least 80% or until a preset number of iterations has been covered. 

The results obtained show a good separation between normal and abnormal VEPs. This separation is 
based on an index X, which is defined as the ratio of two summations, namely, the summation of 
amplitudes whose ph(i) is less than 256 ms and the summation of amplitudes whose ph(i) is greater than 
256 ms. A large value of X indicates an abnormal VEP, while a small X indicates a normal VEP. 

The convergences of the process for a normal and an abnormal VEP are shown in Figs. 58.1 and 58.2, 
respectively, at different iteration numbers. The main assumption here is that in normal individuals, the 
action potentials all arrive at a focal point in the cortex in resonance, while in abnormal subjects there 
exists as asynchrony of the arrival times. Maier and colleagues [41] noted the importance of source 
localization of VEPs in humans in studying the perceptual behavior in humans. The optimization proce- 
dure used in this section can help that task, since individual neuronal responses are optimally isolated. 
One of the interesting points is that signals (action potentials) with different delay times result in composite 
signals of different forms. Thus the reverse solution of this problem, i.e., extracting individual signals from 
a composite measurement, can help resolve the problem of signal source localization. The multiunit 
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(c) 



FIGURE 58.1 Normal VEP. (a) The 
fitting at the beginning of the process, 
(b) after 500 iterations, and (c) after 
1000 iterations. Only one action poten- 
tial is repeated 1000 times. The x axis is 
xlO ms; the y axis is in millivolts. 
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(c) 



FIGURE 58.2 Abnormal VEP. ( a) Fit- 
ting at t = 3 iterations, (b) after 500 
iterations, and (c) after 2000 iterations. 
One action potential was used 1000 
times. 
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recordings presented in the early sections of this paper fall in the same category with that of decomposing 
VEPs. This analysis might provide insight as to how neurons communicate. 

58.2 Neural Networks in Speech Recognition 

Another place where NNs find wide application is in speech recognition. Tebelski et al. [46] have studied 
the performance of linked predictive NNs (LPNNs) for large-vocabulary, continuous-speech recognition. 
The authors used a six-state phoneme topology, and without any other optimization, the LPNN achieved 
an average of 90%, 58%, and 39% accuracy on tasks with perplexity of 5, 11, and 402, respectively, which 
is better than the performance of several other simpler NNs tested. These results show that the main 
advantages of predictive networks are mainly that they produce nonbinary distortion measures in a simple 
way and that they can model dynamic properties such as those found in speech. Their weakness is that 
they show poor discrimination, which may be corrected by corrective training and function work 
modeling. 

Allen and Kanam [47] designed in NN architecture that locates word boundaries and identifies words 
from phoneme sequences. They tested the model in three different regimes with a highly redundant 
corpus and a restricted vocabulary, and the NN was trained with a limited number of phonemic variations 
for the words in the corpus. These tests yielded a very low error rate. In a second experiment, the network 
was trained to identify words from expert transcriptions of speech. The error rate for correct simultaneous 
identification of words and word boundaries was 18%. Finally, they tested the use of the output of a 
phoneme classifier as the input to the word and word boundary identification network. The error rate 
increased almost exponentially to 49%. 

The best discrimination came from hybrid systems such as the use of an MLP and a hidden Markov 
model (HMM). These systems incorporate multiple sources of evidence such as features and temporal 
context without any restrictive assumptions of distributions or statistical independence. Bourland et al. 
[48] used MLPs as linear predictions for autoregressive HMMs. This approach, although more compatible 
with the HMM formalism, still suffers from several limitations. Although these authors generalized their 
approach to take into account time correlations between successive observations without any restrictive 
assumptions about the driving noise, the reputed results show that many of the tricks used to improve 
standard HMMs are also valid for this hybrid approach. 

In another study, Intrator [49] used an unsupervised NN for dimensionality reduction that seeks 
directions emphasizing multimodality and derived a new statistical insight to the synaptic modification 
equations learning as in the Bienenstock, Cooper, and Munro (BCM) neurons [50]. The speech data 
consisted of 20 consecutive time windows of 32 ms with 30-ms overlap aligned at the beginning of the 
time. For each window, a set of 22 energy levels was computed corresponding to the Zwicker critical- 
band filters [51]. The classification results were compared with those of a backpropagation network. 
These results showed that the backpropagation network does well in finding structures useful for classi- 
fication of the trained data, but these structures are more sensitive to voicing. Classification results using 
a BCM network, on the other hand, suggest that for the specified task, structures that are more sensitive 
to voicing can be extracted, even though voicing imposes several effects on the speech signal. These 
features are more speaker-invariant. 

Phan et al. [52] have attempted to solve the "cocktail party effect," which describes phenomena in 
which humans can selectively focus attention on one sound source among competing sound sources. 
This is an ability that is hampered for the hearing-impaired. A system was developed that successfully 
identifies a speaker in the presence of competing speakers for short utterances. Features used for identi- 
fication are monaural, whose feature space represent a 90% data reduction from the original data. This 
system is presently used off-line and also has been applied successfully to intraspeaker speech recognition. 
The features used in the preprocessing were obtained by wavelet analysis. This multiresolution analysis 
decomposes a signal into a hierarchical system of subspaces that are one-dimensional and are square 
integrable. Each subspace is spanned by basis functions that have scaling characteristics of either dilation 
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or compression depending on the resolution. The implementation of these basis functions is incorporated 
in a recursive pyramidal algorithm in which the discrete approximation of a current resolution is 
convolved with quadrature mirror filters in the subsequent resolution [53]. After preprocessing, speech 
waveform is analyzed by the wavelet transform. Analysis is limited to four octaves. For pattern-recognition 
input configuration, the wavelet coefficients are mapped to a vector and used as inputs to a neural network 
trained by the ALOPEX algorithm. White noise was superimposed on the features as well in order to 
establish a threshold of robustness of the algorithm. 

58.3 Neural Networks in Cardiology 

Two-dimensional echocardiography is an important noninvasive clinical tool for cardiac imaging. The 
endocardial and epicardial boundaries of the left ventricle (LV) are very useful quantitative measures of 
various functions of the heart. Cardiac structures detection in ECG images are also very important in 
recognizing the image parts. 

A lot of research in the last couple of years has taken place around these issues and the application of 
neural networks in solving them. Hunter et al. [54] have used NNs in detecting echocardiographic LV 
boundaries and the center of the LV cavity. The points detected are then linked by a "snake" energy- 
minimizing function to give the epicardial and endocardial boundaries. A snake is an energy-minimizing 
deformable curve [55] fined by an internal energy that is the controller of the differential properties in 
terms of its curvature and its metrics. The most robust results were obtained by a 9 x 9 square input 
vector with a resolution reduction of 32:1. Energy minimization is carried out by simulated annealing. 
The minimum energy solution was obtained after 1000 iterations over the entire snake. The use of NNs 
as edge detectors allows the classification of points to be done by their probability of being edges rather 
than by their edge strength, a very important factor for echocardiographic images due to their wide 
variations in edge strength. 

A complex area in electrocardiography is the differentiation of wide QRS tachycardias in ventricular 
(VT) and supraventricular tachycardia (SVT). A set of criteria for this differentiation has been proposed 
recently by Brugada et al. [56]. One important aspect of applying NNs in interpreting ECGs is to use 
parameters that not only make sense but are also meaningful for diagnosis. Dassen et al. [57] developed 
an induction algorithm that further improved the criteria set by Brugada et al., also using NNs. 

Nadal and deBossan [58] used principal-components analysis (PCA) and the relative R-wave to R-wave 
intervals of P-QRS complexes to evaluate arrhythmias. Arrhythmias are one of the risks of sudden cardiac 
arrest in coronary care units. In this study, the authors used the first 10 PCA coefficients (PCCs) to reduce 
the data of each P-QRS complex and a feedforward NN classifier that splits the vector space generated 
by the PCCs into regions that separate the different classes of beats in an efficient way. They obtained 
better classification than using other methods, with correct classification of 98.5% for normal beats, 
98.5% for ventricular premature beats, and 93.5% for fusion beats, using only the first two PCCs. When 
4 PCCs were used, these classifications improved to 99.2%, 99.1%, and 94.1%, respectively. These results 
are better than those obtained by logistic regression when the input space is composed of more than two 
classes of beats. The difficulties encountered include the elimination of redundant data in order not to 
overestimate the importance of normal beat detection by the classifier. 

In another work, Silipo et al. [59] used an NN as an autoassociator. In previous work they had proved 
that NNs had better performances than the traditional clustering and statistical methods. They considered 
beat features derived from both morphologic and prematurity information. Their classification is ade- 
quate for ventricular ectopic beats, but the criteria used were not reliable enough to characterize the 
supraventricular ectopic beat. 

A lot of studies also have used NNs for characterization of myocardial infarction (MI). Myocardial 
infarction is one of the leading causes of death in the United States. The currently available techniques 
for diagnosis are accurate enough, but they suffer from certain drawbacks, such as accurate quantitative 
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FIGURE 58.3 Selection of region of interest. The intensity pattern in the box is used as input to the NN. 



measure of severity, extent, and precise location of the infarction. Since the acoustic properties in the 
involved region are mostly changing, one can study them by the use of ultrasound. 

Baxt [60] used an NN to identify MI in patients presented to an emergency department with anterior 
chest pain. An NN was trained on clinical pattern sets retrospectively derived from patients hospitalized 
with a high likelihood of having MI, and the ability of the NN was compared with that of physicians 
caring for the same patients. The network performed with a sensitivity of 92% and a specificity of 96%. 
These figures were better than the physicians, with a sensitivity of 88% and a specificity of 71%, or any 
other computer technology (88% sensitivity, 74% specificity) [61]. 

Diagnosis of inferior MI with NNs was studied by Heden et al. [62] with sensitivity of 84% and a 
specificity of 97%, findings that are similar to those of Pahlm et al. [63]. 

Yi et al. [64] used intensity changes in an echocardiogram to detect MI. Once an echocardiogram is 
obtained, it is digitized and saved in a file as a gray-scale image (512 x 400) (Fig. 58.3). A window of the 
region of interest is then selected between the systole and diastole of the cardiac cycle and saved in a 
different file. The window can be either of a constant size or it can be adaptive (varying) in size. This 
new file can be enlarged (zoom) for examination of finer details in the image. All image artifacts are 
filtered out, and contrast enhancement is performed. This new image is then saved and serves as input 
to the NN. A traditional three-layer NN with 300 input nodes (one node for each pixel intensity in the 
input file), a varying number of hidden nodes, and two output nodes were used. The output node 
indicates the status of the patient under testing. A "one" indicates normal and a "zero" an abnormal case. 

The weights of the connections were calculated using the optimization algorithms of ALOPEX. One 
sub-ALOPEX was used for the weights between hidden nodes and input nodes, and a second sub-ALOPEX 
was used for those from the output to the hidden layer. 

The network was trained with a population of 256 patients, some with scars and some normal. These 
patients were used to obtain "templates" for each category. These templates were then used for comparison 
with the test images. None of these testing images was included in the training set. The cost function 
used for the process was the least-squares rule, which, although slow, produces reliable results. 

A similar process is used for the output layer. The noise was made adaptable. The intensities of the 
images are normalized before being submitted to the NN. A cutoff of 0.2 was used. Therefore, anything 
above 0.8 is normal, below 0.2 is scar, and all the in-between values are classified as unknown. Due to 
the fact that the scar training set was very small compared with the normals, a better classification of 
normals than scars was observed. A study was also made as to how the number of hidden nodes influences 
the results for the same standard deviation of noise. 
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In another study, Kostis et al. [65] used NNs in estimating the prognosis of acute MI. Patients who 
survive the acute phase of an MI have an increased risk of mortality persisting up to 10 years or more. 
Estimation of the probability of being alive at a given time in the future is important to the patients and 
their physicians and is usually ascertained by the use of statistical methods. The purpose of the investi- 
gation was to use an NN to estimate future mortality of patients with acute MI. The existence of a large 
database (Myocardial Infarction Data Acquisition Systems, or MIDAS) that includes MI occurring in the 
state of New Jersey and has long-term follow-up allows the development and testing of such a computer 
algorithm. Since the information included in the database does not allow the exact prediction of vital 
status (dead or alive) in all patients with 100% accuracy, the NN should be able to categorize patients 
according to the probability of dying within a given period of time. 

Since information included in the database is not sufficient to allow the exact prediction of vital status 
(dead or alive) in all patients with 100% accuracy, we developed an NN able to categorize patients 
according to the probability of dying within a given period of time rather than predicting categorically 
whether a patient will be dead or alive at a given time in the future. It was observed that there were many 
instances where two or more patients had identical input characteristics while some were dead and some 
alive at the end of the study period. For this reason, it is difficult to train a standard NN. Since there is 
no unique output value for all input cases, the network had difficulty converging to a unique set of 
solutions. To alleviate this problem, a conflict-resolution algorithm was developed. The algorithm takes 
templates with identical input vectors and averages each of their input characteristics to produce a single 
case. Their output values of vital status are averaged, producing, in effect, a percentage probability of 
mortality for the particular set of input characteristics. As each new subject template is read into the 
program, its input characteristics are compared with those of all previous templates. If no match is found, 
its input values and corresponding output value are accepted. If a match is found, the output value is 
brought together with the stored output value (percentage probability of mortality and the number of 
templates on which it is based), and a new output value is calculated, representing the percentage average 
mortality of the entire characteristic group. Since each member of the group is an identical input case, 
no input characteristic averaging is necessary, thus preserving the statistical significance of the average 
mortality with respect to that exact case. 

This new algorithm, using the two-hidden-layer perceptron optimized by ALOPEX, has converged to 
greater than 98% using several thousand input cases. In addition, 10 output nodes were used in the final 
layer, each corresponding to a range of percent chance or mortality (e.g., node 1: 0% to 10%; node 2: 
10% to 20%; etc.). The outputs of the network are designed to maximize one of the 10 potential output 
"binds," each corresponding to a decile of mortality between 0% and 100%. The output node containing 
the correct probability value was set to a value of 1.0; the others to 0.0. In this manner, the network 
should be able to provide percentage probability of mortality and also to resolve input-case conflicts. An 
SAS program was written to present the predicted probability of mortality separately in patients who are 
dead or alive at the end of the follow-up period. The NNs constructed as described above were able to 
be trained and evaluated according to several definitions of network response: matching the output value 
at every output node, identifying the correct location of which node was to contain the peak output 
value, and matching the output value at the peak output location. 

The network was tested on known and then on unknown cases. A correspondence of the observed to 
the predicted probability of being alive at a given time was observed. The categorical classifications (dead 
or alive) yielded an overall accuracy of 74%. A reciprocal relationship between sensitivity and specificity 
of the rules for determination of vital status was observed. 

58.4 Neural Networks in Neurology 

NNs found applications in neurology as well and, in particular, in characterizing memory defects, as are 
apparent in diseases such as Parkinson's and Alzheimer's. Both diseases exhibit devastating effects and 
disrupt the lives of those affected. 
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For several decades, in an attempt to further understand brain functions, computational neuroscientists 
have addressed the issue by modeling biologic neural network structure with computer simulations. A 
recent article by Stern et al. [66] reports on an important relationship of Alzheimer's disease expression 
and levels of education. A similar inverse relationship was earlier reported by Zhang et al. [67] in a 
Chinese population. This inverse relationship was attributed to a protective role of the brain reserve 
capacity [68]. Such a capacity becomes important, since in an educated person's brain more synapses 
exist, which might protect the individual in the expression of symptoms of the disease. It is not argued, 
however, that the disease will not be acquired; rather, that it will be delayed. Zahner et al. [69] have 
employed a three-layer feedforward NN trained with ALOPEX to simulate the effects of education in 
dementia, age-related changes, and in general, brain damage. Our results show that the higher the level 
of training of the NN, 50%, 60%, 70%, 80%, etc., the slower is the damage on the "brain." Damage was 
simulated by systematically adding noise on the weights of the network. Noise had a gaussian distribution 
with varying standard deviations and mean of zero. Figure 58.4 shows the results of these simulations 
as recognition rate versus standard deviation of noise added to the inputs for increasingly damaged 
weights at various levels of training. Each point corresponds to an average of 100 samples. Notice how 
much slower the drop in recognition is for the 70% and 80% curves as compared with the 50% and 60% 
training. Also notice the distances of the starting points of the curves as we follow the progress of dementia 
at different stages (Fig. 58.4, a versus b versus c). All curves demonstrate impairment with damage, but 
they also show some threshold in the training level, after which the system becomes more robust. 

58.5 Discussion 

Neural networks provide a powerful tool for analysis of biosignals. This chapter has reviewed applications 
of NNs in cardiology, neurology, speech processing, and brain waveforms. The literature is vast, and this 
chapter is by no means exhaustive. In the last 10 years, NNs have been used more and more extensively 
in a variety of fields, and biomedical engineering is not short of it. Besides the applications, a lot of 
research is still going on in order to find optimal algorithms and optimal values for the parameters used 
in these algorithms. In industry, an explosion of VLSI chip designs on NNs has been observed. The 
parallel character of NNs makes them very desirable solutions to computational bottlenecks. 
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FIGURE 58.4 Recognition rate versus standardization of noise added to the inputs. Different curves correspond 
to various learning levels with damaged weights, (a) Noise added only to the inputs, (b) Noise added to the weights 
to mimic "brain damage," a = 0.05. (c) Noise on the weights with a = 0.1. Notice how much more robust the "brain" 
is to noise with higher levels of education. 
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Complexity, a contemporary theme embraced by physical as well as social sciences, is concerned with the 
collective behavior observed in composite systems in which long-range order is induced by short-range 
interactions of the constituent parts. Complex forms and functions abound in nature. Particularly in 
biology and physiology, branched, nested, granular, or otherwise richly packed, irregular, or disordered 
objects are the rule rather than the exception. Similarly ubiquitous are distributed, broad-band phenom- 
ena that appear to fluctuate randomly. The rising science of complexity holds the promise to lead to 
powerful tools to analyze, model, process, and control the global behavior of complex biomedical systems. 

The basic tenets of the complexity theory rest on the revelation that large classes of complex systems 
(composed of a multitude of richly interacting components) are reducible to simple rules. In particular, 
the structure and dynamics of complex systems invariably exist or evolve over a multitude of spatial and 
temporal scales. Moreover, they exhibit a systematic relationship between scales. From the biomedical 
engineering standpoint, the worthwhile outcome is the ability to characterize these intricate objects and 
processes in terms of straightforward scaling and fractal concepts and measures that often can be 
translated into simple iterative rules. In this sense, the set of concepts and tools, emerging under the 
rubric of complexity, complements the prediction made by the chaos theory that simple (low-order 
deterministic) systems may generate complex behavior. 

In their many incarnations, the concepts of complexity and scaling are playing a refreshingly unifying 
role among diverse scientific pursuits; therein lie compelling opportunities for scientific discoveries and 
technical innovations. Since these advances span a host of disciplines, hence different scientific languages, 
cultures, and dissemination media, finding one's path has become confusing. One of the aims of this 
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presentation is to serve as a resource for key literature. We hope to guide the reader toward substantial 
contributions and away from figments of fascination in the popular press that have tended to stretch 
emerging concepts ahead of the rigorous examination of evidence and the scientific verification of facts. 
This chapter is organized in three mains parts. The first part is intended to serve as a primer for the 
fundamental aspects of the complexity theory. An overview of the attendant notions of scaling theories 
constitutes the core of the second part. In the third part, we illustrate the potential of the complexity 
approach by presenting an application to predict acceleration-induced loss of consciousness in pilots. 

59.1 Complex Dynamics 

There exists a class of systems in which very complex spatial and temporal behavior is produced through 
the rich interactions among a large number of local subsystems. Complexity theory is concerned with 
systems that have many degrees of freedom (composite systems), are spatially extended (systems with 
both spatial and temporal degrees of freedom), and are dissipative as well as nonlinear due to the interplay 
among local components (agents). In general, such systems exhibit emergent global behavior. This means 
that macroscopic characteristics cannot be deduced from the microscopic characteristics of the elemen- 
tary components considered in isolation. The global behavior emerges from the interactions of the local 
dynamics. 

Complexity theories draw their power from recognition that the behavior of a complex dynamic system 
does not, in general, depend on the physical particulars of the local elements but rather on how they 
interact to collectively (cooperatively or competitively) produce the globally observable behavior. The 
local agents of a complex dynamic system interact with their neighbors through a set of usually (very) 
simple rules. 

The emergent global organization that occurs through the interplay of the local agents arises without 
the intervention of a central controller. That is, there is self-organization, a spontaneous emergence of 
global order. Long-range correlations between local elements are not explicitly defined in such models, 
but they are induced through local interactions. The global organization also may exert a top-down 
influence on the local elements, providing feedback between the macroscopic and microscopic structures 
[Forrest, 1990] (Fig. 59.1). 

Overcoming the Limits of Newtonian Mathematics 

Linearity, as well as the inherent predictive ability, was an important factor in the success of Newtonian 
mechanics. If a linear system is perturbed by a small amount, then the system response will change by 
a proportionally small amount. In nonlinear systems, however, if the system is perturbed by a small 
amount, the response could be no change, a small change, a large change, oscillations (limit cycle), or 
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chaotic behavior. The response depends on the state of the system at the time it was perturbed. Since 
most of nature is nonlinear, the key to success in understanding nature lies in embracing this nonlinearity. 

Another feature found in linear systems is the property of superposition. Superposition means that the 
whole is equal to the sum of the parts. All the properties of a linear system can be understood through 
the analysis of each of its parts. This is not the case for complex systems, where the interaction among 
simple local elements can produce complex emergent global behavior. 

Complexity theory stands in stark contrast to a purely reductionist approach that would seek to explain 
global behavior by breaking down the system into its most elementary components. The reductionist 
approach is not guaranteed to generate knowledge about the behavior of a complex system, since it is 
likely that the information about the local interactions (which determine the global behavior) will not 
be revealed in such an analysis. For example, knowing everything there is to know about a single ant will 
reveal nothing about why an ant colony is capable of such complex behaviors as waging war, farming, 
husbandry, and the ability to quickly adapt to changing environmental conditions. The approach that 
complexity theory proposes is to look at the system as a whole and not merely as a collection of irreducible 
parts. 

Complexity research depends on digital computers for simulation of interactions. Cellular automata 
(one of the principal tools of complexity) have been constructed to model sand piles, earthquakes, traffic 
patterns, satellite communication networks, evolution, molecular autocatalysis, forest fires, and species 
interactions (among others) [Toffoli & Margoulis, 1987]. We note here that complexity is building on, 
and in some cases unifying, developments made in the fields of chaotic dynamics [Devaney, 1992], critical 
phenomena, phase transitions, renormalization [Wilson, 1983], percolation [Stauffer & Aharony, 1992], 
neural networks [Harvey, 1994; Simpson, 1990], genetic algorithms [Goldberg, 1989] and artificial life 
[Langton, 1989; Langton etal., 1992]. 

Critical Phenomena: Phase Transitions 

For the purpose of this discussion, a phase transition can be defined as any abrupt change between the 
physical and/or dynamic states of a system. The most familiar examples of phase transitions are between 
the fundamental stages of matter: solid, liquid, gas, and plasma. Phase transitions are also used to define 
other changes in matter, such as changes in the crystalline structure or state of magnetism. There are 
also phase transitions in the dynamics of systems from ordered (fixed-point and limit-cycle stability) to 
disordered (chaos). Determining the state of matter is not always straightforward. Sometimes the apparent 
state of matter changes when the scale of the observation (macroscopic versus microscopic) is changed. 
A critical point is a special case of phase transitions where order and disorder are intermixed at all scales 
[Wilson, 1983]. At criticality, all spatial and temporal features become scale invariant or self-similar. 
Magnetism is a good example of this phenomenon. 

All Illustration of Critical Phenomena: Magnetism 

The atoms of a ferromagnetic substance have more electrons with spins in one direction than in the 
other, resulting in a net magnetic field for the atom as a whole. The individual magnetic fields of the 
atoms tend to line up in one direction, with the result that there is a measurable level of magnetism in 
the material. At a temperature of absolute zero, all the atomic dipoles are perfectly aligned. At normal 
room temperature, however, some of the atoms are not aligned to the global magnetic field due to thermal 
fluctuations. This creates small regions that are nonmagnetic, although the substance is still magnetic. 
Spatial renormalization, or coarse graining, is the process of averaging the microscopic properties of the 
substance over a specified range in order to replace the multiple elements with a single equivalent element. 
If measurements of the magnetic property were taken at a very fine resolution (without renormaliza- 
tion), there would be some measurements that detect small pockets of nonmagnetism, although most 
measurements would indicate that the substance was magnetic. As the scale of the measurements is 
increased, i.e., spatially renormalized, the small pockets of nonmagnetism would be averaged out and 
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would not be measurable. Therefore, measurements at the larger scale would indicate that the substance 
is magnetic, thereby decreasing its apparent temperature and making the apparent magnetic state depen- 
dent on the resolution of the measurements. The situation is similar (but reversed) at high temperatures. 
That is, spatial renormalization results in apparently higher temperatures, since microscopic islands of 
magnetism are missed because of the large areas of disorder in the material. 

At the Curie temperature there is long-range correlation in both the magnetic and nonmagnetic 
regions. The distribution of magnetic and nonmagnetic regions is invariant under the spatial renormal- 
ization transform. These results are independent of the scale at which the measure is taken, and the 
apparent temperature does not change under the renormalization transform. This scale invariance (self- 
similarity) occurs at only three temperatures: absolute zero, infinity, and the Curie temperature. The 
Curie temperature represents a critical point (criticality) in the tuning parameter (temperature) that 
governs the phase transition from a magnetic to a nonmagnetic state [Pietgen & Richter, 1986]. 

A Model for Phase Transitions: Percolation 

A percolation model is created by using a simple regular geometric framework and by establishing simple 
interaction rules among the elements on the grid. Yet these models give rise to very complex structures 
and relationships that can be described by using scaling concepts such as fractals and power laws. A 
percolation model can be constructed on any regular infinite n-dimensional lattice [Stauffer & Aharony, 
1992]. For simplicity, the example discussed here will use a two-dimensional finite square grid. In site 
percolation, each node in the grid has only two states, occupied or vacant. The nodes in the lattice are 
populated-based on a uniform probability distribution, independent of the sate of any other node. The 
probability of a node being occupied is p (and thus the probability of a node being vacant is 1 — p). 
Nodes that are neighbors on the grid link together to form clusters (Fig. 59.2). 
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FIGURE 59.2 A percolation network. 
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Clusters represent connections between nodes in the lattice. Anything associated with the cluster can 
therefore travel (flow) to any node that belongs to the cluster. Percolation can describe the ability of 
water to flow through a porous medium such as igneous rock, oil fields, or finely ground Colombian 
coffee. As the occupation probability increases, the clusters of the percolation network grow from local 
connectedness to global connectedness [Feder, 1988]. At the critical occupation probability, a cluster that 
spans the entire lattice emerges. It is easy to see how percolation could be used to describe such phenomena 
as phase transitions by viewing occupied nodes as ordered matter, with vacant nodes representing 
disordered matter. Percolation networks have been used to model magnetism, forest fires, and the 
permeability of ion channels in cell membranes. 

Self- Organized Criticality 

The concept of self-organized criticality has been introduced as a possible underlying principle of 
complexity [Bak et al., 1988; Bak & Chen, 1991]. The class of self-organized critical systems is spatially 
extended, composite, and dissipative with many locally interacting degrees of freedom. These systems 
have the capability to naturally evolve (i.e., there is no explicit tuning parameter such as temperature or 
pressure) toward a critical state. 

Self-organized criticality is best illustrated by a sand pile. Start with a flat plate. Begin to add sand one 
grain at a time. The mound will continue to grow until criticality is reached. This criticality is dependent 
only on the local interactions among the grains of sand. The local slope determines what will happen if 
another grain of sand is added. If the local slope is below the criticality (i.e., flat) the new grain of sand 
will stay put and increase the local slope. If the local slope is at the criticality, then adding the new grain 
of sand will increase the slope beyond the criticality, causing it to collapse. The collapsing grains of sand 
spread to adjoining areas. If those areas are at the criticality, then the avalanche will continue until local 
areas with slopes below the criticality are reached. Long-range correlations (up to the length of the sand 
pile) may emerge from the interactions of the local elements. Small avalanches are very common, while 
large avalanches are rare. The size (and duration) of the avalanche plotted against the frequency of 
occurrence of the avalanche can be described by a power law [Bak et al., 1988]. The sand pile seeks the 
criticality on its own. The slope in the sand pile will remain constant regardless of even the largest 
avalanches. These same power laws are observed in traffic patterns, earthquakes, and many other complex 
phenomena. 

Dynamics at the Edge of Chaos 

The dynamics of systems can be divided into several categories. Dynamic systems that exhibit a fixed- 
point stability will return to their initial state after being perturbed. A periodic evolution of states will 
result from a system that exhibits a limit-cycle stability. Either of these systems may display a transient 
evolution of states before the stable regions are reached. Dynamic systems also may exhibit chaotic 
behavior. The evolution of states associated with chaotic behavior is aperiodic, well-bounded, and very 
sensitive to initial conditions and resembles noise but is completely deterministic [Tsonis & Tsonis, 1989] . 

The criticality that lies between highly ordered and highly disordered dynamics has been referred to 
as the edge of chaos [Langton, 1990] and is analogous to a phase transition between states of matter, 
where the highly ordered system can be thought of as a solid and the highly disordered system a liquid. 

The edge of chaos is the critical boundary between order and chaos. If the system dynamics are stagnant 
(fixed-point stability, highly ordered system), then there is no mechanism for change. The system cannot 
adapt and evolve because new states cannot be encoded into the system. If the system dynamics are 
chaotic (highly disordered), then the system is in a constant state of flux, and there is no memory, no 
learning, and no adaptation (some of the main qualities associated with life). Systems may exhibit 
transients in the evolution of states before settling down into either fixed-point or limit-cycle behavior. 
As the dynamics of a complex system enter the edge of chaos region, the length of these transients quickly 
grows. The chaotic region is where the length of the "transient" is infinite. At the edge of chaos (the 
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dynamic phase transition) there is no characteristic scale due to the emergence of arbitrarily long 
correlation lengths in space and time [Langton, 1990]. The self-organized criticality in the sand piles of 
Per Bak is an example of a system that exists at the edge of chaos. It is in this region that there is no 
characteristic space or time scale. A single grain of sand added to the pile could cause an avalanche that 
consists of two grains of sand, or it could cause an avalanche that spreads over the entire surface of the 
sand pile. 

59.2 Introduction to Scaling Theories 

Prior to the rise of complexity theories, the existence of a systematic relationship between scales eluded 
the mainstream sciences. As a consequence, natural structures and dynamics have been commonly 
dismissed as too irregular and complex and often rejected as monstrous formations, intractable noise, 
or artifacts. The advent of scaling concepts [Mandelbrot, 1983] has uncovered a remarkable hierarchical 
order that persists over a significant number of spatial or temporal scales. 

Scaling theories capitalize on scale-invariant symmetries exhibited by many natural broadband (i.e., 
multiscale) phenomena. According to the theory of self-organized criticality (see Section 59. 1 ), this scaling 
order is a manifestation of dilation (compression) symmetries that define the organization inherent to 
complex systems which naturally evolve toward a critical state while dissipating energies on broad ranges 
of space and time scales. Long overlooked, this symmetry is now added to the repertoire of mathematical 
modeling concepts, which had included approaches based largely on displacement invariances under 
translation and/or rotation. 

Many natural forms and functions maintain some form of exact or statistical invariance under trans- 
formations of scale and thus belong in the scaling category. Objects and processes that remain invariant 
under ordinary geometric similarity constitute the self-similar subset in this class. 

Methods to capture scaling information in the form of simple rules that relate features on different 
scales are actively developed in many scientific fields [Barnsley, 1993]. Engineers are coping with scaling 
nature of forms and functions by investigating multiscale system theory [Basseville et al., 1992], multi- 
resolution and multirate signal processing [ Akansu & Hadad, 1992; Vaidyanathan, 1993], subband coding, 
wavelets, and filter banks [Meyer, 1993], and fractal compression [Barnsley & Hurd, 1993]. 

These emerging tools empower engineers to reexamine old data and to re-formulate the question at 
the root of many unresolved inverse problems — What can small patterns say about large patterns, and 
vice versa? They also offer the possibility to establish cause-effect relationships between a given physical 
(spatial) medium and the monitored dynamic (temporal) behavior that constitutes the primary preoc- 
cupation of diagnostic scientists. 

Fractal Preliminaries 

In the broadest sense, the noun or adjective fractal refers to physical objects or dynamic processes that 
reveal new details on space or time magnification. A staple of a truly fractal object or process is therefore 
the lack of characteristic scale in time or space. Most structures in nature are broadband over a finite 
range, covering at least a number of frequency decades in space or time. Scaling fractals often consist of 
a hierarchy or heterarchy of spatial or temporal structures in cascade and are often accomplished through 
recursive replication of patterns at finer scales. If the replication rule preserves scale invariance throughout 
the entity, such fractals are recognized as self-similar in either an exact or a statistical sense. 

A prominent feature of fractals is their ability to pack structure with economy of resources, whether 
energy, space, or whatever other real estate. Fitting nearly infinite networks into finite spaces is just one 
such achievement. These types of fractals are pervasive in physiology, i.e., the branching patterns of the 
bronchi, the cardiovascular tree, and the nervous tissue [West and Goldberger, 1987], which have the 
additional feature of being "fault tolerant" [West, 1990]. 

Despite expectations heightened by the colorful publicity campaign mounted by promoters of fractal 
concepts, it is advisable to view fractals only as a starting approximation in analyzing scaling shapes and 
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fluctuations in nature. Fractal concepts are usually descriptive at a phenomenologic level without pretense 
to reveal the exact nature of the underlying elementary processes. They do not offer, for that matter, 
conclusive evidence of whatever particular collective, coupled, or isolated repetitive mechanism that 
created the fractal object. 

In many situations, the power of invoking fractal concepts resides in the fact that they bring the logic 
of constraints, whether in the form of asymmetry of motion caused by defects, traps, energy barriers, 
residual memories, irreversibility, or any other appropriate interaction or coupling mechanisms that 
hinder free random behavior. As discussed earlier, the spontaneous or forced organization and the ensuing 
divergence in correlations and coherences that emerges out of random behavior are presumably respon- 
sible for the irregular structures pervasive throughout the physical world. 

More important, the versatility of fractal concepts as a magnifying tool is rooted in the facility to 
account for scale hierarchies and/or scale invariances in an exact or statistical sense. In the role of a scale 
microscope, they suggest a fresh look, with due respect to all scales of significance, at many structural 
and dynamic problems deemed thus far anomalous or insoluble. 

Mathematical and Natural Fractals 

The history of mathematics is rife with "pathologic" constructions of the iterated kind which defy the 
euclidian dimension concepts. The collection once included an assortment of anomalous dust sets, lines, 
surfaces, volumes, and other mathematical miscellenia mostly born out of the continuous yet nondiffer- 
entiable category of functions such as the Weierstrass series. 

The feature unifying these mathematical creations with natural fractals is the fractional or integer 
dimensions distinct from the euclidian definition. Simply stated, a fractional dimension positions an 
object in between two integer dimensions in the euclidian sense, best articulated by the critical dimension 
in the Hausdorff-Besicovith derivation [Feder, 1988]. When this notion of dimension is pursued to the 
extreme and the dimension reaches an integer value, one is confronted with the counterintuitive reality 
of space-filling curves, volume-filling planes, etc. These objects can be seen readily to share intrinsic 
scaling properties with the nearly infinite networks accomplished by the branching patterns of bronchi 
and blood vessels and the intricate folding of the cortex. 

A rewarding outcome afforded by the advent of scaling concepts is the ability to characterize such 
structures in terms of straightforward "scaling" or "dimension" measures. From these, simple iterative 
rules may be deduced to yield models with maximum economy (or minimum number) or parameters 
[Barnsley, 1993]. This principle is suspected to underlie the succinct, coding adopted by nature in order 
to store extensive information needed to create complex shapes and forms. 

Fractal Measures 

The measure most often used in the diagnosis of a fractal is the basic fractal dimension, which, in the 
true spirit of fractals, has eluded a rigorous definition embracing the entire family of fractal objects. The 
guiding factor in the choice of the appropriate measures is the recognition that most fractal objects scale 
self-similarly; in other words, they can be characterized by a measure expressed in the form of a power 
factor, or scaling exponent d, that links the change in the observed dependent quantity V to the inde- 
pendent variable x as V(x) « X s [Falconer, 1990, p 36]. Clearly, d is proportional to the ratio of the 
logarithm of V{x) and x, i.e., d = log V(x)/log x. In the case of fractal objects, d is the scaling exponent 
in the fractal sense and may have a fractional value. In the final analysis, most scaling relationships can 
be cast into some form of a logarithmic dependence on the independent variable with respect to which 
a scaling property is analyzed, the latter also expressed on the logarithmic scale. A number of dimension 
formulas have been developed based on this observation, and comprehensive compilations are now 
available [Falconer, 1990; Feder, 1988]. 

One approach to formalize the concept of scale invariance utilizes the homogeneity or the renormal- 
ization principle given by /(u) = f{a\x)/b, where a and b are constants and u is the independent variable 
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[West & Goldberger, 1987]. The function /that satisfies this relationship is referred as a scaling function. 
The power-law function /(u.) a u^ is a prominent example in this category provided [3 = log fo/log a. The 
usefulness of this particular scaling function has been proven many times over in many areas of science, 
including the thermodynamics of phase transitions and the threshold behavior of percolation networks 
[Schroeder, 1991; Stauffer & Aharony, 1992; Wilson, 1983]. 

Power Law and 1// Processes 

The revived interest in power-law behavior largely stems from the recognition that a large class of noisy 
signals exhibits spectra that attenuate with a fractional power dependence on frequency [West & 
Shlesinger, 1989; Wornell, 1993]. Such behavior is often viewed as a manifestation of the interplay of a 
multitude of local processes evolving on a spectrum of time scales that collectively give rise to the so- 
called 1//P or, more generally, the l//-type behavior. As in the case of spatial fractals that lack a charac- 
teristic length scale, l//processes such as the fractional brownian motion cannot be described adequately 
within the confines of a "characteristic" time scale and hence exhibit the "fractal time" property [Man- 
delbrot, 1967]. 

Distributed Relaxation Processes 

Since the later part of the nineteenth century, the fractional power function dependence of the frequency 
spectrum also has been recognized as a macroscopic dynamic property manifested by strongly interacting 
dielectric, viscoelastic, and magnetic materials and interfaces between different conducting materials 
[Daniel, 1967]. More recently, the l//-type dynamic behavior has been observed in percolating networks 
composed of random mixtures of conductors and insulators and layered wave propagation in heteroge- 
neous media [Orbach, 1986] . In immittance (impedance or admittance) studies, this frequency dispersion 
has been analyzed conventionally to distinguish a broad class of the so-called anomalous, i.e., nonexpo- 
nential, relaxation/dispersion systems from those which can be described by the "ideal" single exponential 
form due to Debye [Daniel, 1967]. 

The fractal time or the multiplicity of times scales prevalent in distributed relaxation systems necessarily 
translates into fractional constitutive models amenable to analysis by fractional calculus [Ross, 1977] and 
fractional state-space methods [Bagley & Calico, 1991]. This corresponds to logarithmic distribution 
functions ranging in symmetry from the log-normal with even center symmetry at one extreme to single- 
sided hyperbolic distributions with diverging moments at the other. The realization that systems that do 
not possess a characteristic time can be described in terms of distributions renewed the interest in the 
field of dispersion/relaxation analysis. Logarithmic distribution functions have been used conventionally 
as means to characterize such complexity [West, 1994]. 

Multifractals 

Fractal objects and processes in nature are rarely strictly homogeneous in their scaling properties and 
often display a distribution of scaling exponents that echos the structural heterogeneities occurring at a 
myriad of length or time scales. In systems with spectra that attenuate following a pure power law over 
extended frequency scales, as in the case of Davidson-Cole dispersion [Daniel, 1967], the corresponding 
distribution of relaxation times is logarithmic and single-tailed. In many natural relaxation systems, 
however, the spectral dimension exhibits a gradual dependence on frequency, as in phenomena conven- 
tionally modeled by the Cole-Cole type dispersion. The equivalent distribution functions exhibit double- 
sided symmetries on the logarithmic relaxation time scale ranging from the even symmetry of the log- 
normal through intermediate symmetries down to strictly one-sided functions. 

The concept that a fractal structure can be composed of fractal subsets with uniform scaling property 
within the subset has gained popularity in recent years [Feder, 1988]. From this perspective, one may 
view a complicated fractal object, say, the strange attractor of a chaotic process, as a superposition of 
simple fractal subsystems. The idea has been formalized under the term multifractal. It follows that 



) 2000 by CRC Press LLC 



each individual member contributes to the overall scaling behavior according to a spectrum of scaling 
exponents or dimensions. The latter function is called the multifractal spectrum and summarizes the 
global scaling information of the complete set. 

59.3 An Example of the Use of Complexity Theory in the 

Development of a Model of the Central Nervous System 

Consciousness can be viewed as an emergent behavior arising from the interactions among a very large 
number of local agents, which, in this case, range from electrons through neurons and glial cells to 
networks of neurons. The hierarchical organization of the brain [Churchland & Sejnowski, 1992; Newell, 
1990], which exists and evolves on a multitude of spatial and temporal scales, is a good example of the 
scaling characteristics found in many complex dynamic systems. There is no master controller for this 
emergent behavior, which results from the intricate interactions among a very large number of local 
agents. 

A model that duplicates the global dynamics of the induction of unconsciousness in humans due to 
cerebral ischemia produced by linear acceleration stress (G-LOC) was constructed using some of the 
tenets of complexity [Cammarota, 1994]. It was an attempt to provide a theory that could both replicate 
historical human acceleration tolerance data and present a possible underlying mechanism. The model 
coupled the realization that an abrupt loss of consciousness could be thought of as a phase transition 
from consciousness to unconsciousness with the proposed neurophysiologic theory of G-LOC [Whinnery, 
1989]. This phase transition was modeled using a percolation network to evaluate the connectivity of 
neural pathways within the central nervous system. 

In order to construct the model, several hypotheses had to be formulated to account for the unob- 
servable interplay among the local elements of the central nervous system. The inspiration for the 
characteristics of the locally interacting elements (the nodes of the percolation lattice) was provided by 
the physiologic mechanism of arousal (the all-or-nothing aspect of consciousness), the utilization of 
oxygen in neural tissue during ischemia, and the response of neural cells to metabolic threats. The 
neurophysiologic theory of acceleration tolerance views unconsciousness as an active protective mecha- 
nism that is triggered by a metabolic threat which in this case is acceleration-induced ischemia. The 
interplay among the local systems is determined by using a percolation network that models the con- 
nectivity of the arousal mechanism (the reticular activating system). When normal neuronal function is 
suppressed due to local cerebral ischemia, the corresponding node is removed from the percolation 
network. The configuration of the percolation network varies as a function of time. When the network 
is no longer able to support arousal, unconsciousness results. 

The model simulated a wide range of human data with a high degree of fidelity. It duplicated the 
population response (measured as the time it took to lose consciousness) over a range of stresses that 
varied from a simulation of the acute arrest of cerebral circulation to a gradual application of acceleration 
stress. Moreover, the model was able to offer a possible unified explanation for apparently contradictory 
historical data. An analysis of the parameters responsible for the determination of the time of LOC 
indicated that there is a phase transition in the dynamics that was not explicitly incorporated into the 
construction of the model. The model spontaneously captured an interplay of the cardiovascular and 
neurologic systems that could not have been predicted based on existing data. 

The keys to the model's success are the reasonable assumptions that were made about the characteristics 
and interaction of the local dynamic subsystems through the integration of a wide range of human and 
animal physiologic data in the design of the model. None of the local parameters was explicitly tuned to 
produce the global (input-output) behavior. By successfully duplicating the observed global behavior of 
humans under acceleration stress, however, this model provided insight into some (currently) unobserv- 
able inner dynamics of the central nervous system. Furthermore, the model suggests new experimental 
protocols specifically aimed at exploring further the microscopic interplay responsible for the macroscopic 
(observable) behavior. 
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Defining Terms 

l//process: Signals or systems that exhibit spectra which attenuate following a fractional power depen- 
dence on frequency. 

Cellular automata: Composite discrete-time and discrete space dynamic systems defined on a regular 
lattice. Neighborhood rules determine the state transitions of the individual local elements (cells). 

Chaos: A state the produces a signal that resembles noise and is aperiodic, well-bounded, and very 
sensitive to initial conditions but is governed by a low-order deterministic differential or difference 
equation. 

Complexity: Complexity theory is concerned with systems that have many degrees of freedom (com- 
posite systems), are spatially extended (systems with both spatial and temporal degrees of freedom), 
and are dissipative as well as nonlinear due to the rich interactions among the local components 
(agents). Some of the terms associated with such systems are emergent global behavior, collective 
behavior, cooperative behavior, self-organization, critical phenomena, and scale invariance. 

Criticality: A state of a system where spatial and/or temporal characteristics are scale invariant. 

Emergent global behavior: The observable behavior of a system that cannot be deduced from the 
properties of constituent components considered in isolation and results from the collective (coop- 
erative or competitive) evolution of local events. 

Fractal: Refers to physical objects or dynamic processes that reveal new details on space or time 
magnification. Fractals lack a characteristic scale. 

Fractional brownian motion: A generalization of the random function created by the record of the 
motion of a "brownian particle" executing random walk. Brownian motion is commonly used to 
model diffusion in constraint-free media. Fractional brownian motion is often used to model 
diffusion of particles in constrained environments or anomalous diffusion. 

Percolation: A simple mathematical construct commonly used to measure the extent of connectedness 
in a partially occupied (site percolation) or connected (bond percolation) lattice structure. 

Phase transition: Any abrupt change between the physical and/or the dynamic states of a system, 
usually between ordered and disordered organization or behavior. 

Renormalization: Changing the characteristic scale of a measurement though a process of systematic 
averaging applied to the microscopic elements of a system (also referred to as coarse graining). 

Scaling: Structures or dynamics that maintain some form of exact or statistical invariance under 
transformations of scale. 

Self-organization: The spontaneous emergence of order. This occurs without the direction of a global 
controller. 

Self-similarity: A subset of objects and processes in the scaling category that remain invariant under 
ordinary geometric similarity. 
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The long anticipated "information age" is taking shape at the cross-section of multimedia signal pro- 
cessing and telecommunications-based networking. By defying the traditional concepts of space and time, 
these emerging technologies promise to affect all facets of our lives in a pervasive and profound manner 
[Mayo, 1992]. The physical constraints of location have naturally led, over the centuries, to the creation 
of the conventional patient care services and facilities. As the "information superhighway" is laid down 
with branches spanning the nation, and eventually the world via wired and wireless communication 
channels, we will come closer to a bold new era in health care delivery, namely, the era of remote 
monitoring, diagnosis, and intervention. 

Forward-looking medical industries are engaging in research and development efforts to capitalize on 
the emerging technologies. Medical institutions in particular recognize the transforming power of the 
impending revolution. A number of hospitals are undertaking pilot projects to experiment with the 
potentials of the new communication and interaction media that will constitute the foundations of 
futuristic health care systems. There is consensus among health care administrators that the agility and 
effectiveness with which an institution positions itself to fully embrace the new medical lifestyle will 
decide its viability in the next millennium. 

Although multimedia communications is yet in its infancy, recent developments foretell a bright future. 
Many agree that multimedia networking is becoming a reality thanks to advances in digital signal- 
processing research and development. Trends toward implementation of algorithms by fewer components 
are leading to decreasing hardware complexity while increasing processing functionality [Andrews, 1 994] . 
The vast and vibrant industry producing multimedia hardware and software ranging from application- 
specific digital signal processors and video chip sets to videophones and multimedia terminals heavily 
relies on digital signal processing know-how. 
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As in the case of generic digital signal processing, biomedical signal processing is expected to play a 
key role in mainstreaming patient care at a distance. Earlier in this section, emerging methods in 
biomedical signal analysis that promise major enhancements in our ability to extract information from 
vital signals were introduced. This chapter provides a glimpse of the future — when biomedical signals 
will be integrated with other patient information and transmitted via networked multimedia — by exam- 
ining trends in key communications technologies, namely, public switched-network protocols, wireless 
communications, photonics, and virtual reality. 

60.1 Public Switched Network and ATM 

The public switched network already can accommodate a wide array of networked multimedia commu- 
nications. The introduction of new standards such as the asynchronous transfer mode (ATM) is a strong 
sign that the network will evolve to handle an array of novel communication services. 

ATM is a technology based on a switched network that uses dedicated media connections [ATM 
Networking, 1994] . Each connection between users is physically established by setting up a path or virtual 
channel through a series of integrated circuits in the switch. In conventional shared media networks, 
connections are made by breaking the information into packets labeled with their destination; these 
packets then share bandwidth until they reach their destination. In switched networks, instead of sharing 
bandwidth, connections can be run in parallel, since each connection has its own pathway. This approach 
prevents degradation of the response time despite an increased number of users who are running intensive 
applications on the network, such as videoconferencing. Therefore, ATM offers consistently high perfor- 
mance to all users on the network, particularly in real-time networked multimedia applications which 
often encounter severe response time degradation on shared media networks. Also, the ATM standards 
for both local area networks (LANs) and wide area networks (WANs) are the same; this allows for seamless 
integration of LANs and WANs. 

Small-scale experiments based on ATM-based multimedia communications have already been 
launched in a number of medical centers. Early examples involve departments where doctors and staff 
can remotely work on chest x-rays, CT exams, and MRI images around the hospital over an ATM switched 
network. Since ATM integrates LANs and WANs, collaborating institutions will be able to access the same 
information in the future. Similar patient multimedia information-sharing efforts are underway which 
integrate all vital information including physiologic signals and sounds, images, and video and patient 
data and make them available remotely. In some recent experiments, the access is accomplished in real 
time such that medical conferencing, and hence remote diagnosis, becomes a possibility. 

60.2 Wireless Communication 

Wireless communication is the fastest growing sector of the telecommunications industry [Wittman, 
1994]. Wireless networking technology is rapidly coming of age with the recent passage of initial stan- 
dards, widespread performance improvements, and the introduction of personal communications net- 
works (PCNs). Pocket-sized portable "smart" terminals are combined with wireless communication to 
free users from the constraints of wired connection to networks. The progress in this direction is closely 
monitored by the health care community because the technology holds the potential to liberate ambu- 
latory patients who require long-term monitoring and processing of biomedical signals for timely inter- 
vention. Wireless and interactive access by medical personnel to physiologic multimedia information will 
no doubt be a staple of the future distributed health care delivery systems. 

60.3 Photonics 

Photonics, or lightwave, is an evolving technology with the capacity to support a wide range of high- 
bandwidth multimedia applications. In current practice, photonics plays a complementary role to 
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electronics in the hybrid technology referred to as electro-optics. Optical fibers are widely used for 
transmission of signals, from long-distance links to undersea cables to local loops linking customers with 
the central switching office. The trend in photonics is to move beyond functions limited to transmission 
toward logic operations. Recent advances in photonic logic devices suggest that optical computers may 
present characteristics more desirable than electronics in many biomedical processing applications requir- 
ing parallel tasks. A case in point is online pattern recognition, which may bring a new dimension to 
remote diagnosis. 

60.4 Virtual Reality 

Virtual reality — the ultimate networked multimedia service — is a simulated environment that enables 
one to remotely experience an event or a place in all dimensions. Virtual reality makes telepresence 
possible. The nascent technology builds on the capabilities of interactive telecommunications and is 
expected to become a consumer reality early in the next century. Applications in the field of endoscopic 
surgery are being developed. Demonstration projects in remote surgery are underway, paving the way 
for remote medical intervention. 
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Defining Terms 

ATM (asynchronous transfer mode): Technology based on a switched network that uses dedicated 
media connections. 

CT: Computer tomography. 

MRI: Magnetic resonance imaging. 

Multimedia: Technology to integrate sights, sounds, and data. The media may include audio signals 
such as speech, music, biomedical sounds, images, animation, and video signals, as well as text, 
graphics, and fax. One key feature is the common linkage, synchronization, and control of the 
various media that contribute to the overall multimedia signal. In general, multimedia services 
and products are interactive, hence real-time, as in the case of collaborative computing and 
videoconferencing. 

Photonics: Switching, computing, and communications technologies based on lightwave. 

Virtual reality: Technology that creates a simulated environment enabling one to remotely experience 
an event or a place in all dimensions. 

Wireless network: Technology based on communication between nodes such as stationary desktops, 
laptops, or personal digital assistants (PDAs) and the LAN hub or access point using a wireless 
adapter with radio circuitry and an antenna. The LAN hub has a LAN attachment on one interface 
and one or more antennas on another. 
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THE FIELD OF MEDICAL IMAGING has experienced phenomenal growth within the last century. 
Whereas imaging was the prerogative of the defense and the space science communities in the 
past, with the advent of powerful, less-expansive computers, new and expanded imaging systems 
have found their way into the medical field. Systems range from those devoted to planar imaging using 
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x-rays to technologies that are just emerging, such as virtual reality. Some of the systems, such as 
ultrasound, are relatively inexpensive, while others, such as positron emission tomography (PET) facil- 
ities, cost millions of dollars for the hardware and the employment of Ph.D.-level personnel to operate 
them. Systems that make use of x-rays have been designed to image anatomic structures, while others 
that make use of radioisotopes provide functional information. The fields of view that can be imaged 
range from the whole body obtained with nuclear medicine bone scans to images of cellular components 
using magnetic resonance (MR) microscopy. The design of transducers for the imaging devices to the 
postprocessing of the data to allow easier interpretation of the images by medical personnel are all aspects 
of the medical imaging devices field. 

Even with the sophisticated systems now available, challenges remain in the medical imaging field. 
With the increasing emphasis on health care costs, and with medical imaging systems often cited as not 
only an example of the investment that health care providers must make and consequently recover, but 
also as a factor contributing to escalating costs, there is increasing emphasis on lowering the costs of new 
systems. Researchers, for example, are trying to find alternatives to the high-cost superconducting mag- 
nets used for magnetic resonance systems. With the decreasing cost of the powerful computers that are 
currently contained within most imaging systems and with the intense competition among imaging 
companies, prices for these systems are bound to fall. Other challenges entail presentation of the imaging 
data. Often multiple modalities are used during a clinical evaluation. If both anatomic and functional 
information are required, methods to combine and present these data for medical interpretation need 
to be achieved. The use of medical image data to more effectively execute surgery is a field that is only 
starting to be explored. How can anatomic data obtained with a tomographic scan be correlated with 
the surgical field, given that movement of tissues and organs occurs during surgery? Virtual reality is 
likely to play an important role in this integration of imaging information with surgery. There also are 
imaging modalities that are just beginning to be intensively explored, such as the detection of impedance 
and magnetic field data or the use of optical sources and detectors. 

Engineers and physical scientists are involved throughout the medical imaging field. They are employed 
by both large and small companies. The names of the medical imaging giants, such as General Electric, 
Siemens, Picker, and Acuson are familiar to most. Small startup companies are prevalent. In addition to 
the medical imaging companies, Ph.D.-trained engineers and scientists are employed by departments of 
engineering, physics, and chemistry in universities and more and more by radiology departments of 
research-oriented medical centers. Whereas only a few years ago researchers working in the medical 
imaging field would submit papers to general scientific journals, such as the IEEE Transactions on 
Biomedical Engineering, now there is a journal, IEEE Transactions on Medical Imaging, devoted to the 
medical imaging field and journals dedicated to certain modalities, such as Magnetic Resonance in 
Medicine and Journal of Computer Assisted Tomography. Large scientific meetings for medical imaging, 
such as the Radiological Society of North America's annual meeting with over 20,000 attendees, are held 
each year. Modality-specific meetings, such as that of the Society for Magnetic Resonance in Imaging, 
have thousands of attendees. 

Although entire books have been written on each of the medical imaging modalities, this section will 
provide an overview of the main medical imaging devices and also highlight a few emerging systems. 
Chapter 61 describes x-ray systems, the backbone of the medical imaging field. X-ray systems are still 
quite important because of their relatively low system acquisition cost, the low cost of the diagnostic 
procedures, and the speed with which results are obtained. Chapter 62 describes computed tomographic 
(CT) systems. This technology became available in the 1970s, with current improvements focused on 
acquisition speed and data presentations. Chapter 63 highlights magnetic resonance imaging (MRI), a 
technology that first became available in the 1980s. The technology is rapidly evolving, with major 
advances recently in the areas of functional and spectroscopic MRI. Nuclear medicine, the subject of 
Chapter 64, covers both planar and single-photon emission computed tomography (SPECT) systems. 
Chapter 65 covers ultrasound, the technology that is widely used for obstetrical imaging and vascular 
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flow evaluation. The latest research on linear and two-dimensional transducers, which will be able to 
provide real-time three-dimensional ultrasound images, is also covered in this chapter. Less prevalent 
technologies are presented in Chapters 66 to 69. These include the field of MR microscopy, which requires 
high-field-strength magnets; position emission tomography (PET), which has a tremendous potential 
for functional imaging; impedance tomography, which is aimed at constructing images based on differ- 
ence in conductivity between different body tissues; and virtual reality, which provides an overview of 
the high-tech field of interactive imaging that is bound to become increasingly important as computing 
power increases. 




VR system used by the disabled. 
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61.1 X-Ray Equipment 

Robert E. Shroy, Jr. 

Conventional x-ray radiography produces images of anatomy that are shadowgrams based on x-ray 
absorption. The x-rays are produced in a region that is nearly a point source and then are directed on 
the anatomy to be imaged. The x-rays emerging from the anatomy are detected to form a two-dimensional 
image, where each point in the image has a brightness related to the intensity of the x-rays at that point. 
Image production relies on the fact that significant numbers of x-rays penetrate through the anatomy 
and that different parts of the anatomy absorb different amounts of x-rays. In cases where the anatomy 
of interest does not absorb x-rays differently from surrounding regions, contrast may be increased by 
introducing strong x-ray absorbers. For example, barium is often used to image the gastrointestinal tract. 

X-rays are electromagnetic waves (like light) having an energy in the general range of approximately 
1 to several hundred kiloelectronvolts (keV). In medical x-ray imaging, the x-ray energy typically lies 
between 5 and 150 keV, with the energy adjusted to the anatomic thickness and the type of study being 
performed. 

X-rays striking an object may either pass through unaffected or may undergo an interaction. These 
interactions usually involve either the photoelectric effect (where the x-ray is absorbed) or scattering 
(where the x-ray is deflected to the side with a loss of some energy). X-rays that have been scattered may 
undergo deflection through a small angle and still reach the image detector; in this case they reduce 
image contrast and thus degrade the image. This degradation can be reduced by the use of an air gap 
between the anatomy and the image receptor or by use of an antiscatter grid. 

Because of health effects, the doses in radiography are kept as low as possible. However, x-ray quantum 
noise becomes more apparent in the image as the dose is lowered. This noise is due to the fact that there 
is an unavoidable random variation in the number of x-rays reaching a point on an image detector. The 
quantum noise depends on the average number of x-rays striking the image detector and is a fundamental 
limit to radiographic image quality. 
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FIGURE 61.1 X-ray tube. 

The equipment of conventional x-ray radiography mostly deals with the creation of a desirable beam 
of x-rays and with the detection of a high-quality image of the transmitted x-rays. These are discussed 
in the following sections. 

Production of X-Rays 
X-Ray Tube 

The standard device for production of x-rays is the rotating anode x-ray tube, as illustrated in Fig. 61.1. 
The x-rays are produced from electrons that have been accelerated in vacuum from the cathode to the 
anode. The electrons are emitted from a filament mounted within a groove in the cathode. Emission 
occurs when the filament is heated by passing a current through it. When the filament is hot enough, 
some electrons obtain a thermal energy sufficient to overcome the energy binding the electron to the 
metal of the filament. Once the electrons have "boiled off" from the filament, they are accelerated by a 
voltage difference applied from the cathode to the anode. This voltage is supplied by a generator (see 
below). 

After the electrons have been accelerated to the anode, they will be stopped in a short distance. Most 
of the electrons' energy is converted into heating of the anode, but a small percentage is converted to 
x-rays by two main methods. One method of x-ray production relies on the fact that deceleration of a 
charged particle results in emission of electromagnetic radiation, called bremmstrahlung radiation. These 
x-rays will have a wide, continuous distribution of energies, with the maximum being the total energy 
the electron had when reaching the anode. The number of x-rays is relatively small at higher energies 
and increases for lower energies. 

A second method of x-ray production occurs when an accelerated electron strikes an atom in the 
anode and removes an inner electron from this atom. The vacant electron orbital will be filled by a 
neighboring electron, and an x-ray may be emitted whose energy matches the energy change of the 
electron. The result is production of large numbers of x-rays at a few discrete energies. Since the energy 
of these characteristic x-rays depends on the material on the surface of the anode, materials are chosen 
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partially to produce x-rays with desired energies. For example, molybdenum is frequently used in anodes 
of mammography x-ray tubes because of its 20-keV characteristic x-rays. 

Low-energy x-rays are undesirable because they increase dose to the patient but do not contribute to 
the final image because they are almost totally absorbed. Therefore, the number of low-energy x-rays is 
usually reduced by use of a layer of absorber that preferentially absorbs them. The extent to which low- 
energy x-rays have been removed can be quantified by the half-value layer of the x-ray beam. 

It is ideal to create x-rays from a point source because any increase in source size will result in blurring 
of the final image. Quantitatively, the effects of the blurring are described by the focal spot's contribution 
to the system modulation transfer function (MTF). The blurring has its main effect on edges and small 
objects, which correspond to the higher frequencies. The effect of this blurring depends on the geometry 
of the imaging and is worse for larger distances between the object and the image receptor (which 
corresponds to larger geometric magnifications). 

To avoid this blurring, the electrons must be focused to strike a small spot of the anode. The focusing is 
achieved by electric fields determined by the exact shape of the cathode. However, there is a limit to the 
size of this focal spot because the anode material will melt if too much power is deposited into too small 
an area. This limit is improved by use of a rotating anode, where the anode target material is rotated about 
a central axis and new (cooler) anode material is constantly being rotated into place at the focal spot. To 
further increase the power limit, the anode is made with an angle surface. This allows the heat to be deposited 
in a relatively large spot while the apparent spot size at the detector will be smaller by a factor of the sine 
of the anode angle. Unfortunately, this angle cannot be made too small because it limits the area that can 
be covered with x-rays. In practice, tubes are usually supplied with two (or more) focal spots of differing 
sizes, allowing choice of a smaller (sharper, lower-power) spot or a larger (more blurry, higher-power) spot. 

The x-ray tube also limits the total number of x-rays that can be used in an exposure because the 
anode will melt if too much total energy is deposited in it. This limit can be increased by using a more 
massive anode. 

Generator 

The voltages and currents in an x-ray tube are supplied by an x-ray generator. This controls the cathode- 
anode voltage, which partially defines the number of x-rays made because the number of x-rays produced 
increases with voltage. The voltage is also chosen to produce x-rays with desired energies: Higher voltages 
makes x-rays that generally are more penetrating but give a lower contrast image. The generator also 
determines the number of x-rays created by controlling the amount of current flowing from the cathode 
to anode and by controlling the length of time this current flows. This points out the two major parameters 
that describe an x-ray exposure: the peak kilovolts (peak kilovolts from the anode to the cathode during 
the exposure) and the milliampere-seconds (the product of the current in milliamperes and the exposure 
time in seconds). 

The peak kilovolts and milliampere-seconds for an exposure may be set manually by an operator based 
on estimates of the anatomy. Some generators use manual entry of kilovolts and milliamperes but 
determine the exposure time automatically. This involves sampling the radiation either before or after 
the image sensor and is referred to as phototiming. 

The anode-cathode voltage (often 15 to 150 kV) can be produced by a transformer that converts 120 or 
220 V ac to higher voltages. This output is then rectified and filtered. Use of three-phase transformers 
gives voltages that are nearly constant versus those from single-phase transformers, thus avoiding low 
kilovoltages that produce undesired low-energy x-rays. In a variation of this method, the transformer 
output can be controlled at a constant voltage by electron tubes. This gives practically constant voltages 
and, further, allows the voltage to be turned on and off so quickly that millisecond exposure times can 
be achieved. In a third approach, an ac input can be rectified and filtered to produce a nearly dc voltage, 
which is then sent to a solid-state inverter that can turn on and off thousands of times a second. This 
higher-frequency ac voltage can be converted more easily to a high voltage by a transformer. Equipment 
operating on this principle is referred to as midfrequency or high-frequency generators. 
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Image Detection: Screen Film Combinations 

Special properties are needed for image detection in radiographic applications, where a few high-quality 
images are made in a study. Because decisions are not immediately made from the images, it is not 
necessary to display them instantly (although it may be desirable). 

The most commonly used method of detecting such a radiographic x-ray image uses light-sensitive 
negative film as a medium. Because high-quality film has a poor response to x-rays, it must be used 
together with x-ray-sensitive screens. Such screens are usually made with CaWo 2 or phosphors using rare 
earth elements such as doped Gd 2 2 S or LaOBr. The film is enclosed in a light-tight cassette in contact 
with an x-ray screen or in between two x-ray screens. When a x-ray image strikes the cassette, the x-rays 
are absorbed by the screens with high efficiency, and their energy is converted to visible light. The light 
then exposes a negative image on the film, which is in close contact with the screen. 

Several properties have been found to be important in describing the relative performance of different 
films. One critical property is the contrast, which describes the amount of additional darkening caused 
by an additional amount of light when working near the center of a film's exposure range. Another 
property, the latitude of a film, describes the film's ability to create a usable image with a wide range in 
input light levels. Generally, latitude and contrast are competing properties, and a film with a large 
latitude will have a low contrast. Additionally, the modulation transfer function (MTF) of a film is an 
important property. MTF is most degraded at higher frequencies; this high-frequency MTF is also 
described by the film's resolution, its ability to image small objects. 

X-ray screens also have several key performance parameters. It is essential that screens detect and use 
a large percentage of the x-rays striking them, which is measured as the screen's quantum detection 
efficiency. Currently used screens may detect 30% of x-rays for images at higher peak kilovolts and as 
much 60% for lower peak kilovolt images. Such efficiencies lead to the use of two screens (one on each 
side of the film) for improved x-ray utilization. As with films, a good high-frequency MTF is needed to 
give good visibility of small structures and edges. Some MTF degradation is associated with blurring that 
occurs when light spreads as it travels through the screen and to the film. This leads to a compromise 
on thickness; screens must be thick enough for good quantum detection efficiency but thin enough to 
avoid excess blurring. 

For a film/screen system, a certain amount of radiation will be required to produce a usable amount 
of film darkening. The ability of the film/screen system to make an image with a small amount of radiation 
is referred to as its speed. The speed depends on a number of parameters: the quantum detection efficiency 
of the screen, the efficiency with which the screen converts x-ray energy to light, the match between the 
color emitted by the screen and the colors to which the film is sensitive, and the amount of film darkening 
for a given amount of light. The number of x-rays used in producing a radiographic image will be chosen 
to give a viewable amount of exposure to the film. Therefore, patient dose will be reduced by the use of 
a high-speed screen/film system. However, high-speed film/screen combinations gives a "noisier" image 
because of the smaller number of x-rays detected in its creation. 

Image Detection: X-Ray Image Intensifies with Televisions 

Although screen-film systems are excellent for radiography, they are not usable for fluoroscopy, where 
lower x-ray levels are produced continuously and many images must be presented almost immediately. 
Fluoroscopic images are not used for diagnosis but rather as an aid in performing tasks such as placement 
of catheters in blood vessels during angiography. For fluoroscopy, x-ray image intensifiers are used in 
conjunction with television cameras. An x-ray image intensifier detects the x-ray image and converts it 
to a small, bright image of visible light. This visible image is then transferred by lenses to a television 
camera for final display on a monitor. 

The basic structure of an x-ray image intensifier is shown in Fig. 61.2. The components are held in a 
vacuum by an enclosure made of glass and/or metal. The x-rays enter through a low- absorption window 
and then strike an input phosphor usually made of doped Csl. As in the x-ray screens described above, 
the x-rays are converted to light in the Csl. On top of the Csl layer is a photoemitter, which absorbs the 
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FIGURE 61.2 X-ray image intensifier. 



light and emits a number of low-energy electrons that initially spread in various directions. The photo- 
electrons are accelerated and steered by a set of grids that have voltages applied to them. The electrons 
strike an output phosphor structure that converts their energy to the final output image made of light. 
This light then travels through an output window to a lens system. The grid voltages serve to add energy 
to the electrons so that the output image is brighter. Grid voltages and shapes are also chosen so that 
the x-ray image is converted to a light image with minimal distortion. Further, the grids must be designed 
to take photoelectrons that are spreading from a point on the photoemitter and focus them back together 
at a point on the output phosphor. 

It is possible to adjust grid voltages on an image intensifier so that it has different fields of coverage. 
Either the whole input area can be imaged on the output phosphor, or smaller parts of the input can be 
imaged on the whole output. Use of smaller parts of the input is advantageous when only smaller parts 
of anatomy need to be imaged with maximum resolution and a large display. For example, an image 
intensifier that could cover a 12-in-diameter input also might be operated so that a 9-in-diameter or 
6-in-diameter input covers all the output phosphor. 

X-ray image intensifiers can be described by a set of performance parameters not unlike those of 
screen/film combinations. It is important that x-rays be detected and used with a high efficiency; current 
image intensifiers have quantum detection efficiencies of 60% to 70% for 59-keV x-rays. As with 
film/screens, a good high-frequency MTF is needed to image small objects and sharp edges without 
blurring. However, low-frequency MTF also must be controlled carefully in image intensifiers, since it 
can be degraded by internal scattering of x-rays, photoelectrons, and light over relatively large distances. 
The amount of intensification depends on brightness and size of the output image for a given x-ray 
input. This is described either by the gain (specified relative to a standard x-ray screen) or by conversion 
efficiency [a light output per radiation input measured in (cd/m 2 )/(mR/min)]. Note that producing a 
smaller output image is as important as making a light image with more photons because the small image 
can be handled more efficiently by the lenses that follow. Especially when imaging the full input area, 
image intensifiers introduce a pincushion distortion into the output image. Thus a square object placed 
off-center will produce an image that is stretched in the direction away from the center. 
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Although an image intensifier output could be viewed directly with a lens system, there is more 
flexibility when the image intensifier is viewed with a television camera and the output is displayed on 
a monitor. Televisions are currently used with pickup tubes and with CCD sensors. 

When a television tube is used, the image is focused on a charged photoconducting material at the 
tube's input. A number of materials are used, including SbS„ PbO, and SeTeAs. The light image discharges 
regions of the photoconductor, converting the image to a charge distribution on the back of the photo- 
conducting layer. Next, the charge distribution is read by scanning a small beam of electrons across the 
surface, which recharges the photoconductor. The recharging current is proportional to the light intensity 
at the point being scanned; this current is amplified and then used to produce an image on a monitor. 
The tube target is generally scanned in an interlaced mode in order to be consistent with broadcast 
television and allow use of standard equipment. 

In fluoroscopy, it is desirable to use the same detected dose for all studies so that the image noise is 
approximately constant. This is usually achieved by monitoring the image brightness in a central part of 
the image intensifier's output, since brightness generally increases with dose. The brightness may be 
monitored by a photomultiplier tube that samples it directly or by analyzing signal levels in the television. 
However, maintaining a constant detected dose would lead to high patient doses in the case of very 
absorptive anatomy. To avoid problems here, systems are generally required by federal regulations to have 
a limit on the maximum patient dose. In those cases where the dose limit prevents the image intensifier 
from receiving the usual dose, the output image becomes darker. To compensate for this, television 
systems are often operated with automatic gain control that gives an image on the monitor of a constant 
brightness no matter what the brightness from the image intensifier. 

Image Detection: Digital Systems 

In both radiography and fluoroscopy, there are advantages to the use of digital images. This allows image 
processing for better displayed images, use of lower doses in some cases, and opens the possibility for digital 
storage with a PACS system or remote image viewing via teleradiology. Additionally, some digital systems 
provide better image quality because of fewer processing steps, lack of distortion, or improved uniformity. 

A common method of digitizing medical x-ray images uses the voltage output from an image-inten- 
sifier/TV system. This voltage can be digitized by an analog-to-digital converter at rates fast enough to 
be used with fluoroscopy as well as radiography. 

Another technology for obtaining digital radiographs involves use of photostimulable phosphors. Here 
the x-rays strike an enclosed sheet of phosphor that stores the x-ray energy. This phorphor can then be 
taken to a read-out unit, where the phosphor surface is scanned by a small light beam of proper 
wavelength. As a point on the surface is read, the stored energy is emitted as visible light, which is then 
detected, amplified, and digitized. Such systems have the advantage that they can be used with existing 
systems designed for screen-film detection because the phosphor sheet package is the same size as that 
for screen films. 

A new method for digital detection involves use of active-matrix thin-film-transistor technology, in 
which an array of small sensors is grown in hydrogenated amorphous silicon. Each sensor element 
includes an electrode for storing charge that is proportional to its x-ray signal. Each electrode is coupled 
to a transistor that either isolates it during acquisition or couples it to digitization circuitry during read- 
out. There are two common methods for introducing the charge signal on each electrode. In one method, 
a layer of x-ray absorber (typically selenium) is deposited on the array of sensors; when this layer is biased 
and x-rays are absorbed there, their energy is converted to electron-hole pairs and the resulting charge 
is collected on the electrode. In the second method, each electrode is part of the photodiode that makes 
electron-hole pairs when exposed to light; the light is produced from x-rays by a layer of scintillator 
(such as Csl) that is deposited on the array. 

Use of a digital system provides several advantages in fluoroscopy. The digital image can be processed 
in real time with edge enhancement, smoothing, or application of a median filter. Also, frame-to-frame 
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averaging can be used to decrease image noise, at the expense of blurring the image of moving objects. 
Further, digital fluoroscopy with TV system allows the TV tube to be scanned in formats that are 
optimized for read- out; the image can still be shown in a different format that is optimized for display. 
Another advantage is that the displayed image is not allowed to go blank when x-ray exposure is ended, 
but a repeated display of the last image is shown. This last-image-hold significantly reduces doses in 
those cases where the radiologist needs to see an image for evaluation, but does not necessarily need a 
continuously updated image. 

The processing of some digital systems also allows the use of pulsed fluoroscopy, where the x-rays are 
produced in a short, intense burst instead of continuously. In this method the pulses of x-rays are made 
either by biasing the x-ray tube filament or by quickly turning on and off the anode-cathode voltage. 
This has the advantage of making sharper images of objects that are moving. Often one x-ray pulse is 
produced for every display frame, but there is also the ability to obtain dose reduction by leaving the 
x-rays off for some frames. With such a reduced exposure rate, doses can be reduced by a factor of two 
or four by only making x-rays every second or fourth frame. For those frames with no x-ray pulse, the 
system repeats a display of the last frame with x-rays. 

Defining Terms 

Antiscatter grid: A thin structure made of alternating strips of lead and material transmissive to x-rays. 

Strips are oriented so that most scattered x-rays go through lead sections and are preferentially 

absorbed, while unscattered x-rays go through transmissive sections. 
Focal spot: The small area on the anode of an x-ray tube from where x-rays are emitted. It is the place 

where the accelerated electron beam is focused. 
Half-value layer (HVL): The thickness of a material (often aluminum) needed to absorb half the x-ray 

in a beam. 
keV: A unit of energy useful with x-rays. It is equal to the energy supplied to an electron when 

accelerated through 1 kilovolt. 
Modulation transfer function (MTF): The ratio of the contrast in the output image of a system to the 

contrast in the object, specified for sine waves of various frequencies. Describes blurring (loss of 

contrast) in an imaging system for different-sized objects. 
Quantum detection efficiency: The percentage of incident x-rays effectively used to create an image. 
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Further Information 

Medical Physics is a monthly scientific and informational journal published for the American Association 
of Physicists in Medicine. Papers here generally cover evaluation of existing medical equipment and 
development of new systems. For more information, contact the American Association of Physicists in 
Medicine, One Physics Ellipse, College Park, MD 20740-3846. 

The Society of Photo-Optical Instrumentation Engineers (SPIE) sponsors numerous conferences and 
publishes their proceedings. Especially relevant is the annual conference on Medical Imaging. Contact 
SPIE, P.O. Box 10, Bellham, WA 98277-0010. 
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Several corporations active in medical imaging work together under the National Electrical Manufac- 
turers Association to develop definitions and testing standards for equipment used in the field. Informa- 
tion can be obtained from NEMA, 2101 L Street, N.W., Washington, DC 20037. 

Critical aspects of medical x-ray imaging are covered by rules of the Food and Drug Administration, 
part of the Department of Health and Human Services. These are listed in the Code of Federal Regulations, 
Title 21. Copies are for sale by the Superintendent of Documents, U.S. Government Printing Office, 
Washington, DC 20402. 



61.2 X-Ray Projection Angiography 

Michael S. Van Lysel 

Angiography is a diagnostic and, increasingly, therapeutic modality concerned with diseases of the 
circulatory system. While many imaging modalities (ultrasound, computed tomography, magnetic res- 
onance imaging, angioscopy) are now available, either clinically or for research, to study vascular struc- 
tures, this section will focus on projection radiography. In this method, the vessel of interest is opacified 
by injection of a radiopaque contrast agent. Serial radiographs of the contrast material flowing through 
the vessel are then acquired. This examination is performed in an angiographic suite, a special procedures 
laboratory, or a cardiac catheterization laboratory. 

Contrast material is needed to opacify vascular structures because the radiographic contrast of blood 
is essentially the same as that of soft tissue. Contrast material consists of an iodine-containing (Z = 53) 
compound, with maximum iodine concentrations of about 350 mg/cm 3 . Contrast material is injected 
through a catheter ranging in diameter roughly from 1 to 3 mm, depending on the injection flow rate 
to be used. Radiographic images of the contrast-filled vessels are recorded using either film or video. 

Digital imaging technology has become instrumental in the acquisition and storage of angiographic 
images. The most important application of digital imaging is digital subtraction angiography (DSA). 
Temporal subtraction is a DSA model in which a preinjection image (the mask) is acquired, the injection 
of contrast agent is then performed, and the sequential images of the opacified vessel(s) are acquired and 
subtracted from the mask. The result, ideally, is that the fixed anatomy is canceled, allowing contrast 
enhancement (similar to computed tomographic windowing and leveling) to provide increased contrast 
sensitivity. 

An increasingly important facet of the angiographic procedure is the use of transluminal interventional 
techniques to effect a therapeutic result. These techniques, including angioplasty, atherectomy, laser 
ablation, and intraluminal stents, rely on digital angiographic imaging technology to facilitate the precise 
catheter manipulations necessary for a successful result. In fact, digital enhancement, storage, and retrieval 
of fluoroscopic images have become mandatory capabilities for digital angiographic systems. 

Figure 61.3 is a schematic representation of an angiographic imaging system. The basic components 
include an x-ray tube and generator, image intensifier, video camera, cine camera (optional for cardiac 
imaging), and digital image processor. 

X-ray Generation 

Angiographic systems require a high-power, sophisticated x-ray generation system in order to produce 
the short, intense x-ray pulses needed to produce clear images of moving vessels. Required exposure 
times range from 100 to 200 ms for cerebral studies to 1 to 10 ms for cardiac studies. Angiographic 
systems use either a constant potential generator, or, increasingly, a medium/high-frequency inverter 
generator. Power ratings for angiographic generators are generally greater than or equal to 80 kW at 
100 kW and must be capable of producing reasonably square x-ray pulses. In most cases (pediatric 
cardiology being an exception), pulse widths of 5 ms or greater are necessary to keep the x-ray tube 
potential in the desirable, high-contrast range of 70 to 90 kVp. 
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FIGURE 61.3 Schematic diagram of an image intensifier-based digital angiographic and cine imaging system. Solid 
arrows indicate image signals, and dotted arrows indicate control signals. 

The x-ray tube is of the rotating-anode variety. Serial runs of high-intensity x-ray pulses result in high 
heat loads. Anode heat storage capacities of 1 mega-heat units (MHU) or greater are required, especially 
in the case of cine angiography. Electronic "heat computers," which calculate and display the current heat 
load of the x-ray tube, are very useful in preventing damage to the x-ray tube. In a high-throughput 
angiographic suite, forced liquid cooling of the x-ray tube housing is essential. Angiographic x-ray tubes 
are of multifocal design, with the focal spot sizes tailored for the intended use. A 0.6-mm (50-kW loading), 
1.2-mm (100-kW loading) bifocal insert is common. The specified heat loads for a given focal spot are 
for single exposures. When serial angiographic exposures are performed, the focal spot load limit must 
be derated (reduced) to account for the accumulation of heat in the focal spot target during the run. 
Larger focal spots (e.g., 1.8 mm) can be obtained for high-load procedures. A smaller focal spot [e.g., 
0.3 and 0.1 mm (bias)] is needed for magnification studies. Small focal spots have become increasingly 
desirable for high-resolution fluoroscopic interventional studies performed with 1000-line video systems. 

Image Formation 
Film 

From the 1950s through to the 1980s, the dominant detector system used for recording the angiographic 
procedure was film/screen angiography using a rapid film changer [Amplatz, 1997]. However, digital 
angiography, described below, has almost completely replaced this technology. Film provides a higher 
spatial resolution than digital. However, as the spatial resolution of digital continues to improve, the 
overwhelming advantages of ease of use and image processing algorithms have prevailed. While film 
changers can still be found in use, few new film/screen systems are being purchased. 

One application where film use is still common is cardiac imaging. Cine angiography is performed 
with a 35-mm motion picture camera optically coupled to the image intensifier output phosphor 
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FIGURE 61.4 Schematic diagram of the optical distributor used to couple the image intensifier (I.I.) output 
phosphor to a video and cine camera. (From Van Lysel MS. Digital angiography. In S Baum (ed), Abrams' Angio- 
graphy, 4th ed. Boston, Little, Brown, with permission.) 

(Fig. 61.4). The primary clinical application is coronary angiography and ventriculography. During cine 
angiography, x-ray pulses are synchronized both with the cine camera shutter and the vertical retrace of 
the system video camera. To limit motion blurring, it is desirable to keep the x-ray pulses as short as 
possible, but no longer than 10 msec. 

Imaging runs generally last from 5 to 10 sec in duration at a frame rate of 30 to 60 frames/sec (fps). 
Sixty fps is generally used for pediatric studies, where higher heart rates are encountered, while 30 fps is 
more typical for adult studies ("digital cine," discussed below, generally is performed at 15 fps). Some 
cine angiographic installations provide biplane imaging in which two independent imaging chains can 
acquire orthogonal images of the injection sequence. The eccentricity of coronary lesions and the asym- 
metric nature of cardiac contraction abnormalities require that multiple x-ray projections be acquired. 
Biplane systems allow this to be done with a smaller patient contrast load. For this reason, biplane systems 
are considered a requirement for pediatric cath labs. They are relatively uncommon for adult cath labs, 
however, where the less complicated positioning of a single plane system is often valued over the reduced 
number of injections possible with a biplane system. The AP and lateral planes of a biplane system are 
energized out of phase, which results in the potential for image degradation due to detection of the 
radiation scattered from the opposite plane. Image intensifier blanking, which shuts down the accelerating 
voltage of the non-imaging plane, is used to eliminate this problem. 

While film remains in common use for cardiac imaging, here too the medium is being replaced by 
digital. Digital angiographic systems (described below) were integrated into the cardiac catheterization 
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imaging system beginning in the 1980s, to provide short term image storage and replay. However, the 
high resolution and frame rate required for cardiac imaging precluded the use of digital hardware for 
long-term (greater than one day) storage. Film and cine are acquired simultaneously using the beam 
splitting mirror in the optical distributor (Fig. 61.4). This situation is rapidly changing, as high-speed 
video servers and the recordable compact disc (CD-R) now provide acceptable performance for this 
application. Most new cardiac imaging systems are purchased without a cine film camera. 

Image Intensifier/Video Camera 

The image intensifier (II) is fundamental to the modern angiographic procedure. The purpose of the 
image intensifier is ( 1 ) to produce a light image with sufficient brightness to allow the use of video and 
film cameras and (2) to produce an output image of small enough size to allow convenient coupling to 
video and film cameras. The image intensifier provides both real-time imaging capability (fluoroscopy), 
which allows patient positioning and catheter manipulation, and recording of the angiographic injection 
(digital angiography, analog video recording, photospot, cine). 

Image intensifier output phosphors are approximately 25 mm in diameter, although large (e.g., 50 to 
60 mm) output phosphor image intensifiers have been developed to increase spatial resolution. The 
modulation transfer function (MTF) of the image intensifier is determined primarily by the input and 
output phosphor stages, so mapping a given input image to a larger output phosphor will improve the 
MTF of the system. The input phosphor of a modern image intensifier is cesium iodide (Csl). The largest 
currently available image intensifier input phosphors are approximately 16 in. The effective input phos- 
phor diameter is selectable by the user. For example, an image intensifier designated 9/7/5 allows the 
user to select input phosphor diameters of 9, 7, or 5 in. These selections are referred to as image intensifier 
modes. The purpose of providing an adjustable input phosphor is to allow the user to trade off between 
spatial resolution and field of view. A smaller mode provides higher spatial resolution both because the 
MTF of the image intensifier improves and because it maps a smaller field of view to the fixed size of 
the video camera target. Generally speaking, angiographic suites designed exclusively for cardiac cathe- 
terization use 9-in. image intensifiers, neuroangiography suites use 12-in. intensifiers, while suites that 
must handle pulmonary, renal, or peripheral angiography require the larger (i.e., 14 to 16 in.) intensifiers. 

The brightness gain of the image intensifier derives from two sources: (1) the increase in electron 
energy produced by the accelerating potential (the flux gain) and (2) the decrease in size of the image as 
it is transferred from the input to the output phosphor (the minification gain). The product of these two 
factors can exceed 5000. However, since the minification gain is a function of the area of the input 
phosphor exposed to the radiation beam (i.e., the image intensifier mode), the brightness gain drops as 
smaller image intensifier modes are selected. This is compensated for by a combination of increasing 
x-ray exposure to maintain the image intensifier light output and opening the video camera aperture. 
Image intensifier brightness gain declines with age and must be monitored to allow timely replacement. 
The specification used for this purpose is the image intensifier conversion factor, defined as the light 
output of the image intensifier per unit x-ray exposure input. Modern image intensifiers have a conversion 
factor of 100 cd/m 2 /mR/s or more for the 9-in. mode. 

With the increasing emergence of digital angiography as the primary angiographic imaging modality, 
image intensifier performance has become increasingly important. In the field, the high-spatial-frequency 
of an image intensifier is assessed by determining the limiting resolution [in the neighborhood of 4 to 
5 line-pairs/mm (lp/mm) in the 9-in mode] , while the low-spatial- frequency response is assessed using 
the contrast ratio (in the neighborhood of 15:1 to 30:1). The National Electrical Manufacturers Associ- 
ation (NEMA) has defined test procedures for measuring the contrast ratio [NEMA, 1992]. The detective 
quantum efficiency (DQE), which is a measure of the efficiency with which the image intensifier utilizes 
the x-ray energy incident on it, is in the neighborhood of 65% (400 urn phosphor thickness, 60 keV). A 
tabulation of the specifications of several commercially available image intensifiers has been published 
bySiedband [1994]. 
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Optical Distributor 

The image present at the image intensifier output phosphor is coupled to the video camera, and any film 
camera present (e.g., cine), by the optical distributor. The components of the distributor are shown in 
Fig. 61.4. There is an aperture for each camera to allow the light intensity presented to each camera to 
be adjusted independently. The video camera aperture is usually a motor-driven variable iris, while the 
film camera aperture is usually fixed. It is important to realize that while the aperture does ensure that 
the proper light level is presented to the camera, more fundamentally, the aperture determines the x-ray 
exposure input to the image intensifier. As a result, both the patient exposure and the level of quantum 
noise in the image are set by the aperture. The noise amplitude in a fluoroscopic or digital angiographic 
image is inversely proportional to the f-number of the optical system. Because the quantum sink of a 
properly adjusted fluorographic system is at the input of the image intensifier, the aperture diameter is 
set, for a given type of examination, to provide the desired level of quantum mottle present in the image. 
The x-ray exposure factors are then adjusted for each patient, by an automatic exposure control (AEC) 
system, to produce the proper postaperture light level. However, some video systems do provide for 
increasing the video camera aperture during fluoroscopy when the maximum entrance exposure does 
not provide adequate light levels on a large patient. 

The beam-splitting mirror was originally meant to provide a moderate-quality video image simulta- 
neous with cine recording in order to monitor the contrast injection during cardiac studies. More recently, 
as the importance of digital angiography has mushroomed, precision-quality mirrors with higher trans- 
mission have been used in order to provide simultaneous diagnostic-quality cine and video. The latest 
development has been the introduction of cine-less digital cardiac systems, in which the video image is 
the sole recording means. The introduction of these systems has sometimes been accompanied by the 
claim that a cine-less system requires less patient exposure due to the fact that light does not have to be 
provided to the cine camera. However, a conventional cine system operates with an excess of light (i.e., 
the cine camera aperture is stopped down). Because the image intensifier input is the quantum sink of 
the system, exposure is determined by the need to limit quantum mottle, not to maintain a given light 
level at the image intensifier output. Therefore, the validity of this claim is dubious. It should be noted, 
however, that because of the difference in spatial resolution capabilities of cine and video, the noise power 
spectrum of images acquired with equal exposure will be different. It is possible that observers accept a 
lower exposure in a video image than in the higher-resolution film image. 

Video System 

The video system in an angiographic suite consists of several components, including the camera head, 
camera control unit (CCU), video monitors, and video recording devices. In addition, a digital image 
processor is integrated with the video system. 

The video camera is responsible for signal generation. Traditionally, pickup-tube based cameras (dis- 
cussed below) have been used. Recently, high resolution (1024 2 ), high frame rate (30 frame/see) charge- 
coupled device (CCD) video cameras have become available and are replacing tube-based cameras in 
some angiographic installations. This trend will probably continue. The image quality ramifications are 
a matter of current research [Blume, 1998]. The advantages of CCD cameras include low-voltage oper- 
ation, reliability, little required setup tuning, and freedom from geometric distortions. Frame-charge 
transfer is the preferred CCD read-out scheme, due to the higher optical fill factor of this configuration. 

Video camera pickup tubes used for angiography are of the photoconductive vidicon-style of construc- 
tion. This type of tube uses a low- velocity scanning beam and generates a signal from the recharge of 
the target by the scanning electron beam. There are several types of vidicon-style tubes in use (Plumbicon, 
Primicon, Saticon, Newvicon) which differ from each other in the material and configuration used for 
target construction. There is an unfortunate confusion in terminology because the original vidicon-style 
tube is referred to simply as a vidicon. The original vidicon has an antimony trisulfide target (Sb 2 S 3 ) and 
exhibits such a high degree of lag (image retention) that it is not useful for angiographic work. Even with 
low-lag angiographic cameras, the residual lag can result in artifacts in subtraction images. Light bias is 
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often used to further reduce lag by ensuring that the target surface is not driven to a negative potential 
by energetic electrons in the scanning beam [Sandrik, 1984]. 

Image noise in a well-designed system is due to x-ray quantum fluctuations and noise related to signal 
generation in the video camera. When used for digital angiographic purposes, it is important that the 
video camera exhibit a high signal-to-noise ratio (at least 60 dB) so that video camera noise does not 
dominate the low-signal (dark) portions of the image. In order to achieve this, the pickup tube must be 
run at high beam currents (2 to 3 (J.A). Because long-term operation of the tube at these beam currents 
can result in damage to the target, beam current is usually either blanked when imaging is not being 
performed, or the current is held at a low level (e.g., 400 nA) and boosted only when high-quality 
angiographic images are required. All pickup tubes currently in use for angiographic imaging exhibit a 
linear response to light input (i.e., y = 1, where the relationship between signal current / and image 
brightness B is described by a relationship of the form II I = (B/BJ"'). This has the disadvantage that the 
range in image brightness presented to the camera often exceeds the camera's dynamic range when a 
highly transmissive portion of the patient's anatomy (e.g., lung) or unattenuated radiation is included 
in the image field, either saturating the highlights, forcing the rest of the image to a low signal level, or 
both. To deal with this problem, it is desirable for the operator to mechanically bolus the bright area 
with metal filters, saline bags, etc. Specially constructed filters are available for commonly encountered 
problems, such as the transmissive region between the patient's legs during runoff studies of the legs, and 
most vendors provide a controllable metal filter in the x-ray collimator that the operator can position 
over bright spots with a joystick. 

In addition to mechanical bolusing performed by the laboratory staff, most system vendors have 
incorporated some form of gamma curve modification into the systems. Usually performed using analog 
processing in the CCU, the technique applies a nonlinear transfer curve to the originally linear data. 
There are two advantages to this technique. First, the CRT of the display monitor has reduced gain at 
low signal, so imposing a transfer function with y ~ 0.5 via gamma-curve modification provides a better 
match between the video signal and the display monitor. Second, if the modification is performed prior 
to digitization, a y < 1 results in a more constant ratio between the ADC step size and the image noise 
amplitude across the full range of the video signal. This results in less contouring in the dark portions 
of the digital image, especially in images that have been spatially filtered. It is important to note, however, 
that gamma-curve modification does not eliminate the desirable effects of mechanically bolusing the 
image field prior to image acquisition. This is so because bolusing allows more photon flux to be selectively 
applied to the more attenuating regions of the patient, which decreases both quantum and video noise 
in those regions. Bolusing is especially important for subtraction imaging. 

The video system characteristic most apparent to the user is the method employed in scanning the 
image. Prior to the advent of the digital angiography, EIA RS-170 video (525-line, 30 Hz frames, 2:1 
interlace) was the predominant standard used for fluoroscopic systems in the United States. However, 
this method of scanning has definite disadvantages for angiography, including low resolution and image 
artifacts related to the interlaced format. The inclusion of a digital image processor in the imaging chain, 
functioning as an image buffer, allows the scanning mode to be tailored to the angiographic procedure. 
Many of the important video scanning modes used for angiographic work are dependent on the ability 
of image processors to perform scan conversion operations. Two typical scan conversion operations are 
progressive-to-interlaced conversion and upscanning. Progressive-to-interlaced scan conversion allows 
progressive scanning (also referred to as sequential scanning) to be used for image acquisition and 
interlaced scanning for image display. Progressive scanning is a noninterlaced scan mode in which all 
the lines are read out in a single vertical scan. Progressive scanning is especially necessary when imaging 
moving arteries, such as during coronary angiography [Seibert et al., 1984]. In noncardiac work, pro- 
gressive scan acquisition is usually combined with beam blanking. Beam blanking refers to the condition 
in which the pickup tube beam current is blanked (turned off) for one or more integer number of frames. 
This mode is used in order to allow the image to integrate on the camera target prior to readout (Fig. 61.5). 
In this way, x-ray pulses shorter than one frame period can be acquired without scanning artifacts, and 
x-ray pulses longer than one frame period can be used in order to increase the x-ray quantum statistics 
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FIGURE 61.5 Timing diagram for image acquisition using the pulsed-progressive mode. (From Van Lysel MS. 
Digital angiography. In S Baum (ed), Abrams' Angiography, 4th ed. Boston, Little, Brown, with permission.) 

of an image. Upscanning refers to the acquisition of data at a low line rate (e.g., 525 lines) and the display 
of that data at a higher line rate (e.g., 1023 lines) [Holmes et al., 1989]. The extra lines are produced by 
either replication of the actual data or, more commonly, by interpolation (either linear or spline). 
Upscanning also can be performed in the horizontal direction as well, but this is less typical. Upscanning 
is used to decrease the demands on the video camera, system bandwidth, and digital storage requirements 
while improving display contrast and decreasing interfield flicker. 

The image buffering capability of a digital system can provide several operations aimed at reducing 
patient exposure. If fluoroscopy is performed with pulsed rather than continuous x-rays, then the operator 
has the freedom to choose the frame rate. During pulsed-progressive fluoroscopy, the digital system 
provides for display refresh without flicker. Frame rates of less than 30 frames per second can result in 
lower patient exposure, though; because of the phenomenon of eye integration, the x-ray exposure per 
pulse must be increased as the frame rate drops in order to maintain low contrast detectability [Aufrichtig 
et al., 1994]. Last image hold, which stores and displays the last acquired fluoroscopic frame, also can 
result in an exposure reduction. Combining last image hold with graphic overlays allows collimator 
shutters and bolus filters to be positioned with the x-rays turned off. 

Digital Angiography 

Digital imaging technology has quickly replaced film-based recording for most angiographic procedures. 
Digital image processing provides the ability to manipulate the contrast and spatial-frequency characteristics 
of the angiographic image, as well as providing immediate access to the image data during the procedure. 
The rapid application of digital imaging technology to the angiographic procedure was facilitated by 
the fact that the image intensifier and video camera imaging chain was already in use when digital imaging 
appeared. It is a relatively simple matter to digitize the video camera output. Theoretically, additional 
noise is added to the image due to quantitization errors associated with the digitization process. This 
additional noise can be kept insignificantly small by using a sufficient number of digital levels so that 
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FIGURE 61.6 Detector modulation transfer function, including limits due to the image intensifier (II), video camera 
(TV), and sampling, including the antialiasing filter (pixels). Figures are for the 6.5-in image intensifier mode for 
512- and 1024-pixel matrices. In both cases, the total detector MTF is given by the solid line. Data used for this 
figure from Verhoeven [1985]. 

the amplitude of one digital level is approximately equal to the amplitude of the standard deviation in 
image values associated with the noise (x-ray quantum and electronic noise) in the image prior to 
digitization [Kruger et al., 1981]. To meet this condition, most digital angiographic systems employ a 
10-bit (1024-level) analog-to-digital convertor (ADC). Those systems which are designed to digitize high- 
noise (i.e., low x-ray exposure) images exclusively can employ an 8-bit (256-level) ADC. Such systems 
include those designed for cardiac and fluoroscopic applications. 

The spatial resolution of a digital angiographic image is determined by several factors, including the 
size of the x-ray tube focal spot, the modulation transfer function of the image intensifier-video camera 
chain, and the size of the pixel matrix. The typical image matrix dimensions for non-cardiac digital 
angiographic images is 1024 x 1024. Cardiac systems use 512 x 512, 1024(H) x 512(V), or 1024 x 1024. 
Sampling rates required to digitize 512 2 and 1024 2 matrices in the 33-ms frame period of conventional 
video are 10 and 40 MHz, respectively. Figure 61.6 shows an example of the detector MTF of a digital 
angiographic system. The product of the image intensifier and video system MTF constitute the presa- 
mpling MTF [Fujita et al., 1985]. It is seen that the effect of sampling with a 512-pixel matrix is to truncate 
the high-frequency tail of the presampling MTF, while a 1024-pixel matrix imposes few additional 
limitations beyond that associated with the analog components (especially for small image intensifier 
modes) [Verhoeven, 1985]. However, because of blurring associated with the x-ray tube focal spot, the 
full advantage of the higher density pixel matrix can rarely be realized, clinically (Fig. 61.7). 

Digital Image Processor 

The digital image processor found in a modern angiographic suite is a dedicated device designed specially 
to meet the demands of the angiographic procedure. Hardware is structured as a pipeline processor to 
perform real-time processing at video rates. Image-subtraction, integration, spatial-filtration, and tem- 
poral-filtration algorithms are hardwired to meet this requirement. Lookup tables (LUTs) are used to 
perform intensity transformations (e.g., contrast enhancement and logarithmic transformation). A more 
general-purpose host computer is used to control the pipeline processor and x-ray generator, respond to 
user input, and perform non-real-time image manipulations. 

The most clinically important image-processing algorithm is temporal subtraction (DSA). Subtraction 
imaging is used for most vascular studies. Subtraction allows approximately a factor of 2 reduction in 
the amount of injected contrast material. As a result, DSA studies can be performed with less contrast 
load and with smaller catheters than film/screen angiography. The primary limitation of DSA is a 
susceptibility to misregistration artifacts resulting from patient motion. Some procedures that are par- 
ticularly susceptible to motion artifacts are routinely performed in a nonsubtracted mode. Unsubtracted 
digital angiographic studies are usually performed with the same amount of contrast material as 
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FIGURE 61.7 Experimental determination of the limiting resolution (high-contrast object and high x-ray exposure) 
for 512- and 1024-pixel matrices, focal spots of actual dimensions 1.0 and 1.5 mm, and a 6.5-in image intensifier 
mode, as a function of geometric magnification. (From Mistretta CA, Peppier WW. 1988. Digital cardiac x-ray 
imaging: Fundamental principles. Am ) Cardiac Imaging 2:26, with permission.) 

film/screen studies. Cardiac angiography is one procedure that is generally performed without subtrac- 
tion, although in any particular study, if patient and respiratory motion are absent, it is possible to obtain 
high-quality time subtractions using phase-matched mask subtractions in which the preinjection mask 
and postinjection contrast images are matched with respect to cardiac phase. In order to do this efficiently, 
it is necessary to digitize the ECG signal along with the image data. 

Additional examples of unsubtracted digital angiographic studies are those in uncooperative patients 
(e.g., trauma) and digital runoff studies of the vessels in the leg. In a digital runoff study it is necessary 
to follow the bolus down the legs by moving the patient (or the image gantry). This motion causes 
difficulties with both mask registration and uniform exposure intensities between the mask and contrast 
images. The high contrast sensitivity of DSA is valuable for runoff studies, however, because the small 
vessels and slow flow in the lower extremities can make vessel visualization difficult. Recently, making 
use of programmed table (or gantry) motion and pixel-shifting strategies, x-ray vendors have begun to 
offer a viable digital subtraction runoff mode. 

In addition to subtraction angiography, two filtration algorithms have become clinically important. 
The first is high-pass spatial filtration for the purposes of providing edge enhancement. Real-time edge 
enhancement of fluoroscopy is especially important for interventional procedures, such as angioplasty, 
where the task requires visualization of high-frequency objects such as vessel edges and guidewires. 
Because increasing the degree of edge enhancement also increases image noise, operator control of the 
degree of enhancement (accomplished by adjusting the size and weighing of the convolution kernel) is 
an important feature. 
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The second filtration algorithm often available from a digital angiographic system is low-pass temporal 
filtration (recursive filtering) [Rowlands, 1992]. Temporal filtration is used to reduce quantum noise 
levels in fluoroscopic images without increasing patient exposure. Recursive filtering is a more desirable 
method than simple image integration because it requires only two memory planes and because it is a 
simple matter to turn the filtering algorithm off, on a pixel-by-pixel basis, when motion is detected. 
Motion-detection circuits monitor the frame-to-frame change in pixel values and assume that an object 
has moved into or out of the pixel if the change exceeds a preset threshold. 

Image Storage 

Storage needs during the angiographic procedure are easily met by modern hard drives. These often are 
configured to provide real-time (i.e., video rate) storage. The immediate access to images provided by 
real-time disk technology is one of the major advantages of digital angiography over film. Not only is it 
unnecessary to wait for film to be developed, but review of the image data after the patient's procedure 
is completed is facilitated by directories and specialized review software. For example, a popular image 
menu feature is the presentation to users of a low-resolution collage of available images from which they 
may select a single image or entire run for full-resolution display. 

While online storage of recent studies is a strength of digital angiography, long-term (archival) storage 
is a weakness. Archival devices provided by vendors are generally proprietary devices that make use of 
various recording media. There is an established communications protocol (ACR-NEMA) [NEMA, 1993] 
for network transfer of images. For the time being, while large institutions and teaching hospitals are 
investing in sophisticated digital picture archiving and communications systems (PACS), archival needs 
at most institutions are met by storage of hardcopy films generated from the digital images. Hardcopy 
devices include multiformat cameras (laser or video) and video thermal printers. Laser cameras, using 
either an analog or a digital interface to the digital angiographic unit, can provide diagnostic-quality, 
large-format, high-contrast, high-resolution films. Multiformat video cameras, which expose the film 
with a CRT, are a less expensive method of generating diagnostic-quality images but are also more 
susceptible to drift and geometric distortions. Thermal printer images are generally used as convenient 
method to generate temporary hardcopy images. 

Cardiac angiography labs have long employed a similar method, referred to as parallel cine, in which 
both digital and cine images are recorded simultaneously by use of the semitransparent mirror in the 
image intensifier optical distributor. Immediate diagnosis would be performed off the digital monitor 
while the 35-mm film would provide archival storage and the ability to share the image data with other 
institutions. However, due to the promulgation of an image transfer standard using the recordable CD 
(CD-R), many laboratories are abandoning cine film. While CD-R cannot replay images at full real-time 
rates, the read-rate (>1 MB/sec) is sufficient to load old data quickly from CD-R to a workstation hard 
drive for review. Low volume laboratories can use CD-R as an archival storage method. Higher volume 
labs are installing networked video file servers to provide online or near-line access to archived patient 
studies. 

Summary 

For decades, x-ray projection film/screen angiography was the only invasive modality available for the 
diagnosis of vascular disease. Now several imaging modalities are available to study the cardiovascular 
system, most of which are less invasive than x-ray projection angiography. However, conventional angiog- 
raphy also has changed dramatically during the last decade. Digital angiography has replaced film/screen 
angiography in most applications. In addition, the use and capabilities of transluminal interventional 
techniques have mushroomed, and digital angiographic processor modes have expanded significantly in 
support of these interventional procedures. As a consequence, while less invasive technologies, such as 
MR angiography, make inroads into conventional angiography's diagnostic applications, it is likely that 
x-ray projection angiography will remain an important clinical modality for many years to come. 
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Defining Terms 

Bolus: This term has two, independent definitions: (1) material placed in a portion of the x-ray beam 
to reduce the sense dynamic range and (2) the injection contrast material. 

Digital subtraction angiography (DSA): Methodology in which digitized angiographic images are 
subtracted in order to provide contrast enhancement of the opacified vasculature. Clinically, 
temporal subtraction is the algorithm used, though energy subtraction methods also fall under 
the generic term DSA. 

Parallel cine: The simultaneous recording of digital and cine-film images during cardiac angiography. 
In this mode, digital image acquisition provides diagnostic-quality images. 

Picture archiving and communications systems (PACS): Digital system or network for the electronic 
storage and retrieval of patient images and data. 

Progressive scanning: Video raster scan method in which all horizontal lines are read out in a single 
vertical scan of the video camera target. 

Pulsed-progressive fluoroscopy: Method of acquiring fluoroscopic images in which x-rays are pro- 
duced in discrete pulses coincident with the vertical retrace period of the video camera. The video 
camera is then read out using progressive scanning. This compares with the older fluoroscopic 
method of producing x-rays continuously, coupled with interlaced video camera scanning. 

Temporal subtraction: Also known as time subtraction or mask-mode subtraction. A subtraction mode 
in which an unopacified image (the mask, usually acquired prior to injection) is subtracted from 
an opacified image. 

Upscanning: Scan conversion method in which the number of pixels or video lines displayed is higher 
(usually by a factor of 2) than those actually acquired from the video camera. Extra display data 
are produced by either replication or interpolation of the acquired data. 
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61.3 Mammography 

Martin J. Yaffe 

Mammography is an x-ray imaging procedure for examination of the breast. It is used primarily for the 
detection and diagnosis of breast cancer, but also for pre-surgical localization of suspicious areas and in 
the guidance of needle biopsies. 

Breast cancer is a major killer of women. Approximately 179,000 women were diagnosed with breast 
cancer in the U.S. in 1998 and 43,500 women died of this disease [Landis, 1998]. Its cause is not currently 
known; however, it has been demonstrated that survival is greatly improved if disease is detected at an 
early stage [Tabar, 1993; Smart, 1993]. Mammography is at present the most effective means of detecting 
early stage breast cancer. It is used both for investigating symptomatic patients (diagnostic mammogra- 
phy) and for screening of asymptomatic women in selected age groups. 

Breast cancer is detected on the basis of four types of signs on the mammogram: 

1. The characteristic morphology of a tumor mass. 

2. Certain presentations of mineral deposits as specks called microcalcifications. 

3. Architectural distortion of normal tissue patterns caused by the disease. 

4. Asymmetry between corresponding regions of images of the left and right breast. 

Principles of Mammography 

The mammogram is an x-ray shadowgram formed when x-rays from a quasi-point source irradiate the 
breast and the transmitted x-rays are recorded by an image receptor. Because of the spreading of the 
x-rays from the source, structures are magnified as they are projected onto the image receptor. The signal 
is a result of differential attenuation of x-rays along paths passing through the structures of the breast. 
The essential features of image quality are summarized in Fig. 61.8. This is a one-dimensional profile 
of x-ray transmission through a simplified computer model of the breast [Fahrig, 1992], illustrated in 
Fig. 61.9. A region of reduced transmission corresponding to a structure of interest such as a tumor, a 
calcification, or normal fibroglandular tissue is shown. The imaging system must have sufficient spatial 
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FIGURE 61.8 Profile of a simple x-ray projection image, illustrating the role of contrast, spatial resolution, and 
noise in mammographic image quality. 
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FIGURE 61.9 Simplified computer model of the mammographic image acquisition process. 

resolution to delineate the edges of fine structures in the breast. Structural detail as small as 50 |im must 
be adequately resolved. Variation in x-ray attenuation among tissue structures in the breast gives rise to 
contrast. The detectability of structures providing subtle contrast is impaired, however, by an overall 
random fluctuation in the profile, referred to as mottle or noise. Because the breast is sensitive to ionizing 
radiation, which at least for high doses is known to cause breast cancer, it is desirable to use the lowest 
radiation dose compatible with excellent image quality. The components of the imaging system will be 
described and their design will be related to the imaging performance factors discussed in this section. 

Physics of Image Formation 

In the model of Fig. 61.9, an "average" breast composed of 50% adipose tissue and 50% fibroglandular 
tissue is considered. For the simplified case of monoenergetic x-rays of energy, E, the number of x-rays 
recorded in a fixed area of the image is proportional to 
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in the shadow of the lesion or other structure of interest. In Eqs. (61.1) and (61.2), N (E) is the number 
of x-rays that would be recorded in the absence of tissue in the beam, u and u' are the attenuation 
coefficients of the breast tissue and the lesion, respectively, T is the thickness of the breast, and t is the 
thickness of the lesion. 

The difference in x-ray transmission gives rise to subject contrast which can be defined as: 



N„+N, 



(61.3) 



For the case of monoenergetic x-rays and temporarily ignoring scattered radiation, 



1-e 



(61.4) 



1+e 



i.e., contrast would depend only on the thickness of the lesion and the difference between its attenuation 
coefficient and that of the background material. These are not valid assumptions and in actuality contrast 
also depends on u and T 

Shown in Fig. 61.10 are x-ray attenuation coefficients measured vs. energy on samples of three types 
of materials found in the breast: adipose tissue, normal fibroglandular breast tissue, and infiltrating ductal 
carcinoma (one type of breast tumor) [Johns, 1987]. Both the attenuation coefficients themselves and 
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FIGURE 61.10 Measured x-ray linear attenuation coefficients of breast fibroglandular tissue, breast fat, and infil- 
trating ductal carcinoma plotted vs. x-ray energy. 
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FIGURE 61.11 Dependence of mammographic subject contrast on x-ray energy. 

their difference (u' - |i) decrease with increasing E. As shown in Fig. 61.11, which is based on Eq. (61.4), 
this causes C s to fall as x-ray energy increases. Note that the subject contrast of even small calcifications 
in the breast is greater than that for a tumor because of the greater difference in attenuation coefficient 
between calcium and breast tissue. 

For a given image recording system (image receptor), a proper exposure requires a specific value of 
x-ray energy transmitted by the breast and incident on the receptor, i.e., a specific value of N B . The breast 
entrance skin exposure 1 (ESE) required to produce an image is, therefore, proportional to 



N„ = NB\ 



(4 



WT 



(61.5) 



Because u decreases with energy, the required exposure for constant signal at the image receptor, N B , will 
increase if E is reduced to improve image contrast. A better measure of the risk of radiation-induced 
breast cancer than ESE is the mean glandular dose (MGD) [BEIRV, 1990]. MGD is calculated as the 
product of the ESE and a factor, obtained experimentally or by Monte Carlo radiation transport calcu- 
lations, which converts from incident exposure to dose [Wu, 1991, 1994]. The conversion factor increases 
with E so that MGD does not fall as quickly with energy as does entrance exposure. The trade-off between 
image contrast and radiation dose necessitates important compromises in establishing mammographic 
operating conditions. 

Equipment 

The mammography unit consists of an x-ray tube and an image receptor mounted on opposite sides of 
a mechanical assembly or gantry. Because the breast must be imaged from different aspects and to 
accommodate patients of different height, the assembly can be adjusted in a vertical axis and rotated 
about a horizontal axis as shown in Fig. 61.12. 

Most general radiography equipment is designed such that the image field is centered below the x-ray 
source. In mammography, the system's geometry is arranged as in Fig. 61.13a where a vertical line from 
the x-ray source grazes the chest wall of the patient and intersects orthogonally with the edge of the 
image receptor closest to the patient. If the x-ray beam were centered over the breast as in Fig. 61.13b, 



'Exposure is expressed in Roentgens (R) (which is not an SI unit) or in Coulombs of ionization collected per 
kilogram of air. 
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FIGURE 61.12 Schematic diagram of a dedicated mammography machine. 

some of the tissue near the chest wall would be projected inside of the patient where it could not be 
recorded. 

Radiation leaving the x-ray tube passes through a metallic spectral-shaping filter, a beam-defining 
aperture, and a plate which compresses the breast. Those rays transmitted through the breast are incident 
on an anti-scatter "grid" and then strike the image receptor where they interact and deposit most of their 
energy locally. A fraction of the x-rays pass through the receptor without interaction and impinge upon 
a sensor which is used to activate the automatic exposure control mechanism of the unit. 

X-Ray Source 

Practical monoenergetic x-ray sources are not available and the x-rays used in mammography arise from 
bombardment of a metal target by electrons in a hot-cathode vacuum tube. The x-rays are emitted from 
the target over a spectrum of energies, ranging up to the peak kilovoltage applied to the x-ray tube. 
Typically, the x-ray tube employs a rotating anode design in which electrons from the cathode strike the 
anode target material at a small angle (0 to 16°) from normal incidence (Fig. 61.14). Over 99% of the 
energy from the electrons is dissipated as heat in the anode. The angled surface and the distribution of 
the electron bombardment along the circumference of the rotating anode disk allows the energy to be 
spread over a larger area of target material while presenting a much smaller effective focal spot as viewed 
from the imaging plane. On modern equipment, the typical "nominal" focal spot size for normal contact 
mammography is 0.3 mm, while the smaller spot used primarily for magnification is 0.1 mm. The 
specifications for x-ray focal spot size tolerance, established by NEMA (National Electrical Manufacturers 
Association) or the IEC (International Electrotechnical Commission) allow the effective focal spot size to 
be considerably larger than these nominal sizes. For example, the NEMA specification allows the effective 
focal spot size to be 0.45 mm in width and 0.65 mm in length for a nominal 0.3 mm spot and 0.15 mm 
in each dimension for a nominal 0.1 mm spot. 

The nominal focal spot size is defined relative to the effective spot size at a "reference axis". As shown 
in Fig. 61.14, this reference axis, which may vary from manufacturer to manufacturer, is normally 
specified at some mid-point in the image. The effective size of the focal spot will monotonically increase 
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FIGURE 61.13 Geometric arrangement of system components in mammography, (a) Correct alignment provides 
good tissue coverage, (b) incorrect alignment causes tissue near the chest wall not to be imaged. 

from the anode side to the cathode side of the imaging field. Normally, x-ray tubes are arranged such 
that the cathode side of the tube is adjacent to the patient's chest wall, since the highest intensity of x-rays 
is available at the cathode side, and the attenuation of x-rays by the patient is generally greater near the 
chest wall of the image. 

The spatial resolution capability of the imaging system is partially determined by the effective size of 
the focal spot and by the degree of magnification of the anatomy at any plane in the breast. This is illustrated 
in Fig. 61.15 where, by similar triangles, the unsharpness region due to the finite size of the focal spot is 
linearly related to the effective size of the spot and to the ratio of OID to SOD, where SOD is the source- 
object distance and OID is the object-image receptor distance. Because the breast is a three-dimensional 
structure, this ratio and, therefore, the unsharpness will vary for different planes within the breast. 

The size of the focal spot determines the heat loading capability of the x-ray tube target. For smaller 
focal spots, the current through the x-ray tube must be reduced, necessitating increased exposure times 
and the possibility of loss of resolution due to motion of anatomical structures. Loss of geometric 
resolution can be controlled in part by minimizing OID/SOD, i.e., by designing the equipment with 
greater source-breast distances, by minimizing space between the breast and the image receptor, and by 
compressing the breast to reduce its overall thickness. 

Magnification is often used intentionally to improve the signal-to-noise ratio of the image. This is 
accomplished by elevating the breast above the image receptor, in effect reducing SOD and increasing 
OID. Under these conditions, resolution is invariably limited by focal spot size and use of a small spot 
for magnification imaging (typically a nominal size of 0.1 mm) is critical. 
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FIGURE 61.14 Angled-target x-ray source provides improved heat loading but causes effective focal spot size to 
vary across the image. 

Since monoenergetic x-rays are not available, one attempts to define a spectrum providing energies 
which give a reasonable compromise between radiation dose and image contrast. The spectral shape can 
be controlled by adjustment of the kilovoltage, choice of the target material, and the type and thickness 
of metallic filter placed between the x-ray tube and the breast. 

Based on models of the imaging problem in mammography, it has been suggested that the optimum 
energy for imaging lies between 18 and 23 keV, depending on the thickness and composition of the breast 
[Beaman, 1982]. It has been found that for the breast of typical thickness and composition, the charac- 
teristic x-rays from molybdenum at 17.4 and 19.6 keV provide good imaging performance. For this 
reason, molybdenum target x-ray tubes are used on the vast majority of mammography machines. 

Most mammography tubes use beryllium exit windows between the evacuated tube and the outside 
world since glass or other metals used in general purpose tubes would provide excessive attenuation of 
the useful energies for mammography. Figure 61.16 compares tungsten target and molybdenum target 
spectra for beryllium window x-ray tubes. Under some conditions, tungsten may provide appropriate 
image quality for mammography; however, it is essential that the intense emission of L radiation from 
tungsten be filtered from the beam before it is incident upon the breast, since extremely high doses to 
the skin would result from this radiation without useful contribution to the mammogram. 

Filtration of the X-Ray Beam 

In conventional radiology, filters made of aluminum or copper are used to provide selective removal of 
low x-ray energies from the beam before it is incident upon the patient. In mammography, particularly 
when a molybdenum anode x-ray tube is employed, a molybdenum filter 20 to 35 um thick is generally 
used. This filter attenuates x-rays both at low energies and those above its own K-absorption edge allowing 
the molybdenum characteristic x-rays from the target to pass through the filter with relatively high 
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FIGURE 61.15 Dependence of focal spot unsharpness on focal spot size and magnification factor. 

efficiency. As illustrated in Fig. 61.17, this K edge filtration results in a spectrum enriched with x-ray 
energies in the range of 17 to 20 keV. 

Although this spectrum is relatively well suited for imaging the breast of average attenuation, slightly 
higher energies are desirable for imaging dense thicker breasts. Because the molybdenum target spectrum 
is so heavily influenced by the characteristic x-rays, an increase in the kilovoltage alone does not sub- 
stantially change the shape of the spectrum. The beam can be "hardened", however, by employing filters 
of higher atomic number than molybdenum. For example, rhodium (atomic no. 45) has a K absorption 
edge at 23 keV, providing strong attenuation both for x-rays above this energy and for those at substantially 
lower energies. Used with a molybdenum target x-ray tube and slightly increased kV, it provides a 
spectrum with increased penetration (reduced dose) compared to the Mo/Mo combination. 

It is possible to go further in optimizing imaging performance, by "tuning" the effective spectral energy 
by using other target materials in combination with appropriate K-edge filters [Jennings, 1993]. One 
manufacturer employs an x-ray tube incorporating both molybdenum and rhodium targets, where the 
electron beam can be directed toward one or the other of these materials [Heidsieck, 1991]. With this 
system, the filter material (rhodium, molybdenum, etc.) can be varied to suit the target that has been 
selected. Similarly, work has been reported on K-edge filtration of tungsten spectra [Desponds, 1991], 
where the lack of pronounced K characteristic peaks provides more flexibility in spectral shaping with 
filters. 

Compression Device 

There are several reasons for applying firm (but not necessarily painful) compression to the breast during 
the examination. Compression causes the different tissues to be spread out, minimizing superposition 
from different planes and thereby improving a conspicuity of structures. As will be discussed later, 
scattered radiation can degrade contrast in the mammogram. The use of compression decreases the ratio 
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of scattered to directly transmitted radiation reaching the image receptor. Compression also decreases 
the distance from any plane within the breast to the image receptor (i.e., OID) and in this way reduces 
geometric unsharpness. The compressed breast provides lower overall attenuation to the incident x-ray 
beam, allowing the radiation dose to be reduced. The compressed breast also provides more uniform 
attenuation over the image. This reduces the exposure range which must be recorded by the imaging 
system, allowing more flexibility in choice of films to be used. Finally, compression provides a clamping 
action which reduces anatomical motion during the exposure reducing this source of image unsharpness. 
It is important that the compression plate allows the breast to be compressed parallel to the image 
receptor, and that the edge of the plate at the chest wall be straight and aligned with both the focal spot 
and image receptor to maximize the amount of breast tissue which is included in the image (see 
Fig. 61.13). 

Anti-Scatter Grid 

Lower x-ray energies are used for mammography than for other radiological examinations. At these 
energies, the probability of photoelectric interactions within the breast is significant. Nevertheless, the 
probability of Compton scattering of x-rays within the breast is still quite high. Scattered radiation 
recorded by the image receptor has the effect of creating a quasi-uniform haze on the image and causes 
the subject contrast to be reduced to 

C s = C ° (61.6) 

1 + SPR 

where C is the contrast in the absence of scattered radiation, given by Eq. 61.4 and SPR is the scatter- 
to-primary (directly transmitted) x-ray ratio at the location of interest in the image. In the absence of 
an anti-scatter device, 37 to 50% of the total radiation incident on the image receptor would have 
experienced a scattering interaction within the breast, i.e., the scatter- to-primary ratio would be 0.6:1.0. 
In addition to contrast reduction, the recording of scattered radiation uses up part of the dynamic range 
of the image receptor and adds statistical noise to the image. 

Anti-scatter grids have been designed for mammography. These are composed of linear lead (Pb) septa 
separated by a rigid interspace material. Generally, the grid septa are not strictly parallel but focused 
(toward the x-ray source). Because the primary x-rays all travel along direct lines from the x-ray source 
to the image receptor, while the scatter diverges from points within the breast, the grid presents a smaller 
acceptance aperture to scattered radiation than to primary radiation and thereby discriminates against 
scattered radiation. Grids are characterized by their grid ratio (ratio of the path length through the 
interspace material to the interseptal width) which typically ranges from 3.5:1 to 5:1. When a grid is 
used, the SPR is reduced typically by a factor of about 5, leading in most cases to a substantial improve- 
ment in image contrast [Wagner, 1991]. 

On modern mammography equipment, the grid is an integral part of the system, and during x-ray 
exposure is moved to blur the image of the grid septa to avoid a distracting pattern in the mammogram. 
It is important that this motion be uniform and of sufficient amplitude to avoid non-uniformities in the 
image, particularly for short exposures that occur when the breast is relatively lucent. 

Because of absorption of primary radiation by the septa and by the interspace material, part of the 
primary radiation transmitted by the patient does not arrive at the image receptor. In addition, by 
removing some of the scattered radiation, the grid causes the overall radiation fluence to be reduced 
from that which would be obtained in its absence. To obtain a radiograph of proper optical density, the 
entrance exposure to the patient must be increased by a factor known as the Bucky factor to compensate 
for these losses. Typical Bucky factors are in the range of 2 to 3. 

A linear grid does not provide scatter rejection for those quanta traveling in planes parallel to the 
septa. Recently a crossed grid that consists of septa that run in orthogonal directions has been introduced 
for this purpose. The improved scatter rejection is accomplished at doses comparable to those required 
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with a linear grid, because the interspace material of the crossed grid is air rather than solid. To avoid 
artifacts, the grid is moved in a very precise way during the exposure to ensure a uniform blurring of 
the image of the grid itself. 

Image Receptor 

Fluorescent Screens 

When first introduced, mammography was carried out using direct exposure radiographic film in order 
to obtain the high spatial resolution required. Since the mid-1970s, high resolution fluorescent screens 
have been used to convert the x-ray pattern from the breast into an optical image. These screens are used 
in conjunction with single-coated radiographic film, and the configuration is shown in Fig. 61.18. With 
this arrangement, the x-rays pass through the cover of a light-tight cassette and the film to impinge upon 
the screen. Absorption is exponential, so that a large fraction of the x-rays are absorbed near the entrance 
surface of the screen. The phosphor crystals which absorb the energy produce light in an isotropic 
distribution. Because the film emulsion is pressed tightly against the entrance surface of the screen, the 
majority of the light quanta have only a short distance to travel to reach the film. Light quanta traveling 
longer distances have an opportunity to spread laterally (see Fig. 61.18), and in this way degrade the 
spatial resolution. To discriminate against light quanta which travel along these longer oblique paths, the 
phosphor material of the screen is generally treated with a dye which absorbs much of this light, giving 
rise to a sharper image. A typical phosphor used for mammography is gadolinium oxysulphide (Gd 2 2 S). 
Although the K-absorption edge of gadolinium occurs at too high an energy to be useful in mammog- 
raphy, the phosphor material is dense (7.44 g/cm 3 ) so that the quantum efficiency (the fraction of incident 
x-rays which interact with the screen), is good (about 60%). Also, the conversion efficiency of this 
phosphor (fraction of the absorbed x-ray energy converted to light) is relatively high. The light emitted 
from the fluorescent screen is essentially linearly dependent upon the total amount of energy deposited 
by x-rays within the screen. 
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FIGURE 61.19 Characteristic curve of a mammographic screen-film image receptor. 

Film 

The photographic film emulsion for mammography is designed with a characteristic curve such as that 
shown in Fig. 61.19, which is a plot of the optical density (blackness) provided by the processed film vs. 
the logarithm of the x-ray exposure to the screen. Film provides non-linear input-output transfer char- 
acteristics. The local gradient of this curve controls the display contrast presented to the radiologist. 
Where the curve is of shallow gradient, a given increment of radiation exposure provides little change 
in optical density, rendering structures imaged in this part of the curve difficult to visualize. Where the 
curve is steep, the film provides excellent image contrast. The range of exposures over which contrast is 
appreciable is referred to as the latitude of the film. Because the film is constrained between two optical 
density values — the base + fog density of the film, where no intentional x-ray exposure has resulted, and 
the maximum density provided by the emulsion — there is a compromise between maximum gradient 
of the film and the latitude that it provides. For this reason, some regions of the mammogram will 
generally be underexposed or overexposed, i.e., rendered with sub-optimal contrast. 

Film Processing 

Mammography film is processed in an automatic processor similar to that used for general radiographic 
films. It is important that the development temperature, time, and rate of replenishment of the developer 
chemistry be compatible with the type of film emulsion used and be designed to maintain good contrast 
of the film. 

Noise and Dose 

Noise in mammography results primarily from two sources — the random absorption of x-rays in the 
detector and the granularity associated with the screen and the film emulsion. The first, commonly known 
as quantum noise, is governed by Poisson statistics so that for a given area of image the standard deviation 
in the number of x-rays recorded is equal to the square root of the mean number recorded. In other 
words, the noise in the image is dependent on both the amount of radiation which strikes the imaging 
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system per unit area, and the quantum efficiency of the imaging system. The quantum efficiency is related 
to the attenuation coefficient of the phosphor material and the thickness of the screen. In order to 
maintain high spatial resolution, the screen must be made relatively thin to avoid lateral diffusion of 
light. The desirability of maintaining a relatively low quantum noise level in the image mandates that 
the conversion efficiency of the screen material and the sensitivity of the film not be excessively high. 
With very high conversion efficiency, the image could be produced at low dose but with an inadequate 
number of quanta contributing to the image. Similarly, film granularity increases as more sensitive films 
are used, so that again film speed must be limited to maintain high image quality. For current high 
quality mammographic imaging employing an anti-scatter grid, with films exposed to a mean optical 
density of at least 1.6, the mean glandular dose to a 5-cm thick compressed breast consisting of 50% 
fibroglandular and 50% adipose tissue is in the range of 1 to 2 milligray [Conway, 1992]. 

Automatic Exposure Control 

It is difficult for the technologist to estimate the attenuation of the breast by inspection and, therefore, 
modern mammography units are equipped with automatic exposure control (AEC). The AEC radiation 
sensors are located behind the image receptor so that they do not cast a shadow on the image. The sensors 
measure the x-ray fluence transmitted through both the breast and the receptor and provide a signal 
which can be used to discontinue the exposure when a certain preset amount of radiation has been 
received by the image receptor. The location of the sensor must be adjustable so that it can be placed 
behind the appropriate region of the breast in order to obtain proper image density. AEC devices must 
be calibrated so that constant image optical density results are independent of variations in breast 
attenuation, kilovoltage setting, or field size. With modern equipment, automatic exposure control is 
generally microprocessor-based so that relatively sophisticated corrections can be made during the expo- 
sure for the above effects and for reciprocity law failure of the film. 

Automatic Kilovoltage Control 

Many modern mammography units also incorporate automatic control of the kilovoltage or target/fil- 
ter/kilovoltage combination. Penetration through the breast depends on both breast thickness and com- 
position. For a breast that is dense, it is possible that a very long exposure time would be required to 
achieve adequate film blackening. This results in high dose to the breast and possibly blur due to 
anatomical motion. It is possible to sense the compressed breast thickness and the transmitted exposure 
rate and to employ an algorithm to automatically choose the x-ray target and/or beam filter as well as 
the kilovoltage. 

Quality Control 

Mammography is one of the most technically demanding radiographic procedures, and in order to obtain 
optimal results, all components of the system must be operating properly. Recognizing this, the American 
College of Radiology implemented and administers a Mammography Accreditation Program [ McClelland, 
1991], which evaluates both technical and personnel-related factors in facilities applying for accreditation. 
In order to verify proper operation, a rigorous quality control program should be in effect. In fact, 
the U.S. Mammography Quality Standards Act stipulates that a quality control program must be in place 
in all facilities performing mammography. A program of tests (summarized in Table 61.1) and methods 
for performing them are contained in the quality control manuals for mammography published by the 
American College of Radiology [Hendrick, 1999]. 

Stereotactic Biopsy Devices 

Stereoscopic x-ray imaging techniques are currently used for the guidance of needle "core" biopsies. These 
procedures can be used to investigate suspicious mammographic or clinical findings without the need 
for surgical excisional biopsies, resulting in reduced patient risk, discomfort, and cost. In these stereotactic 
biopsies, the gantry of a mammography machine is modified to allow angulated views of the breast 
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TABLE 61.1 Mammographic Quality Control Minimum Test Frequencies 

Test Performed By: Minimum Frequency 

Darkroom cleanliness Radiologic technologist Daily 

Processor quality control Daily 

Screen cleanliness Weekly 

Viewboxes and viewing conditions Weekly 

Phantom images Weekly 

Visual check list Monthly 

Repeat analysis Quarterly 

Analysis of fixer retention in film Quarterly 

Darkroom fog Semi-annually 

Screen-film contact Semi-annually 

Compression Semi-annually 

Mammographic unit assembly evaluation Medical physicist Annually 

Collimation assessment Annually 

Focal spot size performance Annually 

kVp Accuracy/reproducibility Annually 

Beam quality assessment (half-value-layer) Annually 

Automatic exposure control (AEC) system performance assessment Annually 

Uniformity of screen speed Annually 

Breast entrance exposure and mean glandular dose Annually 

Image quality — Phantom evaluation Annually 

Artifact assessment Annually 

Radiation output rate Annually 

Viewbox luminance and room illuminance Annually 

Compression release mechanism Annually 

Source: Hendrick, R.E. et al., 1999. Mammography Quality Control Manuals (radiologist, radiologic technologist, 
medical physicist). American College of Radiology. Reston, VA. With permission. 

(typically ±15° from normal incidence) to be achieved. From measurements obtained from these images, 
the three-dimensional location of a suspicious lesion is determined and a needle equipped with a spring- 
loaded cutting device can be accurately placed in the breast to obtain tissue samples. While this procedure 
can be performed on an upright mammography unit, special dedicated systems have recently been 
introduced to allow its performance with the patient lying prone on a table. The accuracy of sampling 
the appropriate tissue depends critically on the alignment of the system components and the quality of 
the images produced. A thorough review of stereotactic imaging, including recommended quality control 
procedures is given by Hendrick and Parker [Hendrick, 1994]. 

Digital Mammography 

There are several technical factors associated with screen-film mammography which limit the ability to 
display the finest or most subtle details, and produce images with the most efficient use of radiation dose 
to the patient. In screen-film mammography, the film must act as an image acquisition detector as well 
as a storage and display device. Because of its sigmoidal shape, the range of x-ray exposures over which 
the film display gradient is significant, i.e., the image latitude, is limited. If a tumor is located in either 
a relatively lucent or more opaque region of the breast, then the contrast displayed to the radiologist may 
be inadequate because of the limited gradient of the film. This is particularly a concern in patients whose 
breasts contain large amounts of fibroglandular tissue, the so-called dense breast. 

Another limitation of film mammography is the effect of structural noise due to the granularity of 
the film emulsion used to record the image. This impairs the detectibility of microcalcifications and other 
fine structures within the breast. While Poisson quantum noise is unavoidable, it should be possible to 
virtually eliminate structural noise by technical improvements. Existing screen-film mammography also 
suffers because of the inefficiency of grids in removing the effects of scattered radiation and with 
compromises in spatial resolution vs. quantum efficiency inherent in the screen-film image receptor. 



© 2000 by CRC Press LLC 



DIGITAL MAMMOGRAPHY 




Breast 



X-ray 
Source 




Electronic 
Detector and 
Digitizer 




Display 
Device 




Acquisition 
Computer 



FIGURE 61.20 Schematic representation of a digital mammography system. 

Many of the limitations of conventional mammography can be effectively overcome with a digital 
mammography imaging system (Fig. 61.20), in which image acquisition, display, and storage are per- 
formed independently, allowing optimization of each. For example, acquisition can be performed with 
low noise, highly linear x-ray detectors, while since the image is stored digitally, it can be displayed with 
contrast independent of the detector properties and defined by the needs of the radiologist. Whatever 
image processing techniques are found useful, ranging from simple contrast enhancement to histogram 
modification and spatial frequency filtering, could conveniently be applied. 

The challenges in creating a digital mammography system with improved performance are mainly 
related to the x-ray detector and the display device. There is active development of high resolution display 
monitors and hard copy devices to meet the demanding requirements (number of pixels, luminance, 
speed, multi-image capacity) of displaying digital mammography images, and suitable systems for this 
purpose should be available in the near future. The detector should have the following characteristics: 

1. Efficient absorption of the incident radiation beam. 

2. Linear response over a wide range of incident radiation intensity. 

3. Low intrinsic noise. 

4. Spatial resolution on the order of 10 cycles/mm (50 um sampling). 

5. Can accommodate at least an 18 x 24 cm and preferably a 24 x 30 cm field size. 

6. Acceptable imaging time and heat loading of the x-ray tube. 

Two main approaches have been taken in detector development — area detectors and slot detectors. In 
the former, the entire image is acquired simultaneously, while in the latter only a portion of the image 
is acquired at one time and the full image is obtained by scanning the x-ray beam and detector across 
the breast. Area detectors offer convenient fast image acquisition and could be used with conventional 
x-ray machines, but may still require a grid, while slot systems are slower and require a scanning x-ray 
beam, but use relatively simple detectors and have excellent intrinsic efficiency at scatter rejection. 
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FIGURE 61.21 (a) Small-format detector system for biopsy imaging, (b) full-breast detector incorporating 12 
detector modules, (c) slot detector for a full-breast scanning digital mammography system. 

At the time of writing, small-format (5x5 cm) digital systems for guidance of stereotactic breast 
biopsy procedures are in widespread use. These use a lens or a fiberoptic taper to couple a phosphor to 
a CCD whose format is approximately square and typically provides 1 x IK images with 50 um pixels 
(Fig. 61.21a). Adjustment of display contrast enhances the localization of the lesion while the immediate 
display of images (no film processing is required) greatly accelerates the clinical procedure. 

Four designs of full breast digital mammography systems are undergoing clinical evaluation. Various 
detector technologies are being developed and evaluated for use in digital mammography. In three of the 
systems, x-rays are absorbed by a cesium iodide (Csl) phosphor layer and produce light. In one system, the 
phosphor is deposited directly on a matrix of about 2000 2 photodiodes with thin film transistor switches 
fabricated on a large area amorphous silicon plate. The electronic signal is read out on a series of data lines 
as the switches in each row of the array are activated. In another system, the light is coupled through 
demagnifying fiberoptic tapers to a CCD readout. A mosaic of 3 x 4 detector modules is formed (Fig. 61.21b) 
to obtain a detector large enough to cover the breast [Cheung, 1998]. A third system also uses fiberoptic 
coupling of Csl to CCDs; however, the detector is in a slot configuration and is scanned beneath the breast 
in synchrony with a fan beam of x-rays to acquire the transmitted signal (Fig. 61.21c). The fourth system 
employs a plate formed of a photostimulable phosphor material. When exposed to x-rays, traps in the 
phosphor are filled with electrons, the number being related to x-ray intensity. The plate is placed in a 
reader device and scanned with a red HeNe laser beam which stimulates the traps to release the electrons. 
The transition of these electrons through energy levels in the phosphor crystal result in the formation of 
blue light, which is measured as a function of the laser position on the plate to form the image signal. 

Other materials in which x-ray energy is directly converted to charge are under development for digital 
mammography. These materials include lead iodide, amorphous selenium, zinc cadmium telluride, and 
thallium bromide. A review of the current status of digital mammography is given in [Yaffe, 1994; Pisano, 
1998] and of detectors for digital x-ray imaging in [Yaffe, 1997]. 
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Summary 

Mammography is a technically demanding imaging procedure which can help reduce mortality from 
breast cancer. To be successful at this purpose, both the technology and technique used for imaging must 
be optimized. This requires careful system design and attention to quality control procedures. Imaging 
systems for mammography are still evolving and, in the future, are likely to make greater use of digital 
acquisition and display methods. 

Defining Terms 

Conversion efficiency: The efficiency of converting the energy from x-rays absorbed in a phosphor 
material into that of emitted light quanta. 

Fibroglandular tissue: A mixture of tissues within the breast composed of the functional glandular 
tissue and the fibrous supporting structures. 

Focal spot: The area of the anode of an x-ray tube from which the useful beam of x-rays is emitted. 
Also known as the target. 

Grid: A device consisting of evenly spaced lead strips which functions like a Venetian blind in prefer- 
entially allowing x-rays traveling directly from the focal spot without interaction in the patient to 
pass through, while those whose direction has been diverted by scattering in the patient strike the 
slats of the grid and are rejected. Grids improve the contrast of radiographic images at the price 
of increased dose to the patient. 

Image receptor: A device that records the distribution of x-rays to form an image. In mammography, 
the image receptor is generally composed of a light-tight cassette containing a fluorescent screen, 
which absorbs x-rays and produces light, coupled to a sheet of photographic film. 

Quantum efficiency: The fraction of incident x-rays which interact with a detector or image receptor. 

Screening: Examination of asymptomatic individuals to detect disease. 

Survival: An epidemiological term giving the fraction of individuals diagnosed with a given disease 
alive at a specified time after diagnosis, e.g., "10-year survival". 
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62.1 Instrumentation 

Ian A. Cunningham 

The development of computed tomography (CT) in the early 1970s revolutionized medical radiology. 
For the first time, physicians were able to obtain high-quality tomographic (cross-sectional) images of 
internal structures of the body. Over the next 10 years, 18 manufacturers competed for the exploding 
world CT market. Technical sophistication increased dramatically, and even today, CT continues to 
mature, with new capabilities being researched and developed. 

Computed tomographic images are reconstructed from a large number of measurements of x-ray 
transmission through the patient (called projection data). The resulting images are tomographic "maps" 
of the x-ray linear attenuation coefficient. The mathematical methods used to reconstruct CT images 
from projection data are discussed in the next section. In this section, the hardware and instrumentation 
in a modern scanner are described. 

The first practical CT instrument was developed in 1971 by DR. G. N. Hounsfield in England and was 
used to image the brain [Hounsfield, 1980]. The projection data were acquired in approximately 
5 minutes, and the tomographic image was reconstructed in approximately 20 minutes. Since then, CT 
technology has developed dramatically, and CT has become a standard imaging procedure for virtually 
all parts of the body in thousands of facilities throughout the world. Projection data are typically acquired 
in approximately 1 second, and the image is reconstructed in 3 to 5 seconds. One special-purpose scanner 
described below acquires the projection data for one tomographic image in 50 ms. A typical modern CT 
scanner is shown in Fig. 62.1, and typical CT images are shown in Fig. 62.2. 

The fundamental task of CT systems is to make an extremely large number (approximately 500,000) 
of highly accurate measurements of x-ray transmission through the patient in a precisely controlled 
geometry. A basic system generally consists of a gantry, a patient table, a control console, and a computer. 
The gantry contains the x-ray source, x-ray detectors, and the data-acquisition system (DAS). 

Data-Acquisition Geometries 

Projection data may be acquired in one of several possible geometries described below, based on the 
scanning configuration, scanning motions, and detector arrangement. The evolution of these geometries 
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FIGURE 62. 1 Schematic drawing of a typical CT scanner installation, consisting of ( 1 ) control console, (2 ) gantry 
stand, (3) patient table, (4) head holder, and (5) laser imager. (Courtesy of Picker International, Inc.) 

is descried in terms of "generations," as illustrated in Fig. 62.3, and reflects the historical development 
[Newton and Potts, 1981; Seeram, 1994]. Current CT scanners use either third-, fourth-, or fifth- 
generation geometries, each having their own pros and cons. 

First Generation: Parallel-Beam Geometry 

Parallel-beam geometry is the simplest technically and the easiest with which to understand the important 
CT principles. Multiple measurements of x-ray transmission are obtained using a single highly collimated 
x-ray pencil beam and detector. The beam is translated in a linear motion across the patient to obtain a 
projection profile. The source and detector are then rotated about the patient isocenter by approximately 
1 degree, and another projection profile is obtained. This translate-rotate scanning motion is repeated 
until the source and detector have been rotated by 180 degrees. The highly collimated beam provides 
excellent rejection of radiation scattered in the patient; however, the complex scanning motion results 
in long (approximately 5-minute) scan times. This geometry was used by Hounsfield in his original 
experiments [Hounsfield, 1980] but is not used in modern scanners. 

Second Generation: Fan Beam, Multiple Detectors 

Scan times were reduced to approximately 30 s with the use of a fan beam of x-rays and a linear detector 
array. A translate-rotate scanning motion was still employed; however, a larger rotate increment could 
be used, which resulted in shorter scan times. The reconstruction algorithms are slightly more compli- 
cated than those for first-generation algorithms because they must handle fan-beam projection data. 

Third Generation: Fan Beam, Rotating Detectors 

Third-generation scanners were introduced in 1976. A fan beam of x-rays is rotated 360 degrees around 
the isocenter. No translation motion is used; however, the fan beam must be wide enough to completely 
contain the patient. A curved detector array consisting of several hundred independent detectors is 
mechanically coupled to the x-ray source, and both rotate together. As a result, these rotate-only motions 
acquire projection data for a single image in as little as 1 s. Third-generation designs have the advantage 
that thin tungsten septa can be placed between each detector in the array and focused on the x-ray source 
to reject scattered radiation. 
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FIGURE 62.2 Typical CT images of (a) brain, (b) head showing orbits, (c) chest showing lungs, and (d) abdomen. 

Fourth Generation: Fan Beam, Fixed Detectors 

In a fourth-generation scanner, the x-ray source and fan beam rotate about the isocenter, while the 
detector array remains stationary. The detector array consists of 600 to 4800 (depending on the manu- 
facturer) independent detectors in a circle that completely surrounds the patient. Scan times are similar 
to those of third-generation scanners. The detectors are no longer coupled to the x-ray source and hence 
cannot make use of focused septa to reject scattered radiation. However, detectors are calibrated twice 
during each rotation of the x-ray source, providing a self-calibrating system. Third-generation systems 
are calibrated only once every few hours. 

Two detector geometries are currently used for fourth-generation systems: (1) a rotating x-ray source 
inside a fixed detector array and (2) a rotating x-ray source outside a nutating detector array. Figure 62.4 
shows the major components in the gantry of a typical fourth-generation system using a fixed-detector 
array. Both third- and fourth-generation systems are commercially available, and both have been highly 
successful clinically. Neither can be considered an overall superior design. 

Fifth Generation: Scanning Electron Beam 

Fifth-generation scanners are unique in that the x-ray source becomes an integral part of the system 
design. The detector array remains stationary, while a high-energy electron beams is electronically swept 
along a semicircular tungsten strip anode, as illustrated in Fig. 62.5. X-rays are produced at the point 
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FIGURE 62.3 Four generations of CT scanners illustrating the parallel- and fan-beam geometries [Robb, 1982]. 

where the electron beam hits the anode, resulting in a source of x-rays that rotates about the patient with 
no moving parts [Boyd et al., 1979]. Projection data can be acquired in approximately 50 ms, which is 
fast enough to image the beating heart without significant motion artifacts [Boyd and Lipton, 1983]. 

An alternative fifth-generation design, called the dynamic spatial reconstructor (DSR) scanner, is in use 
at the Mayo Clinic [Ritman, 1980, 1990]. This machine is a research prototype and is not available 
commercially. It consists of 14 x-ray tubes, scintillation screens, and video cameras. Volume CT images 
can be produced in as little as 10 ms. 

Spiral/Helical Scanning 

The requirement for faster scan times, and in particular for fast multiple scans for three-dimensional 
imaging, has resulted in the development of spiral (helical) scanning systems [Kalendar et al., 1990]. Both 
third- and fourth-generation systems achieve this using self-lubricating slip-ring technology (Fig. 62.6) to 
make the electrical connections with rotating components. This removes the need for power and signal 
cables which would otherwise have to be rewound between scans and allows for a continuous rotating 
motion of the x-ray fan beam. Multiple images are acquired while the patient is translated through the 
gantry in a smooth continuous motion rather than stopping for each image. Projection data for multiple 
images covering a volume of the patient can be acquired in a single breath hold at rates of approximately 
one slice per second. The reconstruction algorithms are more sophisticated because they must accommo- 
date the spiral or helical path traced by the x-ray source around the patient, as illustrated in Fig. 62.7. 

X-Ray System 

The x-ray system consists of the x-ray source, detectors, and a data-acquisition system. 

X-Ray Source 

With the exception of one fifth-generation system described above, all CT scanners use bremsstrahlung 
x-ray tubes as the source of radiation. These tubes are typical of those used in diagnostic imaging and 
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FIGURE 62.4 The major internal components of a fourth-generation CT gantry are shown in a photograph with 
the gantry cover removed (upper) and identified in the line drawing (lower), (Courtesy of Picker International, Inc.) 



produce x-rays by accelerating a beam of electrons onto a target anode. The anode area from which 
x-rays are emitted, projected along the direction of the beam, is called the focal spot. Most systems have 
two possible focal spot sizes, approximately 0.5 x 1.5 mm and 1.0 x 2.5 mm. A collimator assembly is 
used to control the width of the fan beam between 1.0 and 10 mm, which in turn controls the width of 
the imaged slice. 

The power requirements of these tubes are typically 120 kV at 200 to 500 mA, producing x-rays with 
an energy spectrum ranging between approximately 30 and 120 keV. All modern systems use high- 
frequency generators, typically operating between 5 and 50 kHz [Brunnett et al., 1990]. Some spiral 
systems use a stationary generator in the gantry, requiring high-voltage (120-kV) slip rings, while others 
use a rotating generator with lower-voltage (480-V) slip rings. Production of x-rays in bremsstrahlung 
tubes is an inefficient process, and hence most of the power delivered to the tubes results in heating of 
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FIGURE 62.5 Schematic illustration of a fifth-generation ultrafast CT system. Image data are acquired in as little 
as 50 ms, as an electron beam is swept over the strip anode electronically. (Courtesy of Imatron, Inc.) 

the anode. A heat exchanger on the rotating gantry is used to cool the tube. Spiral scanning, in particular, 
places heavy demands on the heat-storage capacity and cooling rate of the x-ray tube. 

The intensity of the x-ray beam is attenuated by absorption and scattering processes as it passes through 
the patient. The degree of attenuation depends on the energy spectrum of the x-rays as well as on the 
average atomic number and mass density of the patient tissues. The transmitted intensity is given by 



..i e -M* 



(62.1) 



where I and I t are the incident and transmitted beam intensities, respectively; L is the length of the x-ray 
path; and (j,(x) is the x-ray linear attenuation coefficient, which varies with tissue type and hence is a 
function of the distance x through the patient. The integral of the attenuation coefficient is therefore 
given by 



J o V(x)^ = -iln(7 ( //J 



(62.2) 



The reconstruction algorithm requires measurements of this integral along many paths in the fan beam 
at each of many angles about the isocenter. The value of L is known, and I is determined by a system 
calibration. Hence values of the integral along each path can be determined from measurements of /,. 

X-Ray Detectors 

X-ray detectors used in CT systems must (a) have a high overall efficiency to minimize the patient 
radiation dose, have a large dynamic range, (b) be very stable with time, and (c) be insensitive to 
temperature variations within the gantry. Three important factors contributing to the detector efficiency 
are geometric efficiency, quantum (also called capture) efficiency, and conversion efficiency [Villafanaet 
et al., 1987]. Geometric efficiency refers to the area of the detectors sensitive to radiation as a fraction of 
the total exposed area. Thin septa between detector elements to remove scattered radiation, or other 
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FIGURE 62.6 Photograph of the slip rings used to pass power and control signals to the rotating gantry. (Courtesy 
of Picker International, Inc.) 

insensitive regions, will degrade this value. Quantum efficiency refers to the fraction of incident x-rays 
on the detector that are absorbed and contribute to the measured signal. Conversion efficiency refers to 
the ability to accurately convert the absorbed x-ray signal into an electrical signal (but is not the same 
as the energy conversion efficiency). Overall efficiency is the product of the three, and it generally lies 
between 0.45 and 0.85. A value of less than 1 indicates a nonideal detector system and results in a required 
increase in patient radiation dose if image quality is to be maintained. The term dose efficiency sometimes 
has been used to indicate overall efficiency. 

Modern commercial systems use one of two detector types: solid-state or gas ionization detectors. 

Solid-State Detectors. Solid-state detectors consist of an array of scintillating crystals and photodiodes, 
as illustrated in Fig. 62.8. The scintillators generally are either cadmium tungstate (CdW0 4 ) or a ceramic 
material made of rare earth oxides, although previous scanners have used bismuth germanate crystals 
with photomultiplier tubes. Solid-state detectors generally have very high quantum and conversion 
efficiencies and a large dynamic range. 

Gas Ionization Detectors. Gas ionization detectors, as illustrated in Fig. 62.9, consist of an array of 
chambers containing compressed gas (usually xenon at up to 30 atm pressure). A high voltage is applied 
to tungsten septa between chambers to collect ions produced by the radiation. These detectors have 
excellent stability and a large dynamic range; however, they generally have a lower quantum efficiency 
than solid-state detectors. 
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FIGURE 62.7 Spiral scanning causes the focal spot to follow a spiral path around the patient as indicated. (Courtesy 
of Picker International, Inc.) 
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FIGURE 62.8 (a) A solid-state detector consists of a scintillating crystal and photodiode combination, (b) Many 
such detectors are placed side by side to form a detector array that may contain up to 4800 detectors. 



Data-Acquisition System 

The transmitted fraction IJI in Eq. (62.2) through an obese patient can be less than 10 -4 . Thus it is the 
task of the data-acquisition system (DAS) to accurately measure /, over a dynamic range of more than 
10 4 , encode the results into digital values, and transmit the values to the system computer for reconstruc- 
tion. Some manufacturers use the approach illustrated in Fig. 62.10, consisting of precision preamplifiers, 
current-to-voltage converters, analog integrators, multiplexers, and analog-to-digital converters. Alter- 
natively, some manufacturers use the preamplifier to control a synchronous voltage-to-frequency con- 
verter (SVFC), replacing the need for the integrators, multiplexers, and analog-to-digital converters 
[Brunnett, et al., 1990]. The logarithmic conversion required in Eq. (62.2) is performed with either an 
analog logarithmic amplifier or a digital lookup table, depending on the manufacturer. 
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FIGURE 62.9 Gas ionization detector arrays consist of high-pressure gas in multiple chambers separated by thin 
septa. A voltage is applied between alternating septa. The septa also act as electrodes and collect the ions created by 
the radiation, converting them into an electrical signal. 



Detector 
element 



Current- 
voltage 1 — | Integrator 
transducer 




to computer 

■o 



Sustained data transfer rates to the computer are as 
high as 10 Mbytes/s for some scanners. This can be 
accomplished with a direct connection for systems 
having a fixed detector array. However, third-genera- 
tion slip-ring systems must use more sophisticated 
techniques. At least one manufacturer uses optical 
transmitters on the rotating gantry to send data to 
fixed optical receivers [Siemens, 1989]. 

Computer System 

Various computer systems are used by manufacturers 
to control system hardware, acquire the projection 
data, and reconstruct, display, and manipulate the 

tomographic images. A typical system is illustrated in Fig. 62.11, which uses 12 independent processors 
connected by a 40-Mbyte/s multibus. Multiple custom array processors are used to achieve a combined 
computational speed of 200 MFLOPS (million floating-point operations per second) and a reconstruction 
time of approximately 5 s to produce an image on a 1024 x 1024 pixel display. A simplified UNIX 
operating system is used to provide a multitasking, multiuser environment to coordinate tasks. 



FIGURE 62.10 The data-acquisition system con- 
verts the electrical signal produced by each detector 
to a digital value for the computer. 



Patient Dose Considerations 

The patient dose resulting from CT examinations is generally specified in terms of the CT dose index 
(CTDI) [Felmlee et al., 1989; Rothenberg and Pentlow, 1992], which includes the dose contribution from 
radiation scattered from nearby slices. A summary of CTDI values, as specified by four manufacturers, 
is given in Table 62.1. 

Summary 

Computed tomography revolutionized medical radiology in the early 1 970s. Since that time, CT technology 
has developed dramatically, taking advantage of developments in computer hardware and detector tech- 
nology. Modern systems acquire the projection data required for one tomographic image in approximately 
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FIGURE 62.11 The computer system controls the gantry motions, acquires the x-ray transmission measurements, 
and reconstructs the final image. The system shown here uses 12 68000-family CPUs. (Courtesy of Picker Interna- 
tional, Inc.) 



TABLE 62.1 Summary of the CT Dose Index (CTDI) Values at Two Positions (Center 
of the Patient and Near the Skin) as Specified by Four CT Manufacturers for Standard 
Head and Body Scans. 













CTDI, center 


CTDI, skin 


Manufacturer 


Detector 


kVp 


mA 


Scan Time (s) 


(mGy) 


(mGy) 


A, head 


Xenon 


120 


170 


2 


50 


48 


A, body 


Xenon 


120 


170 


2 


14 


25 


A, head 


Solid state 


120 


170 


2 


40 


40 


A, body 


Solid state 


120 


170 


2 


11 


20 


B, head 


Solid state 


130 


80 


2 


37 


41 


B, body 


Solid state 


130 


80 


2 


15 


34 


C, head 


Solid state 


120 


500 


2 


39 


50 


C, body 


Solid state 


120 


290 


1 


12 


28 


D, head 


Solid state 


120 


200 


2 


78 


78 


D, body 


Solid state 


120 


200 


2 


9 


16 



1 s and present the reconstructed image on a 1024 x 1024 matrix display within a few seconds. The 
images are high-quality tomographic "maps" of the x-ray linear attenuation coefficient of the patient 
tissues. 

Defining Terms 

Absorption: Some of the incident x-ray energy is absorbed in patient tissues and hence does not 
contribute to the transmitted beam. 

Anode: A tungsten bombarded by a beam of electrons to produce x-rays. In all but one fifth-generation 
system, the anode rotates to distribute the resulting heat around the perimeter. The anode heat- 
storage capacity and maximum cooling rate often limit the maximum scanning rates of CT systems. 
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Attenuation: The total decrease in the intensity of the primary x-ray beam as it passes through the 

patient, resulting from both scatter and absorption processes. It is characterized by the linear 

attenuation coefficient. 
Computed tomography (CT): A computerized method of producing x-ray tomographic images. Pre- 
vious names for the same thing include computerized tomographic imaging, computerized axial 

tomography (CAT), computer-assisted tomography (CAT), and reconstructive tomography (RT). 
Control console: The control console is used by the CT operator to control the scanning operations, 

image reconstruction, and image display. 
Cormack, Dr. Allan MacLeod: A physicist who developed mathematical techniques required in the 

reconstruction of tomographic images. Dr. Cormack shared the Nobel Prize in Medicine and 

Physiology with Dr. G. N. Hounsfield in 1979 [Cormack, 1980]. 
Data-acquisition system (DAS): Interfaces the x-ray detectors to the system computer and may consist 

of a preamplifier, integrator, multiplexer, logarithmic amplifier, and analog-to-digital converter. 
Detector array: An array of individual detector elements. The number of detector elements varies 

between a few hundred and 4800, depending on the acquisition geometry and manufacturer. Each 

detector element functions independently of the others. 
Fan beam: The x-ray beam is generated at the focal spot and so diverges as it passes through the patient 

to the detector array. The thickness of the beam is generally selectable between 1.0 and 10 mm 

and defines the slice thickness. 
Focal spot: The region of the anode where x-rays are generated. 
Focused septa: Thin metal plates between detector elements which are aligned with the focal spot so 

that the primary beam passes unattenuated to the detector elements, while scattered x-rays which 

normally travel in an altered direction are blocked. 
Gantry: The largest component of the CT installation, containing the x-ray tube, collimators, detector 

array, DAS, other control electronics, and the mechanical components required for the scanning 

motions. 
Helical scanning: The scanning motions in which the x-ray tube rotates continuously around the 

patient while the patient is continuously translated through the fan beam. The focal spot therefore 

traces a helix around the patient. Projection data are obtained which allow the reconstruction of 

multiple contiguous images. This operation is sometimes called spiral, volume, or three-dimensional 

CT scanning. 
Hounsfield, Dr. Godfrey Newbold: An engineer who developed the first practical CT instrument in 

1971. Dr. Hounsfield received the McRobert Award in 1972 and shared the Nobel Prize in Medicine 

and Physiology with Dr. A. M. Cormack in 1979 for this invention [Hounsfield, 1980]. 
Image plane: The plane through the patient that is imaged. In practice, this plane (also called a slice) 

has a selectable thickness between 1.0 and 10 mm centered on the image plane. 
Pencil beam: A narrow, well-collimated beam of x-rays. 

Projection data: The set of transmission measurements used to reconstruct the image. 
Reconstruct: The mathematical operation of generating the tomographic image from the projection 

data. 
Scan time: The time required to acquire the projection data for one image, typically 1.0 s. 
Scattered radiation: Radiation that is removed from the primary beam by a scattering process. This 

radiation is not absorbed but continues along a path in an altered direction. 
Slice: See Image plane. 
Spiral scanning: See Helical scanning. 
Three-dimensional imaging: See Helical scanning. 
Tomography: A technique of imaging a cross-sectional slice. 
Volume CT: See Helical scanning. 
X-ray detector: A device that absorbs radiation and converts some or all of the absorbed energy into 

a small electrical signal. 
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X-ray linear attenuation coefficient u: Expresses the relative rate of attenuation of a radiation beam 
as it passes through a material. The value of u depends on the density and atomic number of the 
material and on the x-ray energy. The units of |i are cm -1 . 

X-ray source: The device that generates the x-ray beam. All CT scanners are rotating-anode 
bremsstrahlung x-ray tubes except one-fifth generation system, which uses a unique scanned 
electron beam and a strip anode. 

X-ray transmission: The fraction of the x-ray beam intensity that is transmitted through the patient 
without being scattered or absorbed. It is equal to IJI in Eq. (62.2), can be determined by mea- 
suring the beam intensity both with (I t ) and without (7 ) the patient present, and is expressed as 
a fraction. As a rule of thumb, n 2 independent transmission measurements are required to recon- 
struct an image with an n x n sized pixel matrix. 
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Further Information 

A recent summary of CT instrumentation and concepts is given by E. Seeram in Computed Tomography: 
Physical Principles, Clinical Applications and Quality Control. The author summarizes CT from the 
perspective of the nonmedical, nonspecialist user. A summary of average CT patient doses is described 
by Rothenberg and Pentlow [1992] in Radiation Dose in CT. Research papers on both fundamental and 
practical aspects of CT physics and instrumentation are published in numerous journals, including 
Medical Physics, Physics in Medicine and Biology, Journal of Computer Assisted Tomography, Radiology, 
British Journal of Radiology, and the IEEE Press. A comparison of technical specifications of CT systems 
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provided by the manufacturers is available from ECRI to help orient the new purchaser in a selection 
process. Their Product Comparison System includes a table of basic specifications for all the major 
international manufactures. 



62.2 Reconstruction Principles 



Philip F. Judy 

Computed tomography (CT) is a two-step process: (1) the transmission of an x-ray beam is measured 
through all possible straight-line paths as in a plane of an object, and (2) the attenuation of an x-ray 
beam is estimated at points in the object. Initially, the transmission measurements will be assumed to 
be the results of an experiment performed with a narrow monoenergetic beam of x-rays that are confined 
to a plane. The designs of devices that attempt to realize these measurements are described in the 
preceding section. One formal consequence of these assumptions is that the logarithmic transformation 
of the measured x-ray intensity is proportional to the line integral of attenuation coefficients. In order 
to satisfy this assumption, computer processing procedures on the measurements of x-ray intensity are 
necessary even before image reconstruction is performed. These linearization procedures will reviewed 
after background. 

Both analytical and iterative estimations of linear x-ray attenuation have been used for transmission 
CT reconstruction. Iterative procedures are of historic interest because an iterative reconstruction pro- 
cedure was used in the first commercially successful CT scanner [EMI, Mark I, Hounsfield, 1973]. They 
also permit easy incorporation of physical processes that cause deviations from the linearity. Their 
practical usefulness is limited. The first EMI scanner required 20 minutes to finish its reconstruction. 
Using the identical hardware and employing an analytical calculation, the estimation of attenuation values 
was performed during the 4.5-minute data acquisition and was made on a 160 x 160 matrix. The original 
iterative procedure reconstructed the attenuation values on an 80 x 80 matrix and consequently failed 
to exploit all the spatial information inherent in transmission data. 

Analytical estimation, or direct reconstruction, uses a numerical approximation of the inverse Radon 
transform [Radon, 1917]. The direct reconstruction technique (convolution-backprojection) presently 
used in x-ray CT was initially applied in other areas such as radio astronomy [Bracewell and Riddle, 
1967] and electron microscopy [Crowther et al., 1970; Ramachandran and Lakshminarayana, 1971]. 
These investigations demonstrated that the reconstructions from the discrete spatial sampling of band- 
limited data led to full recovery of the cross-sectional attenuation. The random variation (noise) in x-ray 
transmission measurements may not be bandlimited. Subsequent investigators [e.g., Chesler and Riederer, 
1975; Herman and Roland, 1973; Shepp and Logan, 1974] have suggested various bandlimiting windows 
that reduce the propagation and amplification of noise by the reconstruction. These issues have been 
investigated by simulation, and investigators continue to pursue these issues using a computer phantom 
[e.g., Guedon and Bizais, 1994, and references therein] described by Shepp and Logan. The subsequent 
investigations of the details of choice of reconstruction parameters has had limited practical impact 
because real variation of transmission data is bandlimited by the finite size of the focal spot and radiation 
detector, a straightforward design question, and because random variation of the transmission tends to 
be uncorrected. Consequently, the classic precedures suffice. 

Image Processing: Artifact and Reconstruction Error 

An artifact is a reconstruction defect that is obviously visible in the image. The classification of an image 
feature as an artifact involves some visual criterion. The effect must produce an image feature that is 
greater than the random variation in image caused by the intrinsic variation in transmission measure- 
ments. An artifact not recognized by the physician observer as an artifact may be reported as a lesion. 
Such false-positive reports could lead to an unnecessary medical procedure, e.g., surgery to remove an 
imaginary tumor. A reconstruction error is a deviation of the reconstruction value from its expected value. 
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Reconstruction errors are significant if the application involves a quantitative measurement, not a com- 
mon medical application. The reconstruction errors are characterized by identical material at different 
points in the object leading to different reconstructed attenuation values in the image which are not 
visible in the medical image. 

Investigators have used computer simulation to investigate artifact [Herman, 1980] because image 
noise limits the visibility of their visibility. One important issue investigated was required spatial sampling 
of transmission slice plane [Crawford and Kak, 1979; Parker et al., 1982]. These simulations provided a 
useful guideline in design. In practice, these aliasing artifacts are overwhelmed by random noise, and 
designers tend to oversample in the slice plane. A second issue that was understood by computer 
simulation was the partial volume artifact [Glover and Pelc, 1980]. This artifact would occur even for 
mononergetic beams and finite beam size, particularly in the axial dimension. The axial dimension of 
the beams tend to be greater (about 10 mm) than their dimensions in the slice plane (about 1 mm). The 
artifact is created when the variation of transmission within the beam varies considerably, and the 
exponential variation within the beam is summed by the radiation detector. The logarithm transforma- 
tion of the detected signal produces a nonlinear effect that is propagated throughout the image by the 
reconstruction process. Simulation was useful in demonstrating that isolated features in the same cross- 
section act together to produce streak artifacts. Simulations have been useful to illustrate the effects of 
patient motion during the data-acquisition streaks off high-contrast objects. 

Projection Data to Image: Calibrations 

Processing of transmission data is necessary to obtain high-quality images. In general, optimization of 
the projection data will optimize the reconstructed image. Reconstruction is a process that removes the 
spatial correlation of attenuation effects in the transmitted image by taking advantage of completely 
sampling the possible transmissions. Two distinct calibrations are required: registration of beams with 
the reconstruction matrix and linearization of the measured signal. 

Without loss of generalization, a projection will be considered a set of transmissions made along 
parallel lines in the slice plane of the CT scanner. Without loss of generalization means that essential 
aspects of all calibration and reconstruction procedures required for fan-beam geometries are captured 
by the calibration and reconstruction procedures described for parallel projections. One line of each 
projection is assumed to pass through the center of rotation of data collection. Shepp et al. [1979] showed 
that errors in the assignment of that center-of- rotation point in the projections could lead to considerable 
distinctive artifacts and that small errors (0.05 mm) would produce these effects. The consequences of 
these errors have been generalized to fan-beam collection schemes, and images reconstructed from 180- 
degree projection sets were compared with images reconstructed from 360-degree data sets [Kijewski 
and Judy, 1983]. A simple misregistration of the center of rotation was found to produce blurring of 
image without the artifact. These differences may explain the empirical observation that most commercial 
CT scanners collect a full 360-degree data set even though 180 degrees of data will suffice. 

The data-acquisition scheme that was designed to overcome the limited sampling inherent in third- 
generation fan-beam systems by shifting detectors a quarter sampling distance while opposite 180-degree 
projection is measured, has particularly stringent registration requirements. Also, the fourth-generation 
scanner does not link the motion of the x-ray tube and the detector; consequently, the center of rotation 
is determined as part of a calibration procedure, and unsystematic effects lead to artifacts that mimic 
noise besides blurring the image. 

Misregistration artifacts also can be mitigated by feathering. This procedure requires collection of 
redundant projection data at the end of the scan. A single data set is produced by linearly weighting the 
redundant data at the beginning and end of the data collection [Parker et al., 1982]. These procedures 
have be useful in reducing artifacts from gated data collections [Moore et al., 1987]. 

The other processing necessary before reconstruction of project data is linearization. The formal require- 
ment for reconstruction is that the line integrals of some variable be available; this is the variable that 
ultimately is reconstructed. The logarithm of x-ray transmission approximates this requirement. There 
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are physical effects in real x-ray transmissions that cause deviations from this assumption. X-ray beams 
of sufficient intensity are composed of photons of different energies. Some photons in the beam interact 
with objects and are scattered rather than absorbed. The spectrum of x-ray photons of different attenuation 
coefficients means the logarithm of the transmission measurement will not be proportional to the line 
integral of the attenuation coefficient along that path, because an attenuation coefficient cannot even be 
defined. An effective attenuation coefficient can only be defined uniquely for a spectrum for a small mass 
of material that alters that intensity. It has to be small enough not to alter the spectrum [McCullough, 1979] . 

A straightforward approach to this nonunique attenuation coefficient error, called hardening, is to 
assume that the energy dependence of the attenuation coefficient is constant and that differences in 
attenuation are related to a generalized density factor that multiplies the spectral dependence of attenu- 
ation. The transmission of an x-ray beam then can be estimated for a standard material, typically water, 
as a function of thickness. This assumption is that attenuations of materials in the object, the human 
body, differ because specific gravities of the materials differ. Direct measurements of the transmission of 
an actual x-ray beam may provide initial estimates that can be parameterized. The inverse of this function 
provides the projection variable that is reconstructed. The parameters of the function are usually modified 
as part of a calibration to make the CT image of a uniform water phantom flat. 

Such a calibration procedure does not deal completely with the hardening effects. The spectral depen- 
dence of bone differs considerably from that of water. This is particularly critical in imaging of the brain, 
which is contained within the skull. Without additional correction, the attenuation values of brain are 
lower in the center than near the skull. 

The detection of scattered energy means that the reconstructed attenuation coefficient will differ from 
the attenuation coefficient estimated with careful narrow-beam measurements. The x-rays appear more 
penetrating because scattered x-rays are detected. The zero-ordered scatter, a decrease in the attenuation 
coefficient by some constant amount, is dealt with automatically by the calibration that treats hardening. 
First-order scattering leads to a widening of the x-ray beam and can be dealt with by a modification of 
the reconstruction kernel. 

Projection Data to Image: Reconstruction 

The impact of CT created considerable interest in the formal aspects of reconstruction. There are many 
detailed descriptions of direct reconstruction procedures. Some are presented in textbooks used in 
graduate courses for medical imaging [Barrett and Swindell, 1981; Cho et al., 1993]. Herman (1980) 
published a textbook that was based on a two-semester course that dealt exclusively with reconstruction 
principles, demonstrating the reconstruction principles with simulation. 

The standard reconstruction method is called convolution-backprojection. The first step in the proce- 
dure is to convolve the projection, a set of transmissions made along parallel lines in the slice plane, with 
a reconstruction kernel derived from the inverse Radon transform. The choice of kernel is dictated by 
bandlimiting issues [Chesler and Riederer, 1975; Herman and Roland, 1973; Shepp and Logan, 1974]. It 
can be modified to deal with the physical aperture of the CT system [Bracewell, 1977], which might 
include the effects of scatter. The convolved projection is then backprojected onto a two-dimensional 
image matrix. Backprojection is the opposite of projection; the value of the projection is added to each 
point along the line of the projection. This procedure makes sense in the continuous description, but in 
the discrete world of the computer, the summation is done over the image matrix. 

Consider a point of the image matrix; very few, possibly no lines of the discrete projection data intersect 
the point. Consequently, to estimate the projection value to be added to that point, the procedure must 
interpolate between two values of sampled convolve projection. The linear interpolation scheme is a 
significant improvement over nearest project nearest to the point. More complex schemes get confounded 
with choices of reconstruction kernel, which are designed to accomplish standard image processing in 
the image, e.g., edge enhancement. 

Scanners have been developed to acquire a three-dimensional set of projection data [Kalender et al., 
1990]. The motion of the source defines a spiral motion relative to the patient. The spiral motion defines 
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an axis. Consequently, only one projection is available for reconstruction of the attenuation values in the 
plane. This is the back-projection problem just discussed; no correct projection value is available from 
the discrete projection data set. The solution is identical: a projection value is interpolated from the 
existing projection values to estimate the necessary projections for each plane to be reconstructed. This 
procedure has the advantage that overlapping slices can be reconstructed without additional exposure, 
and this eliminates the risk that a small lesion will be missed because it straddles adjacent slices. This 
data- collection scheme is possible because systems that continuously rotate have been developed. The 
spiral scan motion is realized by moving the patient through the gantry. Spiral CT scanners have made 
possible the acquisition of an entire data set in a single breath hold. 
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63.1 Acquisition and Processing 

Steven Conolly, Albert Macovski, 
and John Pauly 

Magnetic resonance imaging (MRI) is a clinically important med- 
ical imaging modality due to its exceptional soft-tissue contrast. 
MRI was invented in the early 1970s [1], The first commercial 
scanners appeared about 10 years later. Noninvasive MRI studies 
are now supplanting many conventional invasive procedures. A 
1990 study [2] found that the principal applications for MRI are 
examinations of the head (40%), spine (33%), bone and joints 
(17%), and the body (10%). The percentage of bone and joint 
studies was growing in 1990. 

Although typical imaging studies range from 1 to 10 minutes, 
new fast imaging techniques acquire images in less than 50 ms. 
MRI research involves fundamental tradeoffs between resolution, 
imaging time, and signal-to-noise ratio (SNR). It also depends 
heavily on both gradient and receiver coil hardware innovations. 

In this section we provide a brief synopsis of basic nuclear mag- 
netic resonance (NMR) physics. We then derive the k-space analysis 
of MRI, which interprets the received signal as a scan of the Fourier 
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transform of the image. This powerful formalism is used to analyze the most important imaging 
sequences. Finally, we discuss the fundamental contrast mechanisms for MRI. 

Fundamentals of MRI 

Magnetic resonance imaging exploits the existence of induced nuclear magnetism in the patient. Materials 
with an odd number of protons or neutrons possess a weak but observable nuclear magnetic moment. 
Most commonly protons ('H) are imaged, although carbon ( 13 C), phosphorous ( 31 P), sodium ( 23 Na), 
and fluorine ( 19 F) are also of significant interest. The nuclear moments are normally randomly oriented, 
but they align when placed in a strong magnetic field. Typical field strengths for imaging range between 
0.2 and 1.5 T, although spectroscopic and functional imaging work is often performed with higher field 
strengths. The nuclear magnetization is very weak; the ratio of the induced magnetization to the applied 
fields is only 4 x 10~ 9 . The collection of nuclear moments is often referred to as magnetization or spins. 
The static nuclear moment is far too weak to be measured when it is aligned with the strong static 
magnetic field. Physicists in the 1940s developed resonance techniques that permit this weak moment 
to be measured. The key idea is to measure the moment while it oscillates in a plane perpendicular to 
the static field [3,4]. First one must tip the moment away from the static field. When perpendicular to 
the static field, the moment feels a torque proportional to the strength of the static magnetic field. The 
torque always points perpendicular to the magnetization and causes the spins to oscillate or precess in 
a plane perpendicular to the static field. The frequency of the rotation co is proportional to the field: 

where y, the gyromagnetic ratio, is a constant specific to the nucleus, and B is the magnetic field strength. 
The direction of B defines the z axis. The precession frequency is called the Larmor frequency. The 
negative sign indicates the direction of the precession. 

Since the precessing moments constitute a time-varying flax, they produce a measurable voltage in a 
loop antenna arranged to receive the x and y components of induction. It is remarkable that in MRI we 
are able to directly measure induction from the precessing nuclear moments of water protons. 

Recall that to observe this precession, we first need to tip the magnetization away from the static field. 
This is accomplished with a weak rotating radiofrequency (RF) field. It can be shown that a rotating RF 
field introduces a fictitious field in the z direction of strength <o/y. By tuning the frequency of the RF 
field to (a , we effectively delete the B field. The RF slowly nutates the magnetization away from the z 
axis. The Larmor relation still holds in this "rotating frame," so the frequency of the nutation is yB v 
where B x is the amplitude of the RF field. Since the coils receive x and y (transverse) components of 
induction, the signal is maximized by tipping the spins completely into the transverse plane. This is 
accomplished by a nil RF pulse, which requires yB{i = nil, where x is the duration of the RF pulse. 
Another useful RF pulse rotates spins by n radians. This can be used to invert spins. It also can be used 
to refocus transverse spins that have dephased due to B field inhomogeneity. This is called a spin echo 
and is widely used in imaging. 

NMR has been used for decades in chemistry. A complex molecule is placed in a strong, highly uniform 
magnetic field. Electronic shielding produces microscopic field variations within the molecule so that 
geometrically isolated nuclei rotate about distinct fields. Each distinct magnetic environment produces 
a peak in the spectra of the received signal. The relative size of the spectral peaks gives the ratio of nuclei 
in each magnetic environment. Hence the NMR spectrum is extremely useful for elucidating molecular 
structure. 

The NMR signal from a human is due predominantly to water protons. Since these protons exist in 
identical magnetic environments, they all resonate at the same frequency. Hence the NMR signal is simply 
proportional to the volume of the water. They key innovation for MRI is to impose spatial variations on 
the magnetic field to distinguish spins by their location. Applying a magnetic field gradient causes each 
region of the volume to oscillate at a distinct frequency. The most effective nonuniform field is a linear 
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gradient where the field and the resulting frequencies vary linearly with distance along the object being 
studied. Fourier analysis of the signal obtains a map of the spatial distribution of spins. This argument 
is formalized below, where we derive the powerful fc-space analysis of MRI [5,6]. 

/c-Space Analysis of Data Acquisition 

In MRI, we receive a volume integral from an array of oscillators. By ensuring that the phase, "signature" 
of each oscillator is unique, one can assign a unique location to each spin and thereby reconstruct an 
image. During signal reception, the applied magnetic field points in the z direction. Spins precess in the 
xy plane at the Larmor frequency. Hence a spin at position r = {x,y,z) has a unique phase that describes 
its angle relative to the y axis in the xy plane: 



3(r,f) = -y| B z (r,x)dx (63.1) 



where B z (r,t) is the z component of the instantaneous, local magnetic flux density. This formula assumes 
there are no x and y field components. 

A coil large enough to receive a time-varying flux uniformly from the entire volume produces an EMF 
proportional to 



s ( f )ocAJ M ( r ) e -' e M dr (6 3.2) 



where M(r) represents the equilibrium moment density at each point r. 

The key idea for imaging is to superimpose a linear field gradient on the static field B This field points 
in the direction z, and its magnitude varies linearly with a coordinate direction. For example, an x gradient 
points in the z direction and varies along the coordinate x. This is described by the vector field xG x x, 
where z is the unit vector in the z direction. In general, the gradient is (xG x + yG + zG z ) z, which can 
be written compactly as the dot product G-r z. These gradient field components can vary with time, so 
the total z field is 

B z (r,f) = 5 + G(f)-r (63.3) 

In the presence of this general time-varying gradient, the received signal is 

s( t )ocAJ e -W M ( r ) e -< G M^ r (634) 

The center frequency yB is always much larger than the bandwidth of the signal. Hence the derivative 
operation is approximately equivalent to multiplication by -!C0 . The signal is demodulated by the 
waveform e'"' B «' to obtain the "baseband" signal: 

Jt) oc -io) | Miry 1 ' 11 G ' T ' rrfT dr (63.5) 

It will be helpful to define the term k(f): 



<M = yJ" o G(t) 



dx (63.6) 
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FIGURE 63.1 The drawing on the left illustrates the scanning pattern of the 2D Fourier transform imaging sequence. 
On the right is a plot of the gradient and RF waveforms that produce this pattern. Only four of the N y horizontal 
fc-space lines are shown. The phase-encode period initiates each acquisition at a different fc, and at -fc x (max). Data 
are collected during the horizontal traversals. After all N y fc-space lines have been acquired, a 2D FFT reconstructs 
the image. Usually 128 or 256 k lines are collected, each with 256 samples along k x . The RF pulse and the z gradient 
waveform together restrict the image to a particular slice through the subject. 

Then we can rewrite the received baseband signal as 



s(f)oc f M^y^'dr 



(63.7) 



which we can now identify as the spatial Fourier transform o/M(r) evaluated atk(f). That is, S(t) scans 
the spatial frequencies of the function M(r). This can be written explicitly as 



S(t)-M(k(t)) 



(63.8) 



where 7ft(k) is the three-dimensional Fourier transform of the object distribution M(r). Thus we can 
view MRI with linear gradients as a "scan" of fc-space or the spatial Fourier transform of the image. After 
the desired portion of fc-space is scanned, the image M(r) is reconstructed using an inverse Fourier 
transform. 

2D Imaging. Many different gradient waveforms can be used to scan fc-space and to obtain a desired 
image. The most common approach, called two-dimensional Fourier transform imaging (2D FT), is to 
scan through /c-space along several horizontal lines covering a rectilinear grid in 2D fc-space. See Fig. 63.1 
for a schematic of the fc-space traversal. The horizontal grid lines are acquired using 128 to 256 excitations 
separated by a time TR, which is determined by the desired contrast, RF flip angle, and the Xj of the 
desired components of the image. The horizontal-line scans through fc-space are offset in fc by a variable 
area y-gradient pulse, which happens before data acquisition starts. These variable offsets in fc are called 
phase encodes because they affect the phase of the signal rather than the frequency. Then for each fc phase 
encode, signal is acquired while scanning horizontally with a constant x gradient. 

Resolution and Field of View. The fundamental image characteristics of resolution and field of view 
(FOV) are completely determined by the characteristics of the fc-space scan. The extent of the coverage 
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of fc-space determines the resolution of the reconstructed image. The resolution is inversely proportional 
to the highest spatial frequency acquired: 

1 fc *( maX ) yGT lr , 

— =— '-=^^- (63.9) 

Ax % 2% 



(63.10) 



where G x is the readout gradient amplitude and T is the readout duration. The time T. ' hase is the duration 
of the phase-encode gradient G . For proton imaging on a 1.5-T imaging system, a typical gradient 
strength is G x = 1 G/cm. The signal is usually read for about 8 ms. For water protons, y = 26,751 rad/s/G, 
so the maximum excursion in k x is about 21 rad/mm. Hence we cannot resolve an object smaller than 
0.3 mm in width. From this one might be tempted to improve the resolution dramatically using very 
strong gradients or very long readouts. But there are severe practical obstacles, since higher resolution 
increases the scan time and also degrades the image SNR. 

In the phase-encode direction, the fc-space data are sampled discretely. This discrete sampling in fc-space 
introduces replication in the image domain [7]. If the sampling in fc-space is finer than 1/FOV, then the 
image of the object will not fold back on itself. When the fc-space sampling is coarser than 1/FOV, the 
image of the object does fold back over itself. This is termed aliasing. Aliasing is prevented in the readout 
direction by the sampling filter. 

Perspective. For most imaging systems, diffraction limits the resolution. That is, the resolution is limited 
to the wavelength divided by the angle subtended by the receiver aperture, which means that the ultimate 
resolution is approximately the wavelength itself. This is true for imaging systems based on optics, 
ultrasound, and x-rays (although there are other important factors, such as quantum noise, in x-ray). 

MRI is the only imaging system for which the resolution is independent of the wavelength. In MRI, 
the wavelength is often many meters, yet submillimeter resolution is routinely achieved. The basic reason 
is that no attempt is made to focus the radiation pattern to the individual pixel or voxel (volume element), 
as is done in all other imaging modalities. Instead, the gradients create spatially varying magnetic fields 
so that individual pixels emit unique waveform signatures. These signals are decoded and assigned to 
unique positions. An analogous problem is isolating the signals from two transmitting antenna towers 
separated by much less than a wavelength. Directive antenna arrays would fail because of diffraction 
spreading. However, we can distinguish the two signals if we use the a priori knowledge that the two 
antennas transmit at different frequencies. We can receive both signals with a wide-angle antenna and 
then distinguish the signals through frequency-selective filtering. 

SNR Considerations. The signal strength is determined by the EMF induced from each voxel due to 
the processing moments. The magnetic moment density is proportional to the polarizing field B a . Recall 
that the EMF is proportional to the rate of change of the coil flux. The derivative operation multiples 
the signal by the Larmor frequency, which is proportional to B , so the received signal is proportional to 
B 2 times the volume of the voxel V v . 

In a well-designed MRI system, the dominant noise source is due to thermally generated currents 
within the conductive tissues of the body. These currents create a time-varying flux which induces noise 
voltages in the receiver coil. Other noise sources include the thermal noise from the antenna and from 
the first amplifier. These subsystems are designed so that the noise is negligible compared with the noise 
from the patient. The noise received is determined by the total volume seen by the antenna pattern V n 
and the effective resistivity and temperature of the conductive tissue. One can show [8] that the standard 
deviation of the noise from conductive tissue varies linearly with B . The noise is filtered by an integration 
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over the total acquisition time T , which effectively attenuates the noise standard deviation by It 
Therefore, the SNR varies as 



SNR< 
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(63.11) 



The noise volume V„ is the effective volume based on the distribution of thermally generated currents. 
For example, when imaging a spherical object of radius r, the noise standard deviation varies as r 5 ' 2 [9]. 
The effective resistance depends strongly on the radius because currents near the outer radius contribute 
more to the noise flux seen by the receiver coil. 

To significantly improve the SNR, most systems use surface coils, which are simply small coils that are 
just big enough to see the desired region of the body. Such a coil effectively maximizes the voxel-volume 
to noise-volume ratio. The noise is significantly reduced because these coils are sensitive to currents from 
a smaller part of the body. However, the field of view is somewhat limited, so "phased arrays" of small 
coils are now being offered by the major manufacturers [10]. In the phased array, each coil sees a small 
noise volume, while the combined responses provide the wide coverage at a greatly improved SNR. 

Fast Imaging. The 2D FT scan of fc-space has the disadvantage that the scan time is proportional to 
the number of phase encodes. It is often advantageous to trade off SNR for a shorter scan time. This is 
especially true when motion artifacts dominate thermal noise. To allow for a flexible tradeoff of SNR for 
imaging time, more than a single line in fc-space must be covered in a single excitation. The most popular 
approach, called echo-planar imaging (EPI), traverses fc-space back and forth on a single excitation pulse. 
The fc-space trajectory is drawn in Fig. 63.2. 

It is important that the tradeoff be flexible so that you can maximize the imaging time given the 
motion constraints. For example, patients can hold their breath for about 12 seconds. So a scan of 
12 seconds' duration gives the best SNR given the breath-hold constraint. The EPI trajectory can be 
interleaved to take full advantage of the breath-hold interval. If each acquisition takes about a second, 
12 interleaves can be collected. Each interleaf acquires every twelfth line in fc-space. 
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FIGURE 63.2 Alternative methods for the rapid traversal of k space. On the left is the echo planar trajectory. Data 
are collected during the horizontal traversals. When all N v horizontal lines in k space have been acquired, the data 
are sent to a 2D FFT to reconstruct the image. On the right is an interleaved spiral trajectory. The data are interpolated 
to a 2D rectilinear grid and then Fourier transformed to reconstruct the image. These scanning techniques allow for 
imaging within a breathhold. 
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Another trajectory that allows for a flexible tradeoff between scan time and SNR is the spiral trajectory. 
Here the trajectory starts at the origin in fc-space and spirals outward. Interleaving is accomplished by 
rotating the spirals. Figure 63.2 shows two interleaves in a spiral format. Interleaving is very helpful for 
reducing the hardware requirements (peak amplitude, peak slew rate, average dissipation, etc.) for the 
gradients amplifiers. For reconstruction, the data are interpolated to a 2D rectilinear grid and then 
Fourier-transformed. Our group has found spiral imaging to be very useful for imaging coronary arteries 
within a breath-hold scan [11]. The spiral trajectory is relatively immune to artifacts due to the motion 
of blood. 

Contrast Mechanisms 

The tremendous clinical utility of MRI is due to the great variety of mechanisms that can be exploited 
to create image contrast. If magnetic resonance images were restricted to water density, MRI would be 
considerably less useful, since most tissues would appear identical. Fortunately, many different MRI 
contrast mechanisms can be employed to distinguish different tissues and disease processes. 

The primary contrast mechanisms exploit relaxation of the magnetization. The two types of relaxations 
are termed spin-lattice relaxation, characterized by a relaxation time X p and spin-spin relaxation, char- 
acterized by a relaxation time T, . 

Spin-lattice relaxation describes the rate of recovery of the z component of magnetization toward 
equilibrium after it has been disturbed by RF pulses. The recovery is given by 



M 



(t)=M (l-e- f / r ') + M z (o) e -^ 



(63.12) 



where M is the equilibrium magnetization. Differences in the Xj time constant can be used to produce 
image contrast by exciting all magnetization and then imaging before full recovery has been achieved. 
This is illustrated on the left in Fig. 63.3. An initial ti/2 RF pulse destroys all the longitudinal magneti- 
zation. The plots show the recovery of two different T r components. The short Xj component recovers 
faster and produces more signal. This gives a X r weighted image. 
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FIGURE 63.3 The two primary MRI contrast mechanisms, T, and T 2 . T p illustrated on the left, describes the rate 
at which the equilibrium M 7k magnetization is restored after it has been disturbed. T, contrast is produced by imaging 
before full recovery has been obtained. T 2 , illustrated on the right, describes the rate at which the MRI signal decays 
after it has been created. T 2 contrast is produced by delaying data acquisition, so shorter T 2 components produce 
less signal. 
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FIGURE 63.4 

right. 



Example images of a normal volunteer demonstrating T, contrast on the left and T 2 contrast on the 



Spin-spin relaxation describes the rate at which the NMR signal decays after it has been created. The 
signal is proportional to the transverse magnetization and is given by 
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Image contrast is produced by delaying the data acquisition. The decay of two different T 2 species is 
plotted on the right in Fig. 63.3. The signal from the shorter T 2 component decays more rapidly, while 
that of the longer T 2 component persists. At the time of data collection, the longer T 2 component produces 
more signal. This produces a X 2 -weighted image. 

Figure 63.4 shows examples of these two basic types of contrast. These images are of identical axial 
sections through the brain of a normal volunteer. The image of the left was acquired with an imaging 
method that produces T l contrast. The very bright ring of subcutaneous fat is due to its relatively short 
T l . White matter has a shorter T l than gray matter, so it shows up brighter in this image. The image on 
the right was acquired with an imaging method that produces T 2 contrast. Here the cerebrospinal fluid 
in the ventricles is bright due to its long T 2 . White matter has a shorter T 2 than gray matter, so it is darker 
in this image. 

There are many other contrast mechanisms that are important in MRI. Different chemical species 
have slightly different resonant frequencies, and this can be used to image one particular component. It 
is possible to design RF and gradient waveforms so that the image shows only moving spins. This is of 
great utility in MR angiography, allowing the noninvasive depiction of the vascular system. Another 
contrast mechanism is called T 2 . This relaxation parameter is useful for functional imaging. It occurs 
when there is a significant spread of Larmor frequencies within a voxel. The superposition signal is 
attenuated faster than T 2 due to destructive interference between the different frequencies. 

In addition to the intrinsic tissue contrast, artificial MRI contrast agents also can be introduced. These 
are usually administered intravenously or orally. Many different contrast mechanisms can be exploited, 
but the most popular agents decrease both T l and T 2 . One agent approved for clinical use is gadolinium 
DPTA. Decreasing T l causes faster signal recovery and a higher signal on a T t -weighted image. The 
contrast-enhanced regions then show up bright relative to the rest of the image. 
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Defining Terms 

Gyromagnetic ratio y: An intrinsic property of a nucleus. It determines the Larmor frequency through 
the relation ro = —yB . 

fc-space: The reciprocal of object space, fc-space describes MRI data acquisition in the spatial Fourier 
transform coordinate system. 

Larmor frequency co : The frequency of precession of the spins. It depends on the product of the 
applied flux density B and on the gyromagnetic ratio y. The Larmor frequency is co = —yB . 

Magnetization M: The macroscopic ensemble of nuclear moments. The moments are induced by an 
applied magnetic field. At body temperatures, the amount of magnetization is linearly proportional 
(M = 4 x 1(T 9 H ) to the applied magnetic field. 

Precession: The term used to describe the motion of the magnetization about an applied magnetic 
field. The vector locus traverses a cone. The precession frequency is the frequency of the magne- 
tization components perpendicular to the applied field. The precession frequency is also called the 
Larmor frequency <o . 

Spin echo: The transverse magnetization response to a n RF pulse. The effects of field inhomogeneity 
are refocused at the middle of the spin echo. 

Spin-lattice relaxation T,: The exponential rate constant describing the decay of the z component of 
magnetization toward the equilibrium magnetization. Typical values in the body are between 300 
and 3000 ms. 

Spin-Spin relaxation T 2 : The exponential rate constant describing the decay of the transverse compo- 
nents of magnetization (M x and M ). 

Spins M: Another name for magnetization. 

2D FT: A rectilinear trajectory through fc-space. This popular acquisition scheme requires several 
(usually 128 to 256) excitations separated by a time TR, which is determined by the desired contrast, 
RF flip angle, and the T 1 of the desired components of the image. 
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63.2 Hardware/Instrumentation 



John Schenck 

This section describes the basic components and the operating principles of MRI scanners. Although 
scanners capable of making diagnostic images of the human internal anatomy through the use of magnetic 
resonance imaging (MRI) are now ubiquitous devices in the radiology departments of hospitals in the 
U.S. and around the world, as recently as 1980 such scanners were available only in a handful of research 
institutions. Whole-body superconducting magnets became available in the early 1980s and greatly 
increased the clinical acceptance of this new imaging modality. Market research data indicate that between 
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1980 and 1996 more than 100 million clinical MRI scans were performed worldwide. By 1996 more than 
20 million MRI scans were being performed each year. 

MRI scanners use the technique of nuclear magnetic resonance (NMR) to induce and detect a very 
weak radio frequency signal that is a manifestation of nuclear magnetism. The term nuclear magnetism 
refers to weak magnetic properties that are exhibited by some materials as a consequence of the nuclear 
spin that is associated with their atomic nuclei. In particular, the proton, which is the nucleus of the 
hydrogen atom, possesses a nonzero nuclear spin and is an excellent source of NMR signals. The human 
body contains enormous numbers of hydrogen atoms — especially in water (H 2 0) and lipid molecules. 
Although biologically significant NMR signals can be obtained from other chemical elements in the body, 
such as phosphorous and sodium, the great majority of clinical MRI studies utilize signals originating 
from protons that are present in the lipid and water molecules within the patient's body. 

The patient to be imaged must be placed in an environment in which several different magnetic fields 
can be simultaneously or sequentially applied to elicit the desired NMR signal. Every MRI scanner utilizes 
a strong static field magnet in conjunction with a sophisticated set of gradient coils and radiofrequency 
coils. The gradients and the radiofrequency components are switched on and off in a precisely timed 
pattern, or pulse sequence. Different pulse sequences are used to extract different types of data from the 
patient. MR images are characterized by excellent contrast between the various forms of soft tissues 
within the body. For patients who have no ferromagnetic foreign bodies within them, MRI scanning 
appears to be perfectly safe and can be repeated as often as necessary without danger [Shellock and Kanal, 
1998]. This provides one of the major advantages of MRI over conventional X-ray and computed 
tomographic (CT) scanners. The NMR signal is not blocked at all by regions of air or bone within the 
body, which provides a significant advantage over ultrasound imaging. Also, unlike the case of nuclear 
medicine scanning, it is not necessary to add radioactive tracer materials to the patient. 

Fundamentals of MRI Instrumentation 

Three types of magnetic fields — main fields or static fields (B 2 ), gradient fields, an radiofrequency (RF) 
fields (Bj) — are required in MRI scanners. In practice, it is also usually necessary to use coils or magnets 
that produce shimming fields to enhance the spatial uniformity of the static field B . Most MRI hardware 
engineering is concerned with producing and controlling these various forms of magnetic fields. The 
ability to construct NMR instruments capable of examining test tube-sized samples has been available 
since shortly after World War II. The special challenge associated with the design and construction of 
medical scanners was to develop a practical means of scaling these devices up to sizes capable of safely 
and comfortably accommodating an entire human patient. Instruments capable of human scanning first 
became available in the late 1970s. The successful implementation of MRI requires a two-way flow of 
information between analog and digital formats (Fig. 63.5). The main magnet, the gradient and RF coils, 
and the gradient and RF power supplies operate in the analog domain. The digital domain is centered 
on a general-purpose computer (Fig. 63.6) that is used to provide control information (signal timing 
and amplitude) to the gradient and RF amplifiers, to process time-domain MRI signal data returning 
from the receiver, and to drive image display and storage systems. The computer also provides miscella- 
neous control functions, such as permitting the operator to control the position of the patient table. 

Static Field Magnets 

The main field magnet [Thomas, 1993] is required to produce an intense and highly uniform, static 
magnetic field over the entire region to be imaged. To be useful for imaging purposes, this field must be 
extremely uniform in space and constant in time. In practice, the spatial variation of he main field of a 
whole-body scanner must be less than about 1 to 10 parts per million (ppm) over a region approximately 
40 cm in diameter. To achieve these high levels of homogeneity requires careful attention to magnet 
design and to manufacturing tolerances. The temporal drift of the field strength is normally required to 
be less than 0.1 ppm/h. 
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FIGURE 63.5 Digital and analog domains for MRI imaging. MRI involves the flow of data and system commands 
between these two domains (Courtesy of WM Leue. Reprinted with permission from Schenck and Leue, 1991). 
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FIGURE 63.6 Block diagram for an MRI scanner. A general-purpose computer is used to generate the commands 
that control the pulse sequence and to process data during MR scanning (Courtesy of WM Leue. Reprinted with 
permission from Schenck and Leue, 1991.) 

Two units of magnetic field strength are now in common use. The gauss (G) has a long historical 
usage and is firmly embedded in the older scientific literature. The tesla (T) is a more recently adopted 
unit, but is a part of the SI system of units and, for this reason, is generally preferred. The tesla is a much 
larger unit than the gauss — 1 T corresponds to 10,000 G. The magnitude of the earth's magnetic field is 
about .05 mT (5000 G). The static magnetic fields of modern MRI scanners arc most commonly in the 
range of from 0.5 to 1.5 T; useful scanners, however, have been built using the entire range from 0.02 to 
8 T. The signal-to-noise ration (SNR) is the ratio of the NMR signal voltage to the ever-present noise 
voltages that arise within the patient and within the electronic components of the receiving system. The 
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SNR is one of the key parameters that determine the performance capabilities of a scanner. The maximum 
available SNR increases linearly with field strength. The improvement in SNR as the field strength is 
increased is the major reason that so much effort has gone into producing high-field magnets for MRI 
systems. 

Magnetic fields can be produced by using either electric currents or permanently magnetized materials 
as sources. In either case, the field strength falls off rapidly away from the source, and it is not possible 
to create a highly uniform magnetic field on the outside of a set of sources. Consequently, to produce 
the highly uniform field required for MRI, it is necessary to more or less surround the patient with a 
magnet. The main field magnet must be large enough, therefore, to effectively surround the patient; in 
addition, it must meet other stringent performance requirements. For these reasons, the main field 
magnet is the most important determinant of the cost, performance, and appearance of an MRI scanner. 
Four different classes of main magnets — (1) permanent magnets, (2) electromagnets, (3) resistive mag- 
nets, and (4) superconducting magnets — have been used in MRI scanners. 

Permanent Magnets and Electromagnets 

Both these magnet types use magnetized materials to produce the field that is applied to the patient. In 
a permanent magnet, the patient is placed in the gap between a pair of permanently magnetized pole 
faces. Electromagnets use a similar configuration, but the pole faces are made of soft magnetic materials, 
which become magnetized only when subjected to the influence of electric current coils that are wound 
around them. Electromagnets, but not permanent magnets, require the use of an external power supply. 
For both types of magnets, the magnetic circuit is completed by use of a soft iron yoke connecting the 
pole faces to one another (Fig. 63.7). The gap between the pole faces must be large enough to contain 
the patient as well as the gradient and RF coils. The permanent magnet materials available for use in 
MRI scanners include high-carbon iron, alloys such as Alnico, ceramics such as barium ferrite, and rare 
earth alloys such as samarium cobalt. 

Permanent magnet scanners have some advantages: They produce a relatively small fringing field and 
do not require power supplies. However, they tend to be very heavy (up to 100 tons) can produce only 
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FIGURE 63.7 Permanent magnet. The figure shows a schematic cross-section of a typical permanent magnet 
cinfiguration. Electromagnets have a similar construction but are energized by current-carrying coils wound around 
the iron yoke. Soft magnetic shims are used to enhance the homogeneity of the field. (Reprinted with permission 
from Schenck and Leue, 1991.) 
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relatively low fields — on the order of 0.3 T or less. They are also subject to temporal field drift caused 
by temperature changes. If the pole faces are made from an electrically conducting material, eddy currents 
induced in the pole faces by the pulsed gradient fields can limit performance as well. A recently introduced 
alloy of neodymium, boron, and iron (usually referred to as neodymium iron) has been used to make 
lighter-weight permanent magnet scanners. 

Resistive Magnets 

The first whole-body scanners, manufactured in the late 1970s and early 1980s, used four to six large 
coils of copper or aluminum wire surrounding the patient. These coils are energized by powerful (40 to 
100 kW) direct-current (dc) power supplies. The electrical resistance of the coils leads to substantial joule 
heating, and the use of cooling water flowing through the coils is necessary to prevent overheating. The 
heat dissipation increases rapidly with field strength, and it is not feasible to build resistive magnets 
operating at fields much higher than 0.15 to 0.3 T. At present, resistive magnets are seldom used except 
for very low field strength (0.02 to 0.06 T) applications. 

Superconducting Magnets 

Since the early 1980s, the use of cryogenically cooled superconducting magnets [Wilson, 1983] has been 
the most satisfactory solution to the problem of producing the static magnet field for MRI scanners. The 
property of exhibiting absolutely no electrical resistance near absolute zero has been known as an exotic 
property of some materials since 1911. Unfortunately, the most common of these materials, such as lead, 
tin, and mercury, exhibit a phase change back to the normal state at relatively low magnetic field strengths 
and cannot be used to produce powerful magnetic fields. In the 1950s, a new class of materials (type II 
superconductors) was discovered. These materials retain the ability to carry loss-free electric currents in 
very high fields. One such material, an alloy of niobium and titanium, has been used in most of the 
thousands of superconducting whole-body magnets that have been constructed for use in MRI scanners 
(Fig. 63.8). The widely publicized discovery in 1986 of another class of materials which remain super- 
conducting at much higher temperatures than any previously known material has not yet lead to any 
material capable of carrying sufficient current to be useful in MRI scanners. 




FIGURE 63.8 Superconducting magnet. This figure shows a 1.5-T whole-body superconducting magnet. The 
nominal warm bore diameter is 1 m. The patient to be imaged, as well as the RF and gradient coils, are located 
within this bore. (Courtesy of General Electric Medical Systems. Reprinted with permission from Schenck and Leue, 
1991.) 
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FIGURE 63.9 Schematic drawing of a superconducting magnet. The main magnet coils and the superconducting 
shim coils are maintained at liquid helium temperature. A computer-controlled table is used to advance the patient 
into the region of imaging. (Reprinted with permission from Schenck and Leue, 1991.) 



Figure 63.9 illustrates the construction of a typical superconducting whole-body magnet. In this case, 
six coils of superconducting wire are connected in a series and carry an intense current — on the order 
of 200 A — to produce the 1.5-T magnetic field at the magnet's center. The diameter of the coils is about 
1.3 m, and the total length of wire is about 65 km (40 miles). The entire length of this wire must be 
without any flaws — such as imperfect welds — that would interrupt the superconducting properties. If 
the magnet wire has no such flaws, the magnet can be operated in the persistent mode — that is, once 
the current is established, the terminals may be connected together, and a constant persistent current 
flow indefinitely so long as the temperature of the coils is maintained below the superconducting 
transition temperature. This temperature is about 10 K for niobium-titanium wire. The coils are kept 
at this low temperature by encasing them in a double-walled cryostat (analogous to a Thermos bottle) 
that permits them to be immersed in liquid helium at a temperature of 4.2 K. The gradual boiling of 
liquid helium caused by inevitable heat leaks into the cryostat requires that the helium be replaced on a 
regular schedule. Many magnets now make use of cryogenic refrigerators that reduce or eliminate the 
need for refilling the liquid helium reservoir. The temporal stability of superconducting magnets operating 
in the persistent mode is truly remarkable — magnets have operated for years completely disconnected 
from power supplies and maintained their magnetic field constant to within a few parts per million. 
Because of their ability to achieve very strong and stable magnetic field strengths without undue power 
consumption, superconducting magnets have become the most widely used source of the main magnetic 
fields for MRI scanners. 

Magnetic Field Homogeneity 

The necessary degree of spatial uniformity of the field can be achieved only by carefully placing the coils 
at specific spatial locations. It is well known that a single loop of wire will produce, on its axis, a field 
that is directed along the coil axis and that can be expressed as a sum of spherical harmonic fields. The 
first term in this sum is constant in space and represents the desired field that is completely independent 
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of position. The higher-order terms represent contaminating field inhomogeneities that spoil the field 
uniformity. More than a century ago, a two-coil magnet system — known as the Helmholtz pair — was 
developed which produced a much more homogeneous field at its center than is produced by a single 
current loop. This design is based on the mathematical finding that when two coaxial coils of the same 
radius are separated by a distance equal to their radius, the first nonzero contaminating term in the 
harmonic expansion is of the fourth order. This results in an increased region of the field homogeneity, 
which, although it is useful in many applications, is far too small to be useful in MRI scanners. However, 
the principle of eliminating low-order harmonic fields can be extended by using additional coils. This is 
the method now used to increase the volume of field homogeneity to values that are useful for MRI. For 
example, in the commonly used six-coil system, it is possible to eliminate all the error fields through the 
twelfth order. 

In practice, manufacturing tolerances and field perturbations caused b extraneous magnetic field 
sources — such as steel girders in the building surrounding the magnet — produce additional inhomoge- 
neity in the imaging region. These field imperfections are reduced by the use of shimming fields. One 
approach — active shimming — uses additional coils (either resistive coils, superconducting coils, or some 
of each) which are designed to produce a magnetic field corresponding to a particular term in the spherical 
harmonic expansion. When the magnet is installed, the magnetic field is carefully mapped, and the 
currents in the shim coils are adjusted to cancel out the terms in the harmonic expansion to some 
prescribed high order. The alternative approach — passive shimming — utilizes small permanent magnets 
that are placed at the proper locations along the inner walls of the magnet bore to cancel out contami- 
nating fields. If a large object containing magnetic materials — such as a power supply — is moved in the 
vicinity of superconducting magnets, it may be necessary to reset the shimming currents or magnet 
locations to account for the changed pattern of field inhomogeneity. 

Fringing Fields 

A large, powerful magnet produces a strong magnetic field in the region surrounding it as well as in its 
interior. This fringing field can produce undesirable effects such as erasing magnetic tapes (and credit 
cards). It is also a potential hazard to people with implanted medical devices such as cardiac pacemakers. 
For safety purposes, it is general practice to limit access to the region where the fringing field becomes 
intense. A conventional boundary for this region is the "5-gaussline," which is about 10 to 12 m from 
the center of an unshielded 1.5-T magnet. Magnetic shielding — in the form of iron plates (passive 
shielding) or external coils carrying current in the direction opposite to the main coil current (active 
shielding) — is frequently used to restrict the region in which the fringing field is significant. 

Gradient Coils 

Three gradient fields, one each for the x, y, and z directions of a Cartesian coordinate system, are used 
to code position information into the MRI signal and to permit the imaging of thin anatomic slices 
[Thomas, 1993]. Along with their larger size, it is the use of these gradient coils that distinguishes MRI 
scanners from the conventional NMR systems such as those used in analytical chemistry. The direction 
of the static field, along the axis of the scanner, is conventionally taken as the z direction, and it is only 
the Cartesian component of the gradient field in this direction that produces a significant contribution 
to the resonant behavior of the nuclei. Thus, the three relevant gradient fields are B z = GJi, B z = G y y, 
and B z = G Z Z. MRI scans are carried out by subjecting the spin system to a sequence of pulsed gradient 
and RF fields. Therefore, it is necessary to have three separate coils — one for each of the relevant gradient 
fields — each with its own power supply and under independent computer control. Ordinarily, the most 
practical method for constructing the gradient coils is to wind them on a cylindrical coil form that 
surrounds the patient and is located inside the warm bore of the magnet. The z gradient field can be 
produced by sets of circular coils wound around the cylinder with the current direction reversed for coils 
on the opposite sides of the magnet center (z= 0).To reduce deviations from a perfectly linear B z gradient 
field, a spiral winding can be used with the direction of the turns reversed at z = and the spacing 
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FIGURE 63.10 Z-gradient coil. The photograph shows a spiral coil wound on a cylindrical surface with an over- 
widing near the end of the coil (Courtesy of R. J. Dobberstein, General Electric Medical Systems. Reprinted with 
permission from Schenck and Leue, 1991.) 




FIGURE 63.11 Transverse gradient coil. The photograph shows the outer coil pattern of an actively shielded 
transverse gradient coil. (Courtesy of R. J. Dobberstien, General Electric Medical Systems. Reprinted with permission 
from Schenck and Leue, 1991). 

between windings decreasing away from the coil center (Fig. 63.10). A more complex current pattern is 
required to produce the transverse (x and y) gradients. As indicated in Fig. 63.11, transverse gradient 
fields are produced by windings which utilize a four-quadrant current pattern. 

The generation of MR images requires that a rapid sequence of time-dependent gradient fields (on all 
three axes) be applied to the patient. For example, the commonly used technique of spin-warp imaging 
[Edelstein et al., 1980] utilizes a slice-selection gradient pulse to select the spins in a thin (3 tolO mm) 
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slice of the patient and then applies readout and phase-encoding gradients in the two orthogonal 
directions to encode two-dimensional spatial information into the NMR signal. This, in turn, requires 
that the currents in the three gradient coils be rapidly switched by computer-controlled power supplies. 
The rate at which gradient currents can be switched is an important determinant of the imaging capa- 
bilities of a scanner. In typical scanners, the gradient coils have an electrical resistance of about 1Q and 
an inductance of about 1 mH, and the gradient field can be switched from to 10 mT/m (1 G/cm) in 
about 0.5 ms. The current must be switched from to about 100 A in this interval, and the instantaneous 
voltage on the coils, L di/dt, is on the order of 200 V. The power dissipation during the switching interval 
is about 20 kW. In more demanding applications, such as are met in cardiac MRI, the gradient field may 
be as high as 4 to 5 mT/m and switched in 0.2 ms or less. In this case, the voltage required during gradient 
switching is more than 1 kV. In many pulse sequences, the switching duty cycle is relatively low, and coil 
heating is not significant. However, fast-scanning protocols use very rapidly switched gradients at a high 
duty cycle. This places very strong demands on the power supplies, and it is often necessary to use water 
cooling to prevent overheating the gradient coils. 

Radiofrequency Coils 

Radiofrequency (RF) coils are components of every scanner and are used for two essential pur- 
poses — transmitting and receiving signals at the resonant frequency of the protons within the patient 
[Schenck, 1993]. The precession occurs at the Larmor frequency of the protons, which is proportional 
to the static magnetic field. At IT this frequency is 42.58 MHz. Thus in the range of field strengths 
currently used in whole-body scanners, 0.02 to 4 T, the operating frequency ranges from 0.85 to 
170.3 MHz. For the commonly used 1.5-T scanners, the operating frequency is 63.86 MHz. The frequency 
of MRI scanners overlaps the spectral region used for radio and television broadcasting. As an example, 
the frequency of a 1.5-T scanner is within the frequency hand 60 to 66 MHz, which is allocated to 
television channel 3. Therefore, it is not surprising that the electronic components in MRI transmitter 
and receiver chains closely resemble corresponding components in radio and television circuitry. An 
important difference between MRI scanners and broadcasting systems is that the transmitting and 
receiving antennas of broadcast systems operate in the far field of the electromagnetic wave. These 
antennas are separated by many wavelengths. On the other hand, MRI systems operate in the near field, 
and the spatial separation of the sources and receivers is much less than a wavelength. In far- field systems, 
the electromagnetic energy is shared equally between the electric and magnetic components of the wave. 
However, in the near field of magnetic dipole sources, the field energy is almost entirely in the magnetic 
component of the electromagnetic wave. This difference accounts for the differing geometries that are 
most cost effective for broadcast and MRI antenna structures. 

Ideally, the RF field is perpendicular to the static field, which is in the z direction. Therefore, the RF 
field can be linearly polarized in either the x or y direction. However, the most efficient RF field results 
from quadrature excitation, which requires a coil that is capable of producing simultaneous x and y fields 
with a 90-degree phase shift between them. Three classes of RF coils — body coils, head coils, and surface 
coils — are commonly used in MRI scanners. These coils are located in the space between the patient and 
the gradient coils. Conducting shields just inside the gradient coils are used to prevent electromagnetic 
coupling between the RF coils and the rest of the scanner. Head and body coils are large enough to 
surround the legion being imaged and are designed to produce an RF magnetic field that is uniform 
across the region to be imaged. Body coils are usually constructed on cylindrical coil forms and have a 
large enough diameter (50 to 60 cm) to entirely surround the patient's body. Coils are designed only for 
head imaging (Fig. 63.12) have a smaller diameter (typically 28 cm). Surface coils are smaller coils 
designed to image a restricted region of the patient's anatomy. They come in a wide variety of shapes 
and sizes. Because they can be closely applied to the region of interest, surface coils can provide SNR 
advantages over head and body coils for localized regions, but because of their asymmetric design, they 
do not have uniform sensitivity. 
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FIGURE 63.12 Birdcage resonator. This is a head coil designed to perate in a 4-T scanner at 170 MHz. Quadrature 
excitation and receiver performance are achieved by using two adjacent ports with a 90-degree phase shift between 
them. (Reprinted with permission from Schenck and Leue, 1991.) 



A common practice is to use separate coils for the transmitter and receiver functions. This permits the 
use of a large coil — such as the body coil — with a uniform excitation pattern as the transmitter and a small 
surface coil optimized to the anatomic region — such as the spine — being imaged. When this two-coil 
approach is used, it is important to provide for electronically decoupling of the two coils because they 
are tuned at the same frequency and will tend to have harmful mutual interactions. 

Digital Data Processing 

A typical scan protocol calls for a sequence of tailored RF and gradient pulses with duration controlled 
in steps of 0.1 ms. To achieve sufficient dynamic range in control of pulse amplitudes, 12- to 16-bit 
digital-to-analog converters are used. The RF signal at the Larmor frequency (usually in the range from 
1 to 200 MHz) is mixed with a local oscillator to produce a baseband signal which typically has a 
bandwidth of 16 to 32 kHz. The data-acquisition system must digitize the baseband signal at the Nyquist 
rate, which requires sampling the detected RF signal at a rate one digital data point every 5 to 20 ms. 
Again, it is necessary to provide sufficient dynamic range. Analog-to-digital converters with 16 tol8 bits 
are used to produce the desired digitized signal data. During the data acquisition, information is acquired 
at a rate on the order of 800 kilobytes per second, and each image can contain up to a megabyte of digital 
data. The array processor (AP) is a specialized computer that is designed for the rapid performance of 
specific algorithms, such as the fast Fourier transform (FFT), which are used to convert the digitized 
time-domain data to image data. Two-dimensional images are typically displayed as 256 x 128, 256 x 
256, or 512 x 512 pixel arrays. The images can be made available for viewing within about 1 sec after 
data acquisition. Three-dimensional imagining data, however, require more computer processing, and 
this results in longer delays between acquisition and display. 
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A brightness number, typically containing 16 bits of gray-scale information, is calculated for each pixel 
element of the image, and this corresponds to the signal intensity originating in each voxel of the object. 
To make the most effective use of the imaging information, sophisticated display techniques, such as 
multi-image displays, rapid sequential displays (cine loop), and three-dimensional renderings of anatomic 
surfaces, are frequently used. These techniques are often computationally intensive and require the use 
of specialized computer hardware. Interfaces to microprocessor-based workstations are frequently used 
to provide such additional display and analysis capabilities. MRI images are available as digital data; 
therefore, there is considerable utilization of local arena networks (LANs) to distribute information 
throughout the hospital, and long-distance digital transmission of the images can be used for purposes 
of teleradiology. 

Current Trends in MRI 

At present, there is a substantial effort directed at enhancing the capabilities and cost-effectiveness of 
MR imagers. The activities include efforts to reduce the cost of these scanners, improve image quality, 
reduce scan times, and increase the number of useful clinical applications. Examples of these efforts 
include the development of high-field scanners, the advent of MRI-guided therapy, and the development 
of niche scanners that are designed for specific anatomical and clinical applications. Scanners have been 
developed that are dedicated to low-field imaging of the breast and other designs are dedicated to 
orthopedic applications such as the knees, wrists, and elbows. Perhaps the most promising incipient 
application of MRI is to cardiology. Scanners are now being developed to permit studies of cardiac wall 
motion, cardiac perfusion, and the coronary arteries in conjunction with cardiac stress testing. These 
scanners emphasize short magnet configurations to permit close monitoring and interaction with the 
patient, and high strength rapidly switched gradient fields. 

Conventional spin-warp images typically require several minutes to acquire. The fast spin echo (FSE) 
technique can reduce this to the order of 20 s, and gradient-echo techniques can reduce this time to a 
few seconds. The echo-planar technique (EPI) [Cohen and Weisskoff, 1991; Wehrli, 1990] requires 
substantially increased gradient power and receiver bandwidth but can produce images in 40 to 60 ms. 
Scanners with improved gradient hardware that are capable of handling higher data-acquisition rates are 
now available. 

For most of the 1980s and 1990s, the highest field strength commonly used in MRI scanners was 1.5 T. 
To achieve better SNRs, higher-field scanners, operating at fields up to 4 T, were studied experimentally. 
The need for very high-field scanners has been enhanced by the development of functional brain MRI. 
This technique utilizes magnetization differences between oxygenated and deoxygenated hemoglobin, 
and this difference is enhanced at higher field strengths. It has now become possible to construct 3- and 
4-T and even 8-T [Robitaille et al., 1998], whole-body scanners of essentially the same physical size (or 
footprint) as conventional 1.5-T systems. Along with the rapidly increasing clinical interest in functional 
MRI, this is resulting in a considerable increase in the use of high-field systems. 

For the first decade or so after their introduction, MRI scanners were used almost entirely to provide 
diagnostic information. However, there is now considerable interest in systems capable of performing 
image-guided, invasive surgical procedures. Because MRI is capable of providing excellent soft-tissue 
contrast and has the potential for providing excellent positional information with submillimeter accuracy, 
it can be used for guiding biopsies and stereotactic surgery. The full capabilities of MRI-guided procedures 
can only be achieved if it is possible of provide surgical access to the patient simultaneously with the 
MRI scanning. This has lead to the development of new system designs, including the introduction of a 
scanner with a radically modified superconducting magnet system that permits the surgeon to operate 
at the patient's side within the scanner (Fig. 63.13) [Schenck et al., 1995; Black et al., 1997]. These systems 
have lead to the introduction of magnetic field- compatible surgical instruments, anesthesia stations, and 
patient monitoring equipment [Schenck, 1996]. 
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FIGURE 63.13 Open magnet for MEI-guided therapy. This open-geometry superconducting magnet provides a 
surgion with direct patient access and the ability to interactively control the MRI scanner. This permits imaging to 
be performed simultaneously with surgical interventions. 



Defining Terms 

Bandwidth: The narrow frequency range, approximately 32 kHz, over which the MRI signal is trans- 
mitted. The bandwidth is proportional to the strength of the readout gradient field. 

Echo-planar imaging (EPI): A pulse sequence used to produce very fast MRI scans. EPI times can be 
as short as 50 ms. 

Fast Fourier transform (FFT): A mathematical technique used to convert data sampled from the MRI 
signal into image data. This version of the Fourier transform can be performed with particular 
efficiency on modern array processors. 

Gradient coil: A coil designed to produce a magnetic field for which the field component B: varies 
linearly with position. Three gradient coils, one each for the x, y, and z directions, are required 
MRI. These coils are used to permit slice selection and to encode position information into the 
MRI signal. 

Larmor frequency: The rate at which the magnetic dipole moment of a particle precesses in an applied 
magnetic field. It is proportional to the field strength and is 42.58 MHz for protons in a 1-T 
magnetic field. 

Magnetic resonance imaging (MRI): A technique for obtaining images of the internal anatomy based 
on the use of nuclear magnetic resonance signals. During the 1980s, it became a major modality 
for medical diagnostic imaging. 

Nuclear magnetic resonance (NMR): A technique for observing and studying nuclear magnetism. It 
is based on partially aligning the nuclear spins by use of a strong, static magnetic field, stimulating 
these spins with a radiofrequency field oscillating at the Larmor frequency, and detecting the signal 
that is induced at this frequency. 

Nuclear Magnetism: The magnetic properties arising from the small magnetic dipole moments pos- 
sessed by the atomic nuclei of some materials. This form of magnetism is much weaker than the 
more familiar form that originates from the magnetic dipole moments of the atomic electrons. 

Pixel: A single element or a two-dimensional array of image data. 



© 2000 by CRC Press LLC 



Pulse sequence: A series of gradient and radiofrequency pulses used to organize the nuclear spins into 
a pattern that encodes desired imaging information into the NMR signal. 

Quadrature excitation and detection: The use of circularly polarized, rather than linearly polarized, 
radio frequency fields to excite an detect the NMR signal. It provides a means of reducing the 
required excitation power by 1/2 and increasing the signal-to-noise ration by 2. 

Radiofrequency (RF) coil: A coil designed to excite and/or detect NMR signals. These coils can usually 
be tuned to resonate at the Larmor frequency of the nucleus being studied. 

Spin: The property of a particle, such as an electron or nucleus, that leads to the presence of an intrinsic 
angular momentum and magnetic moment. 

Spin-warp imagining: The pulse sequence used in the most common method of MRI imaging. It uses 
a sequence of gradient field pulses to encode position information into the NMR signal and applies 
Fourier transform mathematics to this signal to calculate the image intensity value for each pixel. 

Static magnetic field: The field of the main magnet that is used to magnetize the spins and to drive 
their Larmor precession. 

Voxel: The volume element associated with a pixel. The voxel volume is equal to the pixel area multiplied 
by the slice thickness. 
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Further Information 

There are several journals devoted entirely to MR imaging. These include Magnetic Resonance in Medicine, 
JMRI — Journal of Magnetic Resonance Imaging, and NMR in Biomedicine, all three of which are published 
by Wiley- Liss, 605 Third Avenue, New York, NY 10158. Another journal dedicated to this field is Magnetic 
Resonance Imaging (Elsevier Publishing, 655 Avenue of the Americas, New York, NY 10010). The clinical 
aspects of MRI are covered extensively in Radiology (Radiological Society of North America, 2021 Spring 
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Road, Suite 600, Oak Brook, IL 60521), The American Journal of Radiology (American Roentgen Ray 
Society, 1891 Preston White Drive, Reston, VA 20191), as well as in several other journals devoted to the 
practice of radiology. There is a professional society, now known as the International Society for Magnetic 
Resonance in Medicine (ISMRM), devoted to the medical aspects of magnetic resonance. The main offices 
of this society are at 2118 Milvia, Suite 201, Berkeley, CA 94704. This society holds an annual meeting 
that includes displays of equipment and the presentation of approximately 2800 technical papers on new 
developments in the field. The annual Book of Abstracts of this meeting provides an excellent summary 
of current activities in the field. Similarly, the annual meeting of the Radiological Society of North America 
(RSNA) provides extensive coverage of MRI that is particularly strong on the clinical applications. The 
RSNA is located at 2021 Spring Road, Suite 600, Oak Brook, IL 60521. 

Several book-length accounts of MRI instrumentation and techniques are available. Biomedical Mag- 
netic Resonance Technology (Adam Higler, Bristol, 1989) by Chen and D. I. Hoult, The Physics of MRI 
(Medical Physics Monograph 21, American Institute of Physics, Woodbury, NY, 1993), edited by M. J. 
Bronskill and P. Sprawls, and Electromagnetic Analysis and Design in Magnetic Resonance Imaging (CRC 
Press, Boca Raton, FL, 1998) by J.M. Jin each contain thorough accounts of instrumentation and the 
physical aspects of MRI. There are many books that cover the clinical aspects of MRI. Of particular 
interest are Magnetic Resonance Imaging, 3rd edition (Mosby, St. Louis, 1999), edited by D. D. Stark and 
W. G. Bradley, Jr., and Magnetic Resonance Imaging of the Brain and Spine, 2nd ed. (Lipincott-Raven, 
Philadelphia, 1996), edited by S. W Atlas. 

63.3 Functional MRI 

Kenneth K. Kwong and David A. Cheslei 

Functional magnetic resonance imaging (fMRI), a technique that images intrinsic blood signal change 
with magnetic resonance (MR) imagers, has in the last 3 years become one of the most successful tools 
used to study blood flow and perfusion in the brain. Since changes in neuronal activity are accompanied 
by focal changes in cerebral blood flow (CBF), blood volume (CBV), blood oxygenation, and metabolism, 
these physiologic changes can be used to produce functional maps of mental operations. 

There are two basic but completely different techniques used in fMRI to measure CBF. The first one 
is a classic steady-state perfusion technique first proposed by Detre et al. [1], who suggested the use of 
saturation or inversion of incoming blood signal to quantify absolute blood flow [1-5]. By focusing on 
blood flow change and not just steady-state blood flow, Kwong et al. [6] were successful in imaging brain 
visual functions associated with quantitative perfusion change. There are many advantages in studying 
blood flow change because many common baseline artifacts associated with MRI absolute flow techniques 
can be subtracted out when we are interested only in changes. And one obtains adequate information in 
most functional neuroimaging studies with information of flow change alone. 

The second technique also looks at change of a blood parameter — blood oxygenation change during 
neuronal activity. The utility of the change of blood oxygenation characteristics was strongly evident in 
Turner's work [7] with cats with induced hypoxia. Turner et al. found that with hypoxia, the MRI signal 
from the cats' brains went down as the level of deoxyhemoglobin rose, a result that was an extension of 
an earlier study by Ogawa et al. [8,9] of the effect of deoxyhemoglobin on MRI signals in animals' veins. 
Turner's new observation was that when oxygen was restored, the cats' brain signals climbed up and went 
above their baseline levels. This was the suggestion that the vascular system overcompensated by bringing 
more oxygen, and with more oxygen in the blood, the MRI signal would rise beyond the baseline. 

Based on Turner's observation and the perfusion method suggested by Detre et al., movies of human 
visual cortex activation utilizing both the perfusion and blood oxygenation techniques were successfully 
acquired in May of 1991 (Fig. 63.14) at the Massachusetts General Hospital with a specially equipped 
superfast 1.5-T system known as an echo-planar imaging (EPI) MRI system [10]. fMRI results using 
intrinsic blood contrast were first presented in public at the Tenth Annual Meeting of the Society of 
Magnetic Resonance in Medicine in August of 1991 [6,11]. The visual cortex activation work was carried 
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FIGURE 63.14 Functional MR image demonstrating activation of the primary visual cortex (VI). Image acquired 
on May 9, 1991 with a blood oxygenation-sensitive MRI gradient-echo (GE) technique. 

out with flickering goggles, a photic stimulation protocol employed by Belliveau et al. [12] earlier to 
acquire the MRI functional imaging of the visual cortex with the injection of the contrast agent gado- 
linium-DTPA. The use of an external contrast agent allows the study of change in blood volume. The 
intrinsic blood contrast technique, sensitive to blood flow and blood oxygenation, uses no external 
contrast material. Early model calculation showed that signal due to blood perfusion change would only 
be around 1% above baseline, and the signal due to blood oxygenation change also was quite small. It 
was quite a pleasant surprise that fMRI results turned out to be so robust and easily detectable. 

The blood oxygenation-sensitive MRI signal change, coined blood oxygenation level dependent (BOLD) 
by Ogawa et al. [8,9,13], is in general much larger than the MRI perfusion signal change during brain 
activation. Also, while the first intrinsic blood contrast fMRI technique was demonstrated with a superfast 
EPI MRI system, most centers doing fMRI today are only equipped with conventional MRI systems, 
which are really not capable of applying Detre's perfusion method. Instead, the explosive growth of MR 
functional neuroimaging [14-33] in the last three years relies mainly on the measurement of blood 
oxygenation change, utilizing a MR parameter called T' 2 . Both high speed echo planar (EPI) and con- 
ventional MR have now been successfully employed for functional imaging in MRI systems with magnet 
field strength ranging from 1.5 to 4.0 T 

Advances in Functional Brain Mapping 

The popularity of fMRI is based on many factors. It is safe and totally noninvasive. It can be acquired 
in single subjects for a scanning duration of several minutes, and it can be repeated on the same subjects 
as many times as necessary. The implementation of the blood oxygenation sensitive MR technique is 
universally available. Early neuroimaging, work focused on time-resolved MR topographic mapping of 
human primary visual (VI) (Figs. 63.15, 63.16), motor (MI), somatosensory (SI), and auditory (Al) 
cortices during task activation. Today, with BOLD technique combined with EPI, one can acquire 20 or 
more contiguous brain slices covering the whole head (3x3 mm in plane and 5 mm slice thickness) 
every 3 seconds for a total duration of several minutes. Conventional scanners can only acquire a couple 
of slices at a time. The benefits of whole-head imaging are many. Not only can researchers identify and 
test their hypotheses on known brain activation centers, they can also search for previous unknown or 
unsuspected sites. High resolution work done with EPI has a resolution of 1.5 x 1.5 mm in plane and a 
slice thickness of 3 mm. Higher spatial resolution has been reported in conventional 1.5-T MR systems 
[34]. 
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FIGURE 63.15 Movie of fMRI mapping of primary visual cortex (VI) activation during visual stimulation. Images 
are obliquely aligned along the calcarie fissures with the occipital pole at the bottom. Images were acquired at 3-s 
intervals using a blood oxygenation-sensitive MRI sequence (80 images total). A baseline image acquired during 
darkness ( upper left) was subtracted from subsequent images. Eight of these subtraction images are displayed, chosen 
when the image intensities reached a steady-state signal level during darkness (OFF) and during 8-Hz photic 
stimulation (ON). During stimulation, local increases in signal intensity are detected in the posteromedial regions 
of the occipital lobes along the calcarine fissures. 

Of note with Fig. 63.16 is that with blood oxygenation-sensitive MR technique, one observers an 
undershoot [6,15,35] in signal in VI when the light stimulus is turned off. The physiologic mechanism 
underlying the undershoot is still not well understood. 

The data collected in the last 3 years have demonstrated that fMRI maps of the visual cortex correlate 
well with known retinotopic organization [24,36]. Higher visual regions such as V5/MT [37] and motor- 
cortex organization [6,14,27,38] have been explored successfully. Preoperative planning work (Fig. 63.17) 
using motor stimulation [21,39,40] has helped neurosurgeons who attempt to preserve primary areas 
from tumors to be resected. For higher cognitive functions, several fMRI language studies have already 
demonstrated known language-associated regions [25,26,41,42] (Fig. 63.18). There is more detailed 
modeling work on the mechanism of functional brain mapping by blood-oxygenation change [43-46]. 
Postprocessing techniques that would help to alleviate the serious problem of motion/displacement 
artifacts are available [47]. 

Mechanism 

Flow- sensitive images show increased perfusion with stimulation, while blood oxygenation-sensitive 
images show changes consistent with an increase in venous blood oxygenation. Although the precise 
biophysical mechanisms responsible for the signal changes have yet to be determined, good hypotheses 
exist to account for our observations. 

Two fundamental MRI relaxation rates, T x and T' 2 , are used to describe the fMRI signal. T i is the rate 
at which the nuclei approach thermal equilibrium, and perfusion change can be considered as an 
additional Xj change. T' 2 represents the rate of the decay of MRI signal due to magnetic field inhomo- 
geneities, and the change of T' 2 is used to measure blood-oxygenation change. 
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Time Course of Blood Oxygenation Sensitive Images 
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FIGURE 63.16 Signal intensity changes for a region of interest (-60 mm 2 ) within the visual cortex during darkness 
and during 8-Hz photic stimulation. Results using oxygenation-sensitive ( top graph) and flow-sensitive ( bottom graph) 
techniques are shown. The flow-sensitive data were collected once every 3.5 s, and the oxygenation-sensitive data 
were collected once every 3 s. Upon termination of photic stimulation, an undershoot in the oxygenation-sensitive 
signal intensity is observed. 

T' 2 changes reflect the interplay between changes in cerebral blood flow, volume, and oxygenation. As 
hemoglobin becomes deoxygenated, it becomes more paramagnetic than the surrounding tissue [48] 
and thus creates a magnetically inhomogeneous environment. The observed increased signal on 
T' 2 -weighted images during activation reflects a decrease in deoxyhemoglobin content, i.e., an increase 
in venous blood oxygenation. Oxygen delivery, cerebral blood flow, and cerebral blood volume all increase 
with neuronal activation. Because CBF (and hence oxygen-delivery) changes exceed CBV changes by 2 to 
4 times [49], while blood-oxygen extraction increases only slightly [50,51], the total paramagnetic blood 
deoxyhemoglobin content within brain tissue voxels will decrease with brain activation. The resulting 
decrease in the tissue-blood magnetic susceptibility difference leads to less intravoxel dephasing within 
brain tissue voxels and hence increased signal on Tj-weighted images [6,14,15,17]. These results inde- 
pendently confirm PET observations that activation-induced changes in blood flow and volume are 
accompanied by little or no increases in tissue oxygen consumption [50,51,52]. 

Since the effect of volume susceptibility difference A% is more pronounced at high field strength [53], 
higher- field imaging magnets [17] will increase the observed T' 2 changes. 

Signal changes can also be observed on T, -weighted MR images. The relationship between Tj and 
regional blood flow was characterized by Detre et al. [1]: 
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FIGURE 63.17 Functional MRI mapping of motor cortex for preoperative planning. This three-dimensional ren- 
dering of the brain represents fusion of functional and structural anatomy. Brain is viewed from the top. A tumor 
is shown in the left hemisphere, near the midline. The other areas depict sites of functional activation during 
movement of the right hand, right foot, and left foot. The right foot cortical representation is displaced by tumor 
mass effect from its usual location. (Courtesy of Dr. Brad Buchbinder.) 
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where M is tissue magnetization and M b is incoming blood signal. M is proton density, /is the flow in 
ml/gm/unit time, and A, is the brain-blood partition coefficient of water (-0.95 ml/gm). From this 
equation, the brain tissue magnetization M relaxes with an apparent T, time constant T t given by 



/. 



A. T 



lapp 



1 



(63.15) 



where the T x is the observed (apparent) longitudinal relaxation time with flow effects included. Xj is 
the true tissue longitudinal relaxation time in the absence of flow. If we assume that the true tissue T x 
remains constant with stimulation, a change in blood flow A/will lead to a change in the observed T t : 
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Thus the MRI signal change can be used to estimate the change in blood flow. 
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FIGURE 63.18 Left hemisphere surface rendering of functional data (EPI, gradient-echo, 10 oblique coronal slices 
extending to posterior sylvian fissure) and high-resolution anatomic image obtained on a subject (age 33 years) during 
performance of a same-different (visual matching) task of pairs of words or nonwords (false font strings). Foci of greatest 
activation for this study are located in dominant perisylvian cortex, i.e., inferior frontal gyrus (Broca's area), superior 
temporal gyrus (Wernicke's area), and inferior parietal lobule (angular gyrus). Also active in this task are sensorimotor 
cortex and prefrontal cortex. The perisylvian sites of activation are known to be key nodes in a left hemisphere language 
network. Prefrontal cortex probably plays a more general, modulatory role in attentional aspects of the task. Sensorimotor 
activation is observed in most language studies despite the absence of overt vocalization. (Courtesy of Dr. Randall Benson.) 



From Eq. (63.14), if the magnetization of blood and tissue always undergoes a similar T l relaxation, 
the flow effect would be minimized. This is a condition that can be approximated by using a flow- 
nonsensitive T l technique inverting all the blood coming into the imaged slice of interest. This flow- 
nonsensitive sequence can be subtracted from a flow-sensitive Xj technique to provide an index of CBF 
without the need of external stimulation [54,55] (Fig. 63.19). Initial results with tumor patients show 
that such flow-mapping techniques are useful for mapping out blood flow of tumor regions [55]. 

Other flow techniques under investigation include the continuous inversion of incoming blood at the 
carotid level [ 1 ] or the use of a single inversion pulse at the carotid level (EPIstar) inverting the incoming 
blood [56,57]. Compared with the flow-nonsensitive and flow-sensitive methods, the blood-tagging 
techniques at the carotid level are basically similar concepts except that the MRI signal of tagged blood 
is expected to be smaller by a factor that depends on the time it takes blood to travel from the tagged 
site to the imaged slice of interest [55]. The continuous-inversion technique also has a significant problem 
of magnetization transfer [ 1 ] that contaminates the flow signal with a magnetization transfer signal that 
is several times larger. On the other hand, the advantage of the continuous inversion is that it can under 
optimal conditions provide a flow contrast larger than all the other methods by a factor of e [55]. 

Problem and Artifacts in fMRI: The Brain- Vein Problem? 
The Brain-Inflow Problem? 

The artifacts arising from large vessels pose serious problems to the interpretation of oxygenation sensitive 
fMRI data. It is generally believed that microvascular changes are specific to the underlying region of 
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FIGURE 63.19 Functional MRI cerebral blood flow (CBF) index (right) of a low-flow brain tumor (dark region 
right of the midline) generated by the subtraction of a flow-nonsensitive image from a flow-sensitive image. This 
low-flow region matches well with a cerebral blood volume (CBV) map (left) of the tumor region generated by the 
injection of a bolus of MRI contrast agent Gd-DTPA, a completely different and established method to measure 
hemodynamics with MRI. 

neuronal activation. However, MRI gradient echo (GE) is sensitive to vessels of all dimensions [46,58], 
and there is concern that macrovascular changes distal to the site of neuronal activity can be induced 
[20]. This has been known as the brain-vein problem. For laboratories not equipped with EPI, gradient 
echo (GE) sensitive to variations in T' 2 and magnetic susceptibility are the only realistic sequences available 
for fMRI acquisition, so the problem is particularly acute. 

In addition, there is a non-deoxyhemoglobin-related problem, especially acute in conventional MRI. 
This is the inflow problem of fresh blood that can be time-locked to stimulation [28,29,59]. Such 
nonparenchymal and macrovascular responses can introduce error in the estimate of activated volumes. 

Techniques to Reduce the Large Vessel Problems 

In dealing with the inflow problems, EPI has special advantages over conventional scanners. The use of 
long repetition times (2 to 3 s) in EPI significantly reduces the brain-inflow problem. Small-flip-angle 
methods in conventional MRI scanners can be used to reduce inflow effect [59]. Based on inflow 
modeling, one observes that at an angle smaller than the Ernst angle [60], the inflow effect drops much 
faster than the tissue signal response to activation. Thus one can effectively remove the inflow artifacts 
with small-flip-angle techniques. 

A new exciting possibility is to add small additional velocity-dephasing gradients to suppress slow in- 
plane vessel flow [60,61]. Basically, moving spins lose signals, while stationary spins are unaffected. The 
addition of these velocity-dephasing gradients drops the overall MRI signal (Fig. 63.20). The hypothesis 
that large vessel signals are suppressed while tissue signals remain intact is a subject of ongoing research. 

Another advantage with EPI is that another oxygenation-sensitive method such as the EPI X 2 -weighted 
spin-echo (T2SE) is also available. T2SE methods are sensitive to the MRI parameter T 2 , which is affected 
by microscopic susceptibility and hence blood oxygenation. Theoretically, T2SE methods are far less 
sensitive to large vessel signals [1,6,46,58]. For conventional scanners, T2SE methods take too long to 
perform and therefore are not practical options. 

The flow model [1] based on T l -weighted sequences and independent of deoxyhemoglobin is also not 
so prone to large vessel artifacts, since the T\ model is a model of perfusion at the tissue level. 

Based on the study of volunteers, the average T' 2 -weighted GE signal percentage change at VI was 
2.5 ± 0.8%. The average oxygenation-weighted T2SE signal percentage change was 0.7 ± 0.3%. The 
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FIGURE 63.20 The curves represent time courses of MRI response to photic stimulation (off-on-off-on •••) with 
different levels of velocity-dephasing gradients turned on to remove MRI signals coming from the flowing blood of 
large vessels. The top curve had no velocity-dephasing gradients turned on. The bottom curve was obtained with 
such strong velocity-dephasing gradients turned on that all large vessel signals were supposed to have been eliminated. 
The middle curve represents a moderate amount of velocity-dephasing gradients, a tradeoff between removing large 
vessel signals and retaining a reasonable amount of MRI signal to noise. 

average perfusion-weighted and Tj-weighted MRI signal percentage change was 1.5 ± 0.5%. These results 
demonstrate that T2SE and T, methods, despite their ability to suppress large vessels, are not competitive 
with T' 2 effect at 1.5 T. However, since the microscopic effect detected by T2SE scales up with field strength 
[62], we expect the T2SE to be a useful sequence at high field strength such as 3 or 4 T. Advancing field 
strength also should benefit T, studies due to better signal-to-noise and to the fact that T, gets longer at 
higher field strength. 

While gradient-echo sequence has a certain ambiguity when it comes to tissue versus vessels, its 
sensitivity at current clinical field strength makes it an extremely attractive technique to identify activation 
sites. By using careful paradigms that rule out possible links between the primary activation site and 
secondary sites, one can circumvent many of the worries of "signal from the primary site draining down 
to secondary sites." A good example is as follows: Photic stimulation activates both the primary visual 
cortex and the extrastriates. To show that the extrastriates are not just a drainage from the primary cortex, 
one can utilize paradigms that activate the primary visual cortex but not the extrastriate, and vice versa. 
There are many permutations of this [37] . This allows us to study the higher-order functions umambig- 
uously even if we are using gradient-echo sequences. 

The continuous advance of MRI mapping techniques utilizing intrinsic blood-tissue contrast promises 
the development of a functional human neuroanatomy of unprecedented spatial and temporal resolution. 
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63.4 Chemical-Shift Imaging: An Introduction 
to Its Theory and Practice 

Xiaoping Hu, Wei Chen, Maqbool Patel, and Kamil Ugurbil 

Over the past two decades, there has been a great deal of development in the application of nuclear 
magnetic resonance (NMR) to biomedical research and clinical medicine. Along with the development 
of magnetic resonance imaging [1], in vivo magnetic resonance spectroscopy (MRS) is becoming a 
research tool for biochemical studies of humans as well as a potentially more specific diagnostic tool, 
since it provides specific information on individual chemical species in living systems. Experimental 
studies in animals and humans have demonstrated that MRS can be used to study the biochemical basis 
of disease and to follow the treatment of disease. 

Since biologic subjects (e.g., humans) are heterogeneous, it is necessary to spatially localize the spec- 
troscopic signals to a well-defined volume or region of interest (VOI or ROI, respectively) in the intact 
body. Toward this goal, various localization techniques have been developed (see ref. [2] for a recent 
review). Among these techniques, chemical-shift imaging (CSI) or spectroscopic imaging [3-6] is an 
attractive technique, since it is capable of producing images reflecting the spatial distribution of various 
chemical species of interest. Since the initial development of CSI in 1982 [3], further developments have 
been made to provide better spatial localization and sensitivity, and the technique has been applied to 
numerous biomedical problems. 

In this section we will first present a qualitative description of the basic principles of chemical-shift 
imaging and subsequently present some practical examples to illustrate the technique. Finally, a summary 
is provided in the last subsection. 

General Methodology 

In an NMR experiment, the subject is placed in a static magnetic field B . Under the equilibrium 
condition, nuclear spins with nonzero magnetic moment are aligned along B , giving rise to an induced 
bulk magnetization. To observe the bulk magnetization, it is tipped to a direction perpendicular to B 
(transverse plane) with a radiofrequency (RF) pulse that has a frequency corresponding to the resonance 
frequency of the nuclei. The resonance frequency is determined by the product of the gyromagnetic ratio 
of the nucleus y and the strength of the static field, i.e., yB , and is called the Larmor frequency. The 
Larmor frequency also depends on the chemical environment of the nuclei, and this dependency gives 
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rise to chemical shifts that allow one to identify different chemical species in an NMR spectrum. Upon 
excitation, the magnetization in the transverse plane (perpendicular to the main B field direction) 
oscillates with the Larmor frequencies of all the different chemical species and induces a signal in a 
receiving RF coil; the signal is also termed the free induction decay (FID). The FID can be Fourier 
transformed with respect to time to produce a spectrum in frequency domain. 

In order to localize an NMR signal from an intact subject, spatially selective excitation and/or spatial 
encoding are usually utilized. Selective excitation is achieved as follows: In the excitation, an RF pulse 
with a finite bandwidth is applied in the presence of a linear static magnetic field gradient. With the 
application of the gradient, the Larmor frequency of spins depends linearly on the spatial location along 
the direction of the gradient. Consequently, only the spins in a slice whose resonance frequency falls into 
the bandwidth of the RF pulse are excited. 

The RF excitation rotates all or a portion of the magnetization to the transverse plane, which can be 
detected by a receiving RF coil. Without spatial encoding, the signal detected is the integral of the signals 
over the entire excited volume. In CSI based on Fourier imaging, spatial discrimination is achieved by 
phase encoding. Phase encoding is accomplished by applying a gradient pulse after the excitation and 
before the data acquisition. During the gradient pulse, spins precess at Larmor frequencies that vary 
linearly along the direction of the gradient and accrue a phase proportional to the position along the 
phase-encoding gradient as well as the strength and the duration of the gradient pulse. This acquired 
spatially encoded phase is typically expressed as k- r =\y g(t)-r dt, where y is the gyromagnetic ratio; r 
is the vector designating spatial location; g(t) defines the magnitude, the direction, and the time depen- 
dence of the magnetic field gradient applied during the phase-encoding; and the integration is performed 
over time when the phase-encoding gradient is on. Thus, in one-dimensional phase encoding, if the 
phase encoding is along, for example, the y axis, the phase acquired becomes k x y = } yg(t) x y dt. The 
acquired signal S(t) is the integral of the spatially distributed signals modulated by a spatially dependent 
phase, given by the equation 

S(f)= fp(?,f)e^ "Vr (63.17) 

where p is a function that describes the spatial density and the time evolution of the transverse magne- 
tization of all the chemical species in the sample. This signal mathematically corresponds to a sample of 
the Fourier transform along the direction of the gradient. The excitation and detection process is repeated 
with various phase-encoding gradients to obtain many phase-encoded signals that can be inversely 
Fourier-transformed to resolve an equal number of pixels along this direction. Taking the example of 
one-dimensional phase-encoding along the y axis to obtain a one-dimensional image along this direction 
of n pixels, the phase encoding gradient is incremented n times so that n FIDs are acquired, each of which 
is described as 

S(t,n)=[p*(y,t)e^dy (63.18) 

where p* is already integrated over the x and z directions, and k is the phase-encoding increment; the 
latter is decided on using the criteria that the full field of view undergo a 360-degree phase difference 
when n = 1, as dictated by the sampling theorem. The time required for each repetition (TR), which is 
dictated by the longitudinal relaxation time, is usually on the order of seconds. 

In CSI, phase encoding is applied in one, two, or three dimensions to provide spatial localization. 
Meanwhile, selective excitation also can be utilized in one or more dimensions to restrict the volume to 
be resolved with the phase encodings. For example, with selective excitation in two dimensions, CSI in 
one spatial dimension can resolve voxels within the selected column. In multidimensional CSI, all the 
phase-encoding steps along one dimension need to be repeated for all the steps along the others. Thus, 
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for three dimensions with M, N, and L number of phase encoding steps, one must acquire M x N x L 
number of FIDS: 



Slt,m,n,l\= \plr,t\e K ' a r (63.19) 



where m, n, and I must step through M, N, and L in integer steps, respectively. As a result, the time 
needed for acquiring a chemical-shift image is proportional to the number of pixels desired and may be 
very long. In practice, due to the time limitation as well as the signal-to-noise ratio (SNR) limitation, 
chemical-shift imaging is usually performed with relatively few spatial encoding steps, such as 16 x 16 
or 32 x 32 in a two-dimensional experiment. 

The data acquired with the CSI sequence need to be properly processed before the metabolite infor- 
mation can be visualized and quantitated. The processing consists of spatial reconstruction and spectral 
processing. Spatial reconstruction is achieved by performing discrete inverse Fourier transformation, for 
each of the spatial dimensions, with respect to the phase-encoding steps. The spatial Fourier transform 
is applied for all the points of the acquired FID. For example, for a data set from a CSI in two spatial 
dimensions with 32 x 32 phase-encoding steps and 1024 sampled data points for each FID, a 32 x 32 
two-dimensional inverse Fourier transform is applied to each of the 1024 data points. Although the 
nominal spatial resolution achieved by the spatial reconstruction is determined by the number of phase- 
encoding steps and the field of view (FOV), it is important to note that due to the limited number of 
phase-encoding steps used in most CSI experiments, the spatial resolution is severely degraded by the 
truncation artifacts, which results in signal "bleeding" between pixels. Various methods have been devel- 
oped to reduce this problem [7-14]. 

The localized FIDs derived from the spatial reconstruction are to be further processed by spectral 
analysis. Standard procedures include Fourier transformation, filtering, zero-filling, and phasing. The 
localized spectra can be subsequently presented for visualization or further processed to produce quan- 
titative metabolite information. The presentation of the localized spectra in CSI is not a straightforward 
task because there can be thousands of spectra. In one-dimensional experiments, localized spectra are 
usually presented in a stack plot. In two-dimensional experiments, localized spectra are plotted in small 
boxes representing the extent of the pixels, and the plots can be overlaid on corresponding anatomic 
image for reference. Spectra from three-dimensional CSI experiments are usually presented slice by slice, 
each displaying the spectra as in the two-dimensional case. 

To derive metabolite maps, peaks corresponding to the metabolites of interest need to be quantified. 
In principle, the peaks can be quantified using the standard methods developed for spectral quantification 
[15-17]. The most straightforward technique is to calculate the peak areas by integrating the spectra over 
the peak of interest if it does not overlap with other peaks significantly. In integrating all the localized 
spectra, spectral shift due to B inhomogeneity should be taken into account. A more robust approach 
is to apply spectral fitting programs to each spectrum to obtain various parameters of each peak. The 
fitted area for the peak of interest can then be used to represent the metabolite signal. The peak areas 
are then used to generate metabolite maps, which are images with intensities proportional to the localized 
peak area. The metabolite map can be displayed by itself as a gray-scale image or color-coded image or 
overlaid on a reference anatomic image. 

Practical Examples 

To illustrate the practical utility of CSI, we present two representative CSI studies in this section. The 
sequence for the first study is shown in Fig. 63.21. This is a three-dimensional sequence in which phase 
encoding is applied in all three directions and no slice selection is used. Such a sequence is usually used 
with a surface RF coil whose spatial extent of sensitivity defines the field of view. In this sequence, the 
FID is acquired immediately after the application of the phase-encoding gradient to minimize the decay 
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FIGURE 63.21 Sequence diagram for a three-dimensional chemical shift imaging sequence using a nonselective 
RF pulse. 



of the transverse magnetization, and the sequence is suitable for imaging metabolites with short transverse 
relaxation time (e.g., ATP). 

With the sequence shown in Fig. 63.21, aphosphorus-31 CSI study of the human brain was conducted 
using a quadrature surface coil. A nonselective RF pulse with an Ernest angle (40 degrees) optimized for 
the repetition time was used for the excitation. Phase-encoding gradients were applied for a duration of 
500 lis; the phase-encoding gradients were incremented according to a FOV of 25 x 25 x 20 cm 3 . Phase- 
encoded FIDs were acquired with 1024 complex data points over a sampling window of 204.8 ms; the 
corresponding spectral width was 5000 Hz. To reduce intervoxel signal contamination, a technique that 
utilizes variable data averaging to introduce spatial filtering during the data acquisition for optimal signal- 
to-noise ratio is employed [7-10], resulting in spherical voxels with diameter of 3 cm (15 cc volume). 
The data were acquired with a TR of 1 s, and the total acquisition time was approximately 28 minutes. 

The acquired data were processed to generate three-dimensional voxels, each containing a localized 
phosphorus spectrum, in a 17 x 13 x 17 matrix. In Fig. 63.22a-c, spectra in three slices of the three- 
dimensional CSI are presented; these spectra are overlaid on the corresponding anatomic images obtained 
with a T r weighted imaging sequence. One representative spectrum of the brain is illustrated in 
Fig. 63.22<i, where the peaks corresponding to various metabolites are labeled. It is evident that the 
localized phosphorus spectra contain a wealth of information about several metabolites of interest, 
including adenosine triphosphate (ATP), phosphocreatine (PCr), phosphomonoester (PME), inorganic 
phosphate (P;), and phosphodiester (PDE). In pathologic cases, focal abnormalities in phosphorus 
metabolites have been detected in patients with tumor, epilepsy, and other diseases [18-25]. 

The second study described below is performed with the sequence depicted in Fig. 63.23. This is a 
two-dimensional spin-echo sequence in which a slice is selectively excited by a 90-degree excitation pulse. 
The 180-degree refocusing pulse is selective with a slightly broader slice profile. Here the phase-encoding 
gradients are applied before the refocusing pulse; they also can be placed after the 180-degree pulse or 
split to both sides of the 180-degree pulse. This sequence was used for a proton CSI experiment. In proton 
CSI, a major problem arises from the strong water signal that overwhelms that of the metabolites. In order 
to suppress the water signal, many techniques have been devised [26-29]. In this study, a three-pulse 
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FIGURE 63.22 (A-C) Boxed plot of spectra in three slices from the three-dimensional 31 P CSI experiment overlaid 
on corresponding anatomic images. The spectral extent displayed is from 10 to -20 ppm. A 20-Hz line broadening 
is applied to all the spectra. (D) Representative spectrum from the three-dimensional 31 P CSI shown in (B). Metabolite 
peaks are labeled. 



CHESS [26] technique was applied before the application of the excitation pulse as shown in Fig. 63.23. 
The CSI experiment was performed on a 1.4-cm slice with 32 x 32 phase encodings over a 22 x 22 cm 2 
FOV. The second half of the spin-echo was acquired with 512 complex data points over a sampling 
window of 256 ms, corresponding to a spectral width of 2000 Hz. Each phase-encoding FID was acquired 
twice for data averaging. The repetition time (TR) and the echo time (TE) used were 1.2 s and 136 ms, 
respectively. The total acquisition time was approximately 40 minutes. 
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FIGURE 63.22 (continued) 

Another major problem in proton CSI study of the brain is that the signal from the subcutaneous 
lipid usually is much stronger than those of the metabolites, and this strong signal leaks into pixels within 
the brain due to truncation artifacts. To avoid lipid signal contamination, many proton CSI studies of 
the brain are performed within a selected region of interest excluding the subcutaneous fat [30-34]. 
Recently, several techniques have been proposed to suppress the lipid signal and consequently suppress 
the lipid signal contamination. These include the use of WEFT [27] and the use of outer- volume signal 
suppression [34]. In the example described below, we used a technique that utilizes the spatial location 
of the lipid to extrapolate data in the fc-space to reduce the signal contamination due to truncation [35]. 

In Fig. 63.24, the results from the proton CSI study are presented. In panel (a), the localized spectra 
are displayed. Note that the spectra in the subcutaneous lipid are ignored because they are all off the 
scale. The nominal spatial resolution is approximately 0.66 cc. A spectrum from an individual pixel in 
this study is presented in Fig. 63.24fo with metabolite peaks indicated. Several metabolite peaks, such as 
those corresponding to the N- acetyl aspartate (NAA), creatine/phosphocreatine (Cr/PCr), and choline 
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FIGURE 63.23 A two-dimensional spin-echo CSI sequence with chemical selective water suppression (CHESS) for 
proton study. 



(Cho), are readily identified. In addition, there is still a noticeable amount of residual lipid signal 
contamination despite the use of the data extrapolation technique. Without the lipid suppression tech- 
nique, the brain spectra would be severely contaminated by the lipid signal, making the detection of the 
metabolite peaks formidable. The peak of NAA in these spectra is fitted to generate the metabolite map 
shown in panel (c). Although the metabolite map is not corrected for coil sensitivity and other factors 
and only provides a relative measure of the metabolite concentration in the brain, it is a reasonable 
measure of the NAA distribution in the brain slice. The spatial resolution of the CSI study can be 
appreciated from the brain structure present in the map. In biomedical research, proton CSI is potentially 
the most promising technique, since it provides best sensitivity and spatial resolution. Various in vivo 
applications of proton spectroscopy can be found in the literature [36]. 

Summary 

CSI is a technique for generating localized spectra that provide a wealth of biochemical information that 
can be used to study the metabolic activity of living system and to detect disease associated biochemical 
changes. This section provides an introduction to the technique and illustrates it by two representative 
examples. More specific topics concerning various aspects of CSI can be found in the literature. 
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FIGURE 63.24 (a) Boxed plot of spectra for the two-dimensional proton study overlaid on the anatomic image. A 
spectral range of 1.7 to 3.5 ppm is used in the plot to show Cho, PCr/Cr, and NAA. A 5-Hz line broadening is applied 
in the spectral processing, (b) A representative spectrum from the two-dimensional CSI in panel (a). Peaks corre- 
sponding to Cho, PCr/Cr, and NAA are indicated, (c) A map of the area under the NAA peak obtained by spectral 
fitting. The anatomic image is presented along with the metabolite map for reference. The spatial resolution of the 
metabolite image can be appreciated from the similarities between the two images. The lipid suppression technique 
has successfully eliminated the signal contamination from the lipid in the skull. 
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Barbara Y. Croft 

Nuclear medicine can be defined as the practice of making patients radioactive for diagnostic and ther- 
apeutic purposes. The radioactivity is injected intravenously, rebreathed, or ingested. It is the internal 
circulation of radioactive material that distinguishes nuclear medicine from diagnostic radiology and 
radiation oncology in most of its forms. This section will examine only the diagnostic use and will 
concentrate on methods for detecting the radioactivity from outside the body without trauma to the 
patient. Diagnostic nuclear medicine is successful for two main reasons: (1) It can rely on the use of very 
small amounts of materials (picomolar concentrations in chemical terms) thus usually not having any 
effect on the processes being studied, and (2) The radionuclides being used can penetrate tissue and be 
detected outside the patient. Thus the materials can trace processes or "opacify" organs without affecting 
their function. 

Parameters for Choices in Nuclear Medicine 

Of the various kinds of emanations from radioactive materials, photons alone have a range in tissue great 
enough to escape so that they can be detected externally. Electrons or beta-minus particles of high energy 
can create bremsstrahlung in interactions with tissue, but the radiation emanates from the site of the 
interaction, not the site of the beta ray's production. Positrons or beta-plus particles annihilate with 
electrons to create gamma rays so that they can be detected (see Chap. 67). For certain radionuclides, 
the emanation being detected is x-rays, in the 50- to 100-KeV energy range. 

The half-lives of materials in use in nuclear medicine range from a few minutes to weeks. The half- 
life must be chosen with two major points in mind: the time course of the process being studied and the 
radiation dose to the target organ, i.e., that organ with the highest concentration over the longest time 
(the cumulated activity or area underneath the activity versus time curve). In general, it is desired to stay 
under 5 rad to the target organ. 

The choice of the best energy range to use is also based on two major criteria: the energy that will 
penetrate tissue but can be channeled by heavy metal shielding and collimation and that which will 
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TABLE 64.1 Gamma Ray Detection 



Type of Sample Activity 

Patient samples, e.g., blood, urine 0.001 uCi 



Small organ function <30 cm field of 5-200 uCi 

view at 60 cm distance 
Static image of body part, e.g., liver, 0.2-30 mCi 50-650 keV 

lung 
Dynamic image of body part, e.g., 2-30 mCi 

xenon in airways 
Static tomographic image of body part See Sec. 64.1 



Energy Type of Instrument 

20-5000 keV Gamma counter with annular Nal(TI) 

detector, 1 or 2 PMTs, external Pb shielding 

20-1500 keV 2-4-in. Nal(TI) detector with flared Pb 
collimator 
Rectilinear scanner with focused Pb collimator 



80-300 keV Anger camera and parallel-hole Pb collimator 



interact in the detector to produce a pulse. Thus the ideal energy is dependent on the detector being 
used and the kind of examination being performed. Table 64. 1 describes the kinds of gamma-ray detec- 
tion, the activity and energy ranges, and an example of the kind of information to be gained. The lesser 
amounts of activity are used in situations of lesser spatial resolution and/or of greater sensitivity. Positron 
imaging is omitted because it is treated elsewhere. 

Radiation dose is affected by all the emanations of a radionuclide, not just the desirable ones, thus 
constricting the choice of nuclide further. There can be no alpha radiation used in diagnosis; the use of 
materials with primary beta radiation should be avoided because the beta radiation confers a radiation 
dose without adding to the information being gained. For imaging, in addition, even if there is a primary 
gamma ray in the correct energy window for the detector, there should be no large amount of radiation, 
either of primary radiation of higher energy, because it interferes with the image collimation, or of 
secondary radiation of a very similar energy, because it interferes with the perception of the primary 
radiation emanating from the site of interest. 

For imaging using heavy-metal collimation, the energy range is constrained to be that which will 
emanate from the human body and which the collimation can contain, or about 50 to 500 keV. 

Detectors must be made from materials that exhibit some detectable change when ionizing radiation 
is absorbed and that are of a high enough atomic number and density to make possible stopping large 
percentages of those gamma rays emanating (high sensitivity). In addition, because the primary gamma 
rays are not the only rays emanating from the source — a human body and therefore a distributed source 
accompanied by an absorber — there must be energy discrimination in the instrument to prevent the 
formation of an image of the scattered radiation. To achieve pulse size proportional to energy, and 
therefore to achieve identification of the energy and source of the energy, the detector must be a 
proportional detector. This means that Geiger-Muller detection, operating in an all-or-none fashion, is 
not acceptable. 

Gaseous detectors are not practical because their density is not great enough. Liquid detectors (in 
which any component is liquid) are not practical because the liquid can spill when the detector is 
positioned; this problem can be compensated for if absolutely necessary, but it is better to consider it 
from the outset. Another property of a good detector is its ability to detect large numbers of gamma rays 
per time unit. With detection capabilities to separate 100,000 counts per second or a dead time of 2 lis, 
the system is still only detecting perhaps 1000 counts per square centimeter per second over a 10 x 10 cm 
area. The precision of the information is governed by Poisson statistics, so the imprecision in information 
collected for 1 s in a square centimeter is ±3% at the 1 standard deviation level. Since we would hope 
for better spatial resolution than 1 cm 2 , the precision is obviously worse than this. This points to the 
need for fast detectors, in addition to the aforementioned sensitivity. The more detector that surrounds 
the patient, the more sensitive the system will be. Table 64.2 lists in order from least sensitive to most 
sensitive some of the geometries used for imaging in nuclear medicine. This generally is also a listing 
from the older methods to the more recent. 

For the purposes of this section, we shall consider that the problems of counting patient and other 
samples and of detecting the time course of activity changes in extended areas with probes are not our 
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TABLE 64.2 Ways of Imaging Using Lead Collimation 

Moving probe; rectilinear scanner 

Array of multiple crystals; autofluoroscope, "fly-eye" camera 

Two moving probes: dual-head rectilinear scanner 

Large single-crystal system: Anger camera 

Two crystals on opposite sides of patient for two views using Anger logic 

Large multiple-crystal systems using Anger logic SPECT 

Other possibilities 

topic and confine ourselves to the attempts made to image distributions of gamma-emitting radionuclides 
in patients and research subjects. The previous section treats the three-dimensional imaging of these 
distributions; this section will treat detection of the distribution in a planar fashion or the image of the 
projection of the distribution onto a planar detector. 

Detection of Photon Radiation 

Gamma rays are detected when atoms in a detector are ionized and the ions are collected either directly 
as in gaseous or semiconductor systems or by first conversion of the ionized electrons to light and 
subsequent conversion of the light to electrons in a photomultiplier tube (P-M tube or PMT). In all cases 
there is a voltage applied across some distance that causes a pulse to be created when a photon is absorbed. 

The gamma rays are emitted according to Poisson statistics because each decaying nucleus is indepen- 
dent of the others and has an equal probability of decaying per unit time. Because the uncertainty in the 
production of gamma rays is therefore on the order of magnitude of the square root of the number of 
gamma rays, the more gamma rays that are detected, the less the proportional uncertainty will be. Thus 
sensitivity is a very important issue for the creation of images, since the rays will be detected by area. To 
get better resolution, one must have the numbers of counts and the apparatus to resolve them spatially. 
Having the large numbers of counts also means the apparatus must resolve them temporally. 

The need for energy resolution carries its own burden. Depending on the detector, the energy resolution 
maybe easily achieved or not (Table 64.3). In any case, the attenuation and scattering inside the body means 
that there will be a range of gamma rays emitted, and it will be difficult to tell those scattered through very 
small angles from those not scattered at all. This affects the spatial resolution of the instrument. 

The current practice of nuclear medicine has defined the limits of the amount of activity that can be 
administered to a patient by the amount of radiation dose. Since planar imaging with one detector allows 
only 2 pi detection at best and generally a view of somewhat less because the source is in the patient and 
the lead collimation means that only rays that are directed from the decay toward the crystal will be 
detected, it is of utmost importance to detect every ray possible. To the extent that no one is ever satisfied 
with the resolution of any system and always wishes for better, there is the need to be able to get spatial 
resolution better than the intrinsic 2 mm currently achievable. Some better collimation system, such as 

TABLE 64.3 Detector Substances and Size Considerations, Atomic Number 
of the Attenuator, Energy Resolution Capability 

PMT connected 

Nal(TI): up to 50 cm across; 63; 5-10% 

Plastic scintillators: unlimited; 6; only Compton absorption for gamma rays used in imaging 

CsI(TI): <3 cm x 3 cm; 53, 55; poorer than Nal(TI) 

BiGermanate: <3 cm x 3 cm; 83; poorer than Nal(TI) 
Semiconductors: Liquid nitrogen operation and liquid nitrogen storage 

GeLi: <3 cm x 3 cm; 32; <1% 

SiLi: <3 cm x 3 cm; 14; <1% 
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TABLE 64.4 Calculation of Number of Counts Achieved with Anger Camera 





cpm 


cps 


Activity 


0.001 




mCi/cm 3 






counts/sec 




3.7 x 10 7 


counts/min 


2.22 x 10 9 




271 geometry 


1.11 x 10 9 


1.85 x 10 7 


Attenuated by tissue of 0.12/cm attenuation 






and 3 cm thick 


7.44 x 10 8 


1.29 x 10 7 


X Camera efficiency of 0.0006 


4.64 x 10 5 


7744 


Good uptake in liver = 5 mCi/1000 g = 






0.005 mCi/g 


2.32 x 10" 


3.8 x 10" 


Thyroid uptake of Tc-99m = (2 mCi/37 g) * 






2% = 0.001 mCi/G 


4.6 x 10 5 


7.7 x 10 3 



envisioned in a coincidence detection system like that used in PET, might make it possible to avoid 
stopping so many of the rays with the collimator. 

We have now seen that energy resolution, sensitivity and resolving time of the detector are all bound 
up together to produce the spatial resolution of the instrument as well as the more obvious temporal 
resolution. The need to collimate to create an image rather than a blush greatly decreases the numbers 
of counts and makes Poisson statistics a major determinant of the appearance of nuclear medical images. 

Table 64.4 shows a calculation for the NaI(TI)-based Anger camera showing 0.06% efficiency for the 
detection system. Thus the number of counts per second is not high and so is well within the temporal 
resolving capabilities of the detector system. The problem is the 0.06% efficiency, which is the effect of 
both the crystal thickness being optimized for imaging rather than for stopping all the gamma rays, and 
the lead collimation. Improvements in nuclear medicine imaging resolution can only come if both these 
factors are addressed. 

Various Detector Configurations 

The detectors in clinical nuclear medicine are Nal(TI) crystals. In research applications, other substances 
are employed, but the engineering considerations for the use of other detectors are more complex and 
have been less thoroughly explored (Table 64.3). 

The possibilities for configuring the detectors have been increasing, although the older methods tend 
to be discarded as the new ones are exploited (see Table 64.2). This is in part because each laboratory 
cannot afford to have one of every kind of instrument, although there are tasks for which each one is 
ideally suited. 

The first instruments possible for plane-projection imaging consisted of a moving single crystal probe, 
called a rectilinear scanner. The probe consisted of a detector [beginning with Nal(TI) but later incor- 
porating small semiconductors] that was collimated by a focused lead collimator of appreciable thickness 
(often 2 in. of lead or more) with hole sizes and thicknesses of septa consonant with the intended energy 
and organ size and depth to be imaged. The collimated detector was caused to move across the patient 
at a constant speed; the pulses from the detector were converted to visible signals either by virtue of 
markings on a sheet or of light flashes exposing a film. This detector could see only one spot at a time, 
so only slow temporal changes in activity could be appreciated. A small organ such as the thyroid could 
be imaged in this fashion very satisfactorily. Bone imaging also could be done with the later versions of 
this instrument. 

To enlarge the size of the detector, several probes, each with its own photomultiplier tube and collimator, 
could be used. Versions of this idea were used to create dual-probe instruments to image both sides of the 
patient simultaneously, bars of probes to sweep down the patient and create a combined image, etc. 

To go further with the multiple crystals to create yet larger fields of view, the autofluoroscope (Fig. 64. 1 ) 
combined crystals in a rectangular array. For each to have its own photomultiplier tube required too 
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FIGURE 64.1 The Bender- Blau autofluoroscope is a multicrystal imager with a rectangular array of crystals con- 
nected to PMTs by plastic light guides. There is a PMT for each row of crystals and a PMT for each column of 
crystals, so an N by M array would have (N + M) PMTs. 

many PMTs, so the instrument was designed with a light pipe to connect each crystal with a PMT to 
indicate its row and a second one to indicate its column. The crystals are separated by lead septa to 
prevent scattered photons from one crystal affecting the next. Because of the large number of crystals 
and PMTs, the instrument is very fast, but because of the size of the crystals, the resolution is coarse. To 
improve the resolution, the collimator is often jittered so that each crystal is made to see more than one 
field of view to create a better resolved image. For those dynamic examinations in which temporal 
resolution is more important than spatial resolution, the system has a clear advantage. It has not been 
very popular for general use, however. In its commercial realization, the field of view was not large enough 
to image either lungs or livers or bones in any single image fashion. 

As large Nal(TI) crystals became a reality, new ways to use them were conceived. The Anger camera 
(Fig. 64.2 ) is one of the older of these methods. The idea is to use a single crystal of diameter large enough 
to image a significant part of the human body and to back the crystal by an array of photomultiplier 
tubes to give positional sensitivity. Each PMT is assigned coordinates (Fig. 64.3). When a photon is 
absorbed by the crystal, a number of PMTs receive light and therefore emit signals. The X and Y signal 
values for the emanation are determined by the strength of the signal from each of the tubes and its x 
and y position, and the energy of the emanation (which determines if it will be used to create the image) 
is the sum of all the signals (the .Z pulse). If the discriminator passes the Zpulse, then the Xand Y signals 
are sent to whatever device is recording the image, be it an oscilloscope and film recording system or the 
analog-to-digital (A/D) converters of a computer system. More recently, the A/D conversion is done 
earlier in the system so that the X and Y signals are themselves digital. The Anger camera is the major 
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FIGURE 64.2 Anger camera detector design. This figure shows a cross section through the camera head. The active 
surface is pointed down. Shielding surrounds the assembly on the sides and top. 
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FIGURE 64.3 An array of PMTs in the Anger camera showing the geometric connection between the PMTs and 
the X and Y output. 

instrument in use in nuclear medicine today. It has been optimized for use with the 140-keV radiation 
from Tc-99m, although collimators have been designed for lower and higher energies, as well as optimized 
for higher sensitivity and higher resolution. The early systems used circular crystals, while the current 
configuration is likely to be rectangular or square. 

A combination of the Anger positional logic and the focused collimator in a scanner produced the 
PhoCon instrument, which, because of the design of its collimators, had planar tomographic capabilities 
(the instrument could partially resolve activity in different planes, parallel to the direction of movement 
of the detector). 

Ancillary Electronic Equipment for Detection 

The detectors used in nuclear medicine are attached to preamplifiers, amplifiers, and pulse shapers to 
form a signal that can be examined for information about the energy of the detected photon (Fig. 64.4). 
The energy discriminator has lower and upper windows that are set with reference radionuclides so that 
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FIGURE 64.4 Schematic drawing of a generalized detector system. There would be a high-voltage power supply 
for the detector in an NaI(TI)-PMT detector system. 

typically the particular nuclide in use can be dialed in along with the width of the energy window. A 
photon with an energy that falls in the selected range will cause the creation of a pulse of a voltage that 
falls in between the levels; all other photon energies will cause voltages either too high or too low. If only 
gross features are being recorded, any of the instruments may be used as probe detectors and the results 
recorded on strip-chart recordings of activity versus time. 

The PMT "multiplies" photons (Fig. 64.5) because it has a quartz entrance window which is coated 
to release electrons when it absorbs a light photon and there is a voltage drop; the number of electrons 
released is proportional to the amount of light that hits the coating. The electrons are guided through a 
hole and caused to hit the first dynode, which is coated with a special substance to allow it to release 
electrons when it is hit by an electron. There are a series of dynodes each with a voltage that pulls the 
electrons from the last dynodes toward it. The surface coating not only releases electrons but also 
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FIGURE 64.5 Schematic drawing of a photomultiplier tube (PMT). Each of the dynodes and the anode is connected 
to a separate pin in the tube socket. The inside of the tube is evacuated of all gas. Dynodes are typically copper with 
a special oxidized coating for electron multiplication. 
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multiplies the electron shower. In a cascade through 10 to 12 dynodes, there is a multiplication of 
approximately 10 6 , so that pulses of a few electrons become currents of the order of 10~ 12 amps. The 
PMTs must be protected from other influences, such as stray radioactivity or strong magnetic fields, 
which might cause extraneous electron formation or curves in the electron path. Without the voltage 
drop from one dynode to the next, there is no cascade of electrons and no counting. 

For imaging, the x and y positions of those photons in the correct energy range will be recorded in 
the image because they have a Z pulse. Once the pulse has been accepted and the position determined, 
that position may be recorded to make an image either in analog or digital fashion; a spot may be made 
on an oscilloscope screen and recorded on film or paper, or the position may be digitized and stored in 
a computer file for later imaging on an oscilloscope screen and/or for photography. In general, the 
computers required are very similar to those used for other imaging modalities, except for the hardware 
that allows the acceptance of the pulse. The software is usually specifically created for nuclear medicine 
because of the unique needs for determination of function. 

The calibration of the systems follows a similar pattern, no matter how simple or complex the 
instrument. Most probe detectors must be "peaked," which means that the energy of the radioactivity 
must be connected with some setting of the instrument, often meant to read in kiloelectronvolts. This 
is accomplished by counting a sample with the instrument, using a reasonably narrow energy window, 
while varying the high voltage until the count rate reading is a maximum. The window is then widened 
for counting samples to encompass all the energy peak being counted. The detector is said to be liner if 
it can be set with one energy and another energy can be found where it should be on the kiloelectronvolt 
scale. 

To ensure that the images of the radioactivity accurately depict the distribution in the object being 
imaged, the system must be initialized correctly and tested at intervals. The several properties that must 
be calibrated and corrected are sensitivity, uniformity, energy pulse shape, and linearity. 

These issues are addressed in several ways. The first is that all the PMTs used in an imaging system 
must be chosen to have matched sensitivities and energy spectra. Next, during manufacture, and at 
intervals during maintenance, the PMTs' response to voltage is matched so that the voltage from the 
power supply causes all the tubes to have maximum counts at the same voltage. The sensitivities of all 
the tubes are also matched during periodic maintenance. Prior to operation, usually at the start of each 
working day, the user will check the radioactive peak and then present the instrument with a source of 
activity to give an even exposure over the whole crystal. This uniform "flood" is recorded. The image 
may be used by the instrument for calibration; recalibration is usually performed at weekly intervals. 
The number of counts needed depends on the use the instrument is to be put to, but generally the 
instrument must be tested and calibrated with numbers of counts at least equal to those being emitted 
by the patients and other objects being imaged. 

Because the PMT placement means that placement of the x and y locations is not perfect over the face 
of the crystal but has the effect of creating wiggly lines that may be closer together over the center of the 
PMT and farther apart at the interstices between tubes, the image may suffer from spatial nonlinearity. 
This can be corrected for by presenting the system with a lead pattern in straight-line bars or holes in 
rows and using a hard-wired or software method to position the X and Y signals correctly. This is called 
a linearity correction. In addition, there may be adjustments of the energy spectra of each tube to make 
them match each other so that variations in the number of kiloelectronvolts included in the window 
(created by varying the discriminator settings) will not create variations in sensitivity. This is called an 
energy correction. 

Place of Planar Imaging in Nuclear Medicine Today: 
Applications and Economics 

There are various ways of thinking about diagnostic imaging and nuclear medicine. If the reason for 
imaging the patient is to determine the presence of disease, then there are at least two possible strategies. 
One is to do the most complicated examination that will give a complete set of results on all patients. 
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The other is to start with a simple examination and hope to categorize patients, perhaps into certain 
abnormals and all others, or even into abnormals, indeterminates, and normals. Then a subsequent, 
more complex examination is used to determine if there are more abnormals, or perhaps how abnormal 
they are. If the reason for imaging the patient is to collect a set of data that will be compared with results 
from that patient at a later time and with a range of normal results from all patients, then the least 
complex method possible for collecting the information should be used in a regular and routine fashion 
so that the comparison are possible. 

In the former setting, where the complexity of the examination may have to be changed after the initial 
results are seen, in order to take full advantage of the dose of radioactive material that has been given to 
the patient, it is sensible to have the equipment available that will be able to perform the more complex 
examination and not to confine the equipment available to that capable of doing only the simple first 
examination. For this reason and because for some organs, such as the brain, the first examination is a 
SPECT examination, the new Anger cameras being sold today are mostly capable of doing rotating SPECT. 
The added necessities that SPECT brings to the instrument specifications are of degree: better stability, 
uniformity, and resolution. Thus they do not obviate the use of the equipment for planar imaging but 
rather enhance it. In the setting of performing only the examination necessary to define the disease, the 
Anger SPECT camera can be used for plane projection imaging and afterward for SPECT to further refine 
the examination. There are settings in which a planar camera will be purchased because of the simplicity 
of all the examinations (as in a very large laboratory that can have specialized instruments, a thyroid 
practice, or in the developing countries), but in the small- to medium-sized nuclear medicine practice, 
the new cameras being purchased are all SPECT-capable. 

Nuclear medicine studies are generally less expensive than x-ray computed tomography or magnetic 
resonance imaging and more so than planar x-ray or ultrasound imaging. The general conduct of a 
nuclear medicine laboratory is more complex than these others because of the radioactive materials and 
the accompanying regulations. The specialty is practice both in clinics and in hospitals, but again, the 
complication imposed by the presence of the radioactive materials tips the balance of the practices toward 
the hospital. In that setting the practitioners may be imagers with a broad range of studies offered or 
cardiologists with a concentration on cardiac studies. Thus the setting also will determine what kind of 
instrument is most suitable. 

Defining Terms 

Energy resolution: Full width at half maximum of graph of detected counts versus energy, expressed 
as a percentage of the energy. 

Poisson statistics: Expresses probability in situations of equal probability for an event per unit of time, 
such as radioactive decay or cosmic-ray appearance. The standard deviation of a mean number of 
counts is the square root of the mean number of counts, which is a decreasing fraction of the 
number of counts when expressed as a fraction of the number of counts. 

rad: The unit of radiation energy absorption (dose) in matter, defined as the absorption of 100 ergs 
per gram of irradiated material. The unit is being replaced by the gray, an SI unit, where 1 gray 
(Gy) = 100 rad. 

Further Information 

A good introduction to nuclear medicine, written as a text for technologists, is Nuclear Medicine Tech- 
nology and Techniques, edited by D. R. Bernier, J. K. Langan, and L. D. Wells. A treatment of many of the 
nuclear medicine physics issues is given in L. E. Williams' Nuclear Medicine Physics, published in three 
volumes. Journals that publish nuclear medicine articles include the monthly Journal of Nuclear Medicine, 
the European Journal of Nuclear Medicine, Clinical Nuclear Medicine, IEEE Transactions in Nuclear Science, 
IEEE Transactions in Medical Imaging, and Medical Physics. Quarterly and annual publications include 
Seminars in Nuclear Medicine, Yearbook of Nuclear Medicine, and Nuclear Medicine Annual. 
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The Society of Nuclear Medicine holds an annual scientific meeting that includes scientific papers, 
continuing education which could give the novice a broad introduction, poster sessions, and a large 
equipment exhibition. Another large meeting, devoted to many radiologic specialties is the Radiologic 
Society of North America's annual meeting, held just after Thanksgiving. 

64.2 SPECT (Single-Photon Emission 
Computed Tomography) 

Benjamin M. W. Tsui 

During the last three decades, there has been much excitement in the development of diagnostic radiology. 
The development is fueled by inventions and advances made in a number of exciting new medical imaging 
modalities, including ultrasound (US), x-ray CT (computed tomography), PET (positron emission 
tomography), SPECT (single-photon emission computed tomography), and MRI (magnetic resonance 
imaging). These new imaging modalities have revolutionized the practice of diagnostic radiology, result- 
ing in substantial improvement in patient care. 

Single-photon emission computed tomography (SPECT) is a medical imaging modality that combines 
conventional nuclear medicine (NM) imaging techniques and CT methods. Different from x-ray CT, 
SPECT uses radioactive-labeled pharmaceuticals, i.e., radiopharmaceuticals, that distribute in different 
internal tissues or organs instead of an external x-ray source. The spatial and uptake distributions of the 
radiopharmaceuticals depend on the biokinetic properties of the pharmaceuticals and the normal or 
abnormal state of the patient. The gamma photons emitted from the radioactive source are detected by 
radiation detectors similar to those used in conventional nuclear medicine. The CT method requires 
projection (or planar) image data to be acquired from different views around the patient. These projection 
data are subsequently reconstructed using image reconstruction methods that generate cross-sectional 
images of the internally distributed radiopharmaceuticals. The SPECT images provide much improved 
contrast and detailed information about the radiopharmaceutical distribution as compared with the 
planar images obtained from conventional nuclear medicine methods. 

As an emission computed tomographic (ECT) method, SPECT differs from PET in the types of 
radionuclides used. PET uses radionuclides such as C-ll, N-13, 0-15, and F-18 that emit positrons with 
subsequent emission of two coincident 511 keV annihilation photons. These radionuclides allow studies 
of biophysiologic functions that cannot be obtained from other means. However, they have very short 
half-lives, often requiring an on-site cyclotron for their production. Also, detection of the annihilation 
photons requires expensive imaging systems. SPECT uses standard radionuclides normally found in 
nuclear medicine clinics and which emit individual gamma-ray photons with energies that are much 
lower than 511 keV. Typical examples are the 140-keV photons from Tc-99m and the ~70-keV photons 
from TI-201. Subsequently, the costs of SPECT instrumentation and of performing SPECT are substan- 
tially less than PET. 

Furthermore, substantial advances have been made in the development of new radiopharmaceuticals, 
instrumentation, and image processing and reconstruction methods for SPECT. The results are much 
improved quality and quantitative accuracy of SPECT images. These advances, combined with the 
relatively lower costs, have propelled SPECT to become an increasingly more important diagnostic tool 
in nuclear medicine clinics. 

This section will present the basic principles of SPECT and the instrumentation and image processing 
and reconstruction methods that are necessary to reconstruct SPECT images. Finally, recent advances and 
future development that will continue to improve the diagnostic capability of SPECT will be discussed. 

Basic Principles of SPECT 

Single-photon emission computed tomography (SPECT) is a medical imaging technique that is based 
on the conventional nuclear medicine imaging technique and tomographic reconstruction methods. 
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FIGURE 64.6 The conventional nuclear medicine imaging process. Gamma-ray photons emitted from the internally 
distributed radioactivity may experience photoelectric (a) or scatter (b) interactions. Photons that are not traveling 
in the direction within the acceptance analog of the collimator ( c) will be intercepted by the lead collimator. Photons 
that experience no interaction and travel within the acceptance angle of the collimator will be detected (d). 

General review of the basic principles, instrumentation, and reconstruction technique for SPECT can be 
found in a few review articles [Barrett, 1986; Jaszczak et al., 1980; Jaszczak and Coleman, 1985a; Jaszczak 
and Tsui, 1994]. 

The SPECT Imaging Process 

The imaging process of SPECT can be simply depicted as in Fig. 64.6. Gamma-ray photons emitted from 
the internal distributed radiopharmaceutical penetrate through the patient's body and are detected by a 
single or a set of collimated radiation detectors. The emitted photons experience interactions with the 
intervening tissues through basic interactions of radiation with matter [Evans, 1955]. The photoelectric 
effect absorbs all the energy of the photons and stops their emergence from the patient's body. The other 
major interaction is Compton interaction, which transfers part of the photon energy to free electrons. 
The original photon is scattered into a new direction with reduced energy that is dependent on the scatter 
angle. Photons that escape from the patient's body include those that have not experienced any interac- 
tions and those which have experienced Compton scattering. For the primary photons from the com- 
monly used radionuclides in SPECT, e.g., 140-keV of TC-99m and ~70-keV of TI-201, the probability 
of pair production is zero. 

Most of the radiation detectors used in current SPECT systems are based on a single or multiple 
Nal(TI) scintillation detectors. The most significant development in nuclear medicine is the scintillation 
camera (or Anger camera) that is based on a large-area (typically 40 cm in diameter) Nal(TI) crystal 
[Anger, 1958, 1964]. An array of photomultiplier tubes (PMTs) is placed at the back of the scintillation 
crystal. When a photon hits and interacts with the crystal, the scintillation generated will be detected by 
the array of PMTs. An electronic circuitry evaluates the relative signals from the PMTs and determines 
the location of interaction of the incident photon in the scintillation crystal. In addition, the scintillation 
cameras have built-in energy discrimination electronic circuitry with finite energy resolution that pro- 
vides selection of the photons that have not been scattered or been scattered within a small scattered 
angle. The scintillation cameras are commonly used in commercial SPECT systems. 

Analogous to the lens in an optical imaging system, a scintillation camera system consists of a collimator 
placed in front of the Nal(TI) crystal for the imaging purpose. The commonly used collimator is made 
of a large number of parallel holes separated by lead septa [Anger, 1964; Keller, 1968; Tsui, 1988]. The 
geometric dimensions, i.e., length, size, and shape of the collimator apertures, determine the directions 
of photons that will be detected by the scintillation crystals or the geometric response of the collimator. 
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The width of the geometric response function increases (or the spatial resolution worsens) as the source 
distance from the collimator increases. Photons that do not pass through the collimator holes properly 
will be intercepted and absorbed by the lead septal walls of the collimator. In general, the detection 
efficiency is approximately proportional to the square of the width of the geometric response function 
of the collimator. This tradeoff between detection efficiency and spatial resolution is a fundamental 
property of a typical SPECT system using conventional collimators. 

The amount of radioactivity that is used in SPECT is restricted by the allowable radiation dose to the 
patient. Combined with photon attenuation within the patient, the practical limit on imaging time, and 
the tradeoff between detection efficiency and spatial resolution of the collimator, the number of photons 
that are collected by a SPECT system is limited. These limitations resulted in SPECT images with relatively 
poor spatial resolution and high statistical noise fluctuations as compared with other medical imaging 
modalities. For example, currently a typical brain SPECT image has a total of about 500K counts per image 
slice and a spatial resolution in the order of approximately 8 mm. A typical myocardial SPECT study using 
TI-201 has about 150K total count per image slice and a spatial resolution of approximately 15 mm. 

In SPECT, projection data are acquired from different views around the patient. Similar to x-ray CT, 
image processing and reconstruction methods are used to obtain transaxial or cross-sectional images 
from the multiple projection data. These methods consist of preprocessing and calibration procedures 
before further processing, mathematical algorithms for reconstruction from projections, and compen- 
sation methods for image degradation due to photon attenuation, scatter, and detector response. 

The biokinetics of the radiopharmaceutical used, anatomy of the patient, instrumentation for data 
acquisition, preprocessing methods, image reconstruction techniques, and compensation methods have 
important effects on the quality and quantitative accuracy of the final SPECT images. A full understanding 
of SPECT cannot be accomplished without clear understanding of these factors. The biokinetics of 
radiopharmaceuticals and conventional radiation detectors have been described in the previous section 
on conventional nuclear medicine. The following subsections will present the major physical factors that 
affect SPECT and a summary review of the instrumentation, image reconstruction techniques, and 
compensation methods that are important technological and engineering aspects in the practice of 
SPECT. 

Physical and Instrumentation Factors that Affect SPECT Images 

There are several important physical and instrumentation factors that affect the measured data and 
subsequently the SPECT images. The characteristics and effects of these factors can be found in a few 
review articles [Jaszczak et al., 1981; laszczak and Tsui, 1994; Tsui et al., 1994a, 1994fo]. As described 
earlier, gamma-ray photons that emit from an internal source may experience photoelectric absorption 
within the patient without contributing to the acquired data, Compton scattering with change in direction 
and loss of energy, or no interaction before exiting the patient's body. The exiting photons will be further 
selected by the geometric response of the collimator-detector. The photoelectric and Compton interac- 
tions and the characteristics of the collimator-detector have significant effects on both the quality and 
quantitative accuracy of SPECT image. 

Photon attenuation is defined as the effect due to photoelectric and Compton interactions resulting in 
a reduced number of photons that would have been detected without them. The degree of attenuation 
is determined by the linear attenuation coefficient, which is a function of photon energy and the amount 
and types of materials contained in the attenuating medium. For example, the attenuation coefficient 
for the 140-keV photon emitted from the commonly used Tc-99m in water or soft tissue is 0.15 cm -1 . 
This gives rise to a half-valued-layer, the thickness of material that attenuates half the incident photons, 
or 4.5 cm H 2 for the 140-keV photon. Attenuation is the most important factor that affects the 
quantitative accuracy of SPECT images. 

Attenuation effect is complicated by the fact that within the patient the attenuation coefficient can be 
quite different in various organs. The effect is most prominent in the thorax, where the attenuation 
coefficients range from as low as 0.05 cnr 1 in the lung to as high as 0.18 cirr 1 in the compact bone for the 
140-keV photons. In x-ray CT, the attenuation coefficient distribution is the target for image reconstruction. 
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In SPECT, however, the wide range of attenuation coefficient values and the variations of attenuation 
coefficient distributions among patients are major difficulties in obtaining quantitative accurate SPECT 
images. Therefore, compensation for attenuation is important to ensure good image quality and quan- 
titatively high accuracy in SPECT. Review of different attenuation methods that have been used in SPECT 
is a subject of discussion later in this chapter. 

Photons that have been scattered before reaching the radiation detector provide misplaced spatial 
information about the origin of the radioactive source. The results are inaccurate quantitative information 
and poor contrast in the SPECT images. For radiation detectors with perfect energy discrimination, 
scattered photons can be completely rejected. In a typical scintillation camera system, however, the energy 
resolution is in the order of 10% at 140 keV. With this energy resolution, the ratio of scattered to scattered 
total photons detected by a typical scintillation detector is about 20% to 30% in brain and about 30% 
to 40% in cardiac and body SPECT studies for 140-keV photons. Furthermore, the effect of scatter 
depends on the distribution of the radiopharmaceutical, the proximity of the source organ to the target 
organ, and the energy window used in addition to the photon energy and the energy resolution of the 
scintillation detector. The compensation of scatter is another important aspect of SPECT to ensure good 
image quality and quantitative accuracy. 

The advances in SPECT can be attributed to simultaneous development of new radiopharmaceuticals, 
instrumentation, reconstruction methods, and clinical applications. Most radiopharmaceuticals that are 
developed for conventional nuclear medicine can readily be used in SPECT, and review of these devel- 
opments is beyond the scope of this chapter. Recent advances include new agents that are labeled with 
iodine and technetium for blood perfusion for brain and cardiac studies. Also, the use of receptor agents 
and labeled antibiotics is being investigated. These developments have resulted in radiopharmaceuticals 
with improved uptake distribution, biokinetics properties, and potentially new clinical applications. The 
following subsections will concentrate on the development of instrumentation and image reconstruction 
methods that have made substantial impact on SPECT. 

SPECT Instrumentation 

Review of the advances in SPECT instrumentation can be found in several recent articles [Jaszczak et al., 
1980; Rogers and Ackermann, 1992; Jaszczak and Tsui, 1994] . A typical SPECT system consists of a single 
or multiple units of radiation detectors arranged in a specific geometric configuration and a mechanism 
for moving the radiation detector(s) or specially designed collimators to acquire data from different 
projection views. In general, SPECT instrumentation can be divided into three general categories: 
(1) arrays of multiple scintillation detectors, (2) one or more scintillation cameras, and (3) hybrid 
scintillation detectors combining the first two approaches. In addition, special collimator designs have 
been proposed for SPECT for specific purposes and clinical applications. The following is a brief review 
of these SPECT systems and special collimators. 

Multidetector SPECT System 

The first fully functional SPECT imaging acquisition system was designed and constructed by Kuhl and 
Edwards [Kuhl and Edwards, 1963, 1964, 1968] in the 1960s, well before the conception of x-ray CT As 
shown in Fig. 64.7a, the MARK IV brain SPECT system consisted of four linear arrays of eight discrete 
Nal(TI) scintillation detectors assembled in a square arrangement. Projection data were obtained by 
rotating the square detector array around the patient's head. Although images from the pioneer MARK 
IV SPECT system were unimpressive without the use of proper reconstruction methods that were 
developed in later years, the multidetector design has been the theme of several other SPECT systems 
that were developed. An example is the Gammatom-1 developed by Cho et al. [1982]. The design concept 
also was used in a dynamic SPECT system [Stokely et al., 1980] and commercial multidetector SPECT 
systems marketed by Medimatic, A/S (Tomomatic-32). Recently, the system design was extended to a 
multislice SPECT system with the Tomomatic-896, consisting of 8 layers of 96 scintillation detectors. 
Also, the system allows both body and brain SPECT imaging by varying the aperture size. 
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FIGURE 64.7 Examples of multidetector-based SPECT systems, (a) The MARK IV system consists of four arrays 
of eight individual Nal(TI) detectors arranged in a square configuration. (&) The Headtome-II system consists of a 
circular ring of detectors. A set of collimator vanes that swings in front of the discrete detector is used to collect 
projection data from different views, (c) A unique Cleon brain SPECT system consists of 12 detectors that scan both 
radially and tangentially. 

Variations of the multiple-detectors arrangement have been proposed for SPECT system designs. 
Figure 64.7 b shows the Headtome-II system by Shimadzu Corporation [Hirose et al., 1982], which 
consists of a stationary array of scintillation detectors arranged in a circular ring. Projection data are 
obtained by a set of collimator vanes that swings in front of the discrete detectors. A unique Cleon brain 
SPECT system (see Fig. 64.7c), originally developed by Union Carbide Corporation in the 1970s, consists 
of 12 detectors that scan both radially and tangentially [Stoddart and Stoddart, 1979]. Images from the 
original system were unimpressive due to inadequate sampling, poor axial resolution, and a reconstruc- 
tion algorithm that did not take full advantage of the unique system design and data acquisition strategy. 
A much improved version of the system with a new reconstruction method [Moore et al., 1984] is 
currently marketed by Strichman Corporation. 

The advantages of multidetector SPECT systems are their high sensitivity per image slice and high 
counting rate capability resulting from the array of multidetectors fully surrounding the patient. However, 
disadvantages of multidetector SPECT systems include their ability to provide only one or a few non- 
contiguous cross-sectional image slices. Also, these systems are relatively more expensive compared with 
camera-based SPECT systems described in the next subsection. With the advance of multicamera SPECT 
systems, the disadvantages of multidetector SPECT systems outweigh their advantages. As a result, they 
are less often found in nuclear medicine clinics. 

Camera-Based SPECT Systems 

The most popular SPECT systems are based on single or multiple scintillation cameras mounted on a 
rotating gantry. The successful design was developed almost simultaneously by three separate groups 
[Budinger and Gullberg, 1977; Jaszczak et al, 1977; Keyes et al., 1977]. In 1981, General Electric Medical 
Systems offered the first commercial SPECT system based on a single rotating camera and brought SPECT 
to clinical use. Today, there are over 10 manufacturers (e.g., ADAC, Elscint, General Electric, Hitachi, Picker, 
Siemens, Sopha, Toshiba, Trionix) offering an array of commercial SPECT systems in the marketplace. 

An advantage of camera-based SPECT systems is their use of off-the-shelf scintillation cameras that 
have been widely used in conventional nuclear medicine. These systems usually can be used in both 
conventional planar and SPECT imaging. Also, camera-based SPECT systems allow truly three-dimen- 
sional (3D) imaging by providing a large set of contiguous transaxial images that cover the entire organ 
of interest. They are easily adaptable for SPECT imaging of the brain or body by simply changing the 
radius of rotation of the camera. 

A disadvantage of a camera-based SPECT system is its relatively low counting rate capability. The dead 
time of a typical state-of-the-art scintillation camera gives rise to a loss of 20% of its true counts at about 
80K counts per second. A few special high- count-rate systems give the same count rate loss at about 
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FIGURE 64.8 Examples of camera-based SPECT systems, (a) Single-camera system, (b) Dual-camera system with 
the two cameras placed at opposing sides of patient during rotation, (c) Dual-camera system with the two cameras 
placed at right angles, (d) Triple-camera system, (e) Quadruple-camera system. 

150K counts per second. For SPECT systems using a single scintillation camera, the sensitivity per image 
slice is relative low compared with a typical multidetector SPECT system. 

Recently, SPECT systems based on multiple cameras became increasingly more popular. Systems with 
two [Jaszczak et al., 1979a], three [Lim et al., 1980, 1985], and four cameras provide increased sensitivity 
per image slice that is proportional to the number of cameras. Figure 64.8 shows the system configurations 
of these camera-based SPECT systems. The dual-camera systems with two opposing cameras (Fig. 64.8fc) 
can be used for both whole-body scanning and SPECT, and those with two right-angled cameras 
(Fig. 64.8c) are especially useful for 180-degree acquisition in cardiac SPECT. The use of multicameras 
has virtually eliminated the disadvantages of camera-based SPECT systems as compared with multide- 
tector SPECT systems. The detection efficiency of camera-based SPECT systems can be further increased 
by using converging-hole collimators such as fan, cone, and astigmatic collimators at the cost of a smaller 
field of view. The use of converging-hole collimators in SPECT will be described in a later subsection. 

Novel SPECT System Designs 

There are several special SPECT systems designs that do not fit into the preceding two general categories. 
The commercially available CERESPECT (formerly known as ASPECT) [Genna and Smith, 1988] is a 
dedicated brain SPECT system. As shown in Fig. 64.9a, it consists of a single fixed-annular Nal(TI) crystal 
that completely surrounds the patient's head. Similar to a scintillation camera, an array of PMTs and 
electronics circuitry are placed behind the crystal to provide positional and energy information about 
photons that interact with the crystal. Projection data are obtained by rotating a segmented annular 
collimator with parallel holes that fits inside the stationary detector. A similar system is also being 
developed by Larsson et al. [1991] in Sweden. 




FIGURE 64.9 Examples of novel SPECT system designs, (a) The CERESPECT brain SPECT system consists of a 
single fixed annular Nal(TI) crystal and a rotating segmented annular collimator, (b) The SPRINT II brain SPECT 
system consists of 1 1 detector modules and a rotating lead ring with slit opening. 
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Several unique SPECT systems are currently being developed in research laboratories. They consist of 
modules of small scintillation cameras that surround the patient. The hybrid designs combine the 
advantage of multidetector and camera-based SPECT systems with added flexibility in system configu- 
ration. An example is the SPRINT II brain SPECT system developed at the University of Michigan [Rogers 
et al., 1988]. As shown in Fig. 64.9b, the system consists of 11 detector modules arranged in a circular 
ring around the patient's head. Each detector module consists of 44 one-dimensional bar Nal(TI) 
scintillation cameras. Projection data are required through a series of narrow slit openings on a rotating 
lead ring that fits inside the circular detector assemblies. A similar system was developed at the University 
oflowa [Chang et al., 1990] with 22 detector modules, each consisting of 4 bar detectors. A set of rotating 
focused collimators is used to acquire projection data necessary for image reconstruction. At the Uni- 
versity of Arizona, a novel SPECT system is being developed that consists of 20 small modular scintillation 
cameras [Milster et al., 1990] arranged in a hemispherical shell surrounding the patient's head [Rowe et 
al., 1992]. Projection data are acquired through a stationary hemispherical array of pinholes that are 
fitted inside the camera array. Without moving parts, the system allows acquisition of dynamic 3D SPECT 
data. 

Special Collimator Designs for SPECT Systems 

Similar to conventional nuclear medicine imaging, parallel-hole collimators (Fig. 64.10a) are commonly 
used in camera-based SPECT systems. As described earlier, the tradeoff between detection efficiency and 




(a) 






FIGURE 64.10 Collimator designs used in camera-based SPECT systems, (a) The commonly used parallel-hole 
collimator. ( h) The fan-beam collimator, where the collimator holes are converged to a line that is parallel to the axis 
of rotation, (c) The cone-beam collimator, where the collimator holes are converged to a point, (d) A varifocal 
collimator, where the collimator holes are converged to various focal points. 
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spatial resolution of parallel-hole collimator is a limitating factor for SPECT. A means to improve SPECT 
system performance is to improve the tradeoff imposed by the parallel-hole collimation. 

To achieve this goal, converging-hole collimator designs that increase the angle of acceptance of 
incoming photons without sacrificing spatial resolution have been developed. Examples are fan-beam 
[Jaszczak et al., 1979fo; Tsui et al., 1986], cone-beam [Jaszczak et al., 1987], astigmatic [Hawman and 
Hsieh, 1986], and more recently varifocal collimators. As shown in Fig. 64A0b-d, the collimator holes 
converge to a line that is oriented parallel to the axis of rotation for a fan-beam collimator, to a point 
for a cone-beam collimator, and to various points for a varifocal collimator, respectively. The gain in 
detection efficiency of a typical fan-beam and cone-beam collimator is about 1.5 and 2 times of that of 
a parallel-hole collimator with the same spatial resolution. The anticipated gain in detection efficiency 
and corresponding decrease in image noise are the main reasons for the interest in applying converging- 
hole collimators in SPECT. 

Despite the advantage of increased detection efficiency, the use of converging-hole collimators in 
SPECT poses special problems. The tradeoff for increase in detection efficiency as compared with parallel- 
hole collimators is a decrease in field of view (see Fig. 64.10). Consequently, converging-hole collimators 
are restricted to imaging small organs or body parts such as the head [Jaszczak et al., 1979fo; Tsui et al., 
1986] and heart [Gullberg et al., 1991]. In addition, the use of converging-hole collimators requires 
special data-acquisition strategies and image reconstruction algorithms. For example, for cone-beam 
tomography using a conventional single planar orbit, the acquired projection data become increasingly 
insufficient for reconstructing transaxial image sections that are further away from the central plane of 
the cone-beam geometry. Active research is underway to study special rotational orbits for sufficient 
projection data acquisition and 3D image reconstruction methods specific for cone-beam SPECT. 

Reconstruction Methods 

As discussed earlier, SPECT combines conventional nuclear medicine image techniques and methods for 
image reconstruction from projections. Aside from radiopharmaceuticals and instrumentation, image 
reconstruction methods are another important engineering and technological aspect of the SPECT 
imaging technique. 

In x-ray CT, accurate transaxial images can be obtained through the use of standard algorithms for 
image reconstruction from projections. The results are images of attenuation coefficient distribution of 
various organs within the patient's body. In SPECT, the goal of image reconstruction is to determine the 
distribution of administered radiopharmaceutical in the patient. However, the presence of photon atten- 
uation affects the measured projection data. If conventional reconstruction algorithms are used without 
proper compensation for the attenuation effects, inaccurate reconstructed images will be obtained. Effects 
of scatter and the finite collimator-detector response impose additional difficulties on image reconstruc- 
tion in SPECT. 

In order to achieve quantitatively accurate images, special reconstruction methods are required for 
SPECT. Quantitatively accurate image reconstruction methods for SPECT consist of two major compo- 
nents. They are the standard algorithms for image reconstruction from projections and methods that 
compensate for the image-degrading effects described earlier. Often, image reconstruction algorithms 
are inseparable from the compensation methods, resulting in a new breed of reconstruction method not 
found in other tomographic medical imaging modalities. The following subsections will present the 
reconstruction problem and a brief review of conventional algorithms for image reconstruction from 
projections. Then quantitative SPECT reconstruction methods that include additional compensation 
methods will be described. 

Image Reconstruction Problem 

Figure 64. 11 shows a schematic diagram of the two-dimensional (2D) image reconstruction problem. 
Let/(x, y) represent a 2D object distribution that is to be determined. A one-dimension (ID) detector 
array is oriented at an angle with respect to the x axis of the laboratory coordinates system (x, y). The 
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FIGURE 64.11 Schematic diagram of the two-dimensional image reconstruction problem. The projection data are 
line integrals of the object distribution along rays that are perpendicular to the detector. A source point {x, y) is 
projected onto a point p (t, 9), where t is a position along the projection and 9 is the projection angle. 

data collected into each detector element at location t, called the projection data p(t, 0), is equal to the 
sum of f(x, y) along a ray that is perpendicular to the detector array and intersects the detector at 
position t; that is, 



p[t,e) = c^j(x,y)ds 



(64.1) 



where (5, t) represents a coordinate system with 5 along the direction of the ray sum and t parallel to the 
ID detector array, and c is the gain factor of the detection system. The angle between the s and x axes is 
0. The relationship between the source position (x, y), the projection angle 0, and the position of detection 
on the ID detector array is given by 



t = y cos 9 — x sin 9 



(64.2) 



In 2D tomographic imaging, the ID detector array rotates around the object distribution f(x, y) and 
collects projection data from various projection data from various projection angles 0. The integral 
transform of the object distribution to its projections given by Eq. (64.1) is called the Radon transform 
[Radon, 1917]. The goal of image reconstruction is to solve the inverse Radon transform. The solution 
is the reconstructed image estimate f(x, y) of the object distribution f(x, y). 

In x-ray CT, the measured projection data is given by 



p'[t, 9j = c t I o exp - \l[x, y) ds 



(64.3) 



where I is the intensity of the incident x-ray, \i(x, y) is the 2D attenuation coefficient, and c t is the gain 
factor which transforms x-ray intensity to detected signals. The reconstruction problem can be rewritten as 



p(f, 9) = In 





p'(t, 9) 



|i(x, y\ ds 



(64.4) 
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with the goal to solve for the attenuation coefficient distribution |i(x, y). Also, in x-ray CT, if parallel 
rays are used, the projection data at opposing views are the same, i.e., p(t, 0) = p(Q + n), and projection 
data acquired over 180 degrees will be sufficient for reconstruction. The number of linear samples along 
the ID projection array and angular samples, i.e., the number of projection views, over 180 degrees must 
be chosen carefully to avoid aliasing error and resolution loss in the reconstructed images. 

In SPECT, if the effects of attenuation, scatter, and collimator-detector response are ignored, the 
measured projection data can be written as the integral of radioactivity along the projection rays; that is, 



p(t,Q)=c[ p(x, y)ds (64.5) 



where p(x, y) is the radioactivity concentration distribution of the object, and c e is the gain factor which 
transforms radioactivity concentration to detected signals. Equation (64.5) fits in the form of the Radon 
transform, and similar to x-ray CT, the radioactivity distribution can be obtained by solving the inverse 
Radon transform problem. 

If attenuation is taken into consideration, the attenuated Radon transform [Gullberg, 1979] can be 
written as 



p( f ' e ,) = c J p(x,yjexp - a(u,v)ds 



ds (64.6) 



where \x(u, v) is the 2D attenuation coefficient distribution, and ] {x y) [i{u, v) ds is the attenuation factor 
for photons that originate from (x, y), travel along the direction perpendicular to the detector array, and 
are detected by the collimator-detector. A major difficulty in SPECT image reconstruction lies in the 
attenuation factor, which makes the inverse problem given by EQ. (64.6) difficult to solve analytically. 
However, the solution is important in cardiac SPECT, where the widely different attenuation coefficients 
are found in various organs within the thorax. Also, due to the attenuation factor, the projection views 
at opposing angles are different. Hence full 360-degree projection data are usually necessary for image 
reconstruction in SPECT. 

Different from x-ray CT, small differences in attenuation coefficient are not as important in SPECT. 
When the attenuation coefficient in the body region can be considered constant, the attenuated Radon 
transform given by Eq. (64.6) can be written as [Tretiak and Metz, 1980] 



p[t, e) = c I p(x, y) exp -|x/(x, y) 

*J— ft * 



ds (64.7) 



where [i is the constant attenuation coefficient in the body region, and l(x, y) is the path length between 
the point (x, y) and the edge of the attenuator (or patient's body) along the direction of the projection 
ray. The solution of the inverse problem with constant attenuator has been a subject of several investi- 
gations. It forms the basis for analytical methods for compensation of uniform attenuation described 
later in this chapter. 

When scatter and collimator-detector response are taken into consideration, the assumption that the 
projection data can be represented by line integrals given by Eqs. (64.1) to (64.7) will no longer be exactly 
correct. Instead, the integration will have to include a wider region covering the field of view of the 
collimator-detector (or the collimator-detector response function). The image reconstruction problem 
is further complicated by the nonstationary properties of the collimator-detector and scatter response 
functions and their dependence on the size and composition of the patient's body. 

Algorithms for Image Reconstruction from Projections 

The application of methods for image reconstruction from projections was a major component in the 
development of x-ray CT in the 1970s. The goal was to solve for the inverse Radon transform problem 
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given in Eq. (64.1). There is an extensive literature on these reconstruction algorithms. Reviews of the 
applications of these algorithms to SPECT can be found in several articles [Barrett, 1986; Brooks and Di 
Chiro, 1975, 1976; Budinger and Gullberg, 1974]. 

Simple Backprojection. An intuitive image reconstruction method is simple backprojection. Here, the 
reconstructed image is formed simply by spreading the values of the measured projection data uniformly 
along the projection ray into the reconstructed image array. By backprojecting the measured projection 
data from all projection views, an estimate of the object distribution can be obtained. Mathematically, 
the simple backproject operation is given by 



fix, y\ = V Ay cos 9 ; - x sin 9 ; ,9 ; )A9 (64.8) 



where 9 is the jth projection angle, m is the number of projection views, and A9 is the angular spacing 
between adjacent projections. The simple backprojected image j{x, y) is a poor approximation of the 
true object distribution fix, y). It is equivalent to the true object distribution blurred by a blurring 
function in the form of 1/r. 

There are two approaches for accurate image reconstruction, and both have been applied to SPECT. 
The first approach is based on direct analytical methods and is widely used in commercial SPECT systems. 
The second approach is based on statistical criteria and iterative algorithms. They have been found useful 
in reconstruction methods that include compensation for the image-degrading effects. 

Analytical Reconstruction Algorithms: Filtered Backprojection. The most widely used analytical 
image-reconstruction algorithm is the filtered backprojection (FBP) method, which involves backproject- 
ing the filtered projections [Bracewell and Riddle, 1967; Ramachandran and Lakshminarayanan, 1971]. 
The algorithm consists of two major steps: 

1. Filter the measured projection data at different projection angles with a special function. 

2. Backproject the filtered projection data to form the reconstructed image. 

The first step of the filtered backprojection method can be implemented in two different ways. In the 
spatial domain, the filter operation is equivalent to convolving the measured projection data using a 
special convolving function h{t); that is, 



p'(f,e)=p(f,9)©fc(f) (64.8a) 



where © is the convolution operation. With the advance of fast Fourier transform (FFT) methods, the 
convolution operation can be replaced by a more efficient multiplication in the spatial frequency domain. 
The equivalent operation consists of three steps: 

1. Fourier- transform the measured projection data into spatial frequency domain using the FFT 
method, i.e., P(v, 9) = FT{p(t, 9)}, where FT is the Fourier transform operation. 

2. Multiply the Fourier-transformed projection data with a special function that is equal to the 
Fourier transform of the special function used in the convolution operation described above, i.e., 
P'(v, 9) = P(v, Q)-H{v), where H(v) = FT{/i(x)} is the Fourier transform of h{x). 

3. Inverse Fourier transform the product P'(v, 9) into spatial domain. 

Again, the filtered projections from different projection angles are backprojected to form the recon- 
structed images. 

The solution of the inverse Radon transform given in Eq. (64.1) specifies the form of the special 
function. In the spatial domain, the special function h(x) used in the convolution operation in Eq. (64.8) 
is given by 
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where Ax is the linear sampling interval, and sinc(z) = [sin(z)]/z. The function h(x) consists of a narrow 
central peak with high magnitude and small negative side lobes. It removes the blurring from the 1/r 
function found in the simple backprojected images. 

In the frequency domain, the special function H(v) is equivalent to the Fourier transform of h(x) and 
is a truncated ramp function given by 



where |v| is the ramp function, and 



HI v\= v • recti v) 



(64.10) 
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that is, the rectangular function rect(v) has a value of 1 when the absolute value of v is less than the 
Nyquist frequency at 0.5 cycles per pixel. 

For noisy projection data, the ramp function tends to amplify the high-frequency noise. In these 
situations, an additional smoothing filter is often applied to smoothly roll off the high-frequency response 
of the ramp function. Examples are Hann and Butterworth filters [Huesman et al., 1977]. Also, decon- 
volution filters have been used to provide partial compensation of spatial resolution loss due to the 
collimator- detection response and noise smoothing. Examples are the Metz and Wiener filters (see below). 

Iterative Reconstruction Algorithms. Another approach to image reconstruction is based on statistical 
criteria and iterative algorithms. They were investigated for application in SPECT before the development 
of analytical image reconstruction methods [Gilbert, 1972; Goitein, 1972; Gordon et al., 1970; Kuhl and 
Edwards, 1968]. The major drawbacks of iterative reconstruction algorithms are the extensive computa- 
tions and long processing time required. For these reasons, the analytical reconstruction methods have 
gained widespread acceptance in clinical SPECT systems. In recent years, there has been renewed interest 
in the use of iterative reconstruction algorithms in SPECT to achieve accurate quantitation by compen- 
sating for the image-degrading effects. 

A typical iterative reconstruction algorithm starts with an initial estimate of the object source distribution. 
A set of projection data is estimated from the initial estimate using a projector that models the imaging 
process. The estimated projection data are compared with the measured projection data at the same 
projection angles, and their differences are calculated. Using an algorithm derived from specific statistical 
criteria, the differences are used to update the initial image estimate. The updated image estimate is then 
used to recalculate a new set of estimated projection data that are again compared with the measured 
projection data. The procedure is repeated until the difference between the estimated and measured pro- 
jection data are smaller than a preselected small value. Statistical criteria that have been used in formulating 
iterative reconstruction algorithms include the minimum mean squares error (MMSE) [Budinger and 
Gullberg, 1977], weighted least squares (WLS) [Huesman et al., 1977], maximum entropy (ME) [Minerbo, 
1979], maximum likelihood (ML) [Shepp andVardi, 1982], and maximum a posteriori approaches [Barrett, 
1986; Geman and McClure, 1985; Johnson et al, 1991; Levitan and Herman, 1987; Liang and Hart, 1987]. 
Iterative algorithms that have been used in estimating the reconstructed images include the conjugate 
gradient (CG) [Huesman et al., 1977] and expectation maximization (EM) [Lange and Carson, 1984]. 

Recently, interest in the application of iterative reconstruction algorithms in SPECT has been revital- 
ized. The interest is sparked by the need to compensate for the spatially variant and/or nonstationary 
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image-degrading factors in the SPECT imaging process. The compensation can be achieved by modeling 
the imaging process that includes the image-degrading factors in the projection and backprojection 
operations of the iterative steps. The development is aided by advances made in computer technology 
and custom-dedicated processors. The drawback of long processing time in using these algorithms is 
substantially reduced. Discussion of the application of iterative reconstruction algorithms in SPECT will 
be presented in a later subsection. 

Compensation Methods 

For a typical SPECT system, the measured projection data are severely affected by attenuation, scatter, and 
collimator-detector response. Direct reconstruction of the measured projection data without compensation 
of these effects produces images with artifacts, distortions, and inaccurate quantitation. In recent years, 
substantial efforts have been made to develop compensation methods for these image-degrading effects. 
This development has produced much improved quality and quantitatively accurate reconstructed images. 
The following subsections will present a brief review of some of these compensation methods. 

Compensation for Attenuation. Methods for attenuation compensation can be grouped into two 
categories: (1) methods that assume the attenuation coefficient is uniform over the body region, and 
(2) methods that address situations of nonuniform attenuation coefficient distribution. The assumption 
of uniform attenuation can be applied to SPECT imaging of the head and abdomen regions. The 
compensation methods seek to solve for the inverse of the attenuated Radon transform given in Eq. (64.7). 
For cardiac and lung SPECT imaging, nonuniform attenuation compensation methods must be used 
due to the very different attenuation coefficient values in various organs in the thorax. Here, the goal is 
to solve the more complicated problem of the inverse of the attenuated Radon transform in Eq. (64.6). 

There are several approximate methods for compensating uniform attenuation. They include methods 
that preprocess the projection data or postprocess the reconstructed image. The typical preprocess methods 
are those which use the geometric or arithmetic mean [Sorenson, 1974] of projections from opposing 
views. These compensation methods are easy to implement and work well with a single, isolated source. 
However, they are relatively inaccurate for more complicated source configurations. Another method 
achieves uniform attenuation compensation by processing the Fourier transform of the sinogram [Bellini 
et al., 1979]. The method provides accurate compensation even for complicated source configurations. 

A popular compensation method for uniform attenuation is that proposed by Chang [1978]. The 
method requires knowledge of the body contour. The information is used in calculating the average 
attenuation factor at each image point from all projection views. The array of attenuation factors is used 
to multiply the reconstructed image obtained without attenuation compensation. The result is the 
attenuation-compensated image. An iterative scheme also can be implemented for improved accuracy. 
In general, the Chang method performs well for uniform attenuation situations. However, the noise level 
in the reconstructed images increases with iteration number. Also, certain image features tend to fluctuate 
as a function of iteration. For these reasons, no more than one or two iterations are recommended. 

Another class of methods for uniform attenuation compensation is based on analytical solution of the 
inverse of the attenuation Radon transform given in Eq. (64.7) for a convex-shaped medium [Gullberg 
and Budinger, 1981; Tretiak and Metz, 1980]. The resultant compensation method involves multiplying 
the projection data by an exponential function. Then the FBP algorithm is used in the image reconstruc- 
tion except that the ramp filter is modified such that its value is zero in the frequency range between 
and \i/2n, where u is the constant attenuation coefficient. The compensation method is easy to imple- 
ment and provides good quantitative accuracy. However, it tends to amplify noise in the resulting image, 
and smoothing is required to obtain acceptable image quality [Gullberg and Budinger, 1981]. 

An analytical solution for the more complicated inverse attenuated Radon transform with nonuniform 
attenuation distribution [Eq. (64.6)] has been found difficult [Gullberg, 1979]. Instead, iterative 
approaches have been used to estimate a solution of the problem. The application is especially important 
in cardiac and lung SPECT studies. The iterative methods model the attenuation distribution in the 
projection and backprojection operations [Manglos et al., 1987; Tsui et al., 1989]. The ML criterion with 
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the EM algorithm [Lange and Carson, 1984] has been used with success [Tsui et al., 1989]. The compen- 
sation method requires information about the attenuation distribution of the region to be imaged. Recently, 
transmission CT methods are being developed using existing SPECT systems to obtain attenuation dis- 
tribution from the patient. The accurate attenuation compensation of cardiac SPECT promises to provide 
much improved quality and quantitative accuracy in cardiac SPECT images [Tsui et al., 1989, 1994a]. 

Compensation for Scatter. As described earlier in this chapter, scattered photons carry misplaced 
positional information about the source distribution resulting in lower image contrast and inaccurate 
quantitation in SPECT images. Compensation for scatter will improve image contrast for better image 
quality and images that will more accurately represent the true object distribution. Much research has 
been devoted to develop scatter compensation methods that can be grouped into two general approaches. 
In the first approach, various methods have been developed to estimate the scatter contribution in the 
measured data. The scatter component is then subtracted from the measured data or from the recon- 
structed images to obtain scatter-free reconstructed images. The compensation method based on this 
approach tends to increase noise level in the compensated images. 

One method estimates the scatter contribution as a convolution of the measured projection data with 
an empirically derived function [Axelsson et al., 1984]. Another method models the scatter component 
as the convolution of the primary (or unscattered) component of the projection data with an exponential 
function [Floyd et al., 1985]. The convolution method is extended to 3D by estimating the 2D scatter 
component [Yanch et al., 1988]. These convolution methods assume that the scatter response function 
is stationary, which is only an approximation. 

The scatter component also has been estimated using two energy windows acquisition methods. One 
method estimates the scatter component in the primary energy window from the measured data obtained 
from a lower and adjacent energy window [Jaszczak et al., 1984, 1985b]. In a dual photopeak window 
(DPW) method, two nonoverlapping windows spanning the primary photopeak window are used [King 
et al., 1992]. This method provides more accurate estimation of the scatter response function. 

Multiple energy windows also have been used to estimate the scatter component. One method uses 
two satellite energy windows that are placed directly above and below the photopeak window to estimate 
the scatter component in the center window [ Ogawa et al. , 1 99 1 ] . In another method, the energy spectrum 
detected at each image pixel is used to predict the scatter contribution [Koral et al., 1988]. An energy- 
weighted acquisition (EWA) technique acquires data from multiple energy windows. The images recon- 
structed from these data are weighted with energy- dependent factors to minimize scatter contribution 
to the weighted image [DeVito et al., 1989; DeVito and Hamill, 1991]. Finally, the holospectral imaging 
method [Gagnon et al., 1989] estimates the scatter contribution from a series of eigenimages derived 
from images reconstructed from data obtained from a series of multiple energy windows. 

In the second approach, the scatter photons are utilized in estimating the true object distribution. 
Without subtracting the scatter component, the compensated images are less noisy than those obtained 
from the first approach. In one method, an average scatter response function can be combined with the 
geometric response of the collimator-detector to form the total response of the imaging system [Gilland 
et al., 1988; Tsui et al., 1994a]. The total response function is then used to generate a restoration filter 
for an approximate geometric and scatter response compensation (see below). 

Another class of methods, characterizes the exact scatter response function and incorporates it into 
iterative reconstruction algorithms for accurate compensation for scatter [Floyd et al., 1985; Frey and 
Tsui, 1992]. Since the exact scatter response functions are nonstationary and are asymmetric in shape, 
implementation of the methods requires extensive computations. However, efforts are being made to 
parameterize the scatter response function and to optimize the algorithm for substantial reduction in 
processing time [Frey et al., 1993; Frey and Tsui, 1991]. 

Compensation for Collimator-Detector Response. As described earlier, for a typical collimator-detector, 
the response function broadens as the distance from the collimator face increases. The effect of the 
collimator-detector response is loss of spatial resolution and blurring of fine detail in SPECT images. 
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Also, the spatially variant detector response function will cause nonisotropic point response in SPECT 
images [Knesaureket al., 1989; Maniawski et al., 1991]. The spatially variant collimator- detector response 
is a major difficulty in its exact compensation. 

By assuming an average and stationary collimator-detector response function, restoration filters can 
be used to provide partial and approximate compensation for the effects of the collimator-detector. 
Examples are the Metz [King et al., 1984, 1986] and Wiener [Penney et al., 1990] filters, where the inverse 
of the average collimator-detector response function is used in the design of the restoration filters. Two- 
dimensional (2D) compensation is achieved by applying the ID restoration filters to the ID projection 
data, and 3D compensation by applying the 2D filters to the 2D projection images [Tsui et al., 1994&]. 

Analytical methods have been developed for compensation of the spatially variant detector response. 
A spatially variant filtering method has been proposed which is based on the frequency distance principle 
(FDP) [Edholm et al., 1986; Lewitt et al., 1989]. The method has been shown to provide an isotropic 
point response function in phantom SPECT images [Glick et al., 1993]. 

Iterative reconstruction methods also have been used to accurately compensate for both nonuniform 
attenuation and collimator-detector response by modeling the attenuation distribution and spatially 
variant detector response function in the projection and backprojection steps. The compensation meth- 
ods have been applied in 2D reconstruction [Formiconi et al., 1990; Tsui et al., 1988], and more recently 
in 3D reconstruction [Tsui et al., 1994b; Zeng et al., 1991]. It has been found that the iterative recon- 
struction methods provide better image quality and more accurate quantitation when compared with 
the conventional restoration filtering techniques. Furthermore, 3D compensation outperforms 2D com- 
pensation at the expense of more extensive computations [Tsui et al., 1994b]. 

Sample SPECT Images 

This subsection presents sample SPECT images to demonstrate the performance of various reconstruction 
and compensation methods. Two data sets were used. The first set was acquired from a 3D physical 
phantom that mimics a human brain perfusion study. The phantom study provided knowledge of the 
true radioactivity distribution for evaluation purposes. The second data set was obtained from a patient 
myocardial SPECT study using thallium-201. 

Figure 64.12a shows the radioactivity distribution from a selected slice of a 3D brain phantom man- 
ufactured by the Data Spectrum Corporation. The phantom design was based on PET images from a 
normal patient to simulate cerebral blood flow [Hoffman et al., 1990] . The phantom was filled with water 
containing 74 mBq of Tc-99m. A single-camera-based GE 400AC/T SPECT system fitted with a high- 
resolution collimator was used for data collection. The projection data were acquired into 128 x 128 
matrices at 128 views over 360 degrees. Figure 64.12fe shows the reconstructed image obtained from the 
FBP algorithm without any compensation. The poor image quality is due to statistical noise fluctuations, 
effects of attenuation (especially at the central portion of the image), loss of spatial resolution due to the 
collimator-detector response, and loss of contrast due to scatter. 

Figure 64.12c shows the reconstructed image obtained with the application of a noise-smoothing filter 
and compensation for the uniform attenuation and scatter. The resulting image has lower noise level, 
reduced attenuation effect, and higher contrast as compared with the image shown in Fig. 64.12k 
Figure 64.12a' is similar to Fig. 64.12c except for an additional application of a Metz filter to partially 
compensate for the collimator-detector blurring. Figure 64.12e shows the reconstructed image obtained 
from the iterative ML-EM algorithm that accurately modeled the attenuation and spatially variant 
detector response. The much superior image quality is apparent. 

Figure 64.13a shows a selected FBP reconstructed transaxial image slice from a typical patient myocardial 
SPECT study using thallium-201. Figure 64.13fo shows the reconstructed image obtained from the Chang 
algorithm for approximate nonuniform attenuation compensation and 2D processing using a Metz filter 
for approximate compensation for collimator-detector response. Figure 64.13c shows the reconstructed 
image obtained from the iterative ML-EM algorithm using a measured transmission CT image for 
accurate attenuation compensation and a 2D model of the collimator-detector response for accurate 
collimator-detector response compensation. The reconstructed image in Fig. 64.13d is similar to that in 
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FIGURE 64.12 Sample images from a phantom SPECT study, (a) Radioactivity distribution from a selected slice 
of a 3D brain phantom, (b) Reconstructed image obtained from the FBP algorithm without any compensation. 
(c) Reconstructed image obtained with the application of noise-smoothing filter and compensation for uniform 
attenuation and scatter, (d) Similar to (c) except for an additional application of a Metz filter to partially compensate 
for the collimator-detector blurring, (e) Reconstructed image similar to that obtained from the iterative ML-EM 
algorithm that accurately models the attenuation and spatially variant detector response. (From Tsui BMW, Frey EC, 
Zhao X-D, et al. 1994. Reprinted with permission.) 
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FIGURE 64. 1 3 Sample images from a patient myocardial SPECT study using TI-20 1 . ( a) A selected transaxial image 
slice from a typical patient myocardial SPECT study using TI-20 1. The reconstructed image was obtained with the 
FBP algorithm without any compensation, (h) Reconstructed image obtained from the Chang algorithm for approx- 
imate nonuniform attenuation compensation and 2D processing using a Metz filter for approximate compensation 
for collimator-detector response, (c) Reconstructed image obtained from the iterative ML-EM algorithm using a 
measured transmission CT image for accurate attenuation compensation and 2D model of the collimator-detector 
response for accurate collimator-detector response compensation, (d) Similar to (b) except that the Metz filter was 
implemented in 3D. (e) Similar to (c) except that a 3D model of the collimator-detector response is used. 
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Fig. 64. 13fo except that the Metz filter was implemented in 3D. Finally, the reconstructed image in Fig. 64.13e 
is similar to that in Fig. 64.13c except that a 3D model of the collimator-detector response is used. The 
superior image quality obtained from using an accurate 3D model of the imaging process is evident. 

Discussion 

The development of SPECT has been a combination of advances in radiopharmaceuticals, instrumenta- 
tion, image processing and reconstruction methods, and clinical applications. Although substantial 
progress has been made during the last decade, there are many opportunities for contributions from 
biomedical engineering in the future. 

The future SPECT instrumentation will consist of more detector area to fully surround the patient 
for high detection efficiency and multiple contiguous transaxial slice capability. Multicamera SPECT 
systems will continue to dominate the commercial market. The use of new radiation detector materials 
and detector systems with high spatial resolution will receive increased attention. Continued research is 
needed to investigate special converging-hole collimator design geometries, fully 3D reconstruction 
algorithms, and their clinical applications. 

To improve image quality and to achieve quantitatively accurate SPECT images will continue to be 
the goals of image processing and image reconstruction methods for SPECT. An important direction of 
research in analytical reconstruction methods will involve solving the inverse Radon transform, which 
includes the effects of attenuation, the spatially variant collimator-detector response function, and scatter. 
The development of iterative reconstruction methods will require more accurate models of the complex 
SPECT imaging process, faster and more stable iterative algorithms, and more powerful computers and 
special computational hardware. 

These improvements in SPECT instrumentation and image reconstruction methods, combined with 
newly developed radiopharmaceuticals, will bring SPECT images with increasingly higher quality and 
more accurate quantitation to nuclear medicine clinics for improved diagnosis and patient care. 
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65.1 Transducers 



Richard L. Goldberg and Stephen W. Smith 

An ultrasound transducer generates acoustic waves by converting magnetic, thermal, and electrical energy 
into mechanical energy. The most efficient technique for medical ultrasound uses the piezoelectric effect, 
which was first demonstrated in 1880 by Jacques and Pierre Curie [Curie and Curie, 1880] . They applied 
a stress to a quartz crystal and detected an electrical potential across opposite faces of the material. The 
Curies also discovered the inverse piezoelectric effect by applying an electric field across the crystal to 
induce a mechanical deformation. In this manner, a piezoelectric transducer converts an oscillating 
electric signal into an acoustic wave, and vice versa. 

Many significant advances in ultrasound imaging have resulted from innovation in transducer tech- 
nology. One such instance was the development of linear-array transducers. Previously, ultrasound 
systems had made an image by manually moving the transducer across the region of interest. Even the 
faster scanners had required several seconds to generate an ultrasound image, and as a result, only static 
targets could be scanned. On the other hand, if the acoustic beam could be scanned rapidly, clinicians 
could visualize moving targets such as a beating heart. In addition, real-time imaging would provide 
instantaneous feedback to the clinician of the transducer position and system settings. 

To implement real-time imaging, researchers developed new types of transducers that rapidly steer 
the acoustic beam. Piston-shaped transducers were designed to wobble or rotate about a fixed axis to 
mechanically steer the beam through a sector-shaped region. Linear sequential arrays were designed to 
electronically focus the beam in a rectangular image region. Linear phased-array transducers were designed 
to electronically steer and focus the beam at high speed in a sector image format. 

This section describes the application of piezoelectric ceramics to transducer arrays for medical 
ultrasound. Background is presented on transducer materials and beam steering with phased arrays. 
Array performance is described, and the design of an idealized array is presented. 
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TABLE 65.1 Maerial Properties of Linear-Array Elements Made 
of PZT-5H 



Parameter 


Symbol 


Value 


Units 


Density 


P 


7500 


kg/m 3 


Speed of sound 


c 


3970 


m/s 


Acoustic impedance 


Z 


29.75 


MRayls 


Relative dielectric constant 


e/e 


1475 


None 


Electromechanical coupling coefficient 


k 


0.698 


None 


Mechanical loss tangent 


tan 8„ 


0.015 


None 


Electrical loss tangent 


tan 8 e 


0.02 


None 



Transducer Materials 

Ferroelectric materials strongly exhibit the piezoelectric effect, and they are ideal materials for medical 
ultrasound. For many years, the ferroelectric ceramic lead-zirconate-titanate (PZT) has been the standard 
transducer material for medical ultrasound, in part because of its high electromechanical conversion 
efficiency and low intrinsic losses. The properties of PZT can be adjusted by modifying the ratio of 
zirconium to titanium and introducing small amounts of other substances, such as lanthanum [Berlin- 
court, 1971]. Table 65.1 shows the material properties of linear-array elements made from PZT-5H. 

PZT has a high dielectric constant compared with many piezoelectric materials, resulting is favorable 
electrical characteristics. The ceramic is mechanically strong, and it can be machined to various shapes 
and sizes. PZT can operate at temperatures up to 100°C or higher, and it is stable over long periods of time. 

The disadvantages of PZT include its high acoustic impedance (Z = 30 MRayls) compared with body 
tissue (Z = 1.5 MRayls) and the presence of lateral modes in array elements. One or more acoustic 
matching layers can largely compensate for the acoustic impedance mismatch. The effect of lateral modes 
can be diminished by choosing the appropriate element dimensions or by subdicing the elements. 

Other piezoelectric materials are used for various applications. Composites are made from PZT 
interspersed in an epoxy matrix [Smith, 1992]. Lateral modes are reduced in a composite because of its 
inhomogeneous structure. By combining the PZT and epoxy in different ratios and spatial distributions, 
one can tailor the composite's properties for different applications. Polyvinylidene difluoride (PVDF) is 
a ferroelectric polymer that has been used effectively in high-frequency transducers [Sherar and Foster, 
1989]. The copolymer of PVDF with trifluoroethylene has an improved electromechanical conversion 
efficiency. Relaxor ferroelectric materials, such as lead-magnesium-niobate (PMN), become piezoelectric 
when a large direct-current (dc) bias voltage is applied [Takeuchi et al., 1990]. They have a very large 
dielectric constant (e > 20,000e ), resulting in higher transducer capacitance and a lower electrical 
impedance. 

Scanning with Array Transducers 

Array transducers use the same principles as acoustic lenses to focus an acoustic beam. In both cases, 
variable delays are applied across the transducer aperture. With a sequential or phased array, however, 
the delays are electronically controlled and can be changed instantaneously to focus the beam in different 
regions. Linear arrays were first developed for radar, sonar, and radio astronomy [Allen, 1964; Bobber, 
1970], and they were implemented in a medical ultrasound system by Somer in 1968 [Somer, 1968]. 

Linear-array transducers have increased versatility over piston transducers. Electronic scanning 
involves no moving parts, and the focal point can be changed dynamically to any location in the scanning 
plane. The system can generate a wide variety of scan formats, and it can process the received echoes for 
other applications, such as dynamic receive focusing [von Ramm and Thurstone, 1976], correction for 
phase aberrations [Flax and O'Donnell, 1988; Trahey et al., 1990], and synthetic aperture imaging [Nock 
andTrahey, 1992]. 



© 2000 by CRC Press LLC 



array 

excitation signal elements 





ii 


1 | 




n 








2 | 










n 


3 | 










n 


4 | 










n 


__5_| 










n 


6 | 







(a) 



time 




summation 



(b) 



delayed signal delays 



-V- 



-V- 



-V- 



-V- 



-V- 



2 



*4 



echo signal 
V- 



array 
elements 



-%- 



-V- 



-V- 



-v- 



-v- 




receive 
focus #2 



FIGURE 65.1 Focusing and steering an acoustic beam using a phased array. A 6-element linear array is shown 
(a) in the transmit mode and (b) in the receive mode. Dynamic focusing in receive allows the scanner focus to track 
the range of returning echoes. 

The disadvantages of linear arrays are due to the increased complexity and higher cost of the trans- 
ducers and scanners. For high-quality ultrasound images, many identical array elements are required 
(currently 128 and rising). The array elements are typically less than a millimeter on one side, and each 
has a separate connection to its own transmitter and receiver electronics. 

The widespread use of array transducers for many applications indicates that the advantages often 
outweigh the disadvantages. In addition, improvement in transducer fabrication techniques and inte- 
grated circuit technology have led to more advanced array transducers and scanners. 

Focusing and Steering with Phased Arrays 

This subsection describes how a phased-array transducer can focus and steer an acoustic beam along a 
specific direction. An ultrasound image is formed by repeating this process over 100 times to interrogate 
a two- (2D) or three-dimensional (3D) region of the medium. 

Figure 65.1a illustrates a simple example of a six-element linear array focusing the transmitted beam. 
One can assume that each array element is a point source that radiates a spherically shaped wavefront 
into the medium. Since the top element is farthest from the focus in this example, it is excited first. The 
remaining elements are excited at the appropriate time intervals so that the acoustic signals from all the 
elements reach the focal point at the same time. According to Huygens' principle, the net acoustic signal 
is the sum of the signals that have arrived from each source. At the focal point, the contributions from 
every element add in phase to produce a peak in the acoustic signal. Elsewhere, at least some of the 
contributions add out of phase, reducing the signal relative to the peak. 

For receiving an ultrasound echo, the phased array works in reverse. Fig. 65.1& shows an echo origi- 
nating from focus 1. The echo is incident on each array element at a different time interval. The received 
signals are electronically delayed so that the delayed signals add in phase for an echo originating at the 
focal point. For echoes originating elsewhere, at least some of the delayed signals will add out of phase, 
reducing the receive signal relative to the peak at the focus. 
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In the receive mode, the focal point can be dynamically adjusted so that it coincides with the range 
of returning echoes. After transmission of an acoustic pulse, the initial echoes return from targets near 
the transducer. Therefore, the scanner focuses the phased array on these targets, located at focus 1 in 
Fig. 65.1b. As echoes return from more distant targets, the scanner focuses at a greater depth (focus 2 in 
the figure). Focal zones are established with adequate depth of field so that the targets are always in focus 
in receive. This process is called dynamic receive focusing and was first implemented by von Ramm and 
Thurstone in 1976 [von Ramm and Thurstone, 1976]. 

Array-Element Configurations 

An ultrasound image is formed by repeating the preceding process many times to scan a 2D or 3D region 
of tissue. For a 2D image, the scanning plane is the azimuth dimension; the elevation dimension is 
perpendicular to the azimuth scanning plane. The shape of the region scanned is determined by the 
array-element configuration, described in the paragraph below. 

Linear Sequential Arrays. Sequential liner arrays have as many as 512 elements in current commercial 
scanners. A subaperture of up to 128 elements is selected to operate at a given time. As shown in Fig. 65.2a, 
the scanning lines are directed perpendicular to the face of the transducer; the acoustic beam is focused 
but not steered. The advantage of this scheme is that the array elements have high sensitivity when the 
beam is directed straight ahead. The disadvantage is that the field of view is limited to the rectangular 
region directly in front of the transducer. Linear-array transducers have a large footprint to obtain an 
adequate field of view. 

Curvilinear Arrays. Curvilinear or convex arrays have a different shape than sequential linear arrays, 
but they operate in the same manner. In both cases, the scan lines are directed perpendicular to the 
transducer face. A curvilinear array, however, scans a wider field of view because of its convex shape, as 
shown in Fig. 65.2b. 

Linear Phased Arrays. The more advanced linear phased arrays have 128 elements. All the elements 
are used to transmit and receive each line of data. As shown in Fig. 65.2c, the scanner steers the ultrasound 
beam through a sector-shaped region in the azimuth plane. Phased arrays scan a region that is significantly 
wider than the footprint of the transducer, making them suitable for scanning through restricted acoustic 
windows. As a result, these transducers are ideal for cardiac imaging, where the transducer must scan 
through a small window to avoid the obstructions of the ribs (bone) and lungs (air). 

1.5D Arrays. The so-called 1.5D array is similar to a 2D array in construction but a ID array in 
operation. The 1.5D array contains elements along both the azimuth and elevation dimensions. Features 
such dynamic focusing and phase correction can be implemented in both dimensions to improve image 
quality. Since a 1.5D array contains a limited number of elements in elevation (e.g., 3 to 9 elements), 
steering is not possible in that direction. Figure 65. 2d illustrates a B-scan made with a 1.5D phased array. 
Linear sequential scanning is also possible with 1.5D arrays. 

2D Phased Arrays. A 2D phased-array has a large number of elements in both the azimuth and elevation 
dimensions. Therefore, 2D arrays can focus and steer the acoustic beam in both dimensions. Using 
parallel receive processing [Shattuck et al., 1984], a 2D array can scan a pyramidal region in real time to 
produce a volumetric image, as shown in Fig. 65. 2e [von Ramm and Smith, 1990]. 

Linear-Array Transducer Performance 

Designing an ultrasound transducer array involves many compromises. Ideally, a transducer has high 
sensitivity or SNR, good spatial resolution, and no artifacts. The individual array elements should have 
wide angular response in the steering dimensions, low cross-coupling, and an electrical impedance 
matched to the transmitter. 
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FIGURE 65.2 Array-element configurations and the region scanned by the acoustic beam, (a) A sequential linear 
array scans a rectangular region; (b) a curvilinear array scans a sector-shaped region; (c) a linear phased array scans 
a sector-shaped region; (d) a 1.5D array scans a sector-shaped region; (e) a 2D array scans a pyramidal-shaped region. 

Figure 65. 3 a illustrates the connections to the transducer assembly. The transmitter and receiver 
circuits are located in the ultrasound scanner and are connected to the array elements through 1 to 2 m 
of coaxial cable. Electrical matching networks can be added to tune out the capacitance of the coaxial 
cable and/or the transducer element and increase the signal-to-noise ratio (SNR). 

A more detailed picture of six- transducer elements is shown in Fig. 65.3 k Electrical leads connect to 
the ground and signal electrodes of the piezoelectric material. Acoustically, the array elements are loaded 
on the front side by one or two quarter-wave matching layers and the tissue medium. The matching 
layers may be made from glass or epoxy. A backing material, such as epoxy, loads the back side of the 
array elements. The faceplate protects the transducer assembly and also may act as an acoustic lens. 
Faceplates are often made from silicone or polyurethane. 

The following subsections describe several important characteristics of an array transducer. 
Figure 65.3c shows a six-element array and its dimensions. The element thickness, width, and length are 
labeled as t, a, and b, respectively. The interelement spacing is d, and the total aperture size is D in 
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FIGURE 65.3 (a) The connections between the ultrasound scanner and the transducer assembly for two elements 
of an array, (b) A more detailed picture of the transducer assembly for six elements of an array, (c) Coordinate system 
and labeling used to describe an array transducer. 

azimuth. The acoustic wavelength in the load medium, usually human tissue, is designated as X, while 
the wavelength in the transducer material is X t . 

Examples are given below for a 128-element linear array operating at 5 MHz. The array is made of 
PZT-5H with element dimensions of 0.1 x 5 x 0.3 mm. The interelement spacing is d = 0.15 mm in 
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FIGURE 65.3 (continued) 

azimuth, and the total aperture is D = 128-0.15 mm = 19.3 mm. See Table 65.1 for the piezoelectric 
material characteristics. The elements have an epoxy backing of Z = 3.25 MRayls. For simplicity, the 
example array does not contain a X/4 matching layer. 

Axial Resolution 

Axial resolution determines the ability to distinguish between targets aligned in the axial direction (the 
direction of acoustic propagation). In pulse-echo imaging, the echoes off of two targets separated by r/2 
have a path length difference of r. If the acoustic pulse length is r, then echoes off the two targets are just 
distinguishable. As a result, the axial resolution is often denned as one-half the pulse length [Christensen, 
1988]. A transducer with a high resonant frequency and a broad bandwidth has a short acoustic pulse 
and good axial resolution. 

Radiation Pattern 

The radiation pattern of a transducer determines the insonified region of tissue. For good lateral reso- 
lution and sensitivity, the acoustic energy should be concentrated in a small region. The radiation pattern 
for a narrow-band or continuous-wave (CW) transducer is described by the Rayleigh-Sommerfeld dif- 
fraction formula [Goodman, 1986]. For a pulse-echo imaging system, this diffraction formula is not 
exact due to the broadband acoustic waves used. Nevertheless, the Rayleigh-Sommerfeld formula is a 
reasonable first-order approximation to the actual radiation pattern. 

The following analysis considers only the azimuth scanning dimension. Near the focal point or in the 
far field, the Fraunhofer approximation reduces the diffraction formula to a Fourier transform formula. 
For a circular or rectangular aperture, the far field is at a range of 



z>- 



4X 



(65.1) 



Figure 65.3c shows the coordinate system used to label the array aperture and its radiation pattern. 
The array aperture is described by 
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where the rect(x) function is a rectangular pulse of width x, and the comb(x) function is a delta function 
repeated at intervals of x. The diffraction pattern is evaluated in the x Q plane at a distance z from the 
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FIGURE 65.4 Radiation pattern of Eq. (65.3) for a 16-element array with a = X, d= 2X, and D = 32A.. The angular 
response, the first term of Eq. (65.3), is also shown as a dashed line. 

transducer, and Q x is the angle of the point x from the normal axis. With the Fraunhofer approximation, 
the normalized diffraction pattern is given by 
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in azimuth, where the Fourier transform of Eq. (65.2) has been evaluated at the spatial frequency 
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Figure 65.4 shows a graph of Eq. (65.3) for a 16-element array with a = X, d = 2X, and D = 32X. In 
the graph, the significance of each term is easily distinguished. The first term determines the angular 
response weighting, the second term determines the location of grating lobes off-axis, and the third term 
determines the shape of the main lobe and the grating lobes. The significance of lateral resolution, 
angular response, and grating lobes is seen from the CW diffraction pattern. 

Lateral resolution determines the ability to distinguish between targets in the azimuth and elevation 
dimensions. According to the Rayleigh criterion [Goodman, 1986], the lateral resolution can be defined 
by the first null in the main lobe, which is determined from the third term of Eq. (65.3). 
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in the azimuth dimension. A larger aperture results in a more narrow main lobe and better resolution. 
A broad angular response is desired to maintain sensitivity while steering off-axis. The first term of 
Eq. (65.3) determines the one-way angular response. The element is usually surrounded by a soft baffle, 
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such as air, resulting in an additional cosine factor in the radiation pattern [Selfridge et al., 1980]. 
Assuming transmit/receive reciprocity, the pulse-echo angular response for a single element is 



P 



sin 2 [ita/k ■ sin 8 I 

>9 = K — ^-cos'e (65.6) 

A x ) I / \ 2 x 

\%a/A ■ sin 9 1 



in the azimuth dimension. As the aperture size becomes smaller, the element more closely resembles a 
point source, and the angular response becomes more broad. Another useful indicator is the -6-dB 
angular response, defined as the full-width half-maximum of the angular response graph. 

Grating lobes are produced at a location where the path length difference to adjacent array elements 
is a multiple of a wavelength (the main lobe is located where the path length difference is zero). The 
acoustic contributions from the elements constructively interfere, producing off-axis peaks. The term 
grating lobe was originally used to describe the optical peaks produced by a diffraction grating. In 
ultrasound, grating lobes are undesirable because they represent acoustic energy steered away from the 
main lobe. From the Comb function in Eq. (65.3), the grating lobes are located at 

=sin _1 — i = l,2, 3, ... (65.7) 

d 

in azimuth. 

If d is a wavelength, then grating lobes are centered at ±90 degrees from the steering direction in that 
dimension. Grating lobes at such large angles are less significant because the array elements have poor 
angular response in those regions. If the main lobe is steered at a large angle, however, the grating lobes 
are brought toward the front of the array. In this case, the angular response weighting produces a relatively 
weak main lobe and a relatively strong grating lobe. To eliminate grating lobes at all steering angles, the 
interelement spacing is set to X/2 or less [Steinberg, 1967]. 

Figure 65.5 shows the theoretical radiation pattern of the 128-element example. For this graph, the 
angular response weighting of Eq. (65.6) was substituted into Eq. (65.3). The lateral resolution, as defined 
by Eq. (65.7), 8^ = 0.9 degrees at the focal point. The -6-dB angular response is ±40 degrees from Eq. (65.6). 

Electrical Impedance 

The electric impedance of an element relative to the electrical loads has a significant impact on transducer 
signal-to-noise ratio (SNR). At frequencies away from resonance, the transducer has electrical charac- 
teristics of a capacitor. The construction of the transducer is a parallel-plate capacitor with clamped 
capacitance of 

C = e s — (65.8) 

where e s is the clamped dielectric constant. 

Near resonance, equivalent circuits help to explain the impedance behavior of a transducer. The 
simplified circuit of Fig. 65.6a is valid for transducers operating at series resonance without losses and 
with low acoustic impedance loads [Kino, 1987]. The mechanical resistance R m represents the acoustic 
loads as seen from the electrical terminals: 
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FIGURE 65.5. Radiation pattern of the example array element with a = 0.1 mm, d - 0.15 mm, D = 19.2 mm, and 
X = 0.3 mm. The angular response of Eq. (65.6) was substituted into Eq. (65.3) for this graph. 
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FIGURE 65.6 Simplified equivalent circuits for a piezoelectric transducer: (a) near-series resonance and (b) near- 
parallel resonance. 

where k is the electromechanical coupling coefficient of the piezoelectric material, Z c is the acoustic 
impedance of the piezoelectric material, Z x is the acoustic impedance of the transducer backing, and Z 2 
is the acoustic impedance of the load medium (body tissue). The power dissipated through R m corre- 
sponds to the acoustic output power from the transducer. 

The mechanical inductance L m and mechanical capacitance C m are analogous to the inductance and 
capacitance of a mass-spring system. At the series resonant frequency of 



1 



f= ^= (65.10) 

the impedances of these components add to zero, resulting in a local impedance minimum. 
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FIGURE 65.7 Complex electrical impedance of the example array element. Series resonance is located at 5.0 MHz, 
and parallel resonance is located at 6.7 MHz. 

The equivalent circuit of Fig. 65.6a can be redrawn in the form shown in Fig. 65.6b. In this circuit, C 
is the same as before, but the mechanical impedances have values of L m ', C m ', and R a . The resistive 
component R a is 



R 



4fc 2 
ncoCL 



z 1 + z 2 



(65.11) 



The inductor and capacitor combine to form an open circuit at the parallel resonant frequency of 



(65.12) 



27t A ;L' C 

V m m 



The parallel resonance, which is at a slightly higher frequency than the series resonance, is indicated by 
a local impedance maximum. 

Figure 65.7 shows a simulated plot of magnitude and phase versus frequency for the example array 
element described at the beginning of this subsection. The series resonance frequency is immediately 
identified at 5.0 MHz with an impedance minimum of \Z\ = 350 Q. Parallel resonance occurs at 6.7 MHz 
with an impedance maximum of \Z\ = 4000 Q. Note the capacitive behavior (approximately -90-degree 
phase) at frequencies far from resonance. 

Designing a Phased-Array Transducer 

In this subsection the design of an idealized phased-array transducer is considered in terms of the 
performance characteristics described above. Criteria are described for selecting array dimensions, acous- 
tic backing and matching layers, and electrical matching networks. 

Choosing Array Dimensions 

The array element thickness is determined by the parallel resonant frequency. For X/2 resonance, the 
thickness is 
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t=^- = -$- (65.13) 

2 2/, 

where C ( is the longitudinal speed of sound in the transducer material. 

There are three constraints for choosing the element width and length: (1) a nearly square cross-section 
should be avoided so that lateral vibrations are not coupled to the thickness vibration; as a rule of thumb 
[Kino and DeSilets, 1979], 

a/t<0.6 or a/t>W (65.14) 

(2) a small width and length are also desirable for a wide angular response weighting function; and (3) an 
interelement spacing of X/2 or less is necessary to eliminate grating lobes. 

Fortunately, these requirements are consistent for PZT array elements. For all forms of PZT, c, > 2c, 
where c is the speed of sound in body tissue (an average of 1540 m/s). At a given frequency, then X t > 
2X. Also, Eq. (65.13) states that X t = 2f at a frequency of f . By combining these equations, t > X for PZT 
array elements operating at a frequency of f p . If d = X/2, then a < X/2 because of the finite kerf width 
that separates the elements. Given this observation, then a < t/2. This is consistent with Eq. (65.14) to 
reduce lateral modes. 

An element having d = X/2 also has adequate angular response. For illustrative purposes, one can 
assume a zero kerf width so that a = X/2. In this case, the — 6-dB angular response is 0^ = ±35 degrees 
according to Eq. (65.6). 

The array dimensions determine the transducer's lateral resolution. In the azimuth dimension, if d = 
X/2, then the transducer aperture is D = nX/2, where n is the number of elements in a fully sampled 
array. From Eq. (65.5), the lateral resolution in azimuth is 

9 =sm' 2 (65.15) 

n 

Therefore, the lateral resolution is independent of frequency in a fully sampled array with d = X/2 . For 
this configuration, the lateral resolution is improved by increasing the number of elements. 

Acoustic Backing and Matching Layers 

The backing and matching layers affect the transducer bandwidth and sensitivity. While a lossy, matched 
backing improves bandwidth, it also dissipates acoustic energy that could otherwise be transmitted into 
the tissue medium. Therefore, a low-impedance acoustic backing is preferred because it reflects the 
acoustic pulses toward the front side of the transducer. In this case, adequate bandwidth is maintained 
by acoustically matching the transducer to the tissue medium using matching layers. 

Matching layers are designed with a thickness of XI A at the center frequency and an acoustic impedance 
between those of the transducer Z T and the load medium Z L . The ideal acoustic impedances can be 
determined from several different models [Hunt et al., 1983]. Using the KLM equivalent circuit model 
[Desilets et al., 1978], the ideal acoustic impedance is 

Z x = ^Z r Z\ (65.16) 

for a single matching layer. For matching PZT-5H array elements (Z T = 30 MRayls) to a water load (Z L = 
1.5 MRayls), a matching layer of Z l = 4.1 MRayls should be chosen. If two matching layers are used, they 
should have acoustic impedances of 



A/ 
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Z x = 4Z^Zl (65.17a) 
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FIGURE 65.8 A transducer of real impedance _R f being excited by a transmitter with source impedance R and 
source voltage V in . 
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(65.17b) 



In this case, Z l = 8.3 MRayls and Z 2 = 2.3 MRayls for matching PZT-5H to a water load. 

When constructing a transducer, a practical matching layer material is not always available, with the 
ideal acoustic impedance [Eq. (65.16) or (65.17)]. Adequate bandwidth is obtained by using materials 
that have an impedance close to the ideal value. With a single matching layer, for example, conductive 
epoxy can be used with Z = 5.1 MRayls. 

Electrical Impedance Matching 

Signal-to-noise ratio and bandwidth are also improved when electrical impedance of an array element 
is matched to that of the transmit circuitry. Consider the simplified circuit in Fig. 65.8 with a transmitter 
of impedance R a and a transducer of real impedance _R r The power output is proportional to the power 
dissipated in R t , as expressed as 



V 2 R 

p = ^sl where V = ( — V 

am ^ ou, ^ + ^ m 



(65.18) 



The power available from the transmitter is 



- a, 

into a matched load. From the two previous equations, the power efficiency is 



(65.19) 



out 

p. 



(Ro+r,) 2 



(65.20) 



For a fixed-source impedance, the maximum efficiency is obtained by taking the derivative of 
Eq. (65.20) with respect to _R f and setting it to zero. Maximum efficiency occurs when the source imped- 
ance is matched to the transducer impedance, R = R t . 

In practice, the transducer has a complex impedance of R m in parallel with C (see Fig. 65.6), which 
is excited by a transmitter with a real impedance of 50 Q. The transducer has a maximum efficiency 
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FIGURE 65.9 ( a) Conventional single-layer ceramic; ( V) five-layer ceramic of the same overall dimensions. The layers 
are electrically in parallel and acoustically in series. The arrows indicate the piezoelectric poling directions of each layer. 

when the imaginary component is tuned out and the real component is 50 Q. This can be accomplished 
with electrical matching networks. 

The capacitance C is tuned out in the frequency range near co using an inductor of 



<c 



(65.21) 



for an inductor in shunt, or 



»oC„ + vX 2 ,c 



(65.22) 



for an inductor in series. The example array elements described in the preceding subsection have C = 
22 pF and R m = 340 D. at series resonance of 5.0 MHz. Therefore, tuning inductors of L = 46 uH or 
L[ = 2.4 uH should be used. 

A shunt inductor also raises the impedance of the transducer, as seen from the scanner, while a series 
inductor lowers the terminal impedance [Hunt et al., 1983]. For more significant changes in terminal 
impedance, transformers are used. 

A transformer of turns ratio 1:N multiplies the terminal impedance by 1/N 2 . In the transmit mode, 
N can be adjusted so that the terminal impedance matches the transmitter impedance. In the receive 
mode, the open-circuit sensitivity varies as 1/Nbecause of the step-down transformer. The lower terminal 
impedance of the array element, however, provides increased ability to drive an electrical load. 

More complicated circuits can be used for better electrical matching across a wide bandwidth [Hunt 
et al., 1983]. These circuits can be either passive, as above, or active. Inductors also can be used in the 
scanner to tune out the capacitance of the coaxial cable that loads the transducer on receive. 

Another alternative for electrical matching is to use multilayer piezoelectric ceramics [Goldberg and 
Smith, 1994]. Figure 65.9 shows an example of a single layer and a five-layer array element with the same 
overall dimensions of a, b, and t. Since the layers are connected electrically in parallel, the clamped 
capacitance of a multilayer ceramic (MLC) element is 
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C = N-e s --^=N 2 -C smik (65.23) 

where C sin le is the capacitance of the single-layer element (Eq. 65.8). As a result, the MLC impedance is 
reduced by a factor of N 2 . Acoustically, the layers of the MLC are in series so the X/2 resonant thickness 
is t, the stack thickness. 

To a first order, an N-layer ceramic has identical performance compared with a 1:N transformer, but 
the impedance is transformed within the ceramic. MLCs also can be fabricated in large quantities more 
easily than hand-wound transformers. While MLCs do not tune out the reactive impedance, they make 
it easier to tune a low capacitance array element. By lowering the terminal impedance of an array element, 
MLCs significantly improve transducer SNR. 

Summary 

The piezoelectric transducer is an important component in the ultrasound imaging system. The trans- 
ducer often consists of a liner array that can electronically focus an acoustic beam. Depending on the 
configuration of array elements, the region scanned may be sector shaped or rectangular in two dimen- 
sions or pyramidal shaped in three dimensions. 

The transducer performance large determines the resolution and the signal-to-noise ratio of the 
resulting ultrasound image. The design of an array involves many compromises in choosing operating 
frequency and array-element dimensions. Electrical matching networks and quarter- wave matching layers 
may be added to improve transducer performance. 

Further improvements in transducer performance may result from several areas of research. Newer 
materials, such as composites, are gaining widespread use in medical ultrasound. In addition, 1.5D arrays 
or 2D arrays may be employed to control the acoustic beam in both azimuth and elevation. Problems 
in fabrication and electrical impedance matching must be overcome to implement these arrays in an 
ultrasound system. 

Defining Terms 

Acoustic impedance: In an analogy to transmission line impedance, the acoustic impedance is the ratio 

of pressure to particle velocity in a medium; more commonly, it is defined as Z = pc, where p = 

density and c = speed of sound in a medium [the units are kg/(m 2 -sec) or Rayls]. 
Angular response: The radiation pattern versus angle for a single element of an array. 
Axial resolution: The ability to distinguish between targets aligned in the axial direction (the direction 

of acoustic propagation). 
Azimuth dimension: The lateral dimension that is along the scanning plane for an array transducer. 
Electrical matching networks: Active or passive networks designed to tune out reactive components 

of the transducer and/or match the transducer impedance to the source and receiver impedance. 
Elevation dimension: The lateral dimension that is perpendicular to the scanning plane for an array 

transducer. 
Grating lobes: Undesirable artifacts in the radiation pattern of a transducer; they are produced at a 

location where the path length difference to adjacent array elements is a multiple of a wavelength. 
Lateral modes: Transducer vibrations that occur in the lateral dimensions when the transducer is 

excited in the thickness dimension. 
Lateral resolution: The ability to distinguish between targets in the azimuth and elevation dimensions 

(perpendicular to the axial dimension). 
Quarter- wave matching layers: One or more layers of material placed between the transducer and the 

load medium (water or human tissue); they effectively match the acoustic impedance of the 

transducer to the load medium to improve the transducer bandwidth and signal-to-noise ratio. 
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Further Information 

A good overview of linear array design and performance is contained in von OT Ramm and S.W. Smith 
(1983), Beam steering with linear arrays, IEEE Trans Biomed Eng 30:438. The same issue contains a more 
general article on transducer design and performance: J.W. Hunt, M. Arditi, and F.S. Foster (1983), 
Ultrasound transducers for pulse-echo medical imaging, IEEE Trans Biomed Eng 30:453. 
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The journal IEEE Transactions on Ultrasonics, Ferroelectrics, and Frequency Control frequently contains 
articles on medical ultrasound transducers. For subscription information, contact IEEE Service Center, 
445 Hoes Lane, P.O. Box 1331, Piscataway, NJ 08855-1331, phone (800) 678-IEEE. 

Another good source is the proceedings of the IEEE Ultrasonics Symposium, published each year. 
Also, the proceedings from New Developments in Ultrasonics Transducers and Transducer Systems, edited 
by EL. Lizzi, was published by SPIE, Vol. 1733, in 1992. 

65.2 Ultrasonic Imaging 

Jack G. Mottley 

It was recognized long ago that the tissues of the body are inhomogeneous and that signals sent into 
them, like pulses of high-frequency sound, are reflected and scattered by those tissues. Scattering, or 
redirection of some of an incident energy signal to other directions by small particles, is why we see the 
beam of a spotlight in fog or smoke. That part of the scattered energy that returns to the transmitter is 
called the backscatter. 

Ultrasonic imaging of the soft tissues of the body really began in the early 1970s. At that time, the 
technologies began to become available to capture and display the echoes backscattered by structures 
within the body as images, at first as static compound images and later as real-time moving images. The 
development followed much the same sequence (and borrowed much of the terminology) as did radar 
and sonar, from initial crude single-line-of-sight displays (A-mode) to recording these side by side to 
build up recordings over time to show motion (M-mode), to finally sweeping the transducer either 
mechanically or electronically over many directions and building up two-dimensional views (B-mode or 
2D). 

Since this technology was intended for civilian use, applications had to wait for the development of 
inexpensive data handling, storage, and display technologies. A-mode was usually shown on oscilloscopes, 
M-modes were printed onto specially treated light-sensitive thermal paper, and B-mode was initially 
built up as a static image in analog scan converters and shown on television monitors. Now all modes 
are produced in real time in proprietary scan converters, shown on television monitors, and recorded 
either on commercially available videotape recorders (for organs or studies in which motion is a part of 
the diagnostic information) or as still frames on photographic film (for those cases in which organ 
dimensions and appearance are useful, but motion is not important). 

Using commercial videotape reduces expenses and greatly simplifies the review of cases for quality 
control and training, since review stations can be set up in offices or conference rooms with commonly 
available monitors and videocassette recorders, and tapes from any imaging system can be played back. 
Also, the tapes are immediately available and do not have to be chemically processed. 

Since the earliest systems were mostly capable of showing motion, the first applications were in studying 
the heart, which must move to carry out its function. A-mode and M-mode displays (see Figs. 65.10 
through 65.12) were able to demonstrate the motion of valves, thickening of heart chamber walls, 
relationships between heart motion and pressure, and other parameters that enabled diagnoses of heart 
problems that had been difficult or impossible before. For some valvular diseases, the preferred display 
format for diagnosis is still the M-mode, on which the speed of valve motions can be measured and the 
relations of valve motions to the electrocardiogram (ECG) are easily seen. 

Later, as 2D displays became available, ultrasound was applied more and more to imaging of the soft 
abdominal organs and in obstetrics (Fig. 65.13). In this format, organ dimensions and structural relations 
are seen more easily, and since the images are now made in real time, motions of organs such as the 
heart are still well appreciated. These images are used in a wide variety of areas from obstetrics and 
gynecology to ophthalmology to measure the dimensions of organs or tissue masses and have been widely 
accepted as a safe and convenient imaging modality. 
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FIGURE 65.10 Schematic representation of the signal received from along a single line of sight in a tissue. The 
rectified voltage signals are displayed for A-mode. 



Fundamentals 

Strictly speaking, ultrasound is simply any sound wave whose frequency is above the limit of human 
hearing, which is usually taken to be 20 kHz. In the context of imaging of the human body, since frequency 
and wavelength (and therefore resolution) are inversely related, the lowest frequency of sound commonly 
used is around 1 MHz, with a constant trend toward higher frequencies in order to obtain better 
resolution. Axial resolution is approximately one wavelength, and at 1 MHz, the wavelength is 1.5 mm 
in most soft tissues, so one must go to 1.5 MHz to achieve 1-mm resolution. 

Attenuation of ultrasonic signals increases with frequency in soft tissues, and so a tradeoff must be 
made between the depth of penetration that must be achieved for a particular application and the highest 
frequency that can be used. Applications that require deep penetration (e.g., cardiology, abdominal, 
obstetrics) typically use frequencies in the 2- to 5-MHz range, while those applications which only require 
shallow penetration but high resolution (e.g., ophthalmology, peripheral vascular, testicular) use fre- 
quencies up to around 20 MHz. Intra-arterial imaging systems, requiring submillimeter resolution, use 
even higher frequencies of 20 to 50 MHz, and laboratory applications of ultrasonic microscopy use 
frequencies up to 100 or even 200 MHz to examine structures within individual cells. 

There are two basic equations used in ultrasonic imaging. One relates the (one-way) distance d of an 
object that caused an echo from the transducer to the (round-trip) time delay t and speed of sound in 
the medium c: 



d=-tc 

2 



(65.24) 



The speed of sound in soft body tissues lies in a fairly narrow range from 1450 to 1520 m/s. For rough 
estimates of time of flight, one often uses 1500 m/s, which can be converted to 1.5 mm/u.s, a more 
convenient set of units. This leads to delay times for the longest-range measurements (20 cm) of 270 u.s. 
To allow echoes and reverberations to die out, one needs to wait several of these periods before launching 
the next interrogating pulse, so pulse repetition frequencies of about a kilohertz are possible. 
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FIGURE 65.11 Example of M-mode imaging of a heart at two points during the cardiac cycle, (a) Upper panel 
shows heart during diastole (relaxation) with a line of sight through it and the corresponding A-line converted to 
an M-line. (b) The lower panel shows the same heart during systole (contraction) and the A- and M-lines. Note the 
thicker walls and smaller ventricular cross-section during systole. 

The other equation relates the received signal strength S(t) to the transmitted signal T(t), the trans- 
ducer's properties B{t), the attenuation of the signal path to and from the scatterer A(t), and the strength 
of the scatterer r\{t): 



s(f) = r(t)®5(t)®A(i)®r|(i) 



(65.25) 



where ® denotes time-domain convolution. Using the property of Fourier transforms that a convolution 
in the time domain is a multiplication in the frequency domain, this is more often written in the frequency 
domain as 
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FIGURE 65.12 Completed M-mode display obtained by showing the M-lines of Fig. 65.11 side by side. The motion 
of the heart walls and their thickening and thinning are well appreciated. Often the ECG or heart sounds are also 
shown in order to coordinate the motions of the heart with other physiologic markers. 
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FIGURE 65.13 Schematic representation of a heart and how a 2D image is constructed by scanning the transducer. 
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s(/) = t(/)b(/)a(/)ti(/) (65.26) 



where each term is the Fourier transform of the corresponding term in the time-domain expression 
(65.25) and is written as a function of frequency/ 

The goal of most imaging applications is to measure and produce an image based on the local values 
of the scattering strength, which requires some assumptions to be made concerning each of the other 
terms. The amplitude of the transmitted signal T(f) is a user-adjustable parameter that simply adds a 
scale factor to the image values, unless it increases the returned signal to the point of saturating the 
receiver amplifier. Increasing the transmit power increases the strength of return from distant or faint 
echoes simply by increasing the power that illuminates them, like using a more powerful flashlight lets 
you see farther at night. Some care must be taken to not turn the transmit power up too high, since very 
high power levels are capable of causing acoustic cavitation or local heating of tissues, both of which can 
cause cellular damage. Advances in both electronics and transducers make it possible to transmit more 
and more power. For this reason, new ultrasonic imaging systems are required to display an index value 
that indicates the transmitted power. If the index exceeds established thresholds, it is possible that damage 
may occur, and the examiner should limit the time of exposure. 

Most imaging systems are fairly narrow band, so the transducer properties B(f) are constant and 
produce only a scale factor to the image values. On phased-array systems it is possible to change the 
depth of focus on both transmit and receive. This improves image quality and detection of lesions by 
matching the focusing characteristics of the transducer to best image the object in question, like focusing 
a pair of binoculars on a particular object. 

As the ultrasonic energy travels along the path from transmitter to scatterer and back, attenuation 
causes the signal to decrease with distance. This builds up as a line integral from time to time t as 



A\ 



(/.»)■ 



-J»' 



An average value of attenuation can be corrected for electronically by increasing the gain of the imaging 
system as a function of time [variously called time gain compensation (TGC) or depth gain compensation 
(DGC)]. In addition, some systems allow for lateral portions of the image region to have different 
attenuation by adding a lateral gain compensation in which the gain is increased to either side of the 
center region of the image. 

Time gain compensation is usually set to give a uniform gray level to the scattering along the center 
of the image. Most operators develop a "usual" setting on each machine, and if it becomes necessary to 
change those settings to obtain acceptable images on a patient, then that indicates that the patient has a 
higher attenuation or that there is a problem with the electronics, transducer, or acoustic coupling. 

Applications and Example Calculations 

As an example of calculating the time of flight of an ultrasonic image, consider the following. 

Example 1. A tissue has a speed of sound c = 1460 m/s, and a given feature is 10 cm deep within. 
Calculate the time it will take an ultrasonic signal to travel from the surface to the feature and back. 

Answer: t = 2 x (10 cm)/(1460 m/s) = 137 us, where the factor of 2 is to account for the round trip 
the signal has to make (i.e., go in and back out). 

Example 2. Typical soft tissues attenuate ultrasonic signals at a rate of 0.5 dB/cm/MHz. How much 
attenuation would be suffered by a 3-MHz signal going through 5 cm of tissue and returning? 

Answer: a = 3 MHz x (0.5 dB/cm/MHz) / (8.686 dB/neper) = 0.173 neper/cm, A(3 MHz, 5 cm) = 

g(-0. 173 neper/cm) x (5 cm)x 2 — A lyy 
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Economics 

Ultrasonic imaging has many economic advantages over other imaging modalities. The imaging systems 
are typically much less expensive than those used for other modalities and do not require special 
preparations of facilities such as shielding for x-rays or uniformity of magnetic field for MRI. Most 
ultrasonic imaging systems can be rolled easily from one location to another, so one system can be shared 
among technicians or examining rooms or even taken to patients' rooms for critically ill patients. 

There are minimal expendables used in ultrasonic examinations, mostly the coupling gel used to couple 
the transducer to the skin and videotape or film for recording. Transducers are reusable and amortized 
over many examinations. These low costs make ultrasonic imaging one of the least expensive modalities, 
far preferred over others when indicated. The low cost also means these systems can be a part of private 
practices and used only occasionally. 

As an indication of the interest in ultrasonic imaging as an alternative to other modalities, in 1993, 
the Wall Street Journal reported that spending in the United States on MRI units was approximately 
$520 million, on CT units $800 million, and on ultrasonic imaging systems $1000 million, and that sales 
of ultrasound systems was growing at 15% annually [1]. 

Defining Terms 

A-mode: The original display of ultrasound measurements, in which the amplitude of the returned 
echoes along a single line is displayed on an oscilloscope. 

Attenuation: The reduction is signal amplitude that occurs per unit distance traveled. Some attenuation 
occurs in homogeneous media such as water due to viscous heating and other phenomena, but 
that is very small and is usually taken to be negligible over the 10- to 20-cm distances typical of 
imaging systems. In inhomogeneous media such as soft tissues, the attenuation is much higher 
and increases with frequency. The values reported for most soft tissues lie around 0.5 dB/cm/MHz. 

Backscatter: That part of a scattered signal that goes back toward the transmitter of the energy. 

B-mode or 2D: The current display mode of choice. This is produced by sweeping the transducer from 
side to side and displaying the strength of the returned echoes as bright spots in their geometrically 
correct direction and distance. 

Compound images: Images built up by adding, or compounding, data obtained from a single trans- 
ducer or multiple transducers swept through arcs. Often these transducers were not fixed to a 
single point of rotation but could be swept over a surface of the body like the abdomen in order 
to build up a picture of the underlying organs such as the liver. This required an elaborate position- 
sensing apparatus attached to the patient's bed or the scanner and that the organ in question be 
held very still throughout the scanning process, or else the image was blurred. 

M-mode: Followed A-mode by recording the strength of the echoes as dark spots on moving light- 
sensitive paper. Objects that move, such as the heart, caused standard patterns of motion to be 
displayed, and a lot of diagnostic information such as valve closure rates, whether valves opened 
or closed completely, and wall thickness could be obtained from M-mode recordings. 

Real-time images: Images currently made on ultrasound imaging systems by rapidly sweeping the 
transducer through an arc either mechanically or electronically. Typical images might have 120 scan 
lines in each image, each 20 cm long. Since each line has a time of flight of 267 (is, a single frame 
takes 120 x 267 |j,s = 32 ms. It is therefore possible to produce images at standard video frame 
rates (30 frames/sec, or 33.3 msec/frame). 

Reflection: Occurs at interfaces between large regions (much larger than a wavelength) of media with 
differing acoustic properties such as density or compressibility. This is similar to the reflection of 
light at interfaces and can be either total, like a mirror, or partial, like a half-silvered mirror or the 
ghostlike reflection seen in a sheet of glass. 

Scattering: Occurs when there are irregularities or inhomogeneities in the acoustic properties of a medium 
over distances comparable with or smaller than the wavelength of the sound. Scattering from objects 
much smaller than a wavelength typically increases with frequency (the blue-sky law in optics), while 
that from an object comparable to a wavelength is constant with frequency (why clouds appear white). 
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Further Information 

There are many textbooks that contain good introduction to ultrasonic imaging. Physical Principles of 
Ultrasonic Diagnosis, by P. N. Wells, is a classic, and there is a new edition of another classic, Diagnostic 
Ultrasound: Principles, Instruments and Exercises, 4th ed., by Frederick Kremkau. Books on medical 
imaging that contain introductions to ultrasonic imaging include Medical Imaging Systems, by Albert 
Macovski; Principles of Medical Imaging, by Kirk Shung, Michael Smith, and Benjamin Tsui; and Foun- 
dations of Medical Imaging, by Zang-Hee Cho, Joie P. Jones, and Manbir Singh. 

The monthly journals IEEE Transactions on Ultrasonics, Ferroelectrics, and Frequency Control and IEEE 
Transactions on Biomedical Engineering often contain information and research reports on ultrasonic 
imaging. For subscription information, contact IEEE Service Center, 445 Hoes Lane, P.O. Box 1331, 
Piscataway, NJ 08855-1331, phone (800) 678-4333. Another journal that often contains articles on 
ultrasonic imaging is the Journal of the Acoustical Society of America. For subscription information, contact 
AIP Circulation and Fulfillment Division, 500 Sunnyside Blvd., Woodbury, NY 11797-2999, phone (800) 
344-6908; e-mail: elecprod\@pinet.aip.org. 

There are many journals that deal with medical ultrasonic imaging exclusively. These include Ultrasonic 
Imaging, the Journal of Ultrasound in Medicine, American Institute of Ultrasound of Medicine (AIUM), 
14750 Sweitzer Lane, Suite 100, Laurel, MD 20707-5906, and the Journal of Ultrasound in Medicine and 
Biology, Elsevier Science, Inc., 660 White Plains Road, Tarrytown, NY 10591-5153, e-mail: 
esuk.usa@elsevier.com. 

There are also specialty journals for particular medical areas, e.g., the Journal of the American Society 
of Echocardiography, that are available through medical libraries and are indexed in Index Medicus, 
Current Contents, Science Citation Index, and other databases. 

65.3 Blood Flow Measurement Using Ultrasound 

K. Whittaker Ferrara 

In order to introduce the fundamental challenges of blood velocity estimation, a brief description of the 
unique operating environment produced by the ultrasonic system, intervening tissue, and the scattering 
of ultrasound by blood is provided. In providing an overview of the parameters that differentiate this 
problem from radar and sonar target estimation problems, an introduction to the fluid dynamics of the 
cardiovascular system is presented, and the requirements of specific clinical applications are summarized. 
An overview of blood flow estimation systems and their performance limitations is then presented. Next, 
an overview of the theory of moving target estimation, with its roots in radar and sonar signal processing, 
is provided. The application of this theory to blood velocity estimation is then reviewed, and a number 
of signal processing strategies that have been applied to this problem are considered. Areas of new research 
including three-dimensional (3D) velocity estimation and the use of ultrasonic contrast agents are 
described in the final section. 

Fundamental Concepts 

In blood velocity estimation, the goal is not simply to estimate the mean target position and mean target 
velocity. The goal instead is to measure the velocity profile over the smallest region possible and to repeat 
this measurement quickly and accurately over the entire target. Therefore, the joint optimization of 
spatial, velocity, and temporal resolution is critical. In addition to the mean velocity, diagnostically useful 
information is contained in the volume of blood flowing through various vessels, spatial variations in 

© 2000 by CRC Press LLC 



Transducer 



Returned signal 




Vessel 



Layers of 



Intervening 
Tissue 



Blood Flow Beam 



FIGURE 65.14 Operating environment for the estimation of blood velocity. 

the velocity profile, and the presence of turbulence. While current methods have proven extremely 
valuable in the assessment of the velocity profile over an entire vessel, improved spatial resolution is 
required in several diagnostic situations. Improved velocity resolution is also desirable for a number of 
clinical applications. Blood velocity estimation algorithms implemented in current systems also suffer 
from a velocity ambiguity due to aliasing. 

Unique Features of the Operating Environment 

A number of features make blood flow estimation distinct from typical radar and sonar target estimation 
situations. The combination of factors associated with the beam formation system, properties of the 
intervening medium, and properties of the target medium lead to a difficult and unique operating 
environment. Figure 65.14 summarizes the operating environment of an ultrasonic blood velocity esti- 
mation system, and Table 65.2 summarizes the key parameters. 

Beam Formation— Data Acquisition System. The transducer bandwidth is limited. Most current trans- 
ducers are limited to a 50% to 75% fractional bandwidth due to their finite dimensions and a variety of 
electrical and mechanical properties. This limits the form of the transmitted signal. The transmitted pulse 
is typically a short pulse with a carrier frequency, which is the center frequency in the spectrum of the 
transmitted signal. 

Federal agencies monitor four distinct intensity levels. The levels are TASA, TASP, TPSA, and TPSP, where 
T represents temporal, S represents spatial, A represents average, and P represents peak. Therefore, the 

TABLE 65.2 Important Parameters 



Typical transducer center frequency 

Maximum transducer fractional bandwidth 

Speed of sound c 

Acoustic wavelength (c = 1540) 

Phased-array size 

Sample volume size 

Blood velocity 

Vessel wall echo/blood echo 

Diameter of a red blood cell 

Thickness of a red blood cell 

Volume of a red blood cell 

Volume concentration of cells (hematocrit) 

Maximum concentration without cell deformation 58% 



2-10 MHz 

50-75% 

1500-1600 m/s 

0.154-1.54 mm 

>32 -wavelength 

mm 3 

Normal; up to 1 m/s 

Pathological: up to 8 m/s 

20-40 dB 

8.5 pm 

2.4 pm 

87 ± 6 pm 3 

45% 
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use of long bursts requires a proportionate reduction in the transmitted peak power. This may limit the 
signal-to-noise ratio (SNR) obtained with a long transmitted burst due to the weak reflections from the 
complex set of targets within the body. 

Intervening Medium. Acoustic windows, which are locations for placement of a transducer to success- 
fully interrogate particular organs, are limited in number and size. Due to the presence of bone and air, 
the number of usable acoustic windows is extremely limited. The reflection of acoustic energy from bone 
is only 3 dB below that of a perfect reflector [Wells, 1977]. Therefore, transducers cannot typically 
surround a desired imaging site. In many cases, it is difficult to find a single small access window. This 
limits the use of inverse techniques. 

Intervening tissue produces acoustic refraction and reflection. Energy is reflected at unpredictable 
angles. 

The clutter-to-signal ratio is very high. Clutter is the returned signal from stationary or slowly moving 
tissue, which can be 40 dB above the returned signal from blood. Movement of the vessel walls and valves 
during the cardiac cycle introduces a high-amplitude, low-frequency signal. This is typically considered 
to be unwanted noise, and a high-pass filter is used to eliminate the estimated wall frequencies. 

The sampling rate is restricted. The speed of sound in tissue is low (approximately 1540 m/s), and 
each transmitted pulse must reach the target and return before the returned signal is recorded. Thus the 
sampling rate is restricted, and the aliasing limit is often exceeded. 

The total observation time is limited (due to low acoustic velocity). In order to estimate the velocity 
of blood in all locations in a 2D field in real time, the estimate for each region must be based on the 
return from a limited number of pulses because of the low speed of sound. 

Frequency-dependent attenuation affects the signal. Tissue acts as a low-pass transmission filter; the 
scattering functions as a high-pass filter. The received signal is therefore a distorted version of the 
transmitted signal. In order to estimate the effective filter function, the type and extent of each tissue 
type encountered by the wave must be known. Also, extension of the bandwidth of the transmitted signal 
to higher frequencies increases absorption, requiring higher power levels that can increase health concerns. 

Target Scattering Medium (Red Blood Cells). Multiple groups of scatterers are present. The target 
medium consists of multiple volumes of diffuse moving scatterers with velocity vectors that vary in 
magnitude and direction. The target medium is spread in space and velocity. The goal is to estimate the 
velocity over the smallest region possible. 

There is a limited period of statistical stationarity. The underlying cardiac process can only be con- 
sidered to be stationary for a limited time. This time was estimated to be 10 ms for the arterial system 
by Hatle and Angelsen [1985]. If an observation interval greater than this period is used, the average 
scatterer velocity cannot be considered to be constant. 

Overview of Ultrasonic Flow Estimation Systems 

Current ultrasonic imaging systems operate in a pulse-echo (PE) or continuous-wave (CW) intensity 
mapping mode. In pulse-echo mode, a very short pulse is transmitted, and the reflected signal is analyzed. 
For a continuous-wave system, a lower-intensity signal is continuously transmitted into the body, and 
the reflected energy is analyzed. In both types of systems, an acoustic wave is launched along a specific 
path into the body, and the return from this wave is processed as a function of time. The return is due 
to reflected waves from structures along the line of sight, combined with unwanted noise. Spatial selec- 
tivity is provided by beam formation performed on burst transmission and reception. Steering of the 
beam to a particular angle and creating a narrow beam width at the depth of interest are accomplished 
by an effective lens applied to the ultrasonic transducer. This lens may be produced by a contoured 
material, or it may be simulated by phased pulses applied to a transducer array. The spatial weighting 
pattern will ultimately be the product of the effective lens on transmission and reception. The returned 
signal from the formed beam can be used to map the backscattered intensity into a two-dimensional 
gray-scale image, or to estimate target velocity. We shall focus on the use of this information to estimate 
the velocity of red blood cells moving through the body. 
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Single Sample Volume Doppler Instruments. One type of system uses the Doppler effect to estimate 
velocity in a single volume of blood, known as the sample volume, which is designated by the system 
operator. The Doppler shift frequency from a moving target can be shown to equal 2f c v/c, where f c is the 
transducer center frequency in Hertz, c is the speed of sound within tissue, and v is the velocity component 
of the blood cells toward or away from the transducer. These "Doppler" systems transmit a train of long 
pulses with a well-defined carrier frequency and measure the Doppler shift in the returned signal. The 
spectrum of Doppler frequencies is proportional to the distribution of velocities present in the sample 
volume. The sample volume is on a cubic millimeter scale for typical pulse-echo systems operating in 
the frequency range of 2 to 10 MHz. Therefore, a thorough cardiac or peripheral vascular examination 
requires a long period. In these systems, 64 to 128 temporal samples are acquired for each estimate. The 
spectrum of these samples is typically computed using a fast Fourier transform (FFT) technique [Kay 
and Marple, 1981]. The range of velocities present within the sample volume can then be estimated. The 
spectrum is scaled to represent velocity and plotted on the vertical axis. Subsequent spectral estimates 
are then calculated and plotted vertically adjacent to the first estimate. 

Color Flow Mapping. In color flow mapping, a pseudo-color velocity display is overlaid on a 2D gray- 
scale image. Simultaneous amplitude and velocity information is thus available for a 2D sector area of 
the body. The clinical advantage is a reduction in the examination time and the ability to visualize the 
velocity profile as a 2D map. Figure 65.15 shows a typical color flow map of ovarian blood flow combined 
with the Doppler spectrum of the region indicated by the small graphic sample volume. The color flow 
map shows color-encoded velocities superimposed on the gray-scale image with the velocity magnitude 
indicated by the color bar on the side of the image. Motion toward the transducer is shown in yellow 
and red, and motion away from the transducer is shown in blue and green, with the range of colors 
representing a range of velocities to a maximum of 6 cm/s in each direction. Velocities above this limit 
would produce aliasing for the parameters used in optimizing the instrument for the display of ovarian 
flow. A velocity of m/s would be indicated by black, as shown at the center of the color bar. Early 
discussions of the implementation of color flow mapping systems can be found in Curry and White 
[1978] and Nowicki and Reid [1981]. 

The lower portion of the image presents an intensity-modulated display of instantaneous Doppler 
components along the vertical axis. As time progresses, the display is translated along the horizontal axis 
to generate a Doppler time history for the selected region of interest [provided by Acuson Corporation, 
Mountain View, Calif.]. 




FIGURE 65.15 Flow map and Doppler spectrum for ovarian blood flow. 
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Limitations of color flow instruments result in part from the transmission of a narrowband (long) 
pulse that is needed for velocity estimation but degrades spatial resolution and prevents mapping of the 
spatial-velocity profile. Due to the velocity gradient in each blood vessel, the transmission of a long pulse 
also degrades the velocity resolution. This is caused by the simultaneous examination of blood cells 
moving at different velocities and the resulting mixing of regions of the scattering medium, which can 
be distinctly resolved on a conventional B-mode image. Since the limited speed of acoustic propagation 
velocity limits the sampling rate, a second problem is aliasing of the Doppler frequency. Third, informa- 
tion regarding the presence of velocity gradients and turbulence is desired and is not currently available. 
Finally, estimation of blood velocity based on the Doppler shift provides only an estimate of the axial 
velocity, which is the movement toward or away from the transducer, and cannot be used to estimate 
movement across the transducer beam. It is the 3D velocity magnitude that is of clinical interest. 

For a color flow map, the velocity estimation technique is based on estimation of the mean Doppler 
shift using signal-processing techniques optimized for rapid (real-time) estimation of velocity in each 
region of the image. The transmitted pulse is typically a burst of 4 to 8 cycles of the carrier frequency. 
Data acquisition for use in velocity estimation is interleaved with the acquisition of information for the 
gray-scale image. Each frame of acquired data samples is used to generate one update of the image display. 
An azimuthal line is a line that describes the direction of the beam from the transducer to the target. A 
typical 2D ultrasound scanner uses 128 azimuthal lines per frame and 30 frames per second to generate 
a gray-scale image. Data acquisition for the velocity estimator used in color flow imaging requires an 
additional 4 to 18 transducer firings per azimuthal line and therefore reduces both the number of 
azimuthal lines and the number of frames per second. If the number of lines per frame is decreased, 
spatial undersampling or a reduced examination area results. If the number of frames per second is 
decreased, temporal undersampling results, and the display becomes difficult to interpret. 

The number of data samples available for each color flow velocity estimate is reduced to 4 to 18 in 
comparison with the 64 to 128 data samples available to estimate velocity in a single sample volume 
Doppler mode. This reduction, required to estimate velocity over the 2D image, produces a large increase 
in the estimator variance. 

Fluid Dynamics and the Cardiovascular System 

In order to predict and adequately assess blood flow profiles within the body, the fluid dynamics of the 
cardiovascular system will be briefly reviewed. The idealized case known as Poiseuille flow will be con- 
sidered first, allowed by a summary of the factors that disturb Poiseuille flow. 

A Poiseuille flow model is appropriate in a long rigid circular pipe at a large distance from the entrance. 
The velocity in this case is described by the equation vlv = 1 - (rid) 1 , where v represents the velocity 
parallel to the wall, v represents the center-line velocity, r is the radial distance variable, and a is the 
radius of the tube. In this case, the mean velocity is half the center-line velocity, and the volume flow 
rate is given by the mean velocity multiplied by the cross-sectional area of the vessel. 

For the actual conditions within the arterial system, Poiseuille flow is only an approximation. The 
actual arterial geometry is tortuous and individualistic, and the resulting flow is perturbed by entrance 
effects and reflections. Reflections are produced by vascular branches and the geometric taper of the 
arterial diameter. In addition, spatial variations in vessel elasticity influence the amplitude and wave 
velocity of the arterial pulse. Several parameters can be used to characterize the velocity profile, including 
the Reynolds number, the Womersly number, the pulsatility index, and the resistive index. The pulsatility 
and resistive indices are frequently estimated during a clinical examination. 

The Reynolds number is denoted Re and measures the ratio of fluid inertia to the viscous forces acting 
on the fluid. The Reynolds number is defined by Re = Dv'/[i k , where v' is the average cross-sectional 
velocity u t is the kinematic viscosity, and D is the vessel diameter. Kinematic viscosity is defined as the 
fluid viscosity divided by the fluid density. When the Reynolds number is high, fluid inertia dominates. 
This is true in the aorta and larger arteries, and bursts of turbulence are possible. When the number is 
low, viscous effects dominate. 
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The Womersly number is used to describe the effect introduced by the unsteady, pulsatile nature of 
the flow. This parameter, defined by a(ro/u t ) 1/2 , where co represents radian frequency of the wave, governs 
propagation along an elastic, fluid-filled tube. When the Womersly number is small, the instantaneous 
profile will be parabolic in shape, the flow is viscous dominated, and the profile is oscillatory and Poiseuille 
in nature. When the Womersly number is large, the flow will be blunt, inviscid, and have thin wall layers 
[Nicholas and O'Rourke, 1990]. 

The pulsatility index represents the ratio of the unsteady and steady velocity components of the flow. 
This shows the magnitude of the velocity changes that occur during acceleration and deceleration of 
blood constituents. Since the arterial pulse decreases in magnitude as it travels, this index is maximum 
in the aorta. The pulsatility index is given by the difference between the peak systolic and minimum 
diastolic values divided by the average value over one cardiac cycle. The Pourcelot, or resistance, index 
is the peak-to-peak swing in velocity from systole to diastole divided by the peak systolic value [Nichols 
and O'Rourke, 1990]. 

Blood Velocity Profiles. Specific factors that influence the blood velocity profile include the entrance 
effect, vessel curvature, skewing, stenosis, acceleration, secondary flows, and turbulence. These effects 
are briefly introduced in this subsection. 

The entrance effect is a result of fluid flow passing from a large tube or chamber into a smaller tube. 
The velocity distribution at the entrance becomes blunt. At a distance known as the entry length, the 
fully developed parabolic profile is restored, where the entry length is given by 0.06Re-(2a) [Nerem, 
1985]. Distal to this point the profile is independent of distance. 

If the vessel is curved, there will also be an entrance effect. The blunt profile in this case is skewed, 
with the peak velocity closer to the inner wall of curvature. When the fully developed profile occurs 
downstream, the distribution will again be skewed, with the maximal velocity toward the outer wall of 
curvature. Skewing also occurs at a bifurcation where proximal flow divides into daughter vessels. The 
higher- velocity components, which occurred at the center of the parent vessel, are then closer to the flow 
divider, and the velocity distribution in the daughter vessels is skewed toward the divider. 

Stenosis, a localized narrowing of the vessel diameter, dampens the pulsatility of the flow and pressure 
waveforms. The downstream flow profile depends on the shape and degree of stenosis. Acceleration adds 
a flat component to the velocity profile. It is responsible for the flat profile during systole, as well as the 
negative flat component near the walls in the deceleration phase. 

Secondary flows are swirling components which are superimposed on the main velocity profile. These 
occur at bends and branches, although regions of secondary flow can break away from the vessel wall 
and are then known as separated flow. These regions reattach to the wall at a point downstream. 

One definition of turbulent flow is flow that demonstrates a random fluctuation in the magnitude 
and direction of velocity as a function of space and time. The intensity of turbulence is calculated using 
the magnitude of the fluctuating velocities. The relative intensity of turbulence is given by I t = u rms /u mmn , 
where w rms represents the root-mean-square value of the fluctuating portion of the velocity, and w mean 
represents the nonfluctuating mean velocity [Hinze, 1975]. 

Clinical Applications and Their Requirements 

Blood flow measurement with ultrasound is used in estimating the velocity and volume of flow within 
the heart and peripheral arteries and veins. Normal blood vessels vary in diameter up to a maximum of 
2 cm, although most vessels examined with ultrasound have a diameter of 1 t 10 mm. Motion of the 
vessel wall results in a diameter change of 5% to 10% during a cardiac cycle. 

Carotid Arteries (Common, Internal, External). The evaluation of flow in the carotid arteries is of 
great clinical interest due to their importance in supplying blood to the brain, their proximity to the 
skin, and the wealth of experience that has been developed in characterizing vascular pathology through 
an evaluation of flow. The size of the carotid arteries is moderate; they narrow quickly from a maximum 
diameter of 0.8 cm. The shape of carotid flow waveforms over the cardiac cycle can be related to the 
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pathophysiology of the circulation. Numerous attempts have been made to characterize the parameters 
of carotid waveforms and to compare these parameters in normal and stenotic cases. A number of indices 
have been used to summarize the information contained in these waveforms. The normal range of the 
Pourcelot index is 0.55 to 0.75. Many researchers have shown that accurate detection of a minor stenosis 
requires accurate quantitation of the entire Doppler spectrum and remains very difficult with current 
technology. The presence of a stenosis causes spectral broadening with the introduction of lower fre- 
quency or velocity components. 

Cardiology. Blood velocity measurement in cardiology requires analysis of information at depths up 
to 18 cm. A relatively low center frequency (e.g., 2.5 to 3.5 MHz) typically is used in order to reduce 
attenuation. Areas commonly studied and the maximum rate of flow include the following [Hatle, 1985] : 

Normal Adult Maximal Velocity (m/s) 



Mitral flow 


0.9 


Tricuspid flow 


0.5 


Pulmonary artery 


0.75 


Left ventricle 


0.9 



Aorta. Aortic flow exhibits a blunt profile with entrance region characteristics. The entrance length is 
approximately 30 cm. The vessel diameter is approximately 2 cm. The mean Reynolds number is 2500 
[Nerem, 1985], although the peak Reynolds number in the ascending aorta can range from 4300 to 8900, 
and the peak Reynolds number in the abdominal aorta is in the range of 400 to 1100 [Nichols and 
O'Rourke, 1990]. The maximal velocity is on the order of 1.35 m/s. The flow is skewed in the aortic arch 
with a higher velocity at the inner wall. The flow is unsteady and laminar with possible turbulent bursts 
at peak systole. 

Peripheral Arteries [Hatsukami et al., 1992]. The peak systolic velocity in centimeters per second and 
standard deviation of the velocity measurement technique are provided below for selected arteries. 



Artery 


Peak Systolic Velocity (cm/s) 


Standard Deviation 


Proximal external iliac 


99 


22 


Distal external iliac 


96 


13 


Proximal common femoral 


89 


16 


Distal common femoral 


71 


15 


Proximal popliteal 


53 


9 


Distal popliteal 


53 


24 


Proximal peroneal 


46 


14 


Distal peroneal 


44 


12 



Nearly all the vessels above normally show some flow reversal during early diastole. A value of the 
pulsatility index of 5 or more in a limb artery is considered to be normal. 

Velocity Estimation Techniques 

Prior to the basic overview of theoretical approaches to target velocity estimation, it is necessary to 
understand a few basic features of the received signal from blood scatterers. It is the statistical correlation 
of the received signal in space and time that provides the opportunity to use a variety of velocity estimation 
strategies. Velocity estimation based on analysis of the frequency shift or the temporal correlation can 
be justified by these statistical properties. 

Blood velocity mapping has unique features due to the substantial viscosity of blood and the spatial 
limitations imposed by the vessel walls. Because of these properties, groups of red blood cells can be 
tracked over a significant distance. Blood consists of a viscous incompressible fluid containing an average 
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FIGURE 65.16 Random concentration of red blood cells within a vessel at times T Y and T 2 , where the change in 
depth from D i to D 2 would be used to estimate velocity. 

volume concentration of red blood cells of 45%, although this concentration varies randomly through 
the blood medium. The red blood cells are primarily responsible for producing the scattered wave, due 
to the difference in their acoustic properties in comparison with plasma. Recent research into the 
characteristics of blood has led to stochastic models for its properties as a function of time and space 
[Angelson, 1980; Atkinson and Berry, 1974; Mo and Cobbold, 1986; Shung et al., 1976, 1992]. The 
scattered signal from an insonified spatial volume is a random process that varies with the fluctuations 
in the density of scatterers in the insonified area, the shear rate within the vessel, and the hematocrit 
[Atkinson and Berry, 1974; Ferrara and Algazi, 1994a, 1994£>; Mo and Cobbold, 1986]. 

Since the concentration of cells varies randomly through the vessel, the magnitude of the returned 
signal varies when the group of scatterers being insonified changes. The returned amplitude from one 
spatial region is independent of the amplitude of the signal from adjacent spatial areas. As blood flows 
through a vessel, it transports cells whose backscattered signals can be tracked to estimate flow velocities. 

Between the transmission of one pulse and the next, the scatterers move a small distance within the 
vessel. As shown in Fig. 65. 1 6, a group of cells with a particular concentration which are originally located 
at depth D l at time T l move to depth D 2 at time T 2 . The resulting change in axial depth produces a 
change in the delay of the signal returning to the transducer from each group of scatterers. This change 
in delay of the radiofrequency (RF) signal can be estimated in several ways. As shown in Fig. 65.17, the 
returned signal from a set of sequential pulses then shows a random amplitude that can be used to 
estimate the velocity. Motion is detected using signal-processing techniques that estimate the shift of the 
signal between pulses. 

Clutter. In addition to the desired signal from the blood scatterers, the received signal contains clutter 
echoes returned from the surrounding tissue. An important component of this clutter signal arises from 
slowly moving vessel walls. The wall motion produces Doppler frequency shifts typically below 1 kHz, 
while the desired information from the blood cells exists in frequencies up to 15 kHz. Due to the smooth 
structure of the walls, energy is scattered coherently, and the clutter signal can be 40 dB above the scattered 
signal from blood. High-pass filters have been developed to remove the unwanted signal from the 
surrounding vessel walls. 
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FIGURE 65.17 Received RF signal from three transmitted pulses, with a random amplitude which can be used to 
estimate the axial movement of blood between pulses. Motion is shown by the shift in the signal with a recognizable 
amplitude. 

Classic Theory of Velocity Estimation. Most current commercial ultrasound systems transmit a train 
of long pulses with a carrier frequency of 2 to 10 MHz and estimate velocity using the Doppler shift of 
the reflected signal. The transmission of a train of short pulses and new signal-processing strategies may 
improve the spatial resolution and quality of the resulting velocity estimate. In order to provide a basis 
for discussion and comparison of these techniques, the problem of blood velocity estimation is considered 
in this subsection from the view of classic velocity estimation theory typically applied to radar and sonar 
problems. 

Important differences exist between classic detection and estimation for radar and sonar and the 
application of such techniques to medical ultrasound. The Van Trees [1971] approach is based on joint 
estimation of the Doppler shift and position over the entire target. In medical ultrasound, the velocity 
is estimated in small regions of a large target, where the target position is assumed to be known. While 
classic theories have been developed for estimation of all velocities within a large target by Van Trees and 
others, such techniques require a model for the velocity in each spatial region of interest. For the case 
of blood velocity estimation, the spatial variation in the velocity profile is complex, and it is difficult to 
postulate a model that can be used to derive a high-quality estimate. The theory of velocity estimation 
in the presence of spread targets is also discussed by Kennedy [1969] and Price [1968] as it applies to 
radar astronomy and dispersive communication channels. 

It is the desire to improve the spatial and velocity resolution of the estimate of blood velocity that has 
motivated the evaluation of alternative wideband estimation techniques. Narrowband velocity estimation 
techniques use the Doppler frequency shift produced by the moving cells with a sample volume that is 
fixed in space. Wideband estimation techniques incorporate the change in delay of the returned pulse 
due to the motion of the moving cells. Within the classification of narrowband techniques are a number 
of estimation strategies to be detailed below. These include the fast Fourier transform (FFT), finite 
derivative estimation, the autocorrelator, and modern spectral estimation techniques, including autore- 
gressive strategies. Within the classification of wideband techniques are cross-correlation strategies and 
the wideband maximum likelihood estimator (WMLE). 

For improving the spatial mapping of blood velocity within the body, the transmission of short pulses 
is desirable. Therefore, it is of interest to assess the quality of velocity estimates made using narrowband 
and wideband estimators with transmitted signals of varying lengths. If {2v/c)BT«l, where v represents 
the axial velocity of the target, c represents the speed of the wave in tissue, B represents the transmitted 
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FIGURE 65.18 Comparison of the axial distance traveled and the effective length of the transmitted pulse. 

signal bandwidth, and T represents the total time interval used in estimating velocity within an individual 
region, then the change in delay produced by the motion of the red blood cells can be ignored [Van 
Trees, 1971]. 

This inequality is interpreted for the physical conditions of medical ultrasound in Fig. 65.18. As shown 
in Fig. 65.18, the value vT represents the axial distance traveled by the target while it is observed by the 
transducer beam, and cl(2B) represents the effective length of the signal that is used to observe the moving 
cells. If vT « c/(2B), the shift in the position of a group of red blood cells during their travel though the 
ultrasonic beam is not a detectable fraction of the signal length. This leads to two important restrictions 
on estimation techniques. First, under the "narrowband" condition of transmission of a long (narrowband) 
pulse, motion of a group of cells through the beam can only be estimated using the Doppler frequency 
shift. Second, if the inequality is not satisfied and therefore the transmitted signal is short (wideband), 
faster-moving red blood cells leave the region of interest, and the use of a narrowband estimation technique 
produces a biased velocity estimate. Thus two strategies can be used to estimate velocity. A long (narrow- 
band) pulse can transmitted, and the signal from a fixed depth then can be used to estimate velocity. 
Alternatively, a short (wideband) signal can be transmitted in order to improve spatial resolution, and the 
estimator used to determine the velocity must move along with the red blood cells. 

The inequality is now evaluated for typical parameters. When the angle between the axis of the beam 
and the axis of the vessel is 45 degrees, the axial distance traveled by the red blood cells while they cross 
the beam is equivalent to the lateral beam width. Using an axial distance vT of 0.75 mm, which is a 
reasonable lateral beam width, and an acoustic velocity of 1540 m/s, the bandwidth of the transmitted 
pulse must be much less than 1.026 MHz for the narrowband approximation to be valid. 

Due to practical advantages in the implementation of the smaller bandwidth required by baseband 
signals, the center frequency of the signal is often removed before velocity estimation. The processing 
required for the extraction of the baseband signal is shown in Fig. 65.19. The returned signal from the 
transducer is amplified and coherently demodulated, through multiplication by the carrier frequency, and 
then a low-pass filter is applied to remove the signal sideband frequencies and noise. The remaining signal 
is the complex envelope. A high-pass filter is then applied to the signal from each fixed depth to remove 
the unwanted echoes from stationary tissue. The output of this processing is denoted as I k (t) for the in- 
phase signal from the fcth pulse as a function of time and Q k (t) for the quadrature signal from the fcth pulse. 

Narrowband Estimation 

Narrowband estimation techniques that estimate velocity for blood at a fixed depth are described in this 
subsection. Both the classic Doppler technique, which frequently is used in single-sample volume systems, 
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FIGURE 65.19 Block diagram of the system architecture required to generate the baseband signal used by several 
estimation techniques. 

and the autocorrelator, which frequently is used in color flow mapping systems, are included, as well as 
a finite derivative estimator and an autoregressive estimator, which have been the subject of previous 
research. The autocorrelator is used in real-time color flow mapping systems due to the ease of imple- 
mentation and the relatively small bias and variance. 

Classic Doppler Estimation. If the carrier frequency is removed by coherently demodulating the signal, 
the change in delay of the RF signal becomes a change in the phase of the baseband signal. The Doppler 
shift frequency from a moving target equals 2f c v/c. With a center frequency of 5 MHz, sound velocity of 
1540 m/s, and blood velocity of 1 m/s, the resulting frequency shift is 6493.5 Hz. For the estimation of 
blood velocity, the Doppler shift is not detectable using a single short pulse, and therefore, the signal 
from a fixed depth and a train of pulses is acquired. 

A pulse-echo Doppler processing block diagram is shown in Fig. 65.20. The baseband signal, from 
Figure 65.19, is shown as the input to this processing block. The received signal from each pulse is 
multiplied by a time window that is typically equal to the length of the transmitted pulse and integrated 
to produce a single data sample from each pulse. The set of data samples from a train of pulses is then 
Fourier-transformed, with the resulting frequency spectrum related to the axial velocity using the Doppler 
relationship. 

Estimation of velocity using the Fourier transform of the signal from a fixed depth suffers from the 
limitations of all narrowband estimators, in that the variance of the estimate increases when a short pulse 
is transmitted. In addition, the velocity resolution produced using the Fourier transform is inversely 
proportional to the length of the data window. Therefore, if 64 pulses with a pulse repetition frequency 
of 5 kHz are used in the spectral estimate, the frequency resolution is on the order of 78.125 Hz (5000/64). 
The velocity resolution for a carrier frequency of 5 MHz and speed of sound of 1540 m/s is then on the 
order of 1.2 cm/s, determined from the Doppler relationship. Increasing the data window only improves 
the velocity resolution if the majority of the red blood cells have not left the sample volume and the flow 
conditions have not produced a decorrelation of the signal. It is this relationship between the data window 
and velocity resolution, a fundamental feature of Fourier transform techniques, that has motivated the 
use of autoregressive estimators. The frequency and velocity resolution are not fundamentally constrained 
by the data window using these modern spectral estimators introduced below. 

Autoregressive Estimation (AR). In addition to the classic techniques discussed previously, higher- 
order modern spectral estimation techniques have been used in an attempt to improve the velocity 
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FIGURE 65.20 Block diagram of the system architecture required to estimate the Doppler spectrum from a set of 
baseband samples from a fixed depth. 
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resolution of the estimate. These techniques are again narrowband estimation techniques, since the data 
samples used in computing the estimate are obtained from a fixed depth. The challenges encountered in 
applying such techniques to blood velocity estimation include the selection of an appropriate order which 
adequately models the data sequence while providing the opportunity for real-time velocity estimation 
and determination of the length of the data sequence to be used in the estimation process. 

The goal in autogressive velocity estimation is to model the frequency content of the received signal 
by a set of coefficients which cold be used to reconstruct the signal spectrum. The coefficients a(m) 
represent the AR parameters of the AR(p) process, where p is the number of poles in the model for the 
signal. Estimation of the AR parameters has been accomplished using the Burg and Levinson-Durban 
recursion methods. The spectrum P(f) is then estimated using the following equation: 



p(f)=k 



1 + 



i 1 
> aim) exp — i2nmf\ 



The poles of the AR transfer function which lie within the unit circle can then be determined based on 
these parameters, and the velocity associated with each pole is determined by the Doppler equation. 

Both autoregressive and autoregressive moving-average estimation techniques have been applied to 
single-sample-volume Doppler estimation. Order selection for single-sample-volume AR estimators is 
discussed in Kaluzinski [1989]. Second-order autoregressive estimation has been applied to color flow 
mapping by Lo upas and McDicken [1990] and Ahn and Park [1991]. Although two poles are not sufficient 
to model the data sequence, the parameters of a higher-order process cannot be estimated in real time. 
In addition, the estimation of parameters of a higher-order process using the limited number of data 
points available in color flow mapping produces a large variance. Loupas and McDicken have used the 
two poles to model the signal returned from blood. Ahn and Park have used one pole to model the 
received signal from blood and the second pole to model the stationary signal from the surrounding tissue. 

While AR techniques are useful in modeling the stationary tissue and blood and in providing a high- 
resolution estimate of multiple velocity components, several problems have been encountered in the 
practical application to blood velocity estimation. First, the order required to adequately model any 
region of the vessel can change when stationary tissue is present in the sample volume or when the range 
of velocity components in the sample volume increases. In addition, the performance of an AR estimate 
degrades rapidly in the presence of white noise, particularly with a small number of data samples. 

Autocorrelator. Kasai et al. [1985] and Barber et al. [1985] discussed a narrowband mean velocity 
estimation structure for use in color flow mapping. The phase of the signal correlation at a lag of one 
transmitted period is estimated and used in an inverse tangent calculation of the estimated mean Doppler 
shift f mtm of the returned signal. A block diagram of the autocorrelator is shown in Fig. 65.21. The 
baseband signal is first integrated over a short depth window. The phase of the correlation at a lag of 
one pulse period is then estimated as the inverse tangent of the imaginary part of the correlation divided 
by the real part of the correlation. The estimated mean velocity v mem of the scattering medium is then 
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FIGURE 65.21 Block diagram of the system architecture required to estimate the mean Doppler shift for each 
depth location using the autocorrelator. 
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determined by scaling the estimated Doppler shift by several factors, including the expected center 
frequency of the returned signal. 

The autocorrelator structure can be derived from the definition of instantaneous frequency, from the 
phase of the correlation at a lag of one period, or as the first-order autoregressive estimate of the mean 
frequency of a baseband signal. The contributions of uncorrelated noise should average to zero in both 
the numerator and denominator of the autocorrelator. This is an advantage because the autocorrelation 
estimate is unbiased when the input signal includes the desired flow signal and noise. Alternatively, in 
the absence of a moving target, the input to the autocorrelator may consist only of white noise. Under 
these conditions, both the numerator and denominator can average to values near zero, and the resulting 
output of the autocorrelator has a very large variance. This estimation structure must therefore be used 
with a power threshold that can determine the presence or absence of a signal from blood flow and set 
the output of the estimator to zero when this motion is absent. 

The variance of the autocorrelation estimate increases with the transmitted bandwidth, and therefore, 
the performance is degraded by transmitting a short pulse. 

Finite Derivative Estimator (FDE). A second approach to mean velocity or frequency estimation is 
based on a finite implementation of a derivative operator. The finite derivative estimator is derived based 
on the first and second moments of the spectrum. The basis for this estimator comes from the definition 
of the spectral centroid: 



ftos(to) 

w 



cka 

(65.27) 
d(S> 



The mean velocity is given by v mean , which is a scaled version of the mean frequency, where the scaling 
constant is given by k' and S(ro) represents the power spectral density. Letting R r {-) represent the complex 
signal correlation and x represent the difference between the two times used in the correlation estimate, 
Eq. (65.27) is equivalent to 
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Writing the baseband signal as the sum 7(f) + jQ(t) and letting E indicate the statistical expectation, 
Brody and Meindl [1974] have shown that the mean velocity estimate can be rewritten as 



mHww 



v ™ = ^ Ul 2 ~M1 " (65 - 29) 

The estimate of this quantity requires estimation of the derivative of the in-phase portion /( t) and 
quadrature portion Q(t) of the signal. For an analog, continuous-time implementation, the bias and 
variance were evaluated by Brody and Meindl [1974]. The discrete case has been studied by Kristoffersen 
[1986]. The differentiation has been implemented in the discrete case as a finite difference or as a finite 
impulse response differentiation filter. The estimator is biased by noise, since the denominator represents 
power in the returned signal. Therefore, for nonzero noise power, the averaged noise power in the 
denominator will not be zero mean and will constitute a bias. The variance of the finite derivative 
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estimator depends on the shape and bandwidth of the Doppler spectrum, as well as on the observation 
interval. 

Wideband Estimation Techniques 

It is desirable to transmit a short ultrasonic pulse in order to examine blood flow in small regions 
individually. For these short pulses, the narrowband approximation is not valid, and the estimation 
techniques used should track the motion of the red blood cells as they move to a new position over time. 
Estimation techniques that track the motion of the red blood cells are known as wideband estimation 
techniques and include cross-correlation techniques, the wideband maximum likelihood estimator and 
high time bandwidth estimation techniques. A thorough review of time-domain estimation techniques 
to estimate tissue motion is presented in Hein and O'Brien [1993]. 

Cross-Correlation Estimator. The use of time shift to estimate signal parameters has been studied 
extensively in radar. If the transmitted signal is known, a maximum likelihood (ML) solution for the 
estimation of delay has been discussed by Van Trees [1971] and others. If the signal shape is not known, 
the use of cross-correlation for delay estimation has been discussed by Helstrom [1968] and Knapp and 
Carter [1976]. If information regarding the statistics of the signal and noise are available, an MLE based 
on cross-correlation has been proposed by Knapp and Carter [1976] known as the generalized correlation 
method for the estimation of time delay. 

Several researchers have applied cross-correlation analysis to medical ultrasound. Bonnefous and 
Pesque [1986], Embree and O'Brien [1986], Foster et al. [1990], and Trahey et al. [1987] have studied 
the estimation of mean velocity based on the change in delay due to target movement. This analysis has 
assumed the shape of the transmitted signal to be unknown, and a cross-correlation technique has been 
used to estimate the difference in delay between successive pulses. This differential delay has then been 
used to estimate target velocity, where the velocity estimate is now based on the change in delay of the 
signal over an axial window, by maximizing the cross-correlation of the returned signal over all possible 
target velocities. Cross-correlation processing is typically performed on the radiofrequency (RF) signal, 
and a typical cross-correlation block diagram is shown in Fig. 65.22. A high-pass filter is first applied to 
the signal from a fixed depth to remove the unwanted return from stationary tissue. One advantage of 
this strategy is that the variance is now inversely proportional to bandwidth of the transmitted signal 
rather than proportional. 

Wideband Maximum Likelihood Estimator (WMLE). Wideband maximum likelihood estimation is a 
baseband strategy with performance properties that are similar to cross-correlation. The estimate of the 
velocity of the blood cells is jointly based on the shift in the signal envelope and the shift in the carrier 
frequency of the returned signal. This estimator can be derived using a model for the signal that is 
expected to be reflected from the moving blood medium after the signal passes through intervening 
tissue. The processing of the signal can be interpreted as a filter matched to the expected signal. A diagram 
of the processing required for the wideband maximum likelihood estimator is shown in Fig. 65.23 [Ferrara 
and Algazi, 1991]. Assume that P pulses were transmitted. Required processing involves the delay of the 
signal from the (P - fc)th pulse by an amount equal to IvIckT, which corresponds to the movement of 
the cells between pulses for a specific v, followed by multiplication by a frequency which corresponds to 
the expected Doppler shift frequency of the baseband returned signal. The result of this multiplication 
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FIGURE 65.22 Block diagram of the system architecture required to estimate the velocity at each depth using a 
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FIGURE 65.23 Block diagram of the system architecture required to estimate the velocity at each depth using the 
wideband MLE. 

is summed for all pulses, and the maximum likelihood velocity is then the velocity which produces the 
largest output from this estimator structure. 

Estimation Using High-Time-Bandwidth Signals. Several researchers have also investigated the use of 
long wideband signals including "chirp" modulated signals and pseduo-random noise for the estimation 
of blood velocity. These signals are transmitted continuously (or with a short "flyback" time). Since these 
signals require continual transmission, the instantaneous power level must be reduced in order to achieve 
safe average power levels. 

Bertram [1979] concluded that transmission of a "chirp" appears to give inferior precision for range 
measurement and inferior resolution of closely spaced multiple targets than a conventional pulse-echo 
system applied to a similar transducer. Multiple targets confuse the analysis. Using a simple sawtooth 
waveform, it is not possible to differentiate a stationary target at one range from a moving target at a 
different range. This problem could possibly be overcome with increasing and decreasing frequency 
intervals. Axial resolution is independent of the modulation rate, dependent only on the spectral fre- 
quency range (which is limited). 

The limitations of systems that have transmitted a long pulse of random noise and correlated the 
return with the transmitted signal include reverberations from outside the sample volume which degrade 
the signal-to-noise ratio (the federally required reduction in peak transmitted power also reduces SNR), 
limited signal bandwidth due to frequency-dependent attenuation in tissue, and the finite transducer 
bandwidth [Bendick and Newhouse, 1974; Cooper and McGillem, 1972]. 

New Directions 

Areas of research interest, including estimation of the 3D velocity magnitude, volume flow estimation, 
the use of high-frequency catheter-based transducers, mapping blood flow within malignant tumors, a 
new display mode known as color Doppler energy, and the use of contrast agents, are summarized in this 
subsection. 

Estimation of the 3D Velocity Magnitude and Beam Vessel Angle. Continued research designed to 
provide an estimate of the 3D magnitude of the flow velocity includes the use of crossed-beam Doppler 
systems [Overbeck et al., 1992; Wang and Yao, 1982] and tracking of speckle in two and three dimensions 
[Trahey et al., 1987]. Mapping of the velocity estimate in two and three dimensions, resulting in a 3D 
color flow map has been described by Carson et al.. [1992],Picot et al. [1993], and Cosgrove etal. [1990]. 

Volume Flow Estimation. Along with the peak velocity, instantaneous velocity profile, and velocity 
indices, a parameter of clinical interest is the volume of flow through vessels as a function of time. 
Estimation strategies for the determination of the volume of flow through a vessel have been described 
by Embree and O'Brien [1990], Gill [1979], Hottinger and Meindl [1979], and Uematsu [1981]. 

Intravascular Ultrasound. It has been shown that intravascular ultrasonic imaging can provide infor- 
mation about the composition of healthy tissue and atheroma as well as anatomic data. A number of 
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researchers have now shown that using frequencies of 30 MHz or above, individual layers and tissue 
types can be differentiated [de Kroon et al., 1991a, 1991 fo; Lockwood et al., 1991]. Although obvious 
changes in the vessel wall, such as dense fibrosis and calcification, have been identified with lower- 
frequency transducers, more subtle changes have been difficult to detect. Recent research has indicated 
that the character of plaque may be a more reliable predictor of subsequent cerebrovascular symptoms 
than the degree of vessel narrowing or the presence of ulceration [Merritt et al., 1992]. Therefore, the 
recognition of subtle differences in tissue type maybe extremely valuable. One signal-processing challenge 
in imaging the vascular wall at frequencies of 30 MHz or above is the removal of the unwanted echo 
from red blood cells, which is a strong interfering signal at high frequencies. 

Vascular Changes Associated with Tumors. Three-dimensional color flow mapping of the vascular 
structure is proposed to provide new information for the differentiation of benign and malignant masses. 
Judah Folkman and associates first recognized the importance of tumor vascularity in 1971 [Folkman et 
al., 1971]. They hypothesized that the increased cell population required for the growth of a malignant 
tumor must be preceded by the production of new vessels. Subsequent work has shown that the walls 
of these vessels are deficient in muscular elements, and this deficiency results in a low impedance to flow 
[Gammill et al., 1976]. This change can be detected by an increase in diastolic flow and a change in the 
resistive index. 

More recently, Less et al. [1991] have shown that the vascular architecture of solid mammary tumors 
has several distinct differences from normal tissues, at least in the microvasculature. A type of network 
exists that exhibits fluctuations in both the diameter and length of the vessel with increasing branch 
order. Current color flow mapping systems with a center frequency of 5 MHz or above have been able 
to detect abnormal flow with varying degrees of clinical sensitivity from 40% to 82% [Balu-Maestro et 
al., 1991; Belcaro et al., 1988; Luska et al., 1992]. Researchers using traditional Doppler systems have also 
reported a range of clinical sensitivity, with a general reporting of high sensitivity but moderate to low 
specificity. Burns et al., [1982] studied the signal from benign and malignant masses with 10-MHz CW 
Doppler. They hypothesized, and confirmed through angiography, that the tumors under study were fed 
by multiple small arteries, with a mean flow velocity below 10 cm/s. Carson et al. [1992] compared 
10-MHz CW Doppler to 5- and 7.5-MHz color flow mapping and concluded that while 3D reconstruction 
of the vasculature could provide significant additional information, color flow mapping systems must 
increase their ability to detect slow flow in small vessels in order to effectively map the vasculature. 

Ultrasound Contrast Agents. The introduction of substances that enhance the ultrasonic echo signal 
from blood primarily through the production of microbubbles is of growing interest in ultrasonic flow 
measurement. The increased echo power may have a significant impact in contrast echocardiography, 
where acquisition of the signal from the coronary arteries has been difficult. In addition, such agents 
have been used to increase the backscattered signal from small vessels in masses that are suspected to be 
malignant. Contrast agents have been developed using sonicated albumen, saccharide microbubbles, and 
gelatin-encapsulated microbubbles. 

Research to improve the sensitivity of flow measurement systems to low-velocity flow and small 
volumes of flow, with the goal of mapping the vasculature architecture, includes the use of ultrasonic 
contrast agents with conventional Doppler signal processing [Hartley et al., 1993] , as well as the detection 
of the second harmonic of the transducer center frequency [Shrope and Newhouse, 1993]. 

Color Doppler Energy. During 1993, a new format for the presentation of the returned signal from the 
blood scattering medium was introduced and termed color Doppler energy (CDE) or color power imaging 
(CPI). In this format, the backscattered signal is filtered to remove the signal from stationary tissue, and 
the remaining energy in the backscattered signal is color encoded and displayed as an overlay on the 
gray-scale image. The advantage of this signal-processing technique is the sensitivity to very low flow 
velocities. 
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Defining Terms 

Baseband signal: The received signal after the center frequency component (carrier frequency) has 
been removed by demodulation. 

Carrier frequency: The center frequency in the spectrum of the transmitted signal. 

Clutter: An unwanted fixed signal component generated by stationary targets typically outside the 
region of interest (such as vessel walls). 

Complex envelope: A signal expressed by the product of the carrier, a high-frequency component, and 
other lower-frequency components that comprise the envelope. The envelope is usually expressed 
in complex form. 

Maximum likelihood: A statistical estimation technique that maximizes the probability of the occur- 
rence of an event to estimate a parameter. ML estimate is the minimum variance, unbiased estimate. 
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Visualization of internal structures of opaque biologic objects is essential in many biomedical studies. 
Limited by the penetration depth of the probing sources (photons and electrons) and the lack of 
endogenous contrast, conventional forms of microscopy such as optical microscopy and electron micros- 
copy require tissues to be sectioned into thin slices and stained with organic chemicals or heavy-metal 
compounds prior to examination. These invasive and destructive procedures, as well as the harmful 
radiation in the case of electron microscopy, make it difficult to obtain three-dimensional information 
and virtually impossible to study biologic tissues in vivo. 

Magnetic resonance (MR) microscopy is a new form of microscopy that overcomes the aforementioned 
limitations. Operating in the radiofrequency (RF) range, MR microscopy allows biologic samples to be 
examined in the living state without bleaching or damage by ionizing radiation and in fresh and fixed 
specimens after minimal preparation. It also can use a number of endogenous contrast mechanisms that 
are directly related to tissue biochemistry, physiology, and pathology. Additionally, MR microscopy is 
digital and three-dimensional; internal structures of opaque tissues can be quantitatively mapped out in 
three dimensions to accurately reveal their histopathologic status. These unique properties provide new 
opportunities for biomedical scientists to attack problems that have been difficult to investigate using 
conventional techniques. 

Conceptually, MR microscopy is an extension of magnetic resonance imaging (MRI) to the microscopic 
domain, generating images with spatial resolution better than 100 urn [Lauterbur, 1984]. As such, MR 
microscopy is challenged by a new set of theoretical and technical problems [Johnson et al., 1992]. For 
example, to improve isotropic resolution from 1 mm to 10 um, signal-to-noise ratio (SNR) per voxel 
must be increased by a million times to maintain the same image quality. In order to do so, almost every 
component of hardware must be optimized to the fullest extent, pulse sequences have to be carefully 
designed to minimize any potential signal loss, and special software and dedicated computation facilities 
must be involved to handle large image arrays (e.g., 256 3 ). Over the past decade, development of MR 
microscopy has focused mainly on these issues. Persistent efforts by many researchers have recently lead 
to images with isotropic resolution of the order of ~10 um [Cho et al., 1992; Jacobs and Fraser, 1994; 
Johnson et al., 1992; Zhou and Lauterbur, 1992]. The significant resolution improvement opens up a 
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broad range of applications, from histology to cancer biology and from toxicology to plant biology 
[Johnson et al., 1992]. In this chapter we will first discuss the basic principles of MR microscopy, with 
special attention to such issues as resolution limits and sensitivity improvements. Then we will give an 
overview of the instrumentation. Finally, we will provide some examples to demonstrate the applications. 

66.1 Basic Principles 

Spatial Encoding and Decoding 

Any digital imaging systems involve two processes. First, spatially resolved information must be encoded 
into a measurable signal, and second, the spatially encoded signal must be decoded to produce an image. 
In MR microscopy, the spatial encoding process is accomplished by acquiring nuclear magnetic resonance 
(NMR) signals under the influence of three orthogonal magnetic field gradients. There are many ways 
that a gradient can interact with a spin system. If the gradient is applied during a frequency-selective RF 
pulse, then the NMR signal arises only from a thin slab along the gradient direction. Thus a slice is 
selected from a three-dimensional (3D) object. If the gradient is applied during the acquisition of an 
NMR signal, the signal will consist of a range of spatially dependent frequencies given by 



G)(r) = yB + yG- r (66.1) 



where y is gyromagnetic ratio, B is the static magnetic field, G is the magnetic field gradient, and r is 
the spatial variable. In this way, the spatial information along G direction is encoded into the signal as 
frequency variations. This method of encoding is called frequency encoding, and the gradient is referred 
to as a frequency-encoding gradient (or read-out gradient). If the gradient is applied for a fixed amount 
of time t before the signal acquisition, then the phase of the signal, instead of the frequency, becomes 
spatially dependent, as given by 



<|>(r)= [ Pe co(r)df = (t) + f "yG-r dt (66.2) 



where <|> is the phase originated from the static magnetic field. This encoding method is known as phase 
encoding, and the gradient is called a phase-encoding gradient. 

Based on the three basic spatial encoding approaches, many imaging schemes can be synthesized. For 
two-dimensional (2D) imaging, a slice-selection gradient is first applied to confine the NMR signal in a 
slice. Spatial encoding within the slice is then accomplished by frequency encoding and/or by phase encod- 
ing. For 3D imaging, the slice-selection gradient is replaced by either a frequency-encoding or a phase- 
encoding gradient. If all spatial directions are frequency-encoded, the encoding scheme is called projection 
acquisition, and the corresponding decoding method is called projection reconstruction [Lai and Lauterbur, 
1981; Lauterbur, 1973]. If one of the spatial dimensions is frequency encoded while the rest are phase 
encoded, the method is known as Fourier imaging, and the image can be reconstructed simply by a 
multidimensional Fourier transform [Edelstein et al., 1980; Kumar et al., 1975]. Although other methods 
do exist, projection reconstruction and Fourier imaging are the two most popular in MR microscopy. 

Projection reconstruction is particularly useful for spin systems with short apparent T 2 values, such as 
protons in lung and liver. Since the T 2 of most tissues decreases as static magnetic field increases, the advantage 
of projection reconstruction is more obvious at high magnetic fields. Another advantage of projection 
reconstruction is its superior SNR to Fourier imaging. This advantage has been theoretically analyzed and 
experimentally demonstrated in a number of independent studies [Callaghan and Eccles, 1987; Gewalt et al., 
1993; Zhou and Lauterbur, 1992]. Recently, it also has been shown that projection reconstruction is less 
sensitive to motion and motion artifacts can be effectively reduced using sinograms [Glover and Noll, 1993; 
Glover and Pauly, 1992; Gmitro and Alexander, 1993]. Unlike projection reconstruction, data acquisition in 
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TABLE 66. 1 Choice of Acquisition Parameters for Different Image Contrasts 
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S SE: spin-echo; GE: gradient echo; FSE: fast spin echo. 

'Subscripts min, max and avg stand for minimum, maximum and average values, 
respectively. 

f A pair of diffusion weighting gradients must be used. 

Fourier imaging generates Fourier coefficients of the image in a cartesian coordinate. Since multidimen- 
sional fast Fourier transform algorithms can be applied directly to the raw data, Fourier imaging is 
computationally more efficient than projection reconstruction. This advantage is most evident when 
reconstructing 3D images with large arrays (e.g., 256 3 ). In addition, Fourier transform imaging is less 
prone to image artifacts arising from various off-resonance effects and is more robust in applications such 
as chemical shift imaging [Brown et al., 1982] and flow imaging [Moran, 1982]. 

Image Contrast 

A variety of contrast mechanisms can be exploited in MR microscopy, including spin density (p), spin- 
spin relaxation time (Tj), spin-lattice relaxation time (X 2 ), apparent T 2 relaxation time (Tj), diffusion 
coefficient (D), flow and chemical shift (8). One of the contrasts can be highlighted by varying data- 
acquisition parameters or by choosing different pulse sequences. Table 66.1 summarizes the pulse 
sequences and data-acquisition parameters to obtain each of the preceding contrasts. 

In high-field MR microscopy (>1.5 T), T 2 and diffusion contrast are strongly coupled together. An 
increasing number of evidences indicate that the apparent T 2 contrast observed in high-field MR micros- 
copy is largely due to microscopic magnetic susceptibility variations [Majumdar and Gore, 1988; Zhong 
and Gore, 1991]. The magnetic susceptibility difference produces strong local magnetic field gradients. 
Molecular diffusion through the induced gradients causes significant signal loss. In addition, the large 
external magnetic field gradients required for spatial encoding further increase the diffusion-induced 
signal loss. Since the signal loss has similar dependence on echo time ( TE) to T 2 -related loss, the diffusion 
contrast mechanism is involved in virtually all X 2 -weighted images. This unique contrast mechanism 
provides a direct means to probe the microscopic tissue heterogeneities and forms the basis for many 
histopathologic studies [Benveniste et al, 1992; Zhou et al., 1994]. 

Chemical shift is another unique contrast mechanism. Changes in chemical shift can directly reveal 
tissue metabolic and histopathologic stages. This mechanism exists in many spin systems such as 'H, 31 P, 
and 13 C. Recently, Lean et al. [1993] showed that based on proton chemical shifts, MR microscopy can 
detect tissue pathologic changes with superior sensitivity to optical microscopy in a number of tumor 
models. A major limitation for chemical-shift MR microscopy is the rather poor spatial resolution, since 
most spin species other than water protons are of considerably low concentration and/or sensitivity. In 
addition, the long data-acquisition time required to resolve both spatial and spectral information also 
appears as an obstacle. 

66.2 Resolution Limits 
Intrinsic Resolution Limit 

Intrinsic resolution is defined as the width of the point-spread function originated from physics laws. In 
MR microscopy, the intrinsic resolution arises from two sources: natural linewidth broadening and 
diffusion [Callaghan and Eccles, 1988; Cho et al., 1988; House, 1984]. 
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In most conventional pulse sequences, natural linewidth broadening affects the resolution limit only 
in the frequency-encoding direction. In some special cases, such as fast spin echo [Hennig et al., 1986] 
and echo planar imaging [Mansfield and Maudsley, 1977], natural linewidth broadening also imposes 
resolution limits in the phase-encoding direction [Zhou et al., 1993]. The natural linewidth resolution 
limit, defined by 
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(66.3) 



is determined by the T 2 relaxation time and can be improved using a stronger gradient G. To obtain 
1-um resolution from a specimen with T 2 = 50 ms, the gradient should be at least 14.9 G/cm. This 
gradient requirement is well within the range of most MR microscopes. 

Molecular diffusion affects the spatial resolution in a number of ways. The bounded diffusion is 
responsible for many interesting phenomena known as edge enhancements [Callaghan et al., 1993; Hills 
et al., 1990; Hyslop and Lauterbur, 1991; Putz et al., 1991]. They are observable only at the microscopic 
resolution and are potentially useful to detect microscopic boundaries. The unbounded diffusion, on the 
other hand, causes signal attenuation, line broadening, and phase misregistration. All these effects 
originate from an incoherent and irreversible phase-dispersion. The root-mean-square value of the phase 
dispersion is 
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where f and tare pulse-sequence-dependent time variables defined by Ahn and Cho [1989]. Because of 
the phase uncertainty, an intrinsic resolution limit along the phase encoding direction arises: 
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For a rectangularly shaped phase-encoding gradient, the preceding equation can be reduced to a very 
simple form: 
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This simple result indicates that the diffusion resolution limit in the phase-encoded direction is deter- 
mined only by the phase-encoding time t (D is a constant for a chosen sample). This is so because the 
phase uncertainty is introduced only during the phase-encoding period. Once the spins are phase- 
encoded, they always carry the same spatial information no matter where they diffuse to. In the frequency- 
encoding direction, diffusion imposes resolution limits by broadening the point-spread function. Unlike 
natural linewidth broadening, broadening caused by diffusion is pulse-sequence-dependent. For the 
simplest pulse sequence, a 3D projection acquisition using free induction decays, the full width at half 
maximum [Callaghan and Eccles, 1988; McFarland, 1992; Zhou, 1992] is 
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Compared with the case of the natural linewidth broadening (Eq. 66.3), the resolution limit caused by 
diffusion varies slowly with the frequency-encoding gradient G fr . Therefore, to improve resolution by a 
same factor, a much larger gradient is required. With the currently achievable gradient strength, the 
diffusion resolution limit is estimated to be 5 to 10 um. 

Digital Resolution Limit 

When the requirements imposed by intrinsic resolution limits are satisfied, image resolution is largely 
determined by the voxel size, provided that SNR is sufficient and the amplitude of physiologic motion 
is limited to a voxel. The voxel size, also known as digital resolution, can be calculated from the following 
equations: 

Frequency-encoding direction: Ax = — — = (66.8 
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Phase-encoding direction: Ay = — — = (66.9) 



where L is the field of view, N is the number of data points or the linear matrix size, G is the gradient 
strength, Av is the receiver bandwidth, A<|) is the phase range of the phase-encoding data (e.g., if the data 
cover a phase range from —n to +71, then A(|) = 2n) and the subscripts x and y represent frequency- and 
phase-encoding directions, respectively. To obtain a high digital resolution, L should be kept minimal, 
while N maximal. In practice, the minimal field of view and the maximal data points are constrained by 
other experimental parameters. In the frequency-encoding direction, decreasing field of view results in 
an increase in gradient amplitude at a constant receiver bandwidth or a decrease in the bandwidth for a 
constant gradient (Eq. 66.8). Since the receiver bandwidth must be large enough to keep the acquisition 
of NMR signals within a certain time window, the largest available gradient strength thus imposes the 
digital resolution limit. In the phase-encoding direction, A(|) is fixed at 2tc in most experiments, and the 
maximum t pe value is refrained by the echo time. Thus digital resolution is also determined by the 
maximum available gradient, as indicated by Eq. (66.9). It has been estimated that in order to achieve 
1-um resolution with a phase-encoding time of 4 ms, the required gradient strength is as high as 
587 G/cm. This gradient requirement is beyond the range of current MR microscopes. Fortunately, the 
requirement is fully relaxed in projection acquisition where no phase encoding is involved. 

Practical Resolution Limit 

The intrinsic resolution limits predict that MR microscopy can theoretically reach the micron regime. 
To realize the resolution, one must overcome several technical obstacles. These obstacles, or practical 
resolution limits, include insufficient SNR, long data-acquisition times, and physiologic motion. At the 
current stage of development, these practical limitations are considerably more important than other 
resolution limits discussed earlier and actually determine the true image resolution. 

SNR is of paramount importance in MR microscopy. As resolution improves, the total number of 
spins per voxel decreases drastically, resulting in a cubic decrease in signal intensity. When the voxel signal 
intensity becomes comparable with noise level, structures become unresolvable even if the digital reso- 
lution and intrinsic resolution are adequate. 

SNR in a voxel depends on many factors. The relationship between SNR and common experimental 
variables is given by 

SNR oc , B A^ n (6610) 
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where B l is the RF magnetic field, B is the static magnetic field, n is the number of average, T is the 
temperature, Av is the bandwidth, k is the Boltzmann constant, and R coil and -R sam p| e are the coil and 
sample resistance, respectively. When small RF coils are used, -R sample is negligible. Since R coi] is proportional 
to JBq due to skin effects, the overall SNR increases as B 7l g. This result strongly suggests that MR 
microscopy be performed at high magnetic field. Another way to improve SNR is to increase the B l field. 
This is accomplished by reducing the size of RF coils [McFarland and Mortara, 1992; Peck et al., 1990; 
Schoeniger et al., 1991; Zhou and Lauterbur, 1992]. Although increasing B and B l is the most common 
approach to attacking the SNR problem, other methods such as signal averaging, pulse-sequence opti- 
mization, and post data processing are also useful in MR microscopy. For example, diffusion-induced 
signal loss can be effectively minimized using diffusion-reduced-gradient (DRG) echo pulse sequences 
[Cho et al., 1992]. Various forms of projection acquisition techniques [Gewalt et al., 1993; Hedges, 1984; 
McFarland and Mortara, 1992; Zhou and Lauterbur, 1992], as well as new A:-space sampling schemes 
[Zhou et al., 1993], also have proved useful in SNR improvements. Recently, Black et al. used high- 
temperature superconducting materials for coil fabrication to simultaneously reduce coil resistance _R coil 
and coil temperature T [Black et al., 1993]. This novel approach can provide up to 70-fold SNR increase, 
equivalent to the SNR gain by increasing the magnetic field strength 1 1 times. 

Long data- acquisition time is another practical limitation. Large image arrays, long repetition times 
(TR), and signal averaging all contribute to the overall acquisition time. For instance, a X 2 -weighted 
image with a 256 3 image array requires a total acquisition time of more than 18 hours (assuming TR = 
500 ms and n = 2). Such a long acquisition time is unacceptable for most applications. To reduce the 
acquisition time while still maintaining the desired contrast, fast-imaging pulse sequences such as echo- 
planar imaging (EPI) [Mansfield and Maudsley, 1977], driven equilibrium Fourier transform (DEFT) 
[Maki et al., 1988], fast low angle shot (FLASH) [Haase et al., 1986], gradient refocused acquisition at 
steady state (GRASS) [Karis et al., 1987], and rapid acquisition with relaxation enhancement (RARE) 
[Hennig et al., 1986] have been developed and applied to MR microscopy. The RARE pulse sequence, or 
fast spin-echo (FSE) [Mulkern et al., 1990], is particularly useful in high-field MR microscopy because of 
its insensitivity to magnetic susceptibility effects as well as the reduced diffusion loss [Zhou et al., 1993]. 
Using fast spin-echo techniques, a 256 3 image has been acquired in less than 2 hours [Zhou et al., 1993]. 

For in vivo studies, the true image resolution is also limited by physiologic motion [Hedges, 1984; 
Wood and Henkelman, 1985]. Techniques to minimize the motion effects have been largely focused on 
pulse sequences and post data processing algorithms, including navigator echoes [Ehman and Felmlee, 
1989], motion compensation using even echo or moment nulling gradients, projection acquisition 
[Glover and Noll, 1993], and various kinds of ghost-image decomposition techniques [Xiang and Hen- 
kelman, 1991]. It should be noted, however, that by refining animal handling techniques and using 
synchronized data acquisition, physiologic motion effects can be effectively avoided and very high quality 
images can be obtained [Hedlund et al., 1986; Johnson et al., 1992]. 

66.3 Instrumentation 

An MR microscope consists of a high-field magnet (>1.5 T), a set of gradient coils, an RF coil (or RF 
coils), and the associated RF systems, gradient power supplies, and computers. Among these components, 
RF coils and gradient coils are often customized for specific applications in order to achieve optimal 
performance. Some general guidelines to design customized RF coils and gradient coils are presented below. 

Radiofrequency Coils 

Many types of RF coils can be used in MR microscopy (Fig. 66.1). The choice of a particular coil 
configuration is determined by specific task and specimen size. If possible, the smallest coil size should 
always be chosen in order to obtain the highest SNR. For ex-vivo studies of tissue specimens, solenoid 
coils are a common choice because of their superior B 1 field homogeneity, high sensitivity, as well as 
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FIGURE 66.1 A range of radio frequency coils are used in MR microscopy, (a) A quadrature birdcage coil scaled 
to the appropriate diameter for rats (6 cm) is used for whole-body imaging, (b) Resonant coils have been constructed 
on microwave substrate that can be surgically implanted to provide both localization and improved SNR. (c) MR 
microscopy of specimens is accomplished with a modified Helmholz coil providing good filling factors, high B 
homogeneity, and ease of access. 
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simplicity in fabrication. Using a 2.9-mm solenoid coil (5 turn), Zhou and Lauterbur [1992] have achieved 
the highest ever reported spatial resolution at 6.4 u.m 3 . Solenoid coil configurations are also used by 
others to obtain images with similar resolution (~10 urn) [Cho et al., 1988; Hedges, 1984; McFarland 
and Mortara, 1992; Schoeniger et al., 1991]. Recently, several researchers began to develop microscopic 
solenoid coils with a size of a few hundred microns [McFarland and Mortara, 1992; Peck et al., 1990]. 
Fabrication of these microcoils often requires special techniques, such as light lithography and electron- 
beam lithography. 

The direction of the B l field generated by a solenoid coil prevents the coil from being coaxially placed 
in the magnet. Thus accessing and positioning samples are difficult. To solve this problem, Banson et al. 
[1992] devised a unique Helmholtz coil that consists of two separate loops. Each loop is made from a 
microwave laminate with a dielectric material sandwiched between two copper foils. By making use of 
the distributed capacitance, the coil can be tuned to a desired frequency. Since the two loops of the coil 
are mechanically separated, samples can be easily slid into the gap between the loops without any 
obstruction. Under certain circumstances, the Helmholtz coil can outperform an optimally designed 
solenoid coil with similar dimensions. 

For in vivo studies, although volume coils such as solenoid coils and birdcage coils can be employed, 
most high-resolution experiments are carried out using local RF coils, including surface coils [Banson 
et al, 1992; Rudin, 1987] and implanted coils [Farmer et al, 1989; Hollett et al, 1987; Zhou et al, 1994]. 
Surface coils can effectively reduce coil size and simultaneously limit the field of view to a small region 
of interest. They can be easily adaptable to the shape of samples and provide high sensitivity in the surface 
region. The problem of inhomogeneous B x fields can be minimized using composite pulses [Hetherington 
et al., 1986] or adiabatic pulses [Ugurbil et al., 1987]. To obtain high-resolution images from regions 
distant from the surface, surgically implantable coils become the method of choice. These coils not only 
give better SNR than optimized surface coils [Zhou et al., 1992] but also provide accurate and consistent 
localization. The latter advantage is particularly useful for time-course studies on dynamic processes such 
as the development of pathology and monitoring the effects of therapeutic drugs. 

The recent advent of high-temperature superconducting (HTS) RF coils has brought new excitement 
to MR microscopy [Black et al., 1993]. The substantial improvement, as discussed earlier, makes signal 
averaging unnecessary. Using these coils, the total imaging time will be solely determined by the efficiency 
to traverse the k space. Although much research is yet to be done in this new area, combination of the 
HTS coils with fast-imaging algorithms will most likely provide a unique way to fully realize the potential 
of MR microscopy and eventually bring the technique into routine use. 

Magnetic Field Gradient Coils 

As discussed previously, high spatial resolution requires magnetic field gradients. The gradient strength 
increases proportional to the coil current and inversely proportional to the coil size. Since increasing 
current generates many undesirable effects (overheating, mechanical vibrations, eddy current, etc.), 
strong magnetic field gradient is almost exclusively achieved by reducing the coil diameter. 

Design of magnetic field gradient coils for MR microscopy is a classic problem. Based on Maxwell 
equations, ideal surface current density can be calculated for a chosen geometry of the conducting surface. 
Two conducting surfaces are mostly used: a cylindrical surface parallel to the axis of the magnet and a 
cylindrical surface perpendicular to the axis. The ideal surface current density distributions for these two 
geometries are illustrated in Fig. 66.2 [Suits and Wilken, 1989]. After the ideal surface current density 
distribution is obtained, design of gradient coils is reduced to a problem of using discrete conductors with 
a finite length to approximate the continuous current distribution function. The error in the approximation 
determines the gradient linearity. Recent advancements in computer-based fabrication and etching tech- 
niques have made it feasible to produce complicated current density distributions. Using these techniques, 
nonlinear terms up to the eleventh order can be eliminated over a predefined cylindrical volume. 

Another issue in gradient coil design involves minimizing the gradient rise time so that fast-imaging 
techniques can be implemented successfully and short echo times can be achieved to minimize signal 
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FIGURE 66.2 (Left) Ideal surface current distributions to generate linear magnetic field gradients when the gradient 
coil former is parallel to the magnet bore: (b) for x gradient and (c) for z gradient. The analytical expressions for 
the current density distribution functions are J x = KG X [R cos G(sin 9i - cos Gj)-zsin Gfc] and J z = KG z Z[sin Gi - cos 
9/]. The y gradient can be obtained by rotating J x 90 degrees. (Right) Ideal surface current distributions to generate 
linear magnetic field gradients when the gradient coil former is perpendicular to the magnet bore: (e for x gradient 
and (f) for y gradient. The analytical expressions for the current density distribution functions are: J x = KG x R(s'm 
2Qj) and J = KG r (- R cos 2 6i + y sin / + 0.5R sin 2Gfc). The z gradient can be obtained by rotating ] x 45 degrees. (The 
graphs are adapted based on Suits and Wilken [1989]). 

loss for short T 2 specimens. The gradient rise time relies on three factors: the inductance over resistance 
ratio of the gradient coil, the time constant of the feedback circuit of the gradient power supply, and the 
decay rate of eddy current triggered by gradient switching. The time constant attributed to inductive 
resistance (LIR) is relatively short (<100 (j,s) for most microscopy gradient coils, and the inductive 
resistance from the power supply can be easily adjusted to match the time constant of the coil. However, 
considering the high magnetic field gradient strength used in MR microscopy, eddy currents can be a 
serious problem. This problem is even worsened when the gradient coils are closely placed in a narrow 
magnet bore. To minimize the eddy currents, modern design of gradient coils uses an extra set of coils 
so that the eddy currents can be actively canceled [Mansfield and Chapman, 1986] . Under close to optimal 
conditions, a rise time of <150 (j,s can be achieved with a maximum gradient of 82 G/cm in a set of 8-cm 
coils [Johnson et al., 1992]. 

Although the majority of microscopic MR images are obtained using cylindrical gradient coils, surface 
gradient coils have been used recently in several studies [Cho et al., 1992]. Similar to surface RF coils, 
surface gradient coils can be easily adapted to the shape of samples and are capable of producing strong 
magnetic field gradient in limited areas. The surface gradient coils also provide more free space in the 
magnet, allowing easy access to samples. A major problem with surface gradient coils is the gradient 
nonlinearity. But when the region of interest is small, high-quality images can still be obtained with 
negligible distortions. 
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FIGURE 66.3 Selected sections of 3D isotropic images of a sheep heart with experimentally induced infarct show 
the utility of MRM in pathology studies. A 3D FSE sequence has been designed to allow rapid acquisition of either 
(a) r r weighted or (b) T 2 -weighted images giving two separate "stains" for the pathology. Arrows indicate areas of 
necrosis 



66.4 Applications 



MR microscopy has a broad range of applications. We include here several examples. Figure 66.3 illustrates 
an application of MR microscopy in ex vivo histology. In this study, a fixed sheep heart with experimen- 
tally-induced infarct is imaged at 2 T using 3D fast-spin-echo techniques with T l (Fig. 66.3a) and T 2 
(Fig. 66.3b) contrasts. The infarct region is clearly detected in both images. The region of infarct can be 
segmented from the rest of the tissue and its volume can be accurately measured. Since the image is 
three-dimensional, the tissue pathology can be examined in any arbitrary orientation. The nondestructive 
nature of MR microscopy also allows the same specimen to be restudied using other techniques as well 
as using other contrast mechanisms of MR microscopy. Obtaining this dimension of information from 
conventional histologic studies would be virtually impossible. 

The in vivo capability of MR microscopy for toxicologic studies is illustrated in Fig. 66.4. In this study 
[Farmer et al., 1989], the effect of mercuric chloride in rat kidney is monitored in a single animal. 
Therefore, development of tissue pathology over a time period is directly observed without the unnec- 
essary interference arising from interanimal variabilities. Since the kidney was the only region of interest 
in this study, a surgically implanted coil was chosen to optimize SNR and to obtain consistent localization. 
Figure 66.4 shows four images obtained from the same animal at different time points to show the 
progression and regression of the HgCl 2 -induced renal pathology. Tissue damage is first observed 24 hours 
after the animal was treated by the chemical, as evident by the blurring between the cortex and the outer 
medulla. The degree of damage is greater in the image obtained at 48 hours. Finally, at 360 hours, the 
blurred boundary between the two tissue regions completely disappeared, indicating full recovery of the 
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FIGURE 66.4 Implanted RF coils allow in vivo studies of deep structures with much higher spatial resolution by 
limiting the field of view during excitation and by increasing the SNR over volume coils. An added benefit is the 
ability to accurately localize the same region during a time course study. Shown here is the same region of a kidney 
at four different time points following exposure to mercuric chloride. Note the description of boundaries between 
the several zones of the kidney in the early part of the study followed by regeneration of the boundaries upon repair. 

organ. The capability to monitor tissue pathologic changes in vivo, as illustrated in this example, bodes 
well for a broad range of applications in pharmacology, toxicology, and pathology. 

Development biology is another area where MR microscopy has found an increasing number of 
applications, Jacobs and Fraser [1994] used MR microscopy to follow cell movements and lineages in 
developing frog embryos. In their study, 3D images of the developing embryo were obtained on a time 
scale faster than the cell division time and analyzed forward and backward in time to reconstruct full 
cell divisions and cell movements. By labeling a 16-cell embryo with an exogenous contrast agent (Gd- 
DTPA), they successfully followed the progression from early cleavage and blastula stage through gas- 
trulation, neurulation, and finally to tail bud stage. More important, they found that external ectodermal 
and internal mesodermal tissues extend at different rates during amphibian gastrulation and neurulation. 
This and many other key events in vertebrate embryogenesis would be very difficult to observe with 
optical microscopy. Another example in developmental biology is given in Fig. 66.5. Using 3D high-field 
(9.4-T) MR microscopy with large image arrays (256 3 ), Smith et al. [1993] studied the early development 
of the circulatory system of mouse embryos. With the aid of a T l contrast agent made from bovine serum 
albumin and Gd-DTPA, vasculature such as ventricles, atria, aorta, cardinal sinuses, basilar arteries, and 
thoracic arteries are clearly identified in mouse embryos at between 9.5 and 12.5 days of gestation. The 
ability to study embryonic development in a noninvasive fashion provided great opportunities to explore 
many problems in transgenic studies, gene targeting, and in situ hybridization. 
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FIGURE 66.5 Isotropic 3D images of fixed mouse embryos at three stages of development have been volume 
rendered to allow visualization of the developing vascular anatomy. 

In less than 10 years, MR microscopy has grown from a scientific curiosity to a tool with a wide range 
of applications. Although many theoretical and experimental problems still exist at the present time, 
there is no doubt that MR microscopy will soon make a significant impact in many areas of basic research 
and clinical diagnosis. 
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Further Information 

A detailed description of the physics of NMR and MRI can be found in Principles of Magnetic Resonance, 
by C. S. Slichter (3rd edition, Springer- Verlag, 1989), in NMR Imaging in Biology and Medicine, by 
P. Morris (Clarendon Press, 1986), and in Principles of Magnetic Resonance Microscopy, by P. T. Callaghan 
(Oxford Press, 1991). The latter two books also contain detailed discussions on instrumentation, data 
acquisition, and image reconstruction for conventional and microscopic magnetic resonance imaging. 

Magnetic Resonance Microscopy, edited by Blumich and Kuhn (VCH, 1992), is particularly helpful to 
understand various aspects of MR microscopy, both methodology and applications. Each chapter of the 
book covers a specific topic and is written by experts in the field. 

Proceedings of the Society of Magnetic Resonance (formerly Society of Magnetic Resonance in Medicine, 
Berkeley, California), published annually, documents the most recent developments in the field of MR 
microscopy. Magnetic Resonance in Medicine, Journal of Magnetic Resonance Imaging, and Magnetic 
Resonance Imaging, all monthly journals, contain original research articles and are good sources for up- 
to-date developments. 
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67.1 Radiopharmaceuticals 



Thomas F. Budinger and Henry F. VanBrocklin 

Since the discovery of artificial radioactivity a half century ago, radiotracers, radionuclides, and radio- 
nuclide compounds have played a vital role in biology and medicine. Common to all is radionuclide 
(radioactive isotope) production. This section describes the basic ideas involved in radionuclide produc- 
tion and gives examples of the applications of radionuclides. The field of radiopharmaceutical chemistry 
has fallen into subspecialties of positron-emission tomography (PET) chemistry and general radiophar- 
maceutical chemistry, including specialists in technetium chemistry, taking advantage of the imaging 
attributes of technetium-99m. 

The two general methods of radionuclide production are neutron addition (activation) from neutron 
reactors to make neutron-rich radionuclides which decay to give off electrons and gamma rays and 
charged-particle accelerators (linacs and cyclotrons) which usually produce neutron-deficient isotopes 
that decay by electron capture and emission of x-rays, gamma rays, and positrons. The production of 
artificial radionuclides is governed by the number of neutrons or charged particles hitting an appropriate 
target per time, the cross section for the particular reaction, the number of atoms in the target, and the 
half-life of the artificial radionuclide: 
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where A(t) is the produced activity in number of atoms per second, N is the number of target nuclei, a 
is the cross section (probability that the neutron or charged particles will interact with the nucleus to 
form the artificial radioisotope) for the reaction, § is the flux of charged particles, and T ll2 is the half- 
life of the product. Note that N is the target mass divided by the atomic weight and multiplied by 
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FIGURE 67.1 (a) High specific activity neutron-excess radionuclides are produced usually through the (n, y), (n, p) 
or (n, a) reactions. The product nuclides usually decay p~ followed by y. Most of the reactor produced radionuclides 
are produced by the (n, y) reaction. (6) Cyclotrons and linear accelerators (linacs) are sources of beams of protons, 
deuterons, or helium ions which bombard targets to produce neutron-deficient radionuclides. 

Avogadro's number (60.24 x 10 23 ) and a is measured in cm 2 . The usual flux is about 10 14 neutrons per 
second or, for charged particles, 10 to 100 uA, which is equivalent to 6.25 x 10 13 to 6.25 x 10 14 charged 
particles per second. 

Nuclear Reactor-Produced Radionuclides. 

Thermal neutrons of the order of 10 14 neutrons/s/cm 2 are produced in a nuclear reactor usually during 
a controlled nuclear fission of uranium, though thorium or plutonium are also used. High specific activity 
neutron-rich radionuclides are produced usually through the (n, y), {n, p),or (n, a) reactions (Fig. 67. la). 
The product nuclides usually decay by |3~ followed by y. Most of the reactor-produced radionuclides are 
produced by the (n, y) reaction. The final step in the production of a radionuclide consists of the 
separation of the product nuclide from the target container by chemical or physical means. 

An alternative method for producing isotopes from a reactor is to separate the fission fragments from 
the spent fuel rods. This is the leading source of "Mo for medical applications. The following two methods 
of "Mo production are examples of carrier-added and no-carrier-added radionuclide synthesis, respec- 
tively. In Fig. 67.1a, the "Mo is production from 98 Mo. Only a small fraction of the 98 Mo nuclei will be 
converted to "Mo. Therefore, at the end of neutron bombardment, there is a mixture of both isotopes. 
These are inseparable by conventional chemical separation techniques, and both isotopes would partic- 
ipate equally well in chemical reactions. The "Mo from fission of 238 U would not contain any other 
isotopes of Mo and is considered carrier-free. Thus radioisotopes produced by any means having the 
same atomic number as the target material would be considered carrier-added. Medical tracer techniques 
obviate the need for carrier-free isotopes. 

Accelerator-Produced Radionuclides 

Cyclotrons and linear accelerators (linacs) are sources of beams of protons, deuterons, or helium ions 
that bombarded targets to produce neutron-deficient (proton-rich) radionuclides (Fig. 67.1b). The neu- 
tron-deficient radionuclides produced through these reactions are shown in Table 67.1. These product 
nuclides (usually carrier-free) decay either by electron capture or by positron emission tomography or 
both, followed by y emission. In Table 67.2, most of the useful charged-particle reactions are listed. 

The heat produced by the beam current on the target material can interfere with isotope production 
and requires efficient heat-removal strategies using extremely stable heat-conducting target materials 
such as metal foils, electroplates metals, metal powders, metal oxides, and salts melted on duralmin plate. 
All the modern targets use circulating cold deionized water and/or chilled helium gas to aid in cooling 
the target body and window foils. Cyclotrons used in medical studies have U C, 13 N, 15 0, and 18 F production 
capabilities that deliver the product nuclides on demand through computer-executed commands. The 
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TABLE 67.1 Radionuclides Used in Biomedicine 



Application(s) 

A positron emitting chemical analog of phosphorus 

Parent in the generator system for producing the positron emitting 128 Cs, a potassium analog 

Berylliosis studies 

Radioimmunotherapy 

Used in metabolic studies and studies of estrogen receptor content 

Positron emitter for metabolism imaging 

Calibration of imaging instruments 

Heart perfusion 

Used as a clinical diagnostic agent for cancer and metabolic disorders 

Radioimmunotherapy 

Used to assess red blood cell survival 

Positron emitter used in glucose analogs uptake and neuroreceptor imaging. 

Required in calibrating PET tomographs. Potential antibody level 

Parent in the generator system for producing the positron emitting 68 Ga 

Radioimmunotherapy 

Positron emitter for blood flow studies 

SPECT brain imaging agent 

Radioimmunotherapy, positron emitter 

Used as a potential cancer therapeutic agent 

Used to diagnose and treat thyroid disorders including cancer 

Used as an iron tracer, positron emitter for bone-marrow imaging 

Magnesium tracer which decays to 2.3 in aluminum-28 

Flow tracer for heart muscle 

Parent in the generator system for producing 195m Au, which is used in cardiac blood pool 

studies 
Used to produce technetium-99m, the most commonly used radioisotope in clinical nuclear 

medicine 
Positron emitter used as ,3 NH for heart perfusion studies 
Decays to iridium- 191 used for cardiac studies 
Positron emitter used for blood flow studies as H 5 2 
Used in the treatment of prostate cancer 
Used in cancer treatment, cell metabolism and kinetics, molecular biology, genetics research, 

biochemistry, microbiology, enzymology, and as a starter to make many basic chemicals 

and research products 
Used for treatment of medullary thyroid carcinoma and alleviation of pain in bone 

metastases 
Positron emitter used for heart perfusion studies 
Hepatobiliary function, tumor and inflammation localization 
Treatment of ocular cancer 
Used to radiolabel various molecules as cancer therapeutic agents and to alleviate bone 

cancer pain 
Radioimmunotherapy 
Used in the therapy of cancer 
Parent in the generator system for producing the positron emitting 82 Rb, a potassium 

analogue 
Used to study bone formation metabolism 
Used to alleviate metastatic bone pain 
Used in studies of cell metabolism and kinetics, molecular biology, genetics research, 

biochemistry, microbiology, enzymology, and as a start to make many basic chemicals and 

research products 
The most widely used radiopharmaceutical in nuclear medicine and produced from 

molybdenum-99 
Cardiac imaging agent 
Palliative treatment of bone cancer pain 
Used to make tritiated water which is used as a starter for thousands of different research 

products and basic chemicals; used for life science and drug metabolism studies to ensure 

the safety of potential new drugs 



Radionuclide 


Half-Life 


Arsenic-74* 


17.9 d 


Barium- 128* 


2.4 d 


Berylium-7* 


53.37 d 


Bromine-77 


57 h 


Bromin-82 


35.3 h 


Carbon-11* 


20.3 min 


Cobalt-57* 


270 d 


Copper-62 


9.8 min 


Copper-64 


12.8 h 


Copper-67 


58.5 h 


Chromium-51 


27.8 d 


Fluorine- 18 


109.7 min 


Gallium-68 


68 min 


Germanium-68* 


287 d 


Indium-Ill* 


2.8 d 


Iodine-122 


3.76 min 


Iodine- 123* 


13.3 h 


Iodine- 124* 


4.2 d 


Iodine- 125 


60.2 d 


Iodine- 131 


8.1 d 


lron-52* 


8.2 h 


Magnesium-28* 


21.2 h 


Magnese-52m 


5.6 d 


Mercury- 195m* 


40 h 


Molybdenum-99 


67 h 


Nitrogen-13* 


9.9 min 


Osmium-191 


15 d 


Oxygen- 15* 


123 s 


Palladium-103 


17 d 


Phosphorus-32 


14.3 d 


Rhenium- 188 


17 h 


Rubidium-82* 


1.2 min 


Ruthemiun-97* 


2.9 d 


Samarium-145 


340 d 


Samarium- 153 


46.8 h 


Scandium-47 


3.4 d 


Scandium-47* 


3.4 d 


Strontium-82* 


64.0 d 


Strontium-85 


64 d 


Strontium-89 


52 d 


Sulfur-35 


87.9 d 


Technetium-99m 


6h 


Thalium-201* 


74 h 


Tin- 11 7m 


14.0 d 


Tritium 


12.3 yr 


(hydrogen-3) 
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TABLE 67.1 (continued) Radionuclides Used in Biomedicine 



Radionuclide 


Half-Lii 


Tungsten-178* 


21.5 d 


Tungsten-188 


69 d 


Vanadium-48* 


16.0 d 


Xenon-122* 


20 h 


Xenon- 127* 


36.4 d 


Xenon-133 


5.3 d 


Yttrium-88* 


106.6 d 


Yttrium-90 


64 h 


Zinc-62* 


9.13 h 


Zirconium-89* 


78.4 h 



Application(s) 

Parent in generator system for producing 178 Ta, short lived scanning agent 

Decays to rhenium- 188 for treatment of cancer and rheumatoid arthritis 

Nutrition and environmental studies 

Parent in the generator system for producing the positron emitting 122 I 

Used in lung ventilation studies 

Used in lung ventilation and perfusion studies 

Radioimmunotherapy 

Used to radiolabel various molecules as cancer therapeutic agents 

Parent in the generator system for producing the positron emitting 62 Cu 

Radioimmunotherapy, positron emitter 



* Produced by accelerated charged particles. Others are produced by neutron reactors. 

radionuclide is remotely transferred into a lead-shielded hot cell for processing. The resulting radionu- 
clides are manipulated using microscale radiochemical techniques: small-scale synthetic methodology, 
ion-exchange chromatography, solvent extraction, electrochemical synthesis, distillation, simple filtra- 
tion, paper chromatography, and isotopic carrier precipitation. Various relevant radiochemical techniques 
have been published in standard texts. 

Generator-Produced Radionuclides 

If the reactor, cyclotron, or natural product radionuclide of long half-life decays to a daughter with 
nuclear characteristics appropriate for medical application, the system is called a medical radionuclide 
generator. There are several advantages afforded by generator-produced isotopes. These generators rep- 
resent a convenient source of short-lived medical isotopes without the need for an on-site reactor or 
particle accelerator. Generators provide delivery of the radionuclide on demand at a site remote from 
the production facility. They are a source of both gamma- and positron-emitting isotopes. 

The most common medical radionuclide generator is the "Mo — > 99m Tc system, the source of 99m Tc, 
a gamma-emitting isotope currently used in 70% of the clinical nuclear medicine studies. The "Mo has 
a 67-hour half-life, giving this generator a useful life of about a week. Another common generator is the 
68 Ge — » 68 Ga system. Germanium (half-life is 287 days) is accelerator-produced in high-energy accelerators 
(e.g., BLIP, LAMPF, TRIUMF) through the alpha-particle bombardment of 66 Zn. The 68 Ge decays to 68 Ga, 
a positron emitter, which has a 68-minute half-life. Gallium generators can last for several months. 

The generator operation is fairly straightforward. In general, the parent isotope is bound to a solid 
chemical matrix (e.g., alumina column, anionic resin, Donux resin). As the parent decays, the daughter 
nuclide grows in. The column is then flushed ("milked") with a suitable solution (e.g., saline, hydrochloric 
acid) that elutes the daughter and leaves the remaining parent absorbed on the column. The eluent may 
be injected directly or processed into a radiopharmaceutical. 



Radiopharmaceuticals 

99m Tc is removed from the generator in the form of Tc0 4 ~ (pertechnetate). 
This species can be injected directly for imaging or incorporated into a 
variety of useful radiopharmaceuticals. The labeling of 99m Tc usually involves 
reduction complexation/chelation. 99m Tc-Sestamibi (Fig. 67.2) is a radiop- 
harmaceutical used to evaluate myocardial perfusion or in the diagnosis of 
cancer. There are several reduction methods employed, including Sn(II) 
reduction in NaHC0 3 at pH of 8 and other reduction and complexation 
reactions such as S 2 3 + HC1, FeCl 3 + ascorbic acid, LiBH 4 , Zn + HC1, HC1, 
Fe(II), Sn(II)F 2 , Sn(II) citrate, and Sn(II) tartrate reduction and complex- 
ation, electrolytic reduction, and in vivo labeling of red cells following Sn(II) 



TABLE 67.2 Important 
Reactions for Cyclotron- 
Produced Radioisotopes 



1. 


p, n 


7. 


p,pn 


2. 


p,2n 


8. 


p,2p 


3. 


d, n 


9. 


tl, p 


4. 


d, 2« 


10. 


d.'He 


5. 


p, "He 


11. 


p, d 


6. 


p, "He« 


12. 


4 He, n 






13. 


3 He,p 



) 2000 by CRC Press LLC 



pyrophosphate or Sn(II) DTPA administration. 131 I, 
125 I, and 123 I labeling requires special reagents or con- 
ditions such as chloramine-T, widely used for protein 
labeling at 7.5 pH; peroxidase + H 2 2 widely used for 
radioassay tracers; isotopic exchange for imaging trac- 
ers; excitation labeling as in 123 Xe — » 123 I diazotization 
plus iodination for primary amines; conjugation label- 
ing with Bolton Hunter agent (N-succinimidyl 3- [4- 
hydroxy 5-( 131 ' 125 ' 123 I)iodophenyl] propionate); 
hydroboration plus iodination; electrophilic destanny- 
lation; microdiffusion with fresh iodine vapor; and 
other methods. Radiopharmaceuticals in common use 
for brain perfusion studies are N-isopropyl-p [ 123 I] 
iodoamphetamine and 99m Tc-labeled hexamethyl- 
propyleneamine. 

PET Radionuclides 



B Tc0 4 



Sestamibi 

+ [(CH 3 ) 2 C(OMe)CH 2 NC] 4 CuBF 4 
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FIGURE 67.2 " m Tc-Sestamibi is a radiopharma- 
ceutical used to evaluate myocardial perfusion or in 
the diagnosis of cancer using both computed tomog- 
raphy and scintigraphy techniques. 



For n C, 13 N, 15 0, and 18 F, the modes of production of 

short-lived positron emitters can dictate the chemical 

form of the product, as shown in Table 67.3. Online 

chemistry is used to make various PET agents and 

precursors. For example, n C cyanide, an important precursor for synthesis of other labeled compounds, 

is produced in the cyclotron target by first bombarding N 2 + 5% H 2 gas target with 20-MeV protons. 

The product is carbon-labeled methane, U CH 4 , which when combined with ammonia and passed over 

a platinum wool catalyst at 1000°C becomes U CN~, which is subsequently trapped in NaOH. 

Molecular oxygen is produced by bombarding a gas target of 14 N 2 + 2% 2 with deuterons (6 to 
8 MeV). A number of products (e.g., 15 2 , C 15 2 , N 15 2 , 15 3 , and Fi 2 15 0) are trapped by soda lime 
followed by charcoal to give 15 2 as the product. However, if an activated charcoal trap at 900°C is used 
before the soda lime trap, the 15 2 will be converted to C I5 0. The specific strategies for other online PET 
agents is given in Rayudu [1990]. 

Fluorine- 18 ( 18 F) is a very versatile positron emitting isotope. With a 2-hour half- life and two forms 
(F + and F~), one can develop several synthetic methods for incorporating 18 F into medically useful 
compounds [Kilbourn, 1990]. Additionally, fluorine forms strong bonds with carbon and is roughly the 
same size as a hydrogen atom, imparting metabolic and chemical stability of the molecules without 
drastically altering biologic activity. The most com- 
monly produced 18 F radiopharmaceutical is 2-deoxy-2- 
[ 18 F]fluoroglucose (FDG). This radiotracer mimics part 
of the glucose metabolic pathway and has shown both 
hypo- and hypermetabolic abnormalities in cardiology, 
oncology, and neurology. The synthetic pathway for the 
production of FDG is shown in Fig. 67.3. 

The production of positron radiopharmaceuticals 
requires the rapid incorporation of the isotope into the 
desired molecule. Chemical techniques and synthetic 
strategies have been developed to facilitate these reac- 
tions. Many of these synthetic manipulations require 
hands-on operations by a highly trained chemist. Addi- 
tionally, since positrons give off 2- to 511-keV gamma 
rays upon annihilation, proximity to the source can 
increase one's personal dose. The demand for the routine 



TABLE 67.3 Major Positron-Emitting 
Radionuclides Produced by Accelerated Protons 



Radionuclide 


Half-Life 


Reaction 


Carbon- 11 


20 min 


12 C (p, pn) "C 
14 N (p, a) "C 


Nitrogen- 13 


10 min 


"O (p, a) "N 
13 C (p, n) 13 N 


Oxygen- 1 5 


2 min 


15 N (p, n) ls O 
14 N (d, n) ls O 


Fluorine- 18 


110 min 


ls O (p, n) 18 F 
20 Ne (d, a) 18 F 



Note: A(x, y)B: A is target, x is the bombarding 
particle, y is the radiation product, and B is the 
isotope produced. 
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Synthesis of 18 F-FIuorodeoxygIucose ( 18 F-FDG) 
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FIGURE 67.3 Schematic for the chemical production of deoxyglucose labelled with fluorine- 18. Here K222 refers 
to Kryptofix and C18 denotes a reverse-phase high-pressure liquid chromatography column. 

production of positron radiopharmaceuticals such as FDG has led to the development of remote synthetic 
devices. These devices can be human-controlled (i.e., flipping switches to open air-actuated valves), 
computer-controlled, or robotic. A sophisticated computer-controlled chemistry synthesis unit has been 
assembled for the fully automated production of 18 FDG [Padgett, 1989]. A computer-controlled robot 
has been programmed to produce 6a-[ 18 F]fluoroestradiol for breast cancer imaging [Brodack, 1986; 
Mathias, 1987]. Both these types of units increase the availability and reduce the cost of short-lived 
radiopharmaceutical production through greater reliability and reduced need for a highly trained staff. 
Additionally, these automated devices reduce personnel radiation exposure. These and other devices are 
being designed with greater versatility in mind to allow a variety of radiopharmaceuticals to be produced 
just by changing the programming and the required reagents. 

These PET radionuclides have been incorporated into a wide variety of medically useful radiophar- 
maceuticals through a number of synthetic techniques. The development of PET scanner technology has 
added a new dimension to synthetic chemistry by challenging radiochemists to devise labeling and 
purification strategies that proceed on the order of minutes rather than hours or days as in conventional 
synthetic chemistry. To meet this challenge, radiochemists are developing new target systems to improve 
isotope production and sophisticated synthetic units to streamline routine production of commonly 
desired radiotracers as well as preparing short-lived radiopharmaceuticals for many applications. 
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67.2 Instrumentation 

Thomas F. Budingei 

Background 

The history of positron-emission tomography (PET) can be traced to the early 1950s, when workers in 
Boston first realized the medical imaging possibilities of a particular class of radioactive substances. It was 
recognized then that the high-energy photons produced by annihilation of the positron from positron- 
emitting isotopes could be used to describe, in three dimensions, the physiologic distribution of "tagged" 
chemical compounds. After two decades of moderate technological developments by a few research centers, 
widespread interest and broadly based research activity began in earnest following the development of 
sophisticated reconstruction algorithms and improvements in detector technology. By the mid-1980s, PET 
had become a tool for medical diagnosis and for dynamic studies of human metabolism. 

Today, because of its million-fold sensitivity advantage over magnetic resonance imaging (MRI) in 
tracer studies and its chemical specificity, PET is used to study neuroreceptors in the brain and other 
body tissues. In contrast, MRI has exquisite resolution for anatomic (Fig. 67.4) and flow studies as well 
as unique attributes of evaluating chemical composition of tissue but in the millimolar range rather than 
the nanomolar range of much of the receptor proteins in the body. Clinical studies include tumors of 
the brain, breast, lungs, lower gastrointestinal tract, and other sites. Additional clinical uses include 
Alzheimer's disease, Parkinson's disease, epilepsy, and coronary artery disease affecting heart muscle 
metabolism and flow. Its use has added immeasurably to our current understanding of flow, oxygen 
utilization, and the metabolic changes that accompany disease and that change during brain stimulation 
and cognitive activation. 

PET Theory 

PET imaging begins with the injection of a metabolically active tracer — a biologic molecule that carries 
with it a positron-emitting isotope (e.g., U C, 13 N, 15 0, or 18 F). Over a few minutes, the isotope accumulates 
in an area of the body for which the molecule has an affinity. As an example, glucose labeled with U C, 
or a glucose analogue labeled with 18 F, accumulates in the brain or tumors, where glucose is used as the 
primary source of energy. The radioactive nuclei then decay by positron emission. In positron (positive 
electron) emission, a nuclear proton changes into a positive electron and a neutron. The atom maintains 
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MRI 



PET 





FIGURE 67.4 The MRI image shows the arteriovenous malformation (AVM) as an area of signal loss due to blood 
flow. The PET image shows the AVM as a region devoid of glucose metabolism and also shows decreased metabolism 
in the adjacent frontal cortex. This is a metabolic effect of the AVM on the brain and may explain some of the 
patient's symptoms. 

its atomic mass but decreases its atomic number by 1. The ejected positron combines with an electron 
almost instantaneously, and these two particles undergo the process of annihilation. The energy associated 
with the masses of the positron and electron particles is 1.022 MeV in accordance with the energy E to 
mass m equivalence E = mc 2 , where c is the velocity of light. This energy is divided equally between two 
photons that fly away from one another at a 180-degree angle. Each photon has an energy of 511 keV. 
These high-energy gamma rays emerge from the body in opposite directions, to be detected by an array 
of detectors that surround the patient (Fig. 67.5). When two photons are recorded simultaneously by a 
pair of detectors, the annihilation event that gave rise to them must have occurred somewhere along the 
line connecting the detectors. Of course, if one of the photons is scattered, then the line of coincidence 
will be incorrect. After 100,000 or more annihilation events are detected, the distribution of the positron- 
emitting tracer is calculated by tomographic reconstruction procedures. PET reconstructs a two-dimen- 
sional (2D) image from the one-dimensional projections seen at different angles. Three-dimensional 
(3D) reconstructions also can be done using 2D projections from multiple angles. 

PET Detectors 

Efficient detection of the annihilation photons from positron emitters is usually provided by the com- 
bination of a crystal, which converts the high-energy photons to visible-light photons, and a photomul- 
tiplier tube that produces an amplified electric current pulse proportional to the amount of light photons 
interacting with the photocathode. The fact that imaging system sensitivity is proportional to the square 
of the detector efficiency leads to a very important requirement that the detector be nearly 100% efficient. 
Thus other detector systems such as plastic scintillators or gas-filled wire chambers, with typical individual 
efficiencies of 20% or less, would result in a coincident efficiency of only 4% or less. 

Most modern PET cameras are multilayered with 15 to 47 levels or transaxial layers to be reconstructed 
(Fig. 67.6). The lead shields prevent activity from the patient from causing spurious counts in the 
tomograph ring, while the tungsten septa reject some of the events in which one (or both) of the 511-keV 
photons suffer a Compton scatter in the patient. The sensitivity of this design is improved by collection 
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, Positron annihilation photons 
180° ± 0.25° 




FIGURE 67.5 The physical basis of positron-emission tomography. Positrons emitted by "tagged" metabolically 
active molecules annihilate nearby electrons and give rise to a pair of high-energy photons. The photons fly off in 
nearly opposite directions and thus serve to pinpoint their source. The biologic activity of the tagged molecule can 
be used to investigate a number of physiologic functions, both normal and pathologic. 



Multi - Layer Detector 



Scintillator 




Cross plane 



Tungsten Septum 



FIGURE 67.6 Most modern PET cameras are multilayered with 15 to 47 levels or transaxial layers to be recon- 
structed. The lead shields prevent activity from the patient from causing spurious counts in the tomograph ring, 
while the tungsten septa reject some of the events in which one (or both) of the 51 1-keV photons suffer a Compton 
scatter in the patient. The sensitivity of this design is improved by collection of data from cross-planes. 

of data from cross-planes (Fig. 67.6). The arrangement of scintillators and phototubes is shown in 
Fig. 67.7. 

The "individually coupled" design is capable of very high resolution, and because the design is very 
parallel (all the photomultiplier tubes and scintillator crystals operate independently), it is capable of 
very high data throughput. The disadvantages of this type of design are the requirement for many expensive 
photomultiplier tubes and, additionally, that connecting round photomultiplier tubes to rectangular 
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FIGURE 67.7 The arrangement of scintillators and phototubes is shown. The "individually coupled" design is 
capable of very high resolution, and because the design is very parallel (all the photomultiplier tubes and scintillator 
crystals operate independently), it is capable of very high data throughput. A block detector couples several photo- 
multiplier tubes to a bank of scintillator crystals and uses a coding scheme to determine the crystal of interaction. 
In the two-layer block, five photomultiplier tubes are coupled to eight scintillator crystals. 

scintillation crystals leads to problems of packing rectangular crystals and circular phototubes of suffi- 
ciently small diameter to form a solid ring. 

The contemporary method of packing many scintillators for 511 keV around the patient is to use what 
is called a block detector design. A block detector couples several photomultiplier tubes to a bank of 
scintillator crystals and uses a coding scheme to determine the crystal of interaction. In the two-layer 
block (Fig. 67.7), five photomultiplier tubes are coupled to eight scintillator crystals. Whenever one of 
the outside four photomultiplier tubes fires, a 51 1-keV photon has interacted in one of the two crystals 
attached to that photomultiplier tube, and the center photomultiplier tube is then used to determine 
whether it was the inner or outer crystal. This is known as a digital coding scheme, since each photo- 
multiplier tube is either "hit" or "not hit" and the crystal of interaction is determined by a "digital" 
mapping of the hit pattern. Block detector designs are much less expensive and practical to form into a 
multilayer camera. However, errors in the decoding scheme reduce the spatial resolution, and since the 
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FIGURE 67.8 Factors contributing to the resolution of the PET tomograph. The contribution most accessible to 
further reduction is the size of the detector crystals. 

entire block is "dead" whenever one of its member crystals is struck by a photon, the dead time is worse 
than with individual coupling. The electronics necessary to decode the output of the block are straight- 
forward but more complex than that needed for the individually coupled design. 

Most block detector coding schemes use an analog coding scheme, where the ratio of light output is 
used to determine the crystal of interaction. In the example above, 4 photomultiplier tubes are coupled 
to a block of BGO that has been partially sawed through to form 64 "individual" crystals. The depth of 
the cuts are critical; that is, deep cuts tend to focus the scintillation light onto the face of a single 
photomultiplier tube, while shallow cuts tend to spread the light over all four photomultiplier tubes. 
This type of coding scheme is more difficult to implement than digital coding, since analog light ratios 
place more stringent requirements on the photomultiplier tube linearity and uniformity as well as 
scintillator crystal uniformity. However, most commercial PET cameras use an analog coding scheme 
because it is much less expensive due to the lower number of photomultiplier tubes required. 

Physical Factors Affecting Resolution 

The factors that affect the spatial resolution of PET tomographs are shown in Fig. 67.8. The size of the 
detector is critical in determining the system's geometric resolution. If the block design is used, there is 
a degradation in this geometric resolution by 2.2 mm for BGO. The degradation is probably due to the 
limited light output of BGO and the ratio of crystals (cuts) per phototube. 

The angle between the paths of the annihilation photons can deviated from 180 degrees as a result of 
some residual kinetic motion (Fermi motion) at the time of annihilation. The effect on resolution of this 
deviation increases as the detector ring diameter increases so that eventually this factor can have a 
significant effect. 
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FIGURE 67.9 The evolution of resolution. Over the past decade, the resolving power of PET has improved from 
about 9 to 2.6 mm. This improvement is graphically illustrated by the increasing success with which one is able to 
resolve "hot spots" of an artificial sample that are detected and imaged by the tomographs. 

The distance the positron travels after being emitted from the nucleus and before annihilation causes 
a deterioration in spatial resolution. This distance depends on the particular nuclide. For example, the 
range of blurring for 18 F, the isotope used for many of the current PET studies, is quite small compared 
with that of the other isotopes. Combining values for these factors for the PET-600 tomograph, we can 
estimate a detector-pair spatial resolution of 2.0 mm and a reconstructed image resolution of 2.6 mm. 
The measured resolution of this system is 2.6 mm, but most commercially available tomographs use a 
block detector design (Fig. 67.7), and the resolution of these systems is above 5 mm. The evolution of 
resolution improvement is shown in Fig. 67.9. 

The resolution evolutions discussed above pertain to results for the center or axis of the tomograph. 
The resolution at the edge of the object (e.g., patient) will be less by a significant amount due to two 
factors. First, the path of the photon from an "off-center" annihilation event typically traverses more 
than one detector crystal, as shown in Fig. 67.10. This results in an elongation of the resolution spread 
function along the radius of the transaxial plane. The loss of resolution is dependent on the crystal density 
and the diameter of the tomograph detector ring. For a 60-cm diameter system, the resolution can 
deteriorate by a factor of 2 from the axis to 10 cm. 

The coincidence circuitry must be able to determine coincident events with 10- to 20-ns resolution 
for each crystal-crystal combination (i.e., chord). The timing requirement is set jointly by the time of 
flight across the detector ring (4 ns) and the crystal-to-crystal resolving time (typically 3 ns). The most 
stringent requirement, however, is the vast number of chords in which coincidences must be determined 
(over 1.5 million in a 24-layer camera with septa in place and 18 million with the septa removed). 

It is obviously impractical to have an individual coincidence circuit for each chord, so tomograph 
builders use parallel organization to solve this problem. A typical method is to use a high-speed clock 
(typically 200 MHz) to mark the arrival time of each 511-keV photon and a digital coincidence processor 
to search for coincident pairs of detected photons based on this time marker. This search can be done 
extremely quickly by having multiple sorters working in parallel. 

The maximum event rate is also quite important, especially in septaless systems. The maximum rate 
in a single detector crystals is limited by the dead time due to the scintillator fluorescent lifetime (typically 
1 us per event), but as the remainder of the scintillator crystals are available, the instrument has much 
higher event rates (e.g., number of crystals x 1 lis). Combining crystals together to form contiguous 
blocks reduces the maximum event rate because the fluorescent lifetime applies to the entire block and 
a fraction of the tomograph is dead after each event. 
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FIGURE 67.10 Resolution astigmatism in detecting off-center events. Because annihilation photons can penetrate 
crystals to different depths, the resolution is not equal in all directions, particularly at the edge of the imaging field. 
This problem of astigmatism will be taken into account in future PET instrumentation. 

Random Coincidences 

If two annihilation events occurs within the time resolution of the tomograph (e.g., 10 ns), then random 
coincident "events" add erroneous background activity to the tomograph and are significant at high event 
rates. These can be corrected for on a chord-by-chord basis. The noncoincidence event rate of each crystal 
pair is measured by observing the rate of events beyond the coincident timing window. The random rate 
for the particular chord R tj corresponding to a crystal pair is 



R = r. x r. x 2% 



R. = R.. 



(67.2) 



where r i and r are the event rates of crystal i and crystal j, and T is the coincidence window width. As 
the activity in the subject increases, the event rate in each detector increases. Thus the random event rate 
will increase as the square of the activity. 

Tomographic Reconstruction 

Before reconstruction, each projection ray or chord receives three corrections: crystal efficiency, attenu- 
ation, and random efficiency. The efficiency for each chord is computed by dividing the observed count 
rate for that chord by the average court rate for chords with a similar geometry (i.e., length). This is 
typically done daily using a transmission source without the patient or object in place. Once the patient 
is in position in the camera, a transmission scan is taken, and the attenuation factor for each chord is 
computed by dividing its transmission count rate by its efficiency count rate. The patient is then injected 
with the isotope, and an emission scan is taken, during which time the random count rate is also measured. 
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For each chord, the random event rate is subtracted from the emission rate, and the difference is divided 
by the attenuation factor and the chord efficiency. (The detector efficiency is divided twice because two 
separate detection's measurements are made — transmission and emission). The resulting value is recon- 
structed, usually with the filtered backprojection algorithm. This is the same algorithm used in x-ray 
computed tomography (CT) and in projection MRI. The corrected projection data are formatted onto 
parallel- or fan-beam data sets for each angle. These are modified by a high-pass filter and backprojected. 
The process of PET reconstruction is linear and shown by operators successively operating on the 
projection P: 



(P) (67.3) 



A=> BPF~ l RE 



where A is the image, F is the Fourier transform, _R is the ramp-shaped high-pass filter, i 7-1 is the inverse 
Fourier transform, BP is the backprojection operation, and e denotes the superposition operation. 

The alternative class of reconstruction algorithms involves iterative solutions to the classic inverse 
problem: 

P=FA (67.4) 

where P is the projection matrix, A is the matrix of true data being sought, and F is the projection 
operation. The inverse is 

A=F~ 1 P 

which is computed by iteratively estimating the data A' and modifying the estimate by comparison of 
the calculated projection set P' with the true observed projections P. The expectation-maximization 
algorithm solves the inverse problem by updating each pixel value a, in accord with 

i 

where P is the measured projection, L is the probability a source at pixel i will be detected in projection 
detector j, and k is the iteration. 

Sensitivity 

The sensitivity is a measure of how efficiently the tomograph detects coincident events and has units of 
count rate per unit activity concentration. It is measured by placing a known concentration of radionu- 
clide in a water-filled 20-cm-diameter cylinder in the field of view. This cylinder, known as a phantom, 
is placed in the tomograph, and the coincidence event rate is measured. High sensitivity is important 
because emission imaging involves counting each event, and the resulting data are as much as 1000 times 
less than experienced in x-ray CT. Most tomographs have high individual detection efficiency for 51 1-keV 
photons impinging on the detector (>90%), so the sensitivity is mostly determined by geometric factors, 
i.e., the solid angle subtended by the tomograph: 



A£ 2 y X 3.7 X 10 4 



levents/s) /ImCi/cc) (67.6) 
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where: r = radius of tomograph 

A = area of detector material seen by each point in the object (2nr x axial aperture) 
£ = efficiency of scintillator 
y = attenuation factor 

For a single layer, the sensitivity of a tomograph of 90 cm diameter (2-cm axial crystals) will be 15,000 
events/s/uCi/ml for a disk of activity 20 cm in diameter and 1 cm thick. For a 20-cm-diameter cylinder, 
the sensitivity will be the same for a single layer with shields or septa that limit the off-slice activity from 
entering the collimators. However, modern multislice instruments use septa that allow activity from 
adjacent planes to be detected, thus increasing the solid angle and therefore the sensitivity. This increase 
comes at some cost due to increase in scatter. The improvement in sensitivity is by a factor of 7, but after 
correction for the noise, the improvement is 4. The noise equivalent sensitivity S NE is given by 

true events) 

S NE = '— (67.7) 

true x scatter x random 

Statistical Properties of PET 

The ability to map quantitatively the spatial distribution of a positron-emitting isotope depends on 
adequate spatial resolution to avoid blurring. In addition, sufficient data must be acquired to allow a 
statistically reliable estimation of the tracer concentration. The amount of available data depends on the 
biomedical accumulation, the imaging system sensitivity, and the dose of injected radioactivity. The 
propagation of errors due to the reconstruction process results in an increase in the noise over that 
expected for an independent signal (e.g., ^signal) by a factor proportional to the square root of the 
number of resolution elements (true pixels) across the image. The formula that deals with the general 
case of emission reconstruction (PET or SPECT) is 

1." 

% uncertainty = -, — - — (67.8) 



.2 x 100 (total no. of events) 
(total no. of events) 



The statistical requirements are closely related to the spatial resolution, as shown in Fig. 67.11. 

For a given accuracy or a signal-to-noise ratio for a uniform distribution, the ratio of the number of 
events needed in a high-resolution system to that needed in a low-resolution system is proportional to 
the 3/2 power of the ratio of the number of effective resolution elements in the two systems. Equation (67.8) 
and Fig. 67. 1 1 should be used not with the total pixels in the image but with the effective resolution cells. 
The number of effective resolution cells is the sum of the occupied resolution elements weighted by the 
activity within each element. Suppose, however, that the activity is mainly in a few resolution cells (e.g., 
100 events per cell) and the remainder of the 10,000 cells have a background of 1 event per cell. The 
curves of Fig. 67.11 would suggest unacceptable statistics; however, in this case, the effective number of 
resolution cells is below 100. The relevant equation for this situation is 



/ • v 1 / 4 

1.2 x 100 (no. of resolution cells) 
(avg no. of events per resolution cell in target) 



% uncertainty = j- (67.9) 



The better resolution gives improved results without the requirement for a drastic increase in the 
number of detected events is that the improved resolution increases contrast. (It is well known that the 
number of events needed to detect an object is inversely related to the square of the contrast.) 
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FIGURE 67.11 Statistical requirements and spatial resolution. The general relationship between the detected num- 
ber of events and the number of resolution elements in an image is graphed for various levels of precision. These 
are relations for planes of constant thickness. 
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68.1 The Electrical Impedance of Tissue 

The specific conductance (conductivity) of human tissues varies from 15.4 mS/cm for cerebrospinal fluid 
to 0.06 mS/cm for bone. The difference in the value of conductivity is large between different tissues 
(Table 68.1). Cross-sectional images of the distribution of conductivity, or alternatively specific resistance 
(resistivity), should show good contrast. The aim of electrical impedance tomography (EIT) is to produce 
such images. It has been shown [Kohn and Vogelius, 1984a, 1984b; Sylvester and Uhlmann, 1986] that 
for reasonable isotropic distributions of conductivity it is possible in principle to reconstruct conductivity 
images from electrical measurements made on the surface of an object. Electrical impedance tomography 
(EIT) is the technique of producing these images. In fact, human tissue is not simply conductive. There 
is evidence that many tissues also demonstrate a capacitive component of current flow and therefore, it 
is appropriate to speak of the specific admittance (admittivity) or specific impedance (impedivity) of 
tissue rather than the conductivity; hence the use of the world impedance in electrical impedance 
tomography. 

68.2 Conduction in Human Tissues 

Tissue consists of cells with conducting contents surrounded by insulating membranes embedded in a 
conducting medium. Inside and outside the cell wall is conducting fluid. At low frequencies of applied 
current, the current cannot pass through the membranes, and conduction is through the extracellular 
space. At high frequencies, current can flow through the membranes, which act as capacitors. A simple 
model of bulk tissue impedance based on this structure, which was proposed by Cole and Cole [1941], 
is shown by Fig. 68.1. 

Clearly, this model as it stands is too simple, since an actual tissue sample would be better represented 
as a large network of interconnected modules of this form. However, it has been shown that this model 
fits experimental data if the values of the components, especially the capacitance, are made a power 
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function of the applied frequency co. An equation which 
describes the behavior of tissue impedance as a function 
of frequency reasonably well is 



TABLE 68.1 Values of Specific Conductance 
for Human Tissues 



Z = Z +- 



(68.1) 



1 + 



fj 



Tissue 


Conductivity, mS/cm 


Cerebrospinal fluid 


15.4 


Blood 


6.7 


Liver 


2.8 


Skeletal muscle 


8.0 (longitudinal) 




0.6 (transverse) 


Cardiac muscle 


6.3 (longitudinal 




2.3 (transverse) 


Neural tissue 


1.7 


Gray matter 


3.5 


White matter 


1.5 


Lung 


1.0 (expiration) 




0.4 (inspiration) 


Fat 


0.36 


Bone 


0.06 



where Z and Z„ are the (complex) limiting values of 
tissue impedance low and high frequency and f c is a 
characteristic frequency. The value of a allows for the 
frequency dependency of the components of the model 
and is tissue dependent. Numerical values for in-vivo 
human tissues are not well established. 

Making measurements of the real and imaginary com- 
ponents of tissue impedivity over a range of frequencies 

will allow the components in this model to be extracted. Since it is known that tissue structure alters in 
disease and that R, S, C are dependent on structure, it should be possible to use such measurements to 
distinguish different types of tissue and different disease conditions. It is worth noting that although maxi- 
mum accuracy in the determination of the model components can be obtained if both real and imaginary 
components are available, in principle, knowledge of the resistive component alone should enable the values 
to be determined, provided an adequate range of frequencies is used. This can have practical consequences 
for data collection, since accurate measurement of the capacitive component can prove difficult. 

Although on a microscopic scale tissue is almost certainly 
electrically isotropic, on a macroscopic scale this is not so for 
some tissues because of their anisotropic physical structure. 
Muscle tissue is a prime example (see Table 68.1), where the bulk 
conductivity along the direction of the fibers is significantly 
higher than across the fibers. Although unique solutions for 
conductivity are possible for isotropic conductors, it can be 
shown that for anisotropic conductors unique solutions for con- 
ductivity do not exist. There are sets of different anisotropic 
conductivity distributions that give the same surface voltage dis- 
tributions and which therefore cannot be distinguished by these 
measurements. It is not yet clear how limiting anisotropy is to 
electrical impedance tomography. Clearly, if sufficient data could 
be obtained to resolve down to the microscopic level (this is not 

possible practically), then tissue becomes isotropic. Moreover, the tissue distribution of conductivity, 
including anisotropy, often can be modeled as a network of conductors, and it is known that a unique 
solution will always exist for such a network. In practice, use of some prior knowledge about the 
anisotropy of tissue may remove the ambiguities of conductivity distribution associated with anisotropy. 
The degree to which anisotropy might inhibit useful image reconstruction is still an open question. 



R 



FIGURE 68.1 The Cole-Cole model of 
tissue impedance. 



68.3 Determination of the Impedance Distribution 

The distribution of electrical potential within an isotropic conducting object through which a low- 
frequency current is flowing is given by 



7 (oV<j)) = 







(68.2) 
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where (|> is the potential distribution within the object and a is the distribution of conductivity (generally 
admittivity) within the object. If the conductivity is uniform, this reduces to Laplace's equation. Strictly 
speaking, this equation is only correct for direct current, but for the frequencies of alternating current 
used in EIT (up to 1 MHz) and the sizes of objects being imaged, it can be assumed that this equation 
continues to describe the instantaneous distribution of potential within the conducting object. If this 
equation is solved for a given conductivity distribution and current distribution through the surface of 
the object, the potential distribution developed on the surface of the object may be determined. The 
distribution of potential will depend on several things. It will depend on the pattern of current applied 
and the shape of the object. It will also depend on the internal conductivity of the object, and it is this 
that needs to be determined. In theory, the current may be applied in a continuous and nonuniform 
pattern at every point across the surface. In practice, current is applied to an object through electrodes 
attached to the surface of the object. Theoretically, potential may be measured at every point on the 
surface of the object. Again, voltage on the surface of the object is measured in practice using electrodes 
(possibly different from those used to apply current) attached to the surface of the object. There will be 
a relationship, the forward solution, between an applied current pattern j p the conductivity distribution 
a, and the surface potential distribution (j), which can be formally represented as 



f . = r(; ; ,ct) . (68.3) 



If a and ] i are known, (|), can be computed. For one current pattern j ; , knowledge of (j), is not in general 
sufficient to uniquely determine a. However, by applying a complete set of independent current patterns, 
it becomes possible to obtain sufficient information to determine c, at least in the isotropic case. This 
is the inverse solution. In practice, measurements of surface potential or voltage can only be made at a 
finite number of positions, corresponding to electrodes placed on the surface of the object. This also 
means that only a finite number of independent current patterns can be applied. For N electrodes, N-l 
independent current patterns can be defined and N(N-l)/2 independent measurements made. This latter 
number determines the limit of image resolution achievable with N electrodes. In practice, it may not 
be possible to collect all possible independent measurements. Since only a finite number of current 
patterns and measurements is available, the set of equations represented by Eq. (68.3) can be rewritten as 

v = A c c (68.4) 

where v is now a concatenated vector of all voltage values for all current patterns, c is a vector of 
conductivity values, representing the conductivity distribution divided into uniform image pixels, and 
A c a matrix representing the transformation of this conductivity vector into the voltage vector. Since A c 
depends on the conductivity distribution, this equation is nonlinear. Although formally the preceding 
equation can be solved for c by inverting A c , the nonlinear nature of this equation means that this cannot 
be done in a single step. An iterative procedure will therefore be needed to obtain c. 

Examination of the physics of current flow shows that current tends to take the easiest path possible 
in its passage through the object. If the conductivity at some point is changed, the current path redis- 
tributes in such a way that the effects of this change are minimized. The practical effect of this is that it 
is possible to have fairly large changes in conductivity within the object which only produce relatively 
small changes in voltage at the surface of the object. The converse of this is that when reconstructing the 
conductivity distribution, small errors on the measured voltage data, both random and systematic, can 
translate into large errors in the estimate of the conductivity distribution. This effect forms, and will 
continue to form, a limit to the quality of reconstructed conductivity images in terms of resolution, 
accuracy, and sensitivity. 

Any measurement of voltage must always be referred to a reference point. Usually this is one of the 
electrodes, which is given the nominal value of V. The voltage on all other electrodes is determined by 
measuring the voltage difference between each electrode and the reference electrode. Alternatively, voltage 
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FIGURE 68.2 Idealized electrode positions around a conducting object with typical drive and measurement elec- 
trode pairs indicated. 

differences may be measured between pairs of electrodes. A common approach is to measure the voltage 
between adjacent pairs of electrodes (Fig. 68.2). Clearly, the measurement scheme affects the form of A c . 
Choice of the pattern of applied currents and the voltage measurement scheme used can affect the 
accuracy with which images of conductivity can be reconstructed. 

Electrical impedance tomography (EIT) is not a mature technology. However, it has been the subject 
of intensive research over the past few years, and this work is still continuing. Nearly all the research 
effort has been devoted to exploring the different possible ways of collecting data and producing images 
of tissue resistivity, with the aim of optimizing image reconstruction in terms of image accuracy, spatial 
resolution, and sensitivity. 

Very few areas of medical application have been explored in any great depth, although in a number of 
cases preliminary work has been carried out. Although most current interest is in the use of EIT for medical 
imaging, there is also some interest in its use in geophysical measurements and some industrial uses. A 
recent detailed review of the state of Electrical Impedance Tomography is given in Boone et al. [1997]. 

Data Collection 
Basic Requirements 

Data are collected by applying a current to the object through electrodes connected to the surface of the 
object and then making measurements of the voltage on the object surface through the same or other 
electrodes. Although conceptually simple, technically this can be difficult. Great attention must be paid 
to the reduction of noise and the elimination of any voltage offsets on the measurements. The currents 
applied are alternating currents usually in the range 10 kHz to 1 MHz. Since tissue has a complex 
impedance, the voltage signals will contain in-phase and out-of-phase components. In principle, both 
of these can be measured. In practice, measurement of the out-of-phase (the capacitive) component is 
significantly more difficult because of the presence of unwanted (stray) capacitances between various 
parts of the voltage measurement system, including the leads from the data- collection apparatus to the 
electrodes. These stray capacitances can lead to appreciable leakage currents, especially at the higher 
frequencies, which translate into systematic errors on the voltage measurements. The signal measured 
on an electrode, or between a pair of electrodes, oscillates at the same frequency as the applied current. 
The magnitude of this signal (usually separated into real and imaginary components) is determined, 
typically by demodulation and integration. The frequency of the demodulated signal is much less than 
the frequency of the applied signal, and the effects of stray capacitances on this signal are generally 
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negligible. This realization has led some workers to propose that the signal demodulation and detection 
system be mounted as close to the electrodes as possible, ideally at the electrode site itself, and some 
systems have been developed that use this approach, although none with sufficient miniaturization of 
the electronics to be practical in a clinical setting. This solution is not in itself free of problems, but this 
approach is likely to be of increasing importance if the frequency range of applied currents is to be 
extended beyond 1 MHz, necessary if the value of the complex impedance is to be adequately explored 
as a function of frequency. 

Various data-collection schemes have been proposed. Most data are collected from a two-dimensional 
(2D) configuration of electrodes, either from 2D objects or around the border of a plane normal to the 
principal axis of a cylindrical (in the general sense) object where that plane intersects the object surface. 
The simplest data- collection protocol is to apply a current between a pair of electrodes (often an adjacent 
pair) and measure the voltage difference between other adjacent pairs (see Fig. 68.2). Although in 
principle voltage could be measured on electrodes though which current is simultaneously flowing, the 
presence of an electrode impedance, generally unknown, between the electrode and the body surface 
means that the voltage measured is not actually that on the body surface. Various means have been 
suggested for either measuring the electrode impedance in some way or including it as an unknown in 
the image-reconstruction process. However, in many systems, measurements from electrodes through 
which current is flowing are simply ignored. Electrode impedance is generally not considered to be a 
problem when making voltage measurements on electrodes through which current is not flowing, pro- 
vided a voltmeter with sufficiently high input impedance is used, although, since the input impedance 
is always finite, every attempt should be made to keep the electrode impedance as low as possible. Using 
the same electrode for driving current and making voltage measurements, even at different times in the 
data collection cycle, means that at some point in the data- collection apparatus wires carrying current 
and wires carrying voltage signals will be brought close together in a switching system, leading to the 
possibility of leakage currents. There is a good argument for using separate sets of electrodes for driving 
and measuring to reduce this problem. Paulson et al. [1992] have also proposed this approach and also 
have noted that it can aid in the modeling of the forward solution (see Image Reconstruction). Brown 
et al. [1994] have used this approach in making multi-frequency measurements. 

Clearly, the magnitude of the voltage measured will depend on the magnitude of the current applied. 
If a constant-current drive is used, this must be able to deliver a known current to a variety of input 
impedances with a stability of better than 0.1%. This is technically demanding. The best approach to 
this problem is to measure the current being applied, which can easily be done to this accuracy. These 
measurements are then used to normalize the voltage data. 

The current application and data-collection regime will depend on the reconstruction algorithm used. 
Several EIT systems apply current in a distributed manner, with currents of various magnitudes being 
applied to several or all of the electrodes. These optimal currents (see Image Reconstruction) must be 
specified accurately, and again, it is technically difficult to ensure that the correct current is applied at 
each electrode. Although there are significant theoretical advantages to using distributed current patterns, 
the increased technical problems associated with this approach, and the higher noise levels associated 
with the increase in electronic complexity, may outweigh these advantages. 

Although most EIT at present is 2D in the sense given above, it is intrinsically a three-dimensional 
(3D) imaging procedure, since current cannot be constrained to flow in a plane through a 3D object. 
3D data collection does not pose any further problems apart from increased complexity due to the need 
for more electrodes. Whereas most data-collection systems to date have been based on 16 or 32 electrodes, 
3D systems will require four times or more electrodes distributed over the surface of the object if adequate 
resolution is to be maintained. Technically, this will require "belts" or "vests" of electrodes that can be 
rapidly applied [McAdams et al., 1994]. Some of these are already available, and the application of an 
adequate number of electrodes should not prove insuperable provided electrode-mounted electronics 
are not required. Metherell et al. [1996] describe a three-dimensional data collection system and recon- 
struction algorithm and note the improved accuracy of three-dimensional images compared to two- 
dimensional images constructed using data collected from three-dimensional objects. 
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Performance of Existing Systems 

Several research groups have produced EIT systems for laboratory use [Brown and Seagar, 1987; Smith 
et al, 1990; Rigaud et al, 1990; Jossinet and Trillaud, 1992; Lidgey et al, 1992; Riu et al., 1992; Gisser 
et al., 1991; Cook et al, 1994; Brown et al., 1994; Zhu et al, 1994; Cusick et al., 1994] and some clinical 
trials. The complexity of the systems largely depends on whether current is applied via a pair of electrodes 
(usually adjacent) or whether through many electrodes simultaneously. The former systems are much 
simpler in design and construction and can deliver higher signal-to-noise ratios. The Sheffield Mark II 
system (Smith et al., 1990) used 16 electrodes and was capable of providing signal-to-noise ratios of up 
to 68 dB at 25 datasets per second. The average image resolution achievable across the image was 15% 
of the image diameter. In general, multi-frequency systems have not yet delivered similar performance, 
but are being continuously improved. 

Spatial resolution and noise levels are the most important constraints on possible clinical applications 
of EIT. As images are formed through a reconstruction process the values of these parameters will depend 
critically on the quality of the data collected and the reconstruction process used. However practical 
limitations to the number of electrodes which can be used and the ill-posed nature of the reconstruction 
problem make it unlikely that high quality images, comparable to other medical imaging modalities, can 
be produced. The impact of this on diagnostic performance still needs to be evaluated. 

Image Reconstruction 
Basics of Reconstruction 

Although several different approaches to image reconstruction have been tried [Wexler, 1985; Yorkey, et 
al, 1986; Barber and Seagar, 1987, Kim et al, 1987; Hua et al, 1988; Cheny et al., 1990; Breckon, 1990; 
Zadecoochak et al., 1991; Kotre et al., 1992; Bayford et al., 1994; Morucci et al., 1994], the most accurate 
approaches are based broadly on the following algorithm. For a given set of current patterns, a forward 
transform is set up for determining the voltages v produced form the conductivity distribution c 
(Eq. 68.4). A c is dependent on c, so it is necessary to assume an initial starting conductivity distribution c . 
This is usually taken to be uniform. Using A c , the expected voltages v are calculated and compared with 
the actual measured voltages v m . Unless c is correct (which it will not be initially), v and v m will differ. 
It can be shown that an improved estimate of c is given by 



Ac = (s:S c p>:(v -v m ) (68.5) 



c„ + Ac (68.6) 



where S c is the differential of A c with respect to c, the sensitivity matrix and S' is the transpose of S c . The 
improved value of c is then used in the next iteration to compute an improved estimate of v m , i.e., v r . 
This iterative process is continued until some appropriate endpoint is reached. Although convergence is 
not guaranteed, in practice, convergence to the correct c in the absence of noise can be expected, provided 
a good starting value is chosen. Uniform conductivity seems to be a reasonable choice. In the presence 
of noise on the measurements, iteration is stopped when the difference between v and v m is within the 
margin of error set by the known noise on the data. 

There are some practical difficulties associated with this approach. One is that large changes in c may 
only produce small changes in v, and this will be reflected in the structure of S c , making S' C S C very difficult 
to invert reliably. Various methods of regularization have been used, with varying degrees of success, to 
achieve stable inversion of this matrix although the greater the regularization applied the poorer the 
resolution that can be achieved. A more difficult practical problem is that for convergence to be possible 
the computed voltages v must be equal to the measured voltages \ m when the correct conductivity values 
are used in the forward calculation. Although in a few idealized cases analytical solutions of the forward 
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problem are possible, in general, numerical solutions must be used. Techniques such as the finite-element 
method (FEM) have been developed to solve problems of this type numerically. However, the accuracy 
of these methods has to be carefully examined [Paulson et al., 1992] and, while they are adequate for 
many applications, may not be adequate for the EIT reconstruction problem, especially in the case of 
3D objects. Accuracies of rather better than 1% appear to be required if image artifacts are to be 
minimized. Consider a situation in which the actual distortion of conductivity is uniform. Then the 
initial v should be equal to the v m to an accuracy less than the magnitude of the noise. If this is not the 
case, then the algorithm will alter the conductivity distribution from uniform, which will clearly result 
in error. While the required accuracies have been approached under ideal conditions, there is only a 
limited amount of evidence at present to suggest that they can be achieved with data taken from human 
subjects. 

Optimal Current Patterns 

So far little has been said about the form of the current patterns applied to the object except that a set 
of independent patterns is needed. The simplest current patterns to use are those given by passing current 
into the object through one electrode and extracting current through a second electrode (a bipolar 
pattern) . This pattern has the virtue of simplicity and ease of application. However, other current patterns 
are possible. Current can be passed simultaneously through many electrodes, with different amounts 
passing through each electrode. Indeed, an infinite number of patterns are possible, the only limiting 
condition being that the magnitude of the current flowing into the conducting object equals the mag- 
nitude of the current flowing out of the object. Isaacson [1986] has shown that for any conducting object 
there is a set of optimal current patterns and has provided an algorithm to compute them even if the 
conductivity distribution is initially unknown. Isaacson showed that by using optimal patterns, significant 
improvements in sensitivity could be obtained compared with simpler two-electrode current patterns. 
However, the additional computation and hardware required to use optimal current patterns compared 
with fixed, nonoptimal patterns is considerable. 

Use of suboptimal patterns close to optimum also will produce significant gains. In general, the optimal 
patterns are very different from the patterns produced in the simple two-electrode case. The optimal 
patterns are often cosine-like patterns of current amplitude distributed around the object boundary 
rather than being localized at a pair of points, as in the two-electrode case. Since the currents are passed 
simultaneously through many electrodes, it is tempting to try and use the same electrodes for voltage 
measurements. This produces two problems. As noted above, measurement of voltage on an electrode 
through which an electric current is passing is compromised by the presence of electrode resistance, 
which causes a generally unknown voltage drop across the electrode, whereas voltage can be accurately 
measured on an electrode through which current is not flowing using a voltmeter of high input imped- 
ance. In addition, it has proved difficult to model current flow around an electrode through which current 
is flowing with sufficient accuracy to allow the reliable calculation of voltage on that electrode, which is 
needed for accurate reconstruction. It seems that separate electrodes should be used for voltage mea- 
surements with distributed current systems. 

Theoretically, distributed (near-) optimal current systems have some advantages. As each of the optimal 
current patterns is applied, it is possible to determine if the voltage patterns produced contain any useful 
information or if they are simply noise. Since the patterns can be generated and applied in order of 
decreasing significance, it is possible to terminate application of further current patterns when no further 
information can be obtained. A consequence of this is that SNRs can be maximized for a given total 
data- collection time. With bipolar current patterns this option is not available. All patterns must be 
applied. Provided the SNR in the data is sufficiently good and only a limited number of electrodes are 
used, this may not be too important, and the extra effort involved in generating the optimal or near- 
optimal patterns may not be justified. However, as the number of electrodes is increased, the use of 
optimal patterns becomes more significant. It also has been suggested that the distributed nature of the 
optimal patterns makes the forward problem less sensitive to modeling errors. Although there is currently 
no firm evidence for this, this seems a reasonable assertion. 
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Three-Dimensional Imaging 

Most published work so far on image reconstruction has concentrated on solving the 2D problem. 
However, real medical objects, i.e., patients, are three-dimensional. Theoretically, as the dimensionality 
of the object increases, reconstruction should become better conditioned. However, unlike 3D x-ray 
images, which can be constructed from a set of independent 2D images, EIT data from 3D objects cannot 
be so decomposed and data from over the whole surface of the object is required for 3D reconstruction. 
The principles of reconstruction in 3D are identical to the 2D situation although practically the problem 
is quite formidable, principally because of the need to solve the forward problem in three dimensions. 
Some early work on 3D imaging was presented by Goble and Isaacson [1990]. More recently Metherall 
et al. [1996] have shown images using data collected from human subjects. 

Single-Step Reconstruction 

The complete reconstruction problem is nonlinear and requires iteration. However, each step in the 
iterative process is linear. Images reconstructed using only the first step of iteration effectively treat image 
formation as a liner process, an assumption approximately justified for small changes in conductivity 
from uniform. In the case the functions A c and S c often can be precomputed with reasonable accuracy 
because they usually are computed for the case of uniform conductivity. Although the solution cannot 
be correct, since the nonlinearity is not taken into account, it may be useful, and first-step linear 
approximations have gained some popularity. Cheney et al. [1990] have published some results from a 
first-step process using optimal currents. Most, if not all, of the clinical images produced to date have 
used a single-step reconstruction algorithm [Barber and Seagar, 1987; Barber and Brown, 1990]. Although 
this algorithm uses very nonoptimal current patterns, this has not so far been a limitation because of 
the high quality of data collected and the limited number of electrodes used (16). With larger numbers 
of electrodes, this conclusion may need to be revised. 

Differential Imaging 

Ideally, the aim of EIT is to reconstruct images of the absolute distribution of conductivity (or admit- 
tivity). These images are known as absolute (or static) images. However, this requires that the forward 
problem can be solved to an high degree of accuracy, and this can be difficult. The magnitude of the 
voltage signal measured on an electrode or between electrodes will depend on the body shape, the 
electrode shape and position, and the internal conductivity distribution. The signal magnitude is in fact 
dominated by the first two effects rather than by conductivity. However, if a change in conductivity occurs 
within the object, then it can often be assumed that the change in surface voltage is dominated by this 
conductivity change. In differential (or dynamic) imaging, the aim is to image changes in conductivity 
rather than absolute values. If the voltage difference between a pair of (usually adjacent) electrodes before 
a conductivity change occurs is g l and the value after change occurs is g 2 , then a normalized data value 
is defined as 

Ag n =2^L^ = ^- (68.7) 

ol &2 &mean 

Many of the effects of body shape (including interpreting the data as coming from a 2D object when in 
fact it is from a 3D object) and electrode placing at least partially cancel out in this definition. The values 
of the normalized data are determined largely by the conductivity changes. It can be argued that the 
relationship between the (normalized) changes in conductivity Ac n and the normalized changes in 
boundary data Ag n is given by 

Ag„=FAc n (68.8) 

where F is a sensitivity matrix which it can be shown is much less sensitive to object shape end electrode 
positions than the sensitivity matrix of Eq. 68.5. Although images produced using this algorithm are not 
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FIGURE 68.3 A differential conductivity image representing the changes in conductivity in going from maximum 
inspiration (breathing in) to maximum expiration (breathing out). Increasing blackness represents increasing conduc- 
tivity. 

completely free of artifact that is the only algorithm which has reliably produced images using data taken 
from human subjects. Metherall et al. [1996] used a version of this algorithm for 3D imaging and Brown 
et al. [1994] for multi-frequency imaging, in this case imaging changes in conductivity with frequency. 
The principle disadvantage of this algorithm is that it can only image changes in conductivity, which 
must be either natural or induced. Figure 68.3 shows a typical differential image. This represents the 
changes in the conductivity of the lungs between expiration and inspiration. 

Multifrequency Measurements 

Differential algorithms can only image changes in conductivity. Absolute distributions of conductivity 
cannot be produced using these methods. In addition, any gross movement of the electrodes, either 
because they have to be removed and replaced or even because of significant patient movement, make 
the use of this technique difficult for long-term measurements of changes. As an alternative to changes 
in time, differential algorithms can image changes in conductivity with frequency. Brown et al. [1994] 
have shown that if measurements are made over a range of frequencies and differential images produced 
using data from the lowest frequency and the other frequencies in turn, these images can be used to 
compute parametric images representing the distribution of combinations of the circuit values in Fig. 68.1. 
For example, images representing the ratio of S to R, a measure of the ratio of intracellular to extracellular 
volume, can be produced, as well as images of f a = l/2n(RC + SC), the tissue characteristic frequency. 
Although not images of the absolute distribution of conductivity, they are images of absolute tissue 
properties. Since these properties are related to tissue structure, they should produce images with useful 
contrast. Data sufficient to reconstruct an image can be collected in a time short enough to preclude 
significant patient movement, which means these images are robust against movement artifacts. Changes 
of these parameters with time can still be observed. 

68.4 Areas of Clinical Application 

There is no doubt that the clinical strengths of EIT relate to its ability to be considered as a functional 
imaging modality that carries no hazard and can therefore be used for monitoring purposes. The best 
spatial resolution that might become available will still be much worse than anatomic imaging methods 
such as magnetic resonance imaging and x-ray computed tomography. However, EIT is able to image 
small changes in tissue conductivity such as those associated with blood perfusion, lung ventilation, and 
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fluid shifts. Clinical applications seek to take advantage of the ability of EIT to follow rapid changes in 
physiologic function. 

There are several areas in clinical medicine where electrical impedance tomography might provide 
advantages over existing techniques. These have been received elsewhere [Brown et al., 1985; Dawids, 
1987; Holder and Brown, 1990; Boone et al, 1997]. 

Possible Biomedical Applications 

Gastrointestinal System 

A priori, it seems likely that EIT could be applied usefully to measurement of motor activity in the gut. 
Electrodes can be applied with ease around the abdomen, and there are no large bony structures likely 
to seriously violate the assumption of constant initial conductivity. During motor activity, such as gastric 
emptying or peristalsis, there are relatively large movements of the conducting fluids within the bowel. 
The quantity of interest is the timing of activity, e.g., the rate at which the stomach empties, and the 
absolute impedance change and its exact location in space area of secondary importance. The principal 
limitations of EIT of poor spatial resolution and amplitude measurement are largely circumvented in 
this application [Avill et al., 1987]. There has also been some interest in the measurement of other aspects 
of gastro-intestinal function such as oesophageal activity [Erol et al., 1995]. 

Respiratory System 

Lung pathology can be imaged by conventional radiography, x-ray computed tomography, or magnetic 
resonance imaging, but there are clinical situations where it would be desirable to have a portable means 
of imaging regional lung ventilation which could, if necessary, generate repeated images over time. 
Validation studies have shown that overall ventilation can be measured with good accuracy [Harris et 
al., 1988]. More recent work [Hampshire et al., 1995] suggests that multifrequency measurements may 
be useful in the diagnosis of some lung disorders. 

EIT Imaging of Changes in Blood Volume 

The conductivity of blood is about 6.7 mS/cm, which is approximately three times that of most intratho- 
racic tissues. It therefore seems possible that EIT images related to blood flow could be accomplished. 
The imaged quantity will be the change in conductivity due to replacement of tissue by blood (or vice 
versa) as a result of the pulsatile flow through the thorax. This may be relatively large in the cardiac 
ventricles but will be smaller in the peripheral lung fields. 

The most interesting possibility is that of detecting pulmonary embolus (PE). If a blood clot is present 
in the lung, the lung beyond the clot will not be perfused, and under favorable circumstances, this may 
be visualized using a gated blood volume image of the lung. In combination with a ventilation image, 
which should show normal ventilation in this region, pulmonary embolism could be diagnosed. Some 
data [Leathard et al., 1994] indicate that PE can be visualized in human subjects. Although more sensitive 
methods already exist for detecting PE, the noninvasiveness and bedside availability of EIT mean that 
treatment of the patient could be monitored over the period following the occurrence of the embolism, 
an important aim, since the use of anticoagulants, the principal treatment for PE, needs to be minimized 
in postoperative patients, a common class of patients presenting with this complication. 

68.5 Summary and Future Developments 

EIT is still an emerging technology. In its development, several novel and difficult measurement and 
image-reconstruction problems have had to be addressed. Most of these have been satisfactorily solved. 
The current generation of EIT imaging systems are multifrequency, with some capable of 3D imaging. 
These should be capable of greater quantitative accuracy and be less prone to image artifact and are likely 
to find a practical role in clinical diagnosis. Although there are still many technical problems to be 
answered and many clinical applications to be addressed, the technology may be close to coming of age. 
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Defining Terms 

Absolute imaging: Imaging the actual distribution of conductivity. 

Admittivity: The specific admittance of an electrically conducting material. For simple biomedical 
materials such as saline with no reactive component of resistance, this is the same as conductivity. 

Anisotropic conductor: A material in which the conductivity is dependent on the direction in which 
it is measured through the material. 

Applied current pattern: In EIT, the electric current is applied to the surface of the conducting object 
via electrodes placed on the surface of the object. The spatial distribution of current flow through 
the surface of the object is the applied current pattern. 

Bipolar current pattern: A current pattern applied between a single pair of electrodes. 

Conductivity: The specific conductance of an electrically conducting material. The inverse of resistivity. 

Differential imaging: An EIT imaging technique that specifically images changes in conductivity. 

Distributed current: A current pattern applied through more than two electrodes. 

Dynamic imaging: The same as differential imaging. 

EIT: Electrical impedance tomography. 

Forward transform or problem or solution: The operation, real or computational, that maps or 
transforms the conductivity distribution to surface voltages. 

Impedivity: The specific impedance of an electrically conducting material. The inverse of admittivity. 
For simple biomedical materials such as saline with no reactive component of resistance, this is 
the same as resistivity. 

Inverse transform or problem or solution: The computational operation that maps voltage measure- 
ments on the surface of the object to the conductivity distribution. 

Optimal current: One of a set of a current patterns computed for a particular conductivity distribution 
that produce data with maximum possible SNR. 

Pixel: The conductivity distribution is usually represented as a set of connected piecewise uniform 
patches. Each of these patches is a pixel. The pixel may take any shape, but square or triangular 
shapes are most common. 

Resistivity: The specific electrical resistance of an electrical conducting material. The inverse of con- 
ductivity. 

Static imaging: The same as absolute imaging. 
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69.2 VR Application Examples 
VR Applications 

69.3 Current Status of Virtual Reality Technology 

69.4 Overview of Medical Applications of Virtual 
Reality Technology 

Surgical Training and Surgical Planning ■ Medical Education, 
Modeling, and Non-Surgical Training • Anatomical Imaging 
and Medical Image Fusion ■ Ergonomics, Rehabilitation, and 
Disabilities ■ Telesurgery and Telemedicine ■ Behavioral 
Evaluation and Intervention 

69.5 Summary 



Virtual Reality (VR) is the term commonly used to describe a novel human-computer interface that 
enables users to interact with computers in a radically different way. VR consists of a computer-generated, 
multi-dimensional environment and interface tools that allow users to: 

1. immerse themselves in the environment, 

2. navigate within the environment, and 

3. interact with objects and other inhabitants in the environment. 

The experience of entering this environment — this computer-generated virtual world — is compelling. 
To enter, the user usually dons a helmet containing a head-mounted display (HMD) that incorporates 
a sensor to track the wearer's movement and location. The user may also wear sensor-clad clothing that 
likewise tracks movement and location. The sensors communicate position and location data to a 
computer, which updates the image of the virtual world accordingly. By employing this garb, the user 
"breaks through" the computer screen and becomes completely immersed in this multi-dimensional 
world. Thus immersed, one can walk through a virtual house, drive a virtual car, or run a marathon in 
a park still under design. Recent advances in computer processor speed and graphics make it possible 
for even desk-top computers to create highly realistic environments. The practical applications are far 
reaching. Today, using VR, architects design office buildings, NASA controls robots at remote locations, 
and physicians plan and practice difficult operations. 
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FIGURE 69.1 A complete VR system. 

Virtual reality is quickly finding wide acceptance in the medical community as researchers and clini- 
cians alike become aware of its potential benefits. Several pioneer research groups have already demon- 
strated improved clinical performance using VR imaging, planning, and control techniques. 

69.1 Overview of Virtual Reality Technology 

The term "Virtual Reality" describes the experience of interacting with data from within the computer- 
generated data set. The computer-generated data set may be completely synthetic or remotely sensed, 
such as X-ray, MRI, PET, etc. images. Interaction with the data is natural and intuitive and occurs from 
a first-person perspective. From a system perspective, VR technology can be segmented as shown in 
Fig. 69.1. 

The computer-generated environment, or virtual world content consists of a 3D graphic model, typically 
implemented as a spatially organized, object-oriented database; each object in the database represents 
an object in the virtual world. 

A separate modeling program is used to create the individual objects for the virtual world. For greater 
realism, texture maps are used to create visual surface detail. 

The data set is manipulated using a real-time dynamics generator that allows objects to be moved within 
the world according to natural laws such as gravity and inertia, or according to other variables such as spring- 
rate and flexibility that are specified for each particular experience by application- specific programming. 

The dynamics generator also tracks the position and orientation of the user's head and hand using 
input peripherals such as a head tracker and DataGlove. 

Powerful tenderers are applied to present 3D images and 3D spatialized sound in real-time to the 
observer. 

The common method of working with a computer (the mouse/keyboard/monitor paradigm), based 
as it is, on a two-dimensional desk-top metaphor, is inappropriate for the multi-dimensional virtual 
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FIGURE 69.2 The DataGlove™, a VR control device. 

world. Therefore, one long-term challenge for VR developers has been to replace the conventional 
computer interface with one that is more natural, intuitive, and allows the computer — not the user — to 
carry a greater proportion of the interface burden. Not surprisingly, this search for a more practical 
multi-dimensional metaphor for interacting with computers and computer-generated artificial environ- 
ments has spawned the development of a new generation of computer interface hardware. Ideally, the 
interface hardware should consist of two components: (1) sensors for controlling the virtual world and 
(2) effectors for providing feedback to the user. To date, three new computer-interface mechanisms, not 
all of which manifest the ideal sensor/effector duality, have evolved: Instrumented Clothing, the Head 
Mounted Display (HMD), and 3D Sound Systems. 

Instrumented Clothing 

The DataGlove™ and DataSuit™ use dramatic new methods to measure human motion dynamically in 
real time. The clothing is instrumented with sensors that track the full range of motion of specific activities 
of the person wearing the Glove or Suit, for example as the wearer bends, moves, grasps, or waves. 

The DataGlove is a thin lycra glove with bend-sensors running along its dorsal surface. When the 
joints of the hand bend, the sensors bend and the angular movement is recorded by the sensors. These 
recordings are digitized and forwarded to the computer, which calculates the angle at which each joint 
is bent. On screen, an image of the hand moves in real time, reflecting the movements of the hand in 
the DataGlove and immediately replicating even the most subtle actions. 

The DataGlove is often used in conjunction with an absolute position and orientation sensor that 
allows the computer to determine the three-space coordinates, as well as the orientation of the hand and 
fingers. A similar sensor can be used with the DataSuit and is nearly always used with an HMD. 

The DataSuit is a customized body suit fitted with the same sophisticated bend-sensors found in the 
DataGlove. While the DataGlove is currently in production as both a VR interface and as a data-collection 
instrument, the DataSuit is available only as a custom device. As noted, DataGlove and DataSuit are 
utilized as general purpose computer interface devices for VR. There are several potential applications 
of this new technology for clinical and therapeutic medicine. 
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Head-Mounted Display (HMD) 

The best-known sensor/effector system in VR is a head-mounted display (HMD). It supports first-person 
immersion by generating an image for each eye, which, in some HMDs, may provide stereoscopic vision. 
Most lower cost HMDs ($6000 range) use LCD displays; others use small CRTs. The more expensive 
HMDs ($60,000 and up) use optical fibers to pipe the images from remotely mounted CRTs. An HMD 
requires a position/orientation sensor in addition to the visual display. Some displays, for example the 
BOOM System [Bolas, 1994], may be head-mounted or may be used as a remote window into a virtual 
world. 

3D Spatialized Sound 

The impression of immersion within a virtual environment is greatly enhanced by inclusion of 3D 
spatialized sound [Durlach, 1994; Hendrix and Barfield, 1996]. Stereo-pan effects alone are inadequate 
since they tend to sound as if they are originating inside the head. Research into 3D audio has shown 
the importance of modeling the head and pinea and using this model as part of the 3D sound generation. 
A Head Related Transfer Function (HRTF) can be used to a generate the proper acoustics [Begault and 
Wenzel, 1992; Wenzel, 1994]. A number of problems remain, such as the "cone of confusion" wherein 
sounds behind the head are perceived to be in front of the head [Wenzel, 1992]. 

Other VR Interface Technology 

A sense of motion can be generated in a VR system by a motion platform. These have been used in flight 
simulators to provide cues that the mind integrates with visual and spatialized sound cues to generate 
perception of velocity and acceleration. 

Haptics is the science of touch. Haptic interfaces generate perception of touch and resistance in VR. 
Most systems to date have focused on providing force feedback to enable users to sense the inertial 
qualities of virtual objects, and/or kinesthetic feedback to specify the location of a virtual object in the 
world [Salisbury and Srinivasan, 1996, 1997]. A few prototype systems exist that generate tactile stimu- 
lation, which allow users to feel the surface qualities of virtual objects [Minsky and Lederman, 1996]. 
Many of the haptic systems developed thus far consist of exo-skeletons that provide position sensing as 
well as active force application [Burdea et al., 1992]. 

Some preliminary work has been conducted on generating the sense of temperature in VR. Small 
electrical heat pumps have been developed that produce sensations of heat and cold as part of the 
simulated environment. (See, for example, [Caldwell and Gosney, 1993; Ino et al., 1993].) 

Olfaction is another sense that provides important cues in the real world. Consider, for example, a 
surgeon or dentist examining a patient for a potential bacterial infection. Inflammation and swelling 
may be present, but a major deciding factor is the odor of the lesion. Very early in the history of virtual 
reality, Mort Heilig patented his Sensorama Simulator, which incorporated olfactory, as well as visual, 
auditory, and motion cues (U.S. Patent 3850870, 1961). Recently, another pioneer of virtual reality, Myron 
Kreuger [Kreuger, 1994, 1995a, 1995b], has been developing virtual olfaction systems for use in medical 
training applications. The addition of virtual olfactory cues to medical training systems should greatly 
enhance both the realism and effectiveness of training. 

69.2 VR Application Examples 

Virtual reality had been researched for years in government laboratories and universities, but because of 
the enormous computing power demands and associated high costs, applications have been slow to 
migrate from the research world to other areas. Continual improvements in the price/performance ratio 
of graphic computer systems, however, have made VR technology more affordable and, thus, used more 
commonly in a wider range of application areas. In fact, there is even a strong "Garage VR" move- 
ment — groups of interested parties sharing information on how to build extremely low-cost VR systems 
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using inexpensive off-the-shelf components [Jacobs, 1994]. These home-made systems are often ineffi- 
cient, uncomfortable to use (sometimes painful), and slow, but they exist as a strong testament to a 
fervent interest in VR technology. 

VR Applications 

Applications today are diverse and represent dramatic improvements over conventional visualization and 
planning techniques: 

Public Entertainment: VR made its first major inroads in the area of public entertainment, with 
ventures ranging from shopping mall game simulators to low-cost VR games for the home. Major 
growth continues in home VR systems, partially as a result of 3D games on the Internet. 

Computer-Aided Design: Using VR to create "virtual prototypes" in software allows engineers to test 
potential products in the design phase, even collaboratively over computer networks, without 
investing time or money for conventional hard models. All of the major automobile manufacturers 
and many aircraft manufacturers rely heavily on virtual prototyping. 

Military: With VR, the military's solitary cab-based systems have evolved to extensive networked 
simulations involving a variety of equipment and situations. There is an apocryphal story that 
General Norman Schwartzcopf was selected to lead the Desert Storm operation on the basis of 
his extensive experience with simulations of the Middle East battle arena. 

Architecture/Construction: VR allows architects and engineers and their clients to "walk through" 
structural blueprints. Designs may be understood more clearly by clients who often have difficulty 
comprehending them even with conventional cardboard models. Atlanta, Georgia credits its VR 
model for winning it the site of the 1996 Olympics, while San Diego is using a VR model of a 
planned convention center addition to compete for the next convention of the Republican Party. 

Data Visualization: By allowing navigation through an abstract "world" of data, VR helps users rapidly 
visualize relationships within complex, multi- dimensional data structures. This is particularly 
important in financial-market data, where VR supports faster decision-making. 

VR is commonly associated with exotic "fully immersive applications" because of the over-dramatized 
media coverage on helmets, body suits, entertainment simulators, and the like. Equally important are 
the "Window into World" applications where the user or operator is allowed to interact effectively with 
"virtual" data, either locally or remotely. 

69.3 Current Status of Virtual Reality Technology 

The commercial market for VR, while taking advantage of advances in VR technology at large, is 
nonetheless contending with the lack of integrated systems and the frequent turnover of equipment 
suppliers. Over the last few years, VR users in academia and industry have developed different strategies 
for circumventing these problems. In academic settings, researchers buy peripherals and software from 
separate companies and configure their own systems to maintain the greatest application versatility. In 
industry, however, expensive, state-of-the-art VR systems are vertically integrated to address problems 
peculiar to the industry. 

Each solution is either too costly or too risky for most medical organizations. What is required is a 
VR system tailored to the needs of the medical community. Unfortunately, few companies offer integrated 
systems that are applicable to the VR medical market. This situation is likely to change in the next few 
years as VR-integration companies develop to fill this void. 

At the same time, the nature of the commercial VR medical market is changing as the price of high- 
performance graphics systems continues to decline. High-resolution graphics monitors are becoming 
more cost-effective even for markets that rely solely on desktop computers. Technical advances are also 
occurring in networking, visual photo-realism, tracker latency through predictive algorithms, and vari- 
able-resolution image generators. Improved database access methods are underway. Hardware advances, 
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such as eye gear that provide an increased field of view with high-resolution, untethered VR systems and 
inexpensive intuitive input devices, e.g., DataGloves, have lagged behind advances in computational, 
communications, and display capabilities. 

69.4 Overview of Medical Applications 
of Virtual Reality Technology 

Within the medical community, the first wave of VR development efforts have evolved into six key 
categories: 

1. Surgical Training and Surgical Planning 

2. Medical Education, Modeling, and Non-Surgical Training 

3. Anatomical Imaging and Medical Image Fusion 

4. Ergonomics, Rehabilitation, and Disabilities 

5. Telesurgery and Telemedicine 

6. Behavioral Evaluation and Intervention 

The potential of VR through education and information dissemination indicates there will be few 
areas of medicine not taking advantage of this improved computer interface. However, the latent potential 
of VR lies in its capacity to be used to manipulate and combine heterogeneous multi-dimensional data 
sets from many sources, for example, MRI, PET, and X-ray images. This feature is most significant and 
continues to transform the traditional applications environment. 

Surgical Training and Surgical Planning 

Various projects are underway to utilize VR and imaging technology to plan, simulate, and customize 
invasive (as well as minimally invasive) surgical procedures. One example of a VR surgical-planning and 
training system is the computer-based workstation developed by Cine-Med of Woodbury, Connecticut 
[McGovern, 1994] . The goal was to develop a realistic, interactive training workstation that helps surgeons 
make a more seamless transition from surgical simulation to the actual surgical event. 

Cine-Med focused on television-controlled endosurgical procedures because of the intensive training 
required for endosurgery and the adaptability of endosurgery to high quality imaging. Surgeons can gain 
clinical expertise by training on this highly realistic and functional surgical simulator. Cine-Med's com- 
puter environment includes life-like virtual organs that react much like their real counterparts, and 
sophisticated details such as the actual surgical instruments that provide system input/output (I/O). To 
further enhance training, the simulator allows the instructor to adapt clinical instruction to advance the 
technical expertise of learners. Surgical anomalies and emergency situations can be replicated to allow 
practicing surgeons to experiment and gain technical expertise on a wide range of surgical problems 
using the computer model before using an animal model. Since the steps of the procedure can be repeated 
and replayed at a later time, the learning environment surpasses other skills-training modalities. 

The current prototype simulates the environment of laparoscopic cholecystectomy for use as a surgical 
training device. Development began with the creation of an accurate anatomic landscape, including the 
liver, the gallbladder, and related structures. Appropriate surgical instruments are used for the system 
I/O and inserted into a fiberglass replica of a human torso. Four surgical incisional ports are assigned 
for three scissors grip instruments and camera zoom control. The instruments, retrofitted with switching 
devices, read and relay the opening and closing of the tips, with position trackers located within the 
simulator. The virtual surgical instruments are graphically generated on a display monitor where they 
interact with fully textural, anatomically correct, three-dimensional virtual organs. The organs are created 
as independent objects and conform to object-oriented programming. 

To replicate physical properties, each virtual organ must be assigned appropriate values to dictate its 
reaction when contacted by a virtual surgical instrument. Collision algorithms are established to define 
when the virtual organ is touched by a virtual surgical instrument. Additionally, with the creation of 
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spontaneous objects resulting from the dissection of a virtual organ, each new object is calculated to 
have independent physical properties using artificial intelligence (AI) subroutines. Collision algorithms 
drive the programmed creation of spontaneous objects. 

To reproduce the patient's physiological reactions during the surgical procedure, the simulation 
employs an expert system. This software sub-system generates patient reactions and probable outcomes 
derived from surgical stimuli, for example, bleeding control, heart rate failure, and, in the extreme, a 
death outcome. The acceptable value ranges of these factors are programmed to be constantly updated 
by the expert system while important data is displayed on the monitor. 

Three-dimensional graphical representation of a patient's anatomy is a challenge for accurate surgical 
planning. Technological progress has been seen in the visualization of bone, brain, and soft tissue. 
Heretofore, three-dimensional modeling of soft tissue has been difficult and often inaccurate owing to 
the intricacies of the internal organ, its vasculature, ducts, volume, and connective tissues. 

As an extension of this surgical simulator, a functional surgical planning device using VR technology 
is under development that will enable surgeons to operate on an actual patient, in virtual reality, prior 
to the actual operation. With the advent of technological advances in anatomic imaging, the parallel 
development of a surgical planning device incorporating real-time interaction with computer graphics 
that mimic a patient's anatomy is possible. Identification of anatomical structures to be modeled consti- 
tutes the initial phase for development of the surgical planning device. A spiral CAT scanning device 
records multiple slices of the anatomy during a single breath inhalation by the patient. Pin-registered 
layers of the anatomy are thus provided for the computer to read. 

Individual anatomical structures are defined at the scan level according to gray scale. Once the 
anatomical structures are identified and labeled, the scans are stacked and connected. The result is a 
volumetric polygonal model of the patient's actual anatomy. A polygon-reduction program is initiated 
to create a wire frame that can be successfully texture-mapped and interacted with in real time. As each 
slice of the CAT scan is stacked and linked together, the result is a fully volumetric, graphic representation 
of the human anatomy. Since the model, at this point, is unmanageable by the graphics workstation 
because of the volume polygons, a polygon-reduction program is initiated to eliminate excessive polygons. 

Key to the planning device is development of a user-friendly software program that will allow the 
radiologist to define anatomical structures, incorporate them into graphical representations, and assign 
physiological parameters to the anatomy. Surgeons will then be able to diagnose, plan and prescribe 
appropriate therapy to their patients using a trial run of a computerized simulation. 

Several VR-based systems currently under development allow real-time tracking of surgical instru- 
mentation and simultaneous display and manipulation of three-dimensional anatomy corresponding to 
the simulated procedure [Hon, 1994; McGovern, 1994; Edmond et al., 1997]. With this design surgeons 
can practice procedures and experience the possible complications and variations in anatomy encountered 
during surgery. Ranging from advanced imaging technologies for endoscopic surgery to routine hip 
replacements, these new developments will have a tremendous impact on improving surgical morbidity 
and mortality. According to Merril [1993, 1994], studies show that doctors are more likely to make errors 
performing their first few to several dozen diagnostic and therapeutic surgical procedures. Merril claims 
that operative risk could be substantially reduced by the development of a simulator that allows trans- 
ference of skills from the simulation to the actual point of patient contact. 

A preliminary test of Meril's claim was achieved by Taffinder and colleagues [1998]. Following the 
observations by McDougall et al. [ 1996] that 2D representations of anatomic imagery are insufficient to 
develop the eye-hand coordination of surgeons, Taffinder et al. conducted randomized, controlled studies 
of psychomotor skills developed either through the use of the MIST VR Laparoscopic Simulator or 
through a standard laparoscopic-surgery training course. They used the MIST VR Laparoscopic Simulator 
to compare the psychomotor skills of experienced surgeons, surgeon trainees, and non-surgeons. When 
task speed and the number of correctional movements were compared across subjects, experienced 
surgeons surpassed trainees and non-surgeons. Taffinder and colleagues also noted that among trainees, 
training on the VR system improved efficiency and reduced errors, but did not increase the speed of the 
laparoscopic procedures. 

© 2000 by CRC Press LLC 



To increase the realism of medical simulators, software tools have been developed to create "virtual 
tissues" that reflect the physical characteristics of physiological tissues. This technology operates in real- 
time using three-dimensional graphics, on a high speed computer platform. Recent advances in creating 
virtual tissues have occurred as force feedback is incorporated into more VR systems and the computa- 
tional overhead of finite-element analysis is reduced (see, for example, [Grabowski, 1998; Mclnerny and 
Terzopoulos, 1996]). Bro-Nielsen and Cotin [1996] have been developing tissue models that incorporate 
appropriate real-time volumetric deformation in response to applied forces. Their goal was to provide 
surgeons with the same "hands-on" feel in VR that they experience in actual surgery. They used mass- 
spring systems [Bro-Nielsen, 1997] to simplify finite-element analysis, significantly reduced computa- 
tional overhead, and thereby achieving near- real-time performance. While advancing the state of the art 
considerably toward the real-time, hands-on objective, much more work remains to be done. 

In another study of the effectiveness of force-feedback in surgical simulations, Baur et al. [1998] 
reported on the use of VIRGY, a VR endoscopic surgery simulator. VIRGY provides both visual and force 
feedback to the user, through the use of the Pantoscope [Baumann, 1996]. Because the Pantoscope is a 
remote-center-of-motion force-reflecting system, Baur et al. were able to hide the system beneath the 
skin of the endoscope mannequin. Force reflection of tissue interactions, including probing and cutting, 
was controlled by nearest neighbor propagation of look-up table values to circumvent the computational 
overhead inherent in finite element analysis. Surgeons were asked to assess the tissue feel achieved through 
this simulation. 

Evaluation of another surgical simulator system was carried out by Weghorst et al. [1998]. This 
evaluation involved the use of the Madigan Endoscopic Sinus Surgery (ESS) Simulator (see Edmond 
et al. [1997] for a full description), developed jointly by the U.S. Army, Lockheed-Martin, and the Human 
Interface Technology Laboratory at the University of Washington. The ESS Simulator, which was designed 
to train otolaryngologists to perform paranasal endoscopic procedures, incorporates 3D models of the 
naso -pharyngeal anatomy derived from the National Library of Medicine's Virtual Human Data Base. It 
employs two 6-DOF force-reflecting instruments developed by the Immersion Corporation to simulate 
the endoscope and 1 1 surgical instruments used in paranasal surgery. 

Weghorst and colleagues pitted three groups of subjects against each other in a test of the effectiveness 
of the ESS Simulator: non-MDs, non-ENT MDs, and ENTs with various levels of experience. Training 
aids, in the form of circular hoops projected onto the simulated nasal anatomy, identified the desired 
endoscope trajectory; targets identified anatomical injection sites; and text labels identified anatomical 
landmarks. Successive approximations to the actual task involved training first on an abstract, geometric 
model of the nasal anatomy with superimposed training aids (Level 1); next on real nasal anatomy with 
superimposed training aids (Level 2); and finally, on real nasal anatomy without training aids (Level 3). 

Results of the evaluation indicated that, in general, ENTs performed better than non-ENTs on Level 1 
and 2 tasks, and that among the ENTs, those with more ESS experience scored higher than those who 
had performed fewer ESS procedures. The researchers also found that deviations from the optimum 
endoscope path differentiated ENT and non-ENT subjects. In post-training questionnaires, ENTs rated 
the realism of the simulation as high and confirmed the validity and usefulness of training on the ESS 
Simulator. 

While the ESS Simulator described above requires a 200 MHz Pentium PC and a 4-CPU SGI Oxyx 
with Reality Engine 2 graphics to synthesize the virtual nasal cavity, a current trend in surgical simulation 
is towards lower-end simulation platforms. As computational power of PCs continues to escalate, sim- 
ulations are increasingly being ported to or developed on relatively inexpensive computers. Tseng et al. 
[1998] used an Intel-based PC as the basis of their laparoscopic surgery simulator. This device incorpo- 
rates a 5-DOF force-reflecting effector, a monitor for viewing the end-effector, and a speech-recognition 
system for changing viewpoint. The system detects collisions between the end effector and virtual tissues, 
and is capable of producing tissue deformation through one of several models including a finite element 
model, a displacement model, or a transmission line model. Computational power is provided by dual 
Pentium Pro 200 MHz CPUs and a Realizm Graphics Accelerator Board. 
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Medical Education, Modeling, and Non-Surgical Training 

Researchers at the University of California-San Diego are exploring the value of hybridizing elements of 
VR, multimedia (MM), and communications technologies into a unified educational paradigm [Hoff- 
man, 1994; Hoffman et al. 1995; Hoffman et al. 1997]. The goal is to develop powerful tools that extend 
the flexibility and effectiveness of medical teaching and promote lifelong learning. To this end, they have 
undertaken a multi-year initiative, named the "VR-MM Synthesis Project." Based on instructional design 
and user-need (rather than technology per se), they have planned a linked 3-computer array representing 
the Data Communications Gateway, the Electronic Medical Record System, and the Simulation Environ- 
ment. This system supports medical students, surgical residents, and clinical faculty running applications 
ranging from full surgical simulation to basic anatomical exploration and review, all via a common 
interface. The plan also supports integration of learning and telecommunications resources (such as 
interactive MM libraries, online textbooks, databases of medical literature, decision support systems, 
electronic mail, and access to electronic medical records). 

The first application brought to fruition in the VR-MM Synthesis Project is an anatomical instruc- 
tional-aid system called Anatomic VisualizeR [Hoffman et al. 1997] that uses anatomical models derived 
from the Visible Human Project of the National Library of Medicine. Using the "Guided Lessons" 
paradigm of Anatomical VisualizeR, the student enters a 3D workspace that contains a "Study Guide." 
The student uses the study guide to navigate through the 3D workspace, downloading anatomical models 
of interest, as well as supporting resources like diagrammatic representations, photographic material, 
and text. Manipulation of the anatomical models is encouraged to provide the student with an intuitive 
understanding of anatomical relationships. The study guide also incorporates other instructional material 
necessary for the completion of a given lesson. 

Another advanced medical training system based on VR technology is VMET — the Virtual Medical 
Trainer — jointly developed by the Research Triangle Institute and Advanced Simulation Corp. [Kizakevich 
et al., 1998]. VMET was developed to provide training in trauma care. The VMET Trauma Patient 
Simulator (VMET-TPS), which was designed to comply with civilian guidelines for both Basic Trauma 
Life Support and Advanced Trauma Life Support, incorporates models of (1) physiological systems and 
functions, (2) dynamic consequences of trauma to these physiological systems, and (3) medical interven- 
tion effects on these physiological systems. 

VMET provides a multisensory simulation of a trauma patient, including visible, audible, and behav- 
ioral aspects of the trauma patient. To maintain cost-effectiveness, VMET is constrained to providing 
"virtual spaces" at the site of several injuries, including a penetrating chest wound, a penetrating arm 
wound, laceration to the arm, and a thigh contusion. Within the virtual spaces, the trainee can experience 
the physiological effects of the injury and of his intervention in real time, as the physiological engine 
updates the condition of the wound. 

VMET-TPS was designed to train military personnel in trauma care, so it places emphasis on the 
medical technology currently and foreseeably available to the military. However, VMET-TPS will find 
applications in civilian teaching hospitals, as well. Partially because of the cost-constraint requirements 
of the project, the price of VMET should be within reach of many trauma centers. 

In another task-specific program at the Fraunhofer Institute, researchers have been developing an 
ultrasonic probe training system based on VR components called the UltraTrainer [Stallkamp and Walper, 
1998]. UltraTrainer uses a Silicon Graphics workstation and monitor to present images drawn from a 
database of real ultrasonograms, a Polhemus tracker to determine the position of the user's hand, a 
joystick representation of the ultrasound probe, and an ultrasound phantom to provide probe posi- 
tion/orientation feedback to the trainee. In using the UltraTrainer, the trainee moves the ultrasound 
probe against the phantom, which is used in this system as a representation of the virtual patient's body. 
On the monitor, the trainee sees an image of the probe in relation to the phantom, showing the trainee 
the position and orientation of the probe with respect to the virtual patient. 

The Polhemus tracker determines and records the real-space position of the virtual ultrasound probe, 
and this tracker information is then used to extract stored ultrasonograms from a database of images. 
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FIGURE 69.3 VR-based rehabilitation workstation. 



An ultrasonic image of the appropriate virtual scanfield is presented on the monitor in accordance with 
the position of the virtual probe on the phantom. As the probe is moved on the phantom, new virtual 
scanfields are extracted from the database and presented on the monitor. The UltraTrainer is able to 
present sequential virtual scanfields rapidly enough for the trainee to perceive the virtual ultrasonography 
as occurring in real time. 

Planned improvements to the UltraTrainer include a larger database of stored sonograms that can be 
customized for different medical specialties, and a reduction in cost by porting the system to a PC and by 
using a less expensive tracking system. These improvements should allow the UltraTrainer to be accessed 
by a broader range of users and to move from the laboratory to the classroom, or even to the home. 

Three groups of researchers have taken different approaches to developing VR-based training systems 
for needle insertion, each based on feedback to a different sensory system. At Georgetown University, 
Lathan and associates [1998] have produced a spine biopsy simulator based on visual feedback; a team 
from Ohio State University and the Ohio Supercomputer Center have demonstrated an epidural needle 
insertion simulator based on force feedback [Heimenz et al., 1998] ; and Computer Aided Surgery in New 
York has developed a blind needle biopsy placement simulator that uses 3D auditory feedback [Wenger 
and Karron, 1998]. Each of these innovative systems draws on technological advances in virtual reality. 
In the aggregate, they disclose that, given appropriate cognitive constructs, humans are capable of using 
diverse sensory input to learn very demanding tasks. Imagine how effective virtual reality could be in 
training complex tasks if all of the sensory information could be integrated in a single system. That is 
currently one of the goals of the virtual reality community. 



Anatomical Imaging and Medical Image Fusion 

An anatomically keyed display with real-time data fusion is currently in use at NYU Medical Center's 
Department of Neurosurgery. The system allows both pre-operative planning and real-time tumor 
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FIGURE 69.4 DataSuit™ for ergonomic and sports medicine applications. 

visualization [Kelly, 1994; Kail, 1994] . The technology offers a technique for surgeons to plan and simulate 
the surgical procedure beforehand in order to reach deep-seated or centrally located brain tumors. The 
imaging method (volumetric stereotaxis) gathers, stores, and reformats imaging-derived, three-dimen- 
sional volumetric information that defines an intracranial lesion (tumor) with respect to the surgical field. 

Computer-generated information is displayed intraoperatively on computer monitors in the operating 
room and on a "heads up" display mounted on the operating microscope. These images provide surgeons 
with a CT (computed tomography) and MRI defined map of the surgical field area scaled to actual size 
and location. This guides the surgeon in finding and defining the boundaries of brain tumors. The 
computer-generated images are indexed to the surgical field by means of a robotics-controlled stereotactic 
frame which positions the patient's tumor within a defined targeting area. Simulated systems using VR 
models are being advocated for high risk techniques, such as the alignment of radiation sources to treat 
cancerous tumors. Where registration of virtual and real anatomical features is not an issue, other display 
technologies can be employed. Two recent examples, based on totally different display modalities, are 
described below. 

Parsons and Rolland [1998] developed a non-intrusive display technique that projects images of virtual 
inclusions such as tumors, cysts, or abscesses, or other medical image data into the surgeon's field of 
view on demand. The system relies on retroreflective material, perhaps used as a backing for the surgeon's 
glove or on a surgical instrument, to provide an imaging surface upon which can be presented images 
extracted from 2D data sources, for example MRI scans. The surgeon can access the data by placing the 
retroreflective screen, e.g., his gloved hand, in the path of an imaging beam. The image of the virtual 
MRI scan, for example, will appear superimposed along the surgeon's line of sight. Although image 
registration is not currently possible with this real-time display system, the system does provide a means 
for the clinician to access critical information without turning to view a monitor or other screen. 

For displaying volumetric data, Senger [1998] devised a system based on the FakeSpace Immersive 
Workbench™ and position-tracked StereoGraphics CrystalEyes™ that presents stereoscopic images 
derived from CT, MRI, and the Visible Human data sets. The viewer can interact with the immersive 
data structure through the use of a position/orientation sensed probe. The probe can be used first to 
identify a region of interest within the anatomical data set and then to segment the data so that particular 
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anatomical features are highlighted. The probe can also be used to "inject" digital stain into the data set 
and to direct the propagation of the stain into structures that meet predetermined parameters, such as 
voxel density, opacity, etc. Through the use of the probe, the vasculature, fascia, bone, etc. may be 
selectively stained. This imaging system takes advantage of the recent advances in VR hardware and 
software, the advent of programs such as the National Library of Medicine's Visible Human Project, and 
significant cost reductions in computational power to make it feasible for medical researchers to create 
accurate, interactive 3D human anatomical atlases. 

Suzuki and colleagues [1998] have pushed virtual anatomical imaging a step further by including the 
temporal domain. In 1987, Suzuki, Itou, and Okamura developed a 3D human atlas based on serial MRI 
scans. Originally designed to be resident on a PC, the 3D Human Atlas has been upgraded during the last 
decade to take advantage of major advances in computer graphics. The latest atlas is based on composite 
super-conducting MRI scans at 4-mm intervals at 4-mm pitch of young male and female humans. In 
addition to volume and surface rendering, this atlas is also capable of dynamic cardiac imaging. 

Rendered on a Silicon Graphics Onyx Reality Engine, this atlas presents volumetric images at approx- 
imately 10 frames per second. The user interface allows sectioning of the anatomical data along any plane, 
as well as extracting organ surface information. Organs can be manipulated individually and rendered 
transparent to enable the user to view hidden structures. By rendering the surface of the heart transparent, 
it is possible to view the chambers of the heart in 4D at approximately 8 frames per second. Suzuki and 
colleagues plan to distribute the atlas over the internet so that users throughout the world will be able 
to access it. 

Distribution of volumetric medical imagery over the Internet through the World Wide Web (WWW) 
has already been examined by Hendin and colleagues [1998]. These authors relied on the Virtual Reality 
Modeling Language (VRML) and VRMLscript to display and interact with the medial data sets, and they 
used a Java graphical user interface (GUI) to preserve platform independence. They conclude that it is 
currently feasible to share and interact with medical image data over the WWW. 

Similarly, Silverstein et al. [1998] have demonstrated a system for interacting with 3D radiologic image 
data over the Internet. While they conclude that the Web-based system is effective for transmitting useful 
imagery to remote colleagues for diagnosis and pretreatment planning, it does not supplant the require- 
ment that a knowledgeable radiologist examine the 2D tomographic images. This raises the question of 
the effectiveness of the display and interaction with virtual medical images. 

The perceived effectiveness of VR-based anatomical imaging systems similar to those described above, 
has been assessed by Oyama and colleagues [1998]. They created virtual environments consisting of 3D 
images of cancerous lesions and surrounding tissue, derived from CT or MRI scans, and presented these 
environments to clinicians in one of three modes. The cancers were from the brain, breast, lung, stomach, 
liver, and colon, and the presentation modes were either as surface-rendered images, real-time volume- 
rendered images, or editable real-time volume-rendered images. Clinicians had to rate the effectiveness 
of the three modes in providing them with information about ( 1 ) the location of the cancer, (2) the shape 
of the cancer, (3) the shape of fine vessels, (4) the degree of infiltration of surrounding organs, and (5) the 
relationship between the cancer and normal organs. In each of the five categories, the clinicians rated 
the editable real-time volume-rendered images as superior to the other modes of presentation. From the 
study, the authors conclude that real-time surface rendering, while applicable to many areas of medicine, 
is not suitable for use in cancer diagnosis. 

Ergonomics, Rehabilitation, and Disabilities 

VR offers the possibility to better shape a rehabilitative program to an individual patient. Greenleaf et al. 
[1994] have theorized that the rehabilitation process can be enhanced through the use of VR technology. 
The group is currently developing a VR-based rehabilitation workstation that will be used to: 
(1) decompose rehabilitation into small, incremental functional steps to facilitate the rehabilitation 
process; and (2) make the rehabilitation process more realistic and less boring, thus enhancing motivation 
and recovery of function. 
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FIGURE 69.5 VR system used as a disability solution. 

DataGlove and DataSuit technologies were originally developed as control devices for VR, but through 
improvements they are now being applied to the field of functional evaluation of movement and to 
rehabilitation in a variety of ways. One system, for example, uses a glove device coupled with a force 
feedback system — The Rutgers Master (RM-I) — to rehabilitate a damaged hand or to diagnose a range 
of hand problems [Burdea et al., 1995; Burdea et al, 1997]. The rehabilitation system developed by Burdea 
and colleagues uses programmable force feedback to control the level of effort required to accomplish 
rehabilitative tasks. This system measures finger-specific forces while the patient performs one of several 
rehabilitative tasks, including ball squeezing, DigiKey exercises, and peg insertion. 

Another system under development — The RM II — incorporates tactile feedback to a glove system to 
produce feeling in the fingers when virtual objects are "touched" [Burdea, 1994]. In order to facilitate 
accurate goniometric assessment, improvements to the resolution of the standard DataGlove have been 
developed [Greenleaf, 1992]. The improved DataGlove allows highly accurate measurement of dynamic 
range of motion of the fingers and wrist and is in use at research centers such as Johns Hopkins and 
Loma Linda University to measure and analyze functional movements. 

Adjacent to rehabilitation evaluation systems are systems utilizing the same measurement technology 
to provide ergonomic evaluation and injury prevention. Workplace ergonomics has already received a 
boost from new VR technologies that enable customized workstations tailored to individual requirements 
[Greenleaf, 1994]. In another area, surgical room ergonomics for medical personnel and patients is 
projected to reduce the hostile and complicated interface among patients, health care providers, and 
surgical spaces [Kaplan, 1994]. 

Motion Analysis Software (MAS) can assess and analyze upper extremity function from dynamic 
measurement data acquired by the improved DataGlove. This technology not only provides highly 
objective measurement, but also ensures more accurate methods for collecting data and performing 
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quantitative analyses for physicians, therapists, and ergonomics specialists involved in job site evaluation 
and design. The DataGlove/MAS technology is contributing to a greater understanding of upper extremity 
biomechanics and kinesiology. Basic DataGlove technology coupled with VR media will offer numerous 
opportunities for the rehabilitation sector of the medical market, not the least of which is the positive 
implication for enhancing patient recovery by making the process more realistic and participatory. 

One exciting aspect of VR technology is the inherent ability to enable individuals with physical 
disabilities to accomplish tasks and have experiences otherwise denied them. The strategies currently 
employed for disability- related VR research include head-mounted displays; position/orientation sensing; 
tactile feedback; eye tracking; 3D sound systems; data input devices; image generation; and optics. For 
physically disabled persons, VR will provide a new link to capabilities and experiences heretofore unat- 
tainable, such as: 

• An individual with cerebral palsy who is confined to a wheelchair can operate a telephone switch- 
board, play hand ball, and dance [Greenleaf, 1994] within a virtual environment. 

• Patients with spinal cord injury or CNS dysfunction can relearn limb movements through adaptive 
visual feedback in virtual environments [Steffin, 1997]. 

• Disabled individuals can be in one location while their "virtual being" is in a totally different 
location. This opens all manner of possibilities for participating in work, study, or leisure activities 
anywhere in the world without leaving home. 

• Physically disabled individuals could interact with real world activities through robotic devices 
they control from within the virtual world. 

• Blind persons could practice and plan in advance navigating through or among buildings if the 
accesses represented in a virtual world were made up of 3-dimensional sound images and tactile 
stimuli [Max and Gonzalez, 1997]. 

One novel application of VR technology to disabilities has been demonstrated by Inman [1994a,b; 
1996] who trained handicapped children to operate wheelchairs without the inherent dangers of such 
training. Inman trained children with cerebral palsy or orthopedic impairments to navigate virtual worlds 
that contained simulated physical conditions that they would encounter in the real world. By training in 
a virtual environment, the children acquired an appreciation of the dynamics of their wheelchairs and 
of the dangers that they might encounter in the real world, but in a safe setting. The safety aspect of VR 
training is impossible to achieve with other training technologies. 

VR will also enable persons with disabilities to experience situations and sensations not accessible in a 
physical world. Learning and working environments can be tailored to specific needs with VR. For example, 
since the virtual world can be superimposed over the real world, a learner could move progressively from 
a highly supported mode of performing a task in the virtual world, through to performing it unassisted 
in the real world. One project, "Wheelchair VR" [Trimble, 1992], is a highly specialized architectural 
software being developed to aid in the design of barrier- free building for persons with disabilities. 

VR also presents unique opportunities for retraining persons who have incurred neuropsychological 
deficits. The cognitive deficits associated with neurophysiological insults are often difficult to assess and 
for this and other reasons, not readily amenable to treatment. Several groups have begun to investigate 
the use of VR in the assessment and retraining of persons with neuropsychological cognitive impairments. 
Pugnetti and colleagues [1995a,b; 1996] have begun to develop VR-based scenarios based on standardized 
tests of cognitive function, for example card-sorting tasks. Because the tester has essentially absolute control 
over the cognitive environment in which the test is administered, the effects of environmental distractors 
can be both assessed and controlled. Initial tests by the Italian group suggests that VR-based cognitive tests 
offer great promise for both evaluation and retraining of brain-injury-associated cognitive deficits. 

Other groups have been developing cognitive tests based on mental rotation of virtual objects [Buch- 
walter and Rizzo, 1997; Rizzo and Buchwalter, 1997] or spatial memory of virtual buildings [Attree et al., 
1996]. Although VR technology shows promise in both the assessment and treatment of persons with 
acquired cognitive deficits, Buchwalter and Rizzo find that the technology is currently too cumbersome, 
too unfamiliar, and has too many side effects to allow many patients to benefit from its use. 
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Attree and associates, however, have begun to develop a taxonomy of virtual reality applications in 
cognitive rehabilitation. Their study examined the differences between active and passive navigation of 
virtual environments. They report that active navigation improves spatial memory of the route through 
the virtual environment, while passive navigation improves object recognition for landmarks within the 
virtual environment. This report sends a clear message that VR is capable of improving cognitive function, 
but that much more work is required to understand how the technology interacts with the human psyche. 

One researcher who is actively pursuing this understanding is Dorothy Strickland [1995, 1996], who 
has developed a virtual environment for working with autistic children. Strickland's premise is that VR 
technology allows precise control of visual and auditory stimuli within virtual environments. Frequently, 
autistic children find the magnitude of environmental stimuli overwhelming, so it is difficult for them 
to focus. Strickland has developed a sparse virtual environment into which stimuli can be introduced as 
the autistic child becomes capable of handling more environmental stimulation. Essentially, she produces 
successive approximations to a real environment at a rate that the autistic child can tolerate. Her work 
has shown that autistic children can benefit greatly by exposure to successively richer environments and 
that the advances that they make in virtual environments transfer to the real world. 

Max and Burke [1998], also using VR with autistic children, report that contrary to expectations, their 
patients were able to focus on the virtual environment, rather than "fidgeting" as they are prone to do 
in other environments. These researchers also found that in the absence of competing acoustic stimuli, 
autistic children were drawn to loud music and shunned soft choral music. These findings may provide 
important insights into the etiology of autism that could not be observed in the "real world" setting. 

Telesurgery and Telemedicine 

Telepresence is the "sister field" of VR. Classically defined as the ability to act and interact in an off-site 
environment by making use of VR technology, telepresence is emerging as an area of development in its 
own right. Telemedicine (the telepresence of medical experts) is being explored as a way to reduce the 
cost of medical practice and to bring expertise into remote areas [Burrow, 1994; Rosen, 1994; Rovetta 
etal., 1997; Rissam et al, 1998]. 

Telesurgery is a fertile area for development. On the verge of realization, telesurgery (remote surgery) 
will help resolve issues that can complicate or compromise surgery, among them: 

• The patient is too ill or injured to be moved for surgery. 

• A specialist surgeon is located at some distance from the patient requiring specialized attention. 

• Accident victims may have a better chance of survival if immediate, on-the-scene surgery can be 
performed remotely by an emergency room surgeon at a local hospital. 

• Soldiers wounded in battle could undergo surgery on the battlefield by a surgeon located elsewhere. 

The surgeon really does operate — on flesh, not a computer animation. And while the distance aspect 
of remote surgery is a provocative one, telepresence is proving an aid in non-remote surgery as well. It 
can help surgeons gain dexterity over conventional methods of manipulation. This is expected to be 
particularly important in laparoscopic surgery. For example, suturing and knot-tying will be as easy to 
see in laparoscopic surgery as it is in open surgery because telepresence enables the surgery to look and 
feel like open surgery. 

As developed at SRI International [Satava 1992; Hill et al., 1995, 1998], telepresence not only offers a 
compelling sense of reality for the surgeon, but also allows the surgeon to perform the surgery according 
to the usual methods and procedures. There is nothing new to learn. Hand motions are quick and precise. 
The visual field, instrument motion, and force feedback can all be scaled to make microsurgery easier 
than it would be if the surgeon were at the patient's side. While the current technology has been 
implemented in prototype, SRI and Telesurgical Corporation, based in Redwood City, California, are 
collaborating to develop a full system based on this novel concept. 

The system uses color video cameras to image the surgical field in stereo, which the remote surgeon 
views with stereo shutter glasses. The remote surgeon grasps a pair of force-reflecting 6-DOF remote 
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manipulators linked to a slaved pair of end effectors in the surgical field. Coordinated visual, auditory, 
and haptic feedback from the end effectors provides the remote surgeon with a compelling sense of 
presence at the surgical site. 

Because the remote manipulators and end effectors are only linked electronically, the gain between 
them can be adjusted to provide the remote surgeon with microsurgically precise movement of the slaved 
instruments with relatively gross movement of the master manipulators. Researchers at SRI have dem- 
onstrated the microsurgery capabilities of this system by performing microvascular surgery on rats. 

Again, because only an electronic link exists between manipulator and end effector, separation of the 
remote surgeon and the surgical field can be on a global scale. To demonstrate this global remote surgery 
capability, Rovetta and colleagues [1998] established Internet and ISDN networks between Monterey, 
California, and Milan, Italy, and used these networks to remotely control a surgical robot. By manipulating 
controls in Monterey, Professor Rovetta was able to perform simulated biopsies on liver, prostate, and 
breasts in his laboratory in Milan, Italy. Using both an Internet and ISDN network facilitated the 
transmission of large amounts of image data and robotic control signals in a timely manner. 

Prior to the telesurgery demonstration between Monterey and Milan, Rovetta and colleagues [1997] 
established a link between European hospitals and hospitals in Africa as part of the Telehealth In Africa 
Program of the European Collaboration Group for Telemedicine. Among the objectives of this program 
was a demonstration of the feasibility of transmitting large amounts of medical data including X-ray and 
other diagnostic images, case histories, and diagnostic analyses. Once established, the data link could be 
used for teleconsulting and teletraining. 

Teleconsulting is becoming commonplace. Within the last year, the U.S. Department of Veteran Affairs 
has established a network to link rural and urban Vet Centers. This network, which will initially link 
20 Vet Centers, will provide teleconsultation for chronic disease screening, trauma outreach, and psy- 
chosocial care. In Europe, where telemedicine is making great strides, the Telemedical Information Society 
has been established. (See [Marsh, 1998], for additional information.) At the International Telemedical 
Information Society '98 Conference held in Amsterdam in April 1998, a number of important issues 
pertaining to teleconsultation were addressed. Among these were discussions about the appropriate 
network for telemedicine, e.g., the World Wide Web, ATM, and privacy and liability issues. 

Privacy, that is security of transmitted medical information, will be a continuing problem as hackers 
hone their ability to circumvent Internet security [Radesovich, 1997]. Asian and colleagues [1998] have 
examined the practicality of using 128-bit encryption for securing the transmission of medical images. 
Using software called Photomailer™, Asian at Duke Medical Center and his colleagues at Boston Uni- 
versity Medical Center and Johns Hopkins encrypted 60 medical images and transmitted them over the 
Internet. They then attempted to access the encrypted images without Photomailer™ installed on the 
receiving computer. They reported complete success and insignificant increases in processing time with 
the encrypting software installed. 

Behavioral Evaluation and Intervention 

VR technology has been successfully applied to a number of behavioral conditions over the last few years. 
Among the greatest breakthoughs attained through the use of this technology is the relief of akinesia, a 
symptom of Parkinsonism wherein a patient has progressively greater difficulty initiating and sustaining 
walking. The condition can be mitigated by treatment with drugs such as L-dopa, a precursor of the 
natural neural transmitters dopamine, but usually not without unwanted side effects. Now, collaborators 
at the Human Interface Technology Laboratory at the University of Washington, along with the Univer- 
sity's Department of Rehabilitation Medicine and the San Francisco Parkinson's Institute, are using virtual 
imagery to simulate an effect called kinesia paradoxa, or the triggering of normal walking behavior in 
akinetic Parkinson's patients [Weghorst 1994]. 

Using a commercial, field-multiplexed, "heads-up" video display, the research team has developed an 
approach that elicits near-normal walking by presenting collimated virtual images of objects and abstract 
visual cues moving through the patient's visual field at speeds that emulate normal walking. The 
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combination of image collimation and animation speed reinforces the illusion of space-stabilized visual 
cues at the patient's feet. This novel, VR- assisted technology may also prove to be therapeutically useful 
for other movement disorders. 

In the area of phobia intervention, Lamson [Lamson, 1994, 1997; Lamson and Meisner, 1994] has 
investigated the diagnostic and treatment possibilities of VR immersion on anxiety, panic, and phobia 
of heights. By immersing both patients and controls in computer-generated situations, the researchers 
were able to expose the subjects to anxiety provoking situations (such as jumping from a height) in a 
controlled manner. Experimental results indicated a significant subject habituation and desensitization 
through this approach, and the approach appears clinically useful. 

The effectiveness of VR as a treatment for acrophobia has been critically examined by Rothbaum and 
associates [1995a,b]. They used a procedure in which phobic patients were immersed in virtual environ- 
ments that could be altered to present graded fear-inducing situations. Patient responses were monitored 
and used to modify the virtual threats accordingly. Rothbaum et al. report that the ability to present 
graded exposure to aversive environments through the use of VR technology provides a very effective 
means of overcoming acrophobia. Transfer to the real world was quite good. 

The use of VR technology in the treatment of phobias has expanded in recent years to include fear of 
flying [Rothbaum et al., 1996; North et al., 1997], arachnophobia [Carlin et al, 1997], and sexual dys- 
function [Optale et al., 1998]. In each of these areas, patients attained a significant reduction in their 
phobias through graded exposure to fear-provoking virtual environments. In the treatment of arachno- 
phobia, Carlin and associates devised a clever synthesis of real and virtual environments to desensitize 
patients to spiders. Patients viewed virtual spiders while at the same time touching the simulated fuzziness 
of a spider. The tactile aspect of the exposure was produced by having the patient touch a toupe. 

Before leaving the subject of VR treatment of phobias, a word of caution is necessary. Bloom [1997] 
has examined the application of VR technology to the treatment of psychiatric disorders and sounds a 
note of warning. Bloom notes the advances made in the treatment of anxiety disorders, but also points 
out that there are examples of physiological and psychological side effects from immersion in VR. While 
the major side effects tend to be physiological, mainly simulator sickness and Sopite Syndrome (malaise, 
lethargy), psychological maladaptations, such as "...unpredictable modifications in perceptions of social 
context..." and "...fragmentation of self" [Bloom, 1997, p 12] reportedly occur. 

69.5 Summary 

VR tools and techniques are being developed rapidly in the scientific, engineering, and medical areas. 
This technology will directly affect medical practice. Computer simulation will allow physicians to 
practice surgical procedures in a virtual environment in which there is no risk to patients, and where 
mistakes can be recognized and rectified immediately by the computer. Procedures can be reviewed from 
new, insightful perspectives that are not possible in the real world. 

The innovators in medical VR will be called upon to refine technical efficiency and increase physical 
and psychological comfort and capability, while keeping an eye on reducing costs for health care. The 
mandate is complex, but like VR technology itself, the possibilities are very exciting. While the possibil- 
ities — and the need — for medical VR are immense, approaches and solutions using new VR-based 
applications require diligent, cooperative efforts among technology developers, medical practitioners, 
and medical consumers to establish where future requirements and demand will lie. 

Defining Terms 

For an excellent treatment of the state-of-the-art of VR and its taxonomy, see the ACM SIGGRAPH 
publication "Computer Graphics," Vol. 26, #3, August 1992. It covers the U.S. Government's National 
Science Foundation invitational workshop on Interactive Systems Program, March 23-24, 1992, which 
served to identify and recommend future research directions in the area of virtual environments. A more 
in-depth exposition of VR taxonomy can be found in the MIT Journal "Presence", Vol. 1 #2. 
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NOT TOO LONG AGO, the term medical instrument stood for simple hand-held instruments 
used by physicians for observing patients, examining organs, making simple measurements, or 
administering medication. These small instruments, such as stethoscopes, thermometers, tongue 
depressors, and a few surgical tools, typically fit into a physician's hand bag. Today's medical instruments 
are considerably more complicated and diverse, primarily because they incorporate electronic systems 
for sensing, transducing, manipulating, storing, and displaying data or information. Furthermore, med- 
ical specialists today request detailed and accurate measurements of a vast number of physiologic param- 
eters for diagnosing illnesses and prescribe complicated procedures for treating these. As a result, the 
number of medical instruments and devices has grown from a few hundred a generation ago to more 
than 10,000 today, and the complexity of these instruments has grown at the same pace. The description 
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of all these instruments and devices would fill an entire handbook by itself; however, due to the limited 
space assigned to this topic, only a selected number are described. 

While medical instruments acquire and process information and data for monitoring patients and 
diagnosing illnesses, medical devices use electrical, mechanical, chemical, or radiation energy for achiev- 
ing a desired therapeutic purpose, maintaining physiologic functions, or assisting a patient's healing 
process. To mention only a few functions, medical devices pump blood, remove metabolic waste products, 
destroy kidney stones, infuse fluids and drugs, stimulate muscles and nerves, cut tissue, administer 
anesthesia, alleviate pain, restore function, or warm tissue. Because of their complexity, medical devices 
are used mostly in hospitals and medical centers by trained personnel, but some also can be found in 
private homes operated by patients themselves or their caregivers. 

This section on medical instruments and devices neither replaces a textbook on this subject nor presents 
the material in a typical textbook manner. The authors assume the reader to be interested in but not 
knowledgeable on the subject. Therefore, each chapter begins with a short introduction to the subject 
material, followed by a brief description of current practices and principles, and ends with recent trends 
and developments. Whenever appropriate, equations, diagrams, and pictures amplify and illustrate the 
topic, while tables summarize facts and data. The short reference section at the end of each chapter points 
toward further resource materials, including books, journal articles, patents, and company brochures. 

The chapters in the first half of this section cover the more traditional topics of bioinstrumentation, 
such as biopotential amplifiers and noninvasive blood pressure, blood flow, and respiration monitors, 
while those of the second half focus more on recently developed instruments and devices such as pulse 
oximeters or home-care monitoring devices. Some of this latter material is new or hard to find elsewhere. 
A few traditional bioinstrumentation or electroencephalography have been omitted entirely because most 
textbooks on this subject give excellent introductions and reviews. Transducers, biosensors, and electrodes 
are covered in other sections of this Handbook. Thus this section provides an overview, albeit an incom- 
plete one, of recent developments in the field of medical instruments and devices. 




A typical IV infusion system. 
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Biosignals are recorded as potentials, voltages, and electrical field strengths generated by nerves and 
muscles. The measurements involve voltages at very low levels, typically ranging between 1 uV and 
100 mV, with high source impedances and superimposed high level interference signals and noise. The 
signals need to be amplified to make them compatible with devices such as displays, recorders, or A/D 
converters for computerized equipment. Amplifiers adequate to measure these signals have to satisfy very 
specific requirements. They have to provide amplification selective to the physiological signal, reject 
superimposed noise and interference signals, and guarantee protection from damages through voltage 
and current surges for both patient and electronic equipment. Amplifiers featuring these specifications 
are known as biopotential amplifiers. Basic requirements and features, as well as some specialized systems, 
will be presented. 

70.1 Basic Amplifier Requirements 

The basic requirements that a biopotential amplifier has to satisfy are: 

■ the physiological process to be monitored should not be influenced in any way by the amplifier 

• the measured signal should not be distorted 

• the amplifier should provide the best possible separation of signal and interferences 

■ the amplifier has to offer protection of the patient from any hazard of electrical shock 

■ the amplifier itself has to be protected against damages that might result from high input voltages 
as they occur during the application of defibrillators or electrosurgical instrumentation 

A typical configuration for the measurement of biopotentials is shown in Fig. 70.1. Three electrodes, 
two of them picking up the biological signal and the third providing the reference potential, connect the 
subject to the amplifier. The input signal to the amplifier consists of five components: (1) the desired 
biopotential, (2) undesired biopotentials, (3) a power line interference signal of 60 Hz (50 Hz in some 
countries) and its harmonics, (4) interference signals generated by the tissue/electrode interface, and 
(5) noise. Proper design of the amplifier provides rejection of a large portion of the signal interferences. 
The main task of the differential amplifier as shown in Fig. 70. 1 is to reject the line frequency interference 
that is electrostatically or magnetically coupled into the subject. The desired biopotential appears as a 
voltage between the two input terminals of the differential amplifier and is referred to as the differential 
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FIGURE 70.1 Typical configuration for the measurement of biopotentials. The biological signal V biol appears 
between the two measuring electrodes at the right and left arm of the patient, and is fed to the inverting and the 
non-inverting inputs of the differential amplifier. The right leg electrode provides the reference potential for the 
amplifier with a common mode voltage V c as indicated. 

signal. The line frequency interference signal shows only very small differences in amplitude and phase 
between the two measuring electrodes, causing approximately the same potential at both inputs, and 
thus appears only between the inputs and ground and is called the common mode signal. Strong rejection 
of the common mode signal is one of the most important characteristics of a good biopotential amplifier. 
The common mode rejection ratio (or CMRR) of an amplifier is defined as the ratio of the differential 
mode gain over the common mode gain. As seen in Fig. 70.1, the rejection of the common mode signal 
in a biopotential amplifier is both a function of the amplifier CMRR and the source impedances Z 1 and 
Z 2 . For the ideal biopotential amplifier with Z l = Z 2 and infinite CMRR of the differential amplifier, the 
output voltage is the pure biological signal amplified by G D , the differential mode gain: V out = G D -V biol . 
With finite CMRR, the common mode signal is not completely rejected, adding the interference term 
G D -V C /CMRR to the output signal. Even in the case of an ideal differential amplifier with infinite CMRR, 
the common mode signal will not completely disappear unless the source impedances are equal. The 
common mode signal V c causes currents to flow through Z l and Z 2 . The related voltage drops show a 
difference if the source impedances are unequal, thus generating a differential signal at the amplifier 
input which, of course, is not rejected by the differential amplifier. With amplifier gain G D and input 
impedance Z in , the output voltage of the amplifier is: 



G V 

V =G n V,.,+ D c +G n V 

out D bio! -n, /nri D c 



CMRR 



Z +Z,-Z, 



(70.1) 



The output of a real biopotential amplifier will always consist of the desired output component due 
to a differential biosignal, an undesired component due to incomplete rejection of common mode 
interference signals as a function of CMRR, and an undesired component due to source impedance 
unbalance allowing a small proportion of a common mode signal to appear as a differential signal to the 
amplifier. Since source impedance unbalances of 5,000 to 10,000 Q, mainly caused by electrodes, are not 
uncommon, and sufficient rejection of line frequency interferences requires a minimum CMRR of 100 dB, 
the input impedance of the amplifier should be at least 10 9 Q at 60 Hz to prevent source impedance 
unbalances from deteriorating the overall CMRR of the amplifier. State-of-the-art biopotential amplifiers 
provide a CMRR of 120 to 140 dB. 

In order to provide optimum signal quality and adequate voltage level for further signal processing, 
the amplifier has to provide a gain of 100 to 50,000 and needs to maintain the best possible signal-to- 
noise ratio. The presence of high level interference signals not only deteriorates the quality of the 
physiological signals, but also restricts the design of the biopotential amplifier. Electrode half-cell poten- 
tials, for example, limit the gain factor of the first amplifier stage since their amplitude can be several 
orders of magnitude larger than the amplitude of the physiological signal. To prevent the amplifier from 
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FIGURE 70.2 Schematic design of the main stages of a biopotential amplifier. Three electrodes connect the patient 
to a preamplifier stage. After removing dc and low-frequency interferences, the signal is connected to an output low- 
pass filter through an isolation stage which provides electrical safety to the patient, prevents ground loops, and 
reduces the influence of interference signals. 

going into saturation, this component has to be eliminated before the required gain can be provided for 
the physiological signal. 

A typical design of the various stages of a biopotential amplifier is shown in Fig. 70.2. The electrodes 
which provide the transition between the ionic flow of currents in biological tissue and the electronic 
flow of current in the amplifier, represent a complex electrochemical system that is described elsewhere 
in this handbook. The electrodes determine to a large extent the composition of the measured signal. 
The preamplifier represents the most critical part of the amplifier itself since it sets the stage for the 
quality of the biosignal. With proper design, the preamplifier can eliminate, or at least minimize, most 
of the signals interfering with the measurement of biopotentials. 

In addition to electrode potentials and electromagnetic interferences, noise — generated by the amplifier 
and the connection between biological source and amplifier — has to be taken into account when designing 
the preamplifier. The total source resistance R s , including the resistance of the biological source and all 
transition resistances between signal source and amplifier input, causes thermal voltage noise with a root 
mean square (rms) value of: 



"V 



4kTRB 



(Volt) 
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where k = Boltzmann constant, T = absolute temperature, R s = resistance in Q, and B = bandwidth in Hz. 
Additionally, there is the inherent amplifier noise. It consists of two frequency-dependent components, 
the internal voltage noise source e n and the voltage drop across the source resistance R s caused by an 
internal current noise generator i n . The total input noise for the amplifier with a bandwidth of B = f 2 - 
fj is calculated as the sum of its three independent components: 
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High signal-to-noise ratios thus require the use of very low noise amplifiers and the limitation of 
bandwidth. Current technology offers differential amplifiers with voltage noise of less than 10 nV/ Vtiz 
and current noise less than 1 pA/ jHz . Both parameters are frequency dependent and decrease approx- 
imately with the square root of frequency. The exact relationship depends on the technology of the 
amplifier input stage. Field effect transistor (FET) preamplifiers exhibit about five times the voltage noise 
density compared to bipolar transistors but a current noise density that is about 100 times smaller. 
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The purpose of the high pass and low pass filters in Fig. 70.2 is to eliminate interference signals like 
electrode half-cell potentials and preamplifier offset potentials and to reduce the noise amplitude by the 
limitation of the amplifier bandwidth. Since the biosignal should not be distorted or attenuated, higher 
order sharp-cutting linear phase filters have to be used. Active Bessel filters are preferred filter types due 
to their smooth transfer function. Separation of biosignal and interference is in most cases incomplete 
due to the overlap of their spectra. 

The isolation stage serves the galvanic decoupling of the patient from the measuring equipment and 
provides safety from electrical hazards. This stage also prevents galvanic currents from deteriorating the 
signal-to-noise ratio especially by preventing ground loops. Various principles can be used to realize the 
isolation stage. Analog isolation amplifiers use either transformer, optical, or capacitive couplers to 
transmit the signal through the isolation barrier. Digital isolation amplifiers use a voltage/frequency 
converter to digitize the signal before it is transmitted easily by optical or inductive couplers to the output 
frequency/voltage converter. The most important characteristics of an isolation amplifier are low leakage 
current, isolation impedance, isolation voltage (or mode) rejection (IMR), and maximum safe isolation 
voltage. 

Interferences 

The most critical point in the measurement of biopotentials is the contact between electrodes and 
biological tissue. Both the electrode offset potential and the electrode/tissue impedance are subject to 
changes due to relative movements of electrode and tissue. Thus, two interference signals are generated 
as motion artifacts: the changes of the electrode potential and motion-induced changes of the voltage 
drop caused by the input current of the preamplifier. These motion artifacts can be minimized by 
providing high input impedances for the preamplifier, usage of non-polarized electrodes with low half- 
cell potentials such as Ag/AgCl electrodes, and by reducing the source impedance by use of electrode gel. 
Motion artifacts, interferences from external electromagnetic fields, and noise can also be generated in 
the wires connecting electrodes and amplifier. Reduction of these interferences is achieved by using 
twisted pair cables, shielded wires, and input guarding. 

Recording of biopotentials is often done in an environment that is equipped with many electrical 
systems which produce strong electrical and magnetic fields. In addition to 60 Hz power line frequency 
and some strong harmonics, high frequency electromagnetic fields are encountered. At power line 
frequency, the electric and magnetic components of the interfering fields can be considered separately. 
Electrical fields are caused by all conductors that are connected to power, even with no flow of current. 
A current is capacitively coupled into the body where it flows to the ground electrode. If an isolation 
amplifier is used without patient ground, the current is capacitively coupled to ground. In this case, the 
body potential floats with a voltage of up to 100 V towards ground. Minimizing interferences requires 
increasing the distance between power lines and the body, use of isolation amplifiers, separate grounding 
of the body at a location as far away from the measuring electrodes as possible, and use of shielded 
electrode cables. 

The magnetic field components produce eddy currents in the body. The amplifier, the electrode cable, 
and the body form an induction loop that is subject to the generation of an interference signal. Minimizing 
this interference signal requires increasing the distance between the interference source and patient, 
twisting the connecting cables, shielding of the magnetic fields, and relocating the patient to a place and 
orientation that offers minimum interference signals. In many cases, an additional narrow band-rejection 
filter (notch filter) is implemented as an additional stage in the biopotential amplifier to provide sufficient 
suppression of line frequency interferences. 

In order to achieve optimum signal quality, the biopotential amplifier has to be adapted to the specific 
application. Based on the signal parameters, both appropriate bandwidth and gain factor are chosen. 
Figure 70.3 shows an overview of the most commonly measured biopotentials and specifies the normal 
ranges for amplitude and bandwidth. 



S) 2000 by CRC Press LLC 



1 - 

-1 
10 - 

-2 

10 - 

> 

I10- 3 - 

Q- 

E 
< 

-4 

10 - 

-5 

10 - 














Efectrodt potentials 










Cell potentials - 

resting and 
action potentials 




Cerebral potentials 

intracranially 

recorded 


Electrocardiogram 






,. ...,„,.„. 








Electroencephalogram 










Electroretinogram 




slecfromyogram 




| 




I ' I t I ' l i 

-10 12 3 4 

10 10 10 10 10 10 
Frequency [Hz] 



FIGURE 70.3 Amplitudes and spectral ranges of some important biosignals. The various biopotentials completely 
cover the area from lO -6 V to almost 1 V and from dc to 10 kHz. 

A final requirement for biopotential amplifiers is the need for calibration. Since the amplitude of the 
biopotential often has to be determined very accurately, there must be provisions to easily determine the 
gain or the amplitude range referenced to the input of the amplifier. For this purpose, the gain of the 
amplifier must be well calibrated. In order to prevent difficulties with calibrations, some amplifiers that 
need to have adjustable gain use a number of fixed gain settings rather than providing a continuous gain 
control. Some amplifiers have a standard signal source of known amplitude built in that can be momen- 
tarily connected to the input by the push of a button to check the calibration at the output of the 
biopotential amplifier. 

70.2 Special Circuits 
Instrumentation Amplifier 

An important stage of all biopotential amplifiers is the input preamplifier which substantially contributes 
to the overall quality of the system. The main tasks of the preamplifier are to sense the voltage between 
two measuring electrodes while rejecting the common mode signal, and minimizing the effect of electrode 
polarization overpotentials. Crucial to the performance of the preamplifier is the input impedance which 
should be as high as possible. Such a differential amplifier cannot be realized using a standard single 
operational amplifier (op-amp) design since this does not provide the necessary high input impedance. 
The general solution to the problem involves voltage followers, or noninverting amplifiers, to attain high 
input impedances. A possible realization is shown in Fig. 70.4a. The main disadvantage of this circuit is 
that it requires high CMRR both in the followers and in the final op-amp. With the input buffers working 
at unity gain, all the common-mode rejection must be accomplished in the output amplifier, requiring 
very precise resistor matching. Additionally, the noise of the final op-amp is added at a low signal level, 
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FIGURE 70.4 Circuit drawings for three different realizations of instrumentation amplifiers for biomedical appli- 
cations. Voltage follower input stage (a), improved, amplifying input stage (b), and 2-op-amp version (c). 



decreasing the signal-to-noise ratio unnecessarily. The circuit in Fig. 70.4b eliminates this disadvantage. 
It represents the standard instrumentation amplifier configuration. The two input op-amps provide high 
differential gain and unity common-mode gain without the requirement of close resistor matching. The 
differential output from the first stage represents a signal with substantial relative reduction of the 
common-mode signal and is used to drive a standard differential amplifier which further reduces the 
common-mode signal. CMRR of the output op-amp as well as resistor matching in its circuit are less 
critical than in the follower type instrumentation amplifier. Offset trimming for the whole circuit can 
be done at one of the input op-amps. Complete instrumentation amplifier integrated circuits based on 
this standard instrumentation amplifier configuration are available from several manufacturers. All 
components except R p which determines the gain of the amplifier, and the potentiometer for offset 
trimming are contained on the integrated circuit chip. Figure 70.4c shows another configuration that 
offers high input impedance with only two op-amps. For good CMRR, however, it requires precise resistor 
matching. 

In applications where DC and very low frequency biopotentials are not to be measured, it would be 
desirable to block those signal components at the preamplifier inputs by simply adding a capacitor 
working as a passive high-pass filter. This would eliminate the electrode offset potentials and permit a 
higher gain factor for the preamplifier and thus a higher CMRR. A capacitor between electrodes and 
amplifier input would, however, result in charging effects from the input bias current. Due to the difficulty 
of precisely matching capacitors for the two input leads, they would also contribute to an increased source 
impedance unbalance and thus reduce CMRR. Avoiding the problem of charging effects by adding a 
resistor between the preamplifier inputs and ground as shown in Fig. 70.5a also results in a decrease of 
CMRR due to the diminished and mismatched input impedance. A 1% mismatch for two 1-MQ resistors 
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FIGURE 70.5 AC coupled instrumentation amplifier designs. The classical design using an RC high-pass filter at 
the inputs (a), and a high CMRR "quasi-high-pass" amplifier as proposed by Lu (b). 
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can already create a -60 dB loss in CMRR. The loss 
in CMRR is much greater if the capacitors are mis- 
matched, which cannot be prevented in real systems. 
Nevertheless, such realizations are used where the 
specific situation allows. In some applications, a fur- 
ther reduction of the amplifier to a two-electrode 
amplifier configuration would be convenient, even 
at the expense of some loss in the CMRR. Figure 70.6 
shows a preamplifier design working with two elec- 
trodes and providing AC coupling as proposed by 
Pallas- Areny [1990]. 

A third alternative of eliminating DC and low 
frequencies in the first amplifier stage is a directly 
coupled quasi-high-pass amplifier design, which 
maintains the high CMRR of DC coupled high input 
impedance instrumentation amplifiers [Song et al., 
1998]. In this design, the gain determining resistor 
Rj (Fig. 70.5a) is replaced by a first order high-pass 
filter consisting of R l and a series capacitor C f . The 
signal gain of the amplifier is 



G=l + 



(-) 




-en — cz5- 



c. 



FIGURE 70.6 Composite instrumentation amplifier 
based on an ac-coupled first stage. The second stage is 
based on a one op-amp differential amplifier which can 
be replaced by an instrumentation amplifier. 
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Thus, DC gain is 1, while the high frequency gain remains at G = 1 + 2R 2 /Rj. A realization using an off- 
the-shelf instrumentation amplifier (Burr-Brown INA 118) operates at low power (0,35 mA) with low 
offset voltage (11 uV typical) and low input bias current (1 nA typical), and offers a high CMRR of 
118 dB at a gain of G = 50. The very high input impedance (10 GQ) of the instrumentation amplifier 
renders it insensitive to fluctuations of the electrode impedance. Therefore, it is suitable for bioelectric 
measurements using pasteless electrodes applied to unprepared, i.e., high impedance skin. 

The preamplifier, often implemented as a separate device which is placed close to the electrodes or 
even directly attached to the electrodes, also acts as an impedance converter which allows the transmission 
of even weak signals to the remote monitoring unit. Due to the low output impedance of the preamplifier, 
the input impedance of the following amplifier stage can be low, and still the influence of interference 
signals coupled into the transmission lines is reduced. 

Isolation Amplifier and Patient Safety 

Isolation amplifiers can be used to break ground loops, eliminate source ground connections, and provide 
isolation protection to patient and electronic equipment. In a biopotential amplifier, the main purpose 
of the isolation amplifier is the protection of the patient by eliminating the hazard of electric shock 
resulting from the interaction among patient, amplifier, and other electric devices in the patient's envi- 
ronment, specifically defibrillators and electrosurgical equipment. It also adds to the prevention of line 
frequency interferences. 

Isolation amplifiers are realized in three different technologies: transformer isolation, capacitor isola- 
tion, and opto-isolation. An isolation barrier provides a complete galvanic separation of the input side, 
i.e., patient and preamplifier, from all equipment on the output side. Ideally, there will be no flow of 
electric current across the barrier. The isolation-mode voltage is the voltage which appears across the 
isolation barrier, i.e., between the input common and the output common (Fig. 70.7). The amplifier has 
to withstand the largest expected isolation voltages without damage. Two isolation voltages are specified 
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FIGURE 70.7 Equivalent circuit of an isolation amplifier. The differential amplifier on the left transmits the signal 
through the isolation barrier by a transformer, capacitor, or an opto-coupler. 

for commercial isolation amplifiers: (1) the continuous rating and (2) the test voltage. To eliminate the 
need for longtime testing, the device is tested at about two times the rated continuous voltage. Thus, for 
a continuous rating of 2000 V, the device has to be tested at 4000 to 5000 V for a reasonable period of time. 
Since there is always some leakage across the isolation barrier, the isolation mode rejection ratio 
(IMRR) is not infinite. For a circuit as shown in Fig. 70.7, the output voltage is: 



V 



R G1 + R G2 + R IN 



v D + 
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CM 
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where G is the amplifier gain, V D , V CM , and V ISO are differential, common mode, and isolation voltages, 
respectively, and CMRR is the common mode rejection ratio for the amplifier [Burr-Brown, 1994]. 

Typical values of IMRR for a gain of 10 are 140 dB at DC, and 120 dB at 60 Hz with a source unbalance 
of 5000 Q. The isolation impedance is approximately 1.8 pF || 10 12 Q. 

Transformer coupled isolation amplifiers perform on the basis of inductive transmission of a carrier 
signal that is amplitude modulated by the biosignal. A synchronous demodulator on the output port 
reconstructs the signal before it is fed through a Bessel response low-pass filter to an output buffer. A 
power transformer, generally driven by a 400 to 900 kHz square wave, supplies isolated power to the 
amplifier. 

Optically coupled isolation amplifiers can principally be realized using only a single LED and photo- 
diode combination. While useful for a wide range of digital applications, this design has fundamental 
limitations as to its linearity and stability as a function of time and temperature. A matched photodiode 
design, as used in the Burr-Brown 3650/3652 isolation amplifier, overcomes these difficulties [Burr- 
Brown, 1994]. Operation of the amplifier requires an isolated power supply to drive the input stages. 
Transformer coupled low leakage current isolated DC/DC converters are commonly used for this purpose. 
In some particular applications, especially in cases where the signal is transmitted over a longer distance 
by fiber optics, e.g., ECG amplifiers used for gated magnetic resonance imaging, batteries are used to 
power the amplifier. Fiber optic coupling in isolation amplifiers is another option that offers the advantage 
of higher flexibility in the placement of parts on the amplifier board. 

Biopotential amplifiers have to provide sufficient protection from electrical shock to both user and 
patient. Electrical- safety codes and standards specify the minimum safety requirements for the equip- 
ment, especially the maximum leakage currents for chassis and patient leads, and the power distribution 
system [Webster, 1992; AAMI, 1993]. 

Special attention to patient safety is required in situations where biopotential amplifiers are connected 
to personal computers which are more and more often used to process and store physiological signals 
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and data. Due to the design of the power supplies used in standard PCs permitting high leakage cur- 
rents — an inadequate situation for a medical environment — there is a potential risk involved even when 
the patient is isolated from the PC through an isolation amplifier stage or optical signal transmission 
from the amplifier to the computer. This holds especially in those cases where, due to the proximity of 
the PC to the patient, an operator might touch patient and computer at the same time, or the patient 
might touch the computer. It is required that a special power supply with sufficient limitation of leakage 
currents is used in the computer, or that an additional, medical grade isolation transformer is used to 
provide the necessary isolation between power outlet and PC. 

Surge Protection 

The isolation amplifiers described in the preceding paragraph are primarily used for the protection of 
the patient from electric shock. Voltage surges between electrodes as they occur during the application 
of a defibrillator or electrosurgical instrumentation also present a risk to the biopotential amplifier. 
Biopotential amplifiers should be protected against serious damage to the electronic circuits. This is also 
part of the patient safety since defective input stages could otherwise apply dangerous current levels to 
the patient. To achieve this protection, voltage limiting devices are connected between each measuring 
electrode and electric ground. Ideally, these devices do not represent a shunt impedance and thus do not 
lower the input impedance of the preamplifier as long as the input voltage remains in a range considered 
safe for the equipment. They appear as an open circuit. As soon as the voltage drop across the device 
reaches a critical value V b , the impedance of the device changes sharply and current passes through it to 
such an extent that the voltage cannot exceed V b due to the voltage drop across the series resistor R as 
indicated in Fig. 70.8. 

Devices used for amplifier protection are diodes, Zener diodes, and gas-discharge tubes. Parallel silicon 
diodes limit the voltage to approximately 600 mV. The transition from nonconducting to conducting 
state is not very sharp, and signal distortion begins at about 300 mV which can be within the range of 
input voltages depending on the electrodes used. The breakdown voltage can be increased by connecting 
several diodes in series. Higher breakdown voltages are achieved by Zener diodes connected back to back. 
One of the diodes will be biased in the forward direction and the other in the reverse direction. The 
breakdown voltage in the forward direction is approximately 600 mV, but the breakdown voltage in the 
reverse direction is higher, generally in the range of 3 to 20 V, with a sharper voltage-current characteristic 
than the diode circuit. 
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FIGURE 70.8 Protection of the amplifier input against high- voltage transients. The connection diagram for voltage- 
limiting elements is shown in panel (a) with two optional resistors R' at the input. A typical current-voltage 
characteristic is shown in panel (b). Voltage-limiting elements shown are the anti-parallel connection of diodes (c), 
anti-parallel connection of Zener diodes (d), and gas-discharge tubes (e). 
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A preferred voltage-limiting device for bio- 
potential amplifiers is the gas-discharge tube. 
Due to its extremely high impedance in the 
nonconducting state, this device appears as an 
open circuit until it reaches its breakdown volt- 
age. At the breakdown voltage which is in the 
range of 50 to 90 V, the tube switches to the 
conducting state and maintains a voltage that 
is usually several volts less than the breakdown 
voltage. Though the voltage maintained by the 
gas-discharge tube is still too high for some 
amplifiers, it is low enough to allow the input 
current to be easily limited to a safe value by 
simple circuit elements such as resistors like the 
resistors R' indicated in Fig. 70.8a. Preferred gas 
discharge tubes for biomedical applications are 
miniature neon lamps which are very inexpen- 
sive and have a symmetric characteristic. 
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Input Guarding 

The common mode input impedance and thus the CMRR of an amplifier can be greatly increased by 
guarding the input circuit. The common mode signal can be obtained by two averaging resistors con- 
nected between the outputs of the two input op-amps of an instrumentation amplifier as shown in 
Fig. 70.9. The buffered common-mode signal at the output of op-amp 4 can be used as guard voltage to 
reduce the effects of cable capacitance and leakage. 

In many modern biopotential amplifiers, the reference electrode is not grounded. Instead, it is con- 
nected to the output of an amplifier for the common mode voltage, op-amp 3 in Fig. 70. 10, which works 
as an inverting amplifier. The inverted common mode voltage is fed back to the reference electrode. This 
negative feedback reduces the common-mode voltage to a low value [Webster, 1992] . Electrocardiographs 
based on this principle are called driven-right-leg systems replacing the right leg ground electrode of 
ordinary electrocardiographs by an actively driven electrode. 




FIGURE 70.10 Driven-right-leg circuit reducing common-mode interference. 
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Dynamic Range and Recovery 

With an increase of either the common mode or differential input voltage there will be a point where 
the amplifier will overload and the output voltage will no longer be representative for the input voltage. 
Similarly, with a decrease of the input voltage, there will be a point where the noise components of the 
output voltage cover the output signal to a degree that a measurement of the desired biopotential is no 
longer possible. The dynamic range of the amplifier, i.e., the range between the smallest and largest 
possible input signal to be measured, has to cover the whole amplitude range of the physiological signal 
of interest. The required dynamic range of biopotential amplifiers can be quite large. In an application 
like fetal monitoring for example, two signals are recorded simultaneously from the electrodes which are 
quite different in their amplitudes: the fetal and the maternal ECG. While the maternal ECG shows an 
amplitude of up to 10 mV, the fetal ECG often does not reach more than 1 uV. Assuming that the fetal 
ECG is separated from the composite signal and fed to an analog/digital converter for digital signal 
processing with a resolution of 10 bit (signed integer), the smallest voltage to be safely measured with 
the biopotential amplifier is 1/512 uV or about 2 nV vs. 10 mV for the largest signal, or even up to 
300 mV in the presence of an electrode offset potential. This translates to a dynamic range of 134 dB for 
the signals alone and 164 dB if the electrode potential is included in the consideration. Though most 
applications are less demanding, even such extreme requirements can be realized through careful design 
of the biopotential amplifer and the use of adequate components. The penalty for using less expensive 
amplifiers with diminished performance would be a potentially severe loss of information. 

Transients appearing at the input of the biopotential amplifier, like voltage peaks from a cardiac 
pacemaker or a defibrillator, can drive the amplifier into saturation. An important characteristic for the 
amplifier is the time it takes to recover from such overloads. The recovery time depends on the charac- 
teristics of the transient, like amplitude and duration, the specific design of the amplifier, like bandwidth, 
and the components used. Typical biopotential amplifiers may take several seconds to recover from severe 
overload. The recovery time can be reduced by disconnecting the amplifier inputs at the discovery of a 
transient using an electronic switch. 

Passive Isolation Amplifiers 

Increasingly, biopotentials have to be measured within implanted devices and need to be transmitted to 
an external monitor or controller. Such applications include cardiac pacemakers transmitting the intra- 
cardiac ECG and functional electrical stimulation where, e.g., action potentials measured at one eyelid 
serve to stimulate the other lid to restore the physiological function of a damaged lid at least to some 
degree. In these applications, the power consumption of the implanted biopotential amplifier limits the 
lifespan of the implanted device. The usual solution to this problem is an inductive transmission of power 
into the implanted device that serves to recharge an implanted battery. In applications where the size of 
the implant is of concern, it is desirable to eliminate the need for the battery and the related circuitry 
by using a quasi passive biopotential amplifer, i.e., an amplifier that does not need a power supply. 

The function of passive telemetric amplifiers for biopotentials is based on the ability of the biological 
source to drive a low power device such as a FET and the sensing of the biopotentials through inductive 
or acoustic coupling of the implanted and external devices [Nagel, 1982]. In an inductive system, a FET 
serves as a load to an implanted secondary LC-circuit which is stimulated inductively by an extracorporal 
oscillator (Fig. 70.11). Depending on the special realization of the system, the biopotential is available in 
the external circuit from either an amplitude or frequency-modulated carrier-signal. The input impedance 
of the inductive transmitter as a function of the secondary load impedance Z 2 is given by: 

(com) 2 

Z 1 = jC0l 1 + - v '— (70.6) 

Z 2 + j(x>L 2 
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FIGURE 70.11 The passive isolation amplifier can be operated without the need for an isolated power supply. The 
biological source provides the power to modulate the load impedance of an inductive transformer. As an easy 
realization shown in panel (b), a FET can be directly connected to two electrodes. The source-drain resistance changes 
as a linear function of the biopotential which is then reflected by the input impedance of the transformer. 

In an amplitude-modulated system, the resistive part of the input-impedance Zj must change as a 
linear function of the biopotential. The signal is obtained as the envelope of the carrier signal, measured 
across a resistor R m . A frequency-modulated system is realized when the frequency of the signal generator 
is determined at least in part by the impedance Z, of the inductive transmitter. In both cases, the signal- 
dependent changes of the secondary impedance Z 2 can be achieved by a junction-FET. Using the field 
effect transistor as a variable load resistance changing its resistance in proportion to the source-gate 
voltage which is determined by the electrodes of this two-electrode amplifier, the power supplied by the 
biological source is sufficient to drive the amplifier. The input impedance can be in the range of 10 10 Q. 

Optimal transmission characteristics are achieved with AM systems. Different combinations of external 
and implanted resonance circuits are possible to realize in an AM system, but primary parallel with 
secondary serial resonance yields the best characteristics. In this case, the input impedance is given by: 
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jcoq 



v 2 



«z 



(70.7) 



The transmission factor (L[/M) 2 is optimal since the secondary inductivity, i.e., the implanted induc- 
tivity, can be small, only the external inductivity determines the transmission factor and the mutual 
inductivity should be small, a fact that favors the loose coupling that is inherent to two coils separated 
by skin and tissue. There are, of course, limits to M which cannot be seen from Eq. (70.7). In a similar 
fashion, two piezoelectric crystals can be employed to provide the coupling between input and output. 

This 2-lead isolation amplifier design is not limited to telemetric applications. It can also be used in 
all other applications where its main advantage lies in its simplicity and the resulting substantial cost 
savings as compared to other isolation amplifiers which require additional amplifier stages and an 
additional isolated power supply. 
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Digital Electronics 

The ever increasing density of integrated digital circuits together with their extremely low power con- 
sumption permits digitizing and preprocessing of signals already on the isolated patient-side of the 
amplifiers, thus improving signal quality and eliminating the problems normally related to the isolation 
barrier, especially those concerning isolation voltage interferences and long-term stability of the isolation 
amplifiers. Digital signal transmission to a remote monitoring unit, a computer system, or computer 
network can be achieved without any risk of picking up transmission line interferences, especially when 
implemented with fiberoptical cables. 

Digital techniques also offer an easy means of controlling the front-end of the amplifier. Gain factors 
can be easily adapted, and changes of the electrode potential resulting from electrode polarization or 
from interferences which might drive the differential amplifier into saturation can easily be detected and 
compensated. 

Summary 

Biopotential amplifiers are a crucial component in many medical and biological measurements, and 
largely determine the quality and information content of the measured signals. The extremely wide range 
of necessary specifications with regard to bandwidth, sensitivity, dynamic range, gain, CMRR, and patient 
safety leaves only little room for the application of general purpose biopotential amplifiers, and mostly 
requires the use of special purpose amplifiers. 

Defining Terms 

Common Mode Rejection Ratio (CMRR): The ratio between the amplitude of a common mode signal 
and the amplitude of a differential signal that would produce the same output amplitude or as the 
ratio of the differential gain over the common-mode gain: CMRR = G D /G CM . Expressed in decibels, 
the common mode rejection is 20 log 10 CMRR. The common mode rejection is a function of 
frequency and source-impedance unbalance. 

Isolation Mode Rejection Ratio (IMRR): The ratio between the isolation voltage, V ISO , and the ampli- 
tude of the isolation signal appearing at the output of the isolation amplifier, or as isolation voltage 
divided by output voltage V OUT in the absence of differential and common mode signal: IMRR = 

* iso ' "out ■ 
Operational Amplifier (op-amp): A very high gain dc-coupled differential amplifier with single-ended 
output, high voltage gain, high input impedance, and low output impedance. Due to its high open- 
loop gain, the characteristics of an op-amp circuit only depend on its feedback network. Therefore, 
the integrated circuit op-amp is an extremely convenient tool for the realization of linear amplifier 
circuits [Horowitz and Hill, 1980]. 
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The beginnings of noninvasive arterial pulse recording can be traced to the Renaissance. At that time, 
the Polish scientist, Strus [1555] had proposed that the arterial pulse possesses a waveform. Although 
instrumentation that he used was simple, he suggested that changes in the arterial pulse shape and 
strength may be related to disease conditions. Today, even though the technology is more advanced, 
noninvasive arterial blood pressure measurement still remains a challenge. Rigorous methods for extract- 
ing functional cardiovascular information from noninvasive pressure have been limited. 

In this chapter, the most standard of noninvasive methods for arterial pressure measurements will be 
reviewed and future trends will be proposed. Two types of methods for noninvasive arterial pressure 
measurement may be defined; those that periodically sample and those that continuously record the 
pulse waveform. The sampling methods typically provide systolic and diastolic pressure and sometimes 
mean pressure. These values are collected over different heart beats during the course of 1 min. The 
continuous recording methods provide beat-to-beat measurements and often, the entire waveform. Some 
continuous methods only provide pulse pressure waveform and timing information. 

The knowledge of systolic and diastolic pressure is fundamental for the evaluation of basic cardiovas- 
cular function and identifying disease. The choice of method depends on the type of study. For example, 
high blood pressure is a known precursor to many other forms of cardiovascular disease. A noninvasive 
method that samples blood pressure over the time course of months is usually adequate to study the 
progression of hypertension. The occlusive cuff-based methods fall into this category. These methods 
have been automated with recent instruments designed for ambulatory use [Graettinger et al., 1988]. 
Twenty- four to 48 h ambulatory monitors have been applied to monitor the diurnal variation of a patient's 
blood pressure. This type of monitoring can alleviate the problem of "white coat hypertension", i.e., the 
elevation of blood pressure associated with a visit to the physician's office [Pickering et al., 1988]. 
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Short-term hemodynamic information obtained from the noninvasive arterial pulse waveform is a 
virtually untapped arena. While a great deal of knowledge has been gathered on the physics of the arterial 
pulse [Noordergraaf, 1978], it has been lacking in application because continuous pressure waveform 
monitors have not been available. A review of methods for continuous pulse monitoring that fill this gap 
will be provided. 

Some applications for pulse monitoring can be found. In the recording time span of less than 1 min, 
the importance of the pressure waveform dominates, as well as beat-to-beat variations. This type of 
monitoring is critical in situations where blood pressure can alter quickly, such as due to trauma or 
anesthesia. Other applications for acute monitoring have been in aerospace, biofeedback, and lie detec- 
tion. Moreover, pulse dynamics information becomes available such as wave reflection [Li, 1986]. Kelly 
et al. [1989] have shown that elevated systolic pressure can be attributed to pulse waveform changes due 
to a decrease in arterial compliance and increased wave reflection. Lastly, information on the cardiovas- 
cular control process can be obtained from pulse pressure variability [Omboni et al., 1993]. 

It has become increasingly popular to provide simultaneous recording of such variables as noninvasive 
blood pressure, oxygen saturation via pulse oximetry, body temperature, etc., in a single instrument. It 
is apparent that the advances in computer technology impinge on this practice, making it a clear trend. 
While this practice is likely to continue, the forefront of this approach will be those instruments that 
provide more than just a mere marriage of technology in a single design. It will be possible to extract 
functional information in addition to just pressure. As an example of this, a method for noninvasive 
measurement of the arterial pressure-lumen area curve will be provided. 

71.2 Long-Term Sampling Methods 
Vascular Unloading Principle 

The vascular unloading principle is fundamental to all occlusive cuff-based methods of determining 
arterial blood pressure. It is performed by applying an external compression pressure or force to a limb 
such that it is transmitted to the underlying vessels. It is usually assumed that the external pressure and 
the tissue pressure (stress) are in equilibrium. The underlying vessels are then subjected to altered 
transmural pressure (internal minus external pressure) by varying the external pressure. It is further 
assumed [Marey, 1885] that the tension within the wall of the vessel is zero when transmural pressure 
is zero. Hence, the term vascular unloading originated. 

Various techniques have been developed that attempt to detect vascular unloading. These generally 
rely on the fact that once a vessel is unloaded, further external pressure will cause it to collapse. In 
summary, 

If P a > P c => Lumen open 

or ifP <P =^- Lumen closed 

a c 

where P a is the arterial pressure and P c is the cuff pressure. Most methods that employ the occlusive arm 
cuff rely on this principle and differ in the means of detecting whether the artery is open or closed. 
Briefly, some approaches are the skin flush, palpatory, Korotkoff (auscultatory), oscillometric, and ultra- 
sound methods [Drzewiecki et al., 1987]. Of these, the methods of Korotkoff and oscillometry are in 
most common use and will be reviewed here. 

The idea of using lumen opening and closure as an indication of blood pressure has survived since its 
introduction by Marey. This simple concept is complicated by the fact that lumen closure may not 
necessarily occur at zero transmural pressure. Instead, transmural pressure must be negative by 5 to 
20 mm Hg for complete closure. In vitro and in vivo human measurements have revealed that vessel 
buckling more closely approximates zero transmural pressure [Drzewiecki et al., 1997; Drzewiecki and 
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FIGURE 71.1 Application of arterial buckling to the continuous measurement of arterial pressure. The record 
shown tracks the subject's mean pressure in the brachial artery. The initial portion of the record illustrates the system 
as it locates the buckling point and the subject's blood pressure. The subject was directed to perform a brief Valsalva 
maneuver mid-way through the recording. 

Pilla, in press]. Buckling may be denned as the point at which the vessel switches from wall stretch to 
wall bending as a means of supporting its pressure load. The vessel is maximally compliant at this point 
and is approximately 25% open. 

The validity of employing the buckling concept to blood pressure determination was tested. Sheth and 
Drzewiecki [1998] employed feedback control to regulate the pressure in a flexible diaphragm tonometer 
(see the section on flexible diaphragm tonometer). The buckling point was determined from the volume 
pulse. Thus, to detect vessel buckling, the derivative of the volume pulse with respect to mean pressure 
was computed. This was performed on a beat-to-beat basis. The feedback system was then used to adjust 
pressure such that this derivative was maximized, indicating the point of greatest instability. Thus, the 
underlying blood vessel was maintained in a constant state of buckling. According to the buckling theory, 
the transmural pressure should be nearly zero and tonometer pressure is equal to arterial pressure. A 
sample 2-min recording of noninvasive arterial pressure by this approach is shown (Fig. 71.1). The 
method was capable of tracking the subject's blood pressure in response to a Valsalva maneuver in this 
same record. This test demonstrates the feasibility of employing buckling to measure blood pressure. 
This example also illustrates that beat-to-beat pressure can be obtained by this method without the 
necessity to occlude blood flow. This approach should be useful for pressure variability studies. 

Occlusive Cuff Mechanics 

The occlusive arm cuff has evolved more out of convenience than engineering design. As such, its 
mechanical properties are relatively unknown. The current version of the cuff is designed to encircle the 
upper arm. It consists of a flat rubber bladder connected to air supply tubing. The bladder is covered 
externally by cloth material with Velcro fasteners at either end for easy placement and removal. While 
the cuff encircles the entire arm, the bladder extends over approximately half the circumference. The 
bladder is pressurized with air derived from a hand pump, release valve, and manometer connected to 
the air supply tubing. 

The cuff should accurately transmit pressure down to the tissue surrounding the brachial artery. A 
mechanical analysis revealed that the length of the cuff is required to be a specific fraction of the arm's 
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FIGURE 71.2 Top. Pressure-volume data obtained from two different occlusive arm cuffs. Inner surface of the cuff 
was fixed in this case to isolate cuff mechanics from that of the arm. Bottom. Derivative of the cuff pressure with 
respect to volume obtained from pressure-volume data of both cuffs. These curves indicate the pressure response 
of the cuff to volume change and are useful for plethysmography. Solid curves in both figures are the results of the 
occlusive cuff model. 

circumference for pressure equilibrium [Alexander et al., 1977]. A narrow cuff resulted in the greatest 
error in pressure transmission and, thus, the greatest error in blood pressure determination. Geddes and 
Whistler [1978] experimentally examined the effect of cuff size on blood pressure accuracy for the 
Korotkoff method. Their measurements confirmed that a cuff-width-to-arm circumference ratio of 0.4 
should be maintained. Cuff manufacturers, therefore, supply a range of cuff sizes appropriate for pediatric 
use up to large adults. 

Another aspect of the cuff is its pressure response due to either internal air volume change or that of 
the limb. In this sense, the cuff can be thought of as a plethysmograph. It was examined by considering 
its pressure-volume characteristics. In an experiment, the cuff properties were isolated from that of the 
arm by applying it to a rigid cylinder of similar diameter [Drzewiecki et al.,1993]. Pressure-volume data 
were then obtained by injecting a known volume of air and noting the pressure. This was performed for 
a standard bladder cuff (13 cm width) over a typical range of blood pressure. 

The cuff pressure-volume results for pressures less than 130 mm Hg are nonlinear (Fig. 71.2). For 
higher pressures, the data asymptotically approach a linear relationship. The cuff volume sensitivity, i.e., 
its derivative with respect to volume, increased with cuff pressure (Fig. 71.2). Above 130 mm Hg, the 
cuff responded with nearly constant sensitivity. 

A cuff mechanics theory was developed to explain the above cuff experiment [Drzewiecki et al., 1993]. 
Cuff mechanics was theorized to consist of two components. The first consists of the compressibility of 
air within the cuff. This was modeled using Boyle's gas law. The second component consists of elastic 
and geometric deformation of the cuff bladder. Cuff shape deformation proceeds at low pressures until 
the bladder reaches its final geometry, rendering a curved pressure-volume relationship. Then, elastic 
stretch of the rubber bladder takes over at high pressures, resulting in a nearly linear relationship. 
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Solutions for this model are shown in comparison with the data in Fig. 71.2 for two cuffs; a standard 
cuff and the Critikon Dura-cuf. This model was useful in linearizing the cuff for use as a plethysmograph 
and for application to oscillometry (below). 

Method of Korotkoff 

The auscultatory method or method of Korotkoff was introduced by the Russian army physician N. 
Korotkoff [1905]. In his experiments, Korotkoff discovered that sound emitted distally from a partially 
occluded limb. He realized that this sound was indicative of arterial flow and that together with the 
occlusive cuff could be used to determine blood pressure. The method, as employed today, utilizes a 
stethoscope placed distal to an arm cuff over the brachial artery at the antecubital fossa. The cuff is 
inflated to about 30 mm Hg above systolic pressure and then allowed to deflate at a rate of 2 to 3 mm 
Hg/s. With falling cuff pressure, sounds begin and slowly change their characteristics. The initial "tapping" 
sounds are referred to as Phase I Korotkoff sound and denote systolic pressure. The sounds increase in 
loudness during Phase II. The maximum intensity occurs in Phase III, where the tapping sound may be 
followed by a murmur due to turbulence. Finally, Phase IV Korotkoff sound is identified as muffled 
sound, and Phase V is the complete disappearance of sound. Phase IV is generally taken to indicate 
diastolic arterial pressure. But, Phase V has also been suggested to be a more accurate indication of 
diastolic pressure. This matter is a continued source of controversy. The long history of the Korotkoff 
sound provides much experimental information. For example, the frequency spectrum of sound, spatial 
variation along the arm, filtering effects, and timing are reviewed [Drzewiecki et al., 1989]. 

It is a long-held misconception that the origin of the Korotkoff sound is flow turbulence. Turbulence 
is thought to be induced by the narrowing of the brachial artery under an occlusive cuff as it is forced 
to collapse. There are arguments against this idea. First, the Korotkoff sounds do not sound like turbu- 
lence, i.e., a murmur. Second, the Korotkoff sound can occur in low blood flow situations, while turbu- 
lence cannot. And, last, Doppler ultrasound indicates that peak flow occurs following the time occurrence 
of Korotkoff sound. An alternative theory suggests that the sound is due to nonlinear distortion of the 
brachial pulse, such that sound is introduced to the original pulse. This is shown to arise from flow 
limitation under the cuff in addition to curvilinear pressure-area relationship of the brachial artery 
[Drzewiecki et al., 1989]. Strong support for this theory comes from its ability to predict many of the 
Korotkoff sound's observable features. 

The accuracy of the Korotkoff method is well known. London and London [1967] find that the 
Korotkoff method underestimates systolic pressure by 5 to 20 mm Hg and overestimates diastolic pressure 
by 12 to 20 mm Hg. However, certain subject groups, such as hypertensives or the elderly, can compound 
these errors [Spence et al., 1978]. In addition, it has been shown that the arm blood flow can alter the 
Korotkoff sound intensity and, thus, the accuracy of blood pressure measurement [Rabbany et al., 1993]. 
Disappearance of Korotkoff sound occurs early in Phase III for some subjects and is referred to as the 
auscultatory gap. This causes an erroneous indication of elevated diastolic pressure. The auscultatory 
gap error can be avoided by simply allowing cuff pressure to continue to fall, where the true Phase IV 
sounds return. This is particularly critical for automatic instruments to take into account. In spite of 
these errors, the method of Korotkoff is considered a documented noninvasive blood pressure standard 
by which other noninvasive methods may be evaluated [White et al., 1993]. 

The Korotkoff method is applicable to other vessels besides the brachial artery of the arm. For example, 
the temporal artery has been employed [Shenoy etal., 1993]. In this case, a pressure capsule is applied 
over the artery on the head in place of an occlusive cuff, to provide external pressure. This approach has 
been shown to be accurate and is applicable to aerospace. Pilots' cerebral vascular pressure often falls in 
high acceleration maneuvers, so that temporal artery pressure is a better indicator of this response. 

Oscillometry 

Oscillometric measurement of blood pressure predates the method of Korotkoff. The French physiologist 
Marey [1885] placed the arm within a compression chamber and observed that the chamber pressure 
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FIGURE 71.3 Sample recording of cuff pressure during oscillometric blood pressure measurement. Bottom panel 
shows oscillations in cuff pressure obtained by high pass filtering above 1/2 Hz. 



fluctuated with the pulse. He also noted that the amplitude of pulsation varied with chamber pressure. 
Marey believed that the maximum pulsations or the onset of pulsations were associated with equality of 
blood pressure and chamber pressure. At that time, he was not certain what level of arterial pressure the 
maximum pulsations corresponded with. Recently, it has been demonstrated theoretically that the vari- 
ation in cuff pressure pulsation is primarily due to the brachial artery buckling mechanics [Drzewiecki 
etal., 1994]. 

Today, oscillometry is performed using a standard arm cuff together with an in-line pressure sensor. 
Due to the requirement of a sensor, oscillometry is generally not performed manually, but, rather, with 
an automatic instrument. The recorded cuff pressure is high-pass-filtered above 1 Hz to observe the 
pulsatile oscillations as the cuff slowly deflates (Fig. 71.3). It has been determined only recently that the 
maximum oscillations actually correspond with cuff pressure equal to mean arterial pressure (MAP) 
[Posey et al., 1969; Ramsey, 1979], confirming Marey 's early idea. Systolic pressure is located at the point 
where the oscillations, O s , are a fixed percentage of the maximum oscillations, O m [Geddes et al., 1983]. 
In comparison with the intra-arterial pressure recordings, the systolic detection ratio is O s /O m = 0.55. 
Similarly, the diastolic pressure can be found as a fixed percentage of the maximum oscillations, as 



H /0„ 



0.85. 



Derivative Oscillometry 

The use of the oscillometric detection ratios to find systolic and diastolic pressure is an empirical 
approach. That is, the ratios are statistically valid over the population of subjects that form the total 
sample. This is a distinctly different approach than measuring blood pressure by detecting an event, such 
as the maximum in cuff pressure oscillations or the occurrence of Korotkoff sound. The event is con- 
strained by the physics of arterial collapse under the cuff [Drzewiecki et al., 1994]. Therefore, it is more 
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Pressure (mmHg) 

FIGURE 71.4 Method of derivative oscillometry. The derivative of cuff pressure oscillations data with respect to 
cuff pressure is shown from a single subject. The maximum and minimum values denote diastolic and systolic 
pressure, respectively. A zero derivative indicates MAP in this plot. 

likely to be accurate under different conditions and, more importantly, for subjects outside of the sample 
population. In fact, the oscillometric determination of MAP is more accurate than systolic and diastolic 
pressures. 

Drzewiecki et al. [1994] employed a model of oscillometry to evaluate the derivative of the oscillation 
amplitude curve (Fig. 71.3) with respect to cuff pressure. When this derivative is plotted against cuff 
pressure, it was found that it reaches a maximum positive value. This occurred when cuff pressure equals 
diastolic pressure. Additionally, the minimum negative value was found to occur at systolic pressure. A 
measurement performed in our lab on a single subject illustrates the approach (Fig. 71.4). The specific 
advantage offered is that the empirically based systolic and diastolic ratios are not necessary [Link, 1987]. 
This method may be referred to as derivative oscillometry. 

Derivative oscillometry was evaluated experimentally in our lab. The values of systolic and diastolic 
pressures obtained by derivative oscillometry were compared with those obtained by the method of 
Korotkoff. Thirty recordings were obtained on normal subjects (Fig. 71.5). The results indicated a high 
correlation of 0.93 between the two methods. Systolic mean error was determined to be 9% and diastolic 
mean error was -6%. Thus, derivative oscillometry was found to compare well with the method of 
Korotkoff in this preliminary evaluation. Before adopting derivative oscillometry, a more complete 
evaluation needs to be performed using a greater and more diverse subject population. 



71.3 Pulse Dynamics Methods 



R-Wave Time Interval Technique 

One of the basic characteristics of pressure and flow under an occlusive cuff is that the pulse is apparently 
delayed with increasing cuff pressure. The R-wave of the ECG is often employed as a time reference. 
Arzbaecher et al. [1973] measured the time delay of the Korotkoff sound relative to the R-wave. They 
suggested that the curve obtained by plotting this data represents the rising portion of the arterial pulse 
waveform. 
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FIGURE 71.5 Experimental evaluation of derivative oscillometry using systolic and diastolic pressure from the 
method of Korotkoff as reference. The line indicates the result of linear regression to the data. 

A Korotkoff sound model [Drzewiecki et al., 1989] was employed to investigate the cuff delay effect. 
Time delay of the Korotkoff sound was computed relative to the proximal arterial pulse. Since the arterial 
pulse waveform was known in this calculation, the pressure-RK interval curve can be compared directly. 
The resemblance to a rising arterial pulse was apparent but some deviation was noted, particularly in 
the early portion of the RK interval curve. The model indicates that the pulse occurs earlier and with 
higher derivative than the true pulse waveform. In particular, the increased derivative that occurs at the 
foot of the pulse may mislead any study of wave propagation. 

Recently, Sharir et al. [1993] performed comparisons of Doppler flow pulse delay and direct arterial 
pressure recordings. Their results confirm a consistent elevation in pulse compared with intra-arterial 
recording in the early portions of the wave. The average deviation was 10 mm Hg in value. Hence, if 
accuracy is not important, this method can provide a reasonable estimate of blood pressure and its 
change. The commercial pulse watch has been a recent application of this approach. 

Continuous Vascular Unloading 

Penaz [1973] reasoned that if the cuff pressure could be continuously adjusted to equal the arterial 
pressure, the vessels would be in a constant state of vascular unloading. He employed mechanical feedback 
to continuously adjust the pressure in a finger chamber to apply this concept. The vascular volume was 
measured using photoplethysmography. When feedback was applied such that the vascular volume was 
held constant and at maximum pulsatile levels, the chamber pressure waveform was assumed to be equal 
to the arterial pressure. 

Recent applications of the Penaz method have been developed by Wesseling et al. [ 1 978 ] and Yamakoshi 
et al. [1980]. One instrument is commercially available as the FINAPRES (Ohmeda, Finapres, Englewood, 
CO). These instruments have been evaluated in comparison with intra-arterial pressure recordings 
[Omboni et al., 1993]. Good waveform agreement has been obtained in comparison with intra-arterial 
measurements from the radial artery. But, it should be clear that the Penaz method employs finger 
pressure, which is a peripheral vascular location. This recording site is prone to pulse wave reflection 
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FIGURE 71.6 Left. Illustration of volume pulse method. Right. Illustration of pressure pulse method and arterial 
tonometry. 

effects and is therefore sensitive to the vascular flow resistance. It is anticipated that the technique would 
be affected by skin temperature, vasoactive drugs, and anesthetics. Moreover, mean pressure differences 
between the finger pulse and central aortic pressure should be expected. 

Pulse Sensing 

Pulse sensors attempt to measure the arterial pulse waveform from either the arterial wall deflection or 
force at the surface of the skin above a palpable vessel. Typically, these sensors are not directly calibrated 
in terms of pressure, but ideally respond proportionately to pressure. As such, they are primarily useful 
for dynamic information. While there are many designs available for this type of sensor, they generally 
fall into two categories. The first category is that of the volume sensor (Fig. 71.6 [left] ). This type of sensor 
relies on the adjacent tissues surrounding the vessel as a non-deflecting frame of reference as, for example, 
a photoplethysmograph. Skin deflections directly above the vessel are then measured relative to a reference 
frame to represent the arterial pressure. Several different transduction methods may be employed such as 
capacitive, resistive, inductive, optical, etc. Ideally, this type of sensor minimally restricts the motion of 
the skin so that contact force is zero. The drawback to pulse volume sensing is that the method responds 
to volume distention and indirectly to pressure. The nonlinear and viscoelastic nature of the vascular wall 
result in complex waveform alterations for this type of sensor that are difficult to correct in practice. 

The second category is that of the pressure pulse sensor (Fig. 71.6 [right] ) . This type of sensor measures 
stress due to arterial pressure transmitted though the skin above the pulse artery. The pressure pulse 
sensor requires that surface deflections are zero, as opposed to the volume sensor. Thus, the contact forces 
are proportionate to arterial pressure at the skin surface. 

The differences in pulse waveforms that are provided by the above pulse recording techniques are clear 
in comparison with intra-arterial recordings [Van der Hoeven and Beneken, 1970]. In all cases, the 
pressure pulse method was found to provide superior waveform accuracy, free of the effects of vascular 
nonlinear viscoelasticity. Alternatively, the stiffness of the sensor can best characterize its pulse accuracy. 
High stiffness relative to the artery and surrounding tissue is required to best approximate the pressure 
pulse method. 

Arterial pulse recording is performed while the subject is stationary and refrains from moving the 
pulse location. But, it has become of interest to acquire ambulatory records. Without restraint, pulse 
recording is quite difficult due to motion artifact. For example, hand or finger motion can appear in 
recordings of the radial artery. A change in sensor positioning or acceleration can result in other types 
of artifact. The artifacts are often comparable in magnitude and frequency to the pulse, rendering simple 
filtering methods useless. Recently though, artifact cancellation techniques that employ sensor arrays 
have been applied with good success [Ciaccio et al., 1989]. 

Arterial Tonometry 

Pulse sensing methods do not provide calibrated pressure. Arterial tonometry [Pressman and Newgard, 
1963] is a pressure pulse method that can noninvasively record calibrated pressure in superficial arteries 
with sufficient bony support, such as the radial artery. 
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A tonometer is applied by first centering a contact stress sensor over the vessel. This is accomplished 
by repositioning the device until the largest pulse is detected. An array of sensors [Weaver et al., 1978] 
has been used to accomplish this electronically. Then, the tonometer is depressed towards the vessel. This 
leads to applanation of the vessel wall (Fig. 71.6). If the vessel is not flattened sufficiently, the tonometer 
measures forces due to arterial wall tension and bending of the vessel. As depression is continued, the 
arterial wall is applanated further, but not so much as to occlude blood flow. At this intermediate position, 
wall tension becomes parallel to the tonometer sensing surface. Arterial pressure is then the remaining 
stress perpendicular to the surface and is measured by the sensor. This is termed the contact stress due 
to pressure. Ideally, the sensor should not measure skin shear (frictional) stresses. The contact stress is 
equal in magnitude to the arterial pressure when these conditions are achieved. The details of arterial 
tonometer calibration and design were analyzed by Drzewiecki et al. [1983, 1987]. In summary, tonometry 
requires that the contact stress sensor be flat, stiffer than the tissues, and small relative to the vessel 
diameter. Proper calibration can be attained either by monitoring the contact stress distribution (using 
a sensor array) or the maximum in measured pulse amplitude. 

Recent research in tonometry has focused on miniaturization of semiconductor pressure sensor arrays 
[Weaver et al., 1978]. Alternatively, fiber optics have been employed by Drzewiecki [1985] and Moubarak 
etal. [1989], allowing extreme size reduction of the contact stress sensor. Commercial technology has 
been available (Jentow, Colin Electronics, Japan) and has been evaluated against intra-arterial records. 
Results indicate an average error of -5.6 mm Hg for systolic pressure and -2.4 mm Hg for diastole. 
Excellent pulse waveform quality is afforded by tonometry [Sato et al., 1993] , making it a superior method 
for noninvasive pulse dynamics applications. 

Flexible Diaphragm Tonometry 

As an alternative to the high resolution tonometers under development, a new low resolution technology 
is introduced here. The basic advantage becomes one of cost, ease of positioning, and patient comfort, 
which is critical for long-term applications. In addition, while most tonometers have employed only the 
radial artery of the wrist for measurement, this technology is suitable for other superficial vessels and 
conforms to skin surface irregularities. 

The flexible tonometer design applies the fact that tissue is incompressible in the short term [Bansal 
etal., 1994]. The concept is shown in Fig. 71.7 using three tonometer volume compartments. These 
compartments are not physically separated and fluid can move between them. They are coupled to the 
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FIGURE 71.7 Concept of flexible diaphragm tonometry. PI illustrates inappropriate level of pressure to provide 
arterial applanation tonometry. P2 indicates an increase in chamber volume so that the relative compartment volumes, 
V, are equal. In practice, relative compartment volumes are maintained constant via feedback control and applanation 
is continuous. Compartment pressure equals arterial pressure in P2. 
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FIGURE 71.8 Design of a flexible diaphragm tonometer in accordance with the concept shown in Fig. 71.7. Com- 
partment volumes are obtained by means of impedance plethysmography. Electrode positions define the compart- 
ment boundaries. 

skin and artery by means of the flexible diaphragm. When the arterial pressure exceeds that in the 
tonometer, the volume of the artery expands into Vb. Note also, that Va and Vc must increase to take 
up this expansion since water is incompressible. To restore the tonometer to a flat surface, the total 
volume of the tonometer is increased (Fig. 71.7). In response, the tonometer pressure increases and the 
artery flattens. At this point, the volume in each compartment is equal, the diaphragm is flat, and the 
tonometer pressure is equal to arterial pressure. Thus, by maintaining the equilibrium of the relative 
volume compartments, applanation tonometry can be accomplished with a flexible diaphragm rather 
than a rigid one. In practice, instrumentation continuously adjusts the relative compartment volumes as 
arterial pressure changes. 

A flexible diaphragm tonometer was machined from plexiglass (Fig. 71.8). A rectangular channel was 
designed to contain saline. The front of the channel was sealed with a polyurethane sheet of 0.004 in. 
thickness. Two stainless steel electrodes were placed at each end of the channel. These were used to inject 
a current along the channel length. Near the center of the channel, four measuring electrodes were placed 
at equal spacing. Each pair of electrodes defined a volume compartment and the voltage across each pair 
was calibrated in terms of volume using impedance plethysmography. External to the tonometer, a saline- 
filled catheter was used to connect the channel to an electro-mechanical volume pump. The dynamics 
of the system were designed to possess appropriate frequency response. 

Several beats of data were plotted from the volume pulse recordings of the flexible tonometer (Fig. 71.9) 
for two adjacent compartments. The pulse volume is shown as a volume deviation from the mean 
compartment volume given a constant average tonometer pressure. The waveform shown illustrates the 
ability to provide calibrated noninvasive arterial volume information. 

71.4 Noninvasive Arterial Mechanics 

The flexible diaphragm tonometer and the occlusive cuff are capable of measuring volume in addition 
to pressure. Combining noninvasive measurements can extend the information provided by a single 
instrument, beyond that of blood pressure alone. Furthermore, functional information about the vascu- 
lature requires the simultaneous determination of pressure as well as geometry. In this section, the 
standard occlusive arm cuff will be employed as a plethysmograph to illustrate this concept. 
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FIGURE 71.9 Sample of radial arterial volume recording from tonometer of Fig. 71.8. Chamber pressure was fixed 
at 25 mm Hg. Calibrated volume change is shown relative to the mean compartment volume. 

The transmural pressure vs. lumen area (P-A) graph has been a convenient approach to analyze vessel 
function [Drzewiecki et al., 1997]. This curve can be applied to hemodynamics and portrays vessel 
function in a single graph. Unfortunately, its use has been limited to invasive studies or studies of isolated 
vessels where pressure can be experimentally controlled. The occlusive arm cuff offers a noninvasive 
solution to this problem by allowing the transmural pressure to be varied by altering the external pressure 
applied to a blood vessel. This approach has been used to noninvasively measure the transmural pressure 
vs. compliance relationship of the brachial artery [Mazhbich et al., 1983]. 

Drzewiecki and Pilla [1998] furthered the idea of noninvasively varying the transmural pressure by 
developing a cuff-based instrument that measures the P-A relationship. To perform the measurement, 
an occlusive arm cuff was pressurized by means of a diaphragm air pump. Cuff pressure was measured 
using a pressure sensor that provides pressure as an electronic signal. The pressure signal was input to a 
computer system via an analog-to-digital convertor for analysis. Two valves were used to control the flow 
of air. Initially, the valves were operated so that the pump inflated the cuff to well above the subject's 
systolic pressure. At that point, cuff air was recirculated through the pump. Under this condition, the 
mean cuff pressure remained constant, but the stroke volume of the pump resulted in cuff pressure 
oscillations at the frequency of the pump. Since the pump volume is a known quantity, it was used as a 
volume calibration source. The pump volume was divided by the cuff pressure oscillations due to the 
pump to yield the cuff compliance. With cuff compliance known, the cuff pressure arterial pulse was 
converted to a volume pulse by multiplying by cuff compliance. The pump was designed to operate at 
approximately 40 Hz, well above the heart rate frequency components. This allowed the arterial pulse 
and pump pulse to be easily separated from the cuff pressure recording by using low pass and high pass 
digital filters with a cutoff frequency of 25 Hz. 

The above procedure was repeated at every level of cuff pressure. The cuff pressure was released in 
steps of 5 to 10 mm Hg until it was zero by briefly opening the release valve. The subject's pulse volume 
was divided by their pressure pulse (systolic minus diastolic pressures) to obtain the arterial compliance 
at every value of cuff pressure. The compliance per unit length was obtained by further dividing by the 
effective cuff length. The systolic, diastolic, and mean arterial pressures were evaluated by oscillometry 
and the Korotkoff method. Arterial compliance was then plotted for every value of transmural pressure 
(Fig. 71.10). The compliance was then numerically integrated to obtain the corresponding brachial artery 
P-A curve. Since the vessel was collapsed for large negative transmural pressures, the initial constant of 
integration was chosen to be zero lumen area. 
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FIGURE 71.10 Arterial compliance obtained noninvasively by employing the occlusive arm cuff as a plethysmo- 
graph. Points are measured data. The solid curve is the model result obtained from Eq. (71.1). 

The arterial compliance curve possesses a maximum near zero transmural pressure. This point cor- 
responds with the onset of vessel buckling. On either side of the buckling point, the vessel supports its 
pressure load differently. On the negative pressure side, the vessel partially collapses and undergoes wall 
bending. The compliance rapidly approaches zero during collapse. This occurs as the opposite walls of 
the vessel begin to contact each other and close the lumen. On the positive pressure side, the vessel 
supports its pressure load by wall stretch. The compliance slowly decreases with increasing pressure 
because of nonlinear wall elasticity. That is, the wall becomes stiffer with increasing stretch. The s-shaped 
lumen area curve is a consequence of these two different mechanisms. Preliminary studies of several 
subjects revealed that the shape of the noninvasive P-A curve is consistent for all subjects studied [ Whitt 
and Drzewiecki, 1997; Drzewiecki and Pilla, 1998]. 

A mathematical model was developed that incorporates the fundamental physical and material prop- 
erties that contribute to the collapsible P-A relationship [Drzewiecki et al., 1997; Drzewiecki and Pilla, 
1998]. The basic form of the model is summarized by the following equation for transmural pressure: 



[,-j 



+ P b +a 



, b (H 



(71.1) 



where a and b are arterial elastance constants for distension, E is vessel elastance during collapse, and n 
is a constant that determines the rate of change of pressure with respect to change in area during collapse. 
The quantity X , was defined as the extension ratio and is evaluated from the lumen area divided by the 
lumen area at the buckling point, A/A b . The first hyperbolic term is the pressure due to collapse and wall 
bending. The second term is the buckling pressure and is found when A= A b . The third exponential 
term represents the pressure due to wall stretch. Some overlap in the contribution of each term may 
occur near the buckling pressure. Depending on the vessel type and material, Eq. (71.1) can be improved 
by limiting the extension ratio and its inverse to unity. 

Equation (71.1) was employed to analyze the brachial artery P-A data from each subject. The constants 
A b and P b were measured directly from the data as the point on the P-A curve that corresponds with 
maximum compliance or buckling. Their values were inserted into Eq. (71.1) for each subject, leaving 
the remaining constants to be found by nonlinear least squares regression (Marquardt-Levenberg algo- 
rithm). The model was evaluated for the subject shown in Fig. 71.10 and corresponds with the solid line 
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curve. Similar results were obtained for all subjects studied (N = 10), with the mean error of estimate 
less than 3 mm Hg. The above study suggests that no other physical properties need to be added to model 
vascular collapse. Thus, it can be considered a valid model for further studies of vascular properties and 
blood pressure determination. 

While the other terms of the model were found to vary from subject to subject, the buckling pressure 
was discovered to be relatively constant (10 mmHg ±11). Hence, buckling may be the important vascular 
feature that permits noninvasive blood pressure measurement to be feasible and may be the common 
thread that links all noninvasive methods. This idea was first examined in our theoretical examination 
of oscillometry above [Drzewiecki et al., 1994] . The successful use of buckling itself as a method to find 
arterial pressure was also described here [Sheth and Drzewiecki, 1998]. The phenomenon of buckling is 
a general effect that occurs independent of the technique used to find arterial pressure. It will also be 
independent of the quantitative differences in the P-A relationship of each specific subject. 

Summary 

Future research is open to noninvasive studies of how cardiovascular disease can alter blood vessel 
mechanics and the accuracy of blood pressure determination. The use of methods, such as occlusive cuff 
plethysmography together with blood pressure measurement and vascular mechanical modeling pre- 
sented here, offers a means for noninvasive detection of the early stages of cardiovascular disease. 
Additionally, pulse dynamics and pressure variation offered by noninvasive pulse recording can provide 
new information about cardiovascular control. 

Marey originally proposed that vessel closure is the important event that permits noninvasive blood 
pressure measurement. From the work presented here, this early concept is refocused to the instability 
of arterial buckling, or the process of closure, as the basic mechanism that enables noninvasive blood 
pressure determination. 
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Cardiac output is the amount of blood pumped by the right or left ventricular per unit of time. It is 
expressed in liters per minute (L/min) and normalized by division by body surface area in square meters 
(m 2 ). The resulting quantity is called the cardiac index. Cardiac output is sometimes normalized to body 
weight, being expressed as mL/min per kilogram. A typical resting value for a wide variety of mammals 
is 70 mL/min per kg. 

With exercise, cardiac output increases. In well-trained athletes, cardiac output can increase five-fold 
with maximum exercise. During exercise, heart rate increases, venous return increases, and the ejection 
fraction increases. Parenthetically, physically fit subjects have a low resting heart rate, and the time for 
the heart rate to return to the resting value after exercise is less than that for subjects who are not physically 
fit. 

There are many direct and indirect (noninvasive) methods of measuring cardiac output. Of equal 
importance to the number that represents cardiac output is the left-ventricular ejection fraction (stroke 
volume divided by diastolic volume), which indicates the ability of the left ventricle to pump blood. 

72.1 Indicator-Dilution Method 

The principle underlying the indicator-dilution method is based on the upstream injection of a detectable 
indicator and on measuring the downstream concentration-time curve, which is called a dilution curve. 
The essential requirement is that the indicator mixes with all the blood flowing through the central 
mixing pool. Although the dilution curves in the outlet branches may be slightly different in shape, they 
all have the same area. 

Figure 72.1a illustrates the injection of m g of indicator into an idealized flowing stream having the 
same velocity across the diameter of the tube. Figure 72. 1 b shows the dilution curve recorded downstream. 
Because of the flow-velocity profile, the cylinder of indicator and fluid becomes teardrop in shape, as 
shown in Fig. 72.1c. The resulting dilution curve has a rapid rise and an exponential fall, as shown in 
Fig. 72. Id. However, the area of the dilution curve is the same as that shown in Fig. 72. la. Derivation of 
the flow equation is shown in Fig. 72.1, and the flow is simply the amount of indicator (m gm) divided 
by the area of the dilution curve (gm/mL x s), which provides the flow in mL/s. 
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Indicators 

Before describing the various indicator-dilu- 
tion methods, it is useful to recognize that 
there are two types of indicator, diffusible and 
nondiffusible. A diffusible indicator will leak 
out of the capillaries. A nondiffusible indica- 
tor is retained in the vascular system for a 
time that depends on the type of indicator. 
Whether cardiac output is overestimated with 
a diffusible indicator depends on the location 
of the injection and measuring sites. 
Table 72.1 lists many of the indictors that 
have been used for measuring cardiac output 
and the types of detectors used to obtain the 
dilution curve. It is obvious that the indicator 
selected must be detectable and not alter the 
flow being measured. Importantly, the indi- 
cator must be nontoxic and sterile. 

When a diffusible indicator is injected into 
the right heart, the dilution curve can be 
detected in the pulmonary artery, and there 
is no loss of indicator because there is no 
capillary bed between these sites; therefore 
the cardiac output value will be accurate. 

Thermal Dilution Method 
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FIGURE 72.1 Genesis of the indicator- dilution curve. 



Chilled 5% dextrose in water (D5W) or 0.9% NaCl can be used as indicators. The dilution curve represents 
a transient reduction in pulmonary artery blood temperature following injection of the indicator into 
the right atrium. Figure 72.2 illustrates the method and a typical thermodilution curve. Note that the 
indicator is really negative calories. The thermodilution method is based on heat exchange measured in 
calories, and the flow equation contains terms for the specific heat (C) and the specific gravity (S) of the 
indicator (z) and blood (b). The expression employed when a #7F thermistor- tipped catheter is used and 
chilled D5W is injected into the right atrium is as follows: 



CO = 



TABLE 72.1 Indicators 




Material 

Evans blue (T1824) 
Indocyanine green 
Coomassie blue 
Saline (5%) 
Albumin l 131 
Na 24 , K 42 , D 2 0, DHO 
Hot-cold solutions 



Detector 

Photoelectric 640 mu. 
Photoelectric 800 mu. 
Photoelectric 585-600 mu. 
Conductivity cell 
Radioactive 
Radioactive 
Thermodetector 



Retention Data 

50% loss in 5 days 

50% loss in 10 minutes 

50% loss in 15-20 minutes 

Diffusible* 

50% loss in 8 days 

Diffusible* 

Diffusible* 



* It is estimated that there is about 1 5% loss of diffusible indicators during the 
first pass through the lungs. 
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FIGURE 72.2 The thermodilution method (a) and a typical dilution curve (b). 

■■ Volume of indicator injected in mL 

: Temperature (average of pulmonary artery blood in (°C) 

: Temperature of the indicator (°C) 

: Multiplier required to convert mL/s into mL/min 

: Area under the dilution curve in (seconds x °C) 

: Specific gravity of indicator (i) and blood (b) 

■■ Specific heat of indicator (i) and blood (b) 

= 1.08 for 5% dextrose and blood of 40% packed- cell volume 

: Empiric factor employed to correct for heat transfer through the injection catheter (for 
a #7F catheter, F = 0.825 [2]). 



Entering these factors into the expression gives 



VI 



co = 



(n-rj 



53.46 



A 



where CO = cardiac output in mL/min 
53.46 = 60 x 1.08 x 0.825 

To illustrate how a thermodilution curve is processed, cardiac output is calculated below using the 
dilution curve shown in Fig. 72.2. 

V = 5 ml of 5% dextrose in water 

T b = 37°C 

T, = 0°C 

A = 1.59°C s 



CO 



5(37-0) 



53.46 



6220 mL/min 



1.59 
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Although the thermodilution method is the standard in clinical medicine, it has a few disadvantages. 
Because of the heat loss through the catheter wall, several series 5-mL injections of indicator are needed 
to obtain a consistent value for cardiac output. If cardiac output is low, i.e., the dilution curve is very 
broad, it is difficult to obtain an accurate value for cardiac output. There are respiratory-induced 
variations in PA blood temperature that confound the dilution curve when it is of low amplitude. 
Although room-temperature D5W can be used, chilled D5W provides a better dilution curve and a more 
reliable cardiac output value. Furthermore, it should be obvious that if the temperature of the indicator 
is the same as that of blood, there will be no dilution curve. 

Indicator Recirculation 

An ideal dilution curve shown in Fig. 72.2 consists of a steep rise and an exponential decrease in indicator 
concentration. Algorithms that measure the dilution-curve area have no difficulty with such a curve. 
However, when cardiac output is low, the dilution curve is typically low in amplitude and very broad. 
Often the descending limb of the curve is obscured by recirculation of the indicator or by low-amplitude 
artifacts. Figure 72.3a is a dilution curve in which the descending limb is obscured by recirculation of 
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FIGURE 72.3 Dilution curve obscured by recirculation (a) and a semilogarithmic plot of the descending limb (b). 
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the indicator. Obviously it is difficult to determine the practical end of the curve, which is often specified 
as the time when the indicator concentration has fallen to a chosen percentage (e.g., 1%) of the maximum 
amplitude (C max ). Because the descending limb represents a good approximation of a decaying exponen- 
tial curve (e~ kt ), fitting the descending limb to an exponential allows reconstruction of the curve without 
a recirculation error, thereby providing a means for identifying the end for what is called the first pass 
of the indicator. 

In Fig. 72.3b, the amplitude of the descending limb of the curve in Fig. 72.3a has been plotted on 
semilogarithmic paper, and the exponential part represents a straight line. When recirculation appears, 
the data points deviate from the straight line and therefore can be ignored, and the linear part (repre- 
senting the exponential) can be extrapolated to the desired percentage of the maximum concentration, 
say 1% of C max . The data points representing the extrapolated part were replotted on Fig. 72.3a to reveal 
the dilution curve undistorted by recirculation. 

Commercially available indicator-dilution instruments employ digitization of the dilution curve. Often 
the data beyond about 30% of C max are ignored, and the exponential is computed on digitally extrapolated 
data. 

72.2 Fick Method 

The Fick method employs oxygen as the indicator and the increase in oxygen content of venous blood as 
it passes through the lungs, along with the respiratory oxygen uptake, as the quantities that are needed 
to determine cardiac output, CO = 2 uptake/A- V0 2 difference). Oxygen uptake (mL/min) is measured 
at the airway, usually with an oxygen-filled spirometer containing a C0 2 absorber. The A — V0 2 difference 
is determined from the oxygen content (mL/100 mL blood) from any arterial sample and the oxygen 
content (mL/100 mL) of pulmonary arterial blood. The oxygen content of blood used to be difficult to 
measure. However, the new blood-gas analyzers that measure, pH, p0 2 , pC0 2 , hematocrit, and hemo- 
globin provide a value for 2 content by computation using the oxygen-dissociation curve. 

There is a slight technicality involved in determining the oxygen uptake because oxygen is consumed 
at body temperature but measured at room temperature in the spirometer. Consequently, the volume of 
2 consumed per minute displayed by the spirometer must be multiplied by a factor, F. Therefore the 
Fick equation is 

O, uptake/ mini 7 1 

CO = y — L 

A-V0 2 difference 

Figure 72.4 is a spirogram showing a tidal volume riding on a sloping baseline that represents the 
resting expirating level (REL). The slope identifies the oxygen uptake at room temperature. In this subject, 
the uncorrected oxygen consumption was 400 mL/min at 26°C in the spirometer. With a barometric 
pressure of 750 mmHg, the conversion factor F to correct this volume to body temperature (37°C) and 
saturated with water vapor is 

„ 273 + 37 P, -PH,0 
F = x -s ^— 



273+ T P. -47 

s b 

Where T i is the spirometer temperature, P b is the barometric pressure, and PH 2 at T s is obtained 
from the water-vapor table (Table 72.2). 

A sample calculation for the correction factor F is given in Fig. 72.4, which reveals a value for F of 
1.069. However, it is easier to use Table 72.3 to obtain the correction factor. For example, for a spirometer 
temperature of 26°C and a barometric pressure of 750 mmHg, F = 1.0691. 
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Subject; 36- Year old male 
Height: 5'10'/i" 
Weight: 175 1b" 

Oody sur face area : A - w om X M ' 25 x 71.84 
where IV = weight in kg 

II = height in cm 

A = area in cm 2 

A- 1.98 m 2 
0, Uptake at 26°C: 400 mL/min 
Barometer: 750 mm Kg P ltj0 at 26"C: 25.2 mm Kg 
; Uptake correction factor: /•' 
_ mi 37 .. 750-25.2 



273 i 2fi 750 --17 

f » 1.059 

F from Table 4.6 » 1.069 

0, Uptake BTPS: 400 X 1.0S9 - 427.6 
Blood 2 : 

Arterial = 20 V% 
Mixed vonnits = 15 v% 
Cardiac otitput: 

f ''°"i^o-' 8652mL/min 

Cardiac index: 
... 8.552 

' rxr- - in2 i./min/ m j 




FIGURE 72.4 Measurement of oxygen uptake with a spirometer ( right) and the method used to correct the measured 
volume {left). 



TABLE 72.2 Vapor 


Pressure 


of Water 






Temp. °C 


0.0 


0.2 


0.4 


0.6 


0.8 


15 


12.788 


12.953 


13.121 


13.290 


13.461 


16 


13.634 


13.809 


13.987 


14.166 


14.347 


17 


14.530 


14.715 


14.903 


15.092 


15.284 


18 


15.477 


15.673 


15.871 


16.071 


16.272 


19 


16.477 


16.685 


16.894 


17.105 


17.319 


20 


17.535 


17.753 


17.974 


18.197 


18.422 


21 


18.650 


18.880 


19.113 


19.349 


19.587 


22 


19.827 


20.070 


20.316 


20.565 


20.815 


23 


21.068 


21.324 


21.583 


21.845 


22.110 


24 


22.377 


22.648 


22.922 


23.198 


23.476 


25 


23.756 


24.039 


24.326 


24.617 


24.912 


26 


25.209 


25.509 


25.812 


26.117 


26.426 


27 


26.739 


27.055 


27.374 


27.696 


28.021 


28 


28.349 


28.680 


29.015 


29.354 


29.697 


29 


30.043 


30.392 


30.745 


31.102 


31.461 


30 


31.825 


32.191 


32.561 


32.934 


33.312 


31 


33.695 


34.082 


34.471 


34.864 


35.261 


32 


35.663 


36.068 


36.477 


36.891 


37.308 


33 


37.729 


38.155 


38.584 


39.018 


39.457 


34 


39.898 


40.344 


40.796 


41.251 


41.710 


35 


42.175 


42.644 


43.117 


43.595 


44.078 


36 


44.563 


45.054 


45.549 


46.050 


46.556 


37 


47.067 


47.582 


48.102 


48.627 


49.157 


38 


49.692 


50.231 


50.774 


51.323 


51.879 


39 


42.442 


53.009 


53.580 


54.156 


54.737 


40 


55.324 


55.910 


56.510 


57.110 


57.720 


41 


58.340 


58.960 


59.580 


60.220 


60.860 
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Note that the correction factor Fin this case is only 6.9%. The error encountered by not including it 
may be less than the experimental error in making all other measurements. 

The example selected shows that the A - V0 2 difference is 20 - 15 mL/100 mL blood and that the 
corrected 2 uptake is 400 x 1.069; therefore the cardiac output is: 



CO = 



400x1.069 



( 2 °- 15 )/ 



= 8552 mL/min 



100 



The Fick method does not require the addition of a fluid to the circulation and may have value in 
such a circumstance. However, its use requires stable conditions because an average oxygen uptake takes 
many minutes to obtain. 

72.3 Ejection Fraction 

The ejection fraction (EF) is one of the most convenient indicators of the ability of the left (or right) 
ventricle to pump the blood that is presented to it. Let v be the stroke volume (SV) and V be the end- 
diastolic volume (EDV); the ejection fraction is v/Vor SV/EDV. 

Measurement of ventricular diastolic and systolic volumes can be achieved radiographically, ultrason- 
ically, and by the use of an indicator that is injected into the left ventricle where the indicator concentration 
is measured in the aorta on a beat-by-beat basis. 

Indicator-Dilution Method for Ejection Fraction 

Holt [1] described the method of injecting an indicator into the left ventricular during diastole and 
measuring the stepwise decrease in aortic concentration with successive beats (Fig. 72.5). From this 
concentration-time record, end-diastolic volume, stroke volume, and ejection fraction can be calculated. 
No assumption need be made about the geometric shape of the ventricle. The following describes the 
theory of this fundamental method. 




12 3 4 5 
BEAT NUMBER 



FIGURE 72.5 The saline method of measuring ejection fraction, involving injection of m gm of NaCl into the left 
ventricle and detecting the aortic concentration (C) on a beat-by-beat basis. 
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FIGURE 72.6 Stepwise decrease in indicator concentration (C) versus beat number for ejection fraction (vIV) of 
0.5 and 0.2. 

Let Vbe the end-diastolic ventricular volume. Inject m gm of indicator into this volume during diastole. 
The concentration (Q) of indicator in the aorta for the first beat is mlV. By knowing the amount of 
indicator (m) injected and the calibration for the aortic detector, C l is established, and ventricular end- 
diastolic volume V= mlC l . 

After the first beat, the ventricle fills, and the amount of indicator left in the left ventricle is m - mv/V. 
The aortic concentration (C 2 ) for the second beat is therefore m — mV/V = m{\ - v/V). Therefore the 
aortic concentration (C 2 ) for the second beat is 





m 




V 


L 2 = 




1- 




V 




V 



By continuing the process, it is easily shown that the aortic concentration (C„) for the wth beat is 





m 




V 


<; 




1- 






V 




V 



Figure 72.6 illustrates the stepwise decrease in aortic concentration for ejection fractions (v/V) of 0.2 
and 0.5, i.e., 20% and 50%. 

It is possible to determine the ejection fraction from the concentration ratio for two successive beats. 
For example, 
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FIGURE 72.7 Ejection fraction (v/V) versus the ratio of concentrations for successive beats (C ll+1 /C„). 



from which 



V =1 C n+1 

V c 

n 

where v/V is the ejection fraction and C n+l /C n is the concentration ratio for two successive beats, e.g., 
C 2 /C l or C 3 /C 2 - Figure 72.7 illustrates the relationship between the ejection fraction v/V and the ratio of 
C n+l /C n . Observe that the detector need not be calibrated as long as there is a linear relationship between 
detector output and indicator concentration in the operating range. 
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Bioelectric tissue impedance measurements to determine or infer biological information have a long 
history dating back to before the turn of the century. The start of modern clinical applications of 
bioelectric impedance (BEI) measurements can be attributed in large part to the reports by Nyboer 
[1970]. BEI measurements are commonly used in apnea monitoring, especially for infants, and in the 
detection of venous thrombus. Many papers report the use of the BEI technique for peripheral blood 
flow, cardiac stroke volume, and body composition. Commercial equipment is available for these latter 
three applications, although the reliability, validity, and accuracy of these applications have been ques- 
tioned and, therefore, have not received widespread acceptance in the medical community. 

BEI measurements can be classified into two types. The first and most common application is in the 
study of the small pulsatile impedance changes associated with heart and respiratory action. The goal of 
this application is to give quantitative and qualitative information on the volume changes (plethysmo- 
graphy) in the lung, heart, peripheral arteries, and veins. The second application involves the determi- 
nation of body characteristics such as total body fluid volume, inter- and extra-cell volume, percent body 
fat, and cell and tissue viability. In this application, the total impedance is used and in some cases measured 
as a function of frequency, which is referred to as impedance spectroscopy. 

73.1 Measurement Methods 

Most single frequency BEI measurements are in the range of 50 to 100 kHz (at these frequencies no 
significant electrical shock hazard exists) using currents from 0.5 to 4 mA RMS. Currents at these levels 
are usually necessary to obtain a good signal-to-noise ratio when recording the small pulsatile changes 
that are in the range of .1 to 1% of the total impedance. The use of higher frequencies creates instru- 
mentation design problems due to stray capacity. 

BEI measurements in the 50 to 100 kHz range have typical skin impedance values 2 to 10 times the 
value of the underlying body tissue of interest depending on electrode area. In order to obtain BEI values 
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dZ/dt 



control signal 
FIGURE 73.1 The four electrode impedance measurement technique and the associated instrumentation. 

that can be used to give quantitative biological information, the skin impedance contribution must be 
eliminated. This is accomplished by using the four electrode impedance measurement method shown in 
Fig. 73.1, along with other signal processing blocks used in typical impedance plethysmographs. 

Z bo is the internal section of tissue we wish to measure. If we used two electrodes to make the 
measurement, we would include two skin impedances (i.e., Z skl and Z sk4 ) and two internal tissue imped- 
ances (i.e., Z bI and Z b2 ) which would make it impossible to estimate an accurate value for Z bo . 

A constant current source supplies current, I , to the outside two electrodes 1 and 4. This current flows 
through the skin and body tissue independent of tissue and skin impedance values. The voltage V is 
measured across Z bo with a voltage amplifier using electrodes 2 and 3. Assuming the output impedance 
of the current source is >> Z skl 



Zbi + Z bo + Z b2 + Z sk4 and the input impedance of the voltage amplifier 



is » Z„, 



then 



1 +AZ, Z =V I , and AZ = AV I 



(73.1) 



where Z is the non-time-varying portion of the impedance and AZ is the impedance change typically- 
associated with the pulsation of blood in the region of measurement. 

The output from the voltage pick-up amplifier (Fig. 73.1) is connected to the amplitude detector and 
low pass filter which removes the high frequency carrier signal, which results in an output voltage 
proportional to Z bo . Z bo has a large steady part which is proportional to the magnitude of the tissue 
impedance (Z ) and a small (.1 to 1%) part, AZ, that represents the change due to respiratory or cardiac 
activity. In order to obtain a signal representing AZ , Z must be removed from Z bo and the signal amplified. 
This can be accomplished by capacity coupling or by subtracting a constant that represents Z . The latter 
is usually done because many applications require near dc response. The output of the AZ amplifier will 
be a waveform oscillating around zero volts. The output from the AZ amplifier controls the sample and 
hold circuit. When the AZ output exceeds a given value, usually plus or minus a few tenths of an ohm, 
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to represent the maximal 
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Z(t)=AZ + Z n 



-C 



FIGURE 73.2 Parallel column model. 

the sample and hold circuit updates its value of Z . The output from the sample and hold circuit is 
subtracted from Z bo by the AZ amplifier. The derivative of Z bo is frequently obtained in instruments 
intended for cardiac use. 

73.2 Modeling and Formula Development 

To relate the AZ obtained on the thorax or peripheral limbs to the pulsatile blood volume change, the 
parallel column model, first described by Nyboer [197o], is frequently used (Fig. 73.2). The model consists 
of a conducting volume with impedance Z in parallel with a time-varying column with resistivity p , 
length L, and a time-varying cross-sectional area which oscillates from to a finite value. At the time in 
the cardiac cycle when the pulsatile volume is at a minimum, all of the conducting tissues and fluids are 
represented by the volume labeled Z . This volume can be a heterogeneous mixture of all of the non- 
time-varying tissues such as fat, bone, muscle, etc. in the region under measurement. The only informa- 
tion needed about this volume is its impedance Z and that it is electrically in parallel with the small 
time varying column. During the cardiac cycle, the volume change in the right column starts with a zero 
cross-sectional area and increases in area until its volume equals the blood volume change. If the 
impedance of this volume is much greater than Z , then the following relation holds: 



av = p (l 2 /z o 2 )az 



(73.2) 



where AV = the pulsatile volume change with resistivity p 

p = the resistivity of the pulsatile volume in Q-cm (typically the resistivity of blood) 

L = the length of the cylinder 

Z = the impedance measured when the pulsatile volume is at a minimum 

AZ = the magnitude of the pulsatile impedance change. 

The resistivity of blood, p in Q-cm, is a function of hematocrit (H) expressed as a percentage and can 
be calculated as p = 67.919 exp(0.0247 H) [Mohapatra et al., 1977]. The typical value used for blood is 
150 Q-cm. 

73.3 Respiration Monitoring and Apnea Detection 

If the BEI is measured across the thorax, a variation of approximately 1 to 2 ohms per liter of lung volume 
change is observed, which increases with inspiration. The most common position of the electrodes for 
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Respiration 



ECG ' ! ' ' 

FIGURE 73.3 Example of BEI respiration signal and ECG. 

respiratory measurements is on each side of the thorax along the midaxillary line. The largest signal is 
generally obtained at the level of the xiphsternal joint although a more linear signal is obtained higher 
up near the axilla [Geddes and Baker, 1989]. 

The problems encountered with the quantitative use of BEI for respiration volume are movement 
artifacts and the change in the response depending on whether diaphragmatic or intercostal muscles are 
used. For most applications, the most serious problem is body movement and positional changes artifacts 
which can cause impedance changes significantly larger than the change caused by respiration. 

The determination of apnea or whether respiration has stopped [Neuman, 1988] in infants is one of 
the most widely used applications of BEI. For convenience and due to the lack of space on the thorax of 
infants, only two electrodes are used. These are placed at the mid-thoracic level along the midaxillary 
line and are also used to obtain the ECG. No effort is usually made to quantitate the volume change. 
Filtering is used to reduce movement artifacts and automatic gain controls and adaptive threshold 
detection is used in the breath detection circuits. Due to movement artifacts, the normal breath detection 
rate in infants is not highly reliable. When respiration stops, body movement ceases which eliminates 
the movement artifacts and then apnea can be detected. Ventation detection problems can occur if the 
airway is obstructed and the infant makes inspiratory movement efforts or cardiac-induced impedance 
changes are interpreted as a respiratory signal. Figure 73.3 shows a typical impedance measurement 
during an apneic period. 

73.4 Peripheral Blood Flow 

BEI measurements are made on limbs to determine arterial blood flow into the limb or for the detection 
of venous thrombosis. In both applications, an occluding cuff is inflated above venous pressure to prevent 
outflow for a short period of time. 

Figure 73.4 shows the typical electrode arrangement on the leg and the position of the occluding cuff. 
The cuff is rapidly inflated to 40 to 50 mmHg, which prevents venous outflow without significantly 
changing arterial inflow. The arterial inflow causes an increase in the volume of the limb. The slope of 
the initial impedance change as determined by the first three or four beats is used to measure the arterial 
flow rate. Equation (73.2) is used to calculate the volume change from the impedance change. The flow 
(the slope of the line in Fig. 73.4) is determined by dividing the volume change by the time over which 
the impedance change was measured. The volume change that occurs after the impedance change reaches 
a plateau is a measure of the compliance of the venous system. 

After the volume change of the leg has stabilized, the cuff is quickly deflated which results in an 
exponential decrease in volume. If a thrombosis exists in the veins, the time constant of the outflow 
lengthens. The initial slope, the time constant, and percentage change at 3 sec after cuff deflation have 
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FIGURE 73.4 The measurement of arterial inflow and venous outflow. 

been used to quantitate the measurement. The percentage change at 3 sec has been reported to show the 
best agreement with venograms. The determination of deep venous thrombus is frequently made by 
combining the maximal volume change with the outflow rate. The agreement with a venogram is 94% 
for the detection of deep venous thrombus proximal to the knee [Anderson, 1988]. 

73.5 Cardiac Measurements 

The measurements of chest impedance changes due to cardiac activity have been reported starting in 
1930s. One of the most popular techniques, first reported by Patterson et al. [1964], for quantitative 
measurements uses band electrodes around the ends of the thorax as shown in Fig. 73.5. Each heart beat 
causes a pulsatile decrease in impedance of 0.1 to 0.2 ohms (decreasing AZ and negative dZ/dt are shown 
in an upward direction). The empirical formula for stroke volume based on this method follows from 
Eq. (73.2): 



AV = p(li/Z o 2 ) 



TdZ . /dt 



(73.3) 



where AV 

P 
L 

Z„ 
dZ m 



,/dt 



cardiac stroke volume (mL) 

resistivity of blood (Q-cm) 

distance between the inner band electrodes (cm) 

base impedance (Q) 

the magnitude of the largest negative derivative of the impedance change occurring 

during systole (Q/s) 

systolic ejection time (seconds) 



Many studies have been conducted comparing the impedance technique with other standard methods 
of measuring stroke volume and cardiac output. In general, the correlation coefficient in subjects without 
valve problems or heart failure and with a stable circulatory system is 0.8 to 0.9. In patients with a failing 
circulatory system or valvular or other cardiovascular problems, the correlation coefficient may be less 
than 0.6 [Patterson, 1989]. 

Experimental physiological studies and computer modeling show that multiple sources contribute to 
the impedance signal. The anatomical regions that make significant contributions are the aorta, lungs, 
and atria. Recent studies have reported that the blood resistivity change with flow, pulsatile changes in 
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FIGURE 73.5 Impedance cardiographic waveforms. 

the neck region, and the movement of the heart significantly contribute to the signal [Patterson et al., 
1991, Wang and Patterson, 1995]. It appears that a number of different sources of the thoracic impedance 
change combine in a fortuitous manner to allow for a reasonable correlation between BEI measured 
stroke volume and other accepted techniques. However, in patients with cardiac problems where the 
contributions of the different sources may vary, the stroke volume calculated from BEI measurements 
may have significant error. 

73.6 Body Composition (Single Frequency Measurement) 

The percentage of body fat has been an important parameter in sports medicine, physical conditioning, 
weight loss programs, and predicting optimum body weight. To determine body composition, the body 
is configured as two parallel cylinders similar to the model described earlier. One cylinder is represented 
by fat and the other as fat-free body tissue. Since the resistivity of fat is much larger than muscle and 
other body fluids, the volume determined from the total body impedance measurement is assumed to 
represent the fat-free body volume. Studies have been conducted that calculate the fat-free body mass 
by determining the volume of the fat-free tissue cylinder using the impedance measured between the 
right hand and right foot, and using the subject's height as a measure of the cylinder's length with an 
empirical constant used to replace p [Lukaski, 1985]. Knowing the total weight and assuming body 
density factors, the percentage of body fat can be calculated as the difference between total weight and 
the weight of the fat-free tissue. Many studies have reported the correlation of the percentage of body 
fat calculated from BEI with other accepted standard techniques from approximately 0.88 to 0.98 [Schoe- 
ller and Kushner, 1989]. The physical model used for the equation development is a poor approximation 
to the actual body because it assumes a uniform cross-sectional area for the body between the hand the 
foot. Patterson [1989] pointed out the main problem with the technique: the measured impedance 
depends mostly on the characteristics of the arms and legs and not of the trunk. Therefore, determination 
of body fat with this method may often be inaccurate. 
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FIGURE 73.6 Typical impedance spectroscopy data, model, and the equation used to fit the data. 

73.7 Impedance Spectroscopy 

By measuring BEI over a range of frequencies (typically between 1 kHz to 1 MHz) , the material properties 
of the tissues can be determined [Ackmann and Seitz, 1984]. Figure 73.6 shows the typical complex plane 
plot of the real and imaginary part of impedance and the model used to fit the data. R E represents the 
extra-cellular space, R, the intra-cellular space, and C m the cell membrane. The parameter a is propor- 
tional to the angle of the suppression of the semicircle. It exists to account for the distribution of time 
constants in the tissue. At low frequencies, the current flows in the extra-cellular space and at high 
frequencies the current is capacitively passed across the cell membrane while the extra and intra cell 
spaces are in parallel. 

Using these model parameters, studies have shown positive results in determining intra- and extra- 
cellular fluid volumes [Kanai et al., 1987], body fat [De Lorenzo et al., 1997], tissue ischemica [Cinca 
et al., 1997] and canceous tissues [Jossient, 1998]. 

73.8 Summary 

Electrical impedance instrumentation is not relatively costly, which has encouraged its possible applica- 
tion in many different areas. The impedance measurement is influenced by many different factors 
including geometry, tissue conductivity, and blood flow. Because of this complexity, it is difficult to 
reliably measure an isolated physiological parameter, which has been the principle factor limiting its use. 
The applications that are widely used in clinical medicine are apnea monitoring and the detection of 
venous thrombosis. The other applications described above will need more study before becoming a 
reliable and useful measurement. 

Defining Terms 

Apnea: A suspension of respiration. 

Compliance: The volume change divided by the pressure change. The higher the compliance, the more 

easily the vessel will expand as pressure increases. 
Plethysmography: The measurement of the volume change of an organ or body part. 
Thrombosis: The formation of a thrombus. 
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Thrombus: A clot of blood formed within the heart or vessel. 

Venogram: An X-ray image of the veins using an injected radiopaque contrast material. 
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Further Information 

The book by Nyboer, Electrical Impedance Plethysmography contains useful background information. The 
Encyclopedia of Medical Devices and Instrumentation edited by J.G. Webster and Principles of Applied 
Biomedical Instrumentation by L.A. Geddes and L.E. Baker give a more in-depth description of many 
applications and describe some usual measurements. 
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74. 1 Lung Volumes 

74.2 Pulmonary Function Tests 

Dynamic Tests • The Pneumotachograph • The Nitrogen- 
Leslie A. Geddes Washout Method for Measuring FRC 

Purdue University 74.3 Physiologic Dead Space 

74.1 Lung Volumes 

The amount of air flowing into and out of the lungs with each breath is called the tidal volume (TV). 
In a typical adult this amounts to about 500 mL during quiet breathing. The respiratory system is capable 
of moving much more air than the tidal volume. Starting at the resting expiratory level (REL in Fig. 74.1), 
it is possible to inhale a volume amounting to about seven times the tidal volume; this volume is called 
the inspiratory capacity (IC). A measure of the ability to inspire more than the tidal volume is the 
inspiratory reserve volume (IRV), which is also shown in Fig. 74.1. Starting from REL, it is possible to 
forcibly exhale a volume amounting to about twice the tidal volume; this volume is called the expiratory 
reserve volume (ERV). However, even with the most forcible expiration, it is not possible to exhale all 
the air from the lungs; a residual volume (RV) about equal to the expiratory reserve volume remains. The 
sum of the expiratory reserve volume and the residual volume is designated the functional residual capacity 
(FRC). The volume of air exhaled from a maximum inspiration to a maximum expiration is called the 
vital capacity (VC). The total lung capacity (TLC) is the total air within the lungs, i.e., that which can be 
moved in a vital-capacity maneuver plus the residual volume. All except the residual volume can be 
determined with a volume-measuring instrument such as a spirometer connected to the airway. 

74.2 Pulmonary Function Tests 

In addition to the static lung volumes just identified, there are several time-dependent volumes associated 
with the respiratory act. The minute volume (MV) is the volume of air per breath (tidal volume) multiplied 
by the respiratory rate (R), i.e., MV = (TV) R. It is obvious that the same minute volume can be produced 
by rapid shallow or slow deep breathing. However, the effectiveness is not the same, because not all the 
respiratory air participates in gas exchange, there being a dead space volume. Therefore the alveolar 
ventilation is the important quantity which is defined as the tidal volume (TV) minus the dead space 
(DS) multiplied by the respiratory rate R, i.e., alveolar ventilation = (TV - DS) R. In a normal adult 
subject, the dead space amounts to about 150 mL, or 2 mL/kg. 

Dynamic Tests 

Several timed respiratory volumes describe the ability of the respiratory system to move air. Among these 
are forced vital capacity (FVC), forced expiratory volume in f seconds (FEV f ), the maximum ventilatory 
volume (MW), which was previously designated the maximum breathing capacity (MBC), and the peak 



© 2000 by CRC Press LLC 





6000 


























- 3500 


f 




5500 . 








UJ 

| 

o 

> 


► 3000 






5000 . 






£ 


UI 

> 

UI 


■ 2500 






Woo • 




<u 


(A 




1 








< 


fl£ 


* 2000 










IX 


*J 


>* 




1 




4000 , 




< 




o 




j 








y 


>T 


» 1500 


1 






fc 




DC 

o 

1— 


s 






a: 

UJ 


3500 ■ 


I 


a 


s 

a. 

as 


a. 


■ 1000 




1- 


3000 ■ 




5! 










3 




3 






TIDAL 


500 s "A i 


\\ 


X 


2500 • 








VOLUME 


n - iiy 


y 






< 








* ""* "™*J ** ** 






2000 . 








& 

O UJ 

I- Si UJ 

— «S _1 

a. w o 


wJ 

• 500 E 






1500 • 








X « > 

tit 


• 1000 














1000 - 




. -J 

Sua 

c* x: 


«_> — < 

z </> a. 


-J 
< UJ 

3 X 


• 1500 






300 • 




— 3 

(A -J 
til O 


ess 


T -J 

to o 


- 2000 






















*■ 2300 
FIGURE 74.1 Lung volumes. 

flow (PF). These quantities are measured with a spirometer without valves and C0 2 absorber or with a 
pneumotachograph coupled to an integrator. 

Forced Vital Capacity 

Forced vital capacity (FVC) is shown in Fig. 74.2 and is measured by taking the maximum inspiration 
and forcing all of the inspired air out as rapidly as possible. Table 74.1 presents normal values for males 
and females. 

Forced Expiratory Volume 

Forced expiratory volume in t seconds (FEV f ) is shown in Fig. 74.2, which identifies FEV 5 and FEVj 0> 
and Table 74.1 presents normal values for FEVj . 

Maximum Voluntary Ventilation 

Maximum voluntary ventilation (M W) is the volume of air moved in 1 minute when breathing as deeply 
and rapidly as possible. The test is performed for 20 s and the volume scaled to a 1-min value; Table 74.1 
presents normal values. 

Peak Flow 

Peak flow (PF) in L/min is the maximum flow velocity attainable during an FEV maneuver and represents 
the maximum slope of the expired volume-time curve (Fig. 74.2); typical normal values are shown in 
Table 74.1. 

The Water-Sealed Spirometer 

The water- sealed spirometer was the traditional device used to measure the volume of air moved in 
respiration. The Latin word spirare means to breathe. The most popular type of spirometer consists of 
a hollow cylinder closed at one end, inverted and suspended in an annular space filled with water to 
provide an air-tight seal. Figure 74.3 illustrates the method of suspending the counterbalanced cylinder 
(bell), which is free to move up and down to accommodate the volume of air under it. Movement of the 
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FIGURE 74.2 The measurement of timed forced expiratory volume (FEV t ) and forced vital capacity (FVC). 



TABLE 74. 1 Dynamic Volumes 



Males 

FVC (L) = 0.133H - 0.022A - 3.60 (SEE = 0.58)* 

FEV1 (L) = 0.094H - 0.028A - 1.59 (SEE = 0.52)* 

MVV (L/min) = 3.39H - 1.26A - 21.4 (SEE = 29)* 

PF (L/min) = (10.03 - 0.038A)H f 
Females 

FVC (L) = 0.111H - 0.015A - 3.16 (SD = 0.42) s 

FEV1 (L) = 0.068H - 0.023A - 0.92 (SD = 0.37)' 

MVV (L/min) = 2.05H - 0.57A - 5.5 (SD = 10.7) s 

PF (L/min) = (7.44 - 0.0183A)H* 

H = height in inches, A = age in years, L = liters, L/min = liters per minute, 
SEE = standard error of estimate, SD = standard deviation 
*Kory, Callahan, Boren, Syner. 1961. Am J Med 30:243. 
'Leiner, Abramowitz, Small, Stenby, Lewis. 1963. Amer Rev Resp Dis 88:644. 
•Lindall, Medina, Grismer. 1967. Amer Rev Resp Dis 95:1061. 

bell, which is proportional to volume, is usually recorded by an inking pen applied to a graphic record 
which is caused to move with a constant speed. Below the cylinder, in the space that accommodates the 
volume of air, are inlet and outlet breathing tubes. At the end of one or both of these tubes is a check 
valve designed to maintain a unidirectional flow of air through the spirometer. Outside the spirometer 
the two breathing tubes are brought to a Y tube which is connected to a mouthpiece. With a pinch clamp 
placed on the nose, inspiration diminishes the volume of air under the bell, which descends, causing the 
stylus to rise on the graphic record. Expiration produces the reverse effect. Thus, starting with the 
spirometer half-filled, quiet respiration causes the bell to rise and fall. By knowing the "bell factor," the 
volume of air moved per centimeter excursion of the bell, the volume change can be quantitated. Although 



e 2000 by CRC Press LLC 




RECORDING 



COUNTERBALANCING 
WEIGHT 



FIGURE 74.3 The simple spirometer. 

a variety of flowmeters are now used to measure respiratory volumes, the spirometer with a C0 2 absorber 
is ideally suited to measure oxygen uptake. 

Oxygen Uptake 

A second and very important use for the water-filled spirometer is measurement of oxygen used per unit 
of time, designated the 2 uptake. This measurement is accomplished by incorporating a soda-lime, 
carbon-dioxide absorber into the spirometer as shown in Fig. 74.4a. Soda-lime is a mixture of calcium 
hydroxide, sodium hydroxide, and silicates of sodium and calcium. The exhaled carbon dioxide combines 
with the soda-lime and becomes solid carbonates. A small amount of heat is liberated by this reaction. 
Starting with a spirometer filled with oxygen and connected to a subject, respiration causes the bell 
to move up and down (indicating tidal volume) as shown in Fig. 74.5. With continued respiration the 
baseline of the recording rises, reflecting disappearance of oxygen from under the bell. By measuring the 
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FIGURE 74.4 The spirometer with C0 2 absorber (Fig. 74.5) and a record of oxygen uptake. 
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slope of the baseline on the spirogram, the volume 
of oxygen consumed per minute can be determined. 
Figure 74.5 presents a typical example along with 
calculation. 

The Dry Spirometer 

The water-sealed spirometer was the most popular 
device for measuring the volumes of respiratory 
gases; however, it is not without its inconveniences. 
The presence of water causes corrosion of the metal 
parts. Maintenance is required to keep the device in 
good working order over prolonged periods. To 
eliminate these problems, manufacturers have 
developed dry spirometers. The most common type 
employs a collapsible rubber or plastic bellows, the 
expansion of which is recorded during breathing. 
The earlier rubber models had a decidedly undesir- 
able characteristic which caused their abandon- 
ment. When the bellows was in its mid-position, the 
resistance to breathing was a minimum; when fully 
collapsed, it imposed a slight negative resistance; 
and when fully extended it imposed a slight positive 
resistance to breathing. Newer units with compliant 
plastic bellows minimize this defect. 

The Pneumotachograph 

The pneumotachograph is a device which is placed 

directly in the airway to measure the velocity of air flow. The volume per breath is therefore the integral 
of the velocity-time record during inspiration or expiration. Planimetric integration of the record, or 
electronic integration of the velocity- time signal, yields the tidal volume. Although tidal volume is perhaps 
more easily recorded with the spirometer, the dynamics of respiration are better displayed by the pneu- 
motachograph, which offers less resistance to the air stream and exhibits a much shorter response 
time — so short in most instruments that cardiac impulses are often clearly identifiable in the velocity- 
time record. 

If a specially designed resistor is placed in a tube in which the respiratory gases flow, a pressure drop 
will appear across it. Below the point of turbulent flow, the pressure drop is linearly related to air-flow 
velocity. The resistance may consist of a wire screen or a series of capillary tubes; Fig. 74.6 illustrates both 
types. Detection and recording of this pressure differential constitutes a pneumotachogram; Fig. 74.7 
presents a typical air-velocity record, along with the spirogram, which is the integral of the flow signal. 
The small-amplitude artifacts in the pneumotachogram are cardiac impulses. 

For human application, linear flow rates up to 200 L/min should be recordable with fidelity. The 
resistance to breathing depends upon the flow rate, and it is difficult to establish an upper limit of tolerable 
resistance. Silverman and Whittenberger [1950] stated that a resistance of 6 mm H 2 is perceptible to 
human subjects. Many of the high-fidelity pneumotachographs offer 5-10 mm H 2 resistance at 100 and 
200 L/min. It would appear that such resistances are acceptable in practice. 

Response times of 15-40 ms seem to be currently in use. Fry and coworkers [1957] analyzed the 
dynamic characteristics of three types of commercially available, differential-pressure pneumotacho- 
graphs which employed concentric cylinders, screen mesh, and parallel plates for the air resistors. Using 
a high-quality, differential-pressure transducer with each, they measured total flow resistance ranging 
from 5-15 cm H 2 0. Frequency response curves taken on one model showed fairly uniform response to 
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FIGURE 74.7 Velocity (A) and volume changes (B) during normal, quiet breathing; B is the integral of A. 

40 Hz; the second model showed a slight increase in response at 50 Hz, and the third exhibited a slight 
drop in response at this frequency. 

The Nitrogen-Washout Method for Measuring FRC 

The functional residual capacity (FRC) and the residual volume (RV) are the only lung compartments 
that cannot be measured with a volume-measuring device. Measuring these requires use of the nitrogen 
analyzer and application of the dilution method. 
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FIGURE 74.8 Arrangement of equipment for the nitrogen-washout technique. Valve V allows the subject to breathe 
room air until the test is started. The test is started by operating valve V at the end of a normal breath, i.e. the subject 
starts breathing 100% O, through the inspiratory valve (I) and exhales the N, and 2 mixture into a collecting 
spirometer via the expiratory valve EX. 

Because nitrogen does not participate in respiration, it can be called a diluent. Inspired and expired 
air contain about 80% nitrogen. Between breaths, the FRC of the lungs contains the same concentration 
of nitrogen as in environmental air, i.e., 80%. By causing a subject to inspire from a spirometer filled 
with 100% oxygen and to exhale into a second collecting spirometer, all the nitrogen in the FRC is 
replaced by oxygen, i.e., the nitrogen is "washed out" into the second spirometer. Measurement of the 
concentration of nitrogen in the collecting spirometer, along with a knowledge of its volume, permits 
calculation of the amount of nitrogen originally in the functional residual capacity and hence allows 
calculation of the FRC, as now will be shown. 

Figure 74.8 illustrates the arrangement of equipment for the nitrogen-washout test. Note that two 
check values, (I, EX) are on both sides of the subject's breathing tube, and the nitrogen meter is connected 
to the mouthpiece. Valve V is used to switched the subject from breathing environmental air to the 
measuring system. The left-hand spirometer contains 100% oxygen, which is inhaled by the subject via 
valve I. Of course, a nose clip must be applied so that all the respired gases flow through the breathing 
tube connected to the mouthpiece. It is in this tube that the sampling inlet for the nitrogen analyzer is 
located. Starting at the resting expiratory level, inhalation of pure oxygen causes the nitrogen analyzer 
to indicate zero. Expiration closes valve I and opens valve EX. The first expired breath contains nitrogen 
derived from the FRC (diluted by the oxygen which was inspired); the nitrogen analyzer indicates this 
percentage. The exhaled gases are collected in the right-hand spirometer. The collecting spirometer and 
all the interconnecting tubing was first flushed with oxygen to eliminate all nitrogen. This simple 
procedure eliminates the need for applying corrections and facilitates calculation of the FRC. With 
continued breathing, the nitrogen analyzer indicates less and less nitrogen because it is being washed out 
of the FRC and is replaced by oxygen. Figure 74.9 presents a typical record of the diminishing concentration 
of expired nitrogen throughout the test. In most laboratories, the test is continued until the concentration 
of nitrogen falls to about 1%. The nitrogen analyzer output permits identification of this concentration. 
In normal subjects, virtually all the nitrogen can be washed out of the FRC in about 5 minutes. 

If the peaks on the nitrogen washout record are joined, a smooth exponential decay curve is obtained 
in normal subjects. A semilog of N 2 versus time provides a straight line. In subjects with trapped air, or 
poorly ventilated alveoli, the nitrogen-washout curve consists of several exponentials as the multiple 
poorly ventilated regions give up their nitrogen. In such subjects, the time taken to wash out all the 
nitrogen usually exceeds 10 min. Thus, the nitrogen concentration-time curve provides useful diagnostic 
information on ventilation of the alveoli. 

If it is assumed that all the collected (washed-out) nitrogen was uniformly distributed within the lungs, 
it is easy to calculate the FRC. If the environmental air contains 80% nitrogen, then the volume of nitrogen 
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in the functional residual capacity is 0.8 (FRC). 
Because the volume of expired gas in the col- 
lecting spirometer is known, it is merely neces- 
sary to determine the concentration of nitrogen 
in this volume. To do so requires admitting 
some of this gas to the inlet valve of the nitrogen 
analyzer. Note that this concentration of nitro- 
gen (F m ) exists in a volume which includes the 
volume of air expired (V E ) plus the original 
volume of oxygen in the collecting spirometer 
( V„) at the start of the test and the volume of 
the tubing ( V t ) leading from the expiratory col- 
lecting valve. It is therefore advisable to start 
with an empty collecting spirometer (V = 0). 
Usually the tubing volume (V f ) is negligible 

with respect to the volume of expired gas collected in a typical washout test. In this situation the volume 
of nitrogen collected is V E F N2 , where -F N2 is the fraction of nitrogen within the collected gas. Thus, 0.80 
(FRC) = F m (V E ). Therefore 
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FRC = 



F V 

N2 E 

0.80 



It is important to note that the value for FRC so obtained is at ambient temperature and pressure and 
is saturated with water vapor (ATPS). In respiratory studies, this value is converted to body temperature 
and saturated with water vapor (BTPS). 

In the example shown in Fig. 74.9, the washout to 1% took about 44 breaths. With a breathing rate 
of 12/min, the washout time was 220 s. The volume collected (V E ) was 22 L and the concentration of 
nitrogen in this volume was 0.085 (-F N2 ); therefore 



0.085x22000 
FRC = = 2337mL 



0.80 



74.3 Physiologic Dead Space 



The volume of ventilated lung that does not participate in gas exchange is the physiologic dead space 
(V d ). It is obvious that the physiologic dead space includes anatomic dead space, as well as the volume 
of any alveoli that are not perfused. In the lung, there are theoretically four types of alveoli, as shown in 
Fig. 74.10. The normal alveolus (A) is both ventilated and perfused with blood. There are alveoli that 
are ventilated but not perfused (B); such alveoli contribute significantly to the physiologic dead space. 
There are alveoli that are not ventilated but perfused (C); such alveoli do not provide the exchange of 
respiratory gases. Finally, there are alveoli that are both poorly ventilated and poorly perfused (D); such 
alveoli contain high C0 2 and N 2 and low 2 . These alveoli are the last to expel their CO, and N 2 in 
washout tests. 

Measurement of physiologic dead space is based on the assumption that there is almost complete 
equilibrium between alveolar pCO, and pulmonary capillary blood. Therefore, the arterial pCO, repre- 
sents mean alveolar pC0 2 over many breaths when an arterial blood sample is drawn for analysis of 
pC0 2 . The Bohr equation for physiologic dead space is 
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FIGURE 74.10 The four types of alveoli. 

In this expression, paC0 2 is the partial pressure in the arterial blood sample which is withdrawn slowly 
during the test; pEC0 2 is the partial pressure of C0 2 in the volume of expired air; V E is the volume of 
expired air per breath (tidal volume). 

In a typical test, the subject would breathe in room air and exhale into a collapsed (Douglas) bag. The 
test is continued for 3 min or more, and the number of breaths is counted in that period. An arterial 
blood sample is withdrawn during the collection period. The pC0 2 in the expired gas is measured, and 
then the volume of expired gas is measured by causing it to flow into a spirometer or flowmeter by 
collapsing the collecting bag. 

In a typical 3-min test, the collected volume is 33 L, and the pC0 2 in the expired gas is 14.5 mmHg. 
During the test, the pC0 2 in the arterial blood sample was 40 mmHg. The number of breaths was 60; 
therefore, the average tidal volume was 33000/60 = 550 mL. The physiologic dead space (V d ) is: 



v., 



40-14.5 
40 



550 = 350mL 



It is obvious that an elevated physiological dead space indicates lung tissue that is not perfused with 
blood. 
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The purpose of the clinical laboratory is to analyze body fluids and tissues for specific substances of 
interest and to report the results in a form which is of value to clinicians in the diagnosis and treatment 
of disease. A large range of tests has been developed to achieve this purpose. Four terms commonly used 
to describe tests are accuracy, precision, sensitivity, and specificity. An accurate test, on average, yields true 
values. Precision is the ability of a test to produce identical results upon repeated trials. Sensitivity is a 
measure of how small an amount of substance can be measured. Specificity is the degree to which a test 
measures the substance of interest without being affected by other substances which may be present in 
greater amounts. 

The first step in many laboratory tests is to separate the material of interest from other substances. 
This may be accomplished through extraction, filtration, and centrifugation. Another step is derivatiza- 
tion, in which the substance of interest is chemically altered through addition of reagents to change it 
into a substance which is easily measured. For example, one method for measuring glucose is to add 
o-toluidine which, under proper conditions, forms a green-colored solution with an absorption maxi- 
mum at 630 nm. Separation and derivatization both improve the specificity required of good tests. 

75.1 Separation Methods 

Centrifuges are used to separate materials on the basis of their relative densities. The most common use 
in the laboratory is the separation of cells and platelets from the liquid part of the blood. This requires 
a relative centrifugal force (RCF) of roughly 1000 g (1000 times the force of gravity) for a period of 
10 minutes. Relative centrifugal force is a function of the speed of rotation and the distance of the sample 
from the center of rotation as stated in Eq. (75.1) 
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RCF = (l.l2xl(T 5 ] r(rpm) (75.1) 



where RCF = relative centrifugal force in g, and r = radius in cm. 

Some mixtures require higher g-loads in order to achieve separation in a reasonable period of time. 
Special rotors contain the sample tubes inside a smooth container, which minimizes air resistance to 
allow faster rotational speeds. Refrigerated units maintain the samples at a cool temperature throughout 
long high-speed runs which could lead to sample heating due to air friction on the rotor. Ultracentrifuges 
operate at speeds on the order of 100,000 rpm and provide relative centrifugal forces of up to 600,000 g. 
These usually require vacuum pumps to remove the air which would otherwise retard the rotation and 
heat the rotor. 

75.2 Chromatographic Separations 

Chromatographic separations depend upon the different rates at which various substances moving in a 
stream (mobile phase) are retarded by a stationary material (stationary phase) as they pass over it. The 
mobile phase can be a volatilized sample transported by an inert carrier gas such as helium or a liquid 
transported by an organic solvent such as acetone. Stationary phases are quite diverse depending upon 
the separation being made, but most are contained within a long, thin tube (column). Liquid stationary 
phases may be used by coating them onto inert packing materials. When a sample is introduced into a 
chromatographic column, it is carried through it by the mobile phase. As it passes through the column, 
the substances which have greater affinity for the stationary phase fall behind those with less affinity. The 
separated substances may be detected as individual peaks by a suitable detector placed at the end of the 
chromatographic column. 

75.3 Gas Chromatography 

The most common instrumental chromatographic method used in the clinical laboratory is the gas-liquid 
chromatograph. In this system the mobile phase is a gas, and the stationary phase is a liquid coated onto 
either an inert support material, in the case of a packed column, or the inner walls of a very thin tube, in 
the case of a capillary column. Capillary columns have the greatest resolving power but cannot handle 
large sample quantities. The sample is injected into a small heated chamber at the beginning of the column, 
where it is volatilized if it is not already a gaseous sample. The sample is then carried through the column 
by an inert carrier gas, typically helium or nitrogen. The column is completely housed within an oven. 
Many gas chromatographs allow for the oven temperature to be programmed to slowly increase for a set 
time after the sample injection is made. This produces peaks which are spread more uniformly over time. 

Four detection methods commonly used with gas chromatography are thermal conductivity, flame 
ionization, nitrogen/phosphorous, and mass spectrometry. The thermal conductivity detector takes advan- 
tage of variations in thermal conductivity between the carrier gas and the gas being measured. A heated 
filament immersed in the gas leaving the chromatographic column is part of a Wheatstone bridge circuit. 
Small variations in the conductivity of the gas cause changes in the resistance of the filament, which are 
recorded. The flame ionization detector measures the current between two plates with a voltage applied 
between them. When an organic material appears in the flame, ions which contribute to the current are 
formed. The NP detector, or nitrogen/phosphorous detector, is a modified flame ionization detector (see 
Fig. 75.1) which is particularly sensitive to nitrogen- and phosphorous-containing compounds. 

Mass spectrometry (MS) provides excellent sensitivity and selectivity. The concept behind these devices 
is that the volatilized sample molecules are broken into ionized fragments which are then passed through 
a mass analyzer that separates the fragments according to their mass/charge ( mlz) ratios. A mass spectrum, 
which is a plot of the relative abundance of the various fragments versus m/z, is produced. The mass 
spectrum is characteristic of the molecule sampled. The mass analyzer most commonly used with gas 
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FIGURE 75.1 Flame ionization detector. Organic compounds in the column effluent are ionized in the flame, 
producing a current proportional to the amount of the compound present. 

chromatographs is the quadrupole detector, which consists of four rods that have dc and RF voltages 
applied to them. The mlz spectrum can be scanned by appropriate changes in the applied voltages. The 
detector operates in a manner similar to that of a photomultiplier tube except that the collision of the 
charged particles with the cathode begins the electron cascade, resulting in a measurable electric pulse 
for each charged particle captured. The MS must operate in a high vacuum, which requires good pumps 
and a porous barrier between the GC and MS that limits the amount of carrier gas entering the MS. 

75.4 High-Performance Liquid Chromatography 

In liquid chromatography, the mobile phase is liquid. High-performance liquid chromatography (HPLC) 
refers to systems which obtain excellent resolution in a reasonable time by forcing the mobile phase at 
high pressure through a long thin column. The most common pumps used are pistons driven by 
asymmetrical cams. By using two such pumps in parallel and operating out of phase, pressure fluctuations 
can be minimized. Typical pressures are 350-1500 psi, though the pressure may be as high as 10,000 psi. 
Flow rates are in the 1-10 mL/min range. 

A common method for placing a sample onto the column is with a loop injector, consisting of a loop 
of tubing which is filled with the sample. By a rotation of the loop, it is brought in series with the column, 
and the sample is carried onto the column. A UV/visible spectrophotometer is often used as a detector 
for this method. A mercury arc lamp with the 254-nm emission isolated is useful for detection of aromatic 
compounds, while diode array detectors allow a complete spectrum from 190 nm to 600 nm in 10 msec. 
This provides for detection and identification of compounds as they come off the column. Fluorescent, 
electrochemical, and mass analyzer detectors are also used. 

75.5 Basis for Spectral Methods 

Spectral methods rely on the absorption or emission of electromagnetic radiation by the sample of 
interest. Electromagnetic radiation is often described in terms of frequency or wavelength. Wavelengths 
are those obtained in a vacuum and may be calculated with the formula 
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1 = c/v (75.2) 

where X = wavelength in meters 

c = speed of light in vacuum (3 x 10 8 m/s) 
v = frequency in Hz 

The frequency range of interest for most clinical laboratory work consists of the visible (390-780 nm) 
and the ultraviolet or UV (180-390 nm) ranges. Many substances absorb different wavelengths prefer- 
entially. When this occurs in the visible region, they are colored. In general, the color of a substance is 
the complement of the color it absorbs, e.g., absorption in the blue produces a yellow color. For a given 
wavelength or bandwidth, transmittance is denned as 

T = ^ (75.3) 

J. 

where T= transmittance ratio (often expressed as %) 
J, = incident light intensity 
I t = transmitted light intensity 



Absorbance is defined as 



A=log 10 l/T (75.4) 



Under suitable conditions, the absorbance of a solution with an absorbing compound dissolved in it is 
proportional to the concentration of that compound as well as the path length of light through it. This 
relationship is expressed by Beer's law: 

A=abc (75.5) 

where A = absorbance 
a = a constant 
b = path length 
c = concentration 

A number of situations may cause deviations from Beer's law, such as high concentration or mixtures of 
compounds which absorb at the wavelength of interest. From an instrumental standpoint, the primary 
causes are stray light and excessive spectral bandwidth. Stray light refers to any light reaching the detector 
other than light from the desired pass-band which has passed through sample. Sources of stray light may 
include room light leaking into the detection chamber, scatter from the cuvette, and undesired fluorescence. 

A typical spectrophotometer consists of a light source, some form of wavelength selection, and a 
detector for measuring the light transmitted through the samples. There is no single light source that 
covers the entire visible and UV spectrum. The source most commonly used for the visible part of the 
spectrum is the tungsten-halogen lamp, which provides continuous radiation over the range of 360 to 
950 nm. The deuterium lamp has become the standard for much UV work. It covers the range from 
220-360 nm. Instruments which cover the entire UV/visible range use both lamps with a means for 
switching from one lamp to the other at a wavelength of approximately 360 nm (Fig. 75.2). 

Wavelength selection is accomplished with filters, prisms, and diffraction gratings. Specially designed 
interference filters can provide bandwidths as small as 5 nm. These are useful for instruments which do 
not need to scan a range of wavelengths. Prisms produce a nonlinear dispersion of wavelengths with the 
longer wavelengths closer together than the shorter ones. Since the light must pass through the prism 
material, they must be made of quartz for UV work. Diffraction gratings are surfaces with 1000-3000 
grooves/mm cut into them. They may be transmissive or reflective; the reflective ones are more popular 
since there is no attenuation of light by the material. They produce a linear dispersion. By proper selection 
of slit widths, pass bands of 0.1 nm are commonly achieved. 
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FIGURE 75.2 Dual-beam spectrophotometer. The diffraction grating is rotated to select the desired wavelength. 
The beam splitter consists of a half-silvered mirror which passes half the light while reflecting the other half. A 
rotating mirror with cut-out sections (chopper) alternately directs one beam and then the other to the detector. 
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FIGURE 75.3 Photomultiplier tube. Incident photons cause the photocathode to emit electrons which collide with 
the first dynode which emits additional electrons. Multiple dynodes provide sufficient gain to produce an easily 
measurable electric pulse from a single photon. 

The most common detector is the photomultiplier tube, which consists of a photosensitive cathode 
that emits electrons in proportion to the intensity of light striking it (Fig. 75.3). A series of 10-15 dynodes, 
each at 50-100 volts greater potential than the preceding one, produce an electron amplification of 
4-6 per stage. Overall gains are typically a million or more. Photomultiplier tubes respond quickly and 
cover the entire spectral range. They require a high voltage supply and can be damaged if exposed to 
room light while the high voltage is applied. 



75.6 Fluorometry 



Certain molecules absorb a photon's energy and then emit a photon with less energy (longer wavelength). 
When the reemission occurs in less than 10~ 8 s, the process is known as fluorescence. This physical process 
provides the means for assays which are 10-100 times as sensitive as those based on absorption mea- 
surements. This increase in sensitivity is largely because the light measured is all from the sample of 
interest. A dim light is easily measured against a black background, while it may be lost if added to an 
already bright background. 

Fluorometers and spectrofluorometers are very similar to photometers and spectrophotometers but with 
two major differences. Fluorometers and spectrofluorometers use two monochrometers, one for excitation 
light and one for emitted light. By proper selection of the bandpass regions, all the light used to excite the 
sample can be blocked from the detector, assuring that the detector sees only fluorescence. The other 
difference is that the detector is aligned off-axis, commonly at 90°, from the excitation source. At this angle, 
scatter is minimal, which helps ensure a dark background for the measured fluorescence. Some spectrof- 
luorometers use polarization filters both on the input and output light beams, which allows for fluorescence 
polarization studies (Fig. 75.4). An intense light source in the visible-to-UV range is desirable. A common 
source is the xenon or mercury arc lamps, which provide a continuum of radiation over this range. 
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FIGURE 75.4 Spectrofluorometer. Fluorescence methods can be extremely sensitive to the low background inter- 
ference. Since the detector is off-axis from the incident light and a second monochromator blocks light of wavelengths 
illuminating the sample, virtually no signal reaches the detector other than the desired fluorescence. 

75.7 Flame Photometry 

Flame photometry is used to measure sodium, potassium, and lithium in body fluids. When these 
elements are heated in a flame they emit characteristic wavelengths of light. The major emission lines 
are 589 nm (yellow) for sodium, 767 nm (violet) for potassium, and 671 nm (red) for lithium. An 
atomizer introduces a fine mist of the sample into a flame. For routine laboratory use, a propane and 
compressed air flame is adequate. High-quality interference filters with narrow pass bands are often used 
to isolate the major emission lines. The narrow band pass is necessary to maximize the signal-to-noise 
ratio. Since it is impossible to maintain stable aspiration, atomization, and flame characteristics, it is 
necessary to use an internal standard of known concentration while making measurements of unknowns. 
In this way the ratio of the unknown sample's emission to the internal standard's emission remains stable 
even as the total signal fluctuates. An internal standard is usually an element which is found in very low 
concentration in the sample fluid. By adding a high concentration of this element to the sample, its 
concentration can be known to a high degree of accuracy. Lithium, potassium, and cesium all may be 
used as internal standards depending upon the particular assay being conducted. 

75.8 Atomic Absorption Spectroscopy 

Atomic absorption spectroscopy is based on the fact that just as metal elements have unique emission 
lines, they have identical absorption lines when in a gaseous or dissociated state. The atomic absorption 
spectrometer takes advantage of these physical characteristics in a clever manner, producing an instru- 
ment with approximately 100 times the sensitivity of a flame photometer for similar elements. The sample 
is aspirated into a flame, where the majority of the atoms of the element being measured remain in the 
ground state, where they are capable of absorbing light at their characteristic wavelengths. An intense 
source of exactly these wavelengths is produced by a hollow cathode lamp. These lamps are constructed 
so that the cathode is made from the element to be measured, and the lamps are filled with a low pressure 
of argon or neon gas. When a current is passed through the lamp, metal atoms are sputtered off the 
cathode and collide with the argon or neon in the tube, producing emission of the characteristic wave- 
lengths. A monochromator and photodetector complete the system. 

Light reaching the detector is a combination of that which is emitted by the sample (undesirable) and 
light from the hollow cathode lamp which was not absorbed by the sample in the flame (desirable). By 
pulsing the light from the lamp either by directly pulsing the lamp or with a chopper, and using a detector 
which is sensitive to ac signals and insensitive to dc signals, the undesirable emission signal is eliminated. 
Each element to be measured requires a lamp with that element present in the cathode. Multielement 
lamps have been developed to minimize the number of lamps required. Atomic absorption spectropho- 
tometers may be either single beam or double beam; the double-beam instruments have greater stability. 
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FIGURE 75.5 Nephelometer. Light scattered by large molecules is measured at an angle a away from the axis of 
incident light. The filters select the wavelength range desired and block undesired fluorescence. When a = 0, the 
technique is known as turbidimetry. 

There are various flameless methods for atomic absorption spectroscopy in which the burner is 
replaced with a method for vaporizing the element of interest without a flame. The graphite furnace 
which heats the sample to 2700° consists of a hollow graphite tube which is heated by passing a large 
current through it. The sample is placed within the tube, and the light beam is passed through it while 
the sample is heated. 

75.9 Turbidimetry and Nephelometry 

Light scattering by particles in solution is directly proportional to both concentration and molecular weight 
of the particles. For small molecules the scattering is insignificant, but for proteins, immunoglobulins, 
immune complexes, and other large particles, light scattering can be an effective method for the detection 
and measurement of particle concentration. For a given wavelength X of light and particle size d, scattering 
is described as Raleigh {d < A./10), Raleigh-Debye {d ~ X), or Mie (d > 10X). For particles that are small 
compared to the wavelength, the scattering is equal in all directions. However, as the particle size becomes 
larger than the wavelength of light, it becomes preferentially scattered in the forward direction. Light- 
scattering techniques are widely used to detect the formation of antigen-antibody complexes in immu- 
noassays. 

When light scattering is measured by the attenuation of a beam of light through a solution, it is called 
turbidimetry. This is essentially the same as absorption measurements with a photometer except that a 
large pass-band is acceptable. When maximum sensitivity is required a different method is used — direct 
measurement of the scattered light with a detector placed at an angle to the central beam. This method 
is called nephelometry. A typical nephelometer will have a light source, filter, sample cuvette, and detector 
set at an angle to the incident beam (Fig. 75.5). 

Defining Terms 

Accuracy: The degree to which the average value of repeated measurements approximate the true value 

being measured. 
Fluorescence: Emission of light by an atom or molecule following absorption of a photon by greater 

energy. Emission normally occurs within 10~ 8 of absorption. 
Nephelometry: Measurement of the amount of light scattered by particles suspended in a fluid. 
Precision: A measure of test reproducibility. 

Sensitivity: A measure of how small an amount or concentration of an analyte can be detected. 
Specificity: A measure of how well a test detects the intended analyte without being "fooled" by other 

substances in the sample. 
Turbidimetry: Measurement of the attenuation of a light beam due to light lost to scattering by particles 

suspended in a fluid. 
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76.1 Particle Counting and Identification 

The Coulter principle was the first major advance in automating blood cell counts. The cells to be counted 
are drawn through a small aperture between two fluid compartments, and the electric impedance between 
the two compartments is monitored (see Fig. 76.1). As cells pass through the aperture, the impedance 
increases in proportion to the volume of the cell, allowing large numbers of cells to be counted and sized 
rapidly. Red cells are counted by pulling diluted blood through the aperture. Since red cells greatly 
outnumber white cells, the contribution of white cells to the red cell count is usually neglected. White 
cells are counted by first destroying the red cells and using a more concentrated sample. 

Modern cell counters using the Coulter principle often use hydrodynamic focusing to improve the 
performance of the instrument. A sheath fluid is introduced which flows along the outside of a channel 
with the sample stream inside it. By maintaining laminar flow conditions and narrowing the channel, 
the sample stream is focused into a very thin column with the cells in single file. This eliminates problems 
with cells flowing along the side of the aperture or sticking to it and minimizes problems with having 
more than one cell in the aperture at a time. 

Flow cytometry is a method for characterizing, counting, and separating cells which are suspended in 
a fluid. The basic flow cytometer uses hydrodynamic focusing to produce a very thin stream of fluid 
containing cells moving in single file through a quartz flow chamber (Fig. 76.2). The cells are characterized 
on the basis of their scattering and fluorescent properties. This simultaneous measurement of scattering 
and fluorescence is accomplished with a sophisticated optical system that detects light from the sample 
both at the wavelength of the excitation source (scattering) as well as at longer wavelengths (fluorescence) 
at more than one angle. Analysis of these measurements produces parameters related to the cells' size, 
granularity, and natural or tagged fluorescence. High-pressure mercury or xenon arc lamps can be used 
as light sources, but the argon laser (488 nm) is the preferred source for high-performance instruments. 

One of the more interesting features of this technology is that particular cells may be selected at rates 
that allow collection of quantities of particular cell types adequate for further chemical testing. This is 
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FIGURE 76.1 Coulter method. Blood cells are surrounded by an insulating membrane, which makes them non- 
conductive. The resistance of electrolyte-filled channel will increase slightly as cells flow through it. This resistance 
variation yields both the total number of cells which flow through the channel and the volume of each cell. 
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FIGURE 76.2 Flow cytometer. By combining hydrodynamic focusing, state-of-the-art optics, fluorescent labels, 
and high-speed computing, large numbers of cells can be characterized and sorted automatically. 

accomplished by breaking the outgoing stream into a series of tiny droplets using piezoelectric vibration. 
By charging the stream of droplets and then using deflection plates controlled by the cell analyzer, the 
cells of interest can be diverted into collection vessels. 

The development of monoclonal antibodies coupled with flow cytometry allows for quantitation of 
T and B cells to assess the status of the immune system as well as characterization of leukemias, lym- 
phomas, and other disorders. 



3 2000 by CRC Press LLC 




indicator 
electrode 



Ag/AgCl 
electrode 



reference 
electrode 



liquid junction 



FIGURE 76.3 Electrochemical cell. 



76.2 Electrochemical Methods 

Electrochemical methods are increasingly popular in the clinical laboratory, for measurement not only 
of electrolytes, blood gases, and pH but also of simple compounds such as glucose. Potentiometry is a 
method in which a voltage is developed across electrochemical cells as shown in Fig. 76.3. This voltage 
is measured with little or no current flow. 

Ideally, one would like to measure all potentials between the reference solution in the indicator 
electrode and the test solution. Unfortunately there is no way to do that. Interface potentials develop 
across any metal-liquid boundary, across liquid junctions, and across the ion-selective membrane. The 
key to making potentiometric measurements is to ensure that all the potentials are constant and do not 
vary with the composition of the test solution except for the potential of interest across the ion-selective 
membrane. By maintaining the solutions within the electrodes constant, the potential between these 
solutions and the metal electrodes immersed in them is constant. The liquid junction is a structure which 
severely limits bulk flow of the solution but allows free passage of all ions between the solutions. The 
reference electrode commonly is filled with saturated KC1, which produces a small, constant liquid- 
junction potential. Thus, any change in the measured voltage (V) is due to a change in the ion concen- 
tration in the test solution for which the membrane is selective. 

The potential which develops across an ion-selective membrane is given by the Nernst equation: 



V- 



zF 



liA 



(76.1) 



where R = gas constant = 8.314 J/K-mol 
T = temperature in K 
z = ionization number 
F = Faraday constant = 9.649 x 10 4 C/Mol 
a n = activity of ion in solution n 

When one of the solutions is a reference solution, this equation can be rewritten in a convenient form as 



N 



(76.2) 



where V = a constant voltage due to reference solution 

N = Nernst slope ~ 59 mV/decade at room temperature 
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The actual Nernst slope is usually slightly less than the theoretical value. Thus, the typical pH meter has 
two calibration controls. One adjusts the offset to account for the value of V , and the other adjusts the 
range to account for both temperature effects and deviations from the theoretical Nernst slope. 

76.3 Ion-Specific Electrodes 

Ion-selective electrodes use membranes which are permeable only to the ion being measured. To the 
extent that this can be done, the specificity of the electrode can be very high. One way of overcoming a 
lack of specificity for certain electrodes is to make multiple simultaneous measurement of several ions 
which include the most important interfering ones. A simple algorithm can then make corrections for 
the interfering effects. This technique is used in some commercial electrolyte analyzers. A partial list of 
the ions that can be measured with ion-selective electrodes includes H + (pH), Na + , K + , Li + , Ca ++ , Ch, F", 
NH + 4, and C0 2 . 

NH + 4 , and C0 2 are both measured with a modified ion-selective electrode. They use a pH electrode 
modified with a thin layer of a solution (sodium bicarbonate for C0 2 and ammonium chloride for NH^) 
whose pH varies depending on the concentration of ammonium ions or C0 2 it is equilibrated with. A 
thin membrane holds the solution against the pH glass electrode and provides for equilibration with the 
sample solution. Note that the C0 2 electrode in Fig. 76.4 is a combination electrode. This means that 
both the reference and indicating electrodes have been combined into one unit. Most pH electrodes are 
made as combination electrodes. 

The Clark electrode measures p0 2 by measuring the current developed by an electrode with an applied 
voltage rather than a voltage measurement. This is an example of amperometry. In this electrode a voltage 
of approximately -0.65 V is applied to a platinum electrode relative to a Ag/AgCl electrode in an electrolyte 
solution. The reaction 



2 +2H + +2e~ 



H,0 2 



proceeds at a rate proportional to the partial pressure of oxygen in the solution. The electrons involved 
in this reaction form a current which is proportional to the rate of the reaction and thus to the p0 2 in 
the solution. 
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FIGURE 76.4 Clark electrode. 



© 2000 by CRC Press LLC 



76.4 Radioactive Methods 

Isotopes are atoms which have identical atomic number (number of protons) but different atomic mass 
numbers (protons + neutrons). Since they have the same number of electrons in the neutral atom, they 
have identical chemical properties. This provides an ideal method for labeling molecules in a way that 
allows for detection at extremely low concentrations. Labeling with radioactive isotopes is extensively 
used in radioimmunoassays where the amount of antigen bound to specific antibodies is measured. The 
details of radioactive decay are complex, but for our purposes there are three types of emission from 
decaying nuclei: alpha, beta, and gamma radiation. Alpha particles are made up of two neutrons and two 
protons (helium nucleus). Alpha emitters are rarely used in the clinical laboratory. Beta emission consists 
of electrons or positrons emitted from the nucleus. They have a continuous range of energies up to a 
maximum value characteristic of the isotope. Beta radiation is highly interactive with matter and cannot 
penetrate very far in most materials. Gamma radiation is a high-energy form of electromagnetic radiation. 
This type of radiation may be continuous, discrete, or mixed depending on the details of the decay 
process. It has greater penetrating ability than beta radiation. (See Fig. 76.5.) 

The kinetic energy spectrum of emitted radiation is characteristic of the isotope. The energy is 
commonly measured in electron volts (eV). One electron volt is the energy acquired by an electron falling 
through a potential of 1 volt. The isotopes commonly used in the clinical laboratory have energy spectra 
which range from 18 keV-3.6 MeV. 

The activity of a quantity of radioactive isotope is defined as the number of disintegrations per second 
which occur. The usual units are the curie (Ci), which is defined as 3.7 x 10 10 dps, and the becquerel 
(Bq), defined as 1 dps. Specific activity for a given isotope is defined as activity per unit mass of the isotope. 

The rate of decay for a given isotope is characterized by the decay constant X, which is the proportion of 
the isotope which decays in unit time. Thus, the rate of loss of radioactive isotope is governed by the equation 



dN 
dt 
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(76.3) 



where Nis the amount of radioactive isotope present at time t. The solution to this differential equation is: 



N=N e~ 



(76.4) 
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FIGURE 76.5 Gamma counted. The intensity of the light flash produced when a gamma photon interacts with a 
scintillator is proportional to the energy of the photon. The photomultiplier tube converts these light flashes into 
electric pulses which can be selected according to size (gamma energy) and counted. 
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It can easily be shown that the amount of radioactive isotope present will be reduced by half after time 

t/,=— — (76.5) 



'1/2 



X 



This is known as the half-life for the isotope and can vary widely; for example, carbon- 14 has a half-life 
of 5760 years, and iodine-131 has a half-life of 8.1 days. 

The most common method for detection of radiation in the clinical laboratory is by scintillation. This 
is the conversion of radiation energy into photons in the visible or near-UV range. These are detected 
with photomultiplier tubes. 

For gamma radiation, the scintillating crystal is made of sodium iodide doped with about 1% thallium, 
producing 20 to 30 photons for each electron-volt of energy absorbed. The photomultiplier tube and 
amplifier circuit produce voltage pulses proportional to the energy of the absorbed radiation. These 
voltage pulses are usually passed through a pulse-height analyzer which eliminates pulses outside a preset 
energy range (window). Multichannel analyzers can discriminate between two or more isotopes if they 
have well-separated energy maxima. There generally will be some spill down of counts from the higher- 
energy isotope into the lower-energy isotope's window, but this effect can be corrected with a simple 
algorithm. Multiple well detectors with up to 64 detectors in an array are available which increase the 
throughput for counting systems greatly. Counters using the sodium iodide crystal scintillator are referred 
to as gamma counters or well counters. 

The lower energy and short penetration ability of beta particles requires a scintillator in direct contact 
with the decaying isotope. This is accomplished by dissolving or suspending the sample in a liquid fluor. 
Counters which use this technique are called beta counters or liquid scintillation counters. 

Liquid scintillation counters use two photomultiplier tubes with a coincidence circuit that prevents 
counting of events seen by only one of the tubes. In this way, false counts due to chemiluminescence and 
noise in the phototube are greatly reduced. Quenching is a problem in all liquid scintillation counters. 
Quenching is any process which reduces the efficiency of the scintillation counting process, where 
efficiency is defined as 

Efficiency = counts per minute/ decays per minute (76.6) 

A number of techniques have been developed that automatically correct for quenching effects to produce 
estimates of true decays per minute from the raw counts. Currently there is a trend away from beta- 
emitting isotopic labels, but these assays are still used in many laboratories. 

76.5 Coagulation Timers 

Screening for and diagnosis of coagulation disorders is accomplished by assays that determine how long 
it takes for blood to clot following initiation of the clotting cascade by various reagents. A variety of 
instruments have been designed to automate this procedure. In addition to increasing the speed and 
throughput of such testing, these instruments improve the reproducibility of such tests. All the instru- 
ments provide precise introduction of reagents, accurate timing circuits, and temperature control. They 
differ in the method for detecting clot formation. One of the older methods still in use is to dip a small 
metal hook into the blood sample repeatedly and lift it a few millimeters above the surface. The electric 
resistance between the hook and the sample is measured, and when fibrin filaments form, they produce 
a conductive pathway which is detected as clot formation. Other systems detect the increase in viscosity 
due to fibrin formation or the scattering due to the large polymerized molecules formed. Absorption 
and fluorescence spectroscopy can also be used for clot detection. 
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76.6 Osmometers 

The colligative properties of a solution are a function of the number of solute particles present regardless 
of size or identity. Increased solute concentration causes an increase in osmotic pressure and boiling 
point and a decrease in vapor pressure and freezing point. Measuring these changes provides information 
on the total solute concentration regardless of type. The most accurate and popular method used in 
clinical laboratories is the measurement of freezing point depression. With this method, the sample is 
supercooled to a few degrees below 0°C while being stirred gently. Freezing is then initiated by vigorous 
stirring. The heat of fusion quickly brings the solution to a slushy state where an equilibrium exists 
between ice and liquid, ensuring that the temperature is at the freezing point. This temperature is 
measured. A solute concentration of 1 osmol/kg water produces a freezing point depression of 1.858°C. 
The measured temperature depression is easily calibrated in units of milliosmols/kg water. 

The vapor pressure depression method has the advantage of smaller sample size. However, it is not as 
precise as the freezing point method and cannot measure the contribution of volatile solutes such as 
ethanol. This method is not used as widely as the freezing point depression method in clinical laboratories. 

Osmolality of blood is primarily due to electrolytes such as Na + and Ch. Proteins with molecular weights 
of 30,000 or more atomic mass units (amu) contribute very little to total osmolality due to their smaller 
numbers (a single Na + ion contributes just as much to osmotic pressure as a large protein molecule). 
However, the contribution to osmolality made by proteins is of great interest when monitoring conditions 
leading to pulmonary edema. This value is known as colloid osmotic pressure, or oncotic pressure, and 
is measured with a membrane permeable to water and all molecules smaller than about 30,000 amu. By 
placing a reference saline solution on one side and the unknown sample on the other, an osmotic pressure 
is developed across the membrane. This pressure is measured with a pressure transducer and can be related 
to the true colloid osmotic pressure through a calibration procedure using known standards. 

76.7 Automation 

Improvements in technology coupled with increased demand for laboratory tests as well as pressures to 
reduce costs have led to the rapid development of highly automated laboratory instruments. Typical 
automated instruments contain mechanisms for measuring, mixing, and transport of samples and 
reagents, measurement systems, and one or more microprocessors to control the entire system. In addition 
to system control, the computer systems store calibration curves, match test results to specimen IDs, and 
generate reports. Automated instruments are dedicated to complete blood counts, coagulation studies, 
microbiology assays, and immunochemistry, as well as high-volume instruments used in clinical chem- 
istry laboratories. The chemistry analyzers tend to fall into one of four classes: continuous flow, centrif- 
ugal, pack-based, and dry-slide-based systems. The continuous flow systems pass successive samples and 
reagents through a single set of tubing, where they are directed to appropriate mixing, dialyzing, and 
measuring stations. Carry-over from one sample to the next is minimized by the introduction of air 
bubbles and wash solution between samples. 

Centrifugal analyzers use plastic rotors which serve as reservoirs for samples and reagents and also as 
cuvettes for optical measurements. Spinning the plastic rotor mixes, incubates, and transports the test solution 
into the cuvette portion of the rotor, where the optical measurements are made while the rotor is spinning. 

Pack-based systems are those in which each test uses a special pack with the proper reagents and 
sample preservation devices built-in. The sample is automatically introduced into as many packs as tests 
required. The packs are then processed sequentially. 

Dry chemistry analyzers use no liquid reagents. The reagents and other sample preparation methods 
are layered onto a slide. The liquid sample is placed on the slide, and after a period of time the color 
developed is read by reflectance photometry. Ion-selective electrodes have been incorporated into the 
same slide format. 
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There are a number of technological innovations found in many of the automated instruments. One 
innovation is the use of fiberoptic bundles to channel excitation energy toward the sample as well as 
transmitted, reflected, or emitted light away from the sample to the detectors. This provides a great deal 
of flexibility in instrument layout. Multiwavelength analysis using a spinning filter wheel or diode array 
detectors is commonly found. The computers associated with these instruments allow for innovative 
improvements in the assays. For instance, when many analytes are being analyzed from one sample, the 
interference effects of one analyte on the measurement of another can be predicted and corrected before 
the final report is printed. 

76.8 Trends in Laboratory Instrumentation 

Predicting the future direction of laboratory instrumentation is difficult, but there seem to be some clear 
trends. Decentralization of the laboratory functions will continue with more instruments being located 
in or around ICUs, operating rooms, emergency rooms, and physician offices. More electrochemistry- 
based tests will be developed. The flame photometer is already being replaced with ion-selective electrode 
methods. Instruments which analyze whole blood rather than plasma or serum will reduce the amount 
of time required for sample preparation and will further encourage testing away from the central labo- 
ratory. Dry reagent methods increasingly will replace wet chemistry methods. Radioimmunoassays will 
continue to decline with the increasing use of methods for performing immunoassays that do not rely 
upon radioisotopes such as enzyme-linked fluorescent assays. 

Defining Terms 

Alpha radiation: Particulate radiation consisting of a helium nucleus emitted from a decaying anucleus. 
Amperometry: Measurements based on current flow produced in an electrochemical cell by an applied 

voltage. 
Beta radiation: Particulate radiation consisting of an electron or positron emitted from a decaying 

nucleus. 
Colligative properties: Physical properties that depend on the number of molecules present rather 

than on their individual properties. 
Gamma radiation: Electromagnetic radiation emitted from an atom undergoing nuclear decay. 
Hydrodynamic focusing: A process in which a fluid stream is first surrounded by a second fluid and 

then narrowed to a thin stream by a narrowing of the channel. 
Isotopes: Atoms with the same number of protons but differing numbers of neutrons. 
Plasma: The liquid portion of blood. 
Potentiometry: Measurement of the potential produced by electrochemical cells under equilibrium 

conditions with no current flow. 
Scintillation: The conversion of the kinetic energy of a charged particle or photon to a flash of light. 
Serum: The liquid portion of blood remaining after clotting has occurred. 
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The practical use of an implantable device for delivering a controlled, rhythmic electric stimulus to 
maintain the heartbeat is relatively recent: Cardiac pacemakers have been in clinical use only slightly 
more than 30 years. Although devices have gotten steadily smaller over this period (from 250 grams in 
1960 to 25 grams today), the technological evolution goes far beyond size alone. Early devices provided 
only single-chamber, asynchronous, nonprogrammable pacing coupled with questionable reliability and 
longevity. Today, advanced electronics afford dual-chamber multiprogrammability, diagnostic functions, 
rate response, data collection, and exceptional reliability, and lithium-iodine power sources extend 
longevity to upward of 10 years. Continual advances in a number of clinical, scientific, and engineering 
disciplines have so expanded the use of pacing that it now provides cost-effective benefits to an estimated 
350,000 patients worldwide each year. 

The modern pacing system is comprised of three distinct components: pulse generator, lead, and pro- 
grammer (Fig. 77.1). The pulse generator houses the battery and the circuitry which generates the stimulus 
and senses electrical activity. The lead is an insulated wire that carries the stimulus from the generator to 
the heart and relays intrinsic cardiac signals back to the generator. The programmer is a telemetry device 
used to provide two-way communications between the generator and the clinician. It can alter the therapy 
delivered by the pacemaker and retrieve diagnostic data that are essential for optimally titrating that therapy. 
Ultimately, the therapeutic success of the pacing prescription rests on the clinician's choice of an appropriate 
system, use of sound implant technique, and programming focused on patient outcomes. 

This chapter discusses in further detail the components of the modern pacing system and the significant 
evolution that has occurred since its inception. Our focus is on system design and operations, but we 
also briefly overview issues critical to successful clinical performance. 

77.1 Indications 

The decision to implant a permanent pacemaker for bradyarrhythmias usually is based on the major 
goals of symptom relief (at rest and with physical activity), restoration of functional capacity and quality 
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FIGURE 77.1 The pacing systems comprise a programmer, pulse generator, and lead. There are two programmers 
pictured above; one is portable, and the other is an office-based unit. 

of life, and reduced mortality. As with other healthcare technologies, appropriate use of pacing is the 
intent of indications guidelines established by Medicare and other third-party payors. 

In 1 984 and again in 1 99 1 , a joint commission of the American College of Cardiology and the American 
Heart Association established guidelines for pacemaker implantation [Committee on Pacemaker Implan- 
tation, 1991]. In general, pacing is indicated when there is a dramatic slowing of the heart rate or a failure 
in the connection between the atria and ventricles resulting in decreased cardiac output manifested by 
such symptoms as syncope, light-headedness, fatigue, and exercise intolerance. Failure of impulse for- 
mation and/or conduction is the overriding theme of all pacemaker indications. There are four categories 
of pacing indications: 

1. Heart block (e.g., complete heart block, symptomatic 2° AV block) 

2. Sick sinus syndrome (e.g., symptomatic bradycardia, sinus arrest, sinus exit block) 

3. Myocardial infarction (e.g., conduction disturbance related to the site of infarction) 

4. Hypersensitive carotid sinus syndrome (e.g., recurrent syncope) 

Within each of these four categories the ACC/AHA provided criteria for classifying a condition as group I 
(pacing is considered necessary), group II (pacing may be necessary), or group III (pacing is considered 
inappropriate). 

New indications for cardiac pacing are being evaluated under the jurisdiction of the Food and Drug 
Administration. For example, hypertrophic obstructive cardiomyopathy (HOCM) is one of these new 
potential indications, with researchers looking at dual-chamber pacing as a means of reducing left 
ventricular outflow obstruction. Though efforts in these areas are ongoing and expanding, for now they 
remain unapproved as standard indications for pacing. 

77.2 Pulse Generators 

The pulse generator contains a power source, output circuit, sensing circuit, and a timing circuit 
(Fig. 77.2). A telemetry coil is used to send and receive information between the generator and the 
programmer. Rate-adaptive pulse generators include the sensor components along with the circuit to 
process the information measured by the sensor. 

Modern pacemakers use CMOS circuit technology. One to 2 kilobytes of read-only memory (ROM) 
are used to direct the output and sensing circuits; 16-512 bytes of random-access memory (RAM) are 
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used to store diagnostic data. Some manufacturers offer 
fully RAM-based pulse generators, providing greater 
storage of diagnostic data and the flexibility for chang- 
ing feature sets after implantation. 

All components of the pulse generator are housed in 
a hermetically sealed titanium case with a connector 
block that accepts the lead(s). Because pacing leads are 
available with a variety of different connector sites and 
configurations, the pulse generator is available with an 
equal variety of connectors. The outer casing is laser- 
etched with the manufacturer, name, type (e.g., single- 
versus dual-chamber), model number, serial number, 
and the lead connection diagram for each identifica- 
tion. Once implanted, it may be necessary to use an 
x-ray to reveal the identity of the generator. Some man- 
ufacturers use radiopaque symbols and ID codes for 
this purpose, whereas others give their generators char- 
acteristic shapes. 




FIGURE 77.2 Internal view of pulse generator. 



Sensing Circuit 

Pulse generators have two basic functions, pacing and sensing. Sensing refers to the recognition of an 
appropriate signal by the pulse generator. This signal is the intrinsic cardiac depolarization from the 
chamber or chambers in which the leads are placed. It is imperative for the sensing circuit to discriminate 
between these intracardiac signals and unwanted electrical interference such as far-field cardiac events, 
diastolic potentials, skeletal muscle contraction, and pacing stimuli. An intracardiac electrogram 
(Fig. 77.3) shows the waveform as seen by the pacemaker; it is typically quite different from the corre- 
sponding event as shown on the surface ECG. 

Sensing (and pacing) is accomplished with one of two configurations, bipolar and unipolar. In bipolar, 
the anode and cathode are close together, with the anode at the tip of the lead and the cathode a ring 
electrode about 2 cm proximal to the tip. In unipolar, the anode and cathode may be 5-10 cm apart. The 
anode is at the lead tip and the cathode is the pulse generator itself (usually located in the pectoral region). 



CHART SPEED 25.0 mm/s 




A-Pace x V-Sense 



FIGURE 77.3 The surface ECG (ECG LEAD II) represents the sum total of the electrical potentials of all depolarizing 
tissue. The intracardiac electrogram (V EGM) shows only the potentials measured between the lead electrodes. This 
allows the evaluation of signals that may be hidden within the surface ECG. 
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FIGURE 77.4 This is a conceptual depiction of the bandpass filter demonstrating the typical filtering of unwanted 
signals by discriminating between those with slew rates that are too low and/or too high. 

In general, bipolar and unipolar sensing configurations have equal performance. A drawback of the 
unipolar approach is the increased possibility of sensing noncardiac signals: The large electrode separation 
may, for example, sense myopotentials from skeletal muscle movement, leading to inappropriate inhibi- 
tion of pacing. Many newer pacemakers can be programmed to sense or pace in either configuration. 

Once the electrogram enters the sensing circuit, it is scrutinized by a bandpass filter (Fig. 77.4). The 
frequency of an R-wave is 10 to 30 Hz. The center frequency of most sensing amplifiers is 30 Hz. T- waves 
are slower, broad signals that are composed of lower frequencies (approximately 5 Hz or less). Far-field 
signals are also lower-frequency signals, whereas skeletal muscle falls in the range of 10-200 Hz. 

At the implant, the voltage amplitude of the P-wave (and the P-wave, in the case of dual-chamber 
pacing) is measured to ensure the availability on an adequate signal. P-wave amplitudes are typically 
5-25 mV, and P-wave amplitudes are 2-6 mV. The signals passing through the sense amplifier are 
compared to an adjustable reference voltage called the sensitivity. Any signal below the reference voltage 
is not sensed, and those above it are sensed. Higher-sensitivity settings (high-reference voltage) may lead 
to substandard sensing, and a lower reference voltage may result in oversensing. A minimum 2:1 safety 
margin should be maintained between the sensitivity setting and the amplitude of the intracardiac signal. 
The circuit is protected from extremely high voltages by a Zener diode. 

The slope of the signal is also surveyed by the sensing circuit and is determined by the slew rate (the 
time rate of change in voltage). A slew rate that is too flat or too steep may be eliminated by the bandpass 
filter. On the average, the slew rate measured at implant should be between 0.75 and 2.50 V/s. 

The last line of defense in an effort to remove undesirable signals is to "blind" the circuit at specific 
times during the cardiac cycle. This is accomplished with blanking and refractory periods. Some of these 
periods are programmable. During the blanking period the sensing circuit is turned off, and during the 
refractory period the circuit can see the signal but does not initiate any of the basic timing intervals. 
Virtually all paced and sensed events begin concurrent blanking and refractory periods, typically ranging 
from 10-400 ms. These are especially helpful in dual-chamber pacemakers where there exists the potential 



(D 2000 by CRC Press LLC 



for the pacing output of the atrial side to inhibit the ventricular pacing output, with dangerous conse- 
quences for patients in complete heart block. 

Probably the most common question asked by the general public about pacing systems is the effect 
of electromagnetic interference (EMI) on their operation. EMI outside of the hospital is an infrequent 
problem, though patients are advised to avoid such sources of strong electromagnetic fields as arc welders, 
high-voltage generators, and radar antennae. Some clinicians suggest that patients avoid standing near 
antitheft devices used in retail stores. Airport screening devices are generally safe, though they may detect 
a pacemaker's metal case. Microwave ovens, ham radio equipment, video games, computers, and office 
equipment rarely interfere with the operation of modern pacemakers. A number of medical devices and 
procedures may on occasion do so, however; electrocautery, cardioversion and defibrillation, MRI, 
lithotripsy, diathermy, TENS units, and radiation therapy. 

Pacemakers affected by interference typically respond with temporary loss of output or temporary 
reversion to asynchronous pacing (pacing at a fixed rate, with no inhibition from intrinsic cardiac events). 
The usual consequence for the patient is a return of the symptoms that originally led to the pacemaker 
implant. 

Output Circuit 

Pacing is the most significant drain on the pulse generator power source. Therefore, current drain must 
be minimized while maintaining an adequate safety margin between the stimulation threshold and the 
programmed output stimulus. Modern permanent pulse generators use constant voltage. The voltage 
remains at the programmed value while current fluctuates in relation to the source impedance. 

Output energy is controlled by two programmable parameters, pulse amplitude and pulse duration. 
Pulse amplitudes range from 0.8-5 V and, in some generators, can be as high as 10 V (used for trouble- 
shooting or for pediatric patients). Pulse duration can range from 0.05-1.5 ms. The prudent selection 
of these parameters will greatly influence the longevity of the pulse generator. 

The output pulse is generated from the discharge of a capacitor charged by the battery. Most modern 
pulse generators contain a 2.8 V battery. The higher voltages are achieved using voltage multipliers 
(smaller capacitors used to charge the large capacitor). The voltage can be doubled by charging two 
smaller capacitors in parallel, with the discharge delivered to the output capacitor in series. Output pulses 
are emitted at a rate controlled by the timing circuit; output is commonly inhibited by sensed cardiac 
signals. 

Timing Circuit 

The timing circuit regulates such parameters as the pacing cycle length, refractory and blanking periods, 
pulse duration, and specific timing intervals between atrial and ventricular events. A crystal oscillator 
generating frequencies in the kHz range sends a signal to a digital timing and logic control circuit, which 
in turn operates internally generated clocks at divisions of the oscillatory frequency. 

A rate-limiting circuit is incorporated into the timing circuit to prevent the pacing rate from exceeding 
an upper limit should a random component failure occur (an extremely rare event). This is also referred 
to as "runaway" protection and is typically 180-200 ppm. 

Telemetry Circuit 

Today's pulse generators are capable of both transmitting information from an RF antenna and receiving 
information with an RF decoder. This two-way communication occurs between the pulse generator and 
the programmer at approximately 300 Hz. Real-time telemetry is the term used to describe the ability 
of the pulse generator to provide information such as pulse amplitude, pulse duration, lead impedance, 
battery impedance, lead current, charge, and energy. The programmer, in turn, delivers coded messages 
to the pulse generator to alter any of the programmable features and to retrieve diagnostic data. Coding 
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requirements reduce the likelihood of inappropriate programming alterations by environmental sources 
of radiofrequency and magnetic fields. It is also prevents the improper use of programmers from other 
manufacturers. 

Power Source 

Over the years, a number of different battery technologies have been tried, including mercury-zinc, 
rechargeable silver-modified-mercuric-oxide-zinc, rechargeable nickel-cadmium, radioactive plutonium 
or promefhium, and lithium with a variety of different cathodes. Lithium-cupric-sulfide and mercury- 
zinc batteries were associated with corrosion and early failure. Mercury-zinc produced hydrogen gas as 
a by-product of the battery reaction; the venting required made it impossible to hermetically seal the 
generator. This led to fluid infiltration followed by the risk of sudden failure. 

The longevity of very early pulse generators was measured in hours. With the lithium-iodide technology 
now used, longevity has been reported as high as 15 years. The clinical desire to have a generator that is 
small and full-featured yet also long-lasting poses a formidable challenge to battery designers. One 
response by manufacturers has been to offer different models of generators, each offering a different 
balance between therapy, size, and longevity. Typical battery capacity is in the range of 0.8-3.0 amp-hours. 

Many factors affect longevity, including pulse amplitude and duration, pacing rate, single- versus dual- 
chamber pacing, degree to which the patient uses the pacemaker, lead design, and static current drain 
from the sensing circuits. Improvements in lead design are often overlooked as a factor in improving 
longevity, but electrodes used in 1960 required a pulse generator output of 675 uj for effective stimulation, 
whereas the electrodes of the 1990s need only 3-6 uj. 

Another important factor in battery design lies in the electrolyte that separates the anode and the 
cathode. The semisolid layer of lithium iodide that is used gradually thickens over the life of the cell, 
increasing the internal resistance of the battery. The voltage produced by lithium-iodine batteries is 
inversely related to this resistance and is linear from 2.8 V to approximately 2.4 V, representing about 
90% of the usable battery life. It then declines exponentially to 1.8 V as the internal battery resistance 
increases from 10,000 Q to 40,000 Q (Fig. 77.5). 

When the battery reaches between 2.0 and 2.4 V (depending on the manufacturer), certain functions 
of the pulse generator are altered so as to alert the clinician. These alterations are called the elective- 
replacement indicators (ERI). They vary from one pulse generator to another and include signature 
decreases in rate, a change to a specific pacing mode, pulse duration stretching, and the telemetered 
battery voltage. When the battery voltage reaches 1.8 V, the pulse generator may operate erratically or 
cease to function and is said to have reached "end of life." The time period between appearance of the 
ERI and end-of-life status averages about 3 to 4 months. 
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FIGURE 77.5 The initial decline in battery voltage is slow and then more rapid after the battery reaches the ERI 
voltage. An important aspect of battery design is the predictability of this decline so that timely generator replacement 
is anticipated. 
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FIGURE 77.6 The four major lead components. 



77.3 Leads 



Implantable pacing leads must be designed not only for consistent performance within the hostile 
environment of the body but also for easy handling by the implanting physician. Every lead has four 
major components (Fig. 77.6): the electrode, the conductor, the insulation, and the connector pin(s). 

The electrode is located at the tip of the lead and is in direct contact with the myocardium. Bipolar 
leads have a tip electrode and a ring electrode (located about 2 cm proximal to the tip); unipolar leads 
have tip electrodes only. A small-radius electrode provides increased current density resulting in lower 
stimulation thresholds. The electrode also increases resistance at the electrode-myocardial interface, thus 
lowering the current drain further and improving battery longevity. The radius of most electrodes is 
6-8 mm 2 , though there are clinical trials underway using a "high-impedance" lead with a tip radius as 
low as 1.5 mm 2 . 

Small electrodes, however, historically have been associated with inferior sensing performance. Lead 
designers were able to achieve both good pacing and good sensing by creating porous-tip electrodes 
containing thousands of pores in the 20-100 urn range. The pores allow the ingrowth of tissue, resulting 
in the necessary increase in effective sensing area while maintaining a small pacing area. Some commonly 
used electrode materials include platinum-iridium. Elgiloy (an alloy of cobalt, iron, chromium, molyb- 
denum, nickel, and manganese), platinum coated with platinized titanium, and vitreous or pyrolytic 
carbon coating a titanium or graphite core. 

Another major breakthrough in lead design is the steroid- eluting electrode. About 1 mg of a corticos- 
teroid (dexamethasone sodium phosphate) is contained in a silicone core that is surrounded by the 
electrode material (Fig. 77.7). The "leaking" of the steroid into the myocardium occurs slowly over several 
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FIGURE 77.7 The steroid elution electrode. 
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years and reduces the inflammation that results from the lead placement. It also retards the growth of 
the fibrous sack that forms around the electrode which separates it from viable myocardium. As a result, 
the dramatic rise in acute thresholds that is seen with nonsteroid leads over the 8 to 16 weeks postimplant 
is nearly eliminated. This makes it possible to program a lower pacing output, further extending longevity. 

Once a lead has been implanted, it must remain stable (or fixated). The fixation device is either active 
or passive. The active fixation leads incorporate corkscrew mechanisms, barbs, or hooks to attach them- 
selves to the myocardium. The passive fixation leads are held into place with tines that become entangled 
into the netlike lining (trabeculae) of the heart. Passive leads generally have better acute pacing and 
sensing performance but are difficult to remove chronically. Active leads are easier to remove chronically 
and have the advantage of unlimited placement sites. Some implanters prefer to use active-fixation leads 
in the atrium and passive-fixation leads in the ventricle. 

The conductor carries electric signals to the pulse generator and delivers the pacing pulses to the heart. 
It must be strong and flexible to withstand the repeated flexing stress placed on it by the beating heart. 
The early conductors were a single, straight wire that was vulnerable to fracturing. They have evolved 
into coiled (for increased flexibility) multifilar (to prevent complete failure with partial fractures) con- 
ductors. The conductor material is a nickel alloy called MP35N. Because of the need for two conductors, 
bipolar leads are usually larger in diameter than unipolar leads. Current bipolar leads have a coaxial 
design that has significantly reduced the diameter of bipolar leads. 

Insulation materials (typically silicone and polyurethane) are used to isolate the conductor. Silicone 
has a longer history and the exclusive advantage of being repairable. Because of low tear strength, however, 
silicone leads tend to be thicker than polyurethane leads. Another relative disadvantage of silicone is its 
high coefficient of friction in blood, which makes it difficult for two leads to pass through the same vein. 
A coating applied to silicone leads during manufacturing has diminished this problem. 

A variety of generator-lead connector configurations and adapters are available. Because incompati- 
bility can result in disturbed (or even lost) pacing and sensing, an international standards (IS-1) has 
been developed in an attempt to minimize incompatibility. 

Leads can be implanted epicardially and endocardially. Epicardial leads are placed on the outer surface 
of the heart and require the surgical exposure of a small portion of the heart. They are used when venous 
occlusion makes it impossible to pass a lead transvenously, when abdominal placement of the pulse 
generator is needed (as in the case of radiation therapy to the pectoral area), or in children (to allow for 
growth). Endocardialleads are more common and perform better in the long term. These leads are passed 
through the venous system and into the right side of the heart. The subclavian or cephalic veins in the 
pectoral region are common entry sites. Positioning is facilitated by a thin, firm wire stylet that passes 
through the central lumen of the lead, stiffening it. Fluoroscopy is used to visualize lead positioning and 
to confirm the desired location. 

Manufacturers are very sensitive to the performance reliability of the leads. Steady improvements in 
materials, design, manufacturing, and implant technique have led to reliability well in excess of 99% over 
3-year periods. 

77.4 Programmers 

Noninvasive reversible alteration of the functional parameters of the pacemaker is critical to ongoing 
clinical management. For a pacing system to remain effective throughout its lifetime, it must be able to 
adjust to the patient's changing needs. The programmer is the primary clinical tool for changing settings, 
for retrieving diagnostic data, and for conducting noninvasive tests. 

The pacing rate for programmable pacemakers of the early 1960s was adjusted via a Keith needle 
manipulated percutaneously into a knob on the side of the pacemaker; rotating the needle changed the 
pacing rate. Through the late 1960s and early 1970s, magnetically attuned reed switches in the pulse 
generator made it possible to noninvasively change certain parameters such as rate, output, sensitivity, 
and polarity. The application of a magnet could alter the parameters which were usually limited to only 
one of two choices. It wasn't until the late 1970s, when radiofrequency energy was incorporated as the 
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transmitter of information, that programmability began to realize its full potential. Radiofrequency 
transmission is faster, provides bidirectional telemetry, and decreases the possibility of unintended pro- 
gramming from inappropriate sources. 

Most manufacturers today are moving away from a dedicated proprietary instrument and toward a 
PC-based design. The newer designs are generally more flexible, more intuitive to use, and more easily 
updated when new devices are released. Manufacturers and clinicians alike are becoming more sensitive 
to the role that time-efficient programming can play in the productivity of pacing clinics, which may 
provide follow-up for as many as 500-1000 patients a year. 

77.5 System Operation 

Pacemakers have gotten steadily more powerful over the last three decades, but at the cost of steadily 
greater complexity. Manufacturers have come to realize the challenge that this poses for busy clinicians 
and have responded with a variety of interpretive aids (Fig. 77.8). 

Much of the apparent complexity of the timing rules that determine pacemaker operation is due to a 
design goal of mimicking normal cardiac function without interfering with it. One example is the dual- 
chamber feature that provides sequential stimulation of the atrium before the ventricle. 

Another example is rate response, designed for patients who lack the normal ability to increase their 
heart rate in response to a variety of physical conditions (e.g., exercise). Introduced in the mid-1980s, 
rate-responsive systems use some sort of sensor to measure the change in a physical variable correlated 
to heart rate. The sensor output is signal-processed and then used by the output circuit to specify a target 
pacing rate. The clinician controls the aggressiveness of the rate increase through a variety of parameters 
(including a choice of transfer function); pacemaker- resident diagnostics provide data helpful in titrating 
the rate-response therapy. 

The most common sensor is the activity sensor, which uses piezoelectric materials to detect vibrations 
caused by body movement. Systems using a transthoracic-impedance sensor to estimate pulmonary 
minute ventilation are also commercially available. Numerous other sensors (e.g., stroke volume, blood 



Paced events 
Sensed events 



AV timing 



Refractory sensed events 
Activity events 



Refractory period (open) 
Blanking period (closed) 



PVC PVC 
Safe Safety pacing 
S 



"X Scheduled A pace 



VA timing 



Scheduled V pace 




FIGURE 77.8 The Marker Channel Diagram is just one tool that makes interpretation of the ECG strip faster and 
more reliable for the clinician. It allows quick checking of the timing operations of the system. 
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TABLE 77.1 The NASPE/NPEG Code 



Position 


I 


11 


III 


IV 


V 


Category 


Chamber(s) paced 


Chamber(s) 


Response to 


Programmability rate 


Antitachyarrhythmia 






sensed 


sensing 


modulation 


function(s) 




O = None 


O = None 


O = None 


O = None 


O = None 




A = Atrium 


A = Atrium 


T = Triggered 


P = Simple programmable 


P = Packing 




V = Ventricle 


V = Ventricle 


I = Inhibited 


M = Multiprogrammable 


S = Shock 




D = Dual (A+V) 


D = Dual (A+V) 


D = Dual (T+I) 


C = Communicating 
R = Rate modulation 


D = Dual (P+S) 


Manufacturers' 


S = Single (A or V) 


S = Single (A or 








designation 




V) 








only 













Note: Positions I through III are used exclusively for antibradyarrhythmia function. (From Bernstein AD, et al., PACE, Vol. 10, 
July-Aug. 1987.) 

temperature or pH, oxygen saturation, preejection interval, right ventricular pressure) are in various 
stages of clinical research or have been market released outside the United States. Some of these systems 
are dual-sensor, combining the best features of each sensor in a single pacing system. 

To make it easier to understand the gross-level system operation of modern pacemakers, a five-letter 
code has been developed by the North American Society of Pacing and Electrophysiology and the British 
Pacing and Electrophysiology Group [Bernstein et al., 1987]. The first letter indicates the chamber (or 
chambers) that are paced. The second letter reveals those chambers in which sensing takes place, and the 
third letter describes how the pacemaker will respond to a sensed event. The pacemaker will "inhibit" 
the pacing output when intrinsic activity is sensed or will "trigger" a pacing output based on a specific 
previously sensed event. For example, in DDD mode: 

D: Pacing takes place in the atrium and the ventricle. 

D: Sensing takes place in the atrium and the ventricle. 

D: Both inhibition and triggering are the response to a sensed event. An atrial output is inhibited with 

an atrial-sensed event, whereas a ventricular output is inhibited with a ventricular-sensed event; 

a ventricular pacing output is triggered by an atrial-sensed event (assuming no ventricular event 

occurs during the A-V interval). 

The fourth letter in the code is intended to reflect the degree of programmability of the pacemaker 
but is typically used to indicate that the device can provide rate response. For example, a DDDR device 
is one that is programmed to pace and sense in both chambers and is capable of sensor-driven rate 
variability. The fifth letter is reserved specifically for antitachycardia functions (Table 77.1). 

Readers interested in the intriguing details of pacemaker timing operations are referred to the works 
listed at the end of this chapter. 

77.6 Clinical Outcomes and Cost Implications 

The demonstrable hemodynamic and symptomatic benefits provided by rate-responsive and dual-cham- 
ber pacing have led U.S. physicians to include at least one of these features in over three-fourths of 
implants in recent years. Also, new prospective data [Andersen et al., 1993] support a hypothesis inves- 
tigated retrospectively since the mid-1980s: namely, that pacing the atrium in patients with sinus node 
dysfunction can dramatically reduce the incidence of such life-threatening complications as congestive 
heart failure and stroke associated with chronic atrial fibrillation. Preliminary analysis of the cost impli- 
cations suggest that dual-chamber pacing is significantly cheaper to the U.S. healthcare system than is 
single-chamber pacing over the full course of therapy, despite the somewhat higher initial cost of implant- 
ing the dual-chamber system. 
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77.7 Conclusion 

Permanent cardiac pacing is the beneficiary of three decades of advances in a variety of key technologies: 
biomaterials, electrical stimulation, sensing of bioelectrical events, power sources, microelectronics, trans- 
ducers, signal analysis, and software development. These advances, informed and guided by a wealth of 
clinical experience acquired during that time, have made pacing a cost-effective cornerstone of cardiac 
arrhythmia management. 

Defining Terms 

Atrial fibrillation: An atrial arrhythmia resulting in chaotic current flow within the atria. The effective 

contraction of the atria is lost, allowing blood to pool and clot, leading to stroke if untreated. 
Battery capacity: Given by the voltage and the current delivery. The voltage is a result of the battery 

chemistry, and current delivery (current x time) is measured in ampere hours and is related to 

battery size. 
CMOS circuit: Abbreviation for complementary metallic oxide semiconductor, which is a form of 

semiconductor often used in pacemaker technology. 
Congestive heart failure: The pathophysiologic state in which an abnormality of cardiac function is 

responsible for the failure of the heart to pump blood at a rate commensurate with the requirements 

of the body. 
Endocardium: The inner lining of the heart. 
Epicardium: The outer lining of the heart. 
Hermeticity: The term, as used in the pacemaker industry, refers to a very low rate of helium gas 

leakage from the sealed pacemaker container. This reduces the chance of fluid intruding into the 

pacemaker generator and causing damage. 
Hypertrophic obstructive cardiomyopathy: A disease of the myocardium characterized by thickening 

(hypertrophy) of the interventricular septum, resulting in the partial obstruction of blood from 

the left ventricle. 
Minute ventilation: Respiratory rate x tidal volume (the amount of air taken in with each breath) = 

minute ventilation. This parameter is used as a biologic indicator for rate-adaptive pacing. 
Mode: The type of pacemaker response to the patient's intrinsic heartbeat. The three commonly used 

modes are asynchronous, demand, and triggered. 
Programmable: The ability to alter the pacemaker settings noninvasively. A variety of selections exist, 

each with its own designation. 
Rate-adaptive: The ability to change the pacemaker stimulation interval caused by sensing a physiologic 

function other than the intrinsic atrial rhythm. 
Sensitivity: A programmable setting that adjusts the reference voltage to which signals entering the 

sensing circuit are compared for filtering. 
Stimulation threshold: The minimum output energy required to consistently "capture" (cause depo- 
larization) of the heart. 
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Further Information 

A good basic introduction to pacing from a clinical perspective is the third edition of A Practical Guide 
to Cardiac Pacing by H. Weston Moses, Joel Schneider, Brain Miller, and George Taylor (Little, Brown, 
1991). 

Cardiac Pacing (Blackwell Scientific, 1992), edited by Kenneth Ellenbogen, is an excellent intermediate 
treatment of pacing. The treatments of timing cycles and troubleshooting are especially good. 

In-depth discussion of a wide range of pacing topics is provided by the third edition of A Practice of 
Cardiac Pacing by Seymour Furman, David Hayes, and David Holmes (Futura, 1993), and by New 
Perspectives in Cardiac Pacing 3, edited by Serge Barold and Jacques Mugica (Futura, 1993). 

Detailed treatment of rate-responsive pacing is given in Rate-Adaptive Cardiac Pacing: Single and Dual 
Chamberbj Chu-Pak Lau (Futura, 1993), and in Rate-Adaptive Pacing, edited by David Benditt (Blackwell 
Scientific, 1993). 

The Foundations of Cardiac Pacing, Part Iby Richard Sutton and Ivan Bourgeois (Futura, 1991) contains 
excellent illustrations of implantation techniques. 

Readers seeking a historical perspective may wish to consult "Pacemakers, Pastmakers, and the Paced: 
An Informal History from A to Z," by Dwight Harken in the July/ August 1991 issue of Biomedical 
Instrumentation and Technology. 

PACE is the official journal of the North American Society of Pacing and Electrophysiology (NASPE) 
and of the International Cardiac Pacing and Electrophysiology Society. It is published monthly by Futura 
Publishing (135 Bedford Road, PO Box 418, Armonk, NY 10504 USA). 
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78.1 Functional Electrical Stimulation 



Implantable stimulators for neuromuscular control are the technologically most advanced versions of 
functional electrical stimulators. Their function is to generate contraction of muscles, which cannot be 
controlled volitionally because of the damage or dysfunction in the neural paths of the central nervous 
system (CNS). Their operation is based on the electrical nature of conducting information within nerve 
fibers, from the neuron cell body (soma), along the axon, where a travelling action potential is the carrier 
of excitation. While the action potential is naturally generated chemically in the head of the axon, it may 
also be generated artificially by depolarizing the neuron membrane with an electrical pulse. A train of 
electrical impulses with certain amplitude, width, and repetition rate, applied to a muscle innervating 
nerve (a motor neuron) will cause the muscle to contract, very much like in natural excitation. Similarly, 
a train of electrical pulses applied to the muscular tissue close to the motor point will cause muscle 
contraction by stimulating the muscle through the neural structures at the motor point. 

78.2 Technology for Delivering Stimulation Pulses 
to Excitable Tissue 

A practical system used to stimulate a nerve consists of three components: ( 1 ) a pulse generator to generate 
a train of pulses capable of depolarizing the nerve, (2) a lead wire, the function of which is to deliver the 
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pulses to the stimulation site, and (3) an electrode, which delivers the stimulation pulses to the excitable 
tissue in a safe and efficient manner. 

In terms of location of the above three components of an electrical stimulator, stimulation technology 
can be described in the following terms: 

Surface or transcutaneous stimulation, where all three components are outside the body and the 
electrodes are placed on the skin above or near the motor point of the muscle to be stimulated. 
This method has been used extensively in medical rehabilitation of nerve and muscle. Therapeu- 
tically, it has been used to prevent atrophy of paralyzed muscles, to condition paralyzed muscles 
before the application of functional stimulation, and to generally increase the muscle bulk. As a 
functional tool, it has been used in rehabilitation of plegic and paretic patients. Surface systems 
for functional stimulation have been developed to correct drop-foot condition in hemiplegic 
individuals [Liberson, 1961], for hand control [Rebersek, 1973], and for standing and stepping in 
individuals with paralysis of the lower extremities [Kralj and Bajd, 1989]. This fundamental 
technology was commercialized by Sigmedics, Inc. [Graupe, 1998]. The inability of surface stim- 
ulation to reliably excite the underlying tissue in a repeatable manner and to selectively stimulate 
deep muscles has limited the clinical applicability of surface stimulation. 

Percutaneous stimulation employs electrodes which are positioned inside the body close to the struc- 
tures to be stimulated. Their lead wires permanently penetrate the skin to be connected to the 
external pulse generator. State of the art embodiments of percutaneous electrodes utilize a small- 
diameter insulated stainless steel lead that is passed through the skin. The electrode structure is 
formed by removal of the insulation from the lead and subsequent modification to ensure stability 
within the tissue. This modification includes forming barbs or similar anchoring mechanisms. 
The percutaneous electrode is implanted using a hypodermic needle as a trochar for introduction. 
As the needle is withdrawn, the anchor at the electrode tip is engaged into the surrounding tissue 
and remains in the tissue. A connector at the skin surface, next to the skin penetration point, joins 
the percutaneous electrode lead to the hardwired external stimulator. The penetration site has to 
be maintained and care must be taken to avoid physical damage of the lead wires. In the past, this 
technology has helped develop the existing implantable systems, and it may be used for short and 
long term, albeit not permanent, stimulation applications [Memberg, 1993; Marsolais, 1986]. 

The term implantable stimulation refers to stimulation systems in which all three components, pulse 
generator, lead wires, and electrodes, are permanently surgically implanted into the body and the 
skin is solidly closed after the implantation procedure. Any interaction between the implantable part 
and the outside world is performed using telemetry principles in a contact-less fashion. This chapter 
is focused on implantable neuromuscular stimulators, which will be discussed in more detail. 

78.3 Stimulation Parameters 

In functional electrical stimulation, the typical stimulation waveform is a train of rectangular pulses. 
This shape is used because of its effectiveness as well as relative ease of generation. All three parameters 
of a stimulation train, i.e., frequency, amplitude, and pulse-width, have effect on muscle contraction. 
Generally, the stimulation frequency is kept as low as possible, to prevent muscle fatigue and to conserve 
stimulation energy. The determining factor is the muscle fusion frequency at which a smooth muscle 
response is obtained. This frequency varies; however, it can be as low as 12 to 14 Hz and as high as 50 Hz. 
In most cases, the stimulation frequency is kept constant for a certain application. This is true both for 
surface as well as implanted electrodes. 

With surface electrodes, the common way of modulating muscle force is by varying the stimulation 
pulse amplitude at a constant frequency and pulse width. The stimulation amplitudes may be as low as 
25 V at 200 us for the stimulation of the peroneal nerve and as high as 120 V or more at 300 (is for 
activation of large muscles such as the gluteus maximus. 
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In implantable stimulators and electrodes, the stimulation parameters greatly depend on the implan- 
tation site. When the electrodes are positioned on or around the target nerve, the stimulation amplitudes 
are on the order of a few milliamperes or less. Electrodes positioned on the muscle surface (epimysial 
electrodes) or in the muscle itself (intramuscular electrodes), employ up to ten times higher amplitudes. 
For muscle force control, implantable stimulators rely either on pulse-width modulation or amplitude 
modulation. For example, in upper extremity applications, the current amplitude is usually a fixed 
paramter set to 16 or 20 mA, while the muscle force is modulated with pulse-widths within to 200 us. 

78.4 Implantable Neuromuscular Stimulators 

Implantable stimulation systems use an encapsulated pulse generator that is surgically implanted and has 
subcutaneous leads that terminate at electrodes on or near the desired nerves. In low power consumption 
applications such as the cardiac pacemaker, a primary battery power source is included in the pulse 
generator case. When the battery is close to depletion, the pulse generator has to be surgically replaced. 

Most implantable systems for neuromuscular application consist of an external and an implanted 
component. Between the two, an inductive radio-frequency link is established, consisting of two tightly 
coupled resonant coils. The link allows transmission of power and information, through the skin, from 
the external device to the implanted pulse generator. In more advanced systems, a back-telemetry link is 
also established, allowing transmission of data outwards, from the implanted to the external component. 

Ideally, implantable stimulators for neuromuscular control would be stand alone, totally implanted 
devices with an internal power source and integrated sensors detecting desired movements from the 
motor cortex and delivering stimulation sequences to appropriate muscles, thus bypassing the neural 
damage. At the present developmental stage, they still need a control source and an external controller 
to provide power and stimulation information. The control source may be either operator driven, 
controlled by the user, or triggered by an event such as the heel-strike phase of the gait cycle. Figure 78.1 
depicts a neuromuscular prosthesis developed at the Case Western Reserve University (CWRU) and 
Cleveland Veterans Affairs Medical Center for the restoration of hand functions using an implantable 
neuromuscular stimulator. In this application, the patient uses the shoulder motion to control opening 
and closing of the hand. 

The internal electronic structure of an implantable neuromuscular stimulator is shown in Fig. 78.2. 
It consists of receiving and data retrieval circuits, power supply, data processing circuits, and output stages. 
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FIGURE 78.1 Implanted FES hand grasp system. 
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FIGURE 78.2 Block diagram of an implantable neuromuscular stimulator. 

Receiving Circuit 

The stimulator's receiving circuit is an LC circuit tuned to the resonating frequency of the external 
transmitter, followed by a rectifier. Its task is to provide the raw DC power from the received rf signal and 
at the same time allow extraction of stimulation information embedded in the rf carrier. There are various 
encoding schemes allowing simultaneous transmission of power and information into an implantable 
electronic device. They include amplitude and frequency modulation with different modulation indexes 
as well as different versions of digital encoding such as Manchester encoding where the information is 
hidden in a logic value transition position rather than the logic value itself. Synchronous and asynchronous 
clock signals may be extracted from the modulated carrier to drive the implant's logic circuits. 

The use of radiofrequency transmission for medical devices is regulated and in most countries limited 
to certain frequencies and radiation powers. (In the U.S., the use of the rf space is regulated by the Federal 
Communication Commission [FCC]). Limited rf transmission powers as well as conservation of power 
in battery operated external controllers dictate high coupling efficiencies between the transmitting and 
receiving antennas. Optimal coupling parameters cannot be uniformly defined; they depend on appli- 
cation particularities and design strategies. 

Power Supply 

The amount of power delivered into an implanted electronic package depends on the coupling between 
the transmitting and the receiving coil. The coupling is dependent on the distance as well as the alignment 
between the coils. The power supply circuits must compensate for the variations in distance for different 
users as well as for the alignment variations due to skin movements and consequent changes in relative 
coil-to-coil position during daily usage. The power dissipated on power supply circuits must not raise 
the overall implant case temperature. 

In implantable stimulators that require stimulation voltages in excess of the electronics power supply 
voltages (20 to 30 V), the stimulation voltage can be provided directly through the receiving coil. In that 
case, voltage regulators must be used to provide the electronics supply voltage (usually 5 V), which heavily 
taxes the external power transmitter and increases the implant internal power dissipation. 

Data Retrieval 

Data retrieval technique depends on the data-encoding scheme and is closely related to power supply 
circuits and implant power consumption. Most commonly, amplitude modulation is used to encode the 
in-going data stream. As the high quality factor of resonant LC circuits increases the efficiency of power 
transmission, it also effectively reduces the transmission bandwidth and therefore the transmission data 
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rate. Also, high quality circuits are difficult to amplitude modulate since they tend to continue oscillating 
even with power removed. This has to be taken into account when designing the communication link 
in particular for the start-up situation when the implanted device does not use the power for stimulation 
and therefore loads the transmitter side less heavily, resulting in narrower and higher resonant curves. 
The load on the receiving coil may also affect the low pass filtering of the received rf signal. 

Modulation index (m) or depth of modulation affects the overall energy transfer into the implant. At 
a given rf signal amplitude, less energy is transferred into the implanted device when 100% modulation 
is used (m = 1) as compared to 10% modulation (m = 0.053). However, retrieval of 100% modulated 
signal is much easier than retrieval of a 10% modulated signal. 

Data Processing 

Once the information signal has been satisfactorily retrieved and reconstructed into logic voltage levels, 
it is ready for logic processing. For synchronous data processing a clock signal is required. It can be 
generated locally within the implant device, reconstructed from the incoming data stream, or can be 
derived from the rf carrier. A crystal has to be used with a local oscillator to assure stable clock frequency. 
Local oscillator allows for asynchronous data transmission. Synchronous transmission is best achieved 
using Manchester data encoding. Decoding of Manchester encoded data recovers the original clock signal, 
which was used during data encoding. Another method is using the downscaled rf carrier signal as the 
clock source. In this case, the information signal has to be synchronized with the rf carrier. Of course, 
100% modulation scheme cannot be used with carrier-based clock signal. Complex command structure 
used in multichannel stimulators requires intensive data decoding and processing and consequently 
extensive electronic circuitry. Custom-made, application specific circuits (ASIC) are commonly used to 
minimize the space requirements and optimize the circuit performance. 

Output Stage 

The output stage forms stimulation pulses and defines their electrical characteristics. Even though a mere 
rectangular pulse can depolarize a nervous membrane, such pulses are not used in clinical practice due 
to their noxious effect on the tissue and stimulating electrodes. These effects can be significantly reduced 
by charge balanced stimulating pulses where the cathodic stimulation pulse is followed by an anodic 
pulse containing the same electrical charge, which reverses the electrochemical effects of the cathodic 
pulse. Charge balanced waveforms can be assured by capacitive coupling between the pulse generator 
and stimulation electrodes. Charge balanced stimulation pulses include symmetrical and asymmetrical 
waveforms with anodic phase immediately following the cathodic pulse or being delayed by a short, 20 to 
60 us interval. 

The output stages of most implantable neuromuscular stimulators have constant current characteris- 
tics, meaning that the output current is independent on the electrode or tissue impedance. Practically, 
the constant current characteristics ensure that the same current flows through the excitable tissues 
regardless of the changes that may occur on the electrode-tissue interface, such as the growth of fibrous 
tissue around the electrodes. Constant current output stage can deliver constant current only within the 
supply voltage — compliance voltage. In neuromuscular stimulation, with the electrode impedance being 
on the order of 1 kD, and the stimulating currents in the order of 20 mA, the compliance voltage must 
be above 20 V. Considering the voltage drops and losses across electronic components, the compliance 
voltage of the output stage may have to be as high as 33 V. 

The stimulus may be applied through either monopolar or bipolar electrodes. The monopolar electrode 
is one in which a single active electrode is placed near the excitable nerve and the return electrode is 
placed remotely, generally at the implantable unit itself. Bipolar electrodes are placed at the stimulation 
site, thus limiting the current paths to the area between the electrodes. Generally, in monopolar stimu- 
lation the active electrode is much smaller than the return electrode, while bipolar electrodes are the 
same size. 
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78.5 Packaging of Implantable Electronics 

Electronic circuits must be protected from the harsh environment of the human body. The packaging of 
implantable electronics uses various materials, including polymers, metals, and ceramics. The encapsu- 
lation method depends somewhat on the electronic circuit technology. Older devices may still use discrete 
components in a classical form, such as leaded transistors and resistors. The newer designs, depending 
on the sophistication of the implanted device, may employ application-specific integrated circuits (ASICs) 
and thick film hybrid circuitry for their implementation. Such circuits place considerable requirements 
for hermeticity and protection on the implanted circuit packaging. 

Epoxy encapsulation was the original choice of designers of implantable neuromuscular stimulators. 
It has been successfully used with relatively simple circuits using discrete, low impedance components. 
With epoxy encapsulation, the receiving coil is placed around the circuitry to be "potted" in a mold, 
which gives the implant the final shape. Additionally, the epoxy body is coated with silicone rubber that 
improves the biocompatibility of the package. Polymers do not provide an impermeable barrier and 
therefore cannot be used for encapsulation of high density, high impedance electronic circuits. The 
moisture ingress ultimately will reach the electronic components, and surface ions can allow electric 
shorting and degradation of leakage-sensitive circuitry and subsequent failure. 

Hermetic packaging provides the implant electronic circuitry with a long-term protection from the 
ingress of body fluids. Materials that provide hermetic barriers are metals, ceramics, and glasses. Metallic 
packaging generally uses a titanium capsule machined from a solid piece of metal or deep-drawn from 
a piece of sheet metal. Electrical signals, such as power and stimulation, enter and exit the package 
through hermetic feedthroughs, which are hermetically welded onto the package walls. The feedthrough 
assembly utilizes a ceramic or glass insulator to allow one or more wires to exit the package without 
contact with the package itself. During the assembly procedures, the electronic circuitry is placed in the 
package and connected internally to the feedthroughs, and the package is then welded closed. Tungsten 
Inert Gas (TIG), electron beam, or laser welding equipment is used for the final closure. Assuming 
integrity of all components, hermeticity with this package is ensured. This integrity can be checked by 
detecting gas leakage from the capsule. Metallic packaging requires that the receiving coil be placed 
outside the package to avoid significant loss of rf signal or power, thus requiring additional space within 
the body to accommodate the volume of the entire implant. Generally, the hermetic package and the 
receiving antenna are jointly imbedded in an epoxy encapsulant, which provides electric isolation for the 
metallic antenna and stabilizes the entire implant assembly. Figure 78.3 shows such an implantable 




FIGURE 78.3 Photograph of a multichannel implantable stimulator telemeter. Hybrid circuit in titanium package 
is shown exposed. Receiving coil (left) is imbedded in epoxy resin together with titanium case. Double feedthroughs 
are seen penetrating titanium capsule wall on the right. 
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stimulator designed and made by the CWRU/Veterans Administration Program. The hermetic package 
is open, displaying the electronic hybrid circuit. More recently, alumina-based ceramic packages have 
been developed that allow hermetic sealing of the electronic circuitry together with enclosure of the 
receiving coil [Strojnik, 1994] . This is possible due to the rf transparency of ceramics. The impact of this 
type of enclosure is still not fully investigated. The advantage of this approach is that the volume of the 
implant can be reduced, thus minimizing the biologic response, which is a function of volume. Yet, an 
unexplored issue of this packaging method is the effect of powerful electromagnetic fields on the implant 
circuits, lacking the protection of the metal enclosure. This is a particular concern with high gain (EMG, 
ENG, or EKG sensing) amplifiers, which in the future may be included in the implant package as part 
of back-telemetry circuits. Physical strength of ceramic packages and their resistance to impact will also 
require future investigation. 

78.6 Leads and Electrodes 

Leads connect the pulse generator to the electrodes. They must be sufficiently flexible to move across the 
joints while at the same time sufficiently sturdy to last for the decades of the intended life of the device. 
They must also be stretchable to allow change of distance between the pulse generator and the electrodes, 
associated with body movements. Ability to flex and to stretch is achieved by coiling the lead conductor 
into a helix and inserting the helix into a small-diameter silicone tubing. This way, both flexing move- 
ments and stretching forces exerted on the lead are attenuated, while translated into torsion movements 
and forces exerted on the coiled conductor. Using multi- strand rather than solid conductors further 
enhances the longevity. Several individually insulated multi-strand conductors can be coiled together, 
thus forming a multiple conductor lead wire. Most lead configurations include a connector at some point 
between the implant and the terminal electrode, allowing for replacement of the implanted receiver or 
leads in the event of failure. The connectors used have been either single pin in-line connectors located 
somewhere along the lead length or a multiport/multilead connector at the implant itself. Materials used 
for lead wires are stainless steels, MP35N (Co, Cr, Ni alloy), and noble metals and their alloys. 

Electrodes deliver electrical charge to the stimulated tissues. Those placed on the muscle surface are 
called epimysial, while those inserted into the muscles are called intramuscular. Nerve stimulating elec- 
trodes are called epineural when placed against the nerve, or cuff electrodes when they encircle the nerve. 
Nerve electrodes may embrace the nerve in a spiral manner individually, or in an array configuration. 
Some implantable stimulation systems merely use exposed lead-wire conductor sutured to the epineu- 
rium as the electrode. Generally, nerve electrodes require approximately one-tenth of the energy for 
muscle activation as compared to muscle electrodes. However, they require more extensive surgery and 
may be less selective, but the potential for neural damage is greater than, for example, nerve encircling 
electrodes. 

Electrodes are made of corrosion resistant materials, such as noble metals (platinum or iridium) and 
their alloys. For example, a platinum-iridium alloy consisting of 10% iridium and 90% platinum is 
commonly used as an electrode material. Epimysial electrodes developed at CWRU use 04 mm Pt90IrlO 
discs placed on Dacron reinforced silicone backing. CWRU intramuscular electrodes employ a stainless 
steel lead-wire with the distal end de-insulated and configured into an electrode tip. A small, umbrella- 
like anchoring barb is attached to it. With this arrangement, the diameter of the electrode tip does not 
differ much from the lead wire diameter and this electrode can be introduced into a deep muscle with 
a trochar-like insertion tool. Figure 78.4 shows enlarged views of these electrodes. 

78.7 Safety Issues of Implantable Stimulators 

The targeted lifetime of implantable stimulators for neuromuscular control is the lifetime of their users, 
which is measured in tens of years. Resistance to premature failure must be assured by manufacturing 
processes and testing procedures. Appropriate materials must be selected that will withstand the working 



) 2000 by CRC Press LLC 




FIGURE 78.4 Implantable electrodes with attached lead wires. Intramuscular electrode (top) has stainless steel tip 
and anchoring barbs. Epimysial electrode has Ptlr disk in the center and is backed by silicone-impregnated Dacron 
mesh. 

environment. Protection against mechanical and electrical hazards that may be encountered during the 
device lifetime must be incorporated in the design. Various procedures are followed and rigorous tests 
must be performed during and after its manufacturing to assure the quality and reliability of the device. 

• Manufacturing and testing — Production of implantable electronic circuits and their encapsula- 
tion in many instances falls under the standards governing production and encapsulation of 
integrated circuits. To minimize the possibility of failure, the implantable electronic devices are 
manufactured in controlled clean-room environments, using high quality components and strictly 
defined manufacturing procedures. Finished devices are submitted to rigorous testing before being 
released for implantation. Also, many tests are carried out during the manufacturing process itself. 
To assure maximum reliability and product confidence, methods, tests, and procedures defined 
by military standards, such as MIL-STD-883, are followed. 

• Bio-compatibility — Since the implantable stimulators operate surgically implanted in living tis- 
sue, an important part of their design has to be dedicated to biocompatibility, i.e., their ability to 
dwell in living tissue without disrupting the tissue in its functions, creating adverse tissue response, 
or changing its own properties due to the tissue environment. Elements of biocompatibility include 
tissue reaction to materials, shape, and size, as well as electrochemical reactions on stimulation 
electrodes. There are known biomaterials used in the making of implantable stimulators. They 
include stainless steels, titanium and tantalum, noble metals such as platinum and iridium, as well 
as implantable grades of selected epoxy and silicone-based materials. 

• Susceptibility to electromagnetic interference (EMI) and electrostatic discharge (ESD) — Elec- 
tromagnetic fields can disrupt the operation of electronic devices, which may be lethal in situations 
with life support systems, but they may also impose risk and danger to users of neuromuscular 
stimulators. Emissions of EMI may come from outside sources; however, the external control unit 
is also a source of electromagnetic radiation. Electrostatic discharge shocks are not uncommon 
during the dry winter season. These shocks may reach voltages as high as 15 kV and more. Sensitive 
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electronic components can easily be damaged by these shocks unless protective design measures 
are taken. The electronic circuitry in implantable stimulators is generally protected by the metal 
case. However, the circuitry can be damaged through the feedthroughs either by handling or 
during the implantation procedure by the electrocautery equipment. ESD damage may happen 
even after implantation when long lead-wires are utilized. There are no standards directed specif- 
ically towards implantable electronic devices. The general standards put in place for electromedical 
equipment by the International Electrotechnical Commission provide guidance. The specifications 
require survival after 3 kV and 8 kV ESD discharges on all conductive and nonconductive accessible 
parts, respectively. 

78.8 Implantable Stimulators in Clinical Use 
Peripheral Nerve Stimulators 

• Manipulation — Control of complex functions for movement, such as hand control, requires the 
use of many channels of stimulation. At the Case Western Reserve University and Cleveland VAMC, 
an eight-channel stimulator has been developed for grasp and release [Smith, 1987]. This system 
uses eight channels of stimulation and a titanium-packaged, thick- film hybrid circuit as the pulse 
generator. The implant is distributed by the Neurocontrol Corporation (Cleveland, OH) under 
the name of Freehand®. It has been implanted in approximately 150 patients in the U.S., Europe, 
Asia, and Australia. The implant is controlled by a dual-microprocessor external unit carried by 
the patient with an input control signal provided by the user's remaining volitional movement. 
Activation of the muscles provides two primary grasp patterns and allows the person to achieve 
functional performance that exceeds his or her capabilities without the use of the implanted system. 
This system received pre-market approval from the FDA in 1998. 

• Locomotion — The first implantable stimulators were designed and implanted for the correction 
of the foot drop condition in hemiplegic patients. Medtronic's Neuromuscular Assist (NMA) 
device consisted of an rf receiver implanted in the inner thigh and connected to a cuff electrode 
embracing the peroneal nerve just beneath the head of fibula at the knee [McNeal, 1977; Waters 
1984]. The Ljubljana peroneal implant had two versions [Vavken, 1976; Strojnik,1987] with the 
common feature that the implant-rf receiver was small enough to be implanted next to the 
peroneal nerve in the fossa poplitea region. Epineural stimulating electrodes were an integral part 
of the implant. This feature and the comparatively small size make the Ljubljana implant a 
precursor of the micro-stimulators described in Section 78.9. Both NMA and the Ljubljana 
implants were triggered and synchronized with gait by a heel switch. 

The same implant used for hand control and developed by the CWRU has also been implanted 
in the lower extremity musculature to assist incomplete quadriplegics in standing and transfer 
operations [Triolo, 1996]. Since the design of the implant is completely transparent, it can generate 
any stimulation sequence requested by the external controller. For locomotion and transfer- related 
tasks, stimulation sequences are preprogrammed for individual users and activated by the user by 
means of pushbuttons. The implant (two in some applications) is surgically positioned in the 
lower abdominal region. Locomotion application uses the same electrodes as the manipulation 
system; however, the lead wires have to be somewhat longer. 

• Respiration — Respiratory control systems involve a two-channel implantable stimulator with 
electrodes applied bilaterally to the phrenic nerve. Most of the devices in clinical use were developed 
by Avery Laboratories (Dobelle Institute) and employed discrete circuitry with epoxy encapsula- 
tion of the implant and a nerve cuff electrode. Approximately 1000 of these devices have been 
implanted in patients with respiratory disorders such as high-level tetraplegia [Glenn, 1986]. 
Activation of the phrenic nerve results in contraction of each hemidiaphragm in response to 
electrical stimulation. In order to minimize damage to the diaphragms during chronic use, alter- 
nation of the diaphragms has been employed, in which one hemidiaphragm will be activated for 
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several hours followed by the second. A review of existing systems was given by Creasy et al. [1996]. 
Astrotech of Finland also recently introduced a phrenic stimulator. More recently, DiMarco [1997] 
has investigated use of CNS activation of a respiratory center to provide augmented breathing. 

• Urinary control — Urinary control systems have been developed for persons with spinal cord 
injury. The most successful of these devices has been developed by Brindley [1982] and is man- 
ufactured by Finetech, Ltd. (England). The implanted receiver consists of three separate stimulator 
devices, each with its own coil and circuitry, encapsulated within a single package. The sacral roots 
(S2, S3, and S4) are placed within a type of encircling electrode, and stimulation of the proper 
roots will generate contraction of both the bladder and the external sphincter. Cessation of 
stimulation results in faster relaxation of the external sphincter than of the bladder wall, which 
then results in voiding. Repeated trains of pulses applied in this manner will eliminate most urine, 
with only small residual amounts remaining. Approximately 1500 of these devices have been 
implanted around the world. This technology also has received FDA pre-market approval and is 
currently distributed by NeuroControl Corporation. 

• Scoliosis treatment — Progressive lateral curvature of the adolescent vertebral column with simul- 
taneous rotation is known as idiopathic scoliosis. Electrical stimulation applied to the convex side 
of the curvature has been used to stop or reduce its progression. Initially rf powered stimulators 
have been replaced by battery powered totally implanted devices [Bobechko, 1979; Herbert, 1989] . 
Stimulation is applied intermittently, stimulation amplitudes are under 10.5 V (510 Q), and fre- 
quency and pulsewidth are within usual FES parameter values. 

Stimulators of Central Nervous System 

Some stimulation systems have electrodes implanted on the surface of the central nervous system or in 
its deep areas. They do not produce functional movements; however, they "modulate" a pathological 
motor brain behavior and by that stop unwanted motor activity or abnormality. Therefore, they can be 
regarded as stimulators for neuromuscular control. 

• Cerebellar stimulation — Among the earliest stimulators from this category are cerebellar stimu- 
lators for control of reduction of effects of cerebral palsy in children. Electrodes are placed on the 
cerebellar surface with the leads penetrating cranium and dura. The pulse generator is located 
subcutaneously in the chest area and produces intermittent stimulation bursts. There are about 
600 patients using these devices [Davis, 1997]. 

• Vagal stimulation — Intermittent stimulation of the vagus nerve with 30 sec on and five min off 
has been shown to reduce frequency of epileptic seizures. A pacemaker-like device, developed by 
Cyberonics, is implanted in the chest area with a bipolar helical electrode wrapped around the 
left vagus nerve in the neck. The stimulation sequence is programmed (most often parameter 
settings are 30 Hz, 500 us, 1.75 mA); however, patients have some control over the device using 
a hand-held magnet [Terry, 1991]. More than 3000 patients have been implanted with this device, 
which received the pre-marketing approval (PMA) from the FDA in 1997. 

• Deep brain stimulation — Recently, in 1998, an implantable stimulation device (Activa by 
Medtronic) was approved by the FDA that can dramatically reduce uncontrollable tremor in 
patients with Parkinson's disease or essential tremor [Koller, 1997]. With this device, an electrode 
array is placed stereotactically into the ventral intermediate nucleus of thalamic region of the brain. 
Lead wires again connect the electrodes to a programmable pulse generator implanted in the chest 
area. Application of high frequency stimulation (130 Hz, 60 to 210 (is, 0.25 to 2.75 V) can imme- 
diately suppress the patient's tremor. 
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FIGURE 78.5 Microstimulator developed at A.E. Mann Foundation. Dimensions are roughly 2x16 mm. Electrodes 
at the ends are made of tantalum and iridium, respectively. 

78.9 Future of Implantable Electrical Stimulators 
Distributed Stimulators 

One of the major concerns with multichannel implantable neuromuscular stimulators is the multitude 
of leads that exit the pulse generator and their management during surgical implantation. Routing of 
multiple leads virtually increases the implant size and by that the burden that an implant imposes on 
the tissue. A solution to that may be distributed stimulation systems with a single outside controller and 
multiple single-channel implantable devices implanted throughout the structures to be stimulated. This 
concept has been pursued both by the Alfred E. Mann Foundation [Strojnik, 1992; Cameron, 1997] and 
the University of Michigan [Ziaie, 1997]. Micro-injectable stimulator modules have been developed that 
can be injected into the tissue, into a muscle, or close to a nerve through a lumen of a hypodermic needle. 
A single external coil can address and activate a number of these devices located within its field, on a 
pulse-to-pulse basis. A glass-encapsulated microstimulator developed at the AEMF is shown in Fig. 78.5. 

Sensing of Implantable Transducer-Generated 
and Physiological Signals 

External command sources such as the shoulder-controlled joystick utilized by the Freehand® system 
impose additional constraints on the implantable stimulator users, since they have to be donned by an 
attendant. Permanently implanted control sources make neuro-prosthetic devices much more attractive 
and easier to use. An implantable joint angle transducer (IJAT) has been developed at the CWRU that 
consists of a magnet and an array of magnetic sensors implanted in the distal and the proximal end of 
a joint, respectively [Smith, 1998]. The sensor is connected to the implantable stimulator package, which 
provides the power and also transmits the sensor data to the external controller, using a back-telemetry 
link. Figure 78.6 shows a radiograph of the IJAT implanted in a patient's wrist. Myoelectric signals (MES) 
from muscles not affected by paralysis are another attractive control source for implantable neuromus- 
cular stimulators. Amplified and bin-integrated EMG signal from uninvolved muscles, such as the sterno- 
cleido-mastoid muscle, has been shown to contain enough information to control an upper extremity 
neuroprosthesis [Scott, 1996]. EMG signal is being utilized by a multichannel stimulator-telemeter 
developed at the CWRU, containing 12 stimulator channels and 2 MES channels integrated into the same 
platform [Strojnik, 1998]. 
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FIGURE 78.6 Radiograph of the joint angle transducer (IJAT) implanted in the wrist. The magnet is implanted in 
the lunate bone (left) while the magnetic sensor array is implanted in the radius. Leads going to the implant case 
can be seen as well as intramuscular and epimysial electrodes with their individual lead wires. 

78.10 Summary 

Implantable stimulators for neuromuscular control are an important tool in rehabilitation of paralyzed 
individuals with preserved neuro-muscular apparatus, as well as in the treatment of some neurological 
disorders that result in involuntary motor activity. Their impact on rehabilitation is still in its infancy; 
however, it is expected to increase with further progress in microelectronics technology, development of 
smaller and better sensors, and with improvements of advanced materials. Advancements in neuro- 
physiological science are also expected to bring forward wider utilization of possibilities offered by 
implantable neuromuscular stimulators. 
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Defining Terms 

Biocompatibility: Ability of a foreign object to coexist in a living tissue. 

Electrical stimulation: Diagnostic, therapeutic, and rehabilitational method used to excite motor 

nerves with the aim of contracting the appropriate muscles and obtain limb movement. 
EMG activity: Muscular electrical activity associated with muscle contraction and production of force. 
Feedthrough: Device that allows passage of a conductor through a hermetic barrier. 
Hybrid circuit: Electronic circuit combining miniature active and passive components on a single 

ceramic substrate. 
Implantable stimulator: Biocompatible electronic stimulator designed for surgical implantation and 

operation in a living tissue. 
Lead wire: Flexible and strong insulated conductor connecting pulse generator to stimulating electrodes. 
Paralysis: Loss of power of voluntary movement in a muscle through injury to or disease to its nerve 

supply. 
Stimulating electrode: Conductive device that transfers stimulating current to a living tissue. On its 

surface, the electric charge carriers change from electrons to ions or vice versa. 
rf-radiofrequency: Pertaining to electromagnetic propagation of power and signal in frequencies above 

those used in electrical power distribution. 
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Further Information 

Additional references on early work in FES which augment peer review publications can be found in 
Proceedings from Conferences in Dubrovnik and Vienna. These are the External Control of Human 
Extremities and the Vienna International Workshop on Electrostimulation, respectively. 
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Defibrillators are devices used to supply a strong electric shock (often referred to as a countershock) to a 
patient in an effort to convert excessively fast and ineffective heart rhythm disorders to slower rhythms 
that allow the heart to pump more blood. External defibrillators have been in common use for many 
decades for emergency treatment of life-threatening cardiac rhythms as well as for elective treatment of 
less threatening rapid rhythms. Figure 79.1 shows an external defibrillator. 

Cardiac arrest occurs in more than 500,000 people annually in the United States, and more than 70% 
of the out-of-hospitals are due to cardiac arrhythmia treatable with defibrillators. The most serious 
arrhythmia treated by a defibrillator is ventricular fibrillation. Without rapid treatment using a defibril- 
lator, ventricular fibrillation causes complete loss of cardiac function and death within minutes. Atrial 
fibrillation and the more organized rhythms of atrial flutter and ventricular tachycardia can be treated 
on a less emergent basis. Although they do not cause immediate death, their shortening of the interval 
between contractions can impair filling of the heart chambers and thus decrease cardiac output. Con- 
ventionally, treatment of ventricular fibrillation is called defibrillation, whereas treatment of the other 
tachycardias is called cardioversion. 

79.1 Mechanism of Fibrillation 

Fibrillation is chaotic electric excitation of the myocardium and results in loss of coordinated mechanical 
contraction characteristic of normal heart beats. Description of mechanisms leading to, and maintaining, 
fibrillation and other rhythm disorders are reviewed elsewhere [1] and are beyond the scope of this 
chapter. In summary, however, these rhythm disorders are commonly held to be a result of reentrant 
excitation pathways within the heart. The underlying abnormality that leads to the mechanism is the 
combination of conduction block of cardiac excitation plus rapidly recurring depolarization of the 
membranes of the cardiac cells. This leads to rapid repetitive propagation of a single excitation wave or 
of multiple excitatory waves throughout the heart. If the waves are multiple, the rhythm may degrade 
into total loss of synchronization of cardiac fiber contraction. Without synchronized contraction, the 
chamber affected will not contract, and this is fatal in the case of ventricular fibrillation. The most 
common cause of these conditions, and therefore of these rhythm disorders, is cardiac ischemia or 
infarction as a complication of atherosclerosis. Additional relatively common causes include other cardiac 
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FIGURE 79.1 Photograph of a trans-chest defibrillator (provided by Physio -Control Corporation with permission). 

disorders, drug toxicity, electrolyte imbalances in the blood, hypothermia, and electric shocks (especially 
from alternating current). 

79.2 Mechanism of Defibrillation 

The corrective measure is to extinguish the rapidly occurring waves of excitation by simultaneously 
depolarizing most of the cardiac cells with a strong electric shock. The cells then can simultaneously 
repolarize themselves, and thus they will be back in phase with each other. 

Despite years of intensive research, there is still no single theory for the mechanism of defibrillation 
that explains all the phenomena observed. However, it is generally held that the defibrillating shock must 
be adequately strong and have adequate duration to affect most of the heart cells. In general, longer 
duration shocks require less current than shorter duration shocks. This relationship is called the strength- 
duration relationship and is demonstrated by the curve shown in Fig. 79.2. Shocks of strength and 
duration above and to the right of the current curve (or above the energy curve) have adequate strength 
to defibrillate, whereas shocks below and to the left do not. From the exponentially decaying current 
curve an energy curve can also be determined (also shown in Fig. 79.2), which is high at very short 
durations due to high current requirements at short durations, but which is also high at longer durations 
due to additional energy being delivered as the pulse duration is lengthened at nearly constant current. 
Thus, for most electrical waveforms there is a minimum energy for defibrillation at approximate pulse 
durations of 3-8 ms. A strength-duration charge curve can also be determined as shown in Fig. 79.2, 
which demonstrates that the minimum charge for defibrillation occurs at the shortest pulse duration 
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FIGURE 79.2 Strength-duration curves for current, energy, and charge. Adequate current shocks are above and to 
the right of the current curve. (Modified from Tacker WA, Geddes LA, 1980. Electrical Defibrillation, Boca Raton, 
Fla, CRC Press, with permission.) 

tested. Very-short-duration pulses are not used, however, since the high current and voltage required is 
damaging to the myocardium. It is also important to note that excessively strong or long shocks may 
cause immediate refibrillation, thus failing to restore the heart function. 

In practice, for a shock applied to electrodes on the skin surface of the patient's chest, durations are 
on the order of 3-10 milliseconds and have an intensity of a few thousand volts and tens of amperes. 
The energy delivered to the subject by these shocks is selectable by the operator and is on the order of 
50-360 joules for most defibrillators. The exact shock intensity required at a given duration of electric 
pulse depends on several variables, including the intrinsic characteristics of the patient (such as the 
underlying disease problem or presence of certain drugs and the length of time the arrhythmia has been 
present), the techniques for electrode application, and the particular rhythm disorder being treated (more 
organized rhythms require less energy than disorganized rhythms). 

79.3 Clinical Defibrillators 

Defibrillator design has resulted from medical and physiologic research and advances in hardware tech- 
nology. It is estimated that for each minute that elapses between onset of ventricular fibrillation and the 
first shock application, survival to leave hospital decreases by about 10%. The importance of rapid 
response led to development of portable, battery-operated defibrillators and more recently to automatic 
external defibrillators (AEDs) that enable emergency responders to defibrillate with minimal training. 

All clinical defibrillators used today store energy in capacitors. Desirable capacitor specifications 
include small size, light weight, and capability to sustain several thousands of volts and many charge- 
discharge cycles. Energy storage capacitors account for at least one pound and usually several pounds of 
defibrillator weight. Energy stored by the capacitor is calculated from 



W = - CE 2 

s 2 



(79.1) 
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FIGURE 79.3 Block diagram of a typical defibrillator. (From Feinberg B. 1980. Handbook Series in Clinical Labo- 
ratory Science, vol 2, Boca Raton, Fla, CRC Press, with permission.) 

where W s = stored energy in joules, C = capacitance in farads, and E = voltage applied to the capacitor. 
Delivered energy is expressed as 



W d =W s x 



f R A 



(79.2) 



where W d = delivered energy, W s = stored energy, _R = subject resistance, and .R, = device resistance. 

Figure 79.3 shows a block diagram for defibrillators. Most have a built-in monitor and synchronizer 
(dashed lines in Fig. 79.3). Built-in monitoring speeds up diagnosis of potentially fatal arrhythmias, 
especially when the ECG is monitored through the same electrodes that are used to apply the defibrillating 
shock. The great preponderance of defibrillators for trans-chest defibrillation deliver shocks with either 
a damped sinusoidal waveform produced by discharge of an RCL circuit or a truncated exponential decay 
waveform (sometimes called trapezoidal). Basic components of exemplary circuits for damped sine 
waveform and trapezoidal waveform defibrillators are shown in Figs. 79.4 and 79.5. The shape of the 
waveforms generated by RCL defibrillators depend on the resistance of the patient as well as the energy 
storage capacitance and resistance and inductance of the inductor. When discharged into a 50-Q load 
(to stimulate the patient's resistance), these defibrillators produce either a critically damped sine waveform 
or a slightly underdamped sine waveform (i.e., having a slight reversal of waveform polarity following 
the main waveform) into the 50-fi load. 




FIGURE 79.4 Resister-capacitor-inductor defibrillator. The patient is represented by R. (Modified from Feinberg 
1980. Handbook Series in Clinical Laboratory Science, vol 2, Boca Raton, Fla, CRC Press, with permission.) 
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FIGURE 79.5 Trapezoidal wave defibrillator. The patient is represented by R. (Modified from Feinberg B. 1980. 
Handbook Series in Clinical Laboratory Science, vol 2, Boca Raton, Fla, CRC Press, with permission.) 

The exact waveform can be determined by application of Kirkchoff 's voltage law to the circuit 



r di 

L — + 
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0H4J 
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(79.3) 



where L = inductance in H, i = instantaneous current in amperes, t = time in seconds, R t = device 
resistance, R = subject resistance, and C = capacitance. From this, the second-order differential equation 
describes the RCL defibrillator. 
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Trapezoidal waveform (actually, these are truncated exponential decay waveform) defibrillators are 
also used clinically. The circuit diagram in Fig. 79.4 is exemplary of one design for producing such a 
waveform. Delivered energy calculation for this waveform is expressed as 



W, = 0.5 I. R 
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where W d = delivered energy, I, = initial current in amperes, L = final current, _R = resistance of the 
patient, and d = pulse duration in seconds. Both RCL and trapezoidal waveforms defibrillate effectively. 
Implantable defibrillators now use alternative waveforms such as a biphasic exponential decay waveform, 
in which the polarity of the electrodes is reversed part way through the shock. Use of the biphasic 
waveform has reduced the shock intensity required for implantable defibrillators but has not yet been 
extended to trans-chest use except on an experimental basis. 

RCL defibrillators are the most widely available. They store up to about 440 joules and deliver up to 
about 360 joules into a patient with 50-ohm impedance. Several selectable energy intensities are available, 
typically from 5-360 J, so that pediatric patients, very small patients, or patients with easily converted 
arrhythmias can be treated with low-intensity shocks. The pulse duration ranges from 3-6 ms. Because 
the resistance (_R) varies between patients (25-150 ohms) and is part of the RCL discharge circuit, the 
duration and damping of the pulse also varies; increasing patient impedance lengthens and dampens the 
pulse. Figure 79.6 shows waveforms from RCL defibrillators with critically damped and with under- 
damped pulses. 

79.4 Electrodes 

Electrodes for external defibrillation are metal and from 70-100 cm 2 in surface area. They must be coupled 
to the skin with an electrically conductive material to achieve low impedance across the electrode-patient 
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FIGURE 79.6 The damped sine wave. The interval O-D represents a duration for the critically and overdamped 
since waves. By time D, more than 99% of the energy has been delivered. O—U is taken as the duration for an 
underdamped sine wave. (Modified from Tacker WA, Geddes LA. 1980. Electrical Defibrillation, Boca Raton, Fla, 
CRC Press, with permission.) 

interface. There are two types of electrodes: hand-held (to which a conductive liquid or solid gel is applied) 
and adhesive, for which an adhesive conducting material holds the electrode in place. Hand-held electrodes 
are reusable and are pressed against the patient's chest by the operator during shock delivery. Adhesive 
electrodes are disposable and are applied to the chest before the shock delivery and left in place for reuse 
if subsequent shocks are needed. Electrodes are usually applied with both electrodes on the anterior chest 
as shown in Fig. 79.7 or in anterior-to-posterior (front-to-back) position, as shown in Fig. 79.8. 

79.5 Synchronization 

Most defibrillators for trans-chest use have the feature of synchronization, which is an electronic sensing 
and triggering mechanism for application of the shock during the QRS complex of the ECG. This is 
required when treating arrhythmias other than ventricular fibrillation, because inadvertent application 
of a shock during the T wave of the ECG often produces ventricular fibrillation. Selection by the operator 
of the synchronized mode of defibrillator operation will cause the defibrillator to automatically sense the 
QRS complex and apply the shock during the QRS complex. Furthermore, on the ECG display, the timing 
of the shock on the QRS is graphically displayed so the operator can be certain that the shock will not 
fall during the Twave (see Fig. 79.9). 

79.6 Automatic External Defibrillators 

Automatic external defibrillators (AEDs) are defibrillators that automatically or semiautomatically rec- 
ognize and treat rapid arrhythmias, usually under emergency conditions. Their operation requires less 
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Anterior-Anterior 

FIGURE 79.7 Cross-sectional view of the chest showing position for standard anterior wall (precordial) electrode 
placement. Lines of presumed current flow are shown between the electrodes on the skin surface. (Modified from Tacker 
WA (ed). 1994. Defibrillation of the Heart: ICDs, AEDs and Manual, St. Louis, Mosby-Year Book, with permission.) 




L-Anterior-Posterior 

FIGURE 79.8 Cross-sectional view of the chest showing position for front-to-back electrode placement. Lines of 
presumed current flow are shown between the electrodes on the skin surface. (Modified from Tacker WA (ed). 1994. 
Defibrillation of the Heart: ICDs, AEDs and Manual, St. Louis, Mosby-Year Book, with permission.) 

training than operation of manual defibrillators because the operator need not know which ECG wave- 
forms indicate rhythms requiring a shock. The operator applies adhesive electrodes from the AED to the 
patient and turns on the AED, which monitors the ECG and determines by built-in signal processing 
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FIGURE 79.9 Timing mark (M) as shown on a synchronized defibrillator monitor. The M designates when in the 
cardiac cycle a shock will be applied. The T wave must be avoided, since a shock during the vulnerable period (V.P.) 
may fibrillate the ventricles. This tracing shows atrial fibrillation as identified by the irregular wavy baseline of the 
ECG. (Modified from Feinberg B. 1980. Handbook Series in Clinical Laboratory Science, vol 2, Boca Raton, Fla, CRC 
Press, with permission.) 



whether or not and when to shock the patient. In a completely automatic mode, the AED does not have 
a manual control as shown in Fig. 79.3 but instead has an automatic control. In semiautomatic mode, 
the operator must confirm the shock advisory from the AED to deliver the shock. AEDs have substantial 
potential for improving the chances of survival from cardiac arrest because they enable emergency 
personnel, who typically reach the patient before paramedics do, to deliver defibrillating shocks. Fur- 
thermore, the reduced training requirements make feasible the operation of AEDs in the home by a 
family member of a patient at high risk of ventricular fibrillation. 



79.7 Defibrillator Safety 



Defibrillators are potentially dangerous devices because of their high electrical output characteristics. 
The danger to the patient of unsynchronized shocks has already been presented, as has the synchronization 
design to prevent inadvertent precipitation of fibrillation by a cardioversion shock applied during the X 
wave. 

There are other safety issues. Improper technique may result in accidental shocking of the operator 
or other personnel in the vicinity, if someone is in contact with the electric discharge pathway. This may 
occur if the operator is careless in holding the discharge electrodes or if someone is in contact with the 
patient or with a metal bed occupied by the subject when the shock is applied. Proper training and 
technique is necessary to avoid this risk. 

Another safety issue is that of producing damage to the patient by application of excessively strong or 
excessively numerous shocks. Although cardiac damage has been reported after high-intensity and repet- 
itive shocks to experimental animals and human patients, it is generally held that significant cardiac 
damage is unlikely if proper clinical procedures and guidelines are followed. 

Failure of a defibrillator to operate correctly may also be considered a safety issue, since inability of a 
defibrillator to deliver a shock in the absence of a replacement unit means loss of the opportunity to 
resuscitate the patient. A recent review of defibrillator failures found that operator errors, inadequate 
defibrillator care and maintenance, and, to a lesser extent, component failure accounted for the majority 
of defibrillator failures [7]. 
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Further Information 

Detailed presentation of material on defibrillator waveforms, algorithms for ECG analysis, and automatic 
defibrillation using AED's, electrodes, design, clinical use, effects of drugs on shock strength required to 
defibrillate, damage due to defibrillator shocks, and use of defibrillators during open-thorax surgical 
procedures or trans-esophageal defibrillation are beyond the scope of this chapter. Also, the historical 
aspects of defibrillation are not presented here. For more information, the reader is referred to the 
publications at the end of this chapter [1—3]. For detailed description of specific defibrillators with 
comparisons of features, the reader is referred to articles from Health Devices, a monthly publication of 
ECRI, 5200 Butler Pike, Plymouth Meeting, Pa USA. For American, Canadian, and European defibrillator 
standards, the reader is referred to published standards [3-6] and Charbonnier's discussion of standards 
[11. 
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The implantable cardioverter defibrillator (ICD) is a therapeutic device that can detect ventricular 
tachycardia or fibrillation and automatically deliver high-voltage (750 V) shocks that will restore normal 
sinus rhythm. Advanced versions also provide low-voltage (5-10 V) pacing stimuli for painless termina- 
tion of ventricular tachycardia and for management of bradyarrhythmias. The proven efficacy of the 
automatic implantable defibrillator has placed it in the mainstream of therapies for the prevention of 
sudden arrhythmic cardiac death. 

The implantable defibrillator has evolved significantly since first appearing in 1980. The newest devices 
can be implanted in the patient's pectoral region and use electrodes that can be inserted transvenously, 
eliminating the traumatic thoracotomy required for placement of the earlier epicardial electrode systems. 
Transvenous systems provide rapid, minimally invasive implants with high assurance of success and 
greater patient comfort. Advanced arrhythmia detection algorithms offer a high degree of sensitivity with 
reasonable specificity, and extensive monitoring is provided to document performance and to facilitate 
appropriate programming of arrhythmia detection and therapy parameters. Generator longevity can now 
exceed 4 years, and the cost of providing this therapy is declining. 

80.1 Pulse Generators 

The implantable defibrillator consists of a primary battery, high-voltage capacitor bank, and sensing and 
control circuitry housed in a hermetically sealed titanium case. Commercially available devices weigh 
between 197 and 237 grams and range in volume from 113 to 145 cm 3 . Clinical trials are in progress on 
devices with volumes ranging from 178 cm 3 to 60 cm 3 and weights between 275 and 104 grams. Further 
size reductions will be achieved with the introduction of improved capacitor and integrated circuit 
technologies and lead systems offering lower pacing and defibrillation thresholds. Progress should parallel 
that made with antibradycardia pacemakers that have evolved from 250-gram, nonprogrammable, VOO 
units with 600-uJ pacing outputs to 26-gram, multiprogrammable, DDDR units with dual 25-liJ outputs. 
Implantable defibrillator circuitry must include an amplifier, to allow detection of the millivolt-range 
cardiac electrogram signals; noninvasively programmable processing and control functions, to evaluate 
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the sensed cardiac activity and to direct generation and delivery of the therapeutic energy; high-voltage 
switching capability; dc-dc conversion functions to step up the low battery voltages; random access 
memories, to store appropriate patient and device data; and radiofrequency telemetry systems, to allow 
communication to and from the implanted device. Monolithic integrated circuits on hybridized substrates 
have made it possible to accomplish these diverse functions in a commercially acceptable and highly 
reliable form. 

Defibrillators must convert battery voltages of approximately 6.5 V to the 600-750 V needed to 
defibrillate the heart. Since the conversion process cannot directly supply this high voltage at current 
strengths needed for defibrillation, charge is accumulated in relatively large (s85-120u.F effective capac- 
itance) aluminum electrolytic capacitors that account for 20-30% of the volume of a typical defibrillator. 
These capacitors must be charged periodically to prevent their dielectric from deteriorating. If this is not 
done, the capacitors become electrically leaky, yielding excessively long charge times and delay of therapy. 
Early defibrillators required that the patient return to the clinic periodically to have the capacitors 
reformed, whereas newer devices do this automatically at preset or programmable times. Improved 
capacitor technology, perhaps ceramic or thin-film, will eventually offer higher storage densities, greater 
shape variability for denser component packaging, and freedom from the need to waste battery capacity 
performing periodic reforming charges. Packaging density has already improved from 0.03 J/cm 3 for 
devices such as the early cardioverter to 0.43 J/cm 3 with some investigational ICDs. Capacitors that allow 
conformal shaping could readily increase this density to more than 0.6 J/cm 3 . 

Power sources used in defibrillators must have sufficient capacity to provide 50-400 full energy charges 
(~34 J) and 3 to 5 years of bradycardia pacing and background circuit operation. They must have a very low 
internal resistance in order to supply the relatively high currents needed to charge the defibrillation capacitors 
in 5-15 s. This generally requires that the batteries have large surface area electrodes and use chemistries that 
exhibit higher rates of internal discharge than those seen with the lithium iodide batteries used in pacemakers. 
The most commonly used defibrillator battery chemistry is lithium silver vanadium oxide. 



80.2 Electrode Systems ("Leads") 



Early implantable defibrillators utilized patch electrodes 
(typically a titanium mesh electrode) placed on the surface 
of the heart, requiring entry through the chest (Fig. 80.1). 
This procedure is associated with approximately 3-4% peri- 
operative mortality, significant hospitalization time and 
complications, patient discomfort, and high costs. Although 
subcostal, subxiphoid, and thoracoscopic techniques can 
minimize the surgical procedure, the ultimate solution has 
been development of fully transvenous lead systems with 
acceptable defibrillation thresholds. 

Currently available transvenous leads are constructed 
much like pacemaker leads, using polyurethane or silicone 
insulation and platinum-iridium electrode materials. 
Acceptable thresholds are obtained in 67-95% of patients, 
with mean defibrillation thresholds ranging from 10.9-18.1 J. 
These lead systems use a combination of two or more elec- 
trodes located in the right ventricular apex, the superior 
vena cava, the coronary sinus, and sometimes, a subcuta- 
neous patch electrode is placed in the chest region. These 
leads offer advantages beyond the avoidance of major sur- 
gery. They are easier to remove should there be infections 
or a need for lead system revision. The pacing thresholds 
of current transvenous defibrillation electrodes are typically 




FIGURE 80.1 Epicardial ICD systems typi- 
cally use two or three large defibrillating patch 
electrodes placed on the epicardium of the left 
and right ventricles and a pair of myocardial 
electrodes for detection and pacing. The gen- 
erator is usually placed in the abdomen. (Copy- 
right Medtronic, Inc. Used with permission.) 
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FIGURE 80.2 The latest transvenous fibrillation systems employ a single catheter placed in the right ventricular 
apex. In panel a, a single transvenous catheter provides defibrillation electrodes in the superior vena cava and in the 
right ventricle. This catheter provides a single pace/sense electrode which is used in conjunction with the right 
ventricular high-voltage defibrillation electrode for arrhythmia detection and antibradycardia/antitachycardia pacing 
(a configuration that is sometimes referred to as integrated bipolar). With pulse generators small enough to be placed 
in the pectoral region, defibrillation can be achieved by delivering energy between the generator housing and one 
high-voltage electrode in the right ventricle (analogous to unipolar pacing) as is shown in panel b. This catheter 
provided bipolar pace/sense electrodes for arrhythmia detection and antibradycardia/antitachycardia pacing. (Copy- 
right Medtronic, Inc. Used with permission.) 

0.96 ± 0.39 V, and the electrogram amplitudes are on the order of 16.4 ± 6.4 mV. The eventual application 
of steroid-eluting materials in the leads should provide increased pacing efficiency with transvenous lead 
systems, thereby reducing the current drain associated with pacing and extending pulse generator lon- 
gevity. 

Lead systems are being refined to simplify the implant procedures. One approach is the use of a single 
catheter having a single right ventricular low-voltage electrode for pacing and detection, and a pair of 
high-voltage defibrillation electrodes spaced for replacement in the right ventricle and in the superior 
vena cava (Fig. 80. 2a). A more recent approach parallels that used for unipolar pacemakers. A single 
right-ventricular catheter having bipolar pace/sense electrodes and one right ventricular high-voltage 
electrode is used in conjunction with a defibrillator housing that serves as the second high-voltage 
electrode (Fig. 80.2b). Mean biphasic pulse defibrillation thresholds with the generator- electrode placed 
in the patient's left pectoral region are reported to be 9.8 + 6.6 J (« = 102). This approach appears to be 
practicable only with generators suitable for pectoral placement, but such devices will become increasingly 
available. 



80.3 Arrhythmia Detection 



Most defibrillator detection algorithms rely primarily on heart rate to indicate the presence of a treatable 
rhythm. Additional refinements sometimes include simple morphology assessments, as with the proba- 
bility density function, and analysis of rhythm stability and rate of change in rate. 

The probability density function evaluates the percentage of time that the filtered ventricular electrogram 
spends in a window centered on the baseline. The rate-of-change-in-rate or onset evaluation discriminates 
sinus tachycardia from ventricular tachycardia on the basis of the typically gradual acceleration of sinus 
rhythms versus the relatively abrupt acceleration of many pathologic tachycardias. The rate stability 
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function is designed to bar detection of tachyarrhythmias as long as the variation in ventricular rate 
exceeds a physician-programmed tolerance, thereby reducing the likelihood of inappropriate therapy 
delivery in response to atrial fibrillation. This concept appears to be one of the more successful detection 
algorithm enhancements. 

Because these additions to the detection algorithm reduce sensitivity, some defibrillator designs offer 
a supplementary detection mode that will trigger therapy in response to any elevated ventricular rate of 
prolonged duration. These extended-high-rate algorithms bypass all or portions of the normal detection 
screening, resulting in low specificity for rhythms with prolonged elevated rates such as exercise-induced 
sinus tachycardia. Consequently, use of such algorithms generally increases the incidence of inappropriate 
therapies. 

Improvements in arrhythmia detection specificity are desirable, but they must not decrease the excel- 
lent sensitivity offered by current algorithms. The anticipated introduction of defibrillators incorporating 
dual-chamber pacemaker capability will certainly help in this quest, since it will then be possible to use 
atrial electrograms in the rhythm classification process. It would also be desirable to have a means of 
evaluating the patient's hemodynamic tolerance of the rhythm, so that the more comfortable pacing 
sequences could be used as long as the patient was not syncopal yet branch quickly to a definitive shock 
should the patient begin to lose consciousness. 

Although various enhanced detection processes have been proposed, many have not been tested 
clinically, in some cases because sufficient processing power was not available in implantable systems, 
and in some cases because sensor technology was not yet ready for chronic implantation. Advances in 
technology may eventually make some of these very elegant proposals practicable. Examples of proposed 
detection enhancements include extended analyses of cardiac event timing (PR and RR stability, AV 
interval variation, temporal distribution of atrial electrogram intervals and of ventricular electrogram 
intervals, timing differences and/or coherency of multiple ventricular electrograms, ventricular response 
to a provocative atrial extrastimuli), electrogram waveform analyses (paced depolarization integral, 
morphology analyses of right ventricular or atrial electrograms), analyses of hemodynamic parameters 
(right-ventricular pulsatile pressure, mean right atrial and mean right ventricular pressures, wedge 
coronary sinus pressure, static right ventricular pressure, right atrial pressure, right ventricular stroke 
volume, mixed venous oxygen saturation and mixed venous blood temperature, left ventricular imped- 
ance, intramyocardial pressure gradient, aortic and pulmonary artery flow), and detection of physical 
motion. 

Because defibrillator designs are intentionally biased to overtreat in preference to the life-threatening 
consequences associated with failure to treat, there is some incidence of inappropriate therapy delivery. 
Unwarranted therapies are usually triggered by supraventricular tachyarrhythmias, especially atrial fibril- 
lation, or sinus tachycardia associated with rates faster than the ventricular tachycardia detection rate 
threshold. Additional causes include nonsustained ventricular tachycardia, oversensing of T waves, double 
counting of R waves and pacing stimuli from brady pacemakers, and technical faults such as loose lead- 
generator connections or lead fractures. 

Despite the bias for high detection sensitivity, undersensing does occur. It has been shown to result 
from inappropriate detection algorithm programming, such as an excessively high tachycardia detection 
rate; inappropriate amplifier gain characteristics; and electrode designs that place the sensing terminals 
too close to the high-voltage electrodes with a consequent reduction in electrogram amplitude following 
shocks. Undersensing can also result in the induction of tachycardia should the amplifier gain control 
algorithm result in undersensing of sinus rhythms. 

80.4 Arrhythmia Therapy 

Pioneering implantable defibrillators were capable only of defibrillation shocks. Subsequently, synchro- 
nized cardioversion capability was added. Antibradycardia pacing had to be provided by implantation 
of a standard pacemaker in addition to the defibrillator, and, if antitachycardia pacing was prescribed, it 
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was necessary to use an antitachycardia pacemaker. Several currently marketed implantable defibrillators 
offer integrated ventricular demand pacemaker function and tiered antiarrhythmia therapy (pacing/car- 
dioversion/ defibrillation). Various burst and ramp antitachycardia pacing algorithms are offered, and 
they all seem to offer comparably high success rates. These expanded therapeutic capabilities improve 
patient comfort by reducing the incidence of shocks in conscious patients, eliminate the problems and 
discomfort associated with implantation of multiple devices, and contribute to a greater degree of success, 
since the prescribed regimens can be carefully tailored to specific patient needs. Availability of devices 
with antitachy pacing capability significantly increases the acceptability of the implantable defibrillator 
for patients with ventricular tachycardia. 

Human clinical trials have shown that biphasic defibrillation waveforms are more effective than 
monophasic waveforms, and newer devices now incorporate this characteristic. Speculative explanations 
for biphasic superiority include the large voltage change at the transition from the first to the second 
phase or hyperpolarization of tissue and reactivation of sodium channels during the initial phase, with 
resultant tissue conditioning that allows the second phase to more readily excite the myocardium. 

Antitachycardia pacing and cardioversion are not uniformly successful. There is some incidence of 
ventricular arrhythmia acceleration with antitachycardia pacing and cardioversion, and it is also not 
unusual for cardioversion to induce atrial fibrillation that in turn triggers unwarranted therapies. An 
ideal therapeutic solution would be one capable of preventing the occurrence of tachycardia altogether. 
Prevention techniques have been investigated, among them the use of precisely timed subthreshold 
stimuli, simultaneous stimulation at multiple sites, and pacing with elevated energies at the site of the 
tachycardia, but none has yet proven practical. 

The rudimentary WI antibradycardia pacing provided by current defibrillators lacks rate responsive- 
ness and atrial pacing capability. Consequently, some defibrillator patients require implantation of a 
separate dual-chamber pacemaker for hemodynamic support. It is inevitable that future generations of 
defibrillators will offer dual-chamber pacing capabilities. 

Atrial fibrillation, occurring either as a consequence of defibrillator operation or as a natural progres- 
sion in many defibrillator patients, is a major therapeutic challenge. It is certainly possible to adapt 
implantable defibrillator technology to treat atrial fibrillation, but the challenge is to do so without 
causing the patient undue discomfort. Biphasic waveform defibrillation of acutely induced atrial fibril- 
lation has been demonstrated in humans with an 80% success rate at 0.4 J using epicardial electrodes. 
Stand-alone atrial defibrillators are in development, and, if they are successful, it is likely that this 
capability would be integrated into the mainstream ventricular defibrillators as well. However, most 
conscious patients find shocks above 0.5 J to be very unpleasant, and it remains to be demonstrated that 
a clinically acceptable energy level will be efficacious when applied with transvenous electrode systems 
to spontaneously occurring atrial fibrillation. Moreover, a stand-alone atrial defibrillator either must 
deliver an atrial shock with complete assurance of appropriate synchronization to ventricular activity or 
must restrict the therapeutic energy delivery to atrial structures in order to prevent inadvertent induction 
of a malignant ventricular arrhythmia. 

80.5 Implantable Monitoring 

Until recently, defibrillator data recording capabilities were quite limited, making it difficult to verify the 
adequacy of arrhythmia detection and therapy settings. The latest devices record electrograms and 
diagnostic channel data showing device behavior during multiple tachyarrhythmia episodes. These 
devices also include counters (number of events detected, success and failure of each programmed therapy, 
and so on) that present a broad, though less specific, overview of device behavior (Fig. 80.3). Monitoring 
capability in some of the newest devices appears to be the equivalent of 32 Kbytes of random access 
memory, allowing electrogram waveform records of approximately 2-min duration, with some oppor- 
tunity for later expansion by judicious selection of sampling rates and data compression techniques. 
Electrogram storage has proven useful for documenting false therapy delivery due to atrial fibrillation, 
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FIGURE 80.3 Typical data recorded by an implantable defibrillator include stored intracardiac electrograms with 
annotated markers indicating cardiac intervals, paced and sensed events, and device classification of events (TF = 
fast tachycardia; TP = antitachy pacing stimulus; VS = sensed nontachy ventricular event). In the example, five rapid 
pacing pulses convert a ventricular tachycardia with a cycle length of 340 ms into sinus rhythm with a cycle length 
of 830 ms. In the lower portion of the figure is an example of the summary data collected by the ICD, showing 
detailed counts of the performance of the various therapies (Rx) for ventricular tachycardia (VT), fast ventricular 
(VTF), and ventricular (VF). (Copyright Medtronic, Inc. Used with permission.) 

lead fractures, and sinus tachycardia, determining the triggers of arrhythmias; documenting rhythm 
accelerations in response to therapies; and demonstrating appropriate device behavior when treating 
asymptomatic rhythms. 

Electrograms provide useful information by themselves, yet they cannot indicate how the device 
interpreted cardiac activity. Increasingly, electrogram records are being supplemented with event markers 
that indicate how the device is responding on a beat-by-beat basis. These records can include measure- 
ments of the sensed and paced intervals, indication as to the specific detection zone an event falls in, 
indication of charge initiation, and other device performance data. 



80.6 Follow-up 



Defibrillator patients and their devices require careful follow-up. In one study of 241 ICD patients with 
epicardial lead systems, 53% of the patients experienced one or more complications during an average 
exposure of 24 months. These complications included infection requiring device removal in 5%, post- 
operative respiratory complications in 11%, postoperative bleeding and/or thrombosis in 4%, lead system 
migration or disruption in 8%, and documented inappropriate therapy delivery, most commonly due to 
atrial fibrillation, in 22%. A shorter study of eighty patients with transvenous defibrillator systems 
reported no postoperative pulmonary complications, transient nerve injury (1%), asymptomatic subcla- 
vian vein occlusion (2.5%), pericardial effusion (1%), subcutaneous patch pocket hematoma (5%), pulse 
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generator pocket infection (1%), lead fracture (1%), and lead system dislodgement (10%). During a 
mean follow-up period of 1 1 months, 7.5% of the patients in this series experienced inappropriate therapy 
delivery, half for atrial fibrillation and the rest for sinus tachycardia. 

Although routine follow-up can be accomplished in the clinic, detection and analysis of transient 
events depends on the recording capabilities available in the devices or on the use of various external 
monitoring equipment. 

80.7 Economics 

The annual cost of ICD therapy is dropping as a consequence of better longevity and simpler implantation 
techniques. Early generators that lacked programmability, antibradycardia pacing capability, and event 
recording had 62% survival at 18 months and 2% at 30 months. Some recent programmable designs 
that include VVI pacing capability and considerable event storage exhibit 96.8% survival at 48 months. 
It has been estimated that an increase in generator longevity from 2-5 years would lower the cost per 
life-year saved by 55% in a hypothetical patient population with a 3-year sudden mortality of 28%. More 
efficient energy conversion circuits and finer line-width integrated circuit technology with smaller, more 
highly integrated circuits and reduced current drains will yield longer-lasting defibrillators while con- 
tinuing the evolution to smaller volumes. 

Cost of the implantation procedure is clearly declining as transvenous lead systems become common- 
place. Total hospitalization duration, complication rates, and use of costly hospital operating rooms and 
intensive care facilities all are reduced, providing significant financial benefits. One study reported 
requiring half the intensive care unit time and a reduction in total hospitalization from 26 to 15 days 
when comparing transvenous to epicardial approaches. Another center reported a mean hospitalization 
stay of 6 days for patients receiving transvenous defibrillation systems. 

Increasing sophistication of the implantable defibrillators paradoxically contributes to cost efficacy. 
Incorporation of single-chamber brady pacing capability eliminates the cost of a separate pacemaker and 
lead for those patients who need one. Eventually even dual-chamber pacing capability will be available. 
Programmable detection and therapy features obviate the need for device replacement that was required 
when fixed parameter devices proved to be inappropriately specified or too inflexible to adapt to a patient's 
physiologic changes. 

Significant cost savings may be obtained by better patient selection criteria and processes, obviating 
the need for extensive hospitalization and costly electrophysiologic studies prior to device implantation 
in some patient groups. One frequently discussed issue is the prophylactic role that implantable defibril- 
lators will or should play. Unless a means is found to build far less expensive devices that can be placed 
with minimal time and facilities, the life-saving yield for prophylactic defibrillators will have to be high 
if they are to be cost-effective. This remains an open issue. 

80.8 Conclusion 

The implantable defibrillator is now an established and powerful therapeutic tool. The transition to 
pectoral implants with biphasic waveforms and efficient yet simple transvenous lead systems is simplifying 
the implant procedure and drastically reducing the number of unpleasant VF inductions required to 
demonstrate adequate system performance These advances are making the implantable defibrillator easier 
to use, less costly, and more acceptable to patients and their physicians. 
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An electrosurgical unit (ESU) passes high-frequency electric currents through biologic tissues to achieve 
specific surgical effects such as cutting, coagulation, or desiccation. Although it is not completely under- 
stood how electrosurgery works, it has been used since the 1920s to cut tissue effectively while at the 
same time controlling the amount of bleeding. Cutting is achieved primarily with a continuous sinusoidal 
waveform, whereas coagulation is achieved primarily with a series of sinusoidal wave packets. The surgeon 
selects either one of these waveforms or a blend of them to suit the surgical needs. An electrosurgical 
unit can be operated in two modes, the monopolar mode and the bipolar mode. The most noticeable 
difference between these two modes is the method in which the electric current enters and leaves the 
tissue. In the monopolar mode, the current flows from a small active electrode into the surgical site, 
spreads through the body, and returns to a large dispersive electrode on the skin. The high current density 
in the vicinity of the active electrode achieves tissue cutting or coagulation, whereas the low current 
density under the dispersive electrode causes no tissue damage. In the bipolar mode, the current flows 
only through the tissue held between two forceps electrodes. The monopolar mode is used for both 
cutting and coagulation. The bipolar mode is used primarily for coagulation. 

This chapter beings with the theory of operation for electrosurgical units, outlines various modes of 
operation, and gives basic design details for electronic circuits and electrodes. It then describes how 
improper application of electrosurgical units can lead to hazardous situations for both the operator and 
the patient and how such hazardous situations can be avoided or reduced through proper monitoring 
methods. Finally, the chapter gives an update on current and future developments and applications. 

81.1 Theory of Operation 

In principle, electrosurgery is based on the rapid heating of tissue. To better understand the thermody- 
namic events during electrosurgery, it helps to know the general effects of heat on biologic tissue. Consider 
a tissue volume that experiences a temperature increase from normal body temperature to 45°C within 
a few seconds. Although the cells in this tissue volume show neither microscopic nor macroscopic changes, 
some cytochemical changes do in fact occur. However, these changes are reversible, and the cells return 
to their normal function when the temperature returns to normal values. Above 45°C, irreversible changes 
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take place that inhibit normal cell functions and lead to cell death. First, between 45°C and 60°C, the 
proteins in the cell lose their quaternary configuration and solidify into a glutinous substance that 
resembles the white of a hard-boiled egg. This process, termed coagulation, is accompanied by tissue 
blanching. Further increasing the temperature up to 100°C leads to tissue drying; that is, the aqueous 
cell contents evaporate. This process is called desiccation. If the temperature is increased beyond 100°C, 
the solid contents of the tissue reduce to carbon, a process referred to as carbonization. Tissue damage 
depends not only on temperature, however, but also on the length of exposure to heat. Thus, the overall 
temperature-induced tissue damage is an integrative effect between temperature and time that is expressed 
mathematically by the Arrhenius relationship, where an exponential function of temperature is integrated 
over time [ 1 ] . 

In the monopolar mode, the active electrode either touches the tissue directly or is held a few 
millimeters above the tissue. When the electrode is held above the tissue, the electric current bridges the 
air gap by creating an electric discharge arc. A visible arc forms when the electric field strength exceeds 
1 kV/mm in the gap and disappears when the field strength drops below a certain threshold level. 

When the active electrode touches the tissue and the current flows directly from the electrode into the 
tissue without forming an arc, the rise in tissue temperature follows the bioheat equation 

T-T B = — ft (81.1) 

ape 

where Tand T are the final and initial temperatures (K), a is the electrical conductivity (S/m), p is the 
tissue density (kg/m 3 ), c is the specific heat of the tissue (Jkg^'Kr 1 ), J is the current density (A/m 2 ), and 
(is the duration of heat applications [1]. The bioheat equation is valid for short application times where 
secondary effects such as heat transfer to surrounding tissues, blood perfusion, and metabolic heat can 
be neglected. According to Eq. (81.1), the surgeon has primarily three means of controlling the cutting 
or coagulation effect during electrosurgery: the contact area between active electrode and tissue, the 
electrical current density, and the activation time. In most commercially available electrosurgical gener- 
ators, the output variable that can be adjusted is power. This power setting, in conjunction with the 
output power vs. tissue impedance characteristics of the generator, allow the surgeon some control over 
current. Table 81.1 lists typical output power and mode settings for various surgical procedures. Table 81.2 
lists some typical impedance ranges seen during use of an ESU in surgery. The values are shown as ranges 
because the impedance increases as the tissue dries out, and at the same time, the output power of the 
ESU decreases. The surgeon may control current density by selection of the active electrode type and size. 

81.2 Monopolar Mode 

A continuous sinusoidal waveform cuts tissue with very little hemostasis. This waveform is simply called 
cut or pure cut. During each positive and negative swing of the sinusoidal waveform, a new discharge arc 
forms and disappears at essentially the same tissue location. The electric current concentrates at this 
tissue location, causing a sudden increase in temperature due to resistive heating. The rapid rise in 
temperature then vaporizes intracellular fluids, increases cell pressure, and ruptures the cell membrane, 
thereby parting the tissue. This chain of events is confined to the vicinity of the arc, because from there 
the electric current spreads to a much larger tissue volume, and the current density is no longer high 
enough to cause resistive heating damage. Typical output values for ESUs, in cut and other modes, are 
shown in Table 81.3. 

Experimental observations have shown that more hemostasis is achieved when cutting with an inter- 
rupted sinusoidal waveform or amplitude modulated continuous waveform. These waveforms are typi- 
cally called blend or blended cut. Some ESUs offer a choice of blend waveforms to allow the surgeon to 
select the degree of hemostasis desired. 
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Table 81.1 Typical ESU Power Settings for Various Surgical Procedures 



Power-Level Range 

Low power 
<30 W cut 
<30 W coag 



Medium power 
30 W-150 W cut 
30 W-70 W coag 



High power 
>150Wcut 
>70 W coag 



Procedures 



Neurosurgery 

Dermatology 

Plastic surgery 

Oral surgery 

Laparoscopic sterilization 

Vasectomy 

General surgery 

Laparotomies 

Head and neck surgery (ENT) 

Major orthopedic surgery 

Major vascular surgery 

Routine thoracic surgery 

Polypectomy 

Transurethral resection procedures (TURPs) 

Thoracotomies 

Ablative cancer surgery 

Mastectomies 



Note: Ranges assume the use of a standard blade electrode. Use of a needle 
electrode, or other small current-concentrating electrode, allows lower settings 
to be used; users are urged to use the lowest setting that provides the desired 
clinical results. 



TABLE 81.2 Typical Impedance Ranges Seen During Use 
of an ESU in Surgery 



Cut Mode Application 



Impedance Range (Q) 



Prostate tissue 
Oral cavity 
Liver tissue 
Muscle tissue 
Gall bladder 
Skin tissue 
Bowel tissue 
Periosteum 
Mesentery 
Omentum 
Adipose tissue 
Scar tissue 
Adhesions 

Coag Mode Application 



400-1700 
1000-2000 



1500-2400 
1700-2500 
2500-3000 

3000-4200 

3500-4500 



Contact coagulation to stop bleeding 



100-1000 



When a continuous or interrupted waveform is used in contact with the tissue and the output voltage 
current density is too low to sustain arcing, desiccation of the tissue will occur. Some ESUs have a distinct 
mode for this purpose called desiccation or contact coagulation. 

In noncontact coagulation, the duty cycle of an interrupted waveform and the crest factor (ratio of 
peak voltage to rms voltage) influence the degree of hemostasis. While a continuous waveform reestab- 
lishes the arc at essentially the same tissue location concentrating the heat there, an interrupted waveform 
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TABLE 81.3 Typical Output Characteristics of ESUs 













Crest Factor 






Output Voltage Range 


Output Power 




m 






Open 


Circuit, V peal . p(!al , V 


Range, W 


Frequency, kHz 


Duty Cycle 


Monopolar modes 














Cut 




200-5000 


1-400 


300-1750 


1.4-2.1 


100% 


Blend 




1500-5800 


1-300 


300-1750 


2.1-6.0 


25-80% 


Desiccate 




400-6500 


1-200 


240-800 


3.5-6.0 


50-100% 


Fulgurate/spray 




6000-12000 


1-200 


300-800 


6.0-20.0 


10-70% 


Bipolar mode 














Coagulate/desiccate 




200-1000 


1-70 


300-1050 


1.6-12.0 


25-100% 



causes the arc to reestablish itself at different tissue locations. The arc seems to dance from one location 
to the other raising the temperature of the top tissue layer to coagulation levels. These waveforms are 
called fulgumtion or spray. Since the current inside the tissue spreads very quickly from the point where 
the arc strikes, the heat concentrates in the top layer, primarily desiccating tissue and causing some 
carbonization. During surgery, a surgeon can easily choose between cutting, coagulation, or a combina- 
tion of the two by activating a switch on the grip of the active electrode or by use of a footswitch. 



81.3 Bipolar Mode 



The bipolar mode concentrates the current flow between the two electrodes, requiring considerably less 
power for achieving the same coagulation effect than the monopolar mode. For example, consider 
coagulating a small blood vessel with 3-mm external diameter and 2-mm internal diameter, a tissue 
resistivity of 360 Qcm, a contract area of 2 x 4 mm, and a distance between the forceps tips of 1 mm. 
The tissue resistance between the forceps is 450 D. as calculated from R = pL/A, where p is the resistivity, 
L is the distance between the forceps, and A is the contact area. Assuming a typical current density of 
200 mA/cm 2 , then a small current of 16 mA, a voltage of 7.2 V, and a power level of 0.12 W suffice to 
coagulate this small blood vessel. In contrast, during monopolar coagulation, current levels of 200 mA 
and power levels of 100 W or more are not uncommon to achieve the same surgical effect. The temper- 
ature increase in the vessel tissue follows the bioheat equation, Eq. (81.1). If the specific heat of the vessel 
tissue is 4.2 Jg _1 K _1 and the tissue density is 1 g/cm 3 , then the temperature of the tissue between the 
forceps increases from 37°C to 57°C in 5.83 s. When the active electrode touches the tissue, less tissue 
damage occurs during coagulation, because the charring and carbonization that accompanies fulguration 
is avoided. 

81.4 ESU Design 

Modern ESUs contain building blocks that are also found in other medical devices, such as micropro- 
cessors, power supplies, enclosures, cables, indicators, displays, and alarms. The main building blocks 
unique to ESUs are control input switches, the high-frequency power amplifier, and the safety monitor. 
The first two will be discussed briefly here, and the latter will be discussed later. 

Control input switches include front panel controls, footswitch controls, and handswitch controls. In 
order to make operating an ESU more uniform between models and manufacturers, and to reduce the 
possibility of operator error, the ANSI/AAMI HF-18 standard [5] makes specific recommendations 
concerning the physical construction and location of these switches and prescribes mechanical and 
electrical performance standards. For instance, front panel controls need to have their function identified 
by a permanent label and their output indicated on alphanumeric displays or on graduated scales; the 
pedals of foot switches need to be labeled and respond to a specified activation force; and if the active 
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electrode handle incorporates two finger switches, their position has to correspond to a specific function. 
Additional recommendations can be found in Reference [5]. 

Four basic high-frequency power amplifiers are in use currently; the somewhat dated vacuum 
tube/spark gap configuration, the parallel connection of a bank if bipolar power transistors, the hybrid 
connection of parallel bipolar power transistors cascaded with metal oxide silicon field effect transistors 
(MOSFETs), and the bridge connection of MOSFETs. Each has unique properties and represents a stage 
in the evolution of ESUs. 

In a vacuum tube/spark gap device, a tuned-plate, tuned-grid vacuum tube oscillator is used to generate 
a continuous waveform for use in cutting. This signal is introduced to the patient by an adjustable 
isolation transformer. To generate a waveform for fulguration, the power supply voltage is elevated by a 
step-up transformer to about 1600 V rms which then connects to a series of spark gaps. The voltage 
across the spark gaps is capacitively coupled to the primary of an isolation transformer. The RLC circuit 
created by this arrangement generates a high crest factor, damped sinusoidal, interrupted waveform. One 
can adjust the output power and characteristics by changing the turns ratio or tap on the primary and/or 
secondary side of the isolation transformer, or by changing the spark gap distance. 

In those devices that use a parallel bank of bipolar power transistors, the transistors are arranged in 
a Class A configuration. The bases, collectors, and emitters are all connected in parallel, and the collective 
base node is driven through a current-limiting resistor. A feedback RC network between the base node 
and the collector node stabilizes the circuit. The collectors are usually fused individually before the 
common node connects them to one side of the primary of the step-up transformer. The other side of 
the primary is connected to the high-voltage power supply. A capacitor and resistor in parallel to the 
primary create a resonance tank circuit that generates the output waveform at a specific frequency. 
Additional elements may be switched in and out of the primary parallel RLC to alter the output power 
and waveform for various electrosurgical modes. Small-value resistors between the emitters and ground 
improve the current sharing between transistors. This configuration sometimes requires the use of 
matched sets of high-voltage power transistors. 

A similar arrangement exists in amplifiers using parallel bipolar transistors cascaded with a power 
MOSFET This arrangement is called a hybrid cascode amplifier. In this type of amplifier, the collectors 
of a group if bipolar transistors are connected, via protection diodes, to one side of the primary of the 
step-up output transformer. The other side of the primary is connected to the high-voltage power supply. 
The emitters of two or three bipolar transistors are connected, via current limiting resistors, to the drain 
of an enhancement mode MOSFET. The source of the MOSFET is connected to ground, and the gate of 
the MOSFET is connected to a voltage-snubbing network driven by a fixed amplitude pulse created by 
a high-speed MOS driver circuit. The bases of the bipolar transistors are connected, via current control 
RC networks, to a common variable base voltage source. Each collector and base is separately fused. In 
cut modes, the gate drive pulse is a fixed frequency, and the base voltage is varied according to the power 
setting. In the coagulation modes, the base voltage is fixed and the width of the pulses driving the MOSFET 
is varied. This changes the conduction time of the amplifier and controls the amount of energy imparted 
to the output transformer and its load. In the coagulation modes and in high-power cut modes, the 
bipolar power transistors are saturated, and the voltage across the bipolar/MOSFET combination is low. 
This translates to high efficiency and low power dissipation. 

The most common high-frequency power amplifier in use is a bridge connection of MOSFETs. In this 
configuration, the drains of a series of power MOSFETs are connected, via protection diodes, to one side 
of the primary of the step-up output transformer. The drain protection diodes protect the MOSFETs 
against the negative voltage swings of the transformer primary. The other side of the transformer primary 
is connected to the high-voltage power supply. The sources of the MOSFETs are connected to ground. 
The gate of each MOSFET has a resistor connected to ground and one to its driver circuitry. The resistor 
to ground speeds up the discharge of the gate capacitance when the MOSFET is turned on while the gate 
series resistor eliminates turn-off oscillations. Various combinations of capacitors and/or LC networks 
can be switched across the primary of the step-up output transformer to obtain different waveforms. In 
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the cut mode, the output power is controlled by varying the high-voltage power supply voltage. In the 
coagulation mode, the output power is controlled by varying the on time of the gate drive pulse. 

81.5 Active Electrodes 

The monopolar active electrode is typically a small flat blade with symmetric leading and trailing edges 
that is embedded at the tip of an insulated handle. The edges of the blade are shaped to easily initiate 
discharge arcs and to help the surgeon manipulate the incision; the edges cannot mechanically cut tissue. 
Since the surgeon holds the handle like a pencil, it is often referred to as the "pencil." Many pencils 
contain in their handle one or more switches to control the electrosurgical waveform, primarily to switch 
between cutting and coagulation. Other active electrodes include needle electrodes, loop electrodes, and 
ball electrodes. Needle electrodes are used for coagulating small tissue volumes like in neurosurgery or 
plastic surgery. Loop electrodes are used to resect nodular structures such as polyps or to excise tissue 
samples for pathologic analysis. An example would be the LLETZ procedure where the transition zone 
of the cervix is excised. Electrosurgery at the tip of an endoscope or laparoscope requires yet another set 
of active electrodes and specialized training of the surgeon. 

81.6 Dispersive Electrodes 

The main purpose of the dispersive electrode is to return the high-frequency current to the electrosurgical 
unit without causing harm to the patient. This is usually achieved by attaching a large electrode to the 
patient's skin away from the surgical site. The large electrode area and a small contact impedance reduce 
the current density to levels where tissue heating is minimal. Since the ability of a dispersive electrode 
to avoid tissue heating and burns is of primary importance, dispersive electrodes are often characterized 
by their heating factor. The heating factor describes the energy dissipated under the dispersive electrode 
per Q of impedance and is equal to Pt, where I is the rms current and t is the time of exposure. During 
surgery a typical value for the heating factor is 3 A 2 s, but factors of up to 9 A 2 s may occur during some 
procedures [2], 

Two types of dispersive electrodes are in common use today, the resistive type and the capacitive type. 
In disposable form, both electrodes have a similar structure and appearance. A thin, rectangular metallic 
foil has an insulating layer on the outside, connects to a gel-like material on the inside, and may be 
surrounded by an adhesive foam. In the resistive type, the gel-like material is made of an adhesive 
conductive gel, whereas in the capacitive type, the gel is an adhesive dielectric nonconductive gel. The 
adhesive foam and adhesive gel layer ensure that both electrodes maintain good skin contact to the 
patient, even if the electrode gets stressed mechanically from pulls on the electrode cable. Both types 
have specific advantages and disadvantages. Electrode failures and subsequent patient injury can be 
attributed mostly to improper application, electrode dislodgment, and electrode defects rather than to 
electrode design. 

81.7 ESU Hazards 

Improper use of electrosurgery may expose both the patient and the surgical staff to a number of hazards. 
By far the most frequent hazards are electric shock and undesired burns. Less frequent are undesired 
neuromuscular stimulation, interference with pacemakers or other devices, electrochemical effects from 
direct currents, implant heating, and gas explosions [1,3]. 

Current returns to the ESU through the dispersive electrode. If the contact area of the dispersive 
electrode is large and the current exposure time short, then the skin temperature under the electrode 
does not rise above 45°C, which has been shown to be the maximum safe temperature [4]. However, to 
include a safety margin, the skin temperature should not rise more than 6°C above the normal surface 
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temperature of 29-33°C. The current density at any point under the dispersive electrode has to be 
significantly below the recognized burn threshold of 100 mA/cm 2 for 10 seconds. 

To avoid electric shock and burns, the American National Standard for Electrosurgical Devices [5] 
requires that "any electrosurgical generator that provides for a dispersive electrode and that has a rated 
output power of greater than 50 W shall have at least one patient circuit safety monitor." The most 
common safety monitors are the contact quality monitor for the dispersive electrode and the patient 
circuit monitor. A contact quality monitor consists of a circuit to measure the impedance between the 
two sides of a split dispersive electrode and the skin. A small high-frequency current flows from one 
section of the dispersive electrode through the skin to the second section of the dispersive electrode. If 
the impedance between these two sections exceeds a certain threshold, or changes by a certain percentage, 
an audible alarm sounds, and the ESU output is disabled. 

Patient circuit monitors range from simple to complex. The simple ones monitor electrode cable 
integrity while the complex ones detect any abnormal condition that could result in electrosurgical current 
flowing in other than normal pathways. Although the output isolation transformer present in most 
modern ESUs usually provides adequate patient protection, some potentially hazardous conditions may 
still arise. If a conductor to the dispersive electrode is broken, undesired arcing between the broken 
conductor ends may occur, causing fire in the operating room and serious patient injury. Abnormal 
current pathways may also arise from capacitive coupling between cables, the patient, operators, enclo- 
sures, beds, or any other conductive surface or from direct connections to other electrodes connected to 
the patient. The patient circuit monitoring device should be operated from an isolated power source 
having a maximum voltage of 12 V rms. The most common device is a cable continuity monitor. Unlike 
the contact quality monitor, this monitor only checks the continuity of the cable between the ESU and 
the dispersive electrode and sounds an alarm if the resistance in that conductor is greater than 1 kO. 
Another implementation of a patient circuit monitor measures the voltage between the dispersive elec- 
trode connection and ground. A third implementation functions similarly to a ground fault circuit 
interrupter (GFCI) in that the current in the wire to the active electrode and the current in the wire to 
the dispersive electrode are measured and compared with each other. If the difference between these 
currents is greater than a preset threshold, the alarm sounds and the ESU is disconnected. 

There are other sources of undesired burns. Active electrodes get hot when they are used. After use, 
the active electrode should be placed in a protective holster, if available, or on a suitable surface to isolate 
it from the patient and surgical staff. The correct placement of an active electrode will also prevent the 
patient and/or surgeon from being burned if an inadvertent activation of the ESU occurs (e.g., someone 
accidentally stepping on a foot pedal). Some surgeons use a practice called buzzing the hemostat in which 
a small bleeding vessel is grasped with a clamp or hemostat and the active electrode touched to the clamp 
while activating. Because of the high voltages involved and the stray capacitance to ground, the surgeon's 
glove may be compromised. If the surgical staff cannot be convinced to eliminate the practice of buzzing 
hemostats, the probability of burns can be reduced by use of a cut waveform instead of a coagulation 
waveform (lower voltage), by maximizing contact between the surgeon's hand and the clamp, and by not 
activating until the active electrode is firmly touching the clamp. 

Although it is commonly assumed that neuromuscular stimulation ceases or is insignificant at fre- 
quencies above 10 kHz, such stimulation has been observed in anesthetized patients undergoing certain 
electrosurgical procedures. This undesirable side effect of electrosurgery is generally attributed to non- 
linear events during the electric arcing between the active electrode and tissue. These events rectify the 
high-frequency current leading to both dc and low-frequency current components. These current com- 
ponents can reach magnitudes that stimulate nerve and muscle cells. To minimize the probability of 
unwanted neuromuscular stimulation, most ESUs incorporate in their output circuit a high-pass filter 
that suppresses dc and low-frequency current components. 

The use of electrosurgery means the presence of electric discharge arcs. This presents a potential fire 
hazard in an operating room where oxygen and flammable gases may be present. These flammable gases 
may be introduced by the surgical staff (anesthetics or flammable cleaning solutions), or may be generated 
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within the patients themselves (bowel gases). The use of disposable paper drapes and dry surgical gauze 
also provides a flammable material that may be ignited by sparking or by contact with a hot active 
electrode. Therefore, prevention of fires and explosions depends primarily on the prudence and judgment 
of the ESU operator. 

81.8 Recent Developments 

Electrosurgery is being enhanced by the addition of a controlled column of argon gas in the path between 
the active electrode and the tissue. The flow of argon gas assists in clearing the surgical site of fluid and 
improves visibility. When used in the coagulation mode, the argon gas is turned into a plasma allowing 
tissue damage and smoke to be reduced, and producing a thinner, more flexible eschar. When used with 
the cut mode, lower power levels may be used. 

Many manufacturers have begun to include sophisticated computer-based systems in their ESUs that 
not only simplify the use of the device but also increase the safety of patient and operator [7] . For instance, 
in a so-called soft coagulation mode, a special circuit continuously monitors the current between the 
active electrode and the tissue and turns the ESU output on only after the active electrode has contacted 
the tissue. Furthermore, the ESU output is turned off automatically, once the current has reached a 
certain threshold level that is typical for coagulated and desiccated tissue. This feature is also used in a 
bipolar mode termed autobipolar. Not only does this feature prevent arcing at the beginning of the 
procedure, but it also keeps the tissue from being heated beyond 70°C. Some devices offer a so-called 
power-peak-system that delivers a very short power peak at the beginning of electrosurgical cutting to 
start the cutting arc. Other modern devices use continuous monitoring of current and voltage levels to 
make automatic power adjustments in order to provide for a smooth cutting action from the beginning 
of the incision to its end. Some manufacturers are developing waveforms and instruments designed to 
achieve specific clinical results such as bipolar cutting tissue lesioning, and vessel sealing. With the growth 
and popularity of laparoscopic procedures, additional electrosurgical instruments and waveforms tailored 
to this surgical specialty should also be expected. 

Increased computing power, more sophisticated evaluation of voltage and current waveforms, and the 
addition of miniaturized sensors will continue to make ESUs more user-friendly and safer. 

Defining Terms 

Active electrode: Electrode used for achieving desired surgical effect. 

Coagulation: Solidification of proteins accompanied by tissue whitening. 

Desiccation: Drying of tissue due to the evaporation of intracellular fluids. 

Dispersive electrode: Return electrode at which no electrosurgical effect is intended. 

Fulguration: Random discharge of sparks between active electrode and tissue surface in order to achieve 

coagulation and/or desiccation. 
Spray: Another term for fulguration. Sometimes this waveform has a higher crest factor than that used 

for fulguration. 
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This chapter presents an overview of the structure and function of mechanical ventilators. Mechanical 
ventilators, which are often also called respirators, are used to artificially ventilate the lungs of patients 
who are unable to naturally breathe from the atmosphere. In almost 100 years of development, many 
mechanical ventilators with different designs have been developed [Mushin et al., 1980Philbeam, 1998]. 
The very early devices used bellows that were manually operated to inflate the lungs. Today's respirators 
employ an array of sophisticated components such as microprocessors, fast response servo valves, and 
precision transducers to perform the task of ventilating the lungs. The changes in the design of ventilators 
have come about as the result of improvements in engineering the ventilator components and the advent 
of new therapy modes by clinicians. A large variety of ventilators are now available for short-term 
treatment of acute respiratory problems as well as long-term therapy for chronic respiratory conditions. 

It is reasonable to broadly classify today's ventilators into two groups. The first and indeed the largest 
group encompasses the intensive care respirators used primarily in hospitals to support patients following 
certain surgical procedures or assist patients with acute respiratory disorders. The second group includes 
less complicated machines that are primarily used at home to treat patients with chronic respiratory 
disorders. 

The level of engineering design and sophistication for the intensive care ventilators is higher than the 
ventilators used for chronic treatment. However, many of the engineering concepts employed in designing 
intensive care ventilators can also be applied in the simpler chronic care units. Therefore, this presentation 
focuses on the design of intensive care ventilators; the terms respirator, mechanical ventilator, or ventilator 
that will be used from this point on refer to the intensive care unit respirators. 

At the beginning, the designers of mechanical ventilators realized that the main task of a respirator 
was to ventilate the lungs in a manner as close to natural respiration as possible. Since natural inspiration 
is a result of negative pressure in the pleural cavity generated by distention of the diaphragm, designers 
initially developed ventilators that created the same effect. These ventilators are called negative-pressure 
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FIGURE 82.1 A simplified illustration of a negative-pres- 
sure ventilator. 



ventilators. However, more modern ventilators use pressures greater than atmospheric pressures to ven- 
tilate the lungs; they are known as positive-pressure ventilators. 

82.2 Negative-Pressure Ventilators 

The principle of operation of a negative-pres- 
sure respirator is shown in Fig. 82.1. In this 
design, the flow of air to the lungs is created by 
generating a negative pressure around the 
patient's thoracic cage. The negative pressure 
moves the thoracic walls outward expanding 
the intra-thoracic volume and dropping the 
pressure inside the lungs. The pressure gradient 
between the atmosphere and the lungs causes 
the flow of atmospheric air into the lungs. The 
inspiratory and expiratory phases of the respi- 
ration are controlled by cycling the pressure 
inside the body chamber between a sub-atmo- 
spheric level (inspiration) and the atmospheric 
level (exhalation). Flow of the breath out of the 
lungs during exhalation is caused by the recoil 
of thoracic muscles. 

Although it may appear that the negative- 
pressure respirator incorporates the same principles as natural respiration, the engineering implemen- 
tation of this concept has not been very successful. A major difficulty has been in the design of a chamber 
for creating negative pressure around the thoracic walls. One approach has been to make the chamber 
large enough to house the entire body with the exception of the head and neck. Using foam rubber 
around the patient's neck, one can seal the chamber and generate a negative pressure inside the chamber. 
This design configuration, commonly known as the iron lung, was tried back in the 1920s and proved 
to be deficient in several aspects. The main drawback was that the negative pressure generated inside the 
chamber was applied to the chest as well as the abdominal wall, thus creating a venous blood pool in 
the abdomen and reducing cardiac output. 

More recent designs have tried to restrict the application of the negative pressure to the chest walls by 
designing a chamber that goes only around the chest. However, this has not been successful because 
obtaining a seal around the chest wall (Fig. 82.1) is difficult. 

Negative-pressure ventilators also made the patient less accessible for patient care and monitoring. 
Further, synchronization of the machine cycle with the patient's effort was has been difficult and they 
are also typically noisy and bulky [McPherson, & Spearman 1990]. These deficiencies of the negative- 
pressure ventilators have led to the development of the positive-pressure ventilators. 

82.3 Positive-Pressure Ventilators 

Positive-pressure ventilators generate the inspiratory flow by applying a positive pressure (greater than 
the atmospheric pressure) to the airways. Figure 82.2 shows a simplified block diagram of a positive- 
pressure ventilator. During inspiration, the inspiratory flow delivery system creates a positive pressure 
in the tubes connected to the patient airway, called patient circuit, and the exhalation control system 
closes a valve at the outlet of the tubing to the atmosphere. When the ventilator switches to exhalation, 
the inspiratory flow delivery system stops the positive pressure and the exhalation system opens the valve 
to allow the patient's exhaled breath to flow to the atmosphere. The use of a positive pressure gradient 
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Controller 




FIGURE 82.2 A simplified diagram of the functional blocks of a positive-pressure ventilator. 

in creating the flow allows treatment of patients with high lung resistance and low compliance. As a 
result, positive-pressure ventilators have been very successful in treating a variety of breathing disorders 
and have become more popular than negative-pressure ventilators. 

Positive-pressure ventilators have been employed to treat patients ranging from neonates to adults. 
Due to anatomical differences between various patient populations, the ventilators and their modes of 
treating infants are different than those for adults. Nonetheless, their fundamental design principles are 
similar and adult ventilators comprise a larger percentage of ventilators manufactured and used in clinics. 
Therefore, the emphasis here is on the description of adult positive-pressure ventilators. Also, the concepts 
presented will be illustrated using a microprocessor-based design example, as almost all modern venti- 
lators use microprocessor instrumentation. 

82.4 Ventilation Modes 

Since the advent of respirators, clinicians have devised a variety of strategies to ventilate the lungs based 
on patient conditions. For instance, some patients need the respirator to completely take over the task 
of ventilating their lungs. In this case, the ventilator operates in mandatory mode and delivers mandatory 
breaths. On the other hand, some patients are able to initiate a breath and breathe on their own, but 
may need oxygen-enriched air flow or slightly elevated airway pressure. When a ventilator assists a patient 
who is capable of demanding a breath, the ventilator delivers spontaneous breaths and operates in 
spontaneous mode. In many cases, it is first necessary to treat the patient with mandatory ventilation 
and as the patient's condition improves spontaneous ventilation is introduced; it is used primarily to 
wean the patient from mandatory breathing. 

Mandatory Ventilation 

Designers of adult ventilators have employed two rather distinct approaches for delivering mandatory 
breaths: volume controlled ventilation and pressure controlled ventilation. Volume controlled ventila- 
tion, which presently is more popular, refers to delivering a specified tidal volume to the patient during 
the inspiratory phase. Pressure controlled ventilation, however, refers to raising the airway pressure to a 
level, set by the therapist, during the inspiratory phase of each breath. Regardless of the type, a ventilator 
operating in mandatory mode must control all aspects of breathing such as tidal volume, respiration 
rate, inspiratory flow pattern, and oxygen concentration of the breath. This is often labeled as controlled 
mandatory ventilation (CMV). 

Figure 82.3 shows the flow and pressure waveforms for a volume controlled ventilation (CMV). In 
this illustration, the inspiratory flow waveform is chosen to be a half sinewave. In Fig. 82.3a, t i is the 
inspiration duration, t e is the exhalation period, and Q, is the amplitude of inspiratory flow. The ventilator 
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FIGURE 82.3 (a) Inspiratory flow for a controlled mandatory volume controlled ventilation breath, (b) airway 
pressure resulting from the breath delivery with a non-zero PEEP. 



delivers a tidal volume equal to the area under the flow waveform in Fig. 82.3a at regular intervals (f ; + 
t e ) set by the therapist. The resulting pressure waveform is shown in Fig. 82.3b. It is noted that during 
volume controlled ventilation, the ventilator delivers the same volume irrespective of the patient's respi- 
ratory mechanics. However, the resulting pressure waveform such as the one shown in Fig. 82.3b, will 
be different among patients. Of course, for safety purposes, the ventilator limits the maximum applied 
airway pressure according to the therapist's setting. 

As can be seen in Fig. 82.3b, the airway pressure at the end of exhalation may not end at atmospheric 
pressure (zero gauge). The positive end expiratory pressure (PEEP) is sometimes used to keep the alveoli 
from collapsing during expiration [Norwood, 1990]. In other cases, the expiration pressure is allowed 
to return to the atmospheric level. 

Figure 82.4 shows a plot of the pressure and flow during a mandatory pressure controlled ventilation. 
In this case, the respirator raises and maintains the airway pressure at the desired level independent of 
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FIGURE 82.4 (a) Inspiratory pressure pattern for a controlled mandatory pressure controlled ventilation breath, 
(b) airway flow pattern resulting from the breath delivery. Note that PEEP is zero. 
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patient airway compliance and resistance. The level of pressure during inspiration, P p is set by the 
therapist. While the ventilator maintains the same pressure trajectory for patients with different respi- 
ratory resistance and compliance, the resulting flow trajectory, shown in Fig. 82.4b, will depend on the 
respiratory mechanics of each patient. 

In the following, the presentation will focus on volume ventilators, as they are more common. Further, 
in a microprocessor-based ventilator, the mechanism for delivering mandatory volume and pressure 
controled ventilation have many similar main components. The primary difference lies in the control 
algorithms governing the delivery of breaths to the patient. 

Spontaneous Ventilation 

An important phase in providing respiratory therapy to a recovering pulmonary patient is weaning the 
patient from the respirator. As the patient recovers and gains the ability to breathe independently, the 
ventilator must allow the patient to initiate a breath and control the breath rate, flow rate, and the tidal 
volume. Ideally, when a respirator is functioning in the spontaneous mode, it should let the patient take 
breaths with the same ease as breathing from the atmosphere. This, however, is difficult to achieve because 
the respirator does not have an infinite gas supply or an instantaneous response. In practice, the patient 
generally has to exert more effort to breathe spontaneously on a respirator than from the atmosphere. 
However, patient effort is reduced as the ventilator response speed increases [McPherson, 1990]. Spon- 
taneous ventilation is often used in conjunction with mandatory ventilation since the patient may still 
need breaths that are delivered entirely by the ventilator. Alternatively, when a patient can breathe 
completely on his own but needs oxygen-enriched breath or elevated airway pressure, spontaneous 
ventilation alone may be used. 

As in the case of mandatory ventilation, several modes of spontaneous ventilation have been devised 
by therapists. Two of the most important and popular spontaneous breath delivery modes are described 
below. 

Continuous Positive Airway Pressure (CPAP) in Spontaneous Mode 

In this mode, the ventilator maintains a positive pressure at the airway as the patient attempts to inspire. 
Figure 82.5 illustrates a typical airway pressure waveform during CPAP breath delivery. The therapist sets 
the sensitivity level lower than PEEP. When the patient attempts to breathe, the pressure drops below 
the sensitivity level and the ventilator responds by supplying breathable gases to raise the pressure back 
to the PEEP level. Typically, the PEEP and sensitivity levels are selected such that the patient will be 
impelled to exert effort to breathe independently. As in the case of the mandatory mode, when the patient 
exhales the ventilator shuts off the flow of gas and opens the exhalation valve to allow the exhaled gases 
to flow into the atmosphere. 

Pressure Support in Spontaneous Mode 

This mode is similar to the CPAP mode with the exception that during the inspiration the ventilator 
attempts to maintain the patient airway pressure at a level above PEEP. In fact, CPAP may be considered 
a special case of pressure support ventilation in which the support level is fixed at the atmospheric level. 
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FIGURE 82.5 Airway pressure during a CPAP spontaneous breath delivery. 
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FIGURE 82.6 Airway pressure during a pressure support spontaneous breath delivery. 

Figure 82.6 shows a typical airway pressure waveform during the delivery of a pressure support breath. 
In this mode, when the patient's airway pressure drops below the therapist-set sensitivity line, the 
ventilator inspiratory breath delivery system raises the airway pressure to the pressure support level 
(>PEEP), selected by the therapist. The ventilator stops the flow of breathable gases when the patient 
starts to exhale and controls the exhalation valve to achieve the set PEEP level. 

82.5 Breath Delivery Control 

Figure 82.7 shows a simplified block diagram for delivering mandatory or spontaneous ventilation. 
Compressed air and oxygen are normally stored in high pressure tanks (S1400 kPa) that are attached to 
the inlets of the ventilator. In some ventilators, an air compressor is used in place of a compressed air 
tank. Manufacturers of mechanical respirators have designed a variety of blending and metering devices 
[McPherson, 1990]. The primary mission of the device is to enrich the inspiratory air flow with the 
proper level of oxygen and to deliver a tidal volume according to the therapist's specifications. With the 
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introduction of microprocessors for control of metering devices, electromechanical valves have gained 
popularity [Puritan-Bennett, 1987]. In Fig. 82.7, the air and oxygen valves are placed in closed feedback 
loops with the air and oxygen flow sensors. The microprocessor controls each the valves to deliver the 
desired inspiratory air and oxygen flows for mandatory and spontaneous ventilation. During inhalation, 
the exhalation valve is closed to direct all the delivered flows to the lungs. When exhalation starts, the 
microprocessor actuates the exhalation valve to achieve the desired PEEP level. The airway pressure 
sensor, shown on the right side of Fig. 82.7, generates the feedback signal necessary for maintaining the 
desired PEEP (in both mandatory and spontaneous modes) and airway pressure support level during 
spontaneous breath delivery. 

Mandatory Volume Controlled Inspiratory Flow Delivery 

In a microprocessor-controlled ventilator (Fig. 82.7), the electronically actuated valves open from a closed 
position to allow the flow of blended gases to the patient. The control of flow through each valve depends 
on the therapist's specification for the mandatory breath. That is, the clinician must specify the following 
parameters for delivery of CMV breaths: (1) respiration rate; (2) flow waveform; (3) tidal volume; 
(4) oxygen concentration (of the delivered breath); (5) peak flow; and (6) PEEP, as shown in the lower 
left side of Fig. 82.7. It is noted that the PEEP selected by the therapist in the mandatory mode is only 
used for control of exhalation flow; that will be described in the following section. The microprocessor 
utilizes the first five of the above parameters to compute the total desired inspiratory flow trajectory. To 
illustrate this point, consider the delivery of a tidal volume using a half sinewave as shown in Fig. 82.3. 
If the therapist selects a tidal volume of V, (L), a respiration rate of n breaths per minute (bpm), the 
amplitude of the respirator flow, Q,(L/s), then the total desired inspiratory flow, Q d {t), for a single breath, 
can be computed from the following equation: 



Qd(t)-- 



Q sin — o<t<t. 
' t. ' (82.1) 



t.<t<t 

i e 

where f, signifies the duration of inspiration and is computed from the following relationship: 



V 
£. = —*-. (82.2) 

' 2Q 



The duration of expiration in seconds is obtained from 



60 -t r (82.3) 



The ratio of inspiratory to expiratory periods of a mandatory breath is often used for adjusting the 
respiration rate. This ratio is represented by I:E (ratio) and is computed as follows. First, the inspiratory 
and expiratory periods are normalized with respect to f ; . Hence, the normalized inspiratory period 
becomes unity and the normalized expiratory period is given by R = tjt- r Then, the I:E ratio is simply 
expressed as 1:R. 

To obtain the desired oxygen concentration in the delivered breath, the microprocessor computes the 
discrete form of Q d (t) as Q d (k) where k signifies the fcth sample interval. Then, the total desired flow, 
Q d (k), is partitioned using the following relationships: 



S) 2000 by CRC Press LLC 



, , (l-m)Q d (k) 
Q ^ k r L_\ (82 " 4) 



and 



(l-c) 

where k signifies the sample interval, Q da {k)is the desired air flow (the subscript da stands for desired 
air), Q dx {k) is the desired oxygen flow (the subscript dx stands for desired oxygen), m is the desired 
oxygen concentration, and c is the oxygen concentration of the ventilator air supply. 

A number of control design strategies may be appropriate for the control of the air and oxygen flow 
delivery valves. A simple controller is the proportional plus integral controller that can be readily 
implemented in a microprocessor. For example, the controller for the air valve has the following form: 



where E(k) and A(k) are given by 



I^=K p E^+KA(k) (82.6) 

E(k)=Q da (k)-Q sa (k) (82.7) 

A(fc)=A(fc-l) + £(fc) (82.8) 

where I(k) is the input (voltage or current) to the air valve at the Ath sampling interval, E(k) is the error 
in the delivered flow, Q dl ,(k) is the desired air flow, Q sa {k) is the sensed or actual air flow (the subscript 
sa stands for sensed air flow), A(k) is the integral (rectangular integration) part of the controller, and K„ 
and K t are the controller proportionality constants. It is noted that the above equations are applicable to 
the control of either the air or oxygen valve. For control of the oxygen flow valve, Q dx (k) replaces Q da (k) 
and Q sx {k) replaces Q M (fc) where Q ix (k) represents the sensed oxygen flow (the subscript sx stands for 
sensed oxygen flow). 

The control structure shown in Fig. 82.7 provides the flexibility of quickly adjusting the percentage of 
oxygen in the enriched breath gases. That is, the controller can regulate both the total flow and the 
percent oxygen delivered to the patient. Since the internal volume of the flow control valve is usually 
small (< 50 ml), the desired change in the oxygen concentration of the delivered flow can be achieved 
within one inspiratory period. In actual clinical applications, rapid change of percent oxygen from one 
breath to another is often desirable, as it reduces the waiting time for the delivery of the desired oxygen 
concentration. A design similar to the one shown in Fig. 82.7 has been successfully implemented in a 
microprocessor-based ventilator [Behbehani, 1984] and is deployed in hospitals around the world. 

Pressure Controlled Inspiratory Flow Delivery 

The therapist entry for pressure-controlled ventilation is shown in Fig. 82.7 (lower left-hand side). In 
contrast to the volume-controlled ventilation where Q d (t) was computed directly from the operator's 
entry [Eq. (82.1) through Eq. (82.23)], the total desired flow is generated by a closed loop controller 
labeled as Airway Pressure Controller in Fig. 82.7. This controller uses the therapist-selected inspiratory 
pressure, respiration rate, and the I:E ratio to compute the desired inspiratory pressure trajectory. The 

e 2000 by CRC Press LLC 



trajectory serves as the controller reference input. The controller then computes the flow necessary to 
make the actual airway pressure track the reference input. Assuming a proportional-plus-integral con- 
troller, the governing equations are 

Qj( k ) =c p E p( k ) +c i A p( k ) (82 - 9) 

where Q d is the computed desired flow, C. and C, are the proportionality constants, k represents the 
sample interval, and EJk) and A{k) are computed using the following equations: 

E M)= p «( k )- p { k ) (82 - l0) 

A p( k )= A p( k - i ) +E p( k ) ( 82 - n ) 

where EJk) is the difference between the desired pressure trajectory, P d (k), and the sensed airway pressure, 
P s {k), the parameter A (k) represents the integral portion of the controller. Using Q d from Eq. (82.9), the 
control of air and O, valves is accomplished in the same manner as in the case of volume-controlled 
ventilation described earlier [Eq. (82.4) through Eq. (82.8)]. 

Expiratory Pressure Control in Mandatory Mode 

It is often desirable to keep the patient's lungs inflated at the end of expiration at a pressure greater than 
atmospheric level [Norwood, 1990]. That is, rather than allowing the lungs to deflate during the exha- 
lation, the controller closes the exhalation valve when the airway pressure reaches the PEEP level. When 
expiration starts, the ventilator terminates flow to the lungs; hence, the regulation of the airway pressure 
is achieved by controlling the flow of patient exhaled gases through the exhalation valve. 

In a microprocessor-based ventilator, an electronically actuated valve can be employed that has ade- 
quate dynamic response (=20 ms rise time) to regulate PEEP. For this purpose, the pressure in the patient 
breath delivery circuit is measured using a pressure transducer (Fig. 82.7). The microprocessor will 
initially open the exhalation valve completely to minimize resistance to expiratory flow. At the same time, 
it will sample the pressure transducer's output and start to close the exhalation valve as the pressure 
begins to approach the desired PEEP level. Since the patient's exhaled flow is the only source of pressure, 
if the airway pressure drops below PEEP, it cannot be brought back up until the next inspiratory period. 
Hence, an overrun (i.e., a drop to below PEEP) in the closed-loop control of PEEP cannot be tolerated. 

Spontaneous Breath Delivery Control 

The small diameter (=5 mm) pressure sensing tube, shown on the right side of Fig. 82.7, pneumatically 
transmits the pneumatic pressure signal from the patient airway to a pressure transducer placed in the 
ventilator. The output of the pressure transducer is amplified, filtered, and then sampled by the micro- 
processor. The controller receives the therapist's inputs regarding the spontaneous breath characteristics 
such as the PEEP, sensitivity, and oxygen concentration, as shown on the lower right-hand side of Fig. 82.7. 
The desired airway pressure is computed from the therapist entries of PEEP, pressure support level, and 
sensitivity. The multiple-loop control structure shown in Fig. 82.7 is used to deliver a CPAP or a pressure 
support breath. The sensed proximal airway pressure is compared with the desired airway pressure. The 
airway pressure controller computes the total inspiratory flow level required to raise the airway pressure 
to the desired level. This flow level serves as the reference input or total desired flow for the flow control 
loop. Hence, in general, the desired total flow trajectory for the spontaneous breath delivery may be 
different for each inspiratory cycle. If the operator has specified oxygen concentration greater than 2 1.6% 
(the atmospheric air oxygen concentration of the ventilator air supply), the controller will partition the 
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total required flow into the air and oxygen flow rates using Eqs. (82.4) and (82.5). The flow controller 
then uses the feedback signals from air and oxygen flow sensors and actuates the air and oxygen valves 
to deliver the desired flows. 

For a microprocessor-based ventilator, the control algorithm for regulating the airway pressure can 
also be a proportional plus integral controller [Behbehani, 1984; Behbehani and Watanabe, 1986]. In 
this case, the governing equations are identical to Eqs. (82.9) through (82.11). 

If a non-zero PEEP level is specified, the same control strategy as the one described for mandatory 
breath delivery can be used to achieve the desired PEEP. 

82.6 Summary 

Today's mechanical ventilators can be broadly classified into negative-pressure and positive-pressure 
ventilators. Negative-pressure ventilators do not offer the flexibility and convenience that positive-pres- 
sure ventilators provide; hence, they have not been very popular in clinical use. Positive-pressure venti- 
lators have been quite successful in treating patients with pulmonary disorders. These ventilators operate 
in either mandatory or spontaneous mode. When delivering mandatory breaths, the ventilator controls 
all parameters of the breath such as tidal volume, inspiratory flow waveform, respiration rate, and oxygen 
content of the breath. Mandatory breaths are normally delivered to the patients that are incapable of 
breathing on their own. In contrast, spontaneous breath delivery refers to the case where the ventilator 
responds to the patient's effort to breathe independently. Therefore, the patient can control the volume 
and the rate of the respiration. The therapist selects the oxygen content and the pressure at which the 
breath is delivered. Spontaneous breath delivery is typically used for patients who are on their way to 
full recovery, but are not completely ready to breathe from the atmosphere without mechanical assistance. 

Defining Terms 

Continuous positive airway pressure (CPAP): A spontaneous ventilation mode in which the ventilator 

maintains a constant positive pressure, near or below PEEP level, in the patient's airway while the 

patient breathes at will. 
I:E ratio: The ratio of normalized inspiratory interval to normalized expiratory interval of a mandatory 

breath. Both intervals are normalized with respect to the inspiratory period. Hence, the normalized 

inspiratory period is always unity. 
Mandatory mode: A mode of mechanically ventilating the lungs where the ventilator controls all breath 

delivery parameters such as tidal volume, respiration rate, flow waveform, etc. 
Patient circuit: A set of tubes connecting the patient airway to the outlet of a respirator. 
Positive end expiratory pressure (PEEP): A therapist-selected pressure level for the patient airway at 

the end of expiration in either mandatory or spontaneous breathing. 
Pressure controlled ventilation: A mandatory mode of ventilation where during the inspiration phase 

of each breath, a constant pressure is applied to the patient's airway independent of the patient's 

airway resistance and/or compliance. respiratory mechanics. 
Pressure support: A spontaneous breath delivery mode during which the ventilator applies a positive 

pressure greater than PEEP to the patient's airway during inspiration. 
Pressure support level: Refers to the pressure level, above PEEP, that the ventilator maintains during 

the spontaneous inspiration. 
Spontaneous mode: A ventilation mode in which the patient initiates and breathes from the ventilator- 
supplied gas at will. 
Volume controlled ventilation: A mandatory mode of ventilation where the volume of each breath is 

set by the therapist and the ventilator delivers that volume to the patient independent of the patient's 

airway resistance and/or compliance. respiratory mechanics. 
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Summary 



The circulatory system is the body's primary pathway for both the distribution of oxygen and other 
nutrients and the removal of carbon dioxide and other waste products. Since the entire blood supply in 
a healthy adult completely circulates within 60 seconds, substances introduced into the circulatory system 
are distributed rapidly. Thus intravenous (IV) and intraarterial access routes provide an effective pathway 
for the delivery of fluid, blood, and medicants to a patient's vital organs. Consequently, about 80% of 
hospitalized patients receive infusion therapy. Peripheral and central veins are used for the majority of 
infusions. Umbilical artery delivery (in neonates), enteral delivery of nutrients, and epidural delivery of 
anesthetics and analgesics comprise smaller patient populations. A variety of devices can be used to 
provide flow through an intravenous catheter. An intravenous delivery system typically consists of three 
major components: (1) fluid or drug reservoir, (2) catheter system for transferring the fluid or drug from 
the reservoir into the vasculature through a venipuncture, and (3) device for regulation and/or generating 
flow (see Fig. 83.1). 

This chapter is separated into five sections. The first describes the clinical needs associated with 
intravenous drug delivery that determine device performance criteria. The second section reviews the 
principles of flow through a tube; the third section introduces the underlying electromechanical principles 
for flow regulation and/or generation and their ability to meet the clinical performance criteria. The 
fourth section reviews complications associated with intravenous therapy, and the fifth section concludes 
with a short list of articles providing more detailed information. 

83.1 Performance Criteria for Intravenous Infusion Devices 

The intravenous pathway provides an excellent route for continuous drug therapy. The ideal delivery 
system regulates drug concentration in the body to achieve and maintain a desired result. When the 
drug's effect cannot be monitored directly, it is frequently assumed that a specific blood concentration 
or infusion rate will achieve the therapeutic objective. Although underinfusion may not provide sufficient 
therapy, overinfusion can produce even more serious toxic side effects. 
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FIGURE 83.1 A typical IV infusion system. 

The therapeutic range and risks associated with under- and overinfusion are highly drug and patient 
dependent. Intravenous delivery of fluids and electrolytes often does not require very accurate regulation. 
Low-risk patients can generally tolerate well infusion rate variability of ±30% for fluids. In some situa- 
tions, however, specifically for fluid-restricted patients, prolonged under- or overinfusion of fluids can 
compromise the patient's cardiovascular and renal systems. 

The infusion of many drugs, especially potent cardioactive agents, requires high accuracy. For example, 
post-coronary-artery-bypass-graft patients commonly receive sodium nitroprusside to lower arterial 
blood pressure. Hypertension, associated with underinfusion, subjects the graft sutures to higher stress 
with an increased risk for internal bleeding. Hypotension associated with overinfusion can compromise 
the cardiovascular state of the patient. Nitroprusside's potency, short onset delay, and short half-life 
(30-180 s) provide for very tight control, enabling the clinician to quickly respond to the many events 
that alter the patient's arterial pressure. The fast response of drugs such as nitroprusside creates a need 
for short-term flow uniformity as well as long-term accuracy. 

The British Department of Health employs Trumpet curves in their Health Equipment Information 
reports to compare flow uniformity of infusion pumps. For a prescribed flow rate, the trumpet curve is 
the plot of the maximum and minimum measured percentage flow rate error as a function of the 
accumulation interval (Fig. 83.2). Flow is measured gravimetrically in 30-s blocks for 1 hour. These blocks 
are summed to produce 120-s, 300-s, and other longer total accumulation intervals. Though the 120-s 
window may not detect flow variations important in delivery of the fastest acting agents, the trumpet 
curve provides a helpful means for performance comparison among infusion devices. Additional statis- 
tical information such as standard deviations may be derived from the basic trumpet flow measurements. 

The short half-life of certain pharmacologic agents and the clotting reaction time of blood during 
periods of stagnant flow require that fluid flow be maintained without significant interruption. Specifi- 
cally, concern has been expressed in the literature that the infusion of sodium nitroprusside and other 
short half-life drugs occur without interruption exceeding 20 s. Thus, minimization of false alarms and 
rapid detection of occlusions are important aspects of maintaining a constant vascular concentration. 
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FIGURE 83.2 Trumpet curve for several representative large volume infusion pumps operated a 5 mL/hr. Note that 
peristaltic pumps were designed for low risk patients. 

Accidental occlusions of the IV line due to improper positioning of stopcocks or clamps, kinked tubing, 
and clotted catheters are common. 

Occlusions between pump and patient present a secondary complication in maintaining serum drug 
concentration. Until detected, the pump will infuse, storing fluid in the delivery set. When the occlusion 
is eliminated, the stored volume is delivered to the patient in a bolus. With concentrated pharmaceutic 
agents, this bolus can produce a large perturbation in the patient's status. 

Occlusions of the pump intake also interrupt delivery. If detection is delayed, inadequate flow can 
result. During an intake occlusion, in some pump designs removal of the delivery set can produce abrupt 
aspiration of blood. This event may precipitate clotting and cause injury to the infusion site. 

The common practice of delivering multiple drugs through a single venous access port produces an 
additional challenge to maintaining uniform drug concentration. Although some mixing will occur in 
the venous access catheter, fluid in the catheter more closely resembles a first-in/first-out digital queue: 
During delivery, drugs from the various infusion devices mix at the catheter input, an equivalent fluid 
volume discharges from the outlet. Rate changes and flow nonuniformity cause the mass flow of drugs 
at the outlet to differ from those at the input. Consider a venous access catheter with a volume of 2 mL 
and a total flow of 10 mL/hr. Due to the digital queue phenomenon, an incremental change in the intake 
flow rate of an individual drug will not appear at the output for 12 min. In addition, changing flow rates 
for one drug will cause short-term marked swings in the delivery rate of drugs using the same access 
catheter. When the delay becomes significantly larger than the time constant for a drug that is titrated 
to a measurable patient response, titration becomes extremely difficult leading to large oscillations. 

As discussed, the performance requirements for drug delivery vary with multiple factors: drug, fluid 
restriction, and patient risk. Thus the delivery of potent agents to fluid-restricted patients at risk require 
the highest performance standards defined by flow rate accuracy, flow rate uniformity, and ability to 
minimize risk of IV-site complications. These performance requirements need to be appropriately bal- 
anced with the device cost and the impact on clinician productivity. 

83.2 Flow Through an IV Delivery System 

The physical properties associated with the flow of fluids through cylindrical tubes provide the foundation 
for understanding flow through a catheter into the vasculature. Hagen-Poiseuille's equation for laminar 
flow of a Newtonian fluid through a rigid tube states 

8-T| ■ L 

where Q is the flow; P, and P 2 are the pressures at the inlet and outlet of the tube, respectively; L and r 
are the length and internal radius of the tube, respectively; and r\ is fluid viscosity. Although many drug 
delivery systems do not strictly meet the flow conditions for precise application of the laminar flow 
equation, it does provide insight into the relationship between flow and pressure in a catheter. The fluid 
analog of Ohms law describes the resistance to flow under constant flow conditions 
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TABLE 83.1 Resistance Measurements for Catheter Components Used for 
Infusion 







Flow Resistance, 






Fluid Ohm, 


Component 


Length, cm 


mmHg/(L/hr) 


Standard administration set 


91-213 


4.3-5.3 


Extension tube for CVP monitoring 


15 


15.5 


19-gauge epidural catheter 


91 


290.4-497.1 


18-gauge needle 


6-9 


14.1-17.9 


23-gauge needle 


2.5-9 


165.2-344.0 


25-gauge needle 


1.5-4.0 


525.1-1412.0 


Vicra Quick-Cath Catheter 18-gauge 


5 


12.9 


Extension set with 0.22 micron air-eliminating filter 




623.0 


0.2 micron filter 




555.0 



Note: Mean values are presented over a range of infusions (100, 200, and 300 mL/hr) 
and sample size (n= 10). 



P-P 



Thus, resistance to flow through a tube correlates directly with catheter length and fluid viscosity and 
inversely with the fourth power of catheter diameter. For steady flow, the delivery system can be modeled 
as a series of resistors representing each component, including administration set, access catheter, and 
circulatory system. When dynamic aspects of the delivery system are considered, a more detailed model 
including catheter and venous compliance, fluid inertia, and turbulent flow is required. Flow resistance 
maybe defined with units of mmHg/(L/hr),so that 1 fluid ohm = 4.8 x KT 11 Pas/m 3 . Studies determining 
flow resistance for several catheter components with distilled water for flow rates of 100, 200, and 
300 mL/hr appear in Table 83.1. 

83.3 Intravenous Infusion Devices 

From Hagen-Poiselluie's equation, two general approaches to intravenous infusion become apparent. 
First, a hydrostatic pressure gradient can be used with adjustment of delivery system resistance controlling 
flow rate. Complications such as partial obstructions result in reduced flow which may be detected by 
an automatic flow monitor. Second, a constant displacement flow source can be used. Now complications 
may be detected by monitoring elevated fluid pressure and/or flow resistance. At the risk of overgener- 
alization, the relative strengths of each approach will be presented. 



Gravity Flow/Resistance Regulation 

The simplest means for providing regulated flow employs gravity as the driving force with a roller clamp 
as controlled resistance. Placement of the fluid reservoir 60-100 cm above the patient's right atrium 
provides a hydrostatic pressure gradient P h equal to 1.34 mmHg per cm of elevation. The modest 
physiologic mean pressure in the veins, P„ minimally reduces the net hydrostatic pressure gradient. The 
equation for flow becomes 



Q- 



h y__ 

R ,+R 

mfr n 



where R mfr and R n are the resistance to flow through the mechanical flow regulator and the remainder of 
the delivery system, respectively. Replacing the variables with representative values for an infusion of 5% 
saline solution into a healthy adult at 100 mL/hr yields 
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FIGURE 83.3 Drift in flow rate (mean ± standard deviation) over a four-hour period for three mechanical flow 
regulators at initial flow rates of 10, 60, and 120 mL/hr with distilled water at constant hydrostatic pressure gradient. 



100ml/hr = 
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Gravity flow cannot be used for arterial infusions since the higher vascular pressure exceeds available 
hydrostatic pressure. 

Flow stability in a gravity infusion system is subject to variations in hydrostatic and venous pressure 
as well as catheter resistance. However, the most important factor is the change in flow regulator resistance 
caused by viscoelastic creep of the tubing wall (see Fig. 83.3). Caution must be used in assuming that a 
preset flow regulator setting will accurately provide a predetermined rate. The clinician typically estimates 
flow rate by counting the frequency of drops falling through an in-line drip-forming chamber, adjusting 
the clamp to obtain the desired drop rate. The cross-sectional area of the drip chamber orifice is the 
major determinant of drop volume. Various manufacturers provide minidrip sets designed for pediatric 
(e.g., 60 drops/mL) and regular sets designed for adult (10-20 drops/mL) patients. Tolerances on the 
drip chamber can cause a 3% error in minidrip sets and a 17% error in regular sets at 125 mL/hr flow 
rate with 5% dextrose in water. Mean drop size for rapid rates increased by as much as 25% over the 
size of drops which form slowly. In addition, variation in the specific gravity and surface tension of fluids 
can provide an additional large source of drop size variability. 

Some mechanical flow regulating devices incorporate the principle of a Starling resistor. In a Starling 
device, resistance is proportional to hydrostatic pressure gradient. Thus, the device provides a negative 
feedback mechanism to reduce flow variation as the available pressure gradient changes with time. 

Mechanical flow regulators comprise the largest segment of intravenous infusion systems, providing 
the simplest means of operation. Patient transport is simple, since these devices require no electric power. 
Mechanical flow regulators are most useful where the patient is not fluid restricted and the acceptable 
therapeutic rate range of the drug is relatively wide with minimal risk of serious adverse sequelae. The 
most common use for these systems is the administration of fluids and electrolytes. 

Volumetric Infusion Pumps 

Active pumping infusion devices combine electronics with a mechanism to generate flow. These devices 
have higher performance standards than simple gravity flow regulators. The Association for the Advance- 
ment of Medical Instrumentation (AAMI) recommends that long-term rate accuracy for infusion pumps 
remain within ±10% of the set rate for general infusion and, for the more demanding applications, that 
long-term flow remain within ±5%. Such requirements typically extend to those agents with narrow 
therapeutic indices and/or low flow rates, such as the neonatal population or other fluid-restricted 
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patients. The British Department of Health has established three main categories for hospital-based 
infusion devices: neonatal infusions, high-risk infusions, and low-risk infusions. Infusion control for 
neonates requires the highest performance standards, because their size severely restricts fluid volume. 
A fourth category, ambulatory infusion, pertains to pumps worn by patients. 

Controllers 

These devices automate the process of adjusting the mechanical flow regulator. The most common 
controllers utilize sensors to count the number of drops passing through the drip chamber to provide 
flow feedback for automatic rate adjustment. Flow rate accuracy remains limited by the rate and viscosity 
dependence of drop size. Delivery set motion associated with ambulation and improper angulation of 
the drip chamber can also hinder accurate rate detection. 

An alternative to the drop counter is a volumetric metering chamber. A McGaw Corporation controller 
delivery set uses a rigid chamber divided by a flexible membrane. Instrument-controlled valves allow fluid 
to fill one chamber from the fluid reservoir, displacing the membrane driving the fluid from the second 
chamber toward the patient. When inlet and outlet valves reverse state, the second chamber is filled while 
the first chamber delivers to the patient. The frequency of state change determines the average flow rate. 
Volumetric accuracy demands primarily on the dimensional tolerances of the chamber. Although volu- 
metric controllers may provide greater accuracy than drop-counting controllers, their disposables are 
inherently more complex, and maximum flow is still limited by head height and system resistance. 

Beyond improvements in flow rate accuracy, controllers should provide an added level of patient safety 
by quickly detecting IV-site complications. The IVAC Corporation has developed a series of controllers 
employing pulsed modulated flow providing for monitoring of flow resistance as well as improved 
accuracy. 

The maximum flow rate achieved by gravimetric based infusion systems can become limited by R n 
and by concurrent infusion from other sources through the same catheter. In drop-counting devices, 
flow rate uniformity suffers at low flow rates from the discrete nature of the drop detector. 

In contrast with infusion controllers, pumps generate flow by mechanized displacement of the contents 
of a volumetric chamber. Typical designs provide high flow rate accuracy and uniformity for a wide rate 
range (0.1-1000.0 mL/hr) of infusion rates. Rate error correlates directly with effective chamber volume, 
which, in turn, depends on both instrument and disposable repeatability, precision, and stability under 
varying load. Stepper or servo-controlled dc motors are typically used to provide the driving force for 
the fluid. At low flow rates, dc motors usually operate in a discrete stepping mode. On average, each step 
propels a small quanta of fluid toward the patient. Flow rate uniformity therefore is a function of both 
the average volume per quanta and the variation in volume. Mechanism factors influencing rate unifor- 
mity include: stepping resolution, gearing and activator geometries, volumetric chamber coupling geom- 
etry, and chamber elasticity. When the quanta volume is not inherently uniform over the mechanism's 
cycle, software control has been used to compensate for the variation. 

Syringe Pumps 

These pumps employ a syringe as both reservoir and volumetric pumping chamber. A precision leadscrew 
is used to produce constant linear advancement of the syringe plunger. Except for those ambulatory 
systems that utilize specific microsyringes, pumps generally accept syringes ranging in size from 
5-100 mL. Flow rate accuracy and uniformity are determined by both mechanism displacement charac- 
teristics and tolerance on the internal syringe diameter. Since syringe mechanisms can generate a specified 
linear travel with less than 1% error, the manufacturing tolerance on the internal cross-sectional area of 
the syringe largely determines flow rate accuracy. Although syringes can be manufactured to tighter 
tolerances, standard plastic syringes provide long-term accuracy of ±5%. Flow rate uniformity, however, 
can benefit from the ability to select syringe size (see Fig. 83.4). Since many syringes have similar stroke 
length, diameter variation provides control of volume. Also the linear advancement per step is typically 



(D 2000 by CRC Press LLC 




Observation Window (minutes) 

FIGURE 83.4 Effect of syringe type on Trumpet curve of a syringe pump at 1 mL/hr. 

fixed. Therefore selection of a lower-volume syringe provides smaller-volume quanta. This allows 
tradeoffs among drug concentration, flow rate, and duration of flow per syringe. Slack in the gear train 
and drive shaft coupling as well as plunger slip cause rate inaccuracies during the initial stages of delivery 
(see Fig. 83.5a). 

Since the syringe volumes are typically much smaller than reservoirs used with other infusion devices, 
syringe pumps generally deliver drugs in either fluid-restricted environments or for short duration. With 
high-quality syringes, flow rate uniformity in syringe pumps is generally superior to that accomplished 
by other infusion pumps. With the drug 
reservoir enclosed within the device, 
syringe pumps manage patient transport 
well, including the operating room envi- 
ronment. 

Cassette pumps conceptually mimic the 
piston type action of the syringe pump but 
provide an automated means of repeatedly 
emptying and refilling the cassette. The 
process of refilling the cassette in single pis- 
ton devices requires an interruption in flow 
(see Fig. 83.5b). The length of interruption 
relative to the drug's half-life determines 
the impact of the refill period on hemody- 
namic stability. To eliminate the interrup- 
tion caused by refill, dual piston devices 
alternate refill and delivery states, provid- 
ing nearly continuous output. Others 
implement cassettes with very small vol- 
umes which can refill in less than a second 
(see Fig. 83.2). Tight control of the internal 
cross-sectional area of the pumping cham- 
ber provides exceptional flow rate accuracy. 
Manufacturers have recently developed 
remarkably small cassette pumps that can 
still generate the full spectrum of infusion 
rate (0.1-999.0 mL/hr). These systems 
combine pumping chamber, inlet and out- 
let valving, pressure sensing, and air detec- 
tion into a single complex component. 
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FIGURE 83.5 Continuous flow pattern for a representative, (a) 
syringe, (b) cassette, and (c) linear peristaltic pump at 10 mL/hr. 
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FIGURE 83.6 Impact of 5 variables on flow rate accuracy in 4 different infusion pumps. Variables tested included 
Solution: Distilled water and 25% dextrose in water; Back Pressure: -100 mmHg and 300 mmHg; Pumping Segment 
Filling Pressure: -30 inches of water and +30 inches of water; Temperature: 10°C and 40°C; and Infusion Rate: 
5 mL/hr and 500 mL/hr. Note: First and second peristaltic mechanism qualified for Low Risk patients, while the third 
peristaltic device qualified for high-risk patients. 

Peristaltic pumps operate on a short segment of the IV tubing. Peristaltic pumps can be separated 
into two subtypes. Rotary peristaltic mechanisms operate by compressing the pumping segment against 
the rotor housing with rollers mounted on the housing. With rotation, the rollers push fluid from the 
container through the tubing toward the patient. At least one of the rollers completely occludes the tubing 
against the housing at all times precluding free flow from the reservoir to the patient. During a portion 
of the revolution, two rollers trap fluid in the intervening pumping segment. The captured volume 
between the rollers determines volumetric accuracy. Linear peristaltic pumps hold the pumping segment 
in a channel pressed against a rigid backing plate. An array of cam-driven actuators sequentially occlude 
the segment starting with the section nearest the reservoir forcing fluid toward the patient with a 
sinusoidal wave action. In a typical design using uniform motor step intervals, a characteristic flow wave 
resembling a positively biased sine wave is produced (see Fig. 83.5c). 

Infusion pumps provide significant advantages over both manual flow regulators and controllers in 
several categories. Infusion pumps can provide accurate delivery over a wide range of infusion rates 
(0. 1-999.0 mL/hr). Neither elevated system resistance nor distal line pressure limit the maximum infusion 
rate. Infusion pumps can support a wider range of applications including arterial infusions, spinal and 
epidural infusions, and infusions into pulmonary artery or central venous catheters. Flow rate accuracy 
of infusion pumps is highly dependent on the segment employed as the pumping chamber (see Fig. 83.2). 
Incorporating special syringes or pumping segments can significantly improve flow rate accuracy (see 
Fig. 83.6). Both manufacturing tolerances and segment material composition significantly dictate flow 
rate accuracy. Time- and temperature-related properties of the pumping segment further impact long- 
term drift in flow rate. 

83.4 Managing Occlusions of the Delivery System 

One of the most common problems in managing an IV delivery system is the rapid detection of occlusion 
in the delivery system. With a complete occlusion, the resistance to flow approaches infinity. In this 
condition, gravimetric-based devices cease to generate flow. Mechanical flow regulators have no mech- 
anism for adverse event detection and thus must rely on the clinician to identify an occlusion as part of 
routine patient care. Electronic controllers sense the absence of flow and alarm in response to their 
inability to sustain the desired flow rate. 

The problem of rapidly detecting an occlusion in an infusion pump is more complex. Upstream 
occlusions that occur between the fluid reservoir and the pumping mechanism impact the system quite 
differently than downstream occlusions which occur between the pump and the patient. When an 
occlusion occurs downstream from an infusion pump, the pump continues to propel fluid into the section 
of tubing between the pump and the occlusion. The time rate of pressure rise in that section increases 
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in direct proportion to flow rate and inversely with tubing compliance (compliance, C, is the volume 
increase in a closed tube per mmHg pressure applied). The most common approach to detecting down- 
stream occlusion requires a pressure transducer immediately below the pumping mechanism. These 
devices generate an alarm when either the mean pressure or rate of change in pressure exceeds a threshold. 
For pressure-limited designs, the time to downstream alarm (TTA) may be estimated as 



n 'irm at 

flow rate 



mrri . alarm delivery-set 



Using a representative tubing compliance of 1 uL/mmHg, flow rate of 1 mL/hr, and a fixed alarm threshold 
set of 500 mmHg, the time to alarm becomes 

500 „ • lOOOml/mmHg 

TTA = "^ T-i = 30 min 

1 mL/hr 

where TTA is the time from occlusion to alarm detection. Pressure-based detection algorithms depend 
on accuracy and stability of the sensing system. Lowering the threshold on absolute or relative pressure 
for occlusion alarm reduces the TTA, but at the cost of increasing the likelihood of false alarms. Patient 
movement, patient-to-pump height variations, and other clinical circumstances can cause wide pertur- 
bations in line pressure. To optimize the balance between fast TTA and minimal false alarms, some 
infusion pumps allow the alarm threshold to be set by the clinician or be automatically shifted upward 
in response to alarms; other pumps attempt to optimize performance by varying pressure alarm thresh- 
olds with flow rate. 

A second approach to detection of downstream occlusions uses motor torque as an indirect measure 
of the load seen by the pumping mechanism. Although this approach eliminates the need for a pressure 
sensor, it introduces additional sources for error including friction in the gear mechanism or pumping 
mechanism that requires additional safety margins to protect against false alarms. In syringe pumps, 
where the coefficient of static friction of the syringe bunge (rubber end of the syringe plunger) against 
the syringe wall can be substantial, occlusion detection can exceed 1 hr at low flow rates. 

Direct, continuous measurement of downstream flow resistance may provide a monitoring modality 
which overcomes the disadvantages of pressure-based alarm systems, especially at low infusion rates. 
Such a monitoring system would have the added advantage of performance unaffected by flow rate, 
hydrostatic pressure variations, and motion artifacts. 

Upstream occlusions can cause large negative pressures as the pumping mechanism generates a vacuum 
on the upstream tubing segment. The tube may collapse and the vacuum may pull air through the tubing 
walls or form cavitation bubbles. A pressure sensor situated above the mechanism or a pressure sensor 
below the mechanism synchronized with filling of the pumping chamber can detect the vacuum associated 
with an upstream occlusion. Optical or ultrasound transducers, situated below the mechanism, can detect 
air bubbles in the catheter, and air-eliminating filters can remove air, preventing large air emboli from 
being introduced into the patient. 

Some of the most serious complications of IV therapy occur at the venipuncture site; these include 
extravasation, postinfusion phlebitis (and thrombophlebitis), IV-related infections, ecchymosis, and 
hematomas. Other problems that do not occur as frequently include speed shock and allergic reactions. 

Extravasation (or infiltration) is the inadvertent perfusion of infusate into the interstitial tissue. 
Reported percentage of patients to whom extravasation has occurred ranges from 10% to over 25%. 
Tissue damage does not occur frequently, but the consequences can be severe, including skin necrosis 
requiring significant plastic and reconstructive surgery and amputation of limbs. The frequency of 
extravasation injury correlates with age, state of consciousness, and venous circulation of the patient as 
well as the type, location, and placement of the intravenous cannula. Drugs that have high osmolality, 
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vessicant properties, or the ability to induce ischemia correlate with frequency of extravasation injury. 
Neonatal and pediatric patients who possess limited communication skills, constantly move, and have 
small veins that are difficult to cannulate require superior vigilance to protect against extravasation. 

Since interstitial tissue provides a greater resistance to fluid flow than the venous pathway, infusion 
devices with accurate and precise pressure monitoring systems have been used to detect small pressure 
increases due to extravasation. To successfully implement this technique requires diligence by the clinician, 
since patient movement, flow rate, catheter resistance, and venous pressure variations can obscure the 
small pressure variations resulting from the extravasation. Others have investigated the ability of a 
pumping mechanism to withdraw blood as indicative of problems in a patent line. The catheter tip, 
however, may be partially in and out of the vein such that infiltration occurs yet blood can be withdrawn 
from the patient. A vein might also collapse under negative pressure in a patent line without successful 
blood withdrawal. Techniques currently being investigated which monitor infusion impedance (resistance 
and compliance) show promise for assisting in the detection of extravasation. 

When a catheter tip wedges into the internal lining of the vein wall, it is considered positional. With 
the fluid path restricted by the vein wall, increases in line resistance may indicate a positional catheter. 
With patient movement, for example wrist flexation, the catheter may move in and out of the positional 
state. Since a positional catheter is thought to be more prone toward extravasation than other catheters, 
early detection of a positional catheter and appropriate adjustment of catheter position may be helpful 
in reducing the frequency of extravasation. 

Postinfusion phlebitis is acute inflammation of a vein used for IV infusion. The chief characteristic is 
a reddened area or red streak that follows the course of the vein with tenderness, warmth, and edema at 
the venipuncture site. The vein, which normally is very compliant, also hardens. Phlebitis positively 
correlates with infusion rate and with the infusion of vesicants. 

Fluid overload and speed shock result from the accidental administration of a large fluid volume over 
a short interval. Speed shock associates more frequently with the delivery of potent medications, rather 
than fluids. These problems most commonly occur with manually regulated IV systems, which do not 
provide the safety features of instrumented lines. Many IV sets designed for instrumented operation will 
free-flow when the set is removed from the instrument without manual clamping. To protect against this 
possibility, some sets are automatically placed in the occluded state on disengagement. Although an 
apparent advantage, reliance on such automatic devices may create a false sense of security and lead to 
manual errors with sets not incorporating these features. 

83.5 Summary 

Intravenous infusion has become the mode of choice for delivery of a large class of fluids and drugs both 
in hospital and alternative care settings. Modern infusion devices provide the clinician with a wide array 
of choices for performing intravenous therapy. Selection of the appropriate device for a specified appli- 
cation requires understanding of drug pharmacology and pharmacokinetics, fluid mechanics, and device 
design and performance characteristics. Continuing improvements in performance, safety, and cost of 
these systems will allow even broader utilization of intravenous delivery in a variety of settings. 
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Peter Glass provides a strong rationale for intravenous therapy including pharmacokinetic and pharma- 
codynamic bases for continuous delivery. Clinical complications around intravenous therapy are well 
summarized by MacCara [1983] and Bohony [1993]. The AAMI Standard for Infusion Devices provides 
a comprehensive means of evaluating infusion device technology, and the British Department of Health 
OHEI Report #198 provides a competitive analysis of pumps and controllers. 
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The intent of this chapter is to provide an introduction to the practice of anesthesiology and to the 
technology currently employed. Limitations on the length of this work and the enormous size of the 
topic require that this chapter rely on other elements within this Handbook and other texts cited as 
general references for many of the details that inquisitive minds desire and deserve. 

The practice of anesthesia includes more than just providing relief from pain. In fact, pain relief can 
be considered a secondary facet of the specialty. In actuality, the modern concept of the safe and efficacious 
delivery of anesthesia requires consideration of three fundamental tenets, which are ordered here by 
relative importance: 

1. maintenance of vital organ function 

2. relief of pain 

3. maintenance of the "internal milieu" 

The first, maintenance of vital organ function, is concerned with preventing damage to cells and organ 
systems that could result from inadequate supply of oxygen and other nutrients. The delivery of blood 
and cellular substrates is often referred to as perfusion of the cells or tissues. During the delivery of an 
anesthetic, the patient's "vital signs" are monitored in an attempt to prevent inadequate tissue perfusion. 
However, the surgery itself, the patient's existing pathophysiology, drugs given for the relief of pain, or 
even the management of blood pressure may compromise tissue perfusion. Why is adequate perfusion 
of tissues a higher priority than providing relief of pain for which anesthesia is named? A rather obvious 
extreme example is that without cerebral perfusion, or perfusion of the spinal cord, delivery of an 
anesthetic is not necessary. Damage to other organ systems may result in a range of complications from 
delaying the patient's recovery to diminishing their quality of life to premature death. 
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In other words, the primary purpose of anesthesia care is to maintain adequate delivery of required 
substrates to each organ and cell, which will hopefully preserve cellular function. The second principle 
of anesthesia is to relieve the pain caused by surgery. Chronic pain and suffering caused by many disease 
states is now managed by a relatively new sub-specialty within anesthesia, called Pain Management. 

The third principle of anesthesia is the maintenance of the internal environment of the body, for 
example, the regulation of electrolytes (sodium, potassium, chloride, magnesium, calcium, etc.), acid- 
base balance, and a host of supporting functions on which cellular function and organ system commu- 
nications rest. 

The person delivering anesthesia maybe an Anesthesiologist (physician specializing in anesthesiology), 
an Anesthesiology Physician Assistant (a person trained in a medical school at the masters level to 
administer anesthesia as a member of the care team lead by an Anesthesiologist), or a nurse anesthetist 
(a nurse with Intensive Care Unit experience that has additional training in anesthesia provided by 
advanced practice nursing programs). There are three major categories of anesthesia provided to patients: 
(1) general anesthesia; (2) conduction anesthesia; and (3) monitored anesthesia care. General anesthesia 
typically includes the intravenous injection of anesthetic drugs that render the patient unconscious and 
paralyze their skeletal muscles. Immediately following drug administration a plastic tube is inserted into 
the trachea and the patient is connected to an electropneumatic system to maintain ventilation of the 
lungs. A liquid anesthetic agent is vaporized and administered by inhalation, sometimes along with 
nitrous oxide, to maintain anesthesia for the surgical procedure. Often, other intravenous agents are used 
in conjunction with the inhalation agents to provide what is called a balanced anesthetic. 

Conduction anesthesia refers to blocking the conduction of pain and possibly motor nerve impulses 
traveling along specific nerves or the spinal cord. Common forms of conduction anesthesia include spinal 
and epidural anesthesia, as well as specific nerve blocks, for example, axillary nerve blocks. In order to 
achieve a successful conduction anesthetic, local anesthetic agents such as lidocaine, are injected into the 
proximity of specific nerves to block the conduction of electrical impulses. In addition, sedation may be 
provided intravenously to keep the patient comfortable while he/she is lying still for the surgery. 

Monitored anesthesia care refers to monitoring the patient's vital signs while administering sedatives 
and analgesics to keep the patient comfortable, and treating complications related to the surgical proce- 
dure. Typically, the surgeon administers topical or local anesthetics to alleviate the pain. 

In order to provide the range of support required, from the paralyzed mechanically ventilated patient 
to the patient receiving monitored anesthesia care, a versatile anesthesia delivery system must be available 
to the anesthesia care team. Today's anesthesia delivery system is composed of six major elements: 

1. The primary and secondary sources of gases (0 2 , air, N 2 0, vacuum, gas scavenging, and possibly 
C0 2 and helium). 

2. The gas blending and vaporization system. 

3. The breathing circuit (including methods for manual and mechanical ventilation). 

4. The excess gas scavenging system that minimizes potential pollution of the operating room by 
anesthetic gases. 

5. Instruments and equipment to monitor the function of the anesthesia delivery system. 

6. Patient monitoring instrumentation and equipment. 

The traditional anesthesia machine incorporated elements 1, 2, 3, and more recently 4. The evolution 
to the anesthesia delivery system adds elements 5 and 6. In the text that follows, references to the "anesthesia 
machine" refer to the basic gas delivery system and breathing circuit as contrasted with the "anesthesia 
delivery system" which includes the basic "anesthesia machine" and all monitoring instrumentation. 

84.1 Gases Used During Anesthesia and their Sources 

Most inhaled anesthetic agents are liquids that are vaporized in a device within the anesthesia delivery 
system. The vaporized agents are then blended with other breathing gases before flowing into the 
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breathing circuit and being administered to the patient. The most commonly administered form of 
anesthesia is called a balanced general anesthetic, and is a combination of inhalation agent plus intrave- 
nous analgesic drugs. Intravenous drugs often require electromechanical devices to administer an appro- 
priately controlled flow of drug to the patient. 

Gases needed for the delivery of anesthesia are generally limited to oxygen (0 2 ), air, nitrous oxide 
(N 2 0), and possibly helium (He) and carbon dioxide (C0 2 ). Vacuum and gas scavenging lines are also 
required. There needs to be secondary sources of these gases in the event of primary failure or questionable 
contamination. Typically, primary sources are those supplied from a hospital distribution system at 
345 kPa (50 psig) through gas columns or wall outlets. The secondary sources of gas are cylinders hung 
on yokes on the anesthesia delivery system. 

Oxygen 

Oxygen provides an essential metabolic substrate for all human cells, but it is not without dangerous 
side effects. Prolonged exposure to high concentrations of oxygen may result in toxic effects within the 
lungs that decrease diffusion of gas into and out of the blood, and the return to breathing air following 
prolonged exposure to elevated 2 may result in a debilitating explosive blood vessel growth in infants. 
Oxygen is usually supplied to the hospital in liquid form (boiling point of-183°C), stored in cryogenic 
tanks, and supplied to the hospital piping system as a gas. The efficiency of liquid storage is obvious 
since 1 liter of liquid becomes 860 liters of gas at standard temperature and pressure. The secondary 
source of oxygen within an anesthesia delivery system is usually one or more E cylinders filled with 
gaseous oxygen at a pressure of 15.2 MPa (2200 psig). 

Air (78% N 2 , 21% 2 , 0.9% Ar, 0.1% Other Gases) 

The primary use of air during anesthesia is as a diluent to decrease the inspired oxygen concentration. 
The typical primary source of medical air (there is an important distinction between "air" and "medical 
air" related to the quality and the requirements for periodic testing) is a special compressor that avoids 
hydrocarbon based lubricants for purposes of medical air purity. Dryers are employed to rid the com- 
pressed air of water prior to distribution throughout the hospital. Medical facilities with limited need 
for medical air may use banks of H cylinders of dry medical air. A secondary source of air may be available 
on the anesthesia machine as an E cylinder containing dry gas at 15.2 MPa. 

Nitrous Oxide 

Nitrous oxide is a colorless, odorless, and non-irritating gas that does not support human life. Breathing 
more than 85% N 2 may be fatal. N 2 is not an anesthetic (except under hyperbaric conditions), rather 
it is an analgesic and an amnestic. There are many reasons for administering N 2 during the course of 
an anesthetic including: enhancing the speed of induction and emergence from anesthesia; decreasing 
the concentration requirements of potent inhalation anesthetics (i.e., halothane, isoflurane, etc.); and as 
an essential adjunct to narcotic analgesics. N 2 is supplied to anesthetizing locations from banks of 
H cylinders that are filled with 90% liquid at a pressure of 5.1 MPa (745 psig). Secondary supplies are 
available on the anesthesia machine in the form of E cylinders, again containing 90% liquid. Continual 
exposure to low levels of N 2 in the workplace has been implicated in a number of medical problems 
including spontaneous abortion, infertility, birth defects, cancer, liver and kidney disease, and others. 
Although there is no conclusive evidence to support most of these implications, there is a recognized 
need to scavenge all waste anesthetic gases and periodically sample N 2 levels in the workplace to maintain 
the lowest possible levels consistent with reasonable risk to the operating room personnel and cost to 
the institution [Dorsch and Dorsch, 1998] . Another gas with analgesic properties similar to N 2 is xenon, 
but its use is experimental, and its cost is prohibitive at this time. 
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TABLE 84.1 Physical Properties of Gases Used During Anesthesia 







Density 


Viscosity 


Specific Heat 


GAS 


Molecular Wt. 


(g/L) 


(cp) 


(KJ/Kg°C) 


Oxygen 


31.999 


1.326 


0.0203 


0.917 


Nitrogen 


28.013 


1.161 


0.0175 


1.040 


Aii- 


28.975 


1.200 


0.0181 


1.010 


Nitrous oxide 


44.013 


1.836 


0.0144 


0.839 


Carbon dioxide 


44.01 


1.835 


0.0148 


0.850 


Helium 


4.003 


0.1657 


0.0194 


5.190 



TABLE 84.2 Physical Properties of Currently Available Volatile Anesthetic Agents 



Agent 


Boiling Point 


Vapor Pressure 


Liquid Density 


MAC* 


Generic Name 


(°C at 760 mmHg) 


(mmHg at 


20°C) 


(g/ml) 


(%) 


Halothane 


50.2 


243 




1.86 


0.75 


Enflurane 


56.5 


175 




1.517 


1.68 


Isoflurane 


48.5 


238 




1.496 


1.15 


Desflurane 


23.5 


664 




1.45 


6.0 


Sevoflurane 


58.5 


160 




1.51 


2.0 



* Minimum Alveolar Concentration is the percent of the agent required to provide 
surgical anesthesia to 50% of the population in terms of a cummulative dose response 
curve. The lower the MAC, the more potent the agent. 

Carbon Dioxide 

Carbon dioxide is colorless and odorless, but very irritating to breathe in higher concentrations. C0 2 is 
a byproduct of human cellular metabolism and is not a life-sustaining gas. C0 2 influences many physi- 
ologic processes either directly or through the action of hydrogen ions by the reaction C0 2 + H,0 <-> 
H 2 C0 3 <-> H + + HC0 3 ~. Although not very common in the U.S. today, in the past C0 2 was administered 
during anesthesia to stimulate respiration that was depressed by anesthetic agents and to cause increased 
blood flow in otherwise compromised vasculature during some surgical procedures. Like N 2 0, C0 2 is 
supplied as a liquid in H cylinders for distribution in pipeline systems or as a liquid in E cylinders that 
are located on the anesthesia machine. 

Helium 

Helium is a colorless, odorless, and non-irritating gas that will not support life. The primary use of 
helium in anesthesia is to enhance gas flow through small orifices as in asthma, airway trauma, or tracheal 
stenosis. The viscosity of helium is not different from other anesthetic gases (refer to Table 84.1) and is 
therefore of no benefit when airway flow is laminar. However, in the event that ventilation must be 
performed through abnormally narrow orifices or tubes which create turbulent flow conditions, helium 
is the preferred carrier gas. Resistance to turbulent flow is proportional to the density rather than viscosity 
of the gas and helium is an order of magnitude less dense than other gases. A secondary advantage of 
helium is that it has a large specific heat relative to other anesthetic gases and therefore can carry the 
heat from laser surgery out of the airway more effectively than air, oxygen, or nitrous oxide. 

84.2 Gas Blending and Vaporization System 

The basic anesthesia machine utilizes primary low pressure gas sources of 345 kPa (50 psig) available 
from wall or ceiling column outlets, and secondary high pressure gas sources located on the machine as 
pictured schematically in Fig. 84. 1 . Tracing the path of oxygen in the machine demonstrates that oxygen 
comes from either the low pressure source, or from the 15.2 Mpa (2200 psig) high pressure yokes via 
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FIGURE 84.1 Schematic diagram of gas piping within a simple two-gas (oxygen and nitrous oxide) anesthesia 
machine. 

cylinder pressure regulators and then branches to service several other functions. First and foremost, the 
second stage pressure regulator drops the 2 pressure to approximately 110 kPa (16 psig) before it enters 
the needle valve and the rotameter type flowmeter. From the flowmeter 2 mixes with gases from other 
flowmeters and passes through a calibrated agent vaporizer where specific inhalation anesthetic agents 
are vaporized and added to the breathing gas mixture. Oxygen is also used to supply a reservoir canister 
that sounds a reed alarm in the event that the oxygen pressure drops below 172 kPa (25 psig). When the 
oxygen pressure drops to 172 kPa or lower, then the nitrous oxide pressure sensor shutoff valve closes 
and N 2 is prevented from entering its needle valve and flowmeter and is therefore eliminated from the 
breathing gas mixture. In fact, all machines built in the U.S. have pressure sensor shutoff valves installed 
in the lines to every flowmeter, except oxygen, to prevent the delivery of a hypoxic gas mixture in the 
event of an oxygen pressure failure. Oxygen may also be delivered to the common gas outlet or machine 
outlet via a momentary normally closed flush valve that typically provides a flow of 65 to 80 liters of 2 
per minute directly into the breathing circuit. Newer machines are required to have a safety system for 
limiting the minimum concentration of oxygen that can be delivered to the patient to 25%. The flow 
paths for nitrous oxide and other gases are much simpler in the sense that after coming from the high 
pressure regulator or the low pressure hospital source, gas is immediately presented to the pressure sensor 
shutoff valve from where it travels to its specific needle valve and flowmeter to join the common gas line 
and enter the breathing circuit. 
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FIGURE 84.2 Schematic diagram of a calibrated in-line vaporizer that uses the flow-over technique for adding 
anesthetic vapor to the breathing gas mixture. 

Currently all anesthesia machines manufactured in the U.S. use only calibrated flow-through vapor- 
izers, meaning that all of the gases from the various flowmeters are mixed in the manifold prior to 
entering the vaporizer. Any given vaporizer has a calibrated control knob that, once set to the desired 
concentration for a specific agent, will deliver that concentration to the patient. Some form of interlock 
system must be provided such that only one vaporizer may be activated at any given time. Figure 84.2 
schematically illustrates the operation of a purely mechanical vaporizer with temperature compensation. 
This simple flow-over design permits a fraction of the total gas flow to pass into the vaporizing chamber 
where it becomes saturated with vapor before being added back to the total gas flow. Mathematically 
this is approximated by: 



Q 



p * 



* p 

VC JA 



(Qvc + Qg)" 



P *0 

1 A ^G 



where F A is the fractional concentration of agent at the outlet of the vaporizer, Q G is the total flow of gas 
entering the vaporizer, Q vc is the amount of Q G that is diverted into the vaporization chamber, P A is the 
vapor pressure of the agent, and P B is the barometric pressure. 

From Fig. 84.2, the temperature compensator would decrease Q vc as temperature increased because 
vapor pressure is proportional to temperature. The concentration accuracy over a range of clinically 
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expected gas flows and temperatures is approximately ± 15%. Since vaporization is an endothermic 
process, anesthetic vaporizers must have sufficient thermal mass and conductivity to permit the vapor- 
ization process to proceed independent of the rate at which the agent is being used. 

84.3 Breathing Circuits 

The concept behind an effective breathing circuit is to provide an adequate volume of a controlled 
concentration of gas to the patient during inspiration, and to carry the exhaled gases away from the 
patient during exhalation. There are several forms of breathing circuits which can be classified into 2 
basic types; (1) open circuit, meaning no rebreathing of any gases and no C0 2 absorber present; and 
(2) closed circuit, indicating presence of C0 2 absorber and some rebreathing of other gases. Figure 84.3 
illustrates the Lack modification of a Mapleson open circuit breathing system. There are no valves and 
no C0 2 absorber. There is a great potential for the patient to rebreath their own exhaled gases unless the 
fresh gas inflow is 2 to 3 times the patient's minute volume. Figure 84.4 illustrates the most popular form 
of breathing circuit, the circle system, with oxygen monitor, circle pressure gage, volume monitor (spirom- 
eter), and airway pressure sensor. The circle is a closed system, or semi-closed when the fresh gas inflow 
exceeds the patient's requirements. Excess gas evolves into the scavenging device, and some of the exhaled 
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FIGURE 84.3 An example of an open circuit breathing system that does not use unidirectional flow valves or 
contain a carbon dioxide absorbent. 
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FIGURE 84.4 A diagram of a closed circuit circle breathing system with unidirectional valves, inspired oxygen 
sensor, pressure sensor, and C0 2 absorber. 
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gas is rebreathed after having the C0 2 removed. The inspiratory and expiratory valves in the circle system 
guarantee that gas flows to the patient from the inspiratory limb and away from the patient through the 
exhalation limb. In the event of a failure of either or both of these valves, the patient will rebreath exhaled 
gas that contains C0 2 , which is a potentially dangerous situation. 

There are two forms of mechanical ventilation used during anesthesia: (1) volume ventilation, where 
the volume of gas delivered to the patient remains constant regardless of the pressure that is required; 
and (2) pressure ventilation, where the ventilator provides whatever volume to the patient that is required 
to produce some desired pressure in the breathing circuit. Volume ventilation is the most popular since 
the volume delivered remains theoretically constant despite changes in lung compliance. Pressure venti- 
lation is useful when compliance losses in the breathing circuit are high relative to the volume delivered 
to the lungs. 

Humidification is an important adjunct to the breathing circuit because it maintains the integrity of 
the cilia that line the airways and promote the removal of mucus and particulate matter from the lungs. 
Humidification of dry breathing gases can be accomplished by simple passive heat and moisture exchang- 
ers inserted into the breathing circuit at the level of the endotracheal tube connectors, or by elegant dual 
servo electronic humidifiers that heat a reservoir filled with water and also heat a wire in the gas delivery 
tube to prevent rain-out of the water before it reaches the patient. Electronic safety measures must be 
included in these active devices due to the potential for burning the patient and the fire hazard. 

84.4 Gas Scavenging Systems 

The purpose of scavenging exhaled and excess anesthetic agents is to reduce or eliminate the potential 
hazard to employees who work in the environment where anesthetics are administered, including oper- 
ating rooms, obstetrical areas, special procedures areas, physician's offices, dentist's offices, and veteri- 
narian's surgical suites. Typically more gas is administered to the breathing circuit than is required by 
the patient, resulting in the necessity to remove excess gas from the circuit. The scavenging system must 
be capable of collecting gas from all components of the breathing circuit, including adjustable pressure 
level valves, ventilators, and sample withdrawal type gas monitors, without altering characteristics of the 
circuit such as pressure or gas flow to the patient. There are two broad types of scavenging systems as 
illustrated in Fig. 84.5: the open interface is a simple design that requires a large physical space for the 
reservoir volume, and the closed interface with an expandable reservoir bag and which must include 
relief valves for handling the cases of no scavenged flow and great excess of scavenged flow. 

Trace gas analysis must be performed to guarantee the efficacy of the scavenging system. The National 
Institutes of Occupational Safety and Health (NIOSH) recommends that trace levels of nitrous oxide be 
maintained at or below 25 parts per million (ppm) time weighted average and that halogenated anesthetic 
agents remain below 2 ppm. 

84.5 Monitoring the Function of the Anesthesia 
Delivery System 

The anesthesia machine can produce a single or combination of catastrophic events, any one of which 
could be fatal to the patient: 

1. delivery of a hypoxic gas mixture to the patient; 

2. the inability to adequately ventilate the lungs by not producing positive pressure in the patient's 
lungs, by not delivering an adequate volume of gas to the lungs, or by improper breathing circuit 
connections that permit the patient's lungs to receive only rebreathed gases; 

3. the delivery of an overdose of an inhalational anesthetic agent. 

The necessary monitoring equipment to guarantee proper function of the anesthesia delivery system 
includes at least: 
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FIGURE 84.5 Examples of open and closed gas scavenger interfaces. The closed interface requires relief valves in 
the event of scavenging flow failure. 

• Inspired Oxygen Concentration monitor with absolute low level alarm of 19%. 

• Airway Pressure Monitor with alarms for: 

1. low pressure indicative of inadequate breathing volume and possible leaks 

2. sustained elevated pressures that could compromise cardiovascular function 

3. high pressures that could cause pulmonary barotrauma 

4. subatmospheric pressure that could cause collapse of the lungs 

• Exhaled Gas Volume Monitor. 

• Carbon Dioxide Monitor (capnography). 

• Inspired and Exhaled Concentration of anesthetic agents by any of the following: 

1. mass spectrometer 

2. Raman spectrometer 

3. infrared or other optical spectrometer 

A mass spectrometer is a very useful cost-effective device since it alone can provide capnography, 
inspired and exhaled concentrations of all anesthetic agents, plus all breathing gases simultaneously (0 2 , 
N 2 , C0 2 , N 2 0, Ar, He, halothane, enflurane, isoflurane, desflurane, and suprane). The mass spectrometer 
is unique in that it may be tuned to monitor an assortment of exhaled gases while the patient is asleep, 
including: (1) ketones for detection of diabetic ketoacidosis; (2) ethanol or other marker in the irrigation 
solution during transurethral resection of the prostate for early detection of the TURP syndrome, which 
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results in a severe dilution of blood electrolytes; and (3) pentanes during the evolution of a heart attack, 
to mention a few. 

Sound monitoring principles require. (1) earliest possible detection of untoward events (before they 
result in physiologic derangements); and (2) specificity that results in rapid identification and resolution 
of the problem. An extremely useful rule to always consider is "never monitor the anesthesia delivery 
system performance through the patient's physiologic responses". That is, never intentionally use a device 
like a pulse oximeter to detect a breathing circuit disconnection since the warning is very late and there 
is no specific information provided that leads to rapid resolution of the problem. 



84.6 Monitoring the Patient 



The anesthetist's responsibilities to the patient include: providing relief from pain and preserving all 
existing normal cellular function of all organ systems. Currently the latter obligation is fulfilled by 
monitoring essential physiologic parameters and correcting any substantial derangements that occur 
before they are translated into permanent cellular damage. The inadequacy of current monitoring meth- 
ods can be appreciated by realizing that most monitoring modalities only indicate damage after an insult 
has occurred, at which point the hope is that it is reversible or that further damage can be prevented. 

Standards for basic intraoperative monitoring of patients undergoing anesthesia, that were developed 
and adopted by the American Society of Anesthesiologists, became effective in 1990. Standard I concerns 
the responsibilities of anesthesia personnel, while Standard II requires that the patient's oxygenation, 
ventilation, circulation, and temperature be evaluated continually during all anesthetics. The following 
list indicates the instrumentation typically available during the administration of anesthetics. 

Electrocardiogram Non-Invasive or Invasive Blood Pressure 

Pulse Oximetry Temperature 

Urine Output Nerve Stimulators 

Cardiac Output Mixed Venous Oxygen Saturation 

Electroencephalogram (EEG) Transesophageal Echo Cardiography (TEE) 

Evoked Potentials Coagulation Status 

Blood gases and electrolytes (Po 2 , Pco 2 , pH, BE, Na + , K + , Cl~, Ca ++ , and glucose) 

Mass Spectrometry, Raman Spectrometry or Infrared Breathing Gas Analysis 

Control of Patient Temperature 

Anesthesia alters the thresholds for temperature regulation and the patient becomes unable to maintain 
normal body temperature. As the patient's temperature falls even a few degrees toward room temperature, 
several physiologic derangements occur: (1) drug action is prolonged; (2) blood coagulation is impaired; 
and (3) post-operative infection rate increases. On the positive side, cerebral protection from inadequate 
perfusion is enhanced by just a few degrees of cooling. Proper monitoring of core body temperature and 
forced hot air warming of the patient is essential. 

Monitoring the Depth of Anesthesia 

There are two very unpleasant experiences that patients may have while undergoing an inadequate 
anesthetic: (1) the patient is paralyzed and unable to communicate their state of discomfort, and they 
are feeling the pain of surgery and are aware of their surroundings; (2) the patient may be paralyzed, 
unable to communicate, and is aware of their surroundings, but is not feeling any pain. The ability to 
monitor the depth of anesthesia would provide a safeguard against these unpleasant experiences. However, 
despite numerous instruments and approaches to the problem it remains elusive. Brain stem auditory 
evoked responses have come the closest to depth of anesthesia monitoring, but it is difficult to perform, 
is expensive, and is not possible to perform during many types of surgery. A promising new technology, 
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called bi-spectral index (BIS monitoring) is purported to measure the level of patient awareness through 
multivariate analysis of a single channel of the EEC 

Anesthesia Computer-Aided Record Keeping 

Conceptually, every anesthetist desires an automated anesthesia record keeping system. Anesthesia care 
can be improved through the feedback provided by correct record keeping, but today's systems have an 
enormous overhead associated with their use when compared to standard paper record keeping. No 
doubt that automated anesthesia record keeping reduces the drudgery of routine recording of vital signs, 
but to enter drugs and drips and their dosages, fluids administered, urine output, blood loss, and other 
data requires much more time and machine interaction than the current paper system. Despite attempts 
to use every input/output device ever produced by the computer industry from keyboards to bar codes 
to voice and handwriting recognition, no solution has been found that meets wide acceptance. Tenants 
of a successful system must include: 

1. The concept of a user transparent system, which is ideally defined as requiring no communication 
between the computer and the clinician (far beyond the concept of user friendly), and therefore 
that is intuitively obvious to use even to the most casual users. 

2. Recognition of the fact that educational institutions have very different requirements from private 
practice institutions. 

3. Real time hard copy of the record produced at the site of anesthetic administration that permits 
real time editing and notation. 

4. Ability to interface with a great variety of patient and anesthesia delivery system monitors from 
various suppliers. 

5. Ability to interface with a large number of hospital information systems. 

6. Inexpensive to purchase and maintain. 

Alarms 

Vigilance is the key to effective risk management, but maintaining a vigilant state is not easy. The practice 
of anesthesia has been described as moments of shear terror connected by times of intense boredom. 
Alarms can play a significant role in redirecting one's attention during the boredom to the most important 
event regarding patient safety, but only if false alarms can be eliminated, alarms can be prioritized, and 
all alarms concerning anesthetic management can be displayed in a single clearly visible location. 

Ergonomics 

The study of ergonomics attempts to improve performance by optimizing the relationship between people 
and their work environment. Ergonomics has been defined as a discipline which investigates and applies 
information about human requirements, characteristics, abilities, and limitations to the design, devel- 
opment, and testing of equipment, systems, and jobs [Loeb, 1993]. This field of study is only in its infancy 
and examples of poor ergonomic design abound in the anesthesia workplace. 

Simulation in Anesthesia 

Complete patient simulators are hands-on realistic simulators that interface with physiologic monitoring 
equipment to simulate patient responses to equipment malfunctions, operator errors, and drug therapies. 
There are also crisis management simulators. Complex patient simulators, which are analogous to flight 
simulators, are currently being marketed for training anesthesia personnel. The intended use for these 
complex simulators is currently being debated in the sense that training people to respond in a prepro- 
grammed way to a given event may not be adequate training. 



© 2000 by CRC Press LLC 



Reliability 

The design of an anesthesia delivery system is unlike the design of most other medical devices because 
it is a life support system. As such, its core elements deserve all of the considerations of the latest fail- 
safe technologies. Too often in today's quest to apply microprocessor technology to everything, trade- 
offs are made among reliability, cost, and engineering elegance. The most widely accepted anesthesia 
machine designs continue to be based upon simple ultra-reliable mechanical systems with an absolute 
minimum of catastrophic failure modes. The replacement of needle valves and rotameters, for example, 
with microprocessor controlled electromechanical valves can only introduce new catastrophic failure 
modes. However, the inclusion of microprocessors can enhance the safety of anesthesia delivery if they 
are implemented without adding catastrophic failure modes. 
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Approximately 20 years ago the C0 2 laser was introduced into surgical practice as a tool to photothermally 
ablate, and thus to incise and to debulk, soft tissues. Subsequently, three important factors have led to 
the expanding biomedical use of laser technology, particularly in surgery. These factors are: (1) the 
increasing understanding of the wave-length selective interaction and associated effects of ultraviolet- 
infrared (UV-IR) radiation with biologic tissues, including those of acute damage and long-term healing, 
(2) the rapidly increasing availability of lasers emitting (essentially monochromatically) at those wave- 
lengths that are strongly absorbed by molecular species within tissues, and (3) the availability of both 
optical fiber and lens technologies as well as of endoscopic technologies for delivery of the laser radiation 
to the often remote internal treatment site. Fusion of these factors has led to the development of currently 
available biomedical laser systems. 

This chapter briefly reviews the current status of each of these three factors. In doing so, each of the 
following topics will be briefly discussed: 

1. The physics of the interaction and the associated effects (including clinical efforts) of UV-IR 
radiation on biologic tissues. 

2. The fundamental principles that underlie the operations and construction of all lasers. 

3. The physical properties of the optical delivery systems used with the different biomedical lasers 
for delivery of the laser beam to the treatment site. 

4. The essential physical features of those biomedical lasers currently in routine use ranging over a 
number of clinical specialties, and brief descriptions of their use. 

5. The biomedical uses of other lasers used surgically in limited scale or which are currently being 
researched for applications in surgical and diagnostic procedures and the photosensitized inacti- 
vation of cancer tumors. 
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In this review, effort is made in the text and in the last section to provide a number of key references 
and sources of information for each topic that will enable the reader's more in-depth pursuit. 

85.1 Interaction and Effects of UV-IR Laser Radiation 
on Biologic Tissues 

Electromagnetic radiation in the UV-IR spectral range propagates within biologic tissues until it is either 
scattered or absorbed. 

Scattering in Biologic Tissue 

Scattering in matter occurs only at the boundaries between regions having different optical refractive 
indices and is a process in which the energy of the radiation is conserved [Van de Hulst, 1957]. Since 
biologic tissue is structurally inhomogeneous at the microscopic scale, e.g., both subcellular and cellular 
dimensions, and at the macroscopic scale, e.g., cellular assembly (tissue) dimensions, and predominantly 
contains water, proteins, and lipids, all different chemical species, it is generally regarded as a scatterer 
of UV-IR radiation. The general result of scattering is deviation of the direction of propagation of 
radiation. The deviation is strongest when wavelength and scatterer are comparable in dimension (Mie 
scattering) and when wavelength greatly exceeds particle size (Rayleigh scattering) [Van de Hulst, 1957]. 
This dimensional relationship results in the deeper penetration into biologic tissues of those longer 
wavelengths which are not absorbed appreciably by pigments in the tissues. This results in the relative 
transparency of nonpigmented tissues over the visible and near-IR wavelength ranges. 

Absorption in Biologic Tissue 

Absorption of UV-IR radiation in matter arises from the wavelength-dependent resonant absorption of 
radiation by molecular electrons of optically absorbing molecular species [Grossweiner, 1989]. Because 
of the chemical inhomogeneity of biologic tissues, the degree of absorption of incident radiation strongly 
depends upon its wavelength. The most prevalent or concentrated UV-IR absorbing molecular species 
in biologic tissues are listed in Table 85.1 along with associated high-absorbance wavelengths. These 
species include the peptide bonds; the phenylalanine, tyrosine, and tryptophan residues of proteins, all 
of which absorb in the UV range; oxy- and deoxyhemoglobin of blood which absorb in the visible to 
near-IR range; melanin, which absorbs throughout the UV to near-IR range, which decreasing absorption 
occurring with increasing wavelength; and water, which absorbs maximally in the mid-IR range [Hale & 
Querry, 1973; Miller & Veitch, 1993; White et al., 1968]. Biomedical lasers and their emitted radiation 
wavelength values also are tabulated also in Table 85.1. The correlation between the wavelengths of 
clinically useful lasers and wavelength regions of absorption by constituents of biological tissues is evident. 
Additionally, exogenous light- absorbing chemical species may be intentionally present in tissues. These 
include: 

1. Photosensitizers, such as porphyrins, which upon excitation with UV- visible light initiate photo- 
chemical reactions which are cytotoxic to the cells of the tissue, e.g., a cancer which concentrates 
the photosensitizer relative to surrounding tissues [Spikes, 1989]. 

2. Dyes such as indocyanine green which, when dispersed in a concentrate fibrin protein gel can be 
used to localize 810 nm GaAlAs diode laser radiation and the associated heating to achieve localized 
thermal denaturation and bonding of collagen to effect joining or welding of tissue [Bass et al., 
1992; Oz et al, 1989]. 

3. Tattoo pigments including graphite (black) and black, blue, green, and red organic dyes [Fitz- 
patrick, 1994; McGillis et al, 1994]. 
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TABLE 85.1 UV-IR- Radiation-Absorbing Constituent of Biological Tissues and Biomedical Laser Wavelengths 





Tissue Type 


Optical Absorption 


Laser Type 




Constituent 


Wavelength*, nm 


Relativef Strength 


Wavelength, nm 


Proteins 


All 










Peptide bond 




<220 (r) 


+++++++ 


Art' 


193 


Amino acid 












Residues 












Tryptophan 




220-290 (r) 


+ 






Tyrosine 




220-290 (r) 


+ 






Phenylalanine 




220-2650 (r) 


+ 






Pigments 












Oxyhemoglobin 


Blood 


414 (p) 


+++ 


Ar ion 


488-514.5 




vascular 


537 (p) 


++ 


Frequency 


532 




tissues 


575 (p) 


++ 


Doubled 








970 (p) 


+ 


Nd:YAG 








(690-1100) (r) 




Diode 
Nd:YAG 


810 
1064 


Deoxyhemoglobin 


Blood 


431 (p) 


+++ 


Dye 


400-700 




vascular 


554 (p) 


++ 


Nd:YAG 


1064 




tissues 










Melanin 


Skin 


220-1000 (r) 


++++ 


Ruby 


693 


Water 


All 


2.1 (p) 


+++ 


Ho:YAG 


2100 






3.02 (p) 


+++++++ 


Er:YAG 


2940 






>2.94 (r) 


++++ 


co 2 


10,640 



* (p): Peak absorption wavelength; (r): wavelength range. 

f The number of + signs qualitatively ranks the magnitude of the optical absorbtion. 



85.2 Penetration and Effects of UV-IR Laser Radiation 
into Biologic Tissue 

Both scattering and absorption processes affect the variations of the intensity of radiation with propa- 
gation into tissues. In the absence of scattering, absorption results in an exponential decrease of radiation 
intensity described simply by Beers law [Grossweiner, 1989]. With appreciable scattering present, the 
decrease in incident intensity from the surface is no longer monotonic. A maximum in local internal 
intensity is found to be present due to efficient back-scattering, which adds to the intensity of the incoming 
beam as shown, for example, by Miller and Veitch [1993] for visible light penetrating into the skin and 
by Rastegar and coworkers [1992] for 1.064 urn Nd:YAG laser radiation penetrating into the prostate 
gland. Thus, the relative contributions of absorption and scattering of incident laser radiation will 
stipulate the depth in a tissue at which the resulting tissue effects will be present. Since the absorbed 
energy can be released in a number of different ways including thermal vibrations, fluorescence, and 
resonant electronic energy transfer according to the identity of the absorber, the effects on tissue are in 
general different. Energy release from both hemoglobin and melanin pigments and from water is by 
molecular vibrations resulting in a local temperature rise. Sufficient continued energy absorption and 
release can result in local temperature increases which, as energy input increases, result in protein 
denaturation (41-65°C), water evaporation and boiling (up to = 300°C under confining pressure of 
tissue), thermolysis of proteins, generation of gaseous decomposition products and of carbonaceous 
residue or char (> 300°C). The generation of residual char is minimized by sufficiently rapid energy input 
to support rapid gasification reactions. The clinical effect of this chain of thermal events is tissue ablation. 
Much smaller values of energy input result in coagulation of tissues due to protein denaturation. 

Energy release from excited exogenous photosensitizing dyes is via formation of free-radical species 
or energy exchange with itinerant dissolved molecular oxygen [Spikes, 1989]. Subsequent chemical 
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reactions following free-radical formation or formation of an activated or more reactive form of molecular 
oxygen following energy exchange can be toxic to cells with takeup of the photosensitizer. 

Energy release following absorption of visible (VIS) radiation by fluorescent molecular species, either 
endogenous to tissue or exogenous, is predominantly by emission of longer wavelength radiation [Lakow- 
icz, 1983]. Endogenous fluorescent species include tryptophan, tyrosine, phenylalanine, flavins, and 
metal-free porphyrins. Comparison of measured values of the intensity of fluorescence emission from 
hyperplastic (transformed precancerous) cervical cells to cancerous cervical cells with normal cervical 
epithelial cells shows a strong potential for diagnostic use in the automated diagnosis and staging of 
cervical cancer [Mahadevan et al., 1993]. 

85.3 Effects of Mid-IR Laser Radiation 

Because of the very large absorption by water of radiation with wavelength in the IR range > 2.0 urn, 
the radiation of Ho:YAG, Er: YAG, and C0 2 lasers is absorbed within a very short distance of the tissue 
surface, and scattering is essentially unimportant. Using published values of the water absorption coef- 
ficient [Hale & Querry, 1973] and assuming an 80% water content and that the decrease in intensity is 
exponential with distance, the depth in the "average" soft tissue at which the intensity has decreased to 
10% of the incident value (the optical penetration depth) is estimated to be 619, 13, and 170 micrometers, 
respectively, for Ho:YAG, Er:YAG, and C0 2 laser radiation. Thus, the absorption of radiation from these 
laser sources and thermalization of this energy results essentially in the formation of a surface heat source. 
With sufficient energy input, tissue ablation through water boiling and tissue thermolysis occur at the 
surface. Penetration of heat to underlying tissues is by diffusion alone; thus, the depth of coagulation of 
tissue below the surface region of ablation is limited by competition between thermal diffusion and the 
rate of descent of the heated surface impacted by laser radiation during ablation of tissue. Because of 
this competition, coagulation depths obtained in soft biologic tissues with use of mid-IR laser radiation 
are typically < 205-500 um, and the ability to achieve sealing of blood vessels leading to hemostatic 
("bloodless") surgery is limited [Judy et al., 1992; Schroder et al., 1987]. 

85.4 Effects of Near-IR Laser Radiation 

The 810-nm and 1064-um radiation, respectively, of the GaAlAs diode laser and Nd:YAG laser penetrate 
more deeply into biologic tissues than the radiation of longer- wavelength IR lasers. Thus, the resulting 
thermal effects arise from absorption at greater depth within tissues, and the depths of coagulation and 
degree of hemostasis achieved with these lasers tend to be greater than with the longer-wavelength IR 
lasers. For example, the optical penetration depths (10% incident intensity) for 810-nm and 1.024-um 
radiation are estimated to be =4.6 and =8.6 mm respectively in canine prostate tissue [Rastegar et al., 
1992]. Energy deposition of 3600 J from each laser onto the urethral surface of the canine prostate results 
in maximum coagulation depths of =8 and 12 mm respectively using diode and Nd:YAG lasers [Mota- 
medi et al., 1993]. Depths of optical penetration and coagulation in porcine liver, a more vascular tissue 
than prostate gland, of =2.8 and 9.6 mm, respectively, were obtained with a Nd:YAG laser beam, and of 
7 and 12 mm respectively with an 810-nm diode laser beam [Rastegar et al., 1992]. The smaller pene- 
tration depth obtained with 810-nm diode radiation in liver than in prostate gland reflects the effect of 
greater vascularity (blood content) on near-IR propagation. 

85.5 Effects of Visible-Range Laser Radiation 

Blood and vascular tissues very efficiently absorb radiation in the visible wavelength range due to the 
strong absorption of hemoglobin. This absorption underlies, for example, the use of: 
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1. The argon ion laser (488-514.5 nm) in the localized heating and thermal coagulation of the 
vascular choroid layer and adjacent retina, resulting in the anchoring of the retina in treatment 
of retinal detachment [Katoh and Peyman, 1988]. 

2. The argon ion laser (488-514.5 nm), frequency-doubled Nd:YAG laser (532 nm), and dye laser 
radiation (585 nm) in the coagulative treatment of cutaneous vascular lesions such as port wine 
stains [Mordon et al., 1993]. 

3. The argon ion (488-514.5 nm) and frequency-doubled Nd:YAG lasers (532 nm) in the ablation 
of pelvic endometrial lesions which contain brown iron-containing pigments [Keye et al., 1983]. 

Because of the large absorption by hemoglobin and iron-containing pigments, the incident laser 
radiation is essentially absorbed at the surface of the blood vessel or lesion, and the resulting thermal 
effects are essentially local [Miller and Veitch, 1993]. 

85.6 Effects of UV Laser Radiation 

Whereas exposure of tissue to IR and visible-light-range laser energy result in removal of tissue by thermal 
ablation, exposure to argon fluoride (ArF) laser radiation of 193-nm wavelength results predominantly 
in ablation of tissue initiated by a photochemical process [Garrison and Srinivasan, 1985]. This ablation 
arises from repulsive forces between like-charged regions of ionized protein molecules that result from 
ejection of molecular electrons following UV photon absorption [Garrison and Srinivasan, 1985]. Because 
the ionization and repulsive processes are extremely efficient, little of the incident laser energy escapes 
as thermal vibrational energy, and the extent of thermal coagulation damage adjacent to the site of 
incidence is very limited [Garrison and Srinivasan, 1985]. This feature and the ability to tune very finely 
the fluence emitted by the ArF laser so that micrometer depths of tissue can be removed have led to 
ongoing clinical trials to investigate the efficiency of the use of the ArF laser to selectively remove tissue 
from the surface of the human cornea for correction of short-sighted vision to eliminate the need for 
corrective eyewear [Van Saarloos and Constable, 1993]. 

85.7 Effects of Continuous and Pulsed IR- Visible Laser 
Radiation and Association Temperature Rise 

Heating following absorption of IR-visible laser radiation arises from molecular vibration during loss of 
the excitation energy and initially is manifested locally within the exposed region of tissue. If incidence 
of the laser energy is maintained for a sufficiently long time, the temperature within adjacent regions of 
biologic tissue increases due to heat diffusion. The mean squared distance <X 2 > over which appreciable 
heat diffusion and temperature rise occur during exposure time t can be described in terms of the thermal 
diffusion time x by the equation: 

'x 2 ) = xt (85.1) 



where x is defined as the ratio of the thermal conductivity to the product of the heat capacity and density. 
For soft biologic tissues x is approximately 1 x 10 3 cm 2 s _1 [Meijering et al., 1993]. Thus, with continued 
energy input, the distance over which thermal diffusion and temperature rise occurs increases. Conversely, 
with use of pulsed radiation, the distance of heat diffusion can be made very small; for example, with 
exposure to a l-(j,s pulse, the mean thermal diffusion distance is found to be approximately 0.3 urn, or 
about 3-10% of a biologic cell diameter. If the laser radiation is strongly absorbed and the ablation of 
tissues is efficient, then little energy diffuses away from the site of incidence, and lateral thermally induced 
coagulation of tissue can be minimized with pulses of short duration. The effect of limiting lateral thermal 
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damage is desirable in the cutting of cornea [Hibst et al., 1992] and sclera of the eye [Hill et al., 1993], 
and joint cartilage [Maes and Sherk, 1994], all of which are avascular (or nearly so, with cartilage), and 
the hemostasis arising from lateral tissue coagulation is not required. 

85.8 General Description and Operation of Lasers 

Lasers emit a beam of intense electromagnetic radiation that is essentially monochromatic or contains 
at most a few nearly monochromatic wavelengths and is typically only weakly divergent and easily focused 
into external optical systems. These attributes of laser radiation depend on the key phenomenon which 
underlies laser operation, that of light amplification by stimulated emission of radiation, which in turn 
gives rise to the acronym LASER. 

In practice, a laser is generally a generator of radiation. The generator is constructed by housing a 
light- emitting medium within a cavity defined by mirrors which provide feedback of emitted radiation 
through the medium. With sustained excitation of the ionic or molecular species of the medium to give 
a large density of excited energy states, the spontaneous and attendant stimulated emission of radiation 
from these states by photons of identical wavelength (a lossless process), which is amplified by feedback 
due to photon reflection by the cavity mirrors, leads to the generation of a very large photon density 
within the cavity. With one cavity mirror being partially transmissive, say 0.1 to 1%, a fraction of the 
cavity energy is emitted as an intense beam. With suitable selection of a laser medium, cavity geometry, 
and peak wavelengths of mirror reflection, the beam is also essentially monochromatic and very nearly 
collimated. 

Identity of the lasing molecular species or laser medium fixes the output wavelength of the laser. Laser 
media range from gases within a tubular cavity, organic dye molecules dissolved in a flowing inert liquid 
carrier and heat sink, to impurity-doped transparent crystalline rods (solid state lasers) and semicon- 
ducting diode junctions [Lengyel, 1971]. The different physical properties of these media in part deter- 
mine the methods used to excite them into lasing states. 

Gas-filled, or gas lasers are typically excited by dc or rf electric current. The current either ionizes and 
excites the lasing gas, e.g., argon, to give the electronically excited and lasing Ar+ ion, or ionizes a gaseous 
species in a mixture also containing the lasing species, e.g., N 2 , which by efficient energy transfer excites 
the lasing molecular vibrational states of the C0 2 molecule. 

Dye lasers and so-called solid-state lasers are typically excited by intense light from either another laser 
or from a flash lamp. The excitation light wavelength range is selected to ensure efficient excitation at 
the absorption wavelength of the lasing species. Both excitation and output can be continuous, or the 
use of a pulsed flashlamp or pulsed exciting laser to pump a solid-state or dye laser gives pulsed output 
with high peak power and short pulse duration of 1 (is to 1 ms. Repeated excitation gives a train of 
pulses. Additionally, pulses of higher peak power and shorter duration of approximately 10 ns can be 
obtained from solid lasers by intracavity Q-switching [Lengyel, 1971]. In this method, the density of 
excited states is transiently greatly increased by impeding the path between the totally reflecting and 
partially transmitting mirror of the cavity interrupting the stimulated emission process. Upon rapid 
removal of the impeding device (a beam-interrupting or -deflecting device), stimulated emission of the 
very large population of excited lasing states leads to emission of an intense laser pulse. The process can 
give single pulses or can be repeated to give a pulse train with repetition frequencies typically ranging 
from 1 Hz to 1kHz. 

Gallium aluminum (GaAlAs) lasers are, as are all semiconducting diode lasers, excited by electrical 
current which creates excited hole-electron pairs in the vicinity of the diode junction. Those carrier pairs 
are the lasing species which emit spontaneously and with photon stimulation. The beam emerges parallel 
to the function with the plane of the function forming the cavity and thin-layer surface mirrors providing 
reflection. Use of continuous or pulsed excitation current results in continuous or pulsed output. 
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85.9 Biomedical Laser Beam Delivery Systems 

Beam delivery systems for biomedical lasers guide the laser beam from the output mirror to the site of 
action on tissue. Beam powers of up to 100 W are transmitted routinely. All biomedical lasers incorporate 
a coaxial aiming beam, typically from a HeNe laser (632.8 nm) to illuminate the site of incidence on tissue. 

Usually, the systems incorporate two different beam-guiding methods, either (1) a flexible fused silica 
(Si0 2 ) optical fiber or light guide, generally available currently for laser beam wavelengths between 
=400 nm and —2.1 u.m, where Si0 2 is essentially transparent and (2) an articulated arm having beam- 
guiding mirrors for wavelengths greater than circa 2.1 um (e.g., C0 2 lasers), for the EnYAG and for 
pulsed lasers having peak power outputs capable of causing damage to optical fiber surfaces due to 
ionization by the intense electric field (e.g., pulsed ruby). The arm comprises straight tubular sections 
articulated together with high-quality power-handling dielectric mirrors at each articulation junction to 
guide the beam through each of the sections. Fused silica optical fibers usually are limited to a length of 
1-3 m and to wavelengths in the visible-to-low midrange IR (<2.1 urn), because longer wavelengths of 
IR radiation are absorbed by water impurities (<2.9 urn) and by the SiO, lattice itself (wavelengths > 5 
urn), as described by Levi [1980]. 

Since the flexibility, small diameter, and small mechanical inertia of optical fibers allow their use in 
either flexible or rigid endoscopes and offer significantly less inertia to hand movement, fibers for use at 
longer IR wavelengths are desired by clinicians. Currently, researchers are evaluating optical fiber materials 
transparent to longer IR wavelengths. Material systems showing promise are fused A1 2 3 fibers in short 
lengths for use with near-3-micrometer radiation of the Er:YAG laser and Ag halide fibers in short lengths 
for use with the C0 2 laser emitting at 10.6 urn [Merberg, 1993]. A flexible hollow Teflon waveguide 
1.6 mm in diameter having a thin metal film overlain by a dielectric layer has been reported recently to 
transmit 10.6 urn C0 2 radiation with attenuation of 1.3 and 1.65 dB/m for straight and bent (5-mm 
radius, 90-degree bend) sections, respectively [Gannot et al., 1994]. 

Optical Fiber Transmission Characteristics 

Guiding of the emitted laser beam along the optical fiber, typically of uniform circular cross-section, is 
due to total internal reflection of the radiation at the interface between the wall of the optical fiber core 
and the cladding material having refractive index n x less than that of the core n 2 [Levi, 1980]. Total 
internal reflection occurs for any angle of incidence of the propagating beam with the wall of the fiber 
core such that 9 > 8,. where 



sin 



«, 



(85.2) 



or in terms of the complementary angle a r 
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For a focused input beam with apical angle a m incident upon the flat face of the fiber as shown in 
Fig. 85.1, total internal reflection and beam guidance within the fiber core will occur [Levi, 1980] for 



NA=sin(a m /2): 
where NA is the numerical aperture of the fiber. 



n\ - n\ 



(85.4) 
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cladding 



FIGURE 85.1 Critical reflection and propagation within an optical fiber. 

This relationship ensures that the critical angle of incidence of the interface is not exceeded and that 
total internal reflection occurs [Levi, 1980]. Typical values of NA for fused Si0 2 fibers with polymer 
cladding are in the range of 0.36-0.40. The typical values of a,„ = 14 degrees used to insert the beam of 
the biomedical laser into the fiber is much smaller than those values ( = 21-23 degrees) corresponding 
to typical NA values. The maximum value of the propagation angle a typically used in biomedical laser 
systems is =4.8 degrees. 

Leakage of radiation at the core-cladding interface of the fused Si0 2 fiber is negligible, typically being 
=0.3 dB/m at 400 nm and 0.01 dB/m at 1.064 urn. Bends along the fiber length always decrease the angle 
of the incidence at the core cladding interface. Bends do not give appreciable losses for values of the 
bending radius sufficiently large that the angle of incidence 8 of the propagating beam in the bent core 
does not becomes less than Q c at the core-cladding interface [Levi, 1980]. The relationship given by Levi 
[1980] between the bending radius r b , the fiber core radius r , the ratio {n 2 ln^) of fiber core to cladding 
refractive indices, and the propagation angle a in Fig. 85.1 which ensures that the beam does not escape is 



n 1-p 

-J- > — — cos ot 

n. 



1+p 



(85.5) 



where p = (r /r t ). The inequality will hold for all a < a c provided that 



« L < 1 ZP 



1 + p 



(85.6) 



Thus, the critical bending radius r bc is the value of r b such that Eq. (85.6) is an equality. Use of Eq. (85.6) 
predicts that bends with radii > 12, 18, and 30 mm, respectively, will not result in appreciable beam 
leakage from fibers having 400-, 600-, and 1000-micron diameter cores, which are typical in biomedical 
use. Thus, use of fibers in flexible endoscopes usually does not compromise beam guidance. 

Because the integrity of the core-cladding interface is critical to beam guiding, the clad fiber is encased 
typically in a tough but flexible protective fluoropolymer buffer coat. 

Mirrored Articulated Arm Characteristics 

Typically two or three relatively long tubular sections or arms of 50-80 cm length make up the portion 
of the articulated arm that extends from the laser output fixturing to the handpiece, endoscope, or 
operating microscope stage used to position the laser beam onto the tissue proper. Mirrors placed at the 
articulation of the arms and within the articulated handpiece, laparoscope, or operating microscope stage 
maintain the centration of the trajectory of the laser beam along the length of the delivery system. 
Dielectric multilayer mirrors [Levi, 1980] routinely are used in articulated devices. Their low high 
reflectivity < 99.9 + % and power-handling capabilities ensure efficient power transmission down the 
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arm. Mirrors in articulated devices typically are held in kinetically adjustable mounts for rapid stable 
alignment to maintain beam concentration. 

Optics for Beam Shaping on Tissues 

Since the rate of heating on tissue, and hence rates of ablation and coagulation, depends directly on 
energy input per unit volume of tissue, selection of ablation and coagulation rates of various tissues is 
achieved through control of the energy density ( J/cm 2 or W • s/cm 2 ) of the laser beam. This parameter is 
readily achieved through use of optical elements such as discrete focusing lenses placed in the handpiece 
or rigid endoscope which control the spot size upon the tissue surface or by affixing a so-called contact 
tip to the end of an optical fiber. These are conical or spherical in shape with diameters ranging from 
300-1200 urn and with very short focal lengths. The tip is placed in contact with the tissue and generates 
a submillimeter- sized focal spot in tissue very near the interface between the tip and tissue. One advantage 
of using the contact tip over a focused beam is that ablation proceeds with small lateral depth of attendant 
coagulation [Judy et al„ 1993a]. This is because the energy of the tightly focused beam causes tissue 
thermolysis essentially at the tip surface and because the resulting tissue products strongly absorb the 
beam resulting in energy deposition and ablation essentially at the tip surface. This contrasts with the 
radiation penetrating deeply into tissue before thermolysis which occurs with a less tightly focused beam 
from a free lens or fiber. An additional advantage with the use of contact tips in the perception of the 
surgeon is that the kinesthetics of moving a contact tip along a resisting tissue surface more closely 
mimics the "touch" encountered in moving a scalpel across the tissue surface. 

Recently a class of optical fiber tips has been developed which laterally directs the beam energy from 
a silica fiber [Judy et al., 1993b] . These tips, either a gold reflective micromirror or an angled refractive 
prism, offer a lateral angle of deviation ranging from 35-105 degrees from the optical fiber axis (unde- 
viated beam direction). The beam reflected from a plane micromirror is unfocused and circular in cross- 
section, whereas the beam from a concave mirror and refractive devices is typically elliptical in shape, 
fused with distal diverging rays. Fibers with these terminations are currently finding rapidly expanding, 
large-scale application in coagulation (with 1.064-um Nd:YAG laser radiation) of excess tissue lining the 
urethra in treatment of benign prostatic hypertrophy [Costello et al., 1992]. The capability for lateral 
beam direction may offer additional utility of these terminated fibers in other clinical specialties. 

Features of Routinely Used Biomedical Lasers 

Currently four lasers are in routine large-scale clinical biomedical use to ablate, dissect, and to coagulate 
soft tissue. Two, the carbon dioxide (C0 2 ) and argon ion (Ar-ion) lasers, are gas-filled lasers. The other 
two employ solid-state lasing media. One is the Neodymium-yttrium-aluminum-garnet (Nd:YAG) laser, 
commonly referred to as a solid-state laser, and the other is the gallium-aluminum arsenide (GaAlAs) 
semiconductor diode laser. Salient features of the operating characteristics and biomedical applications 
of those lasers are listed in Tables 85.2 to 85.5. The operational descriptions are typical of the lasers 
currently available commercially and do not represent the product of any single manufacturer. 

Other Biomedical Lasers 

Some important biomedical lasers have smaller-scale use or currently are being researched for biomedical 
application. The following four lasers have more limited scales of surgical use: 

The Ho:YAG (Holmium:YAG) laser, emitting pulses of 2.1 urn wavelength and up to 4 J in energy, 
used in soft tissue ablation in arthroscopic (joint) surgery (FDA approved). 

The Q-switched Ruby (Cr:Al 2 3 ) laser, emitting pulses of 694-nm wavelength and up to 2 J in energy 
is used in dermatology to disperse black, blue, and green tattoo pigments and melanin in pig- 
mented lesions (not melanoma) for subsequent removal by phagocytosis by macrophages (FDA 
approved). 
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TABLE 85.2 Operating Characteristics of Principal Biomedical Lasers 



Characteristics 



Ar Ion Laser 



CO, Laser 



Cavity medium 
Lasing species 
Excitation 
Electric input 
Wall plug efficiency 



Argon gas, 133 Pa 

Ar+ ion 

Electric discharge, continuous 

208 V AC , 60 A 

=0.06% 



10% C0 2 10% Ne, 80% He; 1330 Pa 

C0 2 molecule 

Electric discharge, continuous, pulsed 

110 V AC> 15 A 

= 10% 



Characteristics 

Cavity medium 
Lasing species 
Excitation 
Electric input 

Wall plug efficiency 



Nd:YAG Laser 

Nd-doped YAG 
Nd3t in YAG lattice 
Flashlamp, continuous, pulsed 
208/240 V AC , 30 A continuous 

110 V AC , 10 A pulsed 
= 1% 



GaAlAs Diode Laser 

n-p junction, GaAlAs diode 
Hole-electron pairs at diode junction 
Electric current, continuous pulsed 
110V AC , 15 A 

=23% 



TABLE 85.3 Output Beam Characteristics of Ar-Ion and C0 2 Biomedical Lasers 



Output Characteristics 

Output power 
Wavelength(s) 
Electromagnetic wave 

propagation mode 
Beam guidance, shaping 



Argon Laser 

2-8 W, continuous 

Multiple lines (454.6-528.7 nm), 4 

TEM„ 



S, 514.5 dominant 



Fused silica optical fiber with contact tip or flat-ended 
for beam emission, lensed handpiece. Slit lamp with 
ocular lens 



C0 2 Laser 

1-100 W, continuous 

10.6 um 

TEM„ 

Flexible articulated arm with 
mirrors; lensed handpiece or 
mirrored microscope platen 



TABLE 85.4 Output Beam Characteristics of Nd:YAG and GaAlAs Diode Biomedical Lasers 



Output 
Characteristics 

Output power 



Wavelength(s) 
Electromagnetic wave 

propagation modes 
Beam guidance and 

shaping 



Nd:YAG Lasers 

1-100 W continuous at 1.064 millimicron 
1-36 W continuous at 532 nm (frequency 
doubled with KTP) 
1.064 um/532 nm 
Mixed modes 

Fused Si0 2 optical fiber with contact tip 
directing mirrored or refracture tip 



GaAlAs Diode Laser 



1-25 W continuous 



810 nm 
Mixed modes 



Fused Si0 2 optical fiber with contact tip or 
laterally directing mirrored or refracture tip 



TABLE 85.5 Clinical Uses of Principal Biomedical Lasers 



Ar-ion laser: Pigmented (vascular) soft-tissue ablation in 
gynecology; general and oral surgery; otolaryngology; vascular 
lesion coagulation in dermatology; retinal coagulation in 
ophthalmology 

Nd:YAG laser: Soft-tissue, particularly pigmented vascular tissue, 
ablation — dissection and bulk tissue removal — in dermatology; 
gastroenterology; gynecology; general, arthroscopic, neuro-, 
plastic, and thoracic surgery; urology; posterior capsulotomy 
(ophthalmology) with pulsed 1.064 millimicron and ocular lens 



C0 2 laser: Soft-tissue ablation — dissection and 
bulk tissue removal in dermatology; gynecology; 
general, oral, plastic, and neurosurgery; 
otolaryngology; podiatry; urology 

GaAlAs diode laser: Pigmented (vascular) soft- 
tissue ablation — dissection and bulk removal in 
gynecology; gastroenterology, general surgery, 
and urology; FDA approval for otolaryngology 
and thoracic surgery pending 
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The flashlamp pumped pulsed dye laser emitting 1- to 2-J pulses at either 577- or 585-nm wavelength 
(near the 537-577 absorption region of blood) is used for treatment of cutaneous vascular lesions 
and melanin pigmented lesions except melanoma. Use of pulsed radiation helps to localize the 
thermal damage to within the lesions to obtain low damage of adjacent tissue. 

The following lasers are being investigated for clinical uses. 

The EnYAG laser, emitting at 2.94 urn near the major water absorption peak (OH stretch), is currently 
being investigated for ablation of tooth enamel and dentin [Li et al., 1992]. 

Dye lasers emitting at 630-690 nm are being investigated for application as light sources for exciting 
dihematoporphyrin ether or benzoporphyrin derivatives in investigation of the efficacy of these 
photosensitives in the treatment of esophageal, bronchial, and bladder carcinomas for the FDA 
approved process. 

Defining Terms 

Biomedical Laser Radiation Ranges 

Infrared (IR) radiation: The portion of the electromagnetic spectrum within the wavelength range 

760 nm-1 mm, with the regions 760 nm-1.400 urn and 1.400-10.00 urn, respectively, called the 

near- and mid-IR regions. 
Ultraviolet (UV) radiation: The portion of the electromagnetic spectrum within the wavelength range 

100-400 nm. 
Visible (VIS) radiation: The portion of the electromagnetic spectrum within the wavelength range 

400-760 nm. 

Laser Medium Nomenclature 

Argon fluoride (ArF): Argon fluoride eximer laser (an eximer is a diatomic molecule which can exist 

only in an excited state). 
Ar ion: Argon ion. 
C0 2 : Carbon dioxide. 
Cr:Al 2 3 : Ruby laser. 

Er:YAG: Erbium yttrium aluminum garnet. 
GaAlAs: Gallium aluminum laser. 
HeNe: Helium neon laser. 
Ho:YAG: Holmium yttrium aluminum garnet. 
Nd:YAG: Neodymium yttrium aluminum garnet. 

Optical Fiber Nomenclature 

Ag halide: Silver halide, halide ion, typically bromine (Br) and chlorine (CI). 

Fused silica: Fused Si0 2 . 
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Further Information 

Current research on the optical, thermal, and photochemical interactions of radiation and their effect 
on biologic tissues, are published routinely in the journals: Laser in Medicine and Surgery, Lasers in the 
Life Sciences, and Photochemistry Photobiology and to a lesser extent in Applied Optics and Optical 
Engineering. 

Clinical evaluations of biomedical laser applications appear in Lasers and Medicine and Surgery and 
in journals devoted to clinical specialties such as Journal of General Surgery, Journal of Urology, Journal 
of Gastroenterological Surgery. 

The annual symposium proceedings of the biomedical section of the Society of Photo-Optical Instru- 
mentation Engineers (SPIE) contain descriptions of new and current research on application of lasers 
and optics in biomedicine. 

The book Lasers (a second edition by Bela A. Lengyel), although published in 1971, remains a valuable 
resource on the fundamental physics of lasers — gas, dye solid-state, and semiconducting diode. A more 
recent book, The Laser Guidebook by Jeffrey Hecht, published in 1992, emphasizes the technical charac- 
teristics of the gas, diode, solid-state, and semiconducting diode lasers. 

The Journal of Applied Physics, Physical Review Letters, and Applied Physics Letters carry descriptions 
of the newest advances and experimental phenomena in lasers and optics. 

The book Safety with Lasers and Other Optical Sources by David Sliney and Myron Wolbarsht, published 
in 1980, remains a very valuable resource on matters of safety in laser use. 

Laser safety standards for the United States are given for all laser uses and types in the American 
National Standard (ANSI) Z136.1-1993, Safe Use of Lasers. 
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Optical measures of physiologic status are attractive because they can provide a simple, noninvasive, yet 
real-time assessment of medical condition. Noninvasive optical monitoring is taken here to mean the 
use of visible or near-infrared light to directly assess the internal physiologic status of a person without 
the need of extracting a blood of tissue sample or using a catheter. Liquid water strongly absorbs 
ultraviolet and infrared radiation, and thus these spectral regions are useful only for analyzing thin surface 
layers or respiratory gases, neither of which will be the subject of this review. Instead, it is the visible and 
near-infrared portions of the electromagnetic spectrum that provide a unique "optical window" into the 
human body, opening new vistas for noninvasive monitoring technologies. 

Various molecules in the human body possess distinctive spectral absorption characteristics in the 
visible or near-infrared spectral regions and therefore make optical monitoring possible. The most 
strongly absorbing molecules at physiologic concentrations are the hemoglobins, myoglobins, cyto- 
chromes, melanins, carotenes, and bilirubin (see Fig. 86.1 for some examples). Perhaps less appreciated 
are the less distinctive and weakly absorbing yet ubiquitous materials possessing spectral characteristics 
in the near-infrared: water, fat, proteins, and sugars. Simple optical methods are now available to quan- 
titatively and noninvasively measure some of these compounds directly in intact tissue. The most suc- 
cessful methods to date have used hemoglobins to assess the oxygen content of blood, cytochromes to 
assess the respiratory status of cells, and possibly near-infrared to assess endogenous concentrations of 
metabolytes, including glucose. 

86.1 Oximetry and Pulse Oximetry 

Failure to provide adequate oxygen to tissues — hypoxia — can in a matter of minutes result in reduced 
work capacity of muscles, depressed mental activity, and ultimately cell death. It is therefore of consid- 
erable interest to reliably and accurately determine the amount of oxygen in blood or tissues. Oximetry 
is the determination of the oxygen content of blood of tissues, normally by optical means. In the clinical 
laboratory the oxygen content of whole blood can be determined by a bench-top cooximeter or blood 
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FIGURE 86.1 Absorption spectra of some endogenous biologic materials (a) hemoglobins, (b) cytochrome aa3, 
(c) myoglobins, and (d) melanin. 

gas analyzer. But the need for timely clinical information and the desire to minimize the inconvenience 
and cost of extracting a blood sample and later analyze it in the lab has led to the search for alternative 
noninvasive optical methods. Since the 1930s, attempts have been made to use multiple wavelengths of 
light to arrive at a complete spectral characterization of a tissue. These approaches, although somewhat 
successful, have remained of limited utility owing to the awkward instrumentation and unreliable results. 
It was not until the invention of pulse oximetry in the 1970s and its commercial development and 
application in the 1980s that noninvasive oximetry became practical. Pulse oximetry is an extremely easy- 
to-use, noninvasive, and accurate measurement of real-time arterial oxygen saturation. Pulse oximetry 
is now used routinely in clinical practice, has become a standard of care in all U.S. operating rooms, and 
is increasingly used wherever critical patients are found. The explosive growth of this new technology 
and its considerable utility led John Severinghaus and Poul Astrup [1986] in an excellent historical review 
to conclude that pulse oximetry was "arguably the most significant technological advance ever made in 
monitoring the well-being and safety of patients during anesthesia, recovery and critical care." 
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FIGURE 86.2 Hemoglobin oxygen dissociation curve showing the sigmoidal relationship between the partial 
pressure of oxygen and the oxygen saturation of blood. The curve is given approximately by %Sa0 2 = 100%/[1 
+ P 50 /pO 2 "], with n = 2.8 and P 50 = 26 mm Hg. 

Background 

The partial pressure of oxygen (p0 2 ) in tissues need only be about 3 mmHg to support basic metabolic 
demands. This tissue level, however, requires capillary p0 2 to be near 40 mmHg, with a corresponding 
arterial p0 2 of about 95 mmHg. Most of the oxygen carried by blood is stored in red blood cells reversibly 
bound to hemoglobin molecules. Oxygen saturation (Sa0 2 ) is defined as the percentage of hemoglobin- 
bound oxygen compared to the total amount of hemoglobin available for reversible oxygen binding. The 
relationship between the oxygen partial pressure in blood and the oxygen saturation of blood is given 
by the hemoglobin oxygen dissociation curve as shown in Fig. 86.2. The higher the p0 2 in blood, the 
higher the Sa0 2 . But due to the highly cooperative binding of four oxygen molecules to each hemoglobin 
molecule, the oxygen binding curve is sigmoidal, and consequently the Sa0 2 value is particularly sensitive 
to dangerously low p0 2 levels. With a normal arterial blood p0 2 above 90 mmHg, the oxygen saturation 
should be at least 95%, and a pulse oximeter can readily verify a safe oxygen level. If oxygen content falls, 
say to a p0 2 below 40 mmHg, metabolic needs may not be met, and the corresponding oxygen saturation 
will drop below 80%. Pulse oximetry therefore provides a direct measure of oxygen sufficiency and will 
alert the clinician to any danger of imminent hypoxia in a patient. 

Although endogenous molecular oxygen is not optically observable, hemoglobin serves as an oxygen- 
sensitive "dye" such that when oxygen reversibly binds to the iron atom in the large heme prosthetic 
group, the electron distribution of the heme is shifted, producing a significant color change. The optical 
absorption of hemoglobin in its oxygenated and deoxygenated states is shown in Fig. 86.1. Fully oxygen- 
ated blood absorbs strongly in the blue and appears bright red; deoxygenated blood absorbs through the 
visible region and is very dark (appearing blue when observed through tissue due to light scattering 
effects). Thus the optical absorption spectra of oxyhemaglobin (0 2 Hb) and "reduced" deoxyhemoglobin 
(RHb) differ substantially, and this difference provides the basis for spectroscopic determinations of the 
proportion of the two hemoglobin states. In addition to these two normal functional hemoglobins, there 
are also dysfunctional hemoglobins — carboxyhemoglobin, methemoglobin, and sulhemoglobin — which 
are spectroscopically distinct but do not bind oxygen reversibly. Oxygen saturation is therefore defined 
in Eq. (86.1) only in terms of the functional saturation with respect to 2 Hb and RHb: 
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Cooximeters are bench-top analyzers that accept whole blood samples and utilize four or more 
wavelengths of monochromatic light, typically between 500 and 650 nm, to spectroscopically determine 
the various individual hemoglobins in the sample. If a blood sample can be provided, this spectroscopic 
method is accurate and reliable. Attempts to make an equivalent quantitative analysis noninvasively 
through intact tissue have been fraught with difficulty. The problem has been to contend with the wide 
variation in scattering and nonspecific absorption properties of very complex heterogeneous tissue. One 
of the more successful approaches, marketed by Hewlett-Packard, used eight optical wavelengths trans- 
mitted through the pinna of the ear. In this approach a "bloodless" measurement is first obtained by 
squeezing as much blood as possible from an area of tissue; the arterial blood is then allowed to flow 
back, and the oxygen saturation is determined by analyzing the change in the spectral absorbance 
characteristics of the tissue. While this method works fairly well, it is cumbersome, operator dependent, 
and does not always work well on poorly perfused or highly pigmented subjects. 

In the early 1970s, Takuo Aoyagi recognized that most of the interfering nonspecific tissue effects could 
be eliminated by utilizing only the change in the signal during an arterial pulse. Although an early 
prototype was built in Japan, it was not until the refinements in implementation and application by Biox 
(now Ohmeda) and Nellcor Incorporated in the 1980s that the technology became widely adopted as a 
safety monitor for critical care use. 

Theory 

Pulse oximetry is based on the fractional change in light transmission during an arterial pulse at two 
different wavelengths. In this method the fractional change in the signal is due only to the arterial blood 
itself, and therefore the complicated nonpulsatile and highly variable optical characteristics of tissue are 
eliminated. In a typical configuration, light at two different wavelengths illuminating one side of a finger 
will be detected on the other side, after having traversed the intervening vascular tissues (Fig. 86.3). The 
transmission of light at each wavelength is a function of the thickness, color, and structure of the skin, 
tissue, bone, blood, and other material through which the light passes. The absorbance of light by a 
sample is defined as the negative logarithm of the ratio of the light intensity in the presence of the sample 
(7) to that without (Z ): A = -\og(I/I ). According to the Beer-Lambert law, the absorbance of a sample 
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FIGURE 86.3 Typical pulse oximeter sensing configuration on a finger. Light at two different wavelengths is emitted 
by the source, diffusely scattered through the finger, and detected on the opposite side by a photodetector. 
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at a given wavelength with a molar absorptivity (e) is directly proportional to both the concentration (c) 
and pathlength (I) of the absorbing material: A = eel. (In actuality, biologic tissue is highly scattering, 
and the Beer- Lambert law is only approximately correct; see the references for further elaboration). Visible 
or near-infrared light passing through about one centimeter of tissue (e.g., a finger) will be attenuated 
by about one or two orders of magnitude for a typical emitter-detector geometry, corresponding to an 
effective optical density (OD) of 1-2 OD (the detected light intensity is decreased by one order of 
magnitude for each OD unit). Although hemoglobin in the blood is the single strongest absorbing 
molecule, most of the total attenuation is due to the scattering of light away from the detector by the 
highly heterogeneous tissue. Since human tissue contains about 7% blood, and since blood contains 
typically about 14 g/dL hemoglobin, the effective hemoglobin concentration in tissue is about 1 g/dL 
(-150 uM). At the wavelengths used for pulse oximetry (650-950 nm), the oxy- and deoxyhemoglobin 
molar absorptivities fall in the range of 100-1000 M _I cm _1 , and consequently hemoglobin accounts for 
less than 0.2 OD of the total observed optical density. Of this amount, perhaps only 10% is pulsatile, 
and consequently pulse signals of only a few percent are ultimately measured, at times even one-tenth 
of this. 

A mathematical model for pulse oximetry begins by considering light at two wavelengths, X l and X z , 
passing through tissue and being detected at a distant location as in Fig. 86.3. At each wavelength the 
total light attenuation is described by four different component absorbances: oxyhemoglobin in the blood 
(concentration c„, molar absorptivity e o , and effective pathlength l ), "reduced" deoxyhemoglobin in the 
blood (concentration c r , molar absorptivity e,, and effective pathlength l T ), specific variable absorbances 
that are not from the arterial blood (concentration c x , molar absorptivity e v , and effective pathlength l x ), 
and all other non-specific sources of optical attenuation, combined as A , which can include light 
scattering, geometric factors, and characteristics of the emitter and detector elements. The total absor- 
bance at the two wavelengths can then be written: 



A =e cl + e cl + e cl + A 
I A =e c/+e cl + e c I +A 

I A, °2 o ° r 2 r r x 2 x x y 2 



The blood volume change due to the arterial pulse results in a modulation of the measured absorbances. 
By taking the time rate of change of the absorbances, the two last terms in each equation are effectively 
zero, since the concentration and effective pathlength of absorbing material outside the arterial blood 
do not change during a pulse [d(c x l x )/dt = 0], and all the nonspecific effects on light attenuation are also 
effectively invariant on the time scale of a cardiac cycle {dA Idt = 0). Since the extinction coefficients are 
constant, and the blood concentrations are constant on the time scale of a pulse, the time-dependent 
changes in the absorbances at the two wavelengths can be assigned entirely to the change in the blood 
pathlength {dljdt and dljdt). With the additional assumption that these two blood pathlength changes 
are equivalent (or more generally, their ratio is a constant), the ratio R of the time rate of change of the 
absorbance at wavelength 1 to that at wavelength 2 reduces to the following: 

R _ d \/ dt _ -d^(ljl )/dt _ (A/.A) _ v> + Vr ^^ 

d\/dt -dlog(iji )/dt (m 2 /i 2 ) V° + V' 

Observing that functional oxygen saturation is given by S = c l{c + c r ), and that (1— S) = C r /(c a + c r ), the 
oxygen saturation can then be written in terms of the ratio R as follows 

e , — e ,_R 

S = -, '^j^ r- (86.4) 

( e r l- e „lH e ,2- e 2J£ 



S) 2000 by CRC Press LLC 



100% 



Beer-Lambert 
Model 




1.5 
Ratio (R) 

FIGURE 86.4 Relationship between the measured ratio of fractional changes in light intensity at two wavelengths, 
R, and the oxygen saturation S. Beer-Lambert model is from Eq. (86.4) with e 0l = 100, e o2 = 300, e rl = 800, and e r2 = 
200. Empirical calibration is based on %S = 100% x (a - hR)l(c- dR) with a = 1000, b = 550, c = 900, and d = 350, 
with a linear extrapolation below 70%. 

Equation (86.4) provides the desired relationship between the experimentally determined ratio R and 
the clinically desired oxygen saturation S. In actual use, commonly available LEDs are used as the light 
sources, typically a red LED near 660 nm and a near-infrared LED selected in the range 890-950 nm. 
Such LEDs are not monochromatic light sources, typically with bandwidths between 20 and 50 nm, and 
therefore standard molar absorptivities for hemoglobin cannot be used directly in Eq. (86.4). Further, 
the simple model presented above is only approximately true; for example, the two wavelengths do not 
necessarily have the exact same pathlength changes, and second-order scattering effects have been ignored. 
Consequently the relationship between S and R is instead determined empirically by fitting the clinical 
data to a generalized function of the form S = (a — bR)l{c — dR). The final empirical calibration will 
ultimately depend on the details of an individual sensor design, but these variations can be determined 
for each sensor and included in unique calibration parameters. A typical empirical calibration for _R 
versus S is shown in Fig. 86.4, together with the curve that standard molar absorptivities would predict. 
In this way the measurement of the ratio of the fractional change in signal intensity of the two LEDs 
is used along with the empirically determined calibration equation to obtain a beat-by-beat measurement 
of the arterial oxygen saturation in a perfused tissue — continuously, noninvasively, and to an accuracy 
of a few percent. 



Application and Future Directions 

Pulse oximetry is now routinely used in nearly all operating rooms and critical care areas in the United 
States and increasingly throughout the world. It has become so pervasive and useful that it is now being 
called the "fifth" vital sign (for an excellent review of practical aspects and clinical applications of the 
technology see Kelleher [1989]). 

The principal advantages of pulse oximetry are that it provides continuous, accurate, and reliable 
monitoring of arterial oxygen saturation on nearly all patients, utilizing a variety of convenient sensors, 
reusable as well as disposable. Single-patient-use adhesive sensors can easily be applied to fingers for 
adults and children and to arms for legs or neonates. Surface reflectance sensors have also been developed 
based on the same principles and offer a wider choice for sensor location, though they tend to be less 
accurate and prone to more types of interference. 
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Limitations of pulse oximetry include sensitivity to high levels of optical or electric interference, errors 
due to high concentrations of dysfunctional hemoglobins (methemoglobin or carboxyhemoglobin) or 
interference from physiologic dyes (such as methylene blue). Other important factors, such as total 
hemoglobin content, fetal hemoglobin, or sickle cell trait, have little or no effect on the measurement 
except under extreme conditions. Performance can also be compromised by poor signal quality, as may 
occur for poorly perfused tissues with weak pulse amplitudes or by motion artifact. 

Hardware and software advances continue to provide more sensitive signal detection and filtering 
capabilities, allowing pulse oximeters to work better on more ambulatory patients. Already some pulse 
oximeters incorporate ECG synchronization for improved signal processing. A pulse oximeter for use in 
labor and delivery is currently under active development by several research groups and companies. A 
likely implementation may include use of a reflectance surface sensor for the fetal head to monitor the 
adequacy of fetal oxygenation. This application is still in active development, and clinical utility remains 
to be demonstrated. 

86.2 Nonpulsatile Spectroscopy 
Background 

Nonpulsatile optical spectroscopy has been used for more than half a century for noninvasive medical 
assessment, such as in the use of multiwavelength tissue analysis for oximetry and skin reflectance 
measurement for bilirubin assessment in jaundiced neonates. These early applications have found some 
limited use, but with modest impact. Recent investigations into new nonpulsatile spectroscopy methods 
for assessment of deep-tissue oxygenation (e.g., cerebral oxygen monitoring), for evaluation of respiratory 
status at the cellular level, and for the detection of other critical analytes, such as glucose, may yet prove 
more fruitful. The former applications have led to spectroscopic studies of cytochromes in tissues, and 
the latter has led to considerable work into new approaches in near-infrared analysis of intact tissues. 

Cytochrome Spectroscopy 

Cytochromes are electron-transporting, heme-containing proteins found in the inner membranes of 
mitochondria and are required in the process of oxidative phosphorylation to convert metabolytes and 
oxygen into C0 2 and high-energy phosphates. In this metabolic process the cytochromes are reversibly 
oxidized and reduced, and consequently the oxidation-reduction states of cytochromes c and flfl 3 in 
particular are direct measures of the respiratory condition of the cell. Changes in the absorption spectra 
of these molecules, particularly near 600 nm and 830 nm for cytochrome aa 3 , accompany this shift. By 
monitoring these spectral changes, the cytochrome oxidation state in the tissues can be determined (see, 
for example, Jobsis [1977] and Jobsis et al. [1977]). As with all nonpulsatile approaches, the difficulty is 
to remove the dependence of the measurements on the various nonspecific absorbing materials and 
highly variable scattering effects of the tissue. To date, instruments designed to measure cytochrome 
spectral changes can successfully track relative changes in brain oxygenation, but absolute quantitation 
has not yet been demonstrated. 

Near-Infrared Spectroscopy and Glucose Monitoring 

Near- infrared (NIR), the spectral region between 780 nm and 3000 nm, is characterized by broad and 
overlapping spectral peaks produced by the overtones and combinations of infrared vibrational modes. 
Figure 86.5 shows typical NIR absorption spectra of fat, water, and starch. Exploitation of this spectral 
region for in vivo analysis has been hindered by the same complexities of nonpulsatile tissue spectroscopy 
described above and is further confounded by the very broad and indistinct spectral features characteristic 
of the NIR. Despite these difficulties, NIR spectroscopy has garnered considerable attention, since it may 
enable the analysis of common analytes. 



S) 2000 by CRC Press LLC 















i 

Water Fat , ' 
\ f\'< 1 1 


CD 
O 

c 

CD 
-Q 
i_ 
O 
0) 
XI 

< 








Starch 




\ r hi 1 4 / 

\, ^: / 
V ' / 



> 

CD 
<U 

a. 




fl 
J 


r 
1 
1 


\ 

N 

, ' '» / 


./ 


A ; / 






-* 


' A 
v ' J 


** -. 





1000 



1500 2000 
Wavelength (nm) 



2500 



FIGURE 86.5 Typical near-infrared absorption spectra of several biologic materials. 

Karl Norris and coworkers pioneered the practical application of NIR spectroscopy, using it to evaluate 
water, fat, and sugar content of agricultural products (see Osborne et al. [1993] and Burns and Cuirczak 
[1992]). The further development of sophisticated multivariate analysis techniques, together with new 
scattering models (e.g., Kubelka-Munk theory) and high-performance instrumentation, further extended 
the application of NIR methods. Over the past decade, many research groups and companies have touted 
the use of NIR techniques for medical monitoring, such as for determining the relative fat, protein, and 
water content of tissue, and more recently for noninvasive glucose measurement. The body composition 
analyses are useful but crude and are mainly limited to applications in nutrition and sports medicine. 
Noninvasive glucose monitoring, however, is of considerable interest. 

More than 2 million diabetics in the United States lance their fingers three to six times a day to obtain 
a drop of blood for chemical glucose determination. The ability of these individuals to control their 
glucose levels, and the quality of their life generally, would dramatically improve if a simple, noninvasive 
method for determining blood glucose levels could be developed. Among the noninvasive optical methods 
proposed for this purpose are optical rotation, NIR analysis, and raman spectroscopy. The first two have 
received the most attention. Optical rotation methods aim to exploit the small optical rotation of polarized 
light by glucose. To measure physiologic glucose levels in a 1-cm thick sample to an accuracy of 25 mg/dL 
would require instrumentation that can reliably detect an optical rotation of at least 1 millidegree. Finding 
an appropriate in vivo optical path for such measurements has proved most difficult, with most 
approaches looking to use either the aqueous humor or the anterior chamber of the eye [Cote et al., 
1992; Rabinovitch et al., 1982]. Although several groups have developed laboratory analyzers that can 
measure such a small effect, so far in vivo measurement has not been demonstrated, due both to unwanted 
scattering and optical activity of biomaterials in the optical path and to the inherent difficulty in devel- 
oping a practical instrument with the required sensitivity. 

NIR methods for noninvasive glucose determination are particularly attractive, although the task is 
formidable. Glucose has spectral characteristics near 1500 nm and in the 2000-2500 nm band where 
many other compounds also absorb, and the magnitude of the glucose absorbance in biologic samples 
is typically two orders of magnitude lower than those of water, fat, or protein. The normal detection 
limit for NIR spectroscopy is on the order of one part in 10 3 , whereas a change of 25 mg/dL in glucose 
concentration corresponds to an absorbance change of 10~ 4 to 10~ 5 . In fact, the temperature dependence 
of the NIR absorption of water alone is at least an order of magnitude greater than the signal from glucose 
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in solution. Indeed, some have suggested that the apparent glucose signature in complex NIR spectra 
may actually be the secondary effect of glucose on the water. 

Sophisticated chemometric (particularly multivariate analysis) methods have been employed to try to 
extract the glucose signal out of the noise (for methods reviews see Martens and Naes [1989] and Haaland 
[1992]). Several groups have reported using multivariate techniques to quantitate glucose in whole blood 
samples, with encouraging results [Haaland et al., 1992]. And despite all theoretical disputations to the 
contrary, some groups claim the successful application of these multivariate analysis methods to nonin- 
vasive in vivo glucose determination in patients [Robinson et al., 1992]. Yet even with the many groups 
working in this area, much of the work remains unpublished, and few if any of the reports have been 
independently validated. 

Time-Resolved Spectroscopy 

The fundamental problem in making quantitative optical measurements through intact tissue is dealing 
with the complex scattering phenomena. This scattering makes it difficult to determine the effective 
pathlength for the light, and therefore attempts to use the Beer-Lambert law, or even to determine a 
consistent empirical calibration, continue to be thwarted. Application of new techniques in time-resolved 
spectroscopy may be able to tackle this problem. Thinking of light as a packet of photons, if a single 
packet from a light source is sent through tissue, then a distant receiver will detected a photon distribution 
over time — the photons least scattered arriving first and the photons most scattered arriving later. In 
principle, the first photons arriving at the detector passed directly through the tissue. For these first 
photons the distance between the emitter and the detector is fixed and known, and the Beer-Lambert 
law should apply, permitting determination of an absolute concentration for an absorbing component. 
The difficulty in this is, first, that the measurement time scale must be on the order of the photon transit 
time (subnanosecond), and second, that the number of photons getting through without scattering will 
be extremely small, and therefore the detector must be exquisitely sensitive. Although these considerable 
technical problems have been overcome in the laboratory, their implementation in a practical instrument 
applied to a real subject remains to be demonstrated. This same approach is also being investigated for 
noninvasive optical imaging, since the unscattered photons should produce sharp images (see Chance 
etal., [1988], Chance [1991], and Yoo and Alfano [1989]). 

86.3 Conclusions 

The remarkable success of pulse oximetry has established noninvasive optical monitoring of vital phys- 
iologic functions as a modality of considerable value. Hardware and algorithm advances in pulse oximetry 
are beginning to broaden its use outside the traditional operating room and critical care areas. Other 
promising applications of noninvasive optical monitoring are emerging, such as for measuring deep tissue 
oxygen levels, determining cellular metabolic status, or for quantitative determination of other important 
physiologic parameters such as blood glucose. Although these latter applications are not yet practical, 
they may ultimately impact noninvasive clinical monitoring just as dramatically as pulse oximetry. 

Defining Terms 

Beer-Lambert law: Principle stating that the optical absorbance of a substance is proportional to both 

the concentration of the substance and the pathlength of the sample. 
Cytochromes: Heme-containing proteins found in the membranes of mitochondria and required for 

oxidative phosphorylation, with characteristic optical absorbance spectra. 
Dysfunctional hemoglobins: Those hemoglobin species that cannot reversibly bind oxygen (carboxy- 

hemoglobin, methemoglobin, and sulfhemoglobin). 
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Functional saturation: The ratio of oxygenated hemoglobin to total nondysfunctional hemoglobins 

(oxyhemoglobin plus deoxyhemoglobin). 
Hypoxia: Inadequate oxygen supply to tissues necessary to maintain metabolic activity. 
Multivariate analysis: Empirical models developed to relate multiple spectral intensities from many 

calibration samples to known analyte concentrations, resulting in an optimal set of calibration 

parameters. 
Oximetry: The determination of blood or tissue oxygen content, generally by optical means. 
Pulse oximetry: The determination of functional oxygen saturation of pulsatile arterial blood by 

ratiometric measurement of tissue optical absorbance changes. 
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Further Information 

Two collections of papers on pulse oximetry include a book edited by J. P. Payne and J. W. Severinghaus, 
Pulse Oximetry (New York, Springer- Verlag, 1986), and a journal collection — International Anesthesiol- 
ogy Clinics [25(4), 1987] . For technical reviews of pulse oximetry, see J. A. Pologe's 1987 "Pulse Oximetry" 
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K. Kelleher [1989] and John Severinghaus and Joseph F. Kelleher [1992]. John Severinghaus andYoshiyuki 
Honda have written several excellent histories of pulse oximetry [1987a, 1987b]. 

For an overview of applied near-infrared spectroscopy, see Donald A. Burns and Emil W. Ciurczak 
[1992] and B. G. Osborne, T. Fearn, and P. H. Hindle [1993]. For a good overview of multivariate 
methods, see Harald Martens and Tormod Naes [19891. 



© 2000 by CRC Press LLC 



Bowman, B. R., Schuck, E. "Medical Instruments and Devices Used in the Home." 

The Biomedical Engineering Handbook: Second Edition. 

Ed. Joseph D. Bronzino 

Boca Raton: CRC Press LLC, 2000 



87 





87.1 




87.2 


Bruce R. Bowman 




EdenTec Corporation 




Edward Schuck 


87.3 


EdenTec Corporation 


87.4 



Medical Instruments 

and Devices Used 

in the Home 



Scope of the Market for Home Medical Devices 

Unique Challenges to the Design and Implementation 

of High-Tech Homecare Devices 

The Device Must Provide a Positive Clinical Outcome • The 

Device Must be Safe to Use • The Device Must be Designed 

So That It Will be Used 

Infant Monitor Example 

Conclusions 



87.1 Scope of the Market for Home Medical Devices 

The market for medical devices used in the home and alternative sites has increased dramatically in the 
last 10 years and has reached an overall estimated size of more than $1.6 billion [FIND/SVP, 1992]. In 
the past, hospitals have been thought of as the only places to treat sick patients. But with the major 
emphasis on reducing healthcare costs, increasing numbers of sicker patients move from hospitals to 
their homes. Treating sicker patients outside the hospital places additional challenges on medical device 
design and patient use. Equipment designed for hospital use can usually rely on trained clinical personnel 
to support the devices. Outside the hospital, the patient and/or family members must be able to use the 
equipment, requiring these devices to have a different set of design and safety features. This chapter will 
identify some of the major segments using medical devices in the home and discuss important design 
considerations associated with home use. 

Table 87.1 outlines market segments where devices and products are used to treat patients outside the 
hospital [FIND/SVP, 1992]. The durable medical equipment market is the most established market 
providing aids for patients to improve access and mobility. These devices are usually not life supporting 
or sustaining, but in many cases they can make the difference in allowing a patient to be able to function 
outside a hospital or nursing or skilled facility. Other market segments listed employ generally more 
sophisticated solutions to clinical problems. These will be discussed by category of use. 

The incontinence and ostomy area of products is one of the largest market segments and is growing 
in direct relationship to our aging society. Whereas sanitary pads and colostomy bags are not very "high- 
tech," well-designed aids can have a tremendous impact on the comfort and independence of these 
patients. Other solutions to incontinence are technically more sophisticated, such as use of electric 
stimulation of the sphincter muscles through an implanted device or a miniature stimulator inserted as 
an anal or vaginal plug to maintain continence [Wall et al., 1993]. 
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TABLE 87.1 Major Market Segments Outside Hospitals 





Estimated 




Equipment Size 


Market Segment 


1991 


Durable medical equipment 


$373 M* 


Incontinence and ostomy products 


$600 M* 


Respiratory equipment 


$180 M* 


Drug infusion, drug measurement 


$300 M 


Pain control and functional stimulation 


$140 M 



Device Examples 



Specialty beds, wheelchairs, toilet aids, ambulatory aids 
Sanitary pads, electrical stimulators, colostomy bags 
Oxygen therapy, portable ventilators, nasal CPAP, 

monitors, apnea monitors 
Infusion pumps, access ports, patient-controlled analgesia 

(PCA), glucose measurement, implantable pumps 
Transcutaneous electrical nerve stimulation (TENS), 

functional electrical nerve stimulation (FES) 



*Source: FIND/SVP [1992]. 

Many forms of equipment are included in the Respiratory segment. These devices include those that 
maintain life support as well as those that monitor patients' respiratory function. These patients, with 
proper medical support, can function outside the hospital at a significant reduction in cost and increased 
patient comfort [Pierson, 1994]. One area of this segment, infant apnea monitors, provides parents or 
caregivers the cardio/respiratory status of an at-risk infant so that intervention (CPR etc.) can be initiated 
if the baby has a life-threatening event. The infant monitor shown in Fig. 87.1 is an example of a patient 
monitor designed for home use and will be discussed in more detail later in this chapter. Pulse oximetry 
monitors are also going home with patients. They are used to measure noninvasively the oxygen level of 
patients receiving supplemental oxygen or ventilator-dependent patients to determine if they are being 
properly ventilated. 

Portable infusion pumps are an integral part of providing antibiotics, pain management, chemother- 
apy, and parenteral and enteral nutrition. The pump shown in Fig. 87.2 is an example of technology that 
allows the patient to move about freely while receiving sometimes lengthy drug therapy. Implantable 
drug pumps are also available for special long-term therapy needs. 




FIGURE 87.1 Infant apnea monitor used in a typical home setting (photo courtesy of EdenTec Corporation). 
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FIGURE 87.2 Portable drug pump used throughout the day (photo courtesy of Pharmacia Deltec Inc.). 

Pain control using electric stimulation in place of drug therapy continues to be an increasing market. 
The delivery of small electric impulses to block pain is continuing to gain medical acceptance for treatment 
outside the hospital setting. A different form of electric stimulation called functional electric stimulation 
(FES) applies short pulses of electric current to the nerves that control weak or paralyzed muscles. This 
topic is covered as a separate chapter in this book. 

Growth of the homecare business has created problems in overall healthcare costs since a corresponding 
decrease in hospital utilization has not yet occurred. In the future, however, increased homecare will 
necessarily result in reassessment and downsizing in the corresponding hospital segment. There will be 
clear areas of growth and areas of consolidation in the new era of healthcare reform. It would appear, 
however, that homecare has a bright future of continued growth. 

87.2 Unique Challenges to the Design and Implementation 
of High-Tech Homecare Devices 

What are some of the unique requirements of devices that could allow more sophisticated equipment to 
go home with ordinary people of varied educational levels without compromising their care? Even though 
each type of clinical problem has different requirements for the equipment that must go home with the 
patient, certain common qualities must be inherent in most devices used in the home. Three areas to 
consider when equipment is used outside of the hospital are that the device (1) must provide a positive 
clinical outcome, (2) must be safe and easy to use, and (3) must be user-friendly enough so that it will 
be used. 

The Device Must Provide a Positive Clinical Outcome 

Devices cannot be developed any longer just because new technology becomes available. They must solve 
the problem for which they were intended and make a significant clinical difference in the outcome or 
management of the patient while saving money. These realities are being driven by those who reimburse 
for devices, as well as by the FDA as part of the submission for approval to market a new device. 
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The Device Must Be Safe to Use 

Homecare devices may need to be even more reliable and even safer than hospital devices. We often think 
of hospitals as having the best quality and most expensive devices that money can buy. In addition to 
having the best equipment to monitor patients, hospitals have nurses and aids that keep an eye on patients 
so that equipment problems may be quickly discovered by the staff. A failure in the home may go 
unnoticed until it is too late. Thus systems for home use really need extra reliability with automatic 
backup systems and/or early warning signals. 

Safety issues can take on a different significance depending on the intended use of the device. Certain 
safety issues are important regardless of whether the device is a critical device such as an implanted 
cardiac pacemaker or a noncritical device such as a bed-wetting alarm. No device should be able to cause 
harm to the patient regardless of how well or poorly it may be performing its intended clinical duties. 
Devices must be safe when exposed to all the typical environmental conditions to which the device could 
be exposed while being operated by the entire range of possible users of varied education and while 
exposed to siblings and other untrained friends or relatives. For instance, a bed-wetting alarm should 
not cause skin burns under the sensor if a glass of water spills on the control box. This type of safety 
issue must be addressed even when it significantly affects the final cost to the consumer. 

Other safety issues are not obviously differentiated as to being actual safety issues or simply nuisances 
or inconveniences to the user. It is very important for the designer to properly define these issues; although 
some safety features can be included with little or no extra cost, other safety features may be very costly 
to implement. It may be a nuisance for the patient using a TENS pain control stimulator to have the 
device inadvertently turned off when its on/off switch is bumped while watching TV. In this case, the 
patient only experiences a momentary cessation of pain control until the unit is turned back on. But it 
could mean injuries or death to the same patient driving an automobile who becomes startled when his 
TENS unit inadvertently turns on and he causes an accident. 

Reliability issues can also be mere inconveniences or major safety issues. Medical devices should be 
free of design and materials defects so that they can perform their intended functions reliably. Once 
again, reliability does not necessarily need to be expensive and often can be obtained with good design. 
Critical devices, i.e., devices that could cause death or serious injury if they stopped operating properly, 
may need to have redundant systems for backup, which likely will increase cost. 

The Device Must be Designed So That It Will Be Used 

A great deal of money is being spent in healthcare on devices for patients that end up not being used. 
There are numerous reasons for this happening including that the wrong device was prescribed for the 
patient's problem in the first place; the device works, but it has too many false alarms; the device often 
fails to operate properly; it is cumbersome to use or difficult to operate or too uncomfortable to wear. 

Ease of Use 

User- friendliness is one of the most important features in encouraging a device to be used. Technological 
sophistication may be just as necessary in areas that allow ease of use as in attaining accuracy and reliability 
in the device. The key is that the technologic sophistication be transparent to the user so that the device 
does not intimidate the user. Transparent features such as automatic calibration or automatic sensitivity 
adjustment may help allow successful use of a device that would otherwise be too complicated. 

Notions of what makes a device easy to use, however, need to be thoroughly tested with the patient 
population intended for the device. Caution needs to be taken in defining what "simple" means to different 
people. A VCR may be simple to the designer because all features can be programmed with one button, 
but it may not be simple to users if they have to remember that it takes two long pushes and one short 
to get into the clock-setting program. 

Convenience for the user is also extremely important in encouraging use of a device. Applications that 
require devices to be portable must certainly be light enough to be carried. Size is almost always important 
for anything that must fit within the average household. Either a device must be able to be left in place 
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in the home or it must be easy to set up, clean, and put away. Equipment design can make the difference 
between the patient appropriately using the equipment or deciding that it is just too much hassle to bother. 

Reliability 

Users must also have confidence in the reliability of the device being used and must have confidence that 
if it is not working properly, the device will tell them that something is wrong. Frequent breakdowns or 
false alarms will result in frustration and ultimately in reduced compliance. Eventually patients will stop 
using the device altogether. Most often, reliability can be designed into a product with little or no extra 
cost in manufacturing, and everything that can be done at no cost to enhance reliability should be done. 
It is very important, however, to understand what level of additional reliability involving extra cost is 
necessary for product acceptance. Reliability can always be added by duplicated backup systems, but the 
market or application may not warrant such an approach. Critical devices which are implanted, such as 
cardiac pacemakers, have much greater reliability requirements, since they involve not only patient 
frustration but also safety. 

Cost Reimbursement 

Devices must be paid for before the patient can realize the opportunity to use new, effective equipment. 
Devices are usually paid for by one of two means. First, they are covered on an American Medical 
Association Current Procedural Terminology Code (CPT-code) which covers the medical, surgical, and 
diagnostic services provided by physicians. The CPT-codes are usually priced out by Medicare to establish 
a baseline reimbursement level. Private carriers usually establish a similar or different level of reimburse- 
ment based on regional or other considerations. Gaining new CPT-codes for new devices can take a great 
deal of time and effort. The second method is to cover the procedure and device under a capitated fee 
where the hospital is reimbursed a lump sum for a procedure including the device, hospital, homecare, 
and physician fees. 

Every effort should be made to design devices to be low cost. Device cost is being scrutinized more 
and more by those who reimburse. It is easy to state, however, that a device needs to be inexpensive. 
Unfortunately the reality is that healthcare reforms and new regulations by the FDA are making medical 
devices more costly to develop, to obtain regulatory approvals for [FDA, 1993], and to manufacture. 

Professional Medical Service Support 

The more technically sophisticated a device is, the more crucial that homecare support and education 
be a part of a program. In fact, in many cases, such support and education are as important as the device 
itself. 

Medical service can be offered by numerous homecare service companies. Typically these companies 
purchase the equipment instead of the patient, and a monthly fee is charged for use of the equipment 
along with all the necessary service. The homecare company then must obtain reimbursement from 
third-party payers. Some of the services offered by the homecare company include training on how to 
use the equipment, CPR training, transporting the equipment to the home, servicing/repairing equip- 
ment, monthly visits, and providing on-call service 24 hours a day. The homecare provider must also be 
able to provide feedback to the treating physician on progress of the treatment. This feedback may include 
how well the equipment is working, the patient's medical status, and compliance of the patient. 

87.3 Infant Monitor Example 

Many infants are being monitored in the home using apnea monitors because they have been identified 
with breathing problems [Kelly, 1992]. These include newborn premature babies who have apnea of 
prematurity [Henderson-Smart, 1992; NIH, 1987], siblings of babies who have died of sudden infant 
death syndrome (SIDS) [Hunt, 1992; NIH, 1987], or infants who have had an apparent life-threatening 
episode (ALTE) related to lack of adequate respiration [Kahnetal., 1992; NIH, 1987]. Rather than keeping 
infants in the hospital for a problem that they may soon outgrow (1-6 months), doctors often discharge 
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them from the hospital with an infant apnea monitor that measures the duration of breathing pauses 
and heart rate and sounds an alarm if either parameter crosses limits prescribed by the doctor. 

Infant apnea monitors are among the most sophisticated devices used routinely in the home. These 
devices utilize microprocessor control, sophisticated breath- direction and artifact rejection firmware 
algorithms, and internal memory that keeps track of use of the device as well as recording occurrence 
of events and the physiologic waveforms associated with the events. The memory contents can be 
downloaded directly to computer or sent via modem remotely where a complete 45-day report can be 
provided to the referring physician (see Fig. 87.3). 

Most apnea monitors measure breathing effort through impedance pneumography. A small 
(100-200 uA) high-frequency (25-100 kHz) constant-current train of pulses is applied across the chest 
between a pair of electrodes. The voltage needed to drive the current is measured, and thereby the effective 
impedance between the electrodes can be calculated. Impedance across the chest increases as the chest 
expands and decreases as the chest contracts with each breath. The impedance change with each breath 
can be as low as 0.2 ohms on top of an electrode base impedance of 2000 ohms, creating some interesting 
signal-to-noise challenges. Furthermore, motion artifact and blood volume changes in the heart and 
chest can cause impedance changes of 0.6 ohms or more that can look just like breathing. Through the 
same pair of electrodes, heart rate is monitored by picking up the electrocardiogram (ECG) [AAMI, 1988] . 

Because the impedance technique basically measures the motion of the chest, this technique can only 
be used to monitor central apnea or lack of breathing effort. Another less common apnea in infants 
called obstructive apnea results when an obstruction of the airway blocks air from flowing in spite of 
breathing effort. Obstructive apnea can not be monitored using impedance pneumography [Kelly, 1992] . 

There is a very broad socioeconomic and educational spectrum of parents or caregivers who may be 
monitoring their infants with an apnea monitor. This creates an incredible challenge for the design of 
the device so that it is easy enough to be used by a variety of caregivers. It also puts special requirements 
on the homecare service company that must be able to respond to these patients within a matter of 
minutes, 24 hours a day. 

The user-friendly monitor shown in Fig. 87.1 uses a two-button operation, the on/off switch, and a 
reset switch. The visual alarm indicators are invisible behind a back-lit panel except when an actual alarm 
occurs. A word describing the alarm then appears. By not showing all nine possible alarm conditions 
unless an alarm occurs, parent confusion and anxiety is minimized. Numerous safety features are built 
into the unit, some of which are noticeable but many of which are internal to the operation of the 
monitor. One useful safety feature is the self-check. When the device is turned on, each alarm LED lights 
in sequence, and the unit beeps once indicating that the self-check was completed successfully. This gives 
users the opportunity to confirm that all the alarm visual indicators and the audible indicator are working 
and provides added confidence for users leaving their baby on the monitor. A dual-level battery alarm 
gives an early warning that the battery will soon need charging. The weak battery alarm allows users to 
reset the monitor and continue monitoring their babies for several more hours before depleting the 
battery to the charge battery level where the monitor must be attached to the ac battery charger/adapter. 
This allows parents the freedom to leave their homes for a few hours knowing that their child can continue 
to be monitored. 

A multistage alarm reduces the risk of parents sleeping through an alarm. Most parents are sleep- 
deprived with a new baby. Consequently, it can be easy for parents in a nearby room to sleep through a 
monitor alarm even when the monitor sounds at 85 dB. A three-stage alarm helps to reduce this risk. 
After 10 seconds of sounding at 1 beep per second, the alarm switches to 3 beeps per second for the next 
10 seconds. Finally, if an alarm has not resolved itself after 20 seconds, the alarm switches to 6 beeps per 
second. Each stage of alarm sounds more intense than the previous one and offers the chance of jolting 
parents out of even the deepest sleep. 

The physician always prescribes what alarm settings should be used by the homecare service company 
when setting up the monitor. As a newborn matures, these settings may need to be adjusted. Sometimes 
the parents can be relied upon for making these setting changes. To allow both accessibility to these 
switches as well as to keep them safe from unauthorized tampering from a helping brother or sister, a 
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FIGURE 87.3 Infant apnea monitor with memory allows data to be sent by modem to generate physician report 
(drawing courtesy of EdenTec Corporation). 

special tamper-resistant-adjustment procedure is utilized. Two simultaneous actions are required in order 
to adjust the alarm limit settings. The reset button must be continually pressed on the front of the unit 
while changing settings on the back of the unit. Heart rate levels are set in beats per minute, and apnea 
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duration is set in single-second increments. Rather than using easy-to-set push-button switches, "pen- 
set" switches are used which require a pen or other sharp implement to make the change. If the proper 
switch adjustment procedure is not followed, the monitor alarms continuously and displays a switch 
alarm until the settings are returned to their original settings. A similar technique is used for turning the 
monitor off. The reset button must first be pressed and then the on/off switch turned to the off position. 
Violation of this procedure will result in a switch alarm. 

Other safety features are internal to the monitor and are transparent to the user. The monitor's alarm 
is designed to be normally on from the moment the device is turned on. Active circuitry controlled by 
the microprocessor turns the alarm off when there are no active alarm conditions. If anything hangs up 
the processor or if any of a number of components fail, the alarm will not turn off and will remain on 
in a fail-safe mode. This "alarm on unless turned off" technique is also used in a remote alarm unit for 
parents with their baby in a distant room. If a wire breakage occurs between the monitor and the remote 
alarm unit, or a connector pulls loose, or a component fails, the remote alarm no longer is turned off 
by the monitor and it alarms in a fail-safe condition. 

Switches, connectors, and wires are prone to fail. One way to circumvent this potential safety issue is 
use of switches with a separate line for each possible setting. The monitor continuously polls every switch 
line of each switch element to check that "exactly" one switch position is making contact. This guards 
against misreading bad switch elements, a switch inadvertently being set between two positions, or a bad 
connector or cable. Violation of the "exactly one contact condition" results in a switch alarm. 

It is difficult to manage an apnea monitoring program in rural areas where the monitoring family 
may be a hundred miles or more away from the homecare service company. There are numerous ways 
to become frustrated with the equipment and stop using the monitor. Therefore, simplicity of use and 
reliability are important. Storing occurrence of alarms and documenting compliance in internal memory 
in the monitor help the homecare service company and the remote family cope with the situation. The 
monitor shown in Fig. 87.1 stores in digital memory the time, date, and duration of (1) each use of the 
monitor; (2) occurrence of all equipment alarms; and (3) all physiologic alarms including respiratory 
waveforms, heart rate, and ECG for up to a 45-day period. These data in the form of a report (see 
Fig. 87.3) can be downloaded to a laptop PC or sent via modem to the homecare service company or 
directly to the physician. 

87.4 Conclusions 

Devices that can provide positive patient outcomes with reduced overall cost to the healthcare system 
while being safe, reliable, and user-friendly will succeed based on pending healthcare changes. Future 
technology in the areas of sensors, communications, and memory capabilities should continue to increase 
the potential effectiveness of homecare management programs by using increasingly sophisticated devices. 
The challenge for the medical device designer is to provide cost-effective, reliable, and easy-to-use 
solutions that can be readily adopted by the multidisciplinary aspects of homecare medicine while meeting 
FDA requirements. 

Defining Terms 

Apnea: Cessation of breathing. Apnea can be classified as central, obstructive, or mixed, which is a 

combination. 
Apnea of prematurity: Apnea in which the incidence and severity increases with decreasing gestational 

age attributable to immaturity of the respiratory control system. The incidence has increased due 

to improved survival rates for very-low-birth-weight premature infants. 
Apparent life-threatening episode (ALTE): An episode characterized by a combination of apnea, color 

change, muscle tone change, choking, or gagging. To the observer it may appear the infant has died. 
Capitated fee: A fixed payment for total program services versus the more traditional fee for service 

in which each individual service is charged. 
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Cardiac pacemaker: A device that electrically stimulates the heart at a certain rate used in absence of 

normal function of the heart's sino-atrial node. 
Central apnea: Apnea secondary to lack of respiratory or diaphragmatic effort. 
Chemotherapy: Treatment of disease by chemical agents. Term popularly used when fighting cancer 

chemically. 
Colostomy: The creation of a surgical hole as an alternative opening of the colon. 
CPR (cardiopulmonary resuscitation): Artificially replacing heart and respiration function through 

rhythmic pressure on the chest. 
CPT-code (current procedural terminology code): A code used to describe specific procedures/tests 

developed by the AMA. 
Electrocardiogram (ECG): The electric potential recorded across the chest due to depolarization of 

the heart muscle with each heartbeat. 
Enteral nutrition: Chemical nutrition injected intestinally. 
Food and Drug Administration (FDA): Federal agency that oversees and regulates foods, drugs, and 

medical devices. 
Functional electrical stimulation (FES): Electric stimulation of peripheral nerves or muscles to gain 

functional, purposeful control over partially or fully paralyzed muscles. 
Incontinence: Loss of voluntary control of the bowel or bladder. 
Obstructive apnea: Apnea in which the effort to breath continues but airflow ceases due to obstruction 

or collapse of the airway. 
Ostomy: Surgical procedure that alters the bladder or bowel to eliminate through an artificial passage. 
Parenteral nutrition: Chemical nutrition injected subcutaneously, intramuscular, intrasternally, or 

intravenously. 
Sphincter: A band of muscle fibers that constricts or closes an orifice. 
Sudden infant death syndrome (SIDS): The sudden death of an infant which is unexplained by history 

or postmortem exam. 
Transcutaneous electrical nerve stimulation (TENS): Electrical stimulation of sensory nerve fibers 

resulting in control of pain. 
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88.1 Overview 

A Revolution — Graphical Programming and Virtual Instrumentation 

Over the last decade, the graphical programming revolution has empowered engineers to develop cus- 
tomized systems, the same way the spreadsheet has empowered business managers to analyze financial 
data. This software technology has resulted in another type of revolution — the virtual instrumentation 
revolution, which is rapidly changing the instrumentation industry by driving down costs without 
sacrificing quality. 

Virtual Instrumentation can be defined as: 

A layer of software and/or hardware added to a general-purpose computer in such a fashion that users 
can interact with the computer as though it were their own custom-designed traditional electronic 
instrument. 

Today, computers can serve as the engine for instrumentation. Virtual instruments utilize the open 
architecture of industry-standard computers to provide the processing, memory, and display capabilities; 
while the off-the-shelf, inexpensive interface boards plugged into an open bus, standardized communi- 
cations bus provides the vehicle for the instrument's capabilities. As a result, the open architecture of 
PCs and workstations allow the functionality of virtual instruments to be user defined. In addition, the 
processing power of virtual instruments is much greater than stand-alone instruments. This advantage 
will continue to accelerate due to the rapid technology evolution of PCs and workstations that results 
from the huge investments made in this industry. 

The major benefits of virtual instrumentation include increased performance and reduced costs. In 
addition, because the user controls the technology through software, the flexibility of virtual instrumentation 
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is unmatched by traditional instrumentation. The modular, hierarchical programming environment of 
virtual instrumentation is inherently reusable and reconfigurable. 

88.2 Virtual Instrumentation and Biomedical Engineering 

Virtual Instrumentation applications have encompassed nearly every industry including the telecommu- 
nications, automotive, semiconductor, and biomedical industries. In the fields of healthcare and biomed- 
ical engineering, virtual instrumentation has empowered developers and end-users to conceive of, 
develop, and implement a wide variety of research-based biomedical applications and executive infor- 
mation tools. These applications fall into several categories including: clinical research, equipment testing 
and quality assurance, data management, and performance improvement. 

In a collaborative approach, physicians, researchers, and biomedical and software engineers at Hartford 
Hospital (Hartford, CT) and Premise Development Corporation (Avon, CT) have developed various data 
acquisition and analysis systems that successfully integrate virtual instrumentation principles in a wide 
variety of environments. These include: 

• "The EndoTester™" a patented quality assurance system for fiberoptic endoscopes 

• a Non-Invasive Pulmonary Diffusion and Cardiac Output Measurement System 

• a Cardiovascular Pressure-Dimension Analysis System 

• "BioBench™" a powerful turnkey application for physiological data acquisition and analysis 

• "piviT™", a Performance Indicator Virtual Instrument Toolkit to manage and forecast financial 
data 

• a "Virtual Intelligence Program" to manage the discrete components within the continuum of care 

• "BabySave™" an analysis and feedback system for apnea interruption via vibratory stimulation 

This chapter will describe several of these applications and describe how they have allowed clinicians 
and researchers to gain new insights, discover relationships that may not have been obvious, and test 
and model hypotheses based on acquired data sets. In some cases, these applications have been developed 
into commercial products to address test and measurement needs at other healthcare facilities throughout 
the world. 

Example 1: BioBench™ — A Virtual Instrument Application for Data 
Acquisition and Analysis of Physiological Signals 

The biomedical industry is an industry that relies heavily on the ability to acquire, analyze, and display 
large quantities of data. Whether researching disease mechanisms and treatments by monitoring and 
storing physiological signals, researching the effects of various drugs interactions, or teaching students 
in labs where students study physiological signs and symptoms, it was clear that there existed a strong 
demand for a flexible, easy-to-use, and cost-effective tool. In a collaborative approach, biomedical engi- 
neers, software engineers and clinicians, and researchers created a suite of virtual instruments called 
BioBench™. 

BioBench™ (National Instruments, Austin, TX) is a new software application designed for physiolog- 
ical data acquisition and analysis. It was built with Lab VIEW™, the world's leading software development 
environment for data acquisition, analysis, and presentation. 1 Coupled with National Instruments data 
acquisition (DAQ) boards, BioBench integrates the PC with data acquisition for the life sciences market. 

Many biologists and physiologists have made major investments over time in data acquisition hardware 
built before the advent of modern PCs. While these scientists cannot afford to throw out their investment 
in this equipment, they recognize that computers and the concept of virtual instrumentation yield 
tremendous benefits in terms of data analysis, storage, and presentation. In many cases, traditional 



'BioBench™ was developed for National Instruments (Austin, TX) by Premise Development Corporation (Avon, 
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FIGURE 88.1 A typical biomedical application using BioBench (courtesy of National Instruments). 

medical instrumentation may be too expensive to acquire and/or maintain. As a result, researchers and 
scientists are opting to create their own PC-based data monitoring systems in the form of virtual 
instruments. 

Other life scientists, who are just beginning to assemble laboratory equipment, face the daunting task 
of selecting hardware and software needed for their application. Many manufacturers for the life sciences 
field focus their efforts on the acquisition of raw signals and converting these signals into measurable 
linear voltages. They do not concentrate on digitizing signals or the analysis and display of data on the 
PC. BioBench™ is a low-cost turnkey package that requires no programming. BioBench is compatible 
with any isolation amplifier or monitoring instrument that provides an analog output signal. The user 
can acquire and analyze data immediately because BioBench automatically recognizes and controls the 
National Instruments DAQ hardware, minimizing configuration headaches. 

Some of the advantages of PC-Based Data Monitoring include: 

• Easy-to-use-software applications 

• Large memory and the PCI bus 

• Powerful processing capabilities 

• Simplified customization and development 

• More data storage and faster data transfer 

• More efficient data analysis 

Figure 88.1 illustrates a typical setup of a data acquisition experiment using BioBench. BioBench also 
features pull-down menus through which the user can configure devices. Therefore, those who have 
made large capital investments can easily migrate their existing equipment into the computer age. 
Integrating a combination of old and new physiological instruments from a variety of manufacturers is 
an important and straightforward procedure. In fact, within the clinical and research setting, it is a 
common requirement to be able to acquire multiple physiological signals from a variety of medical devices 
and instruments which do not necessarily communicate with each other. Often times, this situation is 
compounded by the fact that end-users would like to be able to view and analyze an entire waveform 
and not just an average value. In order to accomplish this, the end-user must acquire multiple channels 
of data at a relatively high sampling rate and have the ability to manage many large data files. BioBench 
can collect up to 16 channels simultaneously at a sampling rate of 1000 Hz per channel. Files are stored 
in an efficient binary format which significantly reduces the amount of hard disk and memory require- 
ments of the PC. During data acquisition, a number of features are available to the end-user. These 
features include: 
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Data Logging: Logging can be enabled prior to or during an acquisition. The application will either 
prompt the user for a descriptive filename or it can be configured to automatically assign a filename 
for each acquisition. Turning the data logging option on and off creates a log data event record 
that can be inspected in any of the analysis views of BioBench. 

Event Logging: The capacity to associate and recognize user commands associated with a data file 
may be of significant value. BioBench has been designed to provide this capability by automatically 
logging user-defined events, stimulus events, and file logging events. With user-defined events, the 
user can easily enter and associate date and time-stamped notes with user actions or specific 
subsets of data. Stimulus events are also data and time-stamped and provide the user information 
about whether a stimulus has been turned on or off. File logging events note when data has been 
logged to disk. All of these types of events are stored with the raw data when logging data to file 
and they can be searched for when analyzing data. 

Alarming: To alert the user about specific data values and thresholds, BioBench incorporates user- 
defined alarms for each signal which is displayed. Alarms appear on the user interface during data 
acquisition and notify the user that an alarm condition has occurred. 

Figure 88.2 is an example of the Data Acquisition mode of BioBench. Once data has been acquired, 
BioBench can employ a wide array of easy-to-use analysis features. The user has the choice of importing 
recently acquired data or opening a data file that had been previously acquired for comparison or teaching 
purposes. Once a data set has been selected and opened, BioBench allows the user to simply select and 
highlight a region of interest and choose the analysis options to perform a specific routine. 

BioBench implements a wide array of scalar and array analyses. For example, scalar analysis tools will 
determine the minimum, maximum, mean, integral, and slope of a selected data set, while the array 
analysis tools can employ Fast Fourier Transforms (FFTs), peak detection, histograms, and X vs. Y plots. 

The ability to compare multiple data files is very important in analysis and BioBench allows the user 
to open an unlimited number of data files for simultaneous comparison and analysis. All data files can 
be scanned using BioBench's search tools in which the user can search for particular events that are 
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FIGURE 88.2 BioBench acquisition mode with alarms enabled. 
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FIGURE 88.3 BioBench analysis mode. 

associated with areas of interest. In addition, BioBench allows the user to employ filters and transforma- 
tions to their data sets and all logged data can be easily exported to a spreadsheet or database for further 
analysis. Finally, any signal acquired with BioBench can be played back, thus taking lab experience into 
the classroom. Figure 88.3 illustrates the analysis features of BioBench. 

Example 2: A Cardiovascular Pressure-Dimension Analysis System 
Introduction 

The intrinsic contractility of the heart muscle (myocardium) is the single most important determinant 
of prognosis in virtually all diseases affecting the heart (e.g., coronary artery disease, valvular heart disease, 
and cardiomyopathy). Furthermore, it is clinically important to be able to evaluate and track myocardial 
function in other situations, including chemotherapy (where cardiac dysfunction may be a side effect of 
treatment) and liver disease (where cardiac dysfunction may complicate the disease). 

The most commonly used measure of cardiac performance is the ejection fraction. Although it does 
provide some measure of intrinsic myocardial performance, it is also heavily influenced by other factors 
such as heart rate and loading conditions (i.e., the amount of blood returning to the heart and the 
pressure against which the heart ejects blood). 

Better indices of myocardial function based on the relationship between pressure and volume through- 
out the cardiac cycle (pressure-volume loops) exist. However, these methods have been limited because 
they require the ability to track ventricular volume continuously during rapidly changing loading con- 
ditions. While there are many techniques to measure volume under steady state situations, or at end- 
diastole and end-systole (the basis of ejection fraction determinations), few have the potential to record 
volume during changing loading conditions. 

Echocardiography can provide online images of the heart with high temporal resolution (typically 30 
frames per second). Since echocardiography is radiation-free and has no identifiable toxicity, it is ideally 
suited to pressure-volume analyses. Until recently however, its use for this purpose has been limited by the 
need for manual tracing of the endocardial borders, an extremely tedious and time-consuming endeavor. 
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The System 

Biomedical and software engineers at Premise Development Corporation (Avon, CT), in collaboration 
with physicians and researchers at Hartford Hospital, have developed a sophisticated research application 
called the "Cardiovascular Pressure-Dimension Analysis (CPDA) System". The CPDA system acquires 
echocardiographic volume and area information from the acoustic quantification (AQ) port, in conjunc- 
tion with vetricular pressure(s) and ECG signals to rapidly perform pressure-volume and pressure-area 
analyses. This fully automated system allows cardiologists and researchers to perform online pressure- 
dimension and stroke work analyses during routine cardiac catheterizations and open-heart surgery. The 
system has been designed to work with standard computer hardware. Analog signals for ECG, pressure, 
and area/volume (AQ) are connected to a standard BNC terminal board. Automated calibration routines 
ensure that each signal is properly scaled and allows the user to immediately collect and analyze pressure- 
dimension relationships. 

The CPDA can acquire up to 16 channels of data simultaneously. Typically, only three physiological 
parameters, ECG, pressure, and the AQ signals are collected using standard data acquisition hardware. 
In addition, the software is capable of running on multiple operating systems including Macintosh, 
Windows 95/98/NT, and Solaris. The CPDA also takes advantage of the latest hardware developments 
and form-factors and can be used with either a desktop or a laptop computer. 

The development of an automated, online method of tracing endocardial borders (Hewlett Packard's 
AQ Technology) (Hewlett-Packard Medical Products Group, Andover, MA) has provided a method for 
rapid online area and volume determinations. Figure 88.4 illustrates this AQ signal from a Hewlett 
Packard Sonos Ultrasound Machine. This signal is available as an analog voltage (-1 to +1 volts) through 
the Sonos Dataport option (BNC connector). 

Data Acquisition and Analysis 

Upon launching this application, the user is presented with a dialog box that reviews the license agreement 
and limited warranty. Next, the Main Menu is displayed, allowing the user to select from one of six 
options as shown in Fig. 88.5. 

Clinical Significance 

Several important relationships can be derived from this system. Specifically, a parameter called the End- 
Systolic Pressure-Volume Relationship (ESPVR) describes the line of best fit through the peak-ratio (max- 
imum pressure with respect to minimum volume) coordinates from a series of pressure-volume loops 




FIGURE 88.4 The Acoustic Quantification (AQ) signal (Hewlett Packard). 
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FIGURE 88.5 Cardiovascular pressure-dimension analysis main menu. 
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FIGURE 88.6 The data selection front panel. 

generated under varying loading conditions. The slope of this line has been shown to be a sensitive index 
of myocardial contractility that is independent of loading conditions. In addition, several other analyses, 
including time varying elastance (E llmx ) and stroke work, are calculated. Time-varying elastance is measured 
by determining the maximum slope of a regression line through a series of isochronic pressure-volume 
coordinates. Stroke work is calculated by quantifying the area of each pressure-volume loop. Statistical 
parameters are also calculated and displayed for each set of data. Figure 88.7 illustrates the pressure- 
dimension loops and each of the calculated parameters along with the various analysis options. Finally, 
the user has the ability to export data sets into spreadsheet and database files and export graphs and 
indicators into third-party presentation software packages such as Microsoft PowerPoint®. 
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FIGURE 88.7 The cardiac cycle analysis front panel. 



Summary 



Virtual Instrumentation allows the development and implementation of innovative and cost-effective 
biomedical applications and information management solutions. As the healthcare industry continues 
to respond to the growing trends of managed care and capitation, it is imperative for clinically useful, 
cost-effective technologies to be developed and utilized. As application needs will surely continue to 
change, virtual instrumentation systems will continue to offer users flexible and powerful solutions 
without requiring new equipment or traditional instruments. 
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Historical Perspectives 3: 

Recording of 
Action Potentials 



Nerve Action Potential 

Action Potentials in Afferent Fibers 
Leslie A. Geddes Code of the Nervous System 

Purdue University True Form of the Nerve Action Potential 

Nerve Action Potential 

The question for the form and nature of the nerve action potential used instruments that we would now 
call primitive. Yet, in skilled hands, these instruments led directly to discovery of the code used by the 
nervous system to transmit information. In fact, the code was discovered before the true form of the 
nerve action potential was known. 

Using a slow-speed galvanometer ballistically to measure time [Hoff & Geddes, 1960], Helmholtz 
[1850, 1851, 1853] measured the velocity of the frog sciatic nerve action potential and determined it to 
be 30 m/s. This was far below the speed of electricity and caused much controversy among physiologists 
of the day. Using the rheotome, a slowly responding galvanometer, and an induction coil stimulator 
[Geddes et al., 1989], Bernstein [1868] reconstructed the action potential of the frog sciatic nerve [Hoff & 
Geddes, 1957]. The sampling time was 0.3 ms, and the action potential that he obtained is shown in 
Fig. HP3.1. Not only did Bernstein chart the time course of this 0.5626- to 0.8041-ms action potential, 
he measured its propagation velocity, obtaining an average of 28.718 m/s, agreeing with the speed obtained 
by Helmholtz. Thus, with primitive electromechanical instruments, the time course and velocity of the 
nerve action potential were determined accurately. 

When the capillary electrometer appeared in 1876, it was thought that it might be possible to use it 
to record the nerve action potential. Many attempts were made with variable results because the concept 
of response time was not fully appreciated. Gotch and Horsley [1888] in the United Kingdom stimulated 
peripheral nerves in the cat using induction-coil shocks, and with one recording electrode over intact 
nerve and one in an area of injury, they recorded the action potential. The type of record that they 
obtained is shown in Fig. HP3.2a. Note that the response (action potential) was in the same direction 
for the break (b) and make (m) shocks. Recall that the break shock from an induction coil is stronger 
than the make shock. They knew that the make and break shocks were of opposite polarity and proved 
it by recording them, as shown in Fig. HP3.2&. After a long discussion about the nerve response being 
always in the same direction, irrespective of the polarity of the stimulus, they concluded that they had 
recorded single action potentials in response to single-induction-coil stimuli. They wrote: 

There is thus no doubt that the movement [of the mercury contour] that we obtained and photo- 
graphed was due to the electromotive change which was due to the electromotive change which 
accompanies the propagation of an excitatory state along the mammalian nerve when this state is 
evoked by the application of a single stimulus. 



© 2000 by CRC Press LLC 




FIGURE HP3.1 Bernstein's recon- 
struction of the action potential of the 
frog sciatic nerve. From his experi- 
ments, the mean duration was 0.6833 
ms. This reconstruction was made 
with a slow-speed galvanometer and 
the rheotome. (From Bernstein, 
) 



Like Gotch and Horsley [1888], others who attempted recording 
nerve action potentials with the capillary electrometer were 
rewarded with a small-amplitude record. When Einthoven's string 
galvanometer appeared in 1903, it was also used to record nerve 
action potentials with the same result [Einthoven, 1903]. In 1907 
and 1908, De Forest patented the triode (audion), and the vacuum- 
tube amplifier could be constructed. Therefore it appeared logical 
to use a triode amplifier to enlarge nerve action potentials and display 
them with the string galvanometer and capillary electrometer. 

The first to put the vacuum-tube amplifier to work in electro- 
physiology were Forbes and Thacher [1920], who coupled a triode 
to a string galvanometer and recorded frog nerve and human mus- 
cle action potentials. Their paper is essentially a tutorial that 
describes three ways of coupling the triode to the string galvanom- 
eter so that the delicate string would not be damaged. The first 
method placed the vacuum tube in one arm of a Wheatstone bridge. 
The string galvanometer was in the detector position, and a resistor 
constituted the arm adjacent to the triode. The second method 
employed transformer coupling, and the third method employed 
capacitor (C) coupling, as shown in Fig. HP3.3«. Of the three meth- 
ods, the capacitive-coupling method was preferred. Figure HP3.3& 
is a record obtained by Forbes and Thacher showing the amplified 
and directly recorded frog sciatic nerve action potential. The timing signal (bottom) is 100 Hz. 

Desirous of displaying repetitive nerve action potentials with the string galvanometer, Gasser and 
Newcomer [1921] employed a two-stage resistance-capacity-coupled amplifier connected to the string 
galvanometer. They chose the phrenic nerve as the object of their study because of its spontaneous 
periodic activity in causing the diaphragm to contract tetanically and produce inspiration. The frequency 
of the phrenic nerve action potentials ranged from 71 to 105 per second, noting that their amplitude 
appeared to be largest at the peak of inspiration. They also reported a one-to-one correspondence between 
phrenic nerve and diaphragm action potentials. Their important observations were later to become 
recognized as the two ways that intensity is signaled in the nervous system. 

The addition of amplification to the string galva- 
nometer did not improve its ability to respond to rap- 
idly rising, short-duration action potentials. It 
required the cathode-ray tube and a multistage ampli- 
fier to solve this problem. Gasser and Erlanger [1922] 
were the first to show the form of the nerve action 
potential recorded extracellularly using the cathode- 
ray tube and triode amplifier that they had previously 
developed for use with the string galvanometer. Not 
only did they achieve their goal, but they made the 
fundamental discovery that nerve propagation veloc- 
ity was proportional to nerve fiber diameter. 

Although rapidly responding, the cathode-ray 
tube is quite insensitive, requiring about 20 to 50 V 
for a 1-cm deflection of the spot made by the elec- 
tron beam on the face of the tube. Therefore, con- 
siderable amplification was needed to display the 
millivolt nerve action potentials detected with elec- 
trodes on the surface of a nerve trunk. Gasser and 




FIGURE HP3.2 The first nerve action potentials in 
response to single stimuli, recorded with the capillary 
electrometer in response to a break (b) and make (m) 
induction-oil stimulus (top). (Bottom) The capillary 
electrometer record of the break (b) and make (m) 
stimuli from the induction coil; note that they are of 
opposite polarity and that of the nerve responses (top) 
are of the same polarity. (From Gotch & Horsley, 1 
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FIGURE HP3.3 Use of the capacitively coupled ( C) triode to enlarge action potentials from a nerve (N) and display 
them on the string galvanometer (G). (b) (upper) An amplified action potential and (lower) the unamplified action 
potential. At the bottom is a record from a 100-Hz tuning fork [Forbes & Thacher 1920]. 

Erlanger [1922] built a three-stage amplifier to display nerve action potentials on the cathode-ray tube 
screen. Figure HP3.4 is the circuit diagram of their equipment. The cathode-ray tube was of the low- 
voltage (300) type and was provided by J. B. Johnson and E. B. Craft of the Western Electric Company. 
The tube contained a little argon gas. The fluorescent screen was green with a long persistence (5 to 10 
s). The deflection sensitivity was 10 to 20 V/cm. 

Before describing the Gasser and Erlanger Nobel prize-winning research, it is of interest to examine 
how the nerve was stimulated and the action potentials were recorded. Figure HP3.4 shows the nerve 
(N) on the left with the stimulating electrodes (T) and the recording electrodes (E). The nerve was 
crushed under the electrode connected to the grid (G) of the first vacuum-tube amplifier. Therefore, the 
nerve action potential appeared under the electrode on the uninjured nerve. Three stages of amplification 
were used to enlarge the nerve action potential 7000 to 8000 times, which caused the beam of the cathode- 
ray tube to be deflected vertically. 

On the right of Fig. HP3.4 is a rheotome that (1) delivered the stimulus to an induction coil (PS) 
connected to the nerve stimulating electrodes (T), (2) started the cathode-ray tube beam moving across 
the face of the tube to provide a time axis by starting the 1-mF (nowadays uE) capacitor to charge (the 
voltage on this capacitor was connected to the horizontal deflecting plates in the cathode-ray tube), and 
(3) later discharged the 1-uF capacitor so that the cycle could be restarted. In this way, the stimulus was 
delivered to the nerve at the instant when the spot started horizontally across the face of the cathode- 
ray tube. Typically, 30 stimuli were delivered per second to produce a clear (standing wave) action 
potential on the cathode-ray tube. Commenting on the action potential recorded from frog sciatic nerve, 
Gasser and Erlanger [1922] wrote: 

The action current has a gradual ascent, a steep smooth anacrotic limb [rising phase] and a more 
gradual catacrotic limb [falling phase]. The latter like the former shows a period of great initial 
acceleration so that the crest is situated near the anacrotic side. In frog nerve and some mammalian 
nerves there are secondary waves on the catacrotic limb n. Suggestions are made as to the cause of 
these waves. 

Soon an explanation was given for the secondary waves; it came in 1924 when Erlanger and Gasser [ 1924] 
published their classic report which will now be described. 

Having no camera, Gasser and Erlanger either placed tissue paper on the cathode-ray tube face and 
traced the waveform with a pencil or pressed photographic paper against the face of the tube and obtained 
a contact print. Figure HP3.5 is such an illustration of the action potential of the bullfrog sciatic nerve. 
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FIGURE HP3.5 Contact print from the cathode-ray tube showing the action potential of the bullfrog sciatic nerve 
recorded by Erlanger & Gasser [1924]. 

Erlanger and Gasser knew that their oscilloscope time base was exponential and corrected the oscil- 
lograms accordingly. A favorite method employed transilluminating a contact print and tracing the action 
potential on the back with a pencil to obtain a positive image that was then replotted on semilogarthmic 
paper. This step was essential because they desired a true temporal display of the compound action 
potential from a nerve trunk. By varying the distance over which the action potential was propagated, 
they were able to reveal that the nerve trunk contained groups of fibers that propagated with different 
velocities. Commenting on their results, they stated: 

Each of the waves of these compound action currents as it progresses along the nerve changes its form 
just as does the simple action current in the phrenic nerve. It is suggested that these changes are due, 
in part, at least, to slight differences in the propagation rate in individual, or in many small groups of 
fibers, whose action currents therefore get slightly out of phase as they progress. 

Finally, Erlanger and Gasser [1937] summarized their work in a monograph entitled Electrical Signs of 
Nervous Activity. In it they not only showed how the compound action potential of nerve depends on 
the velocities of propagation of the different bundles of nerve fibers in a trunk but also provided 
histograms (fiber maps) of the diameters of different-sized fibers in a nerve trunk. Figure HP3.6 shows 
the action potentials of the A, B, and C fibers. For this pioneering work Erlanger and Gasser received 
the Nobel prize in physiology and medicine in 1944. 

Meanwhile, in the United Kingdom, Adrian was conducting experiments that showed that a nerve 
fiber responds in an all-or-none manner and that intensity is signaled in a single nerve fiber by the 
frequency of action potentials. In addition, he showed that intensity is also signaled by the number of 
nerve fibers carrying messages. These remarkable discoveries were made with the capillary electrometer, 
which even then was considered to be a primitive instrument. Adrian [1928] defended his use of the 
capillary electrometer in the following way: 

The ideal instrument for recording nerve action currents is undoubtedly the cathode-ray oscillograph 
devised by Erlanger and Gasser, for in this the moving system is a stream of cathode rays, the inertia 
of which is completely negligible. At present, however, the intensity of the illumination from the ray 
is far too small to allow photographs to be made from a single excursion, and similar excursions must 
be repeated many times over before the plate or the eye is affected. As a result, the cathode-ray 
oscillograph can only be used in experiments where the same sequence of action currents can be 
repeated over and over again, and it is not suitable for recording an irregular series of action currents 
such as are produced by the activity of the central nervous system. Another instrument in which the 
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FIGURE HP3.6 Action potential components of the A, B, and C fibers of nerve. (From Erlanger J, Gasser HS. 
Electrical Signs of Nervous Activity. Philadelphia, University of Pennsylvania Press, 1937. With permission.) 

inertia factor is extremely small is the capillary electrometer. This has fallen out of favor with the 
majority of physiologists because its records need analysis and because of its low sensitivity compared 
with that of the string galvanometer. These objections have now become of little importance. With 
the advent of reliable valve [vacuum-tube] amplifiers a low sensitivity in the recording instrument is 
no drawback at all, and the analysis of capillary electrometer records can be made in a few moments 
by the machine designed by Keith Lucas. As will be seen, the combination of value amplifier and 
[capillary] electrometer gives us an instrument of such range and precision that it promises access to 
fields of investigation which are as yet almost unexplored. 

In Adrian's first studies [1928] he used the three-valve (vacuum-tube) amplifier shown in Fig. HP3.7 
to record action potentials in different nerve fibers. In his next studies with Bronk [Adrian & Bronk, 
1928], he used a single-stage amplifier (Fig. HP3.8) that drove a capillary electrometer across which could 
be connected earphones or another amplifier (5) that drove a loudspeaker {LS) to enable listening to the 
nerve action potentials. Adrian stated: 




Ct 



FIGURE HP3.7 The three-stage valve (vacuum-tube) amplifier and capillary electrometer (CE) used by Adrian to 
record action potentials in afferent nerves. The dashed lines represent shielding. (Redrawn from [Adrian, 1926].) 
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FIGURE HP3.8 Amplifier and capillary electrometer used by Adrian to record nerve action potentials along with 
headphones, amplifier (B), and loudspeaker {LS) used to aurally monitor their frequency. (From Adrian & Bronk, 
1928.) 

The three-valve amplifier owes much to the great kindness of Prof. Gasser, who supplied me with 
details of the amplifier used by him in America, and to the staff of Messers W. G. Pye and Co. of 
Cambridge, who redesigned an instrument on the same general line and planned the very compact 
and well shielded lay-out of the apparatus. 

Adrian provided details on his capillary electrometer by stating, "The capillary tube at present has a 
diameter of 0.03 mm at its working part, and a pressure of 140 mmHg is needed to bring the mercury 
to this point." The working part refers to the sulfuric acid-mercury interface, the contour of which 
changes when current traverses it. Adrian stated that he could detect a voltage as small as 0.01 mV when 
his electrometer was connected to the three-stage amplifier. 

Adrian knew that a photographic recording of the change in contour of the meniscus of the capillary 
electrometer was not a true representation of the applied voltage; this fact had been pointed out by Burch 
[ 1 892 ] . Keith Lucas, a collaborator, devised a mechanical instrument for correcting capillary electrometer 
records. However, Adrian was less interested in waveform than in the presence or absence of action 
potentials. 

Action Potentials in Afferent Fibers 

In the paper that described the three-valve (vacuum-tube) amplifier and capillary electrometer, Adrian 
[1928] presented numerous examples of the nature of the action potentials in afferent nerve fibers in the 
frog sciatic nerve when the gastrocnemius muscle was stretched with known weights. He found that the 
frequency of the action potentials was related to the weight. Fig. HP3.9 shows his corrected capillary 
electrometer records of this experiment. 

After applying strict criteria to test the validity of his results, Adrian then recorded trains of afferent 
impulses in the saphenous nerve of a decapitated cat when a forcep was used to pinch the skin of the 
foot. The same result was obtained with a pin prick. He then recorded afferent impulses in the vagus 
nerve in the spinal and decerebrate cat and in the anesthetized rabbit. He also recorded trains of action 
potentials in the vagus nerve that were synchronous with the heartbeat and with respiration. Commenting 
on the latter he wrote: "The striking result is the absence of any sign of renewed discharge of impulses 
at the moment when the lungs are most deflated." 

In the summary to his paper, Adrian stated: 
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FIGURE HP3.9 Corrected capillary electrometer records of afferent action potentials in a frog sciatic nerve when 
weights were applied to the gastrocnemius muscle, (c) shows the action potentials from 10 g applied for 10 s; (d) shows 
the same weight applied for 24 s; and (e) shows the action potentials for 100 g weight applied for 10 s. Note the 
higher frequency of action potentials with 100-g weight. (From Adrian, 1928.) 

It is probable that many of the oscillations represent action currents in a single nerve fibre, and these 
same general form and the same general time relations (allowing for temperature differences) in all 
sensory nerves in which they can be isolated sufficiently for measurement. There is no evidence that 
an increase in the stimulus increases the size of the action currents in single fibres, but the frequency 
of the impulses in the nerve trunk increases and leads to interference and overlapping of impulses in 
different fibres. 

He continued: 

More detailed analysis of the results is postponed until experiments have been made on preparations 
containing a known number of sensory endings, if possibly only one. 

It was not long before Adrian succeeded in recording action potentials in a single nerve fiber. Adrian 
and Bronk [1928] stated: 

To be sure of what is happening in the single nerve fibre we have to devise a method which will put 
out of action all the other fibres in the nerve. The recording of impulses in a single fibre presents no 
difficulty, but the problem of isolating the fibre seemed much more formidable. Fortunately this has 
turned out not nearly so difficult as we had imagined. 

Adrian and Bronk [1928] used the cervical rabbit phrenic nerve and dissected single fibers free with 
a needle and viewed the dissection with a binocular microscope. Teasing out a selected single fiber and 
sectioning it, electrodes were placed at the distal end. In this way, only afferent information was recorded. 
The equipment that they used in Fig. HP3.8. Commenting on use of the headphones or a loudspeaker, 
they stated: 

The amplified action currents can be photographed with the capillary electrometer, but until the final 
stages are reached, it is usually more convenient to lead them to a telephone or loudspeaker and 
estimate the character of the discharged by the ear instead of the eye. 

They continued by showing how aural monitoring aided in the experiment: 

When only a few fibres are in action the electrometer excursions may be too small to detect on a 
screen, but they produce a series of faint clicks in the loudspeaker, and it is thus possible to control 
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FIGURE HP3.10 Capillary electrometer records from the phrenic nerve of the decerebrate cat breathing sponta- 
neously, (a) The airway is open. (6) The trachea was clamped. (From Adrian and Bronk, 1928.) 

the dissection, to expose a plate at the moment when the discharge is at its height, etc., without the 
inconvenience of wearing telephones. 

Adrian and Bronk [1928] investigated the frequency of action potentials in an intact phrenic nerve of 
the decerebrate cat breathing spontaneously. The single-state amplifier (Fig. HP3.8) was used. 
Figure HP3. 10a illustrates the phrenic nerve action potentials during spontaneous breathing, and 
Fig. HP3.10& shows the recordings with the trachea clamped, the breathing becoming labored. 

Having shown that the frequency of the action potentials in the phrenic nerve increased with depth 
of breathing, Adrian and Bronk [1928] set themselves the task of investigating how intrapulmonic pressure 
was related to the frequency of stimuli applied to the phrenic nerve. Using a rabbit in which the central 
circulation was occluded above CI, they connected a manometer to the trachea that could be clamped 
so that the manometer read the negative intrapulmonic pressure when the C4 phrenic nerve root was 
stimulated. They used a coreless induction coil connected to a rotary switch (rheotome) to enable delivery 
of stimuli at any desired frequency. By delivering bursts of stimuli of different frequencies, they plotted 
the negative intrapulmonic pressure as a function of frequency, clearly demonstrating the dependence 
of the amplitude of inspiration on the frequency of phrenic nerve stimuli. 

Adrian published two monographs on his work; one was entitled, The Mechanism of Nervous Action, 
and the other, The Basis of Sensation. Interestingly, neither publication contains illustrations obtained 
with the cathode-ray oscilloscope; however, there are string galvanometer and capillary electrometer 
recordings. Obviously, Adrian had brought the capillary electrometer to a high degree of perfection, but 
it is important to remember that Adrian's interest lay in the frequency of the action potentials, not the 
intimate details of their waveforms. 

Perhaps the best summary of Adrian's contributions appears in a single sentence in The Mechanism 
of Nervous Action. The section entitled, "Gradation of Activity," states: 

There is certainly no evidence to suggest that the impulses are graded in size, for the fact that sensory 
messages may produce a small or large effect according to the intensity of the stimulus is naturally 
explained by the varying number of fibres in action and by the varying frequency of the discharge in 
each fibre. 

In 1932, Adrian shared the Nobel prize in physiology and medicine with Sherrington. The citation read, 
"For their discoveries regarding the function of the neuron." 
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FIGURE HP3.11 The transmembrane potential of the giant axon of the squid at rest {left) and during activity. 
Time marks 2 ms. (From Hodgkin and Huxley, 1939. With permission.) 



Code of the Nervous System 

From the impressive research performed by Gasser and Erlanger and by Adrian, the code of the nervous 
system was discovered. Stated simply, ( 1 ) intensity is signaled by the frequency of action potentials in a 
single axon, the action potentials all being the same, and (2) intensity is also signaled by the number of 
axons transmitting the information. In modern terms, the nervous system is a communications system 
that is binary and frequency modulated. 

True Form of the Nerve Action Potential 

Although the studies by Bernstein, Gasser, Erlanger, and Adrian provided information on the nerve action 
potential, its true form could not be established until the micropipet electrode was used to measure the 
transmembrane potential. Hodgkin and Huxley [1939] obtained the true waveform of the nerve action 
potential and thereby ushered in the modern era of electrophysiology. In a paper published in Nature 
on October 21, 1939, they reported that J. Z.Young [1930] called their attention to the giant axon (500 urn 
in diameter) of the squid, which was ideal for electrophysiologic studies. 

The first micropipet electrode used by Hodgkin and Huxley consisted of a glass tube, 100 urn in 
diameter, that was slipped into the cut end of the giant axon. The electrode was mounted vertically and 
filled with seawater, and a silver-silver chloride electrode was inserted. The axon was then dipped into 
a container of seawater in which a second chlorided silver electrode was placed. The transmembrane 
potential was found to be -45 mV at 20°C. When the axon was stimulated, the action potential was 
90 mV. Figure HP3.11 is a reproduction of this historic record. 

The Hodgkin and Huxley experiment clearly demonstrated two important phenomena: (1) there is a 
measurable resting transmembrane potential, and (2) the action potential is larger than the resting 
transmembrane potential, the latter indicating that activity represents more than a mere disappearance 
of the transmembrane potential. The same team later provided the explanation with their classic papers 
on ion fluxes. Today, the action of many drugs is explained on the basis of ion fluxes. 
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Noninvasive inductive coupling device used to apply low field intensities. 
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THE PURPOSE OF BIOMEDICAL ENGINEERING is to develop and employ the best available 
technology for the benefit of human health. Implicitly, the discussion of biologic effects of 
nonionizing electromagnetic fields in a biomedical engineering handbook will then be concerned 
with beneficial medical applications of such fields. However, these applications must be based to as large 
an extent as possible on scientific knowledge. This section therefore begins with a description of exper- 
imentally established dielectric properties of biological materials (Chapter 89). This is followed by a 
general discussion of biologic effects — adverse as well as beneficial — of low-frequency magnetic fields 
(Chapter 90). This chapter is restricted to low-frequency magnetic — rather than electric and mag- 
netic — fields because the succeeding chapter (Chapter 91), which deals with therapeutic applications, 
indicates that time-varying magnetic fields are of greater interest. At the present time, it is not known 
whether observed biologic effects of low-frequency magnetic fields (particularly of those having modest 
amplitude) are due to the electric fields and currents which they induce in tissue as a consequence of 
Faraday's law or are due to an as yet unknown mechanism of direct magnetic field-tissue interaction. 

Since the fundamental equations that characterize field-tissue interaction are very different in the 
power to audiofrequency range from those which describe biologic effects at radio and microwave 
frequencies, separate chapters are devoted to the different frequency domains. Thus Chapter 92 is devoted 
to discussing biologic effects, again not necessarily beneficial, of RF fields, and Chapter 93 describes 
methods for their principal medical application, which is the production of desired hyperthermia. 

While Chapters 89 to 92 describe primarily measurements, effects, and applications of relatively low- 
intensity fields, Chapter 94 is devoted to the fundamentals of the increasingly important area of the 
interaction of short, high-intensity electric field pulses with biologic cells. It is shown that understanding 
of "electroporation" is not only important for understanding the nature of electric injury, but that this 
phenomenon is becoming a very important tool in biotechnology for transferring genetic material 
between different cells, as well as being applied for local drug delivery to tissue. 

The division of this section along the indicated lines is mandated by the observations that high- and 
low-intensity fields have different biologic effects — to consider extremes, electrocution versus healing of 
fractured bones — and that different frequencies of electromagnetic energy interact differently with mate- 
rials of all types. Of particular importance is the relation between the wavelength X of electromagnetic 
waves and the magnitudes L of field-generating and field-absorbing structures of their distances. Since 



k = - (IX.l) 

where /is the frequency and c is the propagation velocity (3.0 x 10 8 m/s in free space), it is obvious that 
L « X at a "low" frequency, even as high as 1 kHz, where X = 300 km, when L represents dimensions 
that are of interest for interaction with animals, humans, or cell cultures of power transmission lines, 
motors, or laboratory instruments. When L « X, all the "quasi-static" approximations discussed in 
Chapter 90 are applicable, which implies that time retardation effects, i.e., the wave nature of electro- 
magnetic energy, can essentially be disregarded. At low frequencies, the field amplitude most often 
decreases as 1/r 2 or 1/r 3 , where r is the distance from the source, rather than as 1/r, as it does in an 
electromagnetic wave in free space. Furthermore, electric and magnetic fields are effectively separable at 
extremely low frequencies (ELF) in the sense that the field from a particular source in the "near-field 
region" (i.e., when L « X) is either "primarily" electric (E » H) or "primarily" magnetic (H » E). 
Here E is the electric and H is the magnetic field intensity. AT RFs or microwaves, on the other hand, 
one is generally interested in the "far field" or "radiation field" region, where the ratio between E and H 
is equal to the "wave impedance" r\. It is given in a loss-free medium by r| = Vji/e, where u and e are, 
respectively, the magnetic permeability and dielectric permittivity of the medium in which the wave is 
propagating. (The magnetic flux density B is given by B = U.H.) 
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TABLE IX. 1 Biologic Effects of ELF and Pulsed Electric Fields 



V/m 

Inside Tissue 

or Fluid 



Electroporation (transient or permanent, depending on amplitude and duration of pulse) 10 5 

Cell rotation in insulating fluid 10 4 

Nerve/muscle stimulation 

Initiation of firing 10 3 

Alteration of firing rate 10 

Subtle "long-term" (f > 10 min) effects (bone/soft tissue repair, Ca-efflux) 10~ 3 to 10~ 4 

The only precaution that is needed in the application of quasi-statics at ELF is to remember the relation 
between E and B when the sources involved produce predominantly a magnetic field (such as the 
application coils in a bone therapy apparatus): 

r)F> 

VxE = --#0 (IX.2) 

at 

Thus in the tissue, the curl of the electric field, i.e., the circulation of the electric field per unit area, is 
not zero, as it is when the applied quasi-static field comes directly from an electric field source. As a 
consequence, current distributions in tissue due to such different sources are likely to be very different 
even if current magnitudes are possibly, in a particular case, identical. 

As already mentioned, Chapters 89 to 92 deal primarily (although not exclusively) with low-intensity 
fields while Chapter 94 describes only high-field-intensity effects. It is of interest here to summarize the 
known biologic effects of various intensities of dc and low-frequency electric fields (Table IX.l). 

The basic laws that give forces (F) and torques (T) due to electric and magnetic fields are listed on 
Table IX.2. In this table, q = electric charge, v = charge velocity, p = electric dipole moment (magnitude 
of p = charge x separation distance between opposite charges), and m = magnetic moment (magnitude 
of m = magnitude of circulating current x area enclosed by current; direction of m is perpendicular to 
the plane of the circulating current). The derivatives with respect to the distance x are a one-dimensional 
representation of spatial field gradients. The equations listed in Table IX.2, as well as Maxwell's equations 
(or the appropriate applicable approximations) and the known laws of classic and/or quantum mechanics, 
subject to the boundary conditions characteristic for living tissue, must provide a quantitative explanation 
for the observed biologic effects of electric and magnetic fields. Biologic tissue and cells are obviously 
extremely complex media; they are not only extremely inhomogeneous and anisotropic, but they are also 
not in thermodynamic equilibrium (unless dead). Thus the application of physical laws to the explanation 
of field-tissue interactions becomes a very complex problem, and the physicist and engineer must be 
careful not to provide "explanations" or to set limits on what should be "possible" or "impossible" based 
on physical models that are very far from even an approximate representation of biologic conditions. 

It is not surprising, therefore, that only relatively 

gross effects, such as cell damage in electroporation TABLE IX.2 Forces and Torques on Electric 

or heating by radiofrequency fields of sufficiently Charges and Electric and Magnetic Dipole Moments 

high intensity, are reasonably well (although not F _ aE F = a (v x B) 

completely) understood. Some "nonthermal" E _ u dE dB 

microwave effects (Chapter 92) and several low- dx dx 

T=pxE T=mxB 



frequency effects of magnetically induced electric 

fields of less than lO" 3 V/m (Table IX. 1 and Chapters Noninvasive inductive coupling device used to apply 

90 and 91) or alternating magnetic fields of less low field intensities. 
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than about 100 uT are presently not understood. The interested reader may want to consult references 
given in the following chapters and/or discussions by Adair [1991], Adey [1990], Blanchard and Blackman 
[1994], Douglass etal. [1993] Kaiser [1993], Kirschvinket al. [1992], Lednev [1991, 1993],Luben [1993], 
Polk [1992, 1994], Tenforde [1992b], Weaver and Astumian [1990], or additional references listed by 
these authors. There are numerous, experimentally confirmed in vitro biologic effects at low ELF field 
intensities, e.g. Harland and Liburdy [1997]. These effects may be beneficial, may be adverse, or may be 
of no consequence in entire animals or humans. Some epidemiologic data (Chapter 90) suggest the 
possible existence of adverse health effects in Portier and Wolfe [1998]. The experimental evidence existing 
at the present time is insufficient, however, to decide whether any of the more promising physical models 
that are discussed in the given references can provide an adequate explanation for any of the observed 
biologic effects. 

Two additional observations appear to be appropriate. The first is that the biologic environment is 
much more complex than the man-made inanimate world, where many different electric and magnetic 
field effects are used in devices as diverse as large electric motors, Xerox machines, feedback controllers, 
or television sets. It would therefore be surprising if identical electric or magnetic field "mechanisms" 
would be responsible for effects of different frequencies or field intensities on different biologic systems 
(e.g., immune cells versus bone tissue). The second observation concerns the existence and amplitude 
of the ambient, natural electric and magnetic fields. It is sometimes pointed out — correctly — that humans 
have been exposed for eons to the natural geomagnetic field of about 50 uT and to the natural dc electric 
field, which at the earth's surface is about 100 V/m and can be several thousand volts per meter under 
a thunder cloud. Without conducting contact, dc electric fields of this magnitude have indeed not been 
implicated in any biologic effect, nor have dc magnetic fields of about 50 uT, without the simultaneous 
presence of an alternating field, been shown to have significant biologic consequences [Tenforde, 1992a; 
Frankel and Liburdy, 1995]. Virtually all the reported biologic effects of low-intensity electric and 
magnetic fields on living organisms, including cells, involve time variation, particularly at frequencies 
below 100 Hz (or microwaves amplitude modulated at these frequencies). The low-frequency effects on 
animals living outside water appear also to be primarily due to time-varying magnetic fields. Clearly, the 
natural ambient magnetic field of about 50 uT is a static field that exhibits diurnal slow variations (i.e., 
with frequency components well below 1 Hz) only of the order of 10~ 3 uT and with "large" variations 
(taking minutes to hours for one cycle) during magnetic storms occurring a few times each year, rarely 
as large as 0.5 uT [Matsushita and Campbell, 1967]. The natural background of magnetic fields with 
"higher" frequencies (10 to 100 Hz) is on the order of 10 pT [Polk, 1982]. Thus the geomagnetic field 
is clearly a static field that cannot induce an electric field into a stationary object (as a consequence of 
Faraday's law) anywhere near the magnitude of that induced by a 60-Hz magnitude field even as low as 
0. 1 |iT. Uniform linear motion of an object, such as that of a walking human, in a nearly uniform magnetic 
field of about 50 uT will produce an induced "Lorentz" electric field proportional to the product of 
velocity and flux density. However, that field cannot produce circulating electric currents as long as the 
total magnetic flux integrated over the cross-sectional area of the object (animal or human), }] B ds, does 
not change. Only tumbling motion, such as linear motion by means of somersaults, through the earth's 
magnetic field could produce induced electric currents comparable in magnitude to those induced by a 
1-uT, 60-Hz field. Thus there is no basis for statements, sometimes made, that biologic effects of weak 
ELF magnetic fields are impossible, because animals or humans have been exposed throughout their 
development to the geomagnetic field. 
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The bulk electrical properties of tissues and cell suspensions have been of interest for many reasons for 
over a century. These properties determine the pathways of current flow through the body. This gives 
them fundamental importance in studies of biologic effects of electromagnetic fields, in measurements 
of physiologic parameters using impedance, and in basic and applied studies in electrocardiography, 
muscle contraction, nerve transmission, and numerous other fields. 

I will briefly define the quantities used to characterize the bulk electrical properties of tissues and give 
some of the background information needed to interpret the data. A more extensive review is presented 
elsewhere [1]. Other reviews of tissue properties are by Schwan [2], Pethig [3], Grant et al. [4], Schanne 
and P.-Ceretti [5], and Duck [6]. Other tabulations of tissue properties are by Schwan [2], Geddes and 
Baker [7], and Stuchly and Stuchly [8]; Schwan [9] has published an extensive review of practical 
measurement techniques. 

89.1 Definitions and Basic Phenomena 

The dielectric permittivity ee and conductivity a of a material are, respectively, the dipole and current 
densities induced in response to an applied electric field of unit amplitude. 1 The significance of these 
quantities can be illustrated by considering an ideal parallel-plate capacitor, whose plates have surface 
area A and separation d. The capacitance C and conductance G of the capacitor are then 

d (89.1) 

d 

(This neglects the effects of fringing fields and applies at low frequencies where propagation effects can 
be neglected.) At radian frequency co, the admittance Y of the capacitor can be written 



'*In MKS units, the permittivity and conductivity have units of farads per meter and Siemens per meter, respec- 
tively. For convenience, we write the permittivity as e (the relative permittivity) times e , the permittivity of vacuum. 
e = 8.85 (10- 12 F/m). The resistivity p = l/o. 
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Y 



= (g+;'coc) 



,A 



= (a+;coee ) d 

(89.2) 

d 

a 

where a* = a + jcoee is the complex conductivity, and e* = e - ja/coe is the complex permittivity. 2 In 
the usual notation, e* = e' - je", where e" is the loss and tan (e"/e') is the loss tangent. Typically, for soft 
tissues at low frequencies, 

» COee 

o 

and the tissue can, for many purposes, be approximated adequately by considering it to be a pure 
conductor and neglecting the permittivity entirely. 

For tissues, both e and a are strong functions of frequency (Fig. 89.1). This frequency dependence 
(dispersion) arises from several mechanisms. These mechanisms are discussed, with reference to simple 
biophysical models, in Foster and Schwan [1]. For a typical soft tissue, different mechanisms dominate 
at different frequency ranges: 

• At low frequencies (typically below several hundred kilohertz), the conductivity of the tissue is 
dominated by conduction in the electrolytes in the extracellular space. The bulk conductivity of 
the tissue is then a sensitive function of the volume fraction of extracellular space and the con- 
ductivity of the extracellular medium. 

• At low frequencies, the tissue exhibits a dispersion (the alpha dispersion), centered in the low- 
kilohertz range, due to several physical processes. These include polarization of counterions near 
charged surfaces in the tissue and possibly the polarization of large membrane-bound structures 
in the tissue. At frequencies below the alpha dispersion, the relative permittivity of tissue reaches 
very high values, in the tens of millions. The alpha dispersion is very apparent in the permittivity 
but hardly noticeable in the conductivity of the tissue. 

• At radiofrequencies, the tissue exhibits a dispersion (the beta dispersion), centered in the range 
0.1 to 10 MHz, due to the charging of cell membranes through the intracellular and extracellular 
media. Above the beta dispersion, the cell membranes have negligible impedance, and the current 
passes through both the extracellular and intracellular media. The beta dispersion is apparent in 
both the permittivity and conductivity of the tissue. 

• At microwave frequencies (above 1 GHz), the tissue exhibits a dispersion (the gamma dispersion) 
due to rotational relaxation of tissue water. This dispersion is centered at 20 GHz and is the same 
as that found in pure liquid water. 

In addition to these three major dispersions, other smaller dispersions occur due to rotational relax- 
ation of bound water or tissue proteins, charging of membranes of intracellular organelles, and other 
effects. These dispersions overlap in frequency and lead to a broad and often featureless dielectric 
dispersion in tissue. 



2 The term dielectric constant is used, often in the chemical literature, to indicate the relative permittivity of pure 
liquids at low frequencies, where e is essentially independent of frequency. 
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FIGURE 89.1 Data from liver tissue (the composite of several sets of data, from the Table 89.1). (•) Relative permittivity 
e, (A) conductivity a, (I) coee . When a » coee , the tissue may be regarded for many purposes as a pure conductor. 
The major dispersion regions (a,p\y) are indicated on the figure. The lines are regression lines through the data. 

These dispersions do not affect the permittivity and conductivity in the same way. For a single-time- 
constant dispersion centered at frequency f c , the change in permittivity Ae is related to the change in 
conductivity Aa : 



Ao= 27l/ f Aee 

Thus the alpha dispersion (at kilohertz frequencies) is associated with a small (usually imperceptible) 
increase in tissue conductivity but a very large decrease in permittivity. By contrast, the beta dispersion 
represents a large decrease in the permittivity (from several thousand to less than 100) and a large increase 
in conductivity (by a factor of 10 or so). 

89.2 In Vivo Versus In Vitro Properties 

The dielectric properties of the tissues that are summarized in the Table 89.1 pertain, for the most part, 
to excised tissues. The relation between these properties and the dielectric properties of tissue in vivo is 
a complicated matter. 

At low frequencies (below about 0.1 MHz), electric current largely passes through the extracellular 
space, and the tissue conductivity is a sensitive function of the extracellular volume fraction. Any changes 
in the fluid distribution between intracellular and extracellular compartments can lead to a pronounced 
change in the low-frequency conductivity of the tissue. 

For example, substantial (twofold) decrease in the conductivity of rat kidney [10] and sheep myocar- 
dium [11] have been reported within a few minutes after death of the animal or after experimentally 



© 2000 by CRC Press LLC 



S3 



o 



c 
o 






« S 2 



^3 

2 



u 



PQ - 



CD 



3 a 



-^5 o 



u 



5 8 

E- o 



-< PL, 



s 
-o 
a 
o 
U 



o d 



i— i i-h <n en 



■— I (N 



--t i-H CN 



O O O O O 



O O O O O 



<N en CN 



cn^oinoococoooin 

" o ^' ^ 



cp cp T Cj. <p Cf 



+l T HH 

in en m ctn 

o in r-> cn o o in 



i-h CN <N 



r^^cNcNOOoo in 



p -* oq oq <-; 

CN fN <N LT> i-H 







m 

00 


en 


so 






0> 


o 


X 






O 


o 


o 


o 


o 


o 


o 


o 


+1 


o 


*H 




\o in 


GO 


X 


CO 

m 


\0 

X 


r-j 








CC 


en 



OOOOOOOO^h^h^h 



+1 

r^ co en 

cn cn od 



x 
% 



X 

% 



X 






A 


X 


X 


o 


o 


o 



X 

o 



MMffi^iT^NCOmO 



cn m ^ m ^o rv 



» 2000 by CRC Press LLC 



r-> VO \0 LD 

III I 

cn oo eo co m 

r^ m \o ^o m 



rN — < ^o 



■— I (N >-h 



I 



T 




m 


(-N1 


' ' 


1 

o 


T 


t 


+ 1 


m 


^ 


ON 


^ 


ID 


■^ 


■— i 


GO 


1/1 


X 



-— I <N 

rN in 

m en 



I 



I o 

O O rN 
in in in 



3V ^H ^H ^H 



m o m r-^. 
m co (N i>* 



t^. rN m cn ** i>- 

■* -t in 't ro fo 



o o 
o o 
m oo 



2 i 



m in 



m 



CN CN 

I I 

O O 

^ o rt 

^ °1 S 



r O ^ -h 



u^ i_> uu _ , 

H (N H O] ^ O 






OO'ONOOTfNOOOO 



^o >o ^o in in ■* 



cn ^o o o. in 

m -^ -* fn m 



X 

2 









X 

o 



Hrj^i^^^NwaOrHtN^^m^Nco 



© 2000 by CRC Press LLC 



c 

3 



o 
Q 



b£ ^ 



oo m "qqq?w 



M m pq 
M <"<"<;" w a w & 



co --h ^ [^ 



'Tn H 



r- 




3 




S 




E 




^ 


6 


"rt 


s 


<U 


CJ 




cu 




■— 


QJ 


J-l 


2 

c/1 


3 

-.'■: 


CD 


3 


DC 


<U 


tu 


O 

T-i 


6 


^ 


B 



is G 



O 

Q 



c 
Q 



.3 K 

2 U 







u 

o 




6 


CD 

J3 






r\ 




IT) 






s-^ 




in 


O 










Q 


P- 






jg 




c/3 














X 






X! 




+1 


tU 










js 








3 




I 


o 








o 


> 










o. 


.3 


rt 






n 


d 




<U 








1 

r-j 

d 


(A' 


3 

2 


rO 




■■■- 


C/i 


t« 




j-T 




s 

3 
tu 

U 


<D 


— ' 


u! 


g 


o 


'0 


V-: 
<-> 

O 


C 


c 


<D 


M 




eti 




a 


cs 


m 




> 





ffi K K _, 

d d d u u u u n' « m n 



pq pa pq 
< < < 









O 



c 
Q 





G 






J3 

s 


4_i 


fi 


o 


TO 













1 — ' 


n 





.a I 

o ™ 

Q 



< <; <; 



pacQPQpacQPQpqcQPQ 

< «f <J < «f <J < «f <J U U U U U U U O w ft ft ft ft ft ft ft 



K K K K 

o cj d d w w 



ft 

i^ O CO 
CN CO CO 



i 2000 by CRC Press LLC 





u 


u 


N 




rt 








3 


rj 


O 




_^ 


rt 


o 




i~ 


in 


GO 






es 


';•' 


U 


i-T 






m 


C 


rj 


<N 


rt 








C 


;' 


o 




OJ 


MD 










u 


x" 





X 


td 


cd 


r*~i 


OJ 


B 


..'V 


■J 


_c 





T3 

_2 


DC 

C 


0> 
■— 

% 


3 


T3 


■j~. 



I 3 s 





— ' 


-- 


X 


a 

S-H 


n 


■■ i 


o 


.2 


<u 

e 


o 



u 



z > 



u 









k-,^^OUWW^-^ 



cq X X X X X 
_, _, «? _ d d d ci d 






te 









u 



B « 



'•> 






a 








o 














^ 


an 










o 


U5 


u 





J> 


•" ■& 


u 


u .a 


h- 




m 


„ c 




^ o 


i- 


3 --^ 


.> 




CD 

O 


s ^ 


Q 


rt 



pq m 

r--. o m oo •— i ■* n o m \o (^ M 

(NfO(NMfniNMfnf**lM(NiTi 



a o 



OO ^H ^H 



© 2000 by CRC Press LLC 



induced ischemia. These changes are almost certainly a result of changes in fluid distribution within the 
tissue associated with cell swelling. These changes are likely to be much less pronounced in the permittivity 
and in the conductivity above about 0.1 MHz. 

Most of the data in Table 89.1 were taken from excised tissues within minutes to hours after the death 
of the animal and may well represent systematic under-estimates of the conductivity of tissues in vivo. 
This calls for caution in their use. 



89.3 Temperature Coefficients 



The dielectric properties of tissues change with temperature. Below about 44 to 45°C these changes are 
generally reversible. They principally reflect the changes in conductivity of electrolyte with temperature, 
either directly (in the conductivity of tissue) or indirectly, as they affect the dielectric dispersion. 

At higher temperatures, thermal damage will result in irreversible changes in the dielectric properties 
of tissue. The extent of such changes depends on the tissue type, duration of heating, and other factors. 
For canine skeleton, such changes occur above 44.5°C. 

Figure 89.2 shows the temperature coefficient, defined in the figure, for the conductivity and permit- 
tivity of canine muscle and brain. These data pertain to reversible changes only. 

89.4 Dielectric Data: Tabulated 

Table 89.1 presents selected permittivity and conductivity data from various tissues. Where available, the 
table presents up to three values for the tissues, including measurements performed on excised tissues 
of various species and in vivo. The aim is to present primarily new data; in some cases I have included 
earlier data. Sources are References 15 to 31. The large variability in reported properties of these tissues 
is illustrated in the table and is due in part to biological variability and in part to differences in condition 
of the tissues. All data pertain to tissues at body temperature (37 to 38°C). 

Another extensive tabulation of dielectric data has recently been prepared by Gabriel et al. [32]. These 
data are available (as of time of publication) on the Internet at http://www.brooks.af.mil/AFRL/HED/ 
hedr/reports/dielectric/home.html and several other sites. 
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FIGURE 89.2 Fractional change with reciprocal temperature of dielectric properties of ( a) dog muscles and ( b) hrain 
near 37°C (from Foster and Schwan, ref. 1). In most cases, the temperature coefficients have been calculated from 
two measurements at 25 to 28 and 37°C. 



© 2000 by CRC Press LLC 



References 

1. Foster KR, Schwan HP. 1994. Dielectric properties of tissues. In C Polk, E Postow (eds), Handbook 
of Biological Effects of Electromagnetic Fields, 2d ed. Boca Raton, Fla, CRC Press. 

2. Schwan HP. 1957. Electrical properties of tissue and cell suspensions. In Advances in Biological 
and Medical Physics, vol 5, p 47. New York, Academic Press. 

3. Pethig R. 1979. Dielectric and Electronic Properties of Biological Materials. New York, Wiley. 

4. Grant EH, Sheppard RJ, South GP. 1978. Dielectric Behavior of Biological Molecules in Solution. 
Oxford, Oxford University Press. 

5. Schanne OF, P.-Ceretti ER. 1978. Impedance Measurements in Biological Cells. New York, Wiley. 

6. Duck FA. 1990. Physical Properties of Tissue. New York, Academic Press. 

7. Geddes LA, Baker LE. 1967. The specific resistance of biological material — A compendium of data 
for the biomedical engineer and physiologist. Med Biol Eng 5:271. 

8. Stuchly MA, Stuchly SS. 1980. Dielectric properties of biological substances — Tabulated. J Micro- 
wave Power 15:19. 

9. Schwan HP. 1963. Determination of biological impedances. In G Oster et al (eds), Physical Tech- 
niques in Biological Research, vol 6, p 323. New York, Academic Press. 

10. Lofgren B. 1951. The electrical impedance of a complex tissue and its relation to changes in volume 
and fluid distribution. Acta Physiol Scand 23(suppl 81):1. 

11. Fallert MA, Mirotznik MS, Bogen DK, et al. 1993. Myocardial electrical impedance mapping of 
ischemic sheep hearts and healing aneurysms. Circulation 87:188. 

12. McRae DA, Esrick MA. 1992. The dielectric parameters of excised EMT-6 tumors and their change 
during hyperthermia. Phys Med Biol 37:2045. 

13. McRae DA, Esrick MA. 1993. Changes in electrical impedance of skeletal muscle measured during 
hyperthermia. Int J Hyperthermia 9:247. 

14. Esrick MA, McRae DA. 1994. The effect of hyperthermia-induced tissue conductivity changes on 
electrical-impedance temperature mapping so physics in medicine and biology. Phys Med Biol 
39:133. 

15. Fricke H, Curtis HJ. 1935. The electric impedance of hemolyzed suspensions of mammalian 
erythrocytes. J Gen Physiol 18:821. 

16. Herrick JF, lelatis DG, Lee GM. 1950. Dielectric properties of tissues important in microwave 
diathermy. Fed Proc Fed Am Soc Exp Biol 9:60 (abstract only; data summarized in ref. 2). 

17. England TS, Sharpies NA. 1949. Dielectric properties of the human body in the microwave region 
of the spectrum. Nature 163:487. 

18. Schwan HP. 1941. Uber die Niederfrequenzleitfahigkeit von Bluten und Blutserum bei ver- 
schiedenen Temperaturen. Z Ges Exp Med 109:531. 

19. Epstein BR, Foster KR. 1983. Anisotropy in the dielectric properties of skeletal muscle. Med Biol 
Eng Comput 21:51. 

20. Stoy RD, Foster KR, Schwan HP. 1982. Dielectric properties of mammalian tissues from 0.1 to 
100 MHz: A summary of recent data. Phys Med Biol 27:501. 

21. Schepps JL, Foster KR. 1980. The UHF and microwave dielectric properties of normal and tumor 
tissues: Variation in dielectric properties with tissue water content. Phys Med Biol 25:1149. 

22. Schepps JL. 1980. The measurement and analysis of the dielectric properties of normal and tumor 
tissues at UHF and microwave frequencies, Ph.D. dissertation, University of Pennsylvania, Phila- 
delphia. 

23. Schwan HP, Kay CF. 1957. The conductivity of living tissues. Ann NY Acad Sci 65:1007. 

24. Stuchly MA, Athey TW, Stuchly SS, et al. 1981. Dielectric properties of animal tissues in vivo at 
frequencies 10 MHz-1 GHz. Bioelectromagnetics 2:93. 

25. Brady MM, Symonds SA, Stuchly SS. 1981. Dielectric behavior of selected animal tissues in vitro 
at frequencies from 2 to 4 GHz. IEEE Trans Biomed Eng BME-28:305. 



3 2000 by CRC Press LLC 



26. Schwan HP, Li K. 1953. Capacity and conductivity of body tissues at ultrahigh frequencies. Proc 
IRE 41:1735. 

27. Kraszewski A, Stuchly MA, Stuchly SS, Smith AM. 1982. In vivo and in vitro dielectric properties 
of animal tissues at radiofrequencies. Bioelectromagnetics 3:421. 

28. Kosterich JD, Foster KR, Pollack SR. 1983. Dielectric permittivity and electrical conductivity of 
fluid saturated bone. IEEE Trans Biomed Eng BME-30:81. 

29. Nightingale NRV, Goodridge VD, Sheppard RJ, Christie JL. 1983. The dielectric properties of 
cerebellum, cerebrum, and brain stem of mouse brain at radiowave and microwave frequencies. 
Phys Med Biol 28:897. 

30. Pfutzner H. 1984. Dielectric analysis of blood by means of a raster- electrode technique. Med Biol 
Eng Computing 22:142. 

31. Foster KR, Schepps JL, Stoy RD, Schwan HP. 1979. Dielectric properties of brain tissue between 
0.01 and 10 GHz. Phys Med Biol 24:1177. 

32. Gabriel S, Lau RW, Gabriel C. 1996. The dielectric properties of biological tissues. 2. Measurements 
in the frequency range 10 Hz to 20 GHz. Phys Med Biol 41:2251. 



© 2000 by CRC Press LLC 



Stuchly, M. A. "Low-Frequency Magnetic Fields:Dosimetry, Cellular, and Animal Effects." 

The Biomedical Engineering Handbook: Second Edition. 

Ed. Joseph D. Bronzino 

Boca Raton: CRC Press LLC, 2000 



90 



Maria A. Stuchly 

University of Victoria 

90.1 Introduction 



901. 
90.2 



90.3 



90.4 



Low-Frequency 

Magnetic Fields: 

Dosimetry, Cellular, 

and Animal Effects 



Introduction 

Physical Interactions — Dosimetry 

Basic Principles • People and Animals • Cells and Cell 

Assemblies 

Biological Effects 

Human Data • Laboratory Animals • Cellular Systems 

Concluding Remarks 



Low frequency magnetic fields are of interest to a biomedical engineer for at least two reasons. They have 
found applications in a few diagnostic procedures and their therapeutic applications have been growing. 
The diagnostic applications include the use of strong pulses of magnetic fields for neural stimulation. 
Another well-established diagnostic use is in magnetic resource imaging. Therapeutic applications are 
reviewed in Chapter 91. Information on other applications can be found in a review by Stuchly [1990]. 
The other reason for the importance of low frequency magnetic fields is their potential impact on human 
health. This is the area of considerable research activities, public concern, and a subject of this section. 
Low frequency magnetic fields, for the purpose of this review, mostly refer to extremely low frequency 
(ELF) fields, but the techniques and interactions extend to higher frequencies up to a few kilohertz. 

The impetus for the inquiry into interactions of ELF magnetic fields with living systems has been 
provided by epidemiological reports (for a summary of these reports see NAS [1996] andNIEHS [1998]). 
The epidemiological studies have not provided convincing evidence that ELF, or more specifically the 
power line frequency magnetic fields are associated with increases in cancer rates. They have shown some 
supportive evidence for possible increases in childhood leukemia and some adult leukemia in those 
exposed occupationally. In an overview that follows, attention is focused on physical interactions of ELF 
magnetic fields with living systems and reported effects in cellular and animal studies. One general 
conclusion that can be drawn is that low frequency magnetic fields at moderate to low levels are 
biologically active, i.e., interact with various cells or systems of the animal body. What, however, is not 
known are the biophysical mechanisms responsible for the interactions observed, and parameters of the 
field and biosystem that are essential in eliciting the interactions. It has also not been resolved what 
strengths of the field are required to elicit an effect. Several reviews and reports provide detailed infor- 
mation on the subject [NAS, 1996; NIEHS, 1998; Moulder, 1998]. 
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90.2 Physical Interactions — Dosimetry 
Basic Principles 

ELF magnetic fields interact with conductive bodies such as tissues by inducing electric fields and currents 
inside them. While in general, all four Maxwell's equations have to be satisfied, in the case of ELF fields, 
certain simplifications apply. As discussed in the preceding chapter, tissues are conductive and nonmag- 
netic (their bulk magnetic permeability is equal to that of free space). Consideration of the dimensions 
of biologic objects (e.g., people) in comparison with wavelengths at ELF, the tissue conductivity as 
compared with the dielectric constant, lead to the following conclusions. Provided that: 

0/coe»l (90.1) 

and 

ftl oL 2 « 1 (90.2) 

the quasi-static approximation can be applied [Polk, 1986, 1990] . This means that the secondary magnetic 
field induced by the eddy currents produced by the original magnetic field can be neglected. As a result, 
to determine the induced current and electric field, it suffices to consider Faraday's law and Laplace's 
equation rather than coupled Maxwell's equations. Faraday's law, 



§E-M = -\ 



— ■ds (90.3) 

dt 



gives the electric field vector E over a closed contour c due to the time change of the magnetic flux density 
vector B integrated over the surfaces encompassed by c. Note the scalar products, i.e., only the component 
of B normal to the surface defined by the contour, are contributing to the induction of the electric field. 
For harmonic (sinusoidal) fields and a circular contour in a homogeneous infinite conductor, the mag- 
nitude of the electric field is expressed by a simple relationship 

E = %fBr (90.4) 

where /is the frequency and r is the radius. 

In media that are finite in size and consist of parts having different dielectric properties, the boundary 
conditions at interfaces have to be satisfied, and Laplace's equation needs to be solved in the region of 
interest (subject to the boundary conditions): 

V 2 V = (90.5) 

and 

E = -VV (90.6) 

The induced current density is related to the electric field as: 

J=&E (90.7) 
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With the exception of a few simple cases where Eq. (90.4) can be used to determine the induced electric 
field, a more complex analysis, often numerical, has to be performed. An example of a valid use of Eq. (90.4) 
is a conductive solution in a cylindrical dish in a uniform magnetic field parallel to the cylinder axis. 

There is no clear indication that the induced electric field and current in tissues, cells, or their parts are 
responsible for all observed biological effects. At least some of the effects clearly depend on these parameters 
[NAS, 1996, NIEHS, 1998] . Other mechanisms are postulated but not satisfactorily proven to be dependent 
on the ELF magnetic field itself or its combination with a static magnetic field. Despite this limitation, it 
appears reasonable to quantify the exposure conditions for in vitro and animal studies in terms of dosi- 
metric measures that are well based in physics and refer to the biological system, namely the induced 
electric field and current. Evaluations of internal fields and currents are useful in comparing experimental 
exposures for various cell preparations and species. Their parameters can also be used in scaling the 
exposures from one animal to another or to human beings. Furthermore, there are well-known physiologic 
effects that are related to the induced electric fields and currents in humans, which need to be considered 
for occupational exposures to strong fields. The induced currents are also used as a benchmark for 
establishing exposure limits in health protection guidelines [Bailey et al., 1997, ICNIRP, 1998]. 

People and Animals 

Numerous evaluations of the induced electric fields and currents have been performed for simplified or 
semi-realistic models of the human body exposed to either a uniform electric or magnetic field, e.g., Shiau 
and Valentino [1981], Kaune and McCreary [1985], Spiegel [1981], Chiba et al. [1984], Chen et al. [1986], 
Dimbylow [1987], Xi et al. [1994], Stuchly and Zhao [1996]. Actual measurements have also been per- 
formed of the short circuit current in people or the models exposed to a uniform electric field [Deno and 
Zaffanella, 1982; Kaune and Forsythe, 1985; Kaune et al., 1987]. These measurements have been used to 
verify the correctness of computations for realistic models of the human body [Gandhi and Chen, 1992; 
Dawson et al., 1997]. Today, extensive data are available for realistic models of the human body in a 
uniform electric or magnetic field [Gandhi and Chen, 1992; Hart and Gandhi, 1998; Dawson et al., 1997, 
1998a-d; Dimbylow, 1998; Furse and Gandhi, 1998]. Typically, average and maximum values of induced 
average and voxel maximum induced quantities are given for various organs and tissues for voxel sites of 
3 to 7 mm. Tables 90.1 and 90.2 provide some representative data for exposure to the electric and magnetic 
field, respectively. In both cases, the human model resolution is 3.6 mm [Dawson et al., 1997]. 

Similarly comprehensive data are not available for animals, but for reference and possible scaling, 
Table 90.3 gives a comparison of induced quantities in a heterogeneous human model with 1.3 cm cubic 
voxels and in homogeneous rodents. In all the cases, the magnetic field orientation is selected to give the 
maximum values of induced current densities. That means the magnetic field is directed front-to-back 
(and vice versa), translating into a horizontal magnetic field for a person, and a vertical field for rodents. 

TABLE 90.1 Induced Electric Field and Current Density 
in the Human Body in Contact with a Perfect Ground 
in a Vertical Uniform Electric Field of 1 kV/m at 60 Hz 







Electric Field 


Current Density 








(mV/m) 


(mA/m 2 ) 


Organ 


Averag 


;e Maximum 


Average 


Maximum 


Whole 


body 


1.8 


63 


0.3 


13.5 


Blood 




1.2 


19 


0.9 


13.5 


Brains 




0.8 


3.4 


0.07 


0.34 


Heart 




1.3 


3.2 


0.13 


0.32 


Spleen 




1.6 


2.8 


0.16 


0.28 


Testes 




0.42 


1.5 


0.17 


0.58 



Source: Based on Dawson et al. [1998c], 
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TABLE 90.2 Induced Electric Field and Current Density in the 
Human Body in a Uniform Magnetic Field of 1 uT at 60 Hz 
Oriented Front-to-Back 







Electric Field 


Current 


Density 






(H.V/ 


m) 


(uA/m 2 ) 


Organ 


Average 


Maximum 


Average 


Maximum 


Whole 


body 


18.9 


369 


3.4 


76 


Blood 




6.9 


82 


4.8 


57 


Brains 




10.6 


73 


0.9 


7.3 


Heart 




13.5 


48 


1.4 


4.8 


Spleen 




41 


91 


4.1 


9.1 


Testes 




15 


74 


5.9 


29 



Source: Based on Dawson et al. [1998a] 



TABLE 90.3 Induced Currents and Fields from 60 Hz Uniform 
Magnetic Field of 1 uT 

Current Density Electric Field 

(uA/m 2 ) (uV/m) 

Subject Average Maximum Average Maximum 



Human, 1.7 m, 70 kg 


1.3-1.9" 


8 (20) b 


14-17.7 


161 (296) b 


Rat, 0.3 kg 


0.3 


1.3 


4.4 


17.7 


Mouse, 0.02 kg 


0.12 


0.4 


1.7 


5.7 



" The average current density depends on the electrical properties used 
for the muscle tissue. 

B Values in brackets obtained from the analysis with an improved reso- 
lution of 0.65 cm instead of 1.3 cm. 

Source: Based on Xi et al. [1994] and Xi and Stuchly [1994]. 

The interspecies scaling values for Table 90.3 are different from those frequently used and based on the 
estimated maximal current paths for various species. For instance, comparing the maximum currents 
one gets a ratio of 1:9 for humans vs. rat from the modeling results and 1:6 from the weight/volume 
ratios (maximal current path). 

It is worthwhile to note that the induced currents and fields in people exposed to power-line (50 or 
60 Hz) magnetic fields in the environment are very weak. Typical levels of magnetic fields at homes are 
of the order of 0.1 uT, and in the proximity of appliances 1 to 10 uT. Some occupational exposures (e.g., 
work at power substations) are higher, up to even a few mT. 

Cells and Cell Assemblies 

Evaluation of induced current and electric fields is also very important in quantification and interpreta- 
tion of results of in vitro laboratory studies. This is especially the case when the question is asked: is the 
biological effect observed due to the magnetic field or the electric currents and field induced in the test 
sample by the magnetic field. Also, when results of a study in one laboratory are not corroborated by 
other data, evaluation of induced fields may prove useful in gaining an insight into hidden differences 
in apparently identical experiments. For low density of biological cells in a conductive medium, the 
induced current density can be computed based solely on the geometry of the medium contained in the 
exposure dish and the magnetic field characteristics [Misakian et al., 1993]. Methods of calculation for 
several dish shapes including an annular ring have been published [McLeod et al., 1983; Misakian and 
Kaune, 1990; Misakian, 1991; Misakian et al., 1993; Wang et al., 1993]. Some dish configurations and 
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magnetic field orientations facilitate obtaining the same current density in most of the medium volume 
occupied by cells. However, even at low densities, the presence of biological cells, because of their low 
conductivity membranes, affects the spatial pattern of the induced currents and fields [Polk, 1990]. The 
effect of cell density becomes much more pronounced at higher densities and when cells form a confluent 
monolayer [Stuchly and Xi, 1994]. 

Many biological cells are connected by gap junctions. These are small regions in the cell membrane 
that have different conductivity from that of the membrane. Furthermore, the conductivity can change 
depending on the biologic processes. Assemblies of cells connected by gap junctions when placed in an 
electric field are expected to have higher membrane potentials induced because of their bigger corporate 
size. Detailed evaluation of various chains and other assemblies of gap-connected cells have recently been 
performed [Fear and Stuchly, 1998a-c]. Influences of the cell size and shape, and size, shape, and electrical 
properties of the gap junctions have been evaluated. It has also been shown that while the membrane 
potential increases with the size of the cell chain or assembly, it shows the relaxation behavior, and the 
relaxation frequency decreases with the size [Fear and Stuchly, 1998b]. Induced potentials of the order 
of 1 mV have been shown unlikely to be achieved. However, the analyses have been done considering 
only linear properties of gap junctions. On the other hand, the simplified circuit-based model [Fear and 
Stuchly, 1998c] of cell assembly with gap junctions is suitable for non-linear analysis. 

90.3 Biological Effects 

Human Data 

Apart from the epidemiological reports and clinical reports on therapeutic applications, there are inter- 
esting laboratory observations of effects on people of ELF magnetic fields. Even intense magnetic fields 
are not detected by people in the absence of artifactual cues. Exposure to 20 uT and 9 kV/m 60 Hz fields 
of healthy male volunteers resulted in statistically significant changes in the heart rate (slowing), com- 
ponents of event-related brain potentials, and errors on a choice reaction-time task. The effects were 
greatest soon after the field was switched either on or off [Cooket al., 1991]. The fields were not perceived 
by the subjects. The performance measures such as time estimation, vigilance, digit span, reaction time, 
and event-related brain potentials tasks were unaffected by the field exposure. Neither were subjective 
measures such as mood or sleepiness affected. 

Evidence on suppression of melatonin is mixed, with some studies reporting significant decreases of 
the nighttime melatonin, while others, even in the same laboratory, are not able to observe it [NIEHS, 
1998]. A well-established phenomenon of strong magnetic fields, above 10 mT at frequencies of tens of 
Hz, is induction of magnetophosphenes, i.e., visual sensations [Tenforde, 1986]. Results of studies of 
various other effects observed but not well established are summarized in a report by NIEHS [1998]. 

Laboratory Animals 

Numerous laboratory investigations of animals, mostly rodents, exposed to magnetic fields have been 
conducted. Detailed reviews can be found elsewhere (e.g., WHO, 1987, NAS, 1996, NIEHS, 1998]), only 
a few representative studies are outlined here. Effects of magnetic fields on cancer development have 
been of primary concern because of the suggestive indications from epidemiological studies. Typical 
animal screening studies that are used in testing carcinogenic potential of various chemicals have been 
recognized as of relatively limited value in the case of magnetic fields. The main reason is the difficulty 
in applying a high "toxic" dose, since what constitutes the dose has not been established. Large scale, 
long-term studies of rats and mice that followed the National Toxicology Program protocols have not 
shown any statistically significant carcinogenic effects [NIEHS, 1998]. 

Action of time-varying magnetic fields as promoters has been investigated by a few groups. A classical 
and relatively easy to manipulate model of the mouse skin tumor has been used by several investigators. 
Exposure magnetic flux densities up to 2 mT have been used. Although some initial studies indicated 
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acceleration of tumor development, all the studies taken together do not support the promotional or co- 
promotional effect in this model [NAS, 1996, NIEHS, 1998]. Effects on mammary tumor development 
have been reported in several experiments conducted in one laboratory, however, very similar studies in 
another laboratory have not been able to find any statistically significant effects of magnetic fields on the 
mammary tumor development in the same species [NIEHS]. 

Another area of potential health concern is reproductive and teratological effects. Two comprehensive 
reviews address this question [Chernoff et al., 1992; Brent et al., 1993]. Most studies of mammalian 
species have not reported any significant differences in reproduction and development of exposed animals 
as compared to sham exposed. The conclusions drawn by two large working groups indicate that repro- 
ductive effects due to exposure to low or moderate strength magnetic fields are unlikely [NAS, 1996; 
NIEHS, 1998]. 

There is rather weak evidence indicating that ELF magnetic fields affect nocturnal melatonin produc- 
tion in some animals [NAS, 1996; NIEHS, 1998]. 

Overall, studies performed thus far have not unambiguously identified any gross detrimental effects 
at exposures even to relatively high magnetic flux densities of the order of 0.1 to 1 ml 

Cellular Systems 

Interactions ofELF magnetic fields have been reviewed by Luben [1991], NAS [1996], and NIEHS [1998]. 
A general picture that has been emerging indicates that the fields do not cause genetic damage directly 
even at relatively high levels [McCann et al., 1993]. The mostly likely site of interaction is the cell 
membrane, through the interactions with signal transduction pathways [Luben, 1991]. The possible 
effects on signal transduction include stimulation or inhibition of receptors or ion channels, alteration 
of ligand binding, alteration of gene expression, induction of oncogenes, effects in cross-regulation of 
pathways, shifts of dose-response curves. Effects of electromagnetic fields resulting from alterations in 
signal transduction can be synergistic (or antagonistic) with other agents. The calcium ion plays an 
important role in biosignal transduction. 

The ornithine decarboxylase (ODC) is an enzyme involved in DNA replications, cell growth, and cell 
proliferation. Its activity is mediated by cell surface receptor events. ODC activity was affected by 
magnetic fields producing induced electric fields of 0.01 to 0.1 V/m in three types of normal or trans- 
formed cells [Byus et al., 1987]. These findings were corroborated by other laboratories [NAS, 1996; 
NIEHS, 1998]. 

90.4 Concluding Remarks 

Low frequency magnetic fields induce the electric field and current in biological systems. Patterns of these 
fields and currents are highly non-uniform in cells, tissues, and whole bodies. Very little is known about 
the actual induced fields on the cellular and subcellular level, and how they may interact with various 
components of biological systems. At the same time, laboratory experiments have shown that moderately 
intense fields can interact with various biological systems. These interactions depend on numerous con- 
ditions of the exposure and biological system that are not fully identified. Understanding of underlying 
biophysical mechanisms of the interactions and quantification of parameters and conditions responsible 
are important in resolving both the scientific uncertainty and potential public health issue. 
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91.1 Bone and Cartilage Repair with PEMF and Other 
Signals 
Charles Polk 91.2 Soft-Tissue Repair and Nerve Regeneration 

University of Rhode Island 91.3 Mechanisms of Field-Tissue Interaction 



It has been known for more than 30 years that electric potential differences appear across both living 
and dead bone subjected to mechanical stress [Fukada and Yasuda, 1957; Bassett and Becker, 1963]. 
C. A. L. Bassett and R. O. Becker observed that these stress-generated electrical signals decayed very slowly 
in comparison with similarly initiated signals in piezoelectric crystals and concluded that piezoelectric 
phenomena "while probably present, were not the sole cause of these potentials." Later analysis and 
experiments established that the observed signals were primarily due to ion displacement within the 
porous regions and multiple fluid-filled channels present in all bone [Gross and Williams, 1982]. 

Having shown that application of a dc electric field to nonexcitable connective tissue cells can produce 
effects similar to those elicited by mechanical stress, C. A. L. Bassett and his coworkers realized that 
clinical exploitation of these phenomena would require surgical implantation of electrodes with attendant 
danger of infection. They proceeded therefore to explore whether noninvasive inductive coupling that 
gave waveform similar to those produced endogenously by mechanical stress could lead to beneficial 
bone development, and they obtained favorable results with pulsed electromagnetic fields on dogs [Bassett 
et al., 1974]. Signals of this type have generally been identified as PEMF in the orthopedics/electrical 
stimulation community for the last 20 years and have been applied successfully in a large number of 
cases for the repair of nonunions [Gossling et al., 1992]. More recently it was found that simultaneous 
application of dc and sinusoidally time-varying extremely low frequency (ELF) magnetic fields with 
intensities below 100 uT also could be used for this purpose [Liboff et al., 1990; Ryaby et al., 1993; Zoltan 
andRyaby, 1992]. 

Although the noninvasive time-varying magnetic field treatment for nonunions (fractures that fail to 
heal) became — at least in the United States — the most widely used clinical application of subradiofrequency 
fields, several investigators pursued the application of dc electric fields through implanted electrodes and 
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the application of higher-frequency currents through electrode contacts placed on the skin surface to 
enhance bone repair [Brighton et al., 1979]. At the same time, laboratory investigations, in vitro and on 
animals, explored the application of all three modalities — PEMF, implanted dc electrodes, and higher- 
frequency coupling through skin electrodes — to produce blood vessel regeneration (angiogenesis), soft- 
tissue healing, nerve repair or regeneration, and regression of tumors. The use of time-varying magnetic 
fields for the treatment of arthritis was also explored [Trock et al., 1993]. 

91.1 Bone and Cartilage Repair with PEMF and Other Signals 

In the United States, medical devices are approved for clinical use only after they have been shown to 
the satisfaction of the U.S. Food and Drug Administration (FDA) to be not only safe but also effective. 
The devices listed in Table 91.1 are currently (March 1994) approved for one of three applications: the 
treatment of nonunions (fractures that have failed to heal after standard treatment involving setting and 
stabilization with casts) and congenital pseudoarthroses [Bassett, 1984] and the promotion of spinal 
fusion. Although many animal experiments (and possibly a few human trials, especially in Europe) have 
evaluated the application of electric or magnetic fields for acceleration of fresh fracture healing, for 
reversal of osteoporosis, and for the treatment of arthritis, no devices are currently approved in the United 
States for these purposes. 



TABLE 91.1 Electrical Bone Growth Stimulators Approved by the U.S. FDA as of March 1994 



Manufacturer Device 

Electro-Biology, Inc. EBI Bone Healing System* 



American Medical 
Electronics, Inc. 



American Medical 
Electronics, Inc. 



Physio -Stim 



Spinal-Stim 



Orthologic Corp. 



Bioelectron, Inc. 



Orthologic 1000 



Orthopak BGS System 



Electro-Biology, Inc. Orthogen/Osteogen 



Electro-Biology, Inc. 
Bioelectron, Inc. 



Sp F-4 (2) Implantable BGS f 
Zimmer direct current bone 

growth stimulator 

(DCGBS) 



Approved for 

Nonunion, 

congenital 

pseudoarthroses, 

failed fusions 
Nonunions 

(excluding 

vertebrate and flat 

bones) 
To promote spinal 

fusion as an adjunct 

to surgery or as 

nonoperative 

treatment when 9 

mos. Have elapsed 

since the last surgery 
Nonunion (excluding 

vertebrate and flat 

bones) 
Nonunions 

(excluding 

vertebrate and flat 

bones) 
Nonunion of long 

bones 
Spinal fusion adjunct 
Nonunion fractures 



Technology 



Date 



Noninvasive dc + 

sinusoidal 

magnetic fields 
Noninvasive/capac. Feb. 1986 

Coupled 



Implantable dc 

Implantable dc 
Semi-invasive dc 

with percutaneous 

pins 



Ref. 



Noninvasive pulsed Nov. 1979 A J 

e.m. field (PEMF) 



Noninvasive PEMF Feb. 1986 C 



Noninvasive PEMF Feb. 1990 D 



March 1994 I 



Jan. 1980 F 

April 1987 G 
Nov. 1979 H 



* Also known as Bi-Osteogen Systems 204. 
f Also known as BGS-Osteostim HS 11. 
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FIGURE 91.1 A typical PEMF signal (signal A of Table 91.1). Signal consists of 15 pulse bursts per second. Each 
burst is 4.5 ms long and contains 20 magnetic field pulses. In each pulse the magnetic field increases from to 
approx. 2 mT during 200 us, decreases to again during 23 us, and is equal to for 2 us before the next 225 us 
sequence begins, (a) Magnetic field versus time. (6) Electric field [x/(8B/dt)] versus time. 



Classified by electrical and mechanical characteristics, the devices in Table 91.1 are either 

Noninvasive: 

1. Generating time-varying magnetic fields applied by coils to the affected body part (I and PEMF 
devices A, B, C, and D). 

2. Generating time-varying electric fields applied through skin-surface electrodes (capacitively cou- 
pled)(E). 

Invasive or semi-invasive: dc applied from an implanted battery (F, G) or (semi-invasive) dc applied 
with percutaneous pins (H). 

A signal typical for some PEMF (A, C, D) devices is illustrating in Fig. 91.1. Figure 91.1a shows the 
magnetic field versus time, and Fig. 91. lb the corresponding electric field induced into a linear, isotropic 
medium. The waveform shown in Fig. 91.1b can be measured by a probe coil having a sufficiently large 
number of turns. The frequency spectrum of the electric field is shown in Fig. 91.2a and b. Signals used 
by the different manufacturers are protected by patents, and FDA approved is for particular signal 
parameters within specified tolerances on time and amplitude. The pseudoarthrosis signal used by 
Electrobiology, Inc. (EBI) (B in Table 91.1) consists of single pulses repeated at a rate of 72 pulses per 
second rather than the pulse bursts illustrated in Figs. 91.1 and 91.2. Each magnetic field pulse increases 
from to 3.5 mT in 380 lis and then decreases slowly to in approximately 4.5 ms. The signals that are 
now in use have evolved considerably from those employed in the initial studies, and some have little 
resemblance to the endogenous electrical signals elicited by mechanical stress. 

The PEMF signals employed by the various manufacturers in the United States and Europe can have 
several different pulse shapes, rise and decay times, pulse widths, pulse repetition rates, and amplitudes. 
Since it has been shown that all these variables can have a profound effect on the biologic action of a 
particular signal, it is essential that reports on effectiveness or lack of effectiveness of PEMF give an exact 
description of the signal that was used. Unfortunately, the medical literature is replete with examples 
where this information is either incomplete or completely absent. It is particularly important that a PEMF 
signal not simply be described by its "frequency," when what is meant is the pulse repetition frequency. 
Referring to the legend of Fig. 91.1 and to Fig. 91.2, it is clear that this particular signal should not be 



) 2000 by CRC Press LLC 




500 Hz 



mV 


ft 








1 








8 










































0.6 






































































0.4 " 
























0.2 






















i 


( 


s 




. 


1 
1 


I 


\ 


A 




| 




, 


Vw^ 


A/wyWy 


M 


A-w^V" ^t^r^f* ^~^s / 


j Uu 


uu 



B 



10 



20 



30 kHz 



FIGURE 91.2 Electric field spectrum \E(w)\ of signal in Fig. 91.1 as measured by the output from an air-core coil 
(self-inductance 0.147 mH, resistance 2.13 fi, mean radius 1.5 cm), (a) 10 Hz to 1 kHz (frequency resolution: 3 Hz). 
100 uV corresponds to 0.6 mT/sHz in a 4.5-ms pulse burst, (b) 1 to 10 kHz (frequency resolution: 10 Hz). 1 mV 
corresponds to 6 mT/sHz in a 4.5-ms pulse burst. 

identified as a "15-Hz" signal, since it has significant frequency content well into the kilohertz region. 
Only the spacing between the many different spectral lines is 15 Hz, as is apparent from part a of Fig. 91.2. 
As a further illustration of the importance of differentiating between simply frequency and pulse 
repetition frequency, consider the hypothetical signal of Fig. 91.3, which is, however, similar to that of 
some therapeutic signals. This signal would be correctly described as having a pulse repetition frequency 
of 20 Hz. However, it should not simply be called a "20-Hz" signal or an "ELF" signal because very little 
of its energy is at 20 Hz or even below 1000 Hz. Fourier integral analysis of such a pulse burst [Papoulis, 
1962] gives 



<(<o) = 2tt£ 



BZ 



sin^co-cojTj sin^co + coJTj 

(o-eoji; n (ffl+cojr, 



5(co -mco ) 



(91.1) 



where the delta function 5(co -nco ) indicates discrete spectral lines at multiples of the pulse repetition 
frequency (cd /27i), which is 20 Hz in the selected example. The amplitudes of the spectral lines are given 
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FIGURE 91.3 Biphasic pulse burst repeated at frequency (co /27i) = 20 Hz. Time f in seconds. 



n|B(C0)l 




CO, 



CO 



FIGURE 91.4 Spectral envelope of the signal of Fig. 91.3. If a single burst is replaced by bursts repeated at frequency 
f , the continuous spectrum becomes the envelope of spectral lines spaced at intervals f . 



2T 2 B 




FIGURE 91.5 DC pulse {a) and Fourier spectrum (h). 



by the expression in brackets. The spectral peak for positive frequencies occurs, as illustrated in Fig. 91.4, 
where [sin(co - (a l )T 1 ]l(& - coJTJ is equal to 1, which occurs when co = (i) l or at the frequency of the 
sine wave within each individual burst. In the present example this is (10 6 /200) = 5 kHz. The duration 
of 2T 2 of the pulse burst determines the widths of the spectral lobes in Fig. 91.4, as indicated by Eq. (91.1). 
The spectrum of a unidirectional (dc) pulse (Fig. 91.5a) is shown in Fig. 91.5b. Only if such a pulse is 
combined with the biphasic signal of Fig. 91.3, so as to give the signal of Fig. 91.6, will the Fourier 
spectrum have significant energy content at low frequencies. The example serves to illustrate the difference 
in frequency content — rather than only dc component — between unidirectional and biphasic pulsed 
signals. 

Details of shape, orientation, and location of the application coil or coils are also important, since 
these parameters, together with pulse amplitude and shape, determine the nature of the magnetic and 
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FIGURE 91.6 Unidirectional pulse and spectral envelope. If burst is repeated at frequency^, the spectrum consists 
of discrete spectral lines at f intervals. The illustrated continuous spectrum then becomes the envelope indicating 
relative amplitudes of the spectral lines. 

electric field at the location of the injured tissue. If the amplitude of the axially directed magnetic flux 
density B is constant over some region of radius _R within the cross section of a circular cylinder, the 
induced electric field is 



dB 
dt 



(91.2) 



provided the material of the cylinder is electrically homogeneous and isotropic. In Eq. (91.2), r < R is 
the distance from the center of the cylinder, and q> a unit vector in the circumferential direction. For 
magnetic fields varying sinusoidally as B cos cof, E = co5 (r/2) sin cof (|). Since most biologic objects are 
neither homogeneous nor isotropic, the actual induced electric fields at various points in the tissue or 
cells may deviate substantially from the values given by Eq. (91.2) [Polk and Song, 1990; Van Amelsfort, 
1990]. Equation (91.2) is useful only for estimating the spatial average value of the induced electric field, 
which depends in the bone environment, on the point-to-point variation of the electrical properties of 
muscle, fat, cartilage, periosteum (outer bone membrane), and bone marrow. 

Current pulses of the PEMF devices are usually produced by the discharge of capacitor banks controlled 
by a timing network. The applicator coil cannot be interchanged among different devices because its 
inductance and resistance are a part of the discharge network. While earlier bone growth stimulators 
employed Helmholtz coil-pair arrangements, most present devices have single coils which can be custom- 
shaped for particular limbs. Most commercial units are driven by rechargeable batteries (Fig. 91.7), and 




A/VVU_ 



FIGURE 91.7 PEMF applicator and control unit and implantable dc stimulating device (battery with wire electrode) 
manufactured by Electrobiology, Inc. Systems A and G of Table 91.1. (Photograph courtesy of Electrobiology, Inc.) 
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FIGURE 91.8 Applicator of the Orthologic 1000 dc/ac equipment system I of Table 91.1. (Photograph courtesy of 
Orthologic, Inc.) 

control boxes usually include an elapsed-time clock to measure the total time of stimulation of the fracture 
being treated. A typical treatment time with PEMF devices can be between 2 and 10 hours per day over 
a period of 6 months and 30 minutes per day with the newer low-field-intensity system (Fig. 91.8, I in 
Table 91.1). 

The so-called capacitively coupled device (E on Table 91.1) generates a continuous sine wave at a 
frequency of 60 kHz. The total current through the skin contains a not negligible conduction component, 
since conductive contact is made between the applicator electrodes and the skin that represents a very 
"leaky" capacitor. Electric fields produced at the tissue level by this device are between 1 and 50 V/m 
[Pollack and Brighton, 1989]. These levels are very much higher than the average amplitude of the electric 
fields produced in tissue by PEMF devices and also higher than the instantaneous peak values produced 
by some of the PEMF systems. It is interesting to note here that in vitro experiments with truly capacitively 
coupled fields showed enhancement of bone cell proliferation at very much lower amplitude (10~ 5 V/m) 
when the frequency of the continuous sine wave was 10 Hz rather than 60 kHz [Fitzsimmons et al., 1989]. 

A typical invasive (implantable) dc device (F, G in Table 91.1) consists of a small (approximately 4 x 
2 x 0.5 cm) titanium case and two or four titanium wires. These wires act as the cathode, and the case, 
which contains the battery that is connected to it, forms the anode. The amplitude of the continuous 
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current is between 5 uA (for some spinal fusion applications) and 20 uA (for nonunion of long bones). 
The cathodes are placed at the location where bone growth is to be enhanced, e.g., at the vertebrae that 
are surgically fused, while the case is placed in a convenient location some distance from the bone. 
Treatment details and success rates, in comparison with surgical procedures without use of dc stimulation, 
are discussed in the medical literature [Nerubay et al., 1986]. 

Although not used or approved for use in the United States, the German Magnetodyn system [Kraus, 
1984] is interesting not only because it employs sinusoidal magnetic fields between 2 and 20 Hz but also 
because it relies on metallic implants that are used for fixation of the bone to act as the "secondary" of 
a transformer whose primary is the external applicator. Sometimes an implanted pickup coil (secondary) 
is connected to fixation bars or to screws (electrically insulated from the bars) on each side of a pseudoar- 
throtic gap. With this system, peak electric fields of 40 V/m and current densities of 5 A/m 2 have been 
produced in the gap. 

A PEMF signal (similar to B in Table 91.1) also has been used experimentally to arrest osteonecrosis 
(bone death, possibly due to vascular impairment or toxic agents) of the femoral head. Application for 
8 hours per day over 12 months gave substantially better results than the standard surgical (core decom- 
pression) treatment [Aaron and Steinberg, 1991]. 

91.2 Soft-Tissue Repair and Nerve Regeneration 

No electric or magnetic system to aid nerve regeneration or soft-tissue repair is approved by the FDA at 
the present time for nonexperimental therapy. However, considerable animal and in vitro experimentation 
in this country and abroad suggests the clinical usefulness of electric currents for soft-tissue repair 
[Canady and Lee, 1991] and possibly also to enhance repair of nerve fibers that have sustained crush or 
transsection injury [Ito and Bassett, 1983; Siskin et al., 1990]. Since there is a great variety of soft-tissue 
pathologies that could respond to electric or magnetic fields, the volume of application in this area could 
in the future become larger than in orthopedics, provided field-tissue and field-cell interactions become 
better understood and clinical benefits for specific injuries and diseases are established. 

Beneficial effects of time-varying electric fields in wound healing are most likely related to promotion 
of angiogenesis. Wound healing consists of several stages, the first being inflammation, when changes in 
vascular permeability occur, infiltration of leukocytes and macrophages takes place, and cells migrate, 
synthesize granulation tissue, collagen, and proteoglycans, and initiate formation of capillaries. This is 
followed by transitional repair and remodeling phases. Electric currents are probably only important in 
the first two stages, and the experimental clinical trials performed thus far involve therapy for inflam- 
mation [Binder et al, 1984; Ieran et al, 1990]. 

Very soon after some types of bone injury and pathology were first treated with PEMF, the effects of 
PEMF on peripheral nerve regeneration became the subject of investigation. Improved neural function 
that appeared as an unintended "side effect" in the clinical treatment of nonunions lead to a systematic 
investigation of PEMF effects on neural regeneration in rats [Kort et al., 1980]. Other investigators 
employed other animal models and also compared PEMF with dc current as agents for neural regener- 
ation. More recent work [Sisken et al., 1990] employed a PEMF signal consisting of 20-ms pulses at a 
repetition rate of 2 pulses per second with exponential rise and decay times of, respectively, 0.5 and 1 ms. 
Amplitudes were 0.3 mT for experiments with crush lesions of the sciatic nerve in rats and 0.05 mT for 
stimulation in vitro of neurite outgrowth in dorsal root ganglia. Estimated values for the induced electric 
field pulses were 5 mV/m in the animal experiments and 0.7 mV/m in the 60-mm-diameter culture 
dishes of the in vitro work. Stimulation for 2 hours per day of the in vitro cultures produced approximately 
50% enhancement of neurite outgrowth in comparison with controls after 2 days. A report on a very 
limited (13 subjects) clinical trial [Ellis, 1987] of spinal nerve stimulation in para- and quadriplegics 
employing pulsed electric current introduced by needle electrodes produced "encouraging results" in 
terms of increased sensory perception and motor function. 



S) 2000 by CRC Press LLC 



91.3 Mechanisms of Field-Tissue Interaction 

It is known that electrokinetic or streaming potentials rather than piezoelectricity make the principal 
contribution to electric potentials generated by mechanically stressed bone [Gross and Williams, 1982; 
Lavine and Grodzinsky, 1987] . Thus potential differences appear when mechanical loading displaces fluid 
that contains "counterions" that normally reside opposite ions fixed to cell or intercellular matrix surfaces. 
These potentials are likely to play a role in intracellular signaling and in bone as well as cartilage and 
soft-tissue development. While the original intent of electrical bone therapy was to simply mimic endog- 
enously generated fields, a much wider range of signals was found to be clinically useful. Furthermore, 
it was found later that some weak ionic currents (~5 x 10 2 A/m 2 ) appear endogenously without mechanical 
stress and that extremely weak sinusoidal electric fields can produce profound effects on cells in vitro. It 
has been claimed that transcription (information transfer from DNA to RNA) and translation (formation 
of new proteins initiated by RNA) have been modified by electric fields estimated at 10~ 3 V/m that were 
generated inductively by 60 Hz magnetic fields of 5.7 uT [Goodman and Henderson, 1991]. However 
these results are controversial [Saffer and Thurston, 1995]. 

It is likely that the mechanism involved when dc currents are directly applied to injured bone (or other 
tissue) differs from the transduction sequence that must be acting when low-intensity alternating fields 
are employed. Even the 5-uA continuous current of the implantable devices (F, G in Table 91.1) when 
distributed over an (estimated) 5-cm 2 area corresponds to a steady electric field of 1 V/m in bone tissue 
of 0.01 S/m conductivity. This value is large compared with the average (but not the peak) fields induced 
in tissue by some PEMF devices [Rubin et al., 1989] or the millivolts per meter ELF sinusoidal fields that 
affect cartilage and bone development [McLeod and Rubin, 1990]. For example, if one assumes a mean 
radius of 4 cm for a particular human bone fracture, one obtains from Eq. (91.2) the electric field values 
between 0.18 and 1.74 V/m shown on Table 91.2. 

To clarify the sequence of biologic events that occur when PEMF signals are applied to developing 
bone, the following in vivo experiment was performed [Aaron, 1989]. Twenty- five milligrams of dem- 
ineralized rat bone matrix in powdered form was implanted along the thoracic musculature of immature 
rats. This powder recruits cells from the surrounding tissue leading to formation of cartilage within 6 to 
10 days; thereafter, progressive calcification occurs, leading to formation of fibrous particles by days 12 to 
14 and formation of a small bone (ossicle). These developments were compared in a large number of 
paired rats, with equal numbers unexposed and exposed (8 hours per day) to the PEMF signal illustrated 
in Figs. 91.1 and 91.2 (A in Table 91.1). Estimates of the mean electric fields in the exposed tissues give 
values equal to about one-fourth those listed on line 1 in Table 91.2. Chemical and histologic analysis of 
ossicles harvested from animals, sacrificed on every second day, showed that exposure to this PEMF signal 
at the applied level significantly increased both rate and quantity of cartilage formation and enhanced 
maturation of the subsequent bone. The experimenters concluded that field exposure either enhanced 
recruitment or proliferation of cartilage precursor cells, increased differentiation of precursor to cartilage 
cells, or accelerated maturation of cartilage cells. 

Getting closer to fundamental events at the cellular level, both devices A and B (Table 91.1) were used 
to expose cultured mouse bone cells and mouse skin fibroblasts, as well as explanted mouse pineal cells 

TABLE 91.2 Electric Field (V/m) Induced by Bone Therapy Signals at a Radius of 4 cm into an Electrically 
Uniform Medium (B perpendicular to plane in which radius is denned) 



PEMF device A (Table 91.1, Figs. 91.1, 91.2, and 91.7) 
PEMF device B (Table 91.1)* 
AC-DC device I (Table 91.1, Fig. 91.8) 

* Signal parameters given on p. 91-3 ("pseudoarthrosis" signal). 
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Positive Peak 


Negative Peak 


Average of 
Rectified Signal 


0.2 
0.18 
76.6-Hz sinusoid, peak 


1.74 

0.015 

±(1.9)10- 4 


0.024 
(1.4)10-' 
(1.2)10^ 



in organ culture [Luben, 1993] . In all three cases, various chemical procedures were employed to examine 
beta-adrenergic receptors. These are cell surface protein strands that span the cell membrane and emerge 
from it; they mediate cell response to agents such as epinephrine (adrenaline) and norepinephrine 
through so-called G-proteins that act essentially as molecular amplifiers at the interior surface of the cell. 
Other G-proteins are involved in the response to growth factors. The arrangement of the exposure coils 
and culture plates was such as to give mean electric fields equal to about one-tenth the values shown in 
Table 91.2. Exposures were of 4 hours duration. Specific types of G-proteins were stimulated by the A 
signal; others by the B signal. The total number of binding sites on the cells was not affected, but the 
affinity of the receptors for specific hormones was changed, suggesting a change in receptor conformation. 
It is interesting to compare this work, which employed peak values on the order of 10~ 2 V/m and time 
averaged values not greater than 2(10~ 3 ) V/m, with other in vitro experiments showing effects on enzyme 
activity at the cell surface by ELF sinusoidal fields between 5(10~ 4 ) V/m and 30 V/m [Blank, 1992]. 

How such weak, low-frequency electric and magnetic fields are translated into biochemical activity is 
at the present time not known. Field energies are orders of magnitude below levels that would produce 
thermal effects, and in most cases, potential differences produced across cell membranes also appear to 
be below thermal equilibrium ("Johnson") noise. References to several possible models of field-to- 
biochemical process transduction are given in the introduction to Section IX and in Chapter 90. One of 
the devices (I in Table 91.1) approved by the FDA for the treatment of nonunions is based on either the 
cyclotron resonance [Liboff and McLeod, 1988; Liboff et al., 1990] or the parametric resonance model 
[Lednev, 1991, 1993]. In this device, a dc magnetic field B of 20 uT is used to obtain an ion cyclotron 
frequency f c or an integral multiple of it. This frequency is given by 

f c = ^- (91.3) 

2nm 

where Q is the charge of the ion and m its mass. The fifth harmonic of f c for 40 Ca 2+ and the third harmonic 
for 243 Mg 2+ are equal to 76.6 Hz when B a = 20 uT An alternating 76.6-Hz magnetic field of 40 uT is 
applied through the same coil as the dc field. This parallel field combination has been reported to increase 
the concentration of the important growth factor IGF-II in a culture medium containing the osteosarcoma 
cell line TE-85 after 30 minutes of exposure [Fitzsimmons et al., 1993], to affect articular cartilage 
metabolism [Ryaby et al., 1993a], and to prevent bone loss due to castration in rats [Ryaby et al., 1993b]. 

Defining Terms 

Angiogenesis: Formation of blood vessels. 

Biphasic signals (in the context of bone stimulation): An electric or magnetic field signal with both 
positive and negative components (as opposed to a unidirectional pulse signal). 

Capacitively coupled device: An electrical stimulating device (usually for enhancement of bone 
growth) which employs metallic plates for application of an electric signal to the body part (usually 
a limb) under treatment. In the 60-kHz device (E) listed in Table 91.1, conductive contact is made 
with the skin; therefore, current flow consists of both conductive and displacement components. 

Cell line: A cell culture derived from a single neoplastic progenitor cell and therefore with homogeneous 
genetic constitution. 

Cyclotron resonance: The condition under which an electrically charged particle continuously gains 
energy (provided collision damping is not excessive) as it moves in a circular orbit of increasing 
radius under the simultaneous influence of a dc magnetic field and a circumferential electric field 
whose frequency is given by Eq. (91.3). This electric field can be generated by an alternating 
magnetic field whose direction is parallel to that of the dc field. 

Extremely low frequency (ELF): A frequency between 3 and 300 Hz. 

Ion cyclotron frequency: The frequency given by Eq. (91.3) for an ion of charge Q (Q is equal to the 
electronic charge, 1.602 x 10~ 19 C, or the integral multiple thereof appropriate for the particular ion). 
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Nonunion: A bone fracture that fails to heal under conventional treatment within the normal expected 
time. 

Parametric resonance: A resonance condition involving Zeeman splitting of a vibrational energy level 
and consequent changes in the transition probability whose frequency is given by Eq. (91.3). 
Vibrational levels of an ion within a protein molecule would be at ELF with dc magnetic fields 
below 100 uT (for details see Lednev, [1991, 1993] and Blanchard and Blackman, [1994]). 

Pseudoarthrosis: A false joint formed on the shaft of a long bone. Either congenital or formed at the 
site of a fracture that failed to fuse. 

Pulsed electromagnetic field (PEMF): A term used frequently in the electrical stimulation literature 
for a pulsed magnetic field that is capable of inducing an electric field and current as a consequence 
of Faraday's law. Depending on pulse width and pulse shape, the spectrum of PEMF bone stimu- 
lation signals may contain components in the ELF and/or VLF (up to 100 kHz) range. 

Spinal fusion: The surgical fusion of two or more vertebral segments in order to eliminate motion 
between them (sometimes performed to mitigate pain resulting from degenerative bone disease). 
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Radiofrequency radiation (RFR), defined by the Institute of Electrical and Electronics Engineers (IEEE) 
as extending from 3 kHz to 300 GHz, is absorbed by biologic systems, which are water-dominated 
dielectrics richly endowed with electrolytes and intricately packaged polar and nonpolar molecules. At 
sufficiently high RF intensities, thermal energy is generated that can quickly produce morbidity and, after 
thermoregulatory mechanisms are overwhelmed, mortality. In recent years, work has focused on possible 
effects at "nonthermal" levels or under conditions in which physiologic temperature can be maintained 
by regulatory mechanisms of living systems. 

The human perception of microwave heating has been studied [1,2]. Although the human is the 
ultimate "test object," the bulk of research has been on animals. This requires an understanding and 
appreciation of biophysical principles and comparative medicine. Such studies require interspecies "scal- 
ing," [3-7] the selection of biomedical parameters that reflect basic physiologic functions, and differen- 
tiation of adaptational or compensatory changes from pathologic manifestations. In comparing results 
of experiments performed in the same or different laboratories, standardization of conditions is important 
and, unfortunately, all too often not attained. 

Even by using approaches where absorbed energy patterns in a test animal are set to approximate as 
closely as possible the patterns that may exist in humans under certain exposure conditions, the intrinsic 
physical and physiologic dissimilarities between species further confound the problem of extrapolating 
between animals and humans. In addition to the obvious external geometric differences, the differences 
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in internal vascular anatomy and mechanisms of heat dissipation in fur-bearing animals compared with 
humans must be taken into considerations. 

Johnson [8] has described factors that affect absorption of electromagnetic energy in animals. This 
absorption is dependent on the size and geometry of the animal relative to the wavelength and polariza- 
tion. The wavelength-to-animal size relationship {XI a, where a is the longest axis dimension of the body 
and the electric field vector is parallel to the longest axis) is a critical factor in the relative absorption 
cross section (RAC). The RAC is the ratio of the absorbed power to the power incident on the geometric 
cross-sectional area of the animal [9]. This produces the immediate result that at a given frequency and 
power density, the specific absorption rate (SAR) is vastly different in animals depending on size. To 
produce an identical whole-body average SAR, one must scale from one frequency to another. It is of 
practical importance to realize that experiments on biologic effects at 2.45 GHz on small animals, such 
as mice and rats, do not scale to humans at 2.45 GHz but to effects on humans at approximately 100 MHz. 
In recent years, much modeling and experimental measurement has been done on the variation of 
localized absorption in animals and humans [10]. 



92.1 Cellular and Molecular Biology 



The mutagenic potential of microwave energy has been evaluated by various techniques, including point 
mutations in bacterial assays [11], the dominant lethal test in mammalian systems [12], and genetic trans- 
mission in Drosophila [13], with inconsistent results. Two studies reported effects of RFR on sister chromatid 
exchange [14,15]. Such changes as were seen appeared to be related to temperature elevations rather than 
RF exposure. From many studies there is little evidence that exposure to RF radiation induces mutations in 
bacteria, yeasts, fruit flies, or mammals. One study reported cytogenetic effects at 200 W/m 2 [16], but its 
finding was contradicted by another study [17] that failed to find any cytogenetic effects at 2 to 5 kW/m 2 . 

There is a consensus among students of the subject that RF radiation does not initiate carcinogenesis 
by inflicting direct damage on the genome by any mechanism resembling the effect of ionizing radiation. 
This is evident from the low photon energies involved. RF radiation as part of a compound stress on 
intact animals has been studied with conflicting results. Fibroblasts in tissue culture were exposed to 
benzpyrene and the promoter 12-o-tetradecanoylphorbol with and without 2.45-GHz microwaves, SAR 
4.4 W/kg. Microwave exposure produced a several-fold increase in transformation frequency [18]. 

In a study of long-term, low-level microwave exposure of rats, Chou et al. [19] compared 100 rats 
exposed to 2.45-GHz pulsed microwaves, average SAR 0.15 to 0.4 W/kg, 21 hours per day for 25 months 
with suitable controls. No adverse effects were found. An increase in malignancies of specific type in 
exposed animals was found, but a significant increase in malignancies of all types was not found. This 
was interpreted as minimal evidence of "carcinogenic action." 

92.2 Reproduction, Growth, and Development 

The effect of microwaves on the testes has been studied extensively [20-22]. Exposure of the scrotal area 
at high power densities (>500 W/m 2 ) results in varying degrees of testicular damage, such as edema, 
necrosis of seminiferous tubules, and atrophy, at 2.45, 3, and 10 GHz. 

Lebovitz et al. [23] compared pulse-modulated 1.3-GHz microwaves with conventional heating in rats. 
The authors stated that all damage could be explained by heating. Although the general consensus is that 
damage is thermally mediated, Cleary et al. [24] reported decreases in sperm function in vitro utilizing 
suspension of mouse sperm exposed to SARs of 50 W/kg or greater but with stringent temperature control. 

Effects on embryonic development have been studied. Thermal stress appears to be the primary 
mechanism by which RF energy absorption exerts a teratogenic action. Chernovetz et al. [25] and others 
have pointed out evidence that indicates that increases in mortality and resorption are probably related 
to peak body temperature and its duration regardless of the method by which the temperature elevation 
is elicited. Rugh and McManaway [26] were able to prevent the increase in incidence of teratogenic 
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activity, which they had previously reported, by lowering the maternal body temperature through con- 
trolled use of pentobarbital anesthesia. The most common result from many studies [27] appears to be 
a reduced or retarded gain of body mass, thermally mediated. 

92.3 Effects on the Nervous System 

Guy and Chou [28] studied response thresholds in rats for short, high-intensity microwave pulses. Rats 
were exposed to single 915-MHz RF pulses of l-(j.s to 300-ms duration. No reaction occurred until the 
SAR rose to 29 kj/kg, which correlated with a temperature rise in the brain of 8°C. Seizures occurred 
followed by unconsciousness for 4 to 5 minutes. Complete recovery ensued. Postmortem examination 
revealed some demyelination of neurons 1 day following exposure and focal gliosis 1 month after 
exposure. More intense exposures could produce a fatal outcome. Brown et al. [29] looked for response 
thresholds at lower energies, with particular attention to the onset of an involuntary, generalized muscular 
contraction that did not appear to be injurious. 

Studies of low-level chronic effects, mostly dating to the 1960s and 1970s, have yielded inconsistent 
results. Problems of dosimetric measurement and quantitation of reproducible biologic endpoints have 
left uncertain whether extended exposure to low-intensity or nonthermal levels of radiofrequency energy 
produces effects, adverse or not. Bawin et al. [30] reported that electromagnetic energy of 147 MHz, 
amplitude modulated, at brain wave frequencies (8 to 16 Hz), influenced spontaneous and conditioned 
EEG patterns in the cat at 10 W/m 2 . No effects were seen at other modulation frequencies. 

At high field intensities, when death is a result of hyperthermia, pathologic changes are identical to 
those of hyperthermia. At lower levels of exposure, there are no changes that are specific to RF radiation. 
From the work of Albert and DeSantis [31], there appears to be a threshold for permanent histologic 
damage to the brain for exposures lasting several hours per day for up to 3 weeks at between 100 and 
250 W/m 2 at 2.45 GHz CW. 

An historically controversial issue has been the effect of radiofrequency radiation on the blood-brain 
barrier (BBB). This poorly understood functional "barrier" provides resistance to movements of large- 
molecular- weight, fat-insoluble substances from the blood vessels of the brain into brain tissue, presum- 
ably to protect the brain from invasion by various blood-borne pathogens and toxic substances. Early 
reports asserted that RF radiation reduces the barrier, allowing many substances normally barred from 
brain tissue to enter including many drugs. It does appear that gross hyperthermia (brain tissue elevated 
to 43°C) compromises the barrier [32]. But there is evidence that the level of a drug entering the brain 
might actually be reduced at "moderate" levels of hyperthermia [32]. 

A number of studies on isolated nerve preparations have been performed. There is no direct production 
of a nerve impulse or action potential by CW or PW microwaves, but conduction velocity and amplitude 
can be changed, mediated by temperature elevations of at least 1°C in the solution. Similar changes can 
be produced by ambient changes in temperature [33]. 

92.4 Behavioral Effects 

Studies have been conducted on the effects of RFR on performance of trained tasks or operant behavior 
by rats and rhesus and squirrel monkeys [34,35]. All the studies indicated that the exposure would result 
in suppressed performance of the trained task and that an energy/dose threshold for achieving the 
suppression existed. Depending on duration and other parameters of exposure, the threshold power 
density for affecting trained behavior ranged between 50 and 500 W/m 2 . Lebovitz [36] and Akyel et al. 
[37] were unable to find any specific effect attributed to pulse power and noted that interference with 
behavior appeared to be of thermal origin. Raslearetal. [39] and Akyel et al. [38] later found performance 
deficits at very high peak, short-pulse powers at low repetition rates, such that no measurable temperature 
changes occurred in the brains of rats. Peak brain SAR was reported to be 7 MW/kg for 80-ns-wide 
pulses with an average brain SAR of 0.07 W/kg. 
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92.5 Effects on the Cardiovascular 
and Hematopoietic Systems 



At nonthermalizing levels of exposure, both bradycardia and tachycardia have been found in different 
studies of different animals and with inconsistent results in the same animal. The specific conditions of 
exposure, biologic variability, and other sources of error could account for the findings. 

Hyperthermia of RFR or non-RFR origin produces tachycardia and a decrease in total peripheral 
resistance caused by vasodilation, a heat dissipating response to the thermal burden. Variations from this 
general principle have been found in unusual circumstances, including the application of localized pulse 
power to the head, neck, and thoracic region [40]. 

Changes in the concentrations of circulating white blood cells have been observed in a number of 
animal species exposed to microwave energy. The changes were not consistent within or between species 
and depended on exposure conditions and thermal changes in tissue. A number of mechanisms have 
been proposed to explain changes in cellular dynamics, including stimulation of synthesis at thermal 
levels, recirculation of sequestered cells, and increased hypothalamic-hypophysial-adrenal function fol- 
lowing thermal stress. Such changes may be related to changes in immune function noted by a number 
of investigators [41]. 

92.6 Auditory and Ocular Effects 

Microwave hearing, the perception of a clicking or buzzing in the presence of pulsed microwave energy 
(but not CW) at low power densities [42], has been attributed to thermoelastic transduction of pulsed 
microwaves in the head, with detection by the sensory epithelium of the cochlea. The process, familiar 
to occupationally exposed workers, appears not to be harmful at energies commonly encountered [43]. 
A threshold for cataract production in the lens of the rabbit eye has been found for 2.45-GHz 
microwaves at 1.5 kW/m 2 for 100 minutes. An intraocular temperature elevated to at least 45°C appears 
to be required [44] . For exposures lower than this there does not appear to be a cumulative effect from 
microwave exposure, i.e., no pathologic damage following many exposures with time, for which each 
individual exposure produces no detectable damage. 

92.7 Conclusion 

Elucidation of the biologic effects of RF exposure requires study of the available literature. Evaluating 
the research is a difficult task even for scientists specializing in the field. The possible sources of error of 
the biologic sciences are coupled with the sources of error associated with RF engineering and dosimetry. 
It is often difficult to make meaningful comparisons between studies. The issue of effects or even hazards 
at "nonthermal" levels of exposure stimulates continuing debate and research that may affect existing 
consensus safety standards. Whether existing standards are adequate frequently rests on a determination 
of whether documented effects constitute actual hazards at energy levels not producing morbidity. The 
research of the future will continue to affect the issue of safe levels of exposure. 
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Defining Terms 

Blood-brain barrier: An anatomical and physiologic barrier to the movement of large-molecular- 
weight, fat-insoluble substances from the blood vessels of the brain into brain tissue. 

Bradycardia: An abnormal slowness of the heartbeat, as evidenced by slowing of the pulse rate to 60 or 
less. 

Radiofrequency radiation (RFR): Defined by the Institute of Electrical and Electronics Engineers 
(IEEE) as that part of the electromagnetic spectrum extending from 3 kHz to 300 GHz. 

Relative absorption cross section (RAC): The ratio of the absorbed power to the power incident on 
the geometrical cross-sectional area of an animal. 

Tachycardia: An excessive rapidity in the action of the heart, usually applied to a pulse rate above 
100 per minute. 
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93.1 Introduction 

Cancer hyperthermia is a treatment to raise the tissue temperature either locally or whole body to a 
therapeutic level to eradicate tumors. Over the last three decades, much has been learned about the effects 
of heat on cells and the interactions between heat and radiation and chemotherapeutic agents [1-4]. The 
scientific rationale for its use either alone or combined with other methods is multifactorial, and new 
justifications for its use are continuously being identified. For example, heat may be directly cytotoxic 
to tumor cells or inhibit repair of both sublethal and potentially lethal damage after radiation. Hyper- 
thermia has been used in combination with chemotherapy because heating increases membrane perme- 
ability and the potency of some drugs. The synergism of radiation and hyperthermia is accomplished by 
the thermal killing of hypoxic and S phase (DNA syntheses) cells which are resistive to radiation alone. 
Although the biologic rationale is strong, and hyperthermia has been studied in phase I-III trials, the 
results of phase III clinical trials are controversial and the use of hyperthermia is still a developing project 
in the U.S. [4-8]. 

Numerous factors could affect the results of hyperthermia. The foremost problem in hyperthermia, 
however, is the generation and control of heat in tumors. Current heating methods include whole body 
heating, using hot wax, hot air, hot water suits, or infrared radiation, and partial body heating, utilizing 
ultrasound, heated blood, fluid perfusion, radio frequency (RF) fields, or microwaves. The effective 
temperature range of hyperthermia is very small: 42 to 45°C. At lower temperatures, the effect is minimal. 
At temperatures higher than 45°C, normal cells are damaged. Due to this narrow temperature range, the 
response rate of the tumor is highly dependent on how much of it is heated to a therapeutic level. The 
clinical use of hyperthermia has been hampered by a lack of adequate equipment to effectively deliver 
heat to deep-seated and even large superficial lesions and by a lack of thermometry techniques that 
provide reliable information on heat distribution in the target tissues. In RF hyperthermia, the final 
temperature of tumors is mainly dependent on energy deposition. When electromagnetic (EM) heating 
methods are used, the energy deposition is a complex function of the frequency, intensity, and polarization 
of the applied fields, the applicator's size and geometry, as well as the size, depth, geometry, and dielectric 
property of the tumor [9,10]. The material, thickness, and construction of a cooling bolus also influence 
the amount of energy deposition. In this chapter, the main methods of RF hyperthermia will be reviewed. 
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93.2 Methods of EM Heating 

The EM energy used in hyperthermia is usually classified by frequency as either microwave energy or RF 
energy. Microwaves occupy the EM frequency band between 300 MHz and 300 GHz. Strictly speaking, 
RF is between 3 kHz and 300 GHz, but for hyperthermia it generally refers to frequencies below the 
microwave range. The most commonly used microwave frequencies in hyperthermia are 433, 915, and 
2450 MHz, which are the designated ISM (industrial, scientific, and medical) frequencies in the U.S. and 
Europe. Common RF frequencies are 13.56 and 27.12 MHz, which have also been widely used in 
diathermy. Frequencies higher than 2450 MHz have no practical value due to their limited penetrations. 
At lower frequencies field penetration is deeper, but the applicator must be larger and focusing is difficult. 
Despite these limitations, EM heating methods have been developed for local, regional, and whole body 
hyperthermia. 

Local Heating 
External 

The superficial cooling mechanism of tissue makes deep heating difficult by conductive methods. Two 
RF methods have been used to provide subcutaneous heating. For the first method, tissues are placed 
between two capacitor plates and heated by displacement currents. This method is simple, but overheating 
of the fat, caused by the perpendicular electric field, remains a major problem for obese patients. In 
planar tissue models, the rate of temperature rise is about 17 times greater in fat than in muscle, due to 
the large differences in their dielectric properties and specific heats [11,12]. Additionally, blood flow in 
fat is significantly less than that in muscle. Therefore, the final fat temperature is much higher than the 
muscle temperature, and a water bolus is necessary to minimize the fat heating. 

The second RF method uses solenoidal loops or "pancake" magnetic coils to generate a magnetic field. 
This field then produces heat in tissue by inducing eddy currents. Since the induced electric fields are 
parallel to the tissue interface, heating is maximized in muscle rather than in fat. However, the heating 
pattern is generally toroidal with a null at the center of the coil. Fujita et al. [13] described a paired- 
aperture type inductive applicator to produce deep heating in phantom. 

In the microwave frequency range, energy is coupled into tissues by waveguides, dipoles, microstrips, 
or other radiating devices. The shorter wavelengths of microwaves, as compared to RF wavelengths, 
provide the capability to direct and focus energy into tissues by direct radiation from a small applicator. 
Many applicators of various sizes operate over a frequency range of 300 to 1000 MHz [14-16]. Most of 
them are dielectrically loaded and have a water bolus for surface cooling. Low profile, lightweight 
microstrip applicators, which are easier to use clinically, have also been reported [17,18]. High permittivity 
dielectric materials, electric wall boundaries, and magnetic materials have been used to reduce applicator 
size and weight. For the most part, these applicators are used for treatment of tumors a few centimeters 
below the skin. Toxicities usually associated with treatment are pain and thermal blistering. 

Intracavitary 

Certain tumor sites at hollow visceras or cavities may be treated by intracavitary techniques. The advan- 
tages of intracavitary hyperthermia over external hyperthermia include better power deposition due to 
the proximity of the applicators to the tumors and the reduction of normal tissue exposure. There have 
been clinical and research studies on hyperthermia and radiation or chemotherapy of the esophagus, 
rectum, cervix, prostate, nasopharyngeal, and bladder cancers. 

Chou et al. [19] designed an esophageal applicator for intracavitary 915 MHz hyperthermia and intra- 
cavitary radiation. In a phantom test, the result showed that the effective heating length (>50% peak 
heating rate) is more than 12.0 cm. This applicator was used for a pilot study. The esophageal lesion was 
circumferentially and uniformly heated [20]. A microwave applicator with an integrated cooling system 
for intracavitary hyperthermia of vaginal carcinoma was recently developed in Sweden [21 ] . The advantage 
of this applicator is that it can lead to a highly targeted heating of tumors and a reduced risk of unwanted 
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heating of normal tissues. Li et al. [22] developed a 915 MHz applicator for cervical and upper vaginal 
cancers and a 2450 MHz applicator for uterine cancer. The average temperature distributions in the upper 
vagina, cervix, and uterus was 46 ± 2.1°C. In Italy, a computerized 915 MHz microwave system was 
designed to deliver simultaneously local bladder hyperthermia and intravesical chemotherapy for super- 
ficial bladder cancer [23]. The bladder walls could be directly heated by using a transurethral catheter. A 
cooled microwave transrectal applicator was used for prostate treatment [24] . The applicator was found 
capable of effectively heating a tissue volume extending radially 3 to 25 mm from the applicator surface. 
An intracavitary microwave antenna array system has also been developed for prostate cancer hyperthermia 
[25] . In addition, special applicators for nasopharyngeal cancer [26] and rectal cancer [27] were developed. 
Both microwave and RF energy have been used for intracavitary hyperthermia. The main problem, 
however, is that the tumor temperature is unknown. Most temperatures have been measured on the 
surface of the applicators, which can be very different from those in the tumor. Furthermore, many 
investigators have used thermocouples or thermistors to measure temperatures, not knowing the pertur- 
bation problem caused by the metallic sensors [28]. One solution to this problem is to measure tissue 
temperature in animals and then extrapolate to humans [19]. Recently, the MR thermography was 
developed for noninvasive monitoring of temperature distributions of deep-seated tumors [29]. 

Interstitial 

Interstitial techniques for radiation implants as primary or boost treatments have been practiced suc- 
cessfully by radiation oncologists for many years. When hyperthermia was learned to be cytotoxic and 
synergistic with radiation, it was natural to consider this combination with conventional interstitial 
radioactive implantation. The advantages of this technique over external hyperthermia include confined 
treatment volume, better sparing of normal tissue, accessibility of deeper tumors, more homogeneous 
therapeutic temperature distribution, and better control and evaluation of thermal parameters [30]. 
Interstitial hyperthermia has been used for various site tumors; however, a phase III study of interstitial 
thermoradiotherapy for recurrent or persistent human tumors did not show any additional beneficial 
effects over interstitial radiation alone. The delivery of hyperthermia remains a major obstacle [6, 31-34]. 
Methods such as resistive heating, the microwave technique, or ferromagnetic seed implants can be used 
for interstitial hyperthermia. 

With resistive heating, multi-electrodes inserted in plastic tubes were implanted in the treatment volume. 
The mean power deposition of the individual electrodes was controlled by varying the duty cycle of the 
RF signal applied to the electrodes [35]. To prevent excitation of nerve action potentials, an operating 
frequency greater than 100 kHz should be used. The microwave technique utilizes small microwave 
antennas inserted into hollow plastic tubing to produce interstitial heat. In the U.S., 915 MHz is a 
commonly used frequency for this technique. However, satisfactory heating patterns can be produced 
between 300 and 2450 MHz [36]. A small coaxial antenna can irradiate a volume of approximately 60 cc. 
It is necessary to use an array of microwave antennas to heat a large tumor. In a brain hyperthermia study, 
the array of four dipole antennas spaced 2.0 cm apart were capable of heating a volume of 5.9 cm x 2.8 
cm x 2.8 cm [37]. As in RF resistive hyperthermia, the degree of control of microwave power radiating 
from these antennas is important in order to achieve homogeneous heating. Since the antennas couple to 
each other, the spacing, phasing, and insertion depth affect the heating patterns of array applicators [38,39] . 

Interstitial heat can also be produced by using ferromagnetic seed implants. This technique is applicable 
for delivering thermal energy to deep-seated tumors. The seed characteristics and implant geometry must 
be determined prior to the treatment [40]. When exposed to RF magnetic fields (-100 kHz), the implants 
absorb power and become heated until they reach the Curie point. Here, the implants become non- 
ferromagnetic and no longer produce heat. The surrounding tissues are then heated by thermal conduc- 
tion. The influence of blood flow and tissue inhomogeneities of the tumor, which may affect the tem- 
perature distribution, can be compensated by the self-regulation of the implants. It is therefore possible 
to maintain a temperature close to the Curie point [41,42]. Another method, which exposes magnetic 
fluid in a tumor to an RF magnetic field (0.3 to 80 MHz), has also been shown to be feasible for inducing 
selective heating [43]. Cetas et al. [44] developed an alterative form of ferromagnetic seed. A nearly 
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lossless ceramic ferrite core (FC) is surrounded by an electrically conductive metallic sheath (MS). The 
authors indicated that the FC/MS configuration solves many of the technical problems that have hindered 
the clinical implementation of thermally regulating ferromagnetic implants for thermal therapies. 

Regional Heating 
Electric Field 

Heating deep-seated tumors is difficult. RF energy can be deposited into the center of the body, but a 
large region is affected. Differential increases of blood flow in normal and tumor tissues may result in 
higher temperatures in the tumor than in normal organs. However, this temperature differential cannot 
be assured. Strohbehn [45] used the term "dump and pray" to describe the situation of putting large 
amounts of EM energy into the region and hoping for satisfactory results. In Japan, the 8 MHz Ther- 
motron system uses a capacitive electric field to heat deep tumors and a water-cooled bolus to minimize 
the heating of fat tissue. The sizes of two electrodes are adjusted to control the heating patterns in patients. 

Most other electric field heating systems generate electric fields parallel to the body surface. These 
include the annular phased array systems, the CDRH helix system, the coaxial TEM cell, the ring electrode, 
the segmented cylindrical array, the toroidal inductor, and the loosely coupled TEM system. The APAS, 
made by the BSD Company (Salt Lake City, Utah), radiates 16 RF fields in phase toward the patient. 
These systems, with variations in phase, frequency, amplitude, and orientation of the applied fields, can 
add more dimensions to the control of heating patterns during treatment. To determine the excitation 
phases of an array for heating an inhomogeneous medium, the retro-focusing technique was applied 
[46]. A small probe was first inserted into a tumor. A signal was radiated from the probe and received 
by the array of applicators outside the patient. By the reciprocity theory, conjugate fields were radiated 
from the applicators and focused at the tumor. The technique was demonstrated experimentally in a 
water tank. A significant power increase at the desired focus was observed. 

In general, superficial heating and hot spots in normal tissues are the limiting factors of treatment 
effectiveness in the existing systems. Invasive techniques using interstitial hyperthermia have been shown 
to solve some of the deep heating problems. However, no adequate deep EM heating system is available. 
Scanned ultrasound provides an alternative method [47]. 

Magnetic Field 

Magnetic fields heat tissue by induced eddy currents. The magnetic-loop applicators of the Magnetrode 
unit (Henry Radio, Los Angeles, CA) are self-resonant, non-contact cylindrical coils with built-in imped- 
ance matching circuitry; they operate at 13.56 MHz with a maximum power output of 1000 W. The RF 
current in the coil creates strong magnetic fields that are parallel to the center axis of the coil where the 
body or limb of a patient is located. Since the magnitude of the induced eddy current is a function of 
the radius of the exposed object, there is no energy deposition at the center of the exposed tissue. However, 
Storm et al. [48] showed that the heating of tumors deep in the body was possible as demonstrated in 
live dogs and humans, with no injury of surface tissue. This was apparently due to the redistribution of 
the thermal energy by blood flow. Nevertheless, the FDA has forbidden the use of the Magnetrode. 

Whole Body Heating 

During the last 20 years, hyperthermia has been used primarily for treating localized tumors. However, 
tumors that are resistant to conventional therapy tend to be metastatic. For these patients, local and 
regional hyperthermia can only be palliative. For disseminated disease, whole body hyperthermia ( WBH) 
in conjunction with chemotherapy and radiation has been studied by many groups [49,50]. Preclinical 
studies are consistent with the concept that 41.8°C whole body hyperthermia can enhance the therapeutic 
index of irradiation and specific chemotherapeutic agents without a commensurate increase in normal 
tissue toxicity, induce elevated plasma levels of granulcyte-colony stimulating tumor necrosis factor- 
alpha, and other cytokine factors [51,52]. 
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Methods of WBH include hot wax, water blankets, water suits, radiant heat, and extracorporeal blood 
heating. To heat the whole body uniformly with EM energy is impossible. However, it is possible to heat 
the body regionally so that the blood flow will redistribute the heat to the whole body. In the past, 
applications of 434 and 468 MHz microwaves have been explored in Europe for WBH, but the results 
did not produce any significant impact [53]. Several regional RF systems have been attempted for WBH. 
The BSD dipole system and the CDRH helix system require that the body be inserted into a tunnel 
applicator. The BSD system also requires a water bolus in contact with the patient to provide better 
energy coupling and skin cooling. For hour-long use, the bolus is very heavy and uncomfortable. The 
RF electric field system, designed by the UCLA group, uses three electrode plates to heat deep-seated 
tumors in the torso. The patient lies on a table and the top plate is swung over the abdomen to heat the 
thoracic region. No water bolus is needed and there is no contact with the patient. It is very simple to 
use. After two years of extensive phantom and animal studies, it was found to be a very promising system 
for regional and WBH [54]. 

93.3 Conclusion 

There has been much progress in the application of EM energy for clinical hyperthermia. Based on 
medical demands, many technological advances have been made. As a result, new forms of treatment 
equipment have been developed, and existing methods have been improved. It is impossible for a single 
piece of equipment to fulfill all of the clinical requirements for patient treatments. Depending on the 
location and vascularity of the tumor and adjacent tissues and the general physical condition of the patient, 
the hyperthermia practitioners should have the option of choosing the most appropriate equipment. 

Hyperthermia is a complicated technique and should be applied only by individuals well trained in 
its use. Due to the complexity of EM energy coupling to human tumors, careful heating pattern studies 
should be performed on all exposure geometries and contingencies prior to treatment to assure the best 
treatment conditions for the patient. Since hyperthermia in combination with high energy radiotherapy 
cannot be repeated after the tumor receives a maximal dosage of ionizing radiation, the physician must 
try to reach the critical tumor temperatures in optimal conjunction with radiotherapy. Accurate ther- 
mometry is particularly important in all phases of clinical hyperthermia, especially when the patient is 
anesthetized. The benefit of a good treatment outweighs minor risks. If there is no other choice, it would 
be more beneficial for the patient to have an effective treatment with a few blisters rather than a safe but 
ineffective treatment. It is easier to treat the burns than the cancer. 
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94.1 Background 



Bioelectric phenomena are well-established topics in biomedical engineering. Electroporation is of inter- 
est because of its ability to rapidly and locally deliver molecules across biological barriers comprised of 
lipid bilayer membranes. 1 6 Many applications use in vitro cellular conditions, and focus on DNA intro- 
duction. In vivo tissue electroporation can provide local "drug delivery" that can be electrically controlled. 
Electroporation allows reversible or irreversible alteration of cell membranes, and also other lipid-based 
barriers in tissues such as human skin. With these barriers, ionic and molecular transport is reduced 
within microseconds by several orders of magnitude. Simultaneously, the local electric field across the 
barrier drives molecular transport across the reduced barrier. The necessary transient voltage across a 
single bilayer membrane is about 0.5 to 1 V (depending on pulse duration) so that electroporation is 
mild at the molecular level, and can result in negligible damage. 

94.2 Biological Barriers Based on Lipids 

A biological cell has intra- and extracellular compartments separated by a lipid-based membrane, which 
defines the spatial extent of the cell. The membrane presents a chemical boundary, which supports 
significant chemical concentration gradients. Although the membrane contains variable amounts and 
types of specific membrane proteins, the barrier-defining element is a thin (~6 nm) region of low 
dielectric constant (K m ~ 2) lipid. The ability of a thin fluid sheet of low dielectric constant material to 
exclude ions and charged molecules is impressive, and can be understood in terms of a "Born energy" 
barrier AW Bom . 7 Briefly, in the case of ion transport AWg om is the electrostatic energy increase due to 
electric charge moving from a polarizable liquid medium (here water, dielectric constant K^ ~ 80) to a 
relatively non-polarizable fluid lipid region within the membrane (dielectric constant K m ~ 2). Formally, 
A W Born is the electrostatic energy associated with assembly of a particular configuration of charge, and 
is expressed in terms of the electric field, E, and the permittivity of the surrounding medium, e = Xe 
(K is the dielectric constant and e = 8.85 x 10~ 12 F nr 1 ' 8 ) 
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W R = f -e£ 2 dV. (94.1) 

Born o 

all space but ion 

The electrostatic exclusion of ions and charged molecules can be understood by considering the change, 
AW Bom , as an ion is moved into the membrane. The maximum value, AW Bornmax , is the electrostatic 
contribution to the barrier. 

Because W Bom increases rapidly as the ion enters the membrane, the membrane barrier size can be 
estimated by considering the simple problem of moving a charge from water into lipid. An ion of type 
"s" is treated as a charged sphere of radius r s and charge q s = z s e with z s = ±1 and e = 1.6 x 10~ 19 C. 
Initially the sphere is surrounded by water, far from the membrane ( W Borni ) and is subsequently moved 
to the center of the membrane ( W Bornf ). The difference in these Born energies is the barrier height. A 
simple estimate of A W Bom is made by noting that a small ion diameter (2r s * 0.4 nm) is significantly less 
than a typical membrane thickness (h m ~ 3 to 6 nm; smaller values for some artificial planar bilayer 
membranes, larger for cell membranes). This allows A W Born to be estimated by neglecting the membrane 
thickness, and instead considering bulk lipid. This is justified because the greatest contribution to the 
electric field is in the volume near the ion. This estimate yields 
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= 100kT (94.2) 



where the relevant temperature is T = 37°C = 310 K. Numerical solutions to the electrostatic problem 
for a thin low dielectric constant sheet immersed in water yields A W Born « 65 kT A barrier of this size is 
surmounted at a negligible rate by thermal fluctuations (spontaneous ion movement). Moreover, a 
transmembrane voltage, U diKa , which is much larger than physiological values, would be needed to 
provide this energy. Uncharged molecules that can partition into the membrane and then cross the 
membrane by diffusion are not significantly affected by AW Bom . Instead, their transport is governed by 
a passive permeability due to the combined effect of dissolution and diffusion. But large uncharged 
molecules will not readily cross the membrane either because their combined solubility and effective 
diffusion constant within the membrane are small. 

94.3 Decrease of the Membrane Barrier to Transport 

Two types of membrane structural "defects" can greatly reduce AW Bornmax : (1) a mobile aqueous cavity 
(shuttling carrier), or (2) an aqueous perforation (a "pore"). Functionally, both structures provide 
aqueous transport pathways, so that charged molecules can cross the membrane. Indeed, biological 
systems have specific carriers and channels to regulate ionic and molecular transport. Electroporation 
based on hydrophilic pores (Fig. 94.1), and their constraints on membrane electrical behavior and 
molecular transport have been reviewed in detail. 7 

At the membrane level, structural re-arrangements such as those shown in Fig. 94.1 are hypothesized 
to generate a rapidly changing, heterogeneous population of pores in the membrane. 7 These drawings 
are hypothetical constructs, because visualization of the "primary pores" of electroporation is likely to 
be impossible. As the transmembrane voltage, U m (t), increases, membrane conformational changes 
involving entry of water into the membrane increase nonlinearly in their frequency-of-occurrence. 
Moreover, pore population evolution is highly interactive. As pores appear and expand, the conductance 
of the membrane increases tremendously, and quickly becomes so large that U m (t) cannot rise further, 
and rapidly decays when the applied pulse ceases. 

Formation of pores in lipid bilayer membrane was first proposed as a purely spontaneous event, due 
solely to thermal fluctuations. 7 An important concept is membrane rupture, a destructive mechanical 
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FIGURE 94.1 Illustrations of hypothetical structures of both transient and metastable membrane conformations 
that may be involved in electroporation. Hypothetical bilayer membrane structures related to electroporation. 
A: membrane free volume fluctuation. B: aqueous protrusion into the membrane ("dimple"). C: Hydrophobic pore 
proposed by Chizmadzhev and co-workers.' D: Hydrophilic pore proposed by Litster and by Taupin and co-workers, 
usually regarded as the "primary pores" through which ion and molecules pass during electroporation. E: Composite 
pore with one or more proteins at the pore's inner edge. F: Composite pore with "foot-in-the-door" charged 
macromolecule inserted into a hydrophilic pore. Although the actual transitions are not known, the transient aqueous 
pore model assumes that transitions from A — > B — > C or D occur with increasing frequency as U m increases. Type 
E may form by entry of a tethered macromolecule while U m is large, and then persist after U m has decayed by pore 
conduction. These hypothetical structures have not been directly observed, but are consistent with a variety of 
experimental observations and with theoretical models. Reproduced with permission from Weaver J, Cellular Biochem 
51:426-435 (1993). 

event caused by emergence of a very large pore. A "cookie cutter" model provides a simple way of 
imagining pore creation and motivates the mathematical form of the pore formation energy, W (r). Pore 
formation requires supplying an "edge energy," 2jir y (y is the edge energy per length) while simulta- 
neously removing an area-related energy, Ttr 2 T, (r is the energy per area of a flat, pore-free membrane) 
arising from loss of a circular region of membrane. The idea is simple: a pore-free membrane is envisioned, 
then a circular region is removed, and the difference in energy between these two states is identified as 
W (r). Thus, the pore creation energy at U m = is 



Aw(r) = 2nyr -TlTr 2 at [7 =0 
p\ p) ' p p m 



(94.3) 



Equation (94.3) describes a parabolic barrier, with height W ai . For typical values y ~ 2 x 10 u J m 
and r ~ 1 x 10~ 3 J itt 2 , and the "critical radius" 
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(94.4) 



Structural arguments and interpretation of experiments indicate that the minimum pore size is r min ~ 
1 nm. The large barrier height suggests that spontaneous rupture is negligible, consistent with normal 
cell stability. 
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94.4 Basis of Electroporation 

The creation of hydrophilic pores by increasing U m (t) was first suggested by Chizmadzhev and coworkers. 7 
It was hypothesized that the creation process involves a transition from spontaneously occurring hydro- 
phobic pores to hydrophilic pores (Fig. 94.1), which is governed by a rate equation with a modified pore 
barrier function, W Jr, U m ). A hydrophilic pore (hereafter simply "pore") is believed to dominate most 
electroporation phenomena. It can be modeled as a leaky, microscopic capacitor whose parallel resistance 
for small radii (r in ~ 1 nm) is extremely large (_R^ > 10 9 Q). Many pores are created, which also increase 
in size, such that their contribution to the membrane conductance, G m , results in a several order of 
magnitude drop in membrane resistance, 1^ = 1/G m . This dramatic electrical event occurs even though 
there is insufficient energy iq s U mmsix ~ 1 eV) for classic dielectric breakdown that involves ionization by 
monovalent ions. 

A quantitative model for electroporation of artificial planar bilayer membranes provides a test of this 
electroporation hypothesis. In the presence of a transmembrane voltage, the free energy of pore formation is 

W t (r> U m ) = 2%yr p - nTr 2 - 0.5C p [/ r 2 , (94.5) 

where U m is the spatially averaged transmembrane voltage. The change in specific capacitance at the site 
of a pore due to water replacing lipids is 



C] w - C 



(94.6) 



Here C = e m /ft m is the capacitance per area of a pore-free lipid bilayer of thickness h m , with e w = is^e 
and e m = _K,e the permittivities of water and lipid, respectively. Using K x ~ 2 and K^ x 80, the barrier 
AWp(r p , U m ) decreases as U m increases. 

The pore creation energy maximum, W ix (r, U m ), is a fundamentally important quantity, along with 
the corresponding critical pore radius, r . Both diminish with increasing U m . The aqueous interior of 
the pore has a much larger permittivity than the surrounding lipid. More specifically, as U m increases, it 
therefore becomes progressively more favorable for water to enter the pore. Thus, pore expansion is 
electrically driven. Qualitatively, this provides a readily visualized explanation of the rupture of planar 
bilayer membranes. As U m increases W max , decreases, so that the probability of one or more "supracrit- 
ical" pores [r > r pc ( U m )] goes up. But even one supracritical pore can cause rupture. Once a pore expands 
beyond the barrier maximum, it can expand until reaches whatever physical boundary confines the 
membrane. In the case of artificial planar bilayer membranes, this boundary is a small but macroscopic 
aperture (e.g., r ture ~ 10~ 3 m). In the case of a cell membrane, the cytoskeleton or other cellular structure 
may provide a boundary. In either case, expansion of a supracritical pore to the boundary is essentially 
irreversible. This is "irreversible breakdown." 

One significant achievement of the transient aqueous pore model is a quantitative description of 
rupture, briefly presented here. The extension of Eq. (94.4) to U m > yields, 9 

r = ^ an d a W = 7l I (94.7) 

p,c r+o.sc, u 2 p ' max r+0.5C, U 2 

Jw m Iw in 

Note that the critical pore radius decreases as U m increases, and more significantly, the corresponding 
pore energy, Wl max ([/ m ), decreases. Physical processes governed by activation energies (barriers) involve 
Boltzmann factors. In such cases, including that of pore creation, there is a nonlinear dependence on 
system parameters such as U m . This is the primary source of the nonlinear dependence of pore creation 
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on U m . As discussed below, only a moderate increase in U m is needed to significantly diminish the stability 
of a planar membrane, and can cause rupture. 

Self-consistent electroporation theories predict that pores are located randomly but widely spaced, on 
average separated by several tens of pore radii. The equipotentials outside the pore but near the membrane 
have a significant, nonlinear gradient. Thus, the current flowing through a pore results in a voltage drop 
both within the pore and within the electrolyte external to the pore but near the entrance of the pore. 
This potential drop near a pore's mouth is due to a "spreading resistance," -R d , within the bathing 
electrolyte, which is 

2a r 

e p 

The voltage drop associated with R d is external to the pore, and therefore does not contribute to the 
driving force that expands pores. 

The electrical conductivity within the smallest pores is suppressed, because of Born energy exclusion 
due to the nearby low dielectric constant lipid. Ion entry is less likely due to the energy cost of bringing 
an ion into a small pore. This leads to a reduction in electrical conductivity within a pore, a . Moreover, 
ion movement through a pore only slightly larger than the ion involves significant steric hindrance. The 
resistance within a pore can be estimated using a simple cylindrical geometry, but with a Born energy- 
reduced conductivity, a . This "interior" pore resistance is 

3*"— T- ( 94 - 9 ) 

nr a 

p p 

where a e is the bulk electrolyte conductivity. The two resistances -R spd and R int are in series, creating a 
voltage divider effect. 
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(94.10) 
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For this reason U < U m , and as a result the electrical expanding force associated with dW ' (r, U m )/dr 
in pore radius space is diminished, and pores expand more slowly than expected from using U m in 
Eq. (94.5). 

Electrical measurements on artificial planar bilayer membranes show that some small pores remain 
after U m is decreased. Cellular experiments tested the response to dye uptake after electrical pulsing, and 
these revealed that some cells have a persisting capability to take up these molecules. One hypothesis is 
that some types of complex pores are created, which may involve a more permanent portion of the cell, 
e.g., the cytoskeleton or tethered cytoplasmic molecules (Fig. 94.1). Thus, some type of long lifetime 
metastable pores may occur. This idea is consistent with experiments involving DNA uptake by cells. 
DNA is a large, highly charged molecule, so that while temporarily occupying a pore, DNA charge groups 
should inhibit pore closure by coulombic repulsion. 7 This is a "foot-in-the-door" hypothesis (Fig. 94.1), 
which is also consistent with skin electroporation experiments using heparin. 6 However, most pores 
appear to be destroyed quickly, with pore destruction independent of other pores. This independence is 
plausible, as pores are predicted to be widely spaced even when the maximum number of pores is present. 

A heterogeneous pore size is fundamentally expected, because there are two contributions of energy 
that lead to pore formation: (1) thermal fluctuations ("kT energy"; stochastic), and (2) an increased 
transmembrane voltage ("electric field energy"; deterministic). The combination is therefore stochastic, 
and leads to the expectation that a distribution of pore sizes occurs. The flux of pores in pore radius 
space is described by 
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Diffusion within "pore radius space" is contained in the term with D , the diffusion constant of the pores 
in pore radius space. For a statistical description, n = n(r , t) is the pore distribution function, in which 
n = n(r • , t) is a probability density function. Within a radical increment, Ar there are n 
pores instantaneously present with radii between r and r + Ar . 

This approach leads to a stochastic description of rupture, with average membrane lifetimes, 
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Here, A m is the planar membrane area, and c_ (the s_urface pore concentration) is the number of pores 
per area. Another simpler approach for estimating x is based on an absolute rate estimate for critical 
pore appearance. In this case, x depends on the reciprocal of a Boltzmann factor, in which W ( U m ) is an 
argument. By using an order of magnitude estimate for the prefactor, the mean membrane lifetime against 
rupture is 



1~ exp +AW 
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(94.13) 



Here v is an attempt rate density based on the collision frequency and number density of phospholipid 
molecules within the membrane. The quantity V m = h m A m is the membrane volume. By choosing a 
plausible mean lifetime, (e.g., x m ~ 1 s), the magnitude of AW. C can be estimated, and from that, L^,.. 
Because of the strong non-linear behavior of Eqs. (94.12) and (94.13), an order of magnitude difference 
in the choice of mean lifetime, e.g., 0.1 or 10 s, results in only small differences in U mc . 

Interaction of the membrane and its pore population with its environment should also be considered. 
Specifically, there is a current pathway through the bulk electrolyte by which ions are transported to the 
membrane in order to charge the membrane capacitance to U m . During electroporation, the membrane 
capacitance, C m , changes by less than 2% because only a small fraction of the membrane is occupied by 
aqueous pores. In contrast, the membrane conductance, G m (t) = 1/R m (t), increases by orders of magni- 
tude. Typically a short voltage pulse is applied, and current flows into and/or across the membrane. The 
membrane usually cannot discharge through the pulse generator. Instead, the discharge current passes 
through pores in the membrane. During a rectangular pulse, the circuit differential equation is 
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Once the pulse stops (and the pulse generator is switched out of the circuit), the transmembrane voltage 
can change only by ions moving through the membrane, and the corresponding equation begins 
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Subject to the initial condition U m (t = 0) of zero, Eqs. (94.14) and (94.15) are solved numerically to 
provide a computer stimulation of planar membrane electroporation. 



© 2000 by CRC Press LLC 



Experiments have shown that during rupture U m (t) exhibits a much longer discharge time than during 
reversible electrical breakdown (REB, see below), and that the rupture discharge results in U m (t) having 
a sigmoidal shape. A transient aqueous pore theory can account for general features of this behavior. 7 
The striking phenomenon of REB is observed for both artificial planar bilayer membranes and in cell 
membranes. However, the term "breakdown" is misleading, as REB is actually protective. During REB 
the membrane acquires a very large conductance by evolving a large pore population, and the resulting 
highly nonlinear increase in G m prevents U m from exceeding U m ~ 1.5 V. If some types of artificial planar 
bilayer membranes are challenged by short pulses, a characteristic feature of REB is observed: there is 
progressive shortening of the postpulse membrane discharge as larger and larger pulses are used. This 
behavior is consistent with a progressively smaller R m being achieved. The transition from irreversible 
behavior ("rupture") to reversible behavior (REB or incomplete reversible electrical breakdown) can be 
quantitatively described by the evolution of a dynamic pore population. If the pore population is small 
(moderate U m ), then the membrane discharge can be so slow that one or more pores evolve to rupture 
the membrane. But if the pore population is large (high U m ), then there are so many pores that the 
membrane discharges before any membrane-rupturing supracritical pores evolve. 

Irreversible electroporation of cell membranes is more complicated. A planar membrane is bounded 
by the aperture of the experimental apparatus, and for this reason has a total surface tension (both sides 
of the membrane), T, which favors pore expansion. But there may be no corresponding reservoir of 
membrane molecules in a cell, and almost certainly none in the case of a vesicle (approximately an empty 
spherical membrane). If the osmotic or hydrodynamic pressure difference across the cell membrane is 
zero, the cell membrane has T a 0, and rupture is therefore not expected. If, however, a region of a cell 
membrane is bounded by the cytoskeleton (or other cellular structure) the membrane may behave like 
a microscopic planar membrane. One or more regions of a cell membrane may rupture, creating per- 
manent openings and leading to cell death. 

94.5 Molecular Transport 

From a biomedical engineering viewpoint, electrically controlled transport of molecules across cell 
membranes and tissues is likely to be the most important feature of electroporation. There is strong 
evidence for electrophoretic transport through pores as a major contribution for charged molecules. This 
is consistent with the observations that charged molecule transport due to a single exponential pulse 
exhibits a subequilibrium plateau, in that net transport (cell uptake) is independent of field pulse 
magnitude for large pulse magnitudes. A transient aqueous pore model of electroporation uses only an 
electrophoretic drift contribution through pores, and this is consistent with a subequilibrium plateau. 

Most potential biomedical applications seek controlled molecular transport, without significant side 
effects. However, assessment of cell viability/death following electroporation is nontrivial. Convenient, 
short term viability assays based on membrane-impermanent dyes assume that membrane openings are 
themselves evidence for cell death. The fact that electroporation by definition causes openings with 
unknown recovery kinetics means that vital stains and membrane exclusion probes cannot be used 
without validation. Partly for this reason, cell death by electroporation is not well understood. 

Chemical stress resulting from nonspecific molecular transport has been hypothesized to cause cell 
death by either reversible or irreversible cell membrane electroporation. 4 The irreversible case clearly can 
lead to stress, but in principle membrane-reversible electroporation that generates large chemical 
exchange can also create stress. In either case, significant molecular exchange can occur. As already noted, 
in the case of irreversible electroporation, a region of the cell membrane may behave much like a 
microscopic planar membrane, and undergo rupture. In the case of reversible electroporation, significant 
molecular transport between the intra- and extracellular volumes could cause a significant chemical 
imbalance. If too large, the associated chemical stress (loss of essential compounds; entry of harmful 
compounds) may kill the cell. The cell's local environment should be relevant, and the ratio 
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should govern the degree of chemical stress, and may correlate with cell survival or death. If -R^^e >> 1, 
(typical of in vitro cell suspensions and anchorage dependent cell culture) cell death should be favored, 
while for the other extreme R volume << 1, (typical of in vivo tissue conditions) cell survival should be 
favored. If this relative volume hypothesis is correct, then for the same degree of electroporation, signif- 
icantly less damage may occur in tissue than for cells located in body fluids (e.g., blood or urine) or 
under most in vitro conditions. 

94.6 In Vitro Electroporation Applications 

Practical use of electroporation has initially emphasized the introduction of genetic material into cells 
under in vitro conditions. 2 A high probability of cell survival can be unimportant. A typical experiment 
can begin with a large number of cells, and only a few successful clones are needed. The widespread use 
of electroporation for DNA introduction has involved electroporation of a large number of cell types, 
including microorganisms with intact cell walls, and many mammalian cells. Because of their extremely 
thick and tough cell walls, plant cells have not been emphasized. However, protoplasts and vesicles (which 
do not have walls) of many types have also been used. In addition to DNA introduction, imaginative use 
such as introduction of fluorescent indicators, small particles (including virus), antibodies against specific 
intracellular targets, and reagents (substrates and cofactors) for in situ enzyme assays have begun to be 
used. A general application involves in vitro loading of cells with drug molecules, with the goal of 
introducing loaded cells into the body for drug delivery purposes. 10,11 Another is cellular manipulation 
involving electroinsertion of proteins in cell membranes. 12 

94.7 In Vivo Electroporation Applications 

Drug delivery, the introduction of molecules to desired sites within the body, is increasing in importance. 13 
A common problem is overcoming tissue transport barriers, many of which contain lipid-bilayer mem- 
brane barriers. Recent success in biotechnology has produced new therapeutic molecules, many of which 
have short biological lifetimes. Important biomedical engineering goals are the provision of both larger 
drug delivery rates, and also greater control over delivery. This has made the use of tissue electroporation 
for local cancer tumor therapy, 3 localized gene therapy, 14,15 and transdermal drug delivery attractive. 6 

Tissue electroporation for enhanced local chemotherapy of solid cancer tumors ("ECT") is well 
underway, with some clinical trials undertaken. Most studies used an established drug, bleomycin, which 
ordinarily crosses the membrane poorly, but is very potent. ECT is attractive for treating accessible, local 
tumors, 3 and may be extended to include gene therapy. 14,15 There is, however, a practical dilemma, which 
may prevent ECT from being as rapidly used as its potential advantages suggest. Approved drugs enter 
cells by existing pathways (passive and active). Thus, although greatly reduced systemic side effects are 
expected if local tissue electroporation is used with highly charged drug molecules, such compounds will 
generally not be approved drugs. Modification of conventional anticancer drugs could be pursued: for 
example, charge groups could be purposefully added so that intracellular enzymes can subsequently 
remove them. Then these charged analogs could be locally introduced by electroporation, but should 
not enter cells systemically in other parts of the body. The dilemma is that the charged analogs would 
presumably have to pass through the fully regulatory approval process, and this is extremely expensive. 

Gene therapy is increasingly pursued as a revolutionary medical intervention. Its potential advantage 
and initial weakness are the same: in vivo alteration of genes is the aim. Most approaches are based on 
the use of viral vectors for two purposes: (1) delivery to the cell interior, and (2) insertion of functioning 
DNA. However, there are concerns that rare recombinant events could cause the viral vector to become 



© 2000 by CRC Press LLC 



an infectious agent. If the contemplated intervention requires deletion of a cell function, then electropo- 
ration may be inappropriate, as achieving treatment of -100% of the target cells would be difficult. 
However, if it is sufficient to alter a fraction of the cells (e.g., provision of a cell function such as secretion 
of a specific protein), then local tissue electroporation may provide a general approach. 14,15 

A very different type of tissue, the human skin, may also be treatable by electroporation. 6 The main 
motivation is transdermal drug delivery. Established transdermal methods are limited, in that the skin 
has such a large barrier that relatively few drugs can be delivered across the skin at therapeutic levels. 
Generally speaking, passive transdermal drug delivery is limited to small, lipophilic molecules such as 
scapolamine, nitroglycerine, and nicatine which readily permeate the skin; the much larger number of 
hydrophilic drugs do not cross the skin in therapeutically useful amounts. Moreover, delivery kinetics 
are slow, typically with onset times of hours. 

The origin of the large barrier function of human skin is the key to understanding this problem. It is 
generally agreed that the rate limiting barrier to molecular transport is the stratum corneum (SC). This 
structure comprises the outermost layer (~20 |im thick) of the skin, and is the source of the skin's 
mechanical and chemical protection capability. The SC can be regarded as a "brick wall" in which the 
"bricks" are the corneocytes, which are flattened remnants of epidermal cells that are enclosed in an 
unusual protein-lipid bilayer membrane, the corneocyte envelope. The interior of the corneocytes is 
occupied by crosslinked keratin, and this region increases several fold in volume as the skin hydrates. 
The "mortar" surrounding these "bricks" consists of multilamellar lipids, with above five or six bilayers 
between adjacent corneocytes. It is generally believed that intercellular pathways are responsible for most 
transdermal transport. This may include pathways between corneocytes within the SC, but may also 
involve the so-called "appendages," i.e., the sweat glands and ducts, and the hair follicles that penetrate 
the SC. Creation of aqueous pathways across the lipid bilayers should, in principle, increase the transport 
of water soluble drugs through both translipid and trans-corneocyte barriers, and also across the layers 
of cells that line the appendages. This involves application of high voltage ("HV"; 50 to 100 V across the 
skin); this is reasonable because the SC has about 100 bilayer membranes in series. HV pulses should 
lead to aqueous pathway creation in both the corneocyte envelopes and in the multilamellar lipid bilayers. 
Similarly, smaller pulses should create aqueous pathways across lipid-containing barriers in the appendages. 

Although only a qualitative argument, the idea that new aqueous pathways should be created by large 
electric field pulses is supported by the fundamental tendency of lipid-containing barriers to electropo- 
rate. Experiments also support this. Studies using HV have shown that charged molecules, which ordi- 
narily do not cross the skin at significant rates, can be rapidly transported at rates that could be 
therapeutic. For this reason, there is considerable interest in pursuing the development of this and other 
tissue electroporation applications in medicine. 
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Optical Sensors in Blood Gas Analyzers 

Modern anesthesiology and intensive care, neonatal care, and emergency medical facilities depend heavily 
upon the frequent measurements of a selected group of critical care analytes in blood. Tests recognized 
as being essential for patient management include pH and blood gas (0 2 and C0 2 partial pressures) and 
electrolytes (sodium, potassium, and calcium) measurements. Clinical care clinicians use information 
from blood gas reports in a variety of ways for patient interpretation and to establish plans for patient 
therapy. The measurement of pH and PC0 2 , for example, represents any disturbances in acid-base 
balance, and the calculation of the bicarbonate concentration from the Henderson-Hasselbalch equation 
gives the most complete and consistent model of acid-base physiology for use in infants, children, and 
adults with a variety of disorders. Several efforts, with different levels of success, have been made in order 
to determine the primary variables to be measured directly from whole blood in critical and emergency 
situations. In fact, in a different way from the clinical chemistry routine, some analytes must be deter- 
mined quickly in vitro at the patient's bedside or emergency site without modifying the sample. Instru- 
mentation for these measurements should be as effective as the in vivo monitoring and should be capable 
of making the same determinations as the clinical laboratory. 

Electrochemical sensors have been demonstrated to be the most appropriate way to achieve the 
objective of the measurement of blood components without any sample treatment. In 1905, the first pH 
glass electrochemical determination was demonstrated, and the first practical pH glass electrode for 
anaerobic blood pH determination was the capillary electrode of Mclnnes and Belcher introduced in 
early 1930. Blood pH was not considered interesting from a clinical point of view until 1952, when a 
polio epidemic was seen in Copenhagen and patients required artificial ventilation. Astrup adopted the 
measurement of pH and C0 2 content for calculation of the PC0 2 to teach students how to change the 
patient's ventilation towards a more normal level. In August 1954, Stow described his concepts and 
construction of a C0 2 electrode capable of use in measuring blood PC0 2 , sensitive to C0 2 but insensitive 
to acids and alkali in fluids of known PC0 2 . In 1953, Clark presented the first polyethylene-covered 
oxygen electrode. This marked an historic turning point in blood gas analysis and all of respiratory 
physiology. In 1959, the first example of a three-electrode blood gas apparatus containing a Beck- 
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man/Clark oxygen electrode, a Stow/Severinghaus C0 2 electrode, and a Mclnnes/Belcher pH electrode 
was described. After a few years (early 1960s), Thomas Ross, the founder of Instrumentation Laboratory, 
introduced the first commercial three-electrode system in the market (IL 113). Further development 
brought an interesting level of automation to blood gas analysis, satisfying user demands at the beginning 
ofthe 1970s when Radiometer introduced the first automated blood gas analyzer in 1973, ABL 1. Corning 
Diagnostics (now Bayer), Nova Biomedical, AVL, and Eschweiler are companies that have used the blood 
gas analyzer before the introduction of the portable blood gas analyzer by Mallinckrodt Sensor Systems 
(now IL), iSTAT (now Abbot/HP) and Diametrics. The portable blood gas analyzer was developed in 
order to bring the analyzer closer to the patients, the so-called Near Patient or Point of Care testing. 

Sensor technology has notably improved in the development and manufacturing of sensors for the 
direct determination of components of diagnostic interest such as sodium, potassium, calcium, lithium, 
magnesium, and chloride for the electrolytes, and glucose, lactate, urea, and creatinine for the substrates. 
This makes it possible to not only determine the blood pH and gases, but to do this together to other 
analytes so that a complete diagnostic profile of the patient can be obtained. 

In general, electrochemical sensors transform the information arising from the concentration of 
chemical species in solution or the interaction or reaction of a substance with its substrate to an electrical 
signal of an equivalent and measurable quantity, such as a potential, current, or charge transfer. Biomed- 
ical sensors work as an interface between the biological system and the electrical one, without modifying 
the characteristics of the two systems, guaranteeing the reliability of the resulting information. This 
appendix will go over the basic concepts of blood gas instruments describing the principles for the 
measurements of the signals coming from different sensors and the main criteria for data acquisition 
and reduction. It will conclude with an overview of the new trends in sensor technologies. 

A.l pH and PCO2 Electrochemical Sensors 

Electrochemical potentiometric systems constitute the main category of sensors for the determination 
of ionic and molecular species in blood, such as pH, PC0 2 , P0 2 , and electrolytes. pH and sodium 
potentiometric sensors are based on different glass membrane compositions obtained through the mix- 
ture of metal-alkali oxides. Potassium, calcium, chloride, lithium, and magnesium potentiometric sensors 
are based on polymeric matrix membranes containing the electroactive material that makes the mem- 
brane selective for the determination of the respective ionic species in the blood. The potentiometric 
sensors obtained by incorporating these membranes in their structures are called Ion Selective Electrodes 
(ISE). They make possible the definition of an electrochemical system in which a potential difference is 
seen at the sensor (ISE) when it is in contact with the sample to be measured (electrolytic solution). The 
resulting electrical signal is dependent on the concentration, or more correctly on the ionic activity, of 
the particular ionic species in the sample. 

Figure A.l schematically illustrates the main components of a potentiometric pH sensor and its 
reference electrode. In order to be able to measure the electrical signal present at the sensor/sample 
interface, the Ion Selective Electrode (indicator electrode) should be connected to one of the two input 
terminals of the potential measuring device, the Voltmeter, together with a second electrode, the reference 
electrode, connected to the second input terminal of the Voltmeter. The structure and characteristic of 
the reference electrode that is required to complete the potentiometric measuring system guarantees that 
the change of the measured electrical signal depends only on the concentration of the ionic species in 
solution interacting with the ISE membrane surface. The relationship between the ionic concentration 
ofthe species in solution (a) and the measured electrical signal (E) is represented by the Nernst equation: 

E = E° + K log a 

where E° is the standard potential ofthe electrochemical system including the reference electrode potential 
and K is the slope or the sensitivity of the electrochemical system expressed in mV/log a. 
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FIGURE A.l pH sensor and reference electrode connected to the measuring device. 
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FIGURE A.2 PC0 2 sensor structure. 

In the actual blood gas analytical instrumentation, the determination of C0 2 partial pressure (PC0 2 ) 
is performed with a potentiometric sensor, the so-called Stowe-Severinghaus sensor. 

The experimental setup represented in Fig. A.2 shows the schematic diagram of the electrochemical 
system for the measurement of a molecular species in solution such as C0 2 . 

Only 5% of the C0 2 that enters the blood remains in plasma; the balance enters the red cell. A small 
amount of the plasma C0 2 forms carbonic acid and the balance remains as dissolved C0 2 (dCO,) which 
exerts a pressure (Henry's law) measured as PC0 2 . The dC0 2 , in equilibrium in the sample, diffuses 
through a gas permeable membrane, and changes the pH of the thin layer of bicarbonate-based electrolyte. 



dC0 2 -> CQ 2 + H 2 <-> H 2 C0 3 o H 



+ HCO, 



(A.l) 



The pH variation of the bicarbonate solution, measured by a pH sensor, is directly proportional to 
the logarithm partial pressure of the C0 2 dissolved in the sample. 

The logarithmic output of these sensors creates problems of inaccuracy because small uncertainties 
in the measurement of the potential can cause significant variations in the determination of the concen- 
tration of the species to measure. Precision and accuracy in the measurement can be reached with 
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FIGURE A.3 Schematic view of the structure of an amperometric oxygen sensor. 

appropriate calibration procedures. The calibration procedures should be designed in order to minimize 
the influence of the reference electrode and the interference of other ionic species present in the sample 
on the measured potential. pH sensors are calibrated with buffer solutions having pHs referred to as 
NIST primary standards: phosphate buffers at pH = 7.392 and 6.839 at 37°C, with an ionic strength of 
0.1 mole/Kg. Gas sensors (PC0 2 and P0 2 ) are generally calibrated with humidified gas mixtures taking 
into account the barometric pressure and temperature of each gas as well as the vapor pressure of water. 

A.2 P0 2 Electrochemical Sensors of Dissolved Oxygen 

In the majority of commercially available blood gas analyzers, the oxygen partial pressure (P0 2 ) mea- 
surement is made by an electrochemical amperometric sensor introduced by Clark in the beginning of 
the 1950s. Most of the total oxygen content in blood is bound to hemoglobin. The remaining is dissolved 
in plasma, and P0 2 (the measured quantity) is directly related to the concentration of 2 dissolved in 
plasma. Figure A.3 shows a schematic diagram of a typical amperometric oxygen sensor. 

The amperometric sensor for the determination of the partial pressure of molecular oxygen in an 
aqueous solution depends on the chemical reduction of oxygen at a metallic or other conducting surface 
(working electrode) when a potential difference between the reference electrode and the working electrode 
is applied by an external voltage source. This constitutes the difference between amperometric and 
potentiometric sensors; an external energy source drives an electrochemical reaction of the analyte 
resulting in a current that is proportional to the analyte partial pressure in the amperometric case, while 
potentiometric measurements such as pH and PC0 2 take place without any applied voltage and no 
current flow in the measuring circuit. 

The oxygen sensor measures the current originated during the reduction of oxygen according to the 
reaction: 



2 + 4e + 4H <->H 2 



(A.2) 



to a platinum electrode to which a proper voltage to support the reaction (ca. -0.7 V) vs. an Ag/AgCl 
reference/anode electrode) has been applied. 

The platinum sensor is covered by an oxygen permeable membrane that protects it from a possible 
poisoning caused by the presence of different substances present in the sample, such as the proteins, and 
controls the oxygen diffusion at the platinum surface. 

Silver/silver chloride (Ag/AgCl) is the most common reference electrode used in the amperometric 
oxygen sensor. In fact, the chloride ions contained in the electrolyte reacts with the silver anode to form 
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FIGURE A.4 Equivalent circuit of a potentiometric electrode. 

silver chloride and the electrons produced during this oxidation reaction are used for the reduction of 
oxygen at the platinum surface. The resulting electron flow can be measured as a current (generally in 
the range of to 10 nA) that is directly proportional to the P02 in the blood sample. The current is also 
proportional to the surface area of the working electrode, and small electrode surface areas are used to 
minimize oxygen consumption from the sample. 

A.3 Equivalent Electronic Circuit for a Potentiometric 
Electrochemical Cell 

An ion selective electrode, together with the reference electrode, can be represented by an equivalent 
circuit comprised of resistances, capacitances, and fixed potentials representing the different components 
of the reaction. Such a circuit for a complete ion selective electrode is shown in Fig. A.4. This represen- 
tation is useful to better understanding the requirements for the measuring device. The potential that 
has to be measured is E and it is related to a corresponding change of concentration of the measured 
ion via the Nernst equation as specified before. All other circuit components must remain constant. 

RS represents the sum of all the resistances and RI represents the input impedance of the measuring 
system (voltmeter). The input resistance of the measuring system must be much greater than the source 
resistance RS. Otherwise, the potential difference of the potentiometric electrochemical cell will be loaded 
significantly by the input resistance RI, and so the measurement will be in error. In order to maintain 
this error below 0.1%, the input impedance of the indicating instrument must be larger than the source 
resistance by a factor of 1000. For example, the impedance of a glass pH electrode is typically up to 500 
Mohm, so the input impedance of the instrument must be greater than 1012 Ohm. 

A.4 General Aspects of a Measuring Device 

In general, a blood gas analyzer including electrodes such as for pH, PC0 2 , and P0 2 can be schematized 
as in Fig. A.5. 

The electronic circuits can be divided into three main sections. The first block is the input stage for 
the potentiometric sensors that transform the high source impedance of the sensor to a low impedance. 
The second section consists of the amplification stage. In the figure the blocks are described as pH and 
PC0 2 ion differential amplifiers, but it is possible to use a single ended amplifier as well, although this 
can result in increased noise. The third section is a multiplexer that selects the channel to be measured. 
As is seen in the figure, in some measuring devices current leakage detector circuits are also included 
mainly for the P0 2 and PC0 2 sensors in order to measure the extra current related to damage of the 
membranes. Leaving these currents undetected can result in an error in the measurement. In Fig. A.6a, 
a schematic of the input circuit for the pH electrode is presented. The amplifier used is a very high 
impedance amplifier. This is a unity-gain amplifier, so the output is the same as the input, but the sensor 
is isolated. Furthermore, the bias current must be very low, otherwise it could polarize the electrode. 
The guard provides an equal potential surrounding the input for shielding and to eliminate leakage 
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FIGURE A.5 Block diagram of a blood gas analyzer including the following stages: electrodes, analog signal 
processing, signal conversion to digital form, and the microprocessor. 
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FIGURE A.6 (a) Schematic of the input circuit for a pH electrode, (b) Differential amplifier with pH electrode and 
reference electrode inputs, (c) Current/Voltage converter used for an oxygen sensor. 

current. The inputs to the differential amplifier are the signals coming from the indicator and reference 
electrodes, and the output is the difference between the two inputs multiplied by a factor G, that is 
determined by the resistance Rg (Fig. A.6b). The differential amplifier is important because it provides 
the proper isolation to the measuring circuit in order to avoid current circulating through the electrodes 
when the sample solution is grounded. Furthermore, the differential amplifier can decrease the level of 
noise that occurs in the measurement due to its high common mode rejection ratio. 

The signal coming from the P02 electrode needs to be converted from current to voltage, so the circuit 
simply consists of a current/voltage converter that also amplifies the signal (Fig. A.6c). 

The final stage of the schematic consists of a multiplexer that selects one channel at time to be converted 
into a digital signal by an A/D converter and further processed by a microprocessor. 
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In order to appreciate small variations in the signal, it is necessary to use an A/D converter with a 
large range (higher number of bits). The microprocessor and associated memory will store the data and 
subsequently interact with it. Appropriate programs will provide methods to manipulate, extract infor- 
mation, and calculate a variety of quantities from the acquired data to give the final results. 

A.5 Factors that Influence the Measurements 

In automatic electrochemical analytical equipment, such as pH blood gas analyzers, the flow channel 
where the sample is analyzed and is in contact with the sensor often has metal electrodes that detect the 
presence of the sample or flush solutions. This sensor allows the sample to be automatically pumped 
into the flow channel and minimizes the sample volume while optimizing the analytical cycles including 
the calibration. It also helps to reduce the carry-over effect between samples. Insufficient isolation of the 
reference electrode from ground and these electrodes can cause a current to flow in the measuring 
apparatus that can produce measurement errors and add noise to the measurement. This can cause the 
reference electrode to become polarized, even though almost no current is flowing through the indicator 
electrode. Thus, it must also be well isolated from ground problems with the electrochemical measure- 
ment. 

Additional problems with the electrochemical measurement can be caused by static charging phenom- 
ena. Static charge at the amplifier input produces a voltage across the capacitor formed by the cable core 
and its shielding. This charge, and hence the voltage, may take a long time to dissipate due to the high 
ohmic electrodes. Movement of the cable can change its capacitance, which can also produce a voltage 
transient and noise. In addition to the effects of the changes in the electrode cable capacitance, high 
input impedance circuits are also very sensitive to the electrostatic induction. For this reason, the 
electrodes and input circuits should have shielding that comes as close as possible to the electrode tip. 
With some of the micro ion selective electrodes, stable electrometer readings can only be obtained by 
surrounding the electrode module with a conductive shield connected to the circuit ground. Therefore, 
in summary there are different ways to reduce the noise level involved with the analog signals coming 
from the electrodes and from the instrumentation, such as: 

• shielded cable 

• low-pass filtering before the differential amplifier 

• the differential amplifier itself 

Then further noise reduction can be obtained using software techniques. 

There are other factors that can introduce significant errors in the measurements such as the temper- 
ature and the barometric pressure. In fact, it is very important that the electrodes and the sample remain 
at a constant temperature, around 37°C. In Table A.l, the effect of temperature on sample measurement 
precision for pH, PC0 2 , and P0 2 is reported. 

Table A.l Effect of a Temperature Error (±1 deg) on Sample Measurement Precision 

Temperature Precision 
Required to Meet Required 
Measurement Precision 
Analyte Sample Sensor Total Required Precision (1 SD) (°C) 



pH (pH Unit) 


0.015 


0.025 


0.010 


0.003 


0.3 


pC02 (%) 


4.0 


0.3 


4.3 


0.7 


0.2 


p02 (%) 


7.0 


1.0 


6.0 


0.7 


0.1 



The electrodes and the sample are usually heated by means of an electrical heater and water bath, and 
the temperature is continuously regulated by electronic controls. The sample has to be thermostated as 
well in order to maintain a thermic equilibrium at the sample-electrode interface. Another factor that 
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can influence the measurement is the barometric pressure. In fact, gas sensors have traditionally been 
calibrated with humidified gas mixtures, and the barometric pressure (BP) plays an important role in 
determining the partial pressure of each gas. 

The partial pressure (PX) of any gas in a mixture is determined by: 

PX = (BP-47)(mole% of X/100) 

where: 47 = partial pressure of water vapor at 37°C. 

The barometric pressure is continuously measured by means of a pressure transducer and the measured 
pressure is used to calculate the sample value. 

A.6 New Generations of pH and Blood Gas Sensors 

In blood gas instrumentation, development has been focused on discrete analyzers automatically per- 
forming those functions that are fundamental to ensure the reliability and accuracy of the result, such as: 

• automatic calibration, sample measurement and result calculation 

• detection of errors and malfunctions 

• automatic quality control and diagnosis 

Nevertheless, this kind of blood gas analyzer is most reliable when used by laboratory trained personal, 
because regardless of elevated levels of automation, complicated procedures of maintenance and quality 
control are still sometimes necessary. To reduce this problem, specific satellite laboratories close to the 
patient locations are replacing the central laboratory. These have been created with the objective of 
providing results in a very short time after having collected the sample, while maintaining maximum 
accuracy and precision. 

This arrangement has obviously created additional management expenses and has contributed to the 
development of alternative technologies for blood gas sensors more suitable to be integrated in analytical 
instrumentation to be used near the patient such as 

• discrete analyzers based on multiple or single use sensors (Near Patient or Point of Care Testing) 

• non-invasive extra-corporeal continuous monitors 

• invasive sensors (in vivo) 

In the last ten years, thick- and thin-film microfabrication technology has allowed miniaturization of 
the sensor's notable improvements. These technologies allow a high level of miniaturization of the 
electrochemical sensors, making these sensors economically interesting and particularly reliable for the 
development of small and portable analytical systems. Unfortunately, the good results derived from the 
use of these technologies are still dependent on the training level of the operator. 

Mass microfabrication techniques for sensors are focusing on: 

• improved sensor uniformity 

• simplified calibration routines 

• multi-analyte device development 

• low manufacturing costs 

• disposable sensors 

• reduction of instrument maintenance procedures 

• application closer to the patient 

• rapid turnaround time 

• ultimate use by non-laboratory personnel 
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These new technologies allow the integration of sensors with sample flow channels and front-end 
electronic systems, yet the basic principles of the fundamental sensing systems remain the same. Thus, 
the improvements are the result of precise control of sensor element geometry and reproducible pattern- 
ing on a small scale rather than any different analytical principles. 

A.7 Continuous Monitoring of Blood Gases 

Although most blood gas measurement is performed on blood samples taken from a patient and analyzed 
either in the clinical chemistry laboratory or nearby using point of care analyzers, there are circumstances 
in critical care medicine where it is important to continuously monitor a patient's blood gas status. This 
is particularly true in critical care applications such as in the operating and recovery rooms, as well as 
in various intensive care units. It is important for clinicians managing these patients to be alerted to 
changes in blood gas status as they occur so that immediate therapeutic measures can be taken. Although 
accuracy is always important in a measurement, precision is a critical factor in being able to identify 
blood gas changes as they occur. Often it is possible to make an individual determination from a blood 
sample to periodically calibrate continuous monitors when there are questions regarding their accuracy, 
but if an instrument is unable to recognize changes in blood gas status as they occur, this can lead to 
possible serious problems for the patient. 

Both invasive and noninvasive methods can be used to continuously determine blood gas status. 
Invasive methods involve the use of catheters or probes containing individual or multiple sensors. These 
sensors can either be electrochemical or optical-based devices, and they must be very small to fit in the 
blood vessels or catheters without seriously interfering with flow. Optical techniques involve the use of 
fiber optic sensors as described above. Sensors that determine oxygen, pH, and C02 have been described 
and have been used individually or together in catheters for monitoring blood gas status. Electrochemical 
sensors consist of oxygen or pH electrodes that are placed in a blood vessel or in tissue. In the case of 
oxygen, a small wire can be extruded in the catheter wall such that a cross-section of the wire is exposed 
at the tip of the catheter. This can constitute the working electrode of an amperometric oxygen sensor. 
It is also possible to incorporate a miniature Clark electrode in the wall of the catheter, but this is a much 
more difficult fabrication procedure. 

Catheter-based fiber optic sensors have the optical fibers embedded in the wall of the catheter and use 
the fluorescence or absorption techniques described above to determine analyte concentrations. 

Invasive measurements of blood gases can also be performed by minimally invading the airway. A 
small catheter introduced into the nose can be used to sample exhaled air by drawing some of it into the 
tube. The proximal end of the tube can have a mass spectrometer or infrared absorption cuvette to 
determine the amount of carbon dioxide in the sampled air. Often the concentration of carbon dioxide 
in the air at the end of exhalation is representative of the alveolar air which in turn reflects the carbon 
dioxide in the venous blood. Thus, by measuring the end tidal carbon dioxide partial pressure, one can 
infer the value for blood. 

Noninvasive blood gas sensors include devices for transcutaneous electrochemical measurements and 
transmission or reflection optical devices. In the former category we find the transcutaneous oxygen 
sensor that consists of a Clark electrode that is in contact with the skin. In addition to the oxygen 
permeable membrane on the Clark electrode, the skin serves as a "membrane" separating the electro- 
chemical sensor from the blood. By heating the skin in the region of the electrode to a temperature of 
around 44°C, the blood flow in that portion of skin is greatly increased and the blood in the dermal 
capillaries will have a similar blood gas composition to arterial blood. Thus, by heating it is possible to 
indirectly measure arterial oxygen tension noninvasively. Unfortunately, this technique is extremely 
sensitive to methods of sensor placement and skin preparation. It is also sensitive to variations in skin 
anatomy, and for that reason absolute measurements of transcutaneous oxygen tension do not agree very 
closely with simultaneous arterial oxygen tension measurements in adults. On the other hand, much 
closer agreement is obtained from neonates and infants. 
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Transcutaneous carbon dioxide sensors are similar to the transcutaneous oxygen sensor but utilize a 
Stowe-Severinghaus electrode instead of the Clark electrode. These devices not only function for neonates, 
but have been used in adults as well. Combination electrodes containing both a Clark oxygen sensor and 
a Stowe-Severinghaus carbon dioxide sensor have been developed and made available as commercial 
products. A common limitation for all transcutaneous blood gas sensors is the need to heat the sensor 
to temperatures as high as 44°C. When sensors are left in place for several hours, skin damage can result 
from the elevated temperature. It is, therefore, an accepted rule of thumb that sensors should not remain 
in one location for longer than four hours. 

One of the most popular noninvasive blood gas instruments is the pulse oximeter. This device measures 
pulsitile changes in the intensity of light transmitted or reflected through portions of the body. Usually 
two wavelengths are used, one at an isobestic point and the other at a wavelength where there are large 
differences between the optical spectrum of oxygenated and deoxygenated hemoglobin. Further details 
on pulse oximeter can be found in Chapter 50 "Optical Sensors". 

A.8 Optical Sensors in Blood Gas Analyzers 

Optical sensors are generally based on the use of optical fibers for the quantitative determination of 
species diluted in a solution. The measurement is based on the optical properties of the system such as: 

• The analyte's influence on the optical characteristics of the system 

■ Reaction of the analyte with an intermediate substance that changes its optical properties as a 
result of the reaction. 

Sensors based on optical fibers have the advantage of being easily miniaturized and are a relatively low 
cost. They do not require an external reference signal as the electrochemical sensors, but their use life is 
often limited in terms of stability. More recently, new optical methods for the determination of the partial 
pressure of oxygen have been developed. 

One technique that is frequently used involves the use of a fluorescent dye trapped or immobilized in 
a thin layer adjacent to an optical fiber. The intensity of the fluorescence emitted from the dye is a function 
of the oxygen partial pressure level of the sample in contact with the dye. The sensitivity level of these 
systems is critical, and they need light emitting sources and detection systems of very high quality and 
stability in order to have an adequate signal resolution, in particular for low levels of oxygen. 

Recently new optical methods have been developed for pH and the electrolytes based on the immo- 
bilization of indicators that have functional base-acid groups in their structure. The protonization or 
deprotonization of these functional groups modifies the absorption spectrum of these molecules which 
can then be measured directly or by means of a detection system for fluorescence similar to that described 
above for P0 2 . However, there are fundamental differences, such as the sample ionic strength, that can 
influence the accuracy of measurements and that are potential sources of error in the analytical reliability 
of these sensors. The determination of the C0 2 partial pressure can also be done by optical sensors based 
on the measurement of the equilibrium pH of the thin layer of the bicarbonate electrolyte that is in 
intimate contact with a gas permeable membrane and that separates it from the sample to measure. This 
measurement can be made either by adsorbance or fluorescence. A proper acid-base indicator with a 
pKa corresponding to the central portion of the pH range for normal values of C0 2 partial pressure is 
immobilized on a gas permeable membrane and is placed in direct contact with the distal end of a single 
or multiple optical fiber. The intensity of the adsorbance or the fluorescence changes as a function of 
the pH in the thin layer of the bicarbonate-based electrolyte, and this, in turn, is a function of the PC0 2 
level of the sample. Typical transduction chemicals used to construct optical sensors are listed in Table A.2. 

Table A.2 Typical Transduction Elements Used to Construct Optical Sensors 

Analyte Example of Sensing Chemical Sensing Scheme 

pH / PC0 2 Fluorescein (acid-base indicator) pH dependent fluorescence 
P0 2 Ruthenium complexes Quenching of fluorescence by O, 
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The intention of this section is to couple the concepts of transport phenomena with chemical 
reaction kinetics and thermodynamics to introduce the field of reaction engineering. This is 
essential information needed to design and control engineering devices, particularly flow reactor 
systems. Through extension of these concepts, combined with materials design, to mimic biological 
systems in form and function, the field of biomimicry has evolved. The development of Biomimetic 
Systems is a rapidly emerging technology with expanding applications. Specialized journals, e.g., Biomi- 
metics, devoted to exploring both the analysis of existing biological materials and processes, and the 
design and production of synthetic analogs that mimic biological properties are now emerging. These 
journals blend a biological approach with a materials/engineering science viewpoint covering topics that 
include: analysis of the design criteria used by organisms in the selection of specific biosynthetic materials 
and structures; analysis of the optimization criteria used in natural systems; development of systems 
modeled on biological analogs; and applications of "intelligent" or "smart" materials in areas such as 
biosensors, robotics, and aerospace. This biomimicry theme is prevalent throughout all the chapters in 
this section, including one specifically devoted to these concepts with explicit examples of applications 
to reacting and transport processes. 

The field of transport phenomena traditionally encompasses the subjects of momentum transport 
(viscous flow), energy transport (heat conduction, convection, and radiation), and mass transport (dif- 
fusion). In this section the media in which the transport occurs is regarded as continua; however, some 
molecular explanations are discussed. The continuum approach is of more immediate interest to engi- 
neers, but both approaches are required to thoroughly master the subject. The current emphasis in 
engineering education is on understanding basic physical principles vs. "blind" use of empiricism. 
Consequently, it is imperative that the reader seek further edification in classical transport phenomena 
texts; general [Bird et al., 1960], with chemically reactive systems [Rosner, 1986], and more specifically, 
with a biologically oriented approach [Lightfoot, 1974] . The laws (conservation principles, etc.) governing 
such transport will be seen to influence: (1) the local rates at which reactants encounter one another; 
(2) the temperature field within body regions (or compartments in pharmacokinetics modeling); (3) the 
volume (or area) needed to accomplish the desired turnover or transport rates; and (4) the amount and 
fate of species involved in mass transport and metabolic rates. For biomedical systems such as dialysis, 
in which only physical changes occur, the same general principles, usually with more simplifications, are 
used to design and analyze these devices. The transport laws governing non-reactive systems can often 
be used to make rational predictions of the behavior of "analogous" reacting systems. The importance 
of relative time scales will be discussed throughout this section by all authors. It is particularly important 
to establish orders-of-magnitude and to make realistic limiting calculations. Dimensional analysis and 
pharmacokinetic modeling techniques (see other sections of the handbook and [Enderle et al., 1999; 
Fisher, 1999] are especially attractive for these purposes; in fact, they may permit unifying the whole of 
biological transport. As the reader progresses, the importance of transport phenomena in applied biology 
becomes steadily more apparent. "In all living organisms, but most especially the higher animals, diffu- 
sional and flow limitations are of critical importance; moreover, we live in a very delicate state of balance 
with respect to these two processes" [Lightfoot, 1974], 

Each chapter in this section is somewhat self-contained. Similar concepts are brought forth and 
reinforced through applications and discussions. However, to further enhance the benefits obtained by 
the reader, it is prudent to first discuss some elementary concepts; the most relevant being control volume 
selection and flow reactors. 

When applying the conservation laws to fluid matter treated as a continuum, the question arises as to 
the amount of matter to be considered. Typically this decision is based on convenience and/or level of 
detail required. There is no single choice. Many possibilities exist that can lead to the same useful 
predictions. The conservation laws of continuum dynamics can be applied to the fluid contained in a 
volume of arbitrary size, shape, and state of motion. The volume selected is termed a "control volume." 
The simplest is one where every point on its surface moves with the local fluid velocity. It is called a 
"material" control volume since, in the absence of diffusion across its interface, it retains the material 
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originally present within its control surface. Although conceptually simple, they are not readily used since 
they move through space, change their volume, and deform. An analysis of the motion of material control 
volumes is usually termed "Legrangian" and time derivatives are termed "material" or "substantial" 
derivatives. 

Another simple class of control volumes is defined by surfaces fixed in physical space, through which 
the fluid flows. These "fixed" control volumes are termed "Eularian" and may be either macroscopic or 
differential in any or all directions. The fluid contained within a Eularian control volume is said to be, 
in thermodynamic terms, an "open" (flow) system. 

The most general type of control volume is defined by surfaces that move "arbitrarily", i.e., not related 
to the local fluid velocity. Such control volumes are used to analyze the behavior of non-material "waves" 
in fluids, as well as moving phase boundaries in the presence of mass transfer across the interface [Crank, 
1956; Fisher, 1989]. 

Characterization of the mass transfer processes in bioreactors, as used in cell or tissue culture systems, 
is essential when designing and evaluating their performance. Flow reactor systems bring together various 
reactants in a continuous fashion while simultaneously withdrawing products and excess reactants. These 
reactors generally provide optimum productivity and performance [Levenspiel, 1989; Shuler and Kargi, 
1992]. They are classified as either tank or tube type reactors. Each represents extremes in the behavior 
of the gross fluid motion. Tank type reactors are characterized by instant and complete mixing of the 
contents and are therefore called perfectly mixed, or backmixed, reactors. Tube type reactors are char- 
acterized by the lack of mixing in the flow direction and are called plug flow, or tubular, reactors. The 
performance of actual reactors, though not fully represented by these idealized flow patterns, may match 
them so closely that they can be modeled as such with negligible error. Others can be modeled as 
combinations of tank and tube type over various regions. 

An idealized backmixed reactor is analyzed as follows. Consider a stirred vessel containing a known 
fluid volume, into which multiple streams may be flowing that contain reactants, enzymes (bio-catalysts), 
nutrients, reaction medium, and so on. When all these components are brought together in the vessel, 
under properly controlled conditions such as temperature, pressure, and concentrations of each compo- 
nent, the desired reactions occur. The vessel is well mixed to promote good contacting of all components 
and hence an efficient reaction scheme can be maintained. The well-mixed state is achieved when samples 
withdrawn from different locations, including the exit, at the same instance in time are indistinguishable. 
The system is termed "lumped" vs. "distributed", as in a plug flow system where location matters. The 
response characteristics of a backmixed reactor are significantly different from those of a plug flow reactor. 
How reaction time variations affect performance and how quickly each system responds to upsets in the 
process conditions are key factors. The backmixed system is far more sluggish than the plug flow system. 
To evaluate the role of reaction time variations, the concept of residence time and how it is determined 
must be discussed. A brief analysis of a batch reactor will be useful in understanding the basic principles 
involved. 

In batch systems, there is no flow in or out. The feed (initial charge) is placed in the reactor at the 
start of the process and the products are withdrawn all at once at some later time. The time-for-reaction 
is thus readily determined. This is significant since the conversion of reactants to products is a function 
of time, and can be obtained from knowledge of the reaction rate and its dependence upon composition 
and process variables. Since these other factors are typically controlled at a constant value, the time for 
reaction is the key parameter in the reactor design process. In a batch reactor, the concentration of 
reactants change with time and, therefore, the rate of reaction as well. Conversion is now related to 
reaction time through the use of calculus. For this system, residence time, equal to the processing time, 
is the time for reaction. 

In a backmixed reactor, the conversion of reactants is controlled by the average length of time fluid 
elements remain in the reactor (their residence time). The ratio of the volume of fluid in the tank to the 
volumetric flow rate of the exit stream determines the residence time. Recall that in an ideal system of 
this type, operating at steady state, concentrations in the vessel are uniform and equal to those in the 
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exit stream and thus the rate is also constant. Conversion is determined simply by multiplying this rate 
by the residence time. With imperfect mixing, fluid elements have a distribution of residence times and 
the performance of the reactor is clearly altered. 

The plug flow reactor, which is based on the premise that there is no mixing in the flow direction, 
implies no interaction between neighboring fluid elements as they traverse the length of the reactor. This 
idealization permits the use of the prior results from batch reactor analysis. Each fluid element in the 
tubular reactor functions as a small batch reactor, undisturbed by its neighboring elements, and its 
reaction time is well defined as the ratio of the length of the tube to the fluid velocity. Concentration 
varies along the length of the reactor (also, rate), and can be simply related to reaction time. The 
mathematical analysis and prediction of performance is similar to that for a batch reactor. This analysis 
shows that the plug flow configuration yields higher conversions, given equal residence times, for the 
same processing conditions. However, the plug flow reactor responds more quickly to system upsets and 
is more difficult to control. 

Most actual reactors deviate from these idealized systems primarily because of non-uniform velocity 
profiles, channeling and bypassing of fluids, and the presence of stagnant regions caused by reactor shape 
and internal components such as baffles, heat transfer coils, and measurement probes. Disruptions to 
the flow path are common when dealing with heterogeneous systems, particularly when solids are present. 
To model these actual reactors, various regions are compartmentalized and represented as combinations 
of plug flow and backmixed elements. For illustration, a brief discussion of recycle, packed bed, and 
fluidized bed reactor systems follows. 

Recycle reactors are basically plug flow reactors with a portion of the exit stream recycled to the inlet, 
which provides multiple passes through the reactor to increase conversion. It is particularly useful in 
bio-catalytic reactor designs, in which the use of a packed bed is desired because of physical problems 
associated with contacting and subsequent separation of the phases. The recycle reactor provides an 
excellent means to obtain backmixed behavior from a physically configured tubular reactor. Multiple 
reactors of various types, combined in series and/or parallel arrangements, can improve performance 
and meet other physical requirements. Contact patterns using multiple entries into a single reactor can 
emulate these situations. A system demonstrating this characteristic is a plug flow reactor with uniform 
side entry. If these entry points are distributed along the length with no front entry, the system will 
perform as a single backmixed system. The significance is that backmixed behavior is obtained without 
continual stirring or the use of recycle pumps. Furthermore, if these side entry points are limited to only 
a portion of the reactor length, then the system functions as backmixed reactors in series with plug flow 
reactors. 

Special consideration needs to be given to heterogeneous reactors, in which interaction of the phases 
is required for the reactions to proceed. In these situations, the rate of reaction may not be the deciding 
factor in the reactor design. The rate of transport of reactants and/or products from one phase to another 
can limit the rate at which products are obtained. For example, if reactants cannot get to the surface of 
a solid catalyst faster than they would react at the surface, then the overall (observed) rate of the process 
is controlled by this mass transfer step. To improve the rate, the mass transfer must be increased. It would 
be useless to make changes that would affect only the surface reaction rate. Furthermore, if products do 
not leave the surface rapidly, they may block reaction sites and thus limit the overall rate. Efficient 
contacting patterns need to be utilized. Hence, fluidized bed reactors (two-phase backmixed emulator), 
trickle bed systems (three-phase packed bed emulator), and slurry reactors (three-phase backmixed 
emulator) have evolved as important bioreactors. They are readily simulated, designed, scaled up, and 
modified to meet specific contacting demands in cell and tissue culture systems. 

Flow reactors of all shapes, sizes, and uses are encountered in all phases of life. Examples of interest 
to bioengineers include: the pharmaceutical industry, to produce aspirin, penicillin, and other drugs; the 
biomass processing industry, to produce alcohol, enzymes and other specialty proteins, and value added 
products; and the biotechnologically important tissue and cell culture systems. The type of reactor used 
depends on the specific application and on the scale desired. The choice is based on a number of factors. 
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The primary ones are the reaction rate (or other rate-limiting process), the product distribution speci- 
fications, and the catalyst or other material characteristics, such as chemical and physical stability. 

Using living systems require more thorough discussions because, in these systems, the solid phase may 
change its dimensions as the reaction proceeds. Particles that increase in size can fall out of the reaction 
zone and change flow patterns at the inlet. This occurs when gravity overcomes fluid buoyancy and drag 
forces. In some instances, this biomass growth is the desired product; however usually, the substances 
produced by reactions catalyzed by the cellular enzymes that are the desired products. By reproducing 
themselves, more enzymes are generated and productivity increases. Note, however, that there are both 
advantages and disadvantages to using these whole cell systems vs. the enzymes directly. The cell mem- 
branes can be resistances to transport; the consumption of reactant as a nutrient reduces the efficiency 
of raw material usage; and special precautions must be taken to maintain a healthy environment and 
thus productivity. The payoff, however, is that the enzymes within their natural environment are generally 
more active and safer from poisons or other factors that could reduce their effectiveness. Interesting 
examples are microbial fermentation processes, as discussed earlier. Backmixed flow reactors are used in 
these applications. They are best suited to maintain the cell line at a particular stage in its life cycle to 
obtain the desired results. The uniform and constant environment provided for the cells minimizes the 
adjustments that they need to make concerning nutrient changes, metabolic wastes, and so forth, so that 
production can proceed at a constant rate. The term "chemostat" is used when referring to backmixed 
reactors used in biotechnology applications. These are typically tank type systems with mechanical 
agitation. All the reactor types discussed earlier, however, are applicable. Recall that mixed flow charac- 
teristics can be obtained in tubular reactors if recycle and/or side entry is employed. Thus, air lift systems 
using a vertical column with recycle and side entry ports is a popular design. 

The design, analysis, and simulation of reactors thus becomes an integral part of the bioengineering 
profession. The study of chemical kinetics, particularly when coupled with complex physical phenomena, 
such as the transport of heat, mass, and momentum, is required to determine or predict reactor perfor- 
mance. It thus becomes imperative to uncouple and unmask the fundamental phenomenological events 
in reactors and to subsequently incorporate them in a concerted manner to meet the objectives of specific 
applications. This need further emphasizes the role played by the physical aspects of reactor behavior in 
the stability and controllability of the entire process. The following chapters in this section demonstrate 
the importance of all the concepts presented in this introduction. 
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Humankind has always been fascinated by the phenomenological events, biological and physical in nature, 
revealed to us by our environment. Curiosity drives us to study these observations and understand the 
fundamental basis of the mechanisms involved. The goals have been to control them for our safety and 
comfort and/or design processes to mimic the beneficial aspects. Experience has demonstrated the 
complexity and durability of these natural processes and that adaptability with multi-functionality is a 
must for biological systems to survive. Evolution of these living systems to adapt to new environmental 
challenges is at the molecular scale. Our need to be molecular scientists and engineers is thus apparent. 
The knowledge of the molecular building blocks used in the architectural configurations of both living 
and non-living systems, along with the understanding of the design and processes used for implemen- 
tation, is but one step. To control and utilize these resources is another. The ability to mimic demonstrates 
a sufficient knowledge base to design systems for most conceivable applications requiring controlled 
functionality. To perfect this approach, a series of sensor/reporter systems must be available. A particularly 
attractive feature of living systems is their unique ability to diagnose and repair localized damages through 
a continuously distributed sensor network. Mimicry of these networks is an integral component of this 
emerging research area. The primary approach is to couple transformation and separation technology 
with detection and control systems. A major research emphasis has been toward the development of 
intelligent membranes, specifically sensor/reporter technology. 

The concept of intelligent barriers and substrates, such as membranes, arises from the coupling of this 
sensor/reporter technology with controlled chemistry and reaction engineering, selective transport phe- 
nomena, and innovative systems design. Engineered membrane mimetics are required to respond and 
adapt to environmental stresses, whether intentionally imposed or stochastic in nature. These intelligent 
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membranes may take the form of polymeric films, composite materials, ceramics, supported liquid 
membranes, or as laminates. Their important feature is specific chemical functionality, engineered to 
provide selective transport, structural integrity, controlled stability and release, and sensor/reporter 
capabilities. Applications as active transport mimics and their use in studying the consequences of 
environmental stresses, such as ultrasonic irradiation and electromagnetic fields on enzymatic functions, 
will create valuable insight into cellular mechanisms. 

95.1 Concepts of Biomimicry 

Discoveries that have emerged from a wide spectrum of disciplines, from biotechnology and genetics to 
polymer and molecular engineering, are extending the range of design and manufacturing possibilities 
for mimetic systems that was once incomprehensible. Understanding of the fundamental concepts inher- 
ent in natural processes has led to a broad spectrum of new processes and materials modeled on these 
systems [Srinivasan et al., 1991]. Natural processes, such as active transport systems functioning in living 
systems, have been successfully mimicked and useful applications in other fields, such as pollution 
prevention, have been demonstrated [Thoresen and Fisher, 1995]. It is only through understanding the 
mechanisms on the molecular scale which living systems utilize to develop their processes and implement 
designs, that success at the macro-scale can be assured. Multi-functionality, hierarchical organization, 
adaptability, reliability, self- regulation, and reparability are the key elements that living systems rely upon 
and that we must mimic to develop "intelligent systems". Successes to date have been based on the use 
of molecular engineering of thin films, neural networks, reporter/sensor technology, morphology and 
properties developments in polymer blends, transport phenomena in stationary and reacting flow sys- 
tems, cell culture and immobilization techniques, controlled release and stability mechanisms, and 
environmental stress analyses. A few examples in the bioengineering field are molecular design of sup- 
ported liquid membranes to mimic active transport of ions; non-invasive sensors to monitor in vivo 
glucose concentrations; detection of microbial contamination by bioluminescence; immuno-magnetic 
capture of pathogens; improved encapsulation systems and carrier molecules for controlled release of 
pharmaceutics; in situ regeneration of co-enzymes electro-chemically; and measurement of transport and 
failure mechanisms in "smart" composites. All these successes were accomplished through interdiscipli- 
nary approaches, essentially using three major impact themes, as follows: Morphology and Properties 
Development; Molecular Engineering of Thin Films; and Biotechnology, Bioreaction Engineering, and 
Systems Development. 

Morphology and Properties Development 

Polymer blends are the current focus of this research area since they represent the most rapidly growing 
area of polymer technology [Weiss et al., 1995]. Technology, however, has outpaced a detailed under- 
standing of many facets of the science, which impedes the development and application of new materials. 
The purpose of this research is to develop the fundamental science that influences the phase behavior, 
phase architecture, and morphology and interfacial properties of polymer blends. The main medical 
areas where polymers and composites (polymeric) have found wide use are artificial organs, the cardio- 
vascular systems, orthopedics, dentistry, ophthalmology, drugs, and drug delivery systems. Success has 
been related to the wide range of mechanical properties, transformation processes (shape possibilities), 
and low production costs. The limitation has been their interaction with living tissue. To overcome the 
biological deficiencies of synthetic polymers and enhance mechanical characteristics, a class of bio- 
artificial polymeric materials has been introduced based on blends, composites, and interpenetrating 
polymer networks of both synthetic and biological polymers. Preparations from biopolymers such as 
fibrin, collagen, and hyaluronic acid with synthetic polymers such as polyurethane, poly(acrylic acid), 
and poly( vinyl alcohol) are available [Giusti et al., 1993]. 
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Molecular Engineering of Thin Films 

The focus in this research impact area is to develop a fundamental understanding of how morphology 
can be controlled in (1) organic thin film composites prepared by Langmuir-Blodgett (LB) monolayer 
and multilayer techniques and (2) molecular design of membrane systems using ionomers and selected 
supported liquids. The ability to form controlled structures of this nature will find immediate application 
in several aspects of smart materials development, particularly in micro-sensors. 

Surfaces, interfaces, and microstructures play an important role in many research frontiers. Exploration 
of structural property relationships at the atomic and molecular level, investigating elementary chemical 
and physical transformations occurring at phase boundaries, applying modern theoretical methods for 
predicting chemical dynamics at surfaces, and integration of this knowledge into models that can be used 
in process design and evaluation are within the realm of surface and interfacial engineering. The control 
of surface functionality by proper selection of the composition of the LB films can mimic many functions 
of a biologically active membrane. An informative comparison is that between inverted erythrocyte ghosts 
[Dinno et al., 1991] and their synthetic mimics when environmental stresses are imposed on both systems. 
These model systems can assist in mechanistic studies to understand the functional alterations that result 
from ultrasound, EM fields, and UV radiation. The behavior of carrier molecules and receptor site 
functionality must be mimicked properly along with simulating disturbances in the Proton Motive Force 
of viable cells. Use of ion/electron transport ionomers in membrane-catalyst preparations is beneficial 
for programs such as electro-enzymatic synthesis [Zawalich and Fisher, 1998]. Inventing new membranes 
for artificial organs and advances in micelle reaction systems has resulted from these efforts. 

Biotechnology, Bioreaction Engineering, and Systems Development 

This research impact area is focused on: (1) sensor/receptor reporter systems and detection methods; 
(2) transport processes in biological and synthetic membranes; (3) biomedical and bio-conversion process 
development; and (4) smart film development and applications for intelligent barrier systems. These 
topics require coupling with the previously discussed areas and the use of biochemical reaction engineer- 
ing techniques. Included in all of these areas is the concept of metabolic engineering, the modification 
of the metabolism of organisms to produce useful products. Extensive research in bio-conversion pro- 
cesses is currently being directed to producing important pharmaceutics. Future efforts also need to be 
directed toward impacting the emerging field of cell and tissue engineering, i.e., the manipulation or 
reconstruction of cell and tissue function using molecular approaches. 

95.2 Biomimetic Membranes for Ion Transport 

A cell must take nutrients from its extra-cellular environment if it is to grow or maintain its metabolic 
activity. The selectivity and rate that these molecular species enter can be important in regulatory 
processes. The mechanisms involved depend upon the size of the molecules to be transported across the 
cell membrane. These biological membranes consist of a continuous double layer of lipid molecules in 
which various membrane proteins are imbedded. Individual lipid molecules are able to diffuse rapidly 
within their own monolayer; however, they rarely "flip-flop" spontaneously between the two monolayers. 
These molecules are amphoteric and assemble spontaneously into bilayers when placed in water. Sealed 
compartments are thus formed, which reseal if torn. 

The topic of membrane transport is discussed in detail in many texts (e.g., [Alberts et al., 1989; 
Lehninger, 1988]). The following discussion is limited to membrane transport of small molecules, hence 
excluding macromolecules such as polypeptides, polysaccharides, and polynucleotides. The lipid bylayer 
is a highly impermeable barrier to most polar molecules and thus prevents the loss of the water soluble 
contents of the cell interior. Consequently, cells have developed special means to transport these species 
across their membranes. Specialized transmembrane proteins accomplish this, each responsible for the 
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transfer of a specific molecule or group of closely related molecules. The mechanism can be either energy 
independent, as in passive and facilitated diffusion, or energy dependent, as in active transport and group 
translocation. 

In passive diffusion, molecules are transported with (or "down") a concentration gradient that is 
thermodynamically favorable and can occur spontaneously. Facilitated diffusion utilizes a carrier mole- 
cule, imbedded in the membrane, that can combine specifically and reversibly with the molecule to be 
transported. This carrier protein undergoes conformational changes when the target molecule binds and 
again when it releases that molecule on the transverse side of the membrane. This binding is dependent 
on favorable thermodynamics related to the concentration of free vs. bound species. An equilibrium is 
established, as in a Langmuir isotherm, on both sides of the membrane. Thus, the rate of transport is 
proportional to concentration differences maintained on each side of the membrane and the direction 
of flow is down this gradient. Active transport is similar to facilitated transport in that a carrier protein 
is necessary; however, it occurs against (up) a concentration gradient, which is thermodynamically 
unfavorable and thus requires energy. Group translocation requires chemical modification of the sub- 
stance during the process of transport. This conversion process traps the molecule on a specific side of 
the membrane due to its asymmetric nature and the essential irreversibility of the transformation. These 
complexities lead to difficulties in mimicry; thus, research in this area is slow in developing. 

Several energy sources are possible for active transport, including electrostatic or pH gradients of the 
proton motive force (PMF), secondary gradients derived from the PMF by other active transport systems, 
and by the hydrolysis of ATP. The development of these ion gradients enables the cell to store potential 
energy in the form of these gradients. 

It is essential to realize that simple synthetic lipid bylayers, i.e., protein-free, can mimic only passive 
diffusion processes since they are impermeable to ions but freely permeable to water. Thermodynamically, 
virtually any molecule should diffuse across a protein-free, synthetic lipid bylayer down its concentration 
gradient. However, it is the rate of diffusion that is of concern, which is highly dependent upon the size 
of the molecule and its relative solubility in oil (i.e., the hydrophobic interior of the bylayer). Conse- 
quently, small non-polar molecules such as 2 readily diffuse. If small enough, uncharged polar molecules 
such as C0 2 , ethanol, and urea can diffuse rapidly, whereas glycerol is more difficult and glucose is 
essentially excluded. Water, because it has such a small volume and is uncharged, diffuses rapidly even 
though it is polar and relatively insoluble in the hydrophobic phase of the bylayer. Charged particles, on 
the other hand, no matter how small, such as Na + and K + , are essentially excluded. This is due to the 
charge and the high degree of hydration preventing them from entering the hydrocarbon phase. Quan- 
titatively, water permeates the bylayer at a rate 10 3 faster than urea, 10 6 faster than glucose, and 10 9 faster 
than small ions such as K + . Thus, only non-polar molecules and small uncharged polar molecules can 
cross the cellular lipid membrane directly by simple (passive) diffusion; others require specific membrane 
transport proteins, as either carriers or channels. Synthetic membranes can be designed for specific 
biomedical applications that can mimic the above discussed transport processes [Michaels, 1990]. Mem- 
brane selectivity and transport are enhanced with the aid of highly selective complexing agents, impreg- 
nated as either fixed site or mobile carriers. To use these membranes to their full potential, the mechanism 
of this diffusion needs to be thoroughly understood. 

Active Transport Biomimetics 

Extensive theoretical and experimental work has previously been reported for supported liquid membrane 
systems (SLMS) as effective mimics of active transport of ions [Thoresen and Fisher, 1995; Stockton and 
Fisher, 1998; Cussler et al., 1989; Kalachev et al., 1992]. This was successfully demonstrated using di- 
(2 ethyl hexyl) -phosphoric acid as the mobile carrier dissolved in n-dodecane, supported in various inert 
hydrophobic micro-porous matrices (e.g., polypropylene), with copper and nickel ions as the transported 
species. The results showed that a pH differential between the aqueous feed and strip streams, separated 
by the SLMS, mimics the PMF required for the active transport process that occurred. The model for 
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transport in an SLMS is represented by a five-step resistance-in-series approach, as follows: (1) diffusion 
of the ion through a hydrodynamic boundary layer; (2) desolvation of the ion, where it expels the water 
molecules in its coordination sphere and enters the organic phase via ion exchange with the mobile 
carrier at the feed/membrane interface; (3) diffusion of the ion-carrier complex across the SLMS to the 
strip/membrane interface; (4) solvation of the ion as it enters the aqueous strip solution via ion exchange; 
and (5) transport of the ion through the hydrodynamic boundary layer to the bulk stripping solution. 
A local Peclet number is used to characterize the hydrodynamics and the mass transfer occurring at the 
fluid/SLMS interface. The SLMS itself is modeled as a heterogeneous surface with mass transfer and 
reaction occurring only at active sites; in this case, the transverse pores. Long-term stability and toxicity 
problems limit their application, as configured above, in the biomedical arena. Use in combination with 
fixed site carrier membranes as entrapping barriers has great potential and is an active research area. 
Some success has been obtained using (1) reticulated vitreous carbon as the support matrix and Nafion® 
for the thin film "active barrier"; and (2) an ethylene-acrylic acid ionomer, utilizing the carboxylic acid 
groups as the fixed site carriers. The most probable design for biomedical applications appears to be a 
laminate composite system that incorporates less toxic SLMSs and highly selective molecularly engineered 
thin film entrapping membranes. Use of fixed site carrier membranes in these innovative designs requires 
knowledge of transport characteristics. Cussler et al. [1989] have theoretically predicted a jumping mech- 
anism for these systems. Kalachev et al. [1992] have shown that this mechanism can also occur in an 
SLMS at certain carrier concentrations. This mechanism allows for more efficient transport than common 
facilitated diffusion. Stability over time and a larger range of carrier concentrations where jumping occurs 
make fixed carrier membranes attractive for biomedical applications. A brief discussion of these jumping 
mechanisms follows. 

Mechanism for Facilitated Diffusion in Fixed 
Carrier Membranes 

A theory for the mechanism of diffusion through a membrane using a fixed carrier covalently bound to 
the solid matrix, was developed previously [Cussler et al., 1989]. The concept is that the solute molecule 
jumps from one carrier to the next in sequence. Facilitated diffusion can occur only if these "chained" 
carriers are reasonably close to each other and have some limited mobility. The advantages of using a 
chained carrier in a solid matrix vs. a mobile carrier in a liquid membrane are that the stability is 
improved; there is no potential for solvent loss from the system, and the transport may actually be 
enhanced. Their theory is compared to that for the mobile carriers in the SLMS. For the fixed carrier 
(chained) system, the assumptions of fast reactions and that they take place only at the interface are also 
used. The major difference is that the complex formed cannot diffuse across the membrane since the 
carrier is covalently bound to the polymer chain in the membrane. Although the complex does not diffuse 
in the classical random walk concept, it can "jiggle" around its equilibrium position. This movement can 
bring it into contact range with an uncomplexed carrier also "jiggling," and result in a reversible inter- 
action typical to normal receptor/ligand surface motion. It is assumed that no uncomplexed solute can 
pass through the membrane; it would be immobilized and taken from the diffusion process. The diffusion 
process that is operable is best explained by viewing the chained carrier membrane as a lamella structure 
where each layer is of thickness L. Every carrier can move a distance X around its neutral position and 
is a length L away from its neighbors. Diffusion can occur only over the distance X. Therefore, there is 
a specific concentration where a solute flux is first detected, termed percolation threshold, occurring 
when L = X. This threshold concentration is estimated as C = 1/(L 3 N a ), where C is the average concen- 
tration, L is the distance between carrier molecules, and N a is Avogadro's number. In summary, the 
mechanism is that of intramolecular diffusion; each chained carrier having limited mobility within the 
membrane. A carrier at the fluid membrane interface reacts with the species to be transported and 
subsequently comes in contact with an uncomplexed carrier and reacts with it, repeating this transfer 
process across the entire width of the membrane. 
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Jumping Mechanism in Immobilized Liquid Membranes 

Facilitated diffusion was studied in immobilized liquid membranes using a system composed of a micro- 
porous nitrocellulose film impregnated with tri-n-octylamine (TOA) in n-decane [Kalachev et al., 1992]. 
Experiments were monitored by measuring the conductivity of the feed and strip streams. The transport 
of ions (cobalt and iron) from an acidic feed (HC1) to a basic strip solution (NH 4 OH) was accomplished. 
Their results suggest that there are three distinct transport regimes operable in the membrane. The first 
occurs at short times and exhibits very little ion transport. This initial time is termed the ion penetration 
time and is simply the transport time across the membrane. At long times, a rapid increase in indiscrim- 
inate transport is observed. At this critical time and beyond, there are stability problems; that is, loss of 
solvent from the pores leading to the degradation of the membrane and the formation of channels that 
compromise the ion selective nature of the system and its barrier properties. 

Recall that their experiments were for a selective transport with (not against) the ion gradient. It is 
only in the intermediate time regime that actual facilitated transport occurs. In this second region, 
experiments were conducted using a cobalt feed solution for various times and carrier concentrations; 
all experiments showed a peak in flux. The velocity of the transported species can be obtained from these 
results and the penetration time vs. carrier concentration is available. At the threshold carrier concen- 
tration these researchers claim that the mechanism of transport is by jumping as proposed earlier for 
fixed site carriers. The carrier molecules are now close enough to participate in a "bucket brigade" 
transport mechanism. The carrier molecules use local mobility, made possible by a low viscous solution 
of n-decane, to oscillate, passing the transported species from one to another. This motion results in 
faster transport than common facilitated transport, which relies on the random walk concept and occurs 
at lower TOA concentrations. It is in this low concentration region that the carrier molecules are too far 
apart to participate in the jumping scheme. At higher concentrations, well above the threshold value, the 
increased viscosity interferes with carrier mobility; the jumping is less direct or does not occur because 
of the increased bonding sites and hence removal of the species from the transport process. 

95.3 Assessing Mass Transfer Resistances 
in Biomimetic Reactors 

Uncoupling Resistances 

Characterization of mass transfer limitations in biomimetic reactors is essential when designing and 
evaluating their performance. When used in Cell Culture Analog (CCA) systems, the proper mimicry of 
the role of intrinsic kinetics and transport phenomena cannot be over-emphasized. Lack of the desired 
similitude will negate the credibility of the phenomenological observations as pertaining to toxicity and/or 
pharmaceutical efficacy. The systems must be designed to allow manipulation, and thus control, of all 
interfacial events. The majority of material transfer studies for gaseous substrates are based on the 
assumption that the primary resistance is at the gas/liquid interface. Studies examining the use of hollow 
fiber membranes to enhance gas/liquid transport have been successfully conducted [Grasso et al., 1995]. 
The liquid/cell interfacial resistance is thus uncoupled from that of the gas/liquid interface and they can 
now be examined separately to evaluate their potential impacts. A reduction in the mean velocity gradient, 
while maintaining a constant substrate flux into the liquid, resulted in a shift in the limiting resistance 
from the gas/liquid to the liquid/cell interface. This shift manifested itself as an increase in the Monad 
apparent half-saturation constant for the chemo-autotrophic methanogenic microbial system selected as 
a convenient analog. The result of these studies significantly influences the design and/or evaluation of 
reactors used in the BME research area, especially for the animal surrogate or CCA systems. Although a 
reactor can be considered as well mixed based on spatial invariance in cell density, it was demonstrated 
that significant mass transfer resistance may remain at the liquid/cellular boundary layer. 
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There are three major points to be stressed. First, the liquid/cellular interface may contribute signifi- 
cantly to mass transfer limitations. Second, when mass transfer limitations exist the intrinsic biokinetics 
parameters cannot be determined. In biochemical reactor design, intrinsic parameters are essential to 
model adequately the system performance. Furthermore, without an understanding of the intrinsic 
biokinetics, one cannot accurately study transport mechanisms across biological membranes. The deter- 
mination of passive or active transport across membranes is strongly affected by the extent of the 
liquid/cellular interfacial resistance. 

Use in Physiologically Based Pharmacokinetics Models 
and Cell Culture Analog Systems 

The potential toxicity of, and/or the action of, a pharmaceutical is tested primarily using animal studies. 
Since this technique can be problematic from both a scientific and ethical basis [Gura, 1997], alternatives 
have been sought. In vitro methods using isolated cells [Del Raso, 1993] are inexpensive, quick, and 
generally present no ethical issues. However, the use of isolated cell cultures does not fully represent the 
full range of biochemical activity as in the whole organism. Tissue slices and engineered tissues have also 
been studied but not without their inherent problems, such as the lack of interchange of metabolites 
among organs and the time dependent exposure within the animal. An alternative to both in vitro and 
animal studies is the use of computer models based on physiologically based pharmacokinetics (PBPK) 
models [Connolly and Anderson, 1991]. These models mimic the integrated, multi- compartment nature 
of animals and thus can predict the time dependent changes in blood and tissue concentrations of the 
parent chemical and its metabolites. The obvious limitations lie in that a response is based on assumed 
mechanisms; therefore, secondary and "unexpected" effects are not included, and parameter estimation 
is difficult. Consequently, the need for an animal surrogates or CCA systems is created. The pioneering 
work of M.L. Shuler's group at Cornell University [Sweeney et al., 1995; Shuler et al., 1996; Mufti and 
Shuler, 1998] and many others has led to the following approach for CCA systems. 

These CCA systems are physical representations of the PBPK structure where cells or engineered tissues 
are used in organ compartments. The fluid medium that circulates between compartments acts as a 
"blood surrogate". Small scale bioreactors housing the appropriate cell types are the physical compart- 
ments that represent organs or tissues. This concept combines attributes of PBPK and in vitro systems. 
Furthermore, it is an integrated system that can mimic dose release kinetics, conversion into specific 
metabolites from each organ, and the interchange of these metabolites between compartments. Since the 
CCA system permits dose exposure scenarios that can replicate those of animal studies, it works in 
conjunction with a PBPK as a tool to evaluate and modify proposed mechanisms. Thus, bioreactor design 
and performance evaluation testing is crucial to the success of this animal surrogate concept. 

Efficient transfer of substrates, nutrients, stimulants, etc. from the gas phase across all interfaces may 
be critical for the efficacy of certain biotransformation processes and in improving blood compatibility 
of biosensors monitoring the compartments. Gas/liquid mass transfer theories are well established for 
microbial processes [Cussler, 1984], however, biotransformation processes also involve liquid/cellular 
interfacial transport. In these bioreactor systems, a gaseous species is transported across two interfaces. 
Each could be a rate determining step and can mask intrinsic kinetics modeling studies associated with 
cellular growth and/or substrate conversion and product formation. 

A methanogenic chemo-autotrophic process was selected for study because of its relative simplicity 
and strong dependence on gaseous nutrient transport, thus establishing a firm quantitative base case 
[Grasso et al., 1995]. The primary objective was to compare the effect of fluid hydrodynamics on mass 
transfer across the liquid/cellular interface of planktonic cells and the subsequent impact upon growth 
kinetics. Standard experimental protocol to measure the gas/liquid resistance was employed [Cussler, 
1984; Grasso et al., 1995]. The determination of the liquid/cellular resistance is more complex. The 
thickness of the boundary layer was calculated under various hydrodynamic conditions and combined 
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with molecular diffusion and mass action kinetics to obtain the transfer resistance. Microbial growth 
kinetics associated with these hydrodynamic conditions can also be examined. Since Monad models are 
commonly applied to describe chemo-autotrophic growth kinetics [Ferry, 1993], the half-saturation 
constant can be an indicator of mass transfer limitations. The measured (apparent) value will be greater 
than the intrinsic value, as demonstrated in these earlier studies. 

95.4 Electro-Enzymatic Membrane Reactors as Electron 
Transfer Chain Biomimetics 

Mimicry of In Vivo Co-Enzyme Regeneration Processes 

In many biosynthesis processes, a co-enzyme is required in combination with the base enzymes to 
function as high efficiency catalysts. A regeneration system is needed to repeatedly recycle the co-enzyme 
to reduce operating costs in continuous in vitro synthesis processes, mimicking the in vivo regenerative 
process involving an electron transfer chain system. Multiple reaction sequences are initiated as in 
metabolic cycles. NAD(H) is one such co-enzyme. Because of its high cost, much effort has focused on 
improving the NAD(H) regeneration process [Chenault and Whitesides, 1987], with electrochemical 
methods receiving increased attention. The direct regeneration on an electrode has proven to be extremely 
difficult [Paxinos et al., 1991]. Either acceleration of protonation or inhibition of intermolecular coupling 
of NAD + is required. Redox mediators have permitted the coupling of enzymatic and electrochemical 
reactions; the mediator accepts the electrons from the electrode and transfers them to the co-enzyme via 
an enzymatic reaction, and thus regeneration/recycling of the co-enzyme during a biosynthesis reaction 
can be accomplished [Hoogvliet et al., 1988]. The immobilization of mediator and enzyme on electrodes 
can reduce the separation procedure, increase the selectivity, and stabilize the enzyme activity [Fry et al., 
1994]. Various viologen mediators and electrodes have been investigated for the NADH system in batch 
configurations [Kunugi et al., 1990] . The mechanism and kinetics were investigated by cyclic voltammetry, 
rotating disk electrode, and impedance measurement techniques. The performance of electrochemical 
regeneration of NADH on an enzyme immobilized electrode for the biosynthesis of lactate in a packed 
bed flow reactor [Zawalich and Fisher, 1998] is selected as a model system to illustrate an electron transfer 
chain biomimetic. 

Electro-Enzymatic Production of Lactate from Pyruvate 

The reaction scheme is composed of a three-reaction sequence: (1) the NADH-dependent enzymatic 
(lactate dehydrogenase: LDH) synthesis of lactate from pyruvate; (2) the regeneration of NADH from 
NAD + and enzymatic (lipoamide dehydrogenase: LipDH) reaction with the mediator (methyl viologen); 
and (3) the electrochemical (electrode) reaction. The methyl viologen (MV 2+ ) accepts electrons from the 
cathode and donates them to the NAD + via the LipDH reaction. The regenerated NADH in solution is 
converted to NAD + in the enzymatic (LDH) conversion of pyruvate to lactate. A key feature of this system 
is the in situ regeneration of the co-enzyme NADH. A flow-by porous reactor utilizes the immobilized 
enzyme system (LipDH and methyl viologen as a mediator) within the porous graphite cathodes, encap- 
sulated by a cation exchange membrane (Nafion® 124). The free-flowing fluid contains the pyruvate/lac- 
tate reaction mixture, the LDH, and the NADH/NAD + system. Lactate yields up to 70% have been 
obtained when the reactor system was operated in a semi-batch (i.e., recirculation) mode for 24 h, as 
compared to only 50% when operated in a simple batch mode for 200 h. The multi-pass, dynamic input 
operating scheme permitted optimization studies to be conducted on system parameters. This includes 
concentrations of all components in the free solution (initial and dynamic input values could be readily 
adjusted through recycle conditioning), flow rates, and electrode composition and their transport char- 
acteristics. By varying the flow rates through this membrane reactor system, operating regimes can be 
identified that determine the controlling mechanism for process synthesis (i.e., mass transfer vs. kinetics 
limitations). Procedures for operational map development are thus established. 
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Diffusional Processes 
and Engineering Design 



96.1 Applications of Allometry 

96.2 Flow Limited Processes 

96.3 Extracorporeal Systems 

Membrane Separators • Chromatographic Columns • Flow 
Reactors 

96.4 Useful Correlations 

E. N. LightfoOt Convective Mass Transfer • Convective Dispersion and the 

University of Wisconsin One-Dimensional Convective Diffusion Equation 

Mass transport and diffusional processes play key roles in biomedical engineering at several levels, both 
within the body and in extracorporeal circuits. The roles of mass transfer in tissue function are discussed 
in Chapter 115 (Lightfoot and Duca), and here we will concentrate on external processes and design of 
therapeutic regimens. Moreover, we will depend heavily upon major references [Bird et al., 1960; Schmidt- 
Nielsen,, 1997; Bassingthwaighte et al, 1998; Lightfoot and Lightfoot, 1997; Welling, 1997; Noble and 
Stern, 1995; Ho and Sirkar, 1992] and seek understanding rather than detailed descriptions. 

All mass transport processes, which can be defined as the technology for moving one species in a 
mixture relative to another, depend ultimately upon diffusion as the basis for the desired selective motion. 
Diffusion takes many forms, and a general description is provided in Table 1 15.7 of Chapter 115. However, 
a great deal of information can often be obtained by carefully written statements of simple constraints, 
and that of conservation of mass is the most useful for our purposes. We shall begin with examples where 
this suffices and show how one can determine the validity of such a simple approach. We then proceed 
to situations where more detailed analysis is needed. 

96.1 Applications of Allometry 

Much of biomedical engineering requires transferring information obtained from animal experiments 
to humans, and here interspecies similarities have proven quite useful. Many properties of living systems 
scale simply with body mass according to the simple allometric equation [Schmidt-Nielsen, 1997]: 

P=aM b (96.1) 

Here P is any property, M is average species mass, and a and b are species independent constants. Among 
the most important of such relations is that of basal metabolic rate of warm blooded animals, for which 
the total rate of oxygen consumption is given by 

*o„ M = 3-5M 3/4 (96.2) 
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Here R 02 is ml 2 (STP) consumed per hour, and M is body mass in grams. Other important properties, 
for example blood volume per unit mass and decrease of total oxygen content per unit volume of blood 
on passing through the arterial system, are invariant b is zero. It follows that average blood circulation 
time 



T. 



■ M 



1/4 



(96.3) 



This is an important result because it means that flow limited body processes scale with the 1/4-power 
of body mass. Moreover, T drc is the first of the time constants which we shall find to govern most diffusional 
processes. 

An important application is the prediction of human drug elimination kinetics from animal experi- 
ments. For many it is found that 



cV/m = f(t/T arc ) 



(96.4) 



where c is blood concentration of the drug, V is body volume, and m is the initial drug dose. The 
function f can be quite complex but, to a first order, it is independent of species (see for example [Dedrick 
et al., 1970; also Lightfoot, 1974, p. 353]). Moreover, V can be calculated from mass assuming a species 
independent density close to lg/ml. Success of Eq. (96.4) requires that drug distribution between blood 
and body tissues be independent of drug concentration and that elimination be assumed flow limited. 
We now ask when this last assumption can be justified. 

96.2 Flow Limited Processes 

Processes in which diffusion and reaction rates are fast relative to mean solute residence times are said 
to be flow limited because change in diffusion or kinetic rates has little effect on the process under 
investigation. Although very few biomedically interesting processes are flow limited in this sense at a 
detailed level of description, it is often found that the global behavior of a complex system can be so 
described with little error. To understand this we look briefly at Fig. 96.1a: a closed system of constant 
volume V with a single inlet and single outlet. Blood or other fluid is flowing through this system at a 
constant flow rate Q and with a decaying inlet solute concentration 



.(<)' 



->/** 



(96.5) 



We now calculate the difference between this inlet concentration and the outlet concentration C out (t), for 
two quite different situations: plug flow in our system (PF), and perfect mixing (CSTR for continuous 
stirred tank reactor). In both cases the mean residence time is T = VI Q. The results are: 




C-Cn 



FIGURE 96.1a A simple flow system. 
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FIGURE 96.1b Flow limited approximations. 



PF: Here, there is just a time delay without other change, so that 



m out 



t>T : c -c- =c n e '^ K 

m out in 
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(96.6a) 
(96.6b) 



CSTR: Here the internal concentration is uniform at the outlet level, and 



R-l 



-e- Tf ; R=T„ r T 



(96.7) 



These results are plotted in Fig. 96.1b for two time- constant ratios, T BC /T M , = 1 and T BC /T M = 10. The 
two flow conditions are seen to produce very different behavior for equal time constants, but these 
differences quickly become minor when T m <§ T BC . More specifically, the effect of our system flow 
conditions on the inlet concentration becomes insensitive to these conditions when two criteria are met: 



T <\r 

m BC 



(96.8a) 



t = T >T 

1 OBS — m 



(96.8b) 



Here T OBS is the observer time, i.e., the time before which there is no interest in the system behavior. 
These are the conditions of time constant separation, of great practical importance in all engineering 
design calculations. Usually "much less than" can be taken to be less than a third, and vice versa, and 
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FIGURE 96.2 Modeling hemodialysis. 

one tenth is almost always sufficient. Thus, using time constant separation to simplify process descriptions 
is usually referred to as an order-of-magnitude approximation. Returning to Fig. 96.1a we may now write 
a macroscopic mass balance [Bird et al., 1960, Ch. 22] of the form 



Vdc/dt»(^C in -cj + v(R 



(96.9) 



where c is both average internal and exit concentration, and (R) is the average rate of solute formation 
by chemical reaction. Here we have used the CSTR approximation as the simplest to handle mathemat- 
ically. This expression is valid only under the constraints of Eq. (96.8), but these are commonly met in 
conditions of medical interest. 

The major utility of Eq. (96.9) is in the description of networks such as the organ network of the 
human body, and applications include transient drug distribution, anaesthesia, and the study of metabolic 
processes [Welling, 1997; Bassingthwaighte et al., 1998]. Here an equation similar to Eq. (96.8) must be 
written for each element of the network, but individual organs can be combined, so long as the resulting 
element conforms to Eq. (96.8). These processes are often lumped under the heading of pharmacokinetics, 
and a large literature has developed [Welling, 1997]. An example of great economic importance is 
hemodialysis for the treatment of end stage kidney disease. Here the body can be approximated by the 
simple diagram of Fig. 96.2, and the defining equations reduce to 



V T dc T /dt- 



V B dcJdt-- 



Qb( c b- c t) 
-Qb( c b- c t)- 



+ G 



Ch 



(96.9a) 



(96.9b) 



Here the subscript B and T refer to the blood and (whole body) tissue respectively; G is the rate of toxin 
generation, and CI is the clearance of the dialyzer. Comparison of prediction with observation is shown 
for creatinine in Fig. 96.3. Here the parameters of Eq. (96.9) were determined for the first day's treatment. 
It may be seen that the pharmacokinetic approximation permits accurate extrapolation for four additional 
days. 
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FIGURE 96.3 Actual creatinine dynamics compared to prediction. 



96.3 Extracorporeal Systems 



Next we look at the problem of designing extracorporeal systems, and these can normally be classified 
into a small number of categories. We shall consider steady-state membrane separators, chromatographic 
devices, and flow reactors. 

Membrane Separators 

The purpose of these devices is to transfer solute from one flowing steam to another, and there are two 
subcategories distinguished by the ratio of transmembrane flow induced (convective) and diffusional 
solute. This ratio in turn is defined by a Peclet number, 



Pes v P 



(96.10) 



P=N. Ac 



(96.11) 



Here (v) is the observable transmembrane solvent velocity, and P is the membrane solute diffusional 
permeability. The permeability in turn is defined as the ratio of the molar flux of solute transport, 
moles/area-time, to the solute concentration difference causing this transport. The most familiar examples 
of low-Pe devices are blood oxygenators and hemodialyzers. High-Pe systems include micro-, ultra-, and 
nano-filtration and reverse osmosis. The design and operation of membrane separators is discussed in 
some detail in standard references [Noble and Stern, 1995; Ho and Sirkar, 1992], and a summary of 
useful predictions is provided in Section 96.4. 

Low-Pe devices are by far the simpler. Solute transport in dialyzers is essentially unaffected by the 
small amount of transmembrane water flow, and one may therefore use standard design techniques based 
on membrane permeabilities, usually supplied by the vendor, and mass transfer coefficients in the adjacent 
fluids. Local fluxes can be described by the simple expression 



N=K 



\c u — c I 

c I lb le I 



(96.12) 
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TABLE 96.1 Asymptotic Nusselt Numbers for Laminar Duct Flow 
(With Fully Developed Velocity Profiles) 

Constant Wall Concentration Constant Wall Mass Flux 

Thermal Entrance Region 

Plug flow: 



Parabolic velocity profile: 



1 vD 2 } xr in 



vB 1 
Bz 



'A 



* tef „., jM' 



^=9^(4/3)^] Nu '°<=^ 



Bz 



Here z is distance in the flow direction and g is the rate of change of velocity 
with distance from the wall, evaluated at the wall. Nusselt numbers are local 
values, evaluated at z. 

Fully Developed Region 

Plug flow: Nu hc = 5.783 Nu hc = 8 

Parabolic Flow: Nu = 3.656 Nu = 48/1 1 



where N, is the molar flux of the solute "i" across the membrane, moles/area-time, K c is the overall mass 
transfer coefficient, c t is molar solute concentration while the subscripts "b" and "e" refer to blood and 
the external fluid, respectively. The overall mass transfer coefficient must be calculated from the two fluid 
phase coefficients and membrane permeability. Hemodialysis solutes tend to distribute uniformly 
between blood (on a cell-free basis), and one may use the simple approximation 



l/K c = (l/k b ) + (l/p) + (l/k) (96.13) 



Here k b and k e are the mass transfer coefficients for the blood and external fluid, here dialysate, respectively. 
These phase mass transfer coefficients can usually be estimated to an acceptable accuracy from simple 
asymptotic formulas [Lightfoot and Lightfoot, 1997]. Examples of the latter are given in Table 96.1. For 
unequally distributed solutes, Eq. (96.3) must be appropriately modified [Lightfoot and Lightfoot, 1997], 
and, for blood this can be something of a problem [Popel, 1989]. 

Equipment performance for dialyzers is normally expressed in terms of clearance 

Cl=fc im Q (96.14) 

where f is the fraction of incoming solute "i" removed from the blood, q in is concentration of solute "i" 
in the entering blood, and Q is volumetric blood flow rate. Clearance is a convenient measure as it is 
independent of solute concentration, and it is easily determined experimentally. Prediction is useful for 
device design, but in operation, clearance is usually determined along with effective blood flow rates and 
tissue water volumes from numerical analysis of a test dialysis procedure. The efficiency of blood oxy- 
genators can be dominated by either membrane permeability or mass transfer in the flowing blood, and 
it is complicated by the kinetics and thermodynamics of the oxygen/hemoglobin system. These aspects 
are discussed in detail by Popel [1989]. 

High-Pe devices are dominated by transmembrane water transport, and detailed discussion must be 
left to the above cited references. However, it is important to recognize their primary function is to 
remove water and undesired solutes while retaining one solute which is desired. Rejection of the desired 
product increases toward an asymptote as water flux increases, and one should operate near this asymptote 
if at all possible. The relation between water flux and rejection is perhaps best determined experimentally. 
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However, as water flux increases the rejected solute concentration at the interface between the feed stream 
and the membrane also increases. This process, usually known as concentration polarization, typically 
produces a significant increase in osmotic pressure which acts to reduce the flow. Polarization is a complex 
process, but to a good approximation the trans-membrane water velocity is given by 



v)~lAln(cJc $0 )-(pJp )-®-k c (96.15) 



Here c s is concentration of the rejected solute, p is solution density, k c is the concentration-based mass 
transfer coefficient in the absence of water flux, and the subscripts 8 and refer to conditions at the 
membrane surface and bulk of the fed solution, respectively. The factor is a correction for variable 
viscosity and diffusivity, approximated at least for some systems by 

/ \2/3, , ,l/3 

e = (B*) (l/V*) (96.16) 



and (©*) and (1/u*) are the averages of solute diffusivity and reciprocal solution viscosity at the mem- 
brane surface and bulk solution divided by these quantities in the bulk solution. These equations are 
reasonable once appreciable polarization has occurred, if rejection is high, and they are a modification 
of earlier boundary-layer analyses of Kozinski and Lightfoot [1972]. They are more accurate than the 
more recent results made using simple film theory discussed in [Noble and Stern, 1995; Ho and Sirkar, 
1992] , both in incorporating the coefficient 1 .4 and in the corrections for variable diffusivity and viscosity. 
The coefficient of 1.4 is a correction to account for boundary-layer compression accompanying trans- 
membrane flow, not allowed for in film theory. Typical geometry insensitive boundary layer behavior is 
assumed, and these equations are not restricted to any given rejected solute or equipment configuration. 
However, in order to calculate the pressure drop required to obtain this flow, one must use the expression 

Ap=n + v/k h (96.17) 

where Ap is transmembrane pressure drop, % is solute osmotic pressure at the membrane surface, and 
k h is the hydraulic permeability of the membrane. The osmotic pressure in turn is a function of solute 
concentration at the membrane surface, and thus is different for each rejected solute. 

Chromatographic Columns 

Chromatography is very widely used in biomedical analyses and to a significant extent for extracorporeal 
processing of blood and other body fluids. Recovery of proteins from blood is of particular importance, 
and these applications can be expected to grow. Good basic texts are available for underlying dynamic 
theory [Guiochon et al., 1994] and chemistry [Snyder et al., 1997], and a series of design papers is 
recommended [Lightfoot et al., 1997; Lode et al., 1998; Yuan et al., in press; Athalye et al., 1992]. 
Differential chromatography, in which small mixed-solute pulses are separated by selective migration 
along a packed column, is the simplest, and much of the chromatographic literature is based on concepts 
developed for this basic process. 

In differential chromatography, the individual solutes do not interact, and the effluent curve for each 
is usually close to a Gaussian distribution: 



f (L,t) 



cAL,t) = c exp 



f \ 
t 



v 



\'A 



(27c) (c/i} (96.18) 



where cAL,t) is the fluid phase concentration leaving a column of length L at time f, f is the mean solute 
residence time, and O is the standard deviation of the distribution; c is the maximum effluent concentration. 
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SYMMETRICAL BINARY SEPARATION 

resolution of unity 
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FIGURE 96.4 Resolution in differential chromatography. 

Degree of separation is defined in terms of the resolution, R 12 defined as in Fig. 96.4 where separation with 
a resolution of unity is shown. Here the distance between the two mean residence times ( t r i ), 40 and 20 
in Fig. 96.4, increase in direct proportion to column length, while the standard deviations increase with 
its square root. Thus resolution is proportional to the square root of column length. Column performance 
is normally rated in terms of the number of equivalent theoretical plates, N defined by 



N: 



'(»/-)" 



(96.19) 



Methods of predicting N from column design and operating conditions are described in the above 
mentioned references (see for example [Athalye et al., 1992]). 

Flow Reactors 

Development of small flow reactors for diagnostic purposes is a fast growing field. Patient monitoring 
of blood glucose is probably the largest current example, but new applications for a wide variety of 
purposes are rapidly being developed. These are essentially miniaturized variants of industrial reactors 
and normally consist of channels and reservoirs with overall dimensions of centimeters and channel 
diameters of millimeters or even tenths of millimeters. The key elements in their description are convective 
mass transfer, dispersion, and reaction, and they differ from larger systems only in parameter magnitudes. 
In particular, flow is almost invariably laminar and Schmidt numbers 



Sc = \l pB 



(96.20) 



are always high, never less than 10 5 . Here u is solvent viscosity, p is its density, and B is effective solute 
diffusivity through the solution. These aspects are briefly described in the next section. 

96.4 Useful Correlations 

Convective Mass Transfer 

Mass transfer coefficients k c are best correlated in the dimensionless form of mass transfer Nusselt numbers 
defined as 
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NuskD/B (96.21) 

where D is any convenient characteristic length, typically diameter for tubes. For the high-Sc systems 
considered above, these Nusselt numbers are functions only of a dimensionless ratio L*, system geometry, 
and boundary conditions [Lightfoot and Lightfoot, 1997]. The scaled length L* in turn is just the ratio 
of mean solvent residence time to lateral diffusion time: 



^/(v))/^ 2 /^©) (96.22) 



For all geometries these functions have the same three characteristics: an entrance region solution for L* 
"much" less than unity, a constant asymptotic value for L* "much" greater than unity (the "fully developed" 
region), and a relatively unimportant transition region for L* close to unity. Normally the entrance region 
is of most practical importance and can be used without great error up to L* of unity. Typical results are 
shown in Table 96.1. The entrance region results are valid for both tubes (circular cross-section) and 
between parallel plates; it may be noted that the reference length D cancels. It appears to the same power 
on both sides of the equations. The correlations for laminar flow are also valid for non-Newtonian fluids 
if the appropriate wall velocity gradients are used. The solutions for fully developed flow are, however, 
only useful for round tubes and either plug or Poiseuille flow as indicated. These are much less important 
as they result in very low transfer rates. Plug and Poiseuille (parabolic) flows are limiting situations, and 
most real situations will be between them. Plug flow is approximated in ultrafiltration where the high 
solute concentrations produce a locally high viscosity. Poiseuille flow is a good approximation for dialysis. 

Convective Dispersion and the One-Dimensional 
Convective Diffusion Equation 

Non-uniform velocity profiles tend to reduce axial concentration gradients, a result which is qualitatively 
similar to a high axial diffusivity. As a result, it is common practice to approximate the diffusion equation 
for duct flow as 

dc/dt+(v\dc/dzs B cff d 2 c/dz 2 +(^ )C// ) (96.23) 

Here the overline indicates an average over the flow cross-section, usually further approximated as the 
cup-mixing or bulk concentration [Bird et al., 1960, p. 297), (v) is the flow average velocity, B e a is an 
effective diffusivity and (R^,} is the rate of solute addition per unit volume by reaction plus mass transfer 
across the wall: 

{*J} = ( R ,^ m ) + (cfs)K c (c e - c) (96.24) 

Here (R; c |, em ) is the average volumetric rate of formation of species "i" moles/volume-time, C is the duct 
circumference, S is its cross-sectional area, and c e is the solute concentration outside the duct wall, i.e., 
in the surrounding fluid. 

For Newtonian tube flow at L* greater than unity, it is reasonable to write 

/ / / \ / / \ / \ 2 

B 



effl 



D = l + (l/l92) (b(v\/b\ (96.25) 



The restrictions on this result and some additional useful approximations are widely discussed in the 
mass transfer literature (see for example [Ananthkrishnan et al., 1965]). For flow in packed beds, one 
often sees 
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B eff /D~0A(Dv 2 /B\ (96.26) 

but more accurate expressions are available [Athalye et al., 1992] . Convective dispersion is a very complex 
subject with a large literature, and successful application requires careful study [Brenner, 1962; Ananfh- 
krishnan et al., 1965; Edwards et al., 1991]. 
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97.1 Background 

Animal surrogate or cell culture analog (CCA) systems mimic the biochemical response of an animal or 
human when challenged with a chemical or drug. A true animal surrogate is a device that replicates the 
circulation, metabolism, or adsorption of a chemical and its metabolites using interconnected multiple 
compartments to represent key organs. These compartments make use of engineered tissues or cell 
cultures. Physiologically based pharmacokinetic models (PBPK) guide the design of the device. The 
animal surrogate, particularly a human surrogate, can provide important insights into toxicity and efficacy 
of a drug or chemical when it is impractical or imprudent to use living animals (or humans) for testing. 
The combination of a CCA and PBPK provides a rational basis to relate molecular mechanisms to whole 
animal response. 

Limitations of Animal Studies 

The primary method used to test the potential toxicity of a chemical or action of a pharmaceutical is to 
use animal studies, predominantly with rodents. Animal studies are problematic. The primary difficulties 
are that the results may not be meaningful to assessment of human response [Gura, 1997]. Because of 
the intrinsic complexity of a living organism and the inherent variability within a species, animal studies 
are difficult to use to identify unambiguously the underlying molecular mechanism for action of a 
chemical. The lack of a clear relationship among all of the molecular mechanisms to whole animal 
response makes extrapolation across species difficult. This factor is particularly crucial when extrapolation 
of rodent data to humans is an objective. Further, without a good mechanistic model it is difficult to 
rationally extrapolate from high doses to low doses. However, this disadvantage due to complexity can 
be an advantage; the animal is a "black box" and provides response data even when the mechanism of 
action is unknown. Further disadvantages reside in the high cost of animal studies, the long period of 
time often necessary to secure results, and the potential ethical problems in animal studies. 
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Alternatives to Animal Studies 

In vitro methods using isolated cells (e.g., [Del Raso, 1993]) are inexpensive, quick, and have almost no 
ethical constraints. Because the culture environment can be specified and controlled, the use of isolated 
cells facilitates interpretation in terms of a biochemical mechanism. Since human cells can be used as 
well as animal cells, cross-species extrapolation is facilitated. 

However, these techniques are not fully representative of human or animal response. Typical in vitro 
experiments expose isolated cells to a static dose of a chemical or drug. It is difficult to relate this static 
exposure to specific doses in a whole animal. The time-dependent change in the concentration of a 
chemical in an animal's organ cannot be replicated. If one organ modifies a chemical or prodrug which 
acts elsewhere, these situations would not be revealed by the normal in vitro test. 

A major limitation on the use of cell cultures is that isolated cells do not fully represent the full range 
of biochemical activity of the corresponding cell type when in a whole animal. Engineered tissues, 
especially co-cultures [Bhatia et al., 1998] , can provide a more "natural" environment which can improve 
(i.e., make normal) cell function. Another alternative is use of tissue slices, typically from the liver [Olinga 
et al., 1997]. Tissue slices require the sacrifice of the animal, there is intrinsic variability, and biochemical 
activities can decay rapidly after harvest. The use of isolated tissue slices also does not reproduce inter- 
change of metabolites among organs and the time-dependent exposure that occurs within an animal. 

An alternative to both animal and in vitro studies is the use of computer models based on PBPK models 
[Connolly and Anderson, 1991]. PBPK models can be applied to both humans and animals. Because PBPK 
models mimic the integrated, multicompartment nature of animals, they can predict the time-dependent 
changes in blood and tissue concentrations of a parent chemical or its metabolites. Although construction 
of a robust, comprehensive PBPK is time-consuming, once the PBPK is in place many scenarios concerning 
exposure to a chemical or treatment strategies with a drug can be run quickly and inexpensively. Since 
PBPKs can be constructed for both animals and humans, cross-species extrapolation is facilitated. There 
are, however, significant limitations in relying solely on PBPK models. The primary limitation is that a 
PBPK can only provide a response based on assumed mechanisms. Secondary and unexpected effects are 
not included. A further limitation is the difficulty in estimating parameters, particularly kinetic parameters. 

None of these alternatives satisfactorily predict human response to chemicals or drugs. 

97.2 The Cell Culture Analog Concept 

A CCA is a physical replica of the structure of a PBPK where cells or engineered tissues are used in organ 
compartments to achieve the metabolic and biochemical characteristics of the animal. Cell culture 
medium circulates between compartments and acts as a "blood surrogate". Small-scale bioreactors with 
the appropriate cell types in the physical device represent organs or tissues. 

The CCA concept combines attributes of a PBPK and other in vitro systems. Unlike other in vitro 
systems, the CCA is an integrated system that can mimic dose dynamics and allows for conversion of a 
parent compound into metabolites and the interchange of metabolites between compartments. A CCA 
system allows dose exposure scenarios that can replicate the exposure scenarios of animal studies. 

A CCA is intended to work in conjunction with a PBPK as a tool to test and refine mechanistic 
hypotheses. A molecular model can be embedded in a tissue model which is embedded in the PBPK. 
Thus, the molecular model is related to overall metabolic response. The PBPK can be made an exact 
replica of the CCA; the predicted response and measured CCA response should exactly match if the 
PBPK contains a complete and accurate description of the molecular mechanisms. In the CCA all flow 
rates, the number of cells in each compartment, and the levels of each enzyme can be measured inde- 
pendently, so no adjustable parameters are required. If the PBPK predictions and CCA results disagree, 
then the description of the molecular mechanisms is incomplete. The CCA and PBPK can be used in an 
iterative manner to test modifications in the proposed mechanism. When the PBPK is extended to describe 
the whole animal, failure to predict animal response would be due to inaccurate description of transport 
(particularly within an organ), inability to accurately measure kinetic parameters (e.g., in vivo enzyme 
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levels or activities), or the presence in vivo or metabolic activities not present in the cultured cells or 
tissues. Advances in tissue engineering will provide tissue constructs to use in a CCA that will display 
more authentic metabolism than isolated cell cultures. 

The goal is predicting human pharmacological response to drugs or assessing risk due to chemical 
exposure. A PBPK that can make an accurate prediction of both animal CCA and animal experiments 
would be "validated". If we use the same approach to constructing a human PBPK and CCA for the same 
compound, then we would have a rational basis to extrapolate animal response to predict human response 
when human experiments would be inappropriate. Also, since the PBPK is mechanistically based, it would 
provide a basis for extrapolation to low doses. The CCA/PBPK approach complements animal studies 
by potentially providing an improved basis for extrapolation to humans. 

Further, PBPKs validated as described above provide a basis for prediction of human response to 
mixtures of drugs or chemicals. Drug and chemical interactions may be synergistic or antagonistic. If a 
PBPK for compound A and a PBPK for compound B are combined, then the response to any mixture 
of A and B should be predictable since the mechanisms for response to both A and B are included. 



97.3 Prototype CCA 



A simple three-component CCA mimicking rodent response to a challenge by naphthalene has been 
tested by Sweeney et al. [1995]. While this prototype system did not fulfill the criteria for a CCA of 
physically realistic organ residence times or ratio of cell numbers in each organ, it did represent a multi- 
compartment system with fluid recirculation. The three components were liver, lung, and other perfused 
tissues. These experiments used cultured rat hepatoma (H4IIE) cells for the liver and lung (L2) cells for 
the lung compartment. No cells were required in "other tissues" in this model since no metabolic reactions 
were postulated to occur elsewhere for naphthalene or its metabolites. The H4IIE cells contained enzyme 
systems for activation of naphthalene (cytochrome P450IA1) to the epoxide form and conversion to 
dihydriol (epoxide hydrolase) and conjugation with glutathione (glutathione-S-transferase). The L2 cells 
had no enzymes for naphthalene activation. Cells were cultured in glass vessels as monolayers. Experiments 
with this system using lactate dihydrogenase release (LDH) and glutathione levels as dependent parameters 
supported a hypothesis where naphthalene is activated in the "liver" and reactive metabolites circulate to 
the "lung" causing glutathione depletion and cell death as measured by LDH release. Increasing the level 
of cytochrome p450 activity in the "liver" by increasing cell numbers or by preinducing H4IIE cells led to 
increased death of L2 cells. Experiments with "liver"-blank; "lung"-"lung" and "lung"-blank combinations 
all supported the hypothesis of a circulating reactive metabolite as the cause of L2 cell death. 

This prototype system [Sweeney et al., 1995] was difficult to operate, very non-physiologic, and made 
time course experiments very difficult. An alternative system using packed bed reactors for the "liver" 
and "lung" compartments has been tested [Shuler et al., 1996; Ghanem, 1998]. This system successfully 
allowed time course studies, was more compact and simpler to operate, and was physiological with respect 
to the ratio of "liver" to "lung" cells. While liquid residence times improved in this system, they still were 
not physiologic (i.e., 114s vs. an in vivo value of 21s in the liver and 6.6s vs. in vivo lung value of about 
1.5s) due to physical limitations on flow through the packed beds. Unlike the prototype system, no 
response to naphthalene was observed. 

This difference in response of the two CCA designs was explained through the use of PBPK models 
of each CCA. In the prototype system, the large liquid residence times in the liver (6 min) and the lung 
(2.1 min) allowed formation of large amounts of naphthol from naphthalene oxide and presumably the 
conversion of napthol into quinones that were toxic. In the packed bed system, liquid residence times 
were sufficiently small so that the predicted naphthol level was negligible. Thus, the PBPK provided a 
mechanistic basis to explain the differences in response of the two experimental configurations. 

Using a very simple CCA, Mufti and Shuler [1998] demonstrated that response of human hepatoma 
(HepG2) to exposure to dioxin (2,3,7,8-tetrachlorodibenzo-p-dioxin) is dependent on how the dose is 
delivered. The induction of cytochrome p450IAl activity was used as a model response for exposure to 
dioxin. Data were evaluated to estimate dioxin levels giving cytochrome P450IA1 activity 0.01% of 
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maximal induced activity. Such an analysis mimics the type of analysis used to estimate risk due to 
chemical exposure. The "allowable" dioxin concentration was 4 x 10~ 3 nM using a batch spinner flask, 
4 X 10~ 4 nM using a one-compartment system with continuous feed, and 1 X 10~ 5 nM using a simple 
two-compartment CCA. Further, response could be correlated to an estimate of the amount of dioxin 
bound to the cytosolic Ah receptor with a simple model for two different human hepatoma cell lines. 
This work illustrates the potential usefulness of a CCA approach in risk assessment. 

Ma et al. [1997] have discussed an in vitro human placenta model for drug testing. This was a two- 
compartment perfusion system using human trophoblast cells attached to a chemically modified poly- 
ethylene terephatholate fibrous matrix as a cell culture scaffold. This system is a CCA in the same sense 
as the two-compartment system used to estimate response to dioxin. 

The above examples are the first that attempt to mimic circulation and metabolic response to model 
an animal as an integrated system. However, others have used engineered tissues as a basis for testing 
the efficacy of drugs or toxicity of chemicals. These tissues are important in themselves and could become 
elements in an integrated CCA. 

97.4 Use of Engineered Tissues or Cells 
for Toxicity/Pharmacology 

The primary use of engineered tissues for toxicity testing has been with epithelial cells that mimic the 
barrier properties of the skin, gut, or blood brain barrier. The use of isolated cell cultures has been of 
modest utility due to artifacts introduced by dissolving test agents in medium and due to the extreme 
sensitivity of isolated cells to these agents compared to in vivo tissue. One of the first reports on the use 
of engineered cells is that by Gay et al. [ 1992] reporting on the use of a living skin equivalent as an in vitro 
dermatotoxicity model. The living skin equivalent consists of a co-culture of human dermal fibroblasts 
in a collagen-containing matrix overlaid with human keratinocytes that have formed a stratified epider- 
mis. This in vitro model used a measurement of mitochondrial function (i.e., the colormetric thiazolyl 
blue assay) to determine toxicity. Eighteen chemicals were tested. Eleven compounds classified as non- 
irritating had minimal or no effect on mitochondrial activity. For seven known human skin irritants, 
the concentration that inhibited mitochondrial activity by 50% corresponded to the threshold value for 
each of these compounds to cause irritation on human skin. However, living skin equivalents did not 
fully mimic the barrier properties of human skin; water permeability was 30-fold greater in the living 
skin equivalent than in human skin. Kriwet and Parenteau [1996] report on permeabilities of 20 different 
compounds in in vitro skin models. Skin cultures are slightly more permeable (two- or threefold) for 
highly lipophilic substances and considerably more permeable (about tenfold) for polar substances than 
human-cadaver or freshly excised human skin. 

The above tests are static. Pasternak and Miller [1996] have tested a system combining perfusion and 
a tissue model consisting of MDCK (Madin-Darby canine kidney) epithelial cells cultured on a semi- 
porous cellulose ester membrane filter. The system could be fully automated using measurement of trans- 
epithelial electrical resistance (TER) as an end point. A decrease in TER is an indicator of cell damage. 
The system was tested using nonionic surfactants and predicted the relative occular toxicity of these 
compounds. The perfusion system mimics some dose scenarios (e.g., tearing) more easily than a static 
system and provides a more consistent environment for the cultured cells. The authors cite as a major 
advantage that the TER can be measured throughout the entire exposure protocol without physically 
disturbing the tissue model and introducing artifacts in the response. 

Probably the most used cell-based assay is the Caco-2 model of the intestine to determine oral 
availability of a drug or chemical. The Caco-2 cell cultures are derived from a human colon adenocar- 
cinoma cell line. The cell line, C2Bbel, is a clonal isolate of Caco-2 cells that is more homogeneous in 
apical brush border expression than the Caco-2 cell line. These cells form a polarized monolayer with 
an apical brush border morphologically comparable to the human colon. Tight junctions around the 
cells act to restrict passive diffusion by the paracellular route mimicking the transport resistance in the 
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intestine. Hydrophobic solutes pass primarily by the transcellular route and hydrophilic compounds by 
the paracellular route. Yu and Sinko [1997] have demonstrated that the substratum (e.g., membrane) 
properties upon which the monolayer forms can become important in estimating the barrier properties 
of such in vitro systems. The barrier effects of the substratum need to be separated from the intrinsic 
property of the monolayers. Further, Anderle et al. [1998] have shown that the chemical nature of 
substratum and other culture conditions can alter transport properties. Sattler et al. [1997] provide one 
example (with hypericin) of how this model system can be used to evaluate effects of formulation (e.g., 
use of cyclodextrin or liposomes) on oral bioavailability. Another example is the application of the Caco-2 
system to transport of paclitaxel across the intestine [Walle and Walle, 1998]. Rapid passive transport 
was partially counter-balanced by an efflux pump (probably P-glycoprotein) limiting oral bioavailability. 

Another barrier of intense interest for drug delivery is the blood-brain barrier. The blood-brain barrier 
is formed by the endothelial cells of the brain capillaries. A primary characteristic is the high resistance 
of the capillary due to the presence of complex tight junctions inhibiting paracellular transport and the 
low endocytic activity of this tissue. Acceptable in vitro models have been more difficult to formulate but 
one commercial system marketed by Cellco, Inc. is available that uses a co-culture of endothelial cells 
and astrocytes to form a barrier. 

A recent example of a different in vitro system is described by Glynn and Yazdanian [1998] who used 
bovine brain microvessel endothelial cells grown on porous polycarbonate filters to compare the transport 
of nevirapine, a reverse transcriptase inhibitor to other HIV antiretroviral agents. Nevirapine was the 
most permeable antiretroviral agent and hence may have value in HIV treatment in reducing levels of 
HIV in the brain. 

These isolated cultures mimic an important aspect of cell physiology (oral uptake or transport into 
the brain). In principle, they could be combined with other tissue mimics to form a CCA that would be 
especially useful in testing pharmaceuticals. 

97.5 Future Prospects 

Significant progress is being made in the construction of tissue engineered constructs [Baldwin and Saltz- 
man, 1996] . These efforts include highly vascularized tissues, which are especially important for toxicological 
or pharmacological studies in the liver. Liver constructs, often based on co-cultures, have become increas- 
ingly more normal in behavior (see [Griffith etal., 1997]). For toxicological studies multiple test systems 
operated in parallel are desirable which suggests the need for miniaturization to conserve cells, reagents, 
and time. Bhatia et al. [1998] have described a microfabricated device for hepatocyte-fibroblast co-cultures. 
A CCA based on the concepts described here and incorporating these advanced-engineered tissues 
could become a powerful tool for preclinical testing of pharmaceuticals. While drug leads are expanding 
rapidly in number, the capacity to increase animal and human clinical studies is limited. It is imperative 
that preclinical testing and predictions for human response become more accurate. A CCA should become 
an important tool in preclinical testing. 

Defining Terms 

Animal surrogate: A physiologically based cell or tissue multi-compartmented device with fluid cir- 
culation to mimic metabolism and fate of a drug or chemical. 

Engineered tissues: Cell culture mimic of a tissue or organ; often combines a polymer scaffold and 
one or more cell types. 

Physiologically based pharmacokinetic model (PBPK): A computer model that replicates animal 
physiology by subdividing the body into a number of anatomical compartments, each compart- 
ment interconnected through the body fluid systems; used to describe the time-dependent distri- 
bution and disposition of a substance. 

Tissue slice: A living organ is sliced into thin sections for use in toxicity studies; one primary organ 
can provide material for many tests. 
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98.1 Introduction and Conceptual Challenges 

Models of microvascular heat transfer are useful for optimizing thermal therapies such as hyperthermia 
treatment, for modeling thermoregulatory response at the tissue level, for assessing environmental haz- 
ards that involve tissue heating, for using thermal means of diagnosing vascular pathologies, and for 
relating blood flow to heat clearance in thermal methods of blood perfusion measurement. For example, 
the effect of local hyperthermia treatment is determined by the length of time that the tissue is held at 
an elevated temperature, nominally 43°C or higher. Since the tissue temperature depends on the balance 
between the heat added by artificial means and the tissue's ability to clear that heat, an understanding 
of the means by which the blood transports heat is essential. This chapter outlines the general problems 
associated with such processes, while more extensive reviews and tutorials on microvascular heat transfer 
may be found elsewhere [1-4]. 

The temperature range of interest for all of these applications is intermediate between freezing and 
boiling, making only sensible heat exchange by conduction and convection as important mechanisms of 
heat transfer. At high and low temperatures, such as those present during laser ablation or electrocautery 
and cryopreservation or cryosurgery, the change of phase and accompanying mass transport present 
problems beyond the scope of this section. See, for example, [5]. 

While the equations that govern heat transport are formally similar to those that govern passive mass 
transport, the processes involve the microvasculature in fundamentally different ways because the thermal 
diffusivity of most tissues is roughly two orders of magnitude greater than the diffusivity for mass 
transport of most mobile species (1.5 x 10~ 7 m 2 /s for heat vs. 1.5 x 10~ 9 m 2 /s for 2 ). Mass transport is 
largely restricted to the smallest blood vessels, the capillaries, arterioles, and venules, whereas heat 
transport occurs in somewhat larger, so-called thermally significant blood vessels with diameters in the 
range from 80 (J.m to 1 mm. The modes of heat transport differ from those of mass transport, not simply 
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because these vessels are larger, but because they have a different geometrical arrangement than the vessels 
primarily responsible for mass transport. Many capillary beds approximate a uniformly spaced array of 
parallel vessels that can be well modeled by the Krogh cylinder model. In contrast, the thermally significant 
vessels are in a tree-like arrangement that typically undergoes several generations of branching within 
the size range of interest and are often found as countercurrent pairs in which the artery and vein may 
be separated by one vessel diameter or less. Moreover, the vascular architecture of the thermally significant 
vessels is less well characterized than that of either the primary mass exchange vessels or the larger, less 
numerous supply vessels that carry blood over large distances in the body. There are too few supply 
vessels to contribute much to the overall energy balance in the tissue, but they are often far from thermal 
equilibrium with the surrounding tissue producing large local perturbations in the tissue temperature. 
Much of the microvascular heat exchange occurs as the blood branches from the larger supply vessels 
into the more numerous and densely spaced thermally significant blood vessels. 

While the details of the vascular architecture for particular organs have been well characterized in 
individual cases, variability among individuals makes the use of such published data valid only in a 
statistical sense. Imaging technology to map the thermally significant blood vessels in any given individual 
is not readily available. 

An additional challenge arises from the spatial and temporal variability of the blood flow in tissue. 
The thermoregulatory system and the metabolic needs of tissues can change the blood perfusion rates 
in some tissues by a factor as great as 15 to 25. 

98.2 Basic Concepts 

For purposes of thermal analysis, vascular tissues are generally assumed to consist of two interacting 
subvolumes, a solid tissue subvolume and a blood subvolume which contains flowing blood. These 
subvolumes thermally interact through the walls of the blood vessels where heat, but little mass, is 
exchanged. Because the tissue subvolume can transport heat by conduction alone, it may be modeled by 
the standard heat diffusion equation [6] 

V-*>Vr ( (M) + ^(r,f) = p A — ^ (98.1) 

where T t is the local tissue temperature, k t is the thermal conductivity of the tissue, q' t " is the rate of 
volumetric heat generation from metabolism or external source, p 8 is the tissue density, and c, is the tissue 
specific heat. The properties used in Eq. (98.1) may be assumed to be bulk properties that average over 
the details of the interstitial fluid, extracellular matrix, and cellular content of the tissue. In the blood 
subvolume, heat may also be transported by advection which adds a blood velocity dependent term as 
given by [6] 

V-k b VT b (?s)-p b c b ^T,t)-VT b (7j) + q;(?j) = p b c b —^ (98.2) 

where u b is the local blood velocity and all other parameters pertain to the local properties of the blood. 
Potential energy, kinetic energy, pulsatility, and viscous dissipation effects are normally taken to be 
negligible. 

At the internal boundary on the vessel walls we expect a continuity of heat flux k b V T b ( r w , t) = fc f V T t ( r w , t) 
and temperature T b (r w , t) = T t {r w , t) where r w represents points on the vessel wall. Few attempts have 
been made to solve Eqs. (98.1) and (98.2) exactly, primarily due to the complexity of the vascular 
architecture and the paucity of data on the blood velocity field in any particular instance. The sections 
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that follow present approaches to the problem of microvascular heat transport that fall broadly into the 
categories of vascular models that consider the response of one or a few blood vessels to their immediate 
surroundings and continuum models that seek to average the effects of many blood vessels to obtain a 
single field equation that may be solved for a local average of the tissue temperature. 

98.3 Vascular Models 

Most vascular models are based on the assumption that the behavior of blood flowing in a blood vessel 
is formally similar to that of a fluid flowing steadily in a roughly circular tube (See Fig. 98.1), that is [7] 



dT °( S ). 



ds 
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(98.3) 



where T a is the mixed mean temperature of the blood for a given vessel cross section, r a is the vessel 
radius, u is mean blood speed in the vessel, q(s) is the rate at which heat conducts into the vessel per 
unit length, and s is the spatial coordinate along the vessel axis. For a vessel that interacts only with a 
cylinder of adjacent tissue, we have 



q'(s)=U'2Kr a (T t (s)-T tt (s)) 



(98.4) 



where If is the overall heat transfer coefficient between the tissue and the blood. Typically, the thermal 
resistance inside the blood vessel is much smaller than that in the tissue cylinder so we may approximate 
U'2%r a ~ kp where the conduction shape factor O relating local tissue temperature T t (s) to the blood 
temperature is given by 
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Equilibration Lengths 

One of the most useful concepts that arises from the simple vascular model presented above is the 
equilibration length L c , which may be defined as the characteristic length over which the blood changes 
temperature from an inlet temperature T ao to eventually equilibrate with tissue at a constant temperature 
T r The solution for Eqs. (98.3) and (98.4) under these conditions is given by 



(98.6) 
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where the equilibration length is given by 

L = ^^ (98.7) 

k t a 

Chen and Holmes [8] found that vessels with diameters of about 175 M.m have an anatomical length 
comparable to their thermal equilibration length, thus making vessels of this approximate size the 
dominant site of tissue-blood heat exchange. Accordingly, these vessels are known as the thermally 
significant blood vessels. Much smaller vessels, while more numerous, carry blood that has already equil- 
ibrated with the surrounding tissue. Much larger vessels, while not in equilibrium with the surrounding 
tissue, are too sparsely spaced to contribute significantly to the overall energy balance [7]. Even though 
the larger vessels do not exchange large quantities of heat with the tissue subvolume, they cannot be 
ignored because these vessels produce large local perturbations to the tissue temperature and form a 
source of blood for tissues that is at a much different temperature than the local tissue temperature. 

Countercurrent Heat Exchange 

Thermally significant blood vessels are frequently found in closely spaced countercurrent pairs. Only a 
slight modification to the preceding formulas is needed for heat exchange between adjacent arteries and 
veins with countercurrent flow [9] 

4H 4^( s H( s )) (98 - 8) 

where T v (s) is the mixed mean temperature in the adjacent vein and the conduction shape factor is given 
approximately by [6] 

2K (98.9) 



cosh 



w~ 



2r r 



where w is the distance between the vessel axes and r v is the radius of the vein. The blood temperatures 
in the artery and vein must be obtained simultaneously, but still yield an equilibration length of the form 
given in Eq. (98.7). Substitution of representative property values, blood speeds, and vessel dimensions 
reveals that countercurrent vessels have equilibration lengths that are about one third that of isolated 
vessels of similar size [9], Based on this observation, the only vessels that participate significantly in the 
overall energy balance in the tissue are those larger than about 50 (Im in diameter. 
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TABLE 98.1 Shape Factors for Various Vascular Geometries 

Geometry Ref. 

Single vessel to skin surface [9] 

Single vessel to tissue cylinder [8] 

Countercurrent vessel to vessel [9, 40] 

Countercurrent vessels to tissue cylinder [24] 

Countercurrent vessels with a thin tissue layer [41] 

Multiple parallel vessels [42] 

Vessels near a junction of vessels [43] 

Typical dimension of blood vessels are available in Chapter 1, 
Tables 1.3 and 1.4 of this handbook. 

The shape factors given above are only rough analytical approximations that do not include the effects 
of finite thermal resistance within the blood vessels and other geometrical effects. The reader is referred 
to Table 98.1 for references that address these issues. 

98.4 Heat Transfer Inside of a Blood Vessel 

A detailed analysis of the heat transfer between the blood vessel wall and the mixed mean temperature 
of the blood can be done using standard heat transfer methods 

q(s)=hndir w (s)-T b ] j (98.10) 

where d is the vessel diameter, T w (s) is the vessel wall temperature, and the convective heat transfer 
coefficient h may be found from Victor and Shah's [10] recommendation that 



_ hd 

'"~Y~ 



A/,,. = "'- = 4 + 0.155exp(l.58log 10 Gz) Gz<10 3 (98.11) 



where Gz is the Graetz number defined as 

Gz= P ^ d2 (98.12) 

k b L 

where L is the vessel length. See also Barozzi and Dumas [11]. 

98.5 Models of Perfused Tissues 
Continuum Models 

Continuum models of microvascular heat transfer are intended to average over the effects of many vessels 
so that the blood velocity field need not be modeled in detail. Such models are usually in the form of a 
modified heat diffusion equation in which the effects of blood perfusion are accounted for by one or 
more additional terms. These equations then can be solved to yield a sort of moving average of the local 
temperature that does not give the details of the temperature field around every individual vessel, but 
provides information on the broad trends in the tissue temperature. The temperature they predict may 
be defined as 
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TABLE 98.2 Representative Thermal Property Values 





Thermal 


Thermal 






Conductivity 


Diffusivity 


Perfusion 


Tissue 


(W/m-K) 


(m 2 /s) 


(m 3 /m 3 -sec) 


Aorta 


0.461 [16] 


1.25 X 10- 7 [16] 




Fat of spleen 


0.3337 [44] 


1.314 X 10- 7 [44] 




Spleen 


0.5394 [44] 


1.444 X 10- 7 [44] 


0.023 [45] 


Pancreas 


0.5417 [44] 


1.702 X 10- 7 [44] 


0.0091 [45] 


Cerebral cortex 


0.5153 [44] 


1.468 X 10- 7 [44] 


0.0067 [46] 


Renal cortex 


0.5466 [44] 


1.470 X 10- 7 [44] 


0.077 [47] 


Myocardium 


0.5367 [44] 


1.474 X 10- 7 [44] 


0.0188 [48] 


Liver 


0.5122 [44] 


1.412 X 10- 7 [44] 


0.0233 [49] 


Lung 


0.4506 [44] 


1.307 X 10- 7 [44] 




Adenocarcinoma of breast 


0.5641 [44] 


1.436 X 10- 7 [44] 




Resting muscle bone 


0.478 [50] 


1.59 X 10- 7 [50] 


0.0007 [48] 


Whole blood (21"C) 


0.492 [50] 


1.19 X 10- 7 [50] 




Plasma (21"C) 


0.570 [50] 


1.21 X 10- 7 [50] 




Water 


0.628 [6] 


1.5136 X 10- 7 [6] 





All conductivities and diffusivities are from humans at 37°C except the value for 
skeletal muscle which is from sheep at 2 1°C. Perfusion values are from various mam- 
mals as noted in the references. Significant digits do not imply accuracy. The tem- 
perature coefficient for thermal conductivity ranges from -0.000254 to 0.0039 W/m- 
K-°C with 0.001265 W/m-K-°C typical of most tissues as compared to 0.001575 W/m- 
K-°C for water [44]. The temperature coefficient for thermal diffusivity ranges from 
-4.9 x 10- 10 m 2 /s-°C to 8.4 x 10- 10 m 2 /s-°C with 5.19 x 10- 10 m 2 /s-°C typical of most 
tissues as compared to 4.73 X 10~ 10 m 2 /s-°C for water [44]. The values provided in 
this table are representative values presented for tutorial purposes. The reader is 
referred to the primary literature for values appropriate for specific design applications. 
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where 8 Vis a volume that is assumed to be large enough to encompass a reasonable number of thermally 
significant blood vessels, but much smaller than the scale of the tissue as a whole. Much of the confusion 
concerning the proper form of the bioheat equation stems from the difficulty in precisely defining such 
a length scale. Unlike a typical porous medium, such as water percolating through sand where the grains 
of sand fall into a relatively narrow range of length scales, blood vessels form a branching structure with 
length scales spaning many orders of magnitude. 

Formulations 

Pennes Heat Sink Model 

In 1948, physiologist Harry Pennes modeled the temperature profile in the human forearm by introducing 
the assumptions that the primary site of equilibration was the capillary bed and that each volume of 
tissue has a supply of arterial blood that is at the core temperature of the body. The Pennes' Bioheat 
equation has the form [12] 



v.fcvr ( (? )f ) + cb fc (7 )f )p i , Cfe (r-r t (?, f ))+q'"(?,f)= pc ^p 



3.14) 



where CO,, is taken to be the blood perfusion rate in volume of blood per unit volume of tissue per unit 
time and T a is an arterial supply temperature which is generally assumed to remain constant and equal 
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to the core temperature of the body, nominally 37°C. The other thermal parameters are taken to be 
effective values that average over the blood and tissue subvolumes. Major advantages of this formulation 
are that it is readily solvable for constant parameter values, requires no anatomical data, and in the 
absence of independent measurement of the actual blood rate and heat generation rate gives two adjust- 
able parameters (C0j,( r,t) and T a ) that can be used to fit the majority of the experimental results available. 
On the downside, the model gives no prediction of the actual details of the vascular temperatures, the 
actual blood perfusion rate is usually unknown and not exactly equal to <B 6 that best fits the thermal 
data, the assumption of constant arterial temperature is not generally valid, and, based on the equilibra- 
tion length studies presented in the previous section, thermal equilibration occurs prior to the capillary 
bed. Despite these weaknesses, the Pennes formulation is the primary choice of modelers. Equilibration 
prior to the capillary bed does not invalidate the model as long as the averaging volume is large enough 
to encompass many vessels of the size in which equilibration does occur and as long as the venous return 
does not exchange significant quantities of heat after leaving the equilibrating vessels. As long as (H b and 
T a are taken as adjustable, curve-fitting parameters rather than literally as the perfusion rate and arterial 
blood temperature, the model may be used fruitfully, provided that the results are interpreted accordingly. 

Directed Perfusion 

Some of the shortcomings of the Pennes model were addressed by Wulff [13] in a formulation that is 

essentially the same as that used for common porous media 

V-kVf t {r,t)-pcu{r,tYvf t {r,t) + (f''(r,tj = pc — ^^ (98.15) 

where u is a velocity averaged over both the tissue and blood subvolumes. Among the difficulties with 
this model are that it is valid only when the tissue and blood are in near-thermal equilibrium and when 
the averaging volume is small enough to prevent adjacent arteries and veins from canceling out their 
contributions to the average velocity, thus erroneously suggesting that the blood perfusion has no net 
effect on the tissue heat transfer. Eq. (98.15) is rarely applied in practical situations, but served as an 
important conceptual challenge to the Pennes formulation in the 1970s and 1980s. 

Effective Conductivity Model 

The oldest continuum formulation is the effective conductivity model 



-I \ I \ dT\r,t\ 

V-k^T\r,t) + q'"(r,t) = p t c t —^ 



(98.16) 



where the effective conductivity is comprised of the intrinsic thermal conductivity of the tissue and a 
perfusion dependent increment. In principle, an effective conductivity can be defined from any known 
heat flow and temperature difference, that is 



eff AT J 



K A J 



3.17) 



where / 1 — I is a function of geometry with dimensions length -1 (for example, Ax/A in a slab geometry). 
Originally introduced as an empirical quantity [14], the effective conductivity has been linked to the 
Pennes formulation in the measurement for blood perfusion rates via small, heated, implanted probes 
[15-17]. In 1985, Weinbaum and Jiji [18] theoretically related the effective conductivity to the blood flow 
and anatomy for a restricted class of tissues and heating conditions which are dominated by a closely 
spaced artery- vein architecture and which can satisfy the constraint [19] 
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at i dh + T ) 

^--- l - v > (98.18 

ds 2 ds 

Here the effective conductivity is a tensor quantity related to the flow and anatomy according to [18] 



eff t 



-2„2 2 4— 2 



1 , g PfcS" r a » cos ft 



^X 
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where the enhancement is in the direction of the vessel axes and where n is the number of artery-vein 
pairs per unit area and (|) is the angle of the vessel axes relative to the temperature gradient. The near 
equilibrium between the tissue and the blood vessels needed for this model to be valid is likely to occur 
only in the outer few millimeters near the skin and when the volumetric heat source is neither intense 
nor localized. Artery-vein pairs have been shown to act like a single highly conductive fiber even when 
the near equilibrium condition in Eq. (98.18) is violated [20]. The radius of the thermally equivalent 
fiber is given by 



\fi 



fiber 



(wr) (98.20) 



and its conductivity is given by 



p,c,M r 3 cosh ' w ■ r 
k n = ^ " ' ' ^-^ (98.21) 

fiber -I v ' 

wk t 

Under these non-equilibrium conditions, the tissue-blood system acts like a fiber-composite material, 
but cannot be well modeled as a single homogeneous material with effective properties. 

Combination 

Recognizing that several mechanisms of heat transport may be at play in tissue, Chen and Holmes [8] 

suggested the following formulation which incorporates the effects discussed above 

V-k eff (r,t)vT\r,t) + <a b (7,t)p h c b (r^ 

where T* is the temperature exiting the last artery that is individually modeled. The primary value of 
this formulation is its conceptual generality. In practice, this formulation is difficult to apply because it 
requires knowledge of a great many adjustable parameters, most of which have not been independently 
measured to date. 

Heat Sink Model with Effectiveness 

Using somewhat different approaches, Brinck and Werner [21] and Weinbaum et al. [22] have proposed 
that the shortcomings of the Pennes model can be overcome by introducing a heat transfer effectiveness 
factor e to modify the heat sink term as follows: 



^■^T ( (r ( f)^r,t)(b i (r ) f)p A jr(r ) t) ( -T,) + f(f ) f) = p A -|^ 



3.23) 
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where < £ < 1. In the Brinck and Werner formulation, £ is a curve-fitting parameter that allows the 
actual (rather than the thermally equivalent) perfusion rate to be used [21]. Weinbaum et al. provide an 
analytical result for £ that is valid for blood vessels smaller than 300 p.m diameter in skeletal muscle [22]. 
In both formulations, £ < 1 arises from the countercurrent heat exchange mechanism that shunts heat 
directly between the artery and vein without requiring the heat-carrying blood to first pass through the 
smaller connecting vessels. Typical values for the effectiveness are in the range 0.6 < £ < 0.8 [22]. 

Multi-Equation Models 

The value of the continuum models is that they do not require a separate solution for the blood 
subvolume. In each continuum formulation, the behavior of the blood vessels is modeled by introducing 
assumptions that allow solution of only a single differential equation. But by solving only one equation, 
all detailed information on the temperature of the blood in individual blood vessels is lost. Several 
investigators have introduced multi-equation models that typically model the tissue, arteries, and veins 
as three separate, but interacting, subvolumes [9,23-27]. As with the other non-Pennes formulations, 
these methods are difficult to apply to particular clinical applications, but provide theoretical insights 
into microvascular heat transfer. 

Vascular Reconstruction Models 

As an alternative to the three equation models, a more complete reconstruction of the vasculature may 
be used along with a scheme for solving the resulting flow, conduction, and advection equations [28-33]. 
Since the reconstructed vasculature is similar to the actual vasculature only in a statistical sense, these 
models provide the mean temperature as predicted by the continuum models, as well as insight into the 
mechanisms of heat transport, the sites of thermal interaction, and the degree of thermal perturbations 
produced by vessels of a given size, but they cannot provide the actual details of the temperature field 
in a given living tissue. These models tend to be computationally intensive due to the high spatial 
resolution needed to account for all of the thermally significant blood vessels. 

98.6 Parameter Values 
Thermal Properties 

The intrinsic thermal properties of tissues depend strongly on their composition. Cooper and Trezek 
[34] recommend the following correlations for thermal conductivity: 

/:=pxlO~ 3 (o.628f , +0.117/ ,. +0.231 /,, )w/m-K; (98.24) 

r \ J water J proteins J fats I I v ' 

specific heat: 



c =4,200/ , + 1,090 /„ ,. +2,300/,, J kg-K; (98.25) 

p ' J water ' J proteins ' J fats J I o ' V / 



and density: 



P = 1 1 7 kg m 3 (98.26) 

/ , /1,000+f ,. /l,540+/ f ,/815 ' 

J water I J proteins j ■'fats / 

where f water , fp rote i m > and ft ats are the mass fractions of water, proteins, and fats, respectively. 
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Thermoregulation 

Humans maintain a nearly constant core temperature through a combination of physiological and 
behavioral responses to the environment. For example, heat loss or gain at the skin surface may be 
modified by changes in the skin blood flow, the rate of sweating, or clothing. In deeper tissues, the 
dependence of the blood perfusion rate, the metabolic heat generation rate, and vessel diameters depend 
on the environmental and physiological conditions in a complex, organ-specific manner. The blood 
perfusion varies widely among tissue types and for some tissues can change dramatically depending on 
the metabolic or thermoregulatory needs of the tissue. The situation is further complicated by the 
feedback control aspects of the thermoregulatory systems that utilize a combination of central and 
peripheral temperature sensors as well as local and more distributed actuators. 

The following examples are provided to illustrate some of the considerations, not to exhaustively 
explore this complicated issue. A model of the whole body is typically needed even for a relatively local 
stimulus, especially when the heat input represents a significant fraction of the whole body heat load. 
The reader is referred to extensive handbook entries on environmental response for more information 
[35, 36]. Whole body models of the thermoregulatory system are discussed in Wissler [37]. 

Chato [1] suggests that the temperature dependence of the blood perfusion effect can be approximated 
by a scalar effective conductivity 



: 4.82-4.44833 



1.00075" 



W m-K 
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which is intended for use in Eq. (98.16). 

Under conditions of local hyperthermia, where the heated volume is small compared to the body as 
a whole, the blood perfusion rate in skin and muscle may increase by roughly nine- or tenfold as the 
tissue warms to about 44°C and then drop again for higher temperatures. In contrast, tumors, the typical 
target of local hyperthermia, thermoregulate erratically and undergo a lesser change in blood flow up to 
42°C and then may drop to near zero for higher temperatures [38]. 

The metabolic rate may also undergo thermoregulatory changes. For example, the temperature depen- 
dence of the metabolism in the leg muscle and skin may be modeled with [39] 



/ \\( T °- T ') 
170(2j L 



>W rrf 



(98.28) 



The metabolic rate and blood flow may also be linked through processes that reflect the fact that 
sustained increased metabolic activity generally requires increased blood flow. 

Clinical Heat Generation 

Thermal therapies such as hyperthermia treatment rely on local heat generation rates several orders of 
magnitude greater than that produced by metabolism. Under these circumstances, the metabolic heat 
generation is often neglected with little error. 

98.7 Solutions of Models 

The steady-state solution with spatially and temporally constant parameter values including the rate of 
heat generation for a tissue half-space with a fixed temperature on the skin T skin is given by 



r. 



;(*)■ 



■T, 



,exp 



f ■ A 1 / 2 



T + 



"W&y 



-exp 



f ■ V/ 2 



(98.29) 
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This solution reveals that perturbations to the tissue temperature decay exponentially with a charac- 
teristic length of 






1/2 



(98.30) 



which for typical values of the perfusion rate d) 6 = 0.1 X 10 3 to 3.0 x 10 3 m 3 /m 3 -sec yields L c = 6.5 x 
10- 3 to 36 x 10- 3 m (Fig. 98.2). 

The transient solution of Pennes' bioheat equation with constant perfusion rate for an initial uniform 
temperature of T , in the absence of any spatial dependence, is 



T, 



;M = rex P 



P{, 



T + 



® b Pb C b 
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I 
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Here the solution reveals a characteristic time scale 



Pf, 
®iPb c b 



(98.32) 



that has typical values in the range of t c = 300 to 10,000 sec (Fig. 98.3). This solution is valid only when 
thermoregulatory changes in the perfusion rate are small or occur over a much longer time than the 
characteristic time scale t r 

Numerical solution of the heat sink model is readily obtained by standard methods such as finite 
differences, finite element, boundary element, and Green's functions provided that the parameter values 
and appropriate boundary conditions are known. 



T t (x) 




FIGURE 98.2 One-dimensional steady-state solution of Pennes bioheat equation for constant parameter values. 
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T t (t) 




FIGURE 98.3 Transient solution of Pennes bioheat heat equation for constant parameter values in the absence of 
spatial effects. 



Defining Terms 

Conduction shape factor: Dimensionless factor used to account for the geometrical effects in steady- 
state heat conduction between surfaces at different temperatures. 

Effective conductivity: A modified thermal conductivity that includes the intrinsic thermal conduc- 
tivity of the tissue as well as a contribution from blood perfusion effects. 

Equilibration length: Characteristic length scale over which blood in a blood vessel will change tem- 
perature in response to surrounding tissue at a different temperature. 

Perfusion rate: Quantity of blood provided to a unit of tissue per unit time. 

Specific heat: Quantity of energy needed for a unit temperature increase for a unit of mass. 

Thermal conductivity: Rate of energy transfer by thermal conduction for a unit temperature gradient 
per unit of cross-sectional area. 

Thermally significant vessel: Blood vessels large enough and numerous enough to contribute signifi- 
cantly to overall heat transfer rates in tissue. 
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Traditional methods for delivering drugs to the brain are inadequate. Many drugs, particularly water- 
soluble or high molecular weight compounds, do not enter the brain following systemic administration 
because they permeate through blood capillaries very slowly. This blood-brain barrier (BBB) severely 
limits the number of drugs that are candidates for treating brain disease. 

Several strategies for increasing the permeability of brain capillaries to drugs have been tested. Since 
the BBB is generally permeable to lipid soluble compounds which can dissolve and diffuse through 
endothelial cell membranes [1, 2], a common approach for enhancing brain delivery of compounds is 
chemical modification to enhance lipid solubility [3]. Unfortunately, lipidization approaches are not 
useful for drugs with molecular weight larger than 1,000. Another approach for increasing permeability 
is the entrapment of drugs in liposomes [4], but delivery may be limited by liposome stability in the 
plasma and uptake at other tissue sites. 

Specific nutrient transport systems in brain capillaries can be used to facilitate drug entry into the 
brain. L-dopa (L-3,4-dihydroxyphenylalanine), a metabolic precursor of dopamine, is transported across 
endothelial cells by the neutral amino acid transport system [5]. L-dopa permeates through capillaries 
into the striatal tissue, where it is decarboxylated to form dopamine. Therefore, systemic administration 
of L-dopa is often beneficial to patients with Parkinson's disease. Certain protein modifications, such as 
cationization [6] and anionization [7], produce enhanced uptake in the brain. Modification of drugs 
[8, 9] by linkage to an anti-transferrin receptor antibody also appears to enhance transport into the brain. 
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This approach depends on receptor-mediated transcytosis of transferrin-receptor complexes by brain 
endothelial cells; substantial uptake also occurs in the liver. 

The permeability of brain capillaries can be transiently increased by intra-arterial injection of the 
hyperosmolar solutions, which disrupt interendothelial tight junctions [10]. But BBB disruption affects 
capillary permeability throughout the brain, enhancing permeability to all compounds in the blood, not 
just the agent of interest. Intraventricular therapy, where agents are administered directly into the CSF 
of the ventricles, results in high concentrations within the brain tissue, but only in regions immediately 
surrounding the ventricles [11, 12]. Because the agent must diffuse into the brain parenchyma from the 
ventricles, and because of the high rate of clearance of agents in the CNS into the peripheral circulation, 
this strategy cannot be used to deliver agents deep into the brain. 

Because of the difficulty in achieving therapeutic drug levels by systemic administration, methods for 
direct administration of drugs into the brain parenchyma have been tested. Drugs can be delivered directly 
into the brain tissue by infusion, implantation of a drug-releasing matrix, or transplantation of drug- 
secreting cells [ 13] . These approaches provide sustained drug delivery that can be limited to specific sites, 
localizing therapy to a brain region. Because these methods provide a localized and continuous source 
of active drug molecules, the total drug dose can be less than needed with systemic administration. With 
polymeric controlled release, the implants can also be designed to protect unreleased drug from degra- 
dation in the body and to permit localization of extremely high doses (up to the solubility of the drug) 
at precisely defined locations in the brain. Infusion systems require periodic refilling; the drug is usually 
stored in a liquid reservoir at body temperature and many drugs are not stable under these conditions. 

This chapter describes the transport of drug molecules that are directly delivered into the brain. For 
purposes of clarity, a specific example is considered: polymeric implants that provide controlled release 
of chemotherapy. The results can be extended to other modes of administration [13, 14] and other types 
of drug agents [15]. 

99.2 Implantable Controlled Delivery Systems 
for Chemotherapy 

The kinetics of drug release from a controlled release system are usually characterized in vitro, by 
measuring the amount of drug released from the matrix into a well-stirred reservoir of phosphate buffered 
water or saline at 37°C. Controlled release profiles for some representative anticancer agents are shown 
in Fig. 99.1; all of the agents selected for these studies — l,3-bis(2-chloroethyl)-l-nitrosourea (BCNU), 
4-HC, cisplatin, and taxol — are used clinically for chemotherapy of brain tumors. The controlled release 
period can vary from several days to many months, depending on properties of the drug, the polymer, 
and the method of formulation. Therefore, the delivery system can be tailored to the therapeutic situation 
by manipulation of implant properties. 

The release of drug molecules from polymer matrices can be regulated by diffusion of drug through 
the polymer matrix or degradation of the polymer matrix. In many cases (including the release of BCNU, 
cisplatin, and 4HC from the degradable matrices shown in Fig. 99.1), drug release from biodegradable 
polymers appears to be diffusion-mediated, probably because the time for polymer degradation is longer 
than the time required for drug diffusion through the polymer. In certain cases linear release, which 
appears to correlate with the polymer degradation rate, can be achieved; this might be the case for taxol 
release from the biodegradable matrix (Fig. 99.1), although the exceedingly low solubility of taxol in 
water may also contribute substantially to the slowness of release. 

For diffusion-mediated release, the amount of drug released from the polymer is proportional to the 
concentration gradient of the drug in the polymer. By performing a mass balance on drug molecules 
within a differential volume element in the polymer matrix, a conservation equation for drug within the 
matrix is obtained: 

— t = DV 2 C„ (99.1) 

dt p p 
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FIGURE 99.1 Controlled release of anticancer compounds from polymeric matrices, (a) Release of cisplatin (circles) 
from a biodegradable copolymer of fatty acid dimers and sebacic acid, p(FAD:SA), initially containing 10% drug 
(see [32, 33] for details), (b) Release of BCNU from EVAc matrices (circles), polyanhydride matrices p(CPP:SA) 
(squares), and p(FAD:SA) (triangles) matrices initially containing 20% drug, (c) Release of BCNU (squares), 4HC 
(circles), and taxol (triangles) from p(CPP:SA) matrices initially containing 20% drug. Note that panel (c) has two 
time axes: the lower axis applies to the release of taxol and the upper axis applies to the release of BCNU and 4HC. 



where C„ is the local concentration of drug in the polymer and D„ is the diffusion coefficient of the drug 
in the polymer matrix. This equation can be solved, with appropriate boundary and initial conditions, 
to obtain the cumulative mass of drug released as a function of time; the details of this procedure are 
described elsewhere [16]. A useful approximate solution, which is valid for the initial 60% of release, is: 



M =AM 



D. t 
>-p 

kL 2 



(99.2) 



where M t is the cumulative mass of drug released from the matrix, M is the initial mass of drug in the 
matrix, and L is the thickness of the implant. By comparing Eq. (99.2) to the experimentally determined 
profiles, the rate of diffusion of the agent in the polymer matrix can be estimated (Table 99.1). 



TABLE 99.1 Diffusion Coefficients for Chemotherapy Drug 
Release from Biocompatible Polymer Matrices" 



Drug 


Polymer 


Initial loading 


(%) 


Dp (cm2/sec) 


Cisplatin 


P(FAD:SA) 


10 




6.8 x 10-" 


BCNU 


EVAc 


20 




1.6 x 10- 8 


BCNU 


P(FAD:SA) 


20 




6.9 x 10- 8 


BCNU 


P(CPP:SA) 


20 




2.3 x 10- 8 (panel b) 
2.0 x 10- 8 (panel c) 


4HC 


P(CPP:SA) 


20 




3.1 x 10- 10 


Taxol 


P(CPP:SA) 


20 




n.a. 



a Diffusion coefficients were obtained by comparing the experimental 
data show in Fig. 99.1 to Eq. (99.2) and determining the best value of 
the diffusion coefficient to represent the data. Abbreviations: not appli- 
cable (n.a.). 
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99.3 Drug Transport After Release from the Implant 

Bypassing the BBB is necessary, but not sufficient for effective drug delivery. Consider the consequences 
of implanting a delivery system, such as one of the materials characterized above, within the brain. 
Molecules released into the interstitial fluid in the brain extracellular space must penetrate into the brain 
tissue to reach tumor cells distant from the implanted device. Before these drug molecules can reach the 
target site, however, they might be eliminated from the interstitium by partitioning into brain capillaries 
or cells, entering the cerebrospinal fluid, or being inactivated by extracellular enzymes. Elimination always 
accompanies dispersion; therefore, regardless of the design of the delivery system, one must understand 
the dynamics of both processes in order to predict the spatial pattern of drug distribution after delivery. 

The polymer implant is surrounded by biological tissue, composed of cells and an extracellular space 
(ECS) filled with extracellular fluid (ECF). Immediately following implantation, drug molecules escape 
from the polymer and penetrate the tissue. Once in the brain tissue, drug molecules (1) diffuse through 
the tortuous ECS in the tissue, (2) diffuse across semipermeable tissue capillaries to enter the systemic 
circulation and, therefore, are removed from the brain tissue, (3) diffuse across cell membranes by passive, 
active, or facilitated transport paths, to enter the intracellular space, (4) transform, spontaneously or by 
an enzyme-mediated pathway, into other compounds, and (5) bind to fixed elements in the tissue. Each 
of these events influence drug therapy: diffusion through the ECS is the primary mechanism of drug 
distribution in brain tissue; elimination of the drug occurs when it is removed from the ECF or trans- 
formed; and binding or internalization may slow the progress of the drug through the tissue. 

A mass balance on a differential volume element in the tissue [17] gives a general equation describing 
drug transport in the region near the polymer [18]: 



at v ' at 



(C,)-M (9,..) 



where C is the concentration of the diffusible drug in the tissue surrounding the implant (g/cm 3 tissue), 
v is the fluid velocity (cm/sec), D b is the diffusion coefficient of the drug in the tissue (cm 2 /sec), R e (C) 
is the rate of drug elimination from the ECF (g/sec-cm 3 tissue), B is the concentration of drug bound or 
internalized in cells (g/cm 3 tissue), and t is the time following implantation. 1 In deriving this equation, 
the conventions developed by Nicholson [20], based on volume-averaging in a complex medium, and 
Patlak and Fenstermacher [18] were combined. In this version of the equation, the concentrations Cand 
B and the elimination rate R e (C) are defined per unit volume of tissue. D b is an effective diffusion 
coefficient, which must be corrected from the diffusion coefficient for the drug in water to account for 
the tortuosity of the ECS. 

When the binding reactions are rapid, the amount of intracellular or bound drug can be assumed to 
be directly proportional, with an equilibrium coefficient K bind , to the amount of drug available for 
internalization or binding: 

B=K bmd C (99.4) 

Substitution of Eq. (99.4) into Eq. (99.3) yields, with some simplification: 



dC. 

a 



*- = — *—(D b V 2 C t + R(C t )-v*VC t ) (99.5) 



'An alternate form of this equation, which accounts more rigorously for transfer of drug between different phases 
in the tissue, is also available [19]., 
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The drug elimination rate, R e (C), can be expanded into the following terms: 



.(4 



Kbb 



N V c 

Y^+K£ t (99-6) 



plasma 
V ccs 



where k bbb is the permeability of the BBB (defined based on concentration in the ECS), C phsma is the 
concentration of drug in the blood plasma, V max and K m are Michaelis-Menton constants, and k ne is a first 
order rate constant for drug elimination due to non-enzymatic reactions. For any particular drug, some 
of the rate constants may be very small, reflecting the relative importance of each mechanism of drug 
elimination. If it is assumed that the permeability of the BBB is low (C„ ( <§ C) and the concentration of 
drug in the brain is sufficiently low so that any enzymatic reactions are in the first order regime ( C <§ 
K m ), Eq. (99.6) can be reduced to: 



,(4 



k V 

-JMc+^^C+k C=kC t (99.7) 



where k is a lumped first order rate constant. With these assumptions, Eq. (99.4) can be simplified by 
definition of an apparent diffusion coefficient, D*, and an apparent first order elimination constant, k*: 

7)C v»vr 

-^- = D*V 2 C+k*C-- -±- (99.8) 

dt l + K u 



hi i hi 



k D, 
where k* = — — — and D = — 



Boundary and initial conditions are required for solution of differential Eq. (99.8). If a spherical 
implant of radius R is implanted into a homogeneous region of the brain, at a site sufficiently far from 
anatomical boundaries, the following assumptions are reasonable: 

C t = 0iort = 0;r>R (99.9) 

C t = C,fort>0; r=R (99.10) 

C t = 0fort>0;r^oo (99.11) 

In many situations, drug transport due to bulk flow can be neglected. This assumption (v is zero) is 
common in previous studies of drug distribution in brain tissue [18]. For example, in a previous study 
of cisplatin distribution following continuous infusion into the brain, the effects of bulk flow were found 
to be small, except within 0.5 mm of the site of infusion [21], In the cases considered here, since drug 
molecules enter the tissue by diffusion from the polymer implant, not by pressure-driven flow of a fluid, 
no flow should be introduced by the presence of the polymer. With fluid convection assumed negligible, 
the general governing equation in the tissue, Eq. (99.8), reduces to: 

— >-=D*V 2 C f + k*C, (99.12) 

dt 

The no-flow assumption may be inappropriate in certain situations. In brain tumors, edema and fluid 
movement are significant components of the disease. In addition, some drugs can elicit cytotoxic edema. 
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Certain drug/polymer combinations can also release drugs in sufficient quantity to create density-induced 
fluid convection. 

Equation (99.12), with conditions (99.9) through (99.11), can be solved by Laplace transform tech- 
niques [13] to yield: 



C It, 



exp 



-4>(S-l)]erfc 



2a/ t 



-fVT 



+ exp 



^-l) 



erfc 






+ <|Wt 



(99.13) 



where the dimensionless variables are defined as follows: 



r D*t , „ | fc* 

— ;x = — — ;^)=i? J — 
i? £ 2 \D* 



(99.14) 



The differential equation also has a steady-state solution, which is obtained by solving Eq. (99.12) with 
the time derivative set equal to zero and subject to the boundary conditions (99.9) and (99.10): 



C. 



C 



exp 



-*(H 



(99.15) 



Figure 99.2 shows concentration profiles calculated using Eqs. (99.13) and (99.15). In this situation, 
which was obtained using reasonable values for all of the parameters, steady-state is reached approxi- 
mately 1 h after implantation of the delivery device. The time required to achieve steady-state depends 
on the rate of diffusion and elimination, as previously described [22], but will be significantly less than 
24 h for most drug molecules. 



Steady-state 




0.5 1 1.5 

Distance from implant (mm) 



0.5 1 1.5 2 

Distance from implant (mm) 



FIGURE 99.2 Concentration profiles after implantation of a spherical drug-releasing implant. (Panel a, Transient) 
Solid lines represent the transient solution to Eq. (99.12) [i.e., Eq. (99.13)] with the following parameter values: D* = 
4 x 10~ 7 cm 2 /s; R = 0.032 cm; k* = 1.9 x 10~ 4 S"" 1 (t 1/2 = 1 h). The dashed line represents the steady-state solution [i.e., 
Eq. (99.15)] for the same parameters. (Panel b, Steady-state) Solid lines in this plot represent Eq. (99.15) with the 
following parameters: D* = 4x 10~ 7 cm 2 /s; R = 0.032 cm. Each curve represents the steady-state concentration profile 
for drugs with different elimination half-lives in the brain, corresponding to different dimensionless moduli, (j>: t 1/2 = 
10 min (<|> = 1.7); 1 h (0.7); 34 h (0.12); and 190 h (0.016). 
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99.4 Application of Diffusion-Elimination Models 
to Intracranial BCNU Delivery Systems 

The preceding mathematical analysis, which assumes diffusion and first-order elimination in the tissue, 
agrees well with experimental concentration profiles obtained after implantation of controlled release 
polymers (Fig. 99.3). At 3, 7, and 14 d after implantation of a BCNU-releasing implant, the concentration 
profile at the site of the implant was very similar. The parameter values (obtained by fitting Eq. (99.15) 
to the experimental data) were consistent with parameters obtained using other methods [19], suggesting 
that diffusion and first-order elimination were sufficient to account for the pattern of drug concentration 
observed during this period. Parameter values were similar at 3, 7, and 14 d, indicating that the rates of 
drug release, drug dispersion, and drug elimination did not change during this period. This equation 
has been compared to concentration profiles measured for a variety of molecules delivered by polymer 
implants to the brain — dexamethasone [22], molecular weight fractions of dextran [23], nerve growth 
factor in rats [24, 25], BCNU in rats [19], rabbits [26], and monkeys [27]. In each of these cases, the 
steady-state diffusion-elimination model appears to capture most of the important features of drug 
transport. 

This model can be used to develop guidelines for the design of intracranial delivery systems. Table 99.2 
lists some of the important physical and biological characteristics of a few compounds that have been 
considered for interstitial delivery to treat brain tumors. When the implant is surrounded by tissue, the 
maximum rate of drug release is determined by the solubility of the drug, C 5 , and the rate of diffusive 
transport through the tissue: 



f dM^ 



dt 



3C. 



= [Maximum flux\ x [Surface area\ = —D* — '- 



471 R 2 



(99.16) 




5 6 12 3 12 

Distance from polymer/tissue interface (mm) 



FIGURE 99.3 Concentration profiles after implantation of a BCNU-releasing implant. Solutions to Eq. (99.15) were 
compared to experimental data obtained by quantitative autoradiographic techniques. The solid lines in the three 
panels labeled 3, 7, and 14 d were all obtained using the following parameters: R = 0.15 cm; <|) = 2.1; and C, = 
0.81 mM. The solid line in the panel labeled 1 d was obtained using the following parameters: R = 0.15 cm; ty = 0.7; 
and C, = 1.9 mM. Modified from [19]. 
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TABLE 99.2 Implant Design Applied to Three Chemotherapy Compounds" 









BCNU 


4HC 


Methotrexate 


Molecular weight (daltons) 






214 


293 


454 


Solubility (mM) 






12 


100 


100 


l°gio K 






1.53 


0.6 


-1.85 


k*(h) 






70 


70 


1 


D* (10- 7 cm 2 /sec) 






14 


14 


5 


Toxic concentration in culture (|XM) 






25 


10 


0.04 


Maximum release rate (mg/d) 






1.2 


14 


17 


Implant lifetime at max rate (d) 






0.85 


0.07 


0.06 


Maximum concentration in tissue for 1 -week- releasing 


implant 


(mM) 


1.5 


1.1 


1.8 


RT (mm) 






1.3 


2.5 


5 



' K is the octanol: water partition coefficient, k is the rate of elimination due to permeation through capillaries, 
D b is the diffusion coefficient of the drug in the brain. The following values are assumed, consistent with our 
results from polymer delivery to rats and rabbits: radius of spherical implant, R = 1.5 mm; mass of implant, 
M = 10 mg; drug loading in implant, Load = 10%. 

Evaluating the derivative in Eq. (99.16) from the steady-state concentration profile [Eq. (99.15)] yields: 



'dM,^ 



dt 



-8nD*CR 



(99.17) 



Regardless of the properties of the implant, it is not possible to release drug into the tissue at a rate 
faster than determined by Eq. (99.17). If the release rate from the implant is less than this maximum 
rate, Q (the concentration in the tissue immediately outside the implant) is less than the saturation 
concentration, C s . The actual concentration Q can be determined by balancing the release rate from the 
implant (dM t /dt, which can be determined from Eq. (99.2) provided that diffusion is the mechanism of 
release from the implant) with the rate of penetration into the tissue obtained by substituting C, for C s 
in Eq. (99.17): 



C* 



dM. 



dt 



1 



8nD*R 



(99.18) 



The effective region of therapy can be determined by calculating the distance from the surface of the 
implant to the point where the concentration drops below the cytotoxic level (estimated as the cytotoxic 
concentration determined from in vitro experiments). Using Eq. (99.15), and defining the radial distance 
for effective treatment as R T , yields: 



cytotoxic R _ j K 

— = — exp<-K.i — 

q r t ' \ D' 



R 



(99.19) 



Alternately, an effective penetration distance, d p , can be defined as the radial position at which the 
drug concentration has dropped to 10% of the peak concentration: 



D j i 1 t* 

0.10 = — exp^-flj— 
d B \ \D* 



R 



(99.20) 



These equations provide quantitative criteria for evaluating the suitability of chemotherapy agents for 
direct intracranial delivery (Table 99.2). 



B 2000 by CRC Press LLC 



99.5 Limitations and Extensions of the 
Diffusion-Elimination Model 



Failure of the Model in Certain Situations 

The previous section outlined one method for analysis of drug transport after implantation of a drug- 
releasing device. A simple pseudo-steady-state equation [Eq. (99.15)] yielded simple guidelines 
[Eq. (99.16 through 99.19] for device design. Because the assumptions of the model were satisfied over 
a substantial fraction of the release period (days 3 to 14, based on the data shown in Fig. 99.3), this 
analysis may be useful for predicting the effects of BCNU release from biodegradable implants. Pseudo- 
steady-state assumptions are reasonable during this period of drug release, presumably because the time 
required to achieve steady-state (which is on the order of minutes) is much less than the characteristic 
time associated with changes in the rate of BCNU release from the implant (which is on the order of days) . 
But experimental concentration profiles measured 1 d after implantation were noticeably different: 
the peak concentration was substantially higher and the drug penetration into the surrounding tissue 
was deeper (see the left-hand panel of Fig. 99.3). This behavior cannot be easily explained by the pseudo- 
steady-state models described above. For example, if the difference observed at 1 d represents transient 
behavior, the concentration observed at a fixed radial position should increase with time (Fig. 99.2); in 
contrast, the experimental concentration at any radial position on Day 1 is higher than the concentration 
measured at that same position on subsequent days. 

Effect of Drug Release Rate 

Alternately, the observed difference at 1 d might be due to variability in the rate of BCNU release from 
the polymer implant over this period, with transport characteristics in the tissue remaining constant. 
When similar BCNU-releasing implants are tested in vitro, the rate of drug release did decrease over time 
(Fig. 99.1). Equation (99.18) predicts the variation in peak concentration with release rate; the twofold 
higher concentration observed at the interface on Day 1 (as compared to Days 3 through 14) could be 
explained by a twofold higher release rate on Day 1. But the effective penetration distance, d p , does not 
depend on release rate. Experimentally measured penetration distances are ~1.4 mm on Days 3, 7, and 
14 and ~5 mm on Day 1. This difference in penetration is shown more clearly in the Day 1 panel of 
Fig. 99.3: the dashed line shows the predicted concentration profile if k* and D* were assumed equal to 
the values obtained for Days 3, 7, and 14. Changes in the rate of BCNU release are insufficient to explain 
the differences observed experimentally. 

Determinants of Tissue Penetration 

Penetration of BCNU is enhanced at Day 1 relative to penetration at Days 3, 7, and 14. For an implant 
of fixed size, penetration depends only on the ratio of elimination rate to diffusion rate: k*7D*. Increased 
penetration results from a decrease in this ratio (Fig. 99.2), which could occur because of either a 
decreased rate of elimination (smaller k*) or an increased rate of diffusion (larger D*). But there are no 
good reasons to believe that BCNU diffusion or elimination are different on Day 1 than on Days 3 through 
14. With its high lipid solubility, BCNU can diffuse readily through brain tissue. In addition, elimination 
of BCNU from the brain occurs predominantly by partitioning into the circulation; since BCNU can 
permeate the capillary wall by diffusion, elimination is not a saturable process. Perhaps the enhanced 
penetration of BCNU is due to the presence of another process for drug dispersion, such as bulk fluid 
flow, which was neglected in the previous analysis. 

The diffusion/elimination model compares favorably with available experimental data, but the assump- 
tions used in predicting concentration profiles in the brain may not be appropriate in all cases. Deviations 
from the predicted concentration profiles may occur due to extracellular fluid flows in the brain, com- 
plicated patterns of drug binding, or multistep elimination pathways. The motion of interstitial fluid in 
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the vicinity of the polymer and the tumor periphery may not always be negligible, as mentioned above. 
The interstitial fluid velocity is proportional to the pressure gradient in the interstitium; higher interstitial 
pressure in tumors — due to tumor cell proliferation, high vascular permeability, and the absence of 
functioning lymphatic vessels — may lead to steep interstitial pressure gradients at the periphery of the 
tumor [28]. As a result, interstitial fluid flows within the tumor may influence drug transport. A drug 
at the periphery of the tumor must overcome outward convection to diffuse into the tumor. Similarly, 
local edema after surgical implantation of the polymer may cause significant fluid movement in the 
vicinity of the polymer. More complete mathematical models that include the convective contribution 
to drug transport are required. 

Effect of Fluid Convection 

When bulk fluid flow is present (v # 0), concentration profiles can be predicted from Eq. (99.8), subject 
to the same boundary and initial conditions [Eqs. (99.9) through (99.11)]. In addition to Eq. (99.8), 
continuity equations for water are needed to determine the variation of fluid velocity in the radial 
direction. This set of equations has been used to describe concentration profiles during microinfusion 
of drugs into the brain [14]. Relative concentrations were predicted by assuming that the brain behaves 
as a porous medium (i.e., velocity is related to pressure gradient by Darcy's law). Water introduced into 
the brain can expand the interstitial space; this effect is balanced by the flow of water in the radial direction 
away from the infusion source and, to a lesser extent, by the movement of water across the capillary wall. 
In the presence of fluid flow, penetration of drug away from the source is enhanced (Fig. 99.4). The 
extent of penetration depends on the velocity of the flow and the rate of elimination of the drug. These 
calculations were performed for macromolecular drugs, which have limited permeability across the brain 
capillary wall. The curves indicate steady-state concentration profiles for three different proteins with 



33.5 h 




10 15 

Distance from implant (mm) 



i i i 

20 



FIGURE 99.4 Concentration profiles predicted in the absence (solid lines) and presence (dashed lines) of interstitial 
fluid flow. Solid lines were obtained from Eq. (99.15) with the following parameter values: R = 0.032 cm; D = 4 x 
10~ 7 cm 2 /sec; and k* = ln(2)/t 1/2 where t 1/2 is either 10 min, 1 h, or 33.5 h as indicated on the graph. Dashed lines 
were obtained from [14] using the same parameter values and an infusion rate of 3 uL/min. The dashed line indicating 
the interstitial flow calculation for the long-lived drug (t 1/2 = 33.5 h) was not at steady-state, but at 12 h after initiation 
of the flow. 
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TABLE 99.3 Interstitial Fluid Velocity as a Function 
of Radial Position During Microinfusion" 

Radial Position (mm) Interstitial Velocity (urn/sec) 

2 5.0 

5 0.8 

10 0.2 

20 0.05 

"Calculated by method reported in [14], 



metabolic half-lives of 10 min, 1 h, or 33.5 h. In the absence of fluid flow, drugs with longer half-lives 
penetrate deeper into the tissue [solid lines in Fig. 99.4 were obtained from Eq. (99.15)]. This effect is 
amplified by the presence of flow (dashed lines in Fig. 99.4). 

During microinfusion, drug is introduced by pressure-driven fluid flow from a small catheter. There- 
fore, pressure gradients are produced in the brain interstitial space, which lead to fluid flow through the 
porous brain microenvironment. Volumetric infusion rates of 3 pL/min were assumed in the calculations 
reproduced in Fig. 99.4. Since loss of water through the brain vasculature is small, the velocity can be 
determined as a function of radial position: 



v r = — ^— (99.21) 

47tr 2 8 

where q is the volumetric infusion rate and £ is the volume fraction of the interstitial space in the brain 
(approximately 0.20). Fluid velocity decreases with radial distance from the implant (Table 99.3); but at 
all locations within the first 20 mm of the implant site, predicted velocity was much greater than the 
velocities reported previously during edema or tumor growth in the brain. 

The profiles predicted in Fig. 99.4 were associated with the introduction of substantial volumes of 
fluid at the delivery site. Flow- related phenomena are probably less important in drug delivery by polymer 
implants. Still, this model provides useful guidelines for predicting the influence of fluid flow on local 
rates of drug movement. Clearly, the effect of flow velocity on drug distribution is substantial (Fig. 99.4). 
Even relatively low flows, perhaps as small as 0.03 pm/sec, are large enough to account for the enhance- 
ment in BCNU penetration observed at Day 1 in Fig. 99.3. 

Effect of Metabolism 

The metabolism, elimination, and binding of drug are assumed to be first order processes in our simple 
analysis. This assumption may not be realistic in all cases, especially for complex agents such as proteins. 
The metabolism of drugs in normal and tumor tissues is incompletely understood. Other cellular factors 
(e.g., the heterogeneity of tumor-associated antigen expression and multidrug resistance) that influence 
the uptake of therapeutic agents may not be accounted for by our simple first order elimination. 

Finally, changes in the brain that occur during the course of therapy are not properly considered in 
this model. Irradiation can be safely administered when a BCNU-loaded polymer has been implanted 
in monkey brains, suggesting the feasibility of adjuvant radiotherapy. However, irradiation also causes 
necrosis in the brain. The necrotic region has a lower perfusion rate and interstitial pressure than tumor 
tissue; thus, the convective interstitial flow due to fluid leakage is expected to be smaller. Interstitial 
diffusion of macromolecules is higher in tumor tissue than in normal tissue, as the tumor tissue has 
larger interstitial spaces [29]. The progressive changes in tissue properties — due to changes in tumor size, 
irradiation, and activity of chemotherapy agent — may be an important determinant of drug transport 
and effectiveness of therapy in the clinical situation. 
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99.6 New Approaches to Drug Delivery 
Suggested by Modeling 



Mathematical models, which describe the transport of drug following controlled delivery, can predict 
the penetration distance of drug and the local concentration of drug as a function of time and location. 
The calculations indicate that drugs with slow elimination will penetrate deeper into the tissue. The 
modulus (|), which represents the ratio of elimination to diffusion rates in the tissue, provides a quantitative 
criterion for selecting agents for interstitial delivery. For example, high molecular weight dextrans were 
retained longer in the brain space, and penetrated a larger region of the brain than low molecular weight 
molecules following release from an intracranial implant [23]. This suggests a strategy for modifying 
molecules to improve their tissue penetration by conjugating active drug molecules to inert polymeric 
carriers. For conjugated drugs, the extent of penetration should depend on the modulus (|) for the 
conjugated compound as well as the degree of stability of the drug-carrier linkage. 

The effects of conjugation and stability of the linkage between drug and carrier on enhancing tissue 
penetration in the brain have been studied in a model system [30]. Methotrexate (MTX)-dextran con- 
jugates with different dissociation rates were produced by linking MTX to dextran (molecular weight 
70,000) through a short-lived ester bond (half-life ~ 3 d) and a longer-lived amide bond (half-life > 20 d). 
The extent of penetration for MTX-dextran conjugates was studied in three-dimensional human brain 
tumor cell cultures; penetration was significantly enhanced for MTX-dextran conjugates and the 
increased penetration was correlated with the stability of the linkage. These results suggest that modifi- 
cation of existing drugs may increase their efficacy against brain tumors when delivered directly to the 
brain interstitium. 



99.7 Conclusion 



Controlled release polymer implants are a useful new technology for delivering drugs directly to the brain 
interstitium. This approach is already in clinical use for treatment of tumors [31], and could soon impact 
treatment of other diseases. The mathematical models described in this paper provide a rational frame- 
work for analyzing drug distribution after delivery. These models describe the behavior of chemotherapy 
compounds very well and allow prediction of the effect of changing properties of the implant or the 
drug. More complex models are needed to describe the behavior of macromolecules, which encounter 
multiple modes of elimination and metabolism and are subject to the effects of fluid flow. In addition, 
variations on this approach may be useful for analyzing drug delivery in other situations. 
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Atherosclerosis is a disease of the large arteries which involves a characteristic accumulation of high 
molecular weight lipoprotein in the arterial wall [1]. The disease tends to be localized in regions of 
curvature and branching in arteries where fluid shear stress (shear rate) is altered from its normal patterns 
in straight vessels [2]. The possible role of fluid mechanics in the localization of atherosclerosis has been 
debated for many years [3,4]. One possibility considered early on was that the blood phase resistance to 
lipid transport, which could be affected by local fluid mechanics, played a role in the focal accumulation 
of lipid in arteries. Studies by Caro and Nerem [5], however, showed that the uptake of lipid in arteries 
could not be correlated with fluid phase mass transport, leading to the conclusion that the wall (endo- 
thelium) and not the blood, was the limiting resistance to lipid transport. This suggested that fluid 
mechanical effects on macromolecular transport were the result of direct mechanical influences on the 
transport characteristics of the endothelium. 
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Blood 




Tissue 



FIGURE 100.1 Schematic diagram of arterial wall transport processes showing the concentration profile of a solute 
which is being transported from the blood, where its bulk concentration is C b> to the surface of the endothelium, 
where its concentration is C s , then across the endothelium, where the subendothelial concentration is C w , and finally 
to a minimum value within the tissue, C min . Transport of the solute in the blood phase is characterized by the mass 
transport coefficient, k L ; consumption of the solute at the endothelial surface is described by a first-order reaction 
with rate constant, k^; movement of the solute across the endothelium depends on the permeability coefficient, Pe; 
and reaction of the solute within the tissue volume is quantified by a zeroeth order consumption rate, Q. 

While the transport of large molecules such as low density lipoprotein (LDL) and other high molecular 
weight materials, which are highly impeded by the endothelium, may be limited by the wall and not the 
fluid (blood), other low molecular weight species which undergo rapid reaction on the endothelial surface 
(e.g., adenosine triphosphate — ATP) or which are consumed rapidly by the underlying tissue (e.g, oxygen) 
may be limited by the fluid phase. With these possibilities in mind, the purpose of this short review is 
to compare the rates of transport in the blood phase to the rates of reaction on the endothelial surface, 
the rates of transport across the endothelium, and the rates of consumption within the wall of several 
important biomolecules. It will then be possible to assess quantitatively the importance of fluid phase 
transport; to determine which molecules are likely to be affected by local fluid mechanics; to determine 
where in blood vessels these influences are most likely to be manifest; and finally, to speculate about the 
role of fluid phase mass transport in the localization of atherosclerosis. 

100.1 Steady-State Transport Modeling 

Reactive Surface 

Referring to Fig. 100.1, we will assume that the species of interest is transported from the blood vessel 
lumen, where its bulk concentration is C b , to the blood vessel surface, where its concentration is C s , by 
a convective-diffusive mechanism which depends on the local fluid mechanics and can be characterized 
by a fluid-phase mass transfer coefficient k L (see [6] for further background). The species flux in the 
blood phase is given by 



l 



: k L (c b 



:C. 



(100.1) 



At the endothelial surface, the species may undergo an enzyme-catalyzed surface reaction (e.g., the 
hydrolysis of ATP to ADP) which can be modeled using classical Michaelis-Menten kinetics with a rate 
given by 



B 2000 by CRC Press LLC 



V c 

y max s 

k +C 



(100.2) 



where V mM is the maximum rate (high C s ) and k m is the Michaelis constant. When C s <^ k m , as is often 
the case, then the reaction rate is pseudo-first order 



V = k C 



(100.3) 



with the rate constant for the surface reaction given by k r = V max /k m . 

At steady state, the transport to the surface is balanced by the consumption at the surface so that 



k L ( C b- c s ) = k r c s 



(100.4) 



It will be convenient to cast this equation into a dimensionless form by multiplying it by d/D, where d 
is the vessel diameter and D is the diffusion coefficient of the transported species in blood, or the media 
of interest. Equation (100.4) then becomes 



Sh(c b -C s )=Da r C s 



(100.5) 



where 



Sh.M 
D 



is the Sherwood number (dimensionless mass transfer coefficient), and 



(100.6) 



Da 



k d 

r 

D 



(100.7) 



is the Damkholer number (dimensionless reaction rate coefficient). Solving Eq. (100.5) for the surface 
concentration one finds 



c/c 



1 

1+Da /Sh 



(100. 



When Da r < Sh, 



C =C, 



(100.9) 



and the process is termed "wall-limited" or "reaction-limited". On the other hand, when Da r > Sh, 

(100.10) 



C 



'Sh^ 



v Da r y 



C„ 



and the process is termed "transport-limited"or "fluid phase-limited". It is in this transport-limited case 
that the surface concentration, and in turn the surface reaction rate, depend on the fluid mechanics 
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which determines the Sherwood number. It will therefore be useful to compare the magnitudes of Da r 
and Sh to determine whether fluid mechanics plays a role in the overall transport process of a surface 
reactive species. 

Permeable Surface 

Many species will permeate the endothelium without reacting at the luminal surface (e.g., albumin, LDL) 
and their rate of transport (flux) across the surface layer can be described by 



J s = Pe(C s -C w ) (100.11) 



where Pe is the endothelial permeability coefficient and C w is the wall concentration beneath the endot- 
helium. If the resistance to transport offered by the endothelium is significant, then it will be reasonable 
to assume 

C w ^C s (100.12) 

so that at steady state when the fluid and surface fluxes balance, 



^(c b -c s ) = 



PeC (100.13) 



Multiplying Eq. (100.13) by d/D to introduce dimensionless parameters and then solving for the surface 
concentration leads to 



C/C, = — , — (100.14) 

s/ b 1 + Da e /Sh 

where Sh was defined in Eq. (100.6) and 

Ped 

Da = (100.15) 

e D 

is a Damkholer number based on endothelial permeability. Equation (100.14) shows that when Da e <§ 
Sh, the transport process is again "wall-limited". When Da e > Sh, fluid mechanics again becomes 
important through the Sherwood number. 

Reactive Wall 

Oxygen is transported readily across the endothelium (Heliums), but unlike most proteins, is rapidly 
consumed by the underlying tissue. In this case it is fair to neglect the endothelial transport resistance 
(assume C w = C s ), and then by equating the rate of transport to the wall with the (zeroeth order) 
consumption rate within the wall we obtain 



k 



(c b -C s ) = §T (100.16) 



where Q is the tissue consumption rate and T is the tissue thickness (distance from the surface to the 
minimum tissue concentration — see Fig. 100.1). For the specific case of 2 transport, it is conventional 
to replace concentration (C) with partial pressure (P) through the Henry's law relationship C = KP, where 
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K is the Henry's law constant. Invoking this relationship and rearranging Eq. (100.16) into a convenient 
dimensionless form, we obtain 

P Da , 

-*- = l ^ (100.17) 

P b Sh 

where Sh was defined in Eq. (100.6), and Da w is another Damkholer number based on the wall consump- 
tion rate 

(°)Td 
Da = _^± a _ (100.18) 

w KDP b 

Clearly when Da„ <§ Sh, the process is wall limited. But, as Da„ —> Sh, the process becomes limited 
by transport in the fluid phase (P s — > 0), and fluid mechanics plays a role. Because we are treating the 
tissue consumption rate as a zeroeth order reaction, the case Da„ > Sh is not meaningful (P s < O). In 
reality, as Sh is reduced, the tissue consumption rate must be reduced due to the lack of oxygen supply 
from the blood. 

100.2 Damkholer Numbers for Important Solutes 

A wide range of Damkholer numbers characterize the transport of biomolecular solutes in vessel walls 
of the cardiovascular system, and in this section we focus on four important species as examples of typical 
biotransport processes: adenosine triphosphate (ATP), a species that reacts vigorously on the endothelial 
surface, albumin, and low density lipoprotein (LDL), species which are transported across a permeable 
endothelial surface; and oxygen, which is rapidly consumed within the vessel wall. Since most vascular 
disease (atherosclerosis) occurs in vessels between 3 and 10 mm in diameter, we use a vessel of 5 mm 
diameter to provide estimates of typical Damkholer numbers. 

Adenosine Triphosphate (ATP) 

ATP is degraded at the endothelial surface by enzymes (ectonucleotidases) to form adenosine diphosphate 
(ADP). The Michaelis-Menten kinetics for this reaction have been determined by Gordon et al. [7] using 
cultured porcine aortic endothelial cells: k m = 249 (J.M, V max = 22 nmol/min/10 6 cells. V majL can be con- 
verted to a molar flux by using a typical endothelial cell surface density of 1.2 x 10 5 cells/cm 2 , with the 
result that the pseudo-first order rate constant (Eq. 100.3) iskj = 1.77 X 10~ 4 cm/sec. Assuming a diffusivity 
of 5.0 x 10~ 6 cm 2 /s for ATP [8], and a vessel diameter of 5 mm, we find 

Da r = 17.7 

Albumin and LDL 

These macromolecules are transported across the endothelium by a variety of mechanisms including 
non-specific and receptor-mediated trancytosis, and paracellular transport through normal or "leaky" 
inter-endothelial junctions [9, 10]. In rabbit aortas, Truskeyet al. [11] measured endothelial permeability 
to LDL and observed values on the order of Pe = 1.0 X 10 s cm/sec in uniformly permeable regions, but 
found that permeability increased significantly in punctate regions associated with cells in mitosis to a 
level of Pe = 5 x 10~ 7 cm/sec. Using this range of values for Pe, assuming a diffusivity of 2.5 x 10~ 7 cm 2 /sec 
for LDL, and a vessel diameter of 5 mm, we find 

Da e = 0.02 - 1.0 (LDL) 
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For albumin, Truskey et al. [12] reported values of the order Pe = 4.0 x 10~ 8 cm/sec in the rabbit aorta. 
This presumably corresponded to regions of uniform permeability. They did not report values in punctate 
regions of elevated permeability. More recently, Lever et al. [13] reported Pe values of similar magnitude 
in the thoracic and abdominal aorta as well as the carotid and renal arteries of rabbits. In the ascending 
aorta and pulmonary artery, however, they observed elevated permeability to albumin on the order of 
Pe = 1.5 x 10~ 7 cm/sec. Assuming a diffusivity of 7.3 x 10~ 7 cm 2 /sec for albumin, a vessel diameter of 
5mm, and the range of Pe values described above, we obtain 

Da e = 0.027 - 0.10 (albumin) 

Oxygen 

The first barrier encountered by oxygen after being transported from the blood is the endothelial layer. 
Although arterial endothelial cells consume oxygen [14], the pseudo-first order rate constant for this 
consumption is estimated to be an order of magnitude lower than that of ATP, and it is therefore 
reasonable to neglect the endothelial cell consumption relative to the much more significant consumption 
by the underlying tissue. Liu et al. [15] measured the oxygen permeability of cultured bovine aortic and 
human umbilical vein endothelial cells and obtained values of 1.42 X 10~ 2 cm/sec for bovine cell mono- 
layers and 1.96 x 10~ 2 cm/sec for human cell monolayers. Because the endothelial permeability to oxygen 
is so high, it is fair to neglect the transport resistance of the endothelium and to direct attention to the 
oxygen consumption rate within the tissue. 

To evaluate the Damkholer number based on the tissue consumption rate [Eq. (100.17)], we turn to 
data of Buerk and Goldstick [16] for Q/(KD) measured both in vivo and in vitro in dog, rabbit, and pig 
blood vessels. The values of Q/(KD) reported by Buerk and Goldstick are based on tissue properties for 
KD. To translate these tissue values into blood values, as required in our estimates [Eq. (100.17)], we use 
the relationship (KD) tissue = Ms (KD) watl , r suggested by Paul et al. [17] and assume (KD) blood = (KD) water . 
In the thoracic aorta of dogs, Q/(KD) ranged from 1.29 x 10 5 torr/cm 2 to 5.88 X 10 5 torr/cm 2 in the 
tissue. The thickness (distance to the minimum tissue 2 concentration) of the thoracic aorta was 250 
um and the diameter is estimated to be 0.9 cm [18]. P 02 measured in the blood (P b ) was 90 torr. 
Introducing these values into Eq. (100.17) we find: 

Da w = 10.8 - 49.0 (thoracic aorta) 

In the femoral artery of dogs, Q/(KD) ranged from 35.2 x 10 5 torr/cm 2 to 46.9 x 10 5 torr/cm 2 in the 
tissue. The thickness of the femoral artery was 50 u.m and the estimated diameter is 0.4 cm [18]. P 02 
measured in the blood was about 80 torr. These values lead to the following estimates: 

Da w = 29.3 - 39.1 (femoral artery) 

100.3 Sherwood Numbers in the Circulation 
Straight Vessels 

For smooth, cylindrical tubes (a model of straight blood vessels) with well-mixed entry flow, one can 
invoke the thin concentration boundary layer theory of Leveque [6] to estimate the Sherwood number 
in the entry region of the vessel where the concentration boundary is developing. This leads to 

Sh = 1.08x* _1 ' /3 (constant wall concentration) (100.19a) 

1.30 x*~ l/3 (constant wall flux) (100.19b) 
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TABLE 100.1 Transport Characteristics in a Straight Aorta 



Species 


Sc 


X* 


Sh 


Da 


o 2 


2,900 


4.1 X 10- 5 


31.1 


10.8-49.0 


ATP 


7,000 


1.7 X 10- 5 


41.8 


17.7 


Albumin 


48,000 


2.5 X 10-" 


79.2 


0.027-0.100 


LDL 


140,000 


8.6 X 10- 7 


114 


0.02-1.00 



Note: d = 1 cm, x = 60 cm, Re = 500, V = .035 cm 2 /sec. 



where 



x/d 
Re-Sc 



(100.20) 



is a dimensionless axial distance which accounts for differing rates of concentration boundary layer 
growth due to convection and diffusion. In Eq. (100.20), Re = vd/v is the Reynolds number, Sc = V/D is 
the Schmidt number, and their product is the Peclet number. Equation (100.19) is quite accurate for 
distances from the entrance satisfying x* < .001. Sh continues to drop with increasing axial distance as 
the concentration boundary layer grows, as described by the classical Graetz solution of the analogous 
heat transfer problem [19]. When the concentration boundary layer becomes fully developed, Sh 
approaches its asymptotic minimum value, 

Sh = 3.66 (constant wall concentration) (100.21a) 

Sh = 4.36 (constant wall flux) (100.21b) 

For a straight vessel, Sh cannot drop below these asymptotic values. Equations (100.19) and (100.21) 
also indicate that the wall boundary condition has little effect on the Sherwood number. 

It is instructive to estimate Sh at the end of a straight tube having dimensions and flow rate charac- 
teristics of the human aorta (actually a tapered tube). Table 100.1 compares Sh and Da (for 2 , ATP, 
albumin, and LDL) at the end of a 60-cm long model aorta having a diameter of 1 cm and a flow 
characterized by Re = 500. 

Table 100.1 clearly reveals that for a straight aorta, transport is in the entry or Leveque regime (x* <10~ 3 ). 
For albumin and LDL, Da <§ Sh, and transport is expected to be "wall-limited". For 2 and ATP, Da ~ 
Sh, and the possibility of "fluid phase-limited" transport exists. At the lowest possible rates of wall mass 
transport in a straight vessel (Sh = 3.66 - 4.36), transport is still expected to be "wall-limited" for albumin 
and LDL, whereas it would be "fluid-phase limited" for oxygen and ATP. 

100.4 Non-Uniform Geometries Associated 
with Atherogenesis 

Sudden Expansion 

Flow through a sudden expansion (Fig. 100.2) at sufficiently high Reynolds number induces flow sepa- 
ration at the expansion point followed by reattachment downstream. This is a simple model of physio- 
logical flow separation. The separation zone is associated with low wall shear stress since this quantity 
is identically zero at the separation and reattachment points. 
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FIGURE 100.2 Schematic diagram showing the spatial distribution of the Sherwood number downstream of a 
sudden expansion. The flow separates (S) from the wall at the expansion point and reattaches (R) downstream. The 
Sherwood number is reduced near the separation point (radial velocity away from the wall) and elevated near the 
reattachment point (radial velocity toward the wall). 

An experimental study of oxygen transport in saline and blood for an area expansion ratio of 6.7 and 
Reynolds numbers in the range 160 to 850 [20] displayed the general spatial distribution of Sh displayed 
in Fig. 100.2. The minimum value of Sh was observed near the separation point and the maximum value 
appeared near the reattachment point. The maximum Sh ranged between 500 and several thousand 
depending on the conditions and the minimum value was approximately 50. A numerical study of the 
analogous heat transfer problem by Ma et al. [21] at a lower area expansion ratio (4.42) and Schmidt 
number (Sc = 105) showed the same qualitative trends indicated in Fig. 100.2, but the Sherwood numbers 
were considerably lower (between 4 and 40) due to the lower expansion ratio, lower Schmidt number, 
and different entrance conditions. In both of these studies, Sh did not drop below its fully developed 
tube flow value at any axial location. 

The sudden expansion flow field provides insight into the mechanism controlling the spatial distri- 
bution of the Sherwood number in separated flows. Near the reattachment point, the radial velocity 
component convects solute toward the wall (enhancing transport), whereas near the separation point, 
the radial velocity component convects solute away from the wall (diminishing transport). The net result 
of this radial convective transport superimposed on diffusive transport (toward the wall) is a maximum 
in Sh near the reattachment point and a minimum in Sh near the separation point. 

Stenosis 

Flow through a symmetric stenosis at sufficiently high Reynolds number (Fig. 100.3) will lead to flow 
separation at a point downstream of the throat and reattachment further downstream. Again, because 
the wall shear stress is identically zero at the separation and reattachment points, the separation zone is 
a region of low wall shear stress. Conversely, converging flow upstream of the stenosis induces elevated 
wall shear stress. 

Schneiderman et al. [22] performed numerical simulations of steady flow and transport in an axisym- 
metric, 89% area restriction stenosis with a sinusoidal axial wall contour at various Reynolds numbers 
for a Schmidt number typical of oxygen transport. Relative to a uniform tube, Sh was always elevated in 
the converging flow region upstream of the stenosis, and when Re was low enough to suppress flow 
separation, transport remained elevated downstream of the stenosis as well. At high Re (Fig. 100.3), flow 
separation produced a region of diminished transport (relative to a uniform tube) while reattachment 
induced elevated transport. The lowest value of Sh in the diminished transport regime was approximately 
10. This occurred at Re = 16, a flow state for which separation was incipient. At high Re, the minimum 
Sh was elevated above 10, and the region of diminished transport was reduced in axial extent. 
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FIGURE 100.3 Schematic diagram showing the spatial distribution of the Sherwood number around a symmetric 
stenosis. The flow converges upstream of the stenosis where the Sherwood number is elevated (radial velocity toward 
the wall). The flow separates (S) from the wall just downstream of the throat, if the Reynolds number is high enough, 
and reattaches (R) downstream. The Sherwood number is reduced near the separation point (radial velocity away 
from the wall) and elevated near the reattachment point (radial velocity toward the wall). 

More recent numerical simulations by Rappitsch and Perktold [23, 24] for a 75% stenosis in steady 
flow (Re = 448, Sc = 2,000 (oxygen)) and sinusoidal flow (Re = 300, Sc = 46,000 (albumin)) show the 
same basic trends depicted in Fig. 100.3. Again, Sh was reduced in a narrow region around the separation 
point, but did not drop below ~10. 

Moore and Ethier [25] also simulated oxygen transport in a symmetric stenosis, but they accounted 
for the binding of oxygen to hemoglobin and solved for both the oxygen and oxyhemoglobin concen- 
trations. They determined axial Sh profiles throughout the stenosis, which showed the same basic 
tendencies indicated in Fig. 100.3. However, because hemoglobin has a much higher molecular weight 
and Schmidt number than oxygen (about 100 times higher Sc), the local values of Sh were higher than 
computed on the basis of free oxygen transport alone. This is expected since simple transport consider- 
ations (Leveque solution) suggest Sh a Sc 1 ' 3 . 

The Sh profile for the stenosis (Fig. 100.3) reflects the same underlying mechanisms that were operative 
in the sudden expansion (Fig. 100.2): radial flow directed toward the wall enhances transport near the 
reattachment point, while radial flow directed away from the wall diminishes transport near the separation 
point. 

Bifurcation 

A few numerical studies of flow and transport in the carotid artery bifurcation have been reported recently 
as summarized in Fig. 100.4. The carotid artery bifurcation is a major site for the localization of athero- 
sclerosis, predominantly on the outer wall (away from the flow divider) in the flow separation zone which 
is a region of low and oscillating wall shear stress [26]. Perktold et al. [27] simulated 2 transport in a 
realistic pulsatile flow through an anatomically realistic three-dimensional carotid bifurcation geometry 
using a constant wall concentration boundary condition. Ma et al. [28] simulated oxygen transport in 
steady flow through a realistic, three-dimensional, carotid bifurcation with a constant wall concentration 
boundary condition. 

As in the sudden expansion and stenosis geometries, the bifurcation geometry can induce flow sepa- 
ration on the outer wall with reattachment downstream. Again there is a region of attenuated transport 
near the separation point and amplified transport near the reattachment point. Perktold et al. [27] 
predicted minimum Sherwood numbers close to zero in the flow separation zone. Ma et al. [28] predicted 
the same general spatial distribution, but the minimum Sherwood number was ~25. Differences in the 
minimum Sherwood number may be due to differing entry lengths upstream of the bifurcation as well 
as differences in flow pulsatility 
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FIGURE 100.4 Schematic diagram showing the spatial distribution of the Sherwood number along the outer wall 
of a bifurcation. The flow separates (S) if the Reynolds number is high enough and there is an increase in cross- 
sectional area through the bifurcation, and reattaches (R) downstream. The Sherwood number is reduced near the 
separation point (radial velocity away from the wall) and elevated near the reattachment point (radial velocity toward 
the wall) . 



Curvature 

Localization of atherosclerosis has also been associated with arterial curvature [2] . For example, the inner 
curvature of proximal coronary arteries as they bend over the curved surface of the heart has been 
associated with plaque localization [29]. Qiu [30] carried out three-dimensional, unsteady flow compu- 
tations in an elastic (moving wall) model of a curved coronary artery for 2 transport with a constant 
wall concentration boundary condition. He obtained the results shown in Fig. 100.5. Because Qiu 
assumed uniform axial velocity and concentration profiles (with no secondary flow) at the curved tube 
entrance (0 degrees), there is an entrance region (-25 degrees) where Sh essentially follows a Leveque 
boundary layer development. Eventually (-50 degrees), the secondary flow effects become manifest, and 
marked differences in transport rates between the inside wall (low transport: Sh -2) and the outside wall 
(high transport: Sh -55) develop. The regions of high and low transport in the curved vessel geometry 
cannot be associated with axial flow separation (as in the expansion, stenosis, and bifurcation) because 
flow separation does not occur at the modest curvature levels in the coronary artery simulation. 

The secondary flow (in the plane perpendicular to the axial flow — Fig. 100.5) determines, in large 
measure, the differences in transport rates between the inside and outside walls. The radial velocity is 
directed toward the wall at the outside, leading to enhancement of transport by a convective mechanism. 
At the inside of the curvature, the radial velocity is directed away from the wall, and transport is impeded 
by the convective mechanism. This secondary flow mechanism produces transport rates that are 25 times 
higher on the outside wall than the inside wall. This is to be contrasted with wall shear stress values 
which, for the coronary artery condition, are less than 2 times higher on the outside wall than on the 
inside wall [31]. Earlier studies of fully developed, steady flow and transport in curved tubes [32] are 
consistent with the above observations. 



100.5 Discussion 



The considerations of mass transport in the fluid (blood) phase and consumption by the vessel wall 
described in the preceding sections indicate that only highly reactive species such as 2 and ATP can be 
"transport-limited". Larger molecules such as albumin and LDL, which are not rapidly transformed 
within the vessel wall, are not likely to be transport-limited. 
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FIGURE 100.5 Schematic diagram showing the spatial distribution of the Sherwood number along the inner 
(I — toward the center of curvature) and outer (O — away from the center of curvature) walls of a curved vessel. In 
the entry region, before the secondary flow has developed, the Sherwood number follows a Leveque distribution. As 
the secondary flow evolves, the Sherwood number becomes elevated on the outer wall where the radial velocity of 
the secondary flow is toward the wall, and diminished on the inner wall where radial velocity of the secondary flow 
is away from the wall. 

For 2 , ATP, and other molecules characterized by large values of the Damkholer number, transport 
limitation will occur in regions of the circulation where the Sherwood number is low (Sh < Da). Localized 
regions of low Sh arise in non-uniform geometries around flow separation points (not reattachment 
points) where radial flow velocities are directed away from the wall, and in secondary flow regions where 
the secondary velocity is directed away from the wall (inside wall of a curved vessel). 

Considerable supporting experimental evidence for the above conclusions is available, and a few 
representative studies will be mentioned here. Santilli et al. [33] measured transarterial wall oxygen 
tension gradients at the dog carotid bifurcation. Oxygen tensions at the carotid sinus (outer wall in 
Fig. 100.4) were significantly decreased in the inner 40% of the artery wall compared to an upstream 
control location. Oxygen tensions at the flow divider (inner wall in Fig. 100.4) were increased significantly 
throughout the artery wall compared with control locations. These observations are consistent with fluid 
phase-limited oxygen transport at the outer wall of the carotid bifurcation. 

Dull et al. [8] measured the response of intracellular free calcium in cultured bovine aortic endothelial 
cells (BAECs) after step changes in flow rate, using media containing either ATP or a non-reactive analog 
of ATP. In the presence of ATP, which is rapidly degraded by enzymes on the endothelial cell surface, 
step changes in flow rate induced rapid changes in intracellular calcium which were not apparent when 
the inactive ATP analog was used in place of ATP. The interpretation of these experiments was that 
increases in flow increased the rate of mass transport of ATP to the cell surface and exceeded the capacity 
of the surface enzymes to degrade ATP. This allowed ATP to reach surface receptors and stimulate 
intracellular calcium. This study thus provided evidence that ATP transport to the endothelium could 
be fluid phase (transport) -limited. 
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Caro and Nerem [5] measured the uptake of labeled cholesterol bound to serum lipoprotein in excised 
dog arteries under well-defined conditions in which the Leveque solution [Eq. (100.19)] described the 
fluid phase mass transport process. If the transport of lipoprotein to the surface had been controlled by 
the fluid phase, they should have observed a decrease in uptake with distance from the vessel entrance 
following a x~ 1/3 law [Eq. (100.19)]. They did not, however, observe any significant spatial variation of 
uptake of lipoprotein over the length of the blood vessel. This observation is consistent with "wall-limited" 
transport of lipoprotein as we have suggested in the preceding sections. 

100.6 Possible Role of Blood Phase Transport 
in Atherogenesis 

Accumulation of lipid in the arterial intima is a hallmark of atherosclerosis, a disease that tends to be 
localized on the outer walls of arterial bifurcations and the inner walls of arterial curvatures [2]. The 
outer walls of bifurcations and the inner walls of curvatures may have localized regions characterized by 
relatively low blood phase transport rates (low Sh in Figs. 100.4 and 100.5). But, how can low transport 
rates lead to high accumulation of lipid in the wall? If, as we have argued in this review, LDL transport 
is really not affected by the blood phase fluid mechanics, but is limited by the endothelium, how can 
local fluid mechanics influence intimal lipid accumulation? There are several possible scenarios for a 
fluid mechanical influence which are reviewed briefly. 

Direct Mechanical Effects on Endothelial Cells 

The outerwalls of bifurcations and the inner walls of curved vessels are characterized by low mean wall 
shear stress, and significant temporal oscillations in wall shear stress direction (oscillatory shear stress) 
[2]. Endothelial cells in this low, oscillatory shear environment tend to assume a polyhedral, cobblestone 
enface morphology, whereas endothelial cells in high shear regions tend to be elongated in the direction 
of flow. It has been suggested that these altered morphologies, which represent chronic adaptive responses 
to altered fluid mechanical environments, are characterized by distinct macromolecular permeability 
characteristics [2, 10] as direct responses of the endothelial layer to altered mechanical environments. In 
addition, a number of studies have shown that fluid shear stress on the endothelial surface can have an 
acute influence on endothelial transport properties both in vitro [34, 35] and in vivo [36, 37]. 

Hypoxic Effect on Endothelial Cells 

Hypoxia (low oxygen tension), which can be induced by a blood phase transport limitation, can lead to 
a breakdown of the endothelial transport barrier either by a direct effect on the endothelial layer or by 
an indirect mechanism in which hypoxia up-regulates the production of hyperpermeabilizing cytokines 
from other cells in the arterial wall. A number of recent studies have shown that hypoxia increases macro- 
molecular transport across endothelial monolayers in culture due to metabolic stress [38-40]. These 
studies describe direct effects on the endothelial layer since other cells present in the vessel wall were not 
present in the cell culture systems. 

Hypoxia Induces VEGF 

Many cell lines express increased amounts of vascular endothelial growth factor (VEGF) when subjected 
to hypoxic conditions as do normal tissues exposed to hypoxia, functional anemia, or localized ischemia 
[41]. VEGF is a multifunctional cytokine that acts as an important regulator of angiogenesis and as a 
potent vascular permeabilizing agent [41, 42]. VEGF is believed to play an important role in the hyper- 
permeability of microvessels in tumors, the leakage of proteins in diabetic retinopathy, and other vascular 
pathologies [43]. Thus, a plausible scenario for the increase in lipid uptake in regions of poor blood 
phase mass transport is the following: Hypoxia up-regulates the production of VEGF by cells within the 
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vascular wall and the VEGF in turn permeabilizes the endothelium, allowing increased transport of lipid 
into the wall. This mechanism can be depicted schematically as follows: 

2 4^VEGFt^Pet 

In support of this view, several recent studies have shown that VEGF is enriched in human athero- 
sclerotic lesions [44, 45]. Smooth muscle cells and macrophages appear to be the predominant sources 
of VEGF in such lesions. Thus, a mechanism in which hypoxia induces VEGF and hyperpermeability is 
plausible, but at the present time it must only be considered a hypothesis relating fluid phase transport 
limitation and enhanced macromolecular permeability. 
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T 



HE TERM BIOTECHNOLOGY HAS UNDERGONE significant change over the past 50 years or 
so. During the period prior to the eighties, biotechnology referred primarily to the use of micro- 
organisms for large-scale industrial processes such as antibiotic production. Since the 1980s, with 
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the advent of recombinant DNA technology, monoclonal antibody technology, and new technologies for 
studying and handling cells and tissues, the field of biotechnology has undergone a tremendous resurgence 
in a wide range of applications pertinent to industry, medicine, and science in general. It is some of these 
new ideas, concepts, and technologies that will be covered in this section. We have assembled a set of 
chapters that covers most topics in biotechnology that might interest the practicing biomedical engineer. 
Absent by design is coverage of agricultural, bioprocess, and environmental biotechnology, which is 
beyond the scope of this Handbook. 

Chapter 101 deals with our present ability to manipulate genetic material. This capability, which 
provides the practitioner with the potential to generate new proteins with improved biochemical and 
physicochemical properties, has led to the formation of the field of protein engineering. Chapter 102 
discusses the field of monoclonal antibody production in terms of its basic technology, diverse applica- 
tions, and ways that the field of recombinant DNA technology is currently "reshaping" some of the earlier 
constructs. Chapters 103 and 104 describe applications of nucleic acid chemistry. The burgeoning field 
of antisense technology is introduced with emphasis on basic techniques and potential applications to 
AIDS and cancer, and Chapter 104 is dedicated toward identifying the computational, chemical, and 
machine tools which are being developed and refined for genome analysis. Applied virology is the implied 
heading for Chapters 105 and 106, in which viral vaccines and viral-mediated gene therapy are the main 
foci. 

Finally, Chapters 107 and 108 focus on important aspects of cell structure and function. These topics 
share a common approach toward quantitative analysis of cell behavior in order to develop the principles 
for cell growth and function. By viewing the world of biomedical biotechnology through our paradigm 
of proteins and nucleic acids to viruses to cells, today's biomedical engineer will hopefully be prepared 
to meet the challenge of participating in the greater field of biotechnology as an educated observer at 
the very least. 
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Production of recombinant adeno-associated virus (AAV), which is used in treating cystic fibrosis patients. 
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101.1 Site-Directed Mutagenesis 

101.2 Solvent Engineering 

101.3 Conclusions 



Enzymes have numerous applications in both research and industry. The conformation of proteins must 
be maintained in order for them to function at optimal activity. Protein stability is dependent on 
maintaining a balance of forces that include hydrophobic interactions, hydrogen bonding, and electro- 
static interactions. Most proteins are denatured, i.e., lose their active conformation in high-temperature 
environments (exceptions to this are found among the enzymes from thermophilic microorganisms). 
Therefore, understanding and maintaining enzyme stability are critical if enzymes are to be widely used 
in medicine and industry. Protein engineering [1] is used to construct and analyze modified proteins 
using molecular biologic, genetic engineering, biochemical, and traditional chemical methods 
(Table 101.1). The generation of proteins with improved activity and stability is now feasible. Recent 
developments in molecular biology also have enabled a rapid development of the technologies associated 
with protein engineering (Table 101.2). 

101.1 Site-Direction Mutagenesis 

Since site-directed mutagenesis was described by Hutchinson et al. [2] in 1978, it has become a powerful 
tool to study the molecular structure and function of proteins. The purpose of site-directed mutagenesis 
is to alter a recombinant protein by introducing, replacing, or deleting a specific amino acid. The 
technique enables a desired modification to be achieved with exquisite precision [3]. Site-directed 
mutagenesis has been used to change the activity and stability of enzymes, as well as substrate specificity 
and affinity. Indeed, much of the biologic detergent enzyme sold in the United States (~$200 million per 
year) is a protein-engineered variant of the native enzyme. 

The basic idea behind site-directed mutagenesis is illustrated in Fig. 101.1. The first step of site-directed 
mutagenesis involves cloning the gene for the protein to be studied into a vector. This nontrivial exercise 
is discussed elsewhere in this Handbook. Next, an oligonucleotide is designed and synthesized. The 
oligonucleotide will have a centrally located desired mutation (usually a mismatch) that is flanked by 
sequences of DNA which are complementary to a specific region of interest. Thus the oligonucleotide is 
designed to bind to a single region of the target gene. The mutation can then be introduced into the 
gene by hydridizing the oligonucleotide to the single-stranded template. Single-stranded templates of a 
target gene may be obtained by either cloning the gene to a single-stranded vector, such as bacteriophage 
M13 [4], or by using phagemids (a chimeric plasmid containing a filamentous bacteriophage replication 
origin that directs synthesis of single-stranded DNA with a helper bacteriophage) [5]. Alternatively, 
single-stranded template may be generated by digesting double-stranded DNA with exonuclease III 
following nicking of the target DNA with DNase or a restriction endonuclease [6]. 
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TABLE 101.1 Selected Methods for Modification of Proteins 

Chemical/biochemical methods Side chain/amino acid residue modification 

Immobilization 
Biologic methods Site-directed mutagenesis 

Random mutagenesis 

The second strand is synthesized with DNA polymerase using the oligonucleotide as a primer, and 
the DNA can be circularized with DNA ligase. The vector that carries the newly synthesized DNA (in 
which one strand is mutated) can be introduced into a bacterial host, where, because DNA duplicates 
in a semiconservative mode, half the newly synthesized cells containing the DNA will theoretically contain 
the mutation. In reality, premature DNA polymerization, DNA mismatch repair, and strand displacement 
synthesis result in much lower yield of mutants. Different strategies have been developed to obtain a 
higher level of mutation efficiency in order to minimize the screening of mutant. A second mutagenic 
oligonucleotide containing an active antibiotic resistance gene can be introduced at the same time as the 
site-directed oligonucleotide. Therefore, the wild-type plasmid will be eliminated in an antibiotic-selective 
medium [7] (Fig. 101.2). Alternatively, introducing a second mutagenic oligonucleotide can result in the 
elimination of a unique restriction endonuclease in the mutant strand [8]. Therefore, wild-type plasmid 
can be linearized with this unique restriction endonuclease, while the circular mutant plasmid can be 
transformed into bacteria host. Another alternative method is to perform second-strand extension in the 
presence of 5-methyl-dCTP, resulting in resistance of number of restriction enzymes, including Hpall, 
Mspl, and Sau3A I, in the mutant strand only [9]. The template DNA can then be nicked with these 
enzymes, followed by digestion with exonuclease III, to increase mutagenesis efficiency. 

The recent development of the polymerase chain reaction provides a new approach to site-direct 
mutagenesis. In 1989, Ho and colleagues [10] developed a method named overlap extension (Fig. 101.3), 
in which four oligonucleotides are used as primers. Two of the primers containing the mutant are 
complementary to each other. The two other primers are complementary to the opposite strand of the 
ends of the cloned genes. Polymerase chain reactions are performed three times. The first two polymerase 
chain reactions are carried out using one end primer and one mutant primer in each reaction. The 
products contain one double-stranded DNA from one end to the mutation point and the other double- 
stranded DNA from another end to the mutation point. In other words, the two DNA products are 
overlapped in the mutated region. The polymerase chain reaction products are purified and used as 
templates for the third polymerase chain reaction with two end primers. This generates the whole length 
of DNA with the desired mutation. The advantage of this method is that it can be done quickly with 
nearly 100% efficiency. 

After screening and selection, a mutation is generally confirmed by DNA sequencing. The mutant can 
then be subcloned into an expression vector to test the effect of mutation on the activity of the enzyme. 

TABLE 101.2 Potentially Useful Modifications to Proteins 

Stability Increased thermostability 

Increased stability at extremes of pH 

Increased stability in organic solvents 

Resistance to oxidative inactivation 

Resistance to proteolysis 
Kinetics Increased maximum velocity 

Altered affinity for substrate 

Altered substrate specificity 

Resistance to substrate/product inhibition 
Biology Altered spectrum of activity 

Altered substrate specificity 



Source: Modified from Primrose SB. 1991. In Molecular Bio- 
technology, 2d ed. New York, Blackwell Scientific. 
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FIGURE 101.1 Overview of site-directed mutagenesis. See text for details. 

Over the last decade, site-directed mutagenesis has become a somewhat trivial technical process, and the 
most challenging segment of a protein engineering research project is unquestionably the process of 
determining why a mutant protein has altered properties. The discussion of this particular enterprise lies 
beyond the scope of this brief review. 

101.2 Solvent Engineering 

An enzyme-catalyzed reaction can be simplified to its most basic components by considering it as the 
transfer of a substrate molecule from solvent to the surface of an enzyme molecule. The exchange of 
substrate-solvent and enzyme-solvent interactions for enzyme-substrate interactions then enables the 
chemistry of catalysis to take place. Protein engineering, as described above, is the process of changing 
the enzyme in a predictable and precise manner to effect a change on the catalytic process. Since the 
enzyme is only one side of the balance, however, any changes in the rest of the equation also will alter 
the catalytic process. Until the late 1980s, for example, substrate specificity could be altered by either 
protein engineering of the enzyme or by changing the substrate. Solvent engineering is now also emerging 
as a powerful tool in rational control of enzyme activity. Using solvents other than water has successfully 
led to enzymes with increased thermostability, activity against some substrates, pH dependence, and 
substrate and enantiospecificity. 

The simplicity of the solvent-engineering approach is clear. If a protein is not inactivated by a solvent 
change, then its activity will be dependent on the solvent in which the enzyme and substrate are placed. 
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FIGURE 101.2 Selective antibiotic genes as strategy for improvement of site-directed mutagenesis. See text for 
details. 

This strategy has been discussed in detail in recent reviews [11], and we will only summarize the most 
pertinent information here. 

Enzymes that have been freeze-dried and then suspended in anhydrous organic solvents, in which 
proteins are not soluble, retain their activity and specificity [12]. The enzyme powders retain approxi- 
mately a monolayer of water per molecule of enzyme during the freeze-drying process. As long as this 
monolayer of water remains associated with the enzyme in an organic environment, the structure of the 
enzyme is not disrupted [13], and hence enzyme activity in essentially anhydrous organic solvents can 
be observed. 

The physical properties of a solvent in which an enzyme is placed can influence the level of activity 
and specificity in a given reaction. For instance, alcohol dehydrogenase can catalyze oxidation-reduction 
reactions equally well in buffer and heptane, but the level of activity in heptane is sharply dependent on 
the thermodynamic activity of water in the system. In general, activity can be increased by increasing 
the hydrophobicity of the solvent used. Interestingly, however, the specificity (both stereo- and substrate) 
of an enzyme or antibody dispersed in an organic solvent is generally greater in more hydrophilic solvents 
[14]. Water is involved as a substrate in many of the chemical modification processes that lead to 
irreversible inaction of proteins. Not surprisingly, therefore, proteins in anhydrous environments are 
stable at temperatures exceeding 100°C [15]. 

Considerable attention is now being given to developing the predictability of the solvent-engineering 
approach. Elucidating the structure-function-environment relationships that govern the activity and 
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FIGURE 101.3 Site-directed mutagenesis by overlap extension. See text for details. 

specificity of an enzyme is a crucial step for learning how to apply the power of solvent engineering. In 
recent years, supercritical fluids (materials at temperatures and pressures above their critical point) have 
been used as a dispersant for enzyme-catalyzed reactions. In addition to being excellent process solvents, 
the physical properties of supercritical fluids are pressure-dependent. Thus it may be possible to detect 
which physical properties of a nonaqueous medium have a role in determining a given enzyme function 
[16]. 

101.3 Conclusions 

The goals of protein and solvent engineering are similar. Ideally, one should be able to predictably alter 
a given property of an enzyme by either changing the enzyme or its environment. In reality, both 
approaches are still somewhat unpredictable. Both protein engineering and solvent engineering have 
been used successfully to alter the protein properties that appear in Table 101.1. Further research is 
needed, however, before the results of any given attempt at such biologic engineering can be predicted. 
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Antibodies are a class of topographically homologous multidomain glycoproteins produced by the 
immune system that display a remarkably diverse range of binding specificities. The most important 
aspects of the immune system are that it is diverse and driven to produce antibodies of the highest possible 
antigen affinity. The primary repertoire of antibodies consists of about 10 9 different specificities, each of 
which can be produced by an encounter with the appropriate antigen. This diversity is known to be 
produced by a series of genetic events each of which can play a role in determining the final function of 
the antibody molecule. After the initial exposure to the antigen, additional diversity occurs by a process 
of somatic mutation so that, for any selected antigen, about 10 4 new binding specificities are generated. 
Thus the immunologic repertoire is the most diverse system of binding proteins in biology. Antibodies 
also display remarkable binding specificity. For example, it has been shown that antibodies are able to 
distinguish between ortho-, meta-, and para-forms of the same haptenic group [Landsteiner, 1945]. This 
exquisite specificity and diversity make antibodies ideal candidates for diagnostic and therapeutic agents. 
Originally, the source of antibodies was antisera, which by their nature are limited in quantity and 
heterogeneous in quality. Antibodies derived from such sera are termed conventional antibodies {polyclonal 
antibodies). Polyclonal antibody production requires methods for the introduction of immunogen into 
animals, withdrawal of blood for testing the antibody levels, and finally exsanguination for collection of 
immune sera. These apparently simple technical requirements are complicated by the necessity of choos- 
ing a suitable species and immunization protocol that will produce a highly immune animal in a short 
time. Choice of animal is determined by animal house facilities available, amount of antiserum required 
(a mouse will afford only 1.0 to 1.5 ml of blood; a goat can provide several liters), and amount of 
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immunogen available (mice will usually respond very well to 50 p.g or less of antigen; goats may require 
several milligrams). Another consideration is the phylogenic relationship between the animal from which 
the immunogen is derived and that used for antibody production. In most cases, it is advisable to 
immunize a species phylogenetically unrelated to the immunogen donor, and for highly conserved 
mammalian proteins, nonmammals (e.g., chickens) should be used for antibody production. The poly- 
clonal antibody elicited by an antigen facilitates the localization, phagocytosis, and complement-mediated 
lysis of that antigen; thus the usual polyclonal immune response has clear advantages in vivo. Unfortu- 
nately, the antibody heterogeneity that increases immune protection in vivo often reduces the efficacy of 
an antiserum for various in vitro uses. Conventional heterogeneous antisera vary from animal to animal 
and contain undesirable nonspecific or cross-reacting antibodies. Removal of unwanted specificities from 
a polyclonal antibody preparation is a time-consuming task, involving repeated adsorption techniques, 
which often results in the loss of much of the desired antibody and seldom is very effective in reducing 
the heterogeneity of an antiserum. 

After the development of hybridoma technology [Kohler and Milstein, 1975], a potentially unlimited 
quantity of homogeneous antibodies with precisely defined specificities and affinities {monoclonal anti- 
bodies) became available, and this resulted in a step change in the utility of antibodies. Monoclonal 
antibodies (mAbs) have gained increasing importance as reagents in diagnostic and therapeutic medicine, 
in the identification and determination of antigen molecules, in biocatalysis (catalytic antibodies), and 
in affinity purification and labeling of antigens and cells. 

102.1 Structure and Function of Antibodies 



Antibody molecules are essentially required to carry out two principal roles in immune defense. First, 
they recognize and bind nonself or foreign material (antigen binding). In molecular terms, this generally 
means binding to structures on the surface of the foreign material (antigenic determinants) that differ 
from those on the host. Second, they trigger the elimination of foreign material (biologic effector 
functions). In molecular terms, this involves binding of effector molecules (such as complement) to the 
antibody-antigen complex to trigger elimination mechanisms such as the complement system and phago- 
cytosis by macrophages and neutrophils, etc. 

In humans and other animals, five major immunoglobulin classes have been identified. The five classes 
include immunoglobulins G (IgG), A (IgA), M (IgM), D (IgD), and E (IgE). With the exception of IgA 
(dimer) and IgM (pentamer), all other antibody classes are monomeric. The monomeric antibody 
molecule consists of a basic four-chain structure, as shown in Fig. 102.1. There are two distinct types of 
chains: the light (L) and the heavy chains (H). The 
chains are held together by disulfide bonds. The 
light and heavy chains are held together by inter- 
chain disulfides, and the two heavy chains are held 
together by numerous disulfides in the hinge 
region of the heavy chain. The light chains have a 
molecular weight of about 25,000 Da, while the 
heavy chains have a molecular weight of 50,000 to 
77,000 Da depending on the isotope. The L chains 
can be divided into two subclasses, kappa (k) and 
lambda (k), on the basis of their structures and 
amino acid sequences. In humans, about 65% of 
the antibody molecules have K chains, whereas in 
rodents, they constitute over 95% of all antibody 
molecules. The light chain consists of two struc- 
tural domains: The carboxy-terminal half of the 
chain is constant except for certain allotypic and 




light chain 
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isotype variations and is called the C L (constant: light chain) region, whereas the amino-terminal half of 
the chain shows much sequence variability and is known as the V L (variable: light chain) region. The H 
chains are unique for each immunoglobulin class and are designated by the Greek letter corresponding 
to the capital letter designation of the immunoglobulin class (oc for the H chains of IgA, y for the H 
chains of IgG). IgM and IgA have a third chain component, the J chain, which joins the monomeric 
units. The heavy chain usually consists of four domains: The amino-terminal region (approximately 
110 amino acid residues) shows high sequence variability and is called the V H (variable: heavy chain) 
region, whereas there are three domains called C H1 , C H2 , and C H3 in the constant part of the chain. The 
C H2 domain is glycosylated, and it has been shown that glycosylation is important in some of the effector 
functions of antibody molecules. The extent of glycosylation varies with the antibody class and, to some 
extent, the method of its production. In some antibodies (IgE and IgM), there is an additional domain 
in the constant part of the heavy chain called the C H4 region. The hinge region of the heavy chain (found 
between the C H1 and C H2 domains) contains a large number of hydrophilic and proline residues and is 
responsible for the segmental flexibility of the antibody molecule. 

All binding interactions to the antigen occur within the variable domains (V H and V L ). Each variable 
domain consists of three regions of relatively greater variability, which have been termed hypervariable 
(HV) or complementarity-determining regions (CDR) because they are the regions that determine com- 
plementarity to the particular antigen. Each CDR is relatively short, consisting of from 6 to 15 residues. 
In general, the topography of the six combined CDRs (three from each variable domain) produces a 
structure that is designed to accommodate the particular antigen. The CDRs on the light chain include 
residues 24 to 34 (LI), 50 to 56 (L2), and 89 to 73 (L3), whereas those on the heavy chain include residues 
31 to 35 (HI), 50 to 65 (H2), and 95 to 102 (H3) in the Kabat numbering system. The regions between 
the hypervariable regions are called the framework regions (FR) because they are more conserved and 
play an important structural role. The effector functions are, on the other hand, mediated via the two 
C-terminal constant domains, namely, the C H2 and C H3 . 

Each variable region is composed of several different genetic elements that are separate on the germ 
line chromosome but rearrange in the B cell to produce a variable-region exon. The light chain is 
composed of two genetic elements, a variable (V) gene and a joining (J) gene. The V K gene generally 
encodes residues 1 to 95 in a kappa light chain, and the J K gene encodes residues 96 to 107, which is the 
carboxyl end of the variable region. Thus the kappa V gene encodes all the first and second hypervariable 
regions (LI and L2) and a major portion of the third hypervariable region (L3). In the human system, 
there are estimated to be about 100 different V K genes and about 5 functional J K genes, and the potential 
diversity is increased by the imprecision of the VJ joining. The heavy-chain variable region is similarly 
produced by the splicing together of genetic elements that are distant from each other in the germ line. 
The mature V H region is produced by the splicing of a variable gene segment (V) to a diversity segment (D) 
and to a joining (J) segment. There are about 300 V H genes, about 5 D H genes, and 9 functional J H genes 
[Tizard, 1992]. With respect to the CDRs of the heavy chain, HI and H2 are encoded by the V H gene, 
while the H3 region is formed by the joining of the V, D, and J genes. Additional diversity is generated 
by recombinational inaccuracies, light- and heavy-chain recombination, and N-region additions. 

The three-dimensional structure of several monoclonal antibodies has been determined via x-ray 
crystallography and has been reviewed extensively [Davies et al., 1990] . Each domain consists of a sandwich 
of (3 sheets with a layer of four and three strands making up the constant domains and a layer of four and 
five strands making up the variable domains. The CDRs are highly solvent exposed and are the loops at 
the ends of the variable region (3 strands that interact directly with the antigen. The superimposition of 
several V L and V H structures from available x-ray crystallographic coordinates using the conserved frame- 
work residues shows that the framework residues of the antibody variable domains are spatially closely 
superimposable. The CDRs varied in shape and length, as well as in sequence. Similarities in the structures 
of the CDRs from one antibody to another suggest that they are held in a fixed relationship in space, at 
least within a domain. Also, the N and C terminal of the CDRs were rigidly conserved by the framework 
residues. Among the CDRs, LI, L2, H2, and H3 show the most conformational variability. 



e 2000 by CRC Press LLC 



102.2 Monoclonal Antibody Cloning Techniques 

The original, and still dominant method for cloning murine monoclonal antibodies is through the fusion 
of antibody-producing spleen B cells and a myeloma line [Kohler and Milstein, 1975]. More recently, 
"second generation" monoclonal antibodies have been cloned without resorting to hybridoma technology 
via the expression of combinatorial libraries of antibody genes isolated directly from either immunized 
or naive murine spleens or human peripheral blood lymphocytes. In addition, display of functional 
antigen-binding antibody fragments (Fab, sFv, Fv) on the surface of filamentous bacteriophage has further 
facilitated the screening and isolation of engineered antibody fragments directly from the immunologic 
repertoire. Repertoire cloning and phage display technology are now readily available in the form of 
commercial kits [e.g., ImmunoZap and SurfZap Systems from Stratacyte, Recombinant Phage Antibody 
System (RPAS) from Pharmacia] with complete reagents and protocols. 

Hybridoma Technology 

In 1975, Kohler and Milstein [1975] showed that mouse myeloma cells could be fused to B lymphocytes 
from immunized mice resulting in continuously growing, specific monoclonal antibody-secreting somatic 
cell hybrids, or hybridoma cells. In the fused hybridoma, the B cell contributes the capacity to produce 
specific antibody, and the myeloma cell confers longevity in culture and ability to form tumors in animals. 

The number of fusions obtained via hybridoma technology is small, and hence there is a need to select 
for fused cells. The selection protocol takes advantage of the fact that in normal cells there are two 
biosynthetic pathways used by cells to produce nucleotides and nucleic acids. The de novo pathway can 
be blocked using aminopterin, and the salvage pathway requires the presence of functional hypoxanthine 
guanine phosphoribosyl transferase (HGPRT). Thus, by choosing a myeloma cell line that is deficient in 
HGPRT as the "fusion partner," Kohler and Milstein devised an appropriate selection protocol. The hybrid 
cells are selected by using the HAT (hypoxanthine, aminopterin, and thymidine) medium in which only 
myeloma cells that have fused to spleen cells are able to survive (since unfused myeloma cells are 
HGPRT-). There is no need to select for unfused spleen cells because these die out in culture. 

Briefly, experimental animals (mice or rats) are immunized by injecting them with soluble antigen or 
cells and an immunoadjuvant. Three of four weeks later, the animals receive a booster injection of the 
antigen without the adjuvant. Three to five days after this second injection, the spleens of those animals 
which produce the highest antibody titer to the antigen are excised. The spleen cells are then mixed with 
an appropriate "fusion partner," which is usually a nonsecreting hybridoma or myeloma cell (e.g., SP2/0 
with 8-azaguanine selection) that has lost its ability to produce hypoxanthine guanine phosphoribosyl 
transferase (HGPRT-). The cell suspension is then briefly exposed to a solution of polyethylene glycol 
(PEG) to induce fusion by reversibly disrupting the cell membranes, and the hybrid cells are selected for 
in HAT media. In general, mouse myelomas should yield hybridization frequencies of about 1 to 
100 clones per 10 7 lymphocytes. After between 4 days and 3 weeks, hybridoma cells are visible, and culture 
supernatants are screened for specific antibody secretion by a number of techniques (e.g., radioimmu- 
noassay, ELISA, immunoblotting). Those cultures wells which are positive for antibody production are 
then expanded and subcloned by limiting dilution or in soft agar to ensure that they are derived from a 
single progenitor cell and to ensure stability of antibody production. Reversion of hybridomas to non- 
secreting forms can occur due to loss or rearrangement of chromosomes. The subclones are, in turn, 
screened for antibody secretion, and selected clones are expanded for antibody production in vitro using 
culture flasks or bioreactors (yield about 10 to 100 p.m/liter) or in vivo as ascites in mice (1 to 10 mg/ml). 

The most commonly used protocol is the polyethylene glycol (PEG) fusion technique, which, even 
under the most efficient conditions, results in the fusion of less than 0.5% of the spleen cells, with only 
about 1 in 10 5 cells forming viable hybrids. Several methods have been developed to enhance conventional 
hybrid formation that have led to only incremental improvements in the efficiency of the fusion process 
(see Neil and Urnovitz [1988] for review). Methods that enhance conventional hybridoma formation 
include pretreatment of myeloma cells with colcemid and/or in vitro antigen stimulation of the spleen 
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cells prior to fusion, addition of DMSO or phytohemagglutinin (PHA) to PEG during fusion, and addition 
of insulin, growth factors, human endothelial culture supernatants (HECS), etc. to the growth medium 
after fusion. In addition, improvements in immunization protocols such as suppression of dominant 
responses, in vitro or intrasplenic immunization, and antigen targeting also have been developed. Sup- 
pression of dominant immune responses is used to permit the expression of antibody-producing cells 
with specificity for poor immunogens and is achieved using the selective ability of cyclophosphoamide 
to dampen the immune response to a particular antigen, followed by subsequent immunization with a 
similar antigen. In in vitro immunization, spleen cells from nonimmunized animals are incubated with 
small quantities of antigen in a growth medium that has been "conditioned" by the growth of thymocytes. 
This technique is used most commonly for the production of human hybridomas, where in vivo immu- 
nization is not feasible. Intrasplenic immunization involves direct injection of immunogen into the spleen 
and is typically used when only very small quantities of antigen are available. Other advantages include 
a shortened immunization time, ability to generate high-affinity monoclonals, and improved diversity 
in the classes of antibodies generated. Finally, in antigen targeting, the immune response is enhanced by 
targeting the antigen to specific cells of the immune system, for example, by coupling to anti-class II 
monoclonal antibodies. 

There also have been several advances in fusion techniques such as electrofusion, antigen bridging, 
and laser fusion. In electrofusion, cells in suspension are aligned using an ac current resulting in cell-cell 
contact. A brief dc voltage is applied which induces fusion and has resulted in a 30- to 100-fold increase 
in fusion frequencies in some selected cases. Further improvement in electrofusion yields have been 
obtained by an antigen bridging, wherein avidin is conjugated with the antigen and the myeloma cell 
membranes are treated with biotin. Spleen cells expressing immunoglobulins of correct specificity bind 
to the antigen-avidin complex and are in turn bound by the biotinylated cell membranes of the myeloma 
cells. Finally, laser-induced cell fusion in combination with antigen bridging has been used to eliminate 
the tedious and time-consuming screening process associated with traditional hybridoma techniques. 
Here, rather than carry out the fusion process in "bulk," preselected B cells producing antibody of desired 
specificity and affinity are fused to myeloma cells by irradiating each target cell pair (viewed under a 
microscope) with laser pulses. Each resulting hybridoma cell is then identified, subcloned, and subse- 
quently expanded. 

Despite these improvements, many of the limitations of the hybridoma technique persist. First, it is 
slow and tedious and labor- and cost-intensive. Second, only a few antibody-producing hybridoma lines 
are created per fusion, which does not provide for an adequate survey of the immunologic repertoire. 
Third, as is the case with most mammalian cell lines, the actual antibody production rate is low. Fourth, 
it is not easy to control the class or subclass of the resulting antibodies, a characteristic that often 
determines their biologic activity and therefore their usefulness in therapeutic applications. Finally, the 
production of human monoclonal antibodies by conventional hybridoma techniques has not been very 
successful due to a lack of suitable human fusion partners, problems related to immunization, difficulty 
in obtaining human lymphocytes, etc. 

Repertoire Cloning Technology 

The shortcomings of the hybridoma technology and the potential for improvement of molecular prop- 
erties of antibody molecules by a screening approach have lead to the expression of the immunologic 
repertoire in Escherichia coli (repertoire cloning). Two developments were critical to the development of 
this technology. First, the identification of conserved regions at each end of the nucleotide sequences 
encoding the variable domains enabled the use of the polymerase chain reactions (PCR) to clone antibody 
Fv and Fab genes from both spleen genomic DNA and spleen messenger RNA [Orlandi et al., 1989; 
Sastry et al., 1989]. The amplification of a target sequence via PCR requires two primers, each annealing 
to one end of the target gene. In the case of immunoglobulin variable-region genes, the J segments are 
sufficiently conserved to enable the design of universal "downstream" primers. In addition, by comparing 
the aligned sequences of many variable-region genes, it was found that 5' ends of the V H and V L genes 
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are also relatively conserved so as to enable the design of universal "upstream" primers as well. These 
primers were then used to establish a repertoire of antibody variable-region genes. Second, the successful 
expression of functional antigen-binding fragments in bacteria using a periplasmic secretion strategy 
enabled the direct screening of libraries of cloned antibody genes for antigen binding [Better et al., 1988; 
Skerra and Pluckthun, 1988]. 

The first attempt at repertoire cloning resulted in the establishment of diverse libraries of V H genes 
from spleen genomic DNA of mice immunized with either lysozyme or keyhole-limpet hemocyanin 
(KLH). From these libraries, V H domains were expressed and secreted in E. coli [Ward et al., 1989]. 
Binding activities were detected against both antigens in both libraries; the first library, immunized against 
lysozyme, yielded 21 clones with lysozyme activity and 2 with KLH activity, while the second library, 
immunized against KLH, yielded 14 clones with KLH activity and 2 with lysozyme activity. Two V H 
domains were characterized with affinities for lysozyme in the 20 nM range. The complete sequences of 
48 V H gene clones were determined and shown to be unique. The problems associated with this single- 
domain approach are (1) isolated V H domains suffer from several drawbacks such as lower selectivity 
and poor solubility and (2) an important source of diversity arising from the combination of the heavy 
and light chains is lost. 

In the so-called combinatorial approach, Huse et al. [1989] used a novel bacteriophage X vector system 
(A.-ZAP technology) to express in E. coli a combinatorial library of Fab fragments of the murine antibody 
repertoire. The combinatorial expression library was constructed from spleen mRNA isolated from a 
mouse immunized with KLH-coupled p-nitrophenyl phosphonamidate (NPN) antigen in two steps. In 
the first step, separate heavy-chain (Fd) and light-chain (k) libraries were constructed. These two libraries 
were then combined randomly, resulting in the combinatorial library in which each clone potentially 
coexpresses a heavy and a light chain. In this case, they obtained 25 million clones in the library, 
approximately 60% of which coexpressed both chains. One million of these were subsequently screened 
for antigen binding, resulting in approximately 100 antibody-producing, antigen-binding clones. The 
light- and heavy-chain libraries, when expressed individually, did not show any binding activity. In 
addition, the vector systems used also permitted the excision of a phagemid containing the Fab genes; 
when grown in E. coli, these permitted the production of functional Fab fragments in the periplasmic 
supernatants. While the study did not address the overall diversity of the library, it did establish repertoire 
cloning as a potential alternative to conventional hybridoma technology. 

Repertoire cloning via the A.-ZAP technology (now commercially available as the ImmunoZap kit from 
Stratacyte) has been used to generate antibodies to influenza virus hemagglutinin (HA) starting with 
mRNA from immunized mice [Caton and Kaprowski, 1990]. A total of 10 antigen-binding clones was 
obtained by screening 125,000 clones from the combinatorial library consisting of 25 million members. 
Partial sequence analysis of the V H and V K regions of five of the HA-positive recombinants revealed that 
all the HA-specific antibodies generated by repertoire cloning utilized a V H region derived from members 
of a single B-cell clone in conjunction with one of two light-chain variable regions. A majority of the 
HA-specific antibodies exhibited a common heavy-light-chain combination which was very similar to 
one previously identified among HA-specific hybridoma monoclonal antibodies. The relative represen- 
tation of these sequences and the overall diversity of the library also were studied via hybridization studies 
and sequence analysis of randomly selected clones. It was determined that a single functional V H sequence 
was present at a frequency of 1 in 50, while the more commonly occurring light-chain sequence was 
present at a frequency of 1 in 275. This indicates that the overall diversity of the gene family representation 
in the library is fairly limited. 

The A.-ZAP technology also has been used to produce high-affinity human monoclonal antibodies 
specific for tetanus toxoid [Mullinax et al, 1990; Persson et al., 1991]. The source of the mRNA in these 
studies was peripheral blood lymphocytes (PBLs) from human donors previously immunized with 
tetanus toxoid and boosted with the antigen. Mullinax et al. [1990] estimated that the frequency of 
positive clones in their library was about 1 in 500, and their affinity constants ranged from 10 6 to 10 9 M 
(Molar) -1 . However, the presence of a naturally occurring Sad site (one of the restriction enzymes used 
to force clone PCR-amplified light-chain genes) in the gene for human C K may have resulted in a reduction 
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in the frequency of positive clones. Persson et al. [1991] constructed three different combinatorial libraries 
using untreated PBLs, antigen-stimulated PBLs (cells cultured in the presence of the antigen), and 
antigen-panned PBLs (cells that were selected for binding to the antigen). Positive clones were obtained 
from all three libraries with frequencies if 1 in 6000, 1 in 5000, and 1 in 4000, respectively. Apparently 
binding constants were estimated to be in the range of 10 7 to 10 9 Mr 1 . Sequence analysis of a limited 
number of clones isolated from the antigen-stimulated cell library indicated a greater diversity than that 
described for HA or NPN. For example, of the eight heavy-chain CDR3 sequences examined, only two 
pairs appeared to be clonally related. The A.-ZAP technology also has been used to rescue a functional 
human antirhesus D Fab from an EBV-transformed cell line [Burton, 1991]. 

In principle, repertoire cloning would allow for the rapid and easy identification of monoclonal 
antibody fragments in a form suitable for genetic manipulation. It also provides for a much better survey 
of the immunologic repertoire than conventional hybridoma technology. However, repertoire cloning is 
not without its disadvantages. First, it allows for the production of only antibody fragments. This 
limitation can be overcome by mounting the repertoire cloned variable domains onto constant domains 
that possess the desired effector functions and using transfectoma technology to express the intact 
immunoglobulin genes in a variety of host systems. This has been demonstrated for the case of a human 
Fab fragment to tetanus toxoid, where the Fab gene fragment obtained via repertoire cloning was linked 
to an Fc fragment gene and successfully expressed in a CHO cell line [Bender et al., 1993]. The second 
limitation to the use of "immunized" repertoires, which has serious implications in the applicability of 
this technology for the production of human monoclonal antibodies. The studies reviewed above have 
all used spleen cells or PBLs from immunized donors. This has resulted in relatively high frequency of 
positive clones, eliminating the need for extensive screening. Generating monoclonal antibodies from 
"naive" donors (who have not had any exposure to the antigen) would require the screening of very large 
libraries. Third, the actual diversity of these libraries is still unclear. The studies reported above show a 
wide spectrum ranging from very limited (the HA studies) to moderate (NPN) to fairly marked diversity 
(tetanus toxoid). Finally, the combinatorial approach is disadvantageous in that it destroys the original 
pairing of the heavy and light chains selected for by immunization. Strategies for overcoming some of 
these limitations have already been developed and are reviewed below. 

Phage Display Technology 

A critical aspect of the repertoire cloning approach is the ability to screen large libraries rapidly for clones 
that possess desired binding properties, e.g., binding affinity or specificity, catalysis, etc. This is especially the 
case for "naive" human repertoires, wherein the host has not been immunized against the antigen of interest 
for ethical and/or safety reasons. In order to facilitate screening of large libraries of antibody genes, phage 
display of functional antibody fragments has been developed, which has resulted in an enormous increase 
in the utility of repertoire cloning technology. In phage display technology, functional antibody fragments 
(such as the sFv and Fab) are expressed on the surface of filamentous bacteriophages, which facilitates the 
selection of specific binders (or any other property such as catalysis, etc.) from a large pool of irrelevant 
antibody fragments. Typically, several hundreds of millions of phage particles (in a small volume of 50 to 
100 ml) can be tested for specific binders by allowing them to bind to the antigen of interest immobilized 
to a solid matrix, washing away the nonbinders, and eluting the binders using a suitable elution protocol. 

Phage display of antibody fragments is accomplished by coupling of the antibody fragment to a coat 
protein of the bacteriophage. Two different coat proteins have been used for this purpose, namely, the 
major coat protein encoded for by gene VIII and the adsorption protein encoded for by gene III. The 
system based on gene VIII displays several copies of the antibody fragment (theoretically there are 
2000 copies of gene VIII product per phage) and is used for the selection of low-affinity binders. The 
gene III product is, on the other hand, present at approximately four copies per phage particle and leads 
to the selection of high-affinity binders. However, since the native gene III product is required for 
infectivity, at least one copy on the phage has to be a native one. 

The feasibility of phage display of active antibody fragments was first demonstrated by McCafferty 
et al. [1990] when the single-chain Fv fragment (single-chain antibody) of the anti-hen egg white 
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lysozyme (HEL) antibody was cloned into an fd phage vector at the N-terminal region of the gene III 
protein. This study showed that complete active sFv domains could be displayed on the surface of 
bacteriophage fd and that rare phage displaying functional sFv (1 in 10 6 ) can be isolated. Phage that 
bound HEL were unable to bind to turkey egg white lysozyme, which differs from HEL by only seven 
residues. Similarly, active Fab fragments also have been displayed on phage surfaces using gene VIII [Kang 
et al., 1991]. In this method, assembly of the antibody Fab molecules in the periplasm occurs in concert 
with phage morphogenesis. The Fd chain of the antibody fused to the major coat protein VIII of phage 
Ml 3 was coexpressed with K chains, with both chains being delivered to the periplasmic space by the 
pelB leader sequence. Since the Fd chain is anchored in the membrane, the concomitant secretion of the 
K chains results in the assembly of the two chains and hence the display of functional Fab on the membrane 
surface. Subsequent infection with helper phage resulted in phage particles that had incorporated func- 
tional Fab along their entire surface. Functionality of the incorporated Fab was confirmed by antigen- 
specific precipitation of phage, enzyme-linked immunoassays, and electron microscopy. The production 
of soluble antibody fragments from selected phages can now be accomplished without subcloning [Hoo- 
genboom et al., 1991]. The switch from surface display to soluble antibody is mediated via the use of an 
amber stop codon between the antibody gene and phage gene protein. In a supE suppresser strain of 
E. coli, the amber codon is read as Glu and the resulting fusion protein is displayed on the surface of the 
phage. In nonsuppresser strains, however, the amber codon is read as a stop codon, resulting in the 
production of soluble antibody. 

The combination of repertoire cloning technology and phage display technology was initially used to 
screen antibody fragments from repertoires produced from immunized animals, namely, the production 
of Fv fragments specific for the hapten phenyloxazolone using immunized mice [Clackson et al., 1991], 
human Fab fragments to tetanus toxoid [Barbas et al., 1991], human Fab fragments to gpl20 using 
lymphocytes isolated from HIV-positive individuals [Burton et al., 1991], and human Fab fragments to 
hepatitis B surface antigen from vaccinated individuals [Zebedee et al., 1992]. These studies established 
the utility of phage display as a powerful screening system for functional antibody fragments. For example, 
attempts to generate human Fab fragments against gpl20 using the A.-ZAP technology failed to produce 
any binders. Phage display, on the other hand, resulted in 33 of 40 clones selected via antigen panning 
possessing clear reactivity with affinity constants of the order of 10~ 8 Mr 1 . In the case of the tetanus toxoid 
studies, phage display was used to isolate specific clones from a library that included known tetanus 
toxoid clones at a frequency of 1 in 170,000. 

Bypassing Immunization 

The next step was the application of phage display technology to generate antibodies from unimmunized 
donors (naive repertoires). Marks et al. [1991] constructed two combinatorial libraries starting from 
peripheral blood lymphocytes of unimmunized human donors, namely, an IgM library using p.-specific 
PCR primers and an IgG library using y-specific primers. The libraries were then screened using phage 
display and sFv fragments specific for nonself antigens such as turkey egg white lysozyme, bovine serum 
albumin, phenyloxazolone, and bovine thyroglobulin, as well as for self antigens such as human thyro- 
globulin, human tumor necrosis factor a, cell surface markers, carcinoembryonic antigen, and mucin, 
and human blood group antigens were isolated [Hoogenboom et al., 1992]. The binders were all isolated 
from the IgM library (with the exception of six clones for turkey egg white lysozyme isolated from the 
IgG library), and the affinities of the soluble antibody fragments were low to moderate (2 x 10 6 to 10 7 Mr 1 ). 
Both these results are typical of the antibodies produced during a primary response. 

The second stage of an immune response in vivo involves affinity maturation, in which the affinities 
of antibodies of the selected specificities are increased by a process of somatic mutation. Thus one method 
by which the affinities of antibodies generated from naive repertoires may be increased is by mimicking 
this process. Random mutagenesis of the clones selected from naive repertoires has been accomplished 
by error-prone PCR [Gram et al., 1992]. In this study, low-affinity Fab fragments (10 4 to 10 5 Mr 1 ) to 
progesterone were initially isolated from the library, and their affinities increased 13- to 30-fold via 
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random mutagenesis. An alternative approach to improving the affinities of antibodies obtained from 
naive repertoires involves the use of chain shuffling [Marks et al., 1992]. This study describes the affinity 
maturation of a low-affinity antiphenyloxazolone antibody (3 x 10 7 M _1 ). First, the existing light chain 
was replaced with a repertoire of in vivo somatically mutated light chains from unimmunized donors 
resulting in the isolation of a clone with a 20-fold increase in affinity. Next, shuffling of the heavy chain 
in a similar manner with the exception of retaining the original H3 resulted in a further increase in 
affinity of about 16-fold. The net increase in affinity (320-fold) resulted in a dissociation constant of 
1 nM, which is comparable with the affinities obtained through hybridoma technology. 

Other approaches to bypassing human immunization involve the use of semisynthetic and synthetic 
combinatorial libraries [Barbas et al., 1992] and the immunization of SCID mice that have been populated 
with human peripheral blood lymphocytes [Duchosal et al., 1992]. 

102.3 Monoclonal Antibody Expression Systems 

Several expression systems are currently available for in vitro production of antibodies such as bacteria, 
yeast, plants, baculovirus, and mammalian cells. In each of these systems, cloned antibody genes are the 
starting point for production. These are obtained either by traditional cloning techniques starting with 
preexisting hybridomas or by the more recent repertoire cloning techniques. Each of the aforementioned 
systems has its own advantages and drawbacks. For example, bacterial expression systems suffer from 
the following limitations: They cannot be used for producing intact antibodies, nor can they glycosylate 
antibodies. Unglycosylated antibodies cannot perform many of the effector functions associated with 
normal antibody molecules. Proper folding may sometimes be a problem due to difficulty in forming 
disulfide bonds, and often, the expressed antibody may be toxic to the host cells. On the other hand, 
bacterial expression has the advantage that it is cheap, can potentially produce large amounts of the 
desired product, and can be scaled up easily. In addition, for therapeutic products, bacterial sources are 
to be preferred over mammalian sources due to the potential for the contamination of mammalian cell 
lines with harmful viruses. 

Bacterial Expression 

Early attempts to express intact antibody molecules in bacteria were fairly unsuccessful. Expression of 
intact light and heavy chains in the cytoplasm resulted in the accumulation of the proteins as nonfunc- 
tional inclusion bodies. In vitro reassembly was very inefficient [Boss et al., 1984; Cabilly et al., 1984]. 
These results could be explained on the basis of the fact that the E. coli biosynthetic environment does 
not support protein folding that requires specific disulfide bond formation, post-translational modifica- 
tions such as glycosylation, and polymeric polypeptide chain assembly. 

There has been much more success obtained with antibody fragments. Bacterial expression of IgE "Fc- 
like" fragments has been reported [Kenten et al., 1984; Ishizaka et al., 1986] . These IgE fragments exhibited 
some of the biologic properties characteristic of intact IgE molecules. The fragments constituted 18% of 
the total bacterial protein content but were insoluble and associated with large inclusion bodies. Following 
reduction and reoxidation, greater than 80% of the chains formed dimers. The fragment binds to the 
IgE receptor on basophils and mast cells and, when cross-linked, elicits the expected mediator (histamine) 
release. 

Cytoplasmic expression of a Fab fragment directed against muscle-type creatinine kinase, followed by 
in vitro folding, resulted in renaturation of about 40% of the misfolded protein, with a total active protein 
yield of 80 p.g/ml at 10°C [Buchner and Rudolph, 1991]. Direct cytoplasmic expression of the so-called 
single-chain antibodies, which are novel recombinant polypeptides composed of an antibody V L tethered 
to a V H by a designed "linker" peptide that links the carboxyl terminus of the V L to the amino terminus 
of the V H or vice versa, was the next important step. Various linkers have been used to join the two 
variable domains. Bird et al. [1988] used linkers of varying lengths (14 to 18 amino acids) to join the 
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two variable domains. Huston et al. [1988] used a 15 amino acid "universal" linker with the sequence 
(GGGGS) 3 . The single-chain protein was found to accumulate in the cell as insoluble inclusion bodies 
and needed to be refolded in vitro. However, these proteins retained both the affinity and specificity of 
the native Fabs. 

In 1988, two groups reported, for the first time, the expression of functional antibody fragments (Fv 
and Fab, respectively) in E. coli [Better et al., 1988; Skerra and Pluckthun, 1988]. In both cases, the 
authors attempted to mimic in bacteria the natural assembly and folding pathway of antibody molecules. 
In eukaryotic cells, the two chains are expressed separately with individual leader sequences that direct 
their transport to the endoplasmic reticulum (ER), where the signal sequences are removed and correct 
folding, disulfide formation, and assembly of the two chains occur. By expressing the two chain fragments 
( V L and V H in the case of Fv expression and the Fd and K chains in the case of Fab expression) separately 
with bacterial signal sequences, these workers were successful in directing the two precursor chains to 
the periplasmic space, where correct folding, assembly, and disulfide formation occur along with the 
removal of the signal sequences, resulting in fully functional antibody fragments. Skerra and Pluckthun 
[1988] report the synthesis of the Fv fragment of MOPC 603 which has an affinity constant identical to 
that of the intact Ab. Better et al. [1988] report the synthesis of the Fab fragment of an Ig that binds to 
a ganglioside antigen. While Skerra and Pluckthun obtained a yield of 0.2 mg/liter after a periplasmic 
wash, Better et al. found that the Fab fragment was secreted into the culture medium with a yield of 
2 mg/liter. However, previous attempts to synthesize an active, full-size Ig in E. coli by coexpression and 
secretion mediated by procaryotic signal sequences resulted in poor synthesis and/or secretion of the 
heavy chain. 

Since these early reports, several additional reports have been published (for reviews, see Pluckthun 
[1992] and Skerra [1993]) which have established the two aforementioned strategies (namely, direct 
cytoplasmic expression of antibody fragments followed by in vitro refolding and periplasmic expression 
of functional fragments) as the two standard procedures for the bacterial expression of antibody frag- 
ments. Expression of the protein in the periplasmic space has advantages: (1) the expressed protein is 
recovered in a fully functional form, thereby eliminating the need for in vitro refolding (as is required 
in the case of cytoplasmically expressed fragments), and (2) it greatly simplifies purification. On the 
other hand, direct cytoplasmic expression may, in some cases, reduce problems arising from toxicity of 
the expressed protein and also may increase the total yield of the protein. 

Several improvements have been made in the past few years so as to simplify the expression and 
purification of antibody fragments in bacteria. These include the development of improved vectors with 
strong promoters for the high-level expression of antibody fragments, the incorporation of many different 
signal sequences, and the incorporation of cleavable "affinity" handles that simplify purification. Many 
expression vector systems are now commercially available that enable the rapid cloning, sequencing, and 
expression of immunoglobulin genes in bacteria within a matter of 2 to 3 weeks. 

Expression in Lymphoid and Nonlymphoid Systems 
(Transfectoma Technology) 

Expression of immunoglobulin genes by transfection into eukaryotic cells (transfectoma technology) such 
as myelomas and hybridomas is an alternative approach for producing monoclonal antibodies [Wright 
et al., 1992; Morrison and Oi, 1989]. Myelomas and hybridomas are known to be capable of high-level 
expression of endogenous heavy- and light-chain genes and can glycosylate, assemble, and secrete func- 
tional antibody molecules and therefore are the most appropriate mammalian cells for immunoglobulin 
gene transfection. Nonlymphoid expression in CHO and COS cells also has been examined as a potential 
improvement over expression in lymphoid cells. The biologic properties and effector functions, which 
are very important considerations for applications involving human therapy and diagnostics, are com- 
pletely preserved in this mode of expression. However, transfectoma technology still involves working 
with eukaryotic cell lines with low antibody production rates and poor scale-up characteristics. 
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Transfectoma technology provides us with the ability to genetically manipulate immunoglobulin genes 
to produce antibody molecules with novel and/or improved properties. For example, production of 
"chimeric" and "reshaped" antibodies, wherein the murine variable domains or CDRs are mounted onto 
a human antibody framework in an attempt to reduce the problem of immunogenicity in administering 
murine antibodies for in vivo diagnostic and/or therapeutic purposes, would not be possible without the 
techniques of transfectoma technology. It is also possible to change the isotype of the transfectoma 
antibodies in order to change their biologic activity. It also has enabled the fusion of antibodies with 
nonimmunoglobulin proteins such as enzymes or toxins, resulting in novel antibody reagents used in 
industrial and medicinal applications. 

The most commonly used vectors are the pSV2 vectors that have several essential features. First, they 
contain a plasmid origin of replication and a marker selectable for procaryotes. This makes it relatively 
easy to obtain large quantities of DNA and facilitates genetic manipulation. Second, they contain a marker 
expressible and selectable in eukaryotes. This consists of a eukaryotic transcription unit with an SV40 
promoter, splice, and poly A addition site. Into this eukaryotic transcription unit is placed a dominant 
selectable marker derived from procaryotes (either the neo gene or the gpt gene). 

In order to create the immunoglobulin molecules, the two genes encoding the heavy and light chains 
must be transfected and both polypeptides must be synthesized and assembled. Several methods have 
been used to achieve this objective. Both the heavy- and light-chain genes have been inserted into a single 
vector and then transfected. This approach generates large, cumbersome expression vectors, and further 
manipulation of the vector is difficult. A second approach is to transfect sequentially the heavy- and 
light-chain genes. To facilitate this, one gene is inserted into a vector with the neo gene, permitting 
selection with antibiotic G418. The other gene is placed in an expression vector containing the gpt gene, 
which confers mycophenolic acid resistance to the transfected cells. Alternatively, both genes may be 
introduced simultaneously into lymphoid cells using protoplast fusion. 

Heavy- and light-chain genes, when transfected together, produced complete, glycosylated, assembled 
tetrameric antigen-binding antibody molecules with appropriate disulfide bonds. Under laboratory con- 
ditions, these transfected cells yield about 1 to 20 mg/liter of secreted antibody. A persisting problem has 
been the expression level of the transfected immunoglobulin gene. The expression of transfected heavy- 
chain genes is frequently seen to approach the level seen in myeloma cells; however, efficient expression 
of light-chain genes is more difficult to achieve. 

Expression in Yeast 

Yeast is the simplest eukaryote capable of glycosylation and secretion and has the advantages of rapid 
growth rate and ease of large-scale fermentation. It also retains the advantage that unlike other mam- 
malian systems, it does not harbor potentially harmful viruses. 

Initial attempts to express A. and p. immunoglobulin chains specific for the hapten NP in the yeast 
Sacchromyces cerevisiae under the control of the yeast 3-phosphoglycerate kinase (PGK) promoter resulted 
in the secretion into culture medium at moderate efficiency (5% to 40%), but secreted antibodies had 
no antigen-binding activity [Wood et al., 1985]. Subsequent attempts to coexpress heavy and light chains 
with the yeast invertase signal sequence under the control of the yeast phosphoglycerate kinase (PGK) 
promoter were more successful, presumably due to differences in the efficiency of different yeast signal 
sequences in directing the secretion of mammalian proteins from yeast. Culture supernatants contained 
significant quantities of both light and heavy chains (100 and 50 to 80 mg/liter, respectively) with about 
50% to 70% of the heavy chain associated with the light chain. The yeast-derived mouse-human chimeric 
antibody L6 was indistinguishable from the native antibody in its antigen-binding properties [Horowitz 
et al., 1988; Better and Horowitz, 1989] . Furthermore, it was superior to the native antibody in mediating 
antibody-dependent cellular cytotoxicity (ADCC) but was incapable of eliciting complement-dependent 
cytolysis (CDC). Yeast-derived L6 Fab also was indistinguishable from proteolytically generated Fab as 
well as recombinant Fab generated from E. coli. 
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Expression in Baculovirus 

The baculovirus expression system is potentially a very useful system for the production of large amounts 
of intact, fully functional antibodies for diagnostic and even therapeutic applications. Foreign genes 
expressed in insect cell cultures infected with baculovirus can constitute as much as 50% to 75% of the 
total cellular protein late in viral replication. Immunoglobulin gene expression is achieved by commer- 
cially available vectors that place the gene to be expressed under the control of the efficient promoter of 
the gene encoding the viral polyhedrin protein. The levels of expression seen in this system can be as 
much as 50- to 100-fold greater per cell than in procaryotes while retaining many of the advantages of 
an eukaryotic expression system such as glycosylation and extracellular secretion. However, scale-up is 
not straightforward, since viral infection eventually results in cell death. 

There have been at least two reports of antibody secretion in a baculovirus system [Hasemann and 
Capra, 1990; Putlitz, 1990]. In both cases, the secreted antibody was correctly processed, glycosylated 
(albeit differently than hybridoma-derived antibody), and assembled into a normal functional het- 
erodimer capable of both antigen binding and complement binding. The secreted antibodies were 
obtained at a yield of about 5 mg/liter. 

Expression in Plants 

The development of techniques for plant transformation has led to the expression of a number of foreign 
genes, including immunoglobulin genes, in transgenic plants [Hiatt and Ma, 1992]. The most commonly 
used plant cell transformation protocol employs the ability of plasmid Ti of Agrobacterium tumefaciens 
to mediate gene transfer into the plant genome. 

The expression of a murine anti-phosphonate ester catalytic antibody in transgenic plants was accom- 
plished by first transforming the heavy-chain and light-chain genes individually into different tobacco 
plants [Hiatt et al., 1989]. These were then sexually crossed to obtain progeny that expressed both chains 
simultaneously. It was shown that leader sequences were necessary for the proper expression and assembly 
of the antibody molecules. The level of antibody expression was determined to be about 3 ng/mg of total 
protein, and the plant-derived antibody was comparable with ascites-derived antibody with respect to 
binding as well as catalysis. 

102.4 Genetically Engineered Antibodies 
and Their Fragments 

The domain structure of the antibody molecule allows the reshuffling of domains and the construction 
of functional antibody fragments. A schematic representation of such genetically engineered antibodies 
and their fragments is shown in Figs. 102.2 and 102.3. 

Among intact engineered constructs are chimeric, humanized, and bifunctional antibodies (see Wright 
et al. [1992] and Sandhu [1992] for reviews). A major issue in the long-term use of murine monoclonal 
antibodies for clinical applications is the immunogenicity of these molecules or the so-called HAMA 
response (human antimouse antibody response). Simple chimeric antibodies were constructed by linking 
murine variable domains to human constant domains in order to reduce immunogenicity of therapeu- 
tically administered murine monoclonals. The approach has been validated by several clinical trials that 
show that chimeric antibodies are much less likely to induce a HAMA response compared with their 
murine counterparts. In a more sophisticated approach, CDR grafting has been used to "humanize" 
murine monoclonal antibodies for human therapy by transplanting the CDRs of a murine monoclonal 
antibody of appropriate antigenic specificity onto a human framework. Humanized antibodies are, in 
some cases, even better than their chimeric counterparts in terms of the HAMA response. 

Bifunctional antibodies that contain antigen-specific binding sites with two different specificities have 
been produced via genetic engineering as well as chemical techniques [Fanger et al., 1992]. The dual 
specificity of bispecific antibodies can be used to bring together the molecules or cells that mediate the 
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FIGURE 102.2 Schematic of an antibody molecule and its antigen binding fragments, (a-e) Examples of the 
fragments expressed in Escherichia coli: (a) single-chain Fv fragment in which the linking peptide joins the light and 
heavy Fv segments in the following manner: V L (carboxyl terminus)-peptide linker-V H (amino terminus); (b) single- 
chain Fv fragment with the following connection: V L (amino terminus)-peptide linker-V H (carboxyl terminus); 
(c) disulfide-linked Fv fragment; (d) "miniantibody" comprised of two helix-forming peptides each fused to a single- 
chain Fv; (e) Fab fragments linked by helix-forming peptide fused to heavy chain; (/) Fab fragments linked by helix- 
forming peptide to light chain. 
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FIGURE 102.3 Schematic of genetically engineered intact monoclonal antibodies prepared with antibody-engi- 
neering techniques. 

desired effect. For example, a bispecific antibody that binds to target cells such as a tumor cell and to 
cytotoxic trigger molecules on host killer cells such as T cells has been used to redirect the normal immune 
system response to the tumor cells in question. Bispecific antibodies also have been used to target toxins 
to tumor cells. 
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The list of genetically engineered antibody fragments is a long and growing one [Pluckthun, 1992]. 
Of these, fragments such as the Fab, F(ab)2, Fv, and the Fc are not new and were initially produced by 
proteolytic digestion. The Fab fragment (fragment antigen-binding) consists of the entire light chain and 
the two N-terminal domains of the heavy chain (V H and C H1 , the so-called Fd chain) and was first 
generated by digestion with papain. The other fragment that is generated on papain digestion is the Fc 
fragment (fragment crystallizable), which is a dimeric unit composed of two C H2 and C H3 domains. The 
F(ab)' 2 fragment consists of two Fab arms held together by disulfide bonds in the hinge region and was 
first generated by pepsin digestion. Finally, the Fv fragment, which consists of just the two N-terminal 
variable domains, also was first generated by proteolytic digestion and is now more commonly generated 
via antigen-engineering techniques. 

The other fragments listed in Fig. 102.2 are of genetic origin. These include the sFv (single-chain 
antigen), the V H domain (single-domain antigen), multivalent sFvs (miniantibodies), and multivalent 
Fabs. The multivalent constructs can either be monospecific or bispecific. The single-chain antigen (SCA, 
sFv) consists of the two variable domains linked together by a long flexible polypeptide linker [Bird et 
al., 1988; Huston et al., 1988]. The sFv is an attempt to stabilize the Fv fragment, which is known to 
dissociate at low concentrations into its individual domains due to the low-affinity constant for the V H - 
V L interaction. Two different constructs have been made: the V L -V H construct where the linker extends 
from the C terminus of the V L domain to the N terminus of the V H domain, and the V H -V L construct, 
where the linker runs from the C terminus of the V H domain to the N terminus of the V L . The linker is 
usually about 15 amino acids long (the length required to span the distance between the two domains) 
and has no particular sequence requirements other than to minimize potential interferences in the folding 
of the individual domains. The so-called universal linker used by many workers in the area is (GGGGS) 3 . 
Other strategies to stabilize the Fv fragment include chemical cross-linking of the two domains and 
disulfide-linked domains [Glockshuber et al., 1990]. Chemical cross-linking via glutaraldehyde has been 
demonstrated to be effective in stabilizing the Fv fragment in one instance; here, the cross-linking was 
carried out in the presence of the hapten (phosphorylcholine) to avoid modification of the binding site, 
an approach that may not be feasible with protein antigens. In the disulfide-linked sFv, Cys residues are 
introduced at suitable locations in the framework region of the Fv so as to form a natural interdomain 
disulfide bond. This strategy was shown to be much more effective in stabilizing the Fv fragment against 
thermal denaturation than either the single-chain antigen approach or chemical cross-linking for the 
one case where all three approaches were tested [Glockshuber et al., 1990]. 

The single-domain antigen consists of just the V H domain and has been shown by some to possess 
antigen-binding function on its own in the absence of the V L domain [Ward et al., 1989]. There is some 
skepticism regarding this approach due to the rather high potential for nonspecific binding (the removal 
of the V L domain exposes a very hydrophobic surface), poor solubility, and somewhat compromised 
selectivity. For example, while the Fv fragment retains its ability to distinguish between related antigenic 
species, the single-domain antigen does not. 

Miniantibodies consist of sFv fragments held together by so-called dimerization handles. sFv fragments 
are fused via a flexible hinge region to several kinds of amphipathic helices, which then acts as dimer- 
ization devices [Pack and Pluckthun, 1992]. Alternately, the two sFvs can be fused via a long polypeptide 
linker, similar to the one linking the individual domains of each Fv but longer to maintain the relative 
orientation of the two binding sites. Multivalent Fabs use a somewhat similar approach with the dimer- 
ization handles comprising of zippers from the transcription factors jun and fos [Kostelny et al., 1992]. 

In addition to the constructs described above, a whole new set of genetically engineered fusion proteins 
with antibodies has been described [Wright et al., 1992]. Antibody fusion proteins are made by replacing 
a part of the antibody molecule with a fusion partner such as an enzyme or a toxin that confers a novel 
function to the antibody. In some cases, such as immunotoxins, the variable regions of the antibody are 
retained in order to retain antigen binding and specificity, while the constant domains are deleted and 
replaced by a toxin such as ricin or Pseudomonas exotoxin. Alternately, the constant regions are retained 
(thereby retaining the effector functions) and the variable regions replaced with other targeting proteins 
(such as CD4 for AIDS therapy and IL-2 for cancer therapy). 
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102.5 Applications of Monoclonal Antibodies and Fragments 

The majority of applications for which monoclonal antibodies have been used can be divided into three 
general categories: (1) purification, (2) diagnostic functions (whether for detecting cancer, analyzing for 
toxins in food, or monitoring substance abuse by athletes), and (3) therapeutic functions. From the time 
that monoclonal antibody technology was introduced almost 20 years ago, application methodologies 
using whole antibody have gradually been transformed into methodologies using antibody fragments 
such as the Fab 2 , Fab', and Fv fragments and even synthetic peptides of a CDR region. Antibody conjugates 
have come to include bound drugs, toxins, radioisotopes, lymphokines, and enzymes [Pietersz and 
McKenzie, 1992] and are largely used in the treatment of cancer. Tables 102.1 and 102.2 list some typical 
examples of monoclonal antibodies and fragments used for diagnostic and therapeutic applications. 

Thousands of murine monoclonal antibodies have been made to human carcinomas since the intro- 
duction of antibody technology, but very few, if any, of these monoclonal antibodies are entirely specific 
for malignant cells. In the vast majority of cases, these monoclonal antibodies define tumor-associated 
differentiation antigens (TADAs), which are either proteins, mucins, proteoglycans, or glycolipids (gan- 
gliosides). Examples of TADA proteins are carcinoembryonic antigen (CEA) and OC-fetoprotein (AFP), 
both well-known diagnostic markers. An anti-CEA antibody has been conjugated with the enzyme car- 
boxypeptidase, which, in turn, activates a prodrug at the site of the tumor [Bagshawe et al., 1992]. This 
strategy overcomes the inability of monoclonal antibodies conjugated with drugs to deliver a therapeutic 
dose. Examples of TADA gangliosides are those referred to as GD2 and GD3, for which the respective 
unmodified monoclonal antibodies have shown to be effective therapeutics, particularly when intralesion- 
ally administered [Irie et a., 1989]. An example of a TADA mucin is the antigens found in colorectal and 
ovarian carcinoma that react with the antibody 72.3. Chimeric monoclonal antibodies as well as fragments 
of antibody 72.3 have been constructed and tested [Khazaeli et al., 1991; King et al., 1992]. 

As mentioned above, monoclonal antibodies defining different TADAs have been used passively 
(unmodified) and as carriers of, for example, radioisotopes and enzymes. From the results brought forth 
to date, the passive mode of antibody therapy has produced relatively few remissions in patients, and in 
those cases where it has shown effects, it is likely that the ability of the antibody to mediate ADCC 
(antibody-dependent cellular cytotoxicity) and CDC (complement-dependent cytotoxicity) has contrib- 
uted to the remission. For the case of modified monoclonal antibodies (e.g., toxin and radioisotope 
conjugates), success of treatment is varied depending on the type of neoplasm. Antibody conjugate 



TABLE 102.1 

Cell Lines 



Uses of Whole MAb Derived from Hybridoma, Other Cell Fusions, and Genetically Engineered 



MAb 



Antigen 



Ab Source 



Actual or Potential Uses 



C23 Cytomegalovirus glycoprotein of 

130 and 55 kDa 

EV2-7 Cytomegalovirus protein of 82 kDa 

OKT3 CD3 antigen 

Campath 1H Lymphocyte antigen 

R24 GD2 ganglioside TADA tumor- 

associated differentiation antigen 
L72, L55 GD3 ganglioside TADA 

Digoxin 

6H4 Salmonella flagella 



Humab — fusion of human 
lymphocyte and mouse 
myeloma (p3 X 63Ag8ul) 

Trioma: human X (human x 
mouse heteromyeloma) 

Murine MAb 

"Humanized" chimeric 
rat/human MAb; murine CDRs 
Murine MAb 

Human MAb 
Murine MAb 

Murine MAb 



Prophylactic agent for viral 
infection 

Prophylactic agent for CMV 

infection 
Eradication of T lymphocytes 

involved in graft rejection 
Prevention of bone marrow 

and organ rejection 
Treatment of melanoma 

Treatment of melanoma 
Immunodiagnostic for cardiac 

glycoside digoxin 
Detection of Salmonella 

bacteria in food 
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TABLE 102.2 Immunoconjugates Having Potential for Cancer Therapy 



MAb 
A7 

30.6, 1-1 
RFB4 



Anti-CEA 

(carcinoembryonic antigen) 
Recombinant anti-CEA MAb 

BW431 
B72.3 
B72.3 
B72.3 

B72.3 (Oncoscint) 



Conjugate 
Neocarzinostatin 

N- acetylmelphalan 

Adriamycin, Mitomycin 

C 
RFB4(Fab')-Ricin A 

(aCD22(Fab')-RicinA) 
Anti-CD 19-blocked ricin 
Xomazyme-Mel 
Carboxypeptidase G2 

(DNA coded) human 
P-glucuronidase 

131T 
131T 

Chelated m In 



Antibody Source 

Murine monoclonal (from 
fusion with murine myeloma 
P3, X63.Ag8.653) 

Murine monoclonal 

Murine monoclonal 

Murine Fab' 

Murine monoclonal 
Murine monoclonal 
Murine MAb F(ab') 2 fragment 
chemically bound to enzyme 
Transfectoma 

Murine monoclonal 
Mouse-human chimeric Ab 
Mouse-human chimeric Fab' 

fragment 
Murine MAb 



Use 

Eradication of colon 
cancer 

Eradication of colon 

cancer 
Various cancers 

B-cell lymphoma 

B-cell lymphoma 
Metastatic melanoma 
Colon cancer 



Ovarian cancer 
Colon cancer 
Colon cancer 

Diagnostic imaging 
agent for colorectal 
and ovarian cancers 



treatment of leukemia and lymphoma results in a relatively greater remission rate than that found in 
treatments of malignancies having solid tumors (i.e., carcinoma of the ovary, colon, and lung). 

The rare case of complete remission for solid tumors is probably due to the inaccessibility of antibody 
to that tumor. Several barriers impeding access of antibody to cancer cells have been pointed out [Jain, 
1988]. A few of these barriers include (1) the high interstitial fluid pressure in tumor nodules, 
(2) heterogeneous or poor vascularization of tumors, and (3) the long distances extravasated monoclonal 
antibodies must travel in the interstial mesh of proteoglycans in the tumor. There also exists the possibility 
that tumor antigen shed from the surface is limiting antibody buildup. In the case of bound toxins or 
drugs, there is the added concern that organs such as the kidney and liver are quickly processing and 
eliminating the antibody conjugates. In this respect, immunoconjugates based on antibody fragments 
(such as the sFv) can be very advantageous. For example, it has been shown that the sFv exhibits rapid 
diffusion into the extravascular space, increased volume of distribution, enhance tumor penetration, and 
improved renal elimination. An assessment of solid tumor therapy with modified antibody has led 
Riethmuller et al. [1993] to recommend that current cancer therapy be directed toward minimal residual 
disease, the condition in which micrometastatic cells exist after curatively resecting solid tumors. 

With regard to purification, the research literature is replete with examples of immunoaffinity puri- 
fication of enzymes, receptors, peptides, and small organic molecules. In contrast, commercial applica- 
tions of immunoaffinity chromatography, even on industrially or clinically relevant molecules, are far 
less widespread (Table 102.3). Despite its potential utility, immunoadsorption is an expensive process. A 
significant portion of the high cost is the adsorbent itself, which is related to the cost of materials, 
preparation, and most important, the antibody. In addition, the binding capacity of the immunoadsor- 
bent declines with repeated use, and a systemic study has shown that significant deactivation can typically 
take place over 40 to 100 cycles [Antonsen et al., 1991]. A number of factors can contribute to this 
degradation, including loss of antibody, structural change of the support matrix, nonspecific absorption 
of contaminating proteins, incomplete antigen elution, and loss of antibody function. In most cases, this 
degradation is associated with repeated exposure to harsh elution conditions. Noteworthy commercial 
applications of immunoaffinity chromatography on useful molecules include the separation of factor VIII 
used to treat hemophilia A and factor IX, another coagulation factor in the blood-clotting cascade 
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TABLE 102.3 Clinically or Industrially Relevant Proteins Purified by Immunoaffinity Chromatography 

Protein Use (Actual and Potential) 

Factor VIII Treatment of hemophilia A 

Factor IX Blood coagulant 

a-Galactosidase Improve the food stabilizing properties of guar gum 

Alkaline phosphatase Purify enzyme (a particular glycoform) used as a tumor marker for diagnostic tests 

Interferon (recombinant) Immunotherapeutic 

Interleukin 2 Immunotherapeutic 

[Tharakan et al., 1992]. The immunoaffinity purification step for factor VIII was one of several additional 
steps added to the conventional preparation methodology in which plasma cryoprecipitates were heat- 
treated. The new method contains a virus-inactivation procedure that precedes the immunoaffinity 
column, followed by an additional chromatographic step (ion exchange). The latter step serves to elim- 
inate the eluting solvent and further reduce virus-inactivating compounds. The often mentioned concern 
of antibody leakage from the column matrix did not appear to be a problem. Furthermore, with the 
relatively mild elution conditions used (40% ethylene glycol), one would expect little change in the 
antibody-binding capacity over many elution cycles. 

Typical immunoaffinity matrices contain whole antibody as opposed to antibody fragments. Frag- 
mentation of antibody by enzymatic means contributes additional steps to immunoadsorbent preparation 
and adds to the overall cost of the separation and is thus avoided. However, fragmentation can lead to 
a more efficient separation by enabling the orientation of antibody-binding sites on the surface of the 
immunomatrix [Prisyazhnoy et al., 1988; Yarmush et al., 1992]. Intact antibodies are bound in a random 
fashion, resulting in a loss of binding capacity upon immobilization. Recombinant antibody fragments 
could prove to be more useful for immunoaffinity applications due to the potential for production of 
large quantities of the protein at low cost and improved immobilization characteristics and stability 
[Spitznagel and Clark, 1993]. In what one could consider the ultimate fragment of a antibody, some 
investigators have utilized a peptide based on the CDR region of one of the chains (termed minimal 
recognition units) to isolate the antigen. Welling et al. [1990] have synthesized and tested a 13-residue 
synthetic peptide having a sequence similar to one hypervariable region of an antilysozyme antibody. 

Important diagnostic uses of antibody include the monitoring in clinical laboratories of the cardiac 
glycoside digoxin and the detection of the Salmonella bacteria in foods (Table 102.1). These two examples 
highlight the fact that despite the exquisite specificity offered by monoclonal antibodies, detection is not 
failure-proof. Within digoxin immunoassays there are two possible interfering groups: endogenous 
digoxin-like substances and digoxin metabolites; moreover, several monoclonal antibodies many be 
necessary to avoid under- or overestimating digoxin concentrations. In the case of bacteria detection in 
food, at least two antibodies (MOPC 467 myeloma protein and 6H4 antibody) are needed to detect all 
strains of Salmonella. 

102.6 Summary 

The domain structure of antibodies, both at the protein and genetic levels, facilitates the manipulation 
of antibody properties via genetic engineering (antibody engineering). Antibody engineering has shown 
tremendous potential for basic studies and industrial and medical applications. It has been used to explore 
fundamental questions about the effect of structure on antigen binding and on the biologic effector 
functions of the antibody molecules. A knowledge of the rules by which the particular sequences of amino 
acids involved in the binding surface are chosen in response to a particular antigenic determinant would 
enable the production of antibodies with altered affinities and specificities. Understanding the structures 
and mechanisms involved in the effector function of antibodies is already starting to result in the 
production of antibodies with novel biologic effector functions for use as diagnostic and therapeutic 
reagents. In addition, the production of antibodies via immunoglobulin gene expression has enabled the 
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engineering of novel hybrid, chimeric, and mosaic genes using recombinant DNA techniques and the 
transfection and expression of these genetically engineered genes in a number of different systems such 
as bacteria and yeast, plant cells, myeloma or hybridoma cells, and nonlymphoid mammalian cells. 
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Antisense molecules can selectively inhibit the expression of one gene among the 100,000 present in a 
typical human cell. This inhibition is likely based on simple Watson-Crick base-pairing interactions 
between nucleic acids and makes possible, in principle, the rational design of therapeutic drugs that can 
specifically inhibit any gene with a known sequence. The intervention into disease states at the level of 
gene expression may potentially make drugs based on antisense techniques significantly more efficient 
and specific than other standard therapies. Indeed, antisense technology is already an indispensable 
research tool and may one day be an integral part of future antiviral and anticancer therapies. 

Zamecnik and Stephenson were the first to use antisense DNA to modulate gene expression. They 
constructed antisense oligonucleotides complementary to the 3' and 5' ends of Rous sarcoma virus (RSV) 
35S RNA and added them directly to a culture of chick embryo fibroblasts infected with RSV [1]. 
Remarkably, viral replication was inhibited, indicating that the cells had somehow internalized the 
antisense DNA and that the DNA was somehow interrupting the viral life cycle. 

Natural antisense inhibition was first observed in bacteria as a means of regulating the replication of 
plasmid DNA [2, 3]. RNA primers required for the initiation of replication were bound by (i.e., formed 
duplexes with) antisense RNA. The concentration of these RNA primers, and therefore the initiation of 
replication, was controlled by the formation of these duplexes. Shortly after this discovery, investigators 
developed antisense RNA constructs to control gene expression in mammalian cells. Antisense RNA, 
encoded on expression plasmids that were transfected into mouse cells, successfully blocked expression 
of target genes [4]. These early successes launched what is now a significant effort to expand the use of 
antisense molecules for research and therapeutic purposes. 

103.1 Background 

Antisense oligonucleotides are DNA or RNA molecules whose sequences are complementary to RNA 
transcribed from the target gene. These molecules block the expression of a gene by interacting with its 
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A - T Base Pair ( fl ) G . c Base Pair (fr) 

FIGURE 103.1 Watson-Crick base pairing interactions between adenosine and thymidine (A-T) and between 
guanosine and cytidine (G-C). The sugar is 2'-deoxyribose. 

RNA transcript. Antisense molecules are typically short, single stranded oligonucleotides with a sequence 
complementary to a sequence within the target RNA transcript. The oligonucleotides bind to this 
sequence via Watson-Crick base pairing (adenosine binds to thymidine (DNA) or uracil (RNA), and 
guanosine binds to cytidine), form a DNA:RNA duplex, and block translation of the RNA (see Fig. 103.1). 
Nucleic acids have been used in a variety of strategies to modulate gene expression. For example, antisense 
RNA, encoded by a plasmid transfected into the target cells, has been used to inhibit gene expression. Cells 
are transfected with a plasmid encoding the antisense RNA, the plasmid is transcribed, and the resulting 
antisense RNA transcript inhibits gene expression. Unfortunately, difficulties in controlling the expression 
of the transfected plasmid hinder the effective use of antisense RNA for therapeutic purposes. More recently, 
antisense RNA work has focused on the use of hammerhead ribozymes. These synthetic RNA oligonucle- 
otides have the ability to cleave other RNA strands and they appear more therapeutically viable. Other 
methods for using nucleic acids to block gene expression are being explored, such as: 

1. nucleic acids that competitively bind to proteins via their 3-dimensional structure (aptamers), 

2. nucleic acids designed to prevent transcription by forming a DNA triplex with the target gene 
(antigene), and 

3. nucleic acids that mimic binding sites for transcription and translation complexes (decoys). 

Although promising, these approaches are preliminary and are beyond the scope of this article. The 
interested reader is directed to reviews concerning these topics [5-7]. Only antisense DNA molecules, 
which mimic the antisense strand of the target gene and thus hybridize to RNA transcribed from the 
gene, will be discussed in detail in this article. For a recent review on antisense RNA, see Rossi [8], 

Many technical issues limit the therapeutic usefulness of current antisense oligonucleotides; for 
instance, they are highly susceptible to degradation by nucleases. Also, our understanding of the mech- 
anism of antisense inhibition must improve before the widespread development of therapeutically useful 
antisense molecules is possible. Advances in oligonucleotide chemistry have begun to resolve the issue 
of their intracellular stability; however, the reaction of natural systems to these unnatural species has not 
been fully explored. Many oligonucleotides still lack adequate specificity for in vivo use, and their delivery 
to cells is often non-specific and inefficient. The impact of these problems and approaches to solving 
them will be discussed. In addition, potential applications of antisense techniques to antiviral and 
anticancer therapies and their evaluation in animal models and clinical trials will also be highlighted. 

103.2 Mechanisms of Inhibition 

The inhibition of gene expression by antisense molecules is believed to occur by a combination of two 
mechanisms: ribonuclease H (RNase H) degradation of the RNA and steric hindrance of the processing 
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FIGURE 103.2 Several possible sequence specific sites of antisense inhibition. Antisense oligodeoxynucleotides are 
represented by black bars. Antisense oligodeoxynucleotides can interfere with (I) splicing, (II) transport of the nascent 
mRNA from the nucleus to the cytoplasm, (III) binding of initiation factors, (IV) assembly of ribosomal subunits 
at the start codon or (V) elongation of translation. Inhibition of capping and polyadenylation is also possible. (VI) 
Antisense oligodeoxynucleotides that activate RNase H (e.g., oligonucleotides with phosphodiester and phospho- 
rothioate backbones) can also inhibit gene expression by binding to their target mRNA, catalyzing the RNase H 
cleavage of the mRNA into segments that are rapidly degraded by exonucleases. 

of the RNA [9]. Some oligonucleotides, after hybridizing to the target RNA, activate RNase H, which 
specifically recognizes an RNA:DNA duplex and cleaves the RNA strand. The antisense oligonucleotide 
is not degraded and is free to bind to and catalyze the destruction of other target RNA transcripts [10]. 
Oligonucleotides that activate RNase H, therefore, are potentially capable of destroying many RNA 
transcripts in their lifetime, which suggests that lower concentrations of these oligonucleotides may be 
sufficient to significantly inhibit gene expression [9]. 

However, not all oligonucleotides activate RNase H enzymes. Some oligonucleotides inhibit gene 
expression by interfering with the RNA's normal life cycle. Newly transcribed RNA must be spliced, poly- 
adenylated, and transported to the cytoplasm before being translated into protein. In theory, virtually 
any step in an RNA's life cycle can be blocked by an oligonucleotide (see Fig. 103.2). Oligonucleotides 
that do not activate RNase H can block RNA maturation by binding to sites on the RNA important for 
post-transcriptional processing or for ribosomal assembly and translation initiation [11]. Oligonucle- 
otides designed to block the elongation of translation by binding to coding sequences downstream from 
the initiation codon region have rarely succeeded, presumably because ribosomes destabilize and read 
through DNA:RNA duplexes [11]. Inhibition by both RNase H degradation and steric hindrance was 
demonstrated in a study in which the expression of an intracellular adhesion molecule (ICAM-1) was 
blocked using two different oligonucleotides [12]. 

The mechanism of inhibition influences the specificity of antisense molecules. Oligonucleotides that 
activate RNase H can inactivate slightly mismatched RNA transcripts by transiently binding to them and 
catalyzing their cleavage and destruction by RNase H [13]. This non-specific inactivation of RNA could 
lead to unwanted side effects. In contrast, oligonucleotides that do not activate RNase H, and therefore 
can only inhibit gene expression by steric hindrance, are less likely to inactivate slightly mismatched RNA 
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transcripts. These oligonucleotides, like the ones that activate RNase H, transiently bind to RNA tran- 
scripts. However, they do not catalyze the enzymatic cleavage of the RNA by RNase H, and, since transient 
binding is unlikely to significantly impede translation of the transcript, they are less likely to reduce the 
expression of slightly mismatched non-targeted RNA transcripts. [14]. 

Choosing an effective target site in an mRNA remains largely an empirical process. Generally, several 
candidate oligonucleotides are tested for their ability to inhibit gene expression, and the most effective 
one is used for future studies [10]. Techniques such as gel-shift or oligonucleotide array assays are 
currently being used to rapidly screen multiple candidate oligonucleotides in vitro [15, 16]. Computa- 
tional predictions have also been applied to assist in the identification of susceptible sites on the target 
RNA [17, 18]. The effectiveness of an antisense oligonucleotide has been shown to depend greatly on 
the region of the RNA to which it is complementary [19, 20]. 

RNA molecules are typically folded into complex 3-dimensional structures that are formed by self- 
base pairing and by protein-RNA interactions. If an oligonucleotide is complementary to a sequence 
embedded in the 3-dimensional structure, the oligonucleotide may not be able to bind to its target site, 
and, therefore, gene expression will not be inhibited. For oligonucleotides that block gene expression 
only by steric hindrance and not by RNase H degradation, the number of potential target sites is even 
more limited. For these oligonucleotides, the site must not only be accessible for hybridization, but it 
must be a site for which steric hindrance alone is sufficient to interfere with gene expression. Binding 
sites that are often effective include the 5' cap region and the initiation (AUG) codon region [15, 21-23]. 
However, in other cases, effective inhibitors proved to be oligonucleotides targeted to segments of the 
RNA distant from these regions, e.g., in the 3'-UTR of the mRNA [20, 23]. 

103.3 Chemically Modified Oligonucleotides 

Natural oligonucleotides with a phosphodiester backbone (PO oligonucleotides) are very susceptible to 
nuclease degradation, having half-lives in the presence of purified exonucleases as short as fifteen minutes 
[11, 24, 25]. Almost no full sequence PO oligonucleotide remained following a lh incubation in human 
serum or cell extracts [26]. The need for oligonucleotides that are stable in vivo prompted the develop- 
ment of chemical derivatives of the phosphodiester backbone. The first derivatives were methylphospho- 
nate and phosphorothioate backbone oligonucleotides. Methylphosphonate oligonucleotides have a 
methyl group in place of one of the non-bridge backbone oxygens. They are highly lipophilic with a 
neutral backbone. Recently, however, little attention has been afforded to methylphosphonates as they 
appear less therapeutically viable due to their poor cellular uptake and the inability of their duplexes to 
be recognized by RNase H [10, 11]. 

Phosphorothioate oligonucleotides, on the other hand, remain prominent. Phosphorothioate (PS) 
DNA derivatives have a sulfur atom substituted for a non-bridge backbone oxygen atom. PS oligonucle- 
otides are significantly more stable than PO oligonucleotides with a half-life of > 5 h in the presence of 
purified nucleases [25]. Like PO oligonucleotides, PS oligonucleotides are believed to be internalized by 
receptor mediated endocytosis [27] . Their affinity for complementary RNA is not as high as that observed 
with PO oligonucleotides, and they are more likely to bind non-specifically to proteins. Nevertheless, PS 
oligonucleotides have been shown to efficiently inhibit gene expression [28]. One concern with PS 
oligonucleotides (and all chemically modified oligonucleotides) is their potential for toxicity. The met- 
abolic byproducts of PS oligonucleotides are not native to the cell. If these were incorporated into cellular 
DNA, they could cause mutations [29]. Studies have also shown that PS oligonucleotides activate com- 
plement, resulting in immunological complications for in vivo applications [30]. These complications 
and issues of affinity for the RNA, RNase H activity, stability, and cellular uptake have led to investigation 
of other types of modifications (e.g., N3'-P5' phosphoramidates) and chimeric oligonucleotides (see 
Fig. 103.3 and Table 103.1). 

One particularly promising modification focuses on the 2' position of the sugar ring. RNA:RNA hybrids 
are more stable than RNArDNA hybrids, presumably due to the formation of A- form helices rather than 
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FIGURE 103.3 Analogues of natural oligonucleotides. (A) Backbone modifications in which the phosphorus bridge 
atom is retained. (B) Backbone modifications in which the phosphorus bridge atom is replaced. (C) 2' ribose 
modifications. (D) Peptide nucleic acids — the entire backbone is replaced with amino acids. See Table 103.1 for 
legend. 

the A:B-form of the heteroduplex. However, with a hydroxyl at the 2' position, RNA is exceptionally 
labile to nucleases. Thus, to maintain the A-form geometry (the preferred geometry for RNA helices) 
and prevent degradation, replacement of the 2'-hydroxyl is being investigated. A study by Monia, et al. 
investigated both 2'-halide and 2'-0-alkyl substitutions, finding the affinity for the target ranked 2'-fluoro 
> 2'-0-methyl > 2'-0-propyl > 2'-0-pentyl > 2'-0-deoxy [31]. However, fully modified oligonucleotides 
were ineffective in inhibiting Ha-ras gene expression in HeLa cells unless the interior of the molecule 
contained at least 5 2'-deoxy nucleotides to initiate RNase H cleavage. Hence, chimeric oligonucleotides 
have begun to receive more attention as the next generation of antisense therapeutics. 

Chimeric oligonucleotides take advantage of the properties of multiple modifications in the same 
molecule. Earlier chimeric oligonucleotides used end caps of nuclease resistant bases with unmodified 
central bases [32]. More recently, "mixed-backbone" oligonucleotides have been synthesized with both 
backbone end modifications (primarily PS) and central sugar modifications (primarily 2'-0-methoxy) 
[33], These molecules have been shown to have improved target affinity over PS oligonucleotides while 
maintaining the ability to initiate RNase H cleavage of the RNA. 

Investigators have also replaced the phosphodiester backbone entirely, as with peptide nucleic acids 
(PNAs) in which the entire ribose-phosphodiester backbone is replaced with a polyamide (see 
Fig. 103. 3C). PNAs, with a covalently bound terminal lysine residue to prevent self-aggregation, have 
been shown to preferentially bind to complementary sequences and inhibit gene expression in vitro 
[34, 35] . PNAs have an unusually high affinity for their complementary target sequences, possibly because 
they can form a triple helix with homopurine stretches of RNA (PNA:RNA-PNA) [36]. Duplex forming 
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TABLE 103.1 The Names and Key Characteristics of Several Oligonucleotide Analogues 
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? = unknown. 

1 Chirality and charge depends on backbone structure used. 

2 Structure not drawn; The bond between the sugar and base (an N-glycosidic bond) of a-analogues have the reverse orientation 
(a-configuration) from natural (P-configuration) oligonucleotides. 

5 Typically, the C-terminus is covalently linked to a postively charged lysine residue, giving the PNA a positive charge. 
4 See Fig. 103. 3D for chemical structure. 

X, Y, Z: these columns reference Fig. 103.3; Replace the designated letter in Fig. 103.3 with the molecule indicated in the Table I 
to determine the chemical structure of the oligo. 



PNAs have shown inhibition of translation when targeted to the mRNA start codon, while triple helix 
formation is required to inhibit protein elongation [37]. The ability of PNAs to invade the duplex of 
DNA and form a stable triple helix structure remains an issue for their in vivo application. 

Other modifications to the ribose sugars (e.g., a-anomeric) and modifications to the nucleoside bases 
(e.g., 5-methyl or 5-bromo-2'-deoxycytidine and 7'-deazaguanosine and 7'-deazaadenosine oligonucle- 
otides) have also been examined [10]. Oligonucleotides covalently linked to active groups (e.g., interca- 
lators such as acridine, photoactivated crosslinking agents such as psoralens and chelating agents such 
as EDTA-Fe) are also being actively investigated as potential antisense molecules [9]. Whether for clinical 
application or for research/diagnostic tools, studies with modified oligonucleotides will provide valuable 
information regarding the mechanistic steps of antisense oligonucleotide activity. These future genera- 
tions of antisense "oligonucleotides" may bear little structural resemblance to natural oligonucleotides, 
but their inhibition of gene expression will still rely on sequence specific base pairing. The reader is 
directed to several reviews for a comprehensive treatment of all classes of chemically modified oligonu- 
cleotides [9,38-40]. 
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103.4 Oligonucleotide Synthesis and Purification 

The growth of the field of antisense therapeutics has also resulted in a need for increased production of 
oligonucleotides for in vitro and in vivo applications. As a result, new synthesis procedures are being 
explored to increase batch size and synthesis rate. Large-scale automated synthesizers have been devel- 
oped, with which batches of 100 mmol have been produced [http://www.hybridon.com]. This is a 
significant improvement given that a large batch on a standard synthesizer is 1 (imol. 

Standard procedures begin with the 3' base of the oligonucleotide attached to a solid controlled pore 
glass (CPG) support, although other supports have also been proposed [41]. The process then continues 
with deprotection, monomer introduction, activated coupling, oxidation, and capping in a cycle for the 
addition of each oligonucleotide. Depending on the type of monomer being used (e.g., (3-cyanoethyl 
phosphoramidite or hydrogen phosphonate), sulfurization to generate PS oligonucleotides occurs either 
during the synthesis cycling or following the termination of synthesis. Given the imperfections in the 
synthesis procedures, the resulting oligonucleotide product must then be purified to remove failure 
products. 

Oligonucleotides are typically purified using reverse-phase high performance liquid chromatography 
(RP-HPLC). During synthesis, a hydrophobic dimethoxy-trityl (DMT) group is removed from the phos- 
phoramidite to allow the next nucleoside to attach. Leaving this group on the final nucleoside of the 
synthesis increases the hydrophobicity of the full, correct oligo, increasing its retention time on an RP- 
HPLC column. The eluent is then fractionated to collect only the full sequence product. 

However, PS oligonucleotides, unlike PO oligonucleotides, are chiral molecules, i.e., molecules that 
are not identical to their mirror image. Current synthesis techniques do not control the orientation of 
each internucleotide bond, and the resultant oligonucleotides are therefore a mixture of 2 (nl) diastere- 
omers (where n = the number of bases in the complete oligo). The resulting sequences elute with similar 
retention times on RP-HPLC as all of the complete sequence diastereomers should have the hydrophobic 
DMT group attached. However, analysis following removal of the DMT group can be used to confirm 
or deny the presence of multiple diastereomers under higher resolution conditions. Capillary gel elec- 
trophoresis and anion-exchange HPLC are being examined as ways to enhance the selectivity of purifi- 
cation of PS oligonucleotides. Additional details on the synthesis and purification of oligonucleotides are 
available for the interested reader [42-44]. 

103.5 Specificity of Oligonucleotides 

The ability to block the expression of a single gene without undesired side effects (specificity) is the major 
advantage, in principle, of antisense based strategies. This specificity is primarily determined by the length 
(i.e., the number of bases) of the oligonucleotide. Experimental and theoretical data suggest that there 
is an optimum length at which specific inhibition of gene expression is maximized and non-specific 
effects are minimized [13, 14]. 

Affinity and specificity limit the minimum effective length of oligonucleotides. Oligonucleotides that 
are too short do not inhibit gene expression, because they do not bind with sufficient affinity to their 
substrates. The shortest oligonucleotide reported to affect gene expression in mammalian cells was 7 bases 
in length (7-mer) [45]. This oligonucleotide was highly modified, containing 5-(l-propynyl) cytosines 
and uracils as well as a phosphorothioate backbone. An oligonucleotide that is too short is less likely to 
represent a unique sequence in a given cell's genome and thus more likely to bind to a non-targeted RNA 
and inhibit its expression. Woolf et al. estimated the minimum sequence length that will be statistically 
unique in a given pool of mRNAs [13]. Since each position in a given sequence can be occupied by any 
of four nucleotides (A, C, G or U), the total number of different possible sequences of length N bases is 
4 N . Letting R equal the total number of bases in a given mRNA pool and assuming that it is a random 
and equal mixture of the four nucleotides, then the frequency (F) of occurrence in that pool of a sequence 
of length N is given by: 
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F = — (103.1) 

4 N 

For a typical human cell, which contains approximately 10 4 unique mRNA species whose average length 
is 2000 bases, R is approximately equal to 2 x 10 7 . Therefore, for a sequence to be unique (F < 1), N 
must be greater than or equal to 13 bases [13]. The minimum oligonucleotide length will also be 
constrained by the minimum binding affinity to form a stable complex. 

However, oligonucleotides cannot be made arbitrarily long, because longer oligonucleotides are more 
likely to contain internal sequences complementary to non-targeted RNA molecules. This has also been 
expressed mathematically [13]. The expected number of complementary sites (S) of length L for an 
oligonucleotide of length N in an mRNA pool with R bases is given by: 



(n-i+i) 



N-I+lxR 



(103.2) 



For example, an 18-mer (N = 18) has 6 internal 13-mers. Since a 13-mer is expected to occur 0.3 times 
in a mRNA pool containing 2 x 10 7 bases, the 18-mer is expected to match 1.8 (i.e., 6 x 0.3) 13-mers in 
the mRNA pool. Equation (103.2) also gives this result (N = 18, L = 13, and R = 2 x 10 7 ; therefore, S = 1.8). 

Woolf et al. have demonstrated that significant degradation of non-targeted mRNAs can occur [13]. 
They compared the effectiveness of three different 25-mers in suppressing the expression of fibronectin 
mRNA in Xenopus oocytes. Nearly 80% of the fibronectin mRNA was degraded after the oocytes were 
microinjected with a 25-mer in which all 25 of its bases were complementary to the mRNA. However, 
when the oocytes were microinjected with 25-mers that had only 17 or 14 complementary bases flanked 
by random sequences, greater than 30% of their fibronectin mRNA was still degraded. They also showed 
that a single mismatch in an oligonucleotide did not completely eliminate its antisense effect. Over 40% 
of the target mRNA was degraded when oocytes were treated with a 13-mer with one internal mismatch, 
though the mismatch left a 9 base consecutive complementary sequence that showed nearly the same 
activity as a 13-mer with 10 complementary bases in succession. 

Although these studies were conducted at lower temperatures and therefore under less stringent 
hybridization conditions than those found in mammalian cells, they clearly showed that complementary 
oligonucleotides flanked by unrelated sequences and even oligonucleotides with mismatched sequences 
can lead to significant degradation of RNA. The possibility of undesired inhibition of partially comple- 
mentary sequences must be considered when designing and testing any antisense oligonucleotide. Hence, 
a useful heuristic is that an oligonucleotide should be long enough to be unique and possess high affinity 
for its target but short enough to minimize side effects due to degradation of non-targeted mRNAs. 

The efforts to sequence the human genome have begun to provide more information about the actual 
specificity of oligonucleotides. It is possible to scan for sequences against all of the genomic information 
available using resources such as BLAST at the National Center for Biotechnology Information [46; 
http://www.ncbi.nlm.nih.gov]. The results will determine the uniqueness of the oligonucleotide within 
the known database sequences. As the completeness of the databases grows, computational comparisons 
of target RNA sequences against the database will provide more reliable assessments of the uniqueness 
of these target sequences within the genome. 

103.6 Oligonucleotide Delivery 

In many cases, the concentrations of antisense oligonucleotides required to achieve a biological effect are 
currently too large to be of therapeutic value. Achieving the necessary concentration in the vicinity of 
the target cells is thought to be limited at least in part by charge repulsion between the negatively-charged 
oligonucleotide and the negatively-charged cell surface. Four major methods are currently being inves- 
tigated to overcome this and other extracellular transport barriers: (1) chemical modification of the 
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oligonucleotide to increase its hydrophobicity; (2) conjugation of the oligonucleotide to a polycation; 
(3) conjugation of the oligonucleotide to a ligand specific for a cellular receptor; and (4) encapsulation 
of the oligonucleotide in a liposome. 

Increasing the hydrophobicity of oligonucleotides was first accomplished by synthesis of methylphos- 
phonate backbones. Despite their neutral backbone charge, cellular uptake of MP oligonucleotides was 
shown to be very inefficient, perhaps due in part to their inability to escape lysosomal degradation and 
gain entry into the cytoplasm. The hydrophobicity of an oligonucleotide can also be increased by 
conjugation with hydrophobic moieties such as cholesterol derivatives (chol-oligonucleotide). The 
increased hydrophobicity of chol-oligonucleotides reportedly improves their association with cell mem- 
branes and their internalization by cells [47] . Cellular accumulation of chol-oligonucleotides after 2 hours 
has been shown to occur when no accumulation was seen with unconjugated oligonucleotides [48]. The 
mechanism by which chol-oligonucleotides are more easily and rapidly taken up by cells has not been 
elucidated. Krieg [49] showed that chol-oligonucleotides with cholesterol at the 5' end could be bound 
by low density lipoprotein (LDL) and that this markedly increased the association of the oligonucleotide 
with the cell membrane and its internalization in vitro. LDL-associated chol-oligonucleotides were effec- 
tive at 8-fold lower concentration than PO oligonucleotide controls. 

Oligonucleotides conjugated with polycations such as poly-L-lysine (PLL) also have improved cellular 
uptake. Covalent conjugates can be constructed by coupling the 3' end of the oligonucleotide to the 
epsilon amino groups of the lysine residues [50]. Oligo-PLL conjugates complementary to the translation 
initiation site of the tat gene (a key HIV regulatory protein) protected cells from HIV-1 infection with 
concentrations 100-fold lower than non-conjugated oligonucleotides [51]. Low concentrations of oligo- 
PLL conjugates (100 nM), in which the 15-mer oligonucleotide was complementary to the initiation 
region of the viral N protein mRNA, also inhibited vesicular stomatitis virus (VSV) infection [50]. 

Internalization of oligonucleotides has also been improved by conjugation to ligands. One such ligand, 
transferrin, is internalized by receptor mediated endocytosis. Mammalian cells acquire iron carried by 
transferrin. Oligo-PLL complexes have been conjugated to transferrin to take advantage of this pathway. 
For example, an 18-mer complementary to c-myb mRNA (an oncogene that is responsible for the hyper- 
proliferation of some leukemia cells) was complexed with a transferrin PLL conjugate and rapidly 
internalized by human leukemia (HL-60) cells [52]. The expression of c-myb was greatly reduced and 
the uncontrolled proliferation of these cells inhibited. Because an oligo-PLL conjugate (without trans- 
ferrin) was not tested, it is not clear whether the improved antisense effects were due to the PLL moiety, 
the transferrin moiety, or a combination thereof. 

Oligo-ligand conjugates have also been used to target oligonucleotides to cells with specific cellular 
receptors, such as the asialoglycoprotein (ASGP) receptor, which is expressed uniquely by hepatocytes. 
These receptors bind and internalize ASGPs, serum glycoproteins that have exposed galactose residues 
at the termini of their glycosylation chains. Oligonucleotides complexed to ASGP are rapidly internalized 
by hepatocytes. A number of researchers have used ASGP conjugated to cationic PLL. When the ASGP:PLL 
is conjugated to oligonucleotides, their cellular uptake and antisense effectiveness in vitro is increased 
[53-55] . However, toxicity of the PLL moiety and instability of the noncovalent complexes between ASGP- 
PLL and oligonucleotides have limited their in vivo applicability [56]. One possible solution is the direct 
conjugation of the oligonucleotides to ASGP via a cleavable disulfide linkage [57]. 

Another promising means to deliver oligonucleotides is lipofection. Oligonucleotides are mixed with 
cationic lipids that condense around the negatively charged oligonucleotide forming a lipid vesicle 
(liposome). The positively charged lipids reduce the electrostatic repulsion between the negatively- 
charged oligonucleotide and the similarly charged cell surface. Bennett et al. made liposomes using 
DOTMA {N-{l-(2,3-dioleyloxy)propyl)}-N,N,N-trimethylammonium chloride} as the cationic lipid and 
an oligonucleotide complementary to the translation initiation codon of human intracellular adhesion 
molecule 1 (ICAM-1), increasing the potency of the oligonucleotide by >1000-fold [12]. A recent study 
has shown that liposomally encapsulated oligonucleotides at 0.01 nM were as active as free oligonucle- 
otides at 1.5 (J.M at inhibiting HIV-1 replication in chronically infected cells [58]. It was found that the 
oligonucleotides separate from cationic liposomes following cellular internalization [59]. Cationic lipids 
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subsequently accumulate in cytoplasmic structures and the plasma membrane, suggesting that they have 
fused with the endosomal vesicles and are being recirculated throughout the secretory pathways [59-61]. 
The increase in oligonucleotide activity suggests that cellular uptake may be a significant limitation in 
the effectiveness of antisense oligonucleotides. In vitro studies using electroporation, streptolysin O or 
OC-toxin permeabilization, and particle bombardment to enhance cellular uptake of nucleic acids support 
this hypothesis [62-65]. 

Efforts have been made to use antibodies to target liposomes containing oligonucleotides to specific 
cell types. Protein A, which binds to many IgG antibodies, is covalently bound to the liposome to form 
a liposome-oligo-protein A complex. An antibody specific for the target cell's surface is bound to the 
protein A, and these complexes are incubated with the target cells. Leonetti [66] synthesized such a 
complex to target the mouse major histocompatibility complex H-2K molecule on mouse L929 cells. 
The complex contained a 15-mer complementary to the 5' end of mRNA encoding the N protein of VSV. 
The complexes inhibited viral replication by more than 95%. Unconjugated liposomes, and liposomes 
conjugated to antibodies specific for a nonexpressed antigen, had no effect. 

103.7 Potential Applications of Antisense Oligonucleotides 

The potential applications for antisense oligonucleotides are limited only by the genetic information 
available. Antisense oligonucleotides can be developed against any target in which the inhibition of protein 
production or the inhibition of RNA processing yields the therapeutic result. Currently, clinical trials are 
underway using antisense oligonucleotides to treat rheumatoid arthritis, psoriasis, renal transplant rejec- 
tion, and inflammatory bowel disease (Crohn's disease) [http://www.phrma.org]. However, the primary 
targets remain refractory viral diseases and cancers for which the necessary target genetic information is 
typically available. 

Viral Diseases 

One of the many potential therapeutic applications of antisense technology is the treatment of infectious 
viral diseases. The use of antisense oligonucleotides as anti-viral agents is particularly promising because 
viral nucleic acid sequences are unique to the infected cell and are not found in normal healthy cells. 
The goal of antiviral therapy is to block the expression of key viral proteins that are vital to the life cycle 
of the virus. This has been achieved in vitro with several viruses including HIV, HSV, influenza and 
human papilloma virus [10]. The in vitro work with HIV is representative of other viruses and will be 
highlighted here. 

Human Immunodeficiency Virus 

Retroviruses, and HIV in particular, have high rates of mutation and genetic recombination. The effec- 
tiveness of many anti-HIV drugs has been severely reduced because drug resistant viral strains often arise 
after prolonged drug treatment. This is especially relevant to strategies that use an antisense approach that 
relies on a specific nucleotide sequence. One strategy to inhibit HIV replication is to target conserved 
sequences in key regulatory proteins such as tat and rev. Part of the rev sequence is highly conserved, with 
the known 16 isolates of HIV differing by at most one base pair in this conserved region [67]. The 
importance of targeting a highly conserved region as is found in rev was demonstrated by Lisziewicz, who 
investigated the efficacy of antisense oligonucleotides directed against various viral proteins [68]. Although 
several 28-mers initially inhibited viral replication, resistant mutant viruses developed after 25 days to all 
of the oligonucleotides with the exception of those directed at the highly conserved rev regions. 

A second concern in treating HIV and other viruses is that viral replication can restart after antisense 
treatment is stopped [67] . Can oligonucleotides be continuously administered to prevent viral replication 
without unwanted side effects? These issues and others common to all antisense based therapies (oligo- 
nucleotide stability, specificity, affinity, and delivery) must be addressed prior to the successful imple- 
mentation of antisense-based HIV therapies. 
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Cancer 

In principle, antisense technology can be used against cancer, but the target is more challenging. Onco- 
genes are typically genes that play a vital regulatory role in the growth of a cell, the mutation or 
inappropriate expression of which can result in the development of cancer. In the case of mutation, it is 
often difficult to distinguish an oncogene from its normal counterpart, because they may differ by as 
little as one base. Thus, attempts to inhibit oncogene expression might block the expression of the normal 
gene in non-cancerous cells and cause cytotoxic effects. Despite these challenges, steady progress has 
been made in the development of effective antisense oligonucleotides that have inhibited many types of 
oncogenes including bcr-abl, c-myc, c-myb, c-ras, Ha-ras, neu/erbB2 and bFGF [11, 25, 61, 69, 70]. 

Studies targeting ras oncogenes are encouraging and are representative of progress against other classes 
of oncogenes. Chang [71] targeted a rasp21 gene point mutation with an antisense oligonucleotide. Only 
the expression of the mutated genes was inhibited, suggesting that it is possible to selectively inhibit an 
oncogene that differs by only a single base from the normal gene. Using a 17-mer PS oligo, it was shown 
that -98% inhibition of Ha-ras expression could be achieved at 1 (J.M concentration [25]. 

Successes in inhibiting oncogene expression in vitro are encouraging, but many problems remain to 
be solved before antisense oligonucleotides are therapeutically useful for cancer patients. For example, 
finding a suitable oncogene to target is difficult. Even if a genetic defect is common to all cells of a given 
cancer, there is no guarantee that inhibition of that oncogene will halt cancer cell proliferation [70]. Even 
if cell growth is inhibited, tumor growth may restart after antisense treatment is stopped. Oligonucleotides 
are needed that induce cancer cell terminal differentiation or death [70]. In order to avoid toxic side 
effects by inhibiting gene expression of regulated oncogenes in normal tissues, oligonucleotides that 
specifically target cancer cells are required. Even if appropriate antisense oligonucleotides are developed, 
they will still be ineffective against solid tumors unless they can reach the interior of the tumor at 
biologically effective concentrations. 

103.8 In Vivo Pharmacology 

The first in vivo antisense studies were designed to test the biodistribution and toxicity of antisense 
oligonucleotides. These studies demonstrated that PO oligonucleotides are rapidly excreted from the 
body with PS oligonucleotides being retained longer. One study in mice noted that completely modified 
PS oligonucleotides were retained significantly longer than a chimeric 20-mer that contained 15 phos- 
phodiester bonds and only 4 phosphorothioate bonds [72]. Only 30% of the PS oligonucleotide was 
excreted in the urine after 24 hours, whereas 75% of the chimeric oligonucleotide was excreted after only 
12 hours. A similar study demonstrated that PS oligonucleotides were retained in body tissues of adult 
male rats with a half-life of 20 to 40 hours [73]. 

These studies suggested that antisense oligonucleotides may need to be administered repeatedly to 
maintain a therapeutic effect for chronic disorders. At least with PS oligonucleotides, this has proven to 
be problematic. Continuous or repeated administration of PS oligonucleotides results in immunologic 
complement activation. It has been shown that infusion of PS oligonucleotides to rhesus monkeys results 
in almost immediate (within 10 min.) decreases in white blood cell count and the fraction of neutrophils 
among the white cells [30]. At 40 min. post-infusion, the neutrophil counts had increased to higher than 
baseline levels. A recent study indicated that a plasma concentration of >50 p.g/ml results in complement 
activation [74]. These difficulties have further hastened the development of novel oligonucleotide chem- 
istries and chimeric oligonucleotides for which the immunologic properties are expected to be improved 
due to the possible reduction in the use of PS backbone linkages [33, 75]. 

The targeted delivery of oligonucleotides has been proposed as a way to improve the effectiveness of 
systemically administered antisense oligonucleotides by oligonucleotide concentration in the desired cells 
and minimizing non-specific side effects. Targeting oligonucleotides to a specific organ has been dem- 
onstrated. As previously discussed, a soluble DNA carrier system has been developed to specifically target 
hepatocytes [76]. This system was used to target a 67-mer PO oligonucleotide (complementary to the 
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5' end of rat serum albumin mRNA) to the liver [56] . Following tail vein injection into rats, the complex 
rapidly and preferentially accumulated in the liver, but the efficiency of this targeting method was limited 
by the rapid dissociation of the oligonucleotide from the ASGP:PLL complex (30% dissociated within 
7 minutes). 

103.9 Animal Models 

The effectiveness of antisense oligonucleotides at inhibiting gene expression in vivo has been demon- 
strated in several animal models. Immunodeficient mice, bearing a human myeloid leukemia cell line, 
were continuously infused with 100 (J.g/day of a 24-mer PS oligonucleotide complementary to c-myb 
mRNA [77]. Mice that received the antisense oligonucleotide survived on average 8.5 times longer than 
control mice that received no treatment and 5.7 times longer than sense-treated controls. The treated 
animals also had a significantly lower tumor burden in the ovaries and brain. 

Antisense oligonucleotides have also inhibited the growth of solid tumors in vivo. Immuno-deficient 
mice bearing a fibrosarcoma or melanoma were injected subcutaneously twice weekly with 1.4 mg of PS 
oligonucleotides complementary to the 5' end of the p65 subunit of the mRNA for the NF-KB transcrip- 
tion factor (NF-KB activates a wide variety of genes and is believed to be important in cell adhesion and 
tumor cell metastasis) [78]. Greater than 70% of antisense treated mice exhibited a marked reduction in 
their tumor size. Administration of control oligonucleotides complementary to GAPDH and jun-D had 
no effect on tumor size. 

The capability of antisense oligonucleotides to inhibit viral replication in vivo has also been studied. 
Fourteen one day old ducklings were infected with duck hepatitis B virus (DHBV), which is closely 
related to the strain of hepatitis B virus that is a major cause of chronic liver disease and cancer in humans 
[79]. Two weeks later, the ducks were injected daily, for 10 consecutive days, with 20 (J.g/gm body weight 
of an 18-mer PS oligonucleotide complementary to the start site of the Pre-S region of the DHBV genome. 
The antisense oligonucleotides blocked viral gene expression and eliminated the appearance of viral 
antigens in the serum and liver of all treated ducks. No toxic effects due to the oligonucleotides were 
noted. Unfortunately, residual amounts of DNA precursors of viral transcripts were detected in the nuclei 
of liver cells, which resulted in a slow restart of viral replication after antisense treatment was stopped. 
Further studies are needed to determine if prolonged treatment with antisense will eliminate this residual 
viral DNA. 

Though many of the oligonucleotides under investigation target viruses or cancer, other refractory 
targets are also being investigated. Antisense oligonucleotides have been evaluated in the treatment of 
restenosis, a narrowing of an artery following corrective surgery caused by mitogen-induced proliferation 
of vascular smooth muscle cells (SMC). Several studies have tested the ability of antisense oligonucleotides 
to inhibit genes whose expression is important in SMC proliferation, including c-myc, c-myb, cdc2 and 
cdk2 [80-82] . In one study with an antisense therapeutic complementary to c-myc mRNA, the formation 
of a thick intimal layer (neointima) in treated areas was significantly reduced in antisense treated rats 
when compared to control rats with maximum c-myc expression reduced by 75% relative to controls 
[82]. Dextran sodium sulfate-induced colitis in Swiss-Webster mice (as measured by disease activity 
index, DAI) was reduced by -64% following 5 days of daily subcutaneous administration of an antisense 
oligonucleotide complementary to ICAM-1 at 0.3 mg/kg/day [83]. Interestingly, in this study, higher 
oligonucleotide concentrations (> 5 mg/kg/day) were ineffective in reducing inflammation. 

103.10 Clinical Trials 

The FDA recently approved the first antisense therapeutic, fomivirsen, a treatment for cytomegalovirus 
(CMV) retinitis in AIDS patients [http://www.isip.com]. CMV infection is fairly common (-50% of 
people over 35 are seropositive); however, it results in complications only in immunocompromised 
individuals. The oligonucleotide is complementary to the immediate-early viral RNA, a region encoding 
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TABLE 103.2 A Sample of Current Clinical Trials of Antisense Oligodeoxynucleotide Therapeutics 



Compound 


Type of Oligo 


Disease Target 


Gene Target 


Phase 


ISIS13312 


2'-modified — 2nd Gen. 


CMV Retinitis 


N/A 


I/II 


ISIS2302 


PS 


Crohn's Disease 


ICAM-1 


II/Pivotal 






Psoriasis 


ICAM-1 


II 






Renal Transplant Rejection 


ICAM-1 


II 






Rheumatoid Arthritis 


ICAM-1 


II 






Ulcerative Colitis 


ICAM-1 


II 


G3139 


N/A 


Cancer 


N/A 


I 


ISIS2503 


PS 


Cancer 


Ha-ras 


I 


ISIS3521 


PS 


Cancer 


PKC-a 


II 


ISIS5132 


PS 


Cancer 


c-raf 


II 


LR-3001 


PS 


Chromic Myelogenous Leukemia 


bcr-abl 


I 


LR-3280 


PS 


Restenosis 


myc 


II 



N/A — Not available 

Potential applications of antisense technology include the treatment of cancer, infectious diseases, and 
inflammatory diseases. 

proteins that regulate virus gene expression [84]. The activity of the oligonucleotide therapeutic is 
potentiated by traditional drugs such as ganciclovir. Additional studies suggested, however, that the 
oligonucleotide acts not only via an antisense mechanism, though complementarity to the target was 
required for maximal inhibitory capacity [85]. 

Clinical trials on many other antisense oligonucleotides have advanced to Phase II. However, compli- 
cations associated with antisense oligonucleotides have also been found. An antisense compound targeted 
to the gag region of HIV was pulled from clinical trials due to the resulting decreases in platelet counts 
in 30% of the patients after 10 days of daily administration [http://www.hybridon.com]. The immuno- 
logic complications are likely related to the activation of complement discussed earlier. Perhaps, 
fomivirsen avoids these complications due to its local administration by intravitreal injection rather than 
systemic infusion. Table 103.2 lists clinical trials that are currently ongoing and the disease targets. 
Published results from these trials are not yet available. 

103.11 Summary 

Antisense oligonucleotides have the potential to selectively inhibit the expression of any gene with a 
known sequence. Although there have been some remarkable successes, realizing this potential is proving 
difficult because of problems with oligonucleotide stability, specificity, affinity, and delivery. Natural 
oligonucleotides contain a phosphodiester backbone that is highly susceptible to nuclease degradation. 
Chemically modified oligonucleotides (PS and chimeric oligonucleotides) have been synthesized that are 
more stable. However, the increased stability of PS oligonucleotides comes at a cost, in particular the 
activation of complement. Another issue for chemically modified oligonucleotides is their large scale 
production and purification. Alternative chemistries are needed that are more biologically active and 
more amenable to large scale production. Targeted delivery of antisense oligonucleotides is needed to 
minimize side effects and maximize oligonucleotide concentration in target cells. Attempts to improve 
delivery include chemical modification of the oligonucleotides to increase their cellular permeability, the 
conjugation of oligonucleotides to specific ligands to utilize more efficient receptor-mediated internal- 
ization pathways, and the use of antibody conjugated liposomes to deliver oligonucleotides to specific 
cells. 

The potential applications of antisense oligonucleotides to the treatment of disease are vast. Antisense- 
based therapies are under development for the treatment of infectious diseases such as CMV, HIV, herpes 
simplex virus, influenza, hepatitis and human papilloma virus as well as the treatment of complex genetic 
disorders like cancer. Animal models are being used to determine if ( 1 ) antisense oligonucleotides can 
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be delivered to target cells at high enough concentrations to be effective, (2) repeated treatments with 
oligonucleotides are toxic or elicit an immune response, and (3) antisense oligonucleotides directed 
against a single gene can be effective against complex genetic diseases such as cancer. Multiple clinical 
trials against viral, cancer, and other targets are now in progress with one drug having been approved 
for the clinic. Improvements in our understanding of the mechanisms of antisense inhibition, the 
pharmacology of antisense oligonucleotides in vivo and the development of chemically modified oligo- 
nucleotides with high affinity, specificity and stability are needed to realize the clinical potential of 
antisense-based strategies for the treatment of a wide variety of diseases. 

Defining Terms 

Antisense: Any DNA or RNA molecule whose sequence is complementary to the RNA transcribed 
from a target gene. 

Chiral: A molecule whose configuration is not identical with its mirror image. 

Complementary: A nucleic acid sequence is complementary to another if it is able to form a perfectly 
hydrogen-bonded duplex with it, according to the Watson-Crick rules of base pairing (A opposite 
U or T, G opposite C). 

Diastereomer: Optically active isomers that are not enantiomorphs (mirror images). 

Exonuclease: An enzyme that catalyzes the release of one nucleotide at a time, serially, from one end 
of a polynucleotide. 

In vitro: In an artificial environment, referring to a process or reaction occurring therein, as in a test 
tube or culture dish. 

In vivo: In the living body, referring to a process or reaction occurring therein. 

Lipofection: Delivery of therapeutic drugs (antisense oligonucleotides) to cells using cationic lipo- 
somes. 

Lipophilic: Capable of being dissolved in lipids (organic molecules that are the major structural ele- 
ments of biomembranes). 

Liposome: A spherical particle of lipid substance suspended in an aqueous medium. 

Plasmid: A small, circular extrachromosomal DNA molecule capable of independent replication in a 
host cell. 

Receptor mediated endocytosis: The selective uptake of extracellular proteins, oligonucleotides and 
small particles, usually into clathrin coated pits, following their binding to cell surface receptor 
proteins. 

Restenosis: A narrowing of an artery or heart valve following corrective surgery on it. 

RNase H: An enzyme that specifically recognizes RNA:DNA duplexes and cleaves the RNA portion, 
leaving the DNA portion intact. 
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Further Information 

The book Antisense Nucleic Acids and Proteins: Fundamentals and Applications, edited by Joseph N. M. 
Mol and Alexander R. van der Krol, is a collection of reviews in the use of antisense nucleic acids to 
modulate or downregulate gene expression. The book Antisense RNA and DNA, edited by James A. H. 
Murray, explores the use of antisense and catalytic nucleic acids for regulating gene expression. 

The journal Antisense and Nucleic Acid Drug Development, published by Mary Ann Liebert, Inc., 
presents original research on antisense technology. For subscription information, contact Antisense and 
Nucleic Acid Drug Development, Mary Ann Liebert, Inc., 2 Madison Avenue, Larchmont, NY 10538; 
(914) 834-3100, Fax (914) 834-3688, email: info@liebertpub.com. The biweekly journal Nucleic Acids 
Research publishes papers on physical, chemical, and biologic aspects of nucleic acids, their constituents, 
and proteins with which they interact. For subscription information, contact Journals Marketing; Oxford 
University Press, Inc., 2001 Evans Road, Cary, NC 27513; 1-800-852-7323, Fax, (919) 677-1714, email: 
jnlorders@oup-usa.org. 
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The development of sophisticated and powerful recombinant techniques for manipulating and analyzing 
genetic material has led to the emergence of a new biologic discipline, often termed molecular biology. 
The tools of molecular biology have enabled scientists to begin to understand many of the fundamental 
processes of life, generally through the identification, isolation, and structural and functional analysis of 
individual or, at best, limited numbers of genes. Biology is now at a point where it is feasible to begin a 
more ambitious endeavor — the complete genetic analysis of entire genomes. Genome analysis aims not 
only to identify and molecularly characterize all the genes that orchestrate the development of an organism 
but also to understand the complex and interactive regulatory mechanisms of these genes, their organi- 
zation in the genome, and the role of genetic variation in disease, adaptability, and individuality. Addi- 
tionally, the study of homologous genetic regions across species can provide important insight into their 
evolutionary history. 

As can be seen in Table 104.1, the genome of even a small organism consists of a very large amount 
of information. Thus the analysis of a complete genome is not simply a matter of using conventional 
techniques that work well with individual genes (comprised of perhaps 1000 to 10,000 base pairs) a 
sufficient (very large) number of times to cover the genome. Such a brute-force approach would be too 
slow and too expensive, and conventional data-handling techniques would be inadequate for the task of 
cataloging, storing, retrieving, and analyzing such a large amount of information. The amount of manual 
labor and scientific expertise required would be prohibitive. New technology is needed to provide high- 
throughput, low-cost automation and reduced reliance on expert intervention at intermediate levels in 
the processes required for large-scale genetic analysis. Novel computational tools are required to deal 
with the large volumes of genetic information produced. Individual tools must be integrated smoothly 
to produce an analytical system in which samples are tracked through the entire analytical process, 
intermediate decisions and branch points are few, a stable, reliable and routine protocol or set of protocols 
is employed, and the resulting information is presented to the biologic scientist in a useful and meaningful 
format. It is important to realize that the development of these tools requires the interdisciplinary efforts 
of biologists, chemists, physicists, engineers, mathematicians, and computer scientists. 

Genome analysis is a complex and extended series of interrelated processes. The basic processes 
involved are diagrammed in Fig. 104.1. At each stage, new biologic, chemical, physical (mechanical, 
optical), and computational tools have been developed within the last 10 years that have begun to enable 
large-scale (megabase) genetic analysis. These developments have largely been spurred by the goals of 
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TABLE 104.1 DNA Content of Various Genomes in Monomer 
Units (Base Pairs) 



Organism 

Phage T4 

Escherichia coli 

Saccharomyces 

Ambidopsis thaliana 

Caenorhabditis elegans 

Drosophila melanogaster 

Mouse 

Human 



T>T?e 

Bacteriophage (virus) 

Bacterium 

Yeast 

Plant 

Nematode 

Insect (fruit fly) 

Mammal 

Mammal 



Size 

160,000 

4,000,000 

14,000,000 

100,000,000 

100,000,000 

165,000,000 

3,000,000,000 

3,500,000,000 



Source: Adapted from [1] and [2], 

the Human Genome Project, a worldwide effort to decipher the entirety of human genetics. However, 
biologists are still a significant ways away from having a true genome analysis capability [3], and as such, 
new technologies are still emerging. 

This chapter cannot hope to describe in depth the entire suite of tools currently in use in genome 
analysis. Instead, it will attempt to present the basic principles involved and to highlight some of the 
recent enabling technological developments that are likely to remain in use in genome analysis for the 
foreseeable future. Some fundamental knowledge of biology is assumed; in this regard, an excellent 
beginning text for individuals with a minimal background in molecular biology is that by Watson et al. [ 1 ] . 



Identify target DNA (genomic, chromosomal) 



Isolate target DNA 



Clone target DNA into large insert vector for propagation and amplification 



Position clones in original target DNA (physical mapping) 



Subclone initial clones into small insert vectors suitable for DNA sequencing 



Obtain DNA sequence from small insert clones 



Assemble sequence from overlapping clones into 
contiguous sequence of target DNA 



Analyze sequence for biological information content 



Use information in biological research and medicine 



FIGURE 104.1 Basic steps in genome analysis. 
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TABLE 104.2 Enzymes Commonly Used in Genome Analysis 



Enzyme Function Common Use 

Restriction endonuclease Cleave double-stranded DNA at specific sites Mapping, cloning 

DNA polymerase Synthesize complementary DNA strand DNA sequencing, amplification 

Polynucleotide kinase Adds phosphate to 5' end of single-stranded DNA Radiolabeling, cloning 

Terminal transferase Adds nucleotides to the 3' end of single-stranded DNA Labeling 

Reverse transcriptase Makes DNA copy from RNA RNA sequencing, cDNA cloning 

DNA ligase Covalently joins two DNA fragments Cloning 



104.1 General Principles 



The fundamental blueprint for any cell or organism is encoded in its genetic material, its deoxyribonucleic 
acid (DNA). DNA is a linear polymer derived from a four-letter biochemical alphabet — A, C, G, and T. 
These four letters are often referred to as nucleotides or bases. The linear order of bases in a segment of 
DNA is termed its DNA sequence and determines its function. A gene is a segment of DNA whose sequence 
directly determines its translated protein product. Other DNA sequences are recognized by the cellular 
machinery as start and stop sites for protein synthesis, regulate the temporal or spatial expression of 
genes, or play a role in the organization of higher-order DNA structures such as chromosomes. Thus a 
thorough understanding of the DNA sequence of a cell or organism is fundamental to an understanding 
of its biology. 

Recombinant DNA technology affords biologists the capability to manipulate and analyze DNA 
sequences. Many of the techniques employed take advantage of a basic property of DNA, the molecular 
complementarity of the two strands of the double helix. This complementarity arises from the specific 
hydrogen-bonding interactions between pairs of DNA bases, A with T and C with G. Paired double 
strands of DNA can be denatured, or rendered into the component single strands, by any process that 
disrupts these hydrogens bonds — high temperature, chaotropic agents, or pH extremes. Complementary 
single strands also can be renatured into the duplex structure by reversing the disruptive element; this 
process is sometimes referred to as hybridization or annealing, particularly when one of the strands has 
been supplied from some exogenous source. 

Molecular biology makes extensive use of the DNA-modifying enzymes employed by cells during 
replication, translation, repair, and protection from foreign DNA. A list of commonly used enzymes, 
their functions, and some of the experimental techniques in which they are utilized is provided in 
Table 104.2. 

104.2 Enabling Technologies 

The following are broadly applicable tools that have been developed in the context of molecular biology 
and are commonly used in genome analysis. 

Cloning. Cloning is a recombinant procedure that has two main purposes: First, it allows one to select 
a single DNA fragment from a complex mixture, and second, it provides a means to store, manipulate, 
propagate, and produce large numbers of identical molecules having this single ancestor. A cloning vector 
is a DNA fragment derived from a microorganism, such as a bacteriophage or yeast, into which a foreign 
DNA fragment may be inserted to produce a chimeric DNA species. The vector contains all the genetic 
information necessary to allow for the replication of the chimera in an appropriate host organism. A 
variety of cloning vectors have been developed which allow for the insertion and stable propagation of 
foreign DNA segments of various sizes; these are indicated in Table 104.3. 

Electrophoresis. Electrophoresis is a process whereby nucleic acids are separated by size in a sieving 
matrix under the influence of an electric field. In free solution, DNA, being highly negatively charged by 
virtue of its phosphodiester backbone, migrates rapidly toward the positive pole of an electric field. If 
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TABLE 104.3 Common Cloning Vectors 



Vector Approximate Insert Size Range (Base Pairs) 

Bacteriophage Ml 3 100-5000 

Plasmid 100-10,000 

Bacteriophage lambda 10,000-15,000 

Cosmid 25,000-50,000 

Yeast artificial chromosome (YAC) 100,000-1,000,000 

the DNA is forced instead to travel through a molecularly porous substance, such as a gel, the smaller 
(shorter) fragments of DNA will travel through the pores more rapidly than the larger (longer) fragments, 
thus effecting separation. Agarose, a highly purified derivative of agar, is commonly used to separate 
relatively large fragments of DNA (100 to 50,000 base pairs) with modest resolution (50 to 100 base 
pairs), while cross-linked polyacrylamide is used to separate smaller fragments (10 to 1,000 base pairs) 
with single base-pair resolution. Fragment sizes are generally estimated by comparison with standards 
run in another lane of the same gel. Electrophoresis is used extensively as both an analytical and a 
preparative tool in molecular biology. 

Enzymatic DNA Sequencing. In the late 1970s, Sanger and coworkers [4] reported a procedure employ- 
ing DNA polymerase to obtain DNA sequence information from unknown cloned fragments. While 
significant improvements and modifications have been made since that time, the basic technique remains 
the same: DNA polymerase is used to synthesize a complementary copy of an unknown single-stranded 
DNA (the template) in the presence of the four DNA monomers (deoxynucleotide triphosphates, or 
dNTPs). DNA polymerase requires a double-stranded starting point, so a single-stranded DNA (the 
primer) is hybridized at a unique site on the template (usually in the vector), and it is at this point that 
DNA synthesis is initiated. Key to the sequencing process is the use of a modified monomer, a dideoxy- 
nucleotide triphosphate (ddNTP), in each reaction. The ddNTP lacks the 3'-hydroxyl functionality (it 
has been replaced by a hydrogen) necessary for phosphodiester bond formation, and its incorporation 
thus blocks further elongation of the growing chain by polymerase. Four reactions are carried out, each 
containing all four dNTPs and one of the four ddNTPs. By using the proper ratios of dNTPs to ddNTP, 
each reaction generates a nested set of fragments, each fragment beginning at exactly the same point (the 
primer) and terminating with a particular ddNTP at each base complementary to that ddNTP in the 
template sequence. The products of the reactions are then separated by electrophoresis in four lanes of 
a polyacrylamide slab gel. Since conventional sequencing procedures utilize radiolabeling (incorporation 
of a small amount of 32 P- or 35 S-labeled dNTP by the polymerase), visualization of the gel is achieved by 
exposing it to film. The sequence can be obtained from the resulting autoradiogram, which appears as 
a series of bands (often termed a ladder) in each of the four lanes. Each band is composed of fragments 
of a single size, the shortest fragments being at the bottom of the gel and the longest at the top. Adjacent 
bands represent a single base pair difference, so the sequence is determined by reading up the ladders in 
the four lanes and noting which lane contains the band with the next largest sized fragments. The 
enzymatic sequencing process is diagrammed in Fig. 104.2. It should be noted that although other 
methods exist, the enzymatic sequencing technique is currently the most commonly used DNA sequenc- 
ing procedure due to its simplicity and reliability. 

Polymerase Chain Reaction (PCR). PCR [5] is an in vitro procedure for amplifying particular DNA 
sequences up to 10 8 -fold that is utilized in an ever-increasing variety of ways in genome analysis. The 
sequence to be amplified is defined by a pair of single-stranded primers designed to hybridize to unique 
sites flanking the target sequence on opposite strands. DNA polymerase in the presence of the four dNTPs 
is used to synthesize a complementary DNA copy across the target sequence starting at the two primer 
sites. The amplification procedure is performed by repeating the following cycle 25 to 50 times (see 
Fig. 104.3). First, the double-stranded target DNA is denatured at high temperature (94 to 96°C). Second, 
the mixture is cooled, allowing the primers to anneal to their complementary sites on the target single 
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template DNA 
primer 



ACGTTTAGGCGCATAATGCACG 
TGCAAATC 



DNA polymerase 
dNTPs 
ddNTP 



ddA ddC ddG ddT 



DNA sequencing reaction mixtures 



TGCAAATCCGCGddT 

TGCAAATCCGCGTAddT 

TGCAAATCCGCGTATddT 

TGCAAATCCGCGTATTACGddT 

TGCAAATCCGCGTATTACGTGC 

"T" reaction products 



ddA ddC 




Gel 
electrophoresis 



FIGURE 104.2 Enzymatic DNA sequencing. A synthetic oligonucleotide primer is hybridized to its complementary 
site on the template DNA. DNA polymerase and dNTPs are then used to synthesize a complementary copy of the 
unknown portion of the template in the presence of a chain-terminating ddNTP (see text) . A nested set of fragments 
beginning with the primer sequence and ending at every ddNTP position is produced in each reaction (the ddTTP 
reaction products are shown). Four reactions are carried out, one for each ddNTP. The products of each reaction 
are then separated by gel electrophoresis in individual lanes, and the resulting ladders are visualized. The DNA 
sequence is obtained by reading up the set of four ladders, one base at a time, from smallest to largest fragment. 

strands. Third, the temperature is adjusted for optimal DNA polymerase activity, initiating synthesis. 
Since the primers are complementary to the newly synthesized strands as well as the original target, each 
cycle of denaturation/annealing/synthesis effectively doubles the amount of target sequence present in 
the reaction, resulting in a 2" amplification (« = number of cycles). The initial implementation of PCR 
utilized a polymerase that was unstable at the high temperatures required for denaturation, thus requiring 
manual addition of polymerase prior to the synthesis step of every cycle. An important technological 
development was the isolation of DNA polymerase from a thermophilic bacterium, Thermus aquaticus 
(Taq), which can withstand the high denaturation temperatures [6]. Additionally, the high optimal 
synthesis temperature (70 to 72°C) of Taq polymerase improves the specificity of the amplification process 
by reducing spurious priming from annealing of the primers to nonspecific secondary sites in the target. 
While PCR can be performed successfully manually, it is a tedious process, and numerous thermal 
cycling instruments have become commercially available. Modern thermal cyclers are programmable and 
capable of processing many samples at once, using either small plastic tubes or microtiter plates, and are 
characterized by accurate and consistent temperature control at all sample positions, rapid temperature 
ramping, and minimal temperature over/undershoot. Temperature control is provided by a variety of 
means (Peltier elements, forced air, water) using metal blocks or water or air baths. Speed, precise 
temperature control, and high sample throughput are the watchwords of current thermal cycler design. 
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FIGURE 104.3 The first cycle in the polymerase chain reaction. In step 1, the double-stranded target DNA is 
thermally denatured to produce single-stranded species. A pair of synthetic primers, flanking the specific region of 
interest, are annealed to the single strands to form initiation sites for DNA synthesis by polymerase (step 2). Finally, 
complementary copies of each target single strand are synthesized by polymerase in the presence of dNTPs, thus 
doubling the amount of target DNA initially present (step 3). Repetition of this cycle effectively doubles the target 
population, affording one million-fold or greater amplification of the initial target sequence. 

PCR technology is commonly used to provide sufficient material for cloning from genomic DNA 
sources, to identify and characterize particular DNA sequences in an unknown mixture, to rapidly 
produce templates for DNA sequencing from very small amounts of target DNA, and in cycle sequencing, 
a modification of the enzymatic sequencing procedure that utilizes Taq polymerase and thermal cycling 
to amplify the products of the sequencing reactions. 

Chemical Synthesis of Oligodeoxynucleotides. The widespread use of techniques based on DNA poly- 
merase, such as enzymatic DNA sequencing and the PCR, as well as of numerous other techniques 
utilizing short, defined-sequence, single-stranded DNAs in genome analysis, is largely due to the ease 
with which small oligodeoxynucleotides can be obtained. The chemical synthesis of oligonucleotides has 
become a routine feature of both individual biology laboratories and core facilities. The most widely 
used chemistry for assembling short (<100 base pair) oligonucleotides is the phosphoramidite approach 
[7], which has developed over the past 15 years or so. This approach is characterized by rapid, high-yield 
reactions and stable reagents. Like modern peptide synthesis chemistry, the approach relies on the 
tethering of the growing DNA chain to a solid support (classically glass or silica beads, more recently 
cross-linked polystyrene) and is cyclic in nature. At the end of the assembly, the desired oligonucleotide 
is chemically cleaved from the support, generally in a form that is sufficiently pure for its immediate use 
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in a number of applications. The solid phase provides two significant advantages: It allows for the use 
of large reagent excesses, driving the reactions to near completion in accord with the laws of mass action 
while reducing the removal of these excesses following the reactions to a simple matter of thorough 
washing, and it enables the reactions to be performed in simple, flow-through cartridges, making the 
entire synthesis procedure easily automatable. Indeed, a number of chemical DNA synthesis instruments 
("gene machines") are commercially available, capable of synthesizing one to several oligonucleotides at 
once. Desired sequences are programmed through a keyboard or touchpad, reagents are installed, and 
DNA is obtained a few hours later. Improvements in both chemistry and instrument design have been 
aimed at increasing synthesis throughput (reduced cycle times, increased number of simultaneous 
sequence assemblies), decreasing scale (most applications in genome analysis require subnanomole quan- 
tities of any particular oligonucleotide) , and concomitant with these two, reducing cost per oligonucleotide. 

104.3 Tools for Genome Analysis 

Physical Mapping. In the analysis of genomes, it is often useful to begin with a less complex mixture 
than an entire genome DNA sample. Individual chromosomes can be obtained in high purity using a 
technology known as chromosome sorting [8], a form of flow cytometry. A suspension of chromosomes 
stained with fluorescent dye is flowed past a laser beam. As a chromosome enters the beam, appropriate 
optics detect the scattered and emitted light. Past the beam, the stream is acoustically broken into small 
droplets. The optical signals are used to electronically trigger the collection of droplets containing 
chromosomes by electrostatically charging these droplets and deflecting them into a collection medium 
using a strong electric field. Chromosomes can be differentiated by staining the suspension with two 
different dyes that bind in differing amounts to the various chromosomes and looking at the ratio of the 
emission intensity of each dye as it passes the laser/detector. Current commercial chromosome sorting 
instrumentation is relatively slow, requiring several days to collect sufficient material for subsequent 
analysis. 

As mentioned previously, whole genomes or even chromosomes cannot yet be analyzed as intact 
entities. As such, fractionation of large nucleic acids into smaller fragments is necessary to obtain the 
physical material on which to perform genetic analysis. Fractionation can be achieved using a variety of 
techniques: limited or complete digestion by restriction enzymes, sonication, or physical shearing through 
a small orifice. These fragments are then cloned into an appropriate vector, the choice of which depends 
on the size range of fragments involved (see Table 104.3). The composite set of clones derived from a 
large nucleic acid is termed a library. In general, it is necessary to produce several libraries in different 
cloning vectors containing different-sized inserts. This is necessary because the mapping of clones is 
facilitated by larger inserts, while the sequencing of clones requires shorter inserts. 

The library-generating process yields a very large number of clones having an almost random distri- 
bution of insert endpoints in the original fragment. It would be very costly to analyze all clones in a 
library, and unnecessary as well. Instead, a subset of overlapping clones is selected whose inserts span 
the entire starting fragment. These clones must be arrayed in the linear order in which they are found 
in the starting fragment; the process for doing this is called physical mapping. The conventional method 
for physically mapping clones uses restriction enzymes to cleave each clone at enzyme-specific sites, 
separating the products of the digestion by electrophoresis, and comparing the resulting patterns of 
restriction fragment sizes for different clones to find similarities. Clones exhibiting a number of the same- 
sized fragments likely possess the same subsequence and thus overlap. Clearly, the longer the inserts 
contained in the library, the faster a large genetic region can be covered by this process, since fewer clones 
are required to span the distance. Physical mapping also provides landmarks, the enzyme cleavage sites 
in the sequence, that can be used to provide reference points for the mapping of genes and other functional 
sequences. Mapping by restriction enzyme digestion is simple and reliable to perform; however, manual 
map assembly from the digest data is laborious, and significant effort is currently being expended in the 
development of robust and accurate map assembly software. 
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Normal agarose gel electrophoresis can effectively separate DNA fragments less than 10,000 base pairs 
and fragments between 10,000 and 50,000 base pairs less effectively under special conditions. However, 
the development of very large insert cloning vectors, such as the yeast artificial chromosome [9] , neces- 
sitated the separation of fragments significantly larger than 10,000 base pairs to allow for use in physical 
mapping. In order to address this issue, a technology called pulsed-field gel electrophoresis (PFGE) was 
developed. Unlike conventional electrophoresis, in which the electric field remains essentially constant, 
homogeneous, and unidirectional during a separation, PFGE utilizes an electric field that periodically 
changes its orientation. The principle of PFGE is thought to be as follows: When DNA molecules are 
placed in an electric field, the molecules elongate in the direction of the field and then begin to migrate 
through the gel pores. When the field is removed, the molecules relax to a more random coiled state and 
stop moving. Reapplication of the field in another orientation causes the DNA to change its conformation 
in order to align in that direction prior to migration. The time required for this conformational change 
to occur has been found to be very dependent on the size of the molecules, with larger molecules 
reorienting more slowly than small ones. Thus longer DNAs move more slowly under the influence of 
the constantly switching electric field than shorter ones, and size-based separation occurs. PFGE sepa- 
rations of molecules as large as 10 million base pairs have been demonstrated. Numerous instruments 
for PFGE have been constructed, differing largely in the strategy employed to provide electric field 
switching [10]. 

Physical maps based on restriction sites are of limited long-term utility, since they require the provision 
of physical material from the specific library from which the map was derived in order to be utilized 
experimentally. A more robust landmarking approach based on the PCR has been developed recently 
[11], termed sequence-tagged site (STS) content mapping. An STS is a short, unique sequence in a genome 
that can be amplified by the PCR. Clones in a library are screened for the presence of a particular STS 
using PCR; if the STS is indeed unique in the genome, then clones possessing that STS are reliably 
expected to overlap. Physical mapping is thus reduced to choosing and synthesizing pairs of PCR primers 
that define unique sequences in the genome. Additionally, since STSs are defined by pairs of primer 
sequences, they can be stored in a database and are thus universally accessible. 

DNA Sequencing. Early in the development of tools for large-scale DNA analysis, it was recognized 
that one of the most costly and time-consuming processes was the accumulation of DNA sequence 
information. Two factors, the use of radioisotopic labels and the manual reading and recording of DNA 
sequence films, made it impossible to consider genome-scale (10 6 to 10 9 base pairs) sequence analysis 
using the conventional techniques. To address this, several groups embarked on the development of 
automated DNA sequencing instruments [12-14]. Today, automated DNA sequencing is one of the most 
highly advanced of the technologies for genome analysis, largely due to the extensive effort expended in 
instrument design, biochemical organization, and software development. 

Key to the development of these instruments was the demonstration that fluorescence could be 
employed in the place of autoradiography for detection of fragments in DNA sequencing gels and that 
the use of fluorescent labels enabled the acquisition of DNA sequence data in an automated fashion in 
real time. Two approaches have been demonstrated: the "single-color, four-lane" approach and the "four- 
color, single-lane" approach. The former simply replaces the radioisotopic label used in conventional 
enzymatic sequencing with a fluorescent label, and the sequence is determined by the order of fluorescent 
bands in the four lanes of the gel. The latter utilizes a different-colored label for each of the four sequencing 
reactions (thus A might be "blue," C, "green," G, "yellow," and T, "red"). The four base-specific reactions 
are performed separately and upon completion are combined and electrophoresed in a single lane of the 
gel, and the DNA sequence is determined from the temporal pattern of fluorescent colors passing the 
detector. For a fixed number of gel lanes (current commercial automated DNA sequencers have 24 to 
36), the four-color approach provides greater sample throughput than the single-color approach. Instru- 
ments employing the four-color technology are more widely used for genome analysis at present, and as 
such, this strategy will be discussed more fully. 



© 2000 by CRC Press LLC 



■ ID I 



DNA sequencing reactions 






Computer output 



Pholomulliplisr 



FIGURE 104.4 Schematic illustration of automated fluorescence-based DNA sequencing using the "four-color, 
single-lane" approach. The products of each of the four enzymatic sequencing reactions are "color-coded" with a 
different fluorescent dye, either through the use of dye-labeled primers or dye-terminators. The four reaction mixtures 
are then combined and the mixture separated by gel electrophoresis. The beam of an argon ion laser is mechanically 
scanned across the width of the gel near its bottom to excite the labeled fragments to fluorescence. The emitted light 
is collected through a four-color filter wheel onto a photomultiplier tube. The color of each fluorescing band is 
determined automatically by a computer from the characteristic four-point spectrum of each dye, and the order of 
colors passing the detector is subsequently translated into DNA sequence information. 

In order to utilize fluorescence as a detection strategy for DNA sequencing, a chemistry had to be 
developed for the specific incorporation of fluorophores into the nested set of fragments produced in 
the enzymatic sequencing reactions. The flexibility of chemical DNA synthesis provided a solution to 
this problem. A chemistry was developed for the incorporation of an aliphatic primary amine in the last 
cycle of primer synthesis (i.e., at the 5' terminus) using standard DNA synthesis protocols [15,16]. This 
amine was then conjugated with any of several of readily available amine-reactive fluorochromes that 
had been developed previously for the labeling of proteins to produce the desired labeled sequencing 
primers. The purified dye-primer was demonstrated to perform well in DNA sequencing, exhibiting both 
efficient extension by the polymerase and the necessary single-base resolution in the electrophoretic 
separation [12]. A set of four spectrally discriminable, amine-reactive fluorophores has been developed 
[16] for DNA sequencing. 

While dye-primers are relatively easy to obtain by this method, they are costly to prepare in small 
quantities and require sophisticated chromatographic instrumentation to obtain products pure enough 
for sequencing use. Thus they are generally prepared in large amounts and employed as vector-specific 
"universal" primers, as in situations in which a very large number of inserts cloned in a given vector 
need to be sequenced [17]. For occasions where small amounts of sample-specific primers are needed, 
as in the sequencing of products from the PCR, a simpler and more economical alternative is the use of 
dideoxynucleotides covalently coupled to fluorescent dyes (so-called dye-terminators), since these 
reagents allow the use of conventional unlabeled primers [18]. 

Special instrumentation (Fig. 104.4) has been developed for the fluorescence-based detection of nucleic 
acids in DNA sequencing gels. An argon ion laser is used to excite the fluorescent labels in order to 
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provide sufficient excitation energy at the appropriate wavelength for high-sensitivity detection. The laser 
beam is mechanically scanned across the width of the gel near its bottom in order to interrogate all lanes. 
As the labeled DNA fragments undergoing electrophoresis move through the laser beam, their emission 
is collected by focusing optics onto a photomultiplier tube located on the scanning stage. Between the 
photomultiplier tube and the gel is a rotating four-color filter wheel. The emitted light from the gel is 
collected through each of the four filters in the wheel in turn, generating a continuous four-point 
spectrum of the detected radiation. The color of the emission of the passing bands is determined from 
the characteristic four-point spectrum of each fluorophore, and the identified color is then translated 
into DNA sequence using the associated dye/base pairings. Sequence acquisition and data analysis are 
handled completely by computer; the system is sufficiently sophisticated that the operator can load the 
samples on the gel, activate the electrophoresis and the data acquisition, and return the next day for the 
analyzed data. The current commercial implementation of this technology produces about 450 to 
500 bases per lane of analyzed DNA sequence information at an error rate of a few percent in a 12- to 
14-hour period. 

The rate of data production of current DNA sequencers is still too low to provide true genome-scale 
analytical capabilities, although projects in the few hundred kilobase pair range have been accomplished. 
Improvements such as the use of gel-filled capillaries [19] or ultrathin slab gels (thicknesses on the order 
of 50 to 100 p.m as opposed to the conventional 200 to 400 (im) [20,21] are currently being explored. 
The improved heat dissipation in these thin-gel systems allows for the use of increased electric field 
strengths during electrophoresis, with a concomitant reduction in run time of fivefold or so. 

The greatly increased throughput of the automated instruments over manual techniques has resulted 
in the generation of a new bottleneck in the DNA sequencing process that will only be exacerbated by 
higher-throughput systems: the preparation of sufficient sequencing reaction products for analysis. This 
encompasses two preparative processes, the preparation of sequencing templates and the performance 
of sequencing reactions. Automation of the latter process has been approached initially through the 
development of programmable pipetting robots that operate in the 96-well microtiter plate format 
commonly used for immunoassays, since a 96-well plate will accommodate sequencing reactions for 
24 templates. Sequencing robots of this sort have become important tools for large-scale sequencing 
projects. Template preparation has proven more difficult to automate. No reliable system for selecting 
clones, infecting and culturing bacteria, and isolating DNA has been produced, although several attempts 
are in progress. It is clear that unlike in the case of the sequencing robots, where instrumentation has 
mimicked manual manipulations with programmable mechanics, successful automation of template 
preparation will require a rethinking of current techniques with an eye toward process automation. 
Furthermore, in order to obtain true genome-scale automation, the entire sequencing procedure from 
template preparation through data acquisition will need to be reengineered to minimize, if not eliminate, 
operator intervention using the principles of systems integration, process management, and feedback 
control. 

Genetic Mapping. Simply stated, genetic mapping is concerned with identifying the location of genes 
on chromosomes. Classically, this is accomplished using a combination of mendelian and molecular 
genetics called linkage analysis, a complete description of which is too complex to be fully described here 
(but see Watson et al. [1]). However, an interesting approach to physically locating clones (and the genes 
contained within them) on chromosomes is afforded by a technique termed fluorescence in situ hybrid- 
ization (FISH) [22]. Fluorescently labeled DNA probes derived from cosmid clones can be hybridized to 
chromosome spreads, and the location of the probe-chromosome hybrid can be observed using fluores- 
cence microscopy. Not only can clones be mapped to particular chromosomes in this way, but positions 
and distances relative to chromosomal landmarks (such as cytogenetic bands, telomere, or centromere) 
can be estimated to as little as 50,000 base pairs in some cases, although 1 million base pairs or larger is 
more usual. The technique is particularly useful when two or more probes of different colors are used 
to order sequences relative to one another. 
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Computation. Computation plays a central role in genome analysis at a variety of levels, and significant 
efforts has been expanded on the development of software and hardware tools for biologic applications. 
A large effort has been expended in the development of software that will rapidly assemble a large 
contiguous DNA sequence from the many smaller sequences obtained from automated instruments. This 
assembly process is computationally demanding, and only recently have good software tools for this 
purpose become readily available. Automated sequencers produce 400 to 500 base pairs of sequence per 
template per run. However, in order to completely determine the linear sequence of a 50,000-base-pair 
cosmid insert (which can be conceptually represented as a linear array of 100 adjacent 500-base-pair 
templates), it is necessary to assemble sequence from some 300 to 1000 clones to obtain the redundancy 
of data needed for a high-accuracy finished sequence, depending on the degree to which the clones can 
be preselected for sequencing based on a previously determined physical map. Currently, the tools for 
acquiring and assembling sequence information are significantly better than those for physical mapping; 
as such, most large-scale projects employ strategies that emphasize sequencing at the expense of mapping 
[23]. Improved tools for acquiring and assembling mapping data are under development, however, and 
it remains to be seen what effect they will have on the speed and cost of obtaining finished sequence on 
a genome scale relative to the current situation. 

Many software tools have been developed in the context of the local needs of large-scale projects. These 
include software for instrument control (data acquisition and signal processing), laboratory information- 
management systems, and local data-handling schemes. The development of process approaches to the 
automation of genome analysis will necessitate the continued development of tools of these types. 

The final outcome of the analysis of any genome will be a tremendous amount of sequence information 
that must be accessible to researchers interested in the biology of the organism from which it was derived. 
Frequently, the finished genomic sequence will be the aggregate result of the efforts of many laboratories. 
National and international information resources (databases) are currently being established worldwide 
to address the issues of collecting, storing, correlating, annotating, standardizing, and distributing this 
information. Significant effort is also being expended to develop tools for the rapid analysis of genome 
sequence data that will enable biologists to find new genes and other functional genetic regions, compare 
very large DNA sequences for similarity, and study the role of genetic variation in biology. Eventually, 
as the robust tools for predicting protein tertiary structure and function from primary acid sequence 
data are developed, genome analysis will extend to the protein domain through the translation of new 
DNA sequences into their functional protein products. 

104.4 Conclusions 

Genome analysis is a large-scale endeavor whose goal is the complete understanding of the basic blueprint 
of life The scale of even the smallest genomes of biologic interest is too large to be effectively analyzed 
using the traditional tools of molecular biology and genetics. Over the past 10 years, a suite of biochemical 
techniques and bioanalytical instrumentation has been developed that has allowed biologists to begin to 
probe large genetic regions, although true genome-scale technology is still in its infancy. It is anticipated 
that the next 10 years will see developments in the technology for physical mapping, DNA sequencing, 
and genetic mapping that will allow for a 10- to 100-fold increase in our ability to analyze genomes, with 
a concomitant decrease in cost, through the application of process-based principles and assembly-line 
approaches. The successful realization of a true genome analysis capability will require the close collab- 
orative efforts of individuals from numerous disciplines in both science and engineering. 
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Vaccines are biologic preparations that elicit immune system responses that protect an animal against 
pathogenic organisms. The primary component of the vaccine is an antigen, which can be a weakened 
(attenuated) version of an infectious pathogen or a purified molecule derived from the pathogen. Upon 
oral administration or injection of a vaccine, the immune system generates humoral (antibody) and 
cellular (cytotoxic, or killer T cell) responses that destroy the antigen or antigen-infected cells. When 
properly administered, the immune response to a vaccine has a long-term memory component, which 
protects the host against future infections. Vaccines often contain adjuvants to enhance immune response, 
as well as formulation agents to preserve the antigen during storage or upon administration, to provide 
proper delivery of antigen, and to minimize side reactions. 

Table 105.1 presents a simple classification scheme for vaccines according to the type of organism and 
antigen, Live, attenuated whole-organism vaccines have been favored for simplicity of manufacture and 
for the strong immune response generated when the organism creates a subclinical infection before being 
overwhelmed. These are useful when the organism can be reliably attenuated in pathogenicity (while 
maintaining immunogenicity) or when the organism is difficult to cultivate ex vivo in large quantities 
and hence large amounts of antigen cannot be prepared. Conversely, subunit antigen vaccines are used 
when it is easy to generate large amounts of the antigen or when the whole organisms is not reliably 
attenuated. Since there is no replication in vivo, subunit vaccines rely on administrating relatively large 
amounts of antigen mass and are almost always adjuvanted to try to minimize the antigen needed. Subunit 
preparations have steadily gained favor, since biologic, engineering, and analytical improvements make 
them easily manufactured and characterized to a consistent standard. Passive vaccines are antibody 
preparations from human blood serum. These substitute for the patient's humoral response for immune- 
suppressed persons, for postexposure prophylaxis of disease, and for high infection risk situations where 
immediate protection is required, such as for travelers or medical personnel. Even more so than subunit 
vaccines, large amounts of antibodies are required. These vaccines do not provide long-term immune 
memory. 

The organism and the nature of the antigen combine to determine the technologies of manufacture 
for the vaccine. Vaccine production generally involves growing the organism or its antigenic parts 
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TABLE 105.1 Examples of Vaccines Against Human Pathogens, and Their Classification 



Type of Live, Attenuated Killed, Inactivated 

Organism Cells/Particles Cells/Particles 



Virus 



Bacterium 



Measles, mumps, 
rubella, polio, 
yellow fever, 
varicella*, 
rotavirus* 

Tuberculosis 
(BCG), typhoid 



Polio, hepatitis B, 
rabies, yellow 
fever, Japanese 
encephalitis, 
influenza, 
hepatitis A* 

Pertussis, cholera, 
plague bacillus 



Subcellular/Particle Vaccines 

Virus-like particles Hepatitis B, HIV 
gpl20-vaccinia* 

Characterized single/combined antigens 
Influenza HA + NA, herpes simplex 
gD* 
Toxoids 

Tetanus, diphtheria 
Characterized single/ combined antigens 
Pertussis PT + FHA + LPF + PSA + 

69kD, 
Pneumococcus polysaccharides (23), 
Meningococcus polysaccharides 
Polysaccharide-protein conjugates 
Haemophilus b, Pneumococcus* 



Passive Immunization 

Immune serum globulin 
Rabies, hepatitis B, 
cytomegalovirus 

Monoclonal antibody Anti- 
HIV gpl20* 

Immune serum globulin 
Tetanus, Nonspecific Ig 



* Denotes vaccines in development. 



(cultivation), treating it to purify and/or detoxify the organism and antigen (downstream processing), 
and in many cases further combining the antigen with adjuvants to increase its antigenicity and improve 
storage stability (formulation). These three aspects of production will be discussed, in addition to future 
trends in vaccine technology. This chapter does not include vaccines for parasitic disease [see Barbet, 
1989], such as malaria, since such vaccines are not yet commercially available. 



105.1 Antigen Cultivation 



Microbial Cultivation 
Bacterial Growth 

As far as cultivation is concerned, the fundamental principles of cell growth are identical for the various 
types of bacterial vaccines. Detailed aspects of bacterial growth and metabolism can be found in Ingraham 
et al. [1983]. In the simplest whole-cell vaccines, obtaining cell mass is the objective of cultivation. Growth 
is an autocatalytic process where the cell duplicates by fission; growth is described by the differential 
equation 



dX 
dt 



V* 



(105.1) 



where X is the total mass concentration of cells, X v is the mass concentration of living, or viable cells, 
and p. is the specific growth rate in units of cells per cell time. In this equation, the growth rate of the 
cell is a function of the cells' physical and chemical environment: 



V 



: /(temperature, pH, dissolved oxygen, C v C 2 , C 3 , ..., C n I 



(105.2) 



All bacteria have a narrow range of temperatures permissive to growth; for human pathogens, the optimal 
temperature is usually 37°C; however, for attenuated strains, the optimal temperature may be purposefully 
lowered by mutation. At the end of culture, heat in excess of 50°C is sometimes used to kill pathogenic 
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bacteria. Dissolved oxygen concentration is usually critical, since the organism will either require it or 
be inhibited by it. Cultivation of Clostridium tetani, for example, is conducted completely anaerobically. 

In Eq. (105.2), C, is the concentration of nutrient i presented to the cells. Nutrient conditions pro- 
foundly affect cell growth rate and the ultimate cell mass concentration achievable. At a minimum, the 
cells must be presented with a carbon source, a nitrogen source, and various elements in trace quantities. 
The carbon source, usually a carbohydrate such as glucose, is used for energy metabolism and as a 
precursor for various anabolites, small chemicals used to build the cell. The nitrogen source, which can 
be ammonia, or a complex amino acid mix such as a protein hydrolysate, also can be used to produce 
energy but is mainly present as a precursor for amino acid anabolites for protein production. The 
elements K, Mg, Ca, Fe, Mn, Mo, Co, Cu, and Zn are cofactors required in trace quantities for enzymatic 
reactions in the cell. Inorganic phosphorus and sulfur are incorporated into proteins, polysaccharides, 
and polynucleotides. 

Historically, bacterial vaccines have been grown in a batchwise fashion on complex and ill-defined 
nutrient mixtures, which usually include animal and/or vegetable protein digests. An example is Bordetella 
pertussis cultivation on Wheeler and Cohen medium [World Health Organization, 1977c] , which contains 
corn starch, an acid digest of bovine milk, casein, and a dialyzed lysate of brewer's yeast cells. The starch 
serves as the carbon source. The casein digest provides amino acids as a nitrogen source. The yeast extract 
provides amino acids and vitamins, as well as some trace elements, nucleotides, and carbohydrates. Such 
complex media components make it difficult to predict the fermentation performance, since their atomic 
and molecular compositions are not easily analyzed, and since they may contain hidden inhibitors of cell 
growth or antigen production. More recently, defined media have been used to better reproduce cell 
growth and antigen yield. In a defined medium, pure sugars, amino acids, vitamins, and other antibodies 
are used. When a single nutrient concentration limits cell growth, the growth rate can be described by 
Monod kinetics 

u. = rmax ' (105.3) 

q + K 

Here (J. max is the maximum specific rate, and K is the Monod constant for growth on the nutrient, an 
empirically determined number. It is seen from this equation that at high nutrient concentrations, the 
cell grows at maximum rate, as if the substrate were not limiting. At low concentration as the nutrient 
becomes limiting, the growth rate drops to zero, and the culture ends. 

Antigen Production 

Cultivation of microbes for subunit vaccines is similar to that of whole-cell vaccines, but here one is 
concerned with the production of the antigenic protein, polysaccharide, or antigen-encoding polynucleo- 
tide as opposed to the whole cell; thus cell growth is not necessarily the main objective. The goal is to 
maintain the proper environmental and nutritional factors for production of the desired product. Similar 
to Eqs. (105.1) and (105.2) above, one can describe antigen production by 

dP 

— = q X — k,P where a and 

J. i-p v d ip 

dt (105.4) 



k d = /(temperature pH, dissolved 2 , C v ..., C\ 



where P is the product antigen concentration, q is the cell-specific productivity, and k d is a degradation 
rate constant. For example, the production of pertussis toxin occurs best at slightly alkaline pH, about 
8.0 [Jagicza et al., 1986]. Corynebacterium diphtheriae toxin production is affected by the concentration 
of iron [Reylveld, 1980] . Nutrient effects on degradation can be found for hepatitis B virus vaccine, whose 
production is actually a microbial cultivation. Here, recombinant DNA inserted into Saccharomyces 



& 2000 by CRC Press LLC 



cerevisiae yeast cells produces the hepatitis B surface antigen protein (HBsAg), which spontaneously 
assembles into virus-like particles (about 100 protein monomers per particle) within the cell. Prolonged 
starvation of the yeast cells can cause production of cellular proteases that degrade the antigen protein 
to provide nutrition. Consequently, maximum production of antigen is accomplished by not allowing 
the yeast culture to attain maximum cell mass but by harvesting the culture prior to nutrient depletion. 

Cultivation Technology 

Cultivation vessels for bacterial vaccines were initially bottles containing stagnant liquid or agar-gelled 
medium, where the bacteria grew at the liquid or agar surface. These typically resulted in low concen- 
trations of the bacteria due to the limited penetration depth of diffusion-supplied oxygen and/or due to 
the limited diffusibility of nutrients through the stagnant liquid or agar. Despite the low mass, growth 
of the bacteria at the gas-liquid interface can result in cell differentiation (pellicle formation) and 
improved production of antigen (increased q). For the past several decades, however, glass and stainless 
steel fermentors have been used to increase production scale and productivity. Fermentors mix a liquid 
culture medium via an impeller, thus achieving quicker growth and higher cell mass concentrations than 
would be achievable by diffusive supply of nutrients. For aerobic microbe cultivations, fermenters oxy- 
genate the culture by bubbling air directly into the medium. This increases the oxygen supply rate 
dramatically over diffusion supply. Due to the low solubility of oxygen in water, oxygen must be contin- 
uously supplied to avoid limitation of this nutrient. 

Virus Cultivation 

Virus cultivation is more complex than bacterial cultivation because viruses are, by themselves, nonrep- 
licating. Virus must be grown on a host cell substrate, which can be animal tissue, embryo, or ex vivo 
cells; the host substrate determines the cultivation technology. In the United States, only Japanese 
encephalitis virus vaccine is still produced from infected mature animals. Worldwide, many vaccines are 
produced in chicken embryos, an inexpensive substrate. The remainder of vaccines are produced from 
ex vivo cultivated animal cells. Some virus-like particle vaccines are made by recombinant DNA tech- 
niques in either microbial or animal cells, e.g., hepatitis B virus vaccine, which is made in yeast as 
mentioned above, or in Chinese hamster ovary cells. An interesting synopsis of the development of rabies 
vaccine technology from Pasteur's use of animal tissues to modern use of ex vivo cells can be found in 
Sureau [1987]. 

In Vivo Virus Cultivation 

Virus cultivation in vivo is straightforward, since relatively little control can be exercised on the host 
tissue. Virus is simply inoculated into the organ, and after incubation, the infected organ is harvested. 
Influenza virus is the prototypical in vivo vaccine; the virus is inoculated into the allantoic sac of 9- to 
11-day-old fertilized chicken eggs. The eggs are incubated at about 33°C for 2 to 3 days, candled for 
viability and lack of contamination from the inoculation, and then the allantoic fluid is harvested. The 
process of inoculating, incubating, candling, and harvesting hundreds of thousands of eggs can be highly 
automated [Metzgar and Newhart, 1977]. 

Ex Vivo Virus Cultivation 

Use of ex vivo cell substrates is the most recent technique in vaccine cultivation. In the case of measles 
and mumps vaccines, the cell substrate is chicken embryo cells that have been generated by trypsin 
enzyme treatment that dissociates the embryonic cells. Similarly, some rabies vaccines use cells derived 
from fetal rhesus monkey kidneys. Since the 1 960s, cell lines have been generated that can be characterized, 
banked, and cryopreserved. Cryopreserved cells have been adopted to ensure reproducibility and freedom 
from contaminating viruses and microorganisms and to bypass the ethical problem of extensive use of 
animal tissues. Examples of commonly used cell lines are WI-38 and MRC-5, both human embryonic 
lung cells, and Vero, an African green monkey kidney cell line. 
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Ex vivo cells must be cultivated in a bio reactor, in liquid nutrient medium, with aeration. The principles 
of cell growth are the same as for bacterial cells described above, with some important additions. First, 
ex vivo animal cells cannot synthesize the range of metabolites and hormones necessary for survival and 
growth and must be provided with these compounds. Second, virus production is usually fatal to the 
host cells. Cells also become more fragile during infection, a process that is to an extent decoupled from 
cell growth. Virus growth and degradation and cell death kinetics can influence the choice of process or 
bioreactor. Third, all ex vivo cells used for human vaccine manufacture require a surface to adhere to in 
order to grow and function properly. Finally, animal cells lack the cross-linked, rigid polysaccharide cell 
wall that gives physical protection to microorganisms. These last three factors combine to necessitate 
specialized bioreactors. 

Cell Growth. Supplying cells with nutrients and growth factors is accomplished by growing the cells 
in a complex yet defined medium, which is supplemented with 2% to 10% (v/v) animal blood serum. 
The complex medium will typically contain glucose and L-glutamine as energy sources, some or all of 
the 20 predominant L-amino acids, vitamins, nucleotides, salts, and trace elements. For polio virus, 
productivity is a function of energy source availability [Eagle and Habel, 1956] and may depend on other 
nutrients. For polio and other viruses, the presence of the divalent cations Ca 2+ and Mg 2+ promotes viral 
attachment and entry into cells and stabilizes the virus particles. Serum provides growth-promoting 
hormones, lipids and cholesterol, surface attachment-promoting proteins such as fibronectin, and a host 
of other functions. The serum used is unusually of fetal bovine origin, since this source is particularly 
rich in growth-promoting hormones and contains low levels of antibodies that could neutralize virus. 
After cell growth, and before infection, the medium is usually changed to serum-free or low-serum 
medium. This is done both to avoid virus neutralization and to reduce the bovine protein impurities in 
the harvested virus. These proteins are immunogenic and can be difficult to purify away, especially for 
live-virus vaccines. 

Virus Production Kinetics. An intriguing aspect of ex vivo virus cultivation is that each process depends 
on whether the virus remains cell- associated, is secreted, or lyses the host cells and whether the virus is 
stable in culture. With few exceptions, a cell produces a finite amount of virus before dying, as opposed 
to producing the virus persistently. This is because the host cell protein and DNA/RNA synthesis 
organelles are usually commandeered by virus synthesis, and the host cannot produce its own proteins, 
DNA, or RNA. The goal then becomes to maximize cell concentration, as outlined above for bacterial 
vaccines, while maintaining the cells in a competent state to produce virus. Since infection is transient 
and often rapid, cell-specific virus productivity is not constant, and productivity is usually correlated 
with the cell state at inoculation. Specific productivity can be a function of the cell growth rate, since 
this determines the available protein and nucleic acid synthetic capacity. Although virus production can 
be nutrient-limited, good nutrition is usually ensured by the medium exchange at inoculation mentioned 
above. For viruses that infect cells slowly, nutrition is supplied by exchanging the medium several times, 
batchwise or by continuous perfusion. 

For many viruses, the degradation term in Eq. (105.4) can be significant. This can be due to inherent 
thermal instability of the virus, oxidation, or proteases released from lysed cells. For an unstable virus, 
obtaining a synchronized infection can be key to maximizing titers. Here, the multiplicity of infection 
(MOI), the number of virus inoculated per cell, is an important parameter. An MOI greater than unity 
results in most cells being infected at once, giving the maximum net virus production rate. 

Cultivation Technology. The fragility of animal cells during infection and the surface attachment 
requirement create a requirement for special reactors [Prokop and Rosenberg, 1989] . To an extent, reactor 
choice and productivity are determined by the reactor surface area. Small, simple, and uncontrolled 
vessels, such as a flat-sided T-flask or Roux bottle, made of polystyrene or glass, can be used for small- 
scale culture. With these flasks the medium is stagnant, so fragile infected cells are not exposed to fluid 
motion forces during culture. Like bacterial cultures, productivity can be limited due to the slow diffusion 
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of nutrients and oxygen. To obtain larger surface areas, roller bottles or Cell Factories are used, but like 
egg embryo culture, robotic automation is required to substantially increase the scale of production. Even 
larger culture scales (>50 liters) are accommodated in glass or stainless steel bioreactors, which are actively 
supplied with oxygen and pH controlled. The growth surface is typically supplied as a fixed bed of plates, 
spheres, or fibers or, alternately, as a dispersion of about 200-M.m spherical particles known as microcarriers 
[Reuveny, 1990]. Microcarrier bioreactors are used for polio and rabies production [Montagnon et al., 
1984] . The stirred-tank microcarrier reactors are similar to bacterial fermentors but are operated at much 
lower stirring speeds and gas sparging rates so as to minimize damage to the fragile cells. Packed-bed 
reactors are used for hepatitis A virus vaccine production [Aboud et al., 1994]. These types of reactors 
can give superior performance for highly lytic viruses because they subject the cells to much lower fluid 
mechanical forces. Design considerations for animal cell reactors may be found in Aunins and Henzler 
[1993]. 

105.2 Downstream Processing 

Following cultivation, the antigen is recovered, isolated in crude form, further purified, and/or inactivated 
to give the unformulated product; these steps are referred to collectively as the downstream process. The 
complexity of a downstream process varies greatly depending on whether the antigen is the whole 
organism, a semipurified subunit, or a highly purified subunit. Although the sequence and combination 
of steps are unique for each vaccine, there is a general method to purification. The first steps reduce the 
volume of working material and provide crude separation from contaminants. Later steps typically resolve 
molecules more powerfully but are limited to relatively clean feed streams. Some manufacturing steps 
are classic small laboratory techniques, because many vaccine antigens are quite potent, requiring only 
micrograms of material; historically, manufacturing scale-up has not been a critical issue. 

Purification Principles 
Recovery 

Recovery steps achieve concentration and liberation of the antigen. The first recovery step consists of 
separating the cells and/or virus from the fermentation broth or culture medium. The objective is to 
capture and concentrate the cells for cell-associated antigens or to clarify the medium of particulates for 
extracellular antigens. In the first case, the particle separation simultaneously concentrates the antigen. 
The two methods used are centrifugation and filtration. Batch volume and feed solids content determine 
whether centrifugation or filtration is appropriate; guidelines for centrifugation are given by Datar and 
Rosen [ 1993] and Atkinson and Mavituna [ 1991 ] . Filtration is used increasingly as filter materials science 
improves to give filters that do not bind antigen. Filtration can either be dead-end or cross-flow. Dead- 
end filters are usually fibrous depth filters and are used when the particulate load is low and the antigen 
is extracellular. In cross-flow filtration, the particles are retained by a microporous (>0. 1 (Im) or ultra- 
filtration (<0.1 Jim) membrane. The feed stream is circulated tangential to the membrane surface at high 
velocity to keep the cells and other particulates from forming a filter cake on the membrane [Hanisch, 
1986]. 

If the desired antigen is subcellular and cell-associated, the next recovery step is likely to be cell lysis. 
This can be accomplished by high-pressure valve homogenization, bead mills, or chemical lysis with 
detergent or chaotropic agents, to name only a few techniques. The homogenate or cell lysate may 
subsequently be clarified, again using either centrifugation or membrane filtration. 

Isolation 

Isolation is conducted to achieve crude fractionation from contaminants that are unlike the antigen; 
precipitation and extraction are often used here. These techniques rely on large differences in charge or 
solubility between antigen and contaminants. Prior to isolation, the recovered process stream may be 
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treated enzymatically with nucleases, proteases, or lipases to remove DNA/RNA, protein, and lipids, the 
major macromolecular contaminants. Nonionic detergents such as Tween and Triton can serve a similar 
function to separate components, provided they do not denature the antigen. Subsequent purification 
steps must be designed to remove the enzyme(s) or detergent, however. 

Ammonium sulfate salt precipitation is a classic method used to concentrate and partially purify 
various proteins, e.g., diphtheria and tetanus toxins. Alcohol precipitation is effective for separating 
polysaccharides from proteins. Cohn cold alcohol precipitation is a classic technique used to fractionate 
blood serum for antibody isolation. Both techniques concentrate antigen for further treatment. 

Liquid-liquid extraction, either aqueous-organic or aqueous-aqueous, is another isolation technique 
suitable for vaccine purification. In a two-phase aqueous polymer system, the separation is based on the 
selective partitioning of the product from an aqueous liquid phase containing, e.g., polyethylene glycol 
(PEG), into a second, immiscible aqueous liquid phase that contains another polymer, e.g., dextran, or 
containing a salt [see Kelley and Hatton, 1992]. 

Final Purification 

Further purification of the vaccine product is to remove contaminants that have properties closely 
resembling the antigen. These sophisticated techniques resolve molecules with small differences in charge, 
hydrophobicity, density, or size. 

Density-gradient centrifugation, although not readily scalable, is a popular technique for final purifi- 
cation of viruses [Poison, 1993]. Either rate-zonal centrifugation, where the separation is based on 
differences in sedimentation rate, or isopycnic equilibrium centrifugation, where the separation is based 
solely on density differences, is used. Further details on these techniques can be found in Dobrota and 
Hinton [1992]. 

Finally, different types of chromatography are employed to manufacture highly pure vaccines; prin- 
ciples can be found in Janson and Ryden [1989]. Ion-exchange chromatography (IEC) is based on 
differences in overall charge and distribution of charge on the components. Hydrophobic-interaction 
chromatography (HIC) exploits differences in hydrophobicity. Affinity chromatography is based on 
specific stereochemical interactions common only to the antigen and the ligand. Size-exclusion chroma- 
tography (SEC) separates on the basis of size and shape. Often, multiple chromatographic steps are used, 
since the separation mechanisms are complementary. IEC and HIC can sometimes be used early in the 
process to gain substantial purification; SEC is typically used as a final polishing step. This technique, 
along with ultrafiltration, may be used to exchange buffers for formulation at the end of purification. 

Inactivation 

For nonattenuated whole organisms or for toxin antigens, the preparation must be inactivated to eliminate 
pathogenicity. This is accomplished by heat pasteurization, by cross-linking using formaldehyde or 
glutaraldehyde, or by alkylating using agents such as (3-propiolactone. The agent is chosen for effectiveness 
without destruction of antigenicity. For whole organisms, the inactivation abolishes infectivity. For 
antigens such as the diphtheria and tetanus toxins, formaldehyde treatment removes the toxicity of the 
antigen itself as well as killing the organism. These detoxified antigens are known as toxoids. 

The placement of inactivation in the process depends largely on safety issues. For pathogen cultures, 
inactivation traditionally has been immediately after cultivation to eliminate danger to manufacturing 
personnel. For inactivation with cross-linking agents, however, the step may be placed later in the process 
in order to minimize interference by contaminants that either foil inactivation of the organism or cause 
carryover of antigen-contaminant cross-linked entities that could cause safety problems, i.e., side-reac- 
tions in patients. 

Purification Examples 

Examples of purification processes are presented below, illustrating how the individual techniques are 
combined to create purification processes. 
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Bacterial Vaccines 

Salmonella typhi Ty21, a vaccine for typhoid fever, and BCG (bacille Calmette-Guerin, a strain of 
Mycobacterium bovis) vaccine against Myobacterium tuberculosis, are the only vaccines licensed for human 
use based on live, attenuated bacteria. Downstream processing consists of collecting the cells by contin- 
uous-flow centrifugation or using cross-flow membrane filtration. For M. bovis that are grown in a liquid 
submerged fermenter culture, Tween 20 is added to keep the cells from aggregating, and the cultures are 
collected as above. Tween 20, however, has been found to decrease virulence. In contrast, if the BCG is 
grown in stagnant bottles as a surface pellicle, the downstream process consists of collecting the pellicle 
sheet, which is a moist cake, and then homogenizing the cake using a ball mill. Milling time is critical, 
since prolonged milling kills the cells and too little milling leaves clumps of bacteria in suspension. 

Most current whooping cough vaccines are inactivated whole B. pertussis. The cells are harvested by 
centrifugation and then resuspended in buffer, which is the supernate in some cases. This is done because 
some of the filamentous hemagglutinin (FHA) and pertussis toxin (PT) antigens are released into the 
supernate. The cell concentrate is inactivated by mild heat and stored with thimerosal and/or formalde- 
hyde. The inactivation process serves the dual purpose of killing the cells and inactivating the toxins. 

C. diphtheria vaccine is typical of a crude protein toxoid vaccine. Here the 58 kDa toxin is the antigen, 
and it is converted to a toxoid with formaldehyde and crudely purified. The cells are first separated from 
the toxin by centrifugation. Sometimes the pathogen culture is inactivated with formaldehyde before 
centrifugation. The supernate is treated with formaldehyde to 0.75%, and it is stored for 4 to 6 weeks at 
37°C to allow complete detoxification [Pappenheimer, 1984]. The toxoid is then concentrated by ultra- 
filtration and fractionated from contaminants by ammonium sulfate precipitation. During detoxification 
of crude material, reactions with formaldehyde lead to a variety of products. The toxin is internally cross- 
linked and also cross-linked to other toxins, beef peptones from the medium, and other medium proteins. 
Because detoxification creates a population of molecules containing antigen, the purity of this product 
is only about 60% to70%. 

Due to the cross-linking of impurities, improved processes have been developed to purify toxins before 
formaldehyde treatment. Purification by ammonium sulfate fractionation, followed by ion-exchange 
and/or size-exclusion chromatography, is capable of yielding diphtheria or tetanus toxins with purities 
ranging from 85% to 95%. The purified toxin is then treated with formaldehyde or glutaraldehyde 
[Relyveld and Ben-Efraim, 1983] to form the toxoid. Likewise, whole-cell pertussis vaccine is being 
replaced by subunit vaccines. Here, the pertussis toxin (PT) and the filamentous hemagglutinin (FHA) 
are purified from the supernate. These two antigens are isolated by ammonium sulfate precipitation, 
followed by sucrose density-gradient centrifugation to remove impurities such as endotoxin. The FHA 
and PT are then detoxified with formaldehyde [Sato et al., 1984]. 

Bacterial components other than proteins also have been developed for use as subunit vaccines. One 
class of bacterial vaccines is based on capsular polysaccharides. Polysaccharides from Meningococcus, 
Pneumococcus, and Haemophilus influenzae type b are used for vaccines against these organisms. After 
separating the cells, the polysaccharides are typically purified using a series of alcohol precipitations. As 
described below, these polysaccharides are not antigenic in infants. As a consequence, the polysaccharides 
are chemically cross-linked, or conjugated, to a highly purified antigenic protein carrier. After conjuga- 
tion, there are purification steps to remove unreacted polysaccharide protein, and small-molecular- weight 
cross-linking reagents. Several manufacturers have introduced pediatric conjugate vaccines against 
H. influenzae type b (Hib-conjugate). 

Viral Vaccines 

Live viral vaccines have limited downstream processes. For secreted viruses such as measles, mumps, and 
rubella, cell debris is simply removed by filtration and the supernate frozen. In many cases it is necessary 
to process quickly and at refrigerated temperatures, since live virus can be unstable [Cryz and Gluck, 
1990]. 

For cell-associated virus such as herpesviruses, e.g., varicella zoster (chickenpox virus), the cells are 
washed with a physiologic saline buffer to remove medium contaminants and are harvested. This can be 
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done by placing them into a stabilizer formulation (see below) and then mechanically scraping the cells 
from the growth surface. Alternately, the cells can be harvested from the growth surface chemically or 
enzymatically. For the latter, the cells are centrifuged and resuspended in stabilizer medium. The virus 
is then liberated by disrupting the cells, usually by sonication, and the virus-containing supernate is 
clarified by dead-end filtration and frozen. 

Early inactivated influenza virus vaccines contained relatively crude virus. The allantoic fluid was 
harvested, followed by formaldehyde inactivation of the whole virus, and adsorption to aluminum 
phosphate (see below), which may have provided some purification as well. The early vaccines, however, 
were associated with reactogenicity. For some current processes, the virus is purified by rate-zonal 
centrifugation, which effectively eliminates the contaminants from the allantoic fluid. The virus is then 
inactivated with formaldehyde or (3-propiolactone, which preserves the antigenicity of both the hemag- 
glutinin (HA) and neuraminidase (NA) antigens. Undesirable side reactions have been even further 
reduced with the introduction of split vaccines, where the virus particle is disrupted by an organic solvent 
or detergent and then inactivated. By further purifying the split vaccine using a method such as zonal 
centrifugation to separate the other virion components from the HA and NA antigens, an even more 
highly purified HA + NA vaccine is available. These antigens are considered to elicit protective antibodies 
against influenza [Tyrrell, 1976]. 

Recently, an extensive purification process was developed for hepatitis A vaccine, yielding a >90% pure 
product [Aboud et al., 1994]. The intracellular virus is released from the cells by Triton detergent lysis, 
followed by nuclease enzyme treatment for removal of RNA and DNA. The virus is concentrated and 
detergent removed by ion-exchange chromatography. PEG precipitation and chloroform solvent extrac- 
tion purify away most of the cellular proteins, and final purification and polishing are achieved by ion- 
exchange and size-exclusion chromatography. The virus particle is then inactivated with formaldehyde. 
In this case, inactivation comes last for two reasons. First, the virus is attenuated, so there is no risk to 
process personnel. Second, placing the inactivation after the size-exclusion step ensures that there are no 
contaminants or virus aggregates that may cause incomplete inactivation. 

The first hepatitis B virus vaccines were derived from human plasma [Hilleman, 1993]. The virus is 
a 22-nm-diameter particle, much larger than most biologic molecules. Isolation was achieved by ammo- 
nium sulfate or PEG precipitation, followed by rate zonal centrifugation and isopycnic banding to take 
advantage of the large particle size. The preparation was then treated with pepsin protease, urea, and 
formaldehyde or heat. The latter steps ensure inactivation of possible contaminant viruses from the blood 
serum. More recently, recombinant DNA-derived hepatitis B vaccines are expressed as an intracellular 
noninfectious particle in yeast and use a completely different purification process. Here, the emphasis is 
to remove the yeast host contaminants, particularly high levels of nucleic acids and polysaccharides. 
Details on the various manufacturing processes have been described by Sitrin et al. [1993]. 

Antibody Preparations 

Antibody preparation starts from the plasma pool prepared by removing the cellular components of 
blood. Cold ethanol is added in increments to precipitate fractions of the blood proteins, and the 
precipitate containing IgG antibodies is collected. This is further redissolved and purified by ultrafiltra- 
tion, which also exchanges the buffer to the stabilizer formulation. Sometimes ion-exchange chromatog- 
raphy is used for further purification. Although the plasma is screened for viral contamination prior to 
pooling, all three purification techniques remove some virus. 

105.3 Formulation and Delivery 

Successful vaccination requires both the development of a stable dosage form for in vitro storage and 
the proper delivery and presentation of the antigen to elicit a vigorous immune response in vivo. This 
is done by adjuvanting the vaccine and/or by formulating the adjuvanted antigen. An adjuvant is defined 
as an agent that enhances the immune response against an antigen. A formulation contains an antigen 
in a delivery vehicle designed to preserve the (adjuvenated) antigen and to deliver it to a specific target 
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organ or over a desired time period. Despite adjuvanting and formulation efforts, most current vaccines 
require multiple doses to create immune memory. 

Live Organisms 

Live viruses and bacteria die relatively quickly in liquid solution (without an optimal environment) and 
are therefore usually stored in the frozen state. Preserving the infectivity of frozen live-organism vaccines 
is typically accomplished by lyophilization or freeze-drying. The freeze-drying process involves freezing 
the organism in the presence of stabilizers, followed by sublimation of both bulk water (primary drying) 
and more tightly bound water (secondary drying). The dehydration process reduces the conformational 
flexibility of the macromolecules, providing protection against thermal denaturation. Stabilizers also 
provide conformational stability and protect against other inactivating mechanisms such as amino acid 
deamidation, oxidation, and light-catalyzed reaction. 

Final water content of the freeze-dried product is the most important parameter for the drying process. 
Although low water content enhances storage stability, overdrying inactivates biologic molecules, since 
removal of tightly bound water disrupts antigen conformation. Influenza virus suspensions have been 
shown to be more stable at 1.7% (w/w) water than either 0.4% to 1% or 2.1% to 3.2% [Greiff and 
Rightsel, 1968] . Other lyophilization parameters that must be optimized pertain to heat and mass transfer, 
including (1) the rate of freezing and sublimation, (2) vial location in the freeze-drier, and (3) the type 
of vial and stopper used to cap the vial. Rates of freezing and drying affect phase transitions and 
compositions, changing the viable organism yield on lyophilization and the degradation rate of the 
remaining viable organisms on storage. 

Stabilizers are identified by trial-and-error screening and by examining the mechanisms of inactivation. 
They can be classified into four categories depending on their purpose: specific, nonspecific, competitive, 
and pharmaceutical. Specific stabilizers are ligands that naturally bind biologic macromolecules. For 
example, enzyme antigens are often stabilized by their natural substrates or closely related compounds. 
Antigen stabilizers for the liquid state also stabilize during freezing. Nonspecific stabilizers such as sugars, 
amino acids, and neutral salts stabilize proteins and virus structures via a variety of mechanisms. Sugars 
and polyols act as bound water substitutes, preserving conformational integrity without possessing the 
chemical reactivity of water. Buffer salts preserve optimal pH. Competitive inhibitors outcompete the 
organism or antigen for inactivating conditions, such as gas-liquid interfaces, oxygen, or trace-metal ions 
[Volkin and Klibanov, 1989]. Finally, pharmaceutical stabilizers may be added to preserve pharmaceutical 
elegance, i.e., to prevent collapse of the lyophilized powder during the drying cycle, which creates difficult 
redissolution. Large-molecular-weight polymers such as carbohydrates (dextrans or starch) or proteins 
such as albumin or gelatin are used for this purpose. For example, a buffered sorbitol-gelatin medium 
has been used successfully to preserve the infectivity of measles virus vaccine during lyophilized storage 
for several years at 2 to 8°C [Hilleman, 1989]. An example of live bacterium formulation to preserve 
activity on administration is typhoid fever vaccine, administered orally, S. typhi bacteria are lyophilized 
to a powder to preserve viability on the shelf, and the powder is encapsulated in gelatin to preserve 
bacterial viability when passing through the low-pH stomach. The gelatin capsule dissolves in the intestine 
to deliver the live bacteria. 

Oral polio vaccine is an exception to the general rule of lyophilization, since polio virus is inherently 
quite stable relative to other viruses. It is formulated as a frozen liquid and can be used for a limited time 
after thawing [Melnick, 1984]. In the presence of specific stabilizers such as MgCl 2 , extended 4°C stability 
can be obtained. 

Subunit Antigens 

Inactivated and/or purified viral and bacterial antigens inherently offer enhanced stability because whole- 
organism infectivity does not need to be preserved. However, these antigens are not as immunogenic as live 
organisms and thus are administered with an adjuvant. They are usually formulated as an aqueous liquid 
suspension or solution, although they can be lyophilized under the same principles as above. The major 
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adjuvant recognized as safe for human use is alum. Alum is a general term referring to various hydrated 
aluminum salts; a discussion of the different alums can be found in Shirodkar et al. [1990]. Vaccines can 
be formulated with alum adjuvants by two distinct methods: adsorption to performed aluminum precipi- 
tates or precipitation of aluminum salts in the presence of the antigen, thus adsorbing and entrapping the 
antigen. Alum's adjuvant activity is classically believed to be a "depot" effect, slowly delivering antigen over 
time in vivo. In addition, alum particles are believed to be phagocytized by macrophages. 

Alum properties vary depending on the salt used. Adjuvants labeled aluminum hydroxide are actually 
aluminum oxyhydroxide, AIO(OH). This material is crystalline, has a fibrous morphology, and has a 
positive surface charge at neutral pH. In contrast, aluminum phosphate adjuvants are networks of 
platelike particles of amorphous aluminum hydroxyphosphate and possess a negative surface charge at 
neutral pH. Finally, alum coprecipitate vaccines are prepared by mixing an acidic alum solution of 
KA1(S0 4 ) 2 -12H 2 with an antigen solution buffered at neutral pH, sometimes actively pH-controlled 
with base. At neutral pH, the aluminum forms a precipitate, entrapping and adsorbing the antigen. The 
composition and physical properties of this alum vary with processing conditions and the buffer anions. 
In general, an amorphous aluminum hydroxy(buffer anion)sulfate material is formed. 

Process parameters must be optimized for each antigen to ensure proper adsorption and storage 
stability. First, since antigen adsorption isotherms are a function of the antigen's isoelectric point and 
the type of alum used [Seeber et al., 1991], the proper alum and adsorption pH must be chosen. Second, 
the buffer ions in solution can affect the physical properties of alum over time, resulting in changes in 
solution pH and antigen adsorption. Finally, heat sterilization of alum solutions and precipitates prior 
to antigen adsorption can alter their properties. Alum is used to adjuvant virtually all the existing 
inactivated or formaldehyde-treated vaccines, as well as purified subunit vaccines such as HBsAg and 
Hib-conjugate vaccines. The exception is for some bacterial polysaccharide vaccines and for new vaccines 
under development (see below). 

An interesting vaccine development challenge was encountered with Hib-conjugate pediatric vaccines, 
which consist of purified capsular polysaccharides. Although purified, unadjuvanted polysaccharide is used 
in adults, it is not sufficiently immunogenic in children under age 2, the population is greatest risk [Ellis, 
1992; Howard, 1992]. Chemical conjugation, or cross-linking, of the PRP polysaccharide to an antigenic 
protein adjuvant elicits T-helper cell activation, resulting in higher antibody production. Variations in 
conjugation chemistry and protein carriers have been developed; example proteins are the diphtheria toxoid 
(CRM 197), tetanus toxoid, and the outer membrane protein complex of N. meningitidis [Ellis, 1992; 
Howard, 1992]. The conjugated polysaccharide is sometimes adsorbed to alum for further adjuvant action. 

105.4 Future Trends 

The reader will have noted that many production aspects for existing vaccines are quite archaic. This is so 
because most vaccines were developed before the biotechnology revolution, which is creating a generation 
of highly purified and better-characterized subunit vaccines. As such, for older vaccines "the process defines 
the product," and process improvements cannot readily be incorporated into these poorly characterized 
vaccines without extensive new clinical trials. With improved scientific capabilities, we can understand the 
effects of process changes on the physicochemical properties of new vaccines and on their behavior in vivo. 

Vaccine Cultivation 

Future cultivation methods will resemble existing methods of microbial and virus culture. Ill-defined 
medium components and cells will be replaced to enhance reproducibility in production. For bacterial and 
ex vivo cultivated virus, analytical advances will make monitoring the environment and nutritional status 
of the culture more ubiquitous. However, the major changes will be in novel product types — single-molecule 
subunit antigens, virus-like particles, monoclonal antibodies, and gene-therapy vaccines, each of which 
will incorporate novel processes. 
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Newer subunit vaccine antigens will be cultivated via recombinant DNA in microbial or animal cells. 
Several virus-like particle vaccines are under development using recombinant baculovirus (nuclear poly- 
hedrosis virus) to infect insect cells (spodoptera frugipeeda or trichoplusia ni). Like the hepatitis B 
vaccine, the viral antigens spontaneously assemble into a noninfectious capsid within the cell. Although 
the metabolic pathways of insect cells differ from vertebrates, cultivation principles are similar. Insect 
cells do not require surface attachment and are grown much like bacteria. However, they also lack a cell 
wall and are larger and hence more fragile than vertebrate cells. 

Passive antibody vaccines have been prepared up to now from human blood serum. Consequently, 
there has been no need for cultivation methods beyond vaccination and conventional harvest of antibody- 
containing blood from donors. Due to safety concerns over using human blood, passive vaccines will 
likely be monoclonal antibodies or cocktails thereof prepared in vitro by the cultivation of hybridoma 
or myeloma cell lines. This approach is under investigation for anti-HIV-1 antibodies [Emini et al., 1992]. 
Cultivation of these cell lines involves the same principles of animal cell cultivation as described above, 
with the exception that hybridomas can be less fastidious in nutritional requirements, and they do not 
require surface attachment for growth. These features will allow for defined serum-free media and simpler 
cultivation vessels and procedures. 

For the gene-therapy approach, the patient actually produces the antigen. A DNA polynucleotide 
encoding protein antigen(s) is injected intramuscularly into the patient. The muscle absorbs the DNA 
and produces the antigen, thereby eliciting an immune response [Ulmer et al., 1993]. For cultivation, 
production of the DNA plasmid is the objective, which can be done efficiently by bacteria such as 
Escherichia coli. Such vaccines are not sufficiently far along in development to generalize the factors that 
influence their production; however, it is expected that producer cells and process conditions that favor 
high cell mass, DNA replication, and DNA stability will be important. A potential beauty of this vacci- 
nation approach is that for cultivation, purification, and formulation, many vaccines can conceivably be 
made by identical processes, since the plasmids are inactive within the bacterium and possess roughly 
the same nucleotide composition. 

Downstream Processing 

Future vaccines will be more highly purified in order to minimize side effects, and future improvements 
will be to assist this goal. The use of chemically defined culture media will impact favorably on down- 
stream processing by providing a cleaner feedstock. Advances in filtration membranes and in chromato- 
graphic support binding capacity and throughput will improve ease of purification. Affinity purification 
methods that rely on specific "lock and key" interactions between a chromatographic support and the 
antigen will see greater use as well. Techniques amenable to larger scales will be more important to meet 
increased market demands and to reduce manufacturing costs. HPLC and other analytical techniques 
will provide greater process monitoring and control throughout purification. 

As seen during the evolution of diphtheria and tetanus toxoid vaccines, the trend will be to purify 
toxins prior to inactivation to reduce their cross-linking with other impurities. New inactivating agents 
such as hydrogen peroxide and ethyl dimethylaminopropyl carbodiimide have been investigated for 
pertussis toxin, which do not have problems of cross-linking or reversion of the toxoid to toxin status. 

Molecular biology is likely to have an even greater impact on purification. Molecular cloning of proteins 
allows the addition of amino acid sequences that can facilitate purification, e.g., polyhistidine or poly- 
alanine tails for metal ion, or ion-exchange chromatography. Recent efforts also have employed genetic 
manipulation to inactivate toxins, eliminating the need for the chemical treatment step. 

Vaccine Adjuvants and Formulation 

Many new subunit antigens lack the inherent immunogenicity found in the natural organism, thereby 
creating the need for better adjuvants. Concomitantly, the practical problem of enhancing worldwide 
immunization coverage has stimulated development of single-shot vaccine formulations in which 
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booster doses are unnecessary. Thus future vaccine delivery systems will aim at reducing the number of 
doses via controlled antigen release and will increase vaccine efficacy by improving the mechanism of 
antigen presentation (i.e., controlled release of antigen over time or directing of antigen to specific 
antigen-presenting cells). Major efforts are also being made to combine antigens into single-shot vaccines 
to improve immunization rates for infants, who currently receive up to 15 injections during the first 
2 years of life. Coadministration of antigens presents unique challenges to formulation as well. 

Recent advances in the understanding of in vivo antigen presentation to the immune system has 
generated considerable interest in developing novel vaccine adjuvants. The efficacy of an adjuvant is 
judged by its ability to stimulate specific antibody production and killer cell proliferation. Developments 
in biology now allow analysis of activity by the particular immune cells that are responsible for these 
processes. Examples of adjuvants currently under development include saponin detergents, muramyl 
dipeptides, and lipopolysaccharides (endotoxin), including lipid A derivatives. As well, cytokine growth 
factors that stimulate immune cells directly are under investigation. 

Emulsion and liposome delivery vehicles are also being examined to enhance the presentation of 
antigen and adjuvant to the immune system [Edelman, 1992; Allison and Byars, 1992] . Controlled-release 
delivery systems are also being developed that encapsulate antigen inside a polymer-based solid micro- 
sphere. The size of the particles typically varies between 1 and 300 p.m depending on the manufacturing 
process. Microspheres are prepared by first dissolving the biodegradable polymer in an organic solvent. 
The adjuvanted antigen, in aqueous solution or lyophilized powder form, is then emulsified into the 
solvent-polymer continuous phase. Microspheres are then formed by either solvent evaporation, phase- 
separation, or spray-drying, resulting in entrapment of antigen [Morris et al., 1994]. The most frequently 
employed biodegradable controlled-released delivery systems use FDA-approved poly(lactide-co-gly- 
colide) copolymers (PLGA), which hydrolyze in vivo to nontoxic lactic and glycolic acid monomers. 
Degradation rate can be optimized by varying the microsphere size and the monomer ratio. Antigen 
stability during encapsulation and during in vivo release from the microspheres remains a challenge. 
Other challenges to manufacturing include encapsulation process reproducibility, minimizing antigen 
exposure to denaturing organic solvents, and ensuring sterility. Methods are being developed to address 
these issues, including the addition of stabilizers for processing purposes only. It should be noted that 
microsphere technology may permit vaccines to be targeted to specific cells; they can potentially be 
delivered orally or nasally to produce a mucosal immune response. 

Other potential delivery technologies include liposomes and alginate polysaccharide and poly(dicar- 
boxylatophenoxy)phosphazene polymers. The latter two form aqueous hydrogels in the presence of 
divalent cations [Khan et al., 1994]. Antigens can thus be entrapped under aqueous conditions with 
minimal processing by simply mixing antigen and soluble aqueous polymer and dripping the mixture 
into a solution of CaCl 2 . The particles erode by Ca 2+ loss, mechanical and chemical degradation, and 
macrophage attack. For alginate polymers, monomer composition also determines the polymer's immu- 
nogenicity, and thus the material can serve as both adjuvant and release vehicle. 

For combination vaccines, storage and administration compatibility of the different antigens must be 
demonstrated. Live-organism vaccines are probably not compatible with purified antigens, since the 
former usually require lyophilization and the latter are liquid formulas. Within a class of vaccines, 
formulation is challenging. Whereas it is relatively straightforward to adjuvant and formulate a single 
antigen, combining antigens is more difficult because each has its own unique alum species, pH, buffer 
ion, and preservative optimum. Nevertheless, several combination vaccines have reached the market, and 
others are undergoing clinical trials. 

105.5 Conclusions 

Although vaccinology and manufacturing methods have come a considerable distance over the past 
40 years, much more development will occur. There will be challenges for biotechnologists to arrive at 
safer, more effective vaccines for an ever-increasing number of antigen targets. If government interference 
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and legal liability questions do not hamper innovation, vaccines will remain one of the most cost-effective 
and logical biomedical technologies of the next century, as disease is prevented rather than treated. 

Challenges are also posed in bringing existing vaccines to technologically undeveloped nations, where 
they are needed most. This problem is almost exclusively dominated by the cost of vaccine manufacture 
and the reliability of distribution. Hence it is fertile ground for engineering improvements in vaccine 
production. 

Defining Terms 

Adjuvant: A chemical or biologic substance that enhances immune response against an antigen. Used 

here as a verb, the action of combining an antigen and an adjuvant. 
Antigen: A macromolecule or assembly of macromolecules from a pathogenic organism that the 

immune system recognizes as foreign. 
Attenuation: The process of mutating an organism so that it no longer causes disease. 
Immunogen: A molecule or assembly of molecules with the ability to invoke an immune system 

response. 
Pathogen: A disease-causing organism, either a virus, mycobacterium, or bacterium. 
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A description of all the aspects of cell culture and viral vaccine manufacture may be found in Spier 
RE, Griffiths JB. 1985. Animal Cell Biotechnology, vols 1 to 3. London, Academic Press. 
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Gene therapy, the transfer of genes into cells for a therapeutic effect, is a new approach to the treatment 
of disease. The first clinically applicable system for efficiently delivering genes into mammalian cells was 
developed in the early 1980s and was based on a genetically engineered retrovirus, which, as part of its 
lifecycle, stably integrates its genome into the target cell's chromosomal DNA. Using recombinant DNA 
technology perfected in the mid-1970s, investigators replaced the viral genes with therapeutic genes and 
the resulting recombinant retrovirus shuttled these genes into the target cells. The potential applications 
of gene therapy are far reaching (Table 106.1) since there are over 4000 known human genetic diseases 
(many of which have no viable treatment) and virtually every human disease is profoundly influenced 
by genetic factors [Anderson, 1992]. In one of the first clinical applications, gene therapy was used to 
treat patients with ADA (adenosine deaminase) deficiency, a genetic defect which causes severe combined 
immune deficiency (SCID) and death at an early age [Anderson, 1992]. The patient's lymphocytes were 
isolated and transduced (insertion of a foreign gene into the genome of a cell) by a recombinant retrovirus 
encoding a functional ADA gene. These transduced cells were expanded in culture then reinfused back 
to the patient. These protocols were conducted after an exhaustive peer review process that laid the 
groundwork for future gene therapy protocols. Successful treatment of other genetic diseases is likely to 
be achieved in the future [Levine and Friedmann, 1993; Roemer and Friedmann, 1992]. In addition to 
inherited diseases, other viable targets for gene therapy include more prevalent disorders that show a 
complex genetic dependence (i.e., cancer and heart disease) as well as infectious diseases (i.e., human 
immunodeficiency virus (HIV) and applications in tissue engineering [Anderson, 1992; Morgan and 
Yarmush, 1998]. 

106.1 Background 

Gene therapy protocols conduct gene transfer in one of two settings; either ex vivo or in vivo [Mulligan, 
1993]. For ex vivo gene therapy, target cells or tissue are removed from the patient, grown in culture, 
genetically modified and then reinfused or retransplanted into the patient [Ledley, 1993]. Ex-vivo gene 
therapy is limited to those tissues which can be removed, cultured in vitro and returned to the patient 
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TABLE 106.1 Target Diseases for Gene Therapy 









Gene Delivery 


Target Disease 


Target Tissues 


Corrective Gene 


Systems Used 


Inherited 








ADA deficiency 


Hematopoietic cells 


ADA 


RV, RM 


Alpha- 1 antitrypsin deficiency 


Fibroblasts 


Alpha- 1 antitrypsin 


RV 




Hepatocytes 




RV 




Lung epithelia cells 




AV 




Peritoneal mesothelial cells 




AV 


Alzheimer's Disease 


Nervous system 


Nerve growth factor 


RV, AV, HSV 


Cystic fibrosis 


Lung epithelia cells 


CFTR 


AV.AAV, RM, L 


Diabetes 


Fibroblasts 


Human insulin 


RV 




Hepatocytes 




L 


Duchenne muscular dystrophy 


Muscle cells 


Dystrophin 


AV.DI 


Familial hypercholesterolemia 


Hepatocytes 


LDL receptor 


RV, AV, RM 


Gaucher disease 


Hematopoietic cells 


Glucocerebrosidase 


RV 




Fibroblasts 




RV 


Growth hormone deficiency 


Endothelial cells 


Human growth hormone 


RV 




Fibroblasts 




TR 




Keratinocytes 




RV 




Muscle cells 




RV 


Hemoglobinopathies 


Hematopoietic cells 


a or p-globin 


RV,AAV 


Hemophilia 


Fibroblasts 


Factor VIII, IX 


RV, Dl, L 




Keratinocytes 




RV 




Hepatocytes 




RV, AV, RM, L 




Muscle cells 




RV 


Leukocyte adhesion deficiency 


Hematopoietic cells 


CD- 18 


RV 


Parkinson's disease 


Nervous system 


Tyrosine hydroxylase 


RV, AV, HSV 


Phenylketonuria 


Hepatocytes 


Phenylalanine hydroxylase 


RV 


Purine nucleoside 


Fibroblasts 


Purine nucleoside phosphorylase 


RV 


phosphorylase deficiency 








Urea cycle disorders 


Hepatocytes 


Ornithine transcarbamylase or 
arginosuccinate synthetase 


AV 


Acquired 








Cancer 


Acute lymphoblastic leukemia 


p53 


RV 




Brain tumors 


HSV thymidine kinase 


RV 




Carcinoma 


y-interferon 


TR 




Melanoma 


Tumor necrosis factor 


RV 




Retinoblastoma 


Retinoblastoma gene 


RV 


Infectious diseases 


HIV 


Dominant negative Rev 


RV 






TAR decoy 


RV 






RRE decoy 


TR 






Diptheria toxin A 


RV 


Cardiomyopathy 


Muscle cells 


(Used reporter gene) 


Dl 


Emphysema 


Lung epithelial cells 


Alpha- 1 antitrypsin 


AV 


Local thrombosis 


Endothelial cells 


Anti-clotting factors 


RV 


Vaccines 


Muscle cells 


Various 


Dl 



Note: AAV = recombinant adeno-associated viruses 
AV = recombinant adenoviruses 
Dl = direct injection 

HSV = recombinant herpes simplex virus 
L = lipofection 
RM = receptor mediated 
RV = recombinant retroviruses 
TR = transfection 
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TABLE 106.2 Physical Characteristics of Recombinant Virions 



Characteristic 


Units 


Recombinant Retroviruses 


Recombinant Adenoviruses 


Recombinant AAV 


Genome type 




ss RNA (2 per virion) 


dsDNA 


ssDNA 


Genome size 


Bases 


8300 


36000 


4700 


Genome MW 


Daltons 


3x 10" 


20-25 x 10" 


1.2-1.8 x 10" 


Particle diameter 


run 


90-147 


65-80 


20-24 


Particle mass 


Grams 


3.6 X 10- 16 


2.9 x 10- 16 


1.0 X 10-' 7 


Composition 










DNA/RNA 


% 


2 


13 


26 


Protein 


% 


62 


87 


74 


Lipid 


% 


36 








Density 


g/cm 3 CsCl 


1.15-1.16 


1.33-1.35 


1.39-1.42 


Enveloped? 


Yes/no 


Yes 


No 


No 


Shape 




Spherical 


Icosahedral 


Icosahedral 


Surface projections 


Yes/no 


Yes 


Yes 


No 


Number 




-60-200 


12 




Length 


mn 


5 


25-30 




Max diameter 


inn 


8 


4 




Virus titer 


pfu/ml 


10»-10 7 


10 10 -10 12 


10 5 -10 7 


Integration? 




Yes, random 


No, episomal 


Yes, chromosome 19 



and cannot be applied to many important target tissues and organs such as the lungs, brain and heart. 
For in vivo gene therapy, the gene transfer agent is delivered directly to the target tissue/organ, and gene 
transfer occurs in the patient rather than in the tissue culture dish [Mulligan, 1993]. Both strategies have 
inherent advantages and disadvantages and current research is evaluating which approach can best meet 
the needs of a particular disease. 

Gene delivery systems can be classified as either viral or non-viral [Friedmann, 1989]. For viral gene 
transfer, one of several different types of viruses is engineered to deliver genes. Typically, viral genes are 
removed to prevent self-replication of the virus and to provide room for the insertion of one or more 
therapeutic genes that the recombinant virus will carry. To further ensure the safety of the recombinant 
viruses, specialized packaging cell lines have been developed to produce the recombinant viruses and 
minimize the production of infectious wild-type viruses. Some viruses are able to integrate the therapeutic 
genes into the target cell's nuclear DNA (retroviruses, adeno associated viruses), whereas others are not 
(adeno viruses) (Table 106.2). 

Non-viral gene transfer systems are based on a variety of technologies that employ physical/chemical 
means to deliver genes [Feigner and Rhodes, 1991]. These technologies include direct plasmid injection, 
bombardment with DNA coated microprojectiles, and DNA complexed with liposomes. Some non-viral 
transfection techniques are too inefficient (e.g., coprecipitation of DNA with calcium phosphate [Chen 
and Okayama, 1987], DNA complexed with DEAE-dextran [Pagano et al., 1967], electroporation [Neu- 
mann et al., 1982]) or laborious (e.g., microinjection of DNA [Capecchi, 1980]) for clinical use. Only 
those gene delivery systems (viral and non-viral) with potential for clinical application will be discussed 
in this article. The main features of these technologies (Table 106.3) will be described and specific 
examples of their applications highlighted. 

106.2 Recombinant Retroviruses 

Many of the approved clinical trials have utilized recombinant retroviruses for gene delivery. Retroviral 
particles contain two copies of identical RNA genomes that are wrapped in a protein coat and further 
encapsidated by a lipid bilayer membrane. The virus attaches to specific cell surface receptors via surface 
proteins that protrude from the viral membrane. The particle is then internalized and its genome is 
released into the cytoplasm, reverse transcribed from RNA to DNA, transported into the nucleus and 
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TABLE 106.3 Features of the Various Gene Transfer Systems 



Features 

Maximum transgene size 
Maximum concentration 

(vectors/ml) 
Transfers genes to quiescent cells 
Integrates transgene into target 

cell genome 
Persistence of gene expression 
Immunological problems 
Pre-existing host immunity 
Stability 

Ease of large scale production 
Safety concerns 



Retrovirus 




AAV 


Adenovirus 


Non-viral 


8kb 


4.7 kb 




36 kb 


8>36kb 


~10 7 


~10 12 




~10 12 


8>36kb 


No/Yes* 


Yes 




Yes 


Yes 


Yes 


Yes 




No 


No 


wks— yrs 


yrs 




wks-mos 


days— wks 


Few 


None known 


Extensive 


None 


No 


Yes 




Yes 


No 


Poor 


Good 




Good 


Good 


Difficult 


Difficult 




Easy 


Easy 



Insertional mutagenesis Inflammation toxicity Inflammation toxicity Toxicity 



* Recombinant lentiviruses, such as human immunodeficiency virus, are capable of transducing quiescent cells. 

then integrated into the cell's chromosomal DNA. The integrated viral genome has LTRs (long terminal 
repeats) at both ends which encode the regulatory sequences that drive the expression of the viral genome 
[Weiss et al., 1982]. 

Retroviruses used for gene transfer are derived from wild-type murine retroviruses. The recombinant 
viral particles are structurally identical to the wild type virus but carry a genetically engineered genome 
(retroviral vector) which encodes the therapeutic gene of interest. These recombinant viruses are inca- 
pable of self- replication but can infect and insert their genomes into a target cell's genome [Morgan et al., 
1993]. 

Recombinant retroviruses, like all other recombinant viruses, are produced by a two part system 
composed of a packaging cell line and a recombinant vector (Fig. 106.1) [Anderson, 1992; Levine and 
Friedmann, 1993]. The packaging cell line has been engineered to express all the structural viral genes 
(gag, pol and env) necessary for the formation of an infectious virus particle, gag encodes the capsid 
proteins and is necessary for encapsidation of the vector, pol encodes the enzymatic activities of the virus 
including reverse transcriptase and integrase. env encodes the surface proteins on the virus particle which 
are necessary for attachment to the target cell's receptors. 

The retroviral vector is essentially the wild-type genome with all the viral genes removed. This vector 
encodes the transgene(s) and the regulatory sequences necessary for their expression as well as a special 
packaging sequence, (l|/), that is required for encapsidation of the genome into an infectious viral particle 
[Morgan et al., 1993]. To produce recombinant retrovirus particles, the retroviral vector is transfected 
into the packaging cell line. The structural proteins expressed by the packaging cell line recognize the 
packaging sequence on RNAs transcribed from the transfected vector and encapsidate them into an 
infectious virus particle that is subsequently exocytosed by the cell and released into the culture medium. 
This medium containing infectious recombinant viruses is harvested and used to transduce target cells. 

Many different retroviral vector designs have been used (Fig. 106.2). A commonly used vector encodes 
two genes, one expressed from the TTR and the other from an internal promoter (Fig. 106.2c) [Miller, 
1992]. Often, one gene expresses a therapeutic protein and the other a selectable marker that makes the 
modified cells resistant to selective media or a drug. This allows the investigator to establish a culture 
composed solely of transduced cells by growing them under the selective conditions. Several configura- 
tions are possible, but the optimum design is often dictated by the transgene(s) being expressed and the 
cell type to be transduced. Vector configuration is crucial for maximizing viral titer and transgene 
expression [Roemer and Friedmann, 1992]. 

As with all gene transfer technologies, there are advantages and disadvantages to the use of recombinant 
retroviruses. Retroviruses can only transduce dividing cells since integration requires passage of the target 
cells through mitosis [Roe et al., 1993]. This limits the use of recombinant retroviruses for in vivo gene 
therapy, since few normal cells are actively dividing. Recently, however, new retroviral vectors based on 
lentiviruses have been developed that are capable of transducing non-dividing cells [Naldini et al., 1996]. 
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FIGURE 106.1 Packaging cell line for retrovirus. A simple retroviral vector composed of 2 LTR regions which flank 
sequences encoding the packaging sequence Q¥) and a therapeutic gene. A packaging cell line is transfected with this 
vector. The packaging cell line expresses the three structural proteins necessary for formation of a virus particle (gag, 
pol, and env). These proteins recognize the packaging sequence on the vector and form an infectious virion around 
it. Infectious virions bud from the cell surface into the culture medium. The virus-laden culture medium is filtered 
to remove cell debris and then either immediately used to transduce target cells or the virions are purified and/or 
concentrated and frozen for later use. 

These HIV based vectors were able to achieve stable gene transfer after injection into a non-dividing 
tissue (brain) in vivo. 

Other drawbacks of retroviral vectors include: (a) a limitation to the size of inserted genes 
(<8 kilobases) [Roemer and Friedmann, 1992]; (b) the particles are unstable and lose activity at 37°C 
with a half-life of 5-7 hours [Kotani et al., 1994; Le Doux et al., 1999]; and (c) virus producer cell lines 
typically produce retrovirus in relatively low titers (10 5 to 10 7 infectious particles per ml) [Paul et al., 
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FIGURE 106.2 Standard retroviral vector designs. LTRs are shown as black boxes. S D and S A represent splice donor 
and acceptor sites, respectively. *F indicates the packaging signal sequence. Internal promoters are the stippled boxes 
and transcription start sites are indicated by arrows, (a) A single-gene vector. Transcription is driven from the LTR. 
(b) A single-gene vector expressed from an internal promoter, (c) A typical two-gene vector. One gene is the 
therapeutic gene, and the other is a selectable marker gene. Gene A is driven from the LTR, and gene B is expressed 
from the internal promoter, (d) A two-gene vector in which transcription is driven from the LTR. The efficiency of 
expression of gene B is a function of the efficiency of splicing, (e) A self-inactivating vector. A deletion in 3' LTR is 
copied to the 5' LTR during reverse transcription. This eliminates any transcription from the LTR. Internal promoters 
are still active. This vector reduces the chance of insertional activation of a proto-oncogene downstream from the 3' 
LTR. 

1993] . The viral titer is a function of several factors including the producer cell line, the type of transgene 
and the vector construction. Moreover, purification and concentration of retroviruses without loss of 
infectivity is difficult [Andreadis et al., 1999]. Standard techniques such as centrifugation, column filtra- 
tion or ultrafiltration have failed [Le Doux et al., 1998; McGrath et al., 1978]. More recently, hollow fiber 
[Paul et al., 1993] and tangential flow filtration [Kotani et al., 1994] have been used with some success. 
A chimeric retrovirus in which the normal envelope proteins were replaced by VSV (Vesicular Stomatitis 
Virus) envelope proteins was readily concentrated by centrifugation [Burns et al., 1993]. 

Stability and level of expression has also been cited as a significant drawback for retrovirally transduced 
cells. Although long term expression has been documented in some systems such as bone marrow, 
fibroblasts, hepatocytes and muscle cells, there have also been reports of transient expression in retrovi- 
rally transduced cells [Roemer and Friedmann, 1992]. This unstable expression has been attributed to 
several causes including methylation of the DNA and in some cases rearrangement or loss of the proviral 
sequences [Roemer and Friedmann, 1992]. 

The use of recombinant retroviruses has raised two major safety concerns [Roemer and Friedmann, 
1992; Temin, 1990]. Older packaging cell lines occasionally produced replication competent virus by 
homologous recombination between the retroviral vector and the packaging cell line's retroviral 
sequences. New packaging cell lines have made the production of replication competent viruses essentially 
impossible [Danos and Mulligan, 1988]. The other safety concern was the possibility that the integration 
of the recombinant retrovirus would activate a proto-oncogene. The probability of this event is very low, 
and since several mutations are required for a cell to become cancerous, the risk of cellular transformation 
is extremely low and is typically outweighed by the potential therapeutic benefits [Temin, 1990]. 

106.3 Recombinant Adenoviruses 

Recombinant adenoviruses have a number of properties that make them a useful alternative to recom- 
binant retroviruses for human gene transfer. Recombinant adenoviruses are well- characterized, relatively 
easy to manipulate, can be grown and concentrated to very high titers (up to 10 13 infectious particles/ml), 
are stable particles, and can transduce a wide variety of cell types [Crystal, 1995; Hitt et al., 1997]. 
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Recombinant adenoviruses are also a relatively safe gene transfer system. Replication competent viruses, 
should they contaminate virus stocks or form by recombination with wild-type adenoviruses from latent 
infections, do not pose any significant risk since they elicit only a mild, self-limiting infection [Parks 
et al., 1996]. Furthermore, genes transferred by recombinant adenoviruses do not integrate, eliminating 
the risk of insertional mutagenesis of the chromosomal DNA of the target cell [Crystal, 1995]. Most 
important, recombinant adenoviruses efficiently transfer genes to non-dividing, as well as dividing, cells 
which makes possible the in vivo transduction of tissues composed of fully differentiated or slowly 
dividing cells such as the liver and lung [Mulligan, 1993]. 

Adenoviruses consist of a large double stranded DNA genome (about 36 kilobase pairs long) packaged 
within a non-enveloped icosahedral capsid that is primarily composed of three virus-encoded proteins 
(hexon, penton base, and fiber proteins) [Horwitz, 1990]. The fiber proteins protrude from the surface 
of the virus and mediate its attachment to target cells via a high affinity interaction with cell surface 
receptors. The virus is then internalized into endosomal vesicles via specific interactions between the 
penton base proteins and a v integrins [Wickham et al., 1993]. Adenoviruses escape these vesicles by an 
acid-induced endosomolytic activity and are transported to the nucleus, into which they enter via pores 
in the nuclear membrane [Greber et al., 1993]. 

The wild type adenovirus genome consists primarily of five early genes (E1-E5) each of which is 
expressed from their own promoters [Horwitz, 1990]. There are also five late genes (L1-L5) which are 
expressed from the major late promoter (MLP). The first generation of recombinant adenoviruses were 
based on a mutant adenovirus in which the El region (and in some cases the E3 region) was deleted. 
The El region is required for replication. Nevertheless, El minus mutants can be grown on specialized 
packaging cell lines (293 cells) which express the El gene and therefore provide the necessary functions 
for virus production [Levine and Friedmann, 1993]. 

To generate recombinant adenoviruses, a plasmid that contains the gene of interest, flanked by aden- 
ovirus sequences, is transfected into 293 cells in which adenoviruses that lack the El region are actively 
replicating (Fig. 106.3). The virus stock is screened for the rare recombinants in which the gene of interest 
has correctly recombined with the El minus mutant by homologous recombination. These recombinant 
virions are purified and grown to high titer on 293 cells [Morgan et al., 1994]. 

Recombinant adenoviruses have been successfully used for in vivo gene transfer in a number of animal 
models and in several clinical protocols, including those which delivered a functional copy of the cystic 
fibrosis transmembrane conductance regulator (CFTR) gene into the nasal epithelium and lungs of cystic 
fibrosis patients [Boucher and Knowles, 1993; Welsh, 1993; Wilmott and Whitsett, 1993]. Unfortunately, 
transgene expression was short-lived (5 to 10 days) [Verma and Somia, 1997]. Short-lived expression is 
a disadvantage for the treatment of genetic or chronic disorders because, in order to maintain the 
therapeutic effect of the transgene, recombinant adenoviruses would have to be administered to the 
patient repeatedly. Repeat administration often fails due to the presence of neutralizing antibodies that 
form in response to the first administration of the recombinant adenoviruses [Hitt et al., 1997]. Alter- 
natively, short lived high expression from adenovirus vectors might be therapeutically useful for appli- 
cations where transient expression is preferred. Such applications include the stimulation of angiogenesis 
by adenoviruses expressing vascular endothelial growth factor (VEGF) [Magovern et al., 1997; Morgan 
andYarmush, 1998]. 

Short-lived gene expression is currently one of the most significant limitations of recombinant aden- 
oviruses for use in human gene therapy. Although short-lived gene expression could be the result of 
vector cytotoxicity, promoter shutoff, or loss of the transgene DNA, most investigators believe it is 
primarily due to the elimination of transduced cells by transgene or adenovirus-specific cytotoxic 
T-lymphocytes [Dai et al, 1995; Engelhardt et al, 1994; Scaria et al, 1998; Zsengeller et al, 1995]. As a 
result, most efforts to increase the longevity of gene expression have focused on eliminating the immune 
rejection of cells transduced by recombinant adenoviruses. Two major strategies have been pursued. One 
strategy has been to construct recombinant adenoviruses that encode fewer immunogenic adenoviral 
proteins, such as by deleting portions of the E2 region or the E4 region. Such vectors have the added 
benefit of being able to accommodate the cloning of much larger inserts of therapeutic DNA (to as much 
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FIGURE 106.3 Isolation of recombinant adenovirus. A packaging cell line (293 cells) which expresses the El gene 
is infected with an El-minus mutant adenovirus. The adenovirus is derived from a plasmid encoding the wild-type 
adenovirus genome. The El region of the adenovirus genome is replaced with the therapeutic gene, and the resultant 
plasmid (the adenovirus vector) is transfected into the 293 cell line which is infected by the mutant adenovirus. Since 
the therapeutic gene is flanked by adenoviral sequences, the mutant adenovirus genome and the adenovirus vector 
will occasionally undergo homologous recombination and form an infectious recombinant adenovirus whose genome 
encodes the therapeutic gene. These rare recombinants are isolated then grown on another 293 cell line. 
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as 36 kilobases) than was possible with the first generation (El and E3 minus) recombinant adenoviruses 
which could accommodate only 7 kilobases of foreign DNA [Morsy et al., 1998; Parks et al., 1996]. 

The second strategy is to block the host immune response. For example, investigators have co-injected 
recombinant adenoviruses with immunomodulatory compounds (e.g., deoxyspergualin, IL-12, LFN- 
gamma) that transiently suppressed the immune system, eliminated the humoral response, and made 
possible repeat administration of the vector [Hitt et al., 1997]. Others have co-injected vectors that encode 
for immunosuppressive compounds that induce tolerance to recombinant adenoviruses (e.g., CTLA4Ig), 
or are developing recombinant adenoviruses whose backbones contain genes that encode for immuno- 
suppressive proteins (e.g., Ad5 E3 19-kDa protein or HSV ICP47) [Nakagawa et al., 1998; Wold and 
Gooding, 1989; York et al, 1994]. 

As a result of these and similar efforts, the persistance of gene expression in vivo has been extended 
to as long as 5 months [Kay et al., 1995; Scaria et al., 1998]. It remains to be determined, however, if 
these methods can eliminate the immune response against recombinant adenoviruses and their products, 
whether or not elimination of the immune response results in long-term gene expression, and if these 
methods will be effective in human gene therapy protocols. 

106.4 Recombinant Adeno-Associated Viruses 

Adeno-associated viruses (AAV) are another virus-based gene transfer system that has significant potential 
for use in human gene therapy. AAV are small, non-enveloped human parvoviruses that contain a single- 
stranded DNA genome (4.7 kilobases) and encode two genes required for replication, rep and cap 
[McCarty and Samulski, 1997]. AAV are stable, have a broad host range, can transduce dividing and 
non-dividing cells, and do not cause any known human disease [Verma and Somia, 1997]. AAV require 
the presence of a helper virus (typically adenovirus) to replicate [Muzyczka, 1992] . When no helper virus 
is present, AAV do not replicate but instead tend to establish a latent infection in which they permanently 
integrate into the chromosomal DNA of the target cells [Inoue and Russell, 1998]. Wild-type AAV 
preferentially integrates into a specific site in chromosome 19 due to the interaction of the virus-encoded 
protein Rep with the host cell DNA [McCarty and Samulski, 1997]. Recombinant AAV, however, are Rep 
negative and as a result integrate randomly into the chromosomal DNA of the target cell. 

To generate recombinant AAV, human 293 cells are transfected with the AAV vector, a plasmid that 
contains the therapeutic gene of interest flanked by the 145-bp AAV inverted terminal repeats (ITR) that 
are necessary for its encapsidation into a virus particle [Ferrari et al., 1997] (Fig. 106.4). The cells are 
also transfected with a helper plasmid, a plasmid that encodes for the virus proteins necessary for particle 
formation and replication (i.e., AAV rep and cap genes). The transfected cells are infected with wild-type 
adenovirus which supplies helper functions required for amplification of the AAV vector [Morgan and 
Yarmush, 1998]. Contaminating adenovirus is removed or inactivated by density gradient centrifugation 
and heat inactivation. Recombinant AAV generated by this and similar methods have been successfully 
used to achieve long term expression of therapeutic proteins in a number of cell types and tissues, 
including in the lung, muscle, liver, central nervous system, retina, and cardiac myocytes [Bartlett et al., 
1998; Flannery et al, 1997; Flotte et al., 1993; Maeda et al, 1998; Monahan et al., 1998; Xiao et al, 1998]. 
Therapeutic effects have been achieved in a number of model systems, including in a dog model of 
hemophilia and a mouse model for obesity and diabetes [Monahan et al., 1998; Murphy et al., 1997]. 

Despite these early successes, several technical issues must be addressed before recombinant AAV can 
be used for human gene therapy on a routine basis. For example, little is known about the conditions 
or factors which enhance recombinant AAV transduction efficiency in vitro or in vivo [McCarty and 
Samulski, 1997]. Nor is it well-understood what controls the efficiency with which recombinant AAV 
integrate into the chromosomal DNA of target cells or why the efficiency of integration is so low (< 1%). 
Perhaps the most significant technical issue, however, is that the current methods for producing recom- 
binant AAV (see above) are tedious, labor intensive, and not well-suited for producing clinical grade virus. 

Current methods are not adequate for producing a clinical grade virus for several reasons. First, current 
methods do not produce enough virus particles to be useful for many gene therapies. For example, it 
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FIGURE 106.4 Production of recombinant AAV. Human cells are transfected with a plasmid encoding the thera- 
peutic gene which is driven from a heterologous promoter (cross-hatched box) and flanked with AAV terminal repeats 
(stippled boxes). The cells are also transfected with a complementing plasmid encoding the AAV rep and cap genes, 
which cannot be packaged because they are not flanked by AAV terminal repeats. The rep and cap genes, whose 
products are required for particle formation, are flanked by adenovirus 5 terminal fragments (black boxes) which 
enhance their expression. The transfected cells are infected with wild-type adenovirus which supplies helper functions 
required for amplification of the AAV vector. The virus stock contains both AAV recombinant virus and adenovirus. 
The adenovirus is either separated by density gradient centrifugation or heat inactivated. 

has been estimated that 10 14 recombinant viruses will be needed to achieve systemic production of 
therapeutic levels of proteins such as factor IX or erythropoietin [Inoue and Russell, 1998]. With current 
production methods, which yield 10 8 to 10 9 viruses per ml, about 100 to 1000 liters of virus would have 
to be produced for each treatment. Second, current methods require the use of helper adenovirus which 
is a significant disadvantage because their removal is tedious and labor intensive. In addition, there is 
the risk that residual contaminating adenovirus proteins will be present and stimulate the immune 
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rejection of transduced cells [Xiao et al., 1998]. Finally, the use of transient transfection is not amenable 
to scale up and significantly increases the likelihood that replication competent AAV will be generated 
by recombinogenic events [Inoue and Russell, 1998]. 

To overcome these problems, investigators are working towards the development of stable packaging 
cell lines that produce recombinant AAV. Substantial increases in the yield of recombinant AAV have 
been achieved by increasing in the virus producer cell lines the number of copies of the AAV vector or 
of the helper plasmid. For example, one group developed an AAV packaging cell line which contained 
integrated helper and vector constructs that were linked to the simian virus 40 replication origin [Inoue 
and Russell, 1998]. These packaging cells could be induced to express SV40 T antigen, which then 
amplified the helper and vector constructs by 4 to 10 fold, resulting in 5 to 20 fold increases in recombinant 
AAV production. Progress has also been made towards the elimination of the need for helper adenovirus. 
For example, adenovirus miniplasmids have been constructed that provide all of the adenovirus helper 
functions but do not support the production of adenovirus proteins or viruses [Xiao et al., 1998]. Use 
of adenovirus miniplasmids, in conjunction with an improved helper plasmid, have improved the yield 
of recombinant AAV by up to 40-fold. Most likely a combination of strategies such as these will be 
necessary to develop a stable producer cell line that will be free of helper adenovirus and that will generate 
recombinant AAV at levels that are adequate for use in human gene therapies. 

106.5 Direct Injection of Naked DNA 

Direct injection of plasmid DNA intramuscularly, intraepidermally, and intravenously is a simple and 
direct technique for modifying cells in vivo. DNA injected intramuscularly or intraepidermally is inter- 
nalized by cells proximal to the injection site [Hengge et al., 1995; Wolff et al., 1990]. DNA injected 
intravenously is rapidly degraded in the blood (t ln < 10 min.) or retained in various organs in the body, 
preferentially in the non-parenchymal cells of the liver [Kawabata etal., 1995; Lew et al., 1995]. Gene 
expression after direct injection has been demonstrated in skeletal muscle cells of rodents and non-human 
primates, cardiac muscle cells of rodents, livers of cats and rats, and in thyroid follicular cells of rabbits 
[Hengge et al., 1995; Hickman et al., 1994; Levine and Friedmann, 1993; Sikes et al., 1994]. 

The efficiency of gene transfer by direct injection is somewhat inefficient [Doh et al., 1997]. As little 
as 60-100 myocardial cells have been reported to be modified per injection [Acsadi et al., 1991]. Higher 
efficiencies were observed when plasmids were injected into regenerating muscle cells [Wells, 1993] or 
co-injected with recombinant adenoviruses [Yoshimura et al., 1993]. Injected DNA does not integrate, 
yet gene expression can persist for as long as two months [Wolff et al., 1990]. Levels of gene expression 
are often low but can be increased by improvements in vector design [Hartikka et al., 1996]. 

Despite the low efficiency of gene transfer and expression, direct injection of plasmid DNA has many 
promising applications in gene therapy, particularly when low levels of expression are sufficient to achieve 
the desired biological effect. For example, several patients suffering from critical limb ischemia were 
successfully treated by injection of DNA encoding human vascular endothelial growth factor (VEGF), a 
potent angiogenic factor [Isner et al., 1996; Isner et al., 1996; Melillo et al, 1997; Tsurumi et al., 1996]. 
Following injection of DNA into the ischemic limbs of ten patients, VEGF expression was detected in 
their serum, new blood vessels were formed in 7 of the 10 patients, and 3 patients were able to avoid 
scheduled below-the-knee amputations [Baumgartner et al., 1998]. 

Direct injection of DNA could also be used to vaccinate patients against pathogens as evidenced by 
the effects of direct injection of plasmid DNA encoding pathogen proteins or immunomodulatory 
cytokines [Donnelly etal., 1997; Sato etal., 1996]. Direct injection may also be an effective way to 
systemically deliver therapeutic proteins [Tripathy et al., 1996]. For example, the incidence of autoim- 
mune diabetes in a mouse model of the disease was significantly reduced in mice that were injected 
intramuscularly with DNA encoding interleukin 10, an immunosuppressive cytokine [Nitta et al., 1998]. 

In contrast to viral-based delivery systems, there is little restriction on the size of the transgene that 
can be delivered by direct DNA injection. As a result, direct DNA injection is particularly well-suited for 
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treating disorders that require the delivery of a large transgene. For example, Duchenne's muscular 
dystrophy, a genetic disease of the muscle caused by a defect in the gene for dystrophin (12 kilobases) 
can potentially be treated by direct DNA injection [Acsadi et al., 1991]. 

106.6 Particle-Mediated Gene Transfer 

Particle-mediated gene transfer is an alternative method used to deliver plasmid DNA to cells. DNA 
coated gold particles are loaded onto a macro-projectile which is then accelerated through a vacuum 
chamber to high velocity by a burst of helium or a voltage discharge until it hits a stopping plate [Yang 
et al., 1996]. Upon impact, the DNA-coated microprojectiles are released through a hole in the stopping 
plate, penetrate the target tissue, and the transferred gene is expressed [Morgan and Yarmush, 1998; Yang 
et al., 1996]. Genes have been introduced and expressed in a number of cell types and tissues, including 
skin, liver, spleen, muscle, intestine, hematopoietic cells, brain, oral mucosa and epidermis, tumor 
explants, and cells of developing mouse embryos [Jiao et al., 1993; Keller et al., 1996; Mahvi et al., 1996; 
Morgan and Yarmush, 1998; Verma et al., 1998; Zelenin et al., 1993]. Similar to direct DNA injection, 
there are few constraints on the size of the DNA that can be delivered. 

The efficiency of particle-mediated gene transfer varies with tissue type, but in general it is most 
efficient in the liver, pancreas, and epidermis of the skin, and least efficient in muscle, vascular, and 
cardiac tissues [Rakhmilevich and Yang, 1997]. For example, 20 percent of bombarded epidermal cells 
of the skin, but only 1 to 3 percent of bombarded muscle cells, expressed the transferred gene [Williams 
et al., 1991; Yang et al, 1990]. 

106.7 Liposome-Mediated Gene Delivery 

Liposomes made from a mixture of neutral and cationic lipids have also been used to deliver plasmid 
DNA to cells. Liposomes are relatively easy to make, can be made with well-defined biophysical properties, 
and can accommodate virtually any size transgene [Kay et al., 1997; Tomlinson and Rolland, 1996] . Small 
unilamellar liposomes ranging from 20 to 100 nm in diameter are prepared by sonication of a mixture of 
cationic (e.g., DOTMA, N-{l-(2,3-dioleyloxy)propyl}-N,N,N-triethylammonium) and neutral (e.g., 
DOPE, dioleoyl-phosphatidylethanolamine) lipids, followed by extrusion through a porous polycarbonate 
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FIGURE 106.5 Common constituents of cationic liposomes. A typical cationic liposome is composed of a mixture 
of cationic lipids and neutral phospholipids. DOTMA {N-[l-(2,3,-dioleyloxy]-JV,N,JV-trimethylammonium chloride) 
is a synthetic cationic lipid which attaches to DNA using its positively charged head group. DOPE {l,2-Di[(ds)-9- 
octadecenoyl]-sn-glycero-3-phosphoethanolamine( is a neutral phospholipid which enhances the activity of these 
cationic liposomes. The resultant DNA-liposome complex is positively charged and reduces or eliminates the repulsive 
electrostatic forces between the negatively charged DNA and the negatively charged cell surface. 
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filter (e.g., 100 nm pore size) (Fig. 106.5) [Morgan and Yarmush, 1998; Radler et al, 1997]. DNA is added 
to the cationic liposomes, binds non-covalently to the positively charged cationic lipids, and induces a 
topological transition from liposomes to multilamellar structures composed of lipid bilayers alternating 
with DNA monolayers [Feigner and Ringold, 1989; Radler et al., 1997]. The size of the structures depends 
on their overall charge, with charged structures (negative or positive) being the smallest (about 100 nm 
in diameter) due to stabilization by electrostatic repulsion and neutral structures being the largest (greater 
than 3000 nm in diameter) due to aggregation as a result of van der Waals attractive forces [Radler et al., 
1997]. The relationship between structure and transfection efficiency is not well-understood, although 
in general a slight excess of cationic lipid is needed for optimal gene transfer [Kay et al., 1997]. 

Gene transfer is accomplished by simply mixing or applying the lipid-DNA complexes to the target 
cells or tissue. Cationic liposome-DNA complexes have been used in a number of applications including 
transfer of genes to the arterial wall, lung, skin, and systemically by intravenous injection [Morgan and 
Yarmush, 1998; Zhu et al., 1993]. Genes delivered by cationic liposomes do not integrate so gene expres- 
sion is transient and there is minimal risk of insertional mutagenesis. Liposomes do not carry any viral 
sequences or proteins, and are relatively non-toxic and non-immunogenic [Nabel et al., 1993; Zhu et al., 
1993]. 

Liposome-mediated gene transfer is somewhat inefficient, however, in part due to the failure of a large 
fraction of lipid-complexed DNA to escape degradation in cellular endosomes [Xu and Szoka, 1996; 
Zabner et al., 1995]. Several strategies have been taken to overcome this limitation, including use of 
acidotropic bases to reduce the rate of degradation by raising the pH of the endosomes, coupling of 
liposomes to endosomolytic or fusogenic virus proteins, and developing new liposome formulations that 
use pH sensitive cationic lipids that become fusogenic in the acidic cellular endosomes [Budker et al., 
1996; Legendre and Szoka, 1992; Yonemitsu et al., 1997]. Substantial improvements in the efficiency of 
liposome-mediated gene transfer will only be achieved when the fundamental mechanisms that govern 
the interactions of cationic lipid-DNA complexes with cells are better understood. 

106.8 Other Gene Transfer Methods 

Several other gene transfer technologies have been tested in clinical trials or are in various stages of 
development. These include recombinant viruses such as vaccinia virus [Qin and Chatterjee, 1996], 
herpes simplex virus [Glorioso et al., 1997], canarypox virus [Kawakita et al., 1997], and fowlpox [Wang 
et al., 1995], and non-viral vectors that mimic some of the properties of viruses, such as DNA conjugated 
to proteins that promote binding to specific cell-surface receptors, fusion, or localization to the nucleus 
[Wu et al., 1994]. These methods are not discussed here in greater detail because even though they are 
capable of transferring genes to cells they have not yet been well-developed or extensively tested in the 
clinic. 

106.9 Summary and Conclusion 

Several gene transfer systems have been developed that have successfully transferred genes to cells, and 
have elicited various biological effects. To date, nearly 300 clinical trials have been approved to test their 
safety and efficacy (Table 106.4). Each system has unique features, advantages, and disadvantages that 
determine if its use in a particular application is appropriate. One principal consideration is whether or 
not permanent or temporary genetic modification is desired. Other important considerations include 
what the setting of gene transfer will be (ex vivo or in vivo), what level of gene expression is needed, and 
whether or not the host has pre-existing immunity against the vector. No single gene transfer system is 
ideal for any particular application and it is unlikely that such a universal gene transfer system will ever 
be developed. More likely, the current gene transfer systems will be further improved and modified, and 
new systems developed, that are optimal for the treatment of specific diseases. 
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TABLE 106.4 Current Clinical Experience Using Gene Transfer 
(as of December 1, 1998) 





Numbe 




Human gene transfer protocols 


r 


Percentage 


Gene marking studies 


30 


11 


Non-therapeutic protocols 


2 


1 


Therapeutic protocols 


238 


88 


Infectious diseases 


25 


9 


Inherited diseases 


35 


13 


Cancer 


167 


62 


Other diseases 


11 


4 


Strategies for gene delivery 






Ex vivo protocols 


134 


50 


In vivo protocols 


136 


50 


Gene transfer technologies 






Permanent genetic modification 






Recombinant retroviruses 


128 


47 


Recombinant adeno associated virus 


6 


2 


Temporary genetic modification 






Recombinant adenovirus 


60 


22 


Cationic liposomes 


37 


14 


Plasmid DNA 


16 


6 


Particle mediated 


2 


1 


Other 






Viral vectors 


19 


7 


Non-viral vector 


2 


1 



*'Other' viral vectors refers to recombinant canarypox virus, vac- 
cinia virus, fowlpox, and herpes simplex virus. 'Other' non-viral vec- 
tor refers to RNA transfer and antisense delivery. 



Defining Terms 

Chimeric retrovirus: A recombinant retrovirus whose structural proteins are derived from two or more 

different viruses. 
Ex-vivo: Outside the living body, referring to a process or reaction occurring therein. 
In-vitro: In an artificial environment, referring to a process or reaction occurring therein, as in a test 

tube or culture dish. 
In-vivo: In the living body, referring to a process or reaction occurring therein. 
Liposome: A spherical particle of lipid substance suspended in an aqueous medium. 
Packaging cell line: Cells that express all the structural proteins required to form an infectious viral 

particle. 
Plasmid: A small, circular extrachromosomal DNA molecule capable of independent replication in a 

host cell, typically a bacterial cell. 
Recombinant: A virus or vector that has DNA sequences not originally (naturally) present in their DNA. 
Retrovirus: A virus that possesses RNA-dependent DNA polymerase (reverse transcriptase) which 

reverse transcribes the virus' RNA genome into DNA then integrates that DNA into the host cell's 

genome . 
Transduce: To effect transfer and integration of genetic material to a cell by infection with a recombi- 
nant retrovirus. 
Vector: A plasmid or viral DNA molecule into which a DNA sequence (typically encoding a therapeutic 

protein) is inserted. 
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Cell engineering can be defined either academically, as "application of the principles and methods of 
engineering to the problems of cell and molecular biology of both a basic and applied nature" [Nerem, 
1991], or functionally, as "manipulation of cell function by molecular approaches" [Lauffenburger and 
Aebischer, 1993]. However, defined, there is no question that cell engineering is becoming a central area 
of biomedical engineering. Applications to health care technology include design of molecular therapies 
for wound healing, cancer, and inflammatory disease, development of biomaterials and devices for tissue 
regeneration and reconstruction, introduction of gene therapies to remedy a variety of disorders, and 
utilization of mammalian cell culture technology for production of therapeutic proteins. 

Cell engineering has its roots in revolutions that have occurred in the field of molecular cell biology 
over the past few decades (see Darnell et al., [1986, Chap. 1] for an excellent historical summary). Key 
advances upon which this field was built include electron microscopy in the 1 940s, permitting intracellular 
structural features to be seen; the identification of DNA as the biochemical basis of genetics, also in the 
1940s; the elucidation of the structure and function of DNA in the 1950s; the discovery of gene regulation 
in the 1960s; and progress in areas such as molecular genetics, light microscopy, and protein biochemistry 
in the 1970s, 1980s, and 1990s. Combining the powerful set of information and techniques from molec- 
ular cell biology with the analytical and synthetic approaches of engineering for purposeful exploitation 
and manipulation of cell function has given rise to the wide range of applications listed above. Engineering 
contributions include elucidation of kinetic, transport, and mechanical effects; identification and mea- 
surement of system parameters; and production of novel materials and devices required for control of 
cell and tissue functions by molecular mechanisms. All these aspects of engineering are essential to 
development of reliable technological systems requiring reproducible performance of cells and tissues. 

The central paradigm of cell engineering is employment of an engineering perspective of understanding 
system function in terms of underlying component properties, with the cells and tissues being the system 
and the molecular mechanisms being the components. Thus cell and tissue functions must be quantified 
and related to molecular properties that serve as the key design parameters. This paradigm is made 
especially powerful by the capability derived from molecular cell biology to alter molecular properties 
intentionally, permitting rational system design based on the identified parameters. 

107.1 Basic Principles 

The fundamental aspects of cell engineering are twofold: ( 1 ) quantitative understanding of cell function 
in molecular terms and (2) ability to manipulate cell function through molecular mechanisms, whether 
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by pharmacologic or genetic intervention or by introduction of materials or devices. Elucidation of basic 
technical principles that will stand a relatively long-term test of time is difficult because the field of 
molecular cell biology continues to experience incredibly rapid and unpredictable advances. However, 
an attempt can be made to at least outline central principles along the two fundamental lines cited above. 
First, one can consider the basic categories of cell function that may be encountered in cell engineering 
applications. These include proliferation, adhesion, migration, uptake and secretion, and differentiation. 
Second, one can consider basic categories of regulation of these functions. Two major loci of regulation 
are gene expression and structural/enzymatic protein activity; these processes act over relatively long- 
term (hours) and short-term (seconds to minutes) time spans, respectively. Thus a cell can change the 
identities and quantities of the molecular components it makes as well as their functional productivity. 
A third locus of regulation, which essentially encompasses the others, is receptor-mediated signaling. 
Molecular ligands interacting with cell receptors generate intracellular chemical and mechanical signals 
that govern both gene expression and protein activity and thus essentially control cell function. 

At the present time, bioengineering efforts to manipulate cell behavior in health care technologies are 
primarily centered on development of materials, devices, and molecular or cellular therapies based on 
receptor-ligand interactions. These interactions are amenable to engineering approaches when relevant 
biochemical and biophysical properties of receptors and their ligands are identified [Lauffenburger and 
Linderman, 1993]. Examples of such efforts and the basic principles known at the present time to be 
involved are described below in three categories of current major activity: cell proliferation, cell adhesion, 
and cell migration. 

107.2 Cell Proliferation 

Proliferation of mammalian tissue and blood cells is regulated by signaling of growth factor receptors 
following ligand binding. In some cases, the resulting regulation is that of an on/off switch, in which the 
fraction of cells in a population that are stimulated to move into a proliferative state is related to the 
level of receptor occupancy; an example of this is the control of blood cell proliferation in the bone 
marrow [Kelley et al., 1993]. In other cases, the rate of progression through the cell cycle, governed by 
the rate of movement into the DNA synthesis phase, is related to the level of receptor occupancy. For 
two examples at least, epidermal growth factor stimulation of fibroblast proliferation [Knauer et al., 1984] 
and interleukin 2 stimulation of T-lymphocyte proliferation [Robb, 1982], DNA synthesis is linearly 
proportional to the number of growth factor-receptor complexes, perhaps with a minimum threshold 
level. 

In engineering terms, two key principles emerge from this relationship. One is that the details of the 
biochemical signal transduction cascade are reflected in the coefficient relating DNA synthesis to the 
number of complexes, which represents an intrinsic mitogenic sensitivity. Thus, for a specified number 
of complexes, the cell proliferation rate can be altered by modifying aspects of the signal transduction 
cascade. The second principle is that mechanisms governing the number, and perhaps location, of growth 
factor-receptor complexes can influence cell proliferation with just as great an effect as signal transduction 
mechanisms. At the present time, it is more straightforward for cell engineering purposes to manipulate 
cell proliferation rates by controlling the number of signaling complexes using a variety of methods. 
Attempts to quantitatively analyze and manipulate components of receptor signal transduction cascades 
are also beginning [Mahama and Linderman, 1994; Renner et al., 1994]. 

Engineering methods helpful for providing desired growth factor concentrations include design of 
controlled-release devices [Powell et al., 1990] and biomaterials [Cima, 1994]. Endogenous alteration of 
ligand concentration and receptor number can be accomplished using gene transfer methods to introduce 
DNA that can increase ligand production or receptor expression or antisense therapy to introduce RNA 
that can decrease ligand production or reduce receptor expression. Levels of growth factor-receptor 
binding can be tuned by rational design of ligands; these might be antagonists that block binding of the 
normal growth factor and hence decrease complex levels or super- agonist growth factor mimics that 
provide for increased levels of complex formation over that of the normal growth factor. Rates or extents 
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of receptor and ligand trafficking processes can be altered by modification of the receptor, ligand, or 
accessory components. Such alterations can affect the number of signaling complexes by increasing or 
decreasing the amount of receptor down-regulation, ligand depletion, or compartmentation of complexes 
from the cell surface to intracellular locations. 

Regulation of fibroblast proliferation by epidermal growth factor (EGF) offers an instructive example 
of some of these approaches. Under normal circumstances, fibroblasts internalize EGF-receptor com- 
plexes efficiently via ligand-induced endocytosis, leading to substantial down-regulation of the EGF- 
receptor and depletion of EGF from the extracellular medium. The latter phenomenon helps account 
for the requirement for periodic replenishment of serum in cell culture; although growth factors present 
in serum could in principle operate as catalysts generating mitogenic signals through receptor binding, 
they generally behave as pseudonutrients because of endocytic internalization and subsequent degrada- 
tion. Use of a controlled-release device for EGF delivery thus improved wound-healing responses com- 
pared with topical administration by helping compensate for growth factor depletion due to cell 
degradation as well as proteolysis in the tissue and loss into the bloodstream [Buckley et al., 1985]. 
Transfecting the gene for EGF into fibroblasts can allow them to become a self- stimulating autocrine 
system not requiring exogenous supplementation [Will et al., 1995] . Altering EGF/EGF-receptor traffick- 
ing can be similarly useful. Fibroblasts possessing variant EGF receptors for which ligand-induced 
endocytic internalization is abrogated exhibit dramatically increased proliferation rates at low EGF 
concentrations, primarily due to a diminished depletion of EGF from the medium [Reddy et al., 1994]. 
Increasing the proportion of ligands or receptors sorted in the endosome to recycling instead of degra- 
dation also reduces ligand depletion and receptor down-regulation, thus increasing the number of growth 
factor-receptor complexes. An instance of this is the difference in response of cells to transforming growth 
factor alpha compared with EGF [French et al., 1994]; the receptor-binding affinity of TGFa is diminished 
at endosomal pH, permitting most of the TGFa-receptor complexes to dissociate with consequently 
increased receptor recycling, in contrast to EGF, which remains predominantly bound to the receptor in 
the endosome, leading to receptor degradation. TGFa has been found to be a more potent wound-healing 
agent, likely due to this alteration in trafficking [Schultz et al., 1987]. Hence ligands can be designed to 
optimize trafficking processes for most effective use in wound-healing, cell culture, and tissue engineering 
applications. 

Proliferation of many tissue cell types is additionally regulated by attachment to extracellular matrices 
via interactions between adhesion receptors and their ligands. The quantitative dependence of prolifer- 
ation on adhesion receptor-ligand interactions is not understood nearly as well as that for growth factors, 
though a relationship between the extent of cell spreading and DNA synthesis has been elucidated [Ingber 
et al., 1987] and has been applied in qualitative fashion for engineering purposes [Singhvi et al., 1994]. 
It is likely that a quantitative dependence of proliferation on adhesion receptor-ligand binding, or a 
related quantity, analogous to that found for growth factors will soon be developed and exploited in 
similar fashion by design of attachment substrata with appropriate composition of attachment factors. 
Properties of the cell environment affecting molecular transport [Yarmush et al., 1992] and mechanical 
stresses [Buschmann et al., 1992] are also important in controlling cell proliferation responses. 

107.3 Cell Adhesion 

Attachment of mammalian tissue and blood cells to other cells, extracellular matrices, and biomaterials 
is controlled by members of the various families of adhesion receptors, including the integrins, selectins, 
cadherins, and immunoglobulins. Interaction of these receptors with their ligands — which are typically 
accessed in an insoluble context instead of free in solution — is responsible for both physical attachment 
and chemical signaling events. (For clarity of discussion, the term receptor will refer to the molecules on 
"free" cells, while the term ligand will refer to the complementary molecules present on a more structured 
substratum, even if that substratum is a cell layer such as blood vessel endothelium.) It is essential to 
note at the outset that biologic adhesion is a dissipative phenomenon, one which in general is nonre- 
versible; the energy of adhesion is typically less than the energy of deadhesion. Thus cell adhesion cannot 
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be analyzed solely in terms of a colloidal adhesion framework. Cell spreading following adhesion to a 
substratum is an active process requiring expenditure of energy for membrane protrusion and intracel- 
lular (cytoskeletal) motile force generation rather than being a purely passive, thermodynamic phenom- 
enon. Similarly, migration of cells over two-dimensional and through three-dimensional environments 
requires involvement of adhesion receptors not only for physical transmission of motile force via traction 
to the substratum but also perhaps for active signaling leading to generation of the cytoskeletal motile 
force. 

An engineering approach can be applied to design of cells, ligands, and ligand-bearing biomaterials 
to optimize attachment and subsequent signaling. For instance, material bearing synthetic peptides to 
which only a specific, desired cell type will adhere can be developed; an example is the immobilization 
of a unique amino acid sequence on a vascular-graft polymer to promote attachment and spreading of 
endothelial cells but not fibroblasts, smooth muscle cells, or platelets [Hubbell et al., 1991]. Moreover, 
the substratum density of a particular adhesive ligand required for proper cell spreading and leading to 
desired behavior can be quantitatively determined [Massia and Hubbell, 1991]. Design of substratum 
matrices with appropriate ligands for attachment, spreading, and proliferation in wound-healing and 
tissue-generation applications can be based on such information. These matrices also must possess 
suitable biodegradation properties along with appropriate mechanical properties for coordination with 
forces generated by the cells themselves as well as external loads [Cima et al., 1991b, Tranquillo et al., 
1992]. 

Analyses of cell-substratum adhesion phenomena have elucidated key design parameters of receptor- 
ligand interactions as well as cell and substratum mechanics. An important principle is that the strength 
of adhesion is proportional to the receptor-ligand equilibrium binding affinity in a logarithmic manner 
[Kuo and Lauffenburger, 1993], so to alter adhesion strength by an order of magnitude requires a change 
in binding affinity by a factor of roughly 10 3 . Cells possess the capability to accomplish this via covalent 
modification of proteins, leading to changes in effective affinity (i.e., avidity) either through direct effects 
on receptor-ligand binding or through indirect effects in which the state of receptor aggregation is 
modified. Aggregation of adhesion receptors into organized structures termed focal contacts leads to 
dramatic increases in adhesion strength because distractive forces are distributed over many receptor- 
ligand bonds together, as demonstrated experimentally [Lotz et al., 1989] and explained theoretically 
[Ward and Hammer, 1993]. Hence biochemical processes can lead to changes in biophysical properties. 
The fact that physical forces can affect distribution of receptor-ligand bonds and probably other protein- 
protein interactions suggests that biophysical processes can similarly alter cellular biochemistry [Ingber, 
1993]. 

Cells are able to regulate the mode of detachment from a substratum, with a variety of alternative sites 
at which the linkage can be disrupted. Between the intracellular cytoskeleton and the extracellular matrix 
are generally a number of intermolecular connections involving the adhesion receptor. The receptor can 
be simply associated with membrane lipids or can be anchored by linkages to one or more types of 
intracellular components, some of which are also linked to actin filaments. Thus cell-substratum detach- 
ment may take place by simple ligand-receptor dissociation or extraction of the intact ligand-receptor 
bond from the membrane or extraction of the bond along with some intracellularly linked components. 
Biochemical regulation by the cell itself, likely involving covalent modification of the receptor and 
associated components, may serve as a means to control the mode of detachment when that is an 
important issue for engineering purposes. It is likely that the different superfamilies of adhesion receptors 
possess distinct characteristics of their micromechanical and biochemical regulation properties leading 
to different responses to imposed stresses generated either intracellularly or extracellularly. 

A different situation involving the cell adhesion is that of cell capture by a surface from a flowing fluid. 
Instances of this include homing of white blood cells to appropriate tissues in the inflammatory and 
immune responses and metastatic spread of tumor cells, both by adhesion to microvascular endothelial 
cells, as well as isolation of particular cell subpopulations by differential cell adhesiveness to ligand-coated 
materials to facilitate cell-specific therapies. In such cases, it may be desired to express appropriate 
attachment ligands at required densities on either the target cells or capturing surfaces or to use soluble 
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receptor-binding competitors to block attachment in the circulation. This is a complicated problem 
involving application of quantitative engineering analysis of receptor-ligand binding kinetics as well as 
cell and molecular mechanics. Indeed, properties of adhesion receptors crucial to facilitating attachment 
from flowing fluid are still uncertain; they appear to include the bond-formation rate constant but also 
bond mechanical parameters such as strength (i.e., ability to resist dissociation or distraction from the 
cell membrane) and compliance (i.e., ability to transfer stress into bond-dissociating energy). 

Cell-substratum attachment during encounter in fluid flow conditions can be considered under equi- 
librium control if the strength or affinity of bonds is the controlling parameter, whereas it is under kinetic 
control when the rate of bond formation is controlling [Hammer and Lauffenburger, 1987]. Most 
physiologic situations of cell capture from the flowing bloodstream by blood vessel endothelium appear 
to be under kinetic control. That is, the crucial factor is the rate at which the first receptor-ligand bond 
can be formed; apparently the bond strength is sufficiently great to hold the cell at the vessel wall despite 
the distracting force due to fluid flow. Rolling of neutrophils along the endothelium is mediated by 
members of the selectin family, slowing the cells down for more stable adhesion and extravasation 
mediated by members of the integrin family. It is not yet clear what special property of the selectins 
accounts for their ability to yield rolling behavior; one candidate is a combination of both fast association 
and dissociation rate constants [Lawrence and Springer, 1991], whereas another is a low compliance 
permitting the dissociation rate constant to be relatively unaffected by stress [Hammer and Apte, 1992]. 
It will be important to determine what the key properties are for proper design of capture-enhancing or 
-inhibiting materials or regimens. 

107.4 Cell Migration 

Movement of cells across or through two- and three-dimensional substrata, respectively, is governed by 
receptor-ligand interactions in two distinct ways. First, active migration arising from intracellular motile 
forces requires that these forces can be effectively transmitted to the substratum as differential traction. 
Adhesion receptors, mainly of the integrin superfamily, are involved in this process. Integrin interactions 
with their extracellular matrix ligands can influence the linear cell translocation speed. Second, the 
intracellular motile forces must be stimulated under the specific conditions for which movement is 
desired. Often, the ligands that stimulate cytoskeletally generated motile forces can do so in a spatially 
dependent fashion; i.e., cells can be induced to extend membrane lamellipodia preferentially in the 
direction of higher ligand concentrations, leading to net movement up a ligand concentration gradient. 
These ligands are termed chemotactic attractants; many such attractants can additionally modulate move- 
ment speed and are thus also termed chemokinetic agents. Chemotactic and chemokinetic ligands include 
some growth factors as well as other factors seemingly more dedicated to effects on migration. 

Two different types of intracellular motile forces are involved, one that generates membrane lamellipod 
extension and one that generates cell body contraction; it is the latter force that must be transmitted to 
the substratum as traction in order for locomotion to proceed. Indeed, it must be transmitted in a spatially 
asymmetrical manner so that attachments formed at the cell front can remain adhered, while attachments 
at the cell rear are disrupted. Since the contraction force is likely to be isotropic in nature, this asymmetry 
must arise at the site of the force-transmission linkage. Migration speed of a particular cell type on a 
given ligand-coated surface thus depends on (1) generation of intracellular motile forces leading to 
lamellipod extension and to subsequent cell body contraction, stimulated by chemokinetic and/or chemo- 
tactic ligand binding to corresponding receptors or by adhesion receptor binding to appropriate soluble 
or substratum-bound ligands, (2) an appropriate balance between intracellular contractile force and 
overall level of cell-substratum traction, and (3) a significant front versus rear difference in cell-substra- 
tum traction. Speed should be roughly proportional to the first and last quantities but dependent on the 
middle quantity in biphasic fashion — maximal at an intermediate level of motile force-to-traction-force 
ratio [DiMilla et al, 1991]. 

Since many adhesion ligands stimulate intracellular motile force generation, an important parameter 
for design purposes is thus the effective density at which they are present on the substratum. Migration 
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should be maximized at an intermediate density at which the cell contraction force is roughly in balance 
with the cell-substratum adhesiveness, consistent with observations for both two-dimensional [DiMilla 
et al„ 1993] and three-dimensional [Parkhurst and Saltzman, 1992] movement environments. Since 
adhesive force is related to the ligand-receptor binding affinity, use of ligands with different affinities 
provides a means for migration under conditions of different ligand densities. Recalling also that the 
degree of receptor organization into aggregates can dramatically increase cell- substratum adhesiveness, 
use of ligands yielding different degrees of receptor aggregation can similarly influence migration speed. 
Engineering attempts to develop useful substrata for cell migration, e.g., for wound-healing scaffolding 
or tissue-regeneration biomaterials also need to consider possible effects of microarchitecture, i.e., the 
physical geometry of three-dimensional matrices through which the cells may crawl and ligands or 
nutrients may diffuse [Cima and Langer, 1993]. 

Addition of soluble receptor-binding ligand competitors into the extracellular medium is another 
approach to affecting cell migration speed by means of altered cell-substratum adhesiveness. As the 
concentration of a competitor is increased, adhesiveness should decrease. However, since migration speed 
varies with adhesiveness in biphasic manner, the result of adding the competitor can be either a decrease 
or an increase in motility, depending on whether the adhesiveness in the absence of the competitor is 
below or above optimal [Wu et al., 1994]. For example, the rate of blood capillary formation, which is 
related to the speed of microvessel endothelial cell migration, can be either diminished or enhanced by 
addition of soluble integrin-binding competitors [Nicosia and Bonnano, 1991; Gamble et al., 1993]. 

Finally, migration speed may be altered genetically, e.g., by changing the expression level of an adhesion 
receptor [Schreiner et al., 1989]. An increase in the number of receptors may lead to either an increase 
or a decrease in motility, depending on whether the change in adhesiveness is helpful or detrimental. At 
the same time, signaling for motile force generation, if it occurs through the same receptor, will provide 
an accompanying positive influence. Receptors may be altered in their ligand-binding or cytoskeleton- 
coupling capabilities as well, providing alternative approaches for affecting force transmission. 

Attempts to manipulate the direction of cell migration fall into three categories: soluble chemotactic 
ligands, immobilized haptotactic ligands, and contact guidance. Haptotaxis is commonly referred to as 
directed migration on a density gradient of an immobilized ligand, regardless of whether it is due to 
differential adhesiveness or to spatially dependent membrane extension. If the latter effect is causative, 
the phenomenon should be more precisely termed chemotaxis but merely with an immobilized ligand; 
the classic definition of haptotaxis requires an adhesion effect. Examples in which efforts have been made 
to manipulate the direction of cell migration including homing of cytotoxic lymphocytes to tumor sites, 
neovascularization of healing wound tissue, and neuron outgrowth. 

Contact guidance can be produced using either patterned substrata, in which pathways for cell migra- 
tion are delineated by boundaries of adhesiveness [Kleinfeld et al., 1988], or by a alignment of matrix 
fibers for directional orientation of cell movement [Dickinson et al., 1994]. A useful technology for 
generation of ligand concentration gradients is controlled release of chemotactic ligands from polymer 
matrices [Edelman et al., 1991]. Gradients required for effective cell guidance have been found to be 
approximately a few percent in concentration across a cell length, and they must persist for many multiples 
of the cell mean path length; this is the product of the persistence time (i.e., the mean time between 
significant direction changes) and the linear speed. Analysis has predicted that the affinity of a chemotactic 
ligand or, more precisely, the dissociation rate constant may be a key parameter in governing the sensitivity 
of a cell to a chemotactic ligand concentration gradient [Tranquillo et al., 1988]. Data on directional 
responses of neutrophil leukocytes to gradients of interleukin 8 show that these cells respond more 
sensitively to variants of IL-8 possessing lower binding affinities [Clark- Lewis et al., 1991]. Although this 
finding is consistent with theory, involvement of multiple forms of the IL-8 receptor with possibly varying 
signaling capabilities also must be considered as an alternative cause. Work with platelet-derived growth 
factor as a chemotactic attractant has elucidated features of the PDGF receptor involved in sensing ligand 
gradients, permitting approaches for pharmacologic or genetic manipulation of chemotactic receptors 
to modify directional migration responses [Kundra et al., 1994]. 
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Metabolic engineering can be defined as the modification of cellular metabolism to achieve a specific 
goal. Metabolic engineering can be applied to both prokaryotic and eukaryotic cells, although applications 
to prokaryotic organisms are much more common. The goals of metabolic engineering include improved 
production of chemicals endogenous to the host cell, alterations of the substrate required for growth and 
product synthesis, synthesis of new products foreign to the host cell, and addition of new detoxification 
activities. As metabolic engineering has evolved, some general principles and techniques have emerged. 
In particular, developments in genetic engineering enable highly specific modification of cellular bio- 
chemical pathways. However, the complexity of biologic systems and their control severely limits the 
ability to implement a rational program of metabolic engineering. Early attempts at metabolic engineer- 
ing were based on a simplistic analysis of cellular biochemistry and did not live up to expectations 
[Stephanopoulos and Sinskey, 1993]. An important realization is that metabolism must be considered 
as a network of interrelated biochemical reactions. The concept of a single rate-limiting step usually does 
not apply — instead, the control of a metabolic pathway is frequently distributed over several steps. The 
choice of specific modifications to achieve a desired goal is difficult because of this distributed nature of 
control. These issues are especially relevant in medical applications of metabolic engineering, where there 
is the added complication of interacting organ systems. 

The field of metabolic engineering is still young, and advancements are continuing, especially in the 
analysis of metabolic networks and their control. Recent advances include techniques for the experimental 
determination of kinetic or control parameters necessary to apply established mathematical formalisms 
to metabolic networks. Coupled with computer optimization and simulation, these techniques should 
lead to a greater success rate for the directed modification of cellular metabolism. In addition, the 
knowledge and experience gained from engineering cellular metabolism can be applied to individual 
organs or to the whole body to help guide the development and evaluation of new drugs and therapies. 
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108.1 Basic Principles 

Metabolic engineering is inherently multidisciplinary, combining knowledge and techniques from cellular 
and molecular biology, biochemistry, chemical engineering, mathematics, and computer science. The 
basic cycle of metabolic engineering is summarized in the flow sheet in Fig. 108.1. The first step in the 
process is to define the problem or goal to which metabolic engineering will be applied. The next step 
is to analyze the relevant biochemistry to determine the specific modifications that will be attempted. 
Recombinant DNA techniques are then used to implement the modifications, and the analysis of their 
impact completes one pass through the metabolic engineering cycle. This process is continued in an 
iterative fashion until the desired result is achieved. In the discussion that follows, each step of the cycle 
will be addressed individually. 

108.2 Problem Definition 

As mentioned above, the typical goals of metabolic engineering include improving the production of 
endogenous chemicals, altering the organism substrate requirements, synthesizing new products, and 
adding new detoxification activities. There are a variety of ways that these goals can be achieved through 
genetic engineering and mutation and selection. Through genetic engineering, heterologous enzymes 
can be added to a cell, either singly or in groups. The addition of new enzymatic activities can be used 
to meet any of the metabolic engineering goals. Mutation and selection can be used to affect the regulation 
of particular enzymes and thus are most useful for altering metabolite flow. 



Define Goal 



I 



Addition of New Activities 

The distinction between product formation, substrate utilization, and detoxification is simply a matter 
of emphasizing different aspects of a metabolic pathway. In product formation, the end of a metabolic 
pathway is the focal point. In substrate utilization and detoxification, it is the input to the pathway that 
is important. With detoxification, there is the additional requirement that the toxin is converted to less 
harmful compounds, but what they are is not important. In the discussion that follows, the addition of 
new activity for the synthesis of new products will be emphasized. However, since they all involve adding 
new activity to a metabolic pathway, the same techniques used for the synthesis of new products also 
can be used to alter substrate requirements or add new detoxification activity. 

There are several possible strategies for the synthesis 
of new products. The choice of strategy depends on the 
biochemistry present in the host organism and the spe- 
cific product to be synthesized. In many instances, path- 
way completion can be used, which only involves the 
addition of a few enzymes that catalyze the steps miss- 
ing from the existing metabolic pathway. Entire bio- 
chemical pathways also can be transferred to a 
heterologous host. The motivation for this type of 
transfer is that the host organism may be more robust, 
have a more desirable substrate requirement, or have 
inherently higher productivities than the organism that 
naturally possesses the biochemical pathway of interest. 
The transfer of multistep pathways to heterologous 
hosts has been used frequently for the production of 
antibiotics because of the clustering of the genes 
involved [Malpartida, 1990]. When the genes are clus- 
tered, they can be cloned and inserted into the host 
organism as a single unit. FIGURE 108.1 The metabolic engineering cycle. 
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The procedures described above involve the transfer of a known metabolic activity from one organism 
to another. It is also possible to create completely new products, because many enzymes can function 
with multiple substrates. If an enzyme is introduced into a cell and is thus presented with a metabolite 
similar to its normal substrate but which was not present in the donor cell, a completely new reaction 
may be catalyzed. Similarly, if a foreign enzymatic activity produces an intermediate foreign to the host 
cell, that intermediate may undergo further reactions by endogenous enzymes and produce a novel 
product. This phenomenon has been used to synthesize novel antibiotics in recombinant Streptomyces 
[McAlpine, 1987; Epp, 1989] and to engineer an Escherichia coli for the degradation of trichlorethylene 
[Winter, 1989] . Of course, the production of new products also may be an undesirable side effect of the 
introduction of a heterologous enzyme and could interfere with the production of the desired product. 
The complexity of enzyme activities makes the prediction of negative side effects extremely difficult. 

Improving Existing Metabolism 

Although a cell or organism may possess a particular metabolic pathway, the flow of material through 
that pathway (metabolic flux) may be suboptimal. Both recombinant DNA techniques and classic muta- 
tion-selection techniques can help to redirect metabolic fluxes. The principal ways to redirect metabolic 
fluxes are by increasing or decreasing the activity of specific enzymes and by changing their regulation. 
The choice of which enzymes and how to alter them can be very complicated because of the distributed 
nature of metabolic control. However, some basic concepts can be illustrated by the analysis of an idealized 
metabolic network for threonine synthesis, as shown in Fig. 108.2. Aspartate is the biosynthetic precursor 
for threonine, as well as for lysine, methionine, and isoleucine. If our goal is to maximize the production 
of threonine, then we may want to decrease the activity of the enzymes that catalyze the production of 
the other amino acids derived from aspartate. This would result in less of the carbon flow from aspartate 
being drawn off by the unwanted amino acids, and thus a larger flux to threonine might be achieved. 
An alternative way of increasing the flux to threonine is to amplify the activity of the enzymes along the 
pathway from aspartate to threonine. Finally, accumulation of the desired product, threonine, will lead 
to the reduction of flux from aspartate to ASA and from ASA to homoserine because of feedback 
inhibition. This would result in a reduction in threonine synthesis as well. If the feedback inhibition were 
removed, threonine could then accumulate without affecting its rate of synthesis. In practice, multiple 
modifications would probably be required to achieve the desired goal. 




Homoserine - 



Threonine 



-Methionine 



-► Isoleucine 



FIGURE 108.2 A simple metabolic network for threonine synthesis. The solid arrows represent enzyme catalyzed 
biochemical reactions. The dotted arrows indicate allosteric inhibition (ASA = aspartate semialdehyde). 



© 2000 by CRC Press LLC 



108.3 Analysis of Metabolic Networks 
Metabolic Flux Analysis 

The measurement or estimation of metabolic fluxes is useful for the evaluation of metabolic networks 
and their control. Direct measure of intracellular fluxes is possible with in vivo nuclear magnetic reso- 
nance (NMR) spectroscopy. However, the inherent insensitivity of NMR limits its applicability. A more 
general strategy combines material balances with carefully chosen measurements to indirectly estimate 
metabolic fluxes. In isotopic tracer methods, a material balance is performed on the labeled isotope, and 
the measured isotope distribution is used to estimate the fluxes of interest. Alternatively, a total mass or 
carbon balance can be combined with extensive measurements of the rates of change of extracellular 
metabolic concentrations to calculate intracellular fluxes. Used individually or together, these methods 
can provide a great deal of information about the state of a biochemical system. 

In isotope tracer methods, the cells to be studied are provided with a substrate specifically labeled with 
a detectable isotope (usually 14 C or 13 C). The incorporation of label into cellular material and by-products 
is governed by the fluxes through the biochemical pathways. The quantity and distribution of label are 
measured and combined with knowledge of the biochemistry to determine the intracellular fluxes. The 
choices of substrate labeling patterns, as well as which by-products to measure, are guided by careful 
analysis of the assumed biochemical network. These experiments are usually performed at isotopic steady 
state so that the flow of isotope into each atom of a metabolite equals the flux out. For the nth atom of 
the fcth metabolite, the flux balance is [Blum, 1982]: 



yiA(m)=S k (n)Y^V 0tk (108.1) 



where S^m) is the specific activity of the mth atom of the ith metabolite that contributes to the labeling 
of the nth atom of the Ath metabolite, S k (n), and V ik and V ok are the input and output fluxes of the fcth 
metabolite, respectively. The system of equations represented by Eq. (108.1) can be solved for the fluxes 
(V ik , V ok ) from measurements of the specific activities. A similar analysis can be applied to the isotope 
isomers, or isotopomers, yielding more information from a single experiment. Isotopomer analysis is 
not just concerned with the average enrichment of individual atoms. Instead, it involves quantifying the 
amounts of the different isotopomers that occur at a specific metabolic state. The isotopomer distribution 
contains more information than that obtained from positional enrichments, making it generally a more 
powerful technique. Both mass spectrometry and NMR can be used to analyze isotopomer distributions. 
Flux estimation based only on material balances is identical, in principle, to flux estimation using 
isotopic tracers and has been applied to many biochemical processes, including lysine synthesis [Vallino, 
1989] and rat heart metabolism [Safer and Williamson, 1973]. Instead of following the fate of specific 
atoms (i.e., 13 C or 14 C), all are considered equally through the measurement of the rates of change of 
substrates and products. The analysis is simplified in some ways because only the stoichiometry of the 
biochemical reactions is important, and no knowledge of the chemical mechanisms is needed. Some 
resolving power is lost, however, with the loss of the ability to determine the source of individual reactions. 
The analysis is usually formulated as a matrix equation: 

r = Ax (108.2) 

where r is a vector of metabolite rates of change, A is the matrix of stoichiometric coefficients for the 
biochemical network, and x is a vector of the metabolic fluxes. If the rates of change of the metabolites 
are measured, then Eq. (108.2) can be solved for the flux vector x by the method of least squares: 



:(A T A)~Vr 



(108.3) 
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Typically, extracellular metabolites are measured, while intracellular metabolites are assumed to be at 
pseudo-steady state and their rates of change are taken to be zero. 

Metabolic Control Analysis 

A biochemical network is a system of enzyme-catalyzed reactions that are interconnected through shared 
metabolites. At steady state, the fluxes through each pathway will be a function of the individual enzyme 
kinetic properties, as well as the network architecture. The activity of a particular enzyme will affect the 
concentration of its reactants and products and thus will influence the flux through pathways upstream 
and downstream. Metabolic control analysis (MCA) grew from the work originally presented by Kacser 
and Burns [1973] and Heinrich and Rapoport [1977]. MCA provides a framework for analyzing and 
quantifying the distributed control that enzymes exert in biochemical networks. 

In the discussion of that follows, a metabolic network consists of enzymes e, metabolites X, substrates 
S, and products P. For simplicity, we assume that the reaction rate v, is proportional to the enzyme 
concentration e,. If this is not true, then some modifications of the analysis are required [Liao and 
Delgado, 1993]. The concentrations of the products and substrates are fixed, but the metabolite concen- 
trations are free to change in order to achieve a steady-state flux /. An example of a simple, unbranched 
network is 

S -> X l -» X 2 -> X 3 -> P 



MCA is essentially a sensitivity analysis that determines how perturbations in a particular parameter 
(usually enzyme concentration) affect a variable (such as steady-state flux). The measures of the sensi- 
tivities are control coefficients defined as follows: 

, e. 3/ 9 In I/I 

C'=^- L = ^ 108.4 

e ' J de t 3 In e. 

The flux control coefficient C' ei is a measure of how the flux / changes in response to small perturbations 
in the concentration or activity of enzyme i. The magnitude of the control coefficient is a measure of 
how important a particular enzyme is in the determination of the steady-state flux. The summation 
theorem relates the individual flux control coefficients: 



I 



C' = l (108.5) 



A large value for C' d indicates that an increase in the activity of enzyme i(e) should result in a large 
change in the metabolic flux. Thus enzymes with large flux control coefficients may be good targets for 
metabolic engineering. Usually, there are several enzymes in a network with comparably large C values, 
indicating that there is no single rate-limiting enzyme. 

The challenge in analyzing a metabolic network is determination of the flux control coefficients. It is 
possible to determine them directly by "enzyme titration" combined with the measurement of the new 
steady-state flux. For in vivo systems, this would require alteration of enzyme expression through an 
inducible promoter and is not very practical for a moderately sized network. A more common method 
of altering enzyme activities involves titration with a specific inhibitor. However, this technique can be 
complicated by nonspecific effects of the added inhibitor and unknown inhibitor kinetics [Liao and 
Delgado, 1993]. 

Although the control coefficients are properties of the network, they can be related to individual 
enzyme kinetics through elasticity coefficients. If a metabolic concentration X is altered, there will be an 
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effect on the reaction rates in which X is involved. The elasticity coefficient s£' is a measure of the effect 
changes in X have on v ; , the rate of reaction catalyzed by enzyme e,: 

X dv. d In v. , 

e v ' = !- = l (108.6) 

v i 3X 31nX 

The flux control connectivity theorem relates the flux control coefficients to the elasticity coefficients: 

'cV; =0 for all it (108.7) 



E< 
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In principle, if the enzyme kinetics and steady-state metabolite concentrations are known, then it is 
possible to calculate the elasticities and through Eq. (108.7) determine the flux control coefficients. 

Flux Control Coefficients from Transient Metabolite Concentrations 

Recent efforts to address the difficulty of determining MCA parameters include the use of transient 
measurements of metabolite concentrations to give good estimates of the flux control coefficients [Del- 
gado and Liao, 1992]. The key assumption in the dynamic approach is that the reaction rates are 
reasonably linear around the steady state. When this is true and a transient condition is induced (by 
changing substrate or enzyme concentration), the transient fluxes v,(f) are constrained by the following 
equation: 



L C H')- 



(108. 



where r equals the number of reactions in the network. Measurements of transient metabolite concen- 
tration allow the determination of regression coefficients a ; from the following equation: 



£c,[x( f )-x( J 



(108.9) 



where n equals the number of metabolites measured. The flux control coefficients are then determined 
from 



C' C' ... C' 



}\a 1 a 2 ... ccJa (108.10) 



where A is the matrix of stoichiometric coefficients. Using this method requires the measurements of 
transient metabolite concentrations, which are used to calculate the a, values, and the accumulation or 
depletion of external metabolites to determine the steady-state flux /. The measurement of transient 
metabolite concentrations can be difficult but is possible with in vivo NMR or from cell extracts taken 
at different time points. 

"Top Down" MCA 

The traditional approach of MCA can be considered to be "bottom up," since all the individual enzyme 
flux control coefficients are determined in order to describe the control structure of a large network. The 
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"top down" approach makes extensive use of lumping of reactions together to determine group flux 
control coefficients [Brown, 1990]. These can give some information about the overall control of a 
metabolic network without its complete characterization. 
Consider a simple, multireaction pathway: 

produces X consumes X 
h h 

The reactions of a metabolic network are divided into two groups, those which produce a particular 
metabolite X and those which consume it. By manipulating the concentration X and measuring the 
resulting fluxes J 1 and J 2 "group" or "overall" elasticities *s of the X producers and X consumers can be 
determined. Application of the connectivity theorem (Eq. 108.7) then permits the calculation of the 
group control coefficient for both groups of reactions. Each pathway can subsequently be divided into 
smaller groups centered around different metabolites, and the process repeated. The advantage of the 
top-down approach is that useful information about the control architecture of a metabolic network can 
be obtained more quickly. This approach is particularly appropriate for highly complex systems such as 
organ or whole-body metabolism. 

Large Deviations 

One of the limitations of MCA is that it only applies when the perturbations from the steady state are 
small. Experimentally, it is much easier to induce large changes in enzyme or metabolite concentrations, 
and in terms of metabolic engineering, the desired perturbations are also likely to be large. Small and 
Kacser [1993a, 1993b] have developed an analysis based on large deviations and related it to MCA. 

In this discussion, e° is the original concentration of enzyme i, e\ = re°, where r is noninfinitesimal. 
Thus e\ represents a large perturbation to the system. ]° is the flux at the original steady state, and J r is 
the flux after the large perturbation. A deviation index D is used to characterize the change from the 
original steady state: 



D> 
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where A/ = ]' - J° and Ae, = e'— e°. By assuming that each individual enzymatic reaction rate is a linear 
function of the participating metabolites, it can be shown that the deviation index and control coefficients 
are equivalent: 

Df = cf (108.12) 

Similarly, an alternate deviation index *D can be defined as 
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and it is equivalent to the control coefficient at the new steady state: 

D r =C r (108.14) 



© 2000 by CRC Press LLC 



Thus, with a single large perturbation, the control coefficients at the original and new steady states can 
be estimated. This analysis has been extended to branched pathways, but relationships between the 
deviation indices and the flux control coefficients are more complicated and depend on the magnitude 
of the deviation (r). If the subscript a designates one branch and b another, the following relationship 
holds: 



D'*=C[° (108.15) 
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Metabolic control analysis, especially with the recent innovations for determining flux control coeffi- 
cients, is a powerful tool for the analysis of metabolic networks. It can describe the control architecture 
of a biochemical reaction network and identify which steps are the most promising targets for efforts at 
metabolic engineering. 

Pathway Synthesis 

The diversity of biochemical reactions found in nature is quite extensive, with many enzymes being 
unique to a particular organism. Through metabolic engineering, it is possible to construct a metabolic 
network that performs a specific substrate-to-product transformation not found in nature. When explor- 
ing the possibility of synthesizing new biochemical pathways, there are several key issues that must be 
addressed. Given a database of possible enzymatic activities and a choice of substrate and product, one 
must first generate a complete set of possible biochemical reactions that can perform the desired con- 
version. Once they are generated, the set of possible biochemical pathways must be checked for thermo- 
dynamic feasibility and evaluated in terms of yields, cofactor requirements, and other constraints that 
might be present. In addition, the impact of an engineered metabolic pathway on the growth and 
maintenance of the host cell is also important to evaluate. 

The problem of synthesizing a complete set of possible biochemical pathways subject to constraints 
on allowable substrates, intermediates, and by-products has been solved [Mavrovouniotis, 1990]. A 
computer algorithm allows the efficient determination of a complete and correct set of biochemical 
pathways that connect a substrate to a product. In addition, a complementary computer algorithm 
evaluates metabolic pathways for thermodynamic feasibility [Mavrovouniotis, 1993]. The key concept in 
the thermodynamic analysis is that evaluation of the feasibility of biologic reactions requires the speci- 
fication of the concentrations of the products and reactants. The standard free-energy change AG is not 
sufficient because physiologic conditions are significantly different from standard conditions. Both local 
and distributed thermodynamic bottlenecks can be determined by incorporating knowledge of the 
metabolite concentration ranges. This thermodynamic analysis of a biochemical pathway can pinpoint 
specific reactions or groups of reactions that should be modified or bypassed in order to better favor 
product formation. 

The addition of new biochemical pathways or the modification of existing pathways is likely to affect 
the rest of the cellular metabolism. The new or altered pathways may compete with other reactions for 
intermediates or cofactors. To precisely predict the impact of a manipulation of a metabolic network is 
virtually impossible, since it would require a perfect model of all enzyme kinetics and of the control of 
gene expression. However, with relatively simple linear optimization techniques, it is possible to predict 
some of the behavior of a metabolic network. The procedure involves the solution of Eq. (108.2) for the 
metabolic fluxes for networks that have more unknowns x than knowns r. An undetermined system of 
linear equations can be solved uniquely using linear optimization techniques if a "cellular objective 
function" is postulated. Examples of objective functions used in the literature include minimizing ATP 
or NADH production [Savinell, 1992] and maximizing growth or product formation [Varma, 1993]. 
Examination of the fluxes gotten from the linear optimization can indicate potential effects of a proposed 
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metabolic change. For example, the maximum growth of E. coli was found to decrease in a piecewise 
linear fashion as leucine production increased [Varma, 1993]. In principle, linear optimization also could 
be applied to whole-body metabolism to evaluate the effects of metabolically active drug or genetic 
therapies. 

108.4 Implementing Changes 

The techniques available to implement specific changes in both eukaryotic and prokaryotic cells are quite 
powerful. Through classic mutation-selection and modern genetic engineering, it is possible to amplify 
or attenuate existing enzyme activity, add completely new activities, and modify the regulation of existing 
pathways. A detailed description of all the genetic engineering techniques available is beyond the scope 
of this discussion, and only a general overview will be presented. 

Mutation-Selection 

Mutation and selection constitute a method of manipulating the phenotype of a cell. Cells are exposed 
to a mutagenic environment and then placed in a selective medium in which only those cells which have 
a desired mutation can grow. Alternatively, cells can be screened by placing them in a medium in which 
it is possible to visibly detect those colonies which possess the desired phenotype. Although it is a random 
process, careful design of selection or screening media can result in alterations in specific enzymes. For 
example, growth in a medium that contains an allosteric inhibitor may be the result of the loss of the 
allosteric inhibition. By altering metabolite concentrations or adding substrate analogues, it is possible 
to select for increased or decreased activity or for changes in regulation. The major drawback of mutation 
and selection for metabolic engineering is the lack of specificity. There may be multiple ways that a 
particular phenotype can be generated, so there is no guarantee that the enzyme that was targeted was 
affected at all. 

Recombinant DNA 

Recombinant DNA techniques are very flexible and powerful tools for implementing specific metabolic 
changes. The cloning of genes encoding a specific enzyme is relatively routine, and the subsequent 
insertion and expression are also straightforward for many host cells. With the choice of a very active 
promoter, the activity of a cloned enzyme can be greatly amplified. In addition, insertion of a heterologous 
activity into a host cell can serve to alter the regulation of a metabolic pathway, assuming the foreign 
enzyme has different kinetics than the original. Enzyme activity also can be attenuated by antisense 
sequences that produce mRNA complementary to the endogenous mRNA and thus form double-stranded 
RNA complexes that cannot be translated. Finally, heterologous recombination can be used to completely 
remove a particular gene from the host genome. 

108.5 Analysis of Changes 

The techniques used to analyze the biochemistry at the beginning of the metabolic engineering cycle are 
also applicable for assessing the effects of any attempted changes. The effect of the specific change can 
be evaluated, as well as its impact on the activity of the whole metabolic network. It is quite possible for 
a desired change to be made successfully at the enzyme level but not have the desired effect on the 
metabolic network. If a change was partially successful, then the resulting cells can be put through another 
iteration of metabolic engineering in order to make further improvements. If, on the other hand, a 
particular change was not successful, then any information which that failure gives about the regulation 
and control of the metabolic network can be used to make another attempt at implementing the desired 
change. 
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108.6 Summary 

Metabolic engineering is an evolving discipline that tries to take advantage of the advances which have 
been made in the genetic manipulation of cells. Through genetic engineering we have the capability of 
making profound changes in the metabolic activity of both prokaryotic or eukaryotic cells. However, the 
complexity of the regulation of biochemical networks makes the choice of modifications difficult. Recent 
advances in the analysis of metabolic networks via MCA or other techniques have provided some of the 
tools needed to implement a rational metabolic engineering program. Finally, metabolic engineering is 
best viewed as an iterative process, where attempted modifications are evaluated and the successful cell 
lines improved further. 

Defining Terms 

Flux: The flow of mass through a biochemical pathway. Frequently expressed in terms of moles of 

metabolite or carbon per unit time. 
Heterologous enzyme: An enzyme from a foreign cell that has been expressed in a host cell. Frequently 

used to provide activity normally not present or to alter the control structure of a metabolic 

network. 
Host: The cell that has had foreign DNA inserted into it. 
Isotopomers: Isomers of a metabolite that contain different patterns of isotopes (for metabolic studies, 

the carbon isotopes 12 C, 13 C, and 14 C are the most commonly analyzed). 
Metabolic engineering: The modification of cellular metabolism to achieve a specific goal. Usually 

performed with recombinant DNA techniques. 
Metabolic network: A system of biochemical reactions that interact through shared substrates and 

allosteric effectors. 
Pathway completion: The addition of a small number of heterologous enzymes to complete a bio- 
chemical pathway. 
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TISSUE ENGINEERING IS A NEW field that is rapidly growing in both scope and importance 
within biomedical engineering. It represents a marriage of the rapid developments in cellular and 
molecular biology on the one hand and materials, chemical, and mechanical engineering on the 
other. The ability to manipulate and reconstitute tissue function has tremendous clinical implications 
and is likely to play a major role in cell and gene therapies during the next few years in addition to 
expanding the tissue supply for transplantation therapies. 

Tissue function is complex and involves an intricate interplay of biologic and physicochemical rate 
processes. A major difficulty with selecting topics for this section was to define its scope. After careful 
consideration we focused on three goals: first, to cover some of the generic engineering issues; second, 
to cover generic cell biologic issues; and third, to review the status of tissue engineering of specific organs. 
In the first category we have two common engineering themes of material properties and development 
and the analysis of physical rate processes. The role of materials is treated in terms of two length 
scales — the molecular and the cellular size scales. On the smaller size scale, Chapter 110 deals with 
engineering the biomolecular properties of a surface, and Chapter 111 deals with protein adsorption 
onto surfaces to which cells will be exposed. Chapters 112 and 113 deal with the engineering of scaffolds 
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and templates on which cells are grown for transplantation purposes. The effects of physical rate processes 
are treated in Chapter 114 and 115. Both fluid mechanical forces and mass transfer rates influence 
important attributes of tissue function and relate to the physical constraints under which tissues operate. 
The issues associated with materials and the physical rate process represent important engineering 
challenges that need to be met to further the development of tissue engineering. 

In the second category we have common cell biologic themes: stem cell biology, cell motion, the tissue 
microenvironment, and the role of stroma in tissue function. It is now believed that many tissues contain 
stem cells from which the tissue is generated. Chapter 116 describes the basic concepts of stem cell biology. 
Cell motion is an important process in tissue function and tissue development. Basic concepts of cell 
motility are covered in Chapter 117, and the manipulation of this process is likely to become an essential 
part of tissue engineering. It is well established that the cellular microenvironment in vivo is critical to 
tissue function. This complex set of tissues is treated in Chapter 118. The specific interaction between 
cells performing a tissue-specific function and the accessory cells (or stroma) found in tissues has proved 
to be a key to reconstituting tissue function ex vivo. The important role of the stroma in tissue engineering 
is described in Chapter 119. The understanding and mastery of these cell biologic issues are essential for 
the future tissue engineer. 

Chapters in the third and last category describe progress with the engineering of specific tissues. Six 
tissues where much progress has been made were selected: bone marrow, liver, the nervous system, skeletal 
muscle, cartilage, and kidney. These chapters provide a concise review of the accomplishments to date 
and the likely clinical implications of these developments. 

By focusing on these three categories we hope that the reader will acquire a good understanding of 
the engineering and cell biological fundamentals of tissue engineering and of the progress that has been 
made to date and will develop ideas for further development of this emerging and important field. 
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Tissue engineering can be defined as the application of scientific principles to the design, construction, 
modification, growth, and maintenance of living tissues. Tissue engineering can be divided into two 
broad categories: (1) in vitro construction of bioartificial tissues from cells isolated by enzymatic disso- 
ciation of donor tissue, and (2) in vivo alteration of cell growth and function. The first category of 
applications includes bioartificial tissues (i.e., tissues which are composed of natural and synthetic 
substances) to be used as an alternative to organ transplantation. Besides their potential clinical use, 
reconstructed organs may also be used as tools to study complex tissue functions and morphogenesis 
in vitro. For tissue engineering in vivo, the objective is to alter the growth and function of cells in situ, 
an example being the use of implanted polymeric tubes to promote the growth and reconnection of 
damaged nerves. Some representative examples of applications of tissue engineering currently being 
pursued are listed in Table 109.1. 

Conceptually, bioartificial tissues involve three-dimensional structures with cell masses that are orders of 
magnitude greater than that used in traditional two-dimensional cell culture techniques. In addition, bioar- 
tificial organ technology often involves highly differentiated somatic and parenchymal cells isolated from 
normal tissues. In this chapter, we will provide an overview on tissue reconstruction in vitro with particular 
emphasis on the techniques used to control cell function and organization and to scale up bioartificial tissues. 
Other equally important issues in tissue engineering, such as cell isolation and biomaterial fabrication will 
not be presented here. A chapter on cell preservation is presented later within this section. 

109.2 Basic Principles and Considerations 
Cell Type and Source 

In tissue engineering, differentiated cells offer some advantages over tumor cell lines: (1) tumor cells 
often do not express the full spectrum of functions at the same level as the somatic cell lines they originated 
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TABLE 109.1 Representative Applications of Tissue Engineering 

Application Examples 

Implantable device Endothelialized vascular grafts 

Extracorporeal device Bone and cartilage implants 

Cell production Bioartificial skin 

In situ tissue growth and repair Bioartificial pancreatic islets 

Neurotransmitter-secreting cells 

Bioartificial liver 

Hematopoiesis in vitro 

Nerve regeneration 

Artificial skin 

from, (2) tumor cell growth can be very rapid and uncontrollable which can cause design problems (e.g., 
clogging of microchannels), (3) there is a potential risk of seeding tumor cells in the patient. Notwith- 
standing these limitations, appropraite selection procedures can be used to derive tumor cell lines which 
are easily propagated, are contact inhibited, and maintain most of the parent somatic cell features. 

Certain human cell types can be easily propagated using standard culture techniques, such as human 
keratinocytes [Parenteau, 1991]. Conversely, other cell types, such as adult hepatocytes or pancreatic islet 
cells, do not replicate to any appreciable extent in vitro. Because there is currently a shortage of human 
organs, animal sources are also considered. Success in using xenogeneic cells will largely depend on our 
ability to control the immunological response of the host to these cells as well as the proteins that they 
produce. Genetic engineering offers the possibility of creating new cells (i.e., cells which perform a new 
function) which are easy to grow (e.g., fibroblasts), however, care must be taken to verify that these cells 
also exhibit the necessary control mechanisms for responding appropriately to the host's metabolic 
changes. 

Control of Cell Function by the Extracellular Matrix 

Since the vast majority of mammalian cells are anchorage-dependent, they must attach and spread on a 
substrate to proliferate and function normally. Cell adhesion is generally mediated by certain extracellular 
matrix proteins such as fibronectin, vitronectin, laminin, and collagen as well as various glycosaminogly- 
cans. Adsorption of these proteins to surfaces and the conformation of the adsorbed proteins appear to 
be important factors influencing cell attachment and growth on synthetic substrates [Grinnell, 1981]. 
Small sequences of the cell binding region of these proteins (RGDS in fibronectin and vitronectin, YIGSR 
in laminin) can also be covalently attached to surfaces to promote cell-substrate adhesion [Massia, 1990]. 

In general, seeding density is important for normal cell function, especially if cell-cell communications 
must be established, either by direct cell-cell contacts or via the secretion of trophic factors by the cells. 
Efficient cell seeding will mainly depend on: ( 1 ) the affinity of certain cell surface proteins for extracellular 
matrix components, (2) the density of cell- substrate binding sites on the material surface, (3) the presence 
or absence of certain nutritional factors. While the first two factors can be controlled independently of 
the number of cells placed on the surface, the last issue can be problematic because attempts at seeding 
large numbers of cells can significantly deplete nutrients, with the result that less cells than expected will 
attach. This point is discussed below in the section on "metabolic requirements of cells". 

After seeding, the cells spread on the surface and reach a stable shape. The final morphology of the 
cells depends on 3 factors: (1) adherence of the substrate for the cells, which is a function of the affinity 
and number of the adhesion sites, (2) rigidity of the substrate (i.e., ability to resist cell-generated fractional 
forces, (3) cell-cell adherence. The effect of substrate adherence and compliance on cell shape and function 
has been studied on hepatocytes. Increased adherence by increasing the amount of fibronectin adsorbed 
on a surface leads to increased cell spreading, increased DNA synthesis and reduced expression of liver- 
specific functions [Mooney, 1992]. Conversely, reduced adherence or the use of a compliant substrate 
where little spreading is observed helps maintain liver-specific function but reduces DNA synthesis 
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TABLE 109.2 Effect of Extracellular Matrix (ECM) on Cell Shape and Function 



ECM Characteristics 


Hepatocytes" 




Capillary Endothelial 


Surface Chemistry 


Geometry 


Cell shape b 


Liver- 


Specific Function 


Cell Shape 1 


1 ng/cm 2 Fibronectin on 


Single surface 


Round (1.2) 




+ 




polystyrene 












1000 ng/cm 2 Fibronectin 


Single surface 


Spread, flat (5) 




— 




on polystyrene 












0.1% w/v Collagen gel 


Single surface 


Spread, flat (3.5) 




— 


Cobblestone, flat 




sandwich 


Cuboidal, flat (1.5) 




+ 


capillary network 


Heat denatured collagen 


Single surface 


Round, cell aggregates 




+ 




Basement membrane 


Single surface 


Round, cell aggregates 




+ 


Capillary network 


extract 













» Data from [Mooney, 1992; Lindblad, 1990; Ezzell, 1993]. 

b The number between brackets indicates the projected surface area relative to that of isolated hepatocytes (-1200 Ltm 2 ). 

c Data from [Montesano, 1983; Madri, 1983]. 

[Lindblad, 1991]. The quantitative difference between cell-substrate and cell-cell adhesion strength on a 
rigid substrate is also a potential factor which may dramatically affect the organization of cells on the 
substrate. A thermodynamic view of the problem suggests that the overall system (consisting of the cells 
and the extracellular support) ultimately reaches an equilibrium state when the surface free energy is 
minimized [Martz, 1974]. According to this hypothesis, the existence of large cell- substrate adhesion 
forces relative to cell-cell adhesion forces may be a sufficient condition to prevent cell-cell overlapping. 
In contrast, the opposite situation would lead to cell clumping or multilayered growth on the substrate. 
This prediction is in agreement with the observation of cellular aggregate formation when hepatocytes 
are plated on a nonadherent surface as opposed to a highly adherent surface such as type I collagen 
[Koide, 1990]. 

One of the most striking features of endothelial and epithelial cells (as opposed to connective tissue 
cells such as fibroblasts) is the organization of the membrane, which exhibits distinct basal and apical 
domains, where different proteins and receptors are found. Most epithelial cells can be cultured on a 
single surface of plastic or extracellular matrix materials which allows the basal surface to be in contact 
with the substrate and the apical surface to be exposed to the liquid medium. Changing the extracellular 
matrix configuration can induce these cells to adopt different morphologies. For example, capillary 
endothelial cells grown on a single collagen gel reorganize into branching capillary-like structures when 
overlaid with a second layer of collagen [Montesano, 1983]. Unlike most epithelial cells, which exhibit 
only two distinct membrane domains (one apical and one basal), hepatocytes possess two basolateral 
surfaces separated by a belt of apical membrane. To express and maintain this particular phenotype, 
these cells have been cultured between two layers of extracellular matrix, thereby creating a "sandwich" 
configuration which closely mimics the in vivo geometry. Examples of the effect of the extracellular matrix 
on cell shape and function are summarized in Table 109.2. 

Control of Tissue Organization 

New ways of controlling cell distribution and shape on surfaces are being developed. These methods 
provide finer control of cell distribution and creation of microstructures (e.g., microchannels) at the a 
scale that rivals that of natural organs. 

The current approach to control cell distribution at size scales down to cell size (10p.m) involves 
micropatterning techniques. A micropatterning technique based on the utilization of photoreactive cross- 
linking of hydrophobic or hydrophilic compounds to a polymeric surface has recently been described. 
Cells attach and grow along the less hydrophilic micropattened domains. Micropatterned surfaces pro- 
duced with this technique have been used to produce two-dimensional neuronal networks with neuro- 
blastoma cells [Matsuda, 1992]. Small micropatterned adherent squares of different sizes have been used 
to control the extent of spreading of hepatocytes [Bhatia, 1994; Singhvi, 1994]. 
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The control of cell orientation may be important when nonisotropic connective tissues are desired. 
For example, cells can be mixed with a chilled solution of collagen in physiological buffer, followed by 
exposure to 37°C to induce the gelation of the collagen. The result is a loose gel comprised of small 
collagen fibrils where cells and fluid are entrapped. Fibroblasts embedded in a collagen gel cause the 
contraction of the gel [Bell, 1979]. The contraction process can be controlled to a certain extent by 
mechanically restricting the motion along certain directions; this also induces a preferential alignment 
of the collagen fibers as well as the cells within it. A mathematical description of the effect of cell tractional 
forces on the deformation of collagen-cell lattices and the resulting alignment of the collagen fibers has 
been presented by Tranquillo [1994]. Other strategies to align cells have been reported: (1) alignment of 
fibroblasts in collagen gels where collagen fibrils were aligned by a magnetic field during gelation [Guido, 
1993], (2) repetitive stretching of endothelial cells cultured on an extensible surface [Banes, 1993], and 
(3) flow-induced alignment of endothelial cells [Girard, 1993]. 

Heterotypic cell systems or "co-culture" systems have been used for the production of skin grafts, in 
long-term cultures of hepatocytes, and in long-term cultures of mixed bone marrow cells. These systems 
take advantage of the trophic factors (for the most part unknown) secreted by "feeder" cells. Greater use 
of different cell types used in co-culture will enable engineered cell systems to closely mimic in vivo 
organization, with potential benefits including increased cell function and viability, and greater range of 
functions expressed by the bioartificial tissue. The organization of multicellular three-dimensional struc- 
tures may not be obvious. Provided that the adherence of homotypic and heterotypic interactions is 
known, a thermodynamic analysis similar to that used to describe the morphology of a pure cell culture 
on a surface can be used to predict how cells will organize in these systems [Wiseman, 1972]. The process 
of cell-cell sorting in multicellular systems may be altered by changing the composition of the medium 
or altering the expression of proteins mediating cell-cell adhesion via genetic engineering [Kuhlenschmidt, 
1982; Nose, 1988]. Furthermore, micropatterned surfaces can be used to control the initial interface 
between two cell types in co-culture systems [Bhatia et al., 1998]. 

Metabolic Requirements of Cells 

Oxygen is very often the limiting nutrient in reconstructed tissues. The oxygen requirements vary largely 
with cell type. Among cells commonly used in bioartificial systems, hepatocytes and pancreatic islet cells 
are particularly sensitive to the availability of oxygen. Oxygen is required for efficient cell attachment 
and spreading to planar surfaces as well as microcarriers [Foy, 1993; Rotem, 1994]. 

Based on a simple mathematical model, we can estimate the critical distance at which the oxygen 
concentration at the cell surface becomes limiting (this concentration is arbitrarily set equal to K^) if an 
unstirred aqueous layer is placed between the gas phase (air) and a confluent monolayer of cells [Yarmush, 
1992]. In the case of a confluent monolayer of hepatocytes (2 x 10 5 cells/cm 2 ), this distance is 0.95 mm. 
Thus, in a first approximation, a successful bioartificial liver system containing hepatocytes will have to 
keep the diffusional distance between the oxygen-carrying medium and the cells to below 1 mm (assuming 
a confluent cell monolayer on the surface). 

Figure 109.1 can be used to estimate the maximum half thickness of a cell mass surrounded by a 
membrane or external diffusion barrier before the nutrient concentration in the center falls below the 
Michaelis-Menten constant for nutrient uptake, a sign of nutrient limitation at the cellular level. Oxygen 
uptake rate parameters for different cell types are given by Fleischaker [1981]. Values for hepatocytes and 
pancreatic islets can be found in Yarmush [1992], Rotem [1992], and Dionne [1989]. For illustrative 
purposes, we will use cylindrical hepatocyte aggregates, as an example. We assumed the following: no 
external diffusion barrier (Rj/Rq) = 1; medium saturated with air at 37°C at the aggregate surface 
(160mmHg= 190 nmol/cm 3 ); diffusivity of oxygen in aggregates (D ) similar to that in water (2 x 
10~ 5 cm 2 /s); a packed cell mass (given a cell diameter of approximately 20 p.m, this corresponds to 1.25 x 
10 8 cells/cm 3 ). The oxygen uptake parameters for hepatocytes were LL max = 0.4 nmol/10 6 cells/s (thus 
50 nmol/cm 3 /s for the above cell concentration) and K m = 0.5 mmHg (e.g., 0.6 nmol/cm 3 ). We obtain 
Q/K™ = C /K m = 320, (Li m „ R 2 )/(D K m ) = 724, 1370, and 2010 for the slab, cylindrical, and spherical 
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FIGURE 109.1 Correlation to predict the maximum half-thickness R of a cell mass surrounded by a shell of thickness 
R[-R without nutrient limitation, assuming that diffusion is the only transport mechanism involved. A: The nutrient 
concentration at the surface of the cell mass normalized to the Michaelis-Menten constant for the nutrient uptake 
by the cells (C /K m ) is obtained from the normalized bulk nutrient concentration (C[/K m ), and the aspect ratio of 
the system (R,/R ). D , D, are the diffusivities for the nutrient within the cell mass and the external diffusion barrier, 
respectively. The partition coefficient between the cell mass and the surrounding shell is assumed to be equal to 1. 
B: The half thickness R for which C/K m = 1 in the center (R = 0) is obtained from the value of the y-axis corresponding 
to C„/K m , knowing, in addition to the parameters listed above, the maximum nutrient uptake rate by the cells (u^J. 
If K m is unknown, a zero order approximation may be used, in which case K m is set arbitrarily so that C;/K m falls in 
the linear portion of the curve in A. In B, R is obtained using the line labeled "0th order approximation", and 
corresponds to the half thickness R for which C = at the center (R = 0). 



geometries, respectively, and thus the corresponding maximum half thicknesses obtained are R = 132, 
181, and 220 (im, respectively. We now consider the case where there is a 100 p.m thick membrane around 
a cylindrical cell aggregate assuming that the diffusivity of oxygen in the membrane (Dj) is the same as 
in the cell mass. An aspect ratio (a) must be assumed and the values of C /K m , (jV^ R 2 )/(D K m ), and 
R,, are calculated. R[ is then obtained from the assumed aspect ratio. Calculations must be performed 
with several aspect ratios until the difference RrR equates the membrane thickness. Here, we found 
that a = 0.6 generates C /K m = 141, (u^ R 2 )/(D K m ) = 622, R = 122 um. For a cylinder, a = D /D 1 
ln^/Rg), and thus in this case Rj = 222 (im. Thus, the maximum half-thickness of the cell mass is 
122 (J.m, as compared to 181 [Ira in the absence of the membrane. These estimates can be used as first 
guidelines to design a bioartificial liver, and they clearly suggest that the thickness of the cell mass must 
be limited to a few hundred microns to prevent the formation of an anoxic core. 
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[Dionne, 1989] 


46.1 nmol/cm 3 /s (300 mg/dl < 
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TABLE 109.3 Maximum Oxygen Uptake Rates of Cells Used in Tissue Reconstruction 

Cell type 

S p 2/0-derived mouse hybridoma 
Hepatocytes 

Pancreatic islet cells 

* Oxygen tension for which the oxygen uptake rate equals half of the maximum. 

Tissue assembled in vitro may have different metabolic requirements than their in vivo counterparts, 
so that exposure of bioartificial tissues to biological fluids and the in vivo environment may lead to 
unexpected alterations in function that may impact on their clinical effectiveness. For example, stable 
cultures of hepatocytes suddenly exposed to heparinized plasma accumulate fat droplets and exhibit a 
progressive deterioration of hepatospecific functions [Stefanovich et al., 1996]. Thus, it is important to 
consider the response of cultured cells and bioartificial tissues to biological fluids when evaluating their 
performance after implantation. 

109.3 Reconstruction of Connective Tissues 

The most simple method to create connective tissue in vitro is to incorporate connective tissue cells within 
a loose network of extracellular matrix components. A dermal equivalent has been produced by embed- 
ding fibroblasts in collagen gels [Bell, et al., 1979]. When fibroblasts (2.5 X 10 4 /mL) were seeded in 
collagen, the collagen lattice contracted 30 to 50 fold after 4 days of culture. The result is a dense cell- 
collagen matrix which can be used to support a cultured epidermis. Another example is the production 
of a bioartificial vascular media by seeding smooth muscle cells in collagen tubes [Weinberg, 1986]. In 
the latter case, orientation of the cells along the circumference of the tube may be necessary in order to 
eventually obtain a tissue with contractile properties similar to that exhibited by native blood vessels. 
More recently, L'Heureux et al. [1998] showed that smooth muscle cells and fibroblasts cultured in the 
presence of ascorbic acid produce large amounts of extracellular matrix and generate cell-extracellular 
matrix composite membranes which can be harvested from the substrate. Vascular grafts generated by 
wrapping around these membranes had a higher burst strength than human saphenous veins commonly 
used for arterial reconstructions. 

When reconstructing connective tissue, it may be advantageous to use a relatively low seeding density 
with the expectation that cell replication together with the migration of blood vessels from the host's 
surrounding tissue would occur after implantation. This approach may not be appropriate for tissues 
that grow slowly and that are subjected to high stresses after implantation. This is the case of bioartificial 
cartilage, which would be best implanted once its mechanical properties are similar to that of the authentic 
tissue. Chondrocytes seeded at high density (10 7 cells/cm 3 ) in agarose gels and in serum-free medium 
retain their phenotype and remain viable for up to 6 months [Bruckner, 1989]. In this system, the 
deposition of type II collagen and highly charged proteoglycans can be observed over a period of 43 days 
[Buschmann, 1992]. The cartilage obtained in the latter studies had a stiffness of approximately 1/3 that 
of the cartilage explants used to isolate the cells. It should be recognized that in these experiments, the 
cell density was considerably lower than that found in the native cartilage explants (7.5 X 10 7 cells/cm 3 ). 
Higher chondrocyte seeding density in the initial gel could potentially yield a material with mechanical 
properties closer to that of native cartilage. 

Meshes made of slowly biodegradable polymers may be particularly suitable for dense connective tissue 
synthesis, such as bone and cartilage. Loose meshes made of lactic and glycolic acid copolymer seeded 
with calf chondrocytes (5 X 10 7 /mL) have been shown to harbor the production of cartilage in vitro 
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[Freed, 1993]. In the same studies, meshes seeded with cells were implanted in nude mice and recovered 
at different time following implantation. Implants excised at the 7 week time point consisted almost 
entirely of cartilage-like tissue with an overall shape similar to that of the original synthetic matrix. 
Histochemical analysis of the specimens revealed the presence of type II collagen and sulfated glycosami- 
noglycans at week 7 and later. 

Angiogenesis may be necessary if there is cell growth in the reconstructed tissue after implantation. 
Because open matrices are mainly used to reconstruct these tissues, the host's capillaries can migrate into 
the implant. The presence of angiogenic factors (e.g., heparin-binding growth factor- 1) may be used to 
improve the kinetics of implant vascularization [Thompson, 1989]. Genetically modified cells have been 
used to provide stable long-term release of angiogenic factors by the implant [Eming et al., 1995]. 

109.4 Reconstruction of Epithelial or Endothelial Surfaces 
Cells Embedded in Extracellular Matrix Materials 

A simple way to maintain certain epithelial cells in a three-dimensional matrix is to seed them in gels or 
meshes in a similar manner as connective tissues are constructed. For example, hepatocytes can be 
maintained in the same mesh-type matrices made of biodegradable materials used to create bioartificial 
cartilage [Cima, 1991]. In this configuration, hepatocytes have a tendency to aggregate; such aggregates 
(sometimes called organoids) are known to contain cells which have maintained their phenotypic stability 
[Koide, 1990] . However, this process may be somewhat limited unless the aggregate size can be controlled 
to prevent the formation of large aggregates with anoxic cores. Also, this method may not be used to 
produce continuous epithelial cell sheets. 

Culture on a Single Surface 

Bioartificial vascular grafts have been produced by seeding endothelial cells on the luminal surface of 
small diameter (6 mm or less) synthetic vascular grafts. These prostheses have given poor results because 
of inflammatory responses at the anastomotic sites and the poor retention of the seeded endothelium 
under in vivo conditions. A more intrinsically biocompatible approach to vascular graft production is to 
reproduce more closely the organization of actual blood vessels. For instance, a bioartificial media has 
been produced by embedding vascular smooth muscle cells in a collagen gel annulus, and the intima re- 
established by seeding endothelial cells on the inside surface of the gel [Weinberg, 1986]. 

A dermal equivalent consisting of fibroblasts embedded in collagen has been used to support a stratified 
layer of human keratinocytes [Parenteau, 1991]. Exposure to air induces the terminal differentiation of 
the keratinocytes near the air-liquid interface, which is characterized by the formation of tight cell-cell 
contacts and cornified envelopes. The resulting bioartificial skin has a morphological and biochemical 
organization very similar to that of real skin including a stratum corneum which exhibits a high resistance 
to chemical damage. This type of bioartificial skin has been tested successfully in animals, but acceptance 
of allogeneic skin grafts remains problematic in humans, which means that the production of grafts 
requires that both fibroblasts and keratinocytes be obtained from the patient and propagated in vitro. 

Culture in a Sandwich Configuration 

This culture technique has been successfully used to maintain hepatocyte polarity and function in vitro. 
Hepatocytes are first plated on a single gel of collagen and then a top layer of type I collagen is placed 
on the cells after one day in culture. The resulting extracellular geometry closely mimics that found 
in vivo and maintains a wide spectrum of liver-specific functions, including protein secretion (e.g., 
albumin, transferrin, and fibrinogen), detoxification (e.g., cytochrome P450-dependent pathways) for 
up to 6 to 8 weeks, [Dunn, 1991]. 
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109.5 Bioreactor Design in Tissue Engineering 

A bioreactor is a system containing a large number of cells that transform an input of reactants into an 
output of products. Bioreactors have been designed primarily for use as bioartificial liver or pancreas, 
and more recently for the production of blood cells from hematopoietic tissue. These systems require 
the maintenance of the function of a large number of cells in a small volume. For example, a hypothetical 
bioartificial liver device possessing 10% of the detoxification and protein synthesis capacity of the normal 
human liver (a rough estimate of the minimum processing and secretory capacities that can meet a 
human body's demands) would contain a total of 10 10 adult hepatocytes. Thus, to keep the total bioreactor 
volume within reasonable limits (1 L or less), 10 7 cells/mL or more are required. For comparison, the 
normal human liver contains approximately 10 s hepatocytes/mL. Two main types of bioreactor design 
have been considered in tissue engineering: (1) hollow fiber systems, and (2) microcarrier systems. 

Hollow Fiber Systems 

Hollow fiber systems consist of a shell traversed by a large number of small diameter tubes. The cells may 
be placed within the fibers in the intracapillary space or on the shell side in the extracapillary space. The 
compartment which does not contain the cells is generally perfused with culture medium or the patient's 
plasma or blood. The fiber walls may provide the attaching surface for the cells and/or act as barrier against 
the immune system of the host. Some of these systems are essentially designed to be implanted as vascular 
shunts, but may also be perfused with the patient's blood or plasma extracorporeally. 

For the selection of fiber dimensions, spacing, and reactor length, the reader is referred to previously 
published experimental and theoretical studies [Chresand, 1988; Piret, 1991]. If the pressure gradient 
across the fiber length is sufficiently high and relatively permeable fibers are used, the pressure difference 
between the inlet and outlet induces convective flow (called Starling flow) across the fiber wall and 
through the shell compartement [Kelsey, 1990; Pillarella, 1990]. The maintenance of Starling flow in 
implanted hollow fiber devices is contingent upon the prevention of a gradual decrease in fiber perme- 
ability due to protein deposition over time on the fiber walls, which may difficult to achieve with fluids 
containing high levels of proteins such as plasma. 

There are several reports in the literature describing the use of hollow fiber systems in the development 
of a bioartificial pancreas [Colton, 1991; Ramirez, 1992]. Most designs place the islets on the shell side, 
while perfusing the fibers with the animal's plasma or blood. Recent studies using implantable devices 
connected to the vascular system of the patient have given some encouraging results [Sullivan, 1991], 
but it appears that none of the pancreatic islet devices tested to date have shown long term function in 
a reproducible manner. It is interesting to note that none of the studies involving pancreatic islets 
encapsulated in hollow fiber systems have recognized that oxygen transport can be severely compromised 
within the isolated islets alone, resulting in a substantial reduction in the insulin secretion capacity of 
the islets in response to glucose changes [Dionne, 1993]. 

It may be advantageous to place cells in the lumen of small fibers because the diffusional distance 
between the shell (where the nutrient supply would be) and the cells is essentially equal to the fiber 
diameter, which is easier to control than the inter-fiber distance. In one configuration, cells have been 
suspended in a collagen solution and injected into the lumen of fibers where the collagen is allowed to 
gel. Contraction of the collagen lattice by the cells creates avoid in the intraluminal space [Scholz, 1991]. 
Such a configuration has been described for the construction of a bioartificial liver using adult hepatocytes 
[Nyberg, 1993] . It has been further proposed that the lumen be perfused with culture medium containing 
the appropriate hormones for cell maintenance while the patient's plasma would flow on the shell side. 

Microcarrier-based Systems 

Micro carriers are small beads (usually less than 500 p.m in diameter) with surfaces treated to support 
cell attachment. The surface area available per microcarrier can be increased by using porous microcarriers, 
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where cells can migrate and proliferate within the porous matrix as well as on the microcarrier surface. 
Porous ceramic beads have been used to support a rat cell line transfected with the human proinsulin 
gene [Park, 1993]; under certain conditions intraparticle flow significantly enhances mass transport 
(especially that of oxygen) to the cells in the beads [Stephanopoulos, 1989]. 

Maintaining a constant number of microcarriers while increasing the number of cells increases the 
surface coverage of the microcarriers as long as the concentration of nutrients (especially oxygen) is not 
limiting. With the adult rat hepatocyte-Cytodex 3 microcarrier system, we observed that sufficient supply 
of oxygen to the hepatocytes is required for the efficient attachment (approximately 90%) of cells to 
microcarriers. A simple oxygen diffusion-reaction model indicated that a cell surface oxygen partial 
pressure greater than 0.1 mm Hg was needed [Foy, 1993]. 

Two bioreactor configurations using microcarriers may find potential use in tissue engineering: 
(1) packed or fluidized bed, and (2) microcarriers incorporated in hollow fiber cartridges. A packed bed 
of microcarriers consists of a column filled with microcarriers with porous plates at the inlet and outlet 
of the column to allow perfusion while preventing microcarrier entrainment by the flow. Total flow rate 
is mainly dependent on cell number and the nutrient uptake rate of the cells. Reactor volume is propor- 
tional to the microcarrier diameter. From that point of view, it is therefore advantageous to reduce the 
microcarrier size as much as possible. However, packed beds with small beads may be potentially more 
prone to clogging and the cells may have tendency to accumulate in the channels between the microcarrier 
surfaces. The aspect ratio of the bed (height/diameter) is adjusted so that the magnitude of fluid mechan- 
ical forces (proportional to the aspect ratio) within the bed is below damaging levels. Given a fixed flow 
rate and microcarrier diameter, decreasing column diameter increases fluid velocity, and therefore 
increases the shear stress at the surface of the microcarriers and cells, with potential mechanical damage 
to cells. On the other hand, low aspect ratios may be difficult to perfuse evenly. Fluidized beds differ 
from packed beds in that the perfusing fluid motion maintains the microcarriers in suspension [Run- 
stadler, 1988]. 

Packed bed systems have been used to support high densities (5 X 10 8 cells/mL) of anchorage-depen- 
dent cell lines seeded on microporous microcarriers (589 to 850 p.m in diameter) [Park, 1993]. In 
addition, packed beads (1.5 mm diameter) have been used to entrap aggregates of hepatocytes. This 
system was shown to maintain a relatively stable level of albumin secretion (a liver-specific product) for 
up to 3wk [Li, 1993]. 

Microcarriers have also been used as a way to provide an attachment surface for anchorage-dependent 
cells introduced in the shell side of hollow fiber devices, as in the case of a hollow fiber bioartificial liver 
device [Demetriou, 1986]. A flat bed device, which is in principle similar to a hollow fiber system (cells 
are separated from the circulating medium by a membrane), for the maintenance of cultured human 
bone marrow at high densities on porous microcarriers has been recently described [Palsson, 1993]. 
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The interaction between cells and surfaces plays a major biologic role in cellular behavior. Cell membrane 
receptors responsible for adhesion may influence cell physiology in numerous ways. Examples of these 
interactions are the rolling and activation of leukocytes on vascular endothelium, the spatial differenti- 
ation of embryonic basement membranes in development, the extension of neurites on adhesion proteins, 
the proliferation of cells induced by mitogenic factors, and the spreading and recruitment of platelets 
onto the vascular subendothelium. These and other cell-surface interactions influence or control many 
aspects of cell physiology, such as adhesion, spreading, activation, recruitment, migration, proliferation, 
and differentiation. 

There is great interest in understanding cell-surface interactions. This understanding is paramount to 
the development of pharmaceutical compounds to enhance or inhibit cell-substrate interactions, such 
as agents to enhance cell adhesion for tissue regeneration or biomaterials integration or to reduce cell 
adhesion in metastasis or fibrosis. In the emerging area of tissue engineering, the adhesion of cells to a 
culture support is essential for many products such as biohybrid dermal dressings, synthetic articular 
cartilage, and hepatocyte scaffolds. Investigations of cell-surface interactions are central to many areas 
of biomedicine and bioengineering, including tissue engineering, biomaterials design, immunology, 
oncology, haematology, and developmental biology. 

To examine the interactions of cells with substrates and how they may influence cell behavior, simplified 
models that simulate basement membranes, cell surfaces, or extracellular matrices have been developed. 
These models involve the immobilization of biologically active ligands of natural and synthetic origin 
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onto various substrates to produce chemically defined bioactive surfaces. Ligands that have been immo- 
bilized include cell-membrane receptor fragments, antibiotics, adhesion peptides, enzymes, adhesive 
carbohydrates, lectins, membrane lipids, and glycosaminoglycan matrix components. Functionalized 
two-dimensional surfaces are important tools to elucidate the molecular mechanisms of cell-mediated 
bioadhesion. 

Techniques for preparing these ligand-functionalized two-dimensional substrates for investigation of 
cell adhesion are similar to methods for labeling ligands, preparing immunomatrices, or immobilizing 
enzymes. This brief tutorial review will focus primarily on schemes for immobilizing bioactive ligands 
specifically for purposes of investigating cell-surface interactions. Topics that will be addressed include: 
(1) general considerations of bioconjugation; (2) examples of surface bioconjugation; (3) preparation of 
surfaces for ligand immobilization; (4) examples of ligand-containing copolymers; (5) techniques for 
characterizing these surfaces; and (6) examples of applications of surface immobilized ligands in cell- 
substrate investigations. 

110.1 General Considerations 

Numerous reviews are available that describe the chemistry of ligand conjugation and immobilization, 
and the reader is encouraged to consult them for more detailed descriptions [1-12]. When immobilizing 
ligands for the preparation of cell-adhesive substrates, several factors must be considered to retain 
maximum bioactivity. Ligand surface immobilization may proceed in low yields due to sterically inac- 
cessible reactive sites on the ligand molecule; the immobilized ligand may not have optimum cell-receptor 
interactions due to physical constraints imposed by the surface or by the active site being buried; or the 
immobilized ligand may be partially denatured upon immobilization. The inclusion of spacer arms on 
the surface may help relieve these limitations [ 13] . If the ligand has a directionality, then the cell adhesive 
response may vary depending upon the orientation of the immobilized ligand [14, 15] or upon the length 
of the spacer arm [16]. 

Ligands have been physicochemically absorbed onto polymeric substrates to investigate cell-surface 
interactions. The advantage of covalently immobilizing a ligand is that a chemical bond is present to 
prevent desorption. The general chemical scheme for grafting a ligand onto a surface is shown in Fig. 110.1. 
Activation usually is a necessary step to produce highly reactive species to enable surface immobilization 
under mild conditions. Typically the surface is activated, since the chemical steps are somewhat simpler 
than for ligand activation, but each strategy may have its own advantages and limitations. 

The activation of ligands may require suitable protective schemes to prevent homopolymerization or 
cross-linking if these competing reactions are not desired. Ligands may also be activated at more than 
one site, some of which may remain unconsumed after coupling is complete and have biological activity. 
In contrast, the activation of surfaces usually does not require protective schemes; nonetheless, activated 
surfaces may contain unconsumed sites after immobilization that may require deactivation so as not to 
influence cell-mediated adhesive responses. 

Because most ligands immobilized for cell-surface studies are biologic in origin, water is often the 
only choice of solvent as the coupling medium. Water is nucleophilic, and hydrolysis can be a competing 
reaction during activation and coupling, especially at higher pH (>9— 10). To reduce hydrolysis, surfaces 
are commonly activated in nonaqueous solvents; ligands are then coupled using concentrated aqueous 
solutions (>1 mg/ml) for extended periods (>12 hr). Some ligands are soluble in polar solvents such as 
dimethylsulfoxide (DMSO), dimethylformamide (DMF), acetone, dioxane, or ethanol, or their aqueous 
cosolvents, and these combinations may be used to reduce hydrolysis during coupling. 



h activation ■ „, coupling I 
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FIGURE 110.1 General chemical scheme for grafting a ligand onto a surface. 
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110.2 Surface Bioconjugation 

The typical chemical groups involved in immobilizing ligands for cell-surface studies are hydroxyls, 
amines, carboxylic acids, and thiols. Other groups, such as amides, disulfides, phenols, guanidines, 
thioethers, indoles, and imidazoles, have also been modified, but these will not be described. 

Hydroxyls and carboxylic acids are commonly activated to produce more reactive agents for ligand 
acylation. Elimination is a possible competing reaction for secondary and aryl alcohols. Primary amines, 
present on many biologic ligands from N-amino acid termini and lysine, are good nucleophiles when 
unprotonated (pK a ~ 9); moderately basic pH (8-10) ensures their reactivity. Thiols, present on cysteine 
(pK a ~ 8.5), are stronger nucleophiles than amines, and as such they can be selectively coupled in the 
presence of amines at lower pH (5-7). Sulhydryl groups often exist as their disulfide form and may be 
reduced to free thiols using mild agents such as dithiothreitol. 

The following paragraphs review activation schemes for particular chemical groups present on the 
surface, giving brief coupling schemes. Due to the advantages of surface versus ligand activation, only 
general methods involving surface activation will be discussed. 

Immobilization to Surface Alcohols 

A hydroxyl-bearing surface may be activated with numerous reagents to produce more reactive species 
for substitution, the most common example being the cyanogen bromide activation of cellulose and 
agarose derivatives [2-5, 11,17]. Due to problems with high volatility of the cyanogen bromide, sensitivity 
of the activated species to hydrolysis, competing reactions during activation and coupling, and the desire 
to use culture substrates other than polysaccharides, other activation schemes have been developed for 
more defined activation and coupling. These schemes have been used to modify functionalized glasses 
and polymers for cell-mediated adhesion studies. 

Alcohols react with sulfonyl halides [18-25], carbonyldiimidazole [26, 27], succinimidyl chloroformate 
[28], epoxides [29, 30], isocyanates [31-33] and heterocyclic [34, 35] and alkyl halides [21]. Activation 
of surface alcohols, with these agents can be performed in organic solvents such as acetonitrile, methylene 
chloride, acetone, benzene, dioxane, diethyl ether, toluene, or DMF. 

Reactive Esters 

Alcohols can be activated to reactive sulfonic ester leaving groups by reaction with sulfonyl halides [18-25] 
(Fig. 110.2) such as p-toluenesulfonyl chloride (tosyl chloride), trifluoroethanesulfonyl chloride (tresyl 
chloride), methanesulfonyl chloride (mesyl chloride), or fluorobenzenesulfonyl chloride (fosyl chloride). 
The resulting sulfonic esters are readily displaced in mild aqueous conditions with amines or thiols to 
produce amino- or thioether-bound ligands. Aryl alcohols should not be activated in this manner, since 
the sulfonate group may irreversibly transfer to the aromatic nucleus [36]. Sulfonic esters differ in their 
ease of nucleophilic substitution and resistance to hydrolysis, tosyl esters being low in coupling potential 
[23] and fosyl esters being high in hydrolysis resistance [24] . Activation can be performed in many organic 
solvents that are properly dry: Trace water and other species such as ammonia will react. DMSO should 
not be used as a solvent, as it will react with sulfonyl halides. A tert-amino base such as pyridine, 
dimethylaminopyridine, triethylamine, diisopropylethyl amine, or ethylmorpholine can be added in 
equimolar amounts to serve as a nucleophilic catalyst and to combine with the liberated HC1. It has been 
suggested that hydroxyls be converted to alkoxides prior to sulfonic ester activation in the presence of 
ethers as they may be sensitive to the HC1 generated during activation [25]. 



h 
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FIGURE 110.2 Alcohols can be activated with sulfonyl halides, which are readily displaced with amine- (shown) 
or thiol-containing (not shown) ligands. 
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An alcohol-containing surface is typically incubated with the sulfonyl halide for 0.5-6.0 hr. Any 
precipitated salts can be rinsed away with 1 mM HC1. The ligand is coupled to the surface for 12-24 hr 
in borate or carbonate buffer (pH 9-10) at a concentration of 1 mg/ml. Coupling can proceed at more 
mild pH (~7) with more concentrated solutions (>10 mg ligand/ml) or with thiol-bearing ligands. Excess 
sulfonic esters can be displaced with aqueous solutions (10-50 mM, pH = 8-9) of tris(hydroxyme- 
fhyl)aminomefhane, aminoethanol, glycine, or mercaptoethanol, or by hydrolysis. 

Other Acylating Agents 

Surface hydroxyls may be activated to groups other than sulfonic esters. Alcohols react readily with 
carbonyldiimidazole [26, 27] (Fig. 110.3) and succinimidyl chloroformate [28] (Fig. 110.4) to produce 
reactive imidazole-N-carboxylates and succinimidyl carbonates, respectively. These species acylate amines 
to urethane linkages. Activation typically proceeds in organic solvent for 2-6 hr. Dimefhylaminopyridine 
catalyzes formation of the succinimidyl carbonate. Amine-bearing ligands (>10 mg/ml) are coupled to 
the surface in borate or phosphate buffer (pH 8-9) for 12-24 hr, or 4°C, 2-3 days. Thiol-bearing groups 
may also be immobilized onto these activated groups; however the thiocarbamate linkage can be sensitive 
to hydrolysis and may not be generally applicable to preparing well-defined substrates for long-term 
biologic investigations. 

Bifunctional Bridges 

In lieu of alcohol conversion to activated reaction groups, the hydroxyl may be added to homo- or 
heterobifunctional bridges, wherein alcholysis consumes one terminus to produce newly functionalized 
surfaces (Figure 110.5). Epoxide bridges can be added to surface-bound alcohols in aqueous base (to 
reduce hydrolysis or polymerization of the epoxide, 10-100 mM NaOH) or in organic solvent, 12 hr. 
Isocyanate bridges are added to surface alcohols in organic solvents with organotin catalysts (such as 
dibutyltin dilaurate), 12 hr. Halo alkylation of surface-bound alcohols or alkoxides proceeds in organic 
solvents, 2-3 days, or with heat (~80°C), 12 hr. The unconsumed free group (isocyanate, epoxide, or 
alkyl halide) is substituted with amine- or thiol-bearing ligands in buffered aqueous conditions, pH 8-10, 
1-10 mg/ml, 12-24 hr. Hydroxyl-bearing ligands, such as carbohydrates, can be coupled in aqueous 
DMSO, dioxane, or DMF, 12-24 hr, 60-80°C. Hydroxyl coupling may also be performed in aqueous base 
(10-100 mM NaOH) for ligands that are resistant to ester- or amide hydrolysis, such as mono- or 
polysaccharides. 
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FIGURE 110.3 Alcohols react readily with carbonyldiimidazole to produce reactive imidazole-N-carboxylates for 
coupling to amine-containing ligands. 
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FIGURE 110.4 Alcohols react readily with succinimidyl chloroformate to produce a succinimidyl carbonate for 
coupling to amine-containing ligands. 

Heterocyclic aryl halides, such as cyanuric chloride [34, 35], react with free hydroxyl groups in polar 
solvents containing sodium carbonate, 40-50°C, 30 min. Amine-bearing ligands are immobilized to the 
surface in borate buffer, pH 9, 1-10 mg/ml, 5-10°C, 12-24 hr. 

Immobilization to Surface Carboxylic Acids 

Acids may be converted to activated leaving groups by reaction with carbodiimides [36]. The generated 
O-acylureas react with amines to produce amide linkages and can react with alcohols to produce ester 
linkages with acid or base catalysts. An undesirable competing reaction is urea rearrangement to nonre- 
active N-acylurea; this effect may be accelerated in aprotic polar solvents such as DMF but is reduced at 
low temperatures (0— 10°C) or by the addition of agents such as hydroxybenztriazole to convert the 
O-acylurea to benztriazole derivatives. Acids may also be activated with carbonyldiimidizole to produce 
easily amino- and alcoholyzed imidazolide intermediates; however, imidazolides are highly susceptible 
to hydrolysis and necessitate anhydrous conditions for activation and coupling [37]. 

Water-soluble carbodiimides such as ethyl(dimethylaminopropyl)-carbodiimide (EDC) can be used 
in either aqueous or organic media for the immobilization of ligands to produce bioactive substrates 
[20, 38, 39]. To reduce hydrolysis of the O-acylurea, conversion to more resistant succinimidyl esters can 
be performed using EDC and N-hydroxysuccinimide or its water-soluble sulfonate derivative, and base 
catalysts (Fig. 110.6). Alternatively, the acid may be converted directly to a succinimidyl ester via reaction 
with tetramethyl(succinimido)uronium tetrafluoroborate [40, 41]. The activation of surface-bound car- 
boxylic acids with EDC in organic media (ethanol, DMF, dioxane) is complete with 1-2 hr at 0-10°C; 

Hrv/Wr' - NH 2 -L ■ 

C— OH ► I C— O-R' ► I — C— 0~NH— L 

FIGURE 1 10.5 Bifunctional coupling agents may be used, e.g., to couple an amine-containing ligand to a hydroxyl- 
containing surface via a spacer. 
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FIGURE 1 10.6 Carboxylated surfaces can be achieved to succimydil esters to subsequently couple amine-containing 
ligands. 

in aqueous media at pH 4-6, 0-5°C, 0.5-2 hr. Amine-bearing ligands may be coupled onto the surface 
in buffered media, pH 7.5-9, 1 mg/ml, 2-24 hr. Since the reaction of carbodiimides with amines is slow 
compared to reaction with acids, activation and coupling may proceed simultaneously [42] by including 
amine-bearing ligands (1-10 mg/ml) with the EDC and allowing coupling to proceed in buffered media, 
pH 5-7, 0-10°C, 12-24 hr, or 25-30°C, 4-6 hr. Acid dehydration directly to succinimidyl esters using 
tetramethyl(succinimido)uronium tetrafluoroborate is performed in anhydrous conditions with equimo- 
lar tert-amino base, 2-4 hr, followed by coupling in aqueous organic cosolvents (water/DMF or 
water/dioxane), 1-10 mg/ml ligand, pH 8-9, 12-24 hr. 

Immobilization to Surface Amines 

Primary and secondary amine-containing surfaces may be reacted with homo- or heterobifunctional 
bridges (Fig. 110.7). Amines are more nucleophilic than alcohols, they generally do not require the 
addition of catalysts, and their addition is faster. These bridges may contain isocyanates [31, 33], isothio- 
cyanates [43], cyclic anhydrides [44], succinimidyl esters [45-47], or epoxides [48, 49]. 

Isocyanates add to amines with good efficiency but are susceptible to hydrolysis; epoxides and cyclic 
anhydrides are somewhat less reactive yet are still sensitive to hydrolysis. Hydrolysis-resistant diisothio- 
cyanates have been used for many years to label ligands with reporter molecules; however, the thiourea 
linkage may be hydrolytically labile (especially at lower pH) and may be unsuitable for investigations of 
cell-surface interactions. Succinimidyl esters, although not as resistant to hydrolysis as isothiocyanates, 
have very good reactivity to amines and form stable amide linkages. Hydrolysis is all these reagents is 
accelerated at higher pH (>9-10). 

Coupling to surface-bound amines is performed in organic conditions (DMF, DMSO, acetone) for 
1-3 hr. Coupling of amine-bearing ligands onto immobilized bridges is performed in buffered media, 
pH 8-10.5, 2-12, 1-10 mg/ml. Excess reagent can be displaced with buffered solutions of tris(hydroxy- 
methyl)aminomethane, aminoethanol, glycine, or mercaptoethanol, or hydrolysis. 

Bifunctional aldehydes, such as glutaraldehyde and formaldehyde, have been used classically as 
crosslinkers for purposes of immunohistochemistry and ultrastructural investigations. They have been 
used also to couple ligands onto amine-bearing substrates [50-52]. Hydrolysis of aldehydes is usually 
not a concern, since the hydrolysis product, alkyl hydrate, is reversible back to the carbonyl. Amines add 
to aldehydes to produce inline linkages over a wide range of pH (6-10). These Schiff bases are potentially 
hydrolytically labile; reductive animation can be performed with mild reducing agents such as sodium 
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FIGURE 1 10.7 Primary and secondary amine-containing surfaces may be reacted with homo- or heterobifunctional 
bridges. 
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FIGURE 110.8 Thiol-containing surfaces may be coupled to malcimide activated species, either heterobifunctional 
linkers or ligands. 

cyanoborohydride, pH 8-9, without substantial losses in ligand bioactivity [53]. Acetalization of poly- 
hydric alcohols may commence in the presence of Lewis acid catalysts followed by dehydrating the 
hemiacetal linkage, an acetal [54, 55]. The dehydration conditions (air-drying followed by 70-90°C, 2 
hr) may damage many biologic ligands. 

Alkyl halide-bearing surfaces can be coupled to amine-bearing ligands [49, 56, 57]; their reaction is 
slower, but they are resistant to hydrolysis. Ligands can be immobilized in buffered medium (pH 9-10), 
12-24 hr, 1-10 mg/ml, or in organic medium. Heat may be used to increase yields (60-80°C). 

Immobilization to Surface Thiols 

Thiols are more nucleophilic than amines or alcohols and may be selectively coupled in their presence 
at more neutral pH. Thiols can be reacted with reagents that are not reactive toward amines, such as 
homo- or heterobifunctional maleimide bridges [32, 58-60]. Surface-bound thiols react with maleimides 
in acetone, methanol, DMF, etc., for 1-2 hr, to produce a thioether linkage (Fig. 110.8). Thiol-bearing 
ligands can be immobilized to surface maleimides by incubating in mild buffered media, pH 5-7, 
12-24 hr, 1-10 mg/ml. More basic pH should be avoided as hydrolysis of the maleimide to nonreactive 
maleamic acid may occur. 

Photoimmobilization 

Surfaces can be modified using photoactivated heterobifunctional bridges in one of three schemes: ( 1 ) the 
bridge may be immobilized onto a functionalized surface, followed by photocoupling of the ligand onto 
the photoactivated group [58, 61]; (2) the reagent may be photocoupled onto the surface, followed by 
immobilization onto the free terminus [58, 62]; or (3) the ligand may be coupled to the reagent, followed 
by photoimmobilization to the surface [63, 64] . The photoactivatible group may be light-sensitive azides, 
benzophenones, diazinines, or acrylates; polymerization initiator-transfer agent-terminators [65] and 
plasma-deposited free radicals [64] have also been used. Photoactivation produces highly reactive radical 
intermediates that immobilize onto the surface via a nonspecific insertion of the radical into a carbon- 
carbon bond. Photolabile reagents may be coupled to amine- or thiol-bearing ligands using succinimidyl 
esters or maleimides. One utility of photoimmobilization is the ability to produce patterns of ligands on 
the surface by lithography. 



110.3 Surface Preparation 



Polymeric materials can be functionalized using plasma deposition or wet chemical methods to produce 
surfaces containing reactive groups for ligand bioconjugation. Plasma polymerization has been widely 
used to deposit numerous functional groups and alter the surface chemistry of many medically relevant 
polymers [66, 67]. Polyesters such as polyethyleneterephthalate can be partially saponified with aqueous 
base (5% NaOH, 100°C, 30 min) to produce surface-bound carboxylic acids [39], partially aminolyzed 
with diamines such as ethylenediamine (50% aqueous, 12 hr) to produce surface-bound amines [68] or 
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electrophilically substituted with formaldehyde (20% in 1 M acetic acid) to produce surface-bound 
alcohols [19]. 

Polyurethanes can be carboxylated via a bimolecular nucleophilic substitution [69]. Carbamate anions 
are prepared by abstraction of hydrogen from the urethane nitrogen at low temperatures to prevent chain 
cleavage (— 5°C), followed by coupling of cyclic lactones such as (3-propiolactone. Ligands are then 
immobilized onto the grafted carboxylic acids. 

Polytetrafluoroethylene can be functionalized with a number of reactive groups using wet chemical 
or photochemical methods. Fluoropolymers are reduced with concentrated benzion dianion solutions 
to form carbonaceous surface layers which can then be halogenated, hydroxylated, carboxylated, or 
aminated [70]. Photochemical modification of fluoropolymers is possible by incubating the polymer in 
solutions of alkoxides or thiolate anions and exposing to UV light [71]. The patterning of surfaces is 
possible using this technique in conjunction with photolithography. 

Glasses and oxidized polymers can be functionalized with bifunctional silanating reagents to generate 
surfaces containing alkyl halides, epoxides, amines, thiols, or carboxylic acids [19, 49, 59, 72]. The 
substrate must be thoroughly clean and free of any contaminating agents. Glass is soaked in strong acid 
or base for 30-60 min. Polymers are cleaned with plasma etching or with strong oxidizers such as chromic 
acid. Clean substrates are immersed in silane solutions (5-10% in acetone, toluene, or ethanol/water 
95%/5%) for 1 hr and cured 50-100°C, 2-4 hr, or at 25°C, 24 hr, for oxidation-sensitive silanes such as 
mercaptosilanes. Prepared surfaces stored under Ar are stable for several weeks. 

Metals such as gold can be functionalized via the chemisorption of self-assembled monolayers of 
alkanethiols [73-75]. Gold substrates, prepared by evaporating gold on chromium-primed silicon sub- 
strates, are immersed in organic solutions of alkanethiol (1-10 mM) for 5-60 min. The monolayer is 
adsorbed via a gold-sulfur bond; competitive displacement of the alkanethiol may occur [75], and it is 
unclear if these surfaces are applicable for use in reducing media. Similar substrates can be prepared 
from the chemisorption of carboxylic acids onto alumina [76]. Prepared substrates stored under Ar are 
stable for several months. 

Amine-bearing surfaces have been produced by the adsorption of biologically inert proteins such as 
albumin. Cells do not have adhesion receptors for albumin, and for this reason albumin has been used 
to passivate surfaces against cell adhesion. Bioactive ligands can be amino-immobilized onto the albumin- 
coated surfaces [45, 46, 48]. 

Functional polymers and copolymers containing alcohols, amines, alkyl halides, carboxylic acids, or 
other groups, can be synthesized, coated onto a surface, and used as the substrate for immobilization of 
bioactive ligands [77, 78]. Examples include poly(vinyl alcohol) [31, 33], chloromethyl polystyrene [56], 
aminopolystyrene [79, 80], poly(acrylic acid) [20, 38], polyallylamine [80], poly(maleic acid anhydride) 
[12], poly(carbodiimide) [81], and poly(succinimide) [82]. 



110.4 Ligand/Polymer Hybrids 



Hybrid copolymers may be synthesized in which one of the components is the biologically active ligand. 
These copolymers may then be coated onto a substrate or crosslinked into a three-dimensional network. 
Since the ligand is a component of the copolymer, no additional ligand immobilization may be necessary 
to produce bioactive substrates. 

Examples are available for particular cases of hybrid copolymers (54, 55, 82-87], including gamma- 
irradiated crosslinked poly(peptide) [86, 88], dialdehyde crosslinked poly( vinyl alcohol)-glycosaminoglycan 
[54], poly( amino acid-etherurethane) [47, 87], poly( amino acid-lactic acid) [89], poly( amino acid-carbon- 
ate) [90], poly(peptide-styrene) [84], and linear [83] or crosslinked [82] poly(glycoside-acrylamide). 

110.5 Determining Ligand Surface Densities 

The concentration of ligands immobilized upon a surface can be measured using radiolabeling, photo- 
chrome labeling, surface analysis, or gravimetry. Since most materials can support the nonspecific 
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adsorption of bioactive ligands, especially proteins, controls must be utilized to differentiate between 
covalent immobilization and physicochemical adsorption occurring during coupling. For example, 
ligands immobilized onto unactivated versus activated substrates give relative differences between non- 
specific adsorption and specific bioconjugation. 

The surface immobilization of radiolabeled ligands with markers such as 3 H, 35 S, or 125 I, can be followed 
to give information on the kinetics of coupling, the coupling capacity of the substrate, and the surface 
density of ligands. Densities on the order of pmol-fmol/cm 2 are detectable using radiolabeled molecules. 

The coupling capacity and density of immobilized acids, amines, and thiols can be evaluated using 
colorimetric procedures. Substrates can be incubated in solutions of Ellman's reagent [91, 92]; the 
absorbances of the reaction products give the surface density. Antigens which have been immobilized 
can be exposed to photochrome-labeled antibodies and surface concentrations calculated using standard 
enzyme immunosorbent assays [49, 93]. 

Verification of ligand immobilization maybe performed using a number of surface analysis techniques. 
Mass spectroscopy, x-ray photoelectron spectroscopy, and dynamic contact angle analysis can give infor- 
mation on the chemical composition of the substance's outmost layers. Changes in composition are 
indicative of modification. Ellipsometry can be used to gauge the thickness of overlapping surface layers; 
increases imply the presence of additional layers. Highly sensitive gravimetric balances, such as quartz 
crystal microbalances, can detect in situ changes in the mass of immobilized ligand in the nanogram 
range [65]. 

110.6 Applications of Immobilized Ligands 

Extracellular matrix proteins such as fibronectin, laminin, vitronectin and collagen, or adhesion mole- 
cules such as ICAM-1, VCAM-1, PCAM-1, and sialyl Lewis X, interact with cell surface receptors and 
mediate cell adhesion. The tripeptide adhesion sequence Arg-Gly-Asp (RGD) is a ubiquitous signal 
present in many cell adhesion proteins. It interacts with the integrin family of cell surface adhesion 
receptors, and comprises the best studied ligand-receptor pair [94-96]. In lieu of immobilizing complex 
multifunctional proteins for purposes of cell adhesion studies, synthetic RGD sequences have instead 
been immobilized onto many substrates as simplified models to understand various molecular aspects 
of cell adhesion phenomena. The following paragraphs cite examples of RGD-grafted substrates that 
have been used in biomedicine and bioengineering. 

The density of RGD necessary to mediate cell adhesion has been determined in a number of fashions. 
RGD-containing peptides and protein fragments have been physicochemically adsorbed onto tissue 
culture substrates [97, 98] or covalently bound to albumin-coated substrates [45, 46] to titrate the 
dependency of cell adhesion function upon RGD surface densities. To remove potential complications 
due to desorption of ligands or albumin, RGD has been covalently bound onto functionalized substrates. 
Immobilization also restricts the number of conformations the peptide may assume, helping to ensure 
that all the peptide is accessible to the cells. RGD has been immobilized onto silanated glasses by its 
amino [19, 99] and carboxyl [15] termini. The effects of RGD density on cell adhesion, spreading, and 
cytoskeletal organization was examined [99] using this well-defined system. Other peptides have been 
immobilized in identical fashion to determine if they influence cell physiology [100]. 

RGD peptides have been immobilized onto highly cell-resistant materials to ensure that the peptide 
is the only cell adhesion signal responsible for cell adhesion to diminish signals borne of nonspecifically 
adsorbed serum proteins. Hydrogels of polyacrylamide [101], poly(vinyl alcohol) [31] and poly (ethylene 
glycol) [85] and nonhydrogel networks of polyacrylate/poly( ethylene glycol) [102] have been grafted with 
RGD; these background materials were highly resistant to the adhesion of cells even in the presence of 
serum proteins, demonstrating that the RGD sequence was solely responsible for mediating cell adhesion. 

RGD-containing peptides have been immobilized onto medically relevant polymers in an effort to 
enhance their biocompatibilities by containing an adhered layer of viable cells. RGD-grafted surfaces can 
be more efficient in supporting the number and strength of cell adhesion by the peptide facilitating cell 
adhesion additionally to adsorbed adhesion proteins from the biological milieu. RGD has been conjugated 
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onto surfaces by means of photoimmobilization [103] and plasma glow discharge [64]. In an effort to 
promote cell adhesion onto biodegradable implants, RGD peptides have been covalenfly grafted onto 
poly( amino acid-lactic acid) copolymers [89]. In this manner, cells adherent on the degradable material 
can eventually obtain a completely natural environment. Self-assembled monolayers of biologic ligands 
have been immobilized onto gold substrates by adsorbing functionalized thiol-containing bridges fol- 
lowed by covalent grafting of the ligand [93] or by adsorbing alkanethiol-containing ligands [104]. It 
has been suggested [105] that RGD-containing peptides could be immobilized onto gold substrates in 
these manners to engineer highly defined surfaces for cell culture systems. 

These examples only partially illustrate the utility of ligand-grafted substrates for bioengineering and 
biomedicine. These substrates offer simplified models of basement membranes to elucidate mechanisms 
and requirements of cell adhesion. They have applications in biomedicine as biocompatible, cell-adhesive 
biomaterials for tissue engineering or for clinical implantation. 
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A common assumption in biomaterials research is that cellular interactions with natural and artificial 
surfaces are mediated through adsorbed proteins. Such diverse processes as thrombosis and hemostasis, 
hard and soft tissue healing, infection, and inflammation are each affected by protein adsorption to 
surfaces in vivo. Many in vitro diagnostic analyses, chromatographic separation techniques, and genetic 
engineering processes also involve protein adsorption at solid-liquid interfaces. 

The adsorption of fibrinogen, a prevalent blood plasma protein, has been studied extensively because 
of its role in blood coagulation and thrombosis, as has the adsorption of albumin, because it is thought 
to inhibit the adhesion of blood platelets [Young et al., 1982]. The amount of protein adsorbed to a 
substrate is best measured directly using radiolabeled proteins, whereas the thickness of an adsorbed 
protein film can be calculated from ellipsometric measurements. Further, the importance of the state of 
an adsorbed protein in mediating cellular interactions is now becoming evident. Molecularly sensitive 
indirect measurement techniques, e.g., circular dichroism (CD), differential scanning calorimetry (DSC), 
enzyme-linked immunosorbent assay (ELISA), Fourier transform infrared spectroscopy/attenuated total 
reflectance (FTIR/ATR), radio-immunoassay (RIA), or total internal reflection fluorescence (TIRF), can 
be used to characterize the conformation and organization of adsorbed proteins. (The amount of protein 
adsorbed to a substrate is best measured directly using radiolabeled proteins; the thickness of an adsorbed 
protein film can be calculated from ellipsometric measurements.) Highly specific monoclonal (MAb) 
(against specific protein epitopes) [Shiba et al., 1991] and polyclonal (PAb) (against multiple epitopes) 
[Lindon et al., 1986] antibodies provide direct probes of adsorbed protein conformation and organiza- 
tion. Thus, cellular responses can be compared not only with the amounts of proteins adsorbed but also 
the organization of the proteins on the surface of the substrate. 

Whereas previous studies confirmed roles for adsorbed proteins in subsequent cell-surface interactions, 
much current research aims to better understand cell-protein-surface interactions on a molecular level. 
Recently, peptide sequences contained within the cell binding domains of adhesive proteins have been 
identified and characterized, synthesized, and demonstrated to bind cellular receptors known as integrins 
[Ruoslahti, 1991; Yamada, 1991]. Current and potential applications range from selective or enhanced 
in vitro cell culture to selective in vivo cellular responses such as endothelialization in the absence of 
inflammation, infection, or thrombosis [Hubbell et al., 1991]. 
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111.2 Fundamentals of Protein Adsorption 

Detailed and comprehensive reviews of protein adsorption have been published [Andrade, 1985; Andrade 
& Hlady, 1986; Horbett, 1982; Norde & Lyklema, 1991]. A thorough understanding of key principles will 
prove helpful in critically evaluating reports in literature. Particularly important concepts are protein 
structure and heterogeneity, factors that dramatically affect the thermodynamics and kinetics of adsorp- 
tion, reversibility of adsorption, and the dynamics of multi-component adsorption. 

A protein is a complex molecule consisting of amino acid copolymer (polypeptide) chains that interact 
with each other to give the molecule a three-dimensional structure. Importantly, each amino acid in the 
polymer contributes to the chemical and physical properties of the protein. A dramatic example of this 
is the oxygen-carrying protein, hemoglobin, which consists of four polypeptide chains denoted rx 2 (3 2 . A 
single amino acid substitution in the 146 amino acid (3-chain results in the conversion of normal 
hemoglobin (HbA) to sickle-cell hemoglobin (HbS) and underlies the serious consequences of sickle- 
cell disease [Stryer, 1995]. Protein structure and function are relevant to protein adsorption and have 
been described on four different scales or orders [Andrade & Hlady, 1986]. Primary structure refers to 
the sequence and number of amino acids in a copolymer chain. The 20 amino acids that are polymerized 
to make proteins are termed residues. Of these, 8 have non-polar side chains, 7 have neutral polarity 
side chains, and 5 have charged polar side chains [Stryer, 1995] . Secondary structure results from hydrogen 
bonding associated with the amide linkages in the polymer chain backbone to form structures such as 
the CC-helix and (3-pleated sheet. Tertiary structure results from associations within chains, including 
hydrogen bonding, ionic and hydrophobic interactions, salt bridges, and disulfide bonds and dictates the 
three-dimensional structure adopted by protein molecules. Quaternary structure results from associations 
between chains. Many blood proteins contain polar, non-polar, and charged residues. In polar media 
such as buffered saline or blood plasma, hydrophilic residues tend to self-associate (often at the outer, 
water-contacting surface of the protein), as do hydrophobic residues (often "inside" the protein). This 
results in distinct domains (Fig. 111.1) that dictate higher order protein structure. 
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FIGURE 111.1 Schematic view of protein interacting with a well characterized surface. The protein has a number 
of surface domains with hydrophobic, charged, and polar character. The solid surface has a similar domain like 
character (Andrade & Hlady, 1986.) 
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When a single, static protein solution contacts a surface, the rate of adsorption depends upon transport 
of the protein from the bulk to the surface. Andrade and Hlady [1986] identified four primary transport 
mechanisms, namely diffusion, thermal convection, flow convection, and coupled convection-diffusion. 
In isothermal, parallel laminar flow or static systems, protein transport to the interface occurs exclusively 
by diffusion. In turbulent or stirred systems, each of the four transport modes can be significant. 

When adsorption is reaction limited, the net rate of adsorption can sometimes be described by the 
classic Langmuir theory of gas adsorption [Smith, 1981]: 



■ k A *C b *(l-®)-k D *® (111.1) 



where r A is the net adsorption rate, k A is the adsorption rate constant, C b is the bulk concentration of 
the protein in solution, is fractional surface coverage, and k D is the desorption rate constant. At 
equilibrium, the net rate of adsorption, r A is zero, and can be calculated from Eq. (111.1) as the 
Langmuir adsorption isotherm, 

0= K * Cb (111.2) 

l+K*C b 

where K= k A /k D . This model assumes reversible monolayer adsorption, no conformational changes upon 
adsorption, and no interactions between adsorbed molecules. It is most applicable to dilute solutions 
and non-hydrophobic substrates. When adsorption is diffusion limited, the initial rate of adsorption is 
equivalent to the rate of diffusion, described mathematically as [Andrade & Hlady, 1986] 



:r e = 2*C r *. 



— (111.3) 

Kt 



where T> is the diffusivity of the protein, and t is time. 

Protein adsorption to hydrophobic substrates differs from that to hydrophilic materials. The primary 
driving force for adsorption to hydrophilic substrates is often enthalpic, whereas that to hydrophobic 
substrates is entropic [Norde, 1986]. Water near a hydrophobic surface tends to hydrogen bond to 
neighboring water molecules, resulting in a highly ordered water structure [Andrade & Hlady, 1986]. 
Disruption of this structure (dehydration) by adsorption of a protein to the surface increases the entropy 
of the system and is therefore thermodynamically favored. As a result, adsorption to hydrophilic substrates 
is generally reversible, whereas that to hydrophobic substrates is not. Denaturation of the adsorbed protein 
by hydrophobic interactions with the substrate can also contribute to irreversible adsorption [Feng & 
Andrade, 1994]. 

The amount of a specific protein adsorbed to a substrate can be measured directly if the protein is 
radiolabeled. Gamma emitting isotopes are preferred because their signal is directly proportional to the 
amount of protein present. Radioisotopes of iodine ( 125 I, 129 I, and 131 I) are commonly used because iodine 
readily attaches to tyrosine residues [Macfarlane, 1958]. An 125 I monochloride radiolabeling technique 
has been published [Horbett, 1986] but others, such as those using chloramine-T or lactoperoxidase, can 
also be used. 

If neither the 125 I-protein nor the unlabeled protein preferentially adsorbs to the substrate at the expense 
of the other, then the amount of that protein adsorbed to a substrate from multi-component media such 
as plasma can be measured by adding a small amount of the 125 Tprotein to the adsorption medium. Studies 
have shown that 125 I-fibrinogen generally behaves like its unlabeled analog [Horbett, 1981]. The total 
amount of the protein adsorbed (both unlabeled and 12 T-protein) is calculated by dividing the measured 
radioactivity by the specific activity of the protein in the medium. (Specific activity is determined by 
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dividing the gamma activity in a measured aliquot of the adsorption medium by the total amount of 
that protein, both labeled and unlabeled, in the aliquot.) To verify that neither labeled nor unlabeled 
fibrinogen preferentially adsorbs to the substrates, the specific activity of the protein in a small aliquot 
of the plasma dilution from which adsorption was maximum should be increased lOx, and adsorption 
from that dilution again measured. Changes in calculated adsorption values should be attributable only 
to the variability in the data and differences in the signal to noise ratio, not the ratio of labeled to unlabeled 
fibrinogen in the plasma. Similarly, to verify that absorption into the sample materials of free 125 I in the 
buffer is not significant, adsorption from dilute plasma to which only 0.01M unlabeled free iodide is 
added should be measured. Sample calculations for measuring protein adsorption using this technique 
are illustrated in the next section. 

The use of radioisotopes for the measurement of protein adsorption to surfaces offers a significant, 
though underused, advantage compared with other techniques. Because the radioisotopes, e.g., 12 T and 
131 I, emit unique energy spectra, i.e., their peak radiation emissions occur at distinct energies, one can 
simultaneously measure the adsorption of two different proteins from a protein mixture. By labeling one 
protein with 125 I and the other with 131 I, the adsorption behavior of both can be determined in one 
experiment. Modern gamma counters come with preset energy windows specific to common isotopes, 
making such measurements routine. In the absence of such an instrument, one can still perform dual- 
labeling experiments by exploiting the fact that the half-lives of the isotopes differ. For example, the half- 
life of 125 I is sixty days whereas that of 131 I is only eight days. After measuring the 131 I emission immediately 
after the experiment, one can allow the radioactivity associated with it to decay to background levels (ten 
half-lives or 80 days for 131 I) and then measure the signal associated solely with 125 I [Dewanjee, 1992]. 

Indirect methods are also used to study proteins adsorbed to a substrate. ELISA and RIA analytical 
techniques exploit specific antibody-antigen interactions as follows. Antibodies against specific epitopes 
of an adsorbed protein are either conjugated to an enzyme (ELISA), or radiolabeled (RIA). Substrates 
are first incubated with the medium, then with a solution containing the antibody or antibody conjugate. 
In the case of ELISA, the substrates are subsequently incubated with substrate solution. As the substrate 
is converted to product, the color of the solution changes in proportion to the amount of bound antibody 
present and is measured spectrophotometrically. However, extensive calibration is required to quantify 
results. In the case of RIA, the radioactivity originating from the radiolabeled antibody (bound to the 
protein adsorbed to the substrate) is measured and the amount of antibody bound calculated. With both 
methods, the relative amount of the adsorbed protein to which the antibody has bound, rather than the 
total amount of protein adsorbed, is measured. Antibody binding is a function of not only the amount 
of protein adsorbed, but also the particular antibody used, total protein surface loading, and protein 
residence time. Thus, although antibody techniques provide direct probes of adsorbed protein confor- 
mation and organization, such measurements do not necessarily reflect the absolute amounts of protein 
adsorbed. Other indirect methods used to study adsorbed proteins include ellipsometry, electron micros- 
copy, high performance liquid chromatography (HPLC), and staining techniques. 

111.3 Example Calculations and Applications 
of Protein Adsorption 

The following example illustrates how radiolabeled proteins are used to measure the amount of protein 
adsorbed to a substrate. The fibrinogen concentration in 10 ml plasma is determined to be 5.00 mg/ml 
by measuring the light absorbance at 280 nm of a redissolved, thrombin-induced clot [Ratnoff & Menzie, 
1950]. The concentration and specific activity of a 10 ul aliquot of 125 Tfibrinogen are 1.00 mg/ml and 
10 9 cpm/ug, respectively. Fibrinogen adsorption from dilute plasma to a series of polymer samples, each 
having 1.00 cm 2 total surface area (counting both sides of the sample) is to be measured. Based upon 
reports in literature, maximal adsorption of 250 ng/cm 2 is expected. The background signal in the gamma 
radiation counter is 25 cpm. To achieve a maximum signal/noise ratio of 10, the specific activity of 
fibrinogen in the plasma should be, 
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(signal/ noise)* noise 10*25 cpm 10 3 ng , , , 

sp.ac. = - '- = -. - * - = 10 3 cpm/Hg (111.4) 

mass adsorbed 25 ng cm 2 * 1.00 cm 2 \lg 

The volume of 125 I-fibrinogen solution to be added to the plasma to obtain 10 3 cpm/g specific activity 
(neglecting the mass of 12 T-fibrinogen added) is calculated as, 

10 i cpmfllg*W i llg/mg*5.00mg/ml*10ml = 5Q (1115) 

10 9 cpm/mg* 1.00 mg / ml* ml/ 10 3 |jJ 

Addition of 50 JJ.1 125 I-fibrinogen solution should increase the total fibrinogen concentration in the plasma 
by only a small fraction, 

50al*ml/l0 3 ill* 1.00 mg/ ml , 

— ^ 1- ^ -. — ^ = 10 3 (111.6) 

10 m/*5.00 mg/ ml 

To determine the amount of protein adsorbed to the substrate, the radioactivity of samples incubated 
with plasma is measured and compared with the specific activity of the protein in the plasma. In this 
example, if the amount of radioactivity retained by a sample is measured to be 137 cpm, then the mass 
of fibrinogen adsorbed is calculated as, 

137 cpm/ sample— 25 cpm background)*^ ng \lg , 

^ - -/ =112ng/cm 2 (111.7) 

10 cpmj\lg fibrinogen* 1.00 cm I sample 

Adsorption of proteins to polymeric substrates is measured because adsorbed proteins influence 
cellular processes. Adsorbed albumin is proposed to favor biocompatibility, whereas adsorbed fibrinogen 
is proposed to discourage biocompatibility because of its role in mediating initial adhesion of blood 
platelets [Young et al., 1982]. This simplified view inadequately describes biocompatibility in vivo for 
several reasons. First, the relationships between processes involved in thrombosis, hemostasis, inflam- 
mation, and healing (e.g., adhesion of platelets, fibroblasts, white blood cells, endothelial cells) and long 
term biocompatibility remain mostly unknown. For example, Sakariassen and colleagues [1979] proposed 
that exclusively adsorbed von Willebrand factor (vWF) mediates platelet adhesion to vascular subendot- 
helial structures, yet although some people that lack serum vWF in their blood exhibit bleeding disorders, 
others remain asymptomatic. Second, biological processes that do not require fibrinogen mediated cell 
adhesion (e.g., contact activation [Kaplan, 1978], complement activation [Chenowith, 1988]) are also 
related to material biocompatibility. Third, biological factors and serum proteins other than fibrinogen 
and albumin (e.g., vWF, fibronectin, vitronectin, laminin) significantly affect cellular processes. Fourth, 
the reactivity of adsorbed proteins depends upon their organization upon the substrate. 

Studies of fibrinogen adsorption in vitro illustrate the dynamic nature of protein adsorption from 
plasma. Vroman and Adams [1969] reported that oxidized silicon and anodized tantalum incubated 2s 
with plasma bind fibrinogen anti-serum, whereas the same materials incubated 25s with plasma do not. 
Ellipsometric measurements indicated that the observed decrease in antibody binding was not due to 
loss of protein. Brash and ten Hove [1984] and Horbett [1984] reported that maximal equilibrium 
fibrinogen adsorption to different materials occurred from intermediate dilutions of plasma. The adsorp- 
tion maximum is sometimes called a Vroman peak, which describes the shape of the adsorption versus 
log (plasma concentration) curve referred to as an adsorption isotherm. Both the location and the 
magnitude of the peak depend upon the surface chemistry of the substrate. (For this reason, it is wise 
to fully characterize any substrate prior to measuring protein adsorption. Electron spectroscopy for 
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chemical analysis, ESCA [Ratner & McElroy, 1986], and secondary ion mass spectroscopy, SIMS 
[Andrade, 1985], are often appropriate.) 

Wojciechowski and colleagues [1986] reported that at short contact times, adsorption was greatest 
from undiluted plasma. As contact time increased, the plasma concentration at which adsorption was 
greatest decreased. The unusual observed adsorption behavior occurs because fibrinogen adsorption is 
driven initially by mass action, i.e., a gradient between surface and bulk concentration. However, as 
coverage of the surface increases, bulk proteins must compete for surface binding sites. The composition 
of the adsorbed layer continues to change as proteins of higher surface activity displace adsorbed proteins 
of lower surface activity [Horbett, 1984]. Vroman and colleagues [1980] called this process conversion 
of the fibrinogen layer, and proposed that fibrinogen adsorbed at early time is at later time displaced by 
other plasma proteins, possibly high molecular weight kininogen (HMWK). Because Vroman pioneered 
much of this work, these phenomena are collectively referred to as the Vroman effect [Slack & Horbett, 
1995]. This principle is most applicable to hydrophilic substrates and is commonly used in biocompat- 
ibility studies to vary the composition of the protein layer adsorbed from plasma in a controlled manner. 

Slack and Horbett [1989] postulated the existence of two distinct types of adsorbed fibrinogen mol- 
ecules; displaceable and non-displaceable. Protein adsorption from plasma was modeled as competitive 
adsorption from a binary solution of fibrinogen and a hypothetical protein H (representing all other 
plasma components). In this model, protein H adsorbs to unoccupied surface sites in a non-displaceable 
state, and fibrinogen molecules first adsorb to unoccupied surface sites in a displaceable state, then are 
displaced by protein H, or spread to become resistant to displacement. The latter process is referred to 
as fibrinogen transition. Neglecting desorption and mass transfer limitations, rate equations for surface 
coverage by protein H, displaceable fibrinogen, and non-displaceable fibrinogen were solved simulta- 
neously for a surface initially free of adsorbate. The analytical solution is given by Eq. (111.8) and 
Eq. (111.9). 



® 2 (t) = (^k 2 /r^r 2 y 



Hl-^-tl 



(111. 



(111.9) 



where ® v 2 , and 3 are fractional coverages by displaceable fibrinogen, non-displaceable fibrinogen, 
and the hypothetical protein, respectively, kj is the fibrinogen adsorption rate constant, k 2 is the fibrinogen 
transition rate constant, k, is the hypothetical protein adsorption rate constant, k 4 is the fibrinogen 
displacement rate constant, C F is the bulk concentration of fibrinogen, C H is the bulk concentration of 
the hypothetical protein in solution, and (3, r p and r 2 are constants related to rate constants and bulk 
concentrations. This model predicts maximal fibrinogen coverage at intermediate adsorption time 
(Fig. 111.2), consistent with reported experimental results. Surface exclusion and molecular mobility 
arguments have also been proposed to explain the Vroman effect [Willems et al., 1991]. 

Changes in protein conformation upon adsorption have been inferred from a variety of indirect 
measurements. For example, Castillo and colleagues [1984] inferred substrate, adsorption time, and 
residence time dependent conformational changes in human serum albumin (HSA) adsorbed to different 
hydrogels based upon changes in FTIR/ATR and CD spectra. Specific conformational changes, i.e., 
decreased a-helix and increased random coil and (3-pleated sheet content upon adsorption were proposed. 
Similarly, Norde and colleagues [1986] reported lower a-helix content in HSA first adsorbed to, then 
desorbed from different substrates compared with native HSA. Castillo and colleagues [1985] also 
reported that lysozyme became increasingly denatured with increased adsorption time and residence 
time, and that denatured lysozyme adsorbed irreversibly to contact lens materials. De Baillou and 
colleagues [1984] proposed conformational changes in fibrinogen upon adsorption to glass based upon 
DSC measurements. Similarly, Feng and Andrade [1994] proposed that low temperature isotropic (LTI) 
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FIGURE 111.2 Time course of fibrinogen adsorption to a polymeric substrate as predicted by the model proposed 
by Slack and Horbett [1989]. 

carbon significantly denatures adsorbed proteins through hydrophobic interactions. Rainbow and col- 
leagues [1987] proposed conformational changes in albumin upon adsorption to quartz, based upon 
changes in fluorescence lifetimes calculated from TIRF measurements. Finally, investigators noted that 
certain enzymes, following their adsorption to solid surfaces, lose a considerable amount of their enzy- 
matic activity compared with enzymes in solution, suggesting that a surface-induced structural rear- 
rangement had occurred [Sandwick & Schray, 1987; Sandwick & Schray, 1988]. 

Whereas indirect methods provide evidence for changes in protein organization upon adsorption, 
antibodies against specific protein epitopes provide direct evidence, as antibody-binding measurements 
directly reflect the availability of different protein epitopes. Proteins at interfaces can undergo both 
covalent (e.g., conversion of fibrinogen to fibrin) and non-covalent (e.g., denaturation, change in con- 
formation, change in epitope accessibility) organizational changes. Therefore, epitopes inaccessible in 
solution might become available upon adsorption of the protein, and consequently, the biological activity 
of the adsorbed protein may differ from that of the same protein in solution. For example, receptor- 
induced binding site (RIBS) anti-fibrinogens bind to adsorbed but not free fibrinogen molecules [Zama- 
rron et al., 1990]. Because an MAb binds to a single protein epitope, whereas a PAb binds to multiple 
epitopes on the protein, MAbs rather than PAbs should be more sensitive to such changes. Also, epitopes 
available at low surface loadings may become unavailable at higher loadings due to steric hindrance by 
proteins adsorbed to neighboring surface sites. 

Horbett and Lew [1994] used the Vroman effect principle to maximize the amount of fibrinogen 
adsorbed from plasma to different polymers, and then measured the binding of different MAbs directed 
against various epitopes of fibrinogen. Binding was reported to be substrate-dependent and, with some 
MAbs, changed with protein residence time. Thus, different fibrinogen epitopes become more or less 
available as the adsorbed molecule reorganizes upon the surface. Soria and colleagues [1985] demon- 
strated that a MAb directed against fragment D of the fibrinogen molecule, which did not bind to the 
molecule in solution, did bind to it following its adsorption to a surface. Although this method cannot 
distinguish between changes in protein conformation (higher order structure) and changes in surface 
orientation (e.g., rotation, steric effects), several authors reported that with increased adsorption time 
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or residence time, adsorbed proteins became less readily displaced by plasma or surfactant eluting agents 
[Balasubramanian et al., In Press; Bohnert & Horbett, 1986; Chinn et al, 1992; Rapoza & Horbett, 1990; 
Slack & Horbett, 1992]. These results suggest that post-adsorptive transitions in adsorbed proteins are 
primarily structural. 

Protein organization is also a function of surface loading. Chinn and colleagues [1992] and Rapoza 
and Horbett [1990] used the Vroman effect principle to vary both the amounts of fibrinogen and total 
protein adsorbed from plasma to different polymers. Fibrinogen retention by all substrates was greater 
when the protein was adsorbed from more dilute than from less dilute plasma. This suggests that at 
higher total protein loadings, each fibrinogen molecule directly contacts the surface at fewer points 
(because individual molecules compete for binding sites), and a greater fraction of molecules is displace- 
able. Conversely, at lower total protein loadings, individual molecules compete less for binding sites and 
are not hindered from reorganizing on the surface. Because each molecule directly contacts the surface 
at more points, a greater fraction of adsorbed molecules is non-displaceable. Pettit and colleagues [1994] 
reported a negative correlation, independent of substrate, between anti-fibronectin binding (normalized 
to the amount of fibronectin adsorbed), and the amount of fibronectin adsorbed. They proposed that 
the conformation or orientation of the adsorbed fibronectin molecule favors exposure of the cell binding 
domain at lower rather than higher fibronectin surface concentrations. 

Although the implications of protein transitions in long-term in vivo biocompatibility remain largely 
unknown, changes in the states of adsorbed proteins are related to cellular interactions. More platelets 
adhered to glass first contacted 5s with plasma than 3 min with plasma [Zucker & Vroman, 1969] . Platelet 
adhesion in vitro to polymers upon which the Vroman effect principle was used to vary the composition 
of adsorbed protein layer was reported related not to total fibrinogen binding, but to anti-fibrinogen 
binding [Lindon et al., 1986; Shiba et al., 1991], as well as the fraction of adsorbed protein that can be 
eluted by surfactant such as sodium dodecyl sulfate (SDS) [Chinn et al., 1991]. 

111.4 Summary, Conclusions, and Directions 

Clearly, adsorbed proteins affect biocompatibility in ways that are not entirely understood. Fibrinogen 
has been extensively studied because blood platelets involved in thrombosis and hemostasis have a 
receptor for this protein [Phillips et al., 1988]. Adsorbed fibrinogen is often proposed to discourage 
biocompatibility, but this view does not consider that adsorbed proteins exist in different states, depend- 
ing upon adsorption conditions, residence time, and substrate. Evidence suggests that fibrinogen 
adsorbed from blood to substrates upon which the protein readily denatures, e.g., LTI carbon, may in 
fact promote biocompatibility. Because it is the organization and not the amount of an adsorbed protein 
that determines its biological activity, what happens to proteins after they adsorb to the substrate must 
be determined to properly evaluate the biocompatibility of a material. Further, a material might be made 
blood compatible if protein organization can be controlled such that the cell binding epitopes become 
unavailable for cell binding [Horbett et al., 1994]. Much current research aims to understand the rela- 
tionship between the states of adsorbed proteins and cell-protein-surface interactions. 

Fundamental to better understanding of material biocompatibility is understanding the importance 
of protein and surface structure and heterogeneity in determining the organization of proteins at the 
solid-liquid interface, the dynamics of multi-component adsorption from complex media, e.g., the 
Vroman effect, and the significance of post-adsorptive events and subsequent cellular interactions. MAbs 
against specific protein epitopes provide direct evidence of changes in the states of adsorbed proteins, 
and indirect methods provide corroborative evidence. Molecular imaging techniques such as atomic 
(AFM), lateral force (LFM), and scanning tunneling (STM) microscopies have been adapted to aqueous 
systems to better define the states of adsorbed proteins with limited success [Sit et al., 1998], as have 
surface analysis techniques such as surface-matrix assisted laser desorption ionization (MALDI) [King- 
shott etal, 1998]. 



© 2000 by CRC Press LLC 



Identification and characterization of the cell binding domains of adhesive proteins has led to better 
understanding of cell adhesion at the molecular level. Pierschbacher and colleagues [1981] used mono- 
clonal antibodies and proteolytic fragments of fibronectin to identify the location of the cell attachment 
site of the molecule. Subsequently, residue sequences within the cell binding domains of other adhesive 
proteins were identified as summarized by Yamada [1991]. The Arg-Gly-Asp (RGD) amino acid sequence 
first isolated from fibronectin was later found present within vitronectin, osteopontin, collagens, throm- 
bospondin, fibrinogen, and vWF [Ruoslahti & Pierschbacher, 1987]. Different adhesive peptides exhibit 
cell line dependent biological activity in vitro, but relatively few in vivo studies of adhesive peptides have 
been reported. 

Haverstick and colleagues [1985] reported that addition of RGD-containing peptides to protein-free 
platelet suspension inhibited thrombin induced platelet aggregation, as well as platelet adhesion in vitro 
to fibronectin, fibrinogen, and vWF coated polystyrene. These results suggest that binding of the peptide 
to the platelet renders the platelets receptor for the proteins unavailable for further binding. Similarly, 
Hanson and Harker [1988] used MAbs against the platelet glycoprotein Ilb/IIIa (fibrinogen binding) 
complex to prevent thrombus formation upon Dacron vascular grafts placed within a chronic baboon 
AV shunt. Controlled release of either MAbs or adhesive peptides at the site of medical device implant 
might allow localized control of thrombosis. Locally administered adhesive peptides might also be used 
to selectively control cell behavior, e.g., Ruoslahti [1992] used RGD containing peptides to inhibit tumor 
invasion in vitro and dissemination in vivo. A recently developed drug delivery system offers great promise 
with respect to these therapeutic interventions [Markou et al., 1996; Markou et al., 1998]. 

Alternatively, adhesive peptides can be used to promote cell proliferation. Hubbell and colleagues 
[1991] reported that immobilization of different adhesive peptides resulted in selective cell response 
in vitro. Whereas the immobilized RGD and YIGSR peptides both enhanced spreading of human foreskin 
fibroblasts, human vascular smooth muscle cells, and human vascular endothelial cells, immobilization 
of REDV enhanced spreading of only endothelial cells. If this concept can be applied in vivo, then 
endothelialization in the absence of inflammation, infection, or thrombosis might be achieved. 

Defining Terms 

Conformation: Higher order protein structure that describes the spatial relationship between the amino 

acid chains that a protein comprises. 
Epitope: Particular regions of a protein to which an antibody or cell can bind. 
Hemostasis: Mechanism by which damaged blood vessels are repaired without compromising normal 

blood flow. 
Integrin: Cellular transmembrane protein that acts as a receptor for adhesive extracellular matrix 

proteins such as fibronectin. The tripeptide RGD is the sequence recognized by many integrins. 
Organization: The manner in which a protein resides upon a surface, in particular, the existence, 

arrangement, and availability of different protein epitopes. 
Plasma concentration: Not the concentration of total protein in the plasma, but rather the volume 

fraction of plasma in the adsorption medium when protein adsorption from different dilutions of 

plasma is measured. 
Post-adsorptive transitions: Changes in protein conformation and organization that occur when 

adsorbed proteins reside upon a surface. 
Residue: The individual amino acids of a peptide or protein. 

Specific activity: The amount of radioactivity detected per unit mass of a specific protein. 
State: The reactive state of an adsorbed protein as determined by its conformation and organization. 
Thrombosis: Formation of plug comprising blood platelets and fibrin that stops blood flow through 

damaged blood vessels. Embolized thrombus refers to a plug that has detached from the wound 

site and entered the circulation. 
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Vroman effect: Collective term describing ( 1 ) maximal adsorption of a specific protein from multi- 
component medium at early time, (2) maximal equilibrium adsorption from intermediate dilution, 
and (3) decrease with increased adsorption time in the plasma concentration at which adsorption 
is maximum. 

Vroman peak: The adsorption maximum in the adsorption versus log (plasma concentration) curve 
when protein adsorption from different dilutions of plasma is measured. 
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The American Society for Artificial and Internal Organs (ASAIO) (P.O. Box C, Boca Raton, FL 33429- 
0468, Web address http://www.asaio.com) publishes original articles in ASAIO J through Lippincott-Raven 
Publishers (227 East Washington Square, Philadelphia, PA 19106-3780) and meeting transactions in 
Trans. Am. Soc. Artif Intern. Organs. 

The Society for Biomaterials (SFB) (6518 Walker St., Ste. 150, Minneapolis, MN 55426-4215, Web 
address http://www.biomaterials.org) publishes original articles in both /. Biomed. Mater. Res. and 
/. Biomed. Mater. Res. Appl. Biomater. through John Wiley and Sons, Inc. (605 Third Ave., New York, NY 
10158) and meeting transactions in Trans. Soc. Biomaterials. The /. Colloid Interface Sci. often contains 
articles related to protein-surface interactions as well. 
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The American Association for the Advancement of Science (AAAS) (1200 New York Ave. NW, Wash- 
ington, D.C. 20002, Web address http://www.aaas.org) publishes original articles in science, and often 
contains excellent review articles and very current developments in protein research. Nature, published 
through Macmillan Magazines (Porters South, 4 Crinan St., London Nl 9XW), is similar in content but 
provides a decidedly European perspective. 
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include: 1982. SL Cooper, NA Peppas (eds.), Biomaterials: Interfacial Phenomena and Applications, ACS 
Advances in Chemistry Series, vol 199, Washington, D.C, American Chemical Society; 1987. TA Horbett 
and JL Brash (eds.), Proteins at Interfaces: Physicochemical and Biochemical Studies, ACS Symposium 
Series, vol 343, Washington, D.C, American Chemical Society; and 1993. SM Factor (ed.), Cardiovascular 
Biomaterials And Biocompatibility. Cardiovasc. Path. 2(3) Suppl. 
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A significant challenge in tissue engineering is to take a biomaterial and process it into a useful form for a 
specific application. All devices for tissue engineering transplant cells and/or induce the ingrowth of desirable 
cell types from the host organism. The device must provide sufficient mechanical support to maintain a 
space for tissue to form or serve as a barrier to undesirable interactions. Also, the device can be designed 
to provide these functions for a defined period before biodegradation occurs or on a permanent basis. 

Generally speaking, devices can be broken down into two types. Immunoprotective devices contain 
semipermeable membranes that prevent elements of the host immune system (e.g., IgG antibodies and 
lymphocytes) from entering the device. In contrast, open devices have large pores (>10 LLm) and allow 
free transport of molecules and cells between the host tissue and transplanted cells. These latter devices 
are utilized to engineer a tissue that is completely integrated with the host tissue. Both types of devices 
can range in size from microns to centimeters or beyond, although the larger sizes are usually repetitions 
on the structure found at the scale of hundreds of microns. 

A fundamental question in designing a device is whether to use synthetic or natural materials. Synthetic 
materials (e.g., organic polymers) can be easily processed into various structures and can be produced 
cheaply and reproducibly; it also is possible to tightly control various properties such as the mechanical 
strength, hydrophobicity, and degradation rate of synthetic materials. Whereas natural materials (e.g., 
collagen) sometimes exhibit a limited range of physical properties and can be difficult to isolate and 
process, they do have specific biologic activity. In addition, these molecules generally do not elicit 
unfavorable host tissue responses, a condition which is typically taken to indicate that a material is 
biocompatible. Some synthetic polymers, in contrast, can elicit a long-term inflammatory response from 
the host tissue [Bostman, 1991]. 

A significant challenge in fabricating devices is either to develop processing techniques for natural biom- 
aterials that allow reproducible fabrication on a large-scale basis [Cavallaro et al., 1994] or to develop materials 
that combine the advantages of synthetic materials with the biologic activity of natural biomaterials [Barrera 
et al., 1993; Massia & Hubbell, 1991]. Computer-aided-design-computer-aided-manufacturing (CAD-CAM) 
technology may possibly be employed in the future to custom-fit devices with complex structures to patients. 
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112.1 Fundamentals 

The interaction of the host tissue with the device and transplanted cells can be controlled by both the 
geometry of the device and the internal structure. The number of inflammatory cells and cellular enzyme 
activity around implanted polymeric devices has been found to depend on the geometry of the device 
[Matlaga et al., 1976], with device geometries that contain sharp angles provoking the greatest response. 
The surface geometry, or microstructure, of implanted polymer devices also has been found to affect the 
types and activities of acute inflammatory cells recruited to the device as well as the formation of a fibrous 
capsule [Taylor & Gibbons, 1983]. 

The pore structure of a device dictates the interaction of the device and transplanted cells with the 
host tissue. The pore structure is determined by the size, size distribution, and continuity of the individual 
pores within the device. Porous materials are typically defined as microporous (pore diameter d < 2 nm), 
mesoporous (2 nm < d < 50 nm), or macroporous (d > 50 nm) [Schaeffer, 1994]. Only small molecules 
(e.g., gases) are capable of penetrating microporous materials. Mesoporous materials allow transport of 
larger molecules, such as small proteins, but transport of large proteins and cells is prevented. 
Macroporous materials allow free transport of large molecules, and, if the pores are large enough {d > 
10 4 nm), cells are capable of migrating through the pores of the device. The proper design of a device 
can allow desirable signals (e.g., a rise in serum sugar concentration) to be passed to transplanted cells 
while excluding molecular or cellular signals which would promote rejections of transplanted cells (e.g., 
IgG protein). 

Fibrovascular tissue will invade a device if the pores are larger than approximately 10 (J.m, and the rate 
of invasion will increase with the pore size and total porosity of a device [Mikos et al., 1993c; Weslowski 
et al., 1961; White et al., 1981]. The degree of fibrosis and calcification of early fabric leaflet valves has 
been correlated to their porosity [Braunwald et al., 1965], as has the nonthrombogenicity of arterial 
prosthesis [DeBakey et al., 1964] and the rigidity of tooth implants and orthopedic prosthesis [Hamner 
et al, 1972; Hulbert et al, 1972]. 

It is important to realize that many materials do not have a unimodal pore size distribution or a 
continuous pore structure, and the ability of molecules or cells to be transported through such a device 
will often be limited by bottlenecks in the pore structure. In addition, the pore structure of a device may 
change over time in a biologic environment. For example, absorption of water into polymers of the 
lactic/glycolic acid family results in the formation first of micropores, and eventually of macropores as 
the polymer itself degrades [Cohen et al., 1991]. The porosity and pore-size distribution of a device can 
be determined utilizing a variety of techniques [Smith et al., 1994]. 

Specific properties (e.g., mechanical strength, degradability, hydrophobicity, biocompatibility) of a 
device are also often desirable. These properties can be controlled both by the biomaterial itself and by 
the processing technique utilized to fabricate the device. An advantage of fabricating devices from 
synthetic polymers is the variety of processing techniques available for these materials. Fibers, hollow 
fibers, and porous sponges can be readily formed from synthetic polymers. Natural biomaterials must 
be isolated from plant, animal, or human tissue and are typically expensive and suffer from large batch- 
to-batch variations. Although the wide range of processing techniques available for synthetic polymers 
is not available for these materials, cells specifically interact with certain types of natural biomaterials, 
such as extracellular matrix (ECM) molecules [Hynes, 1987]. The known ability of ECM molecules to 
mediate cell function in vitro and in vivo may allow precise control over the biologic response to devices 
fabricated from ECM molecules. 

112.2 Applications 

The applications of tissue engineering are very diverse, encompassing virtually every type of tissue in the 
human body. However, the devices utilized in these areas can be divided into two broad types. The first 
type, immunoprotective devices, utilizes a semipermeable membrane to limit communication between 
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cells in the device and the host. The small pores in these devices {d < 10 nm) allow low-molecular-weight 
proteins and molecules to be transported between the implant and the host tissue, but they prevent large 
proteins (e.g., immunoglobulins) and host cells (e.g., lymphocytes) of the immune system from entering 
the device and mediating rejection of the transplanted cells. In contrast, open structures with large pores 
are typically utilized (d > 10 p.m) if one desires that the new tissue be structurally integrated with the 
host tissue. Applications that utilize both types of devices are described below. 

Immunoprotective Devices 

Devices that protect transplanted cells from the immune system of the host can be broken down into 
two types, microencapsulation and macroencapsulation systems [Emerich et al., 1992]. Individual cells 
or small clusters of cells are surrounded by a semipermeable membrane and delivered as a suspension 
in microencapsulation systems (Fig. 112.1a). Macroencapsulation systems utilize hollow semipermeable 
membranes to deliver multiple cells or cell clumps (Fig. 112. lb). The small size, thin wall, and spherical 
shape of microcapsules all optimize diffusional transport to and from the microencapsulated cells. 
Macroencapsulation devices typically have greater mechanical integrity than microencapsule devices, and 
they can be easily retrieved after implantation. However, the structure of these devices is not optimal for 
diffusional transport. Nonbiodegradable materials are the preferred choice for fabricating both types of 
devices, as the barrier function is typically required over the lifetime of the implant. 

A significant effort has been made to cure diabetes by transplanting microencapsulated pancreatic islet 
cells. Transplantation of nonimmunoprotected islets has led to short-term benefits [Lim & Sun, 1980], 
but the cells were ultimately rejected. To prevent this, islets have been immobilized in alginate (a naturally 
occurring polymer derived from seaweed) microbeads coated with a layer of poly(L-lysine) and a layer 
of polyethyleneimine [Lim & Sun, 1980] . Alginate is ionically crosslinked in the presence of calcium, and 
the permeability of alginate/poly(L-lysine) microbeads is determined by the formation of ionic or hydro- 
gen bonds between the polyanion alginate and the polycation poly(L-lysine). This processing technique 
allows cells to be immobilized without exposure to organic solvents or high temperatures. The outer 
layer of polyethyleneimine was subsequently replaced by a layer of alginate to prevent the formation of 
fibrous capsules around the implanted microcapsules [O'Shea et al., 1984]. Smaller microbeads 
(250-400 Lim) have been generated with an electrostatic pulse generator [Lum et al., 1992] to improve 
the in vivo survival and the response time of encapsulated cells [Chicheportiche & Reach, 1988]. These 
devices have been shown to be effective in a variety of animal models [Lim & Sun, 1980; Lum et al., 
1992], and clinical trials of microencapsulated islets in diabetic patients are in progress [Soon-Shiong 
et al., 1994]. Synthetic analogs to alginate have also been developed [Cohen et al., 1990]. 

The superior mechanical stability of macroencapsulation devices, along with the possibility of retriev- 
ing the entire device, makes these types of devices especially attractive when the transplanted cells have 
limited lifetimes and/or when one needs to ensure that the transplanted cells are not migrating out of 
the device. Macroencapsulation devices have been utilized to transplant a variety of cell types, including 
pancreatic cells [Lacy et al., 1991], NGF-secreting cells [Winn et al., 1994], dopamine-secreting cells 
[Emerich et al., 1992], and Chromaffin cells [Sagen et al., 1993]. The nominal molecular mass cutoff of 
the devices was 50 kD, allowing immunoprotection without interfering with transport of therapeutic 
agents from the encapsulated cells. To prevent cell aggregation and subsequent large-scale cell death due 
to nutrient limitations, macroencapsulated islets have been immobilized in alginate [Lacy et al., 1991]. 

Devices with Open Structures 

Devices with large, interconnected pores (d > 10 lim) are utilized in applications where one wishes the 
transplanted cells to interact directly with host tissue and form a structurally integrated tissue. The open 
structure of these devices provides little barrier to diffusional transport and often promotes the ingrowth 
of blood vessels from the host tissue. Degradable materials are often utilized in these applications, since 
once tissue structures are formed the device is not needed. 
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FIGURE 112.1 Examples of microencapsulated cells and a device used for macroencapsulation of cells, (a) Phase 
contrast photomicrograph of hybridoma cells encapsulated in a calcium crosslinked polyphosphazene gel [Cohen 
et al., 1993]. Original magnification was lOOx. Used with permission of Editions de Sante. (b) A SEM photomicro- 
graph of a poly(acrylonitrile-co-vinyl chloride) hollow fiber formed by phase inversion using a dry-jet wet-spinning 
technique [Schoichet et al., 1994]. Used with permission of John Wiley and Sons, Inc. 

These types of devices typically fall into two categories. The first types is fabrics, either woven or 
nonwoven, of small-diameter (approximately 10-40 u,m) fibers (Fig. 112.2a). High porosity (>95%) and 
large average pore size can be easily obtained with this type of material, and these materials can be readily 
shaped into different geometries. Fibers can be formed from synthetic, crystalline polymers such as 
polyglycolic acid by melt extrusion [Frazza & Schmitt, 1971]. Fibers and fabrics also can be formed from 
natural materials such as type I collagen, a type of ECM molecule, by extrusion of soluble collagen into 
a bath where gelling occurs followed by dehydration of the fiber [Cavallaro et al., 1994]. The tensile 
strength of fibers is dependent on the extent of collagen crosslinking, which can be controlled by the 
processing technique [Wang et al., 1994]. Processed collagen fibers can be subsequently spooled and 
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FIGURE 112.2 Examples of a fiber-based fabric and a porous sponge utilized for tissue engineering, (a) A photo- 
micrograph of type I collagen fibers knitted into a fabric. Fiber diameters can be as small as 25 urn, and devices 
constructed from these fibers can be utilized for a variety of tissue engineering applications [Cavallaro et al., 1994], 
Used with permission of John Wiley and Sons, Inc. (b) A SEM photomicrograph of a formaldehyde-crosslinked 
polyvinyl alcohol sponge. These devices have been utilized for a variety of applications, including hepatocyte trans- 
plantation [Uyama et al., 1993]. 

knitted to form fabrics [Cavallaro et al., 1994]. These devices are often ideal when engineering two- 
dimensional tissues. However, these fabrics typically are incapable of resisting large compressional forces, 
and three-dimensional devices are often crushed in vivo. Three-dimensional fiber-based structures have 
been stabilized by physically bonding adjacent fibers [Mikos et al., 1993a; Mooney et al., 1994a; Vacanti 
etal, 1992]. 

To engineer three-dimensional tissues, porous sponge devices (Fig. 112.2&) are utilized typically in 
place of fiber-based devices. These devices are better capable of resisting larger compressional forces 
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(approximately 10 4 Pa) [Mikos et al., 1993a] than are unbounded fiber-based devices (approximately 
10 2 Pa) [Mooney et al., 1994a] due to the continuous solid phase and can be designed to have complex, 
three-dimensional forms [Mikos et al., 1993fo; White et al., 1972]. Porous sponges can be fabricated from 
synthetic polymers utilizing a variety of techniques, including performing the polymerization of a hydro- 
phobic polymer in an aqueous solution [Chirila et al., 1993], exploiting phase separation behavior of 
dissolved polymers in specific solvents [Lo et al., 1994], and combining solvent casting with particulate 
leaching [Mikos et al., 1993b; Mikos et al., 1994]. Porous sponges can be formed from type I collagen 
and other ECM molecules by chemically crosslinking gels or assembling the collagen in nonnatural 
polymeric structures [Bell et al., 1981; Chvapil, 1979; Stenzel et al, 1974; Yannas et al, 1982]. 

Perhaps the most significant clinical effort using open devices has been expended to engineer skin 
tissue to treat burn victims. Both natural [Bell et al., 1981; Yannas et al., 1982] and synthetic degradable 
materials [Hansbrough et al., 1992] in the form of porous sponges or fiber-based fabrics have been 
utilized to transplant various cellular elements of skin. One device fabricated from ECM molecules has 
also been combined with an outer coat of silicone elastomer to prevent dehydration of the wound site 
[Yannas et al., 1982] . The various approaches have shown efficacy in animal models, and tissue-engineered 
skin has progressed to clinical trials [Burke et al., 1981; Compton et al., 1989; Heimbach et al., 1988; 
Stern et al, 1990]. 

Another area with great clinical potential is the engineering of bone and cartilage tissue. Various 
ceramics and biodegradable synthetic polymers have been utilized to fabricate devices for these purposes. 
Porous calcium phosphate devices loaded with mesenchymal stem cells have been shown to promote 
bone formation when implanted into soft tissue sites of animals [Goshima et al., 1991; Haynesworth 
et al., 1992]. Ceramics also have been coated onto prosthetic devices (e.g., hip replacements) to promote 
bone ingrowth and bonding between the prosthetic device and the host tissue [Furlong & Osborn, 1991]. 
The degradation rate [de Bruijn et al., 1994] and mechanical properties [Yoshinari et al., 1994] of these 
ceramics can be controlled by the deposition technique, which determines the crystallinity and chemical 
structure of the deposited ceramic. The brittleness of ceramic materials limits them in certain applications, 
and to bypass this problem composite ceramic/polymer devices have been developed [Stupp et al., 1993]. 
Fiber-based fabrics of biodegradable polymers have also been utilized to transplant cells derived from 
periosteal tissue and form new bone tissue [Vacanti et al., 1993]. To engineer cartilage tissue with specific 
structures such as an ear or nasal septum, devices have been fabricated from a nonwoven mesh of 
biodegradable synthetic polymers molded to the size and shape of the desired tissue. These devices, after 
seeding with chondrocytes and implantation, have been shown to induce the formation of new cartilage 
tissue with the same structure as the polymer device utilized as the template [Puelacher et al., 1993; 
Vacanti et al., 1991, 1992]. After tissue development is complete, the device itself degrades to leave a 
completely natural tissue. 

Liver tissue [Uyama et al., 1993], ligaments [Cavallaro et al., 1994], and neural tissue [Guenard et al., 
1992; Madison et al., 1985] also have been engineered with open devices. Tubular tissues, including blood 
vessels [Weinberg & Bell, 1986], intestine [Mooney et al., \99Ab; Organ et al., 1993], and urothelial 
structures [Atala et al., 1992] have been engineered utilizing open devices fabricated into a tubular 
structure. 

112.3 Conclusions 

A variety of issues must be addressed to design and fabricate a device for tissue engineering. Do the 
transplanted cells need immunoprotection, or should they structurally integrate with the host tissue? If 
immunoprotection is desired, will a micro- or macroencapsulation device be preferred? If a structurally 
integrated new tissue is desired, will a fiber-based device or a porous sponge be more suitable? The 
specific roles that the device will play in a given application and the material itself will dictate the design 
of the device and the fabrication technique. 
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Defining Terms 

Biocompatible: A material which does not elicit an unfavorable response from the host but instead 

performs with an appropriate host response in a specific application [Williams, 1987]. 
Biodegradation: The breakdown of a material mediated by a biologic system [Williams et al., 1992]. 

Biodegradation can occur by simple hydrolysis or via enzyme- or cell-mediated breakdown. 
Extracellular matrix (ECM) molecules: Various substances present in the extracellular space of tissues 

that serve to mediate cell adhesion and organization. 
Immunoprotective: Serving to protect from interacting with the immune system of the host tissue, 

including cellular elements (e.g., lymphocytes) and proteins (e.g., IgG). 
Open devices: Devices with large (d > 10 (tm) interconnected pores which allow unhindered transport 

of molecules and cells within the device and between the device and the surrounding tissue. 
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113.1 The Problem of the Missing Organ 

Drugs typically replace or correct a missing function at the molecular scale; by contrast, regeneration 
templates replace the missing function at the scale of tissue or organ. An organ may be lost to injury or 
may fail in disease: The usual response of the organism is repair, which amounts to contraction and 
synthesis of scar tissue. Tissues and organs in the adult mammal typically do not regenerate. There are 
exceptions, such as epithelial tissues of the skin, gastrointestinal tract, genitals, and the cornea, all of 
which regenerate spontaneously; the liver also shows ability to synthesize substantial organ mass, though 
without recovery of the original organ shape. There are reports that bone and the elastic ligaments 
regenerate. These exceptions underscore the fact that the loss of an organ by the adult mammal almost 
invariably is an irreversible process, since the resulting scar tissue largely or totally lacks the structure 
and function of the missing organ. The most obvious examples involve losses due to injury such as the 
loss of a large area of skin following a burn accident or the loss of substantial nerve mass following an 
automobile accident. However, irreversible loss of function can also occur following disease, although 
over a lengthy time: Examples are the inability of a heart valve to prevent leakage during diastole as a 
result of valve tissue response to an inflammatory process (rheumatic fever), and the inability of liver 
tissue to synthesize enzymes due to its progressive replacement by fibrotic tissue (cirrhosis). 

Five approaches have been used to solve the problem of the missing organ. In autografting, a mass of 
similar or identical tissue from the patient (autograft) is surgically removed and used to treat the area 
of loss. The approach can be considered to be spectacularly successful until one considers the long-term 
cost incurred by the patient. An example is the use of sheet autograft to treat extensive areas of full- 
thickness skin loss; although the patient incorporates the autograft fully with excellent recovery of 
function, the "donor" site used to harvest the autograft remains scarred. When the autograft is not 
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available, as is common in cases of burns extending over more than 30% of body surface area autograft 
is meshed in order to make it extensible enough to cover the large wound areas. However, the meshed 
autograft provides cover only where the graft tissue provides direct cover; where there is no cover, scar 
forms, and the result is one of low cosmetic value. Similar problems of donor site unavailability and 
scarring must be dealt with in heart bypass surgery, another widespread example of autografting. In 
transplantation, the donor tissue is typically harvested from a cadaver, and the recipient has to cope with 
the problems of rejection and the risk of transmission of viruses from this allograft. Another approach 
has been based on efforts to synthesize tissues in vitro using autologous cells from the patient; this 
approach has yielded so far a cultured epidermis (a tissue which regenerates spontaneously provided 
there is a dermal substrate underneath) about 2-3 weeks after the time when the patient was injured. In 
vitro synthesis of the dermis, a tissue which does not regenerate, has not been accomplished so far. 
Perhaps the most successful approach from the commercial standpoint has been the one in which 
engineered biomaterials are used; these materials are typically required by their designers to remain intact 
themselves without interfering with the patient's physiologic functions during the entire lifetime; over- 
whelmingly, this requirement is observed in its breach. A fifth approach is based on the discovery that 
an analog of the extracellular matrix (ECM) induces partial regeneration of the dermis, rather than of 
scar, in full-thickness skin wounds in adult mammals (human, guinea pig, pig) where it is well known 
that no regeneration occurs spontaneously. This fifth approach of solving the problem of organ loss, in 
situ regeneration, will be described in this chapter. 

Efforts to induce regeneration have been successful with only a handful of ECM analogs. Evidence of 
regeneration is sought after the ECM analog has been implanted in situ, i.e., at the lesion marking the 
site of the missing organ. When morphogenesis is clearly evident, based on tests of recovery both of the 
original tissue structure and function, the matrix which has induced these physiologic or nearly physi- 
ologic tissues is named a regeneration template. In the absence of evidence of such morphogenetic activity 
of the cell-free matrix, the latter is not referred to as a regeneration template. 

113.2 Search Principles for Identification 
of Regeneration Templates 

Several parameters have been incorporated in the search for organ regeneration templates. Briefly, these 
parameters account for the performance of the implant during the early or acute stage following implan- 
tation (physicochemical parameters) as well as for the long-term or chronic stage following implantation 
(biologic parameters). 

Immediately upon making contact with the wound bed, the implant must achieve physicochemical 
nanoadhesion (i.e., adhesion at a scale of 1 nm) between itself and the lesion. Without contact of this 
type it is not possible to establish and maintain transport of molecules and cells between implant and 
host tissue. The presence of adequate adhesion can be studied by measurements of the force necessary 
to peel the implant from the wound bed immediately after grafting. 

Empirical evidence has supported a requirement for an implant which is capable of isomorphous 
tissue replacement, i.e., the synthesis of new tissue at a rate which is of same order as the rate of 
degradation of the matrix. 



-=0(l) (113.1) 



In Eq. (113.1), t b denotes a characteristic time constant for biodegradation of the implant at that tissue site, 
and t h denotes a time constant for healing, the latter occurring by synthesis of new tissue inside the implant. 
A third requirement refers to the critical cell path length \ c , beyond which migration of a cell into the 
implant deprives it of an essential nutrient, assumed to be transported from the host tissue by diffusion 
alone. The emphasis is on the characteristic diffusion path for the nutrient during the early stages of 
wound healing, before significant angiogenesis occurs several days later. Calculation of the critical cell 
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path can be done by use of the cell lifeline number, S, a dimensionless number expressing the relative 
importance of a chemical reaction, which leads to consumption of an essential nutrient by the cell, and 
of diffusion of the nutrient which alone makes the latter accessible to the cell. This number is defined as 



Dc 



(113.2) 



where r is the rate of consumption of the nutrient by the cell in mole/cm 3 /s, / is the diffusion length, D 
is the diffusivity of the nutrient in the medium of the implant, and c is the nutrient concentration at or 
near the surface of the wound bed, in mole/cm 3 . When S = O(l) the value of I is the critical path length, 
l c along which cells can migrate, away from host tissue, without requirement of nutrient in excess of that 
supplied by diffusion. Eq ( 1 13.2) can, therefore, be used to define the maximum implant thickness beyond 
which cells require the presence of capillaries. 

The chemical composition of the implant which has induced regeneration was designed on the basis 
of studies of wound-healing kinetics. In most mammals, full-thickness skin wounds close partly by 
contraction of the wounds edges and partly by synthesis of scar tissue. Clearly, skin regeneration over 
the entire area of skin loss cannot occur unless the wound edges are kept apart and, in addition, the 
healing processes in the wound bed are modified drastically enough to yield a physiologic dermis rather 
than scar. Although several synthetic polymers, such as porous (poly)dimethyl siloxane, delay contraction 
to a small but significant extent, they do not degrade and therefore violate isomorphous tissue replace- 
ment, Eq. (113.1). Synthetic biodegradable polymers, such as (poly)lactic acid, can be modified, e.g., by 
copolymerization with glycolic acid, to yield polymers which satisfy Eq. (113.1); however, evidence is 
lacking that these synthetic polymers delay contraction and prevent synthesis of scar. By contrast, there 
is considerable evidence that a certain analog of the extracellular matrix (ECM analogs) not only delays 
contraction significantly but also leads to synthesis of partly physiologic skin. Systematic use of the delay 
in contraction as an essay has been made to identify the structural features of the dermis regeneration 
template (DRT), as shown schematically in Fig. 113.1. 
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FIGURE 113.1 The kinetics of contraction of full- thickness guinea pig skin wounds can be used to separate collagen- 
gra/r-glycosaminoglycan copolymers into three classes, as shown. The wound half-life, t ul , is the number of days 
necessary to reduce the original wound area to 50%. (Courtesy of Massachusetts Institute of Technology.) 
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Summarized in Fig. 113.1 are three modes of wound-healing behavior, each elicited by an ECM analog 
of different design. Mode O of healing is described by a very short time for onset of contraction, followed 
by contraction and definitive closure of the wound with formation of a thin linear scar. Mode I is 
characterized by a significant delay in onset of contraction, following which contraction proceeds and 
eventually leads to a linear scar. Mode II is characterized by a significant delay in onset of contraction 
(somewhat smaller than in mode I), followed by contraction and then by reversal of contraction, with 
expansion of the original wound perimeter at a rate which exceeds significantly the growth rate of the 
entire animal. Mode II healing leads to synthesis of a partly physiologic dermis and a physiologic 
epidermis within the perimeter of the expanded wound bed. Mode O healing occurs when an ECM 
analog which lacks specificity is used to graft the wound. Mode O is also observed when the wound bed 
remains ungrafted. Mode I healing occurs when an ECM analog of highly specific structure (the DRT) 
is grafted on the wound bed. Mode II healing occurs when the DRT, previously identified as the ECM 
analog which leads to mode I healing, is seeded with autologous epidermal cells before being grafted. 
Although contraction is a convenient screening method for identification of structural features of ECM 
analogs which induce skin regeneration, a different procedure has been used to identify the features of 
an implant that induces regeneration of the peripheral nerve. The structural features of the nerve 
regeneration template (NRT) were identified using an essay focused on the long-term return of function 
of the nerve following treatment with candidate ECM analogs. 

Recent studies have shown that it is possible to achieve regeneration both of a dermis and an epidermis 
in sequence, rather than simultaneously, provided that the animal model used is one in which wound 
closure does not take place almost overwhelmingly by wound contraction. The choice of animal model 
is, therefore, critical in this respect. Mode I behavior, illustrated in Fig. 113.1, is applicable to wound 
healing in rodents, such as guinea pigs and rats. In these animals, about 90% of wound closure is 
accounted for fully by wound contraction, the remainder is accounted for by formation of new tissue. 
Grafting of a full-thickness skin defect in a rodent with the ECM analog which possesses a highly specific 
structure (the dermis regeneration template or DRT; see below) leads to lengthy delay of the onset of 
contraction, eventually followed by contraction and formation of a linear scar-like tissue. Although the 
gross appearance of the tissue formed is that of a modified linear scar (see Fig. 113.1), there is some 
histological evidence that the small mass of connective tissue layer formed underneath the epidermis is 
not scar. In contrast, in animals in which contraction contributes about equally to wound closure as does 
formation of new tissue, such as the swine, approximately one half of the initial wound area is eventually 
closed by formation of partly regenerated dermis; soon after that, the new dermis is covered by a new 
epidermis. In wounds which close in large part by formation of new tissue rather than by contraction, 
as in the swine, grafting of the cell-free DRT leads, therefore, to sequential formation of a dermis and 
an epidermis as well, which is the same end result that is arrived at simultaneously by grafting with the 
keratinocyte-seeded DRT. 

Of several ECM analogs that have been prepared, the most commonly studied is a graft copolymer of 
type I collagen and chondroitin 6-sulfate. The structure of the latter glycosaminoglycan (GAG) is illus- 
trated below in terms of the repeat unit of the disaccharide, an alternating copolymer of D-glucuronic 
acid and of an O-sulfate derivative of N-acetyl D-galactosamine: 
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The principle of isomorphous replacement, Eq. (113.1), cannot be satisfied unless the bio degradation 
time constant of the network, t b , can be adjusted to an optimal level, about equal to the rate of synthesis 
of new tissue at that site. Reduction of the biodegradation rate of collagen can be achieved either by 
grafting GAG chains onto collagen chains or by crosslinking collagen chains to each other. The chemical 
grafting of GAG chains on polypeptide chains proceeds by previously coprecipitating the two polymers 
under conditions of acidic pH, followed by covalent crosslinking of the freeze-dried precipitate. A 
particularly useful procedure for crosslinking collagen chains to GAG, or collagen chains to each other, 
is a self-crosslinking reaction, requiring no use of crosslinking agent. This condensation reaction prin- 
cipally involves carboxylic groups from glutamyl/aspartyl residues on polypeptide chain Pj and e-amino 
groups of lysyl residues on an adjacent chain P 2 to yield covalently bonded collagen chains; as well as 
condensation of amine groups of collagen with carboxylic groups of glucuronic acid residues on GAG 
chains to yield gra/f-copolymers of collagen and GAG: 

P r COOH + P 2 -NH 2 -> P 2 -NHCO-P! + H 2 (113.3a) 

GAG-COOH + P 2 -NH 2 ^P 2 -NHCO-GAG + H 2 (113.3b) 

In each case above the reaction proceeds to the right, with formation of a three-dimensional crosslinked 
network when the moisture content of the protein, or protein-GAG coprecipitate, drops below about 
1 wt%. As illustrated by Eq. (113.3), removal of water, the volatile product of the condensation, is favored 
by conditions which drive the reaction towards the right, with formation of a crosslinked network. Thus, 
the reaction proceeds to the right when both high temperature and vacuum are used. Another crosslinking 
reaction, used extensively in preparing implants that have been employed in clinical studies as well as in 
animal studies, involves use of glutaraldehyde. Dehydration crosslinking, which amounts to self-crosslink- 
ing as described above, obviously does not lend toxicity to these implants. Glutaraldehyde, on the other 
hand, is a toxic substance, and devices treated with it require thorough rinsing before use until free 
glutaraldehyde cannot be detected in the rinse water. Network properties of crosslinked collagen-GAG 
copolymers can be analyzed structurally by studying the swelling behavior of small specimens. The 
method is based on the theory of Flory and Rehner, who showed that the volume fraction of a swollen 
polymer v 2 depends on the average molecular weight between crosslinks, M c , through the following 
relationship: 

ln(l-v 2 ) + v 2 +Xv 2 -(p^/M c )( V2 / 3 -y 2 /2) = (113.4) 

In Eq. (113.4), V l is the molar volume of the solvent, p is the density of the dry polymer, and X is a 
constant characteristic of a specific polymer-solvent pair at a particular temperature. 

Although the chemical identify of collagen-GAG copolymers is a necessary element of their biologic 
activity, it is not a sufficient one. In addition to chemical composition and the crosslink density, biologic 
activity also depends strongly on the pore architecture of these ECM analogs. Pores are incorporated first 
by freezing a very dilute suspension of the collagen-GAG coprecipitate and then by inducing sublimation 
of the ice crystals by exposing to vacuum at low temperatures. The resulting pore structure is, therefore, 
a negative replica of the network of ice crystals (dendrites). It follows that control of the conditions of 
ice crystal nucleation and growth can lead to a large variety of pore structures. In practice, the average 
pore diameter decreases with decreasing temperature of freezing while the orientation of pore channel 
axes also depends on the magnitude of the heat flux vector during freezing. The dependence of pore 
channel orientation on heat transfer parameters is illustrated by considering the dimensionless Mikic 
number Mi, a ratio of the characteristic freezing time of the aqueous medium of the collagen-GAG 
suspension, t, to the characteristic time for entry, t e , of a container filled with the suspension which is 
lowered at constant velocity into a well-stirred cooling bath: 
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tf p h f rV 

Mi = _L = *^Jl — (113.5) 

t 10k. AT 

In Eq. (113.5), p w is the density of the suspension, h f is the heat of fusion of the suspension, r is an 
arbitrary length scale, Vis the velocity with which the container is lowered into the bath, k is the thermal 
conductivity of the jacket, and AT is the difference between freezing temperature and bath temperature. 
The shape of the isotherms near the freezing front is highly dependent on the value of Mi. The dominant 
heat flux vector is normal to these isotherms, i.e., ice dendrites grow along this vector. It has been observed 
that, for Mi < 1 (slow cooling), the isotherms are shallow, flat-shaped parabolae, and the ice dendrites 
exhibit high axial orientation. For Mi > 1 (rapid cooling), the isotherms are steep parabolae, and ice 
dendrites are oriented along the radial direction. 

The structure of the porous matrix is denned by quantities such as the volume fraction, specific surface, 
mean pore size, and orientation of pores in the matrix. Determination of these properties is based on 
principles of stereology, the discipline which relates the quantitative statistical properties of three-dimen- 
sional structures to those of their two-dimensional sections or projections. In reverse, stereologic proce- 
dures allow reconstruction of certain aspects of three-dimensional objects from a quantitative analysis 
of planar images. A plane which goes through the two-phase structure of pores and collage-GAG fibers 
may be sampled by random points, by a regular pattern of points, by a near-total sampling using a very 
dense array of points, or by arranging the sampling points to form a continuous line. The volume fraction 
of pores, V Y , is equal to the fraction of total test points which fall inside pore regions, P p , also equal to 
the total area fraction of pores, A A , and, finally, equal to the line fraction of pores, L L , for a linear point 
array in the limit of infinitely close point spacing 

V V = P P = A A = L L (113.6) 

Whether cells of a particular type should be part of a regeneration template depends on predictions 
derived from models of developmental biology as well as empirical findings obtained with well-defined 
wound-healing models. During morphogenesis of a large variety of organs, an interaction between 
epithelial and mesenchymal cells, mediated by the basal lamina which is interleaved between the two 
types of cells, is both necessary and sufficient for development of local physiologic structures involving 
two types of tissue in juxtaposition. In particular, skin morphogenesis in a full-thickness wound model 
requires the presence of this interaction between the two cell types and the basal lamina over a critical 
period. In skin wound-healing experiments with adult mammals, wound healing proceeds with forma- 
tion of scar, rather than physiologic skin, if the wound bed contains epithelial cells and mesenchymal 
cells (fibroblasts) but no ECM structure which could act temporarily as a basal lamina. If, by contrast, 
an analog of the basal lamina is present, wound healing proceeds with nearly physiologic morphogenesis 
of skin. Furthermore, no epidermis is formed unless epithelial cells become involved in wound healing 
early during wound healing and continue being involved until they have achieved confluence. It is also 
known that no dermis forms if fibroblasts are not available early during wound healing. These observa- 
tions suggest the requirements for a DRT which is an analog of the basal lamina and is designed to 
encourage the migration and interaction of both epithelial cells and fibroblasts within its volume. 

Nerve regeneration following injury essentially amounts to elongation of a single nerve cell across a 
gap resulting from the injury. During nerve development many nerve cells are elongated by processes 
which eventually become axons. Interaction of elongating processes with basal lamina are credited as 
being essential in the formation of nerve during development, and it will be assumed here that such 
interactions are essential in regeneration following adult injury as well. It is also known that Schwann 
cells, which derive from neural crest cells during development, are essential contributors to regeneration 
following injury to the adult peripheral nerve. These considerations suggest a nerve regeneration template 
which is structured as an analog of the basal lamina, interacting with the elongating axons in the presence 
of Schwann cells. 
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113.3 Structural Specificity of Dermis Regeneration 
Template (DRT) 

The major events accompanying skin wound healing can be summarized as contraction and scar synthesis. 
Conventional wisdom prescribes the need for a treatment that accelerates wound healing. A large number 
of devices that claim to speed up various aspects of the healing process have been described in the 
literature. The need to achieve healing within as short a time as possible is certainly well founded, 
especially in the clinical setting where the risk to patient's life as well as the morbidity increase with 
extension of time to heal. However, the discovery of partial skin regeneration by use of the skin regen- 
eration template has introduced the option of a drastically improved healing result for the patient in 
exchange for a slightly extended hospital stay. 

The DRT was optimized in studies with animals in which it was observed that skin regeneration did 
not occur unless the test ECM analog effectively delayed, rather than accelerated, wound contraction. 
The length of delay in onset of contraction eventually was used as a quantitative basis for preliminary 
optimization of the structural features of the DRT. Optimization studies are currently continuing on the 
basis of a new criterion, namely, the fidelity of regeneration achieved. 

Systematic use of the criterion of contraction inhibition has been made to select the biodegradation 
rate and the average pore diameter of DRT. The kinetics of contraction of full-thickness skin wounds in 
the guinea pig have been studied for each of the three modes of healing schematically presented in 
Fig. 113.1. The results, presented in Fig. 113.2, show that mode O healing is characterized by early onset 
of contraction, whereas mode I and mode II healing show a significant delay in onset of contraction. A 
measure of the delay is the wound half-life r 1/2 , the time required for the wound area to decrease to 50% 
of the original value. Use of this index of contraction rate has been made in Figs. 1 13.3 and 1 13.4, which 
present data on the variation of wound half-life with average pore diameter and degradation rate for a 
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FIGURE 113.2 The kinetics of guinea pig skin wound contraction following grafting with three classes of ECM 
analogs. Inactive ECM analogs delay the onset of contraction only marginally over the ungrafted wound, whereas 
active cell-free ECM analogs delay the onset of contraction significantly. When seeded with epithelial cells, not only 
does an active ECM analog delay the onset of contraction significantly, but it also induces formation of a confluent 
epidermis and then arrests and reverses the direction of movement of wound edges, leading to expansion of the 
wound perimeter and to synthesis of partly physiologic skin. (Courtesy of Massachusetts Institute of Technology.) 
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FIGURE 113.3 The half-life wounds, r 1/2 , grafted with ECM analogs varies with the average pore diameter of the 
analog. The range of pore diameters where activity is maximal is shown by broken lines. The half-life of the ungrafted 
wound is shown for comparison. (Courtesy of Massachusetts Institute of Technology.) 
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FIGURE 113.4 Variation of wound half-life with degradation rate R of the ECM analog used as graft for a full- 
thickness guinea pig skin wound. R varies inversely as the biodegradation time constant, tf. The region of maximal 
activity is indicated by the broken line. The half-life of the ungrafted wound is shown for comparison. (Courtesy of 
Massachusetts Institute of Technology.) 

type I collagen-chondroitin 6-sulfate copolymer. In Fig. 113.2, mode I kinetics are observed using a cell- 
free copolymer with average pore diameter and biodegradation rate that correspond to the regions of 
Figs. 113.3 and 1 13.4 in which the values of half-life are maximal; these regions characterize the structure 
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of SRT which possesses maximal activity. Maximum delay in wound half-life up to 27 ± 3 days is seen 
to have occurred when the average pore diameter has ranged from values as low as 20 ± 4 (J.m to an 
upper limit of 125 ± 35 Lim. In addition, significant delay in wound healing has been observed when the 
degradation rate has become less than 1 15 ± 25 enzyme units. The latter index of degradation has been 
based on the in vitro degradation rate of the copolymer in a standardized solution of bacterial collagenase. 

The upper limit in degradation rate, defined in Fig. 113.4, is consistent with the requirement of a lower 
limit in t b , Eq. (113.1), below which the implant is biodegraded too rapidly to function as a scaffold over 
the period necessary for synthesis of new tissue; the latter is characterized by t h . The lower limit in pore 
diameter, defined in Fig. 113.3, suggests a dimension which is on the order of two cell diameters; adequate 
space for migration of mesenchymal cells (fibroblasts) from the wound bed into the DRT is thereby 
guaranteed. At an estimated velocity of 0.2 mm/day for fibroblasts these cells would be expected to 
migrate across a 0.5-mm thickness of the DRT within very few days, provided that adequate supplies of 
critical nutrients could be made available to them from host tissue. Use of Eq. ( 1 13.2) leads to an estimated 
critical cell path length l c of order 100 |J.m. These estimates suggest that fibroblasts can migrate across at 
least one-fifth the thickness of the porous implant without requiring the presence of capillaries. However, 
taking into account the observation that wound exudate (comprising primarily serum, with growth 
factors and nutrients) fills at least one-half the thickness of the implant within no more than a few hours 
following grafting, we conclude that the boundary of "host" tissue has moved clearly inside the implant. 

Epithelial cells, as well as fibroblasts and a basal lamina analog, also are required for morphogenesis. 
They can be supplied in a variety of forms. They are always available as epithelial cell sheets, migrating 
from the wound edges toward the center of the wound bed. However, when the area of skin loss is several 
centimeters, as with a severely burned patient, these cell sheets, migrating with speeds of about 
0.5 mm/day from opposite edges, would not be expected to cover one-half the characteristic wound 
dimension in less than the time constant t h for synthesis of new tissue. In the absence of epithelial cells, 
therefore, at the center of the wound bed, Eq. (113.1) wound be violated. To overcome this limitation, 
which is imposed by the scale of the wound, it has been necessary to resort to a variety of procedures. 
In the first, uncultured autologous epidermal cells, extracted from a skin biopsy by controlled enzymatic 
degradation, have been seeded into ECM analogs by centrifugation into the porous matrix prior to 
grafting of the latter on the wound bed. Tested with animals, the procedure leads to formation of a 
confluent epidermis by about 2 weeks, provided that at least 5 x 10 4 epithelial cells per cm 2 of DRT area 
have been seeded. In another procedure, a very thin epidermal layer has been surgically removed from 
an intact area of the patient and has been grafted on the dermal layer which has been synthesized about 
2 weeks after grafting with the DRT. The latter procedure has been tested clinically with reproducible 
success. A third procedure, studied with animals, has made use of cultured epithelia, prepared by a 2- to 
3-week period of culture of autologous cells in vitro and grafted on the newly synthesized dermal bed. 
Approximately equivalent fidelity of skin regeneration has been obtained by each of these procedures for 
supplying epithelial cells to the DRT in the treatment of skin wounds of very large area. 

113.4 In Situ Synthesis of Skin with DRT 

The skin regeneration template DRT induces regeneration of skin to a high degree of fidelity. Fidelity of 
regeneration has been defined in terms of the degree of recovery of structural and functional features 
which are present in the intact organ. 

The first test of fidelity of skin regeneration following use of the DRT was a study of treatment of full- 
thickness skin loss in guineas pigs. In this study the lesion was produced by surgery on healthy animals. 
The characteristic dimension of the wound was about 3 cm, and the desired period for cover by a confluent 
epidermis was 2 weeks. Covering a wound of such a scale within the prescribed time would have been 
out of reach of epithelial cells migrating from the wound edges. Accordingly, autologous epithelial cells 
were extracted from a skin biopsy and seeded into the DRT under conditions of carefully controlled 
centrifugation. 
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A clear and unmistakable difference between healing in the presence and absence of DRT was provided 
by observing the gross anatomy of the wound (Fig. 113.2). In the absence of the DRT, the wound 
contracted vigorously and closed up with formation of a linear scar by about day 30. In the presence of 
a cell-seeded DRT, the wound perimeter started contracting with a delay of about 10 days, and contraction 
was completely arrested and then reversed between days 30 and 40. The wound then continued to expand 
at a rate that was clearly higher than that expected from the rate of growth of the animal. The long-term 
appearance of the wound treated with the cell-seeded DRT was that of an organ that appeared grossly 
identical in color, texture, and touch to intact skin outside the wound perimeter. However, the newly 
synthesized skin was totally hairless, and the total area of new skin was smaller in area from the original 
wound area by about 30% (Fig. 113.2). 

Morphologic studies of newly synthesized skin in the presence of cell-seeded DRT included comparison 
with intact skin and scar. Optical microscopy and electron microscopy were supplemented by laser light 
scattering, the latter used to provide a quantitative measure of collagen fiber orientation in the dermal 
layer. It was concluded that, in most respects, partly regenerated skin was remarkably similar to intact 
guinea pig skin. The epidermis in regenerated skin was often hyperplastic; however, the maturation 
sequence and relative proportion of all cell layers were normal. Keratohyaline granules of the neoepidermis 
were larger and more irregular in contour than those of the normal granular cell layer. The new skin was 
characterized by melanocytes and Langerhans cells, as well as a well-formed pattern of rete ridges and 
interdigitations with dermal papillae, all of which appear in normal skin. Newly synthesized skin was 
distinctly different morphologically from scar. Scar showed characteristic thinning (atrophy) of the 
epidermis, with absence of rete ridges and of associated dermal papillae. Elastic fibers in regenerated skin 
formed a delicate particulate structure, in contrast with scar, where elastic fibers were thin and frag- 
mented. The dermal layer in regenerated skin comprised collagen fibers which were not oriented in the 
plane of the epidermis, as well as fibroblasts which were not elongated; in scar, collagen fibers were highly 
oriented in the plane and fibroblasts were elongated. Both normal and regenerated skin comprised 
unmyelinated nerve fibers within dermal papillae, closely approximated to the epidermis; scar had few, 
if any, nerves. There were no hair follicles or other skin appendages either in regenerated skin or in scar. 

Laser light-scattering measurements of the orientation of collagen fibers in tissue sections of the dermal 
layer were based on use of the Hermans orientation function 



f = 2(cos 2 cO-l (113.7) 

In Eq. (113.7), a is the angle between an individual fiber and the mean axis of the fibers, and (cos 2 a) is 
the square cosine of a averaged over all the fibers in the sample. For a random arrangement of fibers, 
(cos 2 a) equals 1/2, while for a perfectly aligned arrangement it is equal to 1. Accordingly, S varies from 
(truly random) to 1 (perfect alignment). Measurements obtained by use of this procedure showed that 
S took the values 0.20 ± 0.11, 0.48 ± 0.05 and 0.75 ± 0.10 for normal dermis, regenerated dermis, and 
scar dermis, respectively. These results provided objective evidence that regenerated dermis had a mor- 
phology of collagen fibers that was clearly not scar (n = 7; p < 0.001), and was intermediate between 
that of scar and normal dermis. 

Functional studies of regenerated skin showed that the moisture permeability of intact skin and 
regenerated skin had values of 4.5 ± 0.8 and 4.7 ±1.9 g/cm/h, insignificantly different from each other 
(n = 4; p < 0.8). Mechanical behavior studies showed a positive curvature for the tensile stress-strain 
curve of regenerated skin as well as of normal skin. However, the tensile strength of regenerated skin was 

14 ± 4 MPa, significantly lower than the strength of intact skin, 31 ± 4 MPa (n = 4; p < 0.01). 

The second test of fidelity of skin regeneration was a study of treatment of 106 massively burned 
humans with the cell-free DRT. The characteristic dimension of the wound in this study was of order 

15 cm, and the desired period for cover by a confluent epidermis was 2 weeks. The scale of the wound 
necessitated the introduction of epithelial cells, and this was accomplished by grafting the newly synthesized 
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dermal layer, 2 weeks after grafting with DRT, with a very thin epidermal layer (epidermal autograft), 
which was removed from an intact area of the patient. The results of the histologic study of the patient 
population showed that physiologic dermis, rather than scar, had been synthesized in sites that had been 
treated with DRT and had later been covered with an epidermal graft. 

Progress has been made in clarification of the mechanism by which DRT induces regeneration of the 
dermis. There is considerable evidence, some of which was presented above, supporting the hypothesis 
that inhibition of wound contraction is required for regeneration. The evidence also suggests strongly 
that DRT competitively inhibits formation of specific binding interactions between contractile cells and 
endogenous ECM. 

113.5 Advantages and Disadvantages of Clinical Treatment 
of Skin Loss with DRT 

When skin is the missing organ, the patient faces threats to life posed by severe dehydration and infection. 
These threats can be eliminated permanently only if the area of missing skin is covered with a device 
that controls moisture flux within physiologic limits and presents an effective barrier to airborne bacteria. 
Both of these functions can be returned over a period of about 2-4 weeks by use of temporary dressings. 
The latter include the allograft (skin from a cadaver) and a very large variety of membranes based on 
synthetic or natural polymers. None of these dressings solves the problem of missing skin over the lifetime 
of the patient: The allograft does not support synthesis of physiologic skin and must be removed to avoid 
rejection, and the devices based on the vast majority of engineered membranes do not make effective 
biologic contact with the patient's tissues, and all lead to synthesis of scar. 

Three devices have been tested extensively for their ability to provide long-term physiologic cover to 
patients with massive skin loss: the patient's own skin (autograft) the dermis regeneration template 
(DRT), and cultured epithelia (CEA). All three of these treatments have been studied extensively with 
massively burned patients. Of these, the autograft and DRT are effective even when the loss of skin 
extends through the full thickness of the dermis (e.g., third-degree burns), whereas CEA is effective when 
the loss of skin is through part of its thickness only. 

The basis for the differences among the three treatments lies in the intrinsic response of skin to injury. 
Skin comprises three layers: the epidermis, a 100-u.m thick cellular layer; the dermis, a 1- to 5-mm layer 
of connective tissue with very few cells; and the subcutis, a 2- to 4-mm layer of primarily adipose tissue. 
In the adult mammal, an epidermis lost through injury regenerates spontaneously provided that a dermal 
substrate is present underneath. When the dermis is lost, whether through part thickness or full thickness, 
none of the injured mass regenerates; instead, a nonphysiologic tissue, scar, forms. Scar is epithelialized 
and can, therefore, control moisture flux within physiologic limits as well as provide a barrier to bacterial 
invasion. However, scar does not have the mechanical strength of physiologic skin. Also, scar synthesis 
frequently proceeds well beyond what is necessary to cover the wound, and the result of such proliferation 
is hypertrophic scarring, a cosmetically inferior integument which, when extending over a large area or 
over hands or face, reduces significantly the mobility of the patient's joints as well as the patient's cosmetic 
appearance. Autograft can, if used without meshing, provide an excellent permanent cover; if, however, 
as commonly practiced, autograft is meshed before grafting in order to extend the wound area that 
becomes covered, the result is new integument which comprises part physiologic skin and part scar and 
provides the patient with a solution of much lower quality than unmeshed (sheet) autograft. The clinical 
use of skin regeneration template has led to a new integument comprising a dermal layer, which has been 
synthesized by the patient, and an epidermal layer, which has been harvested as a very thin epidermal 
autograft and has been placed on top of the newly synthesized dermis. 

The chief advantages DRT over the meshed autograft are the shorter time that it takes to heal the 
donor site, from which the epidermal graft was harvested, and a superior cosmetic result at the site of 
the wound. The main disadvantage in the clinical use of DRT is the subjection of the patient to two 
surgical treatments rather than one (first graft DRT, then graft the epidermal layer after about 2 weeks). 
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Although it has been shown that a dermis and an epidermis are regenerated sequentially in DRT-treated 
wounds like that of the swine, in which contraction plays only a modest role in wound closure, the 
process of sequential skin regeneration is slower than that of simultaneous skin regeneration. Accordingly, 
the kinetics of wound closure suggest an advantage in the use of keratinocyte-seeded DRT (a two-stage 
treatment) relative to the unseeded DRT (one-stage treatment). The clinical advantages of CEA are the 
ability to grow a very large area, about 10,000 times as large as the area of the skin biopsy, thereby 
providing cover over the entire injured skin area of the patient. The disadvantages of cultured epithelia 
are the lengthy period required to culture the epidermis from the patient's biopsy and the inability of 
the CEA to form a mechanically competent bond with the wound bed. 

113.6 Modifications of DRT: Use of a Living 
Dermal Equivalent 

The design concept of a skin regeneration template outlined above has been adopted and modified. One 
modification involves the replacement of the collagen-GAG matrix with a biodegradable mesh consisting 
of either (poly)glycolic acid (PGA) or polyglactin-910 (PGL) fibers. The latter is used as a matrix for the 
in vitro culture of human fibroblasts isolated from neonatal skin. Fibroblasts synthesize extracellular 
matrix inside the synthetic polymeric mesh, and this "living dermal equivalent" has been cryopreserved 
for a specified period prior to use. 

The living dermal equivalent has been used to graft full-thickness skin wounds in athymic mice. 
Following grafting of wounds, these PGA/PGL fibroblast grafts were covered with meshed allograft. The 
latter is human cadaver skin that was meshed in this study to expand it and achieve coverage of maximum 
possible wound area. This composite graft became vascularized and that, additionally, epithelial cells 
from the cadaver graft had migrated to the matrix underneath. After a period of about 100 days following 
grafting, the reported result was an epithelialized layer covering a densely cellular substratum that 
resembled dermis. A variant of this design, in which the meshed allograft is not used, has been reported; 
epidermal cells are cultured with the fibroblast mesh before grafting. Studies of these designs are in 
progress. 

113.7 The Bilayered Skin-Equivalent Graft 

In one widely reported development, in vitro cell culture procedures have been used to synthesize a 
bilayered tissue which has been reported to be a useful model of human skin. Briefly, fibroblasts from 
humans or from rats have been placed inside a collagen gel. Under these conditions, fibroblasts exert 
contractile forces, trapping the cells inside the contracted collagen lattice. Human epithelial cells, which 
have been plated onto this contracted dermal equivalent, have been observed to attach to the collagen 
substrate, multiply, and spread to form a continuous sheet. Differentiation of this sheet has led to 
formation of specialized epidermal structures, such as a multilayered cell structure with desmosomes, 
tonofilaments, and keratohyalin granules. Further differentiation events have included the formation of 
a basement membrane (basal lamina) in vitro when a rat epidermis was formed on top of a dermal 
equivalent produced from rat fibroblasts. 

Grafts prepared from the bilayered skin equivalent have been grafted on animals. When grafted on 
animals, these structures have been reported to become well vascularized with a network of capillaries 
within 7 to 9 days. It has been reported that the best grafts have blocked wound contraction, but no 
systematic data have been presented which could be used to compare the in vivo performance of these 
skin equivalents with grafts based on DRT (see above). Gross observations of the area grafted with skin 
equivalents have shown pink hairless areas which were not hypertrophically scarred. A systematic com- 
parison between scar and the tissue synthesized in sites that have been grafted with skin equivalents has 
not yet been made. 



© 2000 by CRC Press LLC 



113.8 Structural Specificity of Nerve Regeneration 
Template (NRT) 

The design principles for regeneration templates presented above have been used to design implants for 
regeneration of peripheral nerves. The medical problem typically involves the loss of innervation in arms 
and legs, leading to loss of motor and sensory function (paralysis). The nerves involved are peripheral, 
and the design problem becomes the regeneration of injured peripheral nerves, with recovery of function. 

A widely used animal model for peripheral nerve injury is a surgically generated gap in the sciatic 
nerve of the rat. Interruption of nerve function in this case is localized primarily in the region of plantar 
muscles of the foot involved. This relatively well defined area of loss of function can then be studied 
neurologically in relative isolation from other neurologic events. Furthermore, the rate of recovery of 
function can be studied by electrophysiologic methods, a procedure which provides continuous data over 
the entire period of healing. In the peripheral nerve the healing period extends to about 10 weeks, clearly 
longer than healing in skin, which occurs largely within a period of only 3 weeks. 

When the sciatic nerve is cut and a gap forms between the two nerve ends, the distal part of the nerve 
is isolated from its cell body in the spinal cord. Communication between the central nervous system and 
the leg is no longer possible. The lack of muscle innervation leads to inactivity, which in turn leads to 
muscle atrophy. At the site of injury there is degeneration of the myelin sheath of axons, dissociation of 
Schwann cells and formation of scar. It has been hypothesized that the formation of scar impedes, more 
than any other single cause, the elongation of axons across the gap. Axonal elongation through a gap of 
substantial length becomes, therefore, a parameter of prime importance in the design of a nerve regen- 
eration template (NRT). 

Intubation of severed nerve ends is a widely used procedure for isolating axons from the tissue 
environment of the peripheral nerve. The lumen of the tube serves to isolate the process of nerve 
regeneration from wound-healing events involving connective tissues outside the nerve; the tube walls, 
for example, prevent proliferation of scar tissue inside the tube and the subsequent obstruction of the 
regenerating nerve. Silicone tubes are the most widely used method of intubation. These tubes are both 
nonbiodegradable and nonpermeable to large molecules. In this isolated environment it is possible to 
study the substrate preferences of elongating axons by incorporating well-defined ECM analogs and 
studying the kinetics of functional recovery continuously with electrophysiologic procedures for about 
40 weeks following implantation. As in studies of dermal regeneration described above, the ECM analogs 
which were used in the study of substrate preferences of axons were graft copolymers of type I collagen 
and chondroitin 6-sulfate. Controls used included the autograft, empty silicone tubes, as well as tubes 
filled with saline. In these studies, it was necessary to work with a gap dimension large enough to preclude 
spontaneous regeneration, i.e., regeneration in the absence of an ECM analog. It was observed that a gap 
length of 10 mm was occasionally innervated spontaneously in the rat, whereas no instances of sponta- 
neous regeneration were observed with a 15-mm gap length. Gap lengths of 10 and 15 mm were used 
in these studies with rats. 

Three structural parameters of ECM analogs were varied systematically. The degradation rate had a 
significant effect on the fidelity of regeneration, and an abbreviated optimization procedure led to an 
ECM analog which degraded much faster than the DRT In combination with Eq. (113.1), this empirical 
finding suggests that a healing nerve wound contains a much smaller concentration of the degrading 
enzyme, collagenase; this suggestion is qualitatively consistent with observations of collagenolytic activity 
in injured nerves. The average diameter also was found to have a significant effect on fidelity of regen- 
eration, and the optimization procedure led to a value of 5 (J.m, significantly smaller than the average 
pore diameter in the DRT. Finally, use of Eq. (113.5) led to procedures of preparing ECM analogs, the 
pore channels of which were either highly aligned along the tube axis, randomly oriented, or radially 
oriented. ECM analogs with axially aligned pore channels were found to be superior to analogs with 
other types of alignment. 
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TABLE 113.1 Design Parameters for Two Regeneration Templates 

Design Parameter of ECM Analog DRT NRT 

Degradation rate, enzyme units <120 >150 

Average pore diameter, |im 20-125 5 

Pore channel orientation random axial 



These results have led to a design for an NRT consisting of a specified degradation rate, average pore 
diameter, and pore channel alignment as shown in Table 113.1 in which the structural parameters of 
NRT are contrasted to those of DRT. 

Studies of Nerve Regeneration Using Degradable Tubes 

Porous collagen tubes without matrix content have been extensively studied as guides for peripheral 
nerve regeneration in rodents and nonhuman primates. The walls of these collagen tubes had an average 
pore diameter which was considered sufficiently large for transport of molecules as large as bovine serum 
albumin (MW = 68 kDa). A 4-mm gap in the sciatic nerve of the rat was the standard injury studied. 
Other injury models that were studied included the 4-mm gap and the 15-mm gap in the adult monkey. 

The use of an empty tube did not allow for any degree of optimization of tube parameters to be 
achieved in this study. Nevertheless, the long-term results showed almost complete recovery of motor 
and sensory responses, at rates that approximated the recovery obtained following use of the nerve 
autograft, currently the best conventional treatment in cases of massive loss of nerve mass. The success 
obtained suggests that collagen tubes can be used instead of autografts, the harvesting of which subject 
the patient to nerve-losing surgery. 

Even more significant improvement in quality of nerve regeneration is obtained when collagen tubes 
are filled with NRT. Although lower than normal, the histomorphometric and electrophysiological prop- 
erties of the regenerate resulting from use of an NRT-fiHed collagen tube have been statistically indistin- 
guishable from those of the autograft control. Probably, however, the most important effect observed 
following implantation of the NRT-filled collagen tube has been the observation that, while the total 
number of myelinated axons in regenerated nerves had reached a plateau by 30 weeks after injury, the 
number of axons with diameter larger than 6 Ltm continues to increase at substantial rates through the 
completion of a recent study (60 weeks). Axons with a diameter larger than 6 Ltm have been uniquely 
associated with the A-fiber peak of the action potential, which determines the maximum conduction 
velocity of the regenerates. Thus, kinetic evidence has been presented which supports the view that the 
nerve trunk maturation process continues beyond 60 weeks after injury, resulting in a nerve trunk which 
increasingly approaches the structure of the normal control. 

113.9 In Situ of Synthesis of Meniscus Using a Meniscus 
Regeneration Template (MRT) 

The meniscus of the knee performs a variety of functions which amount to joint stabilization and lubri- 
cation, as well as shock absorption. Its structure is that of fibrocartilage, consisting primarily of type I 
collagen fibers populated with meniscal fibrochondrocytes. The architecture of collagen fibers is complex. 
In this tissue, which has a shape reminiscent of one-half a doughnut, the collagen fibers are arranged in 
a circumferential pattern which is additionally reinforced by radially placed fibers. The meniscus can be 
torn during use, an event which causes pain and disfunction. Currently, the accepted treatments are partial 
or complete excision of torn tissue. The result of such treatment is often unsatisfactory, since the treated 
meniscus has an altered shape which is incompatible with normal joint motion and stability. The long- 
term consequence of such incompatibility is joint degeneration, eventually leading to osteoarthritis. 
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An effort to induce regeneration of the surgically removed meniscus was based on the use of a type I 
collagen-chondroitin 6-sulfate copolymer. The precise structural parameters of this matrix have not been 
reported, so it is not possible to discuss the results of this study in terms of possible similarities and 
differences with DRT and NRT. The study focused on the canine knee joint, since the latter is particularly 
sensitive to biomechanical alterations; in this model, joint instabilities rapidly lead to osteoarthritic 
changes, which can be detected experimentally. Spontaneous regeneration of the canine meniscus fol- 
lowing excision is partial and leads to a biomechanically inadequate tissue which does not protect the 
joint from osteoarthritic changes. The latter condition provides the essential negative control for the 
study. In this study, knee joints were subjected to 80% removal (resection) of the medial meniscus, and 
the lesion was treated either by an autograft or by an ECM analog or was not treated at all. Evaluation 
of joint function was performed by studying joint stability, gait, and treadmill performance and was 
extended up to 27 months. No evidence was presented to show that the structure of the ECM analog 
was optimized to deliver maximal regeneration. 

The results of this study showed that two-thirds of the joints implanted with the ECM analog, two- 
thirds of the joints which were autografted, and only 25% of the joints which were resected without further 
treatment showed regeneration of meniscal tissue. These results were interpreted to suggest that the ECM 
analog, or meniscus regeneration template (MRT), supported significant meniscal regeneration and pro- 
vided enough biomechanical stability to minimize degenerative osteoarthritis in the canine knee joint. 

Recently, a clinical trial of the feasibility of MRT treatment was conducted on nine patients with either 
an irreparable tear of the meniscal cartilage or major loss of meniscal cartilage, and who remained in 
the study for at least 36 months. Following deletion of irreparably damaged meniscal tissue, MRT was 
implanted at the site of the missing meniscal tissue mass. The results showed that implantation of MRT 
induced regeneration of meniscal cartilage while the template was undergoing degradation. Compared 
to patients who were observed during the same period after meniscectomy (removal of damaged meniscal 
tissue without MRT treatment), patients who were implanted with MRT showed significant reduction 
in pain as well as greatly improved resumption of strenuous activity. 

Defining Terms 

Autograft: The patient's own tissue or organ, harvested from an intact area. 

Cell lifeline number, S: A dimensionless number that expresses the relative magnitudes of chemical 

reaction and diffusion. This number, defined as S in Eq. (113.2), can be used to compute the 

maximum path length l c over which a cell can migrate in a scaffold while depending on diffusion 

alone for transport of critical nutrients that it consumes. When the critical length is exceeded, the 

cell requires transport of nutrients by angiogenesis in order to survive. 
Cultured epithelia: A mature, keratinizing epidermis synthesized in vitro by culturing epithelial cells 

removed from the patient by biopsy. A relatively small skin biopsy ( 1 cm 2 ) can be treated to yield 

an area larger by about 10,000 in 2-3 weeks and can then be grafted on patients. 
Dermal equivalent: A term which has been loosely used in the literature to describe a device that 

replaces, usually temporarily, the functions of the dermis following injury. 
Dermis: A 1-5 mm layer of connective tissue populated with quiescent fibroblasts which lies underneath 

the epidermis. It is separated from the former by a very thin basement membrane. The dermis of 

adult mammals does not regenerate spontaneously following injury. 
Dermis regeneration template (DRT): A graft copolymer of type I collagen and chondroitin 6-sulfate, 

average pore diameter 20-125 p.m, degrading in vivo to an extent of about 50% in 2 weeks, which 

induces partial regeneration of the dermis in wounds from which the dermis has been fully excised. 

When seeded with keratinocytes prior to grafting, this analog of extracellular matrix has induced 

simultaneous synthesis both of a dermis and an epidermis. 
ECM analog: A model of extracellular matrix, consisting of a highly porous graft copolymer of collagen 

and a glycosaminoglycan. 
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Epidermis: The cellular outer layer of skin, about 0.1 mm thick, which protects against moisture loss 
and against infection. An epidermal graft, e.g., cultured epithelium or a thin graft removed surgi- 
cally, requires a dermal substrate for adherence onto the wound bed. The epidermis regenerates 
spontaneously following injury, provided there is a dermal substrate underneath. 

Isomorphous tissue replacement: A term used to describe the synthesis of new, physiologic tissue 
within a regeneration template at a rate of the same order as the degradation rate of the template. 
This relation, described by Eq. (113.1), is the defining equation for a biodegradable scaffold which 
couples with, or interacts in this unique manner with, the inflammatory response of the wound bed. 

Meniscus regeneration template (MRT): A graft copolymer of type I collagen and an unspecified 
glycosaminoglycan, average pore diameter unspecified, which has induced partial regeneration of 
the knee meniscus in dogs following 80% excision of the meniscal tissue. 

Mikic number, Mi: Ratio of the characteristic freezing time of the aqueous medium of the collagen- 
GAG suspension, £,-, to the characteristic time for entry, t e , of a container filled with the suspension 
which is lowered at constant velocity into a well-stirred cooling bath. Mi, defined by Eq. (113.5), 
can be used to design implants which have high alignment of pore channels along a particular 
axis, or no preferred alignment. 

Morphogenesis: The shaping of an organ during embryologic development or during wound healing, 
according to transcription of genetic information and local environmental conditions. 

Nerve regeneration template (NRT): A graft copolymer of type I collagen and chondroitin 6-sulfate, 
average pore diameter 5 mm, degrading in vivo to an extent of about 50% in 6 weeks, which has 
induced partial regeneration of the sciatic nerve of the rat across a 15-mm gap. 

Regeneration: The synthesis of new, physiologic tissue at the site of a tissue (one cell type) or organ 
(more than one cell type) which either has been lost due to injury or has failed due to a chronic 
condition. 

Regeneration template: A biodegradable device which, when attached to a missing organ, induces its 
regeneration. 

Scar: The end result of a repair process in skin and other organs. Scar is morphologically different from 
skin, in addition to being mechanically less extensible and weaker than skin. The skin regeneration 
template induces synthesis of nearly physiologic skin rather than scar. Scar is also formed at the 
site of severe nerve injury. 

Self-crosslinking: A procedure for reducing the biodegradation rate of collagen, in which collagen 
chains are covalently bonded to each other by a condensation reaction which is driven by drastic 
dehydration of the protein. This reaction illustrated by Eqs. (113.3a) and (1 13.3&), is also used to 
graft glycosaminoglycan chains to collagen chains without use of an extraneous crosslinking agent. 

Stereology: The discipline which relates the quantitative statistical properties of three-dimensional 
structures to those of their two-dimensional sections or projections. In reverse, stereologic proce- 
dures allow reconstruction of certain aspects of three-dimensional objects from a quantitative 
analysis of planar images. Its rules are used to determine features of the pore structure of ECM 
analogs. 
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Cells of the vascular system are constantly exposed to mechanical (hemodynamic) forces due to the flow 
of blood. The forces generated in the vasculature include the frictional force or a fluid shear stress caused 
by blood flowing tangentially across the endothelium, a tensile stress caused by circumferential vessel 
wall deformations, and a net normal stress caused by a hydrodynamic pressure differential across the 
vessel wall. We will restrict our discussion to examining fluid shear stress modulation of vascular cell 
function. The endothelium is a biologically active monolayer of cells providing an interface between the 
flowing blood and tissues of the body. It can synthesize and secrete a myriad of vasoconstrictors, 
vasodilators, growth factors, fibrinolytic factors, cytokines, adhesion molecules, matrix proteins, and 
mitogens that modulate many physiologic processes, including wound healing, hemostasis, vascular 
remodeling, vascular tone, and immune and inflammatory responses. In addition to humoral stimuli, it 
is now well accepted that endothelial cell synthesis and secretion of bioactive molecules can be regulated 
by the hemodynamic forces generated by the local blood flow. These forces have been hypothesized to 
regulate neovascularization and the structure of the blood vessel [Hudlicka, 1984]. Clinical findings 
further show that arterial walls undergo an endothelium-dependent adaptive response to changes in 
blood flow, with blood vessels in high flow regions tending to enlarge and vessels in the low flow region 
having reduced lumen diameter, thereby maintaining a nearly constant shear stress at the vessel wall 
[Zarins et al., 1987]. In addition to playing an active role in the normal vascular biology, hemodynamic 
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■ Wall thickening & Atherosclerotic Plaque 

FIGURE 114.1 Atherosclerotic plaques develop in regions of arteries where the flow rate (and resultant wall shear 
stress) is relatively low, which is often downstream from a vessel bifurcation, where there can be flow separation from 
the outer walls and regions of recirculation. These regions of low shear stress are pathologically prone to vessel wall 
thickening and thrombosis. 



forces have also been implicated in the pathogenesis of a variety of vascular diseases. Atherosclerotic 
lesion-prone regions, characterized by the incorporation of Evans blue dye, enhanced accumulation of 
albumin, fibrinogen, and LDL, increased recruitment of monocytes, and increased endothelial turnover 
rates exhibit polygonal endothelial cell morphology typically seen in a low-shear environment, as opposed 
to nonlesion regions that have elongated endothelial cells characteristic of high-shear regions [Nerem, 
1993]. 

In vivo studies of the distribution of atherosclerosis and the degree of intimal thickening have shown 
preferential plaque localization in low-shear regions. Atherosclerotic lesions were not seen in random 
locations but were instead found to be localized to regions of arterial branching and sharp curvature, 
where complex flow patterns develop, as shown in Fig. 114.1 [Asakura & Karino, 1990; Friedman et al., 
1981; Gibson et al, 1993; Glagov et al, 1988; Ku et al., 1985; Levesque et al, 1989; Zarins et al, 1983]. 
Morphologically, intact endothelium over plaque surfaces showed variation in shape and size and loss 
of normal orientation, characteristic of low-shear conditions [Davies et al., 1988]. Flow in the vascular 
system is by and large laminar, but extremely complex time dependent flow patterns can develop in 
localized regions of complex geometry. Zarins and coworkers [1983] have shown that in the human 
carotid bifurcation, regions of moderate-to-high shear stress where flow remains unidirectional and 
axially aligned, were relatively free of intimal thickening, whereas extensive plaque localization was seen 
in the regions where low shear stress, fluid separation from the wall, and complex flow patterns were 
predominant. Asakura and Karino [1990] presented a technique for flow visualization by using trans- 
parent arteries where they directly correlated flow patterns to regions of plaque formation in human 
coronary arteries. They observed that the flow divider at the bifurcation point, a region of high shear 
stress, was relatively devoid of plaque deposition, whereas the outer wall, a region of low shear stress, 
showed extensive plaque formation. Similar patterns were also seen in curved vessels, where the inner 
wall or the hip region of the curve, a region of low shear and flow separation, exhibited plaque formation. 
In addition to the direct shear-stress-mediated effects, the regions of complex flow patterns and recircu- 
lation also tend to increase the residence time of circulating blood cells in susceptible regions, whereas 
the blood cells are rapidly cleared away from regions of high wall shear and unidirectional flow [Glagov 
et al., 1988]. This increased transit time could influence plaque deposition by favoring margination of 
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monocytes and platelets, release of vasoactive agents, or altered permeability at the intercellular junctions 
to extracellular lipid particles and possible concentration of procoagulant materials [Nollert et al„ 1991]. 

The endothelium forming the interface between blood and the surrounding tissues is believed to act 
as a sensor of the local hemodynamic environment in mediating both the normal response and the 
pattern of vascular diseases. In vivo studies have shown changes in the actin microfilament localization 
in a shear-dependent manner. Actin stress fibres have been observed to be aligned with the direction of 
flow in high-shear regions, whereas they were mostly present in dense peripheral bands in the low-shear 
regions [Kim et al., 1989]. Langille and Adamson [1981] showed that the cells in large arteries, away 
from the branches, were aligned parallel to the long axes of the artery in rabbit and mouse. Similar results 
were shown in coronary arteries of patients undergoing cardiac transplantation [Davies et al., 1988]. 
Near the branch points of large arteries, however, cells aligned along the flow streamlines. In smaller 
blood vessels, where secondary flow patterns did not develop, the cell alignment followed the geometry 
of the blood vessel. In fact, endothelial cell morphology and orientation at the branch may be a natural 
marker of the detailed features of blood flow [Nerem et al., 1981]. 

Another cell type that is likely to be affected by hemodynamic forces is the vascular smooth muscle 
cell (SMC) present in the media. In the early stages of lesion development, SMCs proliferate and migrate 
into the intima [Ross, 1993; Schwartz et al., 1993]. Since the endothelium is intact in all but the final 
stages of atherosclerosis, SMCs are unlikely to be directly affected by shear stress. Most of the direct force 
they experience comes from the cyclical stretch forces experienced by the vessel itself due to pressure 
pulses [Grande et al., 1989]. Shear forces acting on the endothelium, however, can release compounds 
such as endothelin, nitric oxide, and platelet- derived growth factors that act as SMC mitogens and can 
modulate SMC tone. Ono and colleagues [1991] showed that homogenate of shear- stressed endothelial 
cells contained increased amounts of collagen and stimulated SMC migration in vitro, compared to 
endothelial cells grown under static conditions. The local shear environment could thus act directly or 
indirectly on the cells of the vascular wall in mediating a physiologic response. 

This chapter presents what is currently known regarding how the mechanical agonist of shear stress 
modulates endothelial cell function in the context of vascular physiology and pathophysiology. In dis- 
cussing this topic we have adopted an outside-in approach. That is, we first discuss how shear stress 
affects endothelial cell-blood cell interactions, then progress to how shear stress affects the endothelial 
cell cytoskeleton, signal transduction, and protein secretion, and finally end with how shear stress 
modulates endothelial cell gene expression. We close with gene therapy and tissue engineering consid- 
erations related to how endothelial cells respond to shear stress. Before proceeding, however, we discuss 
the devices and methodology used for studying the endothelial cell responses to shear stress in vitro. 

114.1 Devices and Methodology Used 
for in Vitro Experiments 

In vivo studies aimed at understanding cellular responses to shear forces have the inherent problem that 
they cannot quantitatively define the exact features of the hemodynamic environment. Moreover, it is 
very difficult to say if the resultant response is due to shear stress or some other feature associated with 
the hemodynamic environment. Cell culture studies and techniques for exposing cells to a controlled 
shear environment in vitro have been increasingly used to elucidate cellular responses to shear stress and 
flow. Mechanical-force-induced changes in cell function have been measured in vitro using mainly two 
cell types, cultured monolayers of bovine aortic endothelial cells (BAECs) and human umbilical vein 
endothelial cells (HUVECs). Shear stress is typically generated in vitro by flowing fluid across endothelial 
cell monolayers under controlled kinematic conditions, usually in the laminar flow regime. Parallel plate 
and cone-and-plate geometries have been the most common. Physiologic levels of venous and arterial 
shear stress range between 1-5 dynes/cm 2 and 6-40 dynes/cm 2 , respectively. 

The use of the parallel plate flow chamber allows one to have a controlled and well-defined flow 
environment based on the chamber geometry (fixed) and the flow rate through the chamber (variable). 
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In addition, individual cells can be visualized in real time using video microscopy. Assuming parallel 
plate geometry and Newtonian fluid behavior, the wall shear stress on the cell monolayer in the flow 
chamber is calculated as 

--"* (114.1) 



M 



where Q is the volumetric flow rate, (J. is the viscosity of the flowing fluid, h is the channel height, b is 
the channel width, and X vv is the wall shear stress. The flow chambers are designed such that the entrance 
length is very small compared to the effective length of the chamber [Frangos et al., 1985]. Therefore, 
entry effects can be neglected, and the flow is fully developed and parabolic over nearly the entire length 
of the flow chamber. Flow chambers usually consist of a machined block, a gasket whose thickness 
determines in part the channel depth, and a glass coverslip to which is attached a monolayer of endothelial 
cells (Fig. 1 14.2a). The individual components are held together either by a vacuum, or by evenly torqued 
screws, thereby ensuring a uniform channel depth. The flow chamber can have a myriad of entry ports, 
bubble ports, and exit ports. For short-term experiments, media are drawn through the chamber over 
the monolayer of cells using a syringe pump. For long-term experiments, the chamber is placed in a flow 
loop. In a flow loop (Fig. 1 14.2&), cells grown to confluence on glass slides can be exposed to a well- 
defined shear stress by recirculation of culture medium driven by gravity [Frangos et al., 1985]. Culture 
medium from the lower reservoir is pumped to the upper reservoir at a constant flow rate such that there 
is an overflow of excess medium back into the lower reservoir. This overflow serves two purposes: (1) It 
maintains a constant hydrostatic head between the two reservoirs, and (2) it prevents entry of air bubbles 
into the primary flow section upstream of the flow chamber that could be detrimental to the cells. The 
pH of the medium is maintained at physiologic levels by gassing with a humidified mixture of 95% air 
and 5% CO,. The rate of flow in the line supplying the chamber is determined solely by gravity and can 
be altered by changing the vertical separation between the two reservoirs. A sample port in the bottom 
reservoir allows periodic sampling of the flowing medium for a time-dependent assay. 

As mentioned above, cone-and-plate geometries can also be utilized. A schematic of a typical cone- 
and-plate viscometer is shown in Fig. 1 14.2c. Shear stress is produced in the fluid contained between the 
rotating cone and the stationary plate. Cells grown on coverslips can be placed in the shear field (up to 
12 at a time). For relatively small cone angles and low rates of rotation, the shear stress throughout the 
system is independent of position. The cone angle compensates for radial effects seen in plate-and-plate 
rheometers. The wall shear stress (x vv ) on the cell monolayer in the cone-and-plate viscometer is calculated 
as 

3 T 

X = (114.2) 

w 2jt£ 3 

where T is the applied torque and _R is the cone radius. The flow becomes turbulent, however, at the 
plate's edge and at high rotational speeds. Modifications from the basic design have allowed use of an 
optical system with the rheometer, enabling direct microscopic examination and real-time analysis of 
the cultured cells during exposure to shear stress [Dewey et al., 1981; Schnittler et al., 1993]. For a more 
complete description of in vitro device design and applications, refer to the text edited by Frangos [Tran- 
son-Tay, 1993]. 

114.2 Shear Stress-Mediated Cell-Endothelium Interactions 

Cell-cell and cell-substrate interactions in the vascular system are of importance in a number of physi- 
ologic and pathologic situations. Lymphocytes, platelets, or tumor cells in circulation may arrest at a 
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FIGURE 114.2 Devices used for in vitro study of shear stress effects on vascular endothelial cells, (a) Parallel plate 
flow chamber, (b) flow loop, (c) cone-and-plate viscometer. 

particular site as a result of interaction with the endothelium or the subendothelial matrix. While the 
margination of leukocyte/lymphocyte to the vessel wall be a normal physiologic response to injury or 
inflammation, adhesion of blood platelets to the subendothelium and subsequent platelet aggression 
could result in a partial or complete occlusion of the blood vessel leading to thrombosis or stroke. In 
addition, the adhesion of tumor cells to the endothelium is often the initial step in the development of 
secondary metastases. The adhesion of leukocytes, platelets, and tumor cells is not only mediated by 
adhesion molecules on the endothelium but also mediated by the hemodynamic force environment 
present in the vasculature. In fact, the specific molecular mechanisms employed for adhesion often vary 
with the local wall shear stress [Alevriadou et al., 1993; Lawrence et al., 1990]. 
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Targeting of circulating leukocytes to particular regions of the body is an aspect of immune system 
function that is currently of great research interest. This targeting process consists of adhesion of a specific 
subpopulation of circulating leukocytes to a specific area of vascular endothelium via cell surface adhesion 
receptors. The large number of receptors involved and the differential regulation of their expression on 
particular cell subpopulations make this process very versatile but also quite complicated. An extremely 
important additional complication arises from the fact that these interactions occur within the flowing 
bloodstream. Study of these various types of adhesive interactions requires accurate recreation of the 
flow conditions experienced by leukocytes and endothelial cells. Lawrence and coworkers examined 
neutrophil adhesion to cytokine-stimulated endothelial cells under well-defined postcapillary venular 
flow conditions in vitro [Lawrence et al., 1987; Lawrence et al„ 1990; Lawrence & Springer, 1991]. They 
also demonstrated that under flow conditions neutrophil adhesion to cytokine-stimulated endothelial 
cells is mediated almost exclusively by CD18-independent mechanisms but that subsequent neutrophil 
migration is CD18-dependent [Lawrence & Springer, 1991; Smith et al., 1991]. The initial flow studies 
were followed by many further studies both in vitro [Abbassi et al., 1991, 1993; Anderson et al., 1991; 
Hakkert et al, 1991; Jones et al., 1993; Kishimoto et al., 1991] and in vivo [Ley et al., 1991; Perry & 
Granger, 1991; von Andrian et al., 1991; Watson et al., 1991] which clearly distinguish separate mecha- 
nisms for initial adhesion/rolling and firm adhesion/leukocyte migration. Research has further shown 
that in a variety of systems, selectin/carbohydrate interactions are primarily responsible for initial adhe- 
sion and rolling, and firm adhesion and leukocyte migration are mediated primarily by integrin/peptide 
interactions. Methodology discussed for studying receptor specificity of adhesion for leukocytes under 
flow can also be utilized for studying red cell-endothelial cell interactions [Barabino et al., 1987; Wick 
etal, 1987, 1993]. 

The interaction of tumor cells with endothelial cells is an important step in tumor metastasis. To 
adhere to the vessel wall, tumor cells that come into contact with the microvasculature must resist the 
tractive force of shear stress that tends to detach them from the vessel wall [Weiss, 1992]. Hence, studies 
of the mechanisms involved in the process of tumor metastasis must take into account the physical forces 
acting on the tumor cells. Bastida and coworkers [1989] have demonstrated that tumor cell adhesion 
depends not only on tumor cell characteristics and endothelial cell adhesion molecule expression but on 
shear stress in the interaction of circulating tumor cells with the endothelium. The influence of shear 
stress on tumor cell adhesion suggests that attachment of tumor cells to vascular structures occurs in 
areas of high shear stress, such as the microvasculature [Bastida et al., 1989] . This is supported by earlier 
pathologic observations that indicate preferential attachment of tumor cells on the lung and liver capillary 
system [Warren, 1973]. Some tumor cell types roll and subsequently adhere on endothelial cells using 
selectin-mediated mechanisms similar to leukocytes, whereas others adhere without rolling using inte- 
grin-mediated receptors on endothelial cells [Bastida et al., 1989; Giavazzi et al., 1993; Kojima et al., 
1992; Menter et al, 1992; Patton et al, 1993; Pili et al, 1993]. It has been postulated that some tumor 
cell types undergo a stabilization process prior to firm adhesion that is mediated by transglutaminase 
crosslinking [Menter et al., 1992; Patton et al., 1993]. Recently, Pili and colleagues [1993] demonstrated 
that tumor cell contact with endothelial cells increases Ca 2+ release from endothelial cell intracellular 
stores, which may have a fundamental role in enhancing cell-cell adhesion. The factors and molecular 
mechanisms underlying tumor cell and endothelial cell interactions remain largely undefined and are 
certain to be further explored in the future. 

The endothelium provides a natural nonthrombogenic barrier between circulating platelets and the 
endothelial basement membrane. However, there are pathologic instances in which the endothelium 
integrity is compromised, exposing a thrombogenic surface. Arterial thrombosis is the leading cause of 
death in the United States. Among the most likely possibilities for the initiation of arterial platelet thrombi 
are: (1) adhesion of blood platelets onto the subendothelium of injured arteries and arterioles or on 
ruptured atherosclerotic plaque surfaces, containing collagen and other matrix proteins, with subsequent 
platelet aggregation, or (2) shear-induced aggregation of platelets in areas of the arterial circulation 
partially constricted by atherosclerosis or vasospasm. Various experimental models have been developed 
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to investigate the molecular mechanisms of platelet attachment to surfaces (adhesion) and platelet 
cohesion to each other (aggregation) under shear stress conditions. Whole-blood perfusion studies using 
annular or parallel-plate perfusion chambers simulate the first proposed mechanism of arterial thrombus 
formation in vivo, that may occur as a result of adhesion on an injured, exposed subendothelial or 
atherosclerotic plaque surface [Hubbel & Mclntire, 1986; Weiss et al., 1978, 1986]. Platelets arriving 
subsequently could, under the right conditions, adhere to each other, forming aggregates large enough 
to partially or completely occlude the blood vessel. In vitro studies have shown an increase in thrombus 
growth with local shear rate, an event that is believed to be the result of enhanced arrival and cohesion 
of platelets near the surface at higher wall shear stresses [Turitto et al., 1987] . Video microscopy provides 
information on the morphologic characteristics of thrombi and enables the reconstruction of three- 
dimensional models of thrombi formed on surfaces coated with biomaterials or endothelial cell basement 
membrane proteins. Macroscopic analysis of thrombi can provide information on platelet mass transport 
and reaction kinetics with the surface, and microscopic analysis allows dynamic real-time study of cell- 
surface and intercellular interactions. Such a technology enables the study of key proteins involved in 
mural thrombus formation, such as vWF [Alevriadou et al., 1993; Folie & Mclntire, 1989]. In addition, 
tests of antithrombotic agents and investigation of the thrombogenicity of various purified components 
of the vessel wall or polymeric biomaterials can be performed. 

114.3 Shear Stress Effects on Cell Morphology 
and Cytoskeletal Rearrangement 

In addition to mediating cell-endothelial cell interactions, shear stress can act directly on the endothe- 
lium. It has been demonstrated for almost a decade that hemodynamic forces can modulate endothelial 
cell morphology and structure [Barbee et al., 1994; Coan et al., 1993; Eskin et al., 1984; Franke et al., 
1984; Girard & Nerem, 1991, 1993; Ives et al, 1986; Langille et al, 1991; Levesque et al, 1989; Wechezak 
et al., 1985]. Conceivably, the cytoskeletal reorganization that occurs in endothelial cells several hours 
after exposure to flow may, in conjunction with shape change, transduce mechanical signals to cytosolic 
and nuclear signals (mechanotransduction), thereby playing a role in gene expression and signal trans- 
duction [Ingber, 1993; Resnick et al., 1993; Watson, 1991]. In fact, investigators have shown specific gene 
expression related to cytoskeletal changes [Botteri et al., 1990; Breathnach et al., 1987; Ferrua et al., 1990; 
Werb et al., 1986]. F-actin has been implicated as the principal transmission element in the cytoskeleton 
and appears to be required for signal transduction [Watson, 1991]. Actin filaments are anchored in the 
plasma membrane at several sites, including focal adhesions on the basal membrane, intercellular adhe- 
sion proteins, integral membrane proteins at the apical surface, and the nuclear membrane [Davies & 
Tripathi, 1993]. Substantiating this, Barbee and coworkers [1994] have recently shown, utilizing atomic 
force microscopy, that F-actin fiber stress bundles are formed in the presence of fluid flow and the fibers 
are coupled to the apical membrane. Moreover, endothelial cell shape change and realignment with flow 
can be inhibited by drugs that interfere with microfilament turnover [Davies & Tripathi, 1993] . Although 
all evidence leads one to believe that the endothelial cell cytoskeleton can respond to flow and that F-actin 
is involved, the exact mechanism involved remains to be elucidated. 

114.4 Shear Stress Effects on Signal Transduction 
and Mass Transfer 

Shear stress and resultant convective mass transfer are known to affect many important cytosolic second 
messengers in endothelial cells. For instance, shear stress is known to cause ATP-mediated increases in 
calcium ions (Ca 2+ ) [Ando et al, 1988; Dull & Davies, 1991; Mo et al, 1991; Nollert & Mclntire, 1992]. 
Changes in flow influence the endothelial boundary layer concentration of ATP by altering the convective 
transport of exogenous ATP, thereby altering ATP's interaction with both the P 2 -purinoreceptor and 
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ecto-ATPase. At low levels of shear stress, degradation of ATP by ecto-ATPase exceeds the rate of diffusion, 
and the steady state concentration of ATP remains low. In contrast, at high levels of shear stress, convection 
enhances the delivery of ATP from upstream to the P 2 -purino receptor, and diffusion exceeds the rate of 
degradation by the ecto-ATPase [Mo et al., 1991]. Whether physiologic levels of shear stress can directly 
increase intracellular calcium remains unclear. Preceding the calcium increases are increases in inositol- 
1,4,5 trisphosphate (IP 3 ) [Bhagyalakshmi et al., 1992; Bhagyalakshmi & Frangos, 1989b; Nollert et al., 
1990; Prasad et al., 1993], which binds to specific sites on the endoplasmic reticulum and causes release 
of Ca 2+ from intracellular stores, and diacylglycerol (DAG) [Bhagyalakshmi & Frangos, 1989a]. Elevated 
levels of both DAG and Ca 2+ can activate several protein kinases, including protein kinase C (PKC). 
Changes in Ca 2+ , IP 3 , and DAG are evidence that fluid shear stress activated the PKC pathway. In fact, 
the PKC pathway has been demonstrated to be activated by shear stress [Bhagyalakshmi & Frangos, 
1989b; Hsieh et al, 1992, 1993; Kuchan & Frangos, 1993; Milner et al., 1992]. In addition to the PKC 
pathway, pathways involving cyclic adenosine monophosphate (cAMP) and cyclic guanosine monophos- 
phate (cGMP) may also be modulated by shear stress, as evidenced by increases in cAMP [Bhagyalakshmi 
& Frangos, 1989b] and cGMP [Kuchan & Frangos, 1993; Ohno et al., 1993] with shear stress. Moreover, 
it has been shown recently that shear stress causes acidification of cytoslic pH [Ziegelstein et al., 1992]. 
Intracellular acidification of vascular endothelial cells releases Ca 2+ into the cytosol [Ziegelstein et al., 
1993] . This Ca 2+ mobilization maybe linked to endothelial synthesis and release of vasodilatory substances 
during the pathological condition of acidosis. 

114.5 Shear Stress Effects on Endothelial Cell 
Metabolite Secretion 

The application of shear stress in vitro is accompanied by alterations in protein synthesis that are 
detectable within several hours after initiation of the mechanical agonist. Shear stress may regulate the 
expression of fibrinolytic proteins by endothelial cells; tPA is an antithrombotic glycoprotein and serine 
protease that is released from endothelial cells. Once released, it rapidly converts plasminogen to plasmin 
which, in turn, dissolves fibrin clots. Diamond and coworkers [1989] have shown that venous levels of 
shear stress do not affect tPA secretion, whereas arterial levels increase the secretion rate of tPA. Arterial 
flows lead to a profibrinolytic state that would be beneficial in maintaining a clot-free artery. In contrast 
to tPA, neither venous nor arterial levels of shear stress cause significant changes in plasminogen activator 
inhibitor-1 (PAI-1) secretion rates [Diamond et al., 1989; Kuchan & Frangos, 1993]. Increased prolifer- 
ation of smooth muscle cells is one of the early events of arteriosclerosis. The secretion rate of endothelin- 1 
(ET-1), a potent smooth muscle cell mitogen, has been shown to increase in response to venous levels 
of shear stress and decrease in response to arterial levels of shear stress [Kuchan & Frangos, 1993; Milner 
et al., 1992; Nollert et al, 1991; Sharefkin et al., 1991; Yoshizumi et al, 1989]. Both the in vitro tPA and 
ET-1 results are consistent with in vivo observations that atherosclerotic plaque development occurs in 
low shear stress regions as opposed to high shear stress regions. The low shear stress regions usually 
occur downstream from vessel branches. In these regions we would expect low secretion of tPA and high 
secretion of ET-1, leading to locally increased smooth muscle cell proliferation (intimal thickening of 
the vessel) and periodic problems with clot formation (thrombosis). Both of these observations are 
observed pathologically and are important processes in atherogenesis [Mclntire, 1994]. 

Important metabolites in the arachidonic acid cascade via the cyclooxygenase pathway; prostacylin 
(PGI 2 ) and prostaglandin (PGF 2 J are also known to increase their secretion rates in response to arterial 
levels of shear stress [Frangos et al., 1985; Grabowski et al., 1985; Nollert et al., 1989]. In addition, 
endothelial cells have been shown to increase their production of fibronectin (FN) in response to shear 
stress [Gupte & Frangos, 1990]. Endothelial cells may respond to shear stress by secreting FN in order 
to increase their attachment to the extracellular matrix, thereby resisting the applied fluid shear stress. 
Endothelial cells have been shown to modulate their receptor expression of intercellular adhesion mol- 
ecule- 1 (ICAM-1), vascular cell adhesion molecule- 1 (VCAM-1), and monocyte chemotactic peptide- 1 
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(MCP-1) in response to shear stress [Sampath et al., 1995; Shyy et al., 1994]. The adaptive expression of 
these adhesion molecules in response to hemodynamic shear stress may aid in modulating specific 
adhesion localities for neutrophils, leukocytes, and monocytes. 

114.6 Shear Stress Effects on Gene Regulation 

In many cases, the alterations in protein synthesis observed with shear stress are preceded by modulation 
of protein gene expression. The molecular mechanisms by which mechanical agonists alter the gene 
expression of proteins are under current investigation. Diamond and colleagues [1990] and Nollert and 
colleagues [1992] have demonstrated that arterial levels of shear stress upregulate the transcription of 
tPA mRNA, concomitant with tPA protein secretion. The gene expressions of ET-1 and PDGF, both of 
which are potent smooth muscle cell mitogens and vasoconstrictors, are also known to be modulated by 
hemodynamic forces. For instance, arterial levels of shear stress cause ET-1 mRNA to be downregulated 
[Malek et al, 1993; Malek & Izumo, 1992; Sharefkin et al, 1991]. ET-1 mRNA downregulation is sensitive 
to the magnitude of the shear stress in a dose-dependent fashion, reaching a saturation at 15 dynes/ cm 2 . 
Conversely, Yoshizumi and coworkers [1989] have shown that venous levels of shear stress cause a 
transient increase in ET-1 mRNA, peaking at 4 hours and returning to basal levels by 12-24 hours. 
However, they had previously reported downregulation of ET-1 mRNA expression [Yanagisawa et al., 
1988]. PDGF is expressed as a dimer composed of PDGF-A and PDGF-B subunits. There are conflicting 
reports as to how arterial shear stresses affect PDGF-B mRNA expression. Malek and colleagues [1993] 
have shown that arterial levels of shear stress applied to BAECs cause a significant decrease in PDGF-B 
mRNA expression over a 9-hr period. In contrast, Mitsumata and coworkers [1993] and Resnick and 
coworkers [1993] have shown increases in BAEC mRNA expression of PDGF-B when arterial shear 
stresses were applied. The discrepancy in the results may be attributed to differences in BAEC cell line 
origins or differences in the passage number of cells used. In support of the latter two investigators, Hsieh 
and colleagues [1992] have reported upregulation of PDGF-B mRNA in HUVECs in the presence of 
arterial shear stress. As with the B chain of PDGF, there are conflicting reports as to the affect of arterial 
shear stress on PDGF-A. Mitsumata and coworkers [1993] have reported no change in PDGF-A mRNA 
expression, whereas Hsieh and coworkers [1991, 1992] have reported a shear-dependent increase in the 
mRNA expression from 0-6 dynes/cm 2 which then plateaus from 6-51 dynes/cm 2 . Endothelial cell 
expression of ET-1 and PDGF may be important in blood vessel remodeling. In blood vessels exposed 
to increased flow, the chronic vasculature response is an increase in vessel diameter [Langille & O'Donnell, 
1986]. Hence, it is tempting to postulate that hemodynamic-force-modulated alterations in ET-1 and 
PDGF mRNA expression may account for much of this adaptive change. 

The gene expression of various adhesion molecules involved in leukocyte recruitment during inflam- 
mation and disease has also been investigated. ICAM-1 mRNA has been shown to be transiently upreg- 
ulated in the presence of venous or arterial shear stresses [Nagel et al., 1994; Sampath et al., 1995]. Its 
time-dependent response peaked at 1-3 hr following exposure to shear, before declining below basal 
levels with prolonged exposure of 6-24 hr. In contrast, VCAM-1 mRNA level was downregulated almost 
immediately upon onset of flow and was found to drop significantly below basal levels within 6 hr of 
initiation of flow at all magnitudes of shear stresses. E-selectin mRNA expression appeared to be generally 
less responsive to shear stress, especially at the lower magnitudes. After an initial downward trend 1 hr 
following exposure to shear stress (2 dynes/cm 2 ), E-selectin mRNA remained at stable levels for up to 
6 hr [Sampath et al., 1995]. Recent evidence shows that the expression of MCP-1, a monocyte-specific 
chemoattractant expressed on endothelial cells, also follows a similar biphasic response with shear stress 
like ICAM- 1 [Shyy et al., 1994] . In addition to adhesion molecules, the gene expression of several growth 
factors has been investigated. The mRNA expression of heparin-binding epidermal growth factor like 
growth factor (HB-EGF), a smooth muscle-cell mitogen, transiently increases in the presence of minimal 
arterial shear stress [Morita et al., 1993]. Transforming growth factor-(3l (TGF-(3l) mRNA has been 
reported to be upregulated in the presence of arterial shear stresses within 2 hr and remain elevated for 
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12 hr [Ohno et al., 1992]. In addition, Malek and coworkers [1993] have shown that basic fibroblast 
growth factor (bFGF) mRNA is upregulated in BAECs in the presence of arterial shear stresses. In 
HUVECs, however, no significant changes in bFGF message were observed [Diamond et al., 1990]. This 
difference is probably due to differences in cell source (human versus bovine). 

Proto-oncogenes code for binding proteins that either enhance or repress transcription and, therefore, 
are ideal candidates to act as gene regulators [Cooper, 1990]. Komuro and colleagues [1990, 1991] were 
the first to demonstrate that mechanical loading causes upregulation of c-fos expression. Recently, Hsieh 
and coworkers [1993] investigated the role of arterial shear stress on the mRNA levels of nuclear proto- 
oncogenes c-fos, c-jun, and c-myc. Gene expression of c-fos was transiently upregulated, peaking at 0.5 hr 
and returning to basal levels within an hour. In contrast, both c-jun and c-myc mRNA were upregulated 
to sustainable levels within an hour. The transcribed protein products of c-fos, c-jun, and c-myc may act 
as nuclear-signaling molecules for mechanically induced gene modulation [Nollert et al., 1992; Ranjan & 
Diamond, 1993]. 

114.7 Mechanisms of Shear Stress-Induced Gene Regulation 

Although no unified scheme to explain mechanical signal transduction and modulation of gene expres- 
sion is yet possible, many studies provide insight in an attempt to elucidate which second messengers 
are involved in gene regulation mediated by hemodynamic forces. There is substantial evidence that the 
PKC pathway may be involved in the gene regulation. A model of the PKC transduction pathway is shown 
in Fig. 1 14.3. Mitsumata and colleagues [ 1993] have shown that PDGF mRNA modulation by shear stress 
could be partially attributed to a PKC pathway. Hsieh and coworkers [1992] have shown that shear- 
induced PDGF gene expression in HUVECs is mainly mediated by PKC activation and requires Ca 2+ and 
the involvement of G proteins. In addition, they demonstrated that cAMP and cGMP dependent protein 
kinases are not involved in PDGF gene expression. Morita and colleagues [1993] have shown that shear- 
induced HB-EGF mRNA expression is mediated through a PKC pathway and that Ca 2+ may be involved 
in the pathway. PKC was also found to be an important mediator in flow-induced c-fos expression, with 
the additional involvement of G proteins, phospholipase C (PLC), and Ca 2+ [Hsieh et al., 1993] . Moreover, 
Levin and coworkers [1988] have shown that tPA gene expression is enhanced by PKC activation and 
Iba and coworkers [1992] have shown that cAMP is not involved in tPA gene expression. As depicted in 
Fig. 114.3, the PKC pathway may be involved in activating DNA-binding proteins via phosphorylation. 

In addition to second messengers, it has been proposed that cytoskeletal reorganization may play a 
role in regulating gene expression [Ingber, 1993; Resnick et al., 1993]. Morita and colleagues [1993] have 
shown that shear stress-induced ET-1 gene expression is mediated by the disruption of the actin cytosk- 
eleton and that microtubule integrity is also involved. Gene expression of other bioactive molecules may 
also be regulated by actin disruption. The actual molecular mechanisms involved in cytoskeletal-mediated 
gene expression remain unclear. However, the cytoskeleton may activate membrane ion channels and 
nuclear pores. 

In addition to second messengers and the cytoskeletal architecture, it has been postulated by Nollert 
and others [1992] that transcriptional factors that bind to the DNA may play an active role mediating 
the signal transduction between the cytosol and nucleus (Fig. 114.4). It is known that nuclear translo- 
cation of transcriptional factors and DNA-binding activity of the factors can be mediated by phospho- 
rylation [Bohmann, 1990; Hunter & Karin, 1992]. Many of the transcription factors are protein products 
of proto-oncogenes. It has previously been stated that shear stress increases expression of c-fos and c-jun. 
The gene products of c-fos and c-jun form protein dimers that bind to transcriptional sites on DNA 
promoters and act as either transcriptional activators or repressors. Two of the transcriptional sites to 
which the fos and jun family dimers bind are the TRE (tumor promoting agent response element) and 
CRE (cAMP response element). These have consensus sequences of TGACTCA and TGACGTCA, respec- 
tively. It is known that the promoter regions of tPA, PAI-1, ET, and TGF-(3l possess sequences of homology 
to the CRE and TRE [Nollert et al., 1992]. In addition to mediating known transcription factor binding 
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FIGURE 114.3 Model for shear stress induced protein kinase C (PKC) activation leading to modulation of gene 
expression. Mechanical agonists enhance membrane phosphoinositide turnover via phospholipase C (PLC), produc- 
ing inositol- 1,4,5 trisphosphate (IP 3 ), and diacylglycerol (DAG). DAG can then activate PKC. IP 3 may also activate 
PKC via Ca 2+ release. PKC phosphorylates DNA-binding proteins, thereby making them active. The nuclear binding 
proteins then alter mRNA transcription. R = receptor, G = G protein, PIP 2 = phosphatidylinositol 4,5-bisphosphate, 
(+) = activation. 

in perhaps novel ways, mechanical perturbations may initiate molecular signaling through specific stress- 
or strain-sensitive promoters that can activate gene expression. In fact, Resnick and colleagues [1993] 
have described a cis-acting shear stress response element (SSRE) in the PDGF-B promoter, and this 
sequence is also found in the promoters of tPA, ICAM-1, TGF-pU, c-fos, and c-jun but not in VCAM-1 
or E-Selectin promoters. A core-binding sequence of GAGACC was identified that binds to transcriptional 
factors found in BAEC nuclear extracts. The identify of transcriptional factors that bind this sequence 
remains unknown. In addition, Malek and coworkers [1993] have shown that shear-stress mediated ET-1 
mRNA expression is not dependent on either the PKC or cAMP pathways, but rather shear stress regulates 
the transcription of the ET-1 gene via an upstream cis element in its promoter. It remains to be seen if 
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FIGURE 1 14.4 Gene expression regulated by transcription/translation factors and mechanically sensitive promoters. 
(a) Phosphorylation of transcription factors allows their translocation to the nucleus and subsequent DNA binding. 
The transcription factors can activate or inhibit transcription. TRE = tumor promoting agent response element, 
TREB = tumor promoting agent response element binding protein, CRE = cAMP response element, CREB = cAMP 
response element binding protein, SSRE = shear sensitive response element, (b) The AU-binding factor binds the 
AUUUA motif located in the 3' untranslated regions of some mature mRNA and could play an important role in 
mediating stability and/or transport of these mRNAs; AUBF = AU binding factor. 

ET-1 is shown to have the same response element as that proposed by Resnick and colleagues. As 
mentioned previously, Mitsumata and coworkers [1993] found that PDGF mRNA modulation was not 
solely dependent on a PKC pathway. The PKC independent mechanism may very well involve the cis- 
acting shear stress response element that Resnick and colleagues described, though other factors are 
probably important in controlling the complex temporal pattern of gene expression seen in response to 
mechanical stimuli. The biphasic response of MCP-1 was shown to be regulated at the transcriptional 
level, possibly involving activation of AP-1, sites and the subsequent binding to the TRE site. The putative 
SSRE binding site was also identified in the 5' flanking region of the cloned MCP-1 gene suggesting an 
interactive mechanism, wherein these cis-acting elements collectively regulate the transcriptional activa- 
tion of MCP-1 gene under shear stress [Shyy et al., 1994]. 
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Another important determinant in the role played by shear stress on gene regulation could be at the 
level of mRNA stability. It is not known whether the shear-induced downregulation that has been reported, 
especially in the case of adhesion molecules, is due to a decrease in the transcriptional rate of the gene or 
due to a decrease in the stability of the transcript. The (3-adrenergic receptors, which are part of a large 
class of G-protein linked cell surface receptors, are also susceptible to desensitization and downregulation 
[Hadcock & Malbon, 1988]. A study of the molecular basis for agonist-induced instability revealed the 
existence of an M r 35,000 protein that specifically binds mRNA transcripts which contain the sequence 
AUUUA in their 3' untranslated region (3'-UTR) [Port et al., 1992]. Agonists that decrease the (3-adrenergic 
RNA transcript levels increased the level of this protein. Further, it has been shown that tandem repeats 
(three or more) of this sequence are needed for efficient binding of the destabilizer protein [Bohjanen et 
al., 1992]. Recent reports, however, suggest that AUUUA motif alone does not destabilize the mRNA 
transcripts and that the presence of additional elements in the AU-rich 3'-UTR may be necessary for the 
observed mRNA instability [Peppel et al, 1991]. Muller and others [1992] also report that binding of 
certain AU-binding factors (AUBF) to AUUUA motif accelerates the nuclear export of these mRNAs 
(Fig. 1 14.4). Further, their study indicates that this AUBF may protect AU-rich mRNAs against degradation 
by endoribonuclease V, by binding AUUUA motifs, thereby stabilizing the otherwise unstable mRNA. 

Analysis of the 3'-UTR of some of the adhesion molecules studied revealed the presence of the AUUUA 
motif in all three cases. Human VCAM-1 [Cybulsky et al., 1991] has 6 dispersed AUUUA repeats in its 
3'-UTR. Human ICAM-1 [Staunton et al, 1988] has 3 AUUUA repeats within 10 bases of each other, 
and 3-selectin [Hession et al., 1990] has 8 AUUUA repeats, which include 1 tandem repeat and 3 others 
that are within 10 bases of each other. The constitutively expressed gene human GAPDH [Tokunaga et 
al., 1987] does not have any AUUUA repeats in the 3' region analyzed. Of the other shear responsive 
genes, human PDGF-B (c-sis) gene has 1 AUUUA repeat [Ratner et al., 1985]; human endothelin gene 
[Bloch et al., 1989] has 4, three within 10 bases of each other; human tissue plasminogen activator (tPA) 
[Reddy et al., 1987] has 2; and human thrombomodulin [Jackman et al., 1987] has 9 AUUUA boxes, 
including 1 tandem repeat. The steady state levels of the tPA and PDGF have been shown to be increased, 
and those of the adhesion molecules and endothelin have been shown to be decreased by arterial shear 
stress [Diamond et al., 1990; Hsieh et al., 1992; Sampath et al., 1995], suggesting a possible mechanism 
of shear-induced destabilization of adhesion molecule mRNA. However, the observed changes can be at 
the transcriptional level, with the AUUUA motifs playing a role only in the transport of these mRNAs 
to the cytoplasm. As shown by Muller and colleagues [1992], transport of low-abundance mRNAs like 
cytokine mRNAs, whose cellular concentrations are one-hundredth or less of that of the high abundance 
mRNAs, are the ones that are prone to modulation by transport-stimulatory proteins such as AUBF. This 
can explain the presence of AUUUA boxes in the 3'-UTR of adhesion molecules and other modulators 
of vascular tone and not in that of the constitutively expressed GADPH, which is an essential enzyme in 
the metabolic pathway and is expressed in high copy numbers. A more detailed study using transcription 
run-on assays is needed to address these specific issues. 

114.8 Gene Therapy and Tissue Engineering 
in Vascular Biology 

Endothelial cells, located adjacent to the flowing blood stream, are ideally located for use as vehicles for 
gene therapy, since natural or recombinant proteins of therapeutic value can be expressed directly into the 
blood to manage cardiovascular diseases or inhibit vascular pathology. One could drive gene expression 
only in regions of vasculature where desired by using novel cis-acting stress- or strain-sensitive promoter 
elements or stress-activated transcription factors. For instance, endothelial cells in regions of low shear stress 
could be modified to express tPA so as to inhibit atherosclerotic plaque formation. An appropriate vector 
driven by the ET-1 promoter (active at low stress but downregulated at high stress) attached to the tPA gene 
would be a first logical construct in this application. Likewise, endothelial cells could be modified to increase 
proliferation rates in order to endothelialize vascular grafts or other vascular prostheses, thereby inhibiting 
thrombosis. In addition, endothelial cells could be modified to decrease expression of smooth muscle cell 
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mitogens (ET-1, PDGF) to prevent intimal thickening and restenosis. The endothelial cells could even be 
modified so as to secrete components that would inactivate toxic substances in the blood stream. Work has 
already begun to develop techniques to express foreign proteins in vivo [Nabel et al., 1989; Wilson et al., 
1989; Zwiebel et al., 1989]. Vical and Gen Vec are currently examining ways to express growth factors in 
coronary arteries to stimulate angiogenesis following balloon angioplasty [Glaser, 1994] . In addition, Tularik 
Inc is modifying endothelial cells to over express low density lipoprotein (LDL) receptors to remove 
cholesterol from the blood stream so as to prevent atherosclerotic plaque formation. The application of 
gene therapy to cardiovascular diseases is in its infancy and will continue to grow as we learn more about 
the mechanisms governing endothelial cell gene expression. Since endothelial cells lie in a mechanically 
active environment, predicting local secretion rates of gene products from transfected endothelial cells will 
require a knowledge of how these mechanical signals modulate gene expression for each target gene and 
promoter [Mclntire, 1994]. 

In addition to gene therapy applications, there are tissue engineering applications that can be realized 
once one gains a fundamental understanding of the function-structure relationship intrinsic to vascular 
biology. This includes understanding how hemodynamic forces modulate endothelial cell function and 
morphology. Of primary concern is the development of an artificial blood vessel for use in the bypass 
and replacement of diseased or occluded arteries [Jones, 1982; Nerem, 1991; Weinberg & Bell, 1986]. 
This is particularly important in the case of small- diameter vascular grafts (such as coronary bypass 
grafts), which are highly prone to reocclusion. The synthetic blood vessels must provide the structural 
support required in its mechanically active environment as well as provide endothelial-like functions, 
such as generating a nonthrombogenic surface. 

114.9 Conclusions 

This chapter has demonstrated the intricate interweaving of fluid mechanics and convective mass transfer 
with cell metabolism and the molecular mechanisms of cell adhesion that occur continuously in the 
vascular system. Our understanding of these mechanisms and how they are modulated by shear stress is 
really in the initial stages — but this knowledge is vital to our understanding of thrombosis, atherosclerosis, 
inflammation, and many other aspects of vascular physiology and pathophysiology. Knowledge of the 
fundamental cellular and molecular mechanisms involved in adhesion and mechanical force modulation 
of metabolism under conditions that mimic those seen in vivo is essential for real progress to be made 
in vascular biology and more generally in tissue engineering. 

Defining Terms 

CRE: cAMP response element; its consensus sequence TGACGTCA is found on genes responsive to 

cAMP agonists such as forskolin. 
Gene regulation: Transcriptional and posttranscriptional control of expression of genes in eukaryotes 

where regulatory proteins bind specific DNA sequences to turn a gene either on (positive control) 

or off (negative control). 
Gene therapy: The modification or replacement of a defective or malfunctioning gene with one that 

functions adequately and properly, for instance, addition of gene regulatory elements such as 

specific stress- or strain-sensitive response elements to specifically drive gene expression only in 

regions of interest in the vasculature so as to control proliferation, fibrinolytic capacity, etc. 
Shear: Shear refers to the relative parallel motion between adjacent fluid (blood) planes during flow. 

The difference in the velocity between adjacent layers of blood at various distances from the vessel 

wall determines the local shear rate, expressed in cm/s/cm, or S" 1 . 
Shear stress: Fluid shear stress, expressed in dynes/cm 2 , is a measure of the force required to produce 

a certain rate of flow of a viscous liquid and is proportional to the product of shear rate and blood 

viscosity. Physiologic levels of venous and arterial shear stresses range between 1-5 dynes/cm 2 and 

6-40 dynes/cm 2 , respectively. 
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SSRE: Shear stress response element; its consensus GAGACC has been recently identified in genes 

responsive to shear stress. 
Transcription factor: A protein that binds to a cis-regulatory element in the promoter region of a DNA 

and thereby directly or indirectly affects the initiation of its transcription to an RNA. 
TRE: Tumor promoting agent response element: its consensus sequence TGACTCA is commonly 

located in genes sensitive to phorbol ester stimulation. 

References 

Abbassi O, Kishimoto TK, Mclntire LV, et al. 1993. E-selectin supports neutrophil rolling in vitro under 

conditions of flow. J Clin Invest 92:2719. 
Abbassi OA, Lane CL, Krater S, et al. 1991. Canine neutrophil margination mediated by lectin adhesion 

molecule-1 in vitro. J Immunol 147:2107. 
Alevriadou BR, Moake JL, Turner NA, et al. 1993. Real-time analysis of shear-dependent thrombus 

formation and its blockade by inhibitors of von Willebrand factor binding to platelets. Blood 

81:1263. 
Anderson DC, Abbassi OA, Mclntire LV, et al. 1991. Diminished LECAM-1 on neonatal neutrophils 

underlies their impaired CD18-independent adhesion to endothelial cells in vitro. J Immunol 

146:3372. 
Ando J, Komatsuda T, Kamiya A. 1988. Cytoplasmic calcium response to fluid shear stress in cultured 

vascular endothelial cells. In Vitro Cell Devel 24:871. 
Asakura T, Karino T 1990. Flow patterns and spatial distribution of atherosclerotic lesions in human 

coronary arteries. Circ Res 66:1045. 
Barabino GA, Mclntire LV, Eskin SG, et al. 1987. Endothelial cell interactions with sickle cell, sickle trait, 

mechanically injured, and normal erythrocytes under controlled flow. Blood 70:152. 
Barbee KA, Davies PF, Lai R. 1994. Shear stress-induced reorganizations of the surface topography of 

living endothelial cells imaged by atomic force microscopy. Circ Res 74:163. 
Bastida E, Almirall L, Bertomeu MC, et al. 1989. Influence of shear stress on tumor-cell adhesion to 

endothelial-cell extracellular matrix and its modulation by fibronectin. Int J Cancer 43:1174. 
Bhagyalakshmi A, Berthiaume F, Reich KM, et al. 1992. Fluid shear stress stimulates membrane phos- 
pholipid metabolism in cultured human endothelial cells. J Vase Res 29:443. 
Bhagyalakshmi A, Frangos JA. 1989a. Mechanism of shear- induced prostacyclin production in endothelial 

cells. Biochem Biophys Res Comm 158:31. 
Bhagyalakshmi A, Frangos JA. 1989fe Membrane phospholipid metabolism in sheared endothelial cells. 

Proc 2nd Int Symp Biofluid Mechanics and Biorheology, Munich, Germany 240. 
Bloch KD, Friedrich SP, Lee ME, et al. 1989. Structural organization and chromosomal assignment of 

the gene encoding endothelin. J Biol Chem 264:10851. 
Bohjanen PR, Petryniak B, June CH, et al. 1992. AU RNA-binding factors differ in their binding speci- 
ficities and affinities. J Biol Chem 267:6302. 
Bohmann D. 1990. Transcription factor phosphorylation: A link between signal transduction and the 

regulation of gene expression. Cancer Cells 2:337. 
Botteri FM, Ballmer- Hofer K, Rajput B, et al. 1990. Disruption of cytoskeletal structures results in the 

induction of the urokinase-type plasminogen activator gene expression. J Biol Chem 265:13327. 
Breathnach R, Matrisian LM, Gesnel MC, et al. 1987. Sequences coding part of oncogene-induced transin 

are highly conserved in a related rat gene. Nucleic Acid Res 15:1139. 
Coan DE, Wechezak AR, Viggers RF, et al. 1993. Effect of shear stress upon localization of the Golgi 

apparatus and microtubule organizing center in isolated cultured endothelial cells. J Cell Sci 

104:1145. 
Cooper GM. 1990. Oncogenes, Boston, Jones and Bartlett. 
Cybulsky MI, Fries JW, Williams AJ, et al. 1991. Gene structure, chromosomal location, and basis for 

alternative mRNA splicing of the human VCAM-1 gene. Proc Natl Acad Sci USA 88:7859. 



■) 2000 by CRC Press LLC 



Davies MJ, Woolf N, Rowles PM, et al. 1988. Morphology of the endothelium over atherosclerotic plaques 

in human coronary arteries. Br Heart J 60:459. 
Davies PF, Tripathi SC. 1993. Mechanical stress mechanisms and the cell: An endothelial paradigm. Circ 

Res 72:239. 
Dewey CF Jr, Bussolari SR, Gimbrone MA Jr, et al. 1981. The dynamic response of vascular endothelial 

cells to fluid shear stress. J Biomech Eng 103:177. 
Diamond SL, Eskin SG, Mclntire LV, 1989. Fluid flow stimulates tissue plasminogen activator secretion 

by cultured human endothelial cells. Science 243:1483. 
Diamond SL, Sharefkin JB, Dieffenbach C, et al. 1990. Tissue plasminogen activator messenger RNA 

levels increase in cultured human endothelial cells exposed to laminar shear stress. J Cell Physiol 

143:364. 
Dull RO, Davies PF. 1991. Flow modulation of agonist (ATP) -response (Ca ++ ) coupling in vascular 

endothelial cells. Am J Physiol 261:H149. 
Eskin SG, Ives CL, Mclntire LV, et al. 1984. Response of cultured endothelial cells to steady flow. Microvasc 

Res 28:87. 
Ferrua B, Manie S, Doglio A, et al. 1990. Stimulation of human interleukin-1 production and specific 

mRNA expression by microtubule-disrupting drugs. Cell Immunol 131:391. 
Folie BJ, Mclntire LV. 1989. Mathematical analysis of mural thrombogenesis. Biophys J 56:1121. 
Frangos JA, Eskin SG, Mclntire LV, et al. 1985. Flow effects on prostacyclin production by cultured human 

endothelial cells. Science 227:1477. 
Franke RP, Grafe M, Schnittler H. 1984. Induction of human vascular endothelial stress fibres by shear 

stress. Nature 307:648. 
Giavazzi R, Foppolo M, Dossi R, et al. 1993. Rolling and adhesion of human tumor cells on vascular 

endothelium under physiological flow conditions. J Clin Invest 92:3038. 
Girard PR, Nerem RM. 1991. Fluid shear stress alters endothelial cell structure through the regulation 

of focal contact-associated proteins. Adv Bioeng 20:425. 
Girard PR, Nerem RM. 1993. Endothelial cell signaling and cytoskeletal changes in response to shear 

stress. Front Med Biol Eng 5:31. 
Glagov S, Zarins CK, Giddens DP, et al. 1988. Hemodynamics and atherosclerosis. Arch Pathol Lab Med 

112:1018. 
Glaser V. 1994. Targeted injectable vectors remain the ultimate goal in gene therapy. Genetic Eng News 

14:8. 
Grabowski EF, Jaffe EA, Weksler BB. 1985. Prostacyclin production by cultured endothelial cell mono- 
layers exposed to step increases in shear stress. J Lab Clin Med 105:36. 
Grande JP, Glagov S, Bates SR, et al. 1989. Effect of normolipemic and hyperlipemic serum on biosynthesis 

response to cyclic stretching of aortic smooth muscle cells. Arteriosclerosis 9:446. 
Gupte A, Frangos JA. 1990. Effects of flow on the synthesis and release of fibronectin by endothelial cells. 

In Vitro Cell Devel Biol 26:57. 
Hadcock JR, Malbon CC. 1988. Down-regulation of beta-adrenergic receptors: Agonist-induced reduc- 
tion in receptor mRNA levels. Biochemistry 95:5021. 
Hakkert BC, Kuijipers TW, Leeuwenberg JFM, et al. 1991. Neutrophil and monocyte adherence to and 

migration across monolayers of cytokine-activated endothelial cells: the contribution of CD 18, 

ELAM-1, and VLA-4. Blood 78:2721. 
Hession C, Osborn L, Goff D, et al. 1990. Endothelial leukocyte adhesion molecule- 1: Direct expression 

cloning and functional interactions. Proc Natl Acad Sci USA 87:1673. 
Hsieh HJ, Li NQ, Frangos JA. 1991. Shear stress increases endothelial platelet- derived growth factor 

mRNA levels. Am J Physiol H642. 
Hsieh H, Li NQ, Frangos JA. 1992. Shear-induced platelet-derived growth factor gene expression in 

human endothelial cells is mediated by protein kinase C. J Cell Physiol 150:552. 
Hsieh H, Li NQ, Frangos JA. 1993. Pulsatile and steady flow induces c-fos expression in human endothelial 

cells. J Cell Physiol 154:143. 



■) 2000 by CRC Press LLC 



Hubbel JA, Mclntire LV. 1986. Technique for visualization and analysis of mural thrombogenesis. Rev 

Sci Instrum 57:892. 
Hudlicka O. 1984. Growth of vessels — historical review. In F Hammerson, Hudlicka (eds), Progress 

in Applied Microcirculation Angiogenesis, vol 4, 1-8, Basel, Karger. 
Hunter T, Karin M. 1992. The regulation of transcription by phosphorylation. Cell 70:375. 
Iba T, Mills I, Sumpio BE. 1992. Intracellular cyclic AMP levels in endothelial cells subjected to cyclic 

strain in vitro. J Surgical Res 52:625. 
Ingber D. 1993. Integrins as mechanochemical transducers. Curr Opin Cell Biol 3:841. 
Ives CL, Eskin SG, Mclntire LV. 1986. Mechanical effects on endothelial cell morphology: In vitro 

assessment. In Vitro Cell Dev Biol 22:500. 
Jackman RW, Beeler DL, Fritze L, et al. 1987. Human thrombomodulin gene is intron depleted: Nucleic 

acid sequences of the cDNA and gene predict protein structure and suggest sites of regulatory 

control. Proc Natl Acad Sci USA 84:6425. 
Jones DA, Abbassi OA, Mclntire LV, et al. 1993. P-selectin mediates neutrophil rolling on histamine- 

stimulated endothelial cells. Biophys J 65:1560. 
Jones PA. 1982. Construction of an artificial blood vessel wall from cultured endothelial and smooth 

muscle cells. J Cell Biol 74:1882. 
Kim DW, Gotlieb Al, Langille BL. 1989. In vivo modulation of endothelial F-actin microfilaments by 

experimental alterations in shear stress. Arteriosclerosis 9:439. 
Kishimoto TK, Warnock RA, Jutila MA, et al. 1991. Antibodies against human neutrophil LECAM-1 and 

endothelial cell ELAM-1 inhibit a common CD18-independent adhesion pathway in vitro. Blood 

78:805. 
Kojima N, Shiota M, Sadahira Y, Handa K, Hakomori S. 1992. Cell adhesion in a dynamic flow system 

as compared to static system. J Biol Chem 267:17264-17270. 
Komuro I, Kaida T, Shibazaki Y, et al. 1990. Stretching cardiac myocytes stimulates protooncogene 

expression. J Biol Chem 265:3595. 
Komuro I, Katoh Y, Kaida T, et al. 1991. Mechanical loading stimulates cell hypertrophy and specific gene 

expression in cultured rat cardiac myocytes. J Biol Chem 266:1265. 
Kuchan MJ, Frangos JA. 1993. Shear stress regulates endothelin-1 release via protein kinase C and cGMP 

in cultured endothelial cells. Am J Physiol 264:H150. 
Langille BL, Adamson SL. 1981. Relationship between blood flow direction and endothelial cell orienta- 
tion at arterial branch sites in rabbit and mice. Circ Res 48:481. 
Langille BL, Graham JJ, Kim D, et al. 1991. Dynamics of shear-induced redistribution of F-actin in 

endothelial cells in vivo. Arterioscler Thromb 11:1814. 
Langille BL, O'Donnell F. 1986. Reductions in arterial diameter produced by chronic diseases in blood 

flow are endothelium-dependent. Science 231:405. 
Lawrence MB, Mclntire LV, Eskin SG. 1987. Effect of flow on polymorphonuclear leukocyte/endothelial 

cell adhesion. Blood 70:1284. 
Lawrence MB, Smith CW, Eskin SG, et al. 1990. Effect of venous shear stress on CD18-mediated neu- 
trophil adhesion to cultured endothelium. Blood 75:227. 
Lawrence MB, Springer TA. 1991. Leukocytes roll on a selectin at physiologic flow rates: distinction from 

and prerequisite for adhesion through integrins. Cell 65:859. 
Levesque MJ, Sprague EA, Schwartz CJ, et al. 1989. The influence of shear stress on cultured vascular 

endothelial cells: The stress response of an anchorage-dependent mammalian cell. Biotech Prog 5:1. 
Levin EG, Santell L. 1988. Stimulation and desensitization of tissue plasminogen activator release from 

human endothelial cells. J Biol Chem 263:9360. 
Ley K, Gaehtgens P, Fennie C, et al. 1991. Lectin-like adhesion molecule 1 mediates leukocyte rolling in 

mesenteric venules in vivo. Blood 77:2553. 
Malek AM, Gibbons GH, Dzau VJ, et al. 1993a. Fluid shear stress differentially modulates expression of 

genes encoding basic fibroblast growth factor and platelet-derived growth factor B chain in vascular 

endothelium. J Clin Invest 92:2013. 



■) 2000 by CRC Press LLC 



Malek AM, Greene AL, Izumo S. 1993b. Regulation of endothelin 1 gene by fluid shear stress is tran- 
scriptionally mediated and independent of protein kinase C and cAMP. Proc Natl Acad Sci 90:5999. 
Malek A, Izumo S. 1992. Physiological fluid shear stress causes downregulation of endothelin-1 mRNA 

in bovine aortic endothelium. Am J Physiol 263:C389. 
Mclntire LV. 1994. Bioengineering and vascular biology. Ann Biomed Eng 22:2. 
Menter DG, Patton JT, Updike TV, et al. 1992. Transglutaminase stabilizes melanoma adhesion under 

laminar flow. Cell Biophys 18:123. 
Milner P, Bodin P, Loesch A, et al. 1992. Increased shear stress leads to differential release of endothelin 

and ATP from isolated endothelial cells from 4- and 12-month-old male rabbit aorta. J Vase Res 

29:420. 
Mitsumata M, Fishel RS, Nerem RM, et al. 1993. Fluid shear stress stimulates platelet-derived growth 

factor expression in endothelial cells. Am J Physiol 263:H3. 
Mo M, Eskin SG, Schilling WP. 1991. Flow- induced changes in Ca 2+ signaling of vascular endothelial 

cells: Effect of shear stress and ATP. Am J Physiol 260:H1698. 
Morita T, Yoshizumi M, Kurihara H, et al. 1993. Shear stress increases heparin-binding epidermal growth 

factor-like growth factor mRNA levels in human vascular endothelial cells. Biochem Biophys Res 

Comm 197:256. 
Muller WEG, Slor H, Pfeifer K, et al. 1992. Association of AUUUA-binding Protein with A + U-rich 

mRNA during nucleo-cytoplasmic transport. J Mol Biol 226:721. 
Nabel EG, Plautz G, Boyce FM, et al. 1989. Recombinant gene expression in vivo within endothelial cells 

of the arterial wall. Science 244:1342. 
Nagel T, Resnick N, Atkinson W, et al. 1994. Shear stress selectivity upregulates intercellular adhesion 

molecule- 1 expression in cultured vascular endothelial cells. J Clin Invest 94:885. 
Nerem RM. 1991. Cellular engineering. Ann Biomed Eng 19:529. 

Nerem RM. 1993. Hemodynamics and the vascular endothelium. J Biomech Eng 115:510. 
Nerem RM, Levesque MJ, Cornhill JF. 1981. Vascular endothelial morphology as an indicator of the 

pattern of blood flow. J Biomech Eng 103:172. 
Nollert MU, Diamond SL, Mclntire LV. 1991. Hydrodynamic shear stress and mass transport modulation 

of endothelial cell metabolism. Biotech Bioeng 38:588. 
Nollert MU, Eskin SG, Mclntire LV. 1990. Shear stress increases inositol trisphosphate levels in human 

endothelial cells. Biochem Biophys Res Comm 170:281. 
Nollert MU, Hall ER, Eskin SG, et al. 1989. The effect of shear stress on the uptake and metabolism of 

arachidonic acid by human endothelial cells. Biochim Biophys Acta 1005:72. 
Nollert MU, Mclntire LV. 1992. Convective mass transfer effects on the intracellular calcium response of 

endothelial cells. J Biomech Eng 114:321. 
Nollert MU, Panaro NJ, Mclntire LV. 1992. Regulation of genetic expression in shear stress stimulated 

endothelial cells. Ann NY Acad Sci 665:94. 
Ohno M, Gibbons GH, Dzau VJ, et al. 1993. Shear stress elevates endothelial cGMP. Role of potassium 

channel and G protein coupling. Circulation 88:193. 
Ohno M, Lopez F, Gibbons GH, et al. 1992. Shear stress induced TGF(3l gene expression and generation 

of active TGFpU is mediated via a K + channel. Circulation 86:1-87. 
Ono O, Ando J, Kamiya A, et al. 1991. Flow effects on cultured vascular endothelial and smooth muscle 

cell functions. Cell Struct Funct 16:365. 
Patton JT, Menter DG, Benson DM, et al. 1993. Computerized analysis of tumor cells flowing in a parallel 

plate chamber to determine their adhesion stabilization lag time. Cell Motility and the Cytoskeleton 

26:88. 
Peppel K, Vinci JM, Baglioni C. 1991. The AU-rich sequences in the 3' untranslated region mediate the 

increased turnover of interferon mRNA induced by glucocorticoids. J Exp Med 173:349. 
Perry MA, Granger DN. 1991. Role of CD11/CD18 in shear rate-dependent leukocyte-endothelial cell 

interactions in cat mesenteric venules. J Clin Invest 87:1798. 



■) 2000 by CRC Press LLC 



Pili R, Corda S, Passaniti A, et al. 1993. Endothelial cell Ca 2+ increases upon tumor cell contact and 

modulates cell-cell adhesion. J Clin Invest 92:3017. 
Port JD, Huang LY, Malbon CC. 1992. (3-adrenergic agonists that down-regulate receptor mRNA upreg- 

ulate a M r 35,000 protein(s) that selectively binds to (3-adrenergic receptor mRNAs. J Biol Chem 

267:24103. 
Prasad AR, Logan SA, Nerem RM, et al. 1993. Flow-related responses of intracellular inositol phosphate 

levels in cultured aortic endothelial cells. Circ Res 72:827. 
Ranjan V, Diamond SL. 1993. Fluid shear stress induces synthesis and nuclear localization of c-fos in 

cultured human endothelial cells. Biochem Biophys Res Comm 196:79. 
Ratner L, Josephs SF, Jarrett R, et al. 1985. Nucleotide sequence of transforming human c-sis-cDNA 

clones with homology to platelet derived growth factor. Nucl Acids Res 13:5007. 
Reddy VB, Garramone AJ, Sasak H, et al. 1987. Expression of human uterine tissue-type plasminogen 

activator using BPV vectors. DNA 6:461. 
Resnick N, Collins T, Atkinson W, et al. 1993. Platelet-derived growth factor B chain promoter contains 

a cis-acting fluid shear-stress-responsive-element. Proc Natl Acad Sci 90:4591. 
Ross R. 1993. The pathogenesis of atherosclerosis: a perspective for the 1990s. Nature 362:801. 
Sampath R, Kukielka GL, Smith CW, et al. 1995. Shear stress mediated changes in the expression of 

leukocyte adhesion receptors on human umbilical vein endothelial cells in vitro. Ann Biomed Eng. 
Schnittler HJ, Franke RP, Akbay U, et al. 1993. Improved in vitro rheological system for studying the 

effect of fluid shear stress on cultured cells. Am J Physiol 265:C289. 
Schwartz CJ, Valente AJ, Spargue EA. 1993. A modern view of atherogenesis. Am J Cardiol 71:9B. 
Sharefkin JB, Diamond SL, Eskin SG, et al. 1991. Fluid flow decreases preproendothelin mRNA levels 

and suppresses endothelin-1 peptide release in cultured human endothelial cells. J Vase Surg 14:1. 
Shyy YJ, Hsieh HJ, Usami S, et al. 1994. Fluid shear stress induces a biphasic response of human monocyte 

chemotactic protein 1 gene expression in vascular endothelium. Proc Natl Acad Sci (USA) 91:4678. 
Smith CW, Kishimoto TK, Abbassi OA, et al. 1991. Chemotactic factors regulate LECAM-1 dependent 

neutrophil adhesion to cytokine-stimulated endothelial cells in vitro. J Clin Invest 87:609. 
Staunton DE, Marlin SD, Stratowa C, et al. 1988. Primary structure of ICAM-1 demonstrates interaction 

between members of the immunoglobulin and integrin supergene families. Cell 52:925. 
Tokunaga K, Nakamura Y, Sakata K, et al. 1987. Enhanced expression of a glycearldehyde-3-phosphate 

dehydrogenase. Cancer Res 47:5616. 
Tran-son-Tay R. 1993. Techniques for studying the effects of physical forces on mammalian cells and 

measuring cell mechanical properties. In JA Frangos (ed), Physical Forces and the Mammalian 

Cell, p 1, New York, Academic Press. 
Turitto VT, Weiss HJ, Baumgartner HR, et al. 1987. Cells and aggregates at surfaces. In EF Leonord, VT 

Turitto, L Vroman (eds), Blood in Contact with Natural and Artificial Surfaces, vol 5 16, pp 453-467, 

New York, Annals of the New York Academy of Sciences. 
Von Andrian UH, Chambers JD, McEvoy LM, et al. 1991. Two-step model of leukocyte-endothelial cell 

interaction in inflammation. Proc Natl Acad Sci 88:7538. 
Warren BA. 1973. Evidence of the blood-borne tumor embolus adherent to vessel wall. J Med 6:150. 
Watson PA. 1991. Function follows form: Generation of intracellular signals by cell deformation. FASEB J 

5:2013. 
Watson SR, Fennie C, Lasky LA. 1991. Neutrophil influx into an inflammatory site inhibited by a soluble 

homing receptor- IgG chimaera. Nature 349:164. 
Wechezak AR, Viggers RF, Sauvage LR. 1985. Fibronectin and F-actin redistribution in cultured endot- 
helial cells exposed to shear stress. Lab Invest 53:639. 
Weinberg CB, Bell E. 1986. A blood vessel model constructed from collagen and cultured vascular cells. 

Science 231:397. 
Weiss HJ, Turitto VT, Baumgartner HR. 1978. Effect of shear rate on platelet interaction with subendo- 

thelium in citrated and native blood. J Lab Clin Med 92:750. 



■) 2000 by CRC Press LLC 



Weiss HJ, Turitto VT, Baumgartner HR. 1986. Platelet adhesion and thrombus formation on subendot- 
helium in platelets deficient in GP Ilb-IIIa, lb, and storage granules. Blood 67:905. 

Weiss L. 1992. Biomechanical interactions of cancer cells with the microvasculature during hematogenous 
metastasis. Cancer Metastasis Rev 11:227. 

Werb Z, Hembry RM, Murphy G, et al. 1986. Commitment to expression of metalloendopeptidases, 
collagenase, and stromelysin: Relationship of inducing events to changes in cytoskeletal architec- 
ture. J Cell Biol 102:697. 

Wick TM, Moake JL, Udden MM, et al. 1987. ULvWF multimers increase adhesion of sickle erythrocytes 
to human endothelial cells under controlled flow. J Clin Invest 80:905. 

Wick TM, Moake JL, Udden MM, et al. 1993. ULvWF multimers preferentially promote young sickle 
and non-sickle erythrocyte adhesion to endothelial cells. Am J Hematol 42:284. 

Wilson JM, Birinyi LK, Salomon RN, et al. 1989. Implantation of vascular grafts lined with genetically 
modified endothelial cells. Science 244:1344. 

Yanagisawa M, Kurihara H, Kimura S, et al. 1988. A novel potent vasoconstrictor peptide produced by 
vascular endothelial cells. Nature 332:411. 

Yoshizumi M, Kurihara H, Sugiyama T, et al. 1989. Hemodynamic shear stress stimulates endothelin 
production by cultured endothelial cells. Biochem Biophys Res Commun 161:859. 

Zarins C, Zatina MA, Giddens DP. 1987. Shear stress regulation of artery lumen diameter in experimental 
atherogenesis. J Vase Surg 5:413. 

Zarins CK, Giddens DP, Bharadvaj BK, et al. 1983. Carotid bifurcation atherosclerosis. Circ Res 53:502. 

Ziegelstein RC, Cheng L, Blank PS, et al. 1993. Modulation of calcium homeostasis in cultured rat aortic 
endothelial cells by intracellular acidification. Am J Physiol 265:H1424. 

Ziegelstein RC, Cheng L, Capogrossi MC. 1992. Flow-dependent cytosolic acidification of vascular endot- 
helial cells. Science 258:656. 

Zwiebel JA, Freeman SM, Kantoff PW, et al. 1989. High-level recombinant gene expression in rabbit 
endothelial cells transduced by retroviral vectors. Science 243:220. 



■) 2000 by CRC Press LLC 



Lightfoot, E. N., Duca, K. A. "The Roles of Mass Transfer in Tissue Function." 

The Biomedical Engineering Handbook: Second Edition. 

Ed. Joseph D. Bronzino 

Boca Raton: CRC Press LLC, 2000 



115 



The Roles of Mass 

Transfer in 

Tissue Function 



115.1 



115.2 



Edwin N. Lightfoot 

University of Wisconsin 

Karen A. Duca 

University of Wisconsin 



115.3 



Topology and Transport Characteristics of 

Living Organisms 

Fundamentals: The Basis of a Quantitative 

Description 

Self Similarity and Cross-Species Correlations • Time 

Constants: The Key to Quantitative Modeling • Brownian 

Motion and Concentration Diffusion • More Complex 

Situations • Flow, Chemical Reaction, and Boundary 

Conditions 

Characteristic Behavior: Selected Examples 
The Energy Cost of Immobility 



Mass transfer lies at the heart of physiology and provides major constraints on the metabolic rates and 
anatomy [Pries et al., 1996; Bunk, 1998] of living organisms, from the organization of organ networks 
to intracellular structures. Limitations on mass transport rates are major constraints for nutrient supply, 
waste elimination and information transmission at all of these levels. The primary functional units of 
metabolically active tissue, e.g., the Krogh tissue cylinders of muscle and brain, liver lobules and kidney 
nephrons, have evolved to just eliminate significant mass transfer limitations in the physiological state 
[Lightfoot, 1974]. Turnover rates of highly regulated enzymes are just on the slow side of diffusional 
limitations [Weisz, 1973]. Signal transport rates are frequently mass transport limited [Lauffenburger 
and Linderman, 1993], and very ingenious mechanisms have evolved to speed these processes [Berg and 
von Hippel, 1985; Bray, 1998; Francis and Palsson, 1997; Valee and Sheets, 1996]. In contrast elaborate 
membrane barriers organize and control intracellular reactions in even the simplest organisms. 

Understanding tissue mass transport is important both for the engineer and scientist. Examples of 
engineering interest include the design of extracorporeal devices and biosensors [Fishman et al., 1998]. 
For the scientist it is important to understand the sometimes complex interactions of transport and 
reaction to interpret transport based experiments [Bassingthwaighte, Goresky and Linehan, 1998]. Here 
we shall concentrate on qualitative behavior and orders of magnitude, with particular emphasis on time 
constants. These are the normal starting points of any serious study. A general background in biology 
[Campbell, 1996] and physiology [Johnson, 1998] will be helpful. 
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FIGURE 115.1 Diffusional topology of the mammalian body. Convective transport (bulk flow) dominates in the 
major blood vessels and airways but becomes comparable to diffusion and reaction in the functional units surrounding 
capillaries and sinusoids. At the cellular and subcellular levels concentration diffusion complicated by electrical effects 
and a wide variety of carrier transport processes, interacts in complex ways with enzymatic and genetic reactions. 
Inspired by Dr. Peter Abbrecht, University of Michigan. Reprinted with permission from Lightfoot (1974). Copyright 
1974 by John Wiley and Sons, Inc. 

115.1 Topology and Transport Characteristics of Living 

Organisms [Schmidt-Nielsen 1983, 1984, Calder 1984, 
Berg 1993 Bassingthwaighte et al. 1994] 

If one includes viruses, the size scales of living systems range from nanometers to meters: spanning a 
linear ratio of 10 9 and a mass ratio of 10 27 , which is greater than Avogadro's number! As indicated in 
Fig. 115.1, the higher animals are organized into spatial hierarchies of discrete structures, which span 
this whole size range (Table 115.1). At the largest scales animals may be considered as organ networks 
connected by major blood vessels, with each organ carrying out its own set of specialized tasks. 

Organs are in turn composed essentially of large numbers 
of microcirculatory/uwcriona/ units organized in parallel and 
perfused by capillaries or other microscopic ducts which 
supply oxygen and other nutrients, carry away waste prod- 
ucts, and interchange via a variety of chemical messengers 
[Lauffenburger and Linderman, 1993]. The Krogh tissue cyl- 
inder of Fig. 1 15.2 is representative and corresponds approx- 
imately to the functional units of the brain (see, however, 
Federspiel and Popel, 1986; Popel, 1989; Heliums et al, 1996; 
Vicini et al, 1998; Bassingthwaighte et al, 1998). 

Nutrients and metabolic end products are transported at 
the size scale of the tissue cylinder axially by convection. 



TABLE 115.1 The Spatial Hierarchy 
in Mammals, Characteristic Lengths 



Entity 


Length Scale, m 


Whole body 


lfr'-lO 


Organs 


io^-io- 1 


Microcirculatory units 


10-" 


Cells 


10~ 5 (eukaryiots) 




10~ 6 (prokaryiots) 


Intracellular organelles 


io- 6 


Molecular complexes 


10r» 
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FIGURE 115.2 Krogh tissue cylinder. A circular cylinder is used as an approximation to close-packed hexagonal 
elements. 

Radial transport is primarily by concentration diffusion or Brownian motion [Lightfoot and Lightfoot, 
1997] and, to a lesser extent by Starling flow, slow seepage across the proximal region of the capillary 
driven by the relatively high pressure of entering capillary blood. Flow takes place through clefts between 
the endothelial cells forming the capillary wall and between the parenchymal cells forming the tissue 
cylinder. The clefts are narrow and permselective, rejecting large solutes. Starling flow was once thought 
to reverse direction in the distal (downstream) regions of the tissue cylinder as a result of lower hydro- 
dynamic pressure and increased colloid osmotic pressure caused by the permselectivity of the clefts. 
However, it is now accepted that such reverse flow does not occur to a significant extent and that the 
seepage ends up primarily in the lymph ducts and on to the venous blood. 

The microcirculatory units are in turn comprised of cells which are themselves complex structures. 
Sketched in Fig. 115.3 is a pancreatic beta cell, which produces, stores, and secretes insulin. Its organi- 
zation is similar to that of a chemical plant, with raw materials and energy input at the bottom where 
mitochondria produce the chemical energy for cell metabolism, a synthesis and transport area, plus the 




Synthesis 



FIGURE 115.3 Structure and organization of a representative eukaryotic cell. Schematic cross-section of a pancreatic 
beta cell. Reprinted with permission from Lightfoot (1974). Copyright 1974 by John Wiley and Sons. 
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control region of the cell nucleus, at the center, and packaging, storage and export at the top. All of this 
is accomplished in about 10 micrometers. 

The cell contains smaller structures known collectively as organelles that segregate and coordinate the 
many processes required for its operation and maintenance. Prominent in the diagram are mitochondria, 
which use various carbon sources and oxygen to form high-energy phosphate bonds as in ATP — the 
energy sources needed for cell metabolism. Also shown are the cell nucleus where DNA is stored, 
ribosomes in which RNA is used to produce individual proteins, and the endoplasmic reticulum which 
holds the ribosomes and channels the proteins produced by them to Golgi apparatus (not shown) for 
packaging. Also not shown are microtubules and other filaments comprising the cytoskeleton [Alberts 
et al., 1945; Campenot et al., 1996; Maniotis et al., 1997]. The latter provide for chromosome segregation, 
help to maintain cellular shape, and in some cases provide mobility [ Vallee and Sheetz] . They also act 
as transport networks along with motor proteins, discussed below. Organization and structure of the 
cells is described in standard works [Alberts et al., 1945]. 

At the smallest level are enzyme clusters and substructures of cell membranes used for selective 
transport across the cell boundary [Lauffenberger and Linderman, 1993]. Underlying these structures 
are biochemical reaction networks that are largely shared by all species, and all are composed of the same 
basic elements, primarily proteins, carbohydrates, and lipids. As a result there are a great many interspecies 
similarities [Schmidt-Nielsen, 1983]. 

This elaborate organization just described is constrained to a very large extent by mass transfer 
considerations and in particular by the effect of characteristic time and distance scales of the effectiveness 
of different mass transport mechanisms. At the larger size scales, only flow or convection is fast enough 
to transport oxygen and major metabolites, and convective transport is a major function of the larger 
blood vessels. Diffusive transport begins to take precedence at the level of microcirculatory units. At the 
cellular level and below, diffusion may even be too fast and undirected, and selectively permeable 
membranes, have evolved to maintain spatial segregation against the randomizing forces of diffusion. 

115.2 Fundamentals: The Basis of a Quantitative Description 

Underneath the bewildering complexity of living organisms are some very simple underlying principles 
which make a unified description of mass transport feasible. Of greatest utility are observed similarities 
across the enormous range of system sizes and common magnitudes of key thermodynamic, reaction, 
and transport parameters. Some simplifying features must still be accepted as justified by repeated 
observation, and others can be understood from the first principles of molecular kinetic theory. Here we 
summarize some of the most useful facts and approximations in preparation for the examples of the 
next section. 

Self Similarity and Cross-Species Correlations 

It is a striking characteristic of life forms that each increase in our mathematical sophistication has shown 
new regularity in their anatomic and physiologic characteristics. The ability of simple geometric forms 
to describe morphology has been recognized at least since Leonardo da Vinci, and the definitive work of 
D Arcy Thompson summarizes much of this early work. The next step was the concept of allometry, first 
introduced by J. S. Huxley in 1927 [Schmidt-Nielsen, 1984; Calder, 1984; Lightfoot, 1974]. This is a 
rudimentary form of self similarity usually expressed as 

P = aM b (115.1) 

where P is any property, M is average species mass, and a and b are constants specific to the property. A 
large number of allometric relations are available, and those of most interest to us deal with metabolic 
rate. For example, total basal rates of oxygen consumption for whole animals is given to a good approx- 
imation by: 
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TABLE 115.2 Characteristic Cerebral Tissue Cylinder 

Item Magnitude 

Outer radius 30 |tm 

Capillary radius 3 |lm 

Length 180 (1m 

Blood velocity 400 ttm/s 

Arterial oxygen tension 0.125 atm 

Arterial oxygen concentration (total) 8.6 mM 

Arterial oxygen concentration (dissolved 2 ) 0.12 

Venous oxygen tension 0.053 atm 

Venous oxygen concentration (total) 5.87 mM 

Venous oxygen concentration (dissolved O,) 0.05 

Tissue oxygen diffusivity 1.5 E-5 cm 2 /s (estd.) 

Oxygen respiration rate (zero order) 0.0372 mmols 0,/liter-s 



R nff x3.5M 3/4 (115.2) 

O.tot v ' 

Here Ro, tot is the oxygen consumption rate in ml 2 (STP)/hr, and M is body mass in grams. Under basal 
conditions, fat is the primary fuel and heat generation is about 4.7 kcal per milliliter of oxygen (STP) 
consumed. Small animals have higher specific metabolic rates than large ones, but this is in part because 
a higher proportion of their body mass is made up of highly active body mass, and for the important 
case of brain tissue it is invariant at about: 



R (brain) = 3.72- 10~ 5 mmolsO 2 /cm 3 ,s (115.3) 



for all species. For the liver and kidneys, specific metabolic activity is somewhat lower and falls off slowly 
with an increase in animal size, but this may be due to an increasing proportion of supporting tissue 
such as blood vessels and connective tissue. Accurate data valid under physiologic conditions are difficult 
to find, and to a first approximation specific metabolic activity of parenchymal cells, those actually 
engaged in the primary activity of the organ, may be close to that of brain for both liver and kidneys. 

The sizes of both microcirculatory units and cells in vertebrates are also very insensitive to animal 
size. Capillaries are typically about 3 to 4 (Lm in radius and about 50 to 60 (J.m apart. Typical mammalian 
cells are about 10 to 50 p.m in diameter, and organelles such as mitochondria are about the size of 
prokaryotic cells, about 1 (J.m in diameter. Approximate characteristics of a cerebral tissue cylinder are 
given in Table 115.2. The oxygen-carrying capacity of blood, ionic makeup of body fluids, solubility of 
gases and oxygen diffusivities in body fluids are also largely invariant across species, and some represen- 
tative data are provided in Tables 115.3, 115.4, and 115.5. 

TABLE 115.3 Oxygen Solubilities 



Solvent 




Temperature, °C 


2 Pressure, atm 


Concentration, mM/atm 


Water 




25 

30 
35 

40 




1.26 

1.16 
1.09 
1.03 


Plasma 




37 




1.19 


Red cell interior (dissolved O, only) 


37 




1.18 


Extracellular 


tissue (estd.) 


37 




1.1 


Oxygen gas 




37 




39.3 


Alveolar air 




37 


0.136 




Air (0.21 atm 


of oxygen) 


37 


0.21 
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TABLE 115.4 Effective Oxygen Diffusivities 



Solvent 



Temperature, °C Pressure Diffusivity, cm 2 /s 



Water 


25 


Water 


37 


Blood plasma 


37 


Normal blood 


37 


Red cell interior 


37 


Air 


25 



1 atm 



2.1 E-5 

3.0 E-5 
2.0 E-5 
1.4 E-5 
0.95 E-5 
0.20 



TABLE 115.5 Intracellular Diffusion Coefficients 









Diffusivity 


Intracell 










in Water 


Diffusivity; 


Diffusivity 


Compound 


MW 


Radius, A 


cm 2 /s x 10 7 


cm 2 /s X 10 7 


Ratio 


Sorbitol 


170 


2.5 


94 


50 


1.9 


Methylene blue 


320 


3.7 


40 


15 


2.6 


Sucrose 


324 


4.4 


52 


20 


2.6 


Eosin 


648 


6.0 


40 


8 


5.0 


Dextran 


3600 


12.0 


18 


3.5 


5.0 


Inulin 


5500 


13.0 


15 


3.0 


5.0 


Dextran 


10,000 


23.3 


9.2 


2.5 


3.7 


Dextran 


24,000 


35.5 


6.3 


1.5 


4.2 


Actin 


43,000 


23.2 


5.3 


0.03 


167 


Bovine serum albumin 


68,000 


36.0 


6.9 


0.10 


71 



Allometric correlations are essentially empirical and cannot be predicted from any fundamental phys- 
ical principles. Moreover, most measurements, except solubility and diffusivity data, are of doubtful 
accuracy. 

Recently a more sophisticated form of self similarity, fractal geometry has been found useful for the 
description of a wide variety of biological applications of mass transfer interest [Bassingthwaighte et al., 
1994], applications of nonlinear dynamics are fast increasing [Griffith, 1996]. 

Time Constants: The Key to Quantitative Modeling 

The first step in system modeling is to establish orders of magnitudes of key parameters, and most 
particularly time constants: estimates of the time required for a given transient process to be "effectively 
complete." Time constant or order of magnitude analysis is useful because dynamic response times are 
insensitive to geometric detail and boundary conditions at the order of magnitude level of approximation, 
i.e., within a factor of ten. Time constants are, however, essentially heuristic quantities and can only be 
understood on the basis of experience [Lightfoot and Lightfoot, 1997]. 

Once characteristic system time scales are established, one can restrict attention to processes with 
response times of the same order: Those an order of magnitude faster can be treated as instantaneous 
and those ten times slower as not happening at all. Both fast and slow terms in system description, e.g., 
the diffusion equation of transport phenomena [Bird et al., 1960], can then be eliminated. Such simpli- 
fication often provides valuable insights as well as simplifying integration. Quantitative descriptions are 
particularly valuable at the microcirculatory level, for example in diagnostic procedures, and they have 
been well studied [Bassingthwaighte and Goresky]. Here we shall stay with relatively simple examples to 
illustrate selected characteristics of tissue mass transfer, and we begin with diffusion. We then briefly 
introduce time constants characterizing flow, chemical reaction, and boundary conditions. 



■) 2000 by CRC Press LLC 



Brownian Motion and Concentration Diffusion 

The basis of most species selective transport is the relatively slow observable motion of molecules or 
particles resulting from intermolecular collisions [Lightfoot and Lightfoot, 1997]. Such Brownian motion 
does not take any predictable direction, but the particles under observation do tend to move farther from 
their starting point with increasing time. The extent of diffusional motion can be described as the 
probability of finding any reference particle a distance r from its initial position. For an unbounded 
quiescent fluid this is [Einstein, 1905]: 

p(r) = e~ r2 / 4DpF 7 8(7tD pF t) 32 ; 471 f p(r)r 2 dr = l. (115.4a, 4b) 

This equation defines the Brownian diffusivity, D PF , of a particle or molecule relative to a surrounding 
fluid. Here P is a spherically symmetrical normalized probability density. Here r is distance (in spherical 
coordinates) from the initial position, and t is time. The mean net distance traveled from the initial point 
in unbounded quiescent three-dimensional space is easily determined from the above distribution as: 

r 2 =471 f r 2 • r 2 p(r)dr = 6D PF t, r 2 =4D PF t, x 2 = 2D PF t (1 15.5a, 5b, 5c) 

m I \ J PF ' m PF ' m PF v ' ' ' 

for distances from a point, line, and plane respectively. These results provide useful insight in suggesting 
characteristic diffusion times t as the time required for "most" of a transient diffusion process to be 
complete. Some commonly accepted values are shown in Table 115.6. The numbers in the last column 
are the fractional changes in solute inventory for a sudden change in surface concentration on a particle 
initially at a different uniform concentration. For the hollow cylinder, the outer surface of radius R T is 
assumed impermeable to diffusion, and the inner surface of radius _R C is permeable. The length L is 
assumed large compared to either radius. Fractional completion depends in a complex way on the radius 
ratio, but the diffusion time given is a good approximation. 

For large numbers of particles or molecules nonuniformly distributed in a moving fluid, following 
the Brownian motion of individual molecules becomes too cumbersome to use. We must then average 
behavior over large numbers of molecules to obtain a continuum approximation known as Fick's law 
[Bird et al., 1960], which describes the relative motion of solute and solvent and maybe written in the form 



(vp-v P ) 



V PF J 



Vx p (115.6) 



Here v is observable velocity of species P in a mixture of P and F, x is mole fraction of species P in the 
mixture, and D PF is binary mutual mass diffusivity of solute P relative to solvent fluid P. For situations 
of interest here, the Fick's law diffusivity may be considered equal to the Brownian diffusivity. 



TABLE 115.6 Characteristic Diffusional Response Times 









Fractional 


Shape 


T m 


L 


Completion 


Sphere 


L 2 /6D PF 


Radius 


>0.99 


Cylinder 


L 2 I4D PF 


Radius 


>0.99 


Slab 


L 2 I2D PF 


Half-thickness 


>0.93 


Hollow cylinder 


(L 2 I2D PF ) In (jyjy 


Outer radius, R T 
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Equation 115.6 is valid for binary or dilute solutions of liquids, gases, and homogeneous, and some 
typical magnitudes for biologic situations are shown in Tables 115.4 and 115.5. For dilute solutions, 
hydrodynamic diffusion theory [Bird et al., 1960] provides useful insight: 

7tR p D pF ii F /KT = C (115.7) 

where C is equal to (1/6) if the molecular radius of the solute P is much less than that of the solvent F 
and (1/4) if they are about equal. Here (J. is solvent viscosity, R p and R F are effective spherical solute and 
solvent radii, K is the Boltzmann constant, and T is absolute temperature. Hydrodynamic diffusion theory 
has been extended to solutes in small fluid-filled pores [Deen, 1987] for characterizing transport in 
microporous membranes. 

More Complex Situations 

Most biologic transport processes occur in multicomponent solutions. A generalized diffusion equation 
[Hirschfelder et al., 1954; Lightfoot and Lightfoot, 1997; Taylor and Krishna, 1993] is available, but lack of 
data usually forces neglect of multicomponent effects. For our purposes transport of both molecules and 
particles is adequately described by the simplified equations in Table 115.7, which simply state that relative 
velocity of a particle or molecule P through a fluid F is proportional to the sum of "driving forces" acting on it. 

Flow, Chemical Reaction, and Boundary Conditions 

We now introduce additional time constants in order to characterize the interactions of diffusion with 
flow and chemical reaction and to show that boundary conditions can have an important bearing on 

TABLE 115.7 Particle-Molecular Analogs Dilute Binary Diffusion in a Quiescent Continuum 



Particles 



K-",)=^{ vin " f + ^ v 4 1 -pp/p)N.- f ™ + (p; + ^p/)^ 



dt 



I \^ 2 
6 7C|ip° I R P J + thermal diffusion 



where: 



iH 



/-I \v\x)'U-xt\(h 



Molecules (interdiffusion of species P and F) 

(v p -v F ) = -D pf {vinx p + (l/i?T)[v p (l-p;/p)v p -F em ]J 
Both: 

D PF = 

FF 6n\lR p 

Here: 

K = the Boltzmann constant, or molecular gas constant 

n p = number concentration of particles 

R p = particle or molecular radius 

V = particle volume or partial molal volume of species P 

F em = total electromagnetic force per particle 

F em = molar electromagnetic force on species P 

p = density of fluid phase 

p° = density of particle or reciprocal of partial specific volume of species P in solution 

v' = instantaneous acceleration of particle P 

The subscript °° refers to conditions near the particle but outside its hydrodynamic boundary layer. 
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system behavior. We begin with flow where mean solute residence time, T m , forms a convenient time 
scale. For flow through a cylindrical duct of length L with constant volumetric average velocity <v>, we 
may write: 

T m =L/<v> (115.8) 

More generally, for constant volumetric flow at a rate Q through a system of volume V with a single inlet 
and outlet and no significant diffusion across either, T m is equal to V/Q. For chemical reactions we choose 
as our system, a reactive solid open to diffusion over at least part of its surface where concentration of 
solute i under consideration is maintained at a uniform value c i0 and where the average volumetric rate 
of consumption of that solute is <R ; >. We then define a reaction time constant as: 

T = c. n /<R. > (115.9) 

rxn 1 / l v ' 

Note that both c i0 and <R;> are measurable, at least in principle [Damkdhler, 1937; Weisz, 1973]. Finally 
we consider as an illustrative situation decay of solute concentration q in the inlet stream to a flow system 
according to the expression 

clt, inlet) = cio, inlet) exp(-t/T BC ) (115.10) 

where t is time and T BC is a constant. We now have enough response times for the examples we have 
selected below, and we turn to illustrating their utility. 

115.3 Characteristic Behavior: Selected Examples 

We now consider some representative examples to illustrate the mass transfer behavior of living tissues. 
Alveolar Transients and Pulmonary Mass Transport: At the distal ends of the pulmonary network 
are the alveoli, irregular sacs of air surrounded by blood-filled capillaries, with effective diameters of 
about 75-300 (J.m. The blood residence time is of the order of a second, and it is desired to determine 
whether gas-phase mass transfer resistance is appreciable. To answer, we take the worst possible scenario: 
flat-plate geometry with a half-thickness of 150 (J.m or 0.0105 cm. From Table 1 15.4 we find the oxygen 
diffusivity to be about 0.2 cm 2 /s. It follows that alveolar response time is: 



= (o.015). 



T dif =|0.015|/0.4 = 0.56ms 

This is extremely fast, with respect to both the Is residence time assumed for alveolar blood and the one- 
twelfth of a minute between breaths: gas-phase mass transfer resistance is indeed negligible. This is typical 
for absorption of sparingly soluble gases, and a similar situation occurs in cell culture vessels. 

Mass Transfer Between Blood and Tissue: We next identify blood vessels capable of transferring 
dissolved solute between blood and surrounding tissue. To be effective we assume that mean blood 
residence time should at least equal the radial diffusion time: 

L R 2 D 2 LD DR , 
> = ; el > 1/16 = 0.0625 

<v> 4D PB 16D PB D 2 <v> 

where D is vessel diameter. From the data for a 13-kg dog in Table 115.8, we find the great majority of 
vessels much too short and that only the three smallest classes — arterioles, capillaries, and venules — are 
at all capable of transferring appreciable mass. These, especially the capillaries, are quite effective. They 
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TABLE 1 15.8 Mass Transfer Effectiveness of Blood Vessels ( 13-kg Dog, #39) 



Vessel 


Radius, cm 


Lenght, cm 


<v>, cm/s 


Ll¥/dp F <v> 


Aorta 


0.5 


10 


50 


0.00003 


Larger arteries 


0.15 


20 


13.4 


0.00003 


Secondary arteries 


0.05 


10 


8 


0.005 


Terminal arteries 


0.03 


1.0 


6.0 


0.002 


Arterioles 


0.001 


0.2 


0.032 


6.25 


Capillaries 


0.0004 


0.1 


0.07 


89.4 


Venules 


0.0015 


0.2 


0.07 


12.7 


Terminal veins 


0.075 


1.0 


1.3 


0.0014 


Secondary veins 


0.12 


10 


1.48 


0.0047 


Larger veins 


0.3 


20 


3.6 


0.0006 


Venae cavae 


0.625 


40 


33.4 


0.00003 



have long been classified as the microcirculation on the basis of being invisible to the unaided eye. This 
simple order of magnitude analysis provides a functional definition and a guide to the design of hollow 
fiber cell culture vessels. More refined analyses [Lightfoot, 1974] based on the parameter magnitudes of 
Table 115.2 shows that lateral diffusional resistance of capillaries is in fact rather small, but recent analyses 
[Popel, 1989] suggest a more complex picture. 

Convection and diffusion in parallel [Lightfoot and Lightfoot, 1997] is also of interest, but it is more 
complicated because of convective dispersion [Lightfoot and Lightfoot, 1997]. However, the lung shows 
a sharp transition between convective ducts and those which can be assumed well-mixed by axial diffusion 
[Hobbs and Lightfoot, 1979; Hubal, 1996; Farhi, ]. This has long been known to pulmonary physiologists 
who often model the adult human lung as a plug-flow channel ("dead space") of about 500 ml leading 
to a well-mixed volume of about 6,000 ml. 

Intercapillary Spacing in the Microcirculation: It has been shown [Damkohler, 1937; Weisz, 1973] 
that optimized commercial catalysts normally exhibit ratios of diffusion to reaction times, as defined in 
Table 115.6 in the relatively narrow range: 

i/3<T,.,/t <1 

/ dif / rxn 

Moreover, Weisz has shown that this ratio holds for many biologic systems as well so that enzyme activities 
can often be inferred from their location and function. Here we compare this expectation for cerebral 
tissue cylinders using the data of Tables 115.2 and 115.6. Throughout the tissue cylinder: 



T : 

-■-dif 



9-10~ 6 cnT 



ln(lo) = 0. 



69s 



For venous conditions, which are the most severe, 



0.0372 
0.0584 



:0.64s andT,. f /T =1. 

dir / rxn 



These numbers are reasonable considering the uncertainty in the data and the approximation used for 
diffusion time. The brain, for example, is far from homogeneous. More elaborate calculations suggest 
somewhat more conservative design. However, these figures are correct in suggesting that the brain is 
"designed" for oxygen transport and that the safety factor is small [Neubauer and James, 1998]. Sections 
of the brain become anoxic at about 2/3 of normal blood flow. The body's control mechanisms will shut 
down other vital organs such as the gut, liver, and kidneys when blood cardiac output oxygen supply 
drops to keep the brain as well supplied as possible. All of these comments are for the physiological 
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(normal) state. Lowering brain temperature can greatly decrease oxygen demand and permit survival 
under otherwise fatal conditions. This effect is important in cryosurgery and drowning: whereas about 
6 minutes of anoxia damages the adult brain at normal temperature, about one hour is required at 62°F. 
Intracellular Dispersion: We now ask how long it will take a protein initially concentrated in a very 
small region to disperse through the interior of a cell, and we shall assume the cell to be a sphere with 
a 1 u.m radius. We begin by noting that the cell interior or cytoplasm is a concentrated solution of 
polymeric species and that the diffusivity of large molecules is considerably slowed relative to water. We 
therefore assume a diffusivity of 10~ 8 cm 2 /s as suggested by the last entry in Table 1 15.5. If the protein is 
originally at the center of the sphere, the dispersion time is: 



T, 



R ceU 

6B 



(l.l0- 4 )7(6.10- 8 )~(l/6)s 



If the protein is initially near the cell periphery, diffusion will take about 4 times as long, or about 2/3 s. 
Reliable intracellular diffusion coefficients are important: the corresponding numbers for aqueous solu- 
tion would have been about 2.4 ms and 10 ms, respectively! Cell interiors — the cytoplasm — are crowded. 
Diffusion Controlled Reaction: We now calculate the rate at which a very dilute protein of 2.5 nm 
radius solution in a cell of 1 micron radius is adsorbed on a spherical adsorbent particle of radius 
2.5 nanometers if the adsorption is diffusion controlled. That is, we assume that the free protein concen- 
tration immediately adjacent to the adsorbent surface is always effectively zero because of the speed and 
strength of the adsorption reaction. In diffusion only the region within a few diameters of a sphere's 
surface offers effective resistance to transport [Carslaw and Jaeger, 1959; see also Ozisik, 1993]. We can 
thus assume rapid equilibration over the entire cell volume, except for a thin diffusional boundary layer 
adjacent to the target surface: the ratio of protein plus adsorbent to cell diameter is only about 0.05. The 
rate at which protein is adsorbed on the sphere surface per unit area is then [Lightfoot, 1974]: 



N„ 



D PF C P. 



(r p +rJ 



r 

\ ; 7TT 



where C Poo is the protein concentration far from the sphere, assumed uniform over the bulk of cell volume 
since we are about to see that it adjusts rapidly on the time scale of the adsorption process. The radius 
of the protein is R P , that of the adsorbent R ads , and the dimensionless time X defined by: 



tD„ 



(Rp+rJ 2 

Adsorption rate is within 10% of its asymptotic value when JCT = 100, and: 

ty 10 = (2.5 • 10~ 5 s)(l00/jt) » 0.8ms 

This is much smaller than distribution time, and transients can be neglected: the time scales of the 
transient and distribution are well separated. We now write a mass balance for the rate of change of bulk 
protein concentration: 



-V || dc ^«-V 1| dC ^ = A,N p ;dlnc p /dt = 3 

cell j. cell j. ads P> P/ 

at at 



( r p+rJ/ 



R 3 



B„ 



:1/T 
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T = 

rxn 


3 • 5 • 10" 7 


•10~ 8 







Here V cdl is the volume of the cell, and A ads is the effective area of the adsorbent; T is the characteristic 
reaction time, that required to reduce cellular concentration by a factor e or about 2.7. For our conditions, 
reasonably representative of a prokaryotic cell, 



:67s 



This is clearly a long time compared to the dispersion time of the last example, and the assumed time 
scale separations of all three adsorption and dispersion processes are amply justified. Using time scale 
separations has greatly simplified what was originally a major numerical task [Berg and von Hippie, 1985] . 

If the protein concentration is now interpreted as the probability of finding a protein in any position, 
and the target is a DNA site, T rxn is the time required for gene expression, and it can be seen that it is 
very slow! Comparison of these predictions with experiment showed that both prokaryotic cells respond 
much faster and this led to a major discovery [Berg and von Hippie, 1985]: DNA exhibits a general 
binding affinity for promoter molecules which permits the adsorbed protein to undergo one-dimensional 
diffusion along the DNA chain — thus increasing effective binding site size. Eukaryotes, which are roughly 
ten times larger in diameter, have elaborate internal barriers [Holsstege et al., 1998] and are much more 
complex diffusionally. 

Many other diffusion controlled reactions occur in living systems, for example, protein folding, and 
here brute force molecular dynamic calculations are difficult even with supercomputers. One can greatly 
speed calculation by taking judicious time scale separation [Rojnuckarin et al., 1998]. 

The Energy Cost of Immobility 

Many cell constituents, from small metabolites to organelles, are free to move in the cytoplasm and yet 
must be limited in their spatial distribution. Many such entities are transported by mechanochemical 
enzymes known as protein motors which depend upon consumption of metabolic energy. Here we estimate 
the energy cost of such processes to maintain spatial segregation, without detailed knowledge of the 
mechanisms used. The basis of our analysis [Okamoto and Lightfoot, 1992] is the Maxwell-Stefan 
equation of Table 115.7 and more particularly the fact that migration velocities are related to the motive 
force of Brownian motion and any mechanically transmitted force through the diffusivity The heart of 
our argument is that the mechanical force applied by the protein motors must produce a migration equal 
and opposite to that resulting from the dispersive force of Brownian motion: 

( v p- v f), = D PF V l nn P ( v p" v f) = FD pfA T 

\ >dif > 'mech 

where n p is the number concentration of proteins, and F is the mechanical force required to produce the 
motion. Now the power P p required for transporting a particle back against diffusional motion is the 
product of the mechanical force and mechanical migration velocity: 



P„=F 



K- v *L =KT ( vlnn >) 2 



For a particle of mass m p = (4/3)7tR|p p where p P is particle density, the power requirement per unit mass 
is: 
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TABLE 115.9 Energetics of Forced Diffusion 



Dilute Solution 



Particle 

Small metabolite 
Globular protein 
Typical organelle 



Radius, A Rel. Diff. 



3 

30 
10' 



P/AG 

7 x 10 6 
7x 10 2 
6 x 10- s 



Cytoplasm 

Estimated 

Rel. Diff. P/AG 

1 7 x 10 6 

0.01 7 

10- 1 ' 2 6 X 10- 12 



P p /m p : 



(KTVlnn p ) 2 

8^ e f f PpRp 



Here LL elT is the effective viscosity of the cytoplasm, p is particle density, and R is effective particle radius. 
This very strong effect of particle radius suggests that protein motors will be most effective for larger 
particles, and calculations of Okamoto and Lightfoot [1992] bear out this suggestion. 

If the particle is to be held for the most part within one micrometer of the desired position and the 
free energy transduction of the cell as a whole corresponds to that of brain tissue, Table 115.9 shows the 
cost of mechanical motion is negligible for organelles, prohibitive for small metabolites and problematic 
for proteins. However, for small amounts of critically important metabolites and problematic for proteins. 
However, for small amounts of critically important metabolites, mechanical transport may still take place. 
The quantity, P/AG, is the ratio of power consumption to mean cellular energy transduction, both per 
unit mass. 

It remains to be noted that estimating the cost of organelle transport accurately requires additional 
information, and some is available in [Okamoto and Lightfoot, 1992]. It is impossible for the cell to 
transport the large number of small metabolites, and a number of alternate means of segregation have 
developed. Among them are permselective membranes and compact enzyme clusters, making it difficult 
for intermediate metabolites to escape into the general cytoplasmic pool. These topics must be discussed 
elsewhere. 

Defining Terms 1 

Allometry: A special form of similarity in which the property of interest scales across species with some 

constant power of species mass. 
Alveolus: A pulmonary air sac at the distal end of an airway. 

Arterioles: The smallest subdivision of the arterial tree proximal to the capillaries. 
ATP: Abbreviation of adenosine triphosphate, the source of chemical energy for a wide variety of 

metabolic reactions. 
Capillaries: The smallest class of blood vessel, between an arteriole and venule, whose walls consist of 

a single layer of cells. 
Cell: The smallest unit of living matter capable of independent functioning, composed of protoplasm 

and surrounded by a semipermeable plasma membrane. 
Convection: Mass transport resulting directly from fluid flow. 
Cytoplasm: The protoplasm or substance of a cell surrounding the nucleus, carrying structures within 

which most of the life processes of the cell take place. 
Cytoskeleton: An intracellular network of microtubules. 



'For more complete definitions see the Oxford Dictionary of Biochemistry and Molecular Biology, 1997. 
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Distal: At the downstream end of a flow system. 

Endothelium: A single layer of thin flattened cells lining blood vessels and some body cavities. 
Microcirculation: The three smallest types of blood vessels-arterioles, capillaries, and venules. 
Mitochondrion: Compartmentalized double-membrane self-reproducing organelle responsible for 

generating useable energy by formation of ATP In the average cell there are several hundred 

mitochondria each about 1.5 micrometers in length. 
Microtubule: Long, generally straight elements of the cytoskeleton, formed of the protein tubulin. 
Organelle: A specialized cytoplasmic structure of a cell performing a specific function. 
Parenchyma: The characteristic tissue of an organ or a gland, as distinguished from connective tissue. 
Proximal: At the upstream end of a flow system. 
Venules: The smallest vessels of the venous tree, distal to the capillaries. 
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Life for most eukaryotes, and certainly all mammals, begins as a single totipotent stem cell, the zygote. 
This cell contains the same complement of genes — no more and no less — as does every adult cell that 
will make up the organism once development is complete. Nonetheless, this cell has the unique charac- 
teristic of being able to implement every possible program of gene expression and is thus totipotent. 
How is this possible? It is now known that the selective activation and repression of genes distinguishes 
cells with different developmental potentials. Unraveling this complex series of genetic changes accom- 
panying the progressive restriction of developmental potential during ontogeny is the realm of modern 
developmental biology. In contrast to the zygote, which has unlimited developmental potential, an 
intestinal epithelial cell or a granulocyte, for example, is a highly developed cell type that is said to be 
differentiated. These cells are fixed with respect to their developmental potential and thus no longer 
posses the ability to contribute to other tissue types. Indeed, intestinal epithelial cells and granulocytes 
are incapable of undergoing further division and are said to be terminally differentiated. These mature 
cells have therefore undergone a process whereby they each have acquired a unique and complex repertoire 
of functions. These functions are usually associated with the cellular morphologic features and/or enzy- 
matic profiles required to implement a specific developmental or functional program. We will come back 
to the tissue dynamics of these two cell types in a later section of this chapter. 

Between the extremes of developmental potency represented by the zygote and terminally differentiated 
cells, there is obviously a tremendous number of cell divisions (roughly 2 44) and an accompanying 
restriction of this potential in zygotic progeny. The human body, for example, is composed of greater 
than 10 13 cells — all ultimately derived from one, the zygote. Where during the developmental sequence, 
does restriction occur? Is it gradual or quantal? These questions are fundamental to an understanding 
of developmental biology in general and stem cell biology in particular. 

116.1 Embryonic Stem Cells 

Let us consider first the ultimate human stem cell, the zygote, in more detail. As cellular growth begins, 
the early embryonic cells start to make a series of irreversible decisions to differentiate along various 
developmental pathways. This process is referred to as developmental commitment. Importantly, such 



O 2000 by CRC Press LLC 



decisions do not occur immediately; rather, the zygote divides several times and proceeds to the early 
blastocyst stage of development while maintaining totipotency in all its daughter cells. This is evident 
most commonly in the phenomenon of identical twins, where two distinct yet genetically matched 
embryos arise from the same zygote. The ability of early embryonic cells to maintain totipotency has 
been utilized by developmental biologists as a means to experimentally manipulate these embryonic stem 
cells, or ES cells, as they are commonly known. In 1981, two scientists at Cambridge, Evans and Kaufman, 
were able to isolate ES cells from a blastocyst-stage mouse embryo and demonstrate that such cells could 
be cloned and grown for many generations in vitro [Kaufman et al., 1983]. Remarkably, under the 
appropriate culture conditions, such cells remained completely totipotent. That is to say, upon reimplan- 
tation into another embryo, the stem cells could grow and contribute to the formation of an adult mouse. 
Importantly, the ES cells retained the developmental potential to differentiate into all the possible adult 
phenotypes, thereby proving their totipotency. 

Thus, in culture, the ES cells were said to self-renew without any subsequent loss of developmental 
potential. Upon reintroduction into the appropriate environment, the ES cells were able to differentiate 
into any of the various mature cell types. The basic decision of whether to self- renew or differentiate is 
a common theme found in stem cells of many developmental systems. Generally self-renewal and 
differentiation go hand in hand, i.e., the two events are usually inseparable. The important findings of 
Evans and Kaufman demonstrated that self-renewal and differentiation could be uncoupled and that an 
extended self-renewal phase of growth was attainable for mammalian embryonic cells. 

The advent of ES cell technology has had an enormous impact on the field of mammalian molecular 
genetics. The ability to culture ES cells was quickly combined with molecular techniques which allow for 
the alteration of cellular DNA. For example, if an investigator were interested in the function of a gene, 
he or she might elect to mutate the gene in ES cells so that it was no longer functional. The genetically 
altered ES cells would then be used to generate a line of mice which carry the so-called gene knockout 
[Roller & Smithies, 1992; Robertson, 1991]. By examining the consequences of such a mutation, clues 
to the normal function of a gene may be deduced. Techniques such as these have been widely used over 
the past 10 years and continue to develop as even more powerful means of studying basic cellular function 
become available. 

116.2 Control of Stem Cell Development 

The concepts of self-renewal and differentiation are central to the description of a stem cell; indeed, the 
potential to manifest these two developmental options is the only rigorous criterion used in defining 
what constitutes a true stem cell. Consequently, in studying stem cells, one of the most important 
questions to consider is how the critical choice whether to self-renew or differentiate is made. As seen 
in the case of ES cells, the environment of the stem cell or extrinsic signals determine the outcome of 
the self-renewal decision. In other words, such cells are not intrinsically committed to a particular 
developmental fate; rather, their environment mediates the differentiation decision. Surprisingly, for ES 
cells this decision is dictated by a single essential protein, or growth factor, known as Leukemia inhibitory 
factor (LIF) [Hilton & Gough, 1991; Smith et al., 1992]. In the presence of sufficient concentrations of 
LIF, ES cells will self-renew indefinitely in culture. Although ES cells eventually lose their totipotency in 
vitro, this is thought to be due to technical limitations of ex vivo culture rather than developmental 
decisions by the cells themselves. 

Interestingly, the default decision for ES cells appears to be differentiation, i.e., unless the cells are 
prevented from maturing by the presence of LIF, they will quickly lose their totipotent phenotype. Upon 
beginning the differentiation process, ES cells can be steered along a variety of developmental pathways 
simply by providing the appropriate extrinsic signal, usually in the form of a growth factor. 

Unfortunately, the control of other types of stem cells has proven more difficult to elucidate. In 
particular, the control of blood-forming, or hematopoietic stem cells, has been extensively studied, but 
as yet the developmental control of these cells is poorly understood. 
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116.3 Adult Stem Cells 

As the mammalian embryo develops, various organ systems are formed, and tissue-specific functions are 
elaborated. For the majority of mature tissues, the cells are terminally differentiated and will continue 
to function for extended periods of time. However, some tissues operate in a much more dynamic fashion, 
wherein cells are continuously dying and being replenished. Thee tissues require a population of stem 
cells in order to maintain a steady flow of fresh cells as older cells turn over. Although there are several 
examples of such tissue types, perhaps the best characterized are the hematopoietic system and the 
intestinal epithelia. These two cell types have population parameters that call for their continuous and 
rapid production: Both cell types occur in very large numbers (approximately 10 11 to 10 12 for the human 
hematopoietic system) and have relatively short life spans that can often be measured in days or sometimes 
even hours [Kroller & Pallson, 1993]. These two tissues in adults are therefore distinct (along with skin 
epithelium) in that they require tissue-specific stem cells in order to satisfy the inherent population 
dynamics of the system. 

Stem cells of this nature represent a population arrested at an intermediate level of developmental 
potency that permits them to perform the two classic stem cell functions: They are able to replenish and 
maintain their own numbers through cell divisions that produce daughter cells of equivalent potency, 
that is, self-renew. And, they have the capacity, depending on need, to differentiate and give rise to some, 
if not all, of the various mature cell types of that tissue. Stem cells of the small intestine, to the best of 
our knowledge, give rise to at least four highly specialized lineages found in the epithelium: Paneth, 
goblet, enteroendocrine, and enterocytes; these stem cells are therefore pluripotent (i.e., they have the 
potential to give rise to many different, but not all, lineages) [Potten & Loeffler, 1990]. Similarly, pluri- 
potent stem cells of the hematopoietic system give rise to an even wider variety of mature cells, including 
at least eight types of blood cells: the various lymphocytes, natural killer cells, megakaryocytes, erthroid 
cells, monocytes, and three types of granulocytes [Metcalf & Moore, 1971]. 

Proof of the existence of gut and hematopoietic stem cells came from studies of the effects of ionizing 
radiation on animals. This research, in the 1940s and 1950s, was spurred by concern over military and 
peaceful uses of atomic energy. It became recognized that the organ/tissue systems most susceptible to 
radiation damage were those that normally had a high turnover rate and were replenished by stem cell 
populations, i.e., the gut lining and the hematopoietic system. In particular, the latter was found to be 
the radiation dose-limiting system in the body. It was subsequently discovered that mice could be rescued 
from imminent death from radiation "poisoning" by the transfusion of bone marrow cells following 
exposure [Barnes et al., 1959]. Initially it was not clear whether the survival factor was humoral or cellular, 
but mounting evidence pointed to a cell-mediated effect, and, as the dose of bone marrow cells was 
titrated to determine the number required for survival, it was found that low numbers of cells resulted 
in the development of macroscopic nodules on the spleens of irradiated mice. 1 These nodules were 
composed of cells of some but not all lineages of blood formation — lymphopoiesis was notably missing 
[Till & McCulloch, 1961]. Low-level radiation was then used to induce unique chromosomal aberrations 
in bone marrow cells prior to transplantation into lethally irradiated recipients. In this way, unique 
microscopically identifiable translocations would be passed on to all progeny of an altered cell. It was 
found that the spleen nodules were, in fact, colonies of hematopoietic cells, all possessing an identical 
chromosomal marker [Becker al., 1963]. This observation strongly suggested that the nodule was a clonal 
population, derived from a single stem cell. Since several lineages were represented in the spleen colony, 
the stem cell responsible was pluripotent. Moreover, single spleen colonies could be isolated and injected 
into secondary irradiated recipients and give rise to additional spleen colonies. This suggested that the 
cell giving rise to a spleen colony was capable of some degree of self-renewal as well as multilineage 



'Injected bone-marrow-derived stem cells lodge and develop in several types of hematopoietic tissue, including 
spleen. Apparently, the splenic microenvironment can at least transiently support stem cell growth and development. 
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differentiation [Till et al., 1964]. The cells which give rise to spleen colonies were termed CFU-S for 
colony-forming unit-spleen and have been studied extensively in the characterization of pluripotent 
hematopoietic stem cells. 

More recent studies employing a similar strategy have used retroviruses to mark stem cells. The site 
of viral integration in an infected host cell genome is random and is passed with high fidelity to all 
progeny, thus by molecular means stem cell clones may be identified. Such an approach allows for the 
analysis not only of spleen colonies but of all anatomically dispersed lymphohematopoietic sites, including 
bone marrow, spleen, thymus, lymph nodes, and mature blood cells in the circulation [Dick et al,., 1985; 
Keller et al., 1985; Lemischka et al., 1986]. These analyses unequivocally showed that the same stem cell 
may give rise to all lineages, including lymphocytes. Repetitive blood cell sampling and analysis gave a 
temporal picture of the usage and fate of the stem cell population. In the first few weeks and months 
after transplant of nonlimiting numbers of stem cells, polyclonal hematopoiesis was the rule; however, 
after approximately 4-6 months, the number of stem cell clones was reduced. In fact, in some cases, a 
single stem cell clone was responsible for all hematopoiesis for over a year-about half the mouse's lifetime 
[Jorday & Lemischka, 1990] . These data were interpreted to mean that many stem cell clones were initially 
active in the irradiated recipient mice, but over time a subset of stem cells grew to dominate the 
hematopoietic profile. This implies either that not all the stem cells were equivalent in their developmental 
potential or that the stem cells were not all seeded in equivalent microenvironments and therefore 
manifest differing developmental potentials. Both these possibilities have been supported by a variety of 
subsequent experiments; however, the details of this observation remain cloudy. 

One piece of evidence suggesting intrinsic differences at the stem cell level comes from studies of 
allophenic mice [Mintz, 1971]. These artificially generated strains of laboratory mice are created by 
aggregating the early embryos of two distinguishable mouse strains. As mentioned previously, early 
embryonic cells are totipotent; thus, upon combining such cells from two strains, a chimeric mouse will 
arise in which both cell sources contribute to all tissues, including the stem cell population. Patterns of 
stem cell contribution in allophenic mice show that one strain can cycle more rapidly and thus contribute 
to mature blood cells early in life, whereas the other slow-growing strain will arise to dominate at later 
times. Importantly, upon reimplantation of allophenic bone marrow into a secondary irradiated recipient, 
the two phases of stem cell activity are recapitulated. Thus, the stem cells which had become completely 
quiescent in the primary animal were reactivated initially, only to be followed by a later phase of activity 
from the second strain [Van Zant, 1992]. These data suggest that intrinsic differences at the stem cell 
level rather than the local microenvironment can mediate differences in stem cell activity. 

Unlike most organ systems, mature cells of the lymphohematopoietic system are dispersed either in 
the circulation or in scattered anatomic sites such as the thymus, lymph, nodes, spleen, and, in the case 
of macrophages, in virtually all tissues of the body. The site of production of most of the mature cells, 
the bone marrow, is a complex tissue consisting of stromal elements, stem and progenitor cells, maturing 
cells of multiple lineages, and capillaries and sinusoids of the circulatory system into which mature cells 
are released. Spacial relationships between these different components are not well understood because 
of their apparently diffuse organization and because of the paucity of some of the critical elements, most 
notably the stem cells and early progenitors. 

In contrast, the small intestinal epithelium has a much more straightforward organization that has 
expedited the understanding of some of the critical issues having to do with stem cell differentiation. 
Numerous fingerlike projections of the epithelium, called villi, extend into the intestinal lumen to 
effectively increase the surface area available for absorption. Each villus is covered with approximately 
3500 epithelial cells, of which about 1400 are replaced on a daily basis [Potten & Loeffler, 1990]. 
Surrounding each villus are 6-10 crypts from which new epithelial cells are produced. They subsequently 
migrate to villi, and as senescent cells are shed from the villus tip, a steady progression of epithelial cells 
proceed unidirectionally to replace them. Crypts consist of only about 250 cells, including what is now 
estimated to be 16 stem cells. Since there are about 16 cells in the circumference of the crypt, stem cells 
occupy one circumferential ring of the crypt interior. This ring has been identified as the fourth from 
the bottom, directly above the Paneth cells. In addition, the fifth circumferential ring is occupied by 
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direct progeny of stem cells which retain pluripotency and, in emergencies, may function as stem cells. 
Given the detailed quantitative information available regarding stem cell numbers and epithelial cell 
turnover rates, the number of stem cell doublings occurring during a human life span of 70 years has 
been estimated to be about 5000. Whether this demonstrates that tissue-specific stem cells are immortal 
is the topic of the following section. 

116.4 Aging of Stem Cells 

Given the zygote's enormous developmental potential and that ES cells represent cells of apparently 
equivalent potency that can be propagated as cell lines, it is reasonable to ask whether aging occurs at 
the cellular level. Put another way, can normal cells, other than ES cells, truly self-replicate without 
differentiation or senescence? Or do ES cells represent unique examples of cells capable of apparently 
indefinite self- renewal without differentiation? One of the definitions of hematopoietic stem cells alluded 
to above, is that they self-replicate. Without self-renewal, it might be argued, a stem cell population may 
be exhausted in a time-frame less than a lifetime of normal hematopoiesis or in far less time in the event 
of unusually high hematopoietic demands associated with disease or trauma. If, for example, hemato- 
poietic stem cells can be propagated in vitro without differentiation, it could have tremendous impact 
on a number of clinically important procedures including bone marrow transplantation and gene therapy. 

In classic experiments studying fibroblast growth in vitro, Hayflick [1965] observed that there were a 
finite number of divisions (about 50) that a cell was capable of before reaching senescence. It has been 
thought that totipotent and pluripotent stem cells may be exempt from this constraint. An analysis above 
of intestinal epithelial stem cells suggested that in a lifetime they undergo several thousand replications, 
apparently without any loss in developmental potential. However, several lines of evidence call into 
question the immortality of hematopoietic stem cells and point to at least a limited self- renewal capacity. 
For example, studies in which marrow was serially transplanted from primary recipients to secondary 
hosts, and so on, the number of effective passages is only about four to five [Siminovitch et al., 1964]. 
After the first transplant, normal numbers of relatively late progenitors are produced, but the number 
of repopulating stem cells is either diminished or the cells' developmental potential attenuated, or both, 
resulting in a requirement for successively larger numbers of transplanted cells to have achieve engraftment. 
Another interpretation of these results is that the transplantation procedure itself is responsible for the 
declining repopulating ability of marrow, rather than an intrinsic change in the self-renewal capacity of 
the stem cells. According to this argument, the repetitive dissociation of stem cells from their normal 
microenvironmental niches in the marrow, and the required reestablishment of those contacts during 
seeding and serial engraftment, irreversibly alter their self-renewal capacity [Harrison et al., 1978]. A 
mechanistic possibility for this scenario is that differentiation is favored when stem cells are not in contact 
with their stromal microenvironment. In this context, exposure to growth factors has an overwhelming 
differentiating influence on stem cells in suspension that is normally tempered by stromal associations. 

Recently, an intriguing series of findings has emerged which may at least partially explain cellular 
aging. At the end of chromosomes there is a specialized region of DNA known as a telomere. This segment 
of DNA is comprised of hundreds of short six-nucleotide repeats of the sequence TTAGGG. It has been 
found that the length of telomeres varies over the life of a cell. Younger cells have longer telomeres, and 
as replication occurs the telomeres can be seen to shorten. It is thought that via the normal DNA 
replication process, the last 50-200 nucleotides of the chromosome fail to be synthesized, and thus 
telomeres are subject to loss with every cell division (reviewed in Blackburn [1992] and Greider [1991]). 
Consequently, telomeric shortening may act as a type of molecular clock. Once a cell has undergone a 
certain number of divisions, i.e., aged to a particular extent, it becomes susceptible to chromosome 
destabilization and subsequent cell death. Importantly, the rate at which telomeric sequence is lost may 
not be constant. Rather, some cells have the ability to regenerate their telomeres via an enzymatic activity 
known as telomerase. By controlling the loss of telomeric sequence, certain cell types may be able to 
extend their ability to replicate. Perhaps primitive tissue such as ES cells, when cultured with LIF, are 
able to express high levels of telomerase and thereby maintain their chromosomes indefinitely. Similarly, 
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perhaps early hematopoietic stem cells express telomerase, and as differentiation occurs, the telomerase 
activity is downregulated. Although intriguing, these hypotheses are very preliminary, and much more 
basic research will be required to elucidate the mechanisms of the stem cell replication and aging. 

In contrast to normal mechanisms of preserving replicative ability, a type of aberrant self-renewal is 
observed in the phenomenon of malignant transformation, or cancer. Some hematopoietic cancers are 
thought to originate with a defect at an early stem or progenitor cell level (e.g., chronic myelogenous 
leukemia). In this type of disease, normal differentiation is blocked, and a consequent buildup of 
immature, nonfunctional hematopoietic cells is observed. Malignant or neoplastic growth comes as a 
consequence of genetic damage or alteration. Such events range from single nucleotide changes to gross 
chromosomal deletions and translocations. Mechanistically, there appear to be two general types of 
mutation which cause cancer. One, activation of so-called oncogenes, is a dominant event and only needs 
to occur in one of a cell's two copies of the gene. Second, inactivation of tumor-suppressor genes removes 
the normal cellular control of growth and results in unchecked replication. These two categories of genetic 
alteration are analogous to stepping on a car's accelerator versus releasing the brake; both allow movement 
forward. Importantly, malignancy often comes as the result of a multistep process, whereby a series of 
genetic alterations occur. This process has been shown to involve different combinations of genes for 
different diseases [Vogelstein & Kinzler, 1993]. 

116.5 Other Types of Stem Cells 

Other tissues may also have stem cell populations contributing to the replacement of effete mature cells. 
For example, in the liver only a very small faction (2.5-5 x 10~ 5 ) of hepatocytes is dividing at any given 
time, resulting in a complete turnover time of about 1 year [Sell, 1994]. This compares with the complete 
turnover of intestinal epithelia or granulocytes in a period of a few days. Nonetheless, growing evidence, 
some of which remains controversial, suggests that hepatic stem cells play a role in this tissue turnover. 
A moderate loss of liver tissue due to mild or moderate insult is probably replaced by the division of 
mature hepatocytes. However, a severe loss of hepatic tissue is thought to require the enlistment of the 
putative stem cell population, morphologically identified as oval cells. 

Similarly, Noble's group in London has identified cells in the rat optic nerve that have the requisite 
functions of stem cells [Wren et al., 1992]. These cells, called oligodendrocyte-type 2 astrocyte (0-2A) 
progenitors, are capable of long-term self-renewal in vitro and give rise to oligodendrocytes through 
asymmetric divisions resulting in one new progenitor and one cell committed to differentiation. Con- 
version in vitro of 0-2A progenitors into rapidly dividing and differentiating cells has been shown to be 
regulated extrinsically by platelet-derived growth factor (PDGF) and the basic form of fibroblast growth 
factor (bFGF) [Wolswijk & Noble, 1992]. Since these two growth factors are known to be produced in 
vivo after brain injury, a mechanism is suggested for generation of the large numbers of new oligoden- 
drocytes required subsequent to trauma and demyelination. 

116.6 Summary 

• The zygote is the paradigm of a totipotent stem cell. 

• ES cells derived from early embryos can be propagated indefinitely and maintain totipotency when 
cultured in the presence of LIE Experimental control of differentiation and self-renewal in adult 
stem cells is being extensively investigated. 

• Stem cells are defined by two characteristic traits: (1) Stem cells can self-renew, and (2) they can 
produce large numbers of differentiated progeny. Stem cells possess the intrinsic ability to manifest 
either trait; however, extrinsic factors mediate their developmental fate. 

• Tissue-specific stem cells are pluripotent but not totipotent. 

• Intestinal, epithelial, and hematopoietic tissues are classical self-renewing systems of the adult. In 
addition, recent studies have indicated the presence of stem cells in several other tissues (e.g., liver, 
nervous system). 
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• Although stem cells clearly have an extensive replication potential, it is not clear whether they are 
truly immortal. Stem cells may possess the ability to circumvent normal cellular processes that 
determine aging at the cellular level. 

• Mutations at the DNA level can alter normal cellular control of stem cell replication and differ- 
entiation. This type of even can lead to aberrant development and subsequent malignancy. 

Defining Terms 

Commitment: The biologic process whereby a cell decides which of several possible developmental 

pathways to follow. 
Differentiation: Expression of cell- or tissue-specific genes which results in the functional repertoire 

of a distinct cell type. 
ES cells: Mouse stem cells originating from early embryonic tissue, capable of developing into any of 

the adult cell types. 
Gene knockout: Deletion or alteration of a cellular gene using genetic engineering technology (generally 

performed on ES cells). 
Hematopoietic: Blood forming. 
Ontogeny: The process of development, generally referring to development from the zygote to adult 

stages. 
Pluripotent: Capable of differentiation into multiple cell types. 

Self-renew: Term describing cellular replication wherein no developmental commitment or differen- 
tiation takes place. 
Terminally differentiated: The final stage of development in which all cell-specific features have been 

attained and cell division is no longer possible. 
Totipotent: Capable of differentiation into all possible cell types. 
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The characteristic architecture of a tissue results from an interplay of many cellular processes. In addition 
to the secretion of extracellular matrix, we may distinguish between processes related to the cell cycle — cell 
growth, division, differentiation, and death — and processes related to cell motility — cell translocation, 
directed motile responses, associated movements, and remodeling of the extracellular matrix. These 
processes are controlled and directed by cell-cell interactions, by cell-matrix interactions, and by cellular 
interactions with the fluid phase of the tissue. It is known that all three types of interactions can control 
both the speed and direction of cell translocation. This control results in the directed motile responses, 
which are the main subject of this chapter. Learning how to manipulate these motile responses experi- 
mentally will eventually become an essential aspect of tissue engineering. 

Probably the greatest challenge to tissue engineering lies in understanding the complex dynamic 
systems that arise as a result of feedback loops in these motile interactions. Not only is cell translocation 
controlled by the fluid phase, by the matrix, and by other cells of the tissue, but cell motility can 
profoundly influence the fluid phase, remodel the matrix, and influence the position of other cells by 
active cell-cell responses or by associated movements. It is often forgotten that, especially in "undiffer- 
entiated" types of tissue cells such as fibroblasts, the function of the cell's motile apparatus is not only 
to haul the cell through the extracellular matrix of the tissue spaces but also to remodel this matrix and 
to change the positions of other cells mechanically by exerting tension on cell-matrix and cell-cell 
adhesions. These complex dynamic systems lie at the heart of pattern formation in the tissue, and indeed 
in the developing embryo, and understanding them will require not only a knowledge of the motile 
responses but also an understanding of the mechanism of cell motility itself. 

In the study of cell motility, a great deal is now known about the relative dispositions of specific 
molecules that are thought to contribute to the motile process, and the dynamics of their interactions 
are beginning to be unraveled, yet there appears to have been comparatively little progress toward a 
satisfactory explanation of how a cell moves. There are some molecular biologists who still believe that 
it is just a question of time before the current molecular genetic thrust will alone come up with the 
answers. But there is a rapidly growing climate of opinion, already prevalent among physicists and 
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engineers, that nonlinear dynamic processes such as cell motility have emergent properties that can never 
be completely understood solely from a knowledge of their molecular basis. Cell locomotion, like muscle 
contraction, is essentially a mechanical process, and a satisfactory explanation of how it works will 
inevitably require a study of its mechanical properties. Unlike muscle, the cellular motile apparatus is a 
continuously self-organizing system, and we also need to know the overall dynamics of its capacity for 
reorganization. These outstanding areas of ignorance are essentially problems in engineering. There is a 
nice analogy for this conceptual gap between the molecular and engineering aspects of biologic pattern 
formation in Harrison's new book on the kinetic theory of living pattern [Harrison, 1993]: "...one cannot 
supplant one end of a bridge by building the other. They are planted in different ground, and neither 
will ever occupy the place of the other. But ultimately, one has a bridge when they meet in the middle." 
This chapter is intended to encourage the building of that bridge. 

117.1 Directed Motile Responses in Vivo 
Cellular Interactions with the Fluid Phase 

Cellular responses to the fluid phase of tissue spaces are thought to be mediated largely by specific 
diffusible molecules in the fluid phase. By far the most important directed response is chemotaxis: the 
unidirectional migration of cells in a concentration gradient of some chemoattractant or chemorepellent 
substance. Its study dates back to long before the advent of tissue culture, since it is a widespread response 
among the free-living unicellular organisms. The most widely studied system in vertebrates is the chemo- 
taxis of neutrophil leukocytes in gradients of small peptides. In the case of tissue cells, it has long been 
conjectured, usually on the basis of surprisingly little evidence, that the direction of certain cell migrations 
also might be determined by chemotaxis, and it is recently becoming clear that chemotaxis can in fact 
occur in several vertebrate tissue cells in culture, particularly in gradients of growth factors and related 
substances. Yet whether it does occur naturally, and under what circumstances, is still an area of dispute. 
The concentration gradients themselves are usually speculated to arise by molecular diffusion from 
localized sources, although a nonuniform distribution of sinks could equally explain them, and they are 
only likely to arise naturally in conditions of very low convective flow. Besides controlling the migration 
of isolated cells, there is some evidence that gradients of chemoattractants also may control the direction 
of extension of organized groups of cells such as blood capillary sprouts, and of cellular processes, such 
as nerve axons. Although the mechanisms of these responses may be closely allied to those chemotaxis, 
they should more properly be classed as chemotropism, since the effect is to direct the extension of a 
process rather than the translocation of an isolated cell. The influence on tissue architecture of chemotaxis 
and chemotropism is possibly to determine the relative positions of different cell types and to determine 
the patterns of angiogenesis and innervation — though this is still largely conjectural. Chemotaxis is 
potentially a powerful pattern generator if the responding cells can modify and/or produce the gradient 
field. 

Apart from chemotaxis, there also exists the possibility of mechanically mediated cellular responses 
to flow conditions in the fluid phase. The principal situation in vivo where this is likely to be important 
is in the blood vessels, and the mechanical effects of flow on the endothelial cells that line the vessel walls 
has been investigated. 

Cellular Interactions with the Acellular Solid Phase 

In a tissue, the cellular solid phase is the extracellular matrix, which is usually a fibrillar meshwork, 
though it may take the laminar form of a basement membrane. One directed response to the fibrillar 
extracellular matrix is the guidance of cell migrations, during embryonic development, for example, 
along oriented matrix fibrils. The discovery of this response followed soon after the dawn of tissue culture 
and is generally attributed to Paul Weiss, who named it contact guidance in the 1930s, though Loeb and 
Fleisher had already observed in 1917 that cells tend to follow the "path of least resistance" along oriented 
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matrix fibrils (see [Dunn, 1982]). In culture, contact guidance constrains cell locomotion to be predom- 
inantly bidirectional, along the alignment axis of the matrix, whereas many embryonic migrations are 
predominantly unidirectional. This raises the question of whether contact guidance alone can account 
for these directed migrations in vivo or whether other responses are involved. The influence on tissue 
architecture of contact guidance is less conjectural than that of chemotaxis, since matrix alignment is 
more easily observed than chemical gradients, and the cells themselves are often coaligned with the 
matrix. Directed motion can be inferred since, in culture, this orientation of the cell shape on aligned 
surfaces is strongly correlated with an orientation of locomotion. In conjunction with cellular remodeling 
of the matrix, contact guidance becomes a potentially powerful generator of pattern. Since the cells, by 
exerting tension, can align the matrix, and the matrix alignment can guide the cells, a mutual interaction 
can arise whereby cells are guided into regions of higher cell density. Harris and colleagues [1984] have 
shown that such a feedback loop can result in the spontaneous generation of a regular array of cell clusters 
from a randomly distributed field of cells and matrix. The pattern of feather formation in birds' skin, 
for example, might arise by just such a mechanism. 

Several other types of directed response may be mediated by the properties of the solid phase of the 
cell's environment. One is chemoaffinity, which Sperry [1963] proposed could account for the specific 
connections of the nervous system by guiding nerve fibres along tracks marked out by specific molecules 
adsorbed to the surface of substratum. A similar response has been proposed to account for the directional 
migration of primordial germ cells. 

Cellular Interactions with Other Cells 

Neighboring cells in the tissue environment may be considered as part of the solid phase. Thus it has 
been reported that when cells are used as a substratum for locomotion by other cells, directed responses 
such as contact guidance or chemoaffinity may occur, and these responses may persist even if the cells 
used as a substratum are killed by light fixation. However, the reason that cell-cell interactions are dealt 
with separately here is that cells show a directed response on colliding with other living cells that they 
do not show on colliding with cells that have been lightly fixed. Contact inhibition of locomotion, 
discovered by Abercrombie and Heaysman [1954], is a response whereby a normal tissue cell, on collision 
with another, is halted and eventually redirected in its locomotion. The effect of the response is to prevent 
cells from using other cells as a substratum, and a distinguishing feature of the response to living cells 
is a temporary local paralysis of the cell's active leading edge at the site of contact, which does not generally 
occur with a chemoaffinity type of response. The influence on tissue architecture of contact inhibition is 
probably profound but not easily determined. It is possibly the main response by which cells are kept 
more or less in place within a tissue, rather than milling around, and a failure of contact inhibition is 
thought to be responsible for the infiltration of a normal tissue by invasive malignant cells. Contact 
inhibition can cause cell locomotion to be directed away from centers of population and thus gives rise 
to the radial pattern of cell orientation that is commonly observed in the outgrowths from explant 
cultures. A major question is whether this motile response is related, mechanistically, to the so-called 
contact inhibition of growth, which is also known to fail in malignant cells but which is probably mediated 
by diffusion of some signal rather than by cell-cell contact. 

117.2 Engineering Directed Motile Responses in Vitro 

The investigation of the directed motile responses, using tissue culture, can reveal many aspects of the 
mechanisms that control and direct cell motility in vivo and also give a valuable insight into the mech- 
anism of cell motility. In fact, most of what we know about these responses in vivo is deduced from 
experiments in culture. On the other hand, many of the responses discovered in culture may never occur 
naturally, and yet their study is equally important because it may yield further clues to the mechanism 
of cell motility and result in valuable techniques for tissue engineering. However, responses to properties 
of the culture environment, such as electric or magnetic fields, that are not yet known to have any 
counterparts in vivo will not be dealt with here. 
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A general experimental approach to engineering motile responses is first to try to reproduce in culture 
some response that appears to occur in vivo. The main reason is that the cell behavior can be observed 
in vitro, whereas this is possible in vivo only in very few instances. Another important reason is that once 
achieved in vitro, the response may be "dissected," by progressively simplifying the culture environment, 
until we isolate one or more well-defined properties that can each elicit a response. To be successful, this 
approach not only requires the design of culture environments that each isolate some specific environ- 
mental property and simultaneously allow the resulting cell behavior to be observed but also requires 
methods for adequately quantifying this resulting behavior. 

When designing an artificial environment, it is important to consider its information content. Obvi- 
ously, a uniform field of some scalar quantity is isotropic and cannot, therefore, elicit a directed motile 
response. A nondirected motile response, such as a change in speed caused by a change in some scalar 
property, is generally called a kinesis. Anisotropic uniform fields may be vector fields or may be able to 
distinguish opposite directions in the field and exhibit a unidirectional response, generally known as a 
taxis. Or cell movement perpendicular to the vector may predominate, which is sometimes known as a 
diataxis. In the case of a uniform field of some symmetric second-order tensor, such as strain or surface 
curvature, there is simply no information to distinguish opposite directions in the field, and yet orthog- 
onal axes may be distinguished. This can give rise to a bidirectional response or, in three-dimensional 
fields, also to a response where translocation along one axis is suppressed. There is no generally agreed 
on name to cover all such possible responses, but the term guidance will be used here. Some examples 
of culture environments with specific physiocochemical properties are given below. 

Environments with Specific Properties of the Fluid Phase 
A Linear Concentration Gradient 

The most common method of reproducing the chemotaxis response in vitro is to use a Boyden chamber 
in which a gradient of some specific chemical is formed by diffusion across a membrane filter. The 
resulting directed cell translocation is inferred from the relative number of cells that migrate through 
the pores of the filter from its upper to its lower surface. While this system is very useful for screening 
for potential chemoattractants, its usefulness for investigating the mechanism of the motile response is 
strictly limited, since it does not fulfill our two main criteria for an in vitro system. In the Boyden chamber, 
the properties of the environment are not well defined (since the gradient within the narrow, often 
tortuous pores of the filter is unpredictable), and the cell response cannot be observed directly. The 
Zigmond chamber was introduced to overcome these difficulties, by allowing the cell behavior to be 
observed directly, but the gradient is very unstable and cannot be maintained reliably for longer than an 
hour or two. Zicha and colleagues [1991] have recently developed a direct viewing chemotactic chamber 
with much greater stability and better optical properties. The chamber is constructed from glass and has 
an annular bridge separating concentric wells (Fig. 1 17.1). When covered by a coverslip carrying the cells, 
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FIGURE 117.1 The Dunn Chemotaxis chamber. 



O 2000 by CRC Press LLC 



a gap of precisely 20 |J.m is formed between coverslip and bridge in which the gradient develops. The 
blind central well confers the greater stability, which allows chemotactic gradients to be maintained for 
many hours and thus permits the chemotactic responses of slowly moving tissue cells and malignant 
cells to be studied for the first time. Weber Scientific International, Ltd., manufactures this as the Dunn 
chamber. 

In use, both wells of the chamber are initially filled with control medium. The coverslip carrying the 
cells is then inverted over the wells, firmly seated, and sealed with wax in a slightly offset position (shown 
by the dashed lines) to allow access to the outer well. The outer well is then emptied using a syringe, 
refilled with medium containing the substance under test at known concentration, and the narrow 
opening is sealed with wax. 

Assuming that diffusion is the only mechanism of mass transport in the 20-p.m gap between coverslip 
and bridge, whereas convection currents keep the bulk contents of the two wells stirred, then the 
concentration in the 20-p.m gap as a function of distance r from the center of the inner well is given by 
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where C, and C are the concentrations in the inner and outer wells, respectively, and a and b are the 
inner and outer radii of the bridge. Because the bridge is annular, the gradient is slightly convex, but the 
deviation from linearity is very small. 

Figure 117.2 shows the formation of the gradient during the first hour for a molecule with diffusion 
coefficient D = 13.3 x 10~ 5 mm 2 /s, such as a small globular protein of molecular weight 17 kD and 
chamber dimensions a = 2.8 mm, b = 3.9 mm. The equations describing gradient formation are given 
in Zicha et al. [1991], but it suffices here to show that the gradient is almost linear after 30 minutes. The 
flux from outer to inner well though the gap of height h ( = 20 Lim) is given by 
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FIGURE 117.2 Formation of the gradient in the Dunn chemotaxis chamber (see text). 
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This flux tends to destroy the gradient, and the half-life of the gradient is equal to (In 2)/kD, where k is 
a constant describing the geometric properties of the chamber. For a chamber with volumes v and v, of 
the outer and inner wells, respectively, k is given by 
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Thus, for our small protein, a chamber with v D = 30 (J.1, v, = 14 p.1, and other dimensions as before gives 
a gradient with a half-life of 33.6 hours. This is ample time to study the chemotaxis of slowly moving 
tissue cells, with typical speeds of around 1 (J.m per minute, as well as permitting the study of long-term 
chemotaxis in the more rapidly moving leukocytes with typical speeds around 10 p.m per minute. 

From the point of view of information content, a linear concentration gradient may be viewed as a 
nonuniform scalar field of concentration or as a uniform vector field of concentration gradient. Thus, 
if the cell can distinguish between absolute values of concentration, as well as being able to detect the 
gradient, then the chemotaxis response may be complicated by a superimposed chemokinesis response. 
A stable linear concentration gradient is a great advantage when trying to unravel such complex responses. 
Other sources of complexity are the possibilities that cells can modify the gradient by acting as significant 
sinks, can relay chemotactic signals by releasing pulses of chemoattractant, or can even generate chemo- 
tactic signals in response to nonchemotactic chemical stimulation. These possibilities offer endless oppor- 
tunities for chaotic behavior and pattern generation. 

A Gradient of Shear Flow 

The flow conditions in the fluid phase of the environment can have at least two possible effects on cell 
motility. First, by affecting mass transport, they can change the distribution of molecules, and second, 
they can cause a mechanical shear stress to be exerted on the cells. Figure 1 17.3 shows a culture chamber 
designed by Dunn and Ireland [1985] that allows the behavior of cells to be observed under conditions 
of laminar shear flow. If necessary, the cells may be grown on the special membrane in Petriperm culture 
dishes (Heraeus), which allows gaseous exchange into the medium beneath the glass disk. Laminar shear 
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FIGURE 117.3 A culture chamber for laminar shear flow. 
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flow is probably the simplest and best defined flow regime, and the shear flow produced by an enclosed 
rotating disk has been described in detail [Daily and Nece, I960]. Very low flow rates, caused by rotating 
the disk at around 1 rpm with a separation of about 1 mm between disk and cells, are useful for causing 
a defined distortion to diffusion gradients arising as a result of cell secretion or adsorption of specific 
molecules. Much information about, for example, chemoattractants produced by cells may be obtained 
in this way. Higher shear stresses, around 100 times greater, are known to affect the shape and locomotion 
of cells mechanically, and higher rotational speeds and/or smaller separations will be needed to achieve 
these. 

Environments with Specific Properties of the Solid Phase 
Aligned Fibrillar Matrices 

The bidirectional guidance of cells by oriented fibrillar matrices is easily replicated in culture models. 
Plasma clots and hydrated collagen lattices are the most commonly used substrates, and alignment may 
be achieved by shear flow, mechanical stress, or strong magnetic fields during the gelling process. The 
magnitude and direction of orientation may be monitored, after suitable calibration, by measuring the 
birefringence in a polarizing microscope equipped with a Brace-Kohler compensator. These methods of 
alignment result in environments that are approximately described as uniform tensor fields, since 
mechanical strain is a familiar second-order tensor, and they generally give rise to bidirectional motile 
responses. In three dimensions, the strain tensor that results from applying tension along one axis can be 
described as a prolate ellipsoid, whereas applying compression along one axis results in an oblate ellipsoid. 
Thus we might distinguish between prolate guidance, in which bidirectional locomotion predominates 
along a single axis, and oblate guidance, in which locomotion is relatively suppressed along one of the 
three axes. But unidirectional information can be imposed on an oriented fibrillar matrix. Boocock [1989] 
achieved a unidirectional cellular response, called desmotaxis, by allowing the matrix fibrils to form 
attachments to the underlying solid substratum before aligning them using shear flow. This results in an 
asymmetrical linkage of the fibrils, and similar matrix configurations may well occur in vivo. 

Fibrillar environments in culture are probably good models for the type of cell guidance that occurs 
in vivo, but they are so complex that it is difficult to determine which anisotropic physicochemical 
properties elicit the cellular response. Among the possibilities are morphologic properties (the anisotropic 
shape or texture of the matrix), chemical properties (an oriented pattern of adhesiveness or specific 
chemical affinity) and mechanical properties (an anisotropic viscoelasticity of the matrix). One approach 
to discovering which properties are dominant in determining the response is to try to modify each in 
turn while leaving the others unaffected. This is difficult, though some progress has been made. Another 
approach is to design simpler environments with better defined properties, as described in the sections 
that follow. 

Specific Shapes and Textures 

Ross Harrison, the inventor of tissue culture, placed spiders' webs in some cultures and reported in 1912 
that "The behavior of cells . . . shows not only that the surface of a solid is a necessary condition but also 
that when the latter has a specific linear arrangement, as in the spider web, it has an action in influencing 
the direction of movement." It is hardly surprising to us today that a cell attached to a very fine fiber, in 
a fluid medium, is constrained to move bidirectionally. Nevertheless, such stereotropism or guidance by 
substratum availability is a guidance response and may have relevance to the problem of guidance by 
aligned fibrillar matrices. Moreover, in the hands of Paul Weiss, the guidance of cells on single cylindrical 
glass fibers was shown to have a more subtle aspect and it is now known that fibers up to 200 p.m in 
diameter, which have a circumference approximately 10 times greater than the typical length of a fibroblast, 
will still constrain the locomotion to be parallel with the fiber axis. And so we may distinguish guidance 
by substratum shape, sometimes known as topographic or morphographic guidance, from guidance by 
substratum availability. The surface curvature of a cylinder is a uniform tensor field, and since opposite 
directions within the surface are always equivalent, the cellular response must be bidirectional. 
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FIGURE 117.4 Diagrammatic cross-section of fibroblasts on substrata of various shapes. 

Figure 117.4a is a diagrammatic cross section of a fibroblast attached to a convex cylindrical surface. 
These surfaces are easily made to any required radius of curvature by pulling glass rod in a flame. Dunn 
and Heath [1976] speculated that a cell must have some form of straightedge in order to detect slight 
curvatures of around 100 u.m in radius. Obvious candidates were the actin cables that extend obliquely 
into the cytoplasm from sites of adhesion. These cables or stress fibers are continually formed as the 
fibroblast makes new adhesions to the substratum during locomotion and are known to contact and 
thereby to exert a tractive force on the substratum. The bundles of actin filaments are shown in the 
diagram as sets of parallel straight lines meeting the substratum at a tangent, and it is clear that the cables 
could not be much longer than this without being bent around the cylinder. Dunn and Heath proposed 
as an explanation of cell guidance along cylinders that the cables do not form in a bent condition and 
hence the traction exerted by the cell is reduced in directions of high convex curvature. 

Further evidence for this hypothesis was found by observing cell behavior on substrata with other 
shapes. On concave cylindrical surfaces made by drawing glass tubing in a flame, the cells are not guided 
along the cylindrical axis but tend to become bipolar in shape and oriented perpendicular to the axis 
[Dunn, 1982]. This is to be expected, since, as shown in Fig. 117.4fo, concave surfaces do not restrict 
formation of unbent cables but allow the cells to spread up the walls, which lifts the body of the cell clear 
of the substratum and thus prevents spreading along the cylinder axis. On substrata made with a sharp 
change in inclination like the pitched roof of a house, the hypothesis predicts that locomotion across the 
ridge is inhibited when the angle of inclination is greater than the angle at which the actin cables normally 
meet a plane substratum, as in Fig 117.4c. These substrata are more difficult to make than cylinders and 
require precision optical techniques for grinding and polishing a sharp and accurate ridge angle. Fibro- 
blasts behaved as predicted on these substrata, the limiting angle being about 4 degrees, and high-voltage 
electron microscopy revealed the actin cables terminate precisely at the ridge on substrata with inclina- 
tions greater than this. 

Substrata with fine parallel grooves have long been known to be very effective for eliciting morpho- 
graphic guidance and are interesting because they can be a well-defined mimicry of some of the shape 
properties of an aligned fibrillar matrix while being mechanically rigid. Effective substrata can easily be 
made by simply scratching a glass surface with a very fine abrasive, but such substrata are not well defined 
and their lack of uniformity may give rise to variations in macroperties such as wettability, that cannot 
be ruled out as causing the guidance. Early attempts to make better-defined substrata used ruling engines 
such as those used to make diffraction gratings, but Dunn and Brown [1986] introduced electron beam 
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lithography followed by ion milling to make grooves of rectangular cross section down to about 1 rim 
in width. Clark and colleagues [Clark et al., 1991] have now achieved rectangular grooves with spacings 
as low as 260 nm using the interference of two wavefronts, obtained by splitting an argon laser beam, to 
produce a pattern of parallel fringes on a quartz slide coated with photoresist. Groove depths a small as 
100 nm can elicit a guidance response from certain cell types, and the main reason for pursuing this line 
of inquiry is now to discover the molecular mechanism responsible for this exquisite sensitivity. 

Figure 1 1 7.4c shows a diagrammatic cross section of a fibroblast on a substratum consisting of a parallel 
array of rectangular grooves. One question that has been debated is whether the cells generally sink into 
the grooves, as shown here, or bridge across them. In the latter case, the wall and floor of the grooves 
are not an available substratum, and the cellular response might be a form of guidance by substratum 
availability. Ohara and Buck [1979] have suggested that this might occur, since the focal adhesions of 
fibroblasts are generally elongated in the direction of cell movement, and if they are forced to become 
oriented by being confined to the narrow spaces between grooves, this may force the locomotion into 
the same orientation. On the other hand, if the cells do generally sink into the grooves, then the Dunn 
and Heath hypothesis also could account for guidance 
even by very fine grooves, since individual actin fila- 
ments in the bundles, shown as dashed lines in the inset 
to the figure, would become bent and possibly dis- 
rupted if the cell made any attempt to pull on them 
other than in a direction parallel with the grooves. It 
is still therefore an unresolved issue whether different 
mechanisms operate in the cases of cylinders and 
grooves or whether a common mechanism is responsi- 
ble for all cases of guidance by the shape of the substra- 
tum. Other groove profiles, particularly asymmetrical 
ones such as sawtooth profiles, will be needed for test- 
ing these and other rival hypotheses, and an intriguing 
possibility is that a unidirectional cell response might 
be achieved on microfabricated substrata with two 
orthogonal arrays of parallel sawtooth grooves, as 
shown in Fig. 117.5. 




FIGURE 117.5 A proposed microfabricated sub- 
stratum with two orthogonal arrays of parallel saw 
tooth grooves. 



Specific Patterns of Adhesiveness 

The equivalent in culture of the chemoaffinity response is guidance by differential adhesiveness, in 
which cell locomotion is confined to regions of higher adhesiveness patterned on the substrate. As with 
grooved surfaces, adhesive tracks that guide cells effectively are easily made by a variety of methods, 
including physically streaking nonadhesive viscous materials on an adhesive substratum or scratching 
through a nonadhesive film overlying an adhesive substratum. Again, however, these easily made surfaces 
are not well defined, and in particular, their anisotropic adhesiveness tends to be contaminated by 
anisotropic surface texture and sometimes by anisotropic mechanical properties. Carter [1965] was 
probably the first to describe a method of printing a well-defined pattern of adhesiveness onto a sub- 
stratum; he used the vacuum evaporation of palladium, through a mask, onto a glass substratum made 
nonadhesive by first coating it with cellulose acetate. Clark and colleagues [1992] have now described a 
method for fabricating any required pattern of differential adhesiveness by using photolithography to 
obtain a hydrophobic pattern of methyl groups covalently coupled to a hydrophilic quartz substratum. 
The most recent developments in their laboratories are to use these patterns of hydrophobicity as 
templates for patterning specific proteins onto the substratum, and it seems that soon it will be possible 
to make almost any required pattern in any combination of proteins. 

The explanation of the guidance of cells along tracks of higher adhesiveness appears to be obvious. In 
extreme cases, when the cells cannot adhere at all to the substrate outside the track, then the response 
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is equivalent to guidance by substratum availability, and if the track happens to be sufficiently narrow, 
cell locomotion is restricted to the two directions along the track. But guidance along tracks of higher 
adhesiveness may still be very pronounced even when the cells can also adhere to and move on the regions 
of lower adhesiveness. The explanation in this case is that on encountering boundaries between regions 
of different adhesiveness, cells will cross them far more frequently in the direction from lower to higher 
adhesiveness. It is generally assumed that this results from a tug-of-war competition, since traction can 
be applied more effectively by the parts of the cell overlapping the region of higher adhesiveness. It is 
not known, however, how the traction fails in those parts of the cell which lose the competition, whether 
by breakage or slipping of the adhesions or by a relative failure of the contractile apparatus. Another 
possibility is simply that the cell spreads more easily over the more highly adhesive regions. 

One reason for studying guidance by differential adhesiveness is to discover whether it can account 
for guidance by oriented extracellular matrices. An array of very narrow, parallel stripes of alternately 
high and low adhesiveness mimics the linear arrangement of substratum availability in an aligned matrix. 
Dunn [1982] found that if the repeat spacing is so small that a cell can span several stripes, there is no 
detectable cell orientation or directed locomotion even though an isolated adhesive stripe can strongly 
guide cells. Clark and colleagues [1992] confirmed this observation with one cell type (BHK) but found 
that cells of another type (MDCK) could become aligned even when spanning several stripes but would 
become progressively less elongated as the repeat spacing decreased. It is not yet clear, therefore, whether 
the linear arrangement of substratum availability in an aligned matrix might contribute to the guidance 
response in some cell types. It is clear from work of Clark and colleagues, however, that the adhesive 
stripes become less effective in eliciting guidance as their repeat spacing decreases, whereas the opposite 
is true for grooved surfaces. Thus it seems unlikely that substratum availability is the mechanism of 
guidance by grooved surfaces and, conversely, unlikely that adhesive stripes guide cells by influencing the 
orientation of the focal adhesions as suggested for grooved surfaces by Ohara and Buck [1979]. 

Binary patterns of adhesiveness were not the only ones studied by Carter [1965]. His technique of 
shadowing metallic palladium by vacuum evaporation onto cellulose acetate also could produce a graded 
adhesiveness. By placing a rod of 0.5 mm diameter on the substratum before shadowing, he found that 
the penumbral regions of the rod's shadow acted as steep gradients of adhesiveness that would cause 
cultured cells to move unidirectionally in the direction of increasing adhesiveness. This is therefore a 
taxis as distinct from a guidance response, and he named it haptotaxis. It is still not clear whether 
haptotaxis plays any role in vivo. 

Specific Mechanical Properties 

As yet, there has been no demonstration that anisotropic mechanical properties of the substratum can 
elicit directed motile responses. However, it is known that isotropic mechanical properties, such as the 
viscosity of the substratum, can influence cell locomotion[Harris, 1982], and it appears that changing 
the mechanical properties of aligned matrices can reduce cell guidance [Dunn, 1982; Haston et al„ 1983], 
although it is probable that other properties are altered at the same time. Moreover, the phenomenon of 
desmotaxis suggest that it is the asymmetrical mechanical linkage of the fibrils that biases the locomotion. 
Guidance by anisotropic mechanical properties therefore remains a distinct possibility, but further 
progress is hampered by the difficulty of fabricating well-defined substrata. An ideal substratum would 
be a flat, featureless, and chemically uniform surface with anisotropic viscoelastic properties, and it is 
possible that liquid crystal surfaces will provide the answer. 

Environments with Specific Arrangements of Neighboring Cells 

Although contact inhibition of locomotion is of primary importance in determining patterns that develop 
in populations of cells, it is not easy to control the effects of contact inhibition in culture. If the cells are 
seeded on the substratum at nonuniform density, the response will generally cause cell locomotion to 
be biased in the direction of decreasing cell density. This can give a unidirectional bias if superimposed 
on a guidance response, and it has been conjectured that certain cell migrations in vivo are biased in this 
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way. Cellular contact responses also can lead to the mutual orientation of confluent neighboring cells, 
and this can lead to wide regions of cooriented cells arising spontaneously in uniformly seeded cultures. 
A typical culture arrangement from studying contact inhibition is to seed two dense populations of 
cells, often primary explants of tissue, about 1 mm apart on the substratum [Abercrombie and Heaysman, 
1976]. Homologous contact inhibition causes the cells to migrate radially from these foci, usually as 
confluent sheets, until the two populations collide. With noninvasive cells, their locomotion is much 
reduced after the populations have met, and there is little intermixing of the population boundary. If 
one of the two populations is of an invasive type, however, failure of heterologous contact inhibition will 
cause it to infiltrate the other population, and their invasiveness can be measured by the depth of 
interpenetration. 



Defining Terms 

Associated movements: Occur when cells passively change position as a result of external forces, 

generated by cell motility elsewhere, that are transmitted either through the extracellular matrix 

or through cell-cell contacts. 
Cell motility: A blanket term that covers all aspects of movement actively generated by a cell. It includes 

changes in cell shape, cell contraction, protrusion and retraction of processes, intracellular motility, 

and cell translocation. 
Cell translocation, cell locomotion, or cell migration: All describe active changes in position of a cell 

in relation to its substratum. The translocation of tissue cells always requires a solid or semisolid 

substratum. In seeking a more rigorous definition, positions must first be defined for both the cell 

and its substratum. This is not always easy, since both may change continually in shape. 
Chemoaffinity: The directional translocation of cells or extension of cellular protrusions along narrow 

tracks of specific molecules adsorbed to the substratum. 
Chemokinesis: A kinesis (q.v.) in which the stimulating scalar property is the concentration of some 

chemical. 
Chemotaxis: The directional translocation of cells in a concentration gradient of some chemoattractant 

or chemorepellent substance. 
Chemotropism: The directional extension of a cellular protrusion or multicellular process in a con- 
centration gradient of some chemoattractant or chemorepellent substance. 
Contact guidance: The directional translocation of cells in response to some anisotropic property of 

the substratum. 
Contact inhibition of locomotion: Occurs when a cell collides with another and is halted and/or 

redirected so that it does not use the other cell as a substratum. 
Desmotaxis: Describes a unidirectional bias of cell translocation in a fibrillar matrix that is allowed to 

attach to a solid support and then oriented by shear flow. 
Diataxis: A taxis (q.v.) in which translocation perpendicular to the field vector predominates. This 

leads to a bidirectional bias in two dimensions. 
Directed motile responses: The responses of cells to specific properties of their environment that can 

control the direction of cell translocation or, in the case of nerve growth, for example, can control 

the direction of extension of a cellular protrusion. 
Guidance: Used here to indicate a directed response to some high-order-tensor-like property of the 

environment in which opposite directions are equivalent. Translocation is biased bidirectionally 

in two dimensions. 
Guidance by differential adhesiveness: A form of guidance by substratum availability (q.v.) in which 

cell locomotion is wholly or largely confined to narrow tracks of higher adhesiveness on the 

substratum. The response may be absent or much reduced if the cell spans several parallel tracks. 
Guidance by substratum availability: Occurs when the translocation of a cell is confined to an isolated 

narrow track either because no alternative substratum exists or because the cell is unable to adhere to it. 
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Guidance by substratum shape, topographic guidance, or morphographic guidance: All refer to the 

guidance of cells by the shape or texture of the substratum. It is not known whether all types of 

morphographic guidance are due to a common mechanism. 
Haptotaxis: The tendency of cells to translocate unidirectionally up a steep gradient of increasing 

adhesiveness of the substratum. 
Heterologous contact inhibition: The contact inhibition of locomotion (q.v.) that may occur when a 

cell collides with another of different type. Contact inhibition is called nonreciprocal when the 

responses of the two participating cells are different. A cell type that is invasive with respect to 

another will generally fail to show contact inhibition in heterologous collisions. 
Homologous contact inhibition: The contact inhibition of locomotion (q.v.) that may occur when a cell 

collides with another of the same type. It is not always appreciated that invasive cell types can show 

a high level of homologous contact inhibition. 
Kinesis: The dependence of some parameter of locomotion, usually speed or rate of turning, on some 

scalar property of the environment. In an adaptive kinesis, the response is influenced by the rate 

of change of the scalar property, and if the environment is stable but spatially nonuniform, this 

can lead to behavior indistinguishable from a taxis (q.v.). Current nomenclature is inadequate to 

deal with such situations [Dunn, 1990]. 
Oblate guidance: A form of guidance in three dimensions in which translocation is suppressed along 

a single axis. 
Prolate guidance: A form of guidance in three dimensions in which translocation along a single axis 

predominates. 
Stereotropism: A form of guidance by substratum availability (q.v.) in which the only solid support 

available for locomotion consists of isolated narrow fibers. 
Taxis: A directed response to some vectorlike property of the environment. Translocation is usually 

biased unidirectionally, either along the field vector or opposite to it, except in the case of diataxis 

(q.v.). 
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Tissue development is regulated by a complex set of events in which cells of the developing organ interact 
with each other, with general and specific growth factors, and with the surrounding extracellular matrix 
(ECM) [Long, 1992]. These interactions are important for a variety of reasons such as localizing cells 
within the microenvironment, directing cellular migration, and initiating growth-factor-mediated devel- 
opmental programs. It should be realized, however, that simple interactions such as those between cells 
and growth factor are not the sole means by which developing cells are regulated. Further complexity 
occurs via interactions of cells and growth factors with extracellular matrix or via other interactions 
which generate specific developmental responses. 

Developing tissue cells interact with a wide variety of regulators during their ontogeny. Each of these 
interactions is mediated by defined, specific receptor-ligand interactions necessary to stimulate both the 
cell proliferation and/or motility. For example, both chemical and/or extracellular matrix gradients exist 
which signal the cell to move along "tracks" of molecules into a defined tissue area. As well, high 
concentrations of the attractant, or other signals, next serve to "localize" the cell, thus stopping its 
nonrandom walk. These signals which stop and/or regionalize cells in appropriate microenvironments 
are seemingly complex. For example, in the hematopoietic system, complexes of cytokines and extracel- 
lular matrix molecules serve to localize progenitor cells [Long et al., 1992], and similar mechanisms of 
cell/matrix/cytokine interactions undoubtedly exist in other developing systems. Thus, the regulation of 
cell development, which ultimately leads to tissue formation is a complex process in which a number of 
elements work in cohort to bring about coordinated organogenesis: stromal and parenchymal cells, 
growth factors and extracellular matrix. Each of these is a key component of a localized and highly 
organized microenvironmental regulatory system. 

Cellular interactions can be divided into three classes: cell-cell, cell-extracellular matrix, and cell- 
growth factor, each of which is functionally significant for both mature and developing cells. For example, 
in a number of instances blood cells interact with each other and/or with cells in other tissues. Immu- 
nologic cell-cell interactions occur when lymphocytes interact with antigen-presenting cells, whereas 
neutrophil or lymphocyte egress from the vasculature exemplifies blood cell-endothelial cell recognition. 
Interactions between cells and the extracellular matrix (the complex prontinaceous substance surrounding 
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FIGURE 118.1 Hematopoietic cellular interactions. This figure illustrates the varying complexities of putative 
hematopoietic cell interactions. A conjectural complex is shown in which accessory cell-stromal cell, and stromal 
cell-PG-growth factor complexes localize developmental signals. ECM = extracellular matrix, gag = glycosaminogly- 
can side chain bound to proteoglycan (PG) core protein (indicated by cross-hatched curved molecule); IL-1 = 
interleukin-1; GM-CSF = granulocyte-macrophage colony-stimulating factor. Modified from Long [1992] and 
reprinted with permission. 

cells) also play an important role. During embryogenesis matrix molecules are involved both in cell 
migration and in stimulating cell function. Matrix components are also important in the growth and 
development of precursor cells; they also serve either as cytoadhesion molecules for these cells or to 
compartmentalize growth factors within specific microenvironmental locales. For certain tissues such as 
bone marrow, a large amount of information exists concerning the various components of the nature of 
the microenvironment. For others such as bone, much remains to be learned of the functional microen- 
vironment components. Many experimental designs have examined simple interactions (e.g., cell-cell, 
cell-matrix). However, the situation in vivo is undoubtedly much more complex. For example, growth 
factors are often bound to matrix molecules which, in turn, are expressed on the surface of underlying 
stromal cells. Thus, very complex interactions occur (e.g., accessory cell-stromal cell-growth 
factor-progenitor cell-matrix, see Fig. 118.1), and these can be further complicated by a developmental 
requirement for multiple growth factors. 

The multiplicity of tissue-cell interactions requires highly specialized cell surface structures (i.e., 
receptors) to both mediate cell adhesion and transmit intracellular signals from other cells, growth factors, 
and/or the ECM. Basically, two types of receptor structures exist. Most cell surface receptors are proteins 
which consist of an extracellular ligand-binding domain, a hydrophobic membrane-spanning region, 
and a cytoplasmic region which usually functions in signal transduction. The amino acid sequence of 
these receptors often defines various families of receptors (e.g., immuno-globulin and integrin gene 
superfamilies). However, some receptors are not linked to the cell surface by protein, as certain receptors 
contain phosphotidylinositol-based membrane linkages. This type of receptor is usually associated with 
signal transduction events mediated by phospholipase C activation [Springer, 1990]. Other cell surface 
molecules important in receptor-ligand interactions are surface proteins which function as a coreceptors. 
Coreceptors function with a well-defined receptor, usually to amplify stimulus-response coupling. 

The goal of this chapter is to examine the common features of each component of the microenviron- 
ment (cellular elements, soluble growth factors, and extracellular matrix). As each tissue or organ 
undergoes its own unique and complex developmental program, this review cannot cover these elements 
for all organs and tissue types. Rather, two types of microenvironments (blood and bone) will be 
compared in order to illustrate commonalties and distinctions. 

118.1 Cellular Elements 

Cells develop in a distinct hierarchical fashion. During the ontogeny of any organ, cells migrate to the 
appropriate region for the nascent tissue to form and there undergo a phase of rapid proliferation and 
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differentiation. In tissues which retain their proliferative capacity (bone marrow, liver, skin, the gas- 
trointestinal lining, and bone), the complex hierarchy of proliferation cells is retained throughout life. This 
is best illustrated in the blood-forming (hematopoiesis) system. Blood cells are constantly produced, such 
that approximately 70-times an adult human's body weight of blood cells is produced through the human 
life span. This implies the existence of a very primitive cell type that retains the capacity for self-renewal. 
This cell is called a stem cell, and it is the cell responsible for the engraphment of hematopoiesis in recipients 
of bone marrow transplantation. Besides a high proliferative potential, the stem cell also is characterized by 
its multipotentiality in that it can generate progeny (referred to as progenitor cells) which are committed to 
each of the eight blood cell lineages. As hematopoietic cells proliferate, they progressively lose their prolifer- 
ative capacity and become increasingly restricted in lineage potential. As a result, the more primitive progen- 
itor cells in each lineage produce higher colony numbers, and the earliest cells detectable in vitro produce 
progeny of 2-3 lineages (there is no in vitro assay for transplantable stem cells). Similar stem cell hierarchies 
exist for skin and other regenerating tissues, but fewer data exist concerning their hierarchical nature. 

The regulation of bone cell development is induced during bone morphogenesis by an accumulation 
of extracellular and intracellular signals [Urist et al., 1983a]. Like other systems, extracellular signals are 
known to be transferred from both cytokines and extracellular matrix molecules [Urist et al., 1983a] to 
responding cell surface receptors, resulting in eventual bone formation. The formation of bone occurs 
by two mechanisms. Direct development of bone from mesenchymal cells (referred to as intramembrcmous 
ossification, as observed in skull formation) occurs when mesenchymal cells directly differentiate into 
bone tissue. The second type of bone formation (the endochondrial bone formation of skeletal bone) 
occurs via an intervening cartilage model. Thus, the familiar cell hierarchy exists in the development and 
growth of long bones, beginning with the proliferation of mesenchymal stem cells, their differentiation 
into ostogenic progenitor cells, and then into osteoblasts. The osteoblasts eventually calcify their sur- 
rounding cartilage and/or bone matrix to form bone. Interestingly, the number of osteoporgenitor cells 
in adult bone seems too small to replace all the large mass of bone normally remodeled in the process 
of aging of the skeleton [Urist et al., 1983a]. Observations from this laboratory confirm this concept by 
showing that one (unexpected) source of bone osteoprogenitor cells is the bone marrow [Long et al., 
1990; Long & Mann, 1993] . This reduced progenitor cell number also implies that there is a disassociation 
of bone progenitor cell recruitment from subsequent osteogenic activation and bone deposition and 
further suggests multiple levels of regulation in this process {vide infra). 

As mentioned, cell-cell interactions mediate both cellular development and stimulus-response cou- 
pling. When coupled with other interactions (e.g., cell-ECM), such systems represent a powerful mech- 
anism for directing and/or localizing developmental regulation. Further, combinations of these 
interactions potentially can yield lineage-specific or organ-specific information. Much of out understand- 
ing of cell-cell interactions comes from the immune system and from the study of developing blood cells 
and their interactions with adjacent stromal cells [Dexter, 1982; Gallatin et al., 1986; Springer, 1990]. For 
example, the isolation and cloning of immune cell ligands and receptors resulted in the classification of 
gene families which mediate cell-cell interactions within the immune and hematopoietic systems, and 
similar systems undoubtedly play a role in the development of many tissues. 

There are three families of molecules which mediate cell-cell interactions (Table 114.1). The immu- 
noglobulin superfamily is expressed predominantly on cells mediating immune and inflammatory 
responses (and is discussed only briefly here). The integrin family is a large group of highly versatile 
proteins which is involved in cell-cell and cell-matrix attachment. Finally, the selectin family is comprised 
of molecules which are involved in lymphocyte, platelet, and leukocyte interactions with endothelial cells. 
Interestingly, this class of cell surface molecules utilizes specific glycoconjungates (encoded by gly- 
coslytransferase genes) as their ligands on endothelial cell surfaces. 

Immunoglobulin Gene Superfamily 

These molecules function in both antigen recognition and cell-cell communication. The immunglobulin 
superfamily (Table 118.2) is defined by a 90-100 base pair immunoglobulinlike domain found within a 
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TABLE 118.1 Cell Adhesion Molecule Superfamilies 

Immunoglobulin superfamily of adhesion receptors 

LFA 2 (CD2) T-Cell receptor (CD3) 

LFA 3 (CD58) CD4 (TCR coreceptor) 

ICAM 1 (CD54) CD8 (TCR coreceptor) 

ICAM 2 MHC class I 

VCAM-1 MHC class II 

Integrins 

Pi integrins (VLA proteins) 
P150.95 (CD11 C /CD18) 
VLA 1-3,6 

VLA 4(LPAM 1, C049d/C029) 
Fibronectin receptor (VLA 5, CD-/CD29) 
LPAM2 
(3 2 Integrins 

LFA 1 (CDll a /CD18) 
Mac 1 orMol(CDll b /CD18) 
(3 3 Integrins 

Vitronectin receptor 
Platelet gp-IIb/IIa 
Selectin/LEC-CAMS 

Mell4 (LE-CAM-1, LHR, LAM-1, Leu 8, Ly 22, gp90 MEL) 

ELAM-1 (LE-CAM-2) 

GMP 140 (LE-CAM 3, PADGEM, CD 62) 

Source: Originally adapted from Springer [1990] and 
Brandley and coworkers [1990] and reprinted from Long 
[1992] with permission. 



dimer of two antiparallel (3 strands [Sheetz et al., 1989; Williams & Barclay, 1988]; for a review, see 
Springer [1990]. Two members of this family, the T-cell receptor and immunoglobulin, function in 
antigen recognition. The T-cell receptor recognizes antigenic peptides in the context of two other mol- 
ecules on the surface of antigen-presenting cells: major histocompatibility (MHC) class I and class II 
molecules [Bierer et al., 1989; Sheetz et al., 1989; Springer, 1990]. Whereas the binding of T-cell receptor 
to MHC/antigenic peptide complexes seems sufficient for cell-cell adhesion, cellular activation also 
requires the binding of either of two coreceptors, CD8 or CD4. Neither coreceptor can directly bind the 
MHC complex, but, rather, each seems to interact with the T-cell receptors to synergistically amplify 
intercellular signaling [Shaw et al., 1986; Spits et al., 1986; Springer, 1990]. 

Integrin Gene Superfamily 

Integrin family members are involved in interactions between cells and extracellular matrix proteins 
[Giancotti & Ruoslahti, 1990]. Cell attachment to these molecules occurs rapidly (within minutes) and 
is a result of increased avidity rather than increased expression (see Lawrence and Springer [1991] and 
references therein). The binding sequence within the ligand for most, but not all, integrins is the tripeptide 
sequence arganine-glycine-asparagine (RGD) [Ruoslahti & Pierschbacher, 1987]. Structurally, integrins 
consist of two membrane-spanning alpha and beta chains. The alpha subunits contain three to four 
tandem repeats of a divalent-ion-binding motif and require magnesium or calcium to function. The 
alpha chains are (usually) distinct and bind with common or related (3 subunits to yield functional 
receptors [Giancotti & Ruoslahti, 1990]. The (3 subunits of integrins have functional significance, and 
integrins can be subclassified based on the presence of a given beta chain. Thus, integrins containing (31 
and (33 chains are involved predominantly in cell- extracellular matrix interactions, whereas molecules 
containing the (32 subunits function in leukocyte-leukocyte adhesion (Tables 118.1 and 118.2). The 
cytoplasmic domain of many integrin receptors interacts with the cytoskeleton. For example, several 
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TABLE 118.2 Cell Surface Molecules Mediating Cell-Cell Interactions 





Receptor: Cell 


Ligand, Co-, or 


Ligand Co- or Counter- 




Cell Receptor 


Expression 


Counter-Receptor 


Receptor Cell 


Expression 


References 


Ig superfamily 












MHCI 


Macroph, T cell 


CD8,TCR 


T cells 




Springer 1990 


MHCII 


Macroph, T cell 


CD4, TCR 


T cells 




Springer 1990 


ICAM-1 


Endo, neut. HPC, 


LFA-1 


Mono, T and B cells 


Springer 1990 




B cells, T cells, 












macroph* 










ICAM-2 


Endo 


LFA-1 


Mono, T and B cells 


Springer 1990 


LFA-2 


T cells 


LFA-3 


T cells, eryth 




Springer 1990 


Integrins 












Macl 


Macroph, neut 


Fibrinogen, C3bi 


Endo, pits 




Springer 1990 


LFA-1 


Macroph, neut 


(See above) 


(See above) 






VCAM 


Endo 


VLA4 


Lymphocytes, 
B cells 


monocytes 


Brandley et al., 1990 
Miyamake, 1990 


gp 150,95 


Macro, neut 


(See above) 


(See above) 






FN-R 


Eryth lineage 


Fibronectin 


N.A. 




see Table 118.3 


Ilb/IIIa 


Pits, mk 


Fibrinogen, TSP, VN 
vWF 


Endo 




Springer, 1990 


Selectins 












LEC-CAM-1 


Endo 


Addressins, neg. charged 


Lymphocytes 




Brandley et al., 1990 


(Mel 14) 




oligosaccrides 








ELAM-1 


Endo 


sialyl-Lewis X f 


Endo* 




Lowe et al., 1990 


(LE-CAM-2) 






Neut, tumor cells 




LEC-CAM-3 


Pit gran 


Lewis X 


Endo 




Brandley et al., 1990 


(GMP-140) 


Weible-Palade 
bodies, endo 


(CD15) 


Neut 







*Source: Modified from Long [1992] and reprinted with permission, macroph = macrophage; Mono = monocyte; 
Endo = endothelial cell; Eryth = erythroid cells; Pits = platelets; Neut = neutrophil; Mk = megakarocyte. 
tConstituatively expressed by few cells, upregulated by TNF- and IL-1. 
'Sialylated, fucosylated lactosaminoglycans. Modified from Long [1992] and reprinted with permission. 

integrins are known to localize near focal cell contacts were actin bundles terminate [Giancotti & 
Ruoslahti, 1990; Springer, 1990]. As a result, changes in receptor binding offer an important mechanism 
for linking cell adhesion to cytoskeletal organization. 

Selectins 

The selectin family of cell-adhesion receptors contains a single N-terminus, calcium-dependent, lectin- 
binding domain, an EGF receptor (EFGR) domain, and a region of cysteine-rich tandem repeats (from two 
to seven) which are homologous to complement-binding proteins [Bevilacqua et al., 1989; Springer, 1990; 
Stoolman, 1989]. Selectins (e.g., MEL14, gp90MEL, ELAM-1, and GMP140/PADGEM, Table 118.2) are 
expressed on neutrophils and lymphocytes. They recognize specific glycoconjugate ligands on endothelial 
and other cell surfaces. Early studies demonstrated that fucose or mannose could block lymphocyte attach- 
ment to lymph node endothelial cells [Brandley etal., 1990]. Therefore, the observation that the selectin 
contain a lectin-binding domain [Bevilacqua et al., 1989] led to the identification of the ligands for two 
members of this family; for review see Brandley and coworkers [1990]. Lowe and coworkers first demon- 
strated that alpha (1,3/1,4) fucosyltransferase cDNA converted nonmyeloid COS or CHO cells to selectin 
(sialyl-Lewis X) positive cells which bound to both HL60 cells and neutrophils in an ELAM-1 -dependent 
manner [Lowe et al., 1990]. Conversely, Goelz and coworkers screened an expression library using a mon- 
oclonal antibody which inhibited ELAM-mediated attachment which yielded a novel alpha (1,3) fucosyl- 
transferase whose expression conferred ELAM binding activity on target cells [Goelz et al., 1990]. 

Unlike mature neutrophils and lymphocytes, information on cell-cell interactions among hematopoi- 
etic progenitor cells is less well developed (see Table 118.3). Data concerning the cytoadhesive capacities 
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TABLE 118.3 Hematopoietic Cell-Stromal Cell Interactions (Unknown 
Receptor- Ligand) 



Cell Phenotype 


Stromal Cell 


References 


B cells 


Fibroblasts 


Ryan et al„ 1990; Witte et al., 1987 


Pre-B cell 


Heter stroma 


Palacios et al, 1989 


BFC-E 


Fibroblasts 


Tsai et al., 1986, Tsai et al., 1987 


CFC-S 


Fibroblasts (NIH 3T3) 


Roberts et al., 1987 


CFC-S, Bl-CFC 


Heter stroma* 


Gordon et al., 1985, 1990a, 19906 


CFC-GM 


Heter stroma 


Tsai et al., 1987; Campbell et al., 1985 


CFC-Mk 


Heter stroma 


Tsai et al., 1987; Campbell et al., 1985 



'Methylprednisolone stimulated stromal cells, unstimulated fail to bind — see Gordon 
and coworkers [1985]. Heter = heterologous; BFC-E = burst forming cell-erythroid; 
CFC = colony-forming cell; S = spleen; Bl = blast; GM = granulocyte/macrophage; 
Mk = megakaryocyte. From Long [1992], reprinted with permission. 

of hematopoietic progenitor cells deal with the interaction of these cells with underlying, preestablished 
stromal cell layers [Dexter, 1982]. Gordon and colleagues documented that primitive hematopoietic 
human blast-colony forming cells (Bl-CFC) adhere to performed stromal cell layers [Gordon et al., 1985, 
1990&] and showed that the stromal cell ligand is not one of the known cell adhesion molecules [Gordon 
etal., 1987a]. Other investigators have shown that hematopoietic (CD34-selected) marrow cell popula- 
tions attach to stromal cell layers and that the attached cells are enriched for granulocyte-macrophage 
progenitor cells [Liesveld etal., 1989]. Highly enriched murine spleen colony-forming cells (CFC-S) 
attach to stromal cell layers, proliferate, and differentiate into hematopoietic cells [Spooncer et al., 1985]. 
Interestingly, underlying bone marrow stromal cells can be substituted for by NIH 3T3 cells [Roberts 
etal., 1987], suggesting that these adherent cells supply the necessary attachment ligand for CFU-S 
attachment [Roberts et al, 1987; Yamazaki et al, 1989]. 

118.2 Soluble Growth Factors 

Soluble specific growth factors are an obligate requirement for the proliferation and differentiation of 
developing cells. These growth factors differ in effects from the endocrine hormones such as anabolic 
steroids or growth hormone. Whereas the endocrine hormones affect general cell function and are 
required and/or important to tissue formation, their predominant role is one of homeostasis. Growth 
factors, however, specifically drive the developmental programs of differentiating cells. Whether these 
function in a permissive or an inductive capacity has been the subject of considerable past controversy, 
particularly with respect to blood cell development. However, the large amount of data demonstrating 
linkages between receptor-ligand interaction and gene activation argues persuasively for an induc- 
tive/direct action on gene expression and, hence, cell proliferation and differentiation. 

Again, a large body of knowledge concerning growth factors comes from the field of hematopoiesis 
(blood cell development). Hematopoietic cell proliferation and differentiation is regulated by numerous 
growth factors; for reviews see Metcalf [1989] and Arai and colleagues [1990]. Within the last decade 
approximately 29 stimulatory cytokines (13 interleukins, M-CSF, erythroprotein, G-CFS, GM-CSF, c-kit 
ligand, gamma-interferon, and thrombopoietin) have been molecularly cloned and examined for their 
function in hematopoiesis. Clearly, this literature is beyond the scope of this review. However, the recent 
genetic cloning of eight receptors for these cytokines has led to the observation that a number of these 
receptors have amino acid homologies [Arai et al., 1990], showing that they are members of one or more 
gene families (Table 118.4). Hematopoietic growth factor receptors structurally contain a large extracel- 
lular domain, a transmembrane region, and a sequence-specific cytoplasmic domain [Arai et al., 1990]. 
The extracellular domains of interleukin-1, interleukin-6, and gamma-interferon are homologous with 
the immunoglobulin gene superfamily, and weak but significant amino-acid homologies exist among the 
interleukin-2 (beta chain), IL-6, IL-3, IL-4, erythropoietin, and GM-CSF receptors [Arai et al., 1990]. 
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TABLE 118.4 Hematopoietic Growth Factor Receptor Families 

Receptors with homology to the immunoglobulin gene family 

Interleukin- 1 receptor 

Interleukin-6 receptor 

Gamma-interferon receptor 
Hematopoietic growth factor receptor family 

Interleukin-2 receptor (P-chain) 

Interleukin-3 receptor 

Interleukin-4 receptor 

Interleukin-6 receptor 

Erythropoietin receptor 

G/M-CSF receptor 

Source. Modified from Long [1992]. Reprinted with permission. 

Like other developing tissues, bone responds to bone-specific and other soluble growth factors. TGF-P 
is a member of a family of polypeptide growth regulators which affects cell growth and differentiation 
during developmental processes such as embryogenesis and tissue repair [Sporn & Roberts, 1985]. TGF-P 
strongly inhibits proliferation of normal and tumor-derived epithelial cells and blocks adipogenesis, 
myogenesis, and hematopoiesis [Sporn & Roberts, 1985]. However, in bone, TGF-P is a positive regulator. 
TGF-P is localized in active centers of bone differentiation (cartilage canals and osteocytes) [Massague, 
1987], and TGF-P is found in high quantity in bone, suggesting that bone contains the greatest total 
amount of TGF-P [GehronRobey et al., 1987; Massague, 1987]. During bone formation, TGF-P promotes 
chrondrogenesis [Massague, 1987] — an effect presumably related to its ability to stimulate the deposition 
of extracellular matrix (ECM) components [Ignotz & Massague, 1986]. Besides stimulating cartilage 
formation, TGF-P is synthesized and secreted in bone cell cultures and stimulates the growth of subcon- 
fiuent layers of fetal bovine bone cells, thus showing it to be an autocrine regulator of bone cell devel- 
opment [Sporn & Roberts, 1985]. 

In addition to TGF-P, other growth factors or cytokines are implicated in bone development. Urist 
and coworkers have been able to isolate various regulatory proteins which function in both in vivo and 
in vitro models [Urist etal., 1983&]. Bone morphogenic protein (BMP), originally an extract of dem- 
ineralized human bone matrix, has now been cloned [Wozney et al., 1988] and, when implanted in vivo, 
results in a sequence of events leading to functional bone formation [Muthukumaran & Reddi, 1985; 
Wozney et al., 1988]. The implanting of BMP is followed by mesenchymal cell migration to the area of 
the implant, differentiation into bone progenitor cells, deposition of new bone, and subsequent bone 
remodeling to allow the establishment of bone marrow [Muthukumaran & Reddi, 1985]. A number of 
additional growth factors which regulate bone development exist. In particular, bone-derived growth 
factors (BDGF) stimulate bone cells to proliferate in serum-free media. [Hanamura et al., 1980; Linkhart 
et al., 1986]. However, these factors seem to function at a different level from BMP [Urist et al., 1983a] . 

118.3 Extracellular Matrix 

The extracellular matrix (ECM) varies in its tissue composition throughout the body and consists of 
various molecules such as laminin, collagens, proteoglycans, and other glycoproteins [Wicha et al., 1982]. 
Gospodarowicz and coworkers demonstrated that ECM components greatly affect corneal endothelial 
cell proliferation in vitro [Gospodarowicz et al., 1980; Gospodarowicz & 111, 1980]. Studies by Reh and 
coworkers indicate that the ECM protein laminin is involved in inductive interactions which give rise to 
retinal-pigmented epithelium [Reh & Gretton, 1987]. Likewise, differentiation of mammary epithelial 
cells is profoundly influenced by ECM components; mammary cell growth in vivo and in vitro requires 
type IV collagen [Wicha etal., 1982]. A number of investigations elucidated a role for ECM and its 
components in hematopoietic cell function. These studies have identified the function of both previously 
known and newly identified ECM components in hematopoietic cell cytoadhesion (Table 118.5). 
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TABLE 1 18.5 Protiens and Glycoprotiens Mediating Hematopoietic Cell-Extracellular Matrix Interactions 



Matrix Component 
Fibronectin FnR 



Cell Surface 
Receptor 



Ilb/IIIa 
Thrombospondin TSP-R 



Hyaluronic acid CD44 



Hemonectin 



Unk 



Proteoglycans: 

Heparan sulfate Unk 
Unfract ECM Unk 



Cellular Expression 

Erythroid; BFC-E; B cells; 
Lymphoid cells; HL60 cells 

Platelets and megakaryocytes 
Monocytes and platelets 
Human CFC, 
CFC-GEMM 
T and B cells 

Neutrophils Tumor cells 
CFC-GM, BFC-E Immat. 
neutr. BFU-E 

Bl-CFC 

Bl-CFC, bm Stroma 



References 

Patel, 1984, 1986, 1987, Patel etal., 1985; 

Ryan et al., 1990; Tsai et al„ 1987; 

Van de Water et al., 1988 
Giancotti et al., 1987 

Silverstein and Nachman, 1987; Leung, 1984 
Long and Dixit, 1990 
Long and Dixit, 1990 
Aruffo etal., 1990, Dorshkind, 1989; Horst 

et al, 1990; Miyake et al., 1990 

Campbell etal., 1985, 1987, 1990 



Gordon, 1988; Gordon etal., 1988 
Gordon et al., 1988; Campbell et al., 1985 



R = receptor; BFC-E and erythroid progenitor cell = the burst-forming cell-erythrocyte; HL60 = a promyelocytic 
leukemia cell line; CFC = colony-forming cell; GEMM = granulocyte erythrocyte macrophage megakarocyte; GM = 
granulocyte/macrophage; unk = unknown; Bl = blast cell; bm = bone marrow. Modified and reprinted from Long 
[1992] with permission. 

As mentioned, soluble factors, stromal cells, and extracellular matrix (the natural substrate surround- 
ing cells in vivo) are critical elements of the hematopoietic microenvironment. Work by Wolf and Trenton 
on the hematopoietic microenvironment in vivo provided the first evidence that (still unknown) com- 
ponents of the microenvironment are responsible for the granulocytic predominance of bone marrow 
hematopoiesis and the erythrocytic predominance of spleen [Wolf & Trentin, 1 968 ] . Dexter and coworkers 
observed that the in vitro development of adherent cell populations is essential for the continued 
proliferation and differentiation of blood cells in long-term bone marrow cell cultures [Dexter & Lajtha, 
1974; Dexter etal., 1976]. These stromal cells elaborate specific ECM components such as laminin, 
fibronectin, and various collagens and proteoglycans, and the presence of these ECM proteins coincided 
with the onset of hematopoietic cell proliferation [Zuckerman & Wicha, 1983]. The actual roles for 
extracellular matrix versus stromal cells in supporting cell development remains somewhat obscure, as 
it often is difficult to disassociate stromal cell effects from those of the ECM, since stromal cells are 
universally observed to be enmeshed in the surrounding extracellular matrix. 

Bone extracellular matrix contains both collagenous and noncollagenous proteins. A number of non- 
collagenous matrix proteins, isolated from demineralized bone, are involved in bone formation. 
Osteonectin is a 32 kDa protein which, binding to calcium, hydroxypatite, and collagen, is felt to initiate 
nucleation during the mineral phase of bone deposition [Termine etal., 1981]. In vivo analysis of 
osteonectin message reveals its presence in a variety of developing tissues [Holland et al., 1987; Nomura 
et al., 1988]. However, osteonectin is present in its highest levels in bones of the axial skeleton, skull, and 
the blood platelet (megakaryocyte) [Nomura et al., 1988] . Bone gla protein (BGP, osteocalcin) is a vitamin 
K-dependent, 5700 Da calcium-binding bone protein which is specific for bone and may regulate Ca 2 + 
deposition [Price etal., 1976, 1981; Termine etal., 1981]. Other bone proteins seem to function as 
cytoadhesion molecules [Oldberg etal., 1986; Somerman etal., 1987] or have unresolved functions 
[Reddi, 1981]. Moreover, bone ECM also contains a number of the more common mesenchymal growth 
factors such as PDGF, basic, and acidic fibroblast growth factor [Canalis, 1985; Hauschka et al., 1986; 
Tinkhart et al., 1986; Urist et al., 1983a]. These activities are capable of stimulating the proliferation of 
mesenchymal target cells (BATB/c 3T3 fibroblasts, capillary endothelial cells, and rat fetal osteoblasts). 
As well, bone-specific proliferating activities such as the BMP exist in bone ECM. Although these general 
and specific growth factors undoubtedly play a role in bone formation, little is understood concerning 
the direct inductive/permissive capacity of bone-ECM or bone proteins themselves on human bone cells 



e 2000 by CRC Press LLC 



or their progenitors. Nor is the role of bone matrix in presenting growth factors understood — such 
"matricrine" (factor-ECM) interactions may be of fundamental importance in bone cell development. 

When bone precursor cells are cultured on certain noncollagenous proteins, they show an increase in 
proliferation and bone protein expression (MWL, unpublished observation). Moreover, we have shown, 
using the hematopoietic system as a model, that subpopulations of primitive progenitor cells require 
both a mitogenic cytokine and a specific extracellular matrix (ECM) molecule in order to proliferate 
[Long et al„ 1992]. Indeed, without this obligate matrix-cytokine ("matricrine") signal, the most prim- 
itive of blood precursor cells fail to develop in vitro [Long et al., 1992]. Although poorly understood, a 
similar requirement exists for human bone precursor cells, and complete evaluation of osteogenic devel- 
opment (or that of other tissues) thus requires additional studies of ECM molecules. For example, we 
have demonstrated the importance of three bone ECM proteins in human bone cell growth: osteonectin, 
osteocalcin, and type I collagen [Long et al., 1990; Long et al, 1994]. Additional bone proteins such as 
bone sialoprotien and osteopontin are no doubt important to bone structure and function, but their role 
is unknown [Nomura et al., 1988; Oldberg et al., 1986]. 

The above observations on the general and specific effects of ECM on cell development have identified 
certain matrix components which seem to appear as a recurrent theme in tissue development. These are 
proteoglycans, thrombospondin, fibronectin, and the collagens. 

Proteoglycans and Glycosaminoglycans 

Studies on the role of proteoglycans in blood cell development indicate that both hematopoietic cells 
[Minguell & Tavassoli, 1989] (albeit as demonstrated by cell lines) and marrow stromal cells [Gallagher 
et al., 1983; Kirby & Bentley, 1987; Spooncer et al., 1983; Wight et al., 1986] produce various proteogly- 
cans. Proteoglycans (PG) are polyanionic macromolecules located both on the stromal cell surface and 
within the extracellular matrix. They consist of a core protein containing a number of covalently linked 
glycosaminoglycan (GAG) side chains, as well as one or more O- or N-linked oligosaccharides. The GAGs 
consist of nonbranching chains of repeating N-acetylglucosamine or N-acetylglactosamine disaccharide 
units. With the exception of hyaluronic acid, all glycosaminoglycans are sulfated. Interesting, many 
extracellular matrix molecules (fibronectin, laminin, and collagen) contain glycosaminoglycan-binding 
sites, suggesting that complex interactions occur within the matrix itself. 

Proteoglycans play a role in both cell proliferation and differentiation. Murine stromal cells produce 
hyaluronic acid, heparan sulfate, and chondroitin sulfate [Gallagher et al., 1983], and in vitro studies 
show PG to be differentially between stromal cell surfaces and the media, with heparan sulfate being the 
primary cell-surface molecule and chondroitin sulfate the major molecular species in the aqueous phase 
[Spooncer et al., 1983]. In contrast to murine cultures, the human hematopoietic stromal cells in vitro 
contain small amounts of heparan sulfate and large amounts of dermatin and chondroitin sulfate, which 
seem to be equally distributed between the aqueous phase and extracellular matrix [Wight et al., 1986]. 
The stimulation of proteoglycan/GAG synthesis is associated with an increased hematopoietic cell pro- 
liferation, as demonstrated by an increase in the percentage of cells in S-phase [Spooncer et al., 1983]. 
Given the general diversity of proteoglycans, it is reasonable to expect that they may encode both lineage- 
specific and organ-specific information. For example, organ-specific PGs stimulate differentiation, as 
marrow-derived ECM directly stimulates differentiation of human progranulocytic cells (HL60), whereas 
matrix derived from skin fibroblasts lacks this inductive capacity [Luikart et al., 1987]. Moreover, organ- 
specific effects are seen in studies of human blood precursor cell adhesion to marrow-derived heparan 
sulfate but not to heparan sulfates isolated from bovine kidney [Gordon et al., 1988]. 

Interestingly, cell-surface-associated PGs are involved in the compartmentalization or localization of 
growth factors within the microenvironment. Thus, the proliferation of hematopoietic cells in the pres- 
ence of hematopoietic stroma is associated with a glycosaminoglycan-bound growth factor (GM-CSF) 
[Gordon et al., 1987&], and determination of the precise GAG molecules involved in this process (i.e., 
heparan sulfate) has showed that heparan sulfate side chains bind two blood cell growth factors: GM- 
CSF and interleukin-3 [Roberts et al., 1988]. These data imply that ECM components and growth factors 
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combine to yield lineage-specific information and indicate that, when PG- or GAG-bound, growth factor 
is presented to the progenitor cells in a biologically active form. 

Thrombospondin (TSP) 

Thrombospondin is a large, trimeric disulfide-linked glycoprotein (molecular weight 450,000, subunit 
molecular weight 180,000) having a domainlike structure [Frazier, 1987]. Its protease-resistant domains 
are involved in mediating various TSP functions such as cell binding and binding of other extracellular 
matrix proteins [Frazier, 1987]. Thrombospondin is synthesized and secreted into extracellular matrix 
by most cells; for review see Lawler [1986] and Frazier [1987]. Matrix-bound TSP is necessary for cell 
adhesion [Varian et al., 1986], cell growth [Majack et al., 1986], and carcinoma invasiveness [Riser et al., 
1988] and is differentially expressed during murine embryogenesis [O'Shea & Dixit, 1988]. 

Work from our laboratories shows that thrombospondin functions within the hematopoietic microen- 
vironment as a cytoadhesion protein for a subpopulation of human hematopoietic progenitor cells [Long 
et al., 1992; Long & Dixit, 1990]. Interestingly, immunocytochemical metabolic labeling studies show 
that hematopoietic cells (both normal marrow cells and leukemic cell lines) synthesize TSP, deposit it 
within the ECM, and are attached to it. The attachment of human progenitor cells to thrombospondin 
is not mediated by its integrin-binding RGD sequence because this region of the TSP molecule is cryptic, 
residing within the globular carboxy- terminus of the molecule. Thus, excess concentrations of a tetrapep- 
tide containing the RGD sequence did not inhibit attachment of human progenitor cells [Long & Dixit, 
1990], and similar observations exist in other cell systems [Asch et al., 1987; Roberts et al, 1987; Varian 
et al., 1988]. Other studies from this author's laboratories show that bone marrow ECM also plays a 
major role in hematopoiesis in that complex ECM extracts greatly augment LTBMC cell proliferation 
[Campbell et al., 1985] and that marrow- derived ECM contains specific cytoadhesion molecules [Camp- 
bell et al, 1987, 1990; Long et al., 1990, 1992; Long & Dixit, 1990]. 

Fibronectin 

Fibronectin is a ubiquitous extracellular matrix molecule that is known to be involved in the attachment 
of paryenchymal cells to stromal cells [Bentley & Tralka, 1983; Zuckerman &Wicha, 1983]. As with TSP, 
hematopoietic cells synthesize, deposit, and bind to fibronectin [Zuckerman & Wicha, 1983]. Extensive 
work by Patel and coworkers shows that erythroid progenitor cells attach to fibronectin in a develop- 
mentally regulated manner [Patel et al., 1985; Patel & Lodish, 1984, 1986, 1987]. In addition to cells of 
the erythroid lineage, fibronectin is capable of binding lymphoid precursor cells and other cell phenotypes 
[Bernardi et al., 1987; Giancotti et al., 1986]. Structurally, cells adhere to two distinct regions of the 
fibronectin molecule, one of which contains the RGD sequence; the other is within the carboxy terminal 
region and contains a high-affinity binding site for heparan [Bernardi et al., 1987]. 

Collagen 

The role of various collagens in blood cell development remains uncertain. In vitro marrow cells produce 
types I, III, IV, and V collagen [Bentley, 1982; Bentley & Foidart, 1981; Castro-Malaspina et al, 1980; 
Zuckerman and Wicha, 1983], suggesting a role for these extracellular matrix components in the main- 
tenance of hematopoiesis. Consistent with this, inhibition of collagen synthesis with 6-hydroxyprolene 
blocks or reduces hematopoiesis in vitro [Zuckerman et al., 1985]. Type I collagen is the major protein 
of bone, comprising approximately 90 percent of its protein. 

118.4 Considerations for ex Vivo Tissue Generation 

The microenvironmental complexities discussed above suggest that the ex vivo generation of human 
(replacement) tissue (e.g., marrow, liver) will be a difficult process. However, many of the needed tissues 
(liver, marrow, bone, and kidney) have a degree of regenerative or hyperplastic capacity which allows 
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their in vitro cultivation. Thus, in vivo growth of many of these tissues types is routinely performed, 
albeit at varying degrees of success. The best example of this is bone marrow. If unfractionated human 
bone marrow is established in culture, the stromal and hematopoietic cells attempt to recapitulate in 
vivo hematopoiesis. Both soluble factors and ECM proteins are produced [Dexter & Spooncer, 1987; 
Long & Dixit, 1990; Zuckerman & Wicha, 1983], and relatively long-term hematopoiesis occurs. However, 
if long-term bone marrow cultures are examined closely, they turn out to not faithfully reproduce the 
in vivo microenvironment [Dexter & Spooncer, 1987; Spooncer et al., 1985; Schofield & Dexter, 1985]. 
Over a period of 8-12 weeks (for human cultures) or 3-6 months (for murine cultures), cell proliferation 
ceases, and the stromal/hematopoietic cells die. Moreover, the pluripotentiality of these cultures is rapidly 
lost. In vivo, bone marrow produces a wide variety of cells (erythroid, megakaryocyte/platelet, four types 
of myeloid cells, and B-lymphocytes). Human long-term marrow cultures produce granulocytes and 
megakaryocytes for 1-3 weeks, and erythropoiesis is only seen if exogenous growth factors are added. 
Thereafter, the cultures produce granulocytes and macrophages. These data show that current culture 
conditions are inadequate and further suggest that other factors such as rate of fluid exchange (perfusion) 
or that the three-dimensional structure of these cultures is limiting. 

Recent work by Emerson, Palsson, and colleagues demonstrated the effectiveness of altered medium 
exchange rates in the expansion of blood cells in vitro [Caldwell et al., 1991; Schwartz et al., 1991a, 1991 b]. 
These studies showed that a daily 50% medium exchange affected stromal cell metabolism and stimulated 
a transient increase in growth factor production [Caldwell et al., 1991]. As well, these cultures underwent 
a 10-fold expansion of cell numbers [Schwartz et al., 1991&]. While impressive, thee cultures nonetheless 
decayed after 10-12 weeks. Recently, this group utilized continuous-perfusion bioreactors to achieve a 
longer ex vivo expansion and showed a 10-20-fold expansion of specific progenitor cell types [Roller 
et al., 1993]. These studies demonstrate that bioreactor technology allows significant expansion of cells, 
presumably via better mimicry of in vivo conditions. 

Another aspect of tissue formation is the physical structure of the developing organ. Tissues exist as 
three-dimensional structures. Thus, the usual growth in tissue culture flasks is far removed from the in 
vivo setting. Essentially, cells grown in vitro proliferate at a liquid/substratum interface. As a result, 
primary tissue cells grow until they reach confluence and then cease proliferating, a process known as 
contact inhibition. This, in turn, severely limits the degree of total cellularity of the system. For example, 
long-term marrow cultures (which do not undergo as precise a contact inhibition as do cells from solid 
tissues) reach a density of 1-2 x 10 6 per milliliter. This is three orders of magnitude less than the average 
bone marrow density in vivo. A number of technologies have been applied to this three-dimensional 
growth problem (e.g., hollow fibers). However, the growth of cells on or in a nonphysiologic matrix is 
less than optimal in terms of replacement tissue, since such implants trigger a type of immune reaction 
(foreign body reaction) or are thrombogenic. 

Recently, another type of bioreactor has been used to increase the ex vivo expansion of cells. Rotating 
wall vessels are designed to result in constant, low-shear suspension of tissue cells during their develop- 
ment. These bioreactors thus simulate a microgravity environment. The studies of Goodwin and asso- 
ciates document a remarkable augmentation of mesenchymal cell proliferation in low-shear bioreactors. 
Their data show that mesenchymal cell types show an average three- to six-fold increase in cell density 
in these bioreactors, reaching a cellularity of approximately 10 7 cells/mL [Goodwin et al., 1993a, 1993b] . 
Importantly, this increase in cell density was associated with a 75% reduction in glucose utilization as 
well as an approximate 85% reduction in the enzymatic markers of anabolic cellular metabolism (SGOT 
and LDH) [Goodwin et al., 1993a]. Importantly, further work by Goodwin and colleagues show that the 
growth of mesenchymal cells (kidney and chondrocyte) under low-shear conditions leads to the formation 
of tissuelike cell aggregates which is enhanced by growing these cells on collagen-coated microcarriers 
[Duke et al, 1993; Goodwin et al, 1993a]. 

The physical requirements for optimal bone precursor cell (i.e., osteoprogenitor cells and preosteoblasts) 
proliferation both in vivo and ex vivo are poorly understood. In vivo, bone formation most often occurs 
within an intervening cartilage model (i.e., a three-dimensional framework referred to as endochondral 
ossification). This well- understood bone histogenesis is one of embryonic and postnatal chondrogenesis, 
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which accounts for the shape of bone and the subsequent modification and calcification of bone cell 
ECM by osteoblasts. Recent work in this laboratory has examined the physical requirements for bone 
cell growth. When grown in suspension cultures (liquid/substratum interface), bone precursor cells 
develop distinct, clonal foci. These cells, however, express low amounts of bone-related proteins, and 
they rapidly expand as "sheets" of proliferating cells. As these are nontransformed (i.e., primary) cells, 
they grow until they reach confluence and then undergo contact inhibition and cease proliferating. We 
reasoned that growing these cells in a three-dimensional gel might augment their development. It is 
known from other systems that progenitor cell growth and development in many tissues requires the 
presence of at least one mitogenic growth factor, and that progenitor cell growth in a three-dimensional 
matrix results in the clonal formation of cell colonies by restricting the outgrowth of differentiated 
progeny [Metcalf, 1989]. Thus, bone precursor cells were overlayered with chemically defined, serum- 
free media containing a biopolymer, thus providing the cells with a three-dimensional scaffold in which 
to proliferate. In sharp contrast to bone precursor cell growth at a planar liquid/substratum interface, 
cells grown in a three-dimensional polymer gel show a marked increase in proliferative capacity and an 
increased per-cell production of the bone-specific proteins (MWL, unpublished observations). 

118.5 Conclusions and Perspectives 

As elegantly demonstrated by the composite data above, the molecular basis and function of the various 
components of tissue microenvironments are becoming well understood. However, much remains to be 
learned regarding the role of these and other, as yet unidentified molecules in tissue development. One 
of the intriguing questions to be asked is how each interacting molecule contributes to defining the 
molecular basis of a given microenvironment — for example, the distinct differences in hematopoiesis as 
it exists in the marrow versus the spleen (i.e., a predominance granulopoiesis and erythropoiesis, respec- 
tively) [Wolf & Trentin, 1968]. Another rapidly advancing area is the dissection of the molecular basis 
of cell trafficking into tissues. Again, the immune system offers for the assessment of this process. Thus, 
interaction of the lymphocyte receptors with specific glycoconjugates of vascular addressins [Goldstein 
et al., 1989; Idzerda et al., 1989; Lasky et al., 1989] suggests that similar recognition systems are involved 
in other tissues, particularly the bone marrow. Another interesting observation is that hematopoietic 
progenitor cells synthesize and bind to their own cytoadhesion molecules, independent of the matrix 
molecules contributed by the stromal cells. For example, developing cells in vitro synthesize and attach 
to fibronectin, thrombospondin, and hemonectin, suggesting that these molecules function solely in an 
autochthonous manner to localize or perhaps stimulate development. Such a phenomenon may be a 
generalized process, as we have noted similar patterns of ECM expression/attachment in osteopoietic cell 
cultures. Coupled with data from the bioreactor studies, this suggest that under appropriate biology/phys- 
ical conditions tissue cells may spontaneously reestablish their structure. Finally, the elucidation of the 
various requirements for optimal progenitor cell growth (cell interactions, specific growth factors and 
matrix components, and/or accessory cells which supply them) should allow the improvement of ex vivo 
culture systems to yield an environment in which tissue reconstitution is possible. Such a system would 
have obvious significance in organ-replacement therapy. 
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All tissue is composed of parenchymal (from Greek, that poured in beside) and stromal (Greek, frame- 
work or foundation) cells. Parenchyma are the functional cells of a tissue (e.g., for liver, hepatic paren- 
chymal cells or hepatocytes; for bone marrow, hematopoietic cells), where stroma comprises primarily 
connective tissue elements which, together with their products, form the structural framework of tissue. 
Parenchymal cells can be derivatives of any of the three germ layers, and during development they usually 
grow into areas populated by stromal cells or their progenitors. Under the strictest definition, stromal 
cells are derivatives of mesenchyme and include fibroblasts, osteogenic cells, myofibroblasts, and fat cells 
which appear to arise from a common stem/progenitor cell [Friedenstein et al., 1970; Owen, 1988] 
(Fig. 119.1). Some investigators apply the term stromal cell to all the nonparenchymal cells that contribute 
to the microenvironment of a tissue and include endothelial cells and macrophages (histiocytes) in this 
classification as well [Strobel et al., 1986]. However, the ontogency of both endothelial cells and mac- 
rophages is distinct from that of mesenchymal tissue-derived cells [Wilson, 1983]. In this chapter, the 
more expansive definition of stroma will be used. A partial listing of tissue cells that may influence the 
function of organ parenchyma is in Table 119.1. For the sake of brevity, migrating cells of bone marrow 
origin will not be discussed in the text (e.g., mast cells, B lymphocytes, natural killer cells), although 
these cells can influence parenchyma either directly or via cytokine-mediated modulation of stromal cell 
function. 

Stromal and parenchymal cell components are integrated to form a multifunctional tissue in vivo. 
This chapter will focus on the contribution of stromal cells to the microenvironment and their use in 
culture to support parenchymal function. 

119.1 Tissue Composition and Stromal Cells 

Some similarities in the spatial organization of cells of different tissues are apparent. Epithelial cells are 
a protective and regulatory barrier not just for skin but for all surfaces exposed to blood or to the external 
environment (e.g., respiratory tract, tubular digestive tract). These cells rest atop a selectively permeable 
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Committed Progenitors 



-> Mature Stromal Cells 



Reticular and adventitial fibroblasts 

Adipocytes 

Osteogenic Cells 



Pluripotent Stromal 
Stem Cell 

CFU-F? 



Common Lineage 
Totipotent Stem Cell?* 



fibroblasts 

adipocytes (fibroblast <-» adipocyte?) 

osteocytes 



Pluripotent Hematopoietic 
Stem Cell* 

CFU-S, .? 



H> Committed Progenitors > Precursors 

Myeloid, Lymphoid, Erythroid, 
Monocytic, Megakarocytic CFU 



> Mature Blood Cells 

B lymphocytes 

T lymphocytes 

Natural Killer cells 

Monocytes 

Granulocytes 

Erythrocytes 

Platelets 



FIGURE 119.1 Hypothetical relationship between the ontogeny of stromal and parenchymal (hematopoietic) cells 
of the bone marrow. Note: that stromal cell is denned in this chart as only those cells that are derivatives of 
mesenchyme. Hematopoietic stem and progenitor cells express CD 34. This epitope also is expressed by stromal cell 
precursors (CFU-F) [Simmons & Torok-Storb, 1991] and adherent stromal cells develop from cell populations selected 
for the CD 34 antigen. In addition, fetal marrow elements with CD 34 +, CD 38-, HLA-DR-phenotypes were reported 
to develop not only the hematopoietic microenvironment but also the hematopoietic cells themselves [Huang & 
Terstappen, 1992]. 

* Indicates the possibility that, at least in the fetus, there may be a common stem cell for bone marrow stromal and 
hematopoietic elements. 

1 There are several schools of thought relating to single or multiple stem cell pools for the lymphoid versus the other 
lineages. 

TABLE 119.1 Cells That Contribute to the Tissue Microenvironment 

Stromal cells: derivatives of a common precursor cell 

Mesenchyme 

Fibroblasts 

Myofibroblasts 

Osteogenic/chondrogenic cells 

Adipocytes 
Stromal-associated cells: histogenically distinct from stromal cells, permanent residents of a tissue 

Endothelial cells 

Macrophages 
Transient cells: cells that migrate into a tissue for host defense either prior to or following an inflammatory stimulus 

B lymphocytes/plasma cells 

Cytotoxic T cells and natural killer (NK) cells 

Granulocytes 
Parenchymal cells: cells that occupy most of the tissue volume, express functions that are definitive for the tissue, 
and interact with all other cell types to facilitate the expression of differentiated function 

basement membrane. The composition of the underlying tissue varies, but it contains a connective tissue 
framework for parenchymal cells. If the tissue is an artery or a vein, the underlying connective tissue is 
composed of circularly arranged smooth-muscle cells which in turn are surrounded by an adventitia of 
loose connective tissue. Tissues within organs are generally organized into functional units around 
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capillaries which facilitate metabolite and blood gas transport by virtue of their lack of a smooth muscle 
layer (tunica media) and the thinness of their adventitial covering. Connective tissue cells of the tissue 
underlying capillaries deposit extracellular matrix (ECM) which is a mixture of fibrous proteins (col- 
lagens) embedded in a hydrated gel of glycosaminoglycans (GAGs). The GAGs as distinguished by their 
sugar residues, are divided into five groups: hyaluronic acid, the chondiotin sulfates, dermatan sulfate, 
heparan and heparin sulfate, and keratan sulfate. All the GAGs except hyaluronic acid link with proteins 
to form proteoglycans. These proteoglycans bind to long- chained hyaluronic acid cores that interweave 
the crosslinked collagen fibers that form the framework of the tissue. The basic composition and density 
of deposition of stromal cell-derived ECM varies from tissue to tissue [Lin and Bissel, 1993]. Figure 119.2 
is a scanning electron micrograph (SEM) depicting the intricacies of the ECM deposited by human bone 
marrow-derived stromal cells growing on a three-dimensional culture template. The ECM forms a sieve- 
like arrangement for diffusion in an aqueous environment and a number of bone marrow-derived 
migratory cells are present as well. Such an arrangement dramatically enhances the surface area for cell 
growth. Although these voids are primarily filled by parenchymal cells, normal tissue interstitium also 
contains migratory immunocompetent cells such as B lymphocyte/plasma cells, T cells, and natural killer 
cells as well as the ubiquitous macrophage (histiocyte). The numbers of these cells are enhanced during 
inflammatory episodes when neutrophils or other granulocytes and/or mononuclear leukocytes infiltrate 
the tissue. Cytokines and other proteins released by these leukocytes recruit stromal cells in the tissue 
repair process. This phenomenon can be exploited by tumor cells to enhance their invasiveness (see 
Section 119.2 Tumor Cells). Although tissue function in toto is measured by parenchymal cell output 
(e.g., for liver, protein synthesis, metabolism; for bone marrow, blood cell production), this is profoundly 
influenced by and in some instances orchestrated by the stromal cell microenvironment. 

Stromal cells contribute to parenchymal cell function by synthesizing the unique mix of ECM proteins 
necessary for cell seeding/attachment of specific types of tissue cells, the modulation of gene expression 
in parenchymal cells by events triggered by cytoskeleton-mediated transduction [Ben-Ze'ev et al., 1988], 
the deposition of ECM proteins to sequester growth and regulatory factors for use by developing paren- 
chyma [Roberts et al., 1988], deposition of ECM of the right density to permit diffusion of nutrients, 
metabolites, and oxygen (and egress of C0 2 and waste) to the extent necessary to maintain the functional 
state of the tissue, by forming the appropriate barriers to minimize cell migration or intrusion, and by 
synthesizing and/or presenting cytokins that regulate parenchymal cell function, either constituitively or 
following induction by other humoral agents (see Table 119.2). 

Creating cultures that contain the multiplicity of cell types found in vivo presents a daunting task for 
several reasons: (1) culture media that are rich in nutrients select for the most actively mitotic cells at 
the expense of more mitotically quiescent cells and, (2) cell phenotypic expression and function is related 
to its location in tissue to some extent. It is difficult, especially using a flat (i.e., two-dimensional) culture 
template, to create a microenvironment that is permissive or inductive for the formation of tissue-like 
structures. (3) Localized microenvironmental niches regulate different parenchymal cell functions or, in 
the case of bone marrow, hematopoietic cell differentiation. These milieu are difficult to reproduce in 
culture, since all cells are exposed to essentially the same media components. However, a major goal of 
tissue culture is to permit normal cell-cell associations and the reestablishment of tissue polarity so that 
parenchymal cell function is optimized. The question of three-dimensionality will be addressed later in 
this chapter. A brief survey of the various types of stromal elements follows. 

Fibroblasts 

Fibroblasts are responsible for the synthesis of many GAGs and for the deposition and organization of 
collagens. Although present in most tissues, fibroblasts exhibit specialization with respect to the type of 
ECM that they secrete. For example, liver tissue contains type I and type IV collagen, whereas type II 
collagen is found in cartilaginous tissues. Bone marrow contains types I, III, IV, and V collagen [Zuck- 
erman, 1984]. Heterogeneity of collagen deposition as well as GAG composition exists not only between 
different tissues but in developmentally different stages of the same tissue [Thonar & Kuettner, 1987]. 
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FIGURE 119.2 Scanning electron micrographs of bone marrow cultures on nylon screen templates (a) Photograph 
depicting a portion of a macrophage (M) associated with the numerous, delicate, interweaving strands of ECM (arrows) 
that are deposited between the openings of the nylon screen of a bone marrow stromal cell culture. ( h) Myeloid cells of 
a hematopoietic colony growing in a coculture of human stromal cells and hematopoietic cells. Note the pattern of 
attachment of individual cells to matrix and the large open area between the cells for nutrient access, (c) The photograph 
depicts the intimate association of cells of a myeloid or mixed myeloid/monocytic colony (arrow) with enveloping 
fibroblastic cells (F) in a bone marrow coculture. A filament of the nylon (n) screen is also present in the field, (d) An 
erythroid colony (E) in a human bone marrow coculture on nylon screen. Note that the more mature erythroid cells are 
on the periphery of the colony and that it is in close apposition to a macrophage (M). A nylon filament is also present (n). 
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FIGURE 119.2 (continued) 



In addition to matrix deposition, fibroblasts synthesize the cytokines of the fibroblast growth factor (FGF) 
family, a variety of interleukins, and GM-CSF as well as other regulatory cytokines (Table 119.2). Fibro- 
blast activity can be modulated by circulating or locally diffusable factors including IL-1, TGFp, TNF a , 
and a host of other factors [Yang et al., 1988]. For example, IL-1 activated splenic stroma supported the 
growth of rat natural killer cells in long-term culture [Tjota et al., 1992]. In addition, treatment of bone 
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TABLE 119.2 Stromal Cell Phenotypes and Secretory Profiles 



Phenotypes" 

MHC I (all) 

MHC II (E, mcp) 

CD 10 (Neural endopeptidase) b 

CD 29 (B 2 integrin) (all) 

CD 34 (CFU-F) 

CD 36 (IA 7 (mcp) 

CD 44 (H-CAM) b 

CD 49b (a 2 chain of VLA-2) b 

CD 49d (a 4 chain of VLA-4) b 

CD 49e (a 5 chain of VLA-5) b 

CD 51 (vitronectin receptor) b 

CD 54 (ICAM-1) (all) 

CD 58 (LFA-3) (all) 

CD 61 (GP Ilia) 

VCAM-1 (E) 

a SM actin (F) 

Vimentin (F) 

Decorin (Mes) 

Fibronectin (F, mcp) 

Laminin (E) 

Collagen I (F) 

Collagen III (F) 

Collagen IV (F) 

von Willebrand factor (E) 

Adipsin (A) 



Cytokine/Protein Secretion 

II- 1(3 (mcp, E, F,A) 

M-CSF (m(p, E, F) 

G-CSF (mcp, F) 

GM-CSF (m(p, E) 

LIF (mcp, F) 

GM-colony enhancing factor (mcp) 

TGFp (mcp, F) 

TNFa (mcp, F) 

PAF (E) 

BPA (mcp, E) 

IFNa (mcp, F, E) 

IFNy (F) 

11-6 (mcp) 

c-kit ligand (mcp, F) 

Acidic and basic FGF (F) 

Angiotensinogen (A) 

lipoprotein lipase (A) 

Adipocyte P2 (A) 

MlP-la (mcp) 

Complement proteins C1-C5 (mcp) 

Factor B, properdin (mcp) 

Transcobalamin II (mcp) 

Transferrin (mcp) 

Arachidonic acid metabolites (mcp, E) 

HGF/SF (F) 

flt3-ligand (S) 

Prolactin (F) 

LIF (S) c 

STR-3 (F) 

(IL-3, IL-7, IL-9, IL-11, IL-15, neuroleukin) b 



* Phenotypic expression of stroma in bone marrow cultures. Parentheses indicate 
localization according to cell type (mcp = Macrophage F = Fibroblast, E = endothelial, 
A = adipocyte, Mes = mesenchyme). Many of these phenotypes were identified in the 
original work of Cicuttini et al„ [1992] and Moreau et al., [1993], 

b Indicates the presence of mRNA and /or protein expression in bone marrow cultures 
but not localized to a particular cell type. 

c Found in fallopian tube stroma [Keltz et al., 1997]. 

Abbreviations: BPA — erythroid burst promoting activity, CD — cluster determinant, 
MHC — major histocompatibility complex, M — monocyte, G — granulocyte, CSF — col- 
ony stimulating factor, CFU-F — colony forming unit-fibroblast, M1P — macrophage 
inflammatory protein, IL — interleukin, IFN — interferon, TGF — transforming growth 
factor, TNF — tumor necrosis factor, PAF — platelet activating factor, FGF — fibroblast 
growth factor/scatter factor, LIF-leukemia inhibitory factor, STR-3 — stromelysin 3. 

marrow stromal cell cultures with the steroid methylprednisolone reduced the concentration of hyalu- 
ronic acid and heparan sulfate relative to other proteins [Siczkowski et al., 1993], perhaps making them 
more conducive to hematopoietic support. Horse serum, presumably because of its high content of 
hydrocortisone and other steroids, was an essential component of medium used to maintain hemato- 
poiesis in early long term bone marrow cultures [Greenberger et al., 1979]. In our experience, medium 
supplementation with hydrocortisone or other corticosteroids enhances the ability of liver- derived stro- 
mal cells to support parenchymal hepatocyte function for longer-terms in vitro [Naughton et al., 1994], 
although the precise mechanism(s) has not been defined. 

Fibroblastic cells in tissue appear to be heterogeneous with respect to support function. Early mor- 
phometric studies of bone marrow indicated that stroma supporting different types of hematopoiesis 
exhibited different staining patterns. The development of highly specific monoclonal antibodies in the 
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intervening years made possible a much more detailed analysis of stromal cells of bone marrow and other 
tissues and provided a number of avenues to isolate these cells for study. Whereas bone marrow stromal 
cells can be separated from hematologic cells by virtue of their adherence to plastic, this technique was 
not optimal for all tissues. Monoclonal antibodies can now be used to "dissect" cells from the stromal 
microenvironment using a variety of techniques including flow cytometry, immune panning, immuno- 
magnetic microspheres, and affinity column methodologies. Monoclonal antibodies also make it possible 
to select for stromal cell progenitors like the fibroblast-colony forming unit (CFU-F) which is CD 34 
positive. 

Endothelial Cells 

For nonglandular tissues such as bone marrow, the endothelial cells are vascular lining cells and can be 
distinguished by the expression of or the message for von Willebrand factor and surface major histo- 
compatibility complex-II (MHC-II) antigens. As cells of the vascular tunica intima, they, along with the 
basement membrane, form a selective barrier to regulate the transport of substances to and from the 
blood supply. In addition, their expression of integrins following stimulation with IL-1 or other mediators 
regulates the attachment of immunocompetent cells during acute (neutrophils) or chronic (T lympho- 
cytes, monocytes) inflammation. Endothelial cells also regulate other vascular functions. They synthesize 
the vasodilatory effector nitric oxide after induction with acetylcholine [Furchgott & Zawadzki, 1980] 
and secrete regulatory peptides, the endothelins, which counteract this effect [Yanagisawa et al., 1988]. 
Specialized vascular endothelia are present in bone marrow, where they permit the egress of mature 
leukocytes from the marrow into the sinusoids, and in the liver, where the surfaces of the cells lining 
sinusoids are fenestrated to facilitate transport. In a more general context, endothelial cells also produce 
collagen IV for basement membranes and secrete several cytokines including IL-1, fibronectin, M-CSF, 
and GM-CSF and release platelet-activating factor (Table 119.2). Vascular endothelia are nonthrombo- 
genic and contribute to angiogenesis by mitotically responding to locally secreted factors such as bFGF 
and aFGF. These endothelia possess LDL receptors and degrade this lipoprotein at substantially higher 
rates than other types of cells. These cells also have receptors for Fc, transferrin, mannose, galactose, and 
Apo-E, as well as for scavenger receptors [Van Eyken & Desmet, 1993]. Endothelia tend to assemble into 
tubular structures in culture, but their contribution to parenchymal cell growth in vitro is contingent 
on the generation of the proper tissue polarity. 

In addition to vascular endothelial cells, some organs possess nonvascular endothelial cells such as the 
bile duct lining cells of the liver. These cells possess antigenic profiles and secretory potentials that are 
similar to vascular endothelia. 

Adipocytes, Fat-Storing Cells 

These cells are represented in varying concentrations in different tissues. They are related to fibroblasts 
(Fig. 1 19.2), and transformation of fibroblasts to adipocytes in vitro can be induced by supplementation 
of the medium with hydrocortisone or other steroids [Brockband & van Peer, 1983; Greenberger, 1979] 
which induce the expression of lipoproteinlipase and glycerolphosphate dehydrogenase as well as an 
increase in insulin receptors [Gimble et al., 1989]. Like fibroblasts, marrow adipocytes are heterogeneous 
and display different characteristics related to their distribution in red or yellow marrow [Lichtman, 
1984] . These characteristics include insulin independence but glucocorticoid dependence in vitro, positive 
staining with perfomic acid-Schiff, and higher concentrations of neutral fats and unsaturated fatty acids 
in triglycerides. In bone marrow in vivo, there appears to be an inverse relationship between adipogenesis 
and erythropoiesis: phenylhydrazine-induced anemia causes a rapid conversion of yellow marrow (con- 
taining adipocytes) to red marrow due to compensatory erythroid heperplasia [Maniatis et al., 1971]. 
However, the role of the adipocyte in supporting hematopoiesis in culture is controversial. Although 
Dexter and coworkers [1977] associated declining myelopoiesis in long-term murine bone marrow 
cultures with the gradual disappearance of adipocytes from the stromal layer, these cells may not be 
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necessary to support hematopoiesis in human bone marrow cultures [Touw & Lowenberg, 1983]. In this 
regard, IL- 1 1 suppressed adipogenesis but simulated human CD34+ HL-DR + progenitor cells cocultured 
with human bone marrow stroma and enhanced the numbers of myeloid progenitor cells [Keller et al., 
1993]. Hematopoiesis in vivo requires the proper ECM for cell attachment and differentiation as well as 
regulation by cytokines elaborated by stromal cells [Metcalf, 1993]. Although there is considerable 
redundancy in the synthesis of regulatory factors by different stromal cell populations, no single cell 
population has been identified that can provide an entire hematopoietic microenvironment. Further 
complicating the issue is a recent report indicating that CD 34+ hematopoietic progenitors produce 
soluble factors that control the production of some cytokines by stromal cells [Gupta etal., 1998]. If 
"cross-talk" between parenchyma and stroma is established as an important regulatory mechanism, the 
ratio between the relative numbers of stroma and parenchyma in a tissue culture will assume paramount 
importance. The use of cell lines to provide hematopoietic support will be discussed later in this chapter. 
In the liver, purified fat-storing cells are capable of broad-scale synthetic activity that includes 
collagens I, III, IV, fibronectin, heparan sulfate, chondroitin sulfate, and dermatan sulfate [DeLeeuw et al., 
1984] . Adipocytes also synthesize colony-stimulating factors and other regulatory cytokines (Table 1 19.2). 
In addition to the above characteristics, adipocytes are desmin positive and therefore are phenotypically 
related to myogenic cells or myofibroblasts. The function of adipocytes may vary depending on their 
location. Adipocytes in the bone marrow are in close apposition to the sinusoidal endothelial cells and 
in the liver are found in the space of Disse under the endothelial cells. Hepatic fat-storing cells are finely 
integrated into several contiguous parenchymal cells and contain fat droplets that are qualitatively 
different than those found in hepatic parenchymal cells in that they contain high levels of retinols. Liver 
adipocytes are responsible for the metabolism and storage of vitamin A [DeLeeuw et al., 1984], provide 
some of the raw materials for the synthesis of biologic membranes and also contribute to local energy 
metabolism needs. Adeipocytes of other tissues also act as a type of "progenitor" cell that can be converted 
to different phenotypes (e.g., osteoblasts or chondroblasts) under the appropriate conditions and are 
capable of stimulating osteogenesis via their secretion of cytokines [Benayahu et al., 1993]. 

Macrophages 

Macrophages are derivatives of peripheral blood monocytes and are, therefore, bone-marrow- derived. 
They seed and remain on the surfaces of sinusoidal vessels in organs such as the liver and the spleen or 
migrate into the interstitial spaces of virtually all tissues. Macrophages are quintessential immunocom- 
petent cells and are central components of many defense strategies, including randomized microbial 
phagocytosis and killing; antibody-dependent cellular cytotoxicity (ADCC), where they are directed 
against microbial or other cells that are opsonized with antibody; nonrandomized (specific) phagocytosis 
mediated by the association of immunoglobulins with a multiplicity of F c receptors on their surfaces; 
the presentation of processed antigen to lymphocytes; secretion of and reaction to chemotaxins; and an 
enzymatic profile enabling them to move freely through tissue. The secretory capacity of macrophages 
is prodigious. In addition to plasma components such as complement proteins CI through C5, they 
synthesize the ferric iron- and vitamin B 12 -building proteins, transferrin and transcobalamin II, as well 
as a host of locally acting bioreactive metabolites of arachidonic acid. Macrophage secretory activity 
appears to influence two simultaneous events in vivo, inflammation and tissue repair. One monokine, 
IL-1, enhances the adhesion of neutrophils to vascular endothelial cells and activates B and T lymphocytes 
and other macrophages while stimulating the formation of acute phase proteins inducing collagen, ECM, 
and cytokine synthesis by fibroblasts and other stromal cells. IL-1 as well as a host of other humoral 
regulatory factors, originate in macrophages, including TNFoc, IFNoc, GM-CSF, and MIP-lOC 
(Table 119.2). The ability of macrophages to secrete regulatory cytokines makes them an important 
contributor to the tissue microenvironment. 

Macrophages have been intrinsically associated with erythropoiesis in the bone marrow [Bessis & 
Breton-Gorius, 1959] and [Naughton etal., 1979]. As such, these "nurse cells" destroy defective eryth- 
roblasts and the fetal and regenerating liver, and provide recycled iron stores for hemoglobin synthesis 
by developing red cells. They also synthesize and/or store erythropoietin. 
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119.2 Stromal Cells as "Feeder Layers" for 
Parenchymal Cell Culture 

Irradiated stromal feeder layers were first used by Puck and Marcus [1955] to support the attachment 
and proliferation of HeLa cells in culture. Direct contact with feeder layers of cells also permits the growth 
of glioma cells and epithelial cells derived from breast tissue and colon [Freshney et al., 1982] . Enhanced 
attachment of parenchymal cells and production of factors to regulate growth and differentiation are two 
important benefits of coculturing parenchymal and stromal cells. 

Bone Marrow 

Bone marrow was the first tissue to be systematically investigated in regard to the influence of ECM and 
stromal cells on the production of blood cells. By the mid 1970s, the influence of microenvironmental 
conditions [Trentin, 1970; Wolf & Trentin, 1968] and ECM deposition [McCuskey et al, 1972; Schrock 
et al., 1973] upon hematopoiesis was well established in the hematology literature. Dexter and coworkers 
[1977] were the first to apply the feeder-layer-based coculture technology to hematopoietic cells by 
inoculating mouse bone marrow cells into preestablished, irradiated feeder layers of marrow-derived 
stromal (adherent) cells. These cultures remained hematopoietically active for several months. By com- 
parison, bone marrow cells cultured in the absence of feeder layers or supplementary cytokines terminally 
differentiated within the first two weeks in culture; stromal cells were the only survivors [Brandt et al., 
1990; Chang & Anderson, 1971]. If the cultures are not supplemented with exogenous growth factors, 
then myeloid cells, monocytic cells, and nucleated erythroblasts are present for the first 7-10 days of 
culture. This trilineage pattern is narrowed over successive weeks in culture so that from about 2-4 weeks 
myeloid and monocytic cells are produced, and, if cultured for longer terms, the products of the culture 
are almost entirely monocytic. However, the nature of hematopoietic support can be modulated by 
changing the environmental conditions. In this regard, cocultures established under identical conditions 
as the above produce B lymphocytes if the steroid supplementation of the medium is removed and the 
ambient temperature is increased [Whitlock & Witte, 1982]. Static bone marrow coculture systems are 
generally categorized as declining because of this and the finding that the hematopoietic progenitor cell 
concentrations decrease as a function of time in culture. However, this trend can be offset by coculturing 
bone marrow in bioreactors that provide constant media exchange and optimize oxygen delivery [Palsson 
et al., 1993] or by supplementing the cocultures with cocktails of the various growth factors elaborated 
by stromal cells [Keller et al., 1993]. Connective tissue feeder layers had originally been hypothesized to 
act by conditioning medium with soluble factors that stimulated growth and by providing a substratum 
for the selective attachment of certain types of cells [Puck & Marcus, 1955]. These are functions that 
have since been proven for bone marrow and other tissue culture systems. 

It is very difficult to recreate the marrow microenvironment in vitro. One reason is that the marrow 
stromal cell populations proliferate at different rates; if expanded vigorously using nutrient-rich medium, 
the culture selects for fibroblastic cells at the expense of the more slowly dividing types of stroma. We 
established rat bone marrow cocultures using stromal cells that were passaged for 6 months, and these 
cultures produced lower numbers of progenitor cells (CFU-C) in the adherent zone as compared to 
cocultures established with stroma that was only passed three to four times (Fig. 119.3). Cells released 
from cultures using the older stroma were almost entirely monocytic, even at earlier terms of culture. In 
a related experiment, we suspended nylon screen cocultures of passage 3-4 stroma and bone marrow 
hematopoietic cells in flasks containing confluent, 6-month-old stromal cell monolayers and found that 
these monolayer cells inhibited hematopoiesis in the coculture [Naughton et al., 1989]. 

We use early passage stroma for our cocultures in order to retain a representation of all the stromal 
cell types. If hematopoietic cells from cocultures are removed after about a month in vitro and reinoc- 
ulated onto a new template containing passage 3-4 stromal cells and cultured for an additional month, 
the progenitor cell concentrations of the adherent zones are considerably higher than in cocultures where 
no transfer took place [Naughton et al., 1994] . This experiment indicated that continued access to "fresh" 
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FIGURE 1 19.3 Left. Mean CFU-C progenitor concentration of the adherent zone of a three-dimensional coculture 
of rat bone hematopoietic cells and rat bone marrow stroma. Stock cultures to generate the stromal cells for the 
cocultures were seeded onto the three-dimensional template either after passage (P3) or following expansion in 
monolayer culture for 6 months. Stroma at early passage is substantially more supportive of CFU-C progenitor cells 
than "old" stroma that is primarily fibroblastic in nature. Vertical lines through the means = ±1 standard error of 
the mean (sem). Right. Analysis of the cellular content of the nonadherent zone of rat bone marrow hematopoietic 
cell: stromal cell cocultures on three-dimensional templates by flow cytometry. Cocultures were established either 
with P3 stroma (closed figures) or stroma that was grown in monolayer culture for 6 months (open figures). The 
mean percentages of cells recognized by the phenotyping antibodies MOM/3F12/F2 (myeloid), ED-1 (monocytic), 
and OX-33 (B lymphoid) (Serotec, UK) are depicted. Vertical lines through the means = ±1 sem. Whereas low passage 
stroma generates myeloid, B lymphoid, and monocytic cells in cocultures and releases them into the nonadherent 
zone, 6-month-old stroma supports mainly the production of macrophages. 

stroma enhances hematopoiesis. The maintenance of mixed populations of stromal cells are desirable for 
a number of reasons. The same cytokine may be synthesized by different stromal cells, but its mode of 
action is usually synergistic with other cytokines for the differentiation of a specific blood cell lineage(s). 
Differentiation may not occur in the absence one of these cytokines, although some degree of redundancy 
in cytokine expression is intrinsic to the stromal system [Metcalf, 1993]. The fibroblastic cells that "grow 
out" of later passage human and rodent bone marrow stroma share some phenotypic properties with 
muscle cells (aSM actin+, CD34-, STRO-1-). These cells (endothelial cells and macrophages are absent 
or present in negligible quantities) supported hematopoietic progenitors for up to 7 weeks of culture but 
no longer [Moreau et al., 1993], indicating that other types of cells are necessary for long-term hemato- 
poiesis in vitro. 

Stromal cells derived from spleen or bone marrow were also shown to support the growth and 
maturation of rat natural killer cells in long-term three-dimensional culture [Tjota et al., 1992]. Natural 
killer cells produced by the cultures for more than 2 months in vitro continued to kill YAC-1 (NK 
sensitive) target cells when activated by IL-2. In more recent experiments, fetal thymus-derived stroma 
supported the production of CD45+/CD56+ human natural killer cells when cocultured with CD34+lin- 
progenitors [Vaz et al., 1998]. In related work, stromal cell-derived IL-15 was found to enhance the 
production of CD56+/CD3- natural killer cells from human CD34+ hematopoietic progenitors [Mrozek 
et al., 1996]. In addition, the process of natural killer cell maturation (but not early progenitor cell 
proliferation) appears to require direct contact with stromal cells [Tsuji and Pollack, 1995]. 

Cytoadhesive molecule expression and its modulation is also an important consideration for the 
engineering of bone marrow and other tissues. In the case of bone marrow, this regulates "homing" or 
seeding, restricting cells in specific areas within tissues both to expose them to regulatory factors and to 
the release of cells from the supportive stroma upon maturation. For example, the sequential expression 
of hemonectin, a cytoadhesion molecule of myeloid cells, in yolk sac blood islands, liver, and bone marrow 
during embryonic and fetal life parallels the granulopoiesis occurring in these respective organs [Peters 
et al., 1990]. Hematopoietic stem cells [Simmons et al., 1992] and B-cell progenitors [Ryan et al., 1991] 
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TABLE 119.3 Cells Contributing to the Hepatic Microenvironment 



Cell Type 

Stroma 

Kupffer cells 



Size, (im 



12-16 



Relative Percent 
of Total Cells 



Vascular endothelia 11-12 

Biliary endothelia 10-12 



Fat-storing cells 


14-18 


3 


Fibroblasts 


11-14 


7 


Pit cells 


11-15 


1-2 (variable) 


Parenchymal cells 






Mononuclear (type 1) 


17-22 


35 


Binuclear (type II) 


20-27 


27 


Acidophilic (type III) 


25-32 


5 



Characteristics 



MHC-1+, MHC-II+, Fcr, C3r, mannose and D- 

acetrylglucosamine receptors, acid phosphatase+, density = 

1.076 gm/ml,T.036 gm/ml* 
MHC-1+, MHC II+, vWF+, F c r, TF r , mannose, apo-E, and 

scavenger receptors, density = 1.06-1.08 gm/ml,T.036 gm/ml* 
MHC-I+, Positive for cytokeratins 7, 8, 18, 19, 02 

microglobulin+, positive for VLA-2,3,6 integrins, agglutinate 

with UEA, WGA, SBA, PNA, density = 1.075-1.1 

gm/ml,T.0363gm/ml t 
MHC-I+, desmin+, retinol+, ECM expression, collagen I, IV 

expression., density =1.075-1.1 gm/ml* 
MHC-I+, ECM expression, collagen I, IV expression, density = 

1.025 gm/ml," 1.063 gm/ml' 
MHC-I.+, asialo-GM+, CD8+, CD-5- 

MHC-I-, MHC-II-, blood group antigen-, density = 

1.10-1.14 gm/ml,' 1.067 gm/ml* 
MHC-I±, MHC-II-, blood group antigen-, density = 

1.10-1.14 gm/ml," 1.071 gm/ml* 
MHC-1±, MHC-II-, blood group antigen-, density = 

1.038 gm/ml* 



Abbreviations: ECM = extracellular matrix, MHC - major histocompatibility complex, PNA 
soybean agglutinin, WGA = wheat germ agglutinin, UEA = Ulex europaeus agglutinin. 



peanut agglutinin, SBA = 



bind to VCAM-1 on cultures stromal cells by expression of VLA-4. In addition, erythroid progenitor cells 
bind preferentially to the fibronectin component of the ECM and remain bound throughout their 
differentiation [Tsai et al., 1987]. A basic requirement of stromal feeder layers is the expression of the 
proper cytoadhesion molecule profile to permit attachment of parenchyma; in this instance these are 
hematopoietic stem and progenitor cells. 

Liver 

Liver is an hematopoietic organ during fetal life, and it contains many of the same stromal cell populations 
found in bone marrow (Table 119.3). Research trends in liver cell and bone marrow culture have followed 
a somewhat parallel course. Both tissues are difficult to maintain in vitro; the parenchymal cell numbers 
either declined (hematopoietic progenitors) or lost function and dedifferentiated (hepatocytes) over time 
in culture. When mixed suspensions of hepatic cells are inoculated into liquid medium, approximately 
20% of the total cells attach. The nonadherent population remains viable for about 72 hours. The adherent 
cells, although they proliferate for only 24-48 hours after inoculation, can survive for substantially longer 
periods. However, many of these adherent parenchymal cells undergo drastic phenotypic alterations, 
especially if the medium is conditioned with serum. These changes include a flattening and spreading 
on plastic surfaces as well as a propensity to undergo nuclear division in the absence of concomitant 
cytoplasmic division. The appearance of these bizarre, multinucleated, giant cells herald the loss of liver- 
specific functions such as albumin synthesis and the metabolism of organic chemicals by cytochrome 
P450 enzymes. The percentage of hepaocytes attaching to the flask, the maintenance of rounded paren- 
chymal cell phenotype, and the expression of specialized hepatic function in vitro can be enhanced by 
precoating the flasks with ECM components such as type I collagen [Michalopoulos & Pitot, 1975], 
fibronectin [Deschenes et al., 1980], homogenized liver tissue matrix [Reid et al., 1980], laminin, and 
type IV collagen [Bissell et al., 1987]. Hepatocyte survival and functional expression also improved when 



O 2000 by CRC Press LLC 



hepatocytes were cocultured with liver-derived [Fry & Bridges, 1980] and murine 3T3 [Kuri-Harcuch & 
Mendoza-Figueroa, 1989] fibroblasts as well as a preestablished layer of adherent liver epithelial cells 
[Guguen-Guillouzo et al., 1983]. As with bone marrow culture on feeder layers, cells derived from the 
liver itself usually provide the best support in culture. However, we have found that hepatic parenchyma 
will express a differentiated function in vitro if supported by bone marrow stroma. Conversely, liver- 
derived stromal cells support hematopoiesis in culture but his microenvironment favors erythropoisis 
(unpublished observations). 

In contrast to the role of bone marrow stroma on hematopoiesis, the influence of hepatic stromal cells 
on the function and/or growth of parenchymal hepatocytes has not been exhaustively investigated. 
However, several studies indicate the Kupffer cells, fat-strong cells, and perhaps other stroma influence 
parenchymal cell cytochrome P450 enzyme expression [Peterson & Renton, 1984] and act in tandem 
with parenchymal cells to metabolize lipopolysaccharide [Treon et al., 1979]. In addition, adipocytes and 
hepatic fibroblasts synthesize collagen type I and the proteoglycans heparan sulfate, dermatan sulfate, 
and chondroitin sulfate as well as hyaluronic acid, which is unsulfated in the liver and occurs as a simple 
GAG. Stromal cells as well as parenchyma deposit fibronectin. Liver adipocytes, like their relatives in 
bone marrow, express phenotypes (vimentin+, actin+, tubulin-l-) linking them histogenically to fibro- 
blasts and mogenic cells [DeLeeuw et al, 1984]. Fat-storing cells as well as vascular endothelia apparently 
synthesize the type IV collagen found in the space of Disse, and fibrin originating form parenchymal cell 
fibrinogen synthesis is a significant part of liver matrix, at least in culture. As with the hematopoietic 
cells in bone marrow cultures, hepatic parenchymal cell gene expression is modulated by attachment to 
ECM and influenced by factors released by stromal cells. For example, parenchymal cells attaching to 
laminin-coated surfaces express the differentiation-associated substance (X-fetoprotein, whereas the syn- 
thesis of albumin synthesis is favored when parenchymal cells bind to type IV collagen [Michalopoulous & 
Pitot, 1975]. Hepatocytes bind to fibronectin using the OC 5 (3 integrin heterodimer and AGpllO, a nonin- 
tegrin glycoprotein. Cytoadhesion molecule expression by heapatic cells and cells of other tissues changes 
with and perhaps controls development [Stamatoglou & Hughes, 1994]. Just as differential hemonectin 
expression occurred during the development of the hematopoietic system [Peters et al, 1990], a differ- 
ential expression of liver proteins occurs during ontogeny. Although fibronectin and its receptor are 
strongly expressed in liver throughout life, AGpllO appears later in development and may guide the 
development of parenchyma into a polarized tissue. It will probably be necessary to incorporate the 
various stromal support cells found in liver into hepatocyte cultures. 

Tumor Cells 

Stromal cells contribute to the process of neoplastic invasion of tissue by responding in a paracrine 
fashion to signals released by tumor cells. This includes the tumor-stimulated release of angiogenic factors 
from fibroblasts such as acid and basic FGF and the release of vascular endothelia growth factor/vascular 
proliferation factor (VEGF/VPF) by tumor cells to recruit endothelial cells and stimulate their prolifer- 
ation and development into blood vessels to feed the growing neoplasm (reviewed by Wernert [1997]). 
In addition to cooperating in the establishment of tumors, stromal cells also contribute to their invasive- 
ness. Activation and/or release of stromal cell-derived Tissue Factor (TF) (whose release by damaged 
tissue initiates the extrinsic limb of the protease coagulation cascade) has been associated with the 
progression from early to invasive breast cancer [Vrana et al., 1996]. In this regard, upon direct contact 
with tumor cells, stromal cells produce stromelysin 3 (STR-3), a matrix metalloproteinase that accelerates 
the migration of metastatic cells through tissue by degrading tissue matrix [Mari et al., 1998]. STR-3 is 
overexpressed in a wide variety of tumor stroma in breast, lung, colon, and other cancers. It would be 
appropriate to include coculture studies (i.e., tumor and stroma) in the investigation of the biology of 
cancer in vitro as well as its reponsivity to potential treatments. We have previously demonstrated that 
the presence of stroma alters the hematologic responsiveness to chemotherapy agents such as cyclophos- 
phamide [Naughton et al., 1992]. In related work, primary cultures of murine plasmacytomas requires 
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TABLE 119.4 Some Representative Stromal Cell Lines Used to Support Hematopoietic Cells or Hepatocytes 



Cell line 



Species Phenotype' 



ione marrow 






AC-4 


Mouse 


A 


ALC 


Mouse 


A 


GM 1380 


Human 


F 


GY-30 


Mouse 


A 


K-l 


Mouse 


A 


MBA- 14 


Mouse 


F/M 


MS-1 


Mouse 


F 


U2 


Mouse 


Ft 


3T3 


Mouse 


F 


10 Tl/2 


Mouse 


F 


10T1/2 clone D 


Mouse 


A 


266 AD 


Mouse 


A 


3T3 


Mouse 


F 


Detroit 550 


Human 


F 



Support Capability 

Myeloid, monocytic, B lymphoid 
Myeloid, monocytic, B lymphoid 
Short-term myelo- + monopoiesis 
Myeloid, monocytic, B lymphoid myelopoiesis 

Enhances CFU-C numbers myelopoiesis 

Maintenance of CFU-S 12 , myelopoiesis 

Stimulates CFU-GEMM, -GM 
Stimulates CFU-GM only 
Myeloid, monocytic, B lymphoid 

Enhanced lipid metabolism, extends period of cytochrome P450 activity!" 
Prolonged cytochrome P450 and NADPH-cytochrome C reductase activity* 



Source: Adapted from Anklesaria et al. [1987] and Gimble [1990]. 

'Major phenotype of the cell line (F = fibroblastic, M = monocytic, A = capable of supporting adipocyte phenotypes). 
f A bone marrow-derived cell line that was transformed with the large T oncogene of simian virus 40. 
•Support functions are compared to liquid cultures of rat hepatocytes without stroma. 

Abbreviations: CFU = colony-forming unit; G = granulocyte; M = macrophage; S = spleen; GEMM = granulocyte, eryth- 
rocyte, megakaryoctye, monocyte; NADPH = nicotinamide adenine dinucleotide phosphate. 



a feeder layer. Alteration of ECM composition and other support functions of the stroma by anti- 
inflammatory drugs prevented plasmacytoma growth in vivo and in vitro [De Grassi et al., 1993]. 

119.3 Support of Cultured Cells Using Cell Lines 

The precise roles of the various cellular constituents of the tissue microenvironment have not been fully 
defined. One approach to understanding these mechanisms is to develop stroma cell lines with homo- 
geneous and well-defined characteristics and then ascertain the ability of these cells to support paren- 
chyma in coculture. A brief survey of some of these lines is found in Table 1 19.4. Stromal cell lines derived 
from bone marrow are usually either fibroblastic (F) or a mixture of fibroblastic and adipocytic cells 
which contain a subpopulation of fibroblastic cells that can be induced to undergo lipogenesis with 
dexamethasone or hydrocortisone (A) (reviewed by Gimble, [ 1990] ). In general, fibroblastic lines support 
myelopoiesis and monocytopoiesis and stroma with both fibroblastic and adipocytic cells supports 
myelopoiesis as well as B lymphopoiesis. The MBA- 14 cell line, which consists of stroma bearing fibro- 
blastic as well as monocytic phenotypes, stimulated the formation of CFU-C (bipotential myeloid/mono- 
cytic) progenitors in coculture. Different cell lines that were transformed using the large T oncogene of 
simian virus 40 (U2) were used as feeder layers for bone marrow hematopoietic cells. Cocultures estab- 
lished using bone-marrow-derived stroma exhibited considerably better maintenance of the primitive 
stem cell CFU-S 12 in culture than similarly transfected skin, lung, or kidney tissue cells [Rios & Williams, 
1990]. Although feeder layers share a number of characteristics [cytokine production, ECM deposition), 
it is probably best to derive your feeder cells from the tissue that you wish to coculture rather than use 
xenogeneic feeder cells or cells derived from a completely dissimilar tissue. In this respect, we found that 
total cell output and progenitor cell concentrations were substantially higher in rat bone marrow cocul- 
tures supported with rat bone marrow stroma as compared to those established using immortalized 
human skin fibroblasts or fetal lung cell lines [Naughton & Naughton, 1989]. 
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119.4 Stereotypic (Three-Dimensional) Culture Versus 
Monolayer Culture 

Tissue is a three-dimensional arrangement of various types of cells that are organized into a functional 
unit. These cells also are polarized with respect to their position within tissue and the microenvironment, 
and therefore the metabolic activity and requirements of the tissue are not uniform throughout. Three- 
dimensional culture was first performed successfully by Leighton [1951] using cellulose sponge as a 
template. Collagen gel frameworks [Douglas, 1980] also have been and are currently being employed to 
culture tissues such as skin, breast epithelium, and liver. In addition, tumor tissue cocultured with stroma 
in collagen gels respond to drugs in a manner similar to that observed in vivo [Rheinwald & Beckett, 1981] . 

We developed three-dimensional coculture templates using nylon filtration screens and felts made of 
polyester or bioresorbably polyglycolic acid polymers [Naughton et al., 1987, 1994]. Rodent or human 
bone marrow cocultures retained multilineage hematopoietic expression in these stereotypic cultures, a 
phenomenon that is possible in plastic flask or suspension cultures only if the medium is supplemented 
with cocktails of cytokines [Peters et al., 1993]. Cocultures of rat hepatic parenchymal and stromal cells 
on three-dimensional templates also displayed a number of liver-specific functions for at least 48 days 
in culture, including the active synthesis of albumin, fibrinogen, and other proteins and the expression 
of dioxin-inducible cytochrome P450 enzyme activity for up to 2 months in culture [Naughton et al., 
1994]. Furthermore, hepatic parenchyma in these stereotypic cocultures proliferated in association with 
stromal elements and the ECM they deposited until all "open" areas within the template were utilized. 
Our method is different from others because we use stromal cells derived from the tissue we wish to 
culture to populate the three-dimensional template. These cells secrete tissue-specific ECM and other 
matrix components that are indigenous to the normal microenvironment of the tissue. Parenchymal cells 
associate freely within the template after their inoculation and bind to other cells and/or matrix based 
upon their natural cytoadhesion molecule profiles. We do not add exogenous proteins. 

Three-dimensional scaffolds such as nylon screen or polyester felt provide large surface areas for cell 
attachment and growth. Although mass transfer limitations of diffusion dictate the maximum thickness 
(density) of a tissue culture, suspended three-dimensional cultures have the advantage of being completely 
surrounded by medium. This arrangement effectively doubles the maximum tissue thickness that is 
possible with a plastic flask-based culture. These stereotypic cultures also appear to form tissuelike 
structure in vitro and, when implanted after coculture on bioresorbable polymer templates, in vivo. 

Defining Terms 

When hematopoietic cells are inoculated into semisolid or liquid medium containing the appropriate 
growth factors, some of the cells are clonal and will form colonies after approximately 2 weeks in culture. 
These colonies arise from hematopoietic progenitor cells and are called colony-forming units or CFU. 
These colonies can consist of granulocytic cells (G), monocytic cells (M), a mixture of these two cell 
types (GM), or erythroid cells (E). Less mature progenitor cells have a greater potential for lineage 
expression. For example, a CFU-GEMM contains granulocytic, erythroid, megakaryocytic, and mono- 
cytic cells and is therefore the least mature progenitor of this group. 

The text also mentions CFU-S. Whereas the other assays quantify colony formation in vitro, this is an 
in vivo assay. Briefly, irradiated mice are infused with meatopoietic cells. Some of these cells will colonize 
the spleen and will produce blood cells that "rescue" the animal. As with the in vitro assays, colonies that 
arise later in culture originate from less mature cells. The CFU-S 12 therefore is a more primitive hemato- 
poietic cell than the CFU-S 9 . 



e 2000 by CRC Press LLC 



References 

Anklesaria P, Kase K, Glowacki J, et al. 1987. Engraftment of a clonal bone marrow stromal cell line in vivo 

stimulates hematopoietic recovery from total body irradiation. Proc Natl Acad Sci USA 84:7681. 
Beneyahu D, Zipori D, Wientroub S. 1993. Marrow adipocytes regulate growth and differentiation of 

osteoblasts. Biochem Biophys Res Comm 197:1245. 
Ben-Ze'ev A, Robinson GS, Bucher NLR, et al. 1988. Cell-cell and cell-matrix interactions differentially 

regulate the expression of hepatic and cytoskeletal genes in primary cultures of rat hepatocytes. 

Proc Nat Acad Sci (USA) 85:2161. 
Bessis M, Breton-Gorius J. 1959. Nouvelles observations sur Pilot erythroblastique et la rhopheocytose 

de la ferritin. Rev Hemato 14:165. 
Bissell DM, Arenson DM, Maher JJ, etal. 1987. Support of cultured hepatocytes by a laminin-rich gel: 

Evidence for a functionally significant subendothelial matrix in normal rat liver. J Clin Invest 790:801. 
Brandt J, Srour EF, Van Besien K, et al. 1990. Cytokine-dependent long term culture of highly enriched 

precursors of hematopoietic progenitor cells from human bone marrow. J Clin Invest 86:932. 
Brockbank KGM, van Peer CMJ. 1983. Colony stimulating activity production by hemopoietic organ 

fibroblastoid cells in vitro. Acta Haematol 69:369. 
Chang VT, Anderson RN. 1971. Cultivation of mouse bone marrow cells: I. Growth of granulocytes. 

J Reticuloendoth Soc 9:568. 
Cicuttini FM, Martin M, Ashman L., et al. 1992. Support of human cord blood progenitor cells on human 

stromal cell lines transformed by SV 40 large T antigen under the influence of an inducible (met- 

allothionein) promoter. Blood 80:102. 
Degrassi A, Hilbert DM, Rudikoff S, etal. 1993. In vitro culture of primary plasmacytomas requires 

stromal cell feeder layers. Proc Nat Acad Sci (USA) 90:2060. 
DeLeeuw AM, McCarthy SP, Geerts A, et al. 1984. Purified rat liver fat-storing cells in culture divide and 

contain collagen. Hepatology 4:392. 
Dexter TM, Allen TD, Lajtha LG. 1977. Conditions controlling the proliferation of haematopoietic stem 

cells in vitro. J Cell Physiol 91:335. 
Douglas WHJ, Moorman GW, Teel RW. 1980. Visualization of cellular aggregates cultured on a three- 
dimensional collagen sponge matrix. In Vitro 16:306. 
Freshney RI, Hart E, Russell JM. 1982. Isolation and purification of cell cultures from human tumours. 

In E Reid, GMW Cook, DJ Moore (eds), Cancer Cell Organelles. Methodological Surveys: Bio- 
chemistry, pp 97-110, Chichester, England, Horwood Press. 
Friedenstein AJ, Chailakhyan RK, Gerasimov UV. 1970. Bone marrow osteogenic stem cells: In vitro 

cultivation and transplantation in diffusion chambers. Cell Tiss Kinet 20:263. 
Furchgott RF, Zawadzki JV 1980. The obligatory role of endothelial cells in the relaxation of arterial 

smooth muscle by actylcholine. Nature 286:373. 
Gallagher JT, Spooncer E, Dexter TM. 1982. Role of extracellular matrix in haemopoiesis: I. Synthesis of 

glycosaminoglycans by mouse bone marrow cultures. J Cell Sci 63:155. 
Gimble JM. 1990. The function of adipocytes in the bone marrow stroma. New Biol 2:304. 
Gimble JM, Dorheim MA, Cheng Q, et al. 1989. Response of bone marrow stromal cells to adipogenetic 

antagonists. Mol Cell Biol 9:4587. 
Greenberger JS. 1979. Corticosteroid-dependent differentiation of human marrow preadipocytes in vitro. 

In Vitro 15:823. 
Guguen-Guilluozo C, Clement B, Baffet G, et al. 1983. Maintenance and reversibility of active albumin 

secretion by adult rat hepatocytes co-cultured with another cell type. Exp Cell Res 143:47. 
Gupta P, Blazar BR, Gupta K, et al. 1998. Human CD34+ bone marrow cells regulate stromal production 

of interleukin-6 and granulocyte colony-stimulating factor and increase the colony-stimulating 

activity of stroma. Blood 91:3724. 



e 2000 by CRC Press LLC 



Huang S, Terstappen LWMM. 1992. Formation of haematopoietic microenvironment and haematopoietic 

stem cells from single human bone marrow cells. Nature 360:745. 
Keller D, Ou XX, Rour EF, et al. 1993. Interleukin-II inhibits adipogenesis and stimulates myelopoiesis 

in human long-term marrow cultures. Blood 82:1428. 
Keltz MD, Atton E, Buradagunta S, et al. 1996. Modulation of leukemia inhibitory factor gene expression 

and protein biosynthesis in human fallopian tube. Am J Gyecol Obstet 175:1611. 
Kuri-Harcuch W, Mendoza-Figueroa T. 1 989. Cultivation of adult rat hepatocytes on 3T3 cells: Expression 

of various liver differentiated functions. Differentiation 41:148. 
Leighton J. 1951. A sponge matrix method for tissue culture: Formation of organized aggregates of cells 

in vitro. J Natl Cancer Inst 12:545. 
Lictman MA. 1984. The relationship of stromal cells to hemopoietic cells in marrow. In DG Wright, JS 

Greenberger (eds), Long-Term Bone Marrow Culture, pp 3-30, New York, A.R. Liss. 
Lin CQ, Bissell MJ. 1993. Multi-faceted regulation of cell differentiation by extracellular matrix. FASEB J 

7:737. 
Maniatis A, Tavassoli M, Crosby WH. 1971. Factors affecting the conversion of yellow to red marrow. 

Blood 37:581. 
Mari BP, Anderson IC, Mari SE, et al. 1998. Stromelysin-3 is induced in tumor/stroma cocultures and 

inactivated via a tumor-specific and basic fibroblast growth factor-dependent mechanism. J Biol 

Chem 273:618. 
McCuskey RS, Meineke HA, Townsend SF. 1972. Studies of the hemopoietic microenvironment: I. 

Changes in the microvascular system and stroma during erythropoietic regeneration and suppres- 
sion in the spleens of CFj mice. Blood 5:697. 
Metcalf D. 1993. Hematopoietic growth factors. Redundancy or subtlety? Blood 82:3515. 
Michalopoulos G, Pitot HC. 1975. Primary culture of parenchymal liver cells on collagen membranes. 

Exp Cell Res 94:70. 
Moreau I, Duvert V, Caux C, et al. 1993. Myofibroblastic stromal cells isolated from human bone marrow 

induce the proliferation of both early myeloid and B-lymphoid cells. Blood 82:2396. 
Mrozek E, Anderson P, and Aligiuri MA. 1996. Role of interleukin-15 in the development of human 

CD56+ natural killer cells from CD34+ human progenitor cells. Blood 87:2632. 
Naughton BA, Kolks GA, Arce JM, et al. 1979. The regenerating liver: A site of erythropoiesis in the adult 

long-evans rat. Amer J Anat 156:159. 
Naughton BA, Naughton GK. 1989. Hematopoiesis on nylon mesh templates. Ann NY Acad Sci 554:125. 
Naughton BA, San Roman J, Sibanda B, et al. 1994. Stereotypic culture systems for liver and bone marrow: 

Evidence for the development of functional tissue in vitro and following implantation in vivo. 

Biotech Bioeng 43:810. 
Naughton BA, Sibanda B, San Roman J, et al. 1992. Differential effects of drugs upon hematopoiesis can 

be assessed in long-term bone marrow cultures established on nylon screens. Proc Soc Exp Biol 

Med 199:481. 
Owen ME. 1988. Marrow stromal stem cells. J Cell Sci 10:63. 
Palsson BO, Paek S-H, Schwartz RM, et al. 1993. Expansion of human bone marrow progenitor cells in 

a high cell density continuous perfusion system. Biotechnology 11:368. 
Peters C, O'Shea KS, Campbell AD, et al. 1990. Fetal expression of hemonectin: An extracellular matrix 

hematopoietic cytoadhesion molecule. Blood 75:357. 
Peterson TC, Renton KW. 1984. Depression of cytochrome P-450-dependent drug biotransformation in 

hepatocytes after the activation of the reticuloendothelial system by dextran sulfate. J Pharmacol 

Exp Ther 229:229. 
Puck TT, Marcus PI. 1955. A rapid method for viable cell titration and clone production with HeLa cells 

in tissue culture: the use of X-irradiated cells to supply conditioning factors. Proc Natl Aca Sci 

(USA) 41:432. 
Reid LM, Gaitmaitan Z, Arias I, et al. 1980. Long-term cultures of normal rat hepatocytes on liver 

biomatrix. Ann NY Acad Sci 349:70. 



e 2000 by CRC Press LLC 



Rios M, Williams DA. 1990. Systematic analysis of the ability of stromal cell lines derived from different 

murine adult tissues to support maintenance of hematopoietic stem cells in vitro. J Cell Physiol 145:434. 
Roberts R, Gallagher J, Spooncer E, et al. 1988. Heparan sulfate bound growth factors: A mechanism for 

stromal cell-mediated haemopoiesis. Nature 332:376. 
Ryan DH, Nuccie BL, Abboud CN, et al. 1991. Vascular cell adhesion molecule- 1 and the integrin VLA-4 

mediate adhesion of human B cell precursors to cultured bone marrow adherent cells. J Clin Invest 

88:995. 
Schrock LM, Judd JT, Meineke HA, et al. 1973. Differences in concentration of acid mucopolysaccharides 

between spleens of normal and polycythemic CF1 mice. Proc Soc Exp Biol Med 144:593. 
Siczkowski M, Amos As, Gordon MY. 1993. Hyaluronic acid regulates the function and distribution of 

sulfated glycosaminoglycans in bone marrow stromal cultures. Exp hematol 21:126. 
Simmons PJ, Masinovsky B, Longenecker BM, et al. 1992. Vascular cell adhesion molecule- 1 expressed 

by bone marrow stromal cells mediates the binding of hematopoietic progenitor cells. Blood 80:388. 
Simmons PJ, Torok-Storb B. 1991. CD 34 expression by stromal precursors in normal human adult bone 

marrow. Blood 78:2848. 
Strobel E-S, Gay RE, Greenberg PL. 1986. Characterization of the in vitro stromal microenvironment of 

human bone marrow. Int J Cell Cloning 4:341. 
Stamatoglou SC, Hughes RC. 1994. Cell adhesion molecules in liver function and pattern formation. 

FASEB J 8:420. 
Thonar EJ-MA, Kuttner KE. 1987. Biochemical basis of age-related changes in porteoglycans. In: TN 

Wight, RP Mecham (eds), Biology of Proteoglycans, pp 211-246, New York, Academic Press. 
Tjota A, Rossi Tm, Naughton BA. 1992. Stromal cells derived from spleen or bone marrow support the 

proliferation of rat natural killer cells in long-term culture. Proc Soc Exp Biol Med 200:431. 
Touw I, Lowenberg B. 1983. No simulative effect of adipocytes on hematopoiesis in long-term human 

bone marrow cultures. Blood 61:770. 
Trentin JJ. 1970. Influence of hematopoietic organ stroma (Hematopoietic inductive microenvironment) 

on stem cell differentiation. In AS Gordon (ed), Regulation of Hematopoiesis, New York, Appple- 

ton-Century-Crofts. 
Treon SP, Thomas P, Baron J. 1979. Lippopolysaccharide (LPS) processing by Kupffer cells releases a 

modified LPS with increased hepatocyte binding and decreased tumor necrosis-Ot stimulatory 

capacity. Proc Soc Exp Biol Med 202:153. 
Tsai S, Patel V Beaumont E, et al. 1987. Differential binding to erythroid and myeloid progenitors to 

fibroblasts and fibronectin. Blood 69:1587. 
Tsuji JM and Pollack SB. 1995. Maturation of murine natural killer cells in the absence of exogenous 

cytokines requires contact with bone marrow stroma. Nat Immun 14:44. 
Van Eyken P, Desmet VJ. 1993. Bile duct cells. In AV Le Bouton (ed), Molecular and Cell Biology of the 

Liver, pp 475-524, Boca Raton, Fla, CRC Press. 
Vaz F, Srour EF, Almeida-Porada G, et al. 1998. Human thymic stroma supports natural killer (NK) cell 

development from immature progenitors. Cell Immunol 186:133. 
Vrana JA, Stang MT, Grande JP, et al. 1996. Expression of tissue factor in tumor stroma correlates with 

progression to invasive human breast cancer: paracrine regulation by carcinoma cell-derived mem- 
bers of the transforming growth factor beta family. Cancer Res 56:5063. 
Wernert N. 1997. The multiple roles of tumor stroma. Virchows Arch 430:433. 
Whitlock CA, Witte ON. 1982. Long term culture of B lymphocytes and their precursors from murine 

bone marrow. Proc Nat Aca Sci (USA) 77:4756. 
Wight TN, Kinsella MG, Keating A, et al. 1986. Proteoglycans in human long-term bone marrow cultures: 

Biochemical and ultrastructural analysis. Blood 67:1333. 
Wilson D. 1983. The origin of the endothelium in the developing marginal vein of the chick wing-bud. 

Cell Differ 13:63. 
Wolf NS, Trentin JJ. 1968. Hematopoietic colony studies: V. Effects of hemopoietic organ stroma on 

differentiation of pluripotent stem cells. J Exp Med 127:205. 



e 2000 by CRC Press LLC 



Yanagisawa M, Jurihara HJ, Kimura S, et al. 1988. A novel potent vasoconstrictor peptide produced by 
vascular endothelial cells. Nature 332:411. 

Yang Y-C, Tsai S, Wong GG, et al. 1988. Interleukin-1 regulation of hematopoietic growth factor produc- 
tion by human stromal fibroblasts. J Cell Physiol 134:292. 

Zuckerman KS. 1984. Composition and function of the extracellular matrix in the stroma of long-term 
bone marrow cell cultures. In DG Wright, JS Greenberger (eds), Long-Term Bone Marrow Culture, 
pp 157-170, New York, A.R. Liss. 

Zuckerman KS, Rhodes RK, Goodrum DD, et al. 1985. Inhibition of collagen deposition in the extracel- 
lular matrix prevents the establishment of stroma supportive of hematopoiesis in long term murine 
bone marrow cultures. J Clin Invest 75:970. 

Further Information 

There are a number of different methodologies for isolating cells including gradient density centrifuga- 
tion, sedimentation at unit gravity, lectin agglutination, and reaction with specific antibodies followed 
by immunoselection via panning or immunomagnetic microspheres. These methods are described and 
illustrated well in volumes 1-5 of Cell Separation: Methods and Selected Applications, New York, Academic 
Press, 1987. For additional details concerning the relevance of ECM deposition to the development and 
functional expression of various types of tissue cells, the reader is referred to the serial reviews of this 
subject that appeared in The FASEB Journal from volume 7, number 9 (1993) to volume 8, number 4 
(1994). Information about the various cell types of the liver, their interaction with matrix components, 
and their mechanisms of gene expression can be found in Molecular and Cell Biology of the Liver, Boca 
Raton, Florida, CRC Press (1993). Similarly, for more details concerning bone marrow cells and the 
mechanisms of hematopoiesis, consult The Human Bone Marrow, volume 1, Boca Raton, Florida, CRC 
Press (1992). 
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The human body consumes a staggering 400 billion mature blood cells every day, and this number 
increases dramatically under conditions of stress such as infection or bleeding. A complex scheme of 
multilineage proliferation and differentiation, termed hematopoiesis (Greek for blood forming), has 
evolved to meet this demand. This regulated production of mature blood cells from immature stem cells, 
which occurs mainly in the bone marrow (BM) of adult mammals, has been the focus of considerable 
research effort. Ex vivo models of human hematopoiesis now exist that have significant scientific value 
and promise to have an impact on clinical practice in the near future. This endeavor is spread across 
many fields, including cell biology, molecular biology, bioengineering, and medicine. 

This chapter introduces the reader to the fundamental concepts of hematopoiesis, the clinical appli- 
cations which drive much of the effort to reconstitute hematopoiesis ex vivo, and the progress made to 
date toward achieving this goal. 

120.1 Biology of Hematopoiesis 

The Hematopoietic System: Function and Organization 

There are eight major types of mature blood cells which are found in the circulation (see Fig. 120.1). 
The blood cell population is divided into two major groups; the myeloid and lymphoid. The myeloid 
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MEGAKARYOCYTE 



FIGURE 120.1 The hematopoietic system hierarchy. Dividing pluripotent stem cells may undergo self-renewal to 
form daughter stem cells without loss of potential or may experience a concomitant differentiation to form daughter 
cells with more restricted potential. Continuous proliferation and differentiation along each lineage results in the 
production of many mature cells. This process is under the control of many growth factors (GFs) (See Table 120.1). 
The site of action of some of the better-studied GFs are shown. The mechanisms that determine which lineage a cell 
will develop into are not understood, although many models have been proposed. 

lineage includes erythrocytes (red blood cells), monocytes, the granulocytes (neutrophils, eosinophils, 
and basophils), and platelets (derived from noncirculating megakaryocytes). Thymus-derived (T) lym- 
phocytes and BM-derived (B) lymphocytes constitute the lymphoid lineage. Most mature blood cells 
exhibit a limited lifespan in vivo. Although some lymphocytes are thought to survive for many years, it 
has been shown that erythrocytes and neutrophils have lifespans of 120 days and 8 hours, respectively 
[Cronkite, 1988]. As a result, hematopoiesis is a highly prolific process which occurs throughout our 
lives to fulfill this demand. 

Mature cells are continuously produced from progenitor cells, which in turn are produced from earlier 
cells which originate from stem cells. There are many levels in this hierarchical system, which is usually 
diagrammed as shown in Fig. 120.1. At the left are the very primitive stem cells, the majority of which 
are in a nonproliferative state (G ) [Lajtha, 1979]. These cells are very rare (1 in 100,000 BM cells) but 
collectively have enough proliferative capacity to last several lifetimes [Boggs et al., 1982; Spangrude et al., 
1988]. Through some unknown mechanism, at any given time a small number of these cells are actively 
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proliferating, differentiating, and self-renewing, thereby producing more mature progenitor cells while 
maintaining the size of the stem cell pool. Whereas stem cells (by definition) are not restricted to any 
lineage, their progenitor cell progeny do have a restricted potential and are far greater in number. The 
restricted nature of these progenitors has led to a nomenclature which describes their potential outcome. 
Those that develop into erythrocytes are called colony-forming unit-erythrocyte (CFU-E, the term 
colony- forming unit relates to the biological assay which is used to measure progenitor cells). Similarly, 
progenitors which form granulocytes and macrophages are called CFU-GM. Therefore, as the cells 
differentiate and travel from left to right in Fig. 120.1, they become more numerous, lose self- renewal 
ability, lose proliferative potential, become restricted to a single lineage, and finally become a mature cell 
of a particular type. The biology of stem cells is discussed elsewhere in this Handbook (Chapter 116). 
The need for identification of the many cell types present in the hematopoietic system has led to many 
types of assays. Many of these are biologic assays (such as colony-forming assays), which are performed 
by culturing the cells and examining their progeny, both in number and type [Sutherland et al., 1991a]. 
Another example is the long-term culture-initiating cell (LTC-IC) assay which measures a very early cell 
type through 5-16 weeks of in vitro maintenance [Koller et al., 1998a]. In contrast to these biologic assays, 
which are destructive to the cells being measured, is the real-time technique of flow cytometry. Flow 
cytometry has been used extensively in the study of the hematopoietic system hierarchy. Antibodies to 
antigens on many of the cell types shown in Fig. 120.1 have been developed (see [Brott et al., 1995]). 
Because of the close relation of many of the cell types, often combinations of antigens are required to 
definitively identify a particular cell. Recently, much effort has been focused on the identification of 
primitive stem cells, and this has been accomplished by analyzing increasingly smaller subsets of cells using 
increasingly complex antibody combinations. The first such antigen that was found in CD34, which 
identifies all cells from the stem through progenitor stage (typically about 2% of BM mononuclear cells 
(MNC), see Fig. 120.2) [Civin et al., 1984]. The CD34 antigen is stage-specific but not lineage-specific and 
therefore identifies cells that lead to repopulation of all cell lineages in transplant patients [Berenson et al., 
1991]. However, the CD34 antigen is not restricted to hematopoietic cells because it is also found on 
certain stromal cells in the hematopoietic microenvironment (see Chapters 118 and 119). Although CD34 
captures a small population which contains stem cells, this cell population is itself quite heterogeneous 
and can be fractionated by many other antigens. Over the past several years, many different combinations 
of antibodies have been used to fractionate the CD34 + population. CD34+ fractions which lack CD33, 
HLA-DR, CD38, or CD71 appear to be enriched in stem cells [Civin & Gore, 1993]. Conversely, CD34 + 
populations which coexpress Thy-1 or c-kit appear to contain the primitive cells [Civin & Gore, 1993]. 
These studies have revealed the extreme rarity of stem cells within the heterogeneous BM population (see 
Fig. 120.2). Of the mononuclear cell (MNC) subset (-40% of whole BM), only -2% are CD34+, and of 
those, only ~5% may be CD38 + . Furthermore, this extremely rare population is still heterogeneous with 
respect to stem cell content. Consequently, stem cells as single cells have not yet been identified. 

Molecular Control of Hematopoiesis: The Hematopoietic 
Growth Factors 

A large number of hematopoietic growth factors (GFs) regulate both the production and functional 
activity of hematopoietic cells. The earliest to be discovered were the colony-stimulating factors (CSFs, 
because of their activity in the colony-forming assay), which include interleukin-3 (IL-3), granulocyte- 
macrophage (GM)-CSF, granulocyte (G)-CSF, and monocyte (M)-CSF. These GFs, along with erythro- 
poeitin, have been relatively well characterized because of their obvious effects on mature cell production 
and/or activation. The target cells of some of the better- studied GFs are shown in Fig. 120.1. Subsequent 
intensive research continues to add to the growing list of GFs that affect hematopoietic cell proliferation, 
differentiation, and function (Table 120.1). However, new GFs have been more difficult to find and 
characterize because their effects are more subtle, often providing a synergistic effect which potentiates 
other known GFs. In addition, there appears to be a significant amount of redundancy and pleotropy in 
this GF network, which makes the discovery of new GFs difficult [Metcalf, 1993]. In fact, more recent 
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FIGURE 120.2 Relative frequency of the different cell subsets within the BM population. A BM aspirate typically 
contains 99% mature erythroid cells (mostly from blood contamination), and therefore usually only the nucleated 
cell fraction is studied. Simple density gradient centrifugation techniques, which remove most of the mature eryth- 
rocytes and granulocytes, yield what is known as the mononuclear cell (MNC) fraction (about 40% of the nucleated 
cells). CD34+ cells, about 2% of MNC, can be isolated by a variety of methods to capture the primitive cells, although 
the population is quite heterogeneous. The most primitive cells are found in subsets of CD34 + cells (e.g., CD38 "), 
which identify about 5% of the CD34 + population. These rare subsets can be obtained by flow cytometry but are 
still somewhat heterogeneous with respect to stem cell content. Consequently, although methods are available to 
fractionate BM to a great extent, individual stem cells have not yet been identified. This diagram conveys the 
heterogeneous nature of different BM populations as well as the incredible rarity of the primitive cell subset, which 
is known to contain the stem cells. 

discoveries have focused on potential receptor molecules on the target cell surface, which then have been 
used to isolate the appropriate ligand GF. Examples of such recently discovered GFs which exhibit 
synergistic interactions with other GFs to act on primitive cells include c-kit ligand and flt-3 ligand. 
Another example is thrombopoietin, a stimulator of platelet production, a factor whose activity was 
described over 30 years ago but was cloned only recently [De Sauvage et al., 1994]. These and other GFs 
that act on primitive cells are the subject of intense study because of their potential scientific and 
commercial value. Already, several of the GFs have been developed into enormously successful pharma- 
ceuticals, used in patients who have blood cell production deficiencies due to many different causes (see 
below). 
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TABLE 120.1 Hematopoietic Growth Factors 



Growth Factor Name 


Other Names 


Abbreviations 


Reference 


Interleukin- 1 


Hemopoietin-1 


IL-1 


Dinarello et al., 1981 


Interleukin-2 




IL-2 


Smith, 1988 


Interleukin-3 


Multicolony-stimulating factor 


IL-3, Multi-CSF 


Ihleetal., 1981 


Interleukin-4 


B-cell-stimulatory factor- 1 


IL-4, BSF-1 


Yokotaetal., 1988 


Interleukin-5 




IL-5 


Yokota et al., 1988 


Interleukin-6 


B-cell-stimulatory factor-2 


IL-6, BSF-2 


Kishimoto, 1989 


Interleukin-7 




IL-7 


Tushinski et al, 1991 


Interleukin-8 


Neutrophil activating peptide- 1 


IL-8, NAP-1 


Herbert and Baker, 1993 


Interleukin-9 




IL-9 


Donahue et al., 1990 


Interleukin- 10 


Cytokine synthesis inhibitory factor 


IL-10, CS1F 


Zlotnik and Moore,1991 


Interleukin- 1 1 




IL-11 


Du and Williams, 1994 


Interleukin- 12 


NK cell stimulatory factor 


IL-12, NKSF 


Wolf etal., 1991 


Interleukin- 13 




IL-13 


Mintyetal., 1993 


Interleukin- 14 


High molecular weight B cell growth factor 


IL-14, HMW-BCGF 


Ambrus Jr. et al., 1993 


Interleukin- 15 




IL-1 5 


Grabstein et al., 1994 


Interleukin- 16 


Lymphocyte chemoattractant factor 


IL-16; LCF 


Center et al, 1997 


Interleukein- 1 7 




IL-1 7 


Yao et al., 1995 


Interleukin- 18 


IFN-gamma-inducing factor 


1L-18; IGIF 


Ushio et al., 1996 


Erythropoietin 




Epo 


Krantz, 1991 


Monocyte-CSF 


Colony-stimulating factor- 1 


M-CSF, CSF-1 


Metcalf, 1985 


Granulocyte-CSF 




G-CSF 


Metcalf, 1985 


Granulocyte-macrophage-CF 




GM-CSF 


Metcalf, 1985 


Stem cell factor 


c-kit ligand, Mast cell growth factor 


SCF, KL, MGF 


Zseboetal., 1990 


Interferon-gamma 


Macrophage activating factor 


IFN-y; MAF 


Virelizier et al., 1985 


Macrophage inflammatory 


Stem cell inhibitor 


MIP-1, SCI 


Graham et al., 1990 


protein- 1 








Leukemia inhibitory factor 




LIF 


Metcalf, 1991 


Transforming growth factor- 




TGF-P 


Sporn and Roberts, 1989 


beta 








Tumor necrosis factor-alpha 


Cachetin 


TNF-a 


Pennica et al., 1984 


jlk-2 ligand 


flk-3 ligand 


Fl 


Lyman et al, 1994 


Thrombopoietin 


c-mpl ligand; Megakaryocyte growth and 
development factor 


Tpo, ML; MGDF 


de Sauvage et al., 1994 



The Bone Marrow Microenvironment 

Hematopoiesis occurs in the BM cavity in the presence of many accessory and support cells (or stromal 
cells). In addition, like all other cells in vivo, hematopoietic cells have considerable interaction with the 
extracellular matrix (ECM). These are the chief elements of what is known as the BM microenvironment. 
Further details on the function of stroma and the microenvironment, and their importance in tissue 
engineering, are found in Chapters 118 and 119. 

Bone Marrow Stromal Cells 

Due to the physiology of marrow, hematopoietic cells have a close structural and functional relationship 
with stromal cells. Marrow stroma includes fibroblasts, macrophages, endothelial cells, and adipocytes. 
The ratio of these different cell types varies at different places in the marrow and as the cells are cultured 
in vitro. The term stromal layer therefore refers to an undefined mixture of different adherent cell types 
which grow out from a culture of BM cells. In vitro, stem cells placed on a stromal cell layer will attach 
to and often migrate underneath the stromal layer [Yamakazi et al., 1989]. Under the stromal layer, some 
of the stem cells will proliferate, and the resulting progeny will be packed together, trapped under the 
stroma, forming a characteristic morphologic feature known as a cobblestone area (see below). It is 
widely believed that primitive cells must be in contact with stromal cells to maintain their primitive state. 
However, much of the effect of stromal cells has been attributed to the secretion of GFs. Consequently, 
there have been reports of successful hematopoietic cell growth with the addition of numerous soluble 
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GFs in the absence of stroma [Bodine et al, 1989; Brandt et al, 1990,1992; Haylock et al., 1992; Koller 
et al., 1992a; Verfaillie, 1992]. However, this issue is quite controversial, and stromal cells are still likely 
to be valuable because they synthesize membrane-bound GFs [Toksoz et al., 1992], ECM components 
[Long, 1992], and probably some as yet undiscovered cytokines. In addition, stromal cells can modulate 
the GF environment in a way that would be very difficult to duplicate by simply adding soluble GFs 
[Koller et al., 1995]. This modulation may be responsible for the observations that stroma can be both 
stimulatory and inhibitory [Zipori, 1989]. 

The Extracellular Matrix 

The ECM of BM consists of collagens, laminin, fibronectin [Zuckerman & Wicha, 1983], victronectin 
[Coulombelet al., 1988],hemonectin [Campbell et al., 1987], and thrombospondin [Long&Dixit, 1990]. 
The heterogeneity of this system is further complicated by the presence of various proteoglycans, which 
are themselves complex molecules with numerous glycosaminoglycan chains linked to a protein core 
[Minguell & Tavassoli, 1989; Spooncer et al., 1983; Wight et al., 1986]. These glycosaminoglycans include 
chondroitin, heparan, dermatan, and keratan sulfates and hyaluronic acid. The ECM is secreted by stromal 
cells of the BM (particularly endothelial cells and fibroblasts) and provides support and cohesion for the 
marrow structure. 

There is a growing body of evidence indicating that ECM is important for the regulation of hemato- 
poiesis. Studies have shown that different glycosaminoglycans bind and present different GFs to hemato- 
poietic cells in an active form [Gordon et al., 1987; Roberts et al., 1988]. This demonstrates that ECM 
can sequester and compartmentalize certain GFs in local areas and present them to hematopoietic cells, 
creating a number of different hematopoietically inductive microenvironments. Another important ECM 
function is to provide anchorage for immature hematopoietic cells. Erythroid precursors have receptors 
which allow them to attach to fibronectin. As cells mature through the BFU-E to the reticulocyte stage, 
adherence to fibronectin is gradually lost, and the cells are free to enter the circulation [Patel et al., 1985]. 
It has also been shown that binding to fibronectin renders these erythroid precursors more responsive 
to the effects of Epo [Weinstein et al., 1989]. Another adhesion protein termed hemonectin has been 
shown to selectively bind immature cells of the granulocyte lineage in an analogous fashion [Campbell 
et al., 1987]. This progenitor binding by ECM has led to the general concept for stem cell homing. When 
BM cells are injected into the circulation of an animal, a sufficient number are able to home to the 
marrow and reconstitute hematopoiesis [Tavassoli & Hardy, 1990]. It is therefore likely that homing 
molecules are present on the surface of the primitive cells. Studies suggest that lectins and CD44 on 
progenitor cells may interact with ECM and stromal elements of the marrow to mediate cellular homing 
[Aizawa & Tavassoli, 1987; Kansas et al., 1990; Lewinsohn et al., 1990; Tavassoli and Hardy, 1990]. These 
concepts have been reviewed in detail [Long, 1992]. 

120.2 Applications of Reconstituted Ex Vivo Hematopoiesis 

The hematopoietic system, as described above, has many complex and interacting features. The recon- 
stitution of functional hematopoiesis, which has long been desired, must address these features to achieve 
a truly representative ex vivo system. A functioning ex vivo human hematopoietic system would be a 
valuable analytic model to study the basic biology of hematopoiesis. The clinical applications of functional 
ex vivo models of human hematopoiesis are numerous and are just beginning to be realized. Most of 
these applications revolve around cancer therapies and, more recently, gene therapy. The large-scale 
production of mature cells for transplantation represents an important goal that may be realized in the 
more distant future. 

Bone Marrow Transplantation 

In 1980, when bone marrow transplantation (BMT) was still an experimental procedure, fewer than 
200 BMTs were performed worldwide. Over the past decade, BMT has become an established therapy 
for many diseases. In 1996, over 40,000 BMTs were performed, primarily in the United States and Western 
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Europe, for more than a dozen different clinical indications [Horowitz and Rowling, 1997]. The number 
of BMTs performed annually is increasing at a rate of 20-30% per year and is expected to continue to 
rise in the foreseeable future. 

BMT is required as a treatment in a number of clinical settings because the highly prolific cells of the 
hematopoietic system are sensitive to many of the agents used to treat cancer patients. Chemotherapy and 
radiation therapy usually target rapidly cycling cells, so hematopoietic cells are ablated along with the 
cancer cells. Consequently, patients undergoing these therapies experience neutropenia (low neutrophil 
numbers), thrombocytopenia (low platelet numbers), and anemia (low red blood cell numbers), rendering 
them susceptible to infections and bleeding. A BMT dramatically shortens the period of neutropenia and 
thromobocytopenia, but the patient may require repeated transfusions. The period during which the 
patient is neutropenic represents the greatest risk associated with BMT. In addition, some patients do not 
achieve engraftment (when cell numbers rise to safe levels). As a result, much effort is focused on reducing 
the severity and duration of the blood cell nadir period following chemotherapy and radiation therapy. 

There are several sources of hematopoietic cells for transplantation. BMT may be performed with 
patient marrow (autologous) that has been removed and cryopreserved prior to administration of 
chemotherapy or with donor marrow (allogeneic). The numbers of autologous transplants outnumber 
allogenic transplants by a 2:1 ratio, but there are advantages and disadvantages with both techniques. 

Autologous Bone Marrow Transplantation 

Autologous BMTs have been used in the treatment of a variety of diseases including acute lymphoblastic 
leukemia (ALL), acute myelogenous leukemia (AML), chronic myelogenous leukemia (CML), various 
lymphomas, breast cancer, neuroblastoma, and multiple myeloma. Currently, autologous BM transplan- 
tation is hampered by the long hospital stay that is required until engraftment is achieved and the 
possibility of reintroducing tumor cells along with the cryopreserved marrow. In fact, retroviral marking 
studies have proven that tumor cells reinfused in the transplant can contribute to disease relapse in the 
patient [Rill et al., 1994; Deisseroth et al., 1994]. Long periods of neutropenia, anemia, and thrombocy- 
topenia require parenteral antibiotic administration and repeated blood component transfusions. 

Autologous transplantation could also be used in gene therapy procedures. The basic concept under- 
lying gene therapy of the hematopoietic system is the insertion of a therapeutic gene into the hemato- 
poietic stem cell, so that a stable transfection is obtained. Engineered retroviruses are the gene carriers 
currently used. However, mitosis of primitive cells is required for integration of foreign DNA [Bodine 
et al., 1989]. A culture system which contains dividing stem cells is therefore critical for the enablement 
of retroviral-based gene therapy of the hematopoietic system. This requirement holds true whether or 
not the target stem cell population has been purified prior to the transfection step. To date, only very 
limited success has been achieved with retroviral transfection of human BM cells, whereas murine cells 
are routinely transfected. A comprehensive and accessible accounting of the status of gene therapy has 
been presented elsewhere [Mulligan, 1993]. 

Allogeneic Bone Marrow Transplantation 

In patients with certain hematologic malignancies, or with genetic defects in the hematopoietic popula- 
tion, allogeneic transplants are currently favored when suitable matched donors are available. With these 
leukemias, such as CML, it is likely that the patient's marrow is diseased and would not be suitable for 
autotransplant. A major obstacle in allogeneic transplantation, however, is the high incidence of graft- 
versus-host disease (GVHD), in which the transplanted immune cells attack the host's tissues as foreign. 

Alternative Sources of Hematopoietic Cells for Transplantation 

Although hematopoiesis occurs mainly in the BM of adult mammals, during embryonic development, 
pluripotent stem cells first arise in the yolk sac, are later found in the fetal liver, and at the time of delivery 
are found in high concentrations in umbilical cord blood. In adults, stem cells are found in peripheral 
blood only at very low concentrations, but the concentration increases dramatically after stem cell 
mobilization. Mobilization of stem cells into peripheral blood is a phenomenon that occurs in response 
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to chemotherapy or GF administration. Therefore, hematopoietic stem cells can be collected from cord 
blood [Gluckman et al., 1989; Wagner et al., 1992] or from mobilized peripheral blood [Schneider et al., 
1992] as well as from BM. Disadvantages of cord blood are the limited number of cells that can be 
obtained from the one individual and the question of whether this amount is sufficient to repopulate an 
adult patient. Mobilized peripheral blood results in more rapid patient engraftment than BM, and as a 
consequence, its use is becoming more prevalent, particularly in the autologous setting. 

Tissue Engineering and Improved Transplantation Procedures 

BMT would be greatly facilitated by reliable systems and procedures for ex vivo stem cell maintenance, 
expansion, and manipulation. For example, the harvest procedure, which collects 1-2 liters of marrow, 
is currently a painful and involved operating room procedure. The complications and discomfort of 
marrow donation are not trivial and can affect donors for a month or more [Stroncek et al., 1993]. 
Through cell expansion techniques, a small marrow specimen taken under local anesthesia in an outpa- 
tient setting could be expanded into the large number of cells required for transplant, thereby eliminating 
the large harvest procedure. Engraftment may be accelerated by increasing the numbers of progenitors 
and immature cells available for infusion. In addition, it may be possible to cryopreserve expanded cells 
to be infused at multiple time points, thereby allowing multiple cycles of chemotherapy (schedule 
intensification). Finally, the use of expanded cells may allow increasing doses of chemotherapy (dose 
intensification), facilitating tumor reduction while ameliorating myeloablative side effects. 

The expansion of alternative hematopoietic cell sources would also facilitate transplant procedures. 
For example, multiple rounds of apheresis, each requiring ~4 hours, are required to collect enough 
mobilized peripheral blood cells for transplant. Expansion of a small amount of mobilized peripheral 
blood may reduce the number of aphereses required or eliminate them altogether by allowing the 
collection of enough cells from a volume of blood that has not been apheresed. With cord blood, there 
is a limit on the number of cells that can be collected from a single donor, and it is currently thought 
that this number is inadequate for an adult transplant. Consequently, cord blood transplants to date have 
been performed on children. Expansion of cord blood cells may therefore enable adult transplants from 
the limited number of cord blood cells available for collection. 

Large-Scale Production of Mature Blood Cells 

Beyond the ability to produce stem and progenitor cells for transplantation purposes lies the promise to 
produce large quantities of mature blood cells. Large-scale hematopoietic cultures could potentially 
provide several types of clinically important mature blood cells. These include red blood cells, platelets, 
and granulocytes. About 12 million units of red blood cells are transfused in the United States every year, 
the majority of them during elective surgery and the rest in acute situations. About 4 million units of 
platelets are transfused every year into patients who have difficulty exhibiting normal blood clotting. 
Mature granulocytes, which constitute a relatively low-usage market of only a few thousand units admin- 
istered each year, are involved in combating infections. This need arises in situations when a patient's 
immune system has been compromised and requires assistance in combating opportunistic infections, 
such as during chemotherapy and the healing of burn wounds. 

All in all, the market for these blood cells totals about $1-1.5 billion in the United States annually, 
with a worldwide market that is about three to four times larger. The ability to produce blood cells on 
demand ex vivo would alleviate several problems with the current blood cell supply. The first of these 
problems is the availability and stability of the blood cell supply. The availability of donors has tradi- 
tionally been a problem, and, coupled with the short shelf-life of blood cells, the current supply is unstable 
and cannot meet major changes in demand. The second problem is the usual blood-type compatibility 
problem resulting in shortages of certain types at various times. The third problem is the safety of the 
blood supply. This last issue has received much attention recently due to the contamination of donated 
blood with the human immunodeficiency virus (HIV). However, the three forms of hepatitis currently 
pose an even more serious viral contamination threat to the blood supply. 
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Unlike ex vivo expansion of stem and progenitor cells for transplantation, the large-scale production 
of fully mature blood cells for routine clinical use is less developed and represents a more distant goal. 
The large market would require systems of immense size, unless major improvements in culture produc- 
tivity are attained. For example, the recent discovery of thrombopoietin [De Sauvage et al„ 1994] may 
make the large-scale production of platelets feasible. At present, there are ongoing attempts in several 
laboratories to produce large numbers of neutrophils from CD34-selected cell populations. Thus, large- 
scale production of mature cells may soon become technically feasible, although the economic consid- 
erations are still unknown. 

120.3 The History of Hematopoietic Cell 
Culture Development 

As outlined above, there are many compelling scientific and clinical reasons for undertaking the devel- 
opment of efficient ex vivo hematopoietic systems. Such achievement requires the use of in vivo mimicry, 
sophisticated cell culture technology, and the development of clinically acceptable cell cultivation devices. 
The foundations for these developments lie in the BM cell culture methods which have been developed 
over the past 20 years. The history of BM culture will therefore be described briefly, as it provides the 
backdrop for tissue engineering of the hematopoietic system. More complete reviews have been published 
previously [Dexter et al., 1984; Eaves et al., 1991; Greenberger, 1984]. 

The Murine System 

In the mid 1970s, Dexter and coworkers were successful in developing a culture system in which murine 
hematopoiesis could be maintained for several months [Dexter et al., 1977]. The key feature of this system 
was the establishment of a BM-derived stromal layer during the first three weeks of culture which was 
then recharged with fresh BM cells. One to 2 weeks after the cultures were recharged, active sites of 
hematopoiesis appeared. These sites are often described as cobblestone regions, which are the result of 
primitive cell proliferation (and accumulation) underneath the stromal layer. Traditionally, the cultures 
are fed by replacement of one-half of the medium either once or twice weekly. In these so-called Dexter 
cultures, myelopoiesis proceeds to the exclusion of lymphopoiesis. The selection of a proper lot of serum 
for long-term BM cultures (LTBMCs) was found to be very important. In fact, when using select lots of 
serum, one-step LTBMCs were successfully performed without the recharging step at week 3 [Dexter 
et al., 1984], It is thought that good serum allows rapid development of stroma before the primitive cells 
are depleted, and once the stroma is developed, the culture is maintained from the remaining original 
primitive cells without need for recharging. The importance of stroma has often been demonstrated in 
these Dexter cultures, because the culture outcome was often correlated with stromal development. 

The Human System 

The adaptation of one-step LTBMC for human cells was first reported in 1980 [Gartner & Kaplan, 1980]. 
A mixture of fetal bovine serum and horse serum was found to be required for human LTBMC, and a 
number of other medium additives such as sodium pyruvate, amino acids, vitamins, and antioxidants 
were found to be beneficial [Gartner & Kaplan, 1980; Greenberg et al., 1981; Meagher et al., 1988]. 
Otherwise, the culture protocol has remained essentially the same as that used for murine culture. 
Unfortunately, human LTBMCs have never attained the productivity or longevity which is observed in 
cultures of other species [Dexter et al., 1984; Greenberger et al., 1986]. The exponentially decreasing 
numbers of total and progenitor cells with time in human LTBMC [Eastment & Ruscetti, 1984; Eaves 
et al., 1991] renders the cultures unsuitable for cell expansion and indicates that primitive stem cells are 
lost over time. The discovery of hematopoietic GFs was an important development in human hemato- 
poietic cell culture, because addition of GFs to human LTBMC greatly enhanced cell output. However, 
GFs did not prolong the longevity of the cultures, indicating that primitive cell maintenance was not 
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improved [Coutinho et al., 1990; Lemoli et al., 1992]. Furthermore, although the total number of pro- 
genitors obtained was increased by GF's, it was still less than the number used to initiate the culture. 
Therefore, a net expansion in progenitor cell numbers was not obtained. The increased cell densities that 
were stimulated by GF addition were not well supported by the relatively static culture conditions. 

The disappointing results from human LTBMCs led to the development of other culture strategies. 
The development of an increasing number of recombinant GFs was soon joined by the discovery of the 
CD34 antigen (see above). As protocols for the selection of CD34+ cells became available, it was thought 
that the low cell numbers generated by enrichment could be expanded in GF-supplemented cultures 
without the impediment of numerous mature cells in the system. Because the enrichment procedure 
results in a cell population depleted of stromal cells, CD34+ cell cultures are often called suspension 
cultures, due to the lack of an adherent stromal layer. A number of groups have reported experiments 
in which CD34 + cells were incubated with high doses of up to seven recombinant GFs in suspension 
culture [Brandt et al., 1992; Haylock et al. 1992]. Although 500-1000-fold cell expansion numbers are 
often obtained, the magnitude of CFU-GM expansion is usually less than 10-fold, suggesting that dif- 
ferentiation, accompanied by depletion of primitive cells, is occurring in these systems. In fact, when 
LTC-IC have been measured, the numbers obtained after static culture of enriched cells have always been 
significantly below the input value [Sutherland et al., 1991 fo, 1993; Verfaillie, 1992]. A further consider- 
ation in CD34 + cell culture is the loss of cells during the enrichment procedure. It is not uncommon to 
experience 70-80% loss of progenitors with most CD34+ cell purification protocols [Traycoff et al., 1994], 
and this can be very significant when trying to maximize the final cell number obtained (such as in 
clinical applications, see below). Nevertheless, cultures of purified CD34 + cells, and especially the smaller 
subsets (e.g., CD33 , CD38), have yielded valuable information on the biology of hematopoietic stem 
cells. 

An alternative approach has also been taken to improve human hematopoietic cell culture. Most of 
these advances have come from the realization that traditional culture protocols are highly nonphysiologic 
and that these deficiencies can be corrected by in vivo mimicry. Therefore, these techniques do not involve 
cell purification or high-dose cytokine stimulation. Because these cultures attain fairly high densities, it 
was thought that the tradition of changing one-half of the culture medium either once or twice weekly 
was inadequate. When Dexter-type cultures were performed with more frequent medium exchanges, 
progenitor cell production was supported for at least 20 weeks [Schwartz et al., 1991 b]. This increase in 
culture longevity indicates that primitive cells were maintained for a longer period, and this was accom- 
panied by an increase in progenitor cell yield. Although the precise mechanisms of increased medium 
exchange are unknown, the increased feeding rate enhances stromal cell secretion of GFs [Caldwell et al., 
1991]. 

As previously noted, recombinant GFs can significantly improve culture productivity, but GF-stimu- 
lated cell proliferation exacerbates the problems of nutrient depletion because cell proliferation and the 
consumption of metabolites increases many-fold. Therefore, increased feeding protocols also benefit GF- 
stimulated cultures. Cultures supplemented with IL-3/GM-CSF/Epo and fed with 50% daily medium 
exchanges were found to result in significant cell and progenitor expansion while maintaining culture 
longevity [Schwartz et al., 1991a]. Optimization of these manually fed cultures has been published for 
both BM [Koller et al, 1996] and cord blood [Roller et al., 1998b]. 

Tissue Engineering Challenges 

Although increased manual feeding significantly enhanced the productivity and longevity of hematopoi- 
etic cultures, the labor required to feed each culture is a daunting task. In addition, the cultures are 
subjected to physical disruption and large discontinuous changes in culture conditions and may be 
exposed to contamination at each feeding. Thee complications frustrate the optimization of the culture 
environment for production of hematopoietic cells and limit the clinical usefulness of the cultures. A 
perfusion system, if properly designed and constructed, would eliminate many of the problems currently 
associated with these cultures. 
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The success of this manual frequently fed culture approach led to development of continuously 
perfused bioreactors for human cord blood [Roller et al., 1993a; Koller et al., 1998b], BM [Palsson et al., 
1993], and mobilized peripheral blood [Sandstrom et al, 1995] cell culture. Human BM MNC cultures 
have been performed in spinner flasks in a fed-batch mode as well [Zandstra et al., 1994]. Slow single- 
pass medium perfusion and internal oxygenation have given the best results to date, yielding cell densities 
in excess of 10 7 per ml accompanied by significant progenitor and primitive cell expansion [Koller et al., 
1993b]. These systems have also been amenable to scale-up, first by a factor often, and then by a further 
factor of7. 5 [Koller etal., 1998b]. When an appropriate culture substrate is provided [Koller etal., 1998c], 
perfusion bioreactors support the development and maintenance of accessory cell populations, resulting 
in significant endogenous growth factor production which likely contributes to culture success [Koller 
et al., 1995a; Koller et al., 1997]. Importantly, stromal-containing cultures appear to generate greater 
numbers of primitive cells from a smaller initial cell sample, as measured by in vitro [Koller et al., 1995b; 
Sandstrom et al., 1995] and in vivo assays [Knobel et al., 1994], as compared with CD34-enriched cell 
cultures. 

120.4 Challenges for Scale-Up 

To gauge the scale-up challenges, one first needs to state the requirements for clinically useful systems. 
The need to accommodate stroma that supports active hematopoiesis represents perhaps the most 
important consideration for the selection of a bioreactor system. 

Bioreactors and Stroma 

If stroma is required, the choices are limited to systems that can support the growth of adherent cells. 
This requirement may be eliminated in future systems if the precise GF requirements become known 
and if the microenvironment that the stroma provides is not needed for hematopoietic stem and pro- 
genitor cell expansion. 

Currently, there are at least three culture systems that may be used for adherent cell growth. Fluidized 
bed bioreactors with macroporous bead carriers provide one option. Undoubtedly, significant effort will 
be required to develop the suitable bead chemistry and geometry, since the currently available systems 
are designed for homogeneous cell cultures. Beads for hematopoietic culture probably should allow for 
the formation of functional colonies comprised of a mixed cell population within each bead. Cell sampling 
from fluidized beds would be relatively easy, but final cell harvesting may require stressful procedures, 
such as prolonged treatment with collagenase and/or trypsin. 

Flatbed bioreactors are a second type of system which can support stromal development. Such units 
can be readily designed to carry the required cell number, and, further, they can allow for direct micro- 
scopic observation of the cell culture. Flatbed bioreactors provide perhaps the most straightforward scale- 
up and automation of LTBMC. In fact, such automated systems have been developed and used in human 
clinical trials for the treatment of cancer [Mandalam et al., 1998]. 

Finally, membrane-based systems, such as hollow fiber units, could be used to carry out hematopoietic 
cell cultures of moderate size. Special design of the hollow fiber bed geometry with respect to axial length 
and radial fiberspacing should eliminate all undesirable spatial gradients. Such units have been made 
already and have proved effective for their use for in vivo NMR analysis of metabolic behavior of 
homogenous cell cultures [Mancuso et al., 1990] . However, hematopoietic cell observation and harvesting 
may prove to be troublesome with this approach, as one report has suggested [Sardonini & Wu, 1993]. 

It is possible that the function of accessory cells may be obtained by the use of spheroids without 
classical adherent cell growth. This approach to the growth of liver cells in culture has meet with some 
success (see Chapter 121). In fact, there have been reports of functional heterogeneous cell aggregates 
within BM aspirates [Blazseket al., 1990; Funket al., 1994]. If successfully developed, suspension cultures 
containing these aggregates could be carried out in a variety of devices, including the rotating wall vessels 
that have been developed by NASA [Schwarz et al., 1992]. 
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Alternatives 

The precise arrangement of future large-scale systems will be significantly influenced by continuing 
advances in the understanding of the molecular and microenvironmental regulation of the hematopoietic 
process. Currently, the proximity of a supporting stromal layer is believed to be important. It is thought 
to function through the provision of both soluble and membrane-bound GFs and by providing a suitable 
microenvironment. The characterization of the microenvironment is uncertain at present, but its chem- 
istry and local geometry are both thought to play a role. If the proximity of stroma is found to be 
unimportant, one could possibly design culture systems in which the stroma and BM cells are separated 
[ Verfaillie, 1 992 ] , resulting in the ability to control each function separately. Finally, if the GF requirements 
can be defined and artificially supplied, and the stromal microenvironment is found to be unimportant, 
large-scale hematopoietic suspension cultures would become possible. 

Production of Mature Cells 

Culture systems for generic allogeneic BMT, or for the large-scale production of mature cells, will pose 
more serious scale-up challenges. The number of cells required for these applications, in particular the 
latter, may be significantly higher than that for autologous BMT. Of the three alternatives discussed above, 
the fluidized bed system is the most readily scalable. The flatbed systems can be scaled by a simple stacking 
approach, whereas hollow fiber units are known for their shortcomings with respect to large-scale use. 

120.5 Recapitulation 

Rapid advances in our understanding of hematopoietic cell biology and the molecular control of hemato- 
poietic cell replication, differentiation, and apoptosis are providing some of the basic information that 
is needed to reconstitute human hematopoiesis ex vivo. Compelling clinical applications provide a 
significant impetus for developing systems that produce clinically useful cell populations in clinically 
meaningful numbers. Use of in vivo mimicry and the bioreactor technologies that were developed in the 
1980s are leading to the development of perfusion-based bioreactor systems that will meet some of the 
clinical needs. Further tissue engineering of human hematopoiesis is likely to continue to grow in scope 
and sophistication and lead to definition of basic structure-function relationships and the enablement 
of many needed clinical procedures. 

Defining Terms 

Colony-forming assay: Assay carried out in semisolid medium under GF stimulation. Progenitor cells 

divide, and progeny are held in place so that a microscopically identifiable colony results after 

2 weeks. 
Differentiation: The irreversible progression of a cell or cell population to a more mature state. 
Engraftment: The attainment of a safe number of circulating mature blood cells after a BMT. 
Flow cytometry: Technique for cell analysis using fluorescently conjugated monoclonal antibodies 

which identify certain cell types. More sophisticated instruments are capable of sorting cells into 

different populations as they are analyzed. 
Graft-versus-host disease: The immunologic response of transplanted cells against the tissue of their 

new host. This response is often a severe consequence of allogeneic BMT and can lead to death 

(acute GVHD) or long-term disability (chronic GVHD). 
Hematopoiesis: The regulated production of mature blood cells through a scheme of multilineage 

proliferation and differentiation. 
Lineage: Refers to cells at all stages of differentiation leading to a particular mature cell type, i.e., one 

branch on the lineage diagram shown in Fig. 120.1. 
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Long-term culture-initiating cell: Cell that is measured by a 7-18 week in vitro assay. LTC-IC are 

thought to be very primitive, and the population contains stem cells. However, the population is 

heterogeneous, so not every LTC-IC is a stem cell. 
Miroenvironment: Refers to the environment surrounding a given cell in vivo. 
Mononuclear cell: Refers to the cell population obtained after density centrifugation of whole BM. 

This population excludes cells without a nucleus (erythrocytes) and polymorphonuclear cells 

(granulocytes). 
Progenitor cells: Cells that are intermediate in the development pathway, more mature than stem cells 

but not yet mature cells. This is the cell type measured in the colony-forming assay. 
Self-renewal: Generation of a daughter cell with identical characteristics as the original cell. Most often 

used to refer to stem cell division, which results in the formation of new stem cells. 
Stem cells: Cells with potentially unlimited proliferative and lineage potential. 
Stromal cells: Heterogeneous mixture of support or accessory cells of the BM. Also refers to the 

adherent layer which forms in BM cultures. 
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Liver transplantation has been established as a curative treatment for end-stage adult and pediatric liver 
disease [Starlz et al., 1989], and over recent years, many innovative advances have been made in trans- 
plantation surgery. Unfortunately, a fundamental problem of liver transplantation has been severe donor 
shortage, and no clinical therapeutic bridge exists to abate the progression of liver failure (see Table 121.1). 
As the demand for liver transplantation surgery increases, still fewer than 3500 donors are available 
annually for the approximately 25,000 patients who die from chronic liver disease [National Vital Statistic 
System, 1991, 1992]. Currently, cadaveric and living-related donors are the only available sources. 
Xenograft [Starzl et al., 1993] and split liver transplantation [Merio & Campbell, 1991] are under exper- 
imental and clinical evaluation. The research effort to engineer a functional liver tissue has been vigorous, 
since tissue engineering of the liver offers, in theory, an efficient use of limited organ availability. 

Whereas other experimental hepatic support systems such as extracorporeal bioreactors and hemop- 
erfusion devices [Yarmusch et al., 1992] attempt to temporarily support the metabolic functions of liver, 
transplantation of hepatocyte systems is a possible temporary or permanent alternative therapy to liver 
transplantation for treatment of liver failure. An experimental model system of hepatocellular transplan- 
tation should provide optimal cell survival, proliferation, and maintenance of sufficient functional hepa- 
tocyte mass to replace liver function. Direct hepatocellular injection or infusion into various organs or 
tissues and a complex hepatocyte delivery system utilizing polymer matrices have been two major areas 
of research interest. Just as the liver is one of the most sophisticated organs in the human body, the 
science of hepatocyte or liver tissue construction has proved to be equally complex. 



121.1 Background 



The causes of end-stage liver disease are many, including alcoholic or viral cirrhosis, biliary atresia, inborn 
errors of metabolism, and sclerosing cholangitis. With chronic and progressive liver injury, hepatic 
necrosis occurs followed by fatty infiltration and inflammation. Scar tissue and nodular regeneration 
replace the normal liver architecture and increase the microcirculatory resistance, which results in portal 
hypertension; the liver is further atrophied as important factors in the portal blood which regulate liver 
growth and maintenance are shunted away from the liver. Currently, end-stage liver disease must be 
present to be considered for orthotopic liver transplantation therapy, but a significant difference exists 
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Year 


No. of Patients 


No. 


of Deaths Reported 


1989 


3096 




39 


1990 


4008 




45 


1991 


4866 




67 


1992 


5785 




104 


1993 


7040 




141 



between alcohol- or viral-induced liver injury and TABLE 121.1 UNOS Liver Transplantation Data 
congenital liver diseases such as isolated gene Summary from 1989 to 1993 in the United States: Total 
defects and biliary atresia. With alcohol- or viral- Number of Liver Transplant Candidates and Deaths 

induced chronic hepatic injury, the degree of liver Reported per Year While on Transplant Waiting List 
injury is unknown until metabolic functions are 
severely impaired and signs of progressive irre- 
versible hepatic failure including portal hyperten- 
sion, coagulopathy, progressive jaundice, and 
hepatic encephalopathy have developed. In con- 
genital liver diseases, normal hepatic metabolic 

functions exist until dangerous toxins build up ,, , , . . . . , , , . , . 

° r Source. United Network for Organ Sharing and the 

and destroy the liver parenchyma. Hepatocellular Qrgan Procurement and Transplantation Network. Data 

transplantation could potentially prevent hepatic as of January 12, 1994. 

injury and preserve host hepatic function for con- 
genital inborn errors of liver metabolism. 

As liver transplantation emerged as an important therapeutic modality, research activity intensified to 
improve or understand many areas of liver transplantation such as immunological tolerance, preservation 
techniques, and the mechanism of healing after acute and chronic liver injury. Liver growth and regula- 
tion, in particular, have been better understood: For example, after partial hepatectomy, several mito- 
gens — epidermal growth factor, alpha fibroblastic growth factor, hepatocyte growth factor, and 
transforming growth factor-alpha — are produced early after injury to stimulate liver regeneration. Comi- 
togens — including insulin, glucagon, estrogen, norepinephrine, and vasopressin — also aid with liver 
regeneration [Michalopoulos, 1993]. These stimulation factors help govern the intricate regulatory pro- 
cess of liver growth and regeneration, but much about what controls these factors is still unknown. In 
vitro and in vivo experiments with mitogens such as hepatocyte growth factor, epidermal growth factor, 
and insulin have yielded only moderate improvement in hepatic proliferation. 

The importance of hepatocyte proliferation and hepatic regeneration becomes evident when one 
considers the difficulty in delivering large numbers of hepatocytes in hepatocyte replacement systems. 
The potential advantage of hepatocyte cellular transplantation is to take a small number of hepatocytes 
and proliferate these cells, in vitro or in vivo, to create functional liver equivalents for replacement therapy. 
Without hepatocyte regeneration, delivery of a very large number of hepatocytes is required. Asonuma 
and coworkers [1992] have determined that approximately 12% of the liver by heterotopic liver trans- 
plantation can significantly correct hyperbilirubinemia in the Gunn rat, which is deficient in uridine 
diphosphate glucuronyl tranferase. Although long-term efficacy remains unclear, 10-12% of the liver is 
an approximate critical hepatocellular mass thought necessary to replace the metabolic functions of the 
liver. The inability to mimic normal liver growth and regeneration in in vitro and in vivo systems has 
been one significant obstacle for hepatocyte tissue construction thus far. 

121.2 Hepatocyte Transplantation Systems 

Hepatocyte transplant systems offer the possibilities of creating many functional liver equivalents, storing 
hepatocytes by cryopreservation for later application [Yarmush et al., 1992], and using autologous cells 
for gene therapy [Lauregui & Gann, 1991]. The two systems discussed below differ in the amount of 
hepatocytes delivered, use of implantation devices, and implantation sites and techniques. Yet, in both 
systems, one significant roadblock in proving the efficacy of hepatocyte transplantation has been in the 
lack of a definitive, reproducible isolated liver defect model. Previous studies using syngeneic rat models 
such as the jaundice Gunn rat, the analbuminemic Nagase rat, or acute and chronic liver injury rat models 
have attributed significant correction of their deficit from hepatocyte transplantation. However, either 
inconsistencies or variation of animal strains and lack of consistent reproducible results have made 
accurate scientific interpretations and conclusions difficult. For instance, a study assessing hepatocyte 
delivery with microcarrier beads reported significant decrease in bilirubin in the hyperbilirubinemic 
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Gunn rat and elevation of albumin in the Nagase analbuminemic rat after intraperitoneal implantation 
[Demetriou et al., 1986]; other studies have not demonstrated significant hepatocyte survival with intra- 
peritoneal injection of hepatocytes with or without microcarriers; histology showed predominant cell 
necrosis and granuloma formation after 3 days [Henne-Bruns et al., 1991]. In allogeneic models, the 
possibility of immunological rejection further complicates the evaluation of the hepatocyte transplant 
system. Clearly, determining the efficacy of hepatocyte transplantation in liver metabolic-deficient models 
requires significant chemical results, and histologic correlation without confounding variables. 

Hepatocellular Injection Model 

Hepatocytes require an extracellular matrix for growth and differentiation, and the concept of utilizing 
existing stromal tissue as a vascular extracellular matrix is inviting. Isolated hepatocytes have been injected 
directly into the spleen or liver or in the portal or splenic vein. Several studies have reported significant 
but temporary correction of acute and metabolic liver defects in rat models as a result of intrahepatic, 
intraportal [Matas et al., 1976], or intrasplenic injections [Vroemen et al., 1985]. However, elucidating the 
efficacy of hepatocellular injection transplantation has been difficult for three significant reasons: 
(1) Differentiation of donor transplanted hepatocytes from host liver parenchyma has not been well 
established in an animal liver defect model, (2) how much hepatocyte mass needed to inject for partial or 
total liver function replacement has not been determined [Onodera et al., 1992], and (3) establishing a 
definitive animal liver defect model to prove efficacy of hepatocyte injection has been difficult. 

Transgenic animal strains have been developed and offer a reproducible model in differentiating host 
from transplanted hepatocytes. Using transgenic mouse lines, donor hepatocytes injected into the spleen 
were histologically shown to migrate to the host liver, survive, and maintain function [Ponder et al., 
1991]. Recently, a transgenic liver model in a mouse was used to evaluate the replicative potential of 
adult mouse hepatocytes. Normal adult mouse hepatocytes from two established transgenic lines were 
injected into the spleen of an Alb-uPA transgenic mouse that had an endogenous defect in hepatic growth 
potential and function (see Fig. 121.1). The adult mouse hepatocytes were shown to translocate to the 
liver and undergo up to 12 cell doublings; however, function of the transplanted hepatocytes was not 
fully reported [Rhim et al., 1994]. Rhim's study suggests that a hepatocyte has the potential to replicate 
many-fold so long as the structural and chemical milieu is optimal for survival and regeneration. If a 
small number of transplanted hepatocytes survive and proliferate many- fold in the native liver, sufficient 
hepatocyte mass to replace liver function would be accomplished, ameliorating the need to deliver a large 
quantity of donor hepatocytes. 

Although many questions about hepatocyte injection therapy remain, the transgenic liver model has 
been used to test the safety and efficacy of ex vivo gene therapy for metabolic liver diseases. The Watanabe 
heritable hyperlipidemic rabbit, a strain deficient in low density lipoprotiens (LDL) receptors, has been 
used to evaluate the possible application of a gene therapy by hepatocyte injection. Autologous hepatocytes 
were obtained from a liver segment, genetically modified ex vivo, and infused into the inferior mesenteric 
vein through a catheter placed intraoperatively without postoperative sequelae to the rabbit. Decreased 
levels of LDL have been reported out to 6 months [Wilson et al., 1992]. Larger animal models have shown 
engraftment of the genetically altered hepatocytes for as long as 1.5 years. The first clinical application 
of hepatocellular injection has been performed on a patient diagnosed with homozygous familial hyper- 
cholesterolemia. The patient has had decreased levels of cholesterol and has maintained expression of 
the transfected gene after 18 months [Grossman et al., 1994]. This is an important step forward as we 
await long-term results. 

Hepatocyte Transplantation on Polymer Matrices 

Since the practical application of implanting few hepatocytes to proliferate and replace function is not 
yet possible, hepatocyte tissue construction, using polymer as a scaffold, relies on transplanting a large 
number of hepatocytes to allow survival of enough hepatocyte mass to replace function. The large surface 
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FIGURE 121.1 Control and transgenic liver specimens: a nontransgenic (normal mouse) control with normal liver 
color (top, left); a transgenic (MT-lacZ) liver with normal function which is stained in blue and served as a positive 
control (top, center); a notransgenic control transplanted with transgenic MIT-lacZ hepatocytes (top, right); and three 
livers with a different transgene Alb-uPA transplanted with transgenic (MT-lacZ) hepatocytes. A deficiency in hepatic 
growth potential and function is induced by the Alb-uPA transgene, resulting in a chronic stimulus for liver growth. 
The liver with Alb-uPA transgene has the same color as the nontransgenic control liver but is partially replaced by the 
blue-stained transgencic {MT-lacZ) normal functioning hepatocytes, showing the regenerative response of normal 
hepatocytes in a mouse liver with a chronic stimulus for live growth. (Rhim, Sandgren, and Brinster, University of 
Pennsylvania, reprinted with permission from American Association for the Advancement of Science.) 

area of the polymer accommodates hepatocyte attachment in large numbers so that many cells may 
survive initially by diffusion of oxygen and other vital nutrients (see Fig. 121.2). The polymer scaffold is 
constructed with a high porosity to allow vascular ingrowth, and vascularization of surviving cells then 
can provide permanent nutritional access [Cima et al., 1991]. The cell-polymer system has been used in 
several other tissue-engineering applications such as cartilage, bone, intestine, and urologic tissue con- 
struction [Langer & Vacanti, 1993]. Hepatocytes adhere to the polymer matrix for growth and differen- 
tiation as well as locate into the interstices of the polymer. The polymer-hepatocyte interface can be 
manipulated with surface proteins such as laminin, fibronectin, and growth factors to improve adherence, 
viability, function, or growth [Mooney et al., 1992]. Hepatocyte proliferation by attaching mitogenic 
factors like hepatocyte growth factor, epidermal growth factor, or transforming growth factor-alpha is 
under current investigation. 

Synthetic polymer matrices, both degradable and nondegradable, have been evaluated for hepatocyte- 
polymer construction. As degradable polymers were being studied and evaluated for tissue engineering, 
a nondegradable polymer, polyvinyl (PVA), was used for in vitro and in vivo systems. The polyvinyl 
alcohol sponge offered one significant advantage: a uniform, noncollapsible structure which allowed 
quantification of hepatocyte engraftment [Uyama et al., 1993]. In vitro and in vivo studies have demon- 
strated hepatocyte survival on the polyvinyl alcohol scaffold. However, the polyvinyl alcohol sponge will 
not degrade and could act as a nidus for infection and chronic inflammation. A degradable polymer 
conceptually serves as a better implantable scaffold for hepatocyte-polymer transplantation, since the 
polymer dissolves to leave only tissue. Polyglycolic acid (PGA), polyactic acid (PLA), and copolymer 
hybrids have been employed in several animal models of liver insufficiency (see Fig. 121.3). Histologic 
analyses have shown similar survival of hepatocytes when compared to studies with polyvinyl alcohol. 

The experimental design for the hepatocyte-polymer model has been standardized as follows: (1) an end- 
to-side portacaval shunt is performed to provide hepatotrophic stimulation to the graft, (2) hepatocytes 
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FIGURE 121.2 Hepatocytes are seen adhering to polyglycolic acid polymer in culture (Mooney, unpublished data). 

are isolated and seeded onto degradable polyglycolic acid polymer, (3) the hepatocyte-polymer construct 
is implanted into the abdominal cavity on vascular beds of small intestinal mesentery and omentum, 
(4) pertinent chemical studies are obtained and analyzed at periodic intervals, and (5) histologic analysis 
in the specimens is performed at progressive time points. Early studies have shown that a large percentage 
of hepatocytes perish from hypoxia within 6-24 hours after implantation. To improve hepatocyte survival, 
implantation of hepatocytes onto large vascular surface areas for engraftment and exposing the hepato- 
cytes to hepatotrophic factors were two important maneuvers. The small intestinal mesentery and omen- 
tum have offered large vascular surface areas. Hepatocyte survival has been reported at other sites such 
as the peritoneum [Demetriou et al., 1991], renal capsule [Ricordi et al., 1989], lung [Sandbichler et al., 
1992], and pancreas [Vroemen et al., 1988], but these sites do not provide enough vascular surface area 
to allow survival of a large number of hepatocytes. 

The concept of hepatotrophic factors originated when atrophy and liver insufficiency was observed 
with heterotopic liver transplantation. Later studies have confirmed that important factors regulating 
liver growth and maintenance existed in the portal blood [Jaffe et al., 1991]. Thus, when a portacaval 
shunt to redirect hepatotrophic factors from the host liver to the hepatocyte-polymer construct was 
performed, survival of heterotopically transplanted hepatocytes significantly improved [Uyama et al., 
1993]; consequently, portacaval shunts were instituted in all experimental models. Studies thus far have 
shown survival of functional hepatocytes over 6 months in rat and dog models (unpublished data); 
replacement of liver function has been of shorter duration. 

A study using the Dalmatian dog model of hyperuricosuria typifies the current status of the hepatocyte- 
polymer system. The hepatocyte membrane of the Dalmatian dog has a defect in the uptake of uric acid, 
which results in hyperuricemia and hyperuricosuria [Giesecke & Tiemeyer, 1984]. The study has shown 
a significant but temporary correction of the liver uric acid metabolic defect after implantation of normal 
beagle hepatocytes, which have normal uric acid metabolism, on degradable polyglycolic acid polymer 
[Takeda et al., 1994]. Cyclosporine was administered for immunosuppression. The results of the Dalma- 
tian dog study suggest that (1) successful engraftment of the hepatocyte-polymer construct occurred, 
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FIGURE 121.3 Scanning electron microscopic photographs of polyglyclic acid (top), polyvinyl alcohol (middle), 
and polylactic acid (bottom) demonstrate the high porosity of these polymers (Mooney, reprinted with permission 
from W.B. Saunders Company). 

(2) maintenance of functional hepatocytes under current conditions was temporary, and (3) loss of critical 
hepatocyte mass to replace the liver uric acid metabolism defect occurred at 5-6 weeks after hepatocyte- 
polymer transplantation. The temporary effect could be attributed to suboptimal immunosuppression, 
suboptimal regulation of growth factors and hormones involved with hepatic growth, regeneration, and 
maintenance, or both. 

Coculture of hepatocytes with pancreatic islet cells also has been under investigation to aid in hepa- 
tocyte survival, growth, and maintenance. Trophic factors from islet cells have been shown to improve 
hepatocyte survival [Ricordi et al., 1988], and cotransplantation of hepatocytes with islet cells on a 
polymer matrix has been shown to improve hepatocyte survival as well [Kaufmann et al., 1994] . Coculture 
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with other cell types such as the biliary epithelial cell also may improve hepatocyte survival. Other studies 
with biliary epithelial cells have shown ductular formation in in vitro and in vivo models [Sirica et al., 
1990] , and vestiges of ductular formation in hepatocyte-polymer tissue have been histologically observed 
[Hansen & Vacanti, 1992] . A distinct advantage of the cell-polymer engineered construct is that one can 
manipulate the polymer to direct function. Thus far, diseases involving the biliary system, primarily 
biliary atresia, are not amenable to hepatocyte transplantation. An attempt to develop a biliary drainage 
system with biliary epithelial cells, hepatocytes, and polymer has been initiated. In the future, the potential 
construction of a branching polymer network could serve as the structural cues for the development of 
an interconnecting ductular system. 

Ex vivo gene therapy with the hepatocyte-polymer system is also an exciting potential application as 
demonstrated by the recent clinical trial with hepatocyte injection therapy. Genetically altered hepatocytes 
transplanted on polymer constructs have been studied with encouraging results [Fontaine et al., 1993]. 

121.3 Conclusion 

Studies in hepatocyte transplantation through tissue engineering methods have made important advances 
in recent years. The research in the hepatocellular injection and the research in the hepatocyte-polymer 
construct models have complemented each other in understanding the difficulties as well as the possi- 
bilities of liver replacement therapy. In order to make further advances with hepatocyte replacement 
systems, the process of liver development, growth, and maintenance need to be better understood. 
Currently, the amount of hepatocyte engraftment, proliferation, and the duration of hepatocyte survival 
remain undetermined in both systems. The amount of functional hepatocyte engraftment necessary may 
vary for different hepatic diseases. For instance, isolated gene defects of the liver may require a small 
number of functional transplanted hepatocytes to replace function, whereas end-stage liver disease may 
require a large amount of hepatocyte engraftment. 

Current hepatic replacement models have both advantages and disadvantages. For hepatocellular 
injection, the application of ex vivo gene therapy for an isolated gene defect of liver metabolism is 
promising. However, the small amount of hepatocyte delivery and significant potential complications 
for patients with portal hypertension may preclude application of the hepatocellular injection method 
for end-stage liver disease. A significant amount of intrapulmonary shunting of hepatocytes was observed 
in rats with portal hypertension after intrasplenic injection of hepatocytes, which resulted in increased 
portal pressures, pulmonary hypertension, pulmonary infarction, and reduced pulmonary compliance 
[Gupta et al., 1993]. With the hepatocyte-polymer system, delivery of a large number of hepatocytes is 
possible. In patients with portal hypertension, portal blood-containing hepatotrophic factors are shunted 
away from the liver, obviating the need for a portacaval shunt. Thus, patients with end-stage liver disease 
and portal hypertension may need only transplantation of the hepatocyte-polymer construct. However, 
an end-to-side portacaval shunt operation is needed in congenital liver diseases with normal portal 
pressures to deliver hepatotrophic factors to the heterotopically place hepatocytes. In the future, each 
hepatocyte transplant system could have specific and different clinical applications. More important, 
both offer the hope of increasing therapeutic options for patients requiring liver replacement therapy. 
Approximately 260,000 patients out of 634,000 patients hospitalized for liver diseases have liver diseases 
which could have been considered for hepatic transplantation. The total acute care nonfederal hospital 
cost for liver diseases in 1992, which does not include equally substantial outpatient costs, exceeded 
$9.2 billion [HCIA Inc., 1992]. 

Defining Terms 

Allogeneic: Pertaining to different genetic compositions within the same species. 

Cadaveric: Related to a dead body. In transplantation, cadaveric is related to a person who has been 

declared brain dead; organs should be removed prior to cardiac arrest to prevent injury to the 

organs. 
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Heterotopic: Related to a region or place where an organ or tissue is not present in normal conditions. 
Mitogens: Substances that stimulate mitosis or growth. 

Orthotopic: Related to a region or place where an organ or tissue is present in normal conditions. 
Portacaval shunt: A surgical procedure to partially or completely anastomose the portal vein to the 

inferior vena cava to divert portal blood flow from the liver to the systemic circulation. 
Stroma: The structure or framework of an organ or gland usually composed of connective tissue. 
Transgenic: Referred to introduction of a foreign gene into a recipient which can be used to identify 

genetic elements and examine gene expression. 
Xengraft: A graft transferred from one animal species to another species. 
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Tissue engineering in the nervous system facilitates the controlled application and/or organization of 
neural cells to perform appropriate diagnostic, palliative, or therapeutic tasks. As the word tissue implies, 
tissue engineering in general involves cellular components and their organization. Any cell, given its 
broad genetic program, expresses a particular phenotype in a manner that is dependent on its environ- 
ment. The extracellular environment consists of cells, humoral factors, and the extracellular matrix. 
Research in genetic engineering and the intense focus on growth factors and extracellular matrix biology 
have made it possible to manipulate both the cell's genetic program and its phenotypic expression. 

In the nervous system, degeneration or injury to neurons or glia and/or an aberrant extracellular 
environment can cause a wide variety of ailments. Diseases such as Parkinson's may required the replace- 
ment of diminished levels of a particular neurochemical, e.g., dopamine. Other pathologies such as 
injured nerves or reconnection of served neural pathways may require regeneration of nervous tissue. 

Tissue engineering efforts in the nervous system have currently been addressing the following goals: 

1. Functional replacement of a missing neuroactive component 

2. Rescue or regeneration of damaged neural tissue 

3. Human-machine interfaces: neural coupling elements 

122.1 Delivery of Neuroactive Molecules 
to the Nervous System 

Deficiency of specific neuroactive molecules has been implicated in several neurologic disorders. These 
factors may be neurotransmitters, neurotrophic agents, or enzymes. For example, part of the basal ganglia 
circuitry that plays a role in motor control consists of striatal neurons receiving dopaminergic input from 
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Table 122.1 Engineering Solutions for Parkinson's Disease 



Mode of Delivery 
Infusion 

Polymer slow release 



Cell transplantation 
Fetal 

Substantia nigra 



Rodents 
Rat striatum 



Rat striatum (EVA rods) 
Rat subcutaneous (EVA rods) 
Rat striatum (Si pellet) 
Rat striatum (liposomes) 



Rat striatum 



Rat striatum 



Ventral mesencephalon 
Human DA neurons 
Autologous primary 

Genetically altered skin 

fibroblasts 

Genetically altered skin 

fibroblasts with myoblasts 

Encapsulated xenogeneic tissue 

Bovine adrenal chromaffin cells Striatum (microcapsules) 
PC12 cells Rat striatum (microcapsules) 

Rat striatum (macrocapsule) 



Rat striatum 



Rat striatum 



Monkey Human 

Cerebroventricular 

Systemic 



Putamen 



Striatum 
(macrocapsule) 



Mouse mesencephalon 



Rat parietal cortex 
(macrocapsule) 



Reference 

Hargraves et al., 1987 
De Yebenes et al., 1988 
Hardie et al., 1984 
Winn et al., 1989 
Sabeletal., 1990 
Becker etal., 1990 
During et al., 1992 



Bjorklund et al., 1980 

Freund et al., 1985 
Bolametal., 1987 
Lindvall et al., 1990 

Chen et al., 1991 

Jiaoetal., 1993 



Aebischer et al., 1991 
Winn et al., 1991 
Aebischer et al., 1988 
Aebischer et al., 1994 

Aebischer et al., 1988 



the mesencephalic substantia nigra neurons. It has been shown that a lesioned nigrostriatal dopaminergic 
pathway is responsible for Parkinsons disease [Ehringer & Hornykiewicz, I960]. In chronic cancer 
patients, the delivery of antinociceptive neurotransmitters such as enkephalins, endorphins, catechola- 
mines, neuropeptide Y, neurotensin, and somatostatin to the cerebrospinal fluid may improve the treat- 
ment of severe pain [Akil et al., 1984; foseph et al., in press]. Neurotrophic factors may play a role in the 
treatment of several neurodegenerative disorders. For example, local delivery of nerve growth factor 
(NGF) [Hefti etal, 1984; Williams etal., 1986] and/or brain-derived growth factor (BDNF) may be 
useful in the treatment of Alzheimer's disease [Anderson et al., 1990, Kniisel et al., 1991]. BDNF [Hyman 
et al., 1991; Kniisel et al., 1991] and glial cell line-derived nerve growth factor (GDNF) maybe beneficial 
in Parkinson's' disease [Lin et al., 1993]. Other neurotrophins such as ciliary neurotrophic factor (CNTF) 
[Oppenheim et al, 1991]; Sendtner et al, 1990], BDNF [Yan et al, 1992], neurotrophin-3 (NT-3), neu- 
rotrophin-4 (NT-4/5) [Hughes etal, 19993] and GDNF [Zurn etal, 1994] also could have an impact 
on amyotrophic lateral sclerosis (ALS) or Lou Gehrigs's disease. 

Therefore, augmentation or replacement of any of the above-mentioned factors in the nervous system 
would be a viable therapeutic strategy for the treatment of the pathologies listed above. There are several 
issues that ought to be considered in engineering a system to deliver these factors. The stability of the factors, 
the dosage required, the solubility, the target tissue, and possible side effects all are factors that influence 
the choice of the delivery mode. Pumps, slow-release polymer systems, and cells from various sources that 
secrete the compound of interest are the main modes by which these factors can be delivered. Table 122.1 
lists all the means employed to deliver dopamine to alleviate the symptoms of Parkinson's disease. 

Pumps 

Pumps are used to deliver opiates epidurally to relieve severe pain [Ahlgren etal., 1987]. Pumps also 
have been used to deliver neurotrophic factors such as NGF intraventricularly as a prelude or supplement 
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to transplantation of chromaffin cells for Parkinson's disease [Olson et al., 1991 ] or as a potential therapy 
for Alzheimer's disease [Olson et al., 1992] . Pumps also have been used to experimentally deliver dopam- 
ine or dopamine receptor agonists in Parkinson's disease models [DeYebenes et al., 1988; Hargraves et al., 
1987]. While pumps have been employed successfully in these instances, they may need to be refilled 
every 4 weeks, and this may be a limitation. Other potential drawbacks include susceptibility to "dumping" 
of neuroactive substance due to presence of a large reservoir of neuroactive element, infections, and 
diffusion limitations. Also, some factors such as dopamine and ciliary neurotrophic factor (CNTF) are 
very unstable chemically and have short half-life periods, rendering their delivery by such devices difficult. 
Some of these problems may be eliminated by well designed slow-release polymer systems. 

Slow-Release Polymer Systems 

Slow-release polymer systems essentially trap the molecule of interest in a polymer matrix and release it 
slowly by diffusion over a period of time. Proper design of the shape and composition of the polymer 
matrix may achieve stabilization of the bioactive molecule and facilitate a steady, sustained release over 
a period of time. For instance, it has been shown that the dopamine precursor l- dopa may be effective 
in alleviating some motor symptoms of Parkinson's disease [Birkmayer, 1969], fluctuations in the plasma 
levels of L-dopa due to traditional, periodic oral administration and difficulties in converting L-dopa to 
dopamine may cause the clinical response to fluctuate as well [Albani et al., 1986; Hardie et al., 1984; 
Muenter et al., 1977; Shoulson et al., 1975]. It has been demonstrated that a slow-release ethylene vinyl 
acetate polymer system loaded with L-dopa can sustain elevated plasma levels of L-dopa for at least 
225 days when implanted subcutaneously in rats [Sabel et al., 1990]. Dopamine can only be released 
directly in the CNS, since it does not pass the blood-brain barrier. It has been demonstrated that 
experimental parkinsonism in rats can be alleviated by intrastriatal implantation of dopamine-releasing 
ethylene vinyl acetate rods [Winn et al., 1989]. Silicone elastomer as well as resorbable polyester pellets 
loaded with dopamine also have been implanted intrastriatally and shown to induce a behavioral recovery 
in parkinsonian rats [Becker et al., 1990; McRae et al., 1991]. 

Slow-release systems also may be employed to deliver trophic factors to the brain either to avoid side 
affects that may come about due to systemic administration or to overcome the blood-brain barrier, 
getting the factor delivered to the brain directly [During et al., 1992; Hoffman et al, 1990]. Therefore, 
polymeric systems with the molecule of interest trapped inside may be able to achieve many of the goals 
of an ideal delivery system, including targeted local delivery and zero-order continuos release [Langer, 
1981; Langer et al., 1976]. However, some of the disadvantages of this system are the finite amounts of 
loaded neuroactive molecules and difficulties in shutting off release or adjusting rate of release once the 
polymer is implanted. Also, for the long-term release in humans, the device size may become a limiting 
factor. Some of these limitations may be overcome by the transplantation of cells that release neuroactive 
factors to the target site. 

Cell Transplantation 

Advances in molecular biology and gene transfer techniques have given rise to a rich array of cellular 
sources, which have been engineered to secrete a wide range of neurologic compounds. These include cells 
that release neurotransmitters, neurotrophic factors, and enzymes. Transplantation of these cells leads to 
functional replacement or augmentation of the original source of these compounds in the host. They can 
deliver neuroactive molecules as long as they survive, provided they maintain their phenotype and/or 
transgene expression, in the case of gene therapy. Some of the disadvantages of the slow-release systems, 
such as the presence of a large reservoir for long-term release or reloading of an exhausted reservoir, can 
thus be overcome. Some of the tissues transplanted so far may be classified in the following manner. 

Transplantation of Autologous Primary Cells 

This technique involves the procurement of primary cells from the host, expanding them if necessary to 
generate requisite amounts of tissue, engineering them if so required by using gene transfer techniques, 
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and then transplanting them back into the "donor" at the appropriate site. For instance, autologous 
Schwann cell shave been isolated and transplanted experimentally into the brain and have been shown 
to enhance retinal nerve regeneration, presumably by the release of factors that influence regeneration 
[Morrissey et al., 1991]. Schwann cells also express various neurologically relevant molecules [Bunge & 
Bunge, 1983; Muir et al., 1989]. Autologous Schwann cells also can work as nerve bridges to help 
reconstruction of rat sciatic nerve after axotomy [Guenard et al., 1992]. Primary skin fibroblasts also 
have been genetically engineered to secrete L-dopa and transplanted successfully into the autologous 
host's striatum in an experimental model of Parkinson's disease in rats [Chen et al., 1991]. The same 
group also has reported that nerve growth factor, tyrosine hydroxylase, glutamic acid decarboxylase, and 
choline acetyltransferase genes may be introduced successfully and expressed in primary fibroblasts [Gage 
et al., 1991]. More recently, muscle cells have been engineered to express tyrosine hydroxylase and 
transplanted successfully in a rat model of Parkinson's disease [Jiao et al., 1993]. 

Thus nonneural cells such as a fibroblast or muscle cells may be selected, in part, for the ease with which 
they can be engineered genetically and made neurologically relevant. It is therefore possible to engineer 
cells to suit particular pathologies as a step toward being able to engineer biomimetic tissues and then place 
them in appropriate locations and contexts in vivo. However, it may not always be technically possible to 
procure sufficient amounts of autologous tissue. Other sources of tissues have therefore been explored, and 
these include fetal tissue, used usually in conjunction with immunosuppression. 

Fetal Tissue Transplantation 

One of the important advantages of fetal tissue is its ability to survive and integrate into the host adult 
brain. Transplantation of fetal neural tissue allografts might be useful in treating Parkinson's disease 
[Lindvall et al., 1990]. While promising results have been reported using neural fetal tissue disease 
[Bjorklund, 1991; Bjorklund et al., 1980], availability of donor tissue and potential ethical issues involved 
with the technique may be potential shortcomings. One promising strategy to obtain allogeneic fetal 
tissue in large quantities is to isolate neural stem cells and have them proliferate in vitro. It has been 
demonstrated recently that CNS progenitor cells may be selected using epidermal growth factor (EGF). 
The cells thus selected have been shown to proliferate in vitro under appropriate culture conditions. With 
time and appropriate culture conditions, they differentiate into mature CNS neurons and glia [Reynolds 
et al., 1992; Vescovi et al., 1993]. When optimized, this technique could be useful to select and expand 
fetal neurons of interest in vitro and then transplant them. Transplantation of xenogeneic fetal tissue 
with immunosuppression using cyclosporin A is another alternative to using fetal tissue [Wictorin et al., 
1992], and this approach might yield more abundant amounts of tissue for transplantation and overcome 
some of the possible ethical dilemmas in using human fetal tissue. However, immunosuppression may 
not be sufficient to prevent long-term rejection and may have other undesirable side-effects. 

Transplantation of Encapsulated Xenogeneic Tissue 

Polymeric encapsulation of xenogeneic cells might be a viable strategy to transplant cells across species 
in the absence of systemic immunosuppression [Aebischer et al., 1988; Tresco et al., 1992]. Typically, the 
capsules have pores large enough for nutrients to reach the transplanted tissue and let the neuroactive 
factors out, but the pores are too small to let the molecules and the cells of the immune system reach 
the transplanted tissue (Fig. 122.1). At the same time, this strategy retains all the advantages of using 
cells as controlled, local manufacturers of the neuroactive molecules. The use of encapsulation also 
eliminates the restriction of having to use postmitotic tissue for transplantation to avoid tumor formation. 
The physical restriction of the polymeric capsule prevents escape of the encapsulated tissue. Should the 
capsule break, the transplanted cells are rejected and eliminated by the host immune system. However, 
no integration of the transplanted tissue into the host is possible with this technique. 

Some of the tissue engineering issues involved in optimizing the encapsulation technique are ( 1 ) the 
type and configuration of the encapsulating membrane, (2) the various cells to be used for encapsulation, 
and (3) the matrix in which the cells are immobilized. 
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FIGURE 122.1 (a) Schematic illustration of the concept of immunoisolation involved in the transplantation of 
xenogenic tissue encapsulated in semipermeable polymer membranes, (b) Light micrograph of a longitudinal section 
shown baby hamster kidney cells encapsulated in a polyacryolonitrile-vinylchloride membrane and transplanted in 
an axotomized rat pup after 2 weeks in vivo. W denotes the capsule's polymeric wall, and Y shows the cells. 

Type and Configuration of the Encapsulating Membrane. One important factor determining the size 
of the device is oxygen diffusion and availability to the encapsulated cells. This consideration influences 
the device design and encourages situations where the distances between the oxygen source, usually a 
capillary, and the inner core of transplanted tissue are kept as minimal as possible. The size and config- 
uration of the device also influence the kinetics of release of the neuroactive molecules, the "response 
time" being slower in larger capsules with thicker membrane walls. 
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The capsule membrane may be a water-soluble system stabilized by ionic or hydrogen bonds formed 
between two weak polyelectrolytes — typically an acidic polysaccharide, such as alginic acid or modified 
cellulose, and a cationic polyaminoacid, such as polylysine or polyornithine [Goosen et al., 1985; Lim & 
Sun, 1980; Winn et al., 1991]. Gelation of the charged polyelectrolytes is caused by ionic cross-linking 
in the presence of di- or multivalient counterions. However, the stability and mechanical strength of 
these systems are questionable in the physiologic ionic environments. The major advantage of using these 
systems is that they obviate the need of organic solvent use in the making of the capsules and might be 
less cytotoxic in the manufacturing process. 

The capsule membrane also may be a thermoplastic, yielding a more mechanically and chemically 
stable membrane. This technique involves loading of cells of interest in a preformed hollow fiber and 
then sealing the ends either by heat or with an appropriate glue. The hollow fibers are typically fabricated 
by a dry jet wet-spinning technique involving a phase-inversion process [Aebischer et al., 1991]. The use 
of thermoplastic membranes allows the manipulation of membrane structure, porosity, thickness, and 
permeability by appropriate variation of polymer solution flow rates, viscosity of the polymer solution, 
the nonsolvent used etc. 

Long-term cross-species transplants of dopaminergic xenogeneic tissues and functional efficacy in the 
brain have been reported [Aebischer et al., 1991, 1994] using the preceding system. PC12 cells, a cate- 
cholaminergic cell line derived from a rat pheochromocytoma, ameliorated experimental Parkinson's 
disease when encapsulated within thermoplastic PAN/PVC capsules and implanted in the striatum of 
adult guinea pigs [Aebischer et al., 1991]. 

The Choice of Cells and Tissues for Encapsulation. Three main types of cells can be used for encap- 
sulation. Cells may be postmitotic, cell lines that differentiate under specific conditions, or slow-dividing 
cells. The latter two types of cells lend themselves to genetic manipulation. 

Postmitotic Cells. Primary xenogeneic tissue may be encapsulated and transplanted across species. For 
instance, chromaffin cells release various antinociceptive substances such as enkephalins, catecholamines, 
and somatostatin. Allografts of adrenal chromaffin cells have been shown to alleviate pain when trans- 
planted in the subarachnoid space in rodent models and terminal cancer patients [Sagen et al., 1993]. 
Transplantation of encapsulated xenogeneic chromaffin cells may provide a long-term source of pain- 
reducing neuroactive substances [Sagen et al., 1993]. In our laboratory, clinical trials are currently under- 
way using the transplantation of encapsulated bovine chromaffin tissue into human cerebrospinal fluid 
as a strategy to alleviate chronic pain in terminal cancer patients [Aebischer et al., 1994]. This may 
circumvent the problem of the limited availability of human adrenal tissue in grafting procedures of 
chromaffin tissue. 

Cell Lines that Differentiate under Specific Conditions. Postmitotic cells are attractive for transplantation 
applications because the possibility of tumor formation, loss of phenotypic expression, is potentially 
lower. Also, when encapsulated, there is less debris accumulation inside the capsule due to turnover of 
dividing cells. However, the disadvantage is that the amount of postmitotic tissue is usually too limited 
in quantity for general clinical applications. Therefore, the use of cells that are mitotic and then are 
rendered postmitotic under specific conditions is attractive for transplantation applications. 

Under appropriate culture conditions, primary myoblasts undergo cell division for at least 50 passages. 
Fusion into resting myoblasts can be obtained by controlling the culture conditions. Alternatively, a 
transformed myoblast cell line C 2 C 12 derived from a C 2 H mouse thigh differentiates and forms myotubes 
when cultured under low serum conditions [Yaffe & Saxel, 1977]. Therefore, these cells could be genet- 
ically altered, expanded in vitro and then made postmitotic by varying the culture conditions. They can 
then be transplanted with the attendant advantages of using postmitotic tissue. Since myoblasts can be 
altered genetically, they have the potential to be a rich source for augmentation of tissue function via cell 
transplantation. 

Slow-Dividing Cells. Slow-dividing cells that reach a steady state in a capsule either due to contact 
inhibition or due to the encapsulation matrix are an attractive source of cells for transplantation. 
Potentially, their division can result in a self-renewing supply of cells inside the capsule. Dividing cells 
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are also easier to transfect reliably with retroviral methods and therefore lend themselves to genetic 
manipulation. It is therefore possible to envisage the transplantation of various genetically engineered 
cells for the treatment of several neurologic disorders. This technique allows access to an ever-expanding 
source of xenogeneic tissues that have been engineered to produce the required factor of interest. For 
instance, Horellou and Mallet [1989] have retrovirally transferred the human TH cDNA into mouse 
anterior pituitary AtT-20 cell line, potentially resulting in a plentiful supply of dopamine-secreting cells 
that can then be encapsulated and transplanted into the striatum. 

Another promising area for the use of such techniques is in the treatment of some neurodegenerative 
disorders where a lack of neurotrophic factors is believed to be part of the pathophysiology. Neurotrophic 
factors are soluble proteins that are required for the survival of neurons. These factors often exert a 
trophic effect; i.e., they have the capability of attracting growing axons. The "target" hypothesis describes 
the dependence of connected neurons on a trophic factor that is retrogradely transported along the axons 
after release from the target neurons. In the absence of the trophic factors, the neurons shrink and die, 
presumably to avoid potential misconnections. Experimentally, fibroblast lines have been used in CNS 
transplantation studies because of their convenience for gene transfer techniques [Gage et al., 1991]. The 
transplantation of encapsulated genetically engineered fibroblasts to produce NGF has been shown to 
prevent lesion-induced loss of septal ChaT expression following a fimbria-fornix lesion [Hoffman et al., 
1993]. The fimbria-fornix lesion is characterized by deficits in learning and memory resembling those 
of Alzheimer's disease. 

Matrices for Encapsulation. The physical, chemical, and biologic properties of the matrix in which the 
cells have been immobilized may play an important role in determining the transplanted cell's state and 
function. Broadly, matrices can be classified into the following types: cross-linked polyelectrolytes, col- 
lagen in solution or as porous beads, naturally occurring extracellular matrix derivatives such as Matrigel, 
fibrin clots, and biosynthetic hydrogels with appropriate biologic cues bound to them to elicit a specific 
response from the cells of interest. The matrix has several functions: It can prevent the formation of large 
cell aggregates that lead to the development of central necrosis as a consequence of insufficient oxygen 
and nutrient access; it may allow anchorage-dependent cells to attach and spread on the matrix substrate; 
and it may induce differentiation of a cell line and therefore slow down or stop its division rate. 

Negatively charged polyeletrolytes such as alginate have been used successfully for the immobilization of 
adrenal chromaffin cells [Aebischer et al., 1991]. Positively charged substrates, such as those provided by the 
amine groups of chitosan, allow attachment and spreading of fibroblasts [Zielinski et al., in press] . Biologically 
derived Matrigel induces differentiation of various cell lines such as Chinese hamster ovary (CHO) cells, 
astrocyte lines, or fibroblast lines (unpublished observations). Spongy collagen matrices, as well as fibrin 
matrices, seem to possess similar qualities. Our laboratory is currently evaluating the use of biosynthetic 
hydrogel matrices with biologically relevant peptides covalently bound to the polymer backbone. It is hypoth- 
esized that these matrices may elicit a specific designed response from the encapsulated cells. 

122.2 Tissue Reconstruction: Nerve Regeneration 

Most of the techniques described above were attempts at identifying the missing molecules of various 
neuropathologies, finding an appropriate source for these molecules and, if necessary, designing a cellular 
source via genetic engineering, and ultimately, choosing an optimal mode of delivery of the molecules, 
be it chemical or cellular. This approach, however, falls short of replacing the physical neuroanatomic 
synaptic circuits in the brain, which, in turn, may play an important role in the physiologic feedback 
regulating mechanisms of the system. Attempts to duplicate in vivo predisease neuronal structure have 
been made. For instance, the bridging of the nigrostriatal pathway, which when disrupted may cause 
Parkinson's disease, and the septohippocampal pathway, which may serve as a model for Alzheimer's 
disease, has attracted some attention. Wictorin and colleagues [ 1992] have reported long-distance directed 
axonal growth from human dopaminergic mesencephalic neuroblasts implanted along the nigrostriatal 
pathway in 6-hydroxydopamine-lesioned rats. In this section we shall examine the use of synthetic 
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FIGURE 122.2 Schematic illustration of a nerve guidance channel and some of the possible strategies for influencing 
nerve regeneration. 

guidance channels and extracellular matrix cues to guide axons to their appropriate targets. Thus a 
combination of all these techniques may render the complete physical and synaptic reconstruction of a 
degenerated pathway feasible. 

The promotion of nerve regeneration is an important candidate task for tissue reconstruction in the 
nervous system. Synthetic nerve guidance channels (NGCs) have been used to study the underlying 
mechanisms of mammalian peripheral nerve regeneration after nerve injury and enhance the regeneration 
process. Guidance channels may simplify end-to-end repair and may be useful in repairing long nerve 
gaps. The guidance channel reduces tension at the suture line, protects the regenerating nerve from 
infiltrating scar tissue, and directs the sprouting axons toward their distal targets. The properties of the 
guidance channel can be modified to optimize the regeneration process. Nerve guidance channels also 
may be used to create a controlled environment in the regenerating site. In the peripheral nervous system, 
NGCs can influence the extent of nerve gap that can be bridged and the quality of regeneration. The 
channel properties, the matrix filling the NGC, the cells seeded within the channel lumen, and polymer- 
induced welding of axons all can be strategies used to optimize and enhance nerve regeneration and 
effect nervous tissue reconstruction (see Fig. 122.2 for a schematic). Table 122.2 lists some of the kinds 
of nerve guidance channels used so far. 

The Active Use of Channel Properties 

In the past, biocompatability of a biomaterial was evaluated by the degree of its passivity or lack of 
"reaction" when implanted into the body. However, the recognition that the response of the host tissue 
is related to the mechanical, chemical, and structural properties of the implanted biomaterial has led to 
the design of materials that promote a beneficial response from the host. In the context of a synthetic 
nerve guidance channel, this may translate to manipulation of its microstructural properties, permeabil- 
ity, electrical properties, and the loading of its channel wall with neuroactive components that might 
then be released locally into the regenerating environment. The strategy here is to engineer a tailored 
response from the host and take advantage of the natural repair processes. 

Surface Microgeometry 

The morphology of regenerating peripheral nerves is modulated by the surface microgeometry of poly- 
meric guidance channels [Aebischer et al., 1990]. Channels with smooth inner walls give rise to organized, 
longitudinal fibrin matrices, resulting in discrete free-floating nerve cables with numerous myelinated 
axons. The rough inner surface channels, however, give rise to an unorganized fibrin matrix with nerve 
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TABLE 122.2 Nerve Guidance Channels 



III. 



The Channel Wall 

1 . Passive polymeric channels 
Silicone elastomer 
Polyvinyl chloride 
Polyethylene 

2. Permeable polymer channels 
Acrylonitrile vinychloride copolymer 
Collagen 

Expanded polytetrafluroethylene 

3. Resorbable polymer channels 
Polyglycolic acid 
Poly-L-lactic acid 
Collagen 

4. Electrically shaped polymer channels 
Silicone channels with electrode cuffs 
Polyvinylidenefluoride (piezoelectric) 
Polytetrofluoroethylene (electret) 

5. Polymer channels releasing trophic factors 
Ethylene vinylacetate copolymer 

Intrachannel, luminal matrices 

1. Fibrin matrix 

2. Collagen-glycosaminoglycan template 

3. Matrigel 

Cell seeded lumens for trophic support 

1. Schwann cell-seeded lumens (PNS) 

2. Schwann cell-seeded lumens (CNS) 



Lundborg et al., 1982 
Scaravalli, 1984 
Madison et al., 1988 

Uzman and Villegas, 1983 
Archibald et al., 1991 
Young etal., 1984 

Molander et al, 1989 
Nyilasetal., 1983 
Archibald et al, 1991 

Kerns and Freeman, 1986; Kerns et al., 1991 
Aebischer etal., 1987 
Valentini et al., 1989 

Aebischer et al., 1989 

Williams et al., 1987 
Yannas et al., 1985 
Valentini et al., 1987 

Guenard et al., 1992 

Guenard et al., 1993; Kromer and Cornbrooks, 1985, 1987; 
Smith and Stevenson, 1988 



fascicles scattered in a loose connective tissue filling the entire channel's lumen. Thus the physical textural 
properties and porosity of the channel can influence nervous tissue behavior and may be used to elicit 
a desirable reaction from the host tissue. 

Molecular Weight Cutoff 

The molecular weight cutoff of the NGCs influences peripheral nerve regeneration in rodent models 
[Aebischer etal., 1989]. The molecular weight cutoff may influence nerve regeneration possibly by 
controlling the exchange of molecules between the channel lumen and the external wound-healing 
environment. This may be important because the external environment consists of humoral factors that 
can play a role in augmenting regenerative processes in the absence of a distal stump. 

Electrical Properties 

In vivo regeneration following transection injury in the peripheral nervous system has been reported to 
be enhanced by galvan tropic currents produced in silicone channels fitted with electrode cuffs [Kerns & 
Freeman, 1986; Kerns et al., 1991].Polytetrafluoroethylene (PTFE) "electret" tubes show more myelinated 
axons compared with uncharged tubes in peripheral nerves [Valentini etal., 1989]. Dynamically active 
piezoelectric polymer channels also have been shown to enhance nerve regeneration in the sciatic nerves 
of adult mice and rats [Aebischer et al, 1987; Fine et al., 1991]. 

Release of Bioactive Factors from the Channel Wall 

Polymer guidance channels can be loaded with various factors to study and enhance nerve regeneration. 
Basic fibroblast growth factor released from an ethylene-vinyl acetate copolymer guidance channel facil- 
itates peripheral nerve regeneration across long nerve gaps after a rat sciatic nerve lesion [Aebischer et al., 
1989] . The possible influence of interleukin- 1 (IL- 1 ) on nerve regeneration also was studied by the release 
of IL-1 receptor antagonist (IL-lra) from the wall of an EVA copolymer channel [Guenard et al., 1991]. 
It is conceivable that the release of appropriate neurotrophic factors from the channel wall may enhance 
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specifically subsets of axons, e.g., ciliary neurotrophic factor on motor neurons and nerve growth factor 
on sensory neurons. 

Resorbable Channel Wall 

Bioresorbable nerve guidance channels are attractive because once regeneration is completed, the channel 
disappears without further surgical intervention. Mice sciatic nerves have been bridged with poly-L-lactic 
acid channels [Nyilas et al., 1983] and polyester guidance channels [Henry et al., 1985]. Rabbit tibial 
nerves also have been bridged with guidance channels fabricated from polyglycolic acid [Molander et al., 
1989]. Resorbable guidance channels need to retain their mechanical integrity over 4 to 12 weeks. At the 
same time, their degradation products should not interfere with the regenerative processes of the nerve. 
These issues remain the challenging aspects in the development of bioresorbable nerve guidance channels 
for extensive use in animals and humans. 

Intraluminal Matrices for Optimal Organization 
of Regeneration Micro environment 

The physical support structure of the regenerating environment may play an important role in deter- 
mining the extent of regeneration. An oriented fibrin matrix placed in the lumen of silicone guidance 
channels accelerates the early phases of peripheral nerve regeneration [Williams et al., 1987]. 

Silicone channels filled with a collagen-glycosaminoglycan template bridged a 15-mm nerve gap in 
rats, whereas no regeneration was observed in unfilled tubes [Yannas et al., 1985]. However, even matrices 
known to promote neuritic sprouting in vitro may impede peripheral nerve regeneration in semiperme- 
able guidance channels if the optimal physical conditions are not ensured [Madison et al., 1988; Valentini 
et al., 1987]. Therefore, the structural, chemical, and biologic aspects of the matrix design may all play 
a role in determining the fate of the regenerating nerve. The importance of the effect of the physical 
environment on regeneration, mediated by its influence on fibroblast and Schwann cell behavior, has 
been demonstrated in several studies and has been reviewed by Schwartz [1987] and Fawcett & Keynes 
[1990]. Thus the choice of a hydrogel with physical, chemical, and biologic cues conducive to nerve 
regeneration may enhance nerve regeneration. This strategy is currently being explored [Belamkonda 
et al., in press]. Neurite-promoting oligopeptides from the basement membrane protein laminin (LN) 
were covalently coupled to agarose hydrogels. Agarose gels derivatized with LN oligopeptides specifically 
enhance neurite extension from cells that have receptors to the LN peptides in vitro [Bellamkonda et al., 
in press]. Preliminary results show that the presence of an agarose gel carrying the LN peptide 
CDPGYIGSR inside the lumen of a synthetic guidance channel enhances the regeneration of transected 
peripheral nerves (Fig. 122.3) in rats. Thus it is feasible to tailor the intraluminal matrices with more 
potent neurite-promoting molecules such as the cell adhesion molecules (CAMs) LI, N-CAM, or tenascin 
and "engineer" a desired response from the regenerating neural elements. 

Cell-Seeded Lumens for Trophic Support 

Cells secreting various growth factors may play an important role in organizing the regeneration envi- 
ronment, e.g., Schwann cells in the peripheral and central nervous system. It has been reported that 
regenerating axons do not elongate through acellular nerve grafts if Schwann cell migration was impeded 
[Hall et al., 1986]. Syngeneic Schwann cells derived from adult nerves and seeded in semipermeable 
guidance channels enhance peripheral nerve regeneration [Guenard et al., 1992]. Schwann cells in the 
preceding study orient themselves along the axis of the guidance channel, besides secreting various 
neurotrophic factors. Schwann cells could play a role in organizing the fibirin cable formed during the 
initial phases of nerve regeneration. Schwann cells may be effective in inducing regeneration in the CNS, 
too [Kromer & Cornbrook, 1985, 1987; Smith & Stevenson, 1988]. The use of tailored intraluminal 
matrices and presenting exogeneic Schwann cells to the regeneration environment in a controlled matter 
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FIGURE 122.3 Light micrograph of a cross-sectional cut of a sural nerve regenerating through a polymer guidance 
channel 4 weeks after transection. The nerve guidance channel had been filled with a CDPGYIGSR derivatized agarose 
gel. E is the epineurium; V shows neovascularization; and MA is myelinated axon. 



are strategies aimed at engineering the desired tissue response by creating the optimal substrate, trophic, 
and cellular environments around the regenerating nerves. 

CNS glial cells have a secretory capacity that can modulate neuronal function. Astrocytes release 
proteins that enhance neuronal survival and induce neuronal growth and differentiation. When a silicone 
channel was seeded with astrocytes of different ages, ranging from P9 to P69 (postnatal), it was observed 
that while P9 astrocytes did not interfere with peripheral nerve regeneration, adult astrocytes downreg- 
ulate axonal growth [Kalderon, 1988]. However, the presence of Schwann cells reverses the inhibition of 
PNS regeneration due to adult astrocytes [Guenard et al., 1994]. Thus the cellular environment in the 
site of injury may play an important role in determining the extent of regeneration. Knowledge of these 
factors also may be employed in designing optimal environments for nerve regeneration. 

Polyethyleneglycol-Induced Axon Fusion 

Rapid morphologic fusion of severed myelinated axons may be achieved by the application of polyethylene 
glycol (PEG) to the closely apposed ends of invertebrate-myelinated axons [Krause & Bittner, 1990]. 
Selection of appropriate PEG concentration and molecular mass, tight apposition, and careful alignment 
of the cut ends of the nerve may facilitate the direct fusion of axons. However, this technique is only 
applicable when the two ends of the severed nerve are closely apposed to each other, before the onset of 
wallerian degeneration. 

CNS Nerve Regeneration 

Most of the preceding studies have been conducted in the peripheral nervous system (PNS). In the CNS, 
however, endogenous components express poor support for axonal elongation. Significant regeneration 
may, however, occur with supporting substrates. Entubulation with a semipermeable acrylic copolymer 
tube allows bridging of a transected rabbit optic nerve with a cable containing myelinated axons [Aebi- 
scher et al., 1988]. Cholinergic nerve regeneration into basal lamina tubes containing Schwann cells has 
been reported in a transected septohippocampal model in rats [Kromer & Cornbrooks, 1985]. 
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Thus the appropriate combination of physical guidance, matrices, and growth factors can create the 
right environmental cues and may be effective in inducing regeneration in the CNS. Therefore, both in 
the PNS and CNS, manipulation of the natural regenerative capacities of the host either by guidance 
factor or stimulation by electrical or trophic factors or structural components of the regenerating 
microenvironment can significantly enhance regeneration and help the reconstruction of severed or 
damaged neural tissue. 

122.3 In Vitro Neural Circuits and Biosensors 

The electrochemical and chemoelectrical transduction properties of neuronal cells can form the basis of 
a cell-based biosensing unit. The unique information-processing capabilities of neuronal cells through 
synaptic modulation may form the basis of designing simple neuronal circuits in vitro. Both the preceding 
applications necessitate controlled neuronal cell attachment, tightly coupled to the substrate and a 
sensitive substrate to monitor changes in the cell's electrical activity. 

The use of bioactive material systems tailored to control neuronal cell attachment on the surface and 
still amenable to the incorporation of electrical sensing elements like a field effect transistor (FET) could 
be one feasible design. Therefore, composite material systems, which might incorporate covalently pat- 
terned bioactive peptides on their surface to control cell attachment and neurite extension, may be a step 
toward the fulfillment of the preceding goal. Oligopeptides derived from larger extracellular proteins like 
laminin have been shown to mediate specific cell attachment via cell surface receptors [Graf et al., 1987; 
Iwamoto et al., 1987; Kleinman et al., 1988]. Cell culture on polymeric membranes modified with the 
preceding bioactive peptidic components may give rise to a system where neuronal cell attachment and 
neuritic process outgrowth may be controlled. This control may help in designing microelectronic leads 
to complete the cell-electronic junction. Preliminary recordings from a FET-based neuron-silicon junc- 
tion using leech Retzius cells [Fromherz et al., 1991] have been reported. Though there are many prob- 
lems, such as attaining optimal coupling, this could form the basis of a "neural chip." A neural chip could 
potentially link neurons to external electronics for applications in neuronal cell-based biosensors, neural 
circuits, and limb prosthesis. Polymer surface modification and intelligent use of extracellular matrix 
components through selective binding could help attain this goal. 

Studies in our laboratory have been trying to understand the underlying mechanisms involving protein 
adsorption onto polymeric substrates and their role in influencing and controlling nerve cell attachment 
[Ranieri et al., 1993]. Controlled neuronal cell attachment within a tolerance range of 20 LLm may be 
achieved either nonspecifically monoamine surfaces or specifically via oligopeptides derived from ECM 
proteins like laminin and fibronectin [Fig. 122.4], mediated by integrin cell surface receptors [Ranieri 
et al., in press] . Molecular control of neuronal cell attachment and interfacing neuronal cells with elec- 
trodes may find applications in the design and fabrication of high- sensitivity neuron-based biosensors 
with applications in detection of low level neurotransmitters. 

Studies are also currently in progress involving polymeric hydrogels and controlling neuronal cell 
behavior in a three-dimensional (3D) tissue culture environs as a step toward building 3D neuronal 
tissues [Bellamkonda et al., in press]. The choice of an appropriate hydrogel chemistry and structure, 
combined with the possibility of the gel serving as a carrier for ECM proteins or their peptidic analogues, 
can enable one to enhance regeneration when seeded in a nerve guidance channel. Also, the use of 
appropriate hydrogel chemistries in combination with the chemical modification of the polymer back- 
bone by laser-directed photochemistry may be feasible in controlling the direction and differentiation of 
neuronal cells in three dimensions. Covalent binding of bioactive components like the laminin oligopep- 
tides to the hydrogel backbone gives a specific character to the gel so that it elicits specific responses from 
anchorage-dependent neuronal cells [Bellamkonda et al., in press] (Fig. 122.5). Such a system could be 
useful in reorganizing nerves in 3D either for bridging different regions of the brain with nerve cables 
or for the 3D organization of nerves for optimal coupling with external electronics in the design of 
artificial limb prosthesis. In either case, development of such systems presents an interesting challenge 
for tissue engineering. 
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FIGURE 122.4 (a) Schematic of fluorinated ethylene propylene membrane modified in a "striped" fashion with 
bioactive oligopeptides using carbonyldiimidazole homobifunctional linking agent, (b) Light micrograph of Ngl08- 
15 cells "striping" on FEP membrane surfaces selectively modified with CDPGYIGSR oligopeptide. 

122.4 Conclusion 

Advances in gene transfer techniques and molecular and cell biology offer potent tools in the functional 
replacement of various tissues of the nervous system. Each of these cells' functions can be optimized 
with the design and selection of its optimal extracellular environment. Substrates that support neuronal 
differentiation in two and three dimensions may play an important role in taking advantage of the 
advances in molecular and cell biology. Thus research aimed at tailoring extracellular matrices with the 
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FIGURE 122.5 (a) Schematic of hydrogel derivatized with bioactive peptides with an anchorage-dependent neuron 
suspended in 3D. (b) Light micrograph of a E14 chick superior cervical ganglion suspended in 3D and extending 
neurites in an agarose gel derivatized with the laminin oligopeptide CDPGYIGSR. 



appropriate physical, chemical, and biologic cues may be important in optimizing the function of 
transplanted cells, inducing nerve regeneration, or in the construction of neuronal tissues in two and 
three dimensions in a controlled fashion. Controlled design and fabrication of polymer hydrogels and 
polymer scaffolds on a scale that is relevant for single cells also may be important. This would presumably 
control the degree and the molecular location of permissive, attractive, and repulsive regions of the 
substrate and in turn control cellular and tissue response in vitro and in vivo. 

Biologic molecules like laminin, collagen, fibronectin, and tenascin may provide attractive and per- 
missive pathways for axons to grow. On the other hand, some sulfated proteoglycans have been shown 
to inhibit or repulse neurites [Snow & Letourneau, 1992]. The use of these molecules coupled with a 
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clearer understanding of protein-mediated material-cell interaction may pave the way for neural tissue 
engineering, molecule by molecule, in three dimensions. Thus it is possible to tailor the genetic material 
of a cell to make it neurologically relevant and to control its expression by optimizing its extracellular 
environment. All the preceding factors make tissue engineering in the nervous system an exciting and 
challenging endeavor. 
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Contractions of skeletal muscles generate the stability and power for all movement. Consequently, any 
impairment in skeletal muscle function results in at least some degree of instability or immobility. Muscle 
function can be impaired as a result of injury, disease, or old age. The goal of tissue engineering is to 
restore the structural and functional properties of muscles to permit the greatest recovery of normal 
movement. Impaired movement at all ages, but particularly in the elderly, increases the risk of severe 
injury, reduces participation in the activities of daily living, and impacts on the quality of life. 

Contraction is defined as the activation of muscle fibers with a tendency of the fibers to shorten. 
Contraction occurs when an increase in the cytosolic calcium concentration triggers a series of molecular 
events that includes the binding of calcium to the muscle regulatory proteins, the formation of strong 
interactions between the myosin cross-bridges and the actin filaments, and the generation of the cross- 
bridge driving stroke. In vivo, muscles perform three types of contractions depending on the interaction 
between the magnitude of the force developed by the muscle and the external load placed on the muscle. 
When the force developed by the muscle is greater than the load on the muscle, the fibers shorten during 
the contraction. When the force developed by the muscle is equal to the load, or if the load is immovable, 
the overall length of the muscle remains the same. If the force developed by the muscle is less than the 
load placed on the muscle, the muscle is stretched during the contraction. The types of contractions are 
termed miometric, isometric, and pliometric, respectively. Most normal body movements require varying 
proportions of each type of contraction. 
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FIGURE 123.1 Levels of anatomical organization within a skeletal muscle. Source: Bloom, W. and Fawcett, D.W. 
1968. A Textbook of Histology, 9th ed., Saunders, Philadelphia. With permission. 



123.2 Skeletal Muscle Structure 

Each of the 660 skeletal muscles in the human body is composed of hundreds to hundreds of thousands 
of single muscle fibers (Fig. 123.1). The plasma membrane of a muscle fiber is termed the sarcolemma. 
Contractile, structural, metabolic, regulatory, and cytosolic proteins, as well as many myonuclei and other 
cytosolic organelles are contained within the sarcolemma of each fiber (Fig. 123.2). The contractile 
proteins myosin and actin are incorporated into thick and thin myofilaments, respectively, which are 
arrayed in longitudinally repeated banding patterns termed sarcomeres (Fig. 123.1). Sarcomeres in series 
form myofibrils, and many parallel myofibrils exist within each fiber. The number of myofibrils arranged 
in parallel determines the cross-sectional area (CSA) of single fibers, and consequently, the force gener- 
ating capability of the fiber. During a contraction, the change in the length of a sarcomere occurs as thick 
and thin filaments slide past each other, but the overall length of each actin and myosin filament does 
not change. An additional membrane, referred to as the basement membrane or the basal lamina, 
surrounds the sarcolemma of each fiber (Fig. 123.2). 

In mammals, the number of fibers in a given muscle is determined at birth and changes little through- 
out the life span except in cases of injury or disease. In contrast, the number of myofibrils can change 
dramatically, increasing with normal growth or hypertrophy induced by strength training and decreasing 
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FIGURE 123.2 Drawing of a muscle fiber-satellite cell complex. Note that the satellite cell is located between the 
muscle fiber sarcolemma and the basal lamina. Source: Carlson and Faulkner, 1983. With permission. 

with atrophy associated with immobilization, inactivity, injury, disease, or old age. A single muscle fiber 
is innervated by a single branch of a motor nerve. A motor unit is composed of a single motor nerve, 
its branches, and the muscle fibers innervated by the branches. The motor unit is the smallest group of 
fibers within a muscle that can be activated volitionally. Activation of a motor unit occurs when action 
potentials emanating from the motor cortex depolarize the cell bodies of motor nerves. The depolarization 
generates an action potential in the motor nerve that is transmitted to each muscle fiber in the motor 
unit, and each of the fibers contracts more or less simultaneously. Motor units range from small slow 
units to large fast units dependent on the CSA of the motor nerve. 



123.3 Skeletal Muscle Function 



Skeletal muscles may contract singly or in groups, working synergistically. On either side of limbs, muscles 
contract against one another or antagonistically. The force or power developed during a contraction 
depends on the frequency of stimulation of the motor units, the total number of motor units, and the 
size of the motor units recruited. The frequency of stimulation, particularly for the generation of power, 
is normally on the order of the frequency-power relationship. Consequently, the total number of motor 
units recruited is the major determinant of the force or power developed. 
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Motor units are classified into three general categories based on their functional properties [Burke 
et al., 1973]. Slow (S) units have the smallest single muscle fiber CSAs, the fewest muscle fibers per motor 
unit, and the lowest velocity of shortening. The cell bodies of the S units are the most easily depolarized 
to threshold [Henneman, 1965]. Consequently, S units are the most frequently recruited during tasks 
that require low force or power but highly precise movements. Fast-fatigable (FF) units are composed 
of the largest fibers, have the most fibers per unit, and have the highest velocities of shortening. The FF 
units are the last to be recruited and are recruited for high force and power movements. The fast fatigue- 
resistant (FR) units are intermediate in terms of the CSAs of their fibers, the number of fibers per motor 
unit, the velocity of shortening, and the frequency of recruitment. The force normalized per unit CSA 
is -280 kN/m 2 for each type of fiber, but the maximum normalized power ( W/kg) developed by FF units 
is as much as four-fold greater than that of the S units due to a four-fold higher velocity of shortening 
for FF units. Motor units may also be identified by histochemical techniques as Type I (S), IIA (FR), and 
IIB (FF). Classifications based on histochemical and functional characteristics are usually in good agree- 
ment with one another, but differences do exist, particularly following experimental interventions. 
Consequently, in a given experiment, the validity of this interpretation should be verified. 

123.4 Injury and Repair of Skeletal Muscle 

Injury to skeletal muscles may occur as a result of disease, such as dystrophy; exposure to myotoxic agents, 
such as bupivacaine or lidocaine; sharp or blunt trauma, such as punctures or contusions; ischemia, such 
as that which occurs with transplantation; exposure to excessively hot or cold temperatures; and con- 
tractions of the muscles. Pliometric contractions are much more likely to injure muscle fibers than are 
isometric or miometric contractions [McCully and Faulkner, 1985]. Regardless of the factors responsible, 
the manner in which the injuries are manifested appears to be the same, varying only in severity. In 
addition, the processes of fiber repair and regeneration appear to follow a common pathway regardless 
of the nature of the injurious event [Carlson and Faulkner, 1983]. 

Injury of Skeletal Muscle 

The injury may involve either some or all of the fibers within a muscle [McCully and Faulkner, 1985]. 
In an individual fiber, focal injuries, localized to a few sarcomeres in series or in parallel (Fig. 123.3), as 
well as more widespread injuries, spreading across the entire cross section of the fiber, are observed using 
electron microscopic techniques [Macpherson et al., 1997]. Although the data are highly variable, many 
injuries also give rise to increases in serum levels of muscle enzymes, particularly creatine kinase, leading 
to the conclusion that sarcolemmal integrity is impaired [McNeil and Khakee, 1992; Newham et al., 
1983]. This conclusion is further supported by an influx of circulating proteins, such as serum albumin 
[McNeil and Khakee, 1992], and of calcium [Jones et al., 1984]. An increase in intracellular calcium 
concentration may activate a variety of proteolytic enzymes leading to further degradation of sarcoplasmic 
proteins. 

In cases when the damage involves a large proportion of the sarcomeres within a fiber, the fiber becomes 
necrotic. If blood flow is impaired, fibers remain as a necrotic mass of noncontractile tissue [Carlson 
and Faulkner, 1983]. In contrast, in the presence of an adequate blood supply, the injured fibers are 
infiltrated by monocytes and macrophages [McCully and Faulkner, 1985]. The phagocytic cells remove 
the disrupted myofilaments, other cytosolic structures, and the damaged sarcolemma (Fig. 123.4). The 
most severe injuries result in the complete degeneration of the muscle fiber, leaving only the empty basal 
lamina. The basal lamina appears to be highly resistant to any type of injury and generally remains intact 
[Carlson and Faulkner, 1983]. 

An additional indirect measure of injury is the subjective report by human beings of delayed onset 
muscle soreness, common following intense or novel exercise [Newham et al., 1983]. Because of the focal 
nature of the morphological damage, the variability of serum enzyme levels and the subjectivity of reports 
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FIGURE 123.3 Focal areas of damage to single sarcomeres after a single 40% stretch of a single maximally activated 
rat skeletal muscle fiber (average sarcomere length 2.6 um). Two types of damage are observed in this electron 
micrograph. The arrow indicates Z-line streaming and asterisks show disruption of the thick and thin filaments in 
the region of overlap, the A-band. A third type of damage, the displacement of thick filaments to one Z-line is not 
shown. Note that with the return of the relaxed fiber to an average sarcomere length of 2.6 |lm, the sarcomeres 
indicated by asterisks are at 5.1 um and 3.8 urn while sarcomeres in series are shortened to -1.8 um. Scale bar is 
1.0 lim. Source. Modified from Macpherson et al., 1997. With permission. 

of soreness, the most quantitative and reproducible measure of the totality of a muscle injury is the 
decrease in the ability of the muscle to develop force [McCully and Faulkner, 1985; Newham et al., 1983]. 

Repair of Injured Skeletal Muscles 

Under circumstances when the injury involves only minor disruptions of the thick or thin filaments of 
single sarcomeres, the damaged molecules are likely replaced by newly synthesized molecules available 
in the cytoplasmic pool [Russell et al., 1992]. In addition, contraction-induced disruptions of the sarco- 
lemma are often transient and repaired spontaneously, allowing survival of the fiber [McNeil and Khakee, 
1992]. Following more severe injuries, complete regeneration of the entire muscle fiber will occur. 

Satellite Cell Activation 

A key element in the initiation of muscle fiber regeneration following a wide variety of injuries is the 
activation of satellite cells [Carlson and Faulkner, 1983]. Satellite cells are quiescent myogenic stem cells 
located between the basal lamina and the sarcolemma (Fig. 123.2). Upon activation, satellite cells divide 
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FIGURE 123.4 Schematic representation of the cellular responses during the processes of degeneration and regen- 
eration following transplantation of extensor digitorum longus muscles in rats and cats. The diagram is divided into 
segments that represent the histological appearance of the muscle cross-section at various times after transplantation. 
The times given in days refer to rat muscles and those given in weeks refer to the larger cat muscles. The letters 
indicate groups of muscle fibers with similar histological appearances. A: surviving fibers, B: fibers in a state of 
ischemic necrosis, C: fibers invaded by phagocytic cells, D: myoblasts and early myotubes, E: early myofibers, F: 
immature regenerating fibers, G: mature regenerated fibers, H: normal control muscle fibers [Carlson and Faulkner, 
1983]. Source: Mauro, A. 1979. Muscle Regeneration, p. 493-507, Raven Press, New York, NY. With permission. 

mitotically to give rise to myoblasts. The myoblasts can then fuse with existing muscle fibers, acting as 
a source of new myonuclei. This is the process by which a muscle fiber increases the total number of 
myonuclei in fibers that are increasing in size [Moss and Leblond, 1971] and may be necessary to repair 
local injuries. Alternatively, the myoblasts can fuse with each other to form multinucleated myotubes 
inside the remaining basal lamina of the degenerated fibers [Carlson and Faulkner, 1983]. The myotubes 
then begin to produce muscle specific proteins and ultimately differentiate completely into adult fast or 
slow muscle fibers [Carlson and Faulkner, 1983]. Recent evidence also suggests that in addition to the 
activation of resident satellite cells, regenerating muscle may recruit undifferentiated myogenic precursor 
cells from other sources [Ferrari et al., 1998] 

Following a closed contusion injury, mitotic activation of satellite cells has been observed within the 
first day after the injury and is correlated in time with the appearance of phagocytes and newly formed 
capillaries [Hurme and Kalimo, 1992]. Similarly, DNA synthesis by the satellite cells is observed within 
the first day following crush injuries [Bischoff, 1986] and exercise-induced injuries [Darr and Schultz, 
1987] in rats. These observations are consistent with the hypothesis that the factors that activate satellite 
cells may be endogenous to the injured tissue itself or synthesized and secreted by platelets at the wound 
site, infiltrating neutrophils, and macrophages [reviewed in Husmann et al., 1996]. The primary candi- 
dates for the factors that activate and regulate satellite cell function include the fibroblast growth factors 
(FGFs), platelet-derived growth factor (PDGF), transforming growth factor beta (TGF-(3), and insulin- 
like growth factors I and II (IGF-I, II). 

The effects of these factors on muscle satellite cell proliferation and differentiation have been studied 
extensively in cell culture [reviewed in Florini and Magri, 1989]. FGF is a powerful mitogen for myogenic 
cells from adult rat skeletal muscle, but a potent inhibitor of terminal differentiation, i.e., myoblast fusion 
and expression of the skeletal muscle phenotype. PDGF also shows a strong stimulating effect on pro- 
liferation and inhibitory effect on differentiation of satellite cells [Jin et al., 1990]. Similarly, the presence 
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of TGF-(3 prevents myotube formation as well as muscle-specific protein synthesis by rat embryo myo- 
blasts and by adult rat satellite cells. In contrast to the previously mentioned growth factors, all of which 
inhibit myoblast differention, IGFs stimulate both proliferation and differentiation of myogenic cells 
[Ewton and Florini, 1980]. Despite our extensive knowledge of the actions of many individual growth 
factors in vitro and in vivo, the interactions between different growth factors have been less thoroughly 
investigated [Rosenthal et al., 1991]. A better understanding of the interactions between growth factors 
and the mechanisms that guide satellite cells through the regeneration process is necessary. 

Myogenic Regulatory Factors 

The conversion of pluripotent embryonic stem cells to differentiated muscle cells involves the commit- 
ment of these cells to the myogenic lineage and the subsequent proliferation, differentiation and fusion 
to form multinucleated myotubes and ultimately mature muscle cells. New muscle cell formation from 
muscle satellite cells resembles embryonic development in the sense that in regenerating muscle cells 
embryonic isoforms of the muscle proteins are expressed [Whalen et al., 1985]. The conversion of stem 
cells to mature fibers, in both developing muscle and regenerating muscle is directed by a group of related 
regulatory factors. These so-called muscle regulatory factors (MRFs) are part of a superfamily of basic 
helix-loop-helix DNA binding proteins that interact to regulate the transcription of skeletal muscle genes 
[reviewed in Weintraub et al., 1991]. MyoD was the first MRF identified followed by three other related 
genes including myogenin, MRF4 (also called herculin or Myf-6), and Myf-5 [Weintraub et al., 1991]. 

The observations that each of the MRFs has the ability to independently convert cultured fibroblasts 
into myogenic cells led to the original conclusion that functionally the MRFs were largely redundant. 
Subsequent gene targeting experiments, in which null mutations were introduced in each of the MRF 
genes, support separate and distinct roles in myogenesis for each MRFs [reviewed in Rudnicki and 
Jaenisch, 1995]. For example, mice that lack either Myf-5 or MyoD apparently have normal skeletal 
muscle but deletion of both genes results in the complete absence of skeletal myoblasts [Rudnicki et al., 
1992; 1993; Braun et al., 1992]. While these observations suggest that Myf-5 and MyoD do have over- 
lapping functions, characterization of the temporal-spatial patterns of myogenesis in Myf-5- and MyoD- 
deficient mouse embryos support the hypothesis that during normal development Myf-5 and MyoD 
primarily regulate epaxial (paraspinal and intercostal) and hypaxial (limb and abdominal wall) muscle 
development, respectively [Kablar et al., 1997]. In mice lacking myogenin, the number of myoblasts is 
not different from that of control mice, but skeletal muscles in myogenin-deficient mice display a marked 
reduction in the number of mature muscle fibers [Hasty et al., 1993; Nabeshima et al., 1993]. 

In summary, the MRF family can be divided into two functional groups. MyoD and Myf-5 are referred 
to as primary factors and are required for the determination of skeletal myoblasts whereas myogenin and 
MRF4 are secondary factors that act later and are necessary for differentiation of myoblasts into myotubes. 
How the MRFs control the series of events required for myoblast determination and differentiation are 
important questions for the future. In addition, the roles played by this family of proteins in regeneration, 
adaptation, and changes in skeletal muscle with aging are areas of active investigation [Jacobs-El et al., 
1995; Marsh et al, 1997; Megeney et al, 1996]. 

123.5 Reconstructive Surgery of Whole Skeletal Muscles 

When an injury or impairment is so severe that the total replacement of the muscle is required, a whole 
donor muscle must be transposed or transplanted into the recipient site [Faulkner et al., 1994a]. One of 
the most versatile muscles for transpositions is the latissimus dorsi (LTD) muscle. LTD transfers have 
been used in breast reconstruction [Moelleken et al., 1989], to restore elbow flexion, and to function as 
a heart assist pump [Carpentier and Chachques, 1985]. Thompson [1974] popularized the use of small 
free standard grafts to treat patients with partial facial paralysis. The transplantation of large skeletal 
muscles in dogs with immediate restoration of blood flow through the anastomosis of the artery and 
vein provided an operative technique with numerous applications [Tamai et al., 1970]. Coupled with 
cross-face nerve grafts, large skeletal muscles are transplanted with microneurovascular repair to correct 



e 2000 by CRC Press LLC 



deficits in the face [Harii et al., 1976] and adapted for reconstructive operations to treat impairments in 
function of the limbs, anal and urinary sphincters, and even the heart [Freilinger and Deutinger, 1992]. 
Transposition and transplantation of muscles invariably results in structural and functional deficits 
[Guelinckx et al., 1992]. The deficits are of the greatest magnitude during the first month and then a 
gradual recovery results in the stabilization of structural and functional variables between 90 and 120 days 
[Guelinckx et al., 1992]. In stabilized vascularized grafts ranging from 1 to 3 grams in rats to 90 grams 
in dogs, the major deficits are a -30% decrease in muscle mass and in most grafts a -40% decrease in 
maximum force [Faulkner et al., 1994a]. The decrease in power is more complex since it depends on 
both the average shortening force and the velocity of shortening. As a consequence, the deficit in 
maximum power may be either greater, or less than the deficit in maximum force [Kadhiresan et al., 
1993]. Tenotomy and repair are major factors responsible for the deficits [Guelinckx et al., 1988]. When 
a muscle is transplanted to act synergistically with other muscles, the action of the synergistic muscles 
may contribute to the deficits observed [Miller et al., 1994] . Although the data are limited, skeletal muscle 
grafts appear to respond to training stimuli in a manner not different from that of control muscles 
[Faulkner et al., 1994a]. The training stimuli include traditional methods of endurance and strength 
training [Faulkner et al., 1994b], as well as chronic electrical stimulation [Pette and Vrbova, 1992]. In 
spite of the deficits, transposed and transplanted muscles develop sufficient force and power to function 
effectively to maintain posture and patent sphincters and to move limbs or drive assist devices in parallel 
or in series with the heart [Faulkner et al., 1994a]. 

123.6 Myoblast Transfer and Gene Therapy 

Myoblast transfer and gene therapy are aimed at delivering exogenous genetic constructs to skeletal 
muscle cells. The implications of myoblast and gene therapy hold great promise for skeletal muscle 
research and for those afflicted with inherited myopathies such as Duchenne and Becker muscular 
dystrophy (DMD and BMD). Myoblast transfer is a cell-mediated technique designed to treat inherited 
myopathies by intramuscular injection of myoblasts containing a normal functional genome. The goal 
is to correct for a defective or missing gene in the myopathic tissue through the fusion of normal myoblasts 
with growing or regenerating diseased cells. Gene therapy presents a more complex and flexible approach, 
whereby genetically engineered DNA constructs are delivered to a host cell to specifically direct production 
of a desired protein. By re-engineering the coding sequence of the gene and its regulatory regions, the 
function, the quantity of expression, and the protein itself can be altered. For many years, cells have been 
genetically altered to induce the production of a variety of useful proteins, such as human growth 
hormone and interferon. 

As a focus in skeletal muscle tissue engineering, DMD is an X-linked recessive disorder characterized 
by progressive muscle degeneration resulting in debilitating muscle weakness and death in the second or 
third decade of life as a result of respiratory failure [Emery, 1988]. DMD and the milder BMD are due 
to genetic defects that lead to the absence or marked deficiency in the expression or functional stability 
of the protein dystrophin [Bonilla et al., 1988; Hoffman et al., 1988]. Similarly, a mutation in the dys- 
trophin gene leads to the complete absence of the protein in muscle and brain tissues of the mdx mouse 
[Bulfield et al., 1984; Sicinski et al., 1989]. The homology of the genetic defects in DMD patients and 
mdx mice support the use of mdx mice as a model of dystrophin deficiency to explore the processes of 
the dystrophic disease and test proposed therapies or cures. 

Myoblast Transfer Therapy 

The concept of myoblast transfer is based on the role satellite cells play in muscle fiber growth and repair 
[Carlson and Faulkner, 1983]. As a therapy for DMD, the idea is to obtain satellite cells containing a 
functional dystrophin gene from a healthy compatible donor, have the cells multiply in culture, and then 
inject the "normal" myoblasts into the muscles of the patient. The objective is for the injected myoblasts 
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to fuse with growing or regenerating muscle fibers to form a mosaic fiber in which the cytoplasm will 
contain normal myoblast nuclei capable of producing a functional form of dystrophin. 

Experiments involving mdx mice have had varying success. Several investigators have reported that 
implantation of healthy myoblasts into muscles of mdx mice led to the production of considerable 
quantities of dystrophin [Morgan et al., 1990; Partridge et al., 1989]. Others found that myoblasts injected 
into limb muscles of mdx and control host mice showed a "rapid and massive" die off shortly after 
injection [Fan et al. 1996] with large and permanent decreases in muscle mass and maximum force 
[Wernig et al., 1995]. The successful transfer and fusion of donor myoblasts may be enhanced by X-ray 
irradiation of mdx muscles prior to myoblast injection to prevent the proliferation of myoblasts endog- 
enous to the host and encourage the growth of donor myoblasts [Morgan et al., 1990]. 

In contrast to the studies with mice, delivery of myoblasts to DMD patients has shown very low levels 
of fusion efficiency, transient expression of dystrophin, and immune rejection [Gussoni et al., 1992; 
Karpati et al., 1993; Mendell et al., 1995; Morgan, 1994]. The use of X-ray irradiation in an attempt to 
enhance transfer efficiency is not applicable to DMD boys due to substantial health risks. Furthermore, 
immunosuppression may be necessary to circumvent immune rejection, which carries risks of its own. 
A better understanding of the factors that govern the survival, fusion, and expression of donor myoblasts 
is required before the viability of myoblast transfer as a treatment of DMD can be evaluated. 

Gene Therapy 

The aim of gene therapy for DMD is to transfer a functional dystrophin gene directly into the skeletal 
muscle. The challenge behind gene therapy is not only obtaining a functional genetic construct of the 
dystrophin gene and regulatory region but the effective delivery of the gene to the cell's genetic machinery. 
Methods to transfer genetic material into a muscle cell include direct injection, and the use of retroviral 
and adenoviral vectors. Each of these strategies presents highly technical difficulties that to date remain 
unresolved. 

Transgenic Mice 

To explore the feasibility of gene therapy for DMD, Cox and colleagues [1993] examined the introduction 
of an exogenous dystrophin gene into the germ line of mdx mice to produce transgenic animals. The 
transgenic mdx mice expressed nearly 50 times the level of endogenous dystrophin found in muscles of 
control C57BL/10 mice and displayed the complete absence of any morphological, immunohistological, 
or functional symptoms of the murine muscular dystrophy with no deleterious side effects. Although 
transgenic technology does not provide an appropriate means for treating humans, these results dem- 
onstrated the efficacy of gene therapy to correct pathological genetic defects such as DMD. 

Direct Intramuscular Injection 

The straightforward gene delivery method of direct injection of plasmid DNA into skeletal and heart 
muscles [Lin et al, 1990; Wolff et al., 1990] has been proposed as a treatment for DMD and BMD. The 
idea is that dystrophic cells will incorporate the genetic constructs, whereby the genes will use the cell's 
internal machinery to produce the protein dystrophin. The advantages of direct injection of DNA as a 
gene delivery system are its simplicity, and it presents no chance of viral infection or the potential of 
cancer development that can occur with viral vectors [Morgan, 1 994] . Although this approach is appealing 
in principal, direct intramuscular injection of human dystrophin plasmid DNA into the quadriceps 
muscles of mdx mice led to the expression of human dystrophin in only 1% to 3% of the muscle fibers 
[Acsadi et al., 1991] . In order for this method to be clinically effective, a much larger number of transfected 
myofibers must be achieved. 

Retrovirus-Mediated Gene Transfer 

Retroviruses reverse the normal process by which DNA is transcribed into RNA. A single-stranded viral 
RNA genome enters a host cell and a double helix comprised of two DNA copies of the viral RNA is 
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created by the enzyme reverse transcriptase. Catalyzed by a viral enzyme, the DNA copy then integrates 
into a host cell chromosome where transcription, via the host cell RNA polymerase, produces large 
quantities of viral RNA molecules identical to the infecting genome [Alberts et al., 1989]. Eventually, new 
viruses emerge and bud from the plasma membrane ready to infect other cells. Consequently, retroviral 
vectors used for gene therapy are, by design, rendered replication defective. Once they infect the cell and 
integrate into the genome, they cannot make functional retroviruses to infect other cells. After the infective 
process is completed, cells are permanently altered with the presence of the viral DNA that causes the 
synthesis of proteins not originally endogenous to the host cell. 

A primary obstacle to the efficiency of a retroviral gene delivery system is its dependence on host cell 
division. The requirement that a cell must be mitotically active for the virus to be incorporated into the 
host cell genetic machinery [Morgan, 1994] presents a problem for skeletal muscle tissue engineering 
since skeletal muscle is in a post-mitotic state. Nonetheless, as described above, recovery from injury in 
skeletal muscle involves the proliferation of myogenic precursor cells and either the incorporation of 
these cells into existing muscle fibers or the fusion of these cells with each other to form new fibers. 
Furthermore, since degeneration and regeneration of muscle cells are ongoing in DMD patients, viral 
transfection of myoblasts may be an effective route for gene delivery to skeletal muscle for treatment of 
DMD. 

Another limitation of the retroviral gene delivery system is the small carrying capacity of the retrovi- 
ruses of approximately 7 kilobases. This size limitation precludes the delivery of a full-length dystrophin 
construct of 12-14 kilobases [Dunckley et al., 1993; Morgan, 1994]. A 6.3 kilobase dystrophin construct, 
containing a large in-frame deletion resulting in the absence of -40% of the central domain of the protein, 
has received a great deal of attention since its discovery in a BMD patient expressing a very mild 
phenotype. A single injection of retrovirus containing the Becker dystrophin minigene into the quadriceps 
or tibialis anterior muscle of mdx mice led to the sarcolemmal expression of dystrophin in an average of 
6% of the myo fibers [Dunckley et al., 1993]. Restoration of the 43-kDalton dystrophin-associated glyco- 
protein was also observed and expression of the recombinant dystrophin was maintained for up to nine 
months. Transduction of the minigene was significantly enhanced when muscles were pretreated with 
an intramuscular injection of the myotoxic agent bupivacaine to experimentally induce muscle regeneration. 

Adenovirus-Mediated Gene Transfer 

Adenoviruses have many characteristics that make the adenovirus-mediated gene transfer the most 
promising technology for gene therapy of skeletal muscle. The primary advantages are the stability of 
the viruses allowing them to be prepared in large amounts and the ability of adenoviral vectors to infect 
nondividing or slowly proliferating cells. In addition, adenoviral vectors have the potential to be used 
for systemic delivery of exogenous DNA. Through the use of tissue-specific promoters, specific tissues 
such as skeletal muscle may be targeted for transfection via intravenous injection. 

Initial studies using adenoviral vectors containing the Becker dystrophin minigene driven by the Rous 
Sarcoma Virus promoter demonstrated that after a single intramuscular injection in newborn mdx mice, 
50% of muscle fibers contained dystrophin [Ragot et al., 1993]. The truncated dystrophin was correctly 
localized to the sarcolemmal membrane and appeared to protect myofibers from the degeneration process 
characteristic of mdx muscles [Vincent et al., 1993]. Six months after a single injection, expression of the 
minigene was still observed in the treated muscle. More recently, these same investigators demonstrated 
that the injection of the adenoviral vector containing the dystrophin minigene into limb muscles of 
newborn mdx mice provided protection from the fiber damage and force deficit associated with a protocol 
of pliometric contractions that was administered at four months of age [Deconinck et al., 1996]. 

Despite the promise of adenovirus-mediated gene therapy, a number of limitations to its usefulness 
remain to be resolved. One major drawback of the system is the relatively brief duration of transgene 
expression observed following injection of adenoviral vectors into adult immunocompetent animals 
[Kass-Eisler et al., 1994]. The lack of long-term exogenous gene expression is likely the result of low level 
expression of endogenous viral proteins triggering an inflammatory response that attacks infected cells 
[Yang et al., 1994]. In addition to the difficulties resulting from the potent immune response triggered 
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by adenovirus, direct cytotoxic effects of adenovirus injection on skeletal muscle have been reported 
[Petrof et al., 1995]. Current generation adenoviruses are also limited by their relatively small cloning 
capacity of ~8 kilobases. The development of adenoviral vectors with increased cloning capacity, the 
ability to evade host immune rejection, and no toxic effects are areas of active investigation [Kumar- 
Singh and Chamberlain, 1996; Petrof et al, 1996; Hauser et al, 1997]. 
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This chapter reviews the current state-of-the-art of articular cartilage tissue engineering, and focuses on 
a cell-polymer-bioreactor model system which can be used for controlled studies of chondrogenesis and 
the modulation of engineered cartilage by environmenal factors. 

124.2 Cell-Based Approaches to Cartilage Tissue Engineering 

Articular cartilage is an avascular tissue that contains only one cell type (the chondrocyte), and has a 
very limited capacity for self-repair. The chondrocytes are responsible for the synthesis and maintenance 
of their extracellular matrix (ECM), which is composed of a hydrated collagen network (-60% of the 
tissue dry weight, dw), a highly charged proteoglycan gel (PG, -25% dw), and other proteins and 
glycoproteins (-15% dw). 1 Its high water content (70 to 80% of the tissue wet weight, ww) enables 
cartilage to withstand the compressive, tensile and shear forces associated with joint loading. None of 
the methods conventionally used for cartilage repair (e.g., tissue auto- or allografts) can predictably 
restore a durable articular surface to an osteoarthrotic joint. 2 Cell-based therapies, i.e., implantation of 
cells or engineered cartilage, represent an alternative approach to articular cartilage repair. 

Figure 124.1 shows the cell-polymer-bioreactor system for cartilage tissue engineering. Constructs 
based on chondrogenic cells and polymer scaffolds are cultured in bioreactors to form 3-dimensional 
(3D) cartilaginous tissues. Engineered cartilage can be used either in vivo, to study articular cartilage 
repair or in vitro, for controlled studies of cell and tissue-level responses to molecular, mechanical, and 
genetic manipulations. 

Table 124.1 lists selected, representative examples of recent articular cartilage tissue engineering studies 
in which the stated research goals were either in vivo cartilage repair or in vitro studies of chondrogenesis. 
Several key parameters varied from study to study: (i) cell source and expansion in culture, (ii) scaffold 
material and structure, (iii) in vitro cultivation (conditions and duration), (iv) additional components 
(e.g., perichondrium), (v) experimental animal (species and age), (vi) surgical model (defect location 
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FIGURE 124.1 Cell-based approach to articular cartilage repair. Isolated cells, e.g., chondrocytes or bone marrow 
stromal cells (BMSC) are seeded onto 3-dimensional scaffolds, e.g., polymers formed as fibrous meshes or porous 
sponges, to form cell-polymer constructs. Constructs are first cultured in vitro, e.g., in bioreactors, and then used 
either as implants or for in vitro research. 

and dimensions), (vii) graft type (autograft or allograft), (viii) duration of in vivo implantation, and 
(viii) methods used to assess the resulting tissue (e.g., histological, biochemical, and/or mechanical). 

In Vivo Cartilage Repair 

Brittberg et al. 3 reported a new cell-based procedure for the repair of human knee injuries that could 
potentially eliminate the need for more than half a million arthroplastic procedures and joint replace- 
ments performed annually in the United States. Autologous articular cartilage was harvested from a 
minor load-bearing area on the distal femur, and component cells were isolated, expanded in vitro, and 
reimplanted back into the patient. In particular, after excising the damaged cartilage down to the sub- 
chondral bone, defects (1.65 to 6.5 cm 2 ) were sutured closed with a periosteal flap creating a pocket into 
which the cultured cells were injected. Disabling symptoms (e.g., knee pain, swelling, locking) were 
markedly reduced over a follow-up period of up to 5 years. Femoral transplants were clinically graded 
as good or excellent in 88% of the cases, with 73% of the biopsy specimens resembling hyaline cartilage. 
In contrast, patellar transplants were graded as good or excellent in only 29% of the cases, with only 
14% of the biopsies resembling hyaline cartilage. A therapy based on this technique was approved for 
clinical use by the Food and Drug Administration (FDA) in August, 1997 and is being marketed under 
the tradename Carticel®. 4 However, the original clinical study report has been criticized for not being a 
prospective controlled, randomized study and for its lack of a standard outcome analysis (i.e., for 
subjective evaluation with little quantitative biochemical or mechanical data). 5,6 

The Carticel® technique was also assessed in rabbits and dogs. In one study, cultured autologous 
chondrocytes were transplanted into patellar defects in conjunction with periosteal flaps and in some 
cases carbon fiber pads. 7 The repair tissue was characterized as hyaline-like and reported to develop 
in vivo, as assessed by an increasing degree of cellular columnarization. Histological scores were highest 
in defects treated with transplanted chondrocytes and periosteum, as compared to the combination of 
chondrocytes, carbon fibers and periosteum or periosteum only. The finding that the use of carbon fibers 
did not improve longterm repair was attributed to scaffold-induced diffusional limitations. 
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In other studies, cultured autologous chondrocytes in conjunction with periosteal flaps were trans- 
planted into femoral defects in dogs and compared to periosteal flaps alone and to untreated defects. 8,9 
Three phases of healing were demonstrated: formation of repair tissue (at 1.5 months), remodeling (at 
3 and 6 months), and degradation (at 12 and 18 months). Neither periosteum nor transplanted chon- 
drocytes enhanced healing after 1 year, at which time hyaline-like repair tissue appeared to be displaced 
by fibrocartilage. The differences between results obtained in rabbit and dog studies were attributed to 
several variables including species, subject age, defect location, surgical technique, and postoperative 
animal activity. Low retention of implanted cells in the lesion and failure of immature repair tissue 
subjected to high mechanical forces were listed as possible causes of degradation. 

The group of Caplan et al. pioneered the use of autologous mesenchymal stem cells in the repair of 
osteochondral defects. 10 Osteoprogenitor cells were selected from whole bone marrow based on their 
ability to adhere to Petri dishes and expanded in monolayer culture prior to implantation. Autologous 
precursor cells were delivered in collagen gels into defects that were in some cases pretreated with 
proteolytic enzymes. Other implants were based on either autologous precursor cells derived from 
periosteum or allogeneic chondrocytes derived from articular cartilage. The dimensions of defects were 
up to 3 mm wide, 6 mm long, and 3 mm deep, among the largest reported for repair studies in rabbits. 
Repair was assessed histologically and in some cases mechanically, e.g., relative compliance using a 
microindentation probe. 11 

Autologous precursor cells in collagen gels formed a cartilaginous surface zone while tissue at the base 
of the defect hypertrophied, calcified, and was replaced by host-derived vasculature, marrow, and bone. 10 
It was postulated that the different biological milieus of the synovial fluid at the joint surface and the 
osseus recepticle in the underlying bone played key roles in architecturally appropriate precursor cell 
differentiation. Improved integration of the graft with the surrounding host tissue was reported following 
partial digestion at the defect site (e.g., with trypsin) prior to implantation; this finding was attributed 
to enhanced interdigitation of newly synthesized ECM molecules. However, progressive thinning of the 
cartilaginous surface zone was observed over 6 months, which was more pronounced with periosteally- 
derived than with bone marrow-derived precursor cells. In contrast, allografted chondrocytes in collagen 
gels rapidly formed plugs of hyaline cartilage that filled the entire defect but failed to develop a region 
of subchondral bone at the base or to integrate with the surrounding host tissue even after 6 months. 
Similar results were obtained when chondrocytes were cultured in vitro in collagen gels prior to in vivo 
implantation 12 (as described below). 

Nevo et al. 13,14 have explored the use of both chondrocytes and osteoprogenitor cells derived from 
bone marrow delivered in a hyaluronic acid gel. Autografts or allografts were used to repair tibial defects 
in goats and femoral defects in chicks. The autografts were superior to the allografts, which evoked a 
typical immune response and progressive arthrosis. In both animal models, autografted defects were 
repaired with a well-integrated tissue that resembed hyaline cartilage at the surface and bone at the base. 

Amiel et al. qualitatively (i.e., histologically) and quantitatively (i.e., biochemically and mechanically) 
assessed osteochondral defects repaired with perichondral cells and porous polylactic acid in a rabbit 
allograft model. 1516 After 6 weeks, the alignment of cells in the repair tissue followed the architecture of 
the scaffold. After 1 year, the repair tissue had variable histological appearance (only 50% of the sub- 
chondral bone reformed) and biochemical composition (dry weight fractions of GAG were 55% those 
of the host cartilage), but mechanical properties (i.e., modulus and permeability in radially confined 
compression) were comparable to those measured for the host cartilage. In this case, mechanical testing 
appeared to be less sensitive than histological and biochemical assessments; this finding was attributed 
to difficulties in mechanical measurements and the derivation of intrinsic parameters, due to the geometry 
and the non-homogeneous nature of the repair tissue, respectively. 

Noguchi et al. 17 compared autologous and allogenic repair using chondrocytes cultured in collagen 
gels prior to implantation in inbred rats. After 2 to 4 weeks, the repair tissue consisted of hyaline articular 
cartilage with slight inflammatory cell infiltration in both groups; the immune response was somewhat 
more conspicuous in the allografts. However, inflammation resolved by 8 weeks and auto- and allografted 
repair tissues were almost identical after 6 months. 
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Freed et al. 18 cultured chondrocytes expanded by serial passage (P4) on polyglycolic acid (PGA) 
scaffolds for 3 to 4 weeks in vitro prior to implantation as allografts in rabbits. Compared to native rabbit 
articular cartilage, constructs contained 25% as much total collagen and 86% as much GAG per gram 
dry weight at the time of implantation. After 1 and 6 months, the histological scores of defects repaired 
with cell-PGA constructs did not differ significantly from those following implantation of PGA alone, 
except for qualitatively better surface smoothness, cell columnarization, and spatial uniformity of GAG 
in the defects grafted with cell-PGA constructs. 

Grande et al. 19 cultured osteoprogenitor cells derived from skeletal muscle on PGA scaffolds for 2 to 
3 weeks in vitro prior to implantation as allografts in rabbits. At the time of implantation, cells were 
attached to the scaffold but had not undergone chondrogenesis. After 3 months, defects repaired with 
cell-PGA constructs consisted of a cartilaginous surface region similar in thickness to the host cartilage 
and normal appearing subchondral bone, while implantation of PGA alone resulted in a patchy mixture 
of fibrous and hyaline cartilage. 

Kawamura et al. 12 cultured chondrocytes in collagen gels for 2 weeks in vitro prior to implantation as 
allografts in rabbits. At the time of implantation, constructs appeared stiffer than uncultured cells in 
collagen gels and histologically resembled hyaline cartilage. After 6 months, the repair tissue still consisted 
of a thick plug of hyaline cartilage, i.e., the region at the base was neither vascularized nor replaced by 
bone. It was suggested that a mechanical mismatch between the thin layer of host cartilage and the thick 
plug of cartilaginous repair tissue, and the lack of regeneration of a proper subchondral bony base would 
contribute to long term implant failure. 

In Vitro Chondrogenesis 

Buschmann et al. 20,21 showed that high cell density cultures of calf articular chondrocytes in agarose gels 
formed a mechanically functional cartilaginous matrix. After 7 weeks in vitro, construct compositions 
and mechanical properties (i.e., fractions of DNA and GAG, compressive modulus, streaming potential) 
reached values of about 25% those of native calf cartilage. Constructs responded to mechanical forces in 
a manner similar to that of native cartilage: static compression supressed ECM synthesis by an amount 
that increased with increasing compression amplitude and culture time, while dynamic compression 
stimulated ECM synthesis by an amount that increased with ECM accumulation and culture time. The 
authors postulated that several mechanisms could be involved in mechanotransduction, including cell- 
ECM interactions and/or changes in interstitial fluid flow and streaming potential. 

Hauselmann et al. 22 demonstrated the phenotypic stability of calf articular cartilage after prolonged 
(8 month) cultivation in alginate beads. The ECM formed by adult human chondrocytes cultured in gels 
for 30 days was similar to that of human articular cartilage and could be used to study PG turnover. 23 
The ECM consisted of two compartments: a small amount of cell- associated matrix (corresponding to 
the pericellular and territorial ECM), and a larger amount of matrix further removed from the cells 
(corresponding to interterritorial ECM). Aggregated PGs in the cell-associated ECM turned over relatively 
quickly (t 1/2 of 29 days) as compared to those in the further-removed ECM (t 1/2 >100 days). These findings 
were attributed to the effects of proteolytic enzymes located at the cell membrane. 

Freed et al. 24 characterized cartilaginous constructs based on calf articular or human rib chondrocytes 
and synthetic polymer scaffolds. Fibrous polyglycolic acid (PGA) meshes yielded constructs with higher 
cellularities and GAG production rates as compared to porous polylactic acid (PLLA) sponges; these 
findings were attributed to differences in polymer geometry (mesh vs. sponge, pore size distribution) 
and degradation (faster for PGA than PLLA). PGA scaffolds were subsequently characterized in detail 
and produced on a commercial scale. 25 

Grande et al. 26 showed that the choice of scaffold affected ECM synthesis rates, e.g., PGA and collagen 
scaffolds respectively enhanced GAG and protein synthesis rates of calf chondrocytes cultured in a 
perfused system. Sittinger et al. 27 observed PG and collagen deposition by human articular chondrocytes 
cultured on synthetic polyester meshes coated with poly-L-lysine or type II collagen and embedded in 
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agarose gels. Dunkelman et al. 28 and Grande et al. 26 both reported that cell-polymer constructs were more 
likely to form cartilaginous ECM if cultured in perfused vessels rather than statically. 

Carver and Heath 29,30 demonstrated that physiologicial levels of compression enhanced GAG deposition 
and improved the compressive moduli of engineered cartilage. In particular, constructs based on equine 
chondrocytes and PGA scaffolds were subjected to intermittent hydrostatic pressures of 500 and 1000 psi 
(3.4 and 6.9 MPa) in a semi-continuous perfusion system for 5 weeks. Structural and functional improve- 
ments were observed in constructs based on cells obtained from young but not adult horses. In a recent 
study, fetal human chondrocytes cultured at high density in serum-free medium formed 1.5 to 2.0 mm 
thick hyaline cartilage over 120 days whereas otherwise identical cultures containing serum could not be 
maintained for more than 30 days. 31 

Freed and Vunjak-Novakovic studied in vitro chondrogenesis using a variety of methods to seed and 
cultivate calf chondrocytes on polymer scaffolds as follows. Static seeding of scaffolds two or more mm 
thick resulted in bilaminar constructs with a fibrous upper region and a cartilaginous lower region. 32 In 
contrast, dynamic cell seeding in spinner flasks allowed relatively uniform cell seeding of scaffolds 2-5 mm 
thick at an essentially 100% yield. 33 An increase in the initial density of seeded cells resulted in comparable 
construct cellularities and collagen contents, but markedly higher GAG contents. 34 

Mixing during 3D culture markedly improved construct morphology and composition. 32,35 For exam- 
ple, 10 mm diameter x 5 mm thick scaffolds seeded and cultured in mixed flasks weighed twice as much 
as those seeded and cultured in mixed dishes, and contained about 2.5 times more of each GAG and 
total collagen. 35 However, constructs grown in mixed flasks formed outer capsules that were up to 300 p.m 
thick and contained high concentrations of elongated cells and collagen and little GAG; this finding was 
attributed to the effects of turbulent flow conditions on cells at the construct surface. 35 In contrast, 
constructs cultured in rotating vessels had relatively uniform distributions of cells and ECM and were 
up to 5 mm thick after 6 weeks in culture. 36 With increasing culture time, construct ECM biosynthesis 
rates and the fractional loss of newly synthesized macromolecules into the culture medium both 
decreased 36 but construct GAG fractions and mechanical properties improved. 37 The collagen networks 
of constructs and native cartilage were similar with respect to fibril density and diameter and fractions 
of collagen types II, IX and X. 38 

Vunjak-Novakovic et al. 39 studied the relationships between construct compositions (fractions of water 
and ECM components) and mechanical properties in static and dynamic compression (equilibrium 
modulus, dynamic stiffness, hydraulic permeability, streaming potential) using three different culture 
environments: static flasks, mixed flasks, and rotating vessels. Constructs cultured in static and mixed 
flasks had lower concentrations of ECM and worse mechanical properties as compared to constructs 
cultured in rotating vessels. The structure-function relationships detected for chondrocyte-PGA con- 
structs appeared consistent with those previously reported for chondrocytes cultured in agarose gels, 20 
native calf cartilage, 40 and adult human cartilage. 41 

Martin et al. 42 " 44 demonstrated that bone marrow stromal cells (BMSC) expanded in monolayers, 
seeded onto scaffolds and cultured in mixed Petri dishes formed large, 3D cartilaginous tissues. The 
presence of a 3D scaffold was required, as demonstrated by the small size and noncartilaginous nature 
of control cell pellet cultures. The presence of fibroblast growth factor 2 (FGF) during 2D expansion 
promoted differentiation of BMSC during 3D cultivation. As compared to avian (embryonic chick) 
BMSC, mammalian (calf) BMSC required a more structurally stable 3D scaffold and the presence of 
additional biochemical signals. 

124.3 Cell-Polymer-Bioreactor System 

The literature reviewed in Section 124.2 above demonstrates the feasibility of using cell-based tissue 
engineering approaches to in vivo articular cartilage repair and in vitro studies of chondrogenesis. In the 
next section we will first describe the in vitro cell-polymer-bioreactor system and then use selected 
examples from our own work to illustrate its use in studies of the development and modulation of 
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FIGURE 124.2 Cell seeding of polyglycolic acid (PGA) mesh, (a) PGA ultrastructure (scanning electron micro- 
graph), (b) chondrocytes attach to PGA fibers and maintain their spherical morphology (H&E stained histological 
section), (c) dynamic cell seeding: magnetic stirring generates convective motion of cells into porous scaffolds that 
are fixed to needles embedded in the stopper of the flask, (d) chondrocytes attached to PGA scaffolds with an effective 
yield of 100% over 24 h. 

construct structure and function. The unifying hypothesis of our work is that isolated chondrocytes or 
precursor cells can form functional cartilage tissue in vitro, if cultured on a 3D structural template in an 
environment that provides the necessary biochemical and physical signals. Ideally, engineered cartilage 
should display the key structural features of native cartilage, be able to withstand physiological loads, 
and be able to integrate with adjacent host tissues following in vivo implantation. 

Experimental Methods 

Cells: Cell types studied included articular chondrocytes and their precursors. Chondrocytes were 
obtained from the articular cartilage of 2 to 3 week old bovine calves 20 ' 24 and used immediately after 
isolation. Bone marrow stromal cells (BMSC) were obtained from the marrow of 2 to 3 week old bovine 
calves 43 or 16 day embryonic chicks. 42 

Scaffolds: Our best characterized scaffolds are made of polyglycolic acid (PGA) in form of 97% porous 
nonwoven meshes of 13 (J.m diameter fibers (Fig. 124.2a). 25 Other scaffolds that have been studied include 
polylactic acid (PLLA) 24 and polylactic-co-glycolic acid (PLGA) sponges. 45 

Cell seeding: Polymer scaffolds (5 to 10 mm in diameter, 2 to 5 mm thick) were fixed in place and 
seeded with cells using well-mixed spinner flasks 33 (Fig. 124.2c). 

Bioreactor cultivation: Tissue culture systems under investigation are schematically presented in 
Fig. f 24.3. Static and mixed dishes contained one construct per well in 6 to 7 mf medium; mixed dishes 
were placed on an orbital shaker at 75 rpm. 32 In spinner flasks, scaffolds were fixed to needles (8 to 12 per 
flask, in f 00 to 120 cm 3 medium), seeded with isolated cells under mixed conditions and cultured either 
statically or exposed to unidirectional turbulent flow using a non-suspended magnetic bar at 50-80 rpm. 35 
Rotating vessels included the slow turning lateral vessel (STLV) and the high aspect ratio vessel (HARV), 
each of which was rotated around its central axis such that the constructs (8 to 12 per vessel, in 100 to 
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FIGURE 124.3 Tissue culture bioreactors. (a) static and orbitally mixed Petri dishes, (b) static and magnetically 
stirred flasks, (c) rotating vessels: the slow turning lateral vessel (above) and the high aspect ratio vessel (below), 
(d) perfused chamber in which cultured constructs are monitored using magnetic resonance imaging (MRI, above); 
perfused rotating vessel (below). 

120 cm 3 medium) settled freely in a laminar flow field. 46,47 Perfused vessels included columns used to 
culture constructs and non-invasively monitor the progression of chondrogenesis using magnetic reso- 
nance imaging (MRI), 48 and a rotating perfused vessel developed by NASA for flight studies. 37 In all 
vessels, medium was exchanged batchwise (at a rate of ~3 cm 3 per construct and day), and gas was 
exchanged continuously, by surface aeration or by diffusion through a silicone membrane. 

Analytical techniques: Construct size and distributions of cells and tissue components were assessed 
by image analysis. 33,36 Constructs for histological assessment were fixed in neutral buffered formalin, 
embedded in paraffin, sectioned (5-8 p.m thick), and stained with hematoxylin and eosin (H&E for cells), 
safranin-0 (for GAG), and monoclonal antibodies (for collagen types I, II, IX, and X). 36,38,42 Ultrastruc- 
tural analyses included scanning and transmission electron microscopy (SEM, TEM). 38 Biochemical 
compositions were measured following papain or protease-K digestion of lyophilized constructs. 24,25,36 
Cell number was assessed by measuring the amount of DNA using Hoechst 33258 dye. 49 Sulfated GAG 
content was determined by dimethylmethylene blue dye binding. 50 Total collagen content was determined 
from hydroxyproline content after acid hydrolysis and reaction with p-dimethylaminobenzaldehyde and 
chloramine-T. 51 Type II collagen content was determined by inhibition ELISA. 36,38,52 The presence of other 
collagen types (e.g., I, IX, X) was demonstrated and semiquantitatively measured using SDS-PAGE and 
Western blots. 38 GAG distribution was determined by MRI. 48,53 Synthesis rates of GAG, total protein and 
collagen were measured by incorporation and release of radiolabeled tracers. 36 Cell metabolism was 
assessed based on the ratio of lactate production and glucose consumption, and ammonia production 
rates. 54 Mechanical construct properties (e.g., compressive modulus, dynamic stiffness, hydraulic perme- 
ability, streaming potential) were measured in static and dynamic radially confined compression. 39,55 

Developmental Studies 

As described above (Section 124.2, Table 124.1), articular cartilage repair has been enhanced following 
the transplantation of chondrocytes and osteochondral progenitor cells in conjunction with polymer 
scaffolds. However, identification of specific factors that influence in vivo regeneration and/or repair can 
be difficult due to host systemic responses (e.g., neuroendocrinological, immunological). Controlled 
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FIGURE 124.4 Temporo-spatial pattern of ECM deposition. Calf chondrocytes cultured on PGA scaffolds in rotat- 
ing vessels. Chondrogenesis was initiated peripherally in 12 day constructs (left panels) and proceeded appositionally 
resulting in a continous cartilaginous ECM in 40 day constructs (right panels). Histological cross-sections were stained 
with safranin-O for GAG (top panels) or with a monoclonal antibody to collagen type II (bottom panels). 

in vitro studies carried out in the cell-polymer-bioreactor system can thus provide useful, complementary 
information regarding the process of cartilage matrix regeneration starting from isolated cells (herein 
referred to as developmental studies). 

Chondrogenesis in vivo and in vitro is thought to be initiated by precursor cell aggregation. 56 Carti- 
laginous ECM is deposited, with concurrent increases in the amounts of GAG and collagen types II and 
IX, both in mesenchymal cells obtained from embryonic chick limbs 57 and in human limbs at various 
developmental stages. 58 ECM deposition starts at the center of the developing limb bud and spreads 
peripherally, 57 in conjunction with cellular conditioning of their microenvironment. 59 

In calf chondrocyte-PGA constructs, amounts of GAG and collagen type II also increased 
concomittantly 36 (Fig. 124.4). By culture day 12, cartilaginous tissue had formed at the construct periph- 
ery; by day 40, constructs were continuously cartilaginous matrix over their entire cross sections (6.7 mm 
diameter x 5 mm thick). Chondrogenesis was initiated peripherally and progressed both inward towards 
the construct center and outward from its surface; these findings could be correlated with construct cell 
distributions, as described below. The temporo-spatial patterns of GAG distribution could also be mon- 
itored in living constructs using MRI. 48 Constructs cultured 6 weeks contained an interconnected network 
of collagen fibers that resembled that of calf articular collagen with respect to overall organization and 
fiber diameter (Fig. 124. 5A). Type II collagen represented more than 90% of the total collagen, as quan- 
titated by an ELISA 36,38 and demonstrated qualitatively in Fig. 124.5B. Construct collagen type II was not 
susceptible to extraction with a chaotropic agent (guanidine hydrochloride, GuHCl), suggesting some 
degree of cross-linkage. However, construct collagen had only 30% as many pyridinium crosslinks as 
native calf cartilage. 38 

Construct wet weight fractions of cartilaginous ECM increased, the amount of polymer decreased, and 
construct cellularity plateaued over 6 weeks of culture in rotating vessels 36 (Fig. 124.6). Over the same time 
interval, construct ECM synthesis rates decreased by approximately 40% and the fraction of newly syn- 
thesized macromolecules released into the culture medium decreased from about 25% of total at 4 days 
to less than 4%; the latter can be attributed in part to collagen network development. 36 The mass of the 
PGA scaffold decreased to about 40% of initial over 6 weeks. 25 The time constant for scaffold degradation 
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FIGURE 124.5 Collagen structure and type. Calf chondrocytes cultured on PGA scaffolds for 6 weeks in rotating 
vessels, (a) Scanning electron micrographs showed similar collagen network ultrastructures for constructs and native 
calf articular cartilage, (b) The presence of type II collagen in constructs was demonstrated using pepsin-digested 
constructs in conjunction with SDS-PAGE, and confirmed by Western blot (with a monoclonal antibody to all). 
Lanes on SDS-PAGE: (1) molecular weight standards, (2) collagen type I, (3) GuHCl extract of construct, (4) pepsin 
extract of construct, (5) collagen type II; Lanes on Western blot: (1) collagen type II, (2) GuHCl extract of construct, 
(3) pepsin extract of construct. 

was of the same order of magnitude as the time constant of ECM deposition, a situation associated with 
enhanced tissue regeneration according to the hypothesis of isomorphous tissue replacement. 60 

Modulation of Cartilaginous Structure and Function 

The structure and function of articular cartilage are determined, at least in part, by environmental 
factors. 61 It is likely that the same factors that affect in vivo cartilage development, maintenance and 
remodeling also affect in vitro chondrogenesis. In the selected examples described below, we will describe 
the effects of: (1) cell, scaffold, and biochemical factors and (2) cultivation conditions and time on the 
structure and function of engineered cartilage. In principle, in vitro cultivation of a cell-polymer construct 
prior to in vivo implantation can help localize cell delivery and promote device fixation and survival, 
while maintaining an ability for integration at the graft-host interface. In contrast, cells transplanted in 
the absence of a carrier vehicle tended to leak away from the defect site, 9 while cells implanted immediately 
after loading in/on biomaterials were more vulnerable to mechanical forces and metabolic changes 
experienced during fixation or following implantation. 12 

Cells, the Scaffold and Biochemical Signals 

Chondrogenesis depends on the cells themselves (e.g., type, in vitro expansion, density, spatial arrange- 
ment), the scaffold (e.g., material, structure, degradation rate), and the presence of biochemical factors 
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FIGURE 124.6 Tissue regeneration kinetics. Calf chondrocytes cultured on PGA scaffolds in rotating vessels. ECM 
components (GAG and collagen type II) accumulated, cellularity remained constant, and polymer mass decreased 
with increasing culture time. Data represent the average ± SD of 3 independent measurements. 

(e.g., growth factors, hormones). The choice of cell type, which includes differentiated chondrocytes 
isolated from cartilage (articular or rib) and osteochondral progenitor cells isolated from bone marrow 
(herein referred to as bone marrow stromal cells, BMSC), can affect in vitro culture requirements (e.g., 
medium supplements) and in vivo construct function (e.g., integration potential). Polymer scaffolds vary 
with respect to surface properties (e.g., chemistry and wettability, which affect cell spreading and prolif- 
eration), geometry (e.g., dimensions, porosity, and pore size, which affect the spatial cell arrangement 
and the transmission of biochemical and mechanical signals), and physical properties (e.g., mechanical 
integrity and degradation rate, which determine whether the polymer can provide a structurally stable 
template for tissue regeneration). 

Watt 61 suggested that a cell cultured in vitro will tend to retain its differentiated phenotype under 
conditions that resemble its natural in vivo environment. In the case of chondrocytes, phenotypic stability 
is enhanced by cultivation in alginate or agarose. 22,23,63 Moreover, chondrocytes dedifferentiated by serial 
passage in monolayers redifferentiated (i.e., reacquired a spherical shape, ceased dividing, and resumed 
the synthesis of GAG and collagen type II) when transferred into 3D cultures. 64,65 When cultured on 3D 
fibrous PGA scaffolds, chondrocytes retained their spherical shape and formed cartilaginous tissue. 25,66 

Chondrocytes cultured at high cell densities tended to express their differentiated phenotype. 63,67 It 
was hypothesized that the effective cross-talk between cells depends on the presence of homotypically 
differentiated cells in the immediate cell environment. 67 The term "community effect" was later coined 
by Gurdon 69 who suggested that the ability of a cell to respond to phenotypic induction is enhanced by, 
or even dependent on, other neighboring cells differentiating in the same way at the same time. A 
postulated underlying mechanism involves changes in gene transcription and translation caused by cell- 
cell/ECM interactions. 62,70,71 

In cell-polymer constructs, the density of cells initially seeded at the construct periphery was sufficient 
to initiate chondrogenesis in that region. In particular, corresponding tempero-spatial patterns of carti- 
laginous ECM deposition and cell distribution were observed (compare Figs. 124.4 and 124.7). The 
finding that cell densities were initially higher peripherally than centrally, which implies that cell seeding 
density and proliferation rate were relatively higher peripherally, can be attributed to enhanced rates of 
nutrient and gas transfer at the construct surface. Over the 6 week culture period, self-regulated cell 
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FIGURE 124.7 Temporo-spatial pattern of cell density. Calf chondrocytes cultured on PGA scaffolds in rotating 
vessels. Cell density was initially higher at the construct periphery (white bars) than at its center (shaded bars), while 
cell distribution in 40 day constructs was spatially uniform. Number of cells/mm 2 was assessed by image analysis of 
H&E stained histological sections (average ± SD of 24 independent measurements). 

proliferation and ECM deposition resulted in constructs with physiological cellularities and spatially 
uniform distributions of components 36 (Figs. 124.4 and 124.7). Increases in the size, cellularity, and 
absolute amount of cartilaginous ECM were observed when chondrocyte-PGA constructs were cultured 
in medium supplemented with insulin-like growth factor I (IGF-I, 10-300 ng/mL) and serum (10%). 72 

In the case of osteochondral progenitor cells, Caplan et al. 10 suggested that principles of skeletal tissue 
engineering should be governed by the same motifs as embryonic development. BMSC differentiation 
is thought to be regulated by cell-to-cell contacts in an environment capable of activating the differen- 
tiation program. 73 In vitro (in BMSC aggregates), the induction of chondrogenesis depended on the 
presence of transforming growth factor-(3l (TGF (3) and dexamethasone. 74 In vivo (in rabbits), osteo- 
chondral repair recapitulated embryonic events and depended on the spatial arrangement and density 
of precursor cells and the presence of specific bioactive factors. 10 

In cell-polymer constructs, chondrocytic differentiation of BMSC depended both on scaffold-related 
factors and on exogenous biochemical signals. 4244 Avian BMSC cultured in the absence of polymer 
scaffolds formed small bilaminar tissues in which the lower region contained GAG and upper region 
appeared undifferentiated (Fig. 124. 8A). The same BMSC formed constructs consisting of a single tissue 
phase when cultured on PGA scaffolds in mixed Petri dishes (Fig. 124. 8B), while BMSC expansion in 
the presence of FGF prior to culture on PGA scaffolds resulted in the most cartilaginous ECM 
(Fig. 124. 8C). When FGF-expanded mammalian BMSC were cultured on nonwoven PGA mesh 
(Fig. 124. 2A), constructs first contracted and then collapsed (Fig. 124. 8D), while the same cells cultured 
on a scaffold consisting of a continuous polymer phase (polylactic-co-glycolic acid and polyethylene 
glycol (PLGA/PEG) sponge) 45 maintained their original dimensions (Fig. 124. 8E). When FGF-expanded 
mammalian BMSC were cultured on PLGA/PEG scaffolds in mixed dishes, chondrogenesis was observed 
in media supplemented with transforming growth factor beta 1 (TGF (3), insulin, and dexamethasone 
(Fig. 124. 8F) while osteogenesis was observed in media supplemented with betaglycerophosphate and 
dexamethasone (Fig. 124.8G). In the absence of these supplements, constructs consisted mainly of type I 
collagen and resembled loose connective tissue. 
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FIGURE 124.8 Chondrogenesis starting from bone marrow stromal cells (BMSC). (a-c) avian (chick embryo) 
BMSC cultured in media containing serum (10%) and ascorbic acid (50 Ug/mL): (a) without polymer scaffolds (as 
cell pellets); (b) on PGA scaffolds, or (c) after expansion in the presence of FGF (1 ng/mL). (d-e) FGF-expanded 
mammalian (bovine calf) BMSC cultured on (d) PGA scaffolds (nonwoven meshes) or (e) PLGA/PEG scaffolds 
(sponges consisting of a continuous polymer phase), (f-g) FGF-expanded mammalian BMSC were cultured on 
PLGA/PEG sponges in media containing serum (10%), ascorbic acid (50 (Xg/mL) and either: (f) TGFfi. (10 ng/mL), 
insulin (5 Ug/mL), dexamethasone (100 mM), or (g) bGP (7 mM) and dexamethasone (10 mM). 

Cultivation Conditions and Time 

Tissue culture bioreactors permit the in vitro cultivation of larger, better organized engineered cartilage 
than can be grown in static Petri dishes. 47 Flow and mixing within bioreactors are expected to affect 
tissue formation in at least two ways: by enhancing mass transfer (e.g., of gases and nutrients) and by 
direct physical stimulation of the cells (e.g., by hydrodynamic forces). 

The transport of chemical species lies at the heart of physiology and to a large extent determines tissue 
structure. 75,76 Cells communicate with each other by a combination of diffusion and convective flow, 
which are in turn driven by hydrodynamic, concentration and osmotic gradients. In vivo, mass transfer 
to chondrocytes involves diffusion and convective transport by the fluid flow that accompanies tissue 
loading 77 ; the presence of blood vessels in immature cartilage can further enhance mass transfer. In vitro, 
mixing-induced convection can enhance mass transport at construct surfaces. In contrast, mass transfer 
within constructs, which occurs by diffusion only, can become the limiting factor in the cultivation of a 
large construct with a dense ECM. 78 As compared to constructs grown statically, constructs grown in 
orbitally mixed Petri dishes and in mixed spinner flasks were larger and contained higher amounts of 
tissue components. 32,35 Cell metabolism in constructs cultured in mixed and static flasks were respectively 
found to be aerobic and anaerobic, as assessed by lactate to glucose ratios and ammonia production rates. 54 

The form of a skeletal tissue represents a diagram of underlying forces transmitted across the ECM to 
the individual cells. 61 Mechanotransduction is thought to involve four steps: mechanocoupling, biochem- 
ical coupling, signal transmission, and effector cell response. 79 In vivo, load-bearing and immobilized 
articular cartilage respectively contained high and low GAG fractions. 80,81 In vitro, physiological levels of 
dynamic compression increased the rates of GAG and protein synthesis in cartilage explants, 82,83 while 
static loading supressed GAG synthesis. 82,84 Physiological levels of dynamic compression also increased 
the GAG content and improved the mechanical properties of engineered cartilage. 29,30 The motion of 



e 2000 by CRC Press LLC 



■ lit! II Ml 




Static flask 



Mixed flask 



Rotating 
vessel 



Rovine 

cart il ape 



FIGURE 124.9 Culture conditions affect construct morphology. Representative cross sections of 6 week constructs 
(calf chondrocytes/PGA) cultured in static flasks, mixed flasks, and rotating vessels are compared to fresh calf cartilage 
(low and high power H&E stained histological sections). 

medium in roller bottles stimulated chondrocytes to form cartilaginous nodules, 85 ' 86 and fluid shear 
enhanced PG size and synthesis rate in chondrocyte monolayers. 87 Turbulent mixing in spinner flasks 
induced the formation of a fibrous outer capsule at the construct surface, the thickness of which increased 
with both the mixing intensity and the duration of tissue cultivation. 35 This finding was attributed to 
direct effects of mechanical forces, i.e., cells exposed to external forces tend to flatten and activate stress- 
protection mechanisms in order to remain firmly attached to their substrate 88 and increase their stiffnesses 
by cytoskeletal rearrangements. 89 

The effects of culture conditions on the morphology, composition, and mechanical properties of 
chondrocyte-PGA constructs were studied over 6 weeks using three different bioreactors: static flasks, 
mixed flasks, and rotating vessels. 39 In static cultures, GAG accumulated mostly at the periphery, pre- 
sumably due to diffusional constraints of mass transfer (Fig. 124.9a). In mixed flasks, turbulent shear 
caused the formation of a thick outer capsule with little or no GAG (Fig. 124.9b). Only in rotating vessels, 
were GAG fractions high and spatially uniform (Fig. 124.9c). Construct fractions of GAG (Fig. 124.10a) 
and total collagen (Fig. 124.10b) increased in the following order: static flasks, mixed flasks, rotating 
vessels, native cartilage. 39 Construct equilibrium moduli and hydraulic permeabilities (Figs. 124.10c,d) 
varied in a manner consistent with sample composition. 39 As compared to native calf cartilage, 6-week 
constructs cultured in rotating bioreactors had similar cellularities, 75% as much GAG and 40% as much 
total collagen per unit wet weight, but only 20% the compressive modulus and 5-fold higher hydraulic 
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FIGURE 124.10 Culture conditions affect construct compositions and mechanical properties. Constructs (calf 
chondrocytes/PGA) cultured for 6 weeks bioreactors were compared with respect to: (a,b) wet weight fractions of 
GAG and total collagen, (c) equilibrium modulus, and (d) hydraulic permeability. As compared to static and mixed 
flasks, rotating vessels yielded constructs with the best properties, but these remained inferior to native calf cartilage. 
Data represent the average ± SD of 3-6 independent measurements. 

permeability. The apparent lack of functional organization of ECM in constructs may be explained either 
by the use of immature cartilage (2 to 4 week old calves), or by the absence of specific factors in the 
in vitro culture environment which are normally present in vivo. 

It is possible that the mechanisms by which the dynamic fluctuations in shear and pressure in rotating 
vessels enhanced chondrogenesis resembled those associated with dynamic loading in vivo. However, the 
acting hydrodynamic forces were different in nature and several orders of magnitude lower than those 
resulting from joint loading. 46 ' 90 ' 91 Studies of engineered cartilage subjected to physiological levels 
of dynamic compression 29 ' 30 and shear should thus allow a more direct comparison of in vitro and in vivo 
tissue responses. 

Chondrocytes can be phenotypically stable for prolongued periods of time in 3D cultures (e.g., for 
8 months in alginate beads). 22 The effect of prolongued cultivation (7 months in rotating bioreactors) 
on the structure and function of chondrocyte-PGA constructs is shown in Fig. 124.11. As compared to 
native calf articular cartilage, 7 month constructs had comparable GAG fractions (Fig. 124.11a), 30% as 
much total collagen (Fig. 124.11b), comparable equilibrium moduli (Fig. 124.11c), and comparable 
hydraulic permeability (Fig. 124. lid). At 7 months, constructs were phenotypically stable (75% of the 
total construct collagen was type II) and consisted of metabolically active cells (component cells attached 
and spread in Petri dishes and were enzymatically active). 37 

A successful approach to cartilage tissue engineering must also provide the potential for constructs to 
integrate with the adjacent cartilage and subchondral bone. Most of the in vivo studies described in 
Section 2 addressed this issue. In general, the implantation of chondrocytes and BMSC without a 
cartilaginous ECM resulted in repair tissue that deteriorated with time in vivo, 9 - 11 while implantation of 
cartilaginous constructs integrated relatively poorly with adjacent host tissues. 10,12 In vitro systems can 
be used to study the effects of specific factors on construct integration in the absence of uncontrollable 
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FIGURE124.il Culture time affects construct structure and function. Constructs (calf chondrocytes/PGA) cul- 
tured for 3 days (3 d), 6 weeks (6 w), 3 or 7 months (3 mo, 7 mo) in rotating vessels were compared with respect to: 
(a,b) wet weight fractions of GAG and total collagen, (c) equilibrium modulus, and (d) hydraulic permeability. All 
properties improved with culture time and approached values measured for native calf cartilage (normal ranges 
denoted by dotted lines). Data represent the average ± SD of 3 independent measurements. 

variables intrinsic to in vivo studies. In one such study, constructs were cultured for various times, sutured 
into ring-shaped explants of native cartilage, and cultured for an additional period of time as composites. 92 
The integration process involved cell migration into the construct-explant interface and the formation 
of a new tissue which was initially fibrous but became progressively cartilaginous with increasing culture 
time (Fig. 124.12a). Construct equilibrium modulus, which was negligible at the beginning of cultivation, 
increased to approximately 15% of native calf cartilage after 6 weeks (Fig. 124.11c). The adhesive strength 
at the construct-explant interface was approximately 65% higher for composites made with 6 day con- 
structs, which consisted mainly of cells, as compared to composites made with 5 week constructs, which 
had a well-formed ECM (Fig. 124.12b). 

124.4 Summary and Future Directions 

Tissue engineering offers a cell-based approach to articular cartilage repair. In this chapter, we reviewed 
the state of the art of cartilage tissue engineering and focused on a cell-polymer-bioreactor system which 
can be used for controlled studies of chondrogenesis and the modulation of engineered cartilage by 
environmental factors. 

A procedure in which autologous chondrocytes are obtained from an articular cartilage biopsy, 
expanded in culture, and transplanted in conjunction with a periosteal flap 3 is currently the only FDA- 
approved cell-based treatment for articular cartilage repair. However, in vivo studies in dogs had variable 
results and showed long-term degradation of the repair tissue. 8 ' 9 The clinical study has been viewed with 
some skepticism 5 - 6 and long-term, prospective randomized clinical studies are needed to better evaluate 
the potential of this technique. 

Alternatively, autologous BMSC have been isolated from bone marrow aspirates, expanded, and implan- 
tated in conjunction with various gels to repair osteochondral defects in experimental animals. 1014 Fol- 
lowing implanatation, the cells underwent a site-specific differentiation and formed a cartilaginous zone 
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FIGURE 124.12 Culture time affects construct integration potential, (a) Composites made from 6 day constructs 
(calf chondrocytes/PGA) and cultured for an additional 2 or 4 weeks (upper or lower panel, respectively; safranin-O 
stained histological sections), (b) Adhesive strengths of composites cultured 4 weeks, including: construct- cartilage 
composites made from 6 day or 5 week constructs and cartilage-cartilage composites (adhesion was estimated by 
fixing the outer ring in place and uniaxially loading the newly integrated central portion to failure as shown in the 
inset). 

at the surface of the defect and a vascularized bony tissue at its base. In contrast to grafts based on mature 
chondrocytes, which failed to develop subchondral bone and to fully integrate with the host tissue, grafts 
based on precursor cells suffered from the progressive thinning of the cartilaginous surface zone. 

A 3D scaffold permits the in vitro cultivation of cell-polymer constructs that can be readily manipu- 
lated, shaped, and fixed to the defect site. 18 As compared to such a pre-formed cartilaginous construct, 
cells injected under a periosteal flap or immobilized in a hydrated gel are more vulnerable to environ- 
mental factors and mechanical forces. 12 The selection of an appropriate scaffold depends in part on the 
starting cell type. For example, nonwoven fibrous PGA meshes supported chondrogenesis starting from 
chondrocytes 25 and avian BMSC, 42 while a scaffold with more structural stability appeared to be required 
for chondrogenesis starting from mammalian BMSC (Fig. 124.8). 

Tissue constructs resembling native cartilage were engineered in vitro using isolated cells, 3D polymer 
scaffolds, and bioreactors (Figs. 124.4 through 124.6). Construct structure (histological, biochemical) 
and function (mechanical, metabolic) depended on cultivation conditions and duration. 39 The cell- 
polymer-bioreactor system can thus provide a basis for controlled in vitro studies of the effects of time 
and specific biochemical and physical factors on chondrogenesis (Figs. 124.7 through 124.11). Moreover, 
in vitro studies can be used to assess the potential of a cell-polymer construct to integrate with adjacent 
tissues (Fig. 124.12). Some of the current research needs include: 

1. Development of design criteria. Specific construct design criteria (e.g., required cell metabolic 
activity, ECM composition, and mechanical properties) need to be established based on the results 
of further in vitro and in vivo experimentation and phenomenological modeling of cell and tissue 
responses to environmental signals. 

2. Selection of a cell source. Autologous BMSC and mature chondrocytes are the most likely immediate 
candidates; allogeneic fetal chondrocytes represent an option for the future. Both BMSC and 
chondrocytes maintain their chondrogenic potential when expanded in the presence of FGF and 
then cultured on 3D scaffolds. 42,43 BMSC-based grafts recapitulate embryonic events of endoch- 
ondral bone formation in response to local environmental factors 10 resulting in repair tissue that 
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is relatively well integrated but can undergo ossification, leading to progressive thinning at the 
articular surface. 11 On the other hand, chondrocyte-based grafts do not integrate as well 12 but 
consist of phenotypically stable cartilage with a mechanically functional ECM 37 that may promote 
survival in the presence of mechanical loading. Human fetal articular chondrocytes were recently 
demonstrated to regenerate hyaline cartilage when cultured at high densities in serum-free 
medium. 31 However, this approach has unresolved ethical issues. 

3. Selection of an appropriate scaffold for human chondrocytes and BMSC. Ideally, a tissue engineering 
scaffold should meet all of the following criteria: ( 1 ) reproducible processing into complex, 3D 
shapes, (2) highly porous structure that permits a spatially uniform cell distribution during cell 
seeding and minimizes diffusional constraints during in vitro cultivation, (3) controlled degrada- 
tion at a rate matching that of cellular deposition of ECM (to provide a stable template during 
in vitro chondrogenesis) followed by complete elimination of the foreign material (to maximize 
long-term in vivo biocompatibility). 

4. Development of methods for in vitro seeding and cultivation of human cells. Ideally, a tissue culture 
bioreactor should provide: (1) a means to achieve efficient, spatially uniform cell distribution 
throughout a scaffold (e.g., dynamic seeding), (2) uniform concentrations of biochemical species 
in the bulk phase and their efficient transport to the construct surface (e.g., well-mixed culture 
conditions), (3) steady state conditions (e.g., automated control based on biosensors triggering 
appropriate changes in medium and gas supply rates), to more closely mimic the in vivo cellular 
environment, and (4) applied physical forces (e.g., hydrostatic pressure and shear, dynamic com- 
pression), to more closely mimic the in vivo tissue environment. 

5. Development of methods to promote graft survival, integration, and maturation. Articular cartilage 
repair refers to healing that restores a damaged articular surface without actually replicating the 
complete structure and function of the tissue while articular cartilage regeneration refers to the 
formation of a new tissue that is indistinguishable from normal articular cartilage, including the 
zonal organization, composition, and mechanical properties. 2 Ideally, engineered cartilage should 
meet the criteria for regeneration, since any other repair tissue represents a mechanical disconti- 
nuity likely to cause long-term device failure. 10 Local pre-treatment of the host tissues at the site 
of the defect (e.g., with proteolytic enzymes) may enhance graft integration. 10 Further in vivo 
studies are needed to evaluate whether implanted constructs develop characteristic architectural 
features of articular cartilage in conjunction with physiological loading. 

This chapter describes technologies that can potentially lead to articular cartilage regeneration in vitro 
and in vivo. Representative studies are summarized in which cells (autologous or allogeneic chondrocytes 
or BMSC) were isolated, cultivated, and in some cases used to repair large full-thickness cartilage defects. 
Extension of these results to a human cell source and scale is expected to have a major clinical impact. 
At this time, tissue engineering studies are mainly observational. The increasing use of models to describe 
specific aspects of tissue formation (e.g., patterns of ECM deposition, structure-function correlations) 
is expected to help in the design of hypothesis-driven experiments and interpretation of their results. 
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Defining Terms 

Bioreactors: Tissue culture vessels mixed by magnetic stirring or rotation. 

Bone marrow stromal cell (BMSC): A bone marrow-derived precursor cell with the potential to 

differentiate into various tissues including cartilage. 
Chondrocyte: A cartilage cell. 
Chondrogenesis: The process of cartilage formation. 
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Extracellular matrix (ECM): The biochemical components present in the extracellular space of a tissue, 
e.g., collagen type II and glycosoaminoglycan (GAG) in articular cartilage. 

Polymer scaffold: A synthetic material designed for cell cultivation, characterized by its specific chem- 
ical composition and 3D structure. 

Tissue construct: The tissue engineered in vitro using isolated cells, polymer scaffolds, and bioreactors. 
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Tissue engineering is one of he most intriguing and exciting areas in biotechnology due to its requirements 
for state-of-the-art techniques from both biologic and engineering disciplines. This field is on the 
threshold of the development of an array of products and devices comprised of cell components, biologic 
compounds, and synthetic materials to replace physiologic function of diseased tissues and organs. 

Successful tissue and organ constructs depend on a thorough understanding and creative application 
of molecular, cellular, and organ biology and the principles of chemical, mechanical, and material 
engineering to produce appropriate structure-function relationships to restore, maintain, or improve 
tissue or organ function. This approach depends on the most advanced scientific methodologies, includ- 
ing stem cell culture, gene transfer, growth factors, and biomaterial technologies. 

The kidney was the first solid organ whose function was approximated by a machine and a synthetic 
device. In fact, renal substitution therapy with hemodialysis or chronic ambulatory peritoneal dialysis 
(CAPD) has been the only successful long-term ex vivo organ substitution therapy to date [ 1 ] . The kidney 
was also the first organ to be successfully transplanted from a donor individual to an autologous recipient 
patient. However, the lack of widespread availability of suitable transplantable organs has kept kidney 
transplantation from becoming a practical solution to most cases of chronic renal failure. 

Although long-term chronic renal replacement therapy with either hemodialysis or CAPD has dra- 
matically changed the prognosis of renal failure, it is not complete replacement therapy, since it only 
provides filtration function (usually on an intermittent basis) and does not replace the homeostatic, 
regulatory, metabolic, and endocrine functions of the kidney. Because of the nonphysiologic manner in 
which dialysis performs or does not perform the most critical renal functions, patients with ESRD on 
dialysis continue to have major medical, social, and economic problems [2]. Accordingly, dialysis should 
be considered as renal substitution rather than renal replacement therapy. 

Tissue engineering of a biohybrid kidney comprised of both biologic and synthetic components will 
most likely have substantial benefits for the patient by increasing life expectancy, increasing mobility and 
flexibility, increasing quality of life with large savings in time, less risk of infection, and reduced costs. 
This approach could also be considered a cure rather than a treatment for patients. 
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125.1 Fundamentals of Kidney Function 

The kidneys are solid organs located behind the peritoneum in the posterior abdomen and are critical 
to body homeostasis because of their excretory, regulatory, metabolic, and endocrinologic functions. The 
excretory function is initiated by filtration of blood at the glomerulus, which is an enlargement of the 
proximal end of the tubule incorporating a vascular tuft. The structure of the glomerulus is designed to 
provide efficient ultrafiltration of blood to remove toxic waste from the circulation yet retain important 
circulating components, such as albumin. The regulatory function of the kidney, especially with regard 
to fluid and electrolyte homeostasis, is provided by the tubular segments attached to the glomerulus. 

The ultrafiltrate emanating from the glomerulus courses along the kidney tubule, which resorbs fluid 
and solutes to finely regulate their excretion in various amounts in the final urine. The kidney tubules 
are segmented, with each segment possessing differing transport characteristics for processing the glom- 
erular ultrafiltrate efficiently and effectively to regulate urine formation. The segments of the tubule begin 
with the proximal convoluted and straight tubules, where most salt and water are resorbed. This segment 
leads into the thin and thick segments of Henle's loop, which are critical to the countercurrent system 
for urinary concentration and dilution of water. The distal tubule is next and is important for potassium 
excretion. The final segment is the collecting duct, which provides the final regulation of sodium, 
hydrogen, and water excretion. The functional unit of the kidney is therefore composed of the filtering 
unit (the glomerulus) and the regulatory unit (the tubule). Together they form the basic component of 
the kidney, the nephron. In addition to these excretory and regulatory functions, the kidney is an 
important endocrine organ. Erythropoietin, active forms of vitamin D, renin, angiotensin, prostaglan- 
dins, leukotrienes, and kallikrein-kinins are some of the endocrinologic compounds produced by the 
kidney. 

In order to achieve its homeostatic function for salt and water balance in the individual, the kidney 
has a complex architectural pattern (Fig. 125.1). This complex organization is most evident in the 
exquisitely regulated structure-function interrelationships between the renal tubules and vascular struc- 
tures to coordinate the countercurrent multiplication and exchange processes to control water balance 
with urinary concentration and dilution [3]. This complex structure is the result of a long evolutionary 
process in which animals adapted to many changing environmental conditions. Although well suited for 
maintaining volume homeostasis, the mammalian kidney is an inefficient organ for solute and water 
excretion. In humans, approximately 180 liters of fluid are filtered into the tubules daily, but approxi- 
mately 178 liters must be reabsorbed into the systemic circulation. Because of the importance of main- 
taining volume homeostasis in an individual to prevent volume depletion, shock, and death, multiple 
redundant physiologic systems exist within the body to maintain volume homeostasis. No such redun- 
dancy, however, exists to replace renal excretory function of soluble metabolic toxic by products of 
metabolic activity. Accordingly, chronic renal disease becomes a clinical disorder due to loss of renal 
excretory function and buildup within the body of metabolic toxins which require elimination by the 
kidneys. Because of the efficiency inherent in the kidneys as an excretory organ, life can be sustained 
with only 5-10% of normal renal excretory function. With the recognition that the complexity of renal 
architectural organization is driven by homeostatic rather than excretory function and that renal excretory 
function is the key physiologic process which must be maintained or replaced to treat clinical renal failure, 
the approach to a tissue engineering construct becomes easier to entertain, especially since only a fraction 
of normal renal excretory function is required to maintain life. 

For elimination of solutes and water from the body, the kidney utilizes simple fundamental physical 
principles which govern fluid movement (Fig. 125.2). The kidney's goal in excretory function is to transfer 
solutes and water from the systemic circulation into tubule conduits in order to eliminate toxic byproducts 
from the body in a volume of only several liters. Solute and fluid removal from the systemic circulation 
is the major task of the renal filtering apparatus, the glomerulus. The force responsible for this filtration 
process is the hydraulic pressure generated within the system circulation due to myocardial contraction 
and blood vessel contractile tone. Most of the filtered fluid and solutes must be selectively reabsorbed 
by the renal tubule as the initial filtrate courses along the renal tubules. The reabsorptive process depends 
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upon osmotic forces generated by active solute 
transport by the renal epithelial cell and the colloid 
oncotic pressure within the peritubular capillary 
(Fig. 125.2). The approach of a tissue-engineering 
construct for renal replacement is to mimic these 
natural physical forces to duplicate filtration and 
reabsorption processes to attain adequate excretory 
function lost in renal disorders. 

Glomerular Ultrafiltration 
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The process of urine formation begins within the 
capillary bed of the glomerulus [4]. The glomerular 
capillary wall has evolved into a structure with the 
property to separate as much as one-third of the 
plasma entering the glomerulus into a solution of 
a nearly ideal ultrafitrate. The high rate of ultrafil- 
tration across the glomerular capillary is a result of 
hydraulic pressure generated by the pumping 
action of the heart and the vascular tone of the 
preglomerular and postglomerular vessels as well 
as the high hydraulic permeability of the glomeru- 
lar capillary walls. This hydraulic pressure as well 
as the hydraulic permeability of the glomerular 
capillary bed is at least two times and two orders 
of magnitude higher, respectively, than most other 
capillary networks within the body [5] . Despite this 
high rate of water and solute flux across the glom- 
erular capillary wall, this same structure retards the 
filtration of important circulating macromolecules, 
especially albumin, so that all but the lower-molec- 
ular-weight plasma proteins are restricted in their 
passage across the filtration barrier. 

A variety of experimental studies and model sys- 
tems have been employed to characterize the sieving properties of the glomerulus. Hydrodynamic models 
of solute transport through pores have been successfully used to describe the size selective barrier function 
of this capillary network to macromolecules [6]. This pore model, in its simplest form, assumes the 
capillary wall to contain cylindrical pores of identical size and that macromolecules are spherical particles. 
Based upon the steric hindrances that macromolecules encounter in the passage through small pores, 
whether by diffusion or by convection (bulk flow of fluid), definition of the glomerular capillary barrier 
can be a fluid-filled cylindrical pore (Fig. 125.3). This modeling characterizes the glomerular capillary 
barrier as a membrane with uniform pores of 50A radius [7] . This pore size predicts that molecules with 
radii smaller than 14 A appear in the filtrate in the same concentration as in plasma water. Since there 
is no restriction to filtration, fractional clearance of this size molecule is equal to one. The filtration of 
molecules of increasing size decreases progressively, so that the fractional clearance of macromolecules 
the size of serum albumin (36 A) is low. 

The glomerular barrier, however, does not restrict molecular transfer across the capillary wall only on 
the basis of size (Fig. 125.4). This realization is based upon the observation that the filtration of the 
circulating protein, albumin, is restricted to a much greater extent than would be predicted from size 
alone. The realization that albumin is a polyanion at physiologic pH suggests that molecular charge, in 
addition to molecular size, is another important determination of filtration of macromolecules [8]. The 



FIGURE 125.1 Representation of the complex mor- 
phologic architecture of a section of a mammalian kidney 
in which these components are schematized separately: 
( a) arterial and capillary blood vessels; ( b) venous drain- 
age; (c) two nephrons with their glomeruli and tubule 
segments (Source. From [3] with permission.) 
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FIGURE 125.2 Physical forces which govern fluid transfer within the kidney. 



greater restriction to the filtration of circulating polyanions, including albumin, is due to the electrostatic 
hindrance by fixed negatively charged components of the glomerular capillary barrier. These fixed 
negative charges, as might be expected, simultaneously enhance the filtration of circulating polycations. 
Thus, the formation of glomerular ultrafiltrate, the initial step in urine formation, depends upon the 
pressure and flows within the glomerular capillary bed and the intrinsic permselectivity of the glomerular 
capillary wall. The permselective barrier excludes circulating macromolecules from filtration based upon 
size as well as net molecular charge, so that for any given size, negatively charged macromolecules are 
restricted from filtration to a greater extent than neutral molecules. 
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FIGURE 125.3 Scanning electron micrograph of the glomerular capillary wall demonstrating the fenestrae (pores) 
of the endothelium within the glomerulus (mag x 50,400). Reprinted with permission from Schrier RW, Gottschalk 
CW. 1988. Diseases of the Kidney, p 12, Boston, Little, Brown. 
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FIGURE 125.4 Fractional clearances of negatively charged (sulfate) dextrans, neutral dextrans, and positively 
charged (DEAE) dextrans of varying molecular size. These data demonstrate that the glomerular capillary wall behaves 
as both a size -selective and charge-selective barrier [8]. 



Tubule Reabsorption 

Normal human kidneys form approximately 100 ml of filtrate every minute. Since daily urinary volume 
is roughly 2 L more than 98% of the glomerular ultrafiltrate must be reabsorbed by the renal tubule. 
The bulk of the reabsorption, 50-65%, occurs along the proximal tubule. Similar to glomerular filtration, 
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fluid movement across the renal proximal tubule cell is governed by physical forces. Unlike the fluid 
transfer across the glomerular capillary wall, however, tubular fluid flux is principally driven by osmotic 
and oncotic pressures rather than hydraulic pressure (Fig. 125.2). Renal proximal tubule fluid reabsorp- 
tion is based upon active Na + transport, requiring the energy-dependent Na + K + ATPase located along the 
basolateral membrane of the renal tubule cell to promote a small degree of luminal hypotonicity [9]. 
This small degree of osmotic difference (2-3 mOsm/kgH 2 0) across the renal tubule is sufficient to drive 
isotonic fluid reabsorption due to the very high diffusive water permeability of the renal tubule cell 
membrane. Once across the renal proximal tubule cell, the transported fluid is taken up by the peritubular 
capillary bed due to the favorable oncotic pressure gradient. This high oncotic pressure within the 
peritubular capillary is the result of the high rate of protein-free filtrate formed in the proximate 
glomerular capillary bed [10]. As can be appreciated, the kidney has evolved two separate capillary 
networks to control bulk fluid flow from various fluid compartments of the body. The glomerular 
capillary network has evolved an efficient structure to function as a highly efficient filter to allow water 
and small solutes such as urea and sodium, to cross the glomerular capillary wall while retaining necessary 
macromolecules, such as albumin. This fluid transfer is driven by high hydraulic pressures within the 
glomerular capillary network generated by the high blood flow rates to the kidney and a finely regulated 
vascular system. The permselectivity of the filter is governed by an effective pore size to discriminate 
macromolecular sieving based upon both size and net molecular charge. The high postglomerular 
vascular resistance and the protein-free glomerular filtrate results, respectively, in low hydraulic pressure 
and high oncotic pressure within the peritubular capillary system which follows directly in series from 
the glomerular capillary network. The balance of physical forces within the peritubular capillaries, 
therefore, greatly favors the uptake of fluid back into the systemic circulation. The addition of a renal 
epithelial monolayer with high rates of active Na + transport and high hydraulic permeability assists further 
in the high rate of salt and water reabsorption along the proximal tubule. Thus, an elegant system has 
evolved in the nephron to filter and reabsorb large amounts of fluid in bulk to attain high rates of 
metabolic product excretion while maintaining regulatory salt and water balance. 

Endocrine 

As an endocrine organ, the kidney has been well recognized as critical in the production of erythropoietin, 
a growth factor for red blood cell production, and vitamin D, a compound important in calcium 
metabolism, along with, but not limited to, prostaglandins, kinins, and renin. For the purposes of this 
chapter, this discussion will be limited to erythropoietin production as an example of a potential formu- 
lation of a tissue-engineering construct to replace this lost endocrine function in chronic end-stage renal 
disease. 

More than 40 years ago erythropoietin was shown to be the hormone that regulates erythropoiesis, 
or red blood cell production, in the bone marrow [11]. In adults, erythropoietin is produced primarily 
(greater than 90%) by specialize interstitial cells in the kidney [12]. Although the liver also synthesizes 
erythropoietin, the quantity is not adequate (less than 10%) to maintain adequate red cell production 
in the body [ 13] . The production of erythropoietin by the kidney is regulated by a classic endocrinologic 
feedback loop system. As blood flows through the kidney, the erythropoietin-producing cells are in an 
ideal location to sense oxygen delivery to tissues by red cells in the bloodstream, since they are located 
adjacent to peritubular capillaries in the renal interstitium. 

As demonstrated in Fig. 125.5, erythropoeitin production is inversely related to oxygen delivery to the 
renal interstitial cells. With hypoxemia or decline in red blood cell mass, a decline in oxygen delivery 
occurs to these specialized cells, and increased erythropoietin production develops. Upon return to 
normal oxygen delivery with normoxia and red blood cell mass, the enhanced production of erythro- 
poeitin is suppressed, closing the classic feedback loop. Of importance, the regulation of erythropoietin 
in the kidney cells depends upon transcriptional control, not upon secretory control, as seen with insulin 
[14]. The precise molecular mechanism of the oxygen sensor for tissue oxygen availability has not been 
delineated but appears to depend on a heme protein. 
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FIGURE 125.5 Endocrinologic back loop which regulates erythropoietin production by the kidney. 

Once erythropoietin is released, it circulates in the bloodstream to the bone marrow, where it signals 
the marrow to produce red blood cells. In this regard, all blood cells, both white and red cells, originate 
from a subset of bone marrow cells, called multipotent stem cells. These stem cells develop during 
embryonic development and are maintained through adulthood via self-regulation. Under appropriate 
stimulation, stem cells proliferate and produce committed, more highly differentiated progenitor cells 
which are then destined to a specific differentiated line of blood cells, including neutrophils, lymphocytes, 
platelets, and red cells. Specifically, erythropoietin binds to receptors on the outer membrane of com- 
mitted erythroid progenitor cells, stimulating the terminal steps in erythroid differentiation. Nearly 
200 years ago, it was first recognized that anemia is a complication of chronic renal failure. Ordinarily 
an exponential increase in serum erythropoietin levels are observed when hemoglobin level in patients 
declines below 10 gm/dl. In the clinical state of renal disease, however, the normal increase in erythro- 
poietin in response to anemia is impaired [14]. Although the oxygen delivery (or hemoglobin) to 
erythropoietin feedback loop is still intact, the response is dramatically diminished. In patients with end- 
stage renal disease (ESRD), the hematocrit levels are directly correlated to circulating erythropoietin 
levels. In fact, ERSD patients with bilateral nephrectomies have the lowest hematocrits and lowest rates 
of erythropoiesis, demonstrating that even the small amount of erythropoietin produced by the end- 
stage kidney is important. Thus, the loss of renal function due to chronic disease results in an endocrine 
deficiency of a hormone normally produced by the kidney and results in a clinical problem that com- 
plicates the loss of renal excretory function. 

125.2 Tissue-Engineering Formulation Based 
upon Fundamentals 

In designing an implantable bioartificial kidney for renal replacement function, essential functions of 
kidney tissue must be utilized to direct the design of the tissue-engineering project. The critical elements 
of renal function must be replaced, including the excretory, regulatory (reabsorptive), and endocrinologic 
functions. The functioning excretory unit of the kidney, as detailed previously, is composed of the filtering 
unit, the glomerulus, and the regulatory or reabsorptive unit, the tubule. Therefore, a bioartificial kidney 
requires two main units, the glomerulus and the tubule, to replace renal excretory function. 
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Bioartificial Golmerulus: The Filter 

The potential for a bioartificial glomerulus has been achieved with the use of polysulphone fibers ex vivo 
with maintenance of ultrafiltration in humans for several weeks with a single device [15, 16]. The 
availability of hollow fibres with high hydraulic permeability has been an important advancement in 
biomaterials for replacement function of glomerular ultrafiltration. Conventional hemodialysis for ESRD 
has used membranes in which solute removal is driven by a concentration gradient of the solute across 
the membranes and is, therefore, predominantly a diffusive process. Another type of solute transfer also 
occurs across the dialysis membrane via a process of ultrafiltration of water and solutes across the 
membrane. This convective transport is independent of the concentration gradient and depends pre- 
dominantly on the hydraulic pressure gradient across the membrane. Both diffusive and convective 
processes occur during traditional hemodialysis, but diffusion is the main route of solute movement. 

The development of synthetic membranes with high hydraulic permeability and solute retention 
properties in convenient hollow fiber form has promoted ERSD therapy based upon convective homo- 
filtration rather than diffusive hemodialysis [17, 18]. Removal of uremic toxins, predominantly by the 
convective process, has several distinct advantages, because it imitates the glomerular process of toxin 
removal with increased clearance of higher-molecular-weight solutes and removal of all solutes (up to a 
molecular weight cutoff) at the same rate. The comparison of the differences between diffusive and 
convective transport across a semipermeable membrane is detailed in Fig. 125.6. This figure demonstrates 
the relationship between molecular size and clearance by diffusion and convection. As seen, the clearance 
of a molecule by diffusion is negatively correlated with the size of the molecule. In contrast, clearance 
of a substance by convection is independent of size up to a certain molecular weight. The bulk movement 
of water carries passable solutes along with it in approximately the same concentration as in the fluid. 
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FIGURE 125.6 Relationship between solute molecular size and clearance by diffusion (open circles) and convection 
(closed circles). Left curve shows solute clearance for a 1.0 m 2 dialyzer with no ultrafiltration where solute clearance 
is diffusion. Right curve shows clearance for a 1.6 m 2 ultrafilter where solute clearance is by convection. Smaller 
molecules are better cleared by diffusion; larger molecules by convection. Normal kidneys clear solutes in a pattern 
similar to convective transport. (Figure adapted from [18].) 
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FIGURE 125.7 Conceptual schematization of bioartificial glomerulus. 



Development of an implantable device which mimics glomerular filtration will thus depend upon 
convective transport. This physiologic function has been achieved clinically with the use of polymeric 
hollow fibers ex vivo. Major limitations to the currently available technology for long-term replacement 
of filtration function include bleeding associated with required anticoagulation, diminution of filtration 
rate due to protein deposition in the membrane over time or thrombotic occlusion, and large amounts 
of fluid replacement required to replace the ultrafiltrate formed from the filtering unit. The use of 
endothelial-cell-seeded conduits along filtration surfaces may provide improved long-term hemocom- 
patibility and hemofiltaration in vivo [19, 20. 21], as schematized in Fig. 125.7. 

In this regard, endothelial cell seeding of small-caliber vascular prosthesis has been shown experimen- 
tally to reduce long-term platelet deposition, thrombus formation, and loss of graft patency [21]. Recent 
results in humans have demonstrated success in autologous endothelial cell seeding in small caliber grafts 
after growth of these cells along the graft lumen ex vivo to achieve a confluent monolayer prior to 
implantation. Long-term persistent endothelialization and patency of the implanted graft has been 
reported [19]. A potential rate-limiting step in endothelial-cell-lined hollow fibers of small caliber is 
thrombotic occlusion, which limits the functional patency of this filtration unit. In this regard, gene 
transfer into seeded endothelial cells for constitutive expression of anticoagulant factors can be envisioned 
to minimize clot formation in these small-caliber hollow fibers. Since gene transfer for in vivo protein 
production has been clearly achieved with endothelial cells [22, 23], gene transfer into endothelial cells 
for the production of an anticoagulant protein is clearly conceivable. 

For differentiated endothelial cell morphology and function, an important role for various components 
of the extracellular matrix (ECM) has been demonstrated [24, 25]. The ECM has been clearly shown to 
dictate phenotype and gene expression of endothelial cells, thereby modulating morphogenesis and 
growth. Various components of ECM, including collagen by type I, collagen type IV, laminin, and 
fibronectin, have been shown to affect endothelial cell adherence, growth, and differentiation. Of impor- 
tance, ECM produced by MDCK cells, a permanent renal epithelial cell line, has the ability to induce 
capillary endothelial cells to produce fenestrations [25, 26]. Endothelial cell fenestrations are large 
openings which act as channels or pores for convective transport though the endothelial monolayer and 
are important in the high hydraulic permeability and sieving characteristics of glomerular capillaries. 
Thus, the ECM component on which the endothelial cells attach and grow may be critical in the functional 
characteristics of the lining monolayer. 

Bioartificial Tubule: The Reabsorber 

As detailed above, the efficiency of reabsorption, even though dependent upon natural physical forces 
governing fluid movement across biologic as well as synthetic membranes, requires specialized epithelial 
cells to perform vectorial solute transport. Critical to the advancement of this tubule construct, as well 
as for the tissue-engineering field in general, is the need for the isolation and growth in vitro of specific 
cells, referred to as stem progenitor cells, from adult tissues. 
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Stem or Progenitor Cells 

These cells are those that possess stem-cell-like characteristics with a high capacity for self-renewal under 
denned conditions into specialized cells to develop correct structure and functional components of a 
physiologic organ system [27, 28, 29]. Stem cells have been extensively studied in three adult mammalian 
tissues: the hematopoietic system, the epidermis, and the intestinal epithelium. Recent work has also 
suggested that stem cells may also reside in the adult nervous system [30]. Little insight into possible 
renal tubule stem cells had been developed until recent data demonstrating methodology to isolate and 
grow renal proximal tubule stem or progenitor cells from adult mammalian kidneys [31, 32]. This series 
of studies was promoted by the clinical and experimental observations suggesting that renal proximal 
tubule progenitor cells must exist, because they have the ability to regenerate after severe neophrotoxic 
or ischemic injury to form a fully functional and differentiated epithelium [33, 34]. Whether proximal 
tubule progenitor cells are pluripotent, possessing the ability to differentiate into cells of other segments 
(such as loop of Henle, distal convoluted tubule) as in embryonic kidney development, is presently 
unclear; the clinical state of acute tubular necrosis certainly supports the idea that proximal tubule 
progenitor cells have the ability to replicate and differentiate into proximal tubule cells with functionally 
and morphologically differentiated phenotypes. 

In this regard, recent data have demonstrated, using renal proximal tubule cells in primary culture, 
that the growth factors transforming growth factor-(3l (TGF-(3l) and the epidermal growth factor (EGF), 
along with the retinoid, retinoic acid, promoted tubulogenesis in renal proximal tubule progenitor cells 
in tissue culture [31]. These observations denned a coordinated interplay between growth factors and 
retinoids to induce pattern formation and morphogenesis. This finding is one of the first definitions of 
inductive factors which may be important in the organogenesis of a mammalian organ. In addition, 
using immunofluorescence microscopy, retinoic acid induced laminin A- and Bj-chain production in 
these cells and purified soluble laminin completely substituted for retinoic acid in kidney tubulogenesis. 
These results clearly demonstrate the manner in which retinoic acid, as a morphogen, can promote 
pattern formation and differentiation by regulating the production of an extracellular matrix molecule. 

Further work has demonstrated, in fact, that a population of cells resides in the adult mammalian 
kidney which have retained the capacity to proliferate and morphogenically differentiate into tubule 
structures in vitro [32]. These experiments have identified non-serum-containing growth conditions, 
which select for proximal tubule cells with a high capacity for self-renewal and an ability to differentiate 
phenotypically, collectively and individually, into proximal tubule structures in collagen gels. Regarding 
the high capacity for self-renewal, genetic marking of the cells with a recombinant retrovirus containing 
the lacZ gene and dilution analysis demonstrated that in vitro tubulogenesis often arose from clonal 
expansion of a single genetically tagged progenitor cell. These results suggest that a population of proximal 
tubule cells exist within the adult kidney in a relatively dormant, slowly replicative state, but with a rapid 
potential to proliferate, differentiate, and pattern-form to regenerate the lining proximal tubule epithe- 
lium of the kidney following severe ischemic or toxic injury. 

Bioartificial Tubule Formulation 

The bioartificial renal tubule is now clearly feasible when conceived as a combination of living cells 
supported on polymeric substrata [35]. A bioartificial tubule uses epithelial progenitor cells cultured on 
water and solute-permeable membranes seeded with various biomatrix materials so that expression of 
differentiated vectorial transport and metabolic and endocrine function is attained (Figs. 125.8 and 
125.9). With appropriate membranes and biomatrices, immunoprotection of cultured progenitor cells 
can be achieved concurrent with long-term functional performance as long as conditions support tubule 
cell viability [35, 36]. The technical feasibility of an implantable epithelial cell system derived from cells 
grown as confluent monolayers along the luminal surface of polymeric hollow fibers has been achieved 
[35]. These previously constructed devices, however, have used permanent renal cell lines which do not 
have differentiated transport function. The ability to purify and grow renal proximal tubule progenitor 
cells with the ability to differentiate morphogenically may provide a capability for replacement renal 
tubule function. 
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FIGURE 125.8 Ligh micrograph of an H&E section (lOOx) of hollow fiber lined with collagen type IV and confluent 
monolayer of human renal tubule epithelial cells along the inner component of the fiber. In this fixation process, the 
hollow fiber is clear with the outer contour of the hollow fiber identified by the irregular line (disregard artifact in 
lower left quadrant). 



A bioartificial proximal tubule satisfies a major requirement of reabsorbing a large volume of filtrate 
to maintain salt and water balance within the body. The need for additional tubule equivalents to replace 
another nephronal segment function, such as the loop of Henle, to perform more refined homeostatic 
elements of the kidney, including urine concentration or dilution, may not be necessary. Patients with 
moderate renal insufficiency lose the ability to finely regulate salt and water homeostasis — because they 
are unable to concentrate or dilute, yet are able to maintain reasonable fluid and electrolyte homeostasis 
due to redundant physiologic compensation via other mechanisms. Thus, a bioartificial proximal tubule, 
which reabsorbs iso-osmotically the majority of the filtrate, may be sufficient to replace required tubular 
function to sustain fluid electrolyte balance in a patient with end-stage renal disease. 

Bioartificial Kidney 

The development of a bioartificial filtration device and a bioartificial tubule processing unit would lead 
to the possibility of an implantable bioartificial kidney, consisting of the filtration device followed in 
series by the tubule unit (Fig. 125.10). The filtrate formed by this device will flow directly into the tubule 
unit. The tubule unit should maintain viability, because metabolic substrates and low-molecular-weight 
growth factors are delivered to the tubule cells from the ultrafiltration unit. Furthermore, immunopro- 
tection of the cells grown within the hollow fiber is achievable due to the impenetrance of immunolog- 
ically competent cells through the hollow fiber. Rejection of transplanted cells will, therefore, not occur. 
This arrangement thereby allows the filtrate to enter the internal compartments of the hollow fiber 
network, which are lined with confluent monolayers of renal tubule cells for regulated transport function. 
This device could be used either extracoporeally or implanted within a patient. In this regard, the 
specific implant site for a bioartificial kidney will depend upon the final configuration of both the 
bioartificial filtration and tubule device. As presently conceived, the endothelial-line bioartificial filtration 
hollow fibers can be placed into an arteriovenous circuit using the common iliac artery and vein, similar 
to the surgical connection for a renal transplant. The filtrate is connected in series to a bioartificial 
proximal tubule, which is embedded into the peritoneal membrane, so that reabsorbate will be trans- 
ported into the peritoneal cavity and reabsorbed into the systemic circulation. The processed filtrate 
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FIGURE 125.9 Electorn micrograph of a single hollow fiber lined with extracellular matrix and a confluent mono- 
layer (see Fig. 125.8) of renal tubule cells along the inner component of the fiber. As displayed, the differentiated 
phenotype of renal tubule cells on the hollow fiber prelined with matrix is apparent. The well developed microvilli 
and apical tight junctions can be appreciated (14,000x). 

exiting the tubule unit is then connected via tubing to the proximate ureter for drainage and urine 
excretion via the recipient's own urinary collecting system. 

Although an implantable form of this therapy is in the conceptual phase of development, a functioning 
extracorporeal bioartificial renal tubule has been developed. The technical feasibility of an epithelial cell 
system derived from cells grown as confluent monolayers along the luminal surface of polymeric hollow 
fibers has been recently achieved [38]. Because of its anatomic and physiological similarities with humans 
and the relative simplicity with which it can be bred in large numbers, the pig is currently considered 
the best source of organs for both human xenotransplantation and immuno-isolated cell therapy devices 
[39-41]. Yorkshire breed pig renal tubule progenitor cells have been expanded and cultured on semi- 
permeable hollow fiber membranes seeded with various biomatrix materials, so that expression of 
differentiated vectorial transport, metabolic, and endocrine function has been attained [38, 42, 43]. This 
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FIGURE 125.10 Conceptual schematization of an implantable tissue engineered bioartificial kidney with an endo- 
thelial-cell-lined hemofilter in series with a proximal tubule-cell-lined reabsorber. 

bioartificial renal tubule has been evaluated in uremic and non-uremic large animals [44]. A customized 
extracorporeal circuit using standard arterial venous blood tubing with a dialysis machine has been 
developed which delivers the post-filtered blood through the extracapillary space and the ultrafiltrate 
through the liminal space of the bioartificial tubule. Experiments have confirmed the functional metabolic 
performance and active tubule transport properties of the bioartificial renal tubule using this extracor- 
poreal circuit [44]. Proximal tubule cells derive embryonically from mesodermal progenitors closely 
related to bone marrow precursor cells and have retained many elements of immunologically competent 
cells, including the ability for antigen presentation and production of a variety of immunologically active 
cytokines [45]. Since the kidney participates in this complex and dynamic network of pro- and anti- 
inflammatory cytokines, cell replacement therapy with an extracorporeal bioartificial renal tubule assist 
device may play a critical role in the future treatment of renal failure. 

Bioartificial Endocrine Gland 

The Erythropoietin Generator 

Because specialized cells are programmed to carry out specific biologic tasks, cell therapy may deliver 
several key proteins in a coordinated cascade to promote a biologic or physiologic process. Targeted 
delivery of a specific deficient protein, hormone, or neurotransmitter may be achieved with site-specific 
implantation of cells which can produce this deficient compound after being encapsulated in special 
polymeric membranes. The membranes allow cell nutrients into the encapsulated space to maintain cell 
viability and allow cellular metabolic wastes to exit along with the desired protein, hormone, or neu- 
rotransmitter while shielding the cells from the host's destructive immune response. This strategy is being 
employed, for example, to deliver dopamine produced by bovine adrenal cells to the substantia nigra 
where a deficiency of this neurotransmitter at this site leads to Parkinson's Disease [46]. 

Regulated and homeostatic drug dosing may also be achieved with cell therapy. For hormonal therapy, 
such as insulin for diabetes mellitus, appropriate insulin levels within the body are only crudely attained 
with a once-a-day or twice-a-day dosing. Any hormone-producing cell has a highly evolved biologic 
sensing system to monitor the ambient environment and respond with graded production and release 
of the hormone to regulate the sensed level of the moiety which is regulated. The circulating level of a 
protein or a hormone may be regulated at several levels: at the gene level by transcriptional mechanisms, 
at the protein level by translational processes, or at the secretory level by cellular processes. The complexity 
of regulation increases several-fold, as control progresses from transcriptional to translational to excretory 
processes. Accordingly, a more refined differentiated cell phenotype is required to maintain a regulated 
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secretory process compared to a transcriptional process. The lack of success of encapsulation of insulin- 
producing cells is due to the fact that the cells are unable to maintain a viable, highly differentiated state 
to sense ambient glucose levels and release performed insulin in a regulated, differentiated secretory 
pathway. In contrast, since erythropoietin production is regulated by transcriptional mechanisms, the 
ability to identify and perhaps grow cells from adult mammalian kidneys with the ability to regulate 
erythropoietin production in response to oxygen delivery may allow the design of an implantable cell 
therapy device to sense circulating oxygen levels and regulate erythropoeitin production based upon a 
biologic sensing mechanism. Recently, genetically engineered polymer encapsulated myoblasts have been 
shown to continuously deliver human and mouse erythropoietin in mice [47]. 

125.3 Clinical and Economic Implications 

Although long-term chronic renal replacement therapy with either hemodialysis or CAPD has dramat- 
ically changed the prognosis of renal failure, it is not complete replacement therapy, since it only provides 
filtration function (usually on an intermittent basis) and does not replace the excretory, regulatory, and 
endocrine functions of the kidney. Because of the nonphysiologic manner in which dialysis performs or 
does not perform the most critical renal functions, patients with ESRD on dialysis continue to have major 
medical, social, and economic problems. Renal transplant addresses some of these issues, but immuno- 
logic barriers and organ shortages keep this approach from being ideal for a large number of ESRD 
patients. 

Although dialysis or transplantation therapies can prolong the life of a patient with ESRD disease, it 
is still a serious medical condition, with ESRD patients having only one-fifth the life expectancy of a 
normal age-matched control group. ESRD patients also experience significantly greater morbidity. 
Patients with ESRD have five times the hospitalization rate, nearly twice the disability rate, and five times 
the unemployment rate of age-matched non-ESRD individuals [2]. Accordingly, this new technology 
based upon the proposed bioengineering prototypes would most likely have substantial benefits to the 
patient by increasing life expectancy, increasing mobility and flexibility, increasing quality of life with 
large savings in time, less risk of infection, and reduced costs. 

Besides the personal costs to the patient and family, care of chronic kidney failure is monetarily 
expensive on a per-capita basis in comparison to most forms of medical care [1, 2]. The 1989 estimated 
Medicare payment (federal only) per ESRD patient during the entire year averaged $30,900. The patient 
and private insurance obligations were an addition $6900 per patient. The total cost of a patient per year 
with end-stage renal disease, therefore, is approximately $40,000. In 1988 the expected life span after 
beginning dialysis for an ESRD patient was approximately four years; therefore, the total cost per patient 
of ESRD during his or her lifetime was approximately $160,000 in 1998. 

The total cost in direct medical payments for ESRD, by both public and private payers, increased from 
$6 billion in 1989 to over $14 billion in 1996 [48]. These estimates do not include a number of indirect 
cost items, since they do not include patient travel costs and lost labor production. The number of 
patients receiving chronic dialytic therapy in the U.S. is presently over 250,000 with a current growth 
rate at 8 to 10% per year. It is conceivable that in the not too distant future, tissue engineering technology 
could supplant current treatments for ESRD. Although it is difficult to estimate the value of a technology, 
on a purely economic basis there may be an opportunity for major cost savings with this technology. 

125.4 Summary 

Three technologies will most likely dominate medical therapeutics in the next century. One is "cell 
therapy" — the implantation of living cells to produce a natural substance in short supply from the patient's 
own cells due to injury and destruction from various clinical disorders. Erythropoietin cell therapy is an 
example of this approach to replace a critical hormone deficiency in end-stage renal disease. A second 
therapy is tissue engineering, wherein cells are cultured to replace masses of cells that normally function 
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in a coordinated manner. Growing a functional glomerular filter and tubule reabsorber from a combination 
of cells, biomaterials, and synthetic polymers to replace renal excretory and regulatory functions is an 
example of this formulation. Over the last few years, an extracorporeal bioartificial renal tubule has become 
a reality; demonstrating physiologic and biochemical regulatory function in large animal studies. Finally, 
a third technology that will dominate future therapeutics is gene therapy, in which genes are transferred 
into living cells either to deliver a gene product to a cell in which it is missing or to produce a foreign 
gene product by a cell to promote a new function. The use of genes which encode for anticoagulant proteins 
as a means to deliver in a targeted and local fashion an anticoagulant to maintain hemocompatibility of 
a tissue engineered hemofilter is an example of the application of this third technology. 

The kidney was the first organ whose function was substituted by an artificial device. The kidney was 
also the first organ to be successfully transplanted. The ability to replace renal function with these 
revolutionary technologies in the past was due to the fact that renal excretory function is based upon 
natural physical forces which govern solute and fluid movement from the body compartment to the 
external environment. The need for coordinated mechanical or electrical activities for renal substitution 
was not required. Accordingly, the kidney may well be the first organ to be available as a tissue-engineered 
implantable device as a fully functional replacement part for the human body. 
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XIII. 1 Substitutive Medicine 

Over the past 50 years, humanity has progressively discovered that an engineered device or the 
transplantation of organs, tissues, or cells can substitute for most, and perhaps all, organs and 
body functions. The devices are human-made, whereas the living replacement parts can be 
obtained from the patient, a relative, a human cadaver, or a live animal or can be prospectively developed 
through genetic engineering. The concept that a disease state may be addressed not only by returning 
the malfunctioning organ to health using chemical agents or physical means but also by replacing the 
missing function with a natural or an artificial counterpart has brought about a revolution in therapeutics. 
Currently in the United States alone, 2 to 3 million patients a year are treated with a human-designed 
spare part (assist device, prosthesis, or implant), with the result that over 20 million people enjoy a longer 
or better quality of life thanks to artificial organs. In comparison, a shortage of donor organs limits the 
number of transplantation procedures to about 20,000 a year, and the total population of transplant 
survivors is on the order of 200,000. 

The fundamental tenet of substitutive medicine is that beyond a certain stage of failure, it is more effective 
to remove and replace a malfunctioning organ than to seek in vain to cure it. This ambitious proposition is 
not easy to accept. It goes against the grain of holistic views of the integrity of the person. It seems at odds 
with the main stream of twentieth-century scientific medicine, which strives to elucidate pathophysiologic 
mechanisms at the cellular and molecular level and then to correct them through a specific biochemical key. 
The technology of organ replacement rivals that of space travel in complexity and fanfare and strikes the 
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popular imagination by its daring, its triumphs, and its excesses. Although the artificial organ approach does 
not reach the fundamental objective of medicine, which is to understand and correct the disease process, it 
is considerably more effective than drug therapy or corrective surgery in the treatment of many conditions, 
e.g., cardiac valve disease, heart block, malignant arrhythmia, arterial obstruction, cataract. 

A priori, functional disabilities due to the destruction or wear of body parts can be addressed in two 
ways: the implantation of prosthetic devices or the transplantation of natural organs. For a natural organ 
transplant, we typically borrow a spare part from a living being or from an equally generous donor who 
before death volunteered to help those suffering from terminal organ failure. Transplanted organs benefit 
from refinements acquired over thousands of years of evolution. They are overdesigned, which means 
they will provide sufficient functional support even though the donated part may not be in perfect 
condition at the time of transfer to another person. They have the same shape and the same attachment 
needs as the body part they replace which means that surgical techniques are straightforward. The critical 
problem is the shortage of donors, and therefore only a small minority of patients currently benefit from 
this approach. 

Artificial organs have different limitations. Seen on the scale of human evolution, they are still primitive 
devices, tested for 40 years at most. Yet they have transformed the prognosis of many heretofore fatal 
diseases, which are now allowed to evolve past what used to be their natural termination point. In order 
to design artificial organs, inventive engineers, physiologists, and surgeons think in terms of functional 
results, not anatomical structures. As a result, artificial organs have but a distant similarity to natural 
ones. They are mostly made of synthetic materials (often called biomaterials) which do not exist in nature. 
They use different mechanical, electrical, or chemical processes to achieve the same functional objectives 
as natural organs. They adapt but imperfectly to the changing demands of human activity. They cannot 
easily accommodate body growth and therefore are more beneficial to adults than to children. Most 
critically, artificial organs, as is the case for all machines, have a limited service expectancy because of 
friction, wear, or decay of construction materials in the warm, humid, and corrosive environment of the 
human body. Such considerations limit their use to patients whose life expectancy matches the expected 
service life of the replacement part or to clinical situations where repeated implantations are technically 
feasible. In spite of these obstacles, the astonishing reality is that millions of people are currently alive 
thanks to cardiac pacemakers, cardiac valves, artificial kidneys, or hydrocephalus drainage systems, all of 
which address life-threatening conditions. An even larger number of people enjoy the benefits of hip and 
knee prostheses, vascular grafts, intraocular lenses, and dental implants, which correct dysfunction, pain, 
inconvenience, or merely appearance. In short, the clinical demonstration of the central dogma of 
substitutive medicine over the span of two generations can be viewed demographically as the first step 
in a evolutionary jump which humans cannot yet fully appreciate. 

Hybrid artificial organs, or bioartificial organs, are more recent systems which include living elements 
(organelles, cells, or tissues) as part of a device made of synthetic materials. They integrate the technology 
of natural organ transplantation and the refinements which living structures have gained through millions 
of years of evolution with the purposeful design approach of engineering science and the promises of 
newly developed synthetic materials. Table XIII. 1 provides a current snapshot in the continuing evolution 
of substitutive medicine. 

Depending upon medical needs and anticipated duration of use, artificial organs can be located outside 
of the body yet attached to it (paracorporeal prostheses or assist devices) or implanted inside the body 
in a appropriate location (internal artificial organs or implants). The application of artificial organs may 
be temporary, i.e., a bridge procedure to sustain life or a specific biologic activity while waiting for either 
recovery of natural function (e.g., the heart-lung machine), or permanent organ replacement (e.g., left 
ventricular assist devices). It can be intermittent and repeated at intervals over extended periods of time 
when there is no biologic necessity for continuous replacement of the missing body functions (e.g., 
artificial kidney). It can pretend to be permanent, at least within the limits of a finite life span. 

Up to 1950, organ replacement technology was relatively crude and unimaginative. Wooden legs, 
corrective glasses, and dental prostheses formed the bulk of artificial organs. Blood transfusion was the 
only accepted form of transplantation of living tissue. Suddenly, within a decade, the artificial kidney, 
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TABLE XIII. 1 Evolution of Organ Replacement Technology: A 1995 perspective 



Current Status 

Broadly accepted clinically 



Accepted with reservations 



Limited clinical application 



Experimental stage 



Conceptual stage 



Artificial Organs 

Heart-lung machine 

Large-joint prostheses 

Bone fixation systems 

Cardiac pacemakers 

Implantable defibrillators 

Large vascular grafts 

Prosthetic cardiac valves 

Intra-aortic balloon pump 

Intraocular lenses 

Middle ear ossicle chain 

Hydrocephalus shunts 

Dental implants 

Skin and tissue expanders 

Maintenance hemodialysis 

Chronic ambulatory peritoneal dialysis 

Breast implants 

Sexual prostheses 

Small joint prostheses 

ECMO in children 

ECMO in adults 

Ventricular assist devices 

Cochlear prostheses 

Artificial tendons 

Artificial skin 

Artificial limbs 

Artificial pancreas 

Artificial blood 

Intravenous oxygenation 

Artificial esophagus 

Total artificial heart 

Nerve guidance channels 

Artificial eye 

Neurostimulator 

Blood pressure regulator 

Implantable lung 

Artificial trachea 

Artificial gut 

Artificial fallopian tube 



Transplantation 

Blood transfusion 

Corneal transplants 

Banked bone 

Bone marrow 

Kidney — living related donor 

Kidney — cadaveric donor 

Heart 

Liver 



Whole pancreas 
Single and double lung 
Combined heart-lung 

Cardiomyoplasty 
Pancreatic islets 
Liver lobe or segment 
Small Intestine 



Bioartificial pancreas 
Bioartificial liver 
CNS implants of secreting tissue 
Gene therapy products 



Striated muscle implants 
Smooth muscle implants 
Cardiac muscle implants 
Functional brain implants 
Bioartificial kidney 



the heart-lung machine, the cardiac pacemaker, the arterial graft, the prosthetic cardiac valve, and the 
artificial hip joint provided the first sophisticated examples of engineering in medicine. More recently, 
the membrane lung, the implantable lens, finger and tendon prostheses, total knee replacements, and 
soft-tissue implants for maxillo-facial, ear, or mammary reconstruction have reached the stage of broad 
clinical application. Ventricular assist devices and the total artificial heart have been extensively tested in 
animals and validated for clinical evaluation. Artificial skin is increasingly used in the treatment of ulcers 
and burns. Soft- and hard-tissue substitutes function effectively for several years. Sexual and sensory 
prostheses offer promises for the replacement of complex human functions. Interfacing of devices with 
the peripheral and central nervous systems appears as promising today as cardiovascular devices were 
30 years ago. Perhaps the brightest future belongs to "information prostheses" which bring to the human 
body, signals which the organism can no longer generate by itself (e.g., pacemaker functions), signals 
which need to be modulated differently to correct a disease state (e.g., electronic blood pressure regula- 
tors) or signals which cannot be perceived by the nervous system through its usual channels of informa- 
tion gathering (e.g., artificial eye or artificial ear). 
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XIII.2 Biomaterials 

The materials of the first generation of artificial organs — those which are widely available at the 
moment — are for the most part standard commodity plastics and metals developed for industrial pur- 
poses. Engineers have long recognized the limitations of construction materials in the design and per- 
formance of machines. However, a new awareness arose when they started interacting with surgeons and 
biologic scientists in the emerging field of medical devices. In many cases the intrinsic and well established 
physical properties of synthetic materials such as mechanical strength, hardness, flexibility, or perme- 
ability to fluids and gases were not as immediately limiting as the detrimental effects deriving from the 
material's contact with living tissues. As a result, fewer than 20 chemical compounds among the 
1.5 million candidates have been successfully incorporated into clinical devices. Yet some functional 
implants require material properties which exceed the limits of current polymer, ceramic, or metal alloy 
technology. This is an indirect tribute to the power of evolution, as well as a challenge to scientists to 
emulate natural materials with synthetic compounds, blends, or composites. 

The progressive recognition of the dominant role of phenomena starting at the tissue-material interface 
has led to two generalizations: 

1. All materials in contact with body fluids or living tissue undergo almost instantaneous and then 
continuing surface deposition of body components which alter their original properties. 

2. All body fluids and tissues in contact with foreign material undergo a dynamic sequence of biologic 
reactions which evolve over weeks or months, and these reactions may remain active for as long 
as the contact persists and perhaps even beyond. 

The recognition of biologic interactions between synthetic materials and body tissues has been trans- 
lated into the twin operational concepts of biomaterials and compatibility. 

Biomaterials is a term used to qualify materials which can be placed in intimate contact with living 
structures without harmful effects. Compatibility characterizes a set of material specifications and con- 
straints which address the various aspects of material-tissue interactions. More specifically, hemocom- 
patibility defines the ability of a biomaterial to stay in contact with blood for a clinically relevant period 
of time without causing alterations of the formed elements and plasma constituents of the blood or 
substantially altering the composition of the material itself. The term biocompatibility is often used to 
highlight the absence of untoward interactions with tissues other than blood (e.g., hard or soft tissues). 

It is worth observing that hemocompatibility and biocompatibility are virtues demonstrated not by 
the presence of definable favorable properties but rather by the absence of adverse effects on blood or 
other tissues. Although these terms imply positive characteristics of the material, the presumption of 
compatibility is actually based on the accumulation of negative evidence over longer and longer periods 
of time, using an increasingly complex battery of tests, which must eventually be confirmed under the 
conditions of clinical use. 

The clinical success of materials incorporated into actual devices is altogether remarkable, considering 
how limited our understanding is of the physical and biologic mechanisms underlying tissue-material 
interactions. Indeed the most substantial conclusion one can draw from a review of records of literally 
millions of implants is how few major accidents have been reported and how remarkably uncommon 
and benign have been the side effects of implanting substantial amounts of synthetic substances into the 
human body. Artificial organs are by no means perfect. Their performance must be appreciated within 
the same limits that the inexorable processes of disease and aging impose on natural organs. 

XIII.3 Outlook for Organ Replacement 

Now emerging is a second generation of implantable materials through the confluence of biomaterial 
science and cell biology (Fig. XIII. 1). Cell culture technology, taking advantage of biotechnology products 
and progressing to tridimensional tissue engineering on performed matrices, now provides building 
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FIGURE XIII. 1 Schematic description of the advances in engineering, biologic, and medical technology which led 
to the first generation of artificial organs (read from the top) and the newer developments in body replacement parts 
(read from the bottom). 

blocks which incorporate the peptide or glycoprotein sequences responsible for cell-to-cell interactions. 
This combination leads to a new class of biohybrid devices which includes 

1. Cellular transplants for continuing secretion of bioactive substances (e.g., transplants of insulin- 
producing xenograft tissue protected against immune rejection by permselective envelopes) 

2. Composites of synthetic materials with living cells (often called organoids) to accelerate implant 
integration within the body (e.g. endothelial cell-lined polymer conduits designed for vascular grafts) 

3. Replacement parts in which natural tissue regeneration is activated by the presence of supportive 
cells (e.g., Schwann cell-seeded nerve guidance channels) 

4. Vehicles for gene therapy in which continued gene expression is enhanced by a synthetic polymer 
substrate with appropriate mechanical, chemical, or drug release properties (e.g., epicardial trans- 
plants of genetically modified skeletal or cardiac muscle grown on a distensible polymer matrix) 

In many respects, the new wave of organ replacement exemplifies the synergy of the two original currents 
in substitutive medicine: prostheses and transplants. It expands the feasibility of cell and tissue transplan- 
tation beyond the boundaries of autografts and related donor allografts, opening the way to xenogeneic 
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and engineered replacement parts. It also confronts the "foreign body" limitations of human-made syn- 
thetic implants by adding the molecular and cellular elements that favor permanent biointegration. 

XIII.4 Design Considerations 

Natural organ transplants, if ideally preserved, should be able to fulfill all functions of the original body 
part except for those mediated by the nervous system, since a transplanted organ is by definition a 
denervated structure. In actuality, transplants always present some degree of ischemic damage caused by 
interruption of the blood supply during transfer from donor to recipient. This may be reflected by a 
temporarily impaired function in the postoperative period or by permanent necrosis of the most delicate 
components of the transplant, resulting in some degree of functional limitation. In the long run, trans- 
planted organs may also exhibit functional alterations because of cell or tissue damage associated with 
an underlying systemic disease. They may be damaged by the immunosuppression protocol, which at 
the current stage is needed for all organ replacements except for autografts, identical-twin homografts, 
and some types of fetal tissue transplants. The second-order limitations of transplanted organs are usually 
ignored, and the assumption is made that all original functions are restored in the recipient. 

Artificial organs, however, can only replace those bodily functions which have been incorporated into 
their design because these functions were scientifically described and known to be important. Therefore, 
in the design of an artificial organ, the first task is to establish the specifications for the device, i.e., to 
describe in quantitative terms the function or functions which must be fulfilled by a human-made 
construct and the physical constraints that apply because the device must interface with the human body. 
Each human organ fulfills multiple functions of unequal importance in terms of survival. Consequently, 
it is critical to distinguish the essential functions which must be incorporated into an effective spare part 
from those which can be neglected. 

Defining specifications and constraints is the first step in the conceptualization of an artificial organ. 
Only when this is done can one think realistically about design alternatives, the limitations of available 
materials, and the clinical constraints which will apply, of which the key ones are connections to the 
body and duration of expected service. 

Once all these considerations have been integrated (modeling is often useful at that stage), the next 
step is typically the construction of a prototype. Ideally the device should achieve everything it was 
expected to do, but usually it exhibits some level of performance and durability which falls short of design 
specifications, either because of some misjudgment in terms of required function or because of some 
unanticipated problem arising at the interface between the device and the body. 

The following step of development may be called optimization, if the specifications were well defined 
from the outset, or reevaluation, if they were not. More commonly it is the reconciliation of competition 
and at times contradictory design criteria which leads to a second prototype. 

At this point, new experiments are needed to establish the reliability and effectiveness of the device in 
animal models of the target disease (if such exist) or at least in animals in which the natural organ can 
be removed or bypassed. This is the stage of validation of the device, which is first conducted in acute 
experiments and must later be extended to periods of observation approximating the duration of intended 
use in humans. These criteria, however, cannot always be met for long-term implants, since the life 
expectancy of most animals is shorter than that of humans. By this point, the diverse vantage points of 
the theoretician, the manufacturer, the performance evaluator, and the clinical user have been articulated 
for some specific devices and generalized in terms of quality control for classes of devices. 

The final stage of design, for many artificial organs, is individualization, i.e., the ability to fit the needs 
of diverse individuals. Humans come in a wide range of body sizes. In some cases, the prostheses must 
fit very strict dimensional criteria, which implies that they must be fabricated over an extended range of 
sizes (e.g., cardiac valves). In other cases, there is enough reserve function in the device that one pediatric 
model and one adult size model may suffice (e.g., blood oxygenator for cardiac surgery). 
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XIII.5 Evaluation Process 

The evaluation of an artificial organ typically is done in six phases: 

1. In vitro bench testing 

2. Ex vivo appraisal 

3. In vivo studies with healthy experimental animals 

4. In vivo studies with animal models of disease 

5. Controlled clinical trials 

6. General clinical use 

In Vitro Bench Testing 

In vitro bench testing of a completed prototype has three major purposes: 

1. To observe the mode of operation of the device and assess its performance under tightly controlled 
circumstances 

2. To define performance in quantitative terms over a wide range of environmental or input conditions 

3. To assess the device's reliability and durability in a manner which can be extrapolated to the 
intended clinical use 

For all its value, there are limitations to in vitro testing of devices. Devices are made to work while in 
contact with body fluids or body tissues. This complex environment modifies materials in ways which 
are not always predictable. To duplicate this effect as closely as possible a laboratory bench system can 
be made to match the body's environment in terms of temperature and humidity. Operating pressures 
and external forces can also be imitated but not perfectly reproduced (e.g., the complex pulsatile nature 
of cardiovascular events). Other fluid dynamic conditions such as viscosity, wall shear stress, and com- 
pliance of device-surrounding structures call for sophisticated laboratory systems and can only be approx- 
imated. The chemical environment is the most difficult to reproduce in view of the complexity of body 
fluids and tissue structures. Some in vitro testing systems make use of body fluids such as plasma or 
blood. This in turn brings in additional intricacies because these fluids are not stable outside of the body 
without preservatives and must be kept sterile if the experiment is to last more than a few hours. 

Accelerated testing is a standard component in the evaluation of machines. It is critical for permanent 
implants with moving parts which are subject to the repeated action of external forces. Fatigue testing 
provides important information on progressive wear or catastrophic failure of device components. For 
example, the human heart beats about 40 million times per year. Manufacturers and regulatory agencies 
conduct testing of prosthetic cardiac valves over at least 400 million cycles. With a testing apparatus 
functioning at 1200 cycles per minute, this evaluation can be compressed by a factor of about 15, i.e., to 
about a year. 

Ex Vivo Appraisal 

Because of the difficulty of keeping blood in its physiologic state in a container, the evaluation of some 
blood processing or blood contacting devices is performed by connecting them through the skin to an 
artery or vein or both if the blood must be returned to the cardiovascular system to avoid excessive 
hemorrhage. Such experiments retain the advantage of keeping the device under direct observation while 
allowing longer experiments than are feasible in vitro, particularly if the animal does not require general 
anesthesia. It is also possible in some cases to evaluate several devices in parallel or sequentially under 
quite realistic conditions and therefore to conduct comparative experiments under reasonably standard- 
ized conditions. 
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In Vivo Evaluation with Healthy Experimental Animals 

There comes a stage in the development of most devices where they must be assessed in their target 
location in a living body. The matching of device size and shape with available experimental sites in the 
appropriate animal species is a necessary condition. Such experiments typically last weeks, months, or 
years and provide information about body-device and tissue-material interactions either through non- 
invasive measurement techniques or through device retrieval at the end of the observation period. 
Rodents, felines, and dogs raised for research purposes are usually too small for the evaluation of human- 
sized devices. Farm animals such as sheep, goats, pigs, and calves are commonly used. Here again the 
limited life expectancy of experimental animals prevents studies for periods of service as long as can be 
expected with permanent implants in man. 

In Vivo Evaluation with Animal Models of Disease 

A first approximation of the effectiveness of a device in replacing a physiologic function can be obtained 
after removing the target organ in a normal animal. However, when the organ failure is only the cardinal 
sign of a complex systemic disease, the interactions between the device and the persisting manifestations 
of the disease occur spontaneously in some species and in other cases can be obtained by chemical, 
physical, or surgical intervention. Where such models of disease exist in animals which can be fitted with 
a device, useful information is obtained which helps to refine the final prototypes. 

Controlled Clinical Trials 

Although some devices can be evaluated with little risk in normal volunteers who derive no health benefit 
from the experiment, our culture frowns on this approach and legal considerations discourage it. Once 
reliability and effectiveness have been established through animal experiments and the device appears to 
meet a recognized clinical need, a study protocol is typically submitted to an appropriate ethics committee 
or institutional review board and, upon their approval, a series of clinical trials is undertaken. The first 
step often concentrates on the demonstration of safety of the device with a careful watch for side effects 
or complications. If the device passes this first hurdle, a controlled clinical trial will be carried out with 
patients to evaluate effectiveness as well as safety on a scale which allows statistical comparison with a 
control form of treatment. This protocol may extend from a few months to several years depending upon 
the expected benefits of the device and the natural history of the disease. 

General Clinical Use 

Once a device is deemed successful by a panel of experts, it may be approved by regulatory agencies for 
commercial distribution. Increasingly a third stage of clinical evaluation appears necessary, namely 
postmarket surveillance, i.e., a system of clinical outcomes analysis under conditions of general availability 
of the device to a wide range of doctors and patients. 

Postmarket surveillance is a new concept which is not yet uniformly codified. It may take the form of 
a data collection and analysis network, a patient registry to allow continuing follow-up and statistical 
analysis, a device-tracking system aimed at early identification of unforseen types of failure, or ancillary 
controls such as inspection of facilities and review of patient histories in institutions where devices are 
used. Protocols of surveillance on a large scale are difficult and costly to implement and their cost- 
effectiveness is therefore open to question. They are also impaired by the shortage of broadly available 
and minimally invasive diagnostic methods for assessing the integrity or function of a device prior to 
catastrophic failure. Worthwhile postmarket surveillance requires a constructive collaboration between 
patients, doctors, device manufacturers, government regulatory agencies, and study groups assessing 
health care policy issues in the public and private sectors. 
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Defining Terms 

Artificial organs: Human-made devices designed to replace, duplicate, or augment, functionally or 
cosmetically, a missing, diseased, or otherwise incompetent part of the body, either temporarily 
or permanently, and which require a nonbiologic material interface with living tissue. 

Assist device: An apparatus used to support or partially replace the function of a failing organ. 

Bioartificial organ: Device combining living elements (organelles, cells, or tissues) with synthetic 
materials in a therapeutic system. 

Bicompatibility: The ability of a material to perform with an appropriate host tissue response when 
incorporated for a specific application in a device, prosthesis, or implant. 

Biomaterial: Any material or substance (other than a drug) or combination of materials, synthetic or 
natural in origin, which can be used as a whole or as a part of a system which treats, augments, 
or replaces any tissue, organ, or function of the body. 

Compatibility: A material property which encompasses a set of specifications and constraints relative 
to material-tissue interactions. 

Device: Defined by Congress as "...an instrument, apparatus, implement, machine, contrivance, 
implant, in vitro reagent, or other similar or related article, including any component, part, or 
accessory, which is... intended for use in the diagnosis of disease or other conditions, or in the 
cure, mitigation, treatment, or prevention of disease, in man or other animals,... and which does 
not achieve any of its principal intended purposes through chemical action within or on the body 
of man or other animals and which is not dependent upon being metabolized for the achievement 
of any of its principal intended purposes." 

Hemocompatibility: The ability of a biomaterial to stay in contact with blood for a clinically relevant 
period of time without causing alterations of the blood constituents. 

Hybrid artificial organs: Synonym of bioartificial organs, stressing the combination of cell transplan- 
tation and artificial organ technology. 

Implant: Any biomaterial or device which is actually embedded within the tissues of a living organism. 

Organs: Differentiated structures or parts of a body adapted for the performance of a specific operation 
or function. 

Organoid: An organlike aggregate of living cells and synthetic polymer scaffolds or envelopes, designed 
to provide replacement or support function. 

Organ transplant: An isolated body part obtained from the patient, a living relative, a compatible 
cadaveric donor, or an animal and inserted in a recipient to replace a missing function. 

Prosthesis: An artificial device to replace a missing part of the body. 

Substitutive medicine: A form of medicine which relies on the replacement of failing organs or body 
parts by natural or human-made counterparts. 

Tissue-material interface: The locus of contact and interactions between a biomaterial, implant, or 
device and the tissue or tissues immediately adjacent. 
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Further Information 

The articles and books listed above provide different overviews in the filed of organ replacement. 
Historically most contributions to the filed of artificial organs were described or chronicled in the 
40 annual volumes of the Transactions of the American Society of Artificial Internal Organs (1955 to 
present). The principal journals in the field of artificial organs are Artificial Organs (currently published 
by Blackwell Scientific Publications, Inc.), the ASAIO Journal (published by J. B. Lippincott Company), 
and the International Journal of Artificial Organs (published by Wichtig Editore s.r.l. Milano, Italy). 
Publications of the Japanese Society for Artificial Organs (typically in Japanese with English abstracts, 
Artificial Organs Today, for example) contain substantial information. 
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In 1812, LeGallois [1813] postulated the use of mechanical circulatory support. In 1934, DeBakey 
proposed a continuous flow blood transfusion instrument using a simple roller pump [DeBakey, 1934]. 
In 1961, Dennis et al., performed left heart bypass by inserting cannulae into the left atrium and returning 
blood through the femoral artery [Dennis, 1979]. In 1961, Kolff and Moulopoulos developed the first 
intra-aortic balloon pump [Moulopoulos et al., 1962]. In 1963, Liota performed the first clinical implan- 
tation of a pulsatile left ventricular assist device [Liotzet al., 1963]. In 1969, Dr. Denton Cooley performed 
the first total artificial heart implantation in a human [Cooley et al., 1969]. Since 1969, air driven artificial 
hearts and ventricular assist devices have been utilized in over 200 and 600 patients, respectively. The 
primary use of these devices has been as a bridge to transplant. Recently, electrically powered ventricular 
assist devices have been utilized in humans as a bridge to transplant. These electrically powered systems 
utilize implanted blood pumps and energy converters with percutaneous drive cables and vent tubes. 

Completely implanted heart assist systems requiring no percutaneous leads have been implanted in 
calves with survivals over eight months. The electric motor-driven total artificial hearts being developed 
by several groups in the United States are completely implanted systems employing transcutaneous energy 
transmission and telemetry [Rosenberg, 1991. Appendix C, The Artificial Heart, Prototypes, Policies, and 
Patients}. One total artificial heart design has functioned in a calf for over five months [Snyder, 1993]. 

There has been steady progress in the development of artificial heart and circulatory assist devices. 
Animal survivals have been increasing, and patient morbidity and mortality with the devices has been 
decreasing. It does not appear that new technologies or materials will be required for the first successful 
clinical application of long-term devices. There is no doubt that further advances in energy systems, 
materials, and electronics will provide for smaller and more reliable devices, but for the present, sound 
engineering design using available materials and methods appear to be adequate to provide devices 
satisfactory for initial clinical trials. 
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126.1 Engineering Design 

A definition for design is given by Pahl and Beitz [1977]: "Designing is the intellectual attempt to meet 
certain demands in the best possible way. It is an engineering activity that impinges on nearly every 
sphere of human life, relies on the discoveries and laws of science, and creates the conditions for applying 
these laws to the manufacture of useful products." Designing is a creative activity that calls for sound 
grounding in mathematics, physics, chemistry, mechanics, thermodynamics, hydrodynamics, electrical 
engineering, production engineering, materials technology, and design theory together with practical 
knowledge and experience in specialist fields. Initiative, resolution, economic insight, tenacity, optimism, 
sociability, and teamwork are all qualities that will assist the designer. Engineering design has been broken 
into many steps or phases by various authors. In general, though, each of these definitions includes some 
common tasks. No matter what device is being designed, be it a complicated device such as a totally 
implantable artificial heart or a simpler device, such as a new fastener, sound engineering design principles 
utilizing a methodical approach will help ensure satisfactory outcome of the design process. The engi- 
neering design process can be broken down into at least four separate stages. 

1. Define the problem — clarification of the task. At first it may appear that defining the problem or 
clarifying the task to be accomplished is an easy matter. In general, this is not the case. Great care 
should be taken in defining the problem, being very specific about the requirements and specifi- 
cations of the system to be designed. Very often, a complex system can be reduced to the solution 
of one core problem. An excellent way to begin defining the problem or clarifying the task to be 
accomplished is to begin by writing a careful design specification or design requirement. This is 
a document that lists all the requirements for the device. Further, this design requirement may 
elucidate one or two specific problems which, when solved, will yield a satisfactory design. 

2. Conceptual design — plan treatment. After the problem has been defined, the designer must plan 
the treatment of the problem and begin conceptual design. In this phase possible solutions to the 
problem at hand are examined. Various methods of determining possible solutions include brain- 
storming, focus groups, Delphi method. This is the phase of the design process where a thorough 
review of the literature and examination of similar problems is valuable. In this phase of design 
each of the proposed solutions to the problem should be examined in terms of a hazard analysis 
or failure modes and effects analysis to determine which solution appears the most feasible. 
Economic considerations should also be examined in this phase. 

3. Detailed design — Execute the plan. In this phase of engineering design, a detailed design is formu- 
lated. Perhaps two designs may be evaluated in the initial detailed design phase. As the detailed 
design or designs near completion, they must be examined with reference to the design specifi- 
cations. Here, each of the proposed designs may be evaluated with regard to its ability to perform. 
Such aspects as system performance, reliability, manufacturability, cost, and user acceptance are 
all issues which must be considered before a final design is chosen. 

4. Learn and generalize. Finally, after the design is complete, the designer should be able to learn and 
generalize from the design. This educational process will include manufacturing of prototypes 
and testing. General concepts and principles may be gleaned from the design process that can be 
applied to further designs. 

The remainder of this chapter will deal with the application of this engineering design method to the 
design of artificial heart and circulatory assist devices. 

126.2 Engineering Design of Artificial Heart 
and Circulatory Assist Devices 

Define the Problem — Clarification of the Task 

In the broadest sense, this step can best be accomplished by writing the detailed design requirement or 
specification for the device. In defining the problem, it is often easiest to begin with the most obvious 
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and imperative requirements and proceed to the subtler and less demanding. A general statement of the 
problem for a total artificial heart or assist device is "to develop a device (perhaps totally implantable) 
that when implanted in the human will provide a longer and better quality of life than conventional 
pharmacologic or transplant therapy." The devices considered will be assumed to be permanent implant- 
able devices, not necessarily totally implantable; they may utilize percutaneous wires or tubes. In general, 
it will be assumed that they will be intended for long-term use (one year or longer) but may also be 
utilized for short-term applications. 

Fit of the System 

One must first decide who the device is intended for. Will it be used in men and women, and of what size? 
No matter how good the device is, it must first "fit" the patient. For our example, let us assume the device 
will be used in men and women in the size range of 70-100 kg. The device must then fit in these patients 
and cause minimal or no pathologic conditions. When considering the fit of the implanted device, one must 
consider the volume and mass of the device, as well as any critical dimension such as the length, width, or 
height and the location of any tubes, conduits, or connectors. Careful consideration must be given to the 
physical attributes of the system such as whether the system should be hard, soft, rough, or smooth and the 
actual shape of the system in terms of sharp corners or edges that may damage tissue or organs. The design 
specification must give the maximum length, height, and width of the device, these dimensions being dictated 
by the patient's anatomy. The designer must not be limited by existing anatomy; the opportunity to surgically 
alter the anatomy should be considered. Nontraditional locations for devices should be considered. 

The device should not project heat in such a way that surfaces in contact with tissue or blood are 
subjected to a temperature rise 5°C above core temperature on a chronic basis. The use of heat spreaders 
or fins, along with insulation, may be required. Heat transfer analysis may be helpful. 

The effect of device movement and vibration should be considered in the design specification. The 
acceptable sound levels at various frequencies must be specified. A device should meet existing standards 
for electromagnetic interference and susceptibility. 

The use of any percutaneous tubes will require the choice of an exit site. This site must not be in a 
location of constant or excessive movement or tissue breakdown will be experienced at the interface. 

Pump Performance 

Pump performance must be specified in terms of cardiac output range. In a 70-kg person, a normal 
resting cardiac output is approximately 70ml/min/kg or 5 1/min. Choosing a maximum cardiac output 
of approximately 8 1/min will allow the patient the ability to perform light exercise. The cardiac output 
performance must be obtained at physiologic pressures, that is, the device, be it a heart assist or total 
artificial heart device, must be able to pump a cardiac output ranging up to 8 1/min with physiologic 
inlet and outlet pressures (central venous pressure ~ 5 mmHg mean, left atrial pressure ~ 7 mmHg mean, 
pulmonary artery pressure ~ 15 mmHg mean, aortic pressure ~ 100 mmHg mean). 

Control of the device is critical and must be included in the design specification. For an assist pump, 
it may be as simple as stating that the pump will pump all the blood delivered to it by the native heart 
while operating under physiologic inlet and outlet pressures. Or, the design specification may include 
specific requirements for synchronization of the device with the natural heart. For the total artificial 
heart, the device must always maintain balance between the left and right pumps. It must not let left 
atrial pressure rise above a value that will cause pulmonary congestion (approximately 20 mmHg). The 
device must respond to the patient's cardiac output requirements. The device must either passively or 
though an active controller vary its cardiac output upon patient demand. 

Bicompatibility 

Bicompatibility has already been alluded to in the design requirements by saying that the device must 
not cause excessive damage to the biologic system. Specifically, the device must be minimally thrombo- 
genic and minimally hemolytic. It should have a minimal effect on the immune system. It should not 
promote infection, calcification, or tissue necrosis. Meeting these design requirements will require careful 
design of the pumping chamber and controller and careful selection of materials. 
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Reliability 

The design specification must assign a target reliability for the device. For total artificial hearts and 
circulatory assist devices, the NIH has proposed a reliability of 80% with an 80% confidence for a 2-year 
device life. This is the value that the NIH feels is a reliability goal to be achieved before devices can begin 
initial clinical trials, but the final design reliability may be much more stringent, perhaps 90% reliable 
with 95% confidence for a 5-year life. The design specification must state which components of the 
system could be changed if necessary. The design specification must deal with any service that the device 
may require. For instance, the overall design life of the device may be 5 years, but battery replacement 
at 2-year intervals may be allowed. The reliability issue is very complex and involves moral, ethical, legal, 
and scientific issues. A clear goal must be stated and is necessary before the detailed design can begin. 

Quality of Life 

The design specification must address the quality of life for the patient. The designers must specify what 
is a satisfactory quality of life. Again, this is not an easy task. Quantitative measures of the quality of life 
are difficult to achieve. One person's interpretation of a satisfactory quality of life may not be the same 
as another's. It must always be kept in mind that the quality of life for these patients without treatment 
would generally be considered unsatisfactory by the general public. The prognosis for patients before 
receiving artificial hearts and circulatory assist devices is very poor. The quality of life must thus be 
considered in relation to the patient's quality of life without the device without ignoring the quality of 
life of individuals unaffected by cardiac disease [Rosenberg, 1991]. 

The design specification must state the weight of any external power supplies. The designer must 
consider how much weight an older patient will be able to carry How long should this patient be able 
to be free-roaming and untethered? How often will the energy source require a "recharge?" What sound 
level will be acceptable? These are all issues that must be addressed by the designer. 

All the foregoing must be considered and included in the definition of the problem and clarification 
of the task. Each of these issues should be clearly described in the design specification or requirement. 
In many instances there are no right and wrong answers to these questions. 

Conceptual Design — Plan Treatment 

In the conceptual design phase, the designer must plan the treatment of the problem and consider various 
designs that meet the design specification. In the design specification, it must be stated whether the blood 
pump is to be pulsatile or nonpulsatile. If there is no requirement in the design specification, then in 
the conceptual design phase, the designer may consider various nonpulsatile and pulsatile flow devices. 
Nonpulsatile devices include centrifugal pumps, axial flow pumps, shear flow pumps, and peristaltic 
pumps. Pulsatile pumps have traditionally been sac- or diaphragm-type devices. At the present time there 
is no definitive work describing the absolute requirement for pulsatility in the cardiovascular system; 
thus, both types of devices can be considered for assist and total artificial hearts. 

In the conceptual design phase, the designer should consider other nontraditional solutions to the 
problem such as devices that employ micro-machines or magneto-hydrodynamics. Careful consideration 
must be given to source of energy. Sources that have been considered include electrical energy stored in 
batteries or derived from piezoelectric crystals, fuel cells, and thermal energy created either thermonu- 
clearly or through thermal storage. The performance of each of these energy sources must be considered 
in the conceptual design phase. Public considerations and cost for a thermonuclear source, at the present 
time, have essentially eliminated this as an implantable energy source. Thermal storage has been shown 
to be a feasible energy source, and adequate insulation has been developed and demonstrated to be 
reliable for short periods of time [Uringer, 1989]. 

Steady flow devices that employ seals within the blood stream deserve very careful consideration of the 
seal area. These seals have been prone to failure, causing embolization. Active magnetic levitation has been 
proposed for component suspension and may be considered as a possible design. A device that utilities a 
rotating member such as a rotating electric motor that would drive an impeller or mechanical mechanism 
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will create forces on the tissue when the patient moves due to gyroscopic or coriolis accelerations. These 
forces must be considered. 

Pump performance, in terms of purely hydraulic considerations, can be achieved with any of the 
pumping systems described. Control of these devices may be more difficult. Designs that include 
implanted sensors such as pressure or flow sensors must deal with drift of these signals. Very often, signal- 
to-noise ratios are poor in devices that must operate continuously for several years. Designs that employ 
few or no sensors are preferable. 

Systems such as sac-type blood pumps have been described as having intrinsic automatic control. That 
is, these devices can run in a fill limited mode, and, as more blood is returned to the pump, it will increase 
its stroke volume. Intrinsic control is desirable, but, unfortunately, this generally provides for only a 
limited control range. Nearly all devices currently being developed employ some form of automatic 
electronic control system. These automatic control systems are utilized on both total artificial hearts and 
assist devices. In some designs system parameters such as blood pump fill time, electric current, voltage, 
or speed are utilized to infer the state of the circulatory system. These types of systems appear to 
demonstrate good long-term stability and eliminate the potential for device malfunction associated with 
transducer drift. 

Consideration of pump performance and the interaction with the biologic system is important in the 
conceptual design phase. Two pump designs may be capable of pumping the same in a hydrodynamic 
sense in terms of pressures and flows, but one pump may have much higher shear stresses than the other 
and thus be more hemolytic. One device may have much more mechanical vibration or movement 
compromising surrounding tissue. 

The subject of biocompatibility for circulatory assist and artificial hearts is a very complex one. No 
matter what type device is designed, its interaction with the environment is paramount. Blood-contacting 
materials must not cause thrombosis and should have a minimal effect on formed elements in the blood. 
In terms of tissue biocompatibility, the device should not have sharp corners or areas where pressure 
necrosis can occur. Both smooth and rough exterior surfaces of devices have been investigated. It appears 
that devices in contact with the pleura tend to form a thinner encapsulation when they are rough surfaced. 
Compliance chambers form much thinner capsules when they have a rough surface. In other areas, it is 
not entirely clear if a rough surface is advantageous. Tissue ingrowth into a rough surface makes removal 
of the device difficult. 

The selection of materials in the design of these devices is limited. The designer's job would be made 
much easier if there were several completely biocompatible materials available. If the designer only had 
a perfectly nonthrombogenic material that had outstanding fatigue properties, the design of these devices 
would be greatly simplified! Unfortunately, at the present time the designer is limited to existing materials. 
Traditionally, metals that have been employed in blood pumps include various stainless steels, cobalt, 
cobalt chromium alloys, and titanium. Each of these materials has adequate performance when in contact 
with tissue and blood under certain circumstances. Ceramic materials such as pyrolytic carbon, alumina, 
and zirconia have been used in contact with both tissue and blood with varying degrees of success. The 
range of polymeric materials that have been utilized for these devices is much greater. These materials 
include various polyurenthanes, silicone rubber, Kel-F, Teflon, Delrin, butyl rubber, Hexsyn rubber, 
polysulfone, polycarbonate, and others [Williams, 1981]. The designer must carefully consider all the 
properties of these materials when employing them. The designer must look at the strength, durability, 
hardness, wear resistance, modulus of elasticity, surface energy, surface finish, and biocompatibility before 
choosing any of these materials. 

The interaction between the biologic system and these materials is complex and may be strongly 
influenced by fluid mechanics if in contact with blood. The designer should carefully consider surface 
modification of any of these materials. Surface modification can be performed, including ion implantation 
or grafting of various substances to the surface to promote improved compatibility. Manufacturing and 
fabrication processes have a profound effect on the properties of these materials and must be carefully 
analyzed and controlled. 
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The designer must give a great deal of consideration to the fluid mechanics involved. It is well known 
that excessive shear stresses can promote hemolysis and activation of the clotting system, as well as damage 
to the other formed elements in the blood. Not only the actual design of the blood pump, but the 
operation of the blood pump can affect phenomena such as cavitation, which can be hemolytic and 
destructive to system components. Thrombosis is greatly affected by blood flow. Regions of stagnation, 
recirculation, and low wall shear stress should be avoided. The magnitude of "low" shear stress is reported 
in the literature with wide range [Folie & Mclntire, 1989; Hashimoto et al, 1985; Hubbel & Mclntire, 
1986]. 

Many of the analytical tools available today in terms of computational fluid dynamics and finite element 
analysis are just beginning to be useful in the design of these devices. Most of these systems have complex 
flow (unsteady, turbulent, non-Newtonian with moving boundaries) and geometries and are not easily 
modeled. 

Once a reliability goal has been established, the designer must ensure that this goal is met. The natural 
heart beats approximately 40 million times a year. This means that an artificial heart or assist device with 
a 5-year life may undergo as may as 200,000,000 cycles. The environment in which this device is to 
operate is extremely hostile. Blood and extracellular fluids are quite corrosive and can promote galvanic 
or crevice corrosion in most metallic materials. Devices that employ polymeric materials must deal with 
diffusion of mass across these materials. Water, oxygen, nitrogen, carbon dioxide, and so on may all 
diffuse across these polymeric materials. If temperature fluctuations occur or differences exist, liquid 
water may form due to condensation. With carbon dioxide present, a weak acid can be formed. Many 
polymeric materials are degraded in the biologic environment. A careful review of the literature is 
imperative. The use of any "new" materials must be based upon careful testing. The designer must be 
aware of the difficulty in providing a sealed system. The designer may need to utilize hermetic connectors 
and cables which can tolerate the moist environment. All these affect the reliability of the device. 

External components of the system which may include battery packs or monitoring and perhaps 
control functions have reliability requirements that differ from the implanted components. System 
components which are redundant may not require the same level of reliability as do nonredundant 
implanted components. Externally placed components have the advantage of being amenable to preven- 
tative maintenance or replacement. Systems that utilize transcutaneous energy transmission through 
inductive coupling may have advantages over systems that utilize a percutaneous wire. Although the 
percutaneous wire can function for long periods of time, perhaps up to 1 year, localized infection is 
almost always present. This infection can affect the patient's quality of life and be a source of systemic 
infection. 

In the conceptual design phase, one must carefully weigh quality of life issues when evaluating solutions 
to the problem. Careful consideration needs to be given to the traditional quality of life issues such as 
the patient's emotional, social, and physical well-being, as well as to the details of day-to-day use of the 
device. What are the frequency and sound level requirements for patient prompts or alarms, and are 
certain kinds of alarms useful at all? For visible information for the patient, from what angles can this 
information be viewed, how bright does the display need to be, should it be in different languages, or 
should universal symbols be used? A great deal of thought must be given to all aspects of device use, 
from charging of the batteries to dealing with unexpected events. All these issues must be resolved in the 
conceptual design phase so that the detailed phase will be successful. 

Detailed Design — Execute Plan 

This is the phase of engineering design where the designer and other members of the team must begin 
to do what is generally considered the designer's more traditional role, i.e., calculations and drawings. 
This phase of design may require some initial prototyping and testing before the detailed design can be 
complete. Akutsu and Koyangi [1993] provide several results of various groups' detailed designs for 
artificial hearts and circulatory assist devices. 
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Learn and Generalize 

Substantial research and development of artificial hearts and circulatory assist devices has been ongoing 
for almost 30 years. During this period there have been literally thousands of publications related to this 
research, not only descriptions of device designs, but experimental results, detailed descriptions fluid 
dynamic phenomena, hemolysis, thrombosis, investigation of materials selection and processing, con- 
sideration of control issues, and so on. The artificial organs and biomaterials literature has numerous 
references to materials utilized in these devices. A thorough review of the literature is required to glean 
the general principles that can be applied to the design of these devices. Many of these principles or 
generalities apply only to specific designs. In some design circulatory assist devices, a smooth surface will 
function satisfactorily, whereas a rough surface may cause thrombosis or an uncontrolled growth of 
neointima. Yet, in other design devices, a textured or rough surface will have performance superior to 
that of an extremely smooth surface. Wide ranges of sheer stresses are quoted in the literature that can 
be hemolytic. Although considerable research has been performed examining the fluid mechanics of the 
artificial hearts and circulatory assist devices, there is really no current measure of what is considered a 
"good" blood flow within these devices. We know that the design must avoid regions of stasis, but how 
close to stasis one can come, or for how long, without thrombosis is unknown. 

It is up to the designer to review current literature and determine what fundamental principles are 
applicable to his or her design. Then, when the design is complete, the designer can learn and generalize 
specific principles related to her or his device. Hopefully, general principles that can apply to other devices 
will be elucidated. 

126.3 Conclusions 

The design of artificial hearts and circulatory assist devices is a very complex process involving many 
engineering disciplines along with medicine and other life science areas. Social issues must enter into the 
design process. The design of such devices requires sound engineering design principles and an interdis- 
ciplinary design team dedicated to the development and ultimate clinical application of these devices. 
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The first clinical use of a cardiac valvular prosthesis took place in 1952, when Dr. Charles Hufnagel 
implanted the first artificial caged ball valve for aortic insufficiency. The Plexiglas cage contained a ball 
occluder and was inserted into the descending aorta without the need for cardiopulmonary bypass. It 
did not cure the underlying disease, but it did relieve regurgitation from the lower two-thirds of the body. 
The first implant of a replacement valve in the anatomic position took place in 1960, with the advent 
of cardiopulmonary bypass. Since then, the achievements in valve design and the success of artificial 
heart valves as replacements have been remarkable [Roberts, 1976]. More than 50 different cardiac valves 
have been introduced over the past 35 years. Unfortunately, after many years of experience and success, 
problems associated with heart valve prostheses have not been eliminated. The most serious problems 
and complications [Bodnar, Frater, 1991; Butchart, Bodnar, 1992; Giddens et al. 1993; Roberts, 1976] are: 

■ Thrombosis and thromboembolism 

• Anticoagulant-related hemorrhage 

• Tissue overgrowth 

• Infection 

■ Paravalvular leaks due to healing defects, and 

■ Valve failure due to material fatigue or chemical change. 

New valve designs continue to be developed. Yet to understand the future of valve replacements, it is 
important to understand their history. 
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127.1 A Brief History of Heart Valve Prostheses 

This section on replacement valves highlights a relatively small number of the many various forms which 
have been made. However, those that have been included are either the most commonly used today or 
those which have made notable contributions to the advancement of replacement heart valves [Brewer, 
1969; Yoganathan et al, 1992]. 

Mechanical Valves 

The use of the caged-ball valve in the descending aorta became obsolete with the development in 1960 
of what today is referred to as the Starr-Edwards ball- and- cage valve. Similar in concept to the original 
Hufnagel valve, it was designed to be inserted in place of the excised diseased natural valve. This form 
of intracardiac valve replacement was used in the mitral position and for aortic and multiple replacements. 
Since 1962 the Starr-Edwards valve has undergone many modifications to improve its performance in 
terms of reduced hemolysis and thromboembolic complications. However, the changes have involved 
materials and techniques of construction and have not altered the overall concept of the valve design in 
any way (Fig. 127.1a). 

Other manufacturers have produced variations of the ball and cage valve, notably the Smeloff-Cutter 
valve and the Magovern Prosthesis. In the case of the former, the ball is slightly smaller than the orifice. 
A subcage on the proximal side of the valve retains the ball in the closed position with its equator in the 
plane of the sewing ring. A small clearance around the ball ensures easy passage of the ball into the 
orifice. This clearance also gave rise to a mild regurgitation which was felt, but not proven, to be beneficial 
in preventing thrombus formation. The Magovern valve is a standard ball-and-cage format which incor- 
porates two rows of interlocking mechanical teeth around the orifice ring. These teeth are used for 
inserting the valve and are activated by removing a special valve holder once the valve has been correctly 
located in the prepared tissue annulus. The potential hazard of dislocation from a calcified annulus due 
to imperfect placement was soon observed. This valve is no longer in use. 

Due to the high-profile design characteristics of the ball valves, especially in the mitral position, low- 
profile caged disc valves were developed in the mid-1960s. Examples of the caged disc designs are the 
Kay-Shiley and Beall prostheses, which were introduced in 1965 and 1967, respectively (Fig. 127.1b). 
These valves were used exclusively in the atrioventricular position. However, due to their inferior hemo- 
dynamic characteristics, caged disc valves are rarely used today. 




FIGURE 127.1 (a) Photograph of Starr-Edwards ball and cage valve; (b) photograph of Kay-Shiley disc valve; 
(c) photograph of Bjork-Shiley tilting disc valve; (d) photograph of Medtronic-Hall tilting disc valve; (e) photograph 
of St. lude bileaflet valve; if] photograph of CarboMedics bileaflet valve; (g) photograph of Parallel bileaflet valve. 
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FIGURE 127.1 (continued) 
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Even after 35 years of valve development, the ball-and-cage format remains the valve of choice for 
some surgeons. However, it is no longer the most popular mechanical valve, having been superseded, to 
a large extent, by tilting-disc and bileaflet valve designs. These valve designs overcome two major draw- 
backs of the ball valve, namely, high profile heights and excessive occluder-induced turbulence in the 
flow through and distal to the valve. 

The most significant developments in mechanical valve design occurred in 1969 and 1970 with the 
introduction of the Bjork-Shiley and Lillehei-Kaster tilting-disc valves (Fig. 127. 1 c). Both prostheses involve 
the concept of a free-floating disc which in the open position tilts to an angle depending on the design of 
the disc-retaining struts. In the original Bjork-Shiley valve, the angle of the tilt was 60° for the aortic and 
50° for the mitral model. The Lillehei-Kaster valve has a greater angle of tilt of 80° but in the closed position 
is preinclined to the valve orifice plane by an angle of 18°. In both cases the closed valve configuration 
permits the occluder to fit into the circumference of the inflow ring with virtually no overlap, thus reducing 
mechanical damage to erythrocytes. A small amount of regurgitation backflow induces a "washing out" 
effect of "debris" and platelets and theoretically reduces the incidence of thromboemboli. 

The obvious advantage of the tilting-disc valve is that in the open position it acts like an aerofoil in 
the blood flowing through the valve, and induced flow disturbance is substantially less than that obtained 
with a ball occluder. Although the original Bjork-Shiley valve employed a Delrin occluder, all present- 
day tilting-disc valves use pyrolitic carbon for these components. It should also be noted that the free- 
floating disc can rotate during normal function, thus preventing excessive contact wear from the retaining 
components on one particular part of the disc. Various improvements to this form of mechanical valve 
design have been developed but have tended to concentrate on alterations either to the disc geometry as 
in the Bjork-Shiley convexo-concave design or to the disc-retaining system as with the Medtronic-Hall 
and Omniscience valve designs (Fig. 127. Id). 

The Medtronic-Hall prosthesis was introduced in 1977. It is characterized by a central, disc-control 
strut, with a mitral opening angle of 70° and an aortic opening of 75°. An aperture in the flat, pyrolitic 
carbon-coated disc affixes it to the central guide strut. This strut not only retains the disc but controls 
its opening angle and allows it to move downstream 1.5-2.0 mm; this movement is termed disc translation 
and improves flow velocity between the orifice ring and the rim of the disc. The ring and strut combi- 
nation is machined from a single piece of titanium for durability. All projections into the orifice (pivot 
points, guide struts, and disc stop) are open-ended, streamlined, and in the region of highest velocity to 
prevent the retention of thrombi by valve components. The sewing ring is of knitted Teflon. The housing 
is rotatable within the sewing ring for optimal orientation of the valve within the tissue annulus. 

Perhaps the most interesting development has been that of the bileaflet all-pyrolitic carbon valve 
designed by St. Jude Medical, Inc. and introduced in 1978 (Fig. 127. le). This design incorporates two 
semicircular hinged pyrolitic carbon occluders (leaflets) which in the open position are intended to 
provide minimal disturbance to flow. The leaflets pivot within grooves made in the valve orifice housing. 
In the fully open position the flat leaflets are designed to open to an angle of 85°. The Duromedics valve 
is similar in concept to the St. Jude except that it incorporates curved leaflets. 

The CarboMedics bileaflet prosthesis gained FDA approval for U.S. distribution in 1993 (Fig. 127.1/). 
The CarboMedics valve is also made of Pyrolite, which is intended for durability and thromboresistance. 
The valve has a recessed pivot design and is rotatable within the sewing ring. The two leaflets are 
semicircular, radiopaque, and open to an angle of 78°. A titanium stiffening ring is used to lessen the 
risk of leaflet dislodgment or impingement. 

The most recent bileaflet design is the Parallel valve from Medtronic, Inc. (Fig. 127. lg). The significant 
design aspect of the Parallel valve is the ability of its leaflets to open to a position parallel to flow. This 
is intended to reduce the amount of turbulence that is created in the blood and therefore should improve 
hemodynamics and reduce thromboembolic complications. European clinical implants began in the 
Spring of 1994. 

The most popular valve design in use today is the bileaflet. Approximately 75% of the valves implanted 
today are bileaflet prostheses. 
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Tissue Valves 

Two major disadvantages with the use of mechanical valves is the need for life-long anticoagulation 
therapy and the accompanying problems of bleeding [Butchart & Bodnar, 1992]. Furthermore, the 
hemodynamic function of even the best designed valves differs significantly from that of the natural 
healthy heart valve. An obvious step in the development of heart valve substitutes was the use of naturally 
occurring heart valves. This was the basis of the approach to the use of antibiotic or cryotreated human 
aortic valves (homografts: from another member of the same species) removed from cadavers for implan- 
tation in place of a patient's own diseased valve. 

The first of these homograft procedures was performed by Ross in 1962, and the overall results so far 
have been satisfactory. This is, perhaps, not surprising since the homograft replacement valve is optimum 
both from the point of view of structure and function. In the open position these valves provide 
unobstructed central orifice flow and have the ability to respond to deformations induced by the sur- 
rounding anatomical structure. As a result, such substitutes are less damaging to the blood when com- 
pared with the rigid mechanical valve. The main problem with these cadaveric allografts, as far as may 
be ascertained, is that they are no longer living tissue and therefore lack that unique quality of cellular 
regeneration typical of normal living systems. This makes them more vulnerable to long-term damage. 
Furthermore, they are only available in very limited quantities. 

An alternative approach is to transplant the patient's own pulmonary valve into the aortic position. 
This operation was also the first carried out by Ross in 1967, and his study of 176 patients followed up 
over 13 years showed that such transplants continued to be viable in their new position with no apparent 
degeneration [Wain et al., 1980]. This transplantation technique is, however, limited in that is can only 
be applied to one site. 

The next stage in development of tissue valve substitutes was the use of autologous fascia lata (a wide 
layer of membrane that encases the thigh muscles) either as free or frame-mounted leaflets. The former 
approach for aortic valve replacement was reported by Senning in 1966, and details of a frame-mounted 
technique were published by Ionescu and Ross in 1966 [Ionescu, 1969] The approach combined the more 
natural leaflet format with a readily available living autologous tissue. Although early results seemed 
encouraging, Senning expressed his own doubt on the value of this approach in 1971, and by 1978 fascia 
lata was no longer used in either of the above, or any other, form of valve replacement. The failure of 
this technique was due to the inadequate strength of this tissue when subjected to long-term cyclic 
stressing in the heart. 

In parallel with the work on fascia lata valves, alternative forms of tissue leaflet valves were being 
developed. In these designs, however, more emphasis was placed on optimum performance characteristics 
than on the use of living tissue. In all cases the configuration involved a three-leaflet format which was 
maintained by the use of a suitably designed mounting frame. It was realized that naturally occurring 
animal tissues, if used in an untreated form, would be rejected by the host. Consequently, a method of 
chemical treatment had to be found which prevented this antigenic response but did not degrade the 
mechanical strength of the tissue. 

Formaldehyde has been used by histologists for many years to arrest autolysis and "fix" tissue in the 
state in which it is removed. It had been used to preserve biologic tissues in cardiac surgery but, 
unfortunately, was found to produce shrinkage and increase the stiffness of the resulting material. For 
these reasons, formaldehyde was not considered ideal as a method of tissue treatment. 

Glutaraldehyde is another histologic fixative which has been used especially for preserving fine detail 
for electron microscopy. It is also used as a tanning agent by the leather industry. In addition to arresting 
autolysis, glutaraldehyde produces a more flexible material due to increased collagen crosslinking. Glu- 
taraldehyde has the additional ability of reducing the antigenicity of xenograft tissue to a level at which 
it can be implanted into the heart without significant immunologic reaction. 

In 1969, Kaiser and coworkers described a valve substitute using an explanted glutaral dehyde-treated 
porcine aortic valve which was mounted on to a rigid support frame. Following modification in which 
the rigid frame was replaced by a frame having a rigid base ring with flexible posts, this valve became 
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FIGURE 127.2 (a) Photograph of Hancock porcine valve; (b) photograph of Carpentier-Edwards pericardial valve. 

commercially available as the Hancock Porcine Xenograft in 1970 (Fig. 127.2a). It remains one of the two 
most popular valve substitutes of this type, the other being the Carpentier-Edwards Bioprosthesis intro- 
duced commercially by Edwards Laboratories in 1976. This latter valve uses a totally flexible support frame. 

In 1977 production began of the Hancock Modified Orifice (M.O.) valve, a refinement of the Hancock 
Standard valve. The Hancock M.O. is of a composite nature — the right coronary leaflet containing the 
muscle shelf is replaced by a noncoronary leaflet of the correct size from another porcine valve. This high- 
pressure fixed valve is mounted into a Dacron-covered polypropylene stent. The Hancock II and Carpentier- 
Edwards supra-annular porcine bioprostheses are second-generation bioprosthetic valve designs which were 
introduced in the early 1980s. The porcine tissue is initially fixed at 1.5 mmHg and then at high pressure. 
This fixation method is designed to ensure good tissue geometry. Both valves are treated with antiminer- 
alization treatments. Neither valve has been FDA approved for clinical use in the United States. 

In porcine prostheses, the use of the intact biologically formed valve makes it unnecessary to manu- 
facture individual valve cusps. Although this has the obvious advantage of reduced complexity of con- 
struction, it does require a facility for harvesting an adequate quantity of valves so that an appropriate 
range of valve sizes of suitable quality can be made available. This latter problem did not occur in the 
production of the three-leaflet calf pericardium valve developed by Ionescu and colleagues; the construc- 
tion of this valve involved the molding of fresh tissue to a tricuspid configuration around a support 
frame. As the tissue is held in this position, it is treated with a glutaraldehyde solution. The valve, marketed 
in 1976 as the Ionescu-Shiley Pericardial Xenograft, was discontinued in the mid-1980s due to structural 
failure problems. Early clinical results obtained with tissue valves indicated their superiority to mechanical 
valves with respect to a lower incidence of thromboembolic complications [Bodnar & Yacoub 1991]. For 
this reason the use of tissue valves increased significantly during the late 1970s. 

The Carpentier-Edwards pericardial valve consists of three pieces of pericardium mounted completely 
within the Elgiloy wire stent to reduce potential abrasion between the Dacron-covered frame and the 
leaflets. The pericardium is retained inside the stent by a Mylar button rather than by holding sutures. 
Its clinical implantation began in )uly 1980, and it is currently approved for clinical use in the United 
States (Fig. 127.2&). 

Clinical experiences with different tissue valve designs have increasingly indicated time-dependent 
(5- to 7-year) structural changes such as calcification and leaflet wear, leading to valve failure and 
subsequent replacement [Ferrans et al., 1980; Oyer et al., 1979; Bodnar, Yacoub, 1986]. The problem of 
valve leaflet calcification is more prevalent in children and young adults. Therefore, tissue valves are rarely 
used in children and young adults at the present time. Such problems have not been eliminated by the 
glutaraldehyde tanning methods so far employed, and it is not easy to see how these drawbacks are to 
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be overcome unless either living autologous tissue is used or the original structure of the collagen and 
elastin are chemically enhanced. On the latter point there is, as yet, much room for further work. For 
instance, the fixing of calf pericardium under tension during the molding of the valve cusps will inevitably 
produce "locked-in" stresses during fixation, thus changing the mechanical properties of the tissue. 

127.2 Current Types of Prostheses 

At present, over 175,000 prosthetic valves are implanted each year throughout the world. Currently, the 
four most commonly used basic types of prostheses are: 

1. Caged ball 

2. Tilting disc 

3. Bileaflet 

4. Bioprostheses (tissue valves) 

Valve manufacturers continue to develop new designs of mechanical and tissue valves. The ideal heart 
valve prosthesis does not yet exist and may never be realized. However, the characteristics of the "perfect" 
prostheses should be noted. The ideal heart valve should: 

• Be fully sterile at the time of implantation and be nontoxic 

• Be surgically convenient to insert at or near the normal location of the heart 

• Conform to the heart structure rather than the heart structure conforming to the valve (i.e., the 
size and shape of the prosthesis should not interfere with cardiac function) 

• Show a minimum resistance to flow so as to prevent a significant pressure drop across the valve 

• Have a minimal reverse flow necessary for valve closure, so as to keep the incompetence of the 
valve at a low level 

• Show long resistance to mechanical and structural wear 

• Be long-lasting (25 years) and maintain its normal functional performance (i.e., must not dete- 
riorate over time) 

• Cause minimal trauma to blood elements and the endothelial tissue of the cardiovascular structure 
surrounding the valve 

• Show a low probability for thromboembolic complications without the use of anticoagulants 

• Be sufficiently quiet so as not to disturb the patient 

• Be radiographically visible 

• Have an acceptable cost 

127.3 Tissue Versus Mechanical 

Tissue prostheses gained widespread use during the mid-1970s. The major advantage of tissue valves 
compared to mechanical valves is that tissue valves have a lower incidence of thromboembolic compli- 
cations [Butchart & Bodnar, 1992]. Therefore, most patients receiving tissue valves do not have to take 
anticoagulants long-term. The major disadvantages to tissue valves are large pressure drops compared 
to some mechanical valves (particularly in the smaller valve sizes), jetlike flow through the valve leaflets, 
material fatigue and/or wear of valve leaflets, and calcification of value leaflets, especially in children and 
young adults. Valve deterioration, however, usually takes place slowly with tissue valves, and patients can 
be monitored by echocardiography and other noninvasive techniques. 

The clear advantage of mechanical valves is their long-term durability. Current mechanical valves are 
manufactured from a variety of materials, such as pyrolitic carbon and titanium. Structural failure of 
mechanical valves is rare, but, when it occurs, is usually catastrophic [Giddens et al., 1993]. One major 
disadvantage of the use of mechanical valves is the need for continuous, life-long anticoagulation therapy 
to minimize the risk of thrombosis and thromboembolic complications. Unfortunately, the anticoagulation 
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therapy may lead to bleeding problems; therefore, careful control of anticoagulation medication is 
essential for the patient's well-being and quality of life. Another concern is the hemodynamic performance 
of the prosthesis. The hemodynamic function of even the best designs of mechanical valves differs 
significantly from that of normal heart valves. 

127.4 Engineering Concerns and Hemodynamic Assessment 
of Prosthetic Heart Valves 

In terms of considerations related to heart valve design, the basic engineering concerns are 

■ Hydrodynamics/hemodynamics 

■ Durability (structural mechanics and materials) 

■ Biologic response to the prosthetic implant 

The ideal heart valve design from the hemodynamic point of view should [Giddens et al., 1993] 

■ Produce minimal pressure gradient 

■ Yield relatively small regurgitation 

■ Minimize production of turbulence 

• Not induce regions of high shear stress 

• Contain no stagnation or separation regions in its flow field, especially adjacent to the valve 
superstructure 

No valve as yet, other than normal native valves, satisfies all these criteria. 

Pressure Gradient 

The heart works to maintain adequate blood flow through a prosthetic valve; a well-designed valve will 
not significantly impede that blood flow and will therefore have as small a pressure gradient as possible 
across the valve. 

Because of the larger separation region inherent in flow over bluff bodies, configurations such as the 
caged disc and caged ball have notably large pressure gradients. Porcine bioprostheses have relatively 
acceptable pressure gradients for larger diameter valves because they more closely mimic natural valve 
geometry and motion, but the small sizes (<23 mm) generally have higher pressure gradients than their 
mechanical valve counterparts, as shown in Fig. 127.3 [Yoganathan et al., 1984]. Tilting disc and bileaflet 
valve designs present a relatively streamlined configuration to the flow, and, although separation regions 
may exist in these designs, the pressure gradients are typically smaller than for the bluff shapes. The 
clinical importance of pressure gradients in predicting long-term performance is not clear. The fact that 
these gradients are a manifestation of energy losses resulting from viscous-related phenomena makes it 
intuitive that minimizing pressure gradients across an artificial valve is highly desirable in order to reduce 
the workload of the pump (i.e., left ventricle). 

Effective Orifice Area (EOA) 

The EOA is an index of how well a valve design utilizes its primary or internal stent orifice area. In other 
words, it is related to the degree to which the prosthesis itself obstructs the flow of blood. A larger EOA 
corresponds to a smaller pressure drop and therefore a smaller energy loss. It is desirable to have as large 
an EOA as possible. EOA is calculated from in vitro pressure drop measurements for a particular valve 
using the following formula [Yoganathan et al., 1984]: 

EOA(cm 2 )= rms 
51.6>p 
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FIGURE 127.3 Examples of in vitro pulsatile flow pressure gradients across tilting disc (Bjork-Shiley convexo- 
concave), bileaflet (St. Jude Medical), and porcine aortic valves of three different sizes (27, 25, and 21 mm). 



TABLE 127.1 Effecive Orifice Areas of Different Prosthetic Aortic Valve Designs 



Valve Sewing 


Medronic-Hall 


St. Jude 


Carbomedics 


Hancock I 


Hancock MO 


Carpentier-Edwards 


Starr-Edwards 


Ring Diam, 


Tilting Disc, 


Bileaflet 


Bileaflet 


Porcine, 


Porcine, 


Pericardial, 


1260 Ball, 


mm 


cm 2 


cm 2 


cm 2 


cm 2 


cm 2 


cm 2 


cm 2 


19/20 


1.74 


1.21 


1.12 


1.01 


1.22 


1.56 


1.04 


21 


1.74 


1.81 


1.66 


1.31 


1.43 


1.88 


1.23 


23 


2.26 


2.24 


2.28 


1.73 


1.94 


2.25 


1.45 


25 


3.07 


3.23 


3.14 


1.93 


2.16 


3.25 


1.59 


27 


3.64 


4.05 


3.75 


2.14 


— 


3.70 


1.75 



Q rms is the root mean square systolic/diastolic flow rate (cm 3 /s), Ap is the mean systolic/diastolic pressure 
drop (mmHg). 

Table 127.1 lists EOAs obtained in vitro, for different size mechanical and tissue valve designs in clinical 
use today. These results illustrate the fact that, size for size, the newer mechanical valve designs have 
better pressure gradient characteristics than porcine bioprostheses in current clinical use. 

Regurgitation 

Regurgitation results from reverse flow created during valve closure and from backward leakage once 
closure is effected (see Fig. 127.4). Regurgitation reduces the net flow through the valve. Closing regur- 
gitation is closely related to the valve shape and closing dynamics, and the percentage of stroke volume 
that succumbs to this effect ranges from 2.0-7.5% for mechanical valves. For tissue valves it is typically 
less: 0. 1-1.5%. Leakage depends upon how well the orifices are "sealed" upon closure, and it has a reported 
incidence of 0-10% in mechanical valves and 0.2-3% in bioprosthetic valves. The overall tendency is for 
regurgitation to be less for the trileaflet bioprosthetic heart valves than for mechanical valve designs. 
Figure 127.5 illustrates in vitro regurgitant volumes (closing and leakage) measured on three commonly 
used mechanical valve designs in the aortic and mitral positions. 
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FIGURE 127.4 Flow cycle divided into forward flow, closing volume, and leakage volume. 
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FIGURE 127.5 Examples of in vitro reguritant volumes (closing and leakage) with three mechanical valve designs 
(MH — Medtronic-Hall; SJM — St. Jude Medical; B — Bjork-Shiley mono-strut). 

Regurgitation has implications other than simply for flow delivery. On the negative side, back flow 
through a narrow slit, such as can occur in leakage regurgitation through a bileaflet valve, can create 
relatively high laminar shear stresses, thus increasing the tendency toward blood cell damage [Baldwin, 
1990; Cape et al., 1993]. However, regurgitation can have a beneficial effect in that the back flow over 
surfaces may serve to wash out zones that would otherwise have stagnant flow throughout the cycle. This 
is particularly true for the "hinge" region in some tilting disc and bileaflet designs. 

Flow Patterns and Turbulent Shear Stresses 

Thrombosis and embolism, tissue overgrowth, hemolysis, and damage to endothelium adjacent to the 
valve are directly related to the velocity and turbulence fields created by various valve designs and have 
been addressed in detail during the past decade by investigators studying cardiovascular fluid mechanics 
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[Chandran et al, 1983, 1984; Woo &Yoganathan 1985, 1986; Yoganathan et al., 1986a, 1986b; Yoganathan 
etal., 1988]. 

It has been established that shear stresses on the order of 1500-4000 dynes/cm 2 can cause lethal damage 
to red cells [Nevaril et al., 1969]. However, in the presence of foreign surfaces, red cells can be destroyed 
by shear stresses on the order of 10-100 dynes/cm 2 [Mohandas etal., 1974]. It has also been observed 
that sublethal damage to red cells can occur at turbulent shear stress levels of 500 dynes/cm 2 [Sutera & 
Merjhardi, 1975]. Platelets appear to be more sensitive to shear and can be damaged by shear stress on 
the order of 100-500 dynes/cm 2 [Ramstack, et al., 1979; Wurzinger etal., 1983]. Evidence that platelet 
activation, aggregation, and thrombosis is induced by fluid shear forces has been predominantly generated 
by viscometric studies performed under well-defined fluid mechanical conditions. In viscometers, the 
extent and reversibility of shear-induced platelet aggregation are a function of both the magnitude and 
duration of the applied shear stress. For example, at 150 dynes/cm 2 , platelet aggregation is not observed 
until shear is applied for 300 s. But, as the intensity of the applied shear stress increases, platelet activation 
and aggregation occur more rapidly. For example, at a shear stress of 600 dynes/cm 2 , platelet aggregation 
occurs within 30 s, and at 6500 dynes/cm 2 , platelet activation occurs in fewer than 5 ms. As the magnitude 
of the applied shear increases, formed platelet aggregates tend not to separate when the shear forces are 
discontinued. Furthermore, platelet damage increases linearly with time of exposure to constant shear 
stress, indicating that shear-induced platelet damage is cumulative [Anderson & Heliums, 1978; Brown 
etal., 1977; Colantuoni etal, 1977]. 

Although the exact mechanism of turbulent stress damage to the cell is not precisely known, there is 
no disagreement that cell damage can be created by high turbulent stresses; minimizing these is conducive 
to better valve performance from the standpoints of thrombus formation, thromboembolic complica- 
tions, and hemolysis and from energy loss considerations. 

To illustrate the abnormal flow fields and elevated levels of turbulent shear stresses created by prosthetic 
valves, in vitro measurements conducted on 27-mm aortic valve designs in current clinical use in the 
United States (i.e., FDA approved) are presented below. 

The figures of the in vitro flow field studies are presented as schematic diagrams and represent velocity 
and turbulence profiles obtained at peak systole, at a cardiac output of 6.01/min and a heart rate of 
70 beats/min. All downstream distances are measured from the valve sewing ring. Tables 127.2 and 127.3 
list the maximum and cross sectionally averaged mean turbulent shear stresses measured downstream 
of the valves at different times during systole and diastole. 

Starr-Edwards Caged-Ball Valve (Model 1260) 

The flow emerging from the valve formed a circumferential jet that separated from the ball, hit the wall 
of the flow chamber, and then flowed along the wall. The flow had very high velocities in the annular 
region. The maximum velocity, measured 12 mm downstream of the valve, was 220 cm/s at peak systole. 
The peak systolic velocity, measured 30 mm downstream of the valve, was 180 cm/s, as shown in Fig. 127.6. 
High-velocity gradients were observed at the edges of the jet. The maximum velocity gradient (1700 cm 
s'Vcm) was observed in the annular region adjacent to the surface of the ball during peak systole. A large- 
velocity defect was observed in the central part of the flow chamber as a wake developed distal to the 
ball. A region of low velocity reverse flow was observed at peak systole and during the deceleration phase, 
with a diameter of about 8 mm immediately distal to the apex of the cage. The maximum reverse velocity 
measured was -25 cm/s; it occurred at peak systole 30 mm downstream of the valve (Fig. 127.6). The 
intensity of the reverse flow during the deceleration phase was not as high as that observed at peak systole. 
No reverse flow was observed during the acceleration phase. However, the velocity in the central part of 
the flow channel was low. 

High turbulent shear stresses were observed at the edges of the jet. The maximum turbulent shear 
stress measured was 1850 dynes/cm 2 , which occurred at the location of the highest-velocity gradient 
(Fig. 127.6). The intensity of turbulence during peak systole did not decay very rapidly downstream of 
the valve. Elevated turbulent shear stresses occurred during most of systole. Turbulent shear stresses as 
high as 3500 dynes/cm 2 were estimated in the annular region between the flow channel wall and the ball. 
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TABLE 127.2 Peak and Mean Turbulent Shear Stresses Measured Downstream of Different Aortic Valve Designs 







Acceleration Phase 


Peak 


Systole 


Deceleration Phase 




Peak, 


Mean, 


Peak, 


Mean, 


Peak, 


Mean, 


Valve 


Location, mm 


dynes/cm 2 


dynes/cm 2 


dynes/cm 2 


dynes/cm 2 


dynes/cm 2 


dynes/cm 2 


Starr-Edwards (1260) 


26, centerline 


750 


450 


1800 


1000 


1400 


700 




30, centerline 


600 


200 


1850 


1100 


1300 


750 


Medtronic-Hall 


7, major orifice 


450 


250 


1200 


450 


600 


300 




7, minor orifice 


950 


400 


1000 


350 


850 


450 




13, centerline 


1200 


600 


1000 


550 


800 


400 




13, major orifice 


400 


320 


1500 


370 


700 


300 




13, minor orifice 


1250 


550 


1450 


700 


700 


450 




16, centerline 


300 


100 


2000 


1000 


850 


450 




15, 90 degree 


300 


170 


1450 


700 


900 


450 




rotated (Fig.) 














St. Jude 


8, centerline 


820 


450 


1150 


500 


600 


320 




8, 6.25mm lateral 


1600 


1050 


2000 


1000 


1000 


650 




to centerline 
















13, centerline 


950 


470 


1500 


750 


1400 


900 




13, 6.25mm lateral 


1400 


800 


2000 


1050 


1000 


700 




to centerline 
















11,90-degree 


950 


550 


1700 


1200 


1000 


700 




rotated (Fig.) 














Carpentier-Edwards 


10, centerline 


900 


400 


2750 


1200 


1750 


1000 


(2625 Porcine) 


15, centerline 


1400 


700 


4500 


2000 


1700 


900 


Hancock MO 


10, centerline 


1000 


400 


2900 


1100 


1150 


550 


(250 Porcine) 


15, centerline 


1750 


950 


2450 


1900 


2100 


1400 


Carpentier Edwards 


17, centerline 


500 


100 


850 


200 


1000 


350 


(Pericardial) 


33, centerline 


850 


200 


1130 


450 


900 


370 



Medtronic-Hall Tilting Disc Valve 

High-velocity jetlike flows were observed from both the major and minor orifice outflow regions. The 
orientation of the jets with respect to the axial direction changed as the valve opened and closed. The 
major orifice jet was larger than the minor orifice jet and had a slightly higher velocity. The peak velocities 
measured 7 mm downstream of the valve were 210 cm/s and 200 cm/s in the major and the minor orifice 
regions, respectively (Fig. 127.7). A region of reverse flow was observed adjacent to the wall in the minor 
orifice region at peak systole, which extended 2 mm from the wall with a maximum reverse velocity of 
-25 cm/s. The size of this region increased during the deceleration phase to 8 mm from the wall. A small 
region of flow separation was observed adjacent to the wall in the major orifice region as illustrated by 
Fig. 127.7. In the minor orifice region, a profound velocity defect was observed 7 and 11 mm distal to 
the minor orifice strut (Fig. 127.7). Furthermore, the region adjacent to the wall immediately downstream 
from the minor orifice was stagnant during the acceleration and deceleration phases and had very low 
velocities (<15 cm/s) during peak systole. 

In the major orifice region, high turbulent shear stresses were confined to narrow regions at the edges 
of the major orifice jet (Fig. 127.7). The peak turbulent shear stresses measured at peak systole were 1200 
and 1500 dynes/cm 2 , 7 and 13 mm downstream of the valve, respectively. During the acceleration and 
deceleration phases the turbulent shear stresses were relatively low. High turbulent shear stresses were 
more dispersed in the minor than those in the major orifice region as shown by Fig. 127.7. The turbulent 
shear stress profiles across the major and minor orifices 15mm downstream of the valve (Fig. 127.7) 
showed a maximum turbulent shear stress of 1450 dynes/cm 2 at the lower edge of the minor orifice jet. 

St. Jude Medical Bileaflet Valve 

The St. Jude Medical valve has two semicircular leaflets which divide the area available for forward flow 
into three regions: two lateral orifices and a central orifice. The major part of the forward flow emerged 
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TABLE 127.3 Peak and Mean Turbulent Shear Stresses Measured Downstream of Different Mitral Valve Designs 







Acceleration Phase 


Peak 


Systole 


Deceleration Phase 




Peak, 


Mean, 


Peak, 


Mean, 


Peak, 


Mean, 


Valve 


Location, mm 


dynes/cm 2 


dynes/cm 2 


dynes/cm 2 


dynes/cm 2 


dynes/cm 2 


dynes/cm 2 


BeaU 


11, centerline 


500 


150 


1950 


450 


800 


250 




11, 10 mm above 


360 


150 


1300 


400 


425 


150 




centerline 
















17, centerline 


375 


125 


700 


225 


400 


175 


Bjork-Shiley 


8, 13 mm above 


320 


70 


235 


60 


240 


60 


(cc) 


centerline 
















8, 10 mm below 


40 


18 


30 


17 


25 


19 




centerline 
















12, centerline 


100 


40 


330 


75 


150 


55 




15, 90-degree rotated 


310 


85 


375 


100 


280 


60 




18, centerline 


90 


60 


300 


110 


225 


80 




18, 13 mm above 


155 


80 


350 


240 


210 


140 




centerline 
















18, 10 mm below 


60 


25 


210 


70 


70 


25 




centerline 














Medtronic-Hall 


8, centerline 


300 


170 


1800 


400 


650 


180 




8, 12mm above line 


850 


300 


1150 


450 


500 


200 




center 
















8, 9 mm below 


900 


230 


1200 


350 


225 


130 




centerline 
















14, 90-degree rotated 


620 


230 


1200 


320 


670 


230 




18, centerline 


340 


100 


500 


230 


300 


150 




18, 12 mm above 


600 


230 


2600 


900 


950 


500 




centerline 
















18, 9 mm below 


280 


130 


670 


280 


360 


180 




centerline 














St. Jude 


10, centerline 


170 


95 


250 


95 


175 


85 




10, 10 mm above 


310 


120 


440 


150 


185 


100 




centerline 
















12, 90-degree rotated 


725 


270 


770 


300 


670 


250 




19, centerline 


130 


80 


335 


130 


165 


90 




19, 10 mm above 


115 


50 


575 


170 


250 


110 




centerline 















from the two lateral orifices. The measurements along the centerline plane 8 mm downstream of the 
valve showed at peak systole a maximum velocity of 220 cm/s and 200 cm/s for the lateral and central 
orifice jets, respectively. The velocity of the jets remained about the same as the flow traveled from 8 to 
13 mm downstream (Fig. 127.8). The velocity profiles showed two defects which corresponded to the 
location of the two leaflets. The velocity measurements indicated that the flow was more evenly distributed 
across the flow chamber during the deceleration than during the acceleration phase. Regions of flow 
separation were observed around the jets adjacent to the flow channel wall as the flow separated from 
the orifice ring. The measurements across the central orifice illustrated in Fig. 127.8 show that the 
maximum velocity in the central orifice was 220 cm/s. Small regions of low-velocity reverse flow were 
observed adjacent to the pivot/hinge mechanism of the valve (Fig. 127.8). More flow emerged from the 
central orifice during the deceleration phase than during the acceleration phase. 

High turbulent shear stresses occurred at locations of high-velocity gradients and at locations imme- 
diately distal to the valve leaflets (Fig. 127.8). The flow along the centerline plane became more disturbed 
as the flow traveled from 8 to 1 3 mm downstream of the valve. The peak turbulent shear stresses measured 
along the centerline plane at peak systole were 1150 and 1500 dynes/cm 2 at 8 and 13 mm downstream 
of the valve, respectively. The profiles across the central orifice showed that the flow was very disturbed 
in this region. The maximum turbulent shear stress measured in the central orifice as shown in Fig. 127.8 
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AORTIC: STARR-EDWARDS 



AORTIC: STARR-EDWARDS 





NUMBERS ARE VELOCITIES IN CM/S NUMBERS ARE TURBULENT SHEAR STRESSES IN DYNES/CM 1 

(a) " (b) 

FIGURE 127.6 (a) Velocity profile 30 mm downstream on the centerline for the Starr-Edwards ball valve, at peak systole; 
( b) turbulent shear stress profile 30 mm downstream on the centerline for the Starr-Edwards ball valve, at peak systole. 

(1700 dynes/cm 2 ) occurred peak systole. Since these high turbulent shear stresses across the central orifice 
were measured 11 mm downstream, it is probable that even higher turbulent shear stresses occurred 
closer to the valve. 

Carpentier-Edwards Porcine Valve (Model 2625) 

The velocity profiles taken 10 mm downstream of the valve, along the centerline plane, showed that the 
peak velocity of the jetlike flow emerging from the valve was as high as 220 cm/s at peak systole 
(Fig. 127.9). The peak velocities measured during the acceleration and deceleration phases were about 
the same, 175 and 170 cm/s, respectively. However, the flow was much more evenly distributed during 
the acceleration than during the deceleration phase. No regions of flow separation were observed through- 
out the systolic period in this plane of measurement. However, the annular region between the outflow 
surfaces of the leaflets and the flow chamber wall was relatively stagnant throughout systole. The velocity 
of the jet increased to about 370 cm/s at peak systole, as the flow traveled from 10 to 15 mm downstream 
of the valve. This indicated that the flow tended to accelerate toward the center of the flow channel. 

High turbulent shear stresses occurred at the edge of the jet (Fig. 127.9). The maximum turbulent 
shear stress measured 10 mm downstream of the valve along the centerline plane at peak systole was 
2750 dynes/cm 2 . The turbulent shear stresses at the edge of the jet increased as the flow traveled from 
10 to 15 mm downstream of the valve. The maximum and mean turbulent shear stress measured at peak 
systole increased to 4500 and 2000 dynes/cm 2 , respectively (Table 127.2). 

Hancock Modified Orifice Porcine Valve (Model 250) 

In this design, a 25-mm valve was studied, since a 27-mm valve is not manufactured. The velocity 
measurements showed that this valve design also produced a high-velocity jetlike flow field with a 
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(a) NUMBERS ARE VELOCITIES IN CM/S (b) NUMBERS ARE VELOCITIES IN CM/S 

AORTIC: MEDTRONIC-HALL 




(c) NUMBERS ARE TURBULENT SHEAR STRESSES IN DYNES/CM 1 

FIGURE 127.7 (a) Velocity profile 15 mm downstream on the centerline across the major and minor orifices of 
the Medtronic-Hall tilting disc valve (major orifice to the right), at peak systole; (b) velocity profiles 13 mm 
downstream in the major and minor orifices of the Medtronic-Hall tilting disc valve, at peak systole; ( c) turbulent 
shear stress profile 15 mm downstream on the centerline across the major and minor orifices of the Medtronic-Hall 
tilting disc valve (major orifice to the right), at peak systole; (d) turbulent shear stress profiles 13 mm downstream 
in the major and minor orifices of the Medtronic-Hall tilting disc valve, at peak systole. 
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AORTIC: MEDTRONIC-HALL 




(d) NUMBERS ARE TURBULENT SHEAR STRESSES IN DYNES/CM 2 
FIGURE 127.7 (continued) 

maximum velocity of 330 cm/s, which was measured along the centerline plane, 10 mm downstream. 
The jet, however, started to dissipate very rapidly as it flowed downstream. The maximum velocity 
measured 15 mm downstream of the valve was 180 cm/s, as shown in Fig. 127.10. A velocity defect was 
observed 15 mm downstream in the central part of the flow channel peak systole and during the 
deceleration phase but was not observed along the centerline plane 10 mm downstream of the valve. 
Once again, the annular region between the outflow surfaces of the leaflets and the flow chamber wall 
was relatively stagnant during systole. 

Turbulent shear stress measurements showed that the high turbulent shear stresses measured 10 mm 
downstream of the valve were confined to a narrow region at the edge of the jet, with a peak valve of 
2900 dynes/cm 2 (Table 127.2). This peak turbulent shear stress decreased to 2400 dynes/cm 2 as the flow 
traveled from 10 to 15 mm downstream of the valve (Fig. 127.10). The region of high turbulence, however, 
became more diffuse as a result of energy dissipation. 

Carpentier-Edwards Pericardial Valve (Model 2900) 

The velocity profiles obtained along the centerline plane 17 mm downstream of the valve at peak systole 
showed a maximum velocity of 180 cm/s. The maximum velocities measured during the acceleration 
and deceleration phases were 120 and 80 cm/s, respectively. A region of flow separation which extended 
about 6 mm from the wall was observed at peak systole and during the deceleration phase. This region 
was relatively stagnant during the acceleration phase. The maximum velocity of the jet at peak systole 
did not change as the flow field traveled from 17 to 33 mm downstream of the valve (Fig. 127.11). 
However, the size of the region of flow separation decreased and extended only 1 mm from the wall. 

Turbulent shear stress measurements taken along the centerline plane 17 mm downstream of the valve 
showed that, during the deceleration phase, elevated turbulent shear stresses were spread out over a wide 
region (with a maximum value of 100 dynes/ cm 2 ). At peak systole, the high turbulent shear stresses were 
confined to a narrow region, with a maximum value of 850 dynes/cm 2 (Fig. 127.11). The intensity of 
turbulence at peak systole increased as the flow traveled from 17 to 33 mm downstream of the valve 
(Table 127.7). 
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AORTIC: ST. JUDE 
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(a) NUMBERS ARE VELOCITIES IN CM/S 

AORTIC: ST. JUDE 



(b) NUMBERS ARE VELOCITIES IN CM/S 

AORTIC: ST. JUDE 




(c) 



NUMBERS ARE TURBULENT SHEAR STRESSES IN DYNES/CM" (d ) 




NUMBERS ARE TURBULENT SHEAR STRESSES IN DYNES/CM 2 

FIGURE 127.8 (a) Velocity profile 13 mm downstream on the centerline for the St. Jude Medical bileaflet valve, at 
peak systole; (b) velocity profile 13 mm downstream across the central orifice for the St. Jude Medical bileaflet valve, 
at peak systole; (c) turbulent shear stress profile 13 mm downstream of the centerline orifice for the St. Jude Medical 
bileaflet valve, at peak systole; (d) turbulent shear stress profile 13 mm downstream across the central orifice for the 
St. Jude Medical bileaflet valve, at peak systole. 
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AORTIC: C-E PORCINE (2625) 



AORTIC: C-E 2625 PORCINE 





NUMBERS ARE VELOCITIES IN CM/S 

(a) 



NUMBERS ARE TURBULENT SHEAR STRESSES IN DYNES/CM 2 

(b) 



FIGURE 127.9 (a) Velocity profile 15 mm downstream on the centerline for the Carpentier-Edwards 2625 porcine 
valve, at peak systole; (b) turbulent shear stress profile 15 mm downstream on the centerline for the Carpentier- 
Edwards 2625 porcine valve, at peak systole. 



AORTIC: HANCOCK (MO) PORCINE 



AORTIC: HANCOCK (MO) PORCINE 





(a) 



(b) 



FIGURE 127.10 (a) Velocity profile 15 mm downstream on the center for the Hancock modified orifice porcine 
valve, at peak systole; (b) turbulent shear stress profile 15 mm downstream on the centerline for the Hancock modified 
orifice porcine valve, at peak systole. 
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AORTIC: C-E PERICARDIAL 



AORTIC: C-E PERICARDIAL 





(a) 



NUMBERS ARE TURBULENT SHEAR STRESSES IN DYNES/CM 2 

(b) 



FIGURE 127.11 (a) Velocity profile 17 mm downstream on the centerline for the Carpentier-Edwards pericardial 
valve, at peak systole; (b) turbulent shear stress profile 17 mm downstream on the centerline for the Carpentier- 
Edwards pericardial valve, at peak systole. 

127.5 Implications for Thrombus Deposition 

In the vicinity of mechanical aortic heart valves, where peak shear stresses can easily exceed 1500 dynes/cm 2 
and mean shear stresses are frequently in the range of 200-600 dynes/cm 2 (Table 127.2), platelet activation 
and aggregation can readily occur. Data indicating that shear-induced platelet damage is cumulative are 
particularly relevant to heart valves. During an individual excursion through the replacement valve, the 
combination of shear magnitude and exposure time may not induce platelet aggregation. However, as a 
result of multiple journeys through the artificial valve, shear- induced damage may accumulate to a degree 
sufficient to promote thrombosis and subsequent embolization. 

All the aortic and mitral valve designs (mechanical and tissue) studied created mean turbulent shear 
stress in excess of 200 dynes/cm 2 during the major portion of systole and diastole (Tables 127.2 and 
127.3), which could lead to damage to blood elements. In the case of mechanical prostheses, due to the 
presence of foreign surfaces, the chances for blood cell damage are increased. Furthermore, the regions 
of flow stagnation and/or flow separation that occur adjacent to the superstructures of these valve designs, 
could promote the deposition of damaged blood elements, leading to thrombus formation on the 
prosthesis. 

127.6 Durability 

The performance of prosthetic valves is in several ways related to structural mechanics. The design 
configuration affects the load distribution and dynamics of the valve components, which, in conjunction 
with the material properties, determine durability — notably wear and fatigue life. Valve configuration, 
in concert with the flow engendered by the geometry, also dictates the extent of low-wear (e.g., flow 
separation) and high-shear (e.g., gap leakage) regions. The hinges of bileaflet and tilting disc valves are 
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vulnerable — their design can produce sites of stagnant flow, which may cause localized thrombosis, which 
may in turn restrict occluder motion. In addition, as discussed earlier, the rigid circular orifice ring is 
an unnatural configuration for a heart valve, since the elliptically shaped natural valve annulus changes 
in size and shape during the cardiac cycle. 

The choice of valve materials is closely related to structural factors, since the fatigue and wear perfor- 
mance of a valve depends not only on its configuration and loading but on the material properties as 
well. In addition, the issue of biocompatibility is crucial to prosthetic valve design — and biocompatibility 
depends not only upon the material itself but also on its in vivo environment. In the design of heart 
valves there are engineering design trade-offs: Materials that exhibit good biocompatibility may have 
inferior durability and vice versa. For many patients, the implanted prosthetic valve needs to last well over 
a decade, and in the case of young people the need for valve durability may be even greater. Mechanical 
durability depends on the material properties and the loading cycle, and examples of degradation include 
fatigue cracks, abrasive wear, and biochemical attack on the material [Giddens et al., 1993]. 

Wear 

Abrasive wear of valve parts has been and continues to be a serious issue in the design of mechanical 
prosthetic valves. Various parts of these valves come in contact repeatedly for hundreds of millions of 
cycles over the lifetime of the device. A breakthrough occurred with the introduction of pyrolitic carbon 
(PYC) as a valve material: It has relatively good blood compatibility characteristics and wear performance. 
However, although PYC wear upon PYC and upon metals is relatively low, PYC wear by metals is 
considerably greater [Shim & Schoen, 1974]. One example of this is a PYC disc mounted on a metallic 
orifice/hinge combination. The most durable wear couple is PYC-PYC, therefore PYC-coated components 
are very attractive. The first valve to employ a PYC-PYC couple was the St. Jude Medical valve, which 
has fixed pivots for the leaflets. Tests indicate that it would take 200 years to wear halfway through the 
PYC coating on a leaflet pivot [Gombrich et al., 1979]. By creating designs that allow wear surfaces to 
be distributed rather than focal (i.e., the Omnicarbon valve, which has a PYC-coated disc that is free to 
rotate), it is possible to reduce wear even further. Thus, materials technology continues to progress and 
in fact has reached the point where wear need not negatively impact the performance of prosthetic 
mechanical valves. 

Fatigue 

Metals are prone to fatigue failure. Their polycrystalline nature contains structural characteristics that 
may produce dislocations under mechanical loading. These dislocations can migrate when subjected to 
repeated loading cycles and can accumulate at intercrystalline boundaries, and the end result is tiny 
cracks. These tiny cracks are sites of stress concentration, and the fissures can worsen until fracture occurs. 
The Haynes 25 Stellite Bjork-Shiley valve, which used a chromium-cobalt alloy, experienced the most 
severe fatigue problem for a mechanical valve [Lindblom et al., 1986]. Previous investigations suggested 
that fatigue was not a problem for PYC; however, recent data contradict this, suggesting that cyclic fatigue- 
crack growth occurs in graphite/pyrolitic carbon composite material [Ritchie et al., 1990]. This work 
suggests a fatigue threshold that is as low as 50% of the fracture toughness, and those authors view cyclic 
fatigue as an essential consideration in the design and life prediction of heart valves constructed from 
PYC. The FDA now requires detailed characterization of PYC materials used in different valve designs 
(December 1993 FDA heart valve guidelines). 

Mineralization 

The major cause of both porcine aortic and pericardial bioprosthetic valve failure is calcification, which 
stiffens and frequently causes cuspal tears [Levy et al., 1991]. Calcific deposits occur most commonly at 
the commissures and basal attachments. Calcification is most extensive deep in (intrinsic to) the cusps 
in the spongiosa layer. Ultrastructurally, calcific deposits are associated with cuspal connective tissue cells 
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and collagen. Degenerative cuspal calcific deposits are composed of calcium phosphates that are chem- 
ically and structurally related to physiologic bone mineral (hydroxyapatite). The flexing bladders in 
cardiac assist devices, flexing polymeric heart valves, and vascular grafts have also been found to be 
vulnerable to calcific deposits. In such cases, calcification is usually related to inflammatory cells adjacent 
to the blood-contacting surface, rather than to the implanted material itself. 

The mechanisms of calcification, and the methods of preventing calcification are active areas of current 
research [Webb et al., 1991]. The most common methods of studying calcification involve valve tissue 
implanted either subcutaneously in 3-week old weanling rats or valves implanted as mitral replacements 
in young sheep or calves. Results of both types of studies show that bioprosthetic tissue calcifies in a 
fashion similar to clinical implants but at a greatly accelerated rate. The subcutaneous implantation mode 
is a well-accepted, technically convenient, economical, and quantifiable model for investigating miner- 
alization issues. It is also very useful for determining the potential of new antimineralization treatments. 

Host, implant, and biomechanical factors impact the calcification of tissue valves. Patients who are 
young or have renal failure are vulnerable to valve mineralization, but immunologic factors seem to be 
unimportant. Pretreatment of valve tissue with an aldehyde crosslinking agent has been found to cause 
calcification in rat subcutaneous implants; nonpreserved cusps do not mineralize. Calcification of bio- 
prosthetic valves is greatest at the cuspal commissures and bases, where leaflet flexion is the greatest and 
deformations are maximal. Most data suggest that the basic mechanisms of tissue valve mineralization 
result from aldehyde pretreatment, which changes the tissue microstructure. 

In both clinical and experimental bioprosthetic tissue, the earliest mineral deposits are observed to be 
localized to transplanted connective tissue cells. The collagen fibers are involved later. Mineralization of 
the connective tissue cells of bioprosthetic tissue is thought to result from glutaraldehyde-induced cellular 
"revitalization" and the resulting disruption of cellular calcium regulation. Normal animal cells have a 
low intracellular free calcium concentration (approximately 10~ 7 M), whereas extracellular free calcium 
is much higher (approximately 10~ 3 M), yielding a 10,000-fold gradient across the plasma membrane. In 
healthy cells, cellular calcium is maintained in low concentration by energy-requiring metabolic mech- 
anisms. In addition, organellar and plasma membranes and cell nuclei, the observed sites of early 
nucleation of bioprosthetic tissue mineralization, contain considerable phosphorus, mainly in the form 
of phospholipids. In cells modified by aldehyde crosslinking, passive calcium entry occurs unimpeded, 
but the mechanisms for calcium removal are dysfunctional. This calcium influx reacts with the preexisting 
phosphorus and contributes to the mineralization [Schoen et al., 1986]. A very recent article discusses 
the approaches to preventing heart valve mineralization [Schoen, 1993]. 

127.7 Current Trends in Valve Design 

The long-term clinical durability of bioprosthetic valves is the major impediment to their use. Before 
long-term durability data became available, improvement of hemodynamic performance was the focus 
of development. The clinical introduction of the bovine pericardial valve solved the hemodynamic 
problem, and such valves exhibit hemodynamics equal to or better than some mechanical prostheses. 

The long-term durability of porcine and bovine bioprostheses can be improved through innovative 
stent designs that minimize stress concentrations and through improved processing fixation techniques 
that yield more pliable tissue. The most recent tissue valve design is the stentless bioprosthesis, used for 
aortic valve replacement. The aortic root bioprostheses are similar in concept to homografts. Absence of 
the stent is thought to improve hemodynamics, as there is less obstruction in the orifice. The absence of 
the stent is also thought to improve durability of the tissue, as there is less mechanical wear. Currently, 
three designs of stentless aortic valves (Medtronic's Freestyle, Baxter's Prima, and St. Jude's Toronto Non- 
Stented) are undergoing clinical evaluation in the United States and Europe. New antimineralization 
treatments are also being developed with the goal of increasing the durability of the tissue. 

If the above-mentioned design challenges are met, so that bioprostheses can be produced that are 
durable and thromboresistant, and anticoagulant therapy is not required, there most likely will be another 
swing toward increased bioprosthesis use. 

© 2000 by CRC Press LLC 



127.8 Conclusion 

Direct comparison of the "total" performance of artificial heart valves is difficult, if not impossible. The 
precise definition of criteria used to benchmark valve performance varies from study to study. To study 
long-term performance, large numbers of patients and lengthy observation periods are required. During 
these periods there may be an evolution in valve materials, or design, and in the medical treatment of 
patients with prosthetic heart valves. The age of the patient at implant and the underlying valvular heart 
disease are extremely important factors in valve choice and longevity as well. A valve design suited for 
the aortic position may be inappropriate for the mitral position. Consequently, it is not possible to 
categorize a particular valve as the best. All valves currently in use, mechanical and bioprosthetic, produce 
relatively large turbulent stresses (that can cause lethal and or sublethal damage to red cells and platelets) 
and greater pressure gradients and regurgitant volumes than normal heart valves. 

There are three promising directions for further advances in heart valves (and therefore three challenges 
for engineers designing new heart valves): 

• Improved thromboresistance with new and better artificial materials 

• Improved durability of new tissue valves, through the use of nonstented tissue valves, new anti- 
calcification treatments, and better fixation treatments 

• Improved hemodynamics characteristics, especially reduction or elimination of low shear stress 
regions near valve and vessel surfaces and of high turbulent shear stresses along the edges of jets 
produced by valve outflow and/or leakage of flow. 

Whereas the artificial valves certainly have room for further improvement, the superior prognosis for 
the patient with a replacement heart valve is dramatic and convincing. 
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The use of natural or synthetic replacement parts in vascular repair and vascular reconstruction is 
extensive, with a vascular graft market of approximately $200 million worldwide. A multitude of biologic 
grafts and synthetic prostheses are available, each with distinct qualities and potential applications 
[Rutherford, 1989; Veith et al., 1994]. The ideal vascular graft would (1) be biocompatible, (2) be 
nonthrombogenic, (3) have long-term potency (4) be durable yet compliant, (5) be infection resistant, 
and (6) be technically facile. There is currently no ideal conduit available, but overall, the autogenous 
saphenous vein is preferred for small-vessel reconstruction, and synthetic prostheses are best suited for 
large-vessel replacement. Large diameter (>10 mm) vascular grafts are predominantly used for aortic/iliac 
artery reconstruction with Dacron (80%) and PTFE (20%) being the standard construction materials. 
Synthetic grafts function well in these high-flow, low-resistance circuits with high long-term patencies. 
Small-caliber (<10 mm) vascular grafts are used for a variety of indications including lower-extremity 
bypass procedures, coronary artery bypass grafting (CABG), hemodialysis access, and extra-anatomic 
bypasses. Saphenous vein is the conduit of choice for lower-extremity revascularization and CABG 
procedures and shows superior patency rates compared to synthetic grafts. Internal mammary artery 
(IMA) grafts are used extensively for CABG with better long-term patency than saphenous vein grafts. 
Synthetic prostheses are used for hemodialysis access and extra-anatomic bypass grafting, especially PTFE 
due to its durability and resistance to external pressure. Bovine carotid heterografts are also used for 
hemodialysis access with fair success, but their value is questionable because of aneurysmal changes over 
time due to graft degeneration. 

128.1 History of Vascular Grafts 

Vascular surgery was first defined as a field with the initial publication of Alexis Carrel's work in 1902. 
Goyanes performed the first arterial graft in 1906 by using the popliteal vein in situ to replace an excised 
popliteal artery aneurysm. Lexer, in 1907, performed the first free autogenous vein graft by replacing an 
axillary artery defect with a segment of greater saphenous vein from the same patient. It was not until 
1949, however, that Kunlin performed the first successful femoropopliteal bypass with reversed saphenous 
vein. The Korean War was the true advent of reconstructive surgery for arterial injuries as initiated by 
Shumacker and reported by Hughes and Bowers in 1952. The first report of a synthetic graft also appeared 
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in 1952 when Voorhees described his initial work in dogs using tubes of Vinyon-N cloth to bridge arterial 
defects. In the late 1950s and 1960s DeBakey firmly established the clinical usefulness of Dacron grafts; 
Dacron arterial prostheses became commercially available in 1957. Microporous expanded polytetraflu- 
oroethylene (ePTFE) grafts, introduced in the 1970s, were the next generation of successful synthetic 
grafts but bear no clear superiority to Dacron grafts. 

128.2 Synthetic Grafts 

Synthetic prostheses are the preferred conduit for large-vessel reconstruction and the primary alternative 
to saphenous vein in small-vessel repair. Thankfully, the saphenous vein is of adequate quality and length 
in 70-75% of patients requiring small-vessel grafting, the two major choices available for prosthetic 
reconstruction are Dacron (polyethylene terephthalate) and PTFE (polytetrafluoroethylene). The tensile 
strength of these grafts remains unchanged even after years of implantation, whereas Nylon (Polyamide), 
Ivalon, and Orion lose significant tensile strength over a period of months. 

Dacron grafts are constructed from multifilamentous yarn and fabricated by weaving or knitting 
[Rutherford, 1989; Sawyer, 1987]. Woven fabric is interlaced in an over-and-under pattern resulting in 
a nonporous graft with no stretch. Knitted fabrics have looped threads forming a continuous intercon- 
necting chain with variable stretch and porosity. Knitted velour is a variant in which the loops of yarn 
extend upward at right angles to the fabric surface resulting in a velvety texture. The velour finish may 
be created on the internal, the external, or both surfaces to enhance preclotting and tissue incorporation. 
Porous knitted Dacron prostheses may be made impervious by impregnation or coating with albu- 
min/collagen to obviate preclotting and minimize blood loss while maintaining the superior handling 
characteristics of knitted Dacron grafts. Dacron prostheses are often crimped to impart elasticity, maintain 
shape during bending, and facilitate vascular anastomosis formation. The use of external support for 
vascular grafts is an alternative to crimping which uses externally attached polyprophylene rings to avoid 
kinking with angulation and to create dimensional stability. Woven Dacron grafts possess poor handling 
characteristics and poor compliance and elicit a poor healing response and, therefore, are used predom- 
inantly in the repair of the thoracic aorta, ruptured abdominal aortic aneurysms, and patients with 
coagulation defects. Knitted Dacron prostheses are used in the remainder of cases involving the abdominal 
aorta, iliac, femoral, and popliteal vessels, and in extra-anatomic repair/bypass. Dacron grafts are not 
advocated in small-vessel reconstruction, such as the below-knee popliteal or tibial, because of poor 
patency rates. An active infective process or contaminated field contraindicate the use of Dacron pros- 
theses. Vascular grafts are named after the vessel they replace or the arterial segments they connect. 

Expanded Teflon (ePTFE) is a synthetic polymer of carbon and fluorine produced by mechanical 
stretching [Rutherford, 1989; Sawyer, 1987] . The result is a series of solid nodes of ePTFE with intercon- 
necting small fibrils. The pore size is variable and averages 30 microns. The chief advantages of ePTFE 
in comparison to knitted Dacron include impermeability to blood, resistance to aneurysmal formation, 
and ease of declotting; ePTFE is used clinically mainly for femoropopliteal bypasses, hemodialysis angio- 
access, and extra-anatomic reconstructions. The addition of an external support coil can be used to 
enhance graft stability against external pressure and to prevent kinking at points of angulation. 

The primary area of failure in prosthetic grafting with Dacron and PTFE is the patency of small- caliber 
grafts, which remains poor. The inherent thrombogenicity of these prosthetic grafts in conjunction with 
neointimal hyperplasia, which is limited to the anastomotic area, are the prime factors. The seeding of 
prosthetic grafts with endothelial cells and subsequent proliferation of these cells into a confluent mono- 
layer along the graft has been suggested as an attractive solution to this problem for a number of years. 
The technology and techniques devoted to develop this solution to small-caliber graft patency have been 
impressive, but the results have been variable and certainly not the panacea envisioned by early investi- 
gators. The clinical utility of endothelial cell seeding onto prosthetic grafts remains to be defined. 



© 2000 by CRC Press LLC 



TABLE 128.1 Patency Data for Representative Vascular Grafts 



Graft 


Conduit 


5-Year Patency, percent 


Aortiobifemoral 


Dacron 


90 


Aortiobifemoral 


ePTFE 


90 


Femorofemoral 


Dacron 


80 


Femorofemoral 


ePTFE 


80 


Femoropopliteal 


Saphenous vein 


70 


Femoropopliteal 


ePTFE 


50 


Femoropopliteal 


ePTFE 


40 


Dialysis access 


ePTFE 


60 (3 years) 


Dialysis access 


Bovine heterograft 


25 (3 years) 



128.3 Regional Patency 

Patency data for any vascular graft depend on multiple factors including the conduit and vessel size. 
Large-diameter vessels have a high rate of patency, and therefore, synthetic grafts are the conduit of choice 
and display high short- and long-term patencies. Saphenous vein is the primary graft for small-caliber 
vessel reconstruction with good patency rates, and prosthetic grafts demonstrate acceptable short-term 
patencies but poor long-term results. Extra-anatomic bypasses utilize synthetic conduits with external 
support to reduce kinking. Dacron versus PTFE graft trials have shown no significant difference in patency 
rates [Rutherford, 1989]. Hemodialysis access grafts are constructed primarily of PTFE with a minority 
of bovine carotid heterografts still present. PTFE is preferred due to resistance to infection, decreased 
pseudoaneurysm formation, and improved patency rates. Patency data for representative vascular grafts 
is summarized in Table 128.1 by anatomical location and conduit [Imparato etal., 1972; Rutherford, 
1989; Veith et al, 1994]. 

Vascular grafts have a finite life span which is usually reported in the form of a life-table of graft 
patency. The patency rate falls to about 50% in 4 years after implantation for the most common PTFE 
femoral-popliteal grafts. Graft failures are typically caused by: (1) thrombosis in the early postoperative 
period of 1 month, (2) tissue ingrowth or neointimal hyperplasia at the anastomoses after several years, 
and (3) graft infections. Thrombosis in the early postoperative period is usually caused by a technical 
error at time of implantation and is readily fixed. The longer-term growth of tissue at the anastomoses 
is more insidious. Late failure of these grafts as a result of neointimal hyperplasia is a major problem 
confronting the vascular surgeon. Increasing the longevity of these grafts would greatly benefit the patient 
and have an important economic impact, nationally. 

128.4 Thrombosis 

When an artificial vascular graft is first exposed to blood, a clot will typically form at the surface. The 
clot is formed initially of platelet aggregates, and then the coagulation cascade is activated to lay down 
fibrin and thrombin. The tissue which forms this inner lining of the graft is often called pseudointimal 
hyperplasia. Pseudointimal hyperplasia is devoid of cellular ingrowth and is composed mainly of fibrin, 
platelet debris, and trapped red blood cells [Greisler, 1991]. The blood clots immediately on contract 
and will occlude the vessel if the blood is stagnant or very sluggish. Platelet activity is generally most 
intense during the first 24 hours and subsidies to a very low level after 1 week. 

Various artificial surfaces are more or less thrombogenic. Many approaches have been developed in 
an attempt to make the grafts less thrombogenic [Greisler, 1991]. Typically, the surface characteristics 
have been modified to prevent adherence or activation of platelets. The surfaces have been modified by 
seeding with endothelial cells and other exotic treatments such as photopolymerization, plasma-gas 
coatings, and antisense genetics. Although many of these theories have promise, few have been shown 
to yield long-term improvements over the existing materials of Dacron and PTFE. 
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128.5 Neointimal Hyperplasia 

Tissue ingrowth onto the artificial surface of a PTFE graft occurs at the anastomoses and is called 
neointimal hyperplasia. Initially, platelets and thrombin/fibrin cover the entire graft surface within the 
first 24 hours. After this initial phase, smooth-muscle cells migrate from the native artery into the graft, 
variously referred to as pannus ingrowth, neointimal hyperplasia, and fibromuscular hyperplasia. The 
pannus advances from the edge of the graft at a rate of approximately 0.1 mm per week but does not 
cover the entire surface in humans. Typically, the maximum extent of ingrowth in humans is 1-2 cm. In 
the first few hours, a thrombotic stage of platelet, fibrin, and red blood cell accumulation occurs (stage I). 
Days 3-4 consist of cellular recruitment in which the thrombus is coated with an endothelial layer 
(stage II). Cell proliferation of actin-positive cells which resorb and replace the residual thrombus are 
the major events in stage III, at approximately 7-9 days. This description leads to a conclusion that 
thrombosis and cell proliferation are the two controlling events affecting graft patency. However, this 
process is one of normal healing which creates no significant stenosis in most cases. Recent findings 
indicate that a fourth stage should be added in which the tissue growths in volume primarily by synthesis 
and deposition of large amounts of extracellular substance over the following months, as shown in 
Fig. 128.1. This stage IV leads to a hemodynamically significant stenosis. 

The ingrowth has been characterized by Glagov as having two different forms: intimal fibromuscular 
hypertrophy (IFMH) and intimal hyperplasia (IH) [Glagov etal, 1991]. The first form of IFMH is 
basically a normal healing process which consistently takes place on all grafts and does not become highly 
stenotic. Initial healing is well organized with smooth-muscle cells organized in lamellae and little 
extracellular proteins or mucopolysaccharide material. A second form of abnormal healing creates 
stenotic and occlusive thickening, called intimal hyperplasia (IH). Intimal hyperplasia is characterized by 
a chaotic appearance of randomly oriented cells embedded in a field of extracellular matrix material. 




FIGURE 128.1 High-power sections of the anastomotic neointimal hyperplasia in a PTFE graft. This endothelium- 
lined layer is formed of highly disorganized spindle-shaped cells surrounded by a large amount of extracellular matrix 
material. 
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FIGURE 128.2 Graph of neointimal thickness versus mean shear stress for PTFE tapered grafts in a canine model 
of intimal hyperplasia. 

The histologic appearance is one of hyperplasia or tumor-like behavior. Little is known on the exact 
stimulant which causes healing to proceed by IFMH or IH, although Glagov speculates that hemodynamic 
shear stress may play a role. The events in graft neointimal hyperplasia appear to be similar to that of 
restenotic hyperplasia. 

Although the underlying etiology of neointimal hyperplasia has not been fully elucidated, a large 
number of theories have been proposed to explain this process. The theories generally fall into several 
broad categories: platelet deposition with local release of PDGF, other growth factor stimulation of SMC 
proliferation (e.g., TGF-, FGF), monocyte recruitment, complement activation, leukocyte deposition, 
chronic inflammation, and mechanical stimuli such as stress and shear abnormalities [Greisler, 1991]. 

Many studies have focused on preventing graft intimal thickening by attacking either thrombosis or 
cell proliferation. Cell culture studies clearly demonstrate that certain pharmacologic agents can pro- 
foundly affect the platelet adhesiveness and cell proliferation. However, the effectiveness of these agents 
in vivo is variable, depending on which animal species is used. A large number of human restenosis trials 
using a wide variety of antithrombotic agents, steroids, lipid-lowering therapies, and ACE inhibitors have 
largely been negative. Arteriovenous fistula graft studies in baboons suggest that modifying graft surfaces 
and infusing antithrombotic agents have little effect on the rate of obstruction of the prostheses. 

Compliance mismatch, medial tension, and hemodynamic shear stress are additional putative agents 
which have been hypothesized to be important in arterial adaptation and neointimal hyperplasia. Arteries 
tend to adapt in size to maintain a specific wall shear stress under conditions of increased blood flow. 
Wall shear stress appears to play a major role in the pathogenesis of atherosclerosis and in arterially 
transplanted autogenous veins, and elevations in shear stress inhibits SMC proliferation and neointimal 
thickening in porous PTFE grafts in baboons. 

We have recently characterized the quantitative relationship between the amount of shear stress and 
the associated neointimal thickness, shown graphically in Fig. 128.2. 

This relationship can be written as the mathematical relationship 



Intimal thickness = A X 



1 
wall shear rate 



-+C 



(128.1) 



where C is a constant which would depend on the implantation age of the graft. This relationship appears 
to be dominant over proximal or distal positioning or absolute flow rate. This inverse, nonlinear relationship 
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between neointimal thickening and wall shear stress shown in Fig. 128.2 is strikingly similar to that seen 
for early atherosclerotic intimal thickening in arteries [e.g., Ku & Zhu, 1993]. Thus, neointimal hyper- 
plasia appears to respond to low shear stress in the same consistent, stereotypic way as do other vascular 
cells in vivo. Since wall shear is primarily determined by the graft diameter, selection of an optimal-size 
graft for typical flow rates may increase the long-term patency of prosthetic grafts [Binns et al., 1989]. 
These results apply to in vivo PTFE grafts composed of the pore-size and material configuration used 
most commonly in clinical vascular surgery today. 

Surprisingly, the pseudointima of thrombus material consistently shows the same thickening as the 
anastomotic neointima, even though there are no nucleated cells in the pseudointima [Binns et al., 1989]. 
The advancing front exhibits an unusual shape in which the endothelial cells grow over the existing 
thrombus at the lumen, but the smooth-muscle cells stay at the graft surface underneath the thrombus. 
Thus, the two cell types do not touch each other at the advancing edge of the pannus. The distances 
between the two types of cells suggest that direct cellular communication is not present at the advancing 
edge. 

Although there is clearly a large body of knowledge on intimal hyperplasisa, the more general question 
of what biologic material is actually causing occlusions is not clear. The in vitro cell culture studies are 
elegant in their ability to probe the molecular regulators of synthesis and proliferation. However, the 
response of cells grown as monolayers on plastic may be different from tissue growing in the complex 
milieu in vivo. Small animal studies are useful for initial testing of a wide variety of potential therapeutic 
agents. Unfortunately, many agents which are successful in rats and rabbits turn out to be unsuccessful 
in large animals or humans. Large animals such as dogs, pigs, and baboons appear to develop lesions 
similar to humans and respond to therapies in a similar manner. The major drawback, however, is that 
the lesions typically take approximately one year to develop into advanced stenoses which mimic the 
clinical human problem. An accurate large-animal model of high-grade neointimal hyperplastic lesions 
is needed in order to generate a comprehensive characterization of lesion development in vivo. 

The cellular mechanism by which shear stress influences the development of neointimal hyperplasia 
is not well understood. The pannus ingrowth is likely to be composed predominantly of smooth-muscle 
cells. However, the fluid-wall shear stress interface is primarily experienced by endothelial cell. One 
mechanism for neointimal thickening may be that the endothelial cells sense the local wall shear and 
regulate the amount of underlying smooth muscle cell growth. Clearly, endothelial cells can change their 
physiologic behavior and morphologic appearance in response to wall shear stress [Nerem, 1992]. Shear- 
sensitive endothelial cells mechanoreceptors have been postulated as regulators of the adaptation process 
seen in arteries in response to increased shear stress. Such receptors, however, have not been identified. 
An "endothelial-derived relaxation factor" and an "endothelial-derived constriction factor" appear to play 
an important role in the acute response of the arterial wall to alterations in shear stress. A separate 
humoral factor acting on the media in response to alterations in shear stress has been suggested. The 
fact that most of these mediators are endothelium-derived or endothelium-dependent may explain the 
localization of neointimal hyperplasia in PTFE grafts to the juxta-anastomotic region. In contrast to 
native vessels, the commonly used human prosthetic grafts are rarely, if ever, fully lined with endothelium 
except for a short distance adjacent to the anastomosis. 

The morphology of endothelial cells grown in low shear or static conditions are random in orientation 
without a preferred direction. As the intimal thickening progresses, one would expect the deeper layers 
to be more insulated from the shear effects of hemodynamics and to experience a more static environment. 
PTFE grafts are essentially nondistensible under physiologic pressure conditions. Thus, the perigraft cells 
would be expected to experience the least amount of stress from pressure pulses and circumferential 
stretch as well. 

Recently, there has been greater recognition of the importance of extracellular matrix (ECM) in 
vascular lesions [Bell, 1992]. In general, we have observed that the deepest layers of cells in the perigraft 
region exhibited the most disorganization and extracellular matrix material. The appearance of the 
intimal hyperplasia in this region strongly suggests that growth of neointima in this region from the 
production of extracellular matrix is the dominant mechanism by which intimal hyperplasia becomes 
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occlusive. ECM is dominant in advanced lesions, often to the virtual exclusion of cellular elements. The 
ECM is also directly responsible for a variety of tissue signals which can augment or reduce cellular 
proliferation and protein production. The ECM can act directly or act as a repository for other growth 
factors. Several investigators are attempting to recreate the detailed architecture of ECM in order to 
develop future biologic substrates for arteries. 

Low compliance of PTFE grafts are associated with higher rates of intimal hyperplasia. Rodbard [1970] 
has hypothesized that the normal adaptive response of cells to static conditions is the production of large 
amounts of ECM. It is possible that the lack of stretch in the PTFE graft removes any mechanical stimulus 
to the smooth-muscle cells in this layer and transforms the cells into a synthetic mode with production 
of large amounts of ECM. The highly synthetic mode of these cells and the hypocellularity in this region 
would suggest that the major culprit of late graft occlusion is synthesis instead of proliferation. Studies 
of the response of endothelial cells to very low levels of wall shear stress and the response of underlying 
smooth muscle cells to endothelial cell products should provide important information on the cellular 
and molecular mechanisms producing the transformation of thin, organized healing into occlusive 
growth. 

128.6 Graft Infections 

Vascular graft infections are catastrophic and challenging problems that threaten life and limb. The 
incidence of synthetic graft infection is approximately 2% and has remained stable despite advances in 
aseptic surgical technique, vascular graft production, and immunology. Prosthetic grafts are involved in 
infections four times as often as autogenous vein. The infection rates for PTFE and Dacron prostheses 
are about the same. The mechanism of infectious complications may involve direct inoculation during 
surgery or hematogenous seeding, but most occur due to contamination at the time of surgery. Contact 
of the prosthesis with the skin of the patient is felt to be a key event. The highest incidence anatomically 
occurs in the inguinal areas due to perineal proximity, lymphatic disruption, and poor wound healing. 
Antibiotic prophylaxis, both systemically and locally, decreases the incidence of graft infection. The 
bonding of antibiotics to the prosthesis has been attempted in various forms to prevent prosthetic graft 
infection but to date is not clinically available. Staphylococcus epidermidis is the most common organism 
involved in graft infections, which Staphylococcus aureus also a prevalent organism. Early graft infections 
commonly are superficial infections (inguinal) with virulent organisms such as Staphylococcus aureus or 
Pseudomonas aeruginosa. Late graft infections usually involve contamination of an abdominal graft with 
less virulent species such as Staphylococcus epidermidis. The reported mortality rates vary with the 
anatomic location of the graft and the aggressiveness of the infecting organism. These may range from 
5-10% for femoropopliteal grafts to 50-75% for aortobifemoral grafts. The associated amputation rate 
varies from 25-75%. 

Vascular grafts are important tools in the clinical treatment of end-stage arteriosclerosis. Large- 
diameter grafts made of artificial material have high success and longevity. For small-diameter grafts, 
autogenous venous grafts are superior in longevity and patency than artificial grafts. The main problems 
for artificial vascular grafts are early thrombosis, late neointimal hyperplasia, and iatrogenic infection. 
Some active areas of research include the prevention of flowing blood thrombosis on surfaces, control 
of the pathogenic process causing neointimal hyperplasia occlusions, and facilitation of the immunogenic 
response to bacterial infection of a graft. 

Defining Terms 

Extra-anatomic bypass: Vascular graft used to bypass a blocked artery where the graft is placed in a 
different location than the original artery. One example is an axillo-femoral graft which brings 
blood from the arm to the leg though a tube placed under the skin along the side of the person. 

Hemodialysis access: Arterial-venous connection surgically placed to provide a large amount of blood 
flow for hemodialysis. These are often loops of ePTFE placed in the arm. 



© 2000 by CRC Press LLC 



Neointimal hyperplasia: Fibroblast and smooth-muscle cell growth covering a vascular graft on the 

inside surface. 
Pannus: Neointimal hyperplasia tissue ingrowth at the anastomoses. 
Pseudointimal hyperplasia: Fibrin/thrombin deposition on the inside surface of an arterial vascular 

graft. This accumulation of material is acellular. 
Stenosis: Tissue ingrowth into vessel causing a narrow lumen and reduction of blood flow. 
Vascular graft: Tube replacement of an artery or vein segment. 
Vascular reconstruction: Reconstruction of an artery or vein after trauma, surgery, or blockage of 

blood flow from disease. 
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The natural lung is the organ in which blood exchanges oxygen and carbon dioxide with the body 
environment. In turn, blood brings oxygen to all body tissues, so as to oxidize the nutrients needed to 
sustain life. The end products of the chemical reactions that take place in tissues (globally referred to as 
metabolism) include carbon dioxide, water, and heat, which must all be eliminated. In mammals, oxygen 
is obtained from the air we breathe through diffusion at the level of the pulmonary alveoli and then 
carried to the tissues by the hemoglobin in the red blood cells. The carbon dioxide produced by living 
cells is picked up by the circulating blood and brought to the pulmonary capillaries from where it diffuses 
into the alveoli and is conveyed out by ventilation through the airways. These processes can be slowed 
down to a fraction of resting levels by hypothermia or accelerated up to 20-fold when the demand for 
fuel increases, as for instance with hypothermia, fever, and muscular exercise. 

Only a fraction of the oxygen in the air is actually transferred from the pulmonary alveoli to the 
pulmonary capillary blood, only a fraction of the oxygen carried by arterial blood is actually extracted 
by the tissues, and only a fraction of the oxygen present in tissues is actually replenished in a single blood 
pass. Similarly, only a fraction of the C0 2 present in tissues is conveyed to the circulating blood, only a 
fraction of the mixed venous blood CO, content is actually discharged in the alveoli, and only a fraction 
of the C0 2 in the aveolar gas is eliminated into each breath. Delicately poised physiologic mechanisms, 
further balanced by chemical buffer systems, maintain the gas exchange system in equilibrium. 

The challenge of replacing the function of the natural lungs by an exchange device allowing continuous 
blood flow and continuous blood arterialization was first outlined by physiologists at the end of the 19 th 
century but could not be met reliably until the 1950s. The large transfer areas needed for blood-gas 
exchange in an artificial lung were initially obtained by continuous foaming and defoaming in a circulating 
blood pool or by spreading a thin film of blood in an oxygen atmosphere. Because the direct blood-gas 
interface was found to be damaging to blood as well as difficult to stabilize over extended periods, 
membrane-mediated processes were introduced and are now almost universally preferred. 
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129.1 Gas Exchange Systems 

Artificial lungs are often called blood oxygenators because oxygen transfer has traditionally been seen as 
the most important function being replaced and, in most situations, has proved more limiting than C0 2 
transfer. The change in blood color between inlet and outlet also encourages the term oxygenator, 
considering that changes in blood C0 2 content are not visible to the eye and are more difficult to measure 
than oxygen transfer. 

As is the case for most artificial organs, artificial lungs may be called upon to replace entirely the 
pulmonary gas exchange function (when the natural organ is totally disabled or, while still sound, must 
be taken out of commission for a limited time to allow a surgical intervention) or to assist the deficient 
gas transfer capacity of the natural organ, either temporarily, with the hope that the healing process will 
eventually repair the diseased organ, or permanently, when irreversible lung damage leaves the patient 
permanently disabled. 

Since most artificial lungs cannot be placed in the anatomical location of the natural lung, venous 
blood must be diverted from its normal path through the central veins, right heart, and pulmonary 
vascular bed and rerouted, via catheters and tubes, through an extracorporeal circuit which includes the 
artificial lung before being returned, by means of a pump, to the arterial system. 

The procedure in which the pulmonary circulation is temporarily interrupted for surgical purposes 
and gas exchange is provided by an artificial lung is often referred to as extracorporeal circulation (ECC) 
because, for convenience sake in the operating room, the gas exchange device as well as the pumps which 
circulate the blood are located outside the body. 

The vision of coupling extracorporeal blood pumping with extracorporeal gas exchange at a level of 
performance sufficient to permit unhurried surgical interventions in adult patients originated with 
Gibbon [1939] whose initial laboratory models relied on rotating cylinders to spread blood in a contin- 
uously renewed thin film, in the tradition of 19 th -century physiologists. Gibbon's clinical model [1954], 
built with technical support from IBM, was a stationary film oxygenator, a bulky device in which venous 
blood was evenly smeared over a stack of vertical wire screen meshes in an oxygen atmosphere, flowing 
gently downward to accumulate in a reservoir from where blood could be returned to a systemic artery. 
The main problems with that design, besides its cumbersome dimensions, were to avoid blood streaming 
and to maintain a constant blood-gas exchange area. Other investigators tried to increase the flexibility 
of the system by replacing the stationary film support with rotating screens or rotating discs partly 
immersed in a pool of blood. This allowed some control of gas transfer performance by changing the 
rational speed, but minimizing the blood content of the device dictated a tight fit between the discs and 
the horizontal glass cylinder surrounding them. Foaming and hemolysis were encountered at high disc 
spinning velocity, and these designs were eventually abandoned. 

The very first strategy of physiologists for exchanging oxygen and carbon dioxide in venous blood had 
been to bubble pure oxygen through a stationary blood pool. To turn this batch process into a continuous 
operation for totally body perfusion, blood was collected through cannulae from the central veins by a 
syphon or a pump, driven upward in a vertical chimney, mixed with a stream of oxygen gas bubbles, and 
finally passed through filters and defoaming sponges so as to collect bubble-free arterialized blood, which 
could then be used to perfuse the arterial tree. The efficiency of bubble oxygenators is extremely high 
because the smaller the bubbles, the larger the blood-oxygen exchange area developed by a steady current 
of gas. In the limiting case, it is even possible to saturate venous blood by introducing no more oxygen 
into the blood than is consumed by the tissues. This process, however, doe not remove any carbon dioxide. 
Since the partial pressure of C0 2 in the excess gas vented from the bubble oxygenator cannot exceed the 
C0 2 partial pressure in arterialized blood (and in actuality is much lower), it follows the carbon dioxide 
transfer rate of bubble oxygenators is a direct function of the volume inflow rate of oxygen, which must 
exceed oxygen uptake severalfold to transfer both 2 and C0 2 . Thus, the operating conditions for a bubble 
oxygenator are dictated to a major extent by the requirements for adequate Co 2 removal. 

Three major advances propelled bubble oxygenators ahead of film oxygenators in the pioneer decades 
of cardiac surgery. The first was the identification of silicone-based defoaming compounds which could 
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be smeared on top of the bubble chimney and proved much more effective in coalescing the blood foam 
than previously used chemicals. The second was the quantitative process analysis by Clark [1950] and 
Gollan [1956] which showed that, since small bubbles favor oxygen transfer and large bubbles are need 
for C0 2 removal, an optimum size could be found in between, or alternatively a mix of small and large 
bubbles should be used. The third and practically decisive advance was made simultaneously by Rygg 
[1956] and DeWall [1957], who replaced the assembly of glass, steel, and ceramic parts of early bubble 
oxygenators with inexpensive plastic components and thereby paved the way for the industrial manu- 
facturing of disposable bubble oxygenators. As a result, reusable stationary film and disc oxygenators, 
which require careful cleaning and sterilization, slowly disappeared, and disposable bubble oxygenators 
dominated the field of extracorporeal gas exchange from 1960 to the early 1980s. The oxygenator and 
pumps designed during the pioneer phase of extracorporeal circulation are described in a book by Galletti 
andBrecher [1962]. 

In the 1970s,the bubble oxygenator was integrated with a heat exchanger (usually a stainless steel coil) 
and placed within a clean plastic container that also served as a venous reservoir with a capacity of several 
liters of blood. The level of blood-air interface allowed direct observation of changes of blood volume 
in the extracorporeal circuit. This simple design feature gave the equipment operator (who eventually 
became known as the perfusionist) plenty of time in which to react to a sudden blood loss and make 
compensating changes in operation. 

Yet a number of problems occurred with bubble oxygenators because of their large blood-gas interface. 
If the blood foam did not coalesce completely, a gaseous microemboli could be carried into the arterialized 
bloodstream. Plasma proteins were denatured at the gas interface, leading to blood trauma associated 
with platelet activation and aggregation, complement activation, and hemolysis. These problems could 
be ameliorated by placement of a gas-permeable membrane between the blood and gas phases. The idea 
of using membranes permeable to respiratory gases in order to separate the blood phase from the gas 
phase in an artificial lung, and consequently avoid the risk of foaming or the formulation of thick blood 
rivulets inherent in bubble or film oxygenators, was stimulated after World War II by the growing 
availability of commercially produced thin plastic films for the packaging industry. The two major 
challenges for membrane oxygenators were, and indeed remain, that no synthetic membrane could be 
fabricated as thin as the pulmonary alveolar wall, and manifolding and blood distribution system could 
match the fluid dynamic efficiency of the pulmonary circulation, where a single feed vessel — the pulmo- 
nary artery — branches over a short distance and with minimal resistance to flow into millions of tiny 
gas exchange capillaries the size of an erythrocyte. Membrane oxygenators have progressively captured 
the largest share of the market for clinical gas exchange devices not only because their operation is less 
traumatic for blood but also because the blood content of the gas exchange unit is fixed, thereby limiting 
volume fluctuations to calibrated reservoirs and minimizing the risk of major shifts in intracorporeal 
blood volume during total body perfusion. 

The interposition of a membrane between flowing gas and flowing blood reduces the gas transfer 
efficiency of the system as a consequence of the additional mass transfer resistances associated with the 
membrane itself and the geometry of the blood layer. The permeability of various polymeric materials 
to oxygen and carbon dioxide is summarized in Table 129.1. The very first plastic films used, such as 
thin sheets of polyethylene and polytetrafluoroethylene (PTFE), showed such a low diffusional perme- 
ability that 5-10 m 2 were needed to meet even the minimal oxygen needs of an anesthetized hypothermic 
adult [Clowes et al., 1956]. The advent of silicone elastomer films (either as solid sheets or cast over a 
textile support mesh) in the 1960s established the technical feasibility of membrane oxygenators [Bramson 
etal., 1965; Galletti et al., 1966; Peirce et al., 1967]. Silicone rubber is about 140 times more permeable 
to oxygen and 230 times more permeable to C0 2 than is PTFE for equivalent thickness. Even though 
silicone rubber and related elastomers cannot be cast as thin as PTFE, their permeability is so high that 
they became the standard material for early membrane oxygenator prototypes. 

Extensive research was carried out in the 1960s and 1970s to develop improved membrane oxygenator 
designs with more efficient oxygen transport across the blood boundary layer. These designs are discussed 
later in this chapter. The clinical motivation was to provide continuous full or partial replacement of 
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TABLE 129.1 2 and C0 2 Permeability of Various Materials 







2 Permeability 










cm 3 (STP)-mil 


CO 


, Permeability 


Chemical Composition 


Common Name 


min-m 2 -atm 


o 2 


Permeability 


Air 




5x 10 7 




0.8 


Polydimethylsiloxane 


Silicone rubber 


1100 




5 


Water 




150 




18 


Polystyrene 




55 




5 


Polyisoprene 


Natural rubber 


50 




6 


Polybutadiene 




40 




7 


Regenerated cellulose 


Cellophane 


25 




18 


Polyethylene 




12 




5 


Polytetrafluoroethylene 


Teflon 


8 




3 


Polyamide 


Nylon 


0.1 




4 


Polyvinylidene chloride 


Saran 


0.01 




6 



Permeability is the product of the diffusion coefficient D and the Bunsen solubility 
coefficient a. The gas flux J [cm 3 (STP)/muvm 2 ] across a membrane of thickness h with 
a partial pressure difference Ar is given by / = PAr/h. 

pulmonary gas exchanges for periods of weeks to patients with advanced respiratory failure, with the 
hope that in the interval the natural lung would recover. Since extensive blood trauma limited the use 
of blood oxygenators beyond 12-24 hours, membrane oxygenators became the key to this application. 
In the mid-1970s, an NIH-sponsored clinical study [Zapol, 1975] demonstrated that with the protocols 
used lung function was not regained after 1-3 weeks of extracorporeal circulatory with a membrane 
oxygenator. The major intended application of membrane oxygenators having vanished, and the high 
cost of silicone rubber making these devices much more expensive than bubble oxygenators, research 
and development almost came to a halt, leaving bubble oxygenators in control of the remaining field of 
use, namely cardiopulmonary bypass. 

Two technical advances have over the following two decades reversed this trend: ( 1 ) the discovery that 
hydrophobic microporous membranes, through which gas can freely diffuse, have a high enough surface 
tension to prevent plasma filtration at the moderate pressures prevailing in the blood phase of an artificial 
lung; (2) the large-scale fabrication of defect-free hollow fibers of microporous polypropylene, which 
can be assembled in bundles, potted and manifolded at each extremity to form an artificial capillary bed 
of parallel blood pathways immersed in a cylindrical hard shell through which oxygen circulates. 

Microporous hollow fiber membrane oxygenators now dominate the market. The most common 
embodiment of the hollow fiber membrane oxygenator features gas flow through the lumen of the fibers 
and blood flow in the interstices between fibers. This arrangement not only utilizes the larger outer 
surface area of the capillary tubes as gas transfer interface, instead of the luminal surface, it also promotes 
blood mixing in a manner which enhances oxygen transport, as will become apparent below. 



129.2 Cardiopulmonary Bypass 



Cardiopulmonary bypass (CPB), also called heart-lung bypass, allows the temporary replacement of the 
gas exchange function of the lungs and the blood-pumping function of the heart. As a result blood no 
longer flows through the heart and lungs, which then presents the surgeon with a bloodless operative field. 
The terms pump-oxygenator and heart-lung machine graphically describe the equipment used. Car- 
diopulmonary bypass is the procedure. Open-heart surgery strictly speaking refers to interventions inside 
the heart cavities, which once provided the most frequent demand for cardiopulmonary bypass technol- 
ogy. By extension, the term is also applies to surgical procedures which take place primarily on the 
external aspects of the heart, such as creating new routes for blood to reach the distal coronary arteries 
from the aorta, e.g., coronary artery bypass grafting (CABG) or, popularly, bypass surgery. In these 
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FIGURE 129.1 Scheme of standard operating conditions during cardiopulmonary bypass. 

operations the cardiac cavities are temporarily vented (i.e., open to atmospheric pressure) to avoid the 
build-up of pressure which could damage the cardiac muscle. 

As usually employed for cardiac surgery, the heart-lung machine is part of a total, venoarterial cardio- 
pulmonary bypass circuit, meaning that all the venous blood returning to the right heart cavities is collected 
in the extracorporeal circuit and circulated through the gas exchange device, from where it is pumped 
back into the arterial tree, thereby "bypassing" the heart cavities and the pulmonary circulation. On the 
blood side, this procedure usually involves hemodilution, some degree of hypothermia, nonpulsatile 
arterial perfusion at a flow rate near the resting cardiac output, and continuous recirculation of blood in 
an extracorporeal circuit in series with the systemic circulation of the patient. On the gas side, oxygen or 
an oxygen-enriched gas mixture (with or without a low concentration of C0 2 ) flows from a moderately 
pressurized source in a continuous, nonrecycling manner and is vented to the room atmosphere. 

CPB hinges on twin postulates: that blood circulation can be sustained by mechanical pumps while 
the heart is arrested and that venous blood can be artificially arterialized in an extracorporeal gas exchange 
device while blood flow is excluded from the lungs. Each of these claims was established separately 
through animal experimentation over the course of over 100 years (see Galletti and Brecher [1962]). 
Then surgeons and engineers combined the advances made by physiologists and pharmacologists and 
turned them, in the 1950s, into the basic tool of cardiac surgery. A recent update on the evolution of 
artificial membrane lungs for CPB surgery has been provided by Galletti [1993]. It is estimated that each 
year about 650,000 disposable membrane lungs are used in the operating room worldwide, with each 
gas exchange unit selling for a price in the range of $250-400, i.e., a market close to $2 billion. 

Typical operating conditions for total cardiopulmonary bypass in an adult are summarized in Fig. 129.1 
and Table 129.2. Most notable are the large differences in driving forces for 2 and C0 2 . An approximate 
comparison can be made using the inlet conditions. Thus, under normothermia and with pure oxygen 
fed to the gas phase, initial driving force is approximately 45 - = 45 mmHg for C0 2 and 760 - 47 = 
713 mmHg for 2 yielding a ratio of C0 2 to 2 driving forces of about 0.06. The ratio of C0 2 to O, 
permeability for silicone rubber is roughly 5 (Table 129.2), and the corresponding flux ration is 0.35, less 
than half of the metabolically determined respiratory quotient. Under these conditions, C0 2 is the limiting 



© 2000 by CRC Press LLC 



TABLE 129.2 Typical Operating Parameters for Cardiopulmonary Bypass 
in an Adult 

Oxygen transfer requirement 250 mL/min 

CO, elimination requirement 200 mL/min 

Respiratory gas exchange ratio (respiratory quotient) 0.8 

Blood flow rate 5 L/min 

Gas flow rate 5 -10 L/min 

Gas partial pressures (in mmHg) 

Blood in p0 2 = 40 pC0 2 = 45 

Blood out p0 2 = 100 - 300 pC0 2 = 30-40 

Gas in (humidified) pO, = 250-713 pC0 2 = -20 

Gas out p0 2 = 150 - 675 pC0 2 = 10-30 



gas, and the device should be sized on the basis of C0 2 transport rather than 2 transport requirements. 
This is why silicone rubber membranes have been replaced by microporous polypropylene membrane, 
where the solubility of C0 2 in the membrane material, and consequently its permeability, is no longer 
the limiting factor. Indeed, with some modern membrane oxygenators, gas flow through the device may 
have to be controlled to avoid an excessive loss of CO,. 

129.3 Artificial Lung Versus Natural Lung 

In the natural lungs, the factors underlying exchange across the alveolo-capillary barrier and transport 
by the blood can be grouped into four classes: 

1. The ventilation of the lungs (the volume flow rate of gas) and the composition of the gas mixture 
to which mixed venous (pulmonary artery) blood will be exposed 

2. The permeability of the materials which separate the gas phase from the blood phase in the 
pulmonary alveoli 

3. The pattern of pressure and flow through the airways and through the pulmonary vascular bed 
and the distribution of inspired air and circulating blood among the various zones of the exchange 
system 

4. The gas carrying capacity of the blood as regards oxygen and carbon dioxide (and secondarily 
nitrogen and anesthetic gases) 

In an artificial lung, replacing the gas transfer function of the natural organ implies that blood 
circulation can be sustained by mechanical pumps for extended periods of time to achieve a continuous, 
rather than a batch process, and that venous blood can be arterialized in that device by exposure to a 
gas mixture of appropriate composition. The external gas supply to an artificial lung does not pose 
particular problems, since pressurized gas mixtures are readily available. Similarly the components of 
blood which provide its gas-carrying capacity are well identified and can be adapted to the task at hand. 
In clinical practice, it is important to minimize the amount of donor blood needed to fill the extracor- 
poreal circuit, or priming volume. Therefore a heart-lung machine is generally filled with an electrolyte 
or plasma expander solution (with or without donor blood), resulting in hemodilution upon mixing of 
the contents of the extracorporeal and intracorporeal blood circuits. The critical aspects for the operation 
of an artificial lung are blood distribution to the exchanger, diffusion resistances in the blood mass 
transfer boundary layer, and stability of the gas exchange process. 

Artificial lungs are expected to perform within acceptable limits of safety and effectiveness. The most 
common clinical situation in which an artificial lung is needed is typically of short duration, with resting 
or basal metabolism in anesthetized patients. Table 129.3 compares the structures and operating condi- 
tions of the natural lung and standard hollow fiber artificial membrane lungs with internal blood flow. 

An artificial lung designed to replace the gas exchange function of the natural organ during cardiac 
surgery must meet specifications which are far less demanding than the range of capability of the 
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TABLE 129.3 Comparison of Nature and Artificial Membrane Lung 



Natural Lung 

Performance 

Must meet demand at rest and during exercise, fever, etc. 

Constant temperature process around 37°C 

O, and C0 2 transfer are matched to achieve the respiratory 
quotient imposed by foodstuff metabolism (around 0.8) 
Continuous over a lifetime 

Exchange surface area 

Wide transfer area (~ 70 m 2 ) 

Highly permeable alveolar-capillary membrane 

Short diffusion distances (1 -2 (im) 
Hydrophilic membrane 
No hemocompatibility problems 
Self-cleaning membrane 

Gas side 

To-and-fro ventilation 

Operates with air 

Membrane structure sensitive to high oxygen partial pressure 

Pressure below that in blood phase to avoid capillary collapse 

Operates under water vapor saturation conditions 

Ventilation linked to perfusion by built-in control mechanisms 

Can be used for gaseous anesthesia 

Blood side 

Short capillaries (0.5 - 1 mm) 

Narrow diameter (3-7 (tm) 

Short exposure time (0.7s) 

Low resistance to blood flow 

Sophisticated branching 

Minimal priming volume 

Capillary recruitment capability 

No recirculation 

Limited venous admixture 

No on-site blood mixing 

Operates with normal hemoglobin concentration 

Does not require anticoagulation 



Hollow Fiber Oxygenator 



Must meet demand at rest and under anesthesia 
Can be coupled with heat exchanger to lower body 
temperature. 
O, and C0 2 transfer are largely independent of each other 
and must be controlled by the operator 
Usually limited to few hours 

Limited transfer area ( 1 -3 m 2 ) 

Diffusion barriers in synthetic membrane and blood 

oxygenation boundary layer 
Relatively thick membranes (50 -100 itm) 
Hydrophobic polymers 
Hemocompatibility problems 
Protein build-up on membrane 

Steady cross flow gas supply 

Operates with oxygen-rich mixture 

Membrane insensitive to high oxygen partial pressure 

Pressure below that on blood side to avoid bubble formation 

Can be clogged by water vapor condensation 

Ventilation dissociated from perfusion, with risks of hyper- 

or hypoventilation 
Can be used for gaseous anesthesia 

Long blood path (10 -15 cm) 

Thick blood film (150 -250 urn) 

Long exposure time (5 -15s) 

Moderate to high resistance to blood flow 

Crude manifolding of entry and exit ports 

Moderate to large priming volume 

Fixed geometry of blood path 

Possibility of recirculation 

Risk of uneven perfusion of parallel beds 

Possibility of blood stirring and mixing 

Hemodilution is common 

Requires anticoagulation 



mammalian lung would suggest. Nonetheless, these specifications must embrace a range of performance 
to cover all metabolic situations which a patient undergoing cardiopulmonary bypass might present. 
These conditions range in terms of metabolic rate from the slightly depressed resting metabolism char- 
acteristic of an anesthetized patient, lightly clad in a cool operating room, to moderate (25-28°C) and 
occasionally deep (below 20°C) hypothermia. Hypothermia and high blood flow are occasionally encoun- 
tered in patients with septic shock. In terms of body mass, patients range from 2-5-lb newborn with 
congenital cardiac malformations to the 250-lb obese, diabetic elderly patient suffering from coronary 
artery disease and scheduled for aortocoronary bypass surgery. 

Whereas it is appropriate to match in advance the size and therefore the transfer capability of the gas 
exchange unit in the heart-lung machine to the size of the patient (largely out of concern for the volume 
of fluid needed to fill or "prime" the extracorporeal circuit), each gas exchange unit, once in use, must 
be capable of covering the patient's requirement under any circumstances. This is the responsibility of 
the perfusionist, who controls the system in the light of what is happening to the patient in the operative 
field. In fact, the perfusionist substitutes his or her own judgment for the natural feedback mechanisms 
which normally control ventilation and circulation to the natural lungs. The following analysis indicates 
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how there requirements can be met in the light of the gas transport properties of blood and the 
characteristics of diffusional and convective transport in membrane lung devices. 

129.4 Oxygen Transport 

The starting point for analysis of 2 transport is the conservation of mass relation, which is derived from 
a material balance in a differential element within the flowing blood. By way of example, the relation 
commonly employed for oxygen transport of blood flowing in a tube is 
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where x is the axial coordinate, r is the radial coordinate, V is the velocity in the axial direction, D0 2 is 
the effective diffusion coefficient of 2 in flowing blood, [0 2 ] is the concentration of physically dissolved 
2 , and [Hb0 2 ] is the concentration of 2 bound to hemoglobin. The terms on the left side of Eq. (129.1) 
represent convection in the axial direction of 2 in its two forms, dissolved in aqueous solution and 
bound to hemoglobin. The right-side term represents the diffusion of dissolved 2 in the radial direction. 
By making the usual assumption that the concentration of dissolved 2 is linearly proportional to the 
2 partial pressure p, [0 2 ] = dp, where Ot is the Bunsen solubility coefficient, and by using the definition 
of oxyhemoglobin saturation: [Hb0 2 ] = Cj5 where C T is the oxygen-carrying capacity of hemoglobin 
(per unit volume of blood) and S is its fractional saturation, Eq. (129. 1) can be rewritten in a modified 
form 
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which shows that the convective contribution is proportional to the slope of the saturation curve. 
Equation (129.2) must be solved subject to the appropriate initial and boundary conditions, which for 
a tube are 
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where p i and p are the oxygen partial pressures in the inlet blood and in the gas, respectively, and P„ is 
the membrane permeability for 2 diffusion, defined so as to include the membrane thickness. These 
conditions represent, in order, a uniform inlet concentration, symmetry about the tube axis, and the 
requirement that the 2 diffuison flux at the blood-membrane interface be equal to the 2 diffusion flux 
through the membrane. 

Implicit in the derivation of Eq. (129.2) are two important assumptions. The first is that, on a 
macroscopic scale, blood can be treated as a homogenous continuum, even though as a suspension of 
red blood cells in plasma, blood is microscopically heterogeneous. This simplification is acceptable as 
long as the volume element in which oxygen transport occurs is large compared to the size of a single 
red cell but small compared to the size of the overall diffusion path. This seems reasonable when applied 
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to transport in membrane lungs, where the blood diffusion path thickness is usually equivalent to 20-30 
red cell diameters or more. 

The second important assumption is that the rate of reaction between 2 and hemoglobin is sufficiently 
fast, when compared to the rate of diffusion of 2 within the red cell, that the reaction can be considered 
at equilibrium, with the concentration of hemoglobin-bound 2 in the red blood cells directly related 
to the concentration of dissolved 2 in plasma via the oxyhemoglobin dissociation curve (ODC). Implicit 
in the use of this relationship is the assumption that the 2 diffusion resistance of the red cell membrane 
is insignificant. 

The human ODC for normal physiologic conditions is shifted to the right by increased temperature 
or decreased pH because of decreased hemoglobin affinity for 2 . The ODC is shifted to the right by an 
increase in the concentration of various organic phosphates, especially 2,3-diphosphoglycerate (DPG). 

Under typical venous physiologic conditions (37°C, boundary 2 partial pressures of 40 and 95 mmHg), 
the reaction between hemoglobin and oxygen reaches equilibrium during the time of contact between 
the blood and the gas phase. The ratio of the effective permeability of blood to that of plasma in a model 
system is 0.87 at the hematocrit of 45%; without any facilitation within the red cell, the ratio would be 
0.75. Using the most reliable data for the 2 solubility and diffusion coefficient in plasma leads to an 
estimate of 1.7 x 10~ 5 cm 2 /s for the effective diffusion coefficient of 2 in normal whole blood. 

We can now return to the analysis of convective transport of oxygen in a membrane lung, specifically 
the solution of Eq. (129.2) or its equivalent for other geometries, on the assumption of equilibrium for 
the hemoglobin oxygenation reaction in such devices. The various theoretical analyses that have been 
carried out differ primarily in the means by which the saturation curve is handled. The most common 
approach is to retain its intrinsic nonlinearity and to approximate it by a suitable analytical expression. 
The resulting nonlinear partial differential equation does not permit analytical solution, and it is therefore 
necessary to resort to numerical solution with a finite difference scheme on a digital computer. This 
approach yields numerical values for p and S as a function of x and r. To relate theoretical prediction 
with experimental measurement, one must calculate the velocity- weighted bulk average values of p and 
S [Colton, 1976]. The average 2 partial pressure, p, and oxyhemoglobin saturation, S, that would be 
measured if the blood issuing from the tube were physically mixed are then calculated from 
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Numerical solutions provide accurate predictions. However, they apply only to specific operating 
conditions and cannot be generalized for design purposes. Two methods have been used to simplify the 
ODC and provide approximate yet useful analytical solutions. 

The first is known as the advancing front theory [Marx et al., 1960; Thews, 1957]. The oxyhemoglobin 
saturation curve is approximated as a step function between the saturation values corresponding to the 
2 partial pressure and that of the gas phase or the blood-membrane interface. The blood is treated as 
two regions of uniform oxyhemoglobin saturation separated by a front that moves rapidly inward. Outside 
the front, blood is saturated at a value corresponding to the blood-gas or blood-membrane interface, 
and the rest of the blood is relatively unsaturated. Oxygen diffuses through the saturated blood to the 
interface, where it reacts with unsaturated hemoglobin. The advancing front approximation reasonably 
represents the calculated saturation profiles and has proved useful in developing analytical design expres- 
sions for membrane lungs in terms of saturation changes effected. However, it can be in error by a very 
wide margin for the prediction of 2 partial pressure changes. A modification of advancing front theory, 
involving approximation of the partial pressure changes. A modification of advancing front theory, 
involving approximation of the ODC by several straight line segments, retains the ability to provide an 
analytical solution and gives better accuracy than the conventional step function approximation of the 
ODC. 
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The second type of simplification is to approximate the ODC by a straight line drawn between the 
inlet and boundary 2 partial pressures, which make dS/dp, the slope of the saturation curve constant 
and renders Eq. (129.2) linear. Use can then be made of existing solutions for analogous convective heat 
and mass transfer problems without chemical reaction. 

For typical operating conditions in the clinic, the initial and boundary 2 partial pressures lie on the 
upper portion of the ODC, and the advancing front solution provides an overestimation of the rate of 
2 transport, whereas the constant slope solution provides an underestimate. Conversely, on the lower 
portion of the ODC at very low 2 partial pressures, the advancing front estimate of 2 transport rate 
is too low and constant slope too high. The constant slope approximation is most accurate over the steep 
portion of the saturation cure, where it is nearly linear. Since the 2 transport rate per unit membrane 
surface area is much more sensitive to blood inlet 2 partial pressure than would be expected solely from 
the change in the overall driving force, comparative testing of membrane lungs must be carried out with 
identical inlet blood 2 partial pressure and oxyhemoglobin saturation. 

Theoretical prediction of membrane lung performance is useful for design purposes and for providing 
a guide to the effect of permissive design variables. However, theoretical prediction cannot substitute for 
experimental data under closely controlled conditions where control of pH, temperature, and C0 2 partial 
pressure in fresh blood allow the definition of the appropriate ODC. 

129.5 C0 2 Transport 

The C0 2 dissociation curve for normal human blood is far more linear in its normal operating range 
than the ODC. The fractional volume of C0 2 that is removed in the process of arterialization of venous 
blood is also considerably less than the corresponding fractional loss of oxygen (about 10 percent of 
blood Co 2 content, versus 25 percent for oxygen). As is the case for oxygen, the total C0 2 concentration 
is far larger than that of gas physically dissolved in the aqueous component of blood. Plasma accounts 
for about two-thirds of all the C0 2 carried in blood, whereas typically about 98% of 2 is carried in the 
red cells. 

The main vehicles for CO, transport in blood are bicarbonate, the primary carrier in both plasma and 
red cells, and carbamino hemoglobin, where C0 2 is combined with the amino groups of hemoglobin 
(Fig. 129.2a). Arrows on Fig. 129. 2b indicate the direction and relative rate of each reaction whereby C0 2 
is removed in a membrane lung. Carbonate and hydrogen ions form bicarbonate, which decomposes to 
C0 2 or combines with another hydrogen ion to form carbonic acid; the latter is dehydrated to liberate 
C0 2 . Since the reactions that form C0 2 in plasma are very slow, biocarbonate is the predominant species. 
Biocarbonate can diffuse into the red cell, albeit slowly, in exchange for chloride, leading to the same 
chain of reactions. In the red cell, however, dehydration of carbonic acids is catalyzed by the enzyme 
carbonic anhydrase. This reaction liberates C0 2 , which, in turn, diffuses out of the red cell into the 
plasma and then across the blood-membrane interface. Decomposition of carbamino hemoglobin is a 
significant additional source of C0 2 . Carbamate compounds that arise from combination of C0 2 with 
plasma proteins have a much smaller effect because of the relatively unfavorable equilibria for their 
formation. Finally, various ionic species, such as organic and inorganic phosphates, amino acids, and 
proteins, behave as weak acids at pH 7.4. The buffering power of hemoglobin is particularly strong and 
has a marked effect in influencing the shape of the C0 2 dissociation curve. 

Under clinical conditions of 2 and C0 2 countertransport, two reciprocal phenomena occur which 
affect C0 2 and 2 exchange. A decrease of C0 2 partial pressure causes a shift to the left of the oxyhe- 
moglobin dissociation curve, leading to an increased affinity of hemoglobin when C0 2 is removed (Bohr 
effect). Meanwhile, because oxyhemoglobin is a stronger acid than hemoglobin, uptake of 2 decreases 
the affinity of hemoglobin for C0 2 , thereby releasing additional C0 2 from carbamino hemoglobin 
(Haldane effect). At the same time, increased acidity favors the conversion of more biocarbonate into 
carbonic acid, which then dissociates, releasing C0 2 . The simultaneous occurrence of these two effects 
enhances transport rates of both gases. 
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FIGURE 129.2a Schematic representation of C0 2 transfer from the red blood cell to plasma and into alveolar gas, 
emphasizing the various buffer systems involved. 
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FIGURE 129.2b Blood C0 2 content. C0 2 transport forms in mixed venous and arterial blood, and as components 
of the veno-arterial C0 2 difference. Observe that the red blood cells (RBC) bicarbonate content, although it represents 
a large fraction of C0 2 transport, does not contribute significantly to the arteriovenous difference in C0 2 content. 
Conversely hemoglobin-bound C0 2> though less abundant than bicarbonate in red blood cells, constitutes a larger 
fraction of the veno-arterial difference. 



© 2000 by CRC Press LLC 



If the entire reaction scheme in Fig. 129.2 is assumed to be at equilibrium, the C0 2 dissociation curve 
can be used to relate the total blood concentration of C0 2 (in all forms) to the C0 2 partial pressure. The 
C0 2 dissociation curve can be linearized in the same manner as the constant slope approximation for 
2 transport described above. A relation similar to Eq. (129.2) results, except that p refers to the CO, 
partial pressure, and the term (C T /a) (3S/3p) is replaced by the derivative of the total C0 2 concentration 
with respect to the concentration of dissolved C0 2 . The equation is now linear, and the same simplifi- 
cations hold as for the O, problem with a constant slope approximation. The membrane-limited case 
becomes particularly simple, and the membrane lung can be treated as simple mass or heat exchanger 
with a constant transport coefficient. This approach has been successfully used to correlate experimental 
data. 

129.6 Coupling of 2 and C0 2 Exchange 

In the conceptual representation of Fig. 129.3 [Colton, 1976], the ratio of the C0 2 transport rates is plotted 
against an index of the blood-side mass transfer efficiency divided by the membrane permeability on a 
logarithmic scale. With a constant slope approximation for both 2 and C0 2 transport, a unique curve is 
obtained for a specific membrane lung design if the abscissa is taken to be the ratio of the blood-side log- 
mean average C0 2 mass transfer coefficient divided by the C0 2 membrane permeability, and the curve is 
parameterized by unique values of three dimensionless quantities: (1) the ratio of the membrane perme- 
abilities for 2 and C0 2 , (2) the ratio of the average blood-side transfer coefficients for 2 and C0 2 , and 
(3) the ratio of the log-mean average 2 and C0 2 partial pressure driving force. The asymptotic limits 
plotted are very approximate characteristic values of a capillary or flat plate (sandwich) gas exchange device 
in which the limiting factor is either the blood mass transfer boundary layer or the membrane itself. 

When transport of both 2 and C0 2 is blood phase-limited — that is a relatively low blood-side mass 
transfer coefficient or high membrane permeability — the C0 2 transport rate is higher than the 2 rate, 
and it is necessary to add C0 2 to the inlet gas or to decrease gas flow rate to prevent excessive C0 2 
removal. At the other extreme, if gas transport in membrane limited, the ratio of C0 2 to 2 transport is 
always lower than the physiological value (0.8), because no existing gas-permeable membrane has a 
sufficiently high C0 2 /0 2 permeability ratio to overcome the unfavorable driving force ratio of the 2 
and C0 2 partial pressures (14:1) under the actual operating conditions of a membrane lung. Under either 
limiting condition, it is necessary to design the membrane lung on the basis of the gas that limits transport, 
2 , for blood-limited conditions and C0 2 for membrane-limited conditions. 
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FIGURE 129.3 Relative C0 2 and 2 transport in a membrane lung. 
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A priori, it seems undesirable to operate under blood-limited conditions because full advantage is not 
taken of the membrane permeability. Therefore, an increase in the blood-side mass transfer efficiency is 
valuable, but only to the point where the ratio of C0 2 to 2 transport rates is equal to the respiratory 
quotient. Beyond that point, further improvements in design will not reduce the size of the device required 
unless membrane permeability to C0 2 is increased, thereby moving the operating point on the curve to 
the left and justifying the use of a more efficient exchange device. For example, conventional solid silicone 
rubber membranes cannot take advantage of high-efficiency designs because gas transport is membrane 
limited and has to be designed on the basis of CO, transport rate. To make effective use of the most 
effective devices, it is necessary to employ microporous membranes or ultrathin membranes on 
microporous substrates where the C0 2 permeability is no longer a limiting factor. 

129.7 Shear-Induced Transport Augmentation and Devices 
for Improved Gas Transport 

There is evidence that flow-dependent properties of blood can substantially influence the transport of 
2 and C0 2 . The presence of velocity gradient in a stream can enhance mass transfer through blood 
either by shear-induced collision diffusion wherein interactions between red blood cells produce net 
lateral displacements and associated motions in the surrounding phase or by rotation of individual cells, 
which gives rise to local mechanical stirring of the adjacent fluid. Both mechanisms can lead to transport 
augmentation of species present in the dispersed or continuous phases. Only the first mechanism can 
cause dispersive migration of the particles themselves, and available evidence suggests that it is the 
dominant factor [Cha & Bessinger, 1994]. The shear-induced diffusion coefficient of particles in suspen- 
sion increases linearly with the shear rate and can be orders-of-magnitude larger than the Brownian 
motion diffusivity. The effect of lateral cell movement in oxygen transport in capillary tubes depends on 
both the shear rate and the slope of the ODC, with a maximum at the steepest portion of the curve, i.e., 
below the operating range of a clinical oxygenator. The extent to which such phenomena occur in blood 
under the clinical operating conditions of membrane lungs in cardiopulmonary bypass has not been 
investigated, but the effect on oxygen transport is thought to be significant. 

The earliest oxygenator configurations featuring rubber membranes made the blood flow in simple 
enclosed geometries, such as a flat plate or hollow fiber, that were inherently inefficient from a mass 
transfer standpoint. It was soon recognized that the gas transport rate was limited almost entirely by 
transport within the blood oxygenation boundary layer. There followed extensive efforts in the 1960s 
and early 1970s to investigate new approaches for improving gas transfer in membrane oxygenators, as 
summarized in Table 129.4. These approaches relied on one or more mechanisms: (1) increasing shear 
rate or producing turbulence; (2) keeping the oxygenation boundary layer very thin by using an appro- 
priate contacting geometry or pulsatile blood flow; and (3) making use of secondary flow, which incor- 
porates significant velocity components normal to the membrane surface. All these approaches are 
demonstrably effective under laboratory conditions, but few have achieved successful commercialization 
because of the constraints imposed by the geometry of available membrane materials (flat sheets and 
capillary tubes) and the clinical demand for simple, reliable, and inexpensive devices. 

In Table 129.4, passive designs for inducing secondary flows are those for which no energy source is 
needed except that required for steady blood flow. A common technique has been to place obstacles, for 

TABLE 129.4 Approaches for Improving Mass Transport in Membrane Oxygenators 

Passive designs Active designs 

Obstacles in blood path Oscillating toroidal chamber 

Membrane undulations from external supports Enclosed rotating disc 

Membrane texturing or embossing Pulsed flow vortex shedding 

Helical flow systems Couette flow 

External blood flow over hollow fibers Annulus, inner cylinder rotating axial flow, tangential flow. 
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example screens, in the blood path to induce secondary fluid movements and/or flow separation on a 
small scale and thereby increase blood mixing in flat plate devices. A similar result has been obtained by 
creating undulations in a flat membrane with grooved or multiple point supports or by using textured 
membranes to direct blood flow through the exchanger. 

Active signs are those in which there is energy input to achieve high shear rates or create secondary 
flows. Highly efficient oxygenator prototypes have been described, and at least two, the enclosed rotating 
disc and a pulsed-flow vortex shedding device, have been commercialized, but neither has achieved 
widespread clinical acceptance, in part because of the complexity or cumbersomeness of their operating 
mechanisms in an operating room setting. 

Another widely investigated approach has been to use flow geometries that naturally induce secondary 
flow, for example helical coils, where, superimposed on the primary flow, is a swirling motion in each 
half of the tube. The secondary flow trajectory which results from centrifugal effects takes particles from 
the periphery and carries them into the core, back to the periphery, back to the core in a continuously 
repeated fashion. Such secondary motion, by continuously sweeping oxygenated blood away from the 
membrane and replacing it with venous blood from the central core of the channel, can be extremely 
effective in increasing gas transport rates to the point where the dominant resistance to diffusion lies in 
the membrane. However, the practical difficulty of constructing such complex devices which are also 
disposable has thus far prevented industrial development. 

When the potential market for continuous extracorporeal membrane oxygenation collapsed, so did 
the intensive research and development effort in developing new devices. However, three developments, 
two technical and one marketing-related, over the next decade eventually led to dominance of membrane 
oxygenators for cardiopulmonary bypass. 

The first major technical advance was the fabrication of hydrophobic microporous capillary hollow 
fiber membranes at prices considerably lower than for silicon rubber membranes. The driving force for 
the initial development of these materials was their potential in another technology, membrane plasma- 
pheresis for the separation of plasma from the cellular components of blood. Hollow fibers for the 
application had nominal pore sizes, around 0.5 p.m, which were too large for membrane oxygenation 
because plasma could seep through the fiber wall under pressure leading to a catastrophic decrease in 
membrane permeability (129.1). In the early 1980s, microporous polypropylene hollow fibers with a 
nominal pore size around 0.1 p.m became available and proved satisfactory for membrane oxygenation. 
In addition to reduced trauma and competitive pricing with bubble oxygenators, membrane lungs could 
be employed with a reduced, fixed blood-priming volume, thereby minimizing transfusion and hemodi- 
lution problems. Surprisingly, this advantage was initially viewed as a drawback by perfusionists who 
were used to having a large venous blood reservoir with a visible gas-blood interface. 

The key marketing development of the mid-1980s was to make a membrane oxygenator by attaching 
it, along with a heat exchanger, to a clear plastic venous reservoir with a visible gas-blood interface. The 
astute move was breakthrough which put membrane oxygenators into the operating room. Their clear 
advantage in minimizing blood trauma and postoperative complications was so overwhelming that within 
a year membrane oxygenators captured the dominant market share. Shortly thereafter, bubble oxygen- 
ators were virtually eliminated from the U.S. marketplace. 

The most recent technical development has been the inversion of the usual internal flow arrangement 
so that, now, gas flows through the lumen of the hollow fibers while blood is pumped over the external 
surface of the capillary membrane. This arrangement is most effective when the blood flow is at right 
angles to the hollow fiber. In that configuration the flowing blood successively encounters different fibers, 
and a new oxygenation boundary layer forms on each fiber. Because the boundary layer is thinnest where 
it begins (in this case, at the front of each fiber), and the mass transfer rate is correspondingly highest, 
the transport of oxygen averaged over the periphery of each fiber is much higher than with the conven- 
tional internal (luminal) flow of blood. Thus the transport of oxygen averaged over the periphery of each 
fiber is much higher than with the conventional internal (luminal) flow of blood. The performance of 
various membrane oxygenators is compared in Table 129.5. The data clearly demonstrate the superior 
performance attainable with microporous hollow fibers operated with external crossflow of blood. 
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TABLE 129.5 Comparison of Membrane Oxygenator Performance (Blood flow rate = 
AAMI conditions) 



gas flow rate = 51/min; 



Model 



Membrane Form Blood Path Configuration 



2 , Flux 
cm 3 (STP)/min-m 3 



co 2 /o 2 

Flux Ratio 



Silicone Rubber 

Sci Med SM35 

Microporous polypropylene 

Cobe CMI 

Shiley M-2000 

Bentley Bos CM-40 

Terumo Capiox II 43 

Bard William Harvey 4000 

Johnson & Johnson Maxima 

Sarns 

Bentely Univox 



Sheet Spiral coil embedded failure 

Sheet Flat plate 

Sheet Blood screens 

Hollow fiber Internal flow 

Hollow fiber Internal flow 

Hollow fiber External cross flow 

Hollow fiber External cross flow 

Hollow fiber External cross flow 

Hollow fiber External cross flow 



90 



0.6 



130 


1.4 


120 


1.0 


70 


0.7 


60 


0.9 


140 


0.9 


150 


0.9 


150 


1.0 


160 


1.0 



The state of the art is now fairly advanced for membrane blood oxygenators, but there is still room 
for improvement. Further increases in flux would further reduce cost and minimize priming volume and 
blood consumption. Now that membrane oxygenators are fully entrenched, it is more likely that the 
reduced priming volume and improved control of a closed system can be realized. Lastly, the residual 
gas-blood interface that still exists at the microporous membrane surface could be eliminated by coating 
with a very thin skin in a n asymmetric or composite structure. 

Defining Terms 

Advancing front theory: A type of exchanger theory addressing the limitation of oxygen transport in 
a blood film through a fully saturated boundary layer leading to a front where the hemoglobin in 
flowing blood reacts with the dissolved oxygen. 

Arterialization: A gas exchange process whereby oxygen and carbon dioxide concentrations in venous 
blood are changed to levels characteristic of arterial blood. 

Artificial lung: A device which allows for continuous exchange for oxygen and carbon dioxide between 
circulating blood and a controlled gas atmosphere. 

Blood oxygenator: Synonymous with artificial lung, with the accent placed on oxygen transport, which 
is the most critical aspect of natural lung replacement, since the body oxygen reserves are very 
limited. Depending upon the physical process used for blood-gas transfer, artificial lungs are clas- 
sified as bubble oxygenators, stationary or rotating film oxygenators, and membrane oxygenators. 

Boundary layer: The film of blood adjacent to a permeable membrane, which, by reason of local fluid 
dynamics, is not renewed at the same rate as blood in the core of the flow path, thereby creating 
an additional diffusion barrier between the blood and the gas phase. 

Bubble oxygenator: Blood-gas transfer device in which a large exchange surface is obtained by the 
dispersion of oxygen bubbles in a venous blood stream, followed by coalescence of the foam and 
venting of excess gas (cocurrent blood and gas flow) or by spreading of venous blood over a 
continuously renewed column of foam generated by bubbling oxygen at the bottom of a reservoir 
(countercurrent blood and gas flow). 

Bypass: Derivation or rerouting of blood around an organ or body part, to diminish its blood supply, 
to abolish local circulation for the duration of a surgical intervention, or to increase blood flow 
permanently beyond an obstruction. The qualifier used with the word bypass designates the organ 
so isolated (e.g., left ventricular bypass, coronary artery bypass). 

Cardiopulmonary bypass (CPB): A procedure whereby blood is prevented from circulating through 
the heart cavities and the lungs. Cardiopulmonary bypass (also known as heart-lung bypass) can 
be partial if no obstacle is placed on venous return to the right heart cavities and consequently 
some of the blood continues to flow through the pulmonary circulation. In that case the arterial 
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system is fed in part by left ventricular output and in part by the arterialized blood perfusion from 
the hear-lung machine. The balance between the internal and extracorporeal blood circuits depends 
on the setting of the pumps and the relative resistance to flow in the two venous drainage pathways. 
During total cardiopulmonary bypass, the cardiac muscle must receive arterial blood from the 
extracorporeal circuit (intermittently or continuously) to prevent hypoxic myocardial damage. 
Since coronary venous blood drains into the cardiac cavities, this blood must be drained to the 
outside to prevent an intracavitary pressure increase which could be damaging to the heart. 

Catheter: A long hollow cylinder designed to be introduced in a body canal to infuse or withdraw 
materials into or out of the body. 

Coronary artery bypass graft (CABG): The construction of new blood conduits between the aorta (or 
other major arteries) and segments of coronary arteries beyond lesions which partially or totally 
obstruct the lumen of those vessels, for the purpose of providing an increased blood supply to 
regions of the myocardium made ischemic by those lesions. 

Extracorporeal circulation: Artificial maintenance of blood circulation by means of pumps located 
outside of the body, with blood fed through catheters advanced in an appropriate blood vessel and 
returning the blood to another blood vessel. 

Film oxygenator: Blood-gas transfer device in which a large exchange surface is obtained by spreading 
venous blood in a thin film over a stationary or moving physical support in an oxygen-rich 
atmosphere. 

Heart-lung bypass: Synonymous with cardiopulmonary bypass. 

Heart-lung machine: A mechanical system capable of pumping venous blood around the heart and 
the lungs and arterializing it in an appropriate gas exchange unit. 

Hemodilution: Temporary reduction in blood erythrocyte concentration (and consequently hemoglo- 
bin content, hematocrit, oxygen-carrying capacity, and viscosity) resulting from mixing with the 
erythrocyte-free or erythrocyte-poor content of the liquid used to prime an extracorporeal circuit. 

Hemolysis: The destruction of red blood cells with liberation of hemoglobin in surrounding plasma, 
caused by mechanical damage of the erythrocyte membrane, osmotic imbalance between intra- 
corpuscular and extracorpuscular ion concentration, or uncontrolled freezing-thawing cycles. 

Hollow fiber: A capillary tube of polymeric material produced by spinning a melted or dissolved 
polymer through an annular orifice. 

Membrane: A solid or liquid phase which acts as a barrier to prevent coalescence of neighboring 
compartments while allowing restricted or regulated passage of one or more molecular species. 

Membrane lung or Membrane oxygenator: Blood-gas transfer device in which the blood compartment 
is shielded from the gas phase by a porous or solid, hydrophobic polymer membrane permeable 
to gases but not to liquids (in particular, blood plasma). 

Metabolism: The sum of the chemical reactions occurring within a living body including build up 
(anabolism) and break down (catabolism) of chemical substances. 

Open heart surgery: Interventions taking place inside the cardiac cavities, such as for the replacement 
or reconstruction of cardiac valves, or the closure of abnormal communications between cardiac 
chambers, and which for reasons of convenience and safety, require the interruption of blood flow 
through the heart. By extension this stem is often used for cardiac interventions under total 
cardiopulmonary bypass, which address structures on the outside surface of the heart (such as 
coronary arteries) when drainage of the cardiac cavities through a vent is needed to avoid accu- 
mulation of coronary venous blood. 

Oxygenation boundary layer: Stationary or slowly moving blood layer adjacent to a gas-permeable 
membrane, which progressively develops along the blood path and, once enriched with oxygen 
diffusion through the membrane, effectively becomes a barrier to oxygen transport perpendicular 
to the direction of flow. 

Perfusion: A technique for keeping an organ or body part alive, though severed from its normal blood 
circulation, by introducing blood under pressure into the appropriate feeder artery. 
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Perfusionist: The operator of the heart-lung machine during cardiac surgery or respiratory assist 
procedures. 

Priming volume: The volume of liquid (blood, plasma, synthetic plasma expanders, or electrolyte 
solutions) needed to fill all components of an extracoporeal circuit (oxygenator, heat exchanger, 
blood pumps, filter, tubing, and catheters) so as to avoid exsanguination once the intracorporeal 
and extracorporeal circulation systems are joined. 

Pump-oxygenator: Equipment used to circulate blood through an extracorporeal circuit by means of 
mechanical blood pumps and to arterialize mixed venous blood by means of a gas exchange device. 
In most embodiments, the pump-oxygenator also serves to control blood temperature by means 
of a heat exchanger, typically incorporated in the gas exchange device. Synonymous with heart- 
lung machine. 

Respiratory quotient: The ratio of carbon dioxide produced by tissues or eliminated by the lungs to 
oxygen consumed by tissues or taken in through the lungs. 

Secondary flow: Any type of fluid motion, steady or periodic, in which the fluid is moving in a direction 
different from that of the primary flow. Secondary flow systems may be continuous in distribution, 
occupying the entire flow path, or comprise local elements that produce periodic remixing of the fluid. 

Total body perfusion: Maintenance of blood circulation through the arterial and venous system by 
means of a positive displacement pump introducing blood into an artery under pressure and 
collecting it from a vein for continuous recirculation. 
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130.1 Structure of Function of the Kidney 



The key separation functions of the kidney are: 

1. To eliminate the water-soluble nitrogenous end-products of protein metabolism 

2. To maintain electrolyte balance in body fluids and get rid of the excess electrolytes 

3. To contribute to obligatory water loss and discharge excess water in the urine 

4. To maintain acid-base balance in body fluids and tissues 

To fulfill these functions, the kidney processes blood — or more accurately, plasma water — which in turn 
exchanges water and solutes with the extravascular water compartments: extracellular, intracellular, and 
transcellular. The solute concentrations in body fluids vary from site to site, yet all compartments are 
maintained remarkably constant in volume and composition despite internal and external stresses. The 
global outcome of normal renal function is a net removal of water, electrolyte, and soluble waste products 
from the blood stream. The kidney provides the major regulatory mechanisms for the control of volume, 
osmolality, and electrolyte and nonelectrolyte composition as well as pH of the body fluids and tissues. 
Renal function is provided by paired, fist-sized organs, the kidneys, located behind the peritoneum 
against the posterior abdominal wall on both sides of the aorta. Each kidney is made up of over a million 
parallel mass transfer units which receive their common blood supply from the renal arteries, return the 
processed blood to the systemic circulation through the renal veins, and collect the waste fluids and 
solutes through the calyx of each kidney into the ureter and from there into the urinary bladder. These 
functional units are called nephrons and can be viewed as a sequential arrangement of mass transfer 
devices (glomerulus, proximal tubule, and distal tubule) for two fluid streams: blood and urine. 
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Kidney function is served by two major mechanisms: ultrafiltration, which results in the separation of 
large amounts of extracellular fluid through plasma filtration in the glomeruli, and a combination of 
passive and active tubular transport of electrolytes and other solutes, together with the water in which 
they are dissolved, in the complex system provided in the rest of the nephron. 

Glomerular Filtration 

The volume of blood flowing into the natural kidneys far exceeds the amount needed to meet their 
requirements for oxygen and nutrients: The primary role of the kidneys is chemical processing. As blood 
flows through the glomerular capillaries, about one-fifth of the plasma water is forced through permeable 
membranes to enter the proximal portion of the renal tubule and form the primary urine, which 
henceforth becomes the second fluid phase of the renal mass exchanger. The concentrated blood remain- 
ing in the vascular system is collected in the efferent arterioles and goes on to perfuse the tubules via the 
peritubular capillaries of the "vasa recta" system, where it recovers some of the lost water and eventually 
coalesces with the other blood drainage channels to form the renal vein. The plasma water removed from 
the blood in the glomerulus is termed the glomerular filtrate, and the process of removal is called 
glomerular filtration. Glomerular filtrate normally contains no blood cells and very little protein. Glom- 
erular filtration is a passive process driven by the differences in hydrostatic and oncotic pressures across 
the glomerular membrane. Solutes which are sufficiently small and not bound to large molecules pass 
quite freely through the glomerular membrane. All major ions, glucose, amino acids, urea, and creatinine 
appear in the glomerular filtrate at nearly the same concentrations as prevail in plasma. 

The normal glomerular filtration rate (GFR) averages 120 ml/min. This value masks wide physiological 
fluctuations (e.g., up to 30% decrease during the night, and a marked increase in the postprandial period). 
Although the kidneys produce about 170 liters of glomerular filtrate per day, only 1-2 liters of urine is 
formed. A minimum volume of about 400 ml/day is needed to excrete the metabolic wastes produced 
under normal conditions (often called obligatory water loss). 

Tubular Function 

In the tubule, both solute and water transport take place. Some materials are transported from the lumen 
across the tubular epithelium into the interstitial fluid surrounding the tubule and thence into the blood 
of the peritubular capillaries. This process is called reabsorption and results in the return of initially 
filtered solutes to the blood stream. Other substances are transported from the peritubular blood to the 
interstitial fluid across the tubular epithelium and into the lumen. This process is called secretion and 
leads to elimination of those substances to a greater extent than would be possible solely through 
glomerular filtration. The return of filtered molecules from the kidney tubule to the blood is accompanied 
by the passive reabsorption of water through osmotic mechanisms. 

The proximal tubule reabsorbs about two-thirds of the water and salt in the original glomerular filtrate. 
The epithelial cells extrude Na+ (and with it C1-) from the glomerular filtrate into the interstitial fluid. 
Water follows passively and in proportionate amounts because of the osmotic pressure gradient (the 
proximal tubular membrane is freely permeable to water). In the loop of Henle, the glomerular filtrate, 
now reduced to one-third of its original volume, and still isoosmotic with blood, is processed to remove 
another 20% of its water content. The active element is the ascending limb of the loop of Henle, where 
cells pump out Na+, K+, and CI- from the filtrate and move the Na+ and CI- to the interstitial fluid. 
Because the ascending limb is not permeable to water, tubular fluid becomes increasingly more dilute 
along the ascending loop. The blood vessels around the loop do not carry back all the extruded salt to 
the general circulation, and therefore the Na+ concentration builds up down the descending limb of the 
loop of Henle, reaching a concentration 4-5 times higher than isoosmolar. As a result, the Na+ concen- 
tration in tubular fluid increases as its volume is decreased by passive transport into interstitial fluid and 
from there into the blood. 

Countercurrent multiplication refers to the fact that the more Na+ the ascending limb extrudes the 
higher the concentration in the interstitial fluid, the more water removed from the descending limb by 
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osmosis, and the higher the Na+ concentration presented to the ascending limb around the bend of the 
loop. Overall, the countercurrent multiplier traps salt in the medullary part of the nephron because it 
recirculates its locally. Countercurrent exchange refers to the interaction of the descending and ascending 
branches of the circulatory loops (vasa recta) with the loop of Henle which flows in the opposite direction. 
Substances which pass from the tubule into the blood accumulate in high concentrations in the medullary 
tissue fluid. Na+ and urea diffuse into the blood as it descends along the loop but then diffuse out of 
the ascending vessels and back into the descending vessels where the concentration is lower. Solutes are 
therefore recirculated and trapped (short-circuited) in the medulla, but water diffuses out of the descend- 
ing vessel and into the ascending vessel to be transported out. 

The distal tubule of the kidney, located in the cortex, is the site of fine adjustments of renal excretion. 
Here again the primary motor is the Na+/K+ pump in the baso-lateral membrane, which creates a Na+ 
concentration gradient. The walls of the collecting duct, which traverses through a progressively hyper- 
tonic renal medulla tissue, are permeable to water but not to Na+ and C1-. As a result, water is drawn 
out and transported by capillaries to the general circulation. The osmotic gradient created by the coun- 
tercurrent multiplier system provides the force for water reabsorption from the collecting duct. However, 
the permeability of the cell membrane in the collecting duct is modulated by the concentration of 
antidiuretic hormone (ADH). A decrease in ADH impairs the reabsorption of water and leads to the 
elimination of a larger volume of more dilute urine. 

The terms reabsorption and secretion denote direction of transport rather than a difference in physi- 
ologic mechanism. In fact, a number of factors may impact on the net transport of any one particular 
solute. For example, endogenous creatinine (an end product of protein metabolism) is removed from 
plasma water through glomerular filtration in direct proportion to its concentration in plasma. Since it 
is neither synthesized nor destroyed anywhere in the kidney, and it is neither reabsorbed nor secreted in 
the tubule, its eventual elimination in the urine directly reflects glomerular filtration. Therefore, creatinine 
clearance can be used to measure glomerular filtration rate. However, glucose, which initially passes in 
the glomerular filtrate at the same concentration as in plasma, is completely reabsorbed from the tubular 
urine into peritubular capillaries as long as its plasma concentration does not exceed a threshold value 
somewhat above the level prevailing in normal subjects. As a result, there should be no glucose in the 
urine. When the threshold is exceeded, the amount of glucose excreted in the urine increases propor- 
tionately, producing glycosuria. 

Several weak organic acids such as uric acid and oxalic acid, and some related but not naturally 
occurring substances such as p-aminohippuric acid (PAH), barbiturates, penicillates, and some x-ray 
contrast media, have the special property of being secreted in the proximal tubule. For example, PAH 
concentration in glomerular filtrate is the same as in plasma water. So avid is the tubular transport system 
for PAH that tubular cells remove essentially all the PAH from the blood perfusing them. Therefore, the 
removal of PAH is almost complete, and the rate of appearance of PAH in the urine mirrors the rate of 
presentation of PAH to the renal glomeruli, that is to say, renal plasma flow. Therefore, PAH clearance 
can be used, in association with the hematocrit, to estimate the rate of renal blood flow. 

Urea appears in the glomerular filtrate at the same concentration as in plasma. However, one-third of 
urea diffuses back into the blood in the proximal tubule. In the distal nephron, urea (as an electrically 
neutral molecule without specific transport system) follows the fate of water {solvent drag). If large 
amounts of water are reabsorbed in the distal tubule and the collecting duct, then an additional third of 
the urea can be reabsorbed. However, if water diuresis is large, then correspondingly more urea is excreted. 

130.2 Kidney Disease 

The origin of kidney disease may be infectious, genetic, traumatic, vascular, immunologic, metabolic, or 
degenerative [Brenner & Rector, 1986]. The response of the kidneys to a pathologic agent may be rapid 
or slow, reversible or permanent, local or extensive. Under most circumstances, an abnormal body fluid 
composition is more likely to arise from the unavailability or excess of a raw material than from some 
intrinsic disturbance of renal function. This is why many clinical problems are corrected by fluid or 
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electrolyte therapy and secondarily by dietary measures and pharmacologic agents which act on the 
kidney itself. Only as a treatment of last resort, where kidney disease progresses to renal failure, do 
clinicians use extracorporeal body fluid processing techniques that come under the generic concept of 
dialysis. These invasive procedures are intended to reestablish the body's fluid and electrolyte homeostasis 
and to eliminate toxic waste products. Processing can address the blood (e.g., hemodialysis) or a proxy 
fluid introduced in body cavities (e.g., peritoneal dialysis). 

Even in healthy subjects, the GFR falls steadily from age 40 onward. Beyond age 80, it is only half of 
its adult value of 120 ml/min. However this physiologic deterioration is not extensive enough to cause 
symptoms. Since nature has provided kidneys with an abundance of overcapacity, patients do not become 
identifiably sick until close to 90% of original function has been lost. When kidneys keep deteriorating 
and functional loss exceeds 95%, survival is no longer possible without some form of replacement therapy. 

Supplementation (as distinct from replacement) of renal function by artificial means is occasionally 
used in case of poisoning. Toxic substances are often excreted into the urine of glomerular filtration and 
active tubular secretion, but the body load at times exceeds the kidneys' clearing capacity. There are no 
methods known to accelerate the active transport of poisons into urine. Similarly, enhancement of passive 
glomerular filtration is not a practical means to facilitate elimination of toxic chemicals. Processing of 
blood in an extracorporeal circuit may be life-saving when the amount of poison in the blood is large 
compared to the total body burden and binding of the compound to plasma proteins is not extensive. 
In such cases (e.g., methanol, ethylene glycol, or salicylates poisoning) extracorporeal processing of blood 
for removing the toxic element from the body is indicated. If the poison is distributed in the entire 
extracellular space or tightly bound to plasma proteins, dialytic removal is unlikely to affect the clinical 
outcome because it can only eliminate a small fraction of the toxic solute. 

Unfortunately in some situations either the glomerular or the tubular function of the kidneys, or both, 
fails and cannot be salvaged by drug and diet therapy. Failure can be temporary, self-limiting and 
potentially reversible, in which case only temporary substitution for renal function will be needed. Failure 
can also be the expression of progressive, intractable structural damage, in which case permanent replace- 
ment of renal function will eventually be needed for survival. However, the urgency of external inter- 
vention in end-stage renal disease (ESRD) is never as acute as is the case for the replacement of cardiac 
or respiratory function: The signs of renal dysfunction (water retention, electrolyte shifts, accumulation 
of metabolic end products normally eliminated by the kidneys) develop over days, weeks, or even months 
and are not immediately life threatening. Even in the end stage, renal failure can be addressed by 
intermittent rather than continuous treatment. 

130.3 Renal Failure 

There are two types of renal failure: acute (days or weeks) and chronic (months or years). Acute renal 
failure is typically associated with ischemia (reduction in blood flow), acute glomerulonephritis, tubular 
necrosis, or poisoning with "nephrotoxins" (e.g., heavy metals, some aminoglycosides, and excessive loads 
of free hemoglobin). Chronic renal failure is usually caused by chronic glomerulonephritis (of infectious 
or immune origin), pyelonephritis (ascending infection of the urinary tract), hypertension (leading to 
nephrosclerosis), or vascular disease (most commonly secondary to diabetes). 

Renal insufficiency elicits the clinical picture of uremia. Although the word uremia means that there 
is too much urea in the blood, urea level in itself is not the cause of the problem. Uremia, often expressed 
in the United States as blood urea nitrogen concentration or BUN (which is actually half the urea 
concentration), serves as an indicator of the severity of renal disease. Urea is a metabolic end product in 
the catabolism of proteins that is hardly toxic even in high concentration. However, it mirrors the impaired 
renal elimination and the resulting accumulation in body fluids of other toxic substances, some of which 
have been identified (e.g., phenols, guanidine, diverse polypeptides); others remain unknown and are 
therefore referred to as uremic toxins or, for reasons to be discussed later, middle molecules. The attenuation 
of uremic symptoms by protein restriction in the diet and by various dialytic procedures underscores 
the combined roles of retention, removal, and metabolism in the constellation of signs of uremia. Toxicity 
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may result from the synergism of an entire spectrum of accumulated molecules [Vanholder & Ringoir, 
1992]. It may also reflect the imbalance that results from a specific removal through mechanisms which 
eliminate physiologic compounds together with potential toxins. 

Not until the GFR (as estimated by its proxy, creatinine clearance) falls much below a third of normal 
do the first signs of renal insufficiency become manifest. At that point the plasma or extracellular 
concentration of substances eliminated exclusively through the glomeruli, such as creatinine or urea, 
increase measurably, and the possibility of progressive renal failure must be considered. In such cases, 
over a period of months to years, the kidneys lose their ability to excrete waste materials, to achieve 
osmoregulation, and to maintain water and electrolyte balance. The signs of ESRD become recognizable 
as creatinine clearance approaches 15 ml/min, eventually leading to uremic coma as water and solute 
retention depress the cognitive functions of the central nervous system. Empirically, it appears that the 
lowest level of creatinine clearance that is compatible with life is on the order of 8 ml/min, or 1 1.5 liters 
per day, or 80 liters per week. (These numbers have a bearing on the definition of adequate dialysis in 
ERSD patients, because they represent the time-averaged clearance which must be achieved by much 
more effective but intermittent blood processing). Human life cannot be sustained for more than 
7-10 days in the total absence of kidney function. Clinical experience also shows that even a minimum 
of residual renal clearance (K R ) below the level necessary for survival can be an important factor of well- 
being in dialyzed patients, perhaps because the natural kidney, however sick, remains capable of elimi- 
nating middle molecular weight substances, whereas the artificial kidney is mostly effective in eliminating 
water and small molecules. 

The incidence of ESRD ( incidence is defined as the number of new patients entering treatment during 
a given year) has increased dramatically in the past 25 years in the United States and elsewhere. Whereas 
in the 1960s it was estimated at 700-1000 new cases a year in the United States (nearly three-quarters of 
them between the ages of 25 and 54), the number of new patients reached 16,000 per year at the end of 
the 1970s (still with the majority of cases under age 54) and 40,000 at the end of the 1980s, with the 
largest contingent between 65 and 74 years old. Serious kidney disease now strikes between in 1 in 5000 
and 1 in 10,000 people per year in our progressively aging population. The fastest rate of growth is in 
the age group over 75, and the incidence of ESRD shows no signs of abating. 

The prevalence of ESRD {prevalence is defined as the total number of patients present in the population 
at a specific time) has grown apace: In the United States, about 1000 people were kept alive by dialysis 
in 1969; 58,253 in 1979; 163,017 in 1989; close to 200,000 now. This is the result of a combination of 
factors which include longer survival of patients on hemodialysis and absolute growth of an elderly 
population suffering from an increasing incidence of diseases leading to ESRD such as diabetes. World- 
wide, over 500,000 people are being kept alive by various modalities of "artificial kidney" treatment: 
about a third in the United States, a third in Europe, and a third in Japan and Pacific Rim countries. 
Another 500,000 or so have benefitted from dialytic treatment in the past but have since died or received 
transplants [Lysaght & Baurmeister, 1993]. Close to 85% of current patients are treated by maintenance 
hemodialysis, and 15% are on peritoneal dialysis. These numbers do not include about 100,000 people 
with a functional renal transplant, most of whom required hemodialysis support while waiting for a 
donor organ, and who may need it again, if only for a limited period, in case of graft rejection. 

The mortality of ESRD patients in the United States has inched upward from 12%— 16% per year in 
the 1970s and 1980s and has risen abruptly in recent years to levels in the order of 20%-25%. This has 
led to extensive controversy as to the origin of this deterioration, which has not been observed to the 
same extent in other regions with a similarly large population of ESRD patients, such as Western Europe 
and Japan, and may reflect for the United States insufficient dialysis as well as the burden of an increasingly 
older population. 

130.4 Treatment of Renal Failure 

Profound uremia, whether caused by an acute episode of renal failure or by the chronic progressive 
deterioration of renal function, used to be a fatal condition until the middle of the twentieth century. 
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The concept of clearing the blood of toxic substances while removing excess water by a membrane 
exchange process was first suggested by the experiments of Abel, Rowntree, and Turner at the Johns 
Hopkins Medical School. Back in 1913, these investigators demonstrated the feasibility of blood dialysis 
to balance plasma solute concentrations with those imposed by an appropriately formulated washing 
solution. However, their observation was not followed by clinical application, perhaps because experi- 
ments were limited by the difficulty of fabricating suitable exchange membranes, and blood anticoagu- 
lation was then extremely precarious. Collodion, a nitrocellulose film precipitated from an alcohol, ether, 
or acetone solution was the sole synthetic permeable membrane material available until the advent of 
cellophane in the 1930s. The unreliability of anticoagulants before the discovery of heparin also made 
continuous blood processing a hazardous process even in laboratory animals. 

In 1944, Kolff in the Netherlands developed an artificial kidney of sufficient yet marginal capacity to 
treat acute renal failure in man. This device consisted of a long segment of cellophane sausage tubing 
coiled around a drum rotating in the thermostabilized bath filled with a hypertonic, buffered electrolyte 
solution, called the dialysate. Blood was allowed to flow from a vein into the coiled cellophane tube. 
Water and solute exchange occurred through the membrane with a warm dialysate pool, which had to 
be renewed every few hours because of the risk of bacterial growth. The cleared blood was returned to 
the circulatory system by means of a pump. After World War II, a somewhat similar system was developed 
independently by Alwall in Sweden. Because of the technical difficulty of providing repeated access to 
the patient's circulation, and the overall cumbersomeness of the extracorporeal clearing process, hemo- 
dialysis was limited to patients suffering from acute, and hopefully reversible, renal failure, with the hope 
that their kidneys would eventually recover. To simplify the equipment, Inouye and Engelberg [1953] 
devised a coiled cellophane tube arrangement that was stationary and disposable, and shortly thereafter 
Kolff and Watschinger (by then at the Cleveland Clinic) reported a variant of this design, the Twin Coil, 
that became the standard for clinical practice for a number of years. 

Repeated treatment, as needed for chronic renal failure, was not possible until late 1959, when Scribner 
and Quinton introduced techniques for chronic access to the blood stream which, combined with 
improvements in the design and use of hemodialysis equipment, allowed the advent of chronic intermit- 
tent hemodialysis for long-term maintenance of ESRD patients. This was also the time when Kiil first 
reported results with a flat plate dialyzer design in which blood was made to flow between two sheets of 
cellophane supported by solid mats with grooves for the circulation of dialysate. This design — which had 
been pioneered by Skeggs and Leonard, McNeill, and Bluemle and Leonard — not only needed less blood 
volume to operate then the coiled tube devices, it also had the advantage of requiring a relatively low 
head of pressure to circulate the blood and the dialysate. This meant that the two fluids could circulate 
without high pressure differences across the membrane. Therefore, in contrast to coil dialyzers, where a 
long blood path necessitated a high blood pressure at the entrance of the exchanger, flat plate dialyzers 
could transfer metabolites through the membrane by diffusion alone, without coupling it with the 
obligatory water flux deriving from high transmembrane pressure. When ultrafiltration was needed, it 
could be achieved by circulating the dialysate at subatmospheric pressures. 

Device development was also encouraged by the growing number of home dialysis patients. By 1965, 
the first home dialysate preparation and control units were produced industrially. Home dialysis programs 
based on the twin coil or flat plate dialyzers were soon underway. At that time the cost of home treatment 
was substantially lower than hospital care, and in the United States, Social Security was not yet under- 
writing the cost of treatment of ESRD. 

In 1965 also, Bluemle and coworkers analyzed means to pack the maximum membrane area in the 
minimum volume, so as to reduce the bulkiness of the exchange device and diminish the blood loss 
associated with large dialyzers and long tubing. They concluded that a tightly packed bundle of parallel 
capillaries would best fit this design goal. Indeed by 1967, Lipps and colleagues reported the initial clinical 
experience with hollow fiber dialyzers, which have since become the mainstay of hemodialysis technology. 

In parallel developments. Henderson and coworkers [1967] proposed an alternative solution to the 
problem of limited mass transfer achievable by diffusion alone with hemodialysis equipment. They 
projected that a purely convective transport (ultrafiltration) through membranes more permeable to 
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water than the original cellulose would increase the effective clearance of metabolites larger than urea. 
The lost extracellular volume was to be replaced by infusing large volumes of fresh saline into the blood 
at the inlet or the outlet of the dialyzer to replace the lost water and electrolytes. The process was called 
hemodiafiltration or, sometimes, diafiltration. (The procedure in which solutes and water are removed 
by convective transport alone, using large pore membranes and without substantial replacement of the 
fluid, is now known as hemofiltration and is used primarily in patients presenting with massive fluid 
retention.) 

As is intuitively apparent, the effectiveness of hemodialysis with a given devices is related to the duration 
of the procedure. In the pioneer years, dubbed "the age of innocence" by Colton [1987], patients were 
treated for as many as 30 hours a week. Economics and patient convenience promoted the development 
of more efficient transfer devices. Nowadays, intermittent maintenance dialysis can be offered with 
10 hours (or even less) of treatment dividend in 3 sessions per week. Conversely, nephrologists have 
developed (mostly for use in the intensive care unit) the procedure known as continuous arterio-venous 
hemodialysis (with its variant continuous arterio-venous hemofiltration) in which blood pressure from 
an artery (aided or not by a pump) drives blood through the exchange device and back into a vein. 
Continuous operation compensates for the relatively low blood flow and achieves stable solute concen- 
trations, as opposed to the seesaw pattern that prevails with periodic treatment. 

The concept of using a biologic membrane and its blood capillary network to exchange water and 
solutes with a washing solution underlies the procedure known as peritoneal dialysis, which relies on the 
transfer capacity of the membranelike tissue lining the abdominal cavity and the organs it contains. In 
1976, Popovich and Moncrief described continuous ambulatory peritoneal dialysis (CAPD), a procedure 
in which lavage of the peritoneal cavity is conducted as a continuous form of mass transfer through 
introduction, equilibration, and drainage of dialyzate on a repetitive basis 4-6 times a day. In CAPD, a 
sterile solution containing electrolytes and dextrose is fed by gravity into the peritoneal cavity through 
a permanently installed transcutaneous catheter. After equilibriation with capillary blood over several 
hours, this dialyzate is drained by gravity into the original container and the process is repeated with a 
fresh solution. During the dwell periods, toxins and other solutes are exchanged by diffusional processes. 
Water transfer is induced by the osmotic pressure difference due to the high dextrose concentration in 
the treatment fluid. This procedure is analyzed in detail in Chapter 131. 

Plasmapheresis, i.e., the extraction of plasma from blood by separative procedures (see Chapter 132), 
has been used in the treatment of renal disease [Samtleben & Gurland, 1989]. However, the cost of 
providing fresh plasma to replace the discarded material renders plasmapheresis impractical for frequent, 
repeated procedures, and plasmapheresis is used mainly for other clinical indications. 

Most hemodialysis is performed in free-standing treatment centers, although it may also be provided 
in a hospital or performed by the patient at home. The hemodialysis circuit consists of two fluid pathways. 
The blood circuitry is entirely disposable, though many centers reuse some or all circuit components in 
order to reduce costs. It comprises a 16-gauge needle for access to the circulation (usually through an 
arteriovenous fistula created in the patient's forearm), lengths of plasticized polyvinyl chloride tubing 
(including a special segment adapted to fit into a peristaltic blood pump), the hemodialyzer itself, a 
bubble trap and an open mesh screen filter, various ports for sampling or pressure measurements at the 
blood outlet, and a return cannula. Components of the blood side circuit are supplied in sterile and 
nonpyrogenic conditions. The dialysate side is essentially a machine capable of (1) proportioning out 
glucose and electrolyte concentrates with water to provide a dialysate of appropriate composition; 
(2) sucking dialysate past a restrictor valve and through the hemodialyzer at subatmospheric pressure; 
and (3) monitoring temperature, pressures, and flow rates. During treatment the patient's blood is 
anticoagulated with heparin. Typical blood flow rates are 200-350 ml/min; dialysate flow rates are usually 
set at 500 ml/min. Simple techniques have been developed to prime the blood side with sterile saline 
prior to use and to return to the patient nearly all the blood contained in the extracorporeal circuit after 
treatment. Whereas most mass transport occurs by diffusion, circuits are operated with a pressure on 
the blood side, which may be 100-500 mmHg higher than on the dialysate side. This provides an 
opportunity to remove 2-4 liters of fluid along with solutes. Higher rates of fluid removal are technically 
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possible but physiologically unacceptable. Hemodialyzers must be designed with high enough hydraulic 
permeabilities to provide adequate fluid removal at low transmembrane pressure but not so high that 
excessive water removal will occur in the upper pressure range. 

Although other geometries are still employed, the current preferred format is a "hollow fiber" hemo- 
dialyzer about 25 cm in length and 5 cm in diameter, resembling the design of a shell and tube heat 
exchanger. Blood enters at the inlet manifold, is distributed to a parallel bundle of capillary tubes (potted 
together with polyurethane), and exits at a collection manifold. Dialysate flows countercurrent in an 
external chamber. The shell is typically made of an acrylate or polycarbonate resin. Devices typically 
contain 6000-10,000 capillaries, each with an inner diameter of 200-250 microns and a dry wall thickness 
as low as 10 microns. The total membrane surface area in commercial dialyzers varies from 0.5 to 1.5 m 2 , 
and units can be mass-produced at a relatively low cost (selling price around $10— $15, not including 
tubing and other disposable accessories). Several reference texts (see For Further Information) provide 
concise and comprehensive coverage of all aspects of hemodialysis. 

130.5 Renal Transplantation 

The uremic syndrome resembles complex forms of systemic poisoning and is characterized by multiple 
symptoms and side effects. Survival requires that the toxins be removed, and the resulting quality of life 
depends on the quantity of toxins which are actually eliminated. Ideally, one would like to clean blood 
and body fluids to the same extent as is achieved by normal renal function. This is only possible at the 
present time with an organ transplant. 

The feasibility of renal transplantation as a therapeutic modality for ESRD was first demonstrated in 
1954 by Murray and coworkers in Boston, and Hamburger and coworkers in Paris, in homozygous twins. 
Soon the discovery of the first immunosuppressive drugs led to the extension of transplantation practice 
to kidneys of live, related donors. Kidney donation is thought to be innocuous since removal of one 
kidney does not lead to renal failure. The remaining kidney is capable of hypertrophy, meaning that the 
glomeruli produce more filtrate, and the tubules become capable of increased reabsorption and secretion. 
A recent Canadian study indicates that the risk of ESRD is not higher among living kidney donors than 
in the general population, meaning that a single kidney has enough functional capacity for a lifetime. 
Nonetheless, cadaver donors now constitute the main organ source for the close to 10,000 renal trans- 
plants performed in the United States every year. Even though under ideal circumstances each cadaver 
donor allows two kidney transplants, the scarcity of donors is the major limitation to this form of 
treatment of ESRD. Most patients aspire to renal transplant because of the better quality of life it provides 
and the freedom from the time constraints of repeated procedures. However, the incidence of ESRD is 
such that only one patient in five can be kept alive by transplantation. Dialysis treatment remains a 
clinical necessity while waiting for a transplant, as a safety net in case of organ rejection, and for the 
many patients for whom transplantation is either contraindicated or simply not available. 

130.6 Mass Transfer in Dialysis 

In artificial kidneys, the removal of water and solutes from the blood stream is achieved by 

1. Solute diffusion in response to concentration gradients 

2. Water ultrafiltration and solute convection in response to hydrostatic and osmotic pressure gra- 
dients 

3. Water migration in response to osmotic gradients 

In most cases, these processes occur simultaneously and in the same exchange device, rather than 
sequentially as they do in the natural kidney with the cascade of glomerular filtration, tubular reabsorp- 
tion, and final adjustments in the collecting tubule. 
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Mechanistically, the removal of water and solutes from blood is achieved by passive transport across 
thin, leaky, synthetic polymer sheets or tubes similar to those used in the chemical process call dialysis. 
Functionally, an artificial kidney (also called hemodialyzer, or dialyzer or filter for short) is a device in 
which water and solutes are transported from one moving fluid stream to another. One fluid stream is 
blood; the other is dialysate: a human-made solution of electrolytes, buffers, and nutrients. The solute 
concentration as well as the hydrostatic and osmotic pressures of the dialysate are adjusted to achieve 
transport in the desired direction (e.g., to remove urea and potassium ions while adding glucose or 
bicarbonate to the bloodstream). 

Efficiency of mass transfer is governed by two and only two independent parameters. One, which 
derives from mass conservation requirements, is the ratio of the flow rates of blood and dialysate. The 
other is the rate constant for solute transport between the two fluid streams. This rate constant depends 
upon the overall surface area of membrane available for exchange, its leakiness or permeability, and such 
design characteristics as fluid channel geometry, local flow velocities, and boundary layer control, all of 
which affect the thickness of stationary fluid films, or diffusion barriers, on either side of the membrane. 

130.7 Clearance 

The overall mass transfer efficiency of a hemodialyzer is defined by the fractional depletion of a given 
solute in the blood as it passes through the unit. Complete removal of a solute from blood during a 
single pass defines the dialyzer clearance for that solute as equal to dialyzer blood flow. In other terms, 
dialyzer blood flow asymptotically limits the clearance of any substance in any device, however efficient. 
Under conditions of steady-state dialysis, the mass conservation requirement is expressed as 



N-- 



■Q B (C B -C B0 ) = Q D (C D0 -C D ) (130.1) 



where N is the overall solute transfer rate between blood and dialysate, Q B and Q D are blood flow and 
dialysate flow rates respectively, and C Bi , C Bo , C Di , and C Do , are the solution concentrations C in blood, 
B, or dialysate, D, at the inlet, ;', or the outlet, o of the machine. 

Equation (130.1) about mass conservation leads to the first and oldest criterion for dialyzer effective- 
ness, namely clearance K, modeled after the concept of renal clearance. Dialyzer clearance is defined as 
the mass transfer rate N divided by the concentration gradient prevailing at the inlet of the artificial 
kidney. 

N 
K= (130.2) 

Since mass transfer rate also means the amount of solute removed from the blood per unit of time, 
which in turn is equal to the amount of solute accepted in the dialysate per unit of time, there are two 
expressions for dialysance 

Q B (c B -C Bo ) 

K=^^ ^ (130.3) 
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which afford two methods for measuring it. Any discrepancy must remain within the error of measure- 
ments, which under the conditions of clinical hemodialysis easily approaches ± 10%. As in the natural 
kidney, the clearance of any solute is defined by the flow rate of blood which is completely freed of that 
solute while passing through the exchange device. The dimensions of clearance are those of flow (a virtual 
flow, one may say), which can vary only between zero and blood flow (or dialysate flow, whichever is 
smaller), much in the way the renal clearance of a substance can only vary between zero and effective 
renal plasma flow. 

Since dialyzer clearance is a function of blood flow, a natural way to express the efficiency of a particular 
exchange device consists of "normalizing" clearance with respect to blood flow as a dimensionless ratio 
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K/Q B can vary only between zero and one and represents the highest attainable solute depletion in the 
blood which is actually achieved in a particular device for a particular solute under a particular set of 
circumstances. 

Another generalization of the dialysance concept may be useful in the case where the direction of 
blood flow relative to the direction of dialysate flow is either parallel, random, or undetermined, as occurs 
with the majority of clinical hemodialyzers. Under such circumstances, the best performance which can 
be achieved is expressed by the equality of solute concentration in outgoing blood and outgoing dialysate 
(C Bo = C Do = C e or equilibrium concentration). This limit defines, after algebraic rearrangement of 
Eqs. (130.3) and (130.4), the maximal achievable clearance at any combination of blood and dialysate 
flow rates without reference to solute concentrations. 

K = Qb>< Qd (130.7) 

^ Q B + Qr, 

Since blood and dialysate flows can usually be measured with a reasonable degree of accuracy, the 
concept of K m . ix provides a practical point of reference against which the effectiveness of an actual dialyzer 
can be estimated. 

130.8 Filtration 

So far we implicitly assumed that differences in concentration across the membrane provide the sole 
driving force for solute transfer. In clinical hemodialysis, however, the blood phase is usually subject to 
a higher hydrostatic pressure than the dialysate phase. As a result, water is removed from the plasma by 
ultrafiltration, dragging with it some of the solutes into the dialysate. Ultrafiltration capability is a 
necessary consequence of the transmural pressure required to keep the blood path open with flat sheet 
or wide tubular membranes. It is also clinically useful to remove the water accumulated in the patient's 
body in the interval of dialysis. Ultrafiltration can be enhanced by increasing the resistance to blood flow 
at the dialyzer outlet, and thereby raising blood compartment pressure, by subjecting the dialysate to a 
negative pressure or by utilizing membranes more permeable to water than the common cellophanes. 

Whenever water is removed from the plasma by ultrafiltration, solutes are simultaneously removed in 
a concentration equal to or lower than that present in the plasma. For small, rapidly diffusible molecules 
such as urea, glucose, and the common electrolytes, the rate of solute removal almost keeps pace with 



© 2000 by CRC Press LLC 



that of water, and ultrafiltrate concentration is the same as that in plasma. With compounds characterized 
by a larger molecular size, the rate of solute removal lags behind that of water. Indeed with some of the 
largest molecules of biological interest, ultrafiltration leads to an actual increase in plasma concentration 
during passage through the artificial kidney. 

Defining ultrafiltration as the difference between blood flow entering the dialyzer and blood flow 
leaving the dialyzer 

one can rewrite the mass conservation requirement as 

Q Bi C Bi I Amount of solute in the incoming blood 
= Q Bo C Bo (amount of solute in the outgoing blood) 
+ -K"J C Bi — C Di 1 1 amount cleared in the dialyzer I 

The clearance equations can then be rewritten as 
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The clearance is now defined as the amount of solute removed from the blood phase per unit of time, 
regardless of the nature of the driving force, divided by the concentration difference between incoming 
blood and incoming dialysate. 



When C Di = 



k b = Qb-Qb, 
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When C Di = and C Bo = C Bj 
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The practical value of these equations is somewhat limited, since their application requires a high 
degree of accuracy in the measurement of flows and solute concentrations. The special case where there 
is no solute in the incoming dialysate {C Di = 0) is important for in vitro testing of artificial kidneys. 
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130.9 Permeability 

The definition of clearance is purely operational. Based upon considerations of conservation of mass, it 
is focused primarily on the blood stream from which a solute must be removed, thus, in final analysis, 
on the patient herself or himself. Clearance describes the artificial kidney as part of the circulatory system 
and of the fluid compartments which must be cleared of a given solute. To relate the performance of a 
hemodialyzer to its design characteristics, clearance is of limited value. 

To introduce into the picture the surface area of membrane and the continuously variable (but 
predictable) concentration difference between blood and dialysate within the artificial kidney, one must 
define the rate constant of solute transfer, or permeability Py_. 

N 

P T = (130.13) 

1 Ax AC 

where Nis the overall solute transport rate between blood dialysate, A is the membrane area, and AC is 
the average solute concentration difference between the two moving fluids. 

Permeability is defined by Eq. (130.13) as the amount of solute transferred per unit area and per unit 
of time, under the influence of a unit of concentration driving force. The proper average concentration, 
AC, driving force is the logarithmic mean of the concentration differences prevailing at the inlet and at 
the outlet 

AC= AC -- AC ° (130.14) 

AC 

o 

The boundary conditions on the concentration driving force (AC, and AC ) are uniquely determined 
by the geometry of the dialyzer. The three most common cases to consider are: (1) cocurrent flow of 
blood and dialysate; (2) laminar blood flow, with completely mixed dialysate flow; and (3) countercurrent 
blood and dialysate flow. The boundary conditions on concentration driving force follow. 

Cocurrent flow is 



Mixed dialysate flow is 



Countercurrent flow is 



C B~ C D t and C Bo~ C Do 



Cbi Cd<, an d C DBo C Do 



C„-C n and C„ -C n . 

Bi Do Bo Di 



Thus permeability can be expressed as in the following equations. Cocurrent flow is 

N ln ( C »- C j 

P L =—, S'fi" y (130.15) 
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Mixed dialysate flow is 



N 



ln(C Bi -C Do ) 



C Bo C Do 



a(c b -c Do )-(c Bo -c Do ) 



(130.16) 



Countercurrent flow is 



In 



N 



(c B ,-c Do ) 



c„-c 
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c a 



(130.17) 



By simultaneous solution of Eqs. (130.3), (130.4), and (130.12), and use of the formal definition of 
the logarithmic mean concentration driving force (130.14), the clearance ratio (K/Q B ) can be expressed 
as a function of two dimensionless parameters (Z and _R), neither of which involves solute concentration 
terms [Leonard & Bluemle, 1959; Michaels, 1966]. Cocurrent flow is 



K _ 1 
Q B ~ \ + Z 



l-expl-tffl+Z 



(130.18) 



Mixed dialysate flow is 



K _ l-exp(-i?) 
Q B 1 + zfl-expf-i?) 



(130.19) 



Countercurrent flow is 



K 



1 — exp 



il-Z] 



Q B Z-exp ^(l-ZJ 



(130.20) 



where Z = Q B /Q D and R = P L A/Q B 

Michaels has expressed graphically Eqs. (130.8-130.20) as plots of clearance ratio (K/Q B ) versus flow 
ratio (Q B /Q D ) with various solute transport ratios (Pj., A/Q B ) as parameters. These plots give an appre- 
ciation of the relative importance of the variables affecting dialyzer efficiency and permit one to recognize 
readily the factors which limit mass transfer under a particular set of conditions. 

For the computation of actual permeability coefficients, from pooled data obtained at varying solute 
concentrations, Eqs. (130.15-130.17) can be rearranged, using definitions of N, C Bo 
Eqs. (130.2-130.4). Cocurrent flow is 



and C Do from 
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(130.21) 
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Mixed dialysate flow is 

P=^ln 1 7 K ' Qd , (130.22) 

A 1- K /q b -k/q d 

Countercurrent flow is 

p z= QbQd ln l^ZQD (13023) 

4q d -q b ) !-^/Qb 

As remarked by Leonard and Bluemle [1959], when, and only when, Q D is much greater than Q B , the 
above equations (130.18-130.23) reduce to Renkin's [1956] formula 

R=^Mn -. — (130.24) 

A 1-KQ, 






■■ 1 — exp 



'p^ 



(130.25) 



Historically, Eq. (130.25) played an important role in pointing out to designers that the clearance ratio 
(K/Q B ) can be improved equally well by an increase in exchange area (A) or in permeability (P). However, 
caution is required in applying the equation to some of the efficient modern dialyzers. First, the assump- 
tion that dialysate flow is "infinitely" large with respect to blood flow is seldom verified. Furthermore, 
the functions relating permeability to dialysance, Eqs. (130.21-130.24), or to the individual solute con- 
centrations and flows, Eqs. (130.15-130.17), have an exponential form. When the overall permeability 
approaches that of the membrane alone, when the outgoing solute concentrations approach the equilib- 
rium conditions, or when clearance approaches blood flow, one deals with the steep part of that expo- 
nential function. Any slight error in the experimental measurements will lead to a disproportionately 
larger error in calculated permeability. 

130.10 Overall Transport 

In a dialyzer, separation occurs because small molecules diffuse more rapidly than larger ones and because 
the degree to which membranes restrict solute transport usually increases with permeant size (permse- 
lectivity). Fick's equation states that solute will move from a region of greater concentration to a region 
of lower concentration in a rate proportional to the difference in concentration on opposite sides of the 
membrane 

<fr = -D— (130.26) 

dx 

where (]) = unit solute flux in g/cm 2 -s; D = diffusion coefficient cm 2 /s; c = concentration in g/cm 3 ; and 
x = distance in cm. The minus sign accounts for the convention that flux is considered positive in the 
direction of decreasing concentration. Diffusion coefficient decreases roughly in proportion to the square 
root of molecular weight. 
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Ignoring boundary layer effects for the moment, and assuming that diffusion within the membrane 
is analogous to that in free solution, Eq. (130.26) can be integrated across a homogeneous membrane of 
thickness d to yield 

SD Ac , 

<j) = m (130.27) 

d 

where S represents the dimensionless solute partition coefficient, i.e., the ratio of solute concentration 
in external solution to that at the membrane surface, and D m represents solute diffusion within the 
membrane and is assumed to be independent of solute concentration in the membrane. If two or more 
solutes are dialysing at the same time, the degree of separation or enrichment will be proportional to 
the ratio of their permeabilities. The closer the permeability of a membrane is to that of an equivalent 
thickness of free solution, the more rapid will be the resultant dialytic transport. Equation (130.27) is 
often further simplified to this expression for flux per unit of membrane area 

ty = P J ,AC (130.28) 

where thickness is incorporated into an overall membrane mass transfer coefficient with units of cm/s, 
and AC is the logarithmic mean concentration. 

Chemical engineers provided a firm foundation for describing the overall performance of hemodia- 
lyzers recognizing the importance of understanding and describing mass transfer in each of the three 
phases of a hemodialyzer (blood, membrane, dialysate), the individual mass transfer resistances of which 
sum to the overall mass transfer resistance of the device [Colton, 1987]. Solutions adjacent to the 
membranes are rarely well mixed, and the resistance to transport resides not just in the membrane but 
also in the fluid regions termed boundary layers, on both the dialysate and blood side. Moreover, some 
dialyzers are designed to direct flow parallel to the surface of the membrane rather than expose it to a 
well-mixed bath. Boundary layer effects typically account for 25-75% of the overall resistance to solute 
transfer [Lysaght & Baurmeister, 1993]. In many exchanger designs, boundary layer effects can be 
minimized by rapid convective flow targeted to the surface of the membrane where fluid pathways are 
thin, flow near the membrane is laminar, and boundary layer resistance decreases with increasing wall 
shear rates. When geometry permits higher Reynolds numbers, flow becomes turbulent, and fluid resis- 
tance varies with net tangential velocity. Geometric obstacles (e.g., properly spaced obstacles) or fluid 
mechanical modulation (e.g., superimposed pulsation) are often-used tactics to minimize boundary layer 
effects, but all result in higher energy utilization. Quantitatively, the membrane resistance becomes part 
of an overall mass transfer parameter P E which for conceptual purposes can be broken down into three 
independent and reciprocally additive components for the triple laminate: blood boundary layer (sub- 
script B), membrane (subscript M), and dialysate boundary layer (subscript D), such that 

— = — + — +— (130.29) 

P P P P 

I B M D 

or reciprocally 

R Z = R B + R M + R D (130.30) 

where P E is the device-averaged mass transfer coefficient (or permeability) in cm/s and R^ is the device- 
averaged resistance in s/cm. D B can be estimated for many relevant conditions of geometry and flow 
using mass transport analysis based upon wall Sherwood numbers [Colton et al., 1971]. P M is best 
obtained by measurements employing special test fixtures in which boundary layer resistances are negligible 
or known [Klein et al., 1977]. P D is more problematic and is usually obtained by extrapolations based 
upon Wilson plots [Leonard &Bluemle, I960]. Boundary layer theory, as well as technique for correlation, 
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estimation, and prediction of the constituent mass transfer coefficients, is reviewed in detail by Colton 
and coworkers [1971] and Klein and coworkers [1977]. Overall solute transport is obtained from local 
flux by mass balance and integration; for the most common case of countercurrent flow 
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(130.31) 



where C m and C Di represent inlet concentrations in the blood and dialysate streams in g/cm 3 , A represents 
membrane surface area in cm 2 , Q B and Q D are blood and dialysate flow rates in cm 3 /min, and <I> and Pt_ 
are as defined in Eqs. (130.28) and (130.29). Derivations of this relationship and similar expressions for 
cocurrent or crossflow geometries can be found in reviews by Colton and Lowrie [1981] and Gotch and 
colleagues [1972]. 

As pointed out by Lysaght and Baurmeister [1993], hemodialysis is a highly constrained process. 
Molecular diffusion is slow, and the driving forces are set by the body itself, decreasing in the course of 
purification and not amenable to extrinsic augmentation. The permeant toxic species are not to be 
recovered, and their concentrations are necessarily more dilute in the dialysate than in the incoming 
blood. The flow and gentle nature of dialysis has a special appeal for biologic applications, particularly 
when partial purification of the feed stream, rather than recovery of a product, is intended. 

130.11 Membranes 

Hemodialysis membranes vary in chemical composition, transport properties, and, as we will see later, 
biocompatibility. Hemodialysis membranes are fabricated from these classes of materials: regenerated 
cellulose, modified cellulose, and synthetics [Lysaght & Baurmeister, 1993]. Regenerated cellulose is most 
commonly prepared by the cuproamonium process and are macroscopically homogenous. These 
extremely hydrophilic structures sorb water, bind it tightly, and form a true hydrogel. Solute diffusion 
occurs through highly water-swollen amorphous regions in which the cellulose polymer chains are in 
constant random motion and would actually dissolve if they were not tied down by the presence of 
crystalline regions. Their principles advantage is low unit cost, complemented by the strength of the 
highly crystalline cellulose, which allows polymer films to be made very thin. These membranes provided 
effective small-solute transport in relatively small exchange devices. The drawbacks of regenerated cel- 
lulose are their limited capacity to transport middle molecules and the presence of labile nucleophilic 
groups which trigger complement activation and transient leukopenia during the first hour of exposure 
to blood. The advantages appear to outweigh the disadvantages, since over 70% of all hemodialyzers are 
still prepared from cellulosics, the most common of which is supplied by Akso Faser AG under the trade 
name Cuprophan. 

A variety of other hydrophilic polymers account for 20% of total hemodialyzer production, including 
derivatized cellulose, such as cellulose acetate, diacetate, triacetate, and synthetic materials such as poly- 
carbonate (PC), ethylenevinylalcohol (EVAL), and polyacrylonitrile-sodium methallyl sulfonate copoly- 
mer (PAN-SO3), which can all be fabricated into homogeneous films. 

At the opposite end of the spectrum are membranes prepared from synthetic engineered thermoplas- 
tics, such as polysulfones, polyamides, and polyacylonitrile-polyvinylchloride copolymers. These hydro- 
phobic materials, which account for about 10% of the hemodialyzer market, form asymmetric and 
anisotropic membranes with solid structures and open void spaces (unlike the highly mobile polymeric 
structure of regenerated cellulose). These membranes are characterized by a skin on one surface, typically 
a fraction of a micron thick, which contains very fine pores and constitutes the discriminating barrier 
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determining the hydraulic permeability and solute retention properties of the membrane. The bulk of 
the membrane is composed of a spongy region, with interstices that cover a wide size range and with a 
structure ranging from open to closed cell foam. The primary purpose of the spongy region is to provide 
mechanical strength; the diffusive permeability of the membrane is usually determined by the properties 
of this matrix. As the convective and diffusive transport properties of these membranes are, to a large 
extent, associated independently with the properties of the skin and spongy matrix, respectively, it is 
possible to vary independently the convective and diffusive transport properties with these asymmetric 
structures. There is often a second skin on the other surface, usually much more open than the primary 
barrier. These materials are usually less activating to the complement cascade than are cellulosic mem- 
branes. The materials are also less restrictive to the transport of middle and large molecules. Drawbacks 
are increased cost and such high hydraulic permeability as to require special control mechanisms to avoid 
excess fluid loss and to raise concerns over the biologic quality of dialysate fluid because of the possibility 
of back filtration carrying pyrogenic substances to the blood stream. 

The discovery of asymmetry membrane structures launched the modern era of membrane technology 
by motivating research on new membrane separation processes. Asymmetric membranes proved useful 
in ultrafiltration, and a variety of hydrophobic materials have been used including polysulfone (PS), 
polyacrylonitrile (PAN), its copolymer with polyvinylchloride (PVC), polyamide (PA), and polymethyl 
methacrylate (PMMA). PMMA does not form an obvious skin surface and should perhaps be placed in 
a class of its own. 

130.12 Hemofiltration 

Although low rates of ultrafiltration have been used routinely for water removal since the beginning of 
hemodialysis, the availability of membranes with very high hydraulic permeabilities led to radically new 
approaches to renal substitutive therapy. Such membranes allowed uniformly high clearance rates of 
solutes up to moderate molecular weights (several thousands) by the use of predominantly convective 
transport, thereby mimicking the separation capabilities of the natural kidney glomeruli. Progress in the 
development of this pressure-driven technique, which has come to be known as hemofiltration, has been 
reviewed by Henderson [1982], Lysaght [1986], and Ofsthum et al. ] 1986]. 

In ultrafiltration, the solute flux J s (the rate of solute transport per unit membrane surface area) is 
equal to the product of the ultrafiltrate flux J F (the ultrafiltrate flow rate per unit membrane surface area) 
and the solute concentration in the filtrate, c p In turn c F is related to the retentate concentration C R in 
the bulk solution above the membrane by the observed rejection coefficient R: 

J s = J p c f = J p (i-r)c r (130.32) 

Thus, knowledge of the ultrafiltrate flux and observed rejection coefficient permits prediction of the 
rate of solute removal. 

With increasing transmembrane pressure difference, the ultrafiltrate flux increases and then levels off 
to a pressure-independent value. This behavior arises from the phenomenon of concentration polarization 
[Colton 1987]. Macromolecules (e.g., proteins) that are too large to pass through the membrane build 
up in concentration in a region near the membrane surface. At steady state, the rate at which these 
rejected macromolecules are convected by the flow of fluid towards the membrane surface must be 
balanced by the rate of convective diffusion away from the surface. Estimation of the ultrafiltrate flux 
reduces largely to the problem of estimating the rate of back transport of macromolecules away from 
the membrane surface 

f p = kln°^ (130.33) 
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where k is the mass transfer coefficient for back transport of the rejected species, and c and c ph are the 
plasma concentrations of rejected species at the membrane surface and in the bulk plasma, respectively. 
Attainment of an asymptotic, pressure-independent flux is consistent with the concentration at the wall 
c pw reaching a constant value. As with diffusive membrane permeability, solute rejection coefficients must 
be measured experimentally, since the available theoretical models and details of membrane structure 
are inadequate for prediction. 

In hemofiltration the magnitude of the maximum clearance is determined by the blood and ultrafiltrate 
flow rates and whether the substitution fluid is added before or after filtration. Solutes with molecular 
weights up to several thousand are cleared at essentially the same rate in hemofiltration, whereas there 
is a monotonic decrease with increasing molecular weight in hemodialysis. If a comparison is made with 
devices of equal membrane surface area, it is generally found that hemodialysis provides superior clear- 
ance for low-molecular- weight solutes such as urea. The superiority of hemofiltration becomes apparent 
at molecular weights of several hundred. 

Hemodialysis and hemofiltration represent two extremes with membranes having relatively low and 
relatively high hydraulic permeabilities, respectively. As a variety of new membranes became available 
with hydraulic permeabilities greater than that of regenerated cellulose, various groups began to examine 
new treatment modalities in which hemodialysis was combined with controlled rates of ultrafiltration 
which were higher than those employed in conventional hemodialysis but smaller than those used in 
hemofiltration [Funck-Brenato et al., 1972; Lowrie et al., 1978; Ota et al., 1975]. The advantage of such 
an approach is that it retains the high clearance capabilities of hemodialysis for low-molecular-weight 
solutes while adding enhanced clearance rates for the high-molecular-weight solutes characteristic of 
hemofiltration. A variety of systems is now commercially available and in clinical use, mainly in Europe 
and Japan. The proliferation of mixed-mode therapies has led to a panopoly of acronyms: hemodialysis 
(HD), hemofiltration (HF), high-flux dialysis (HFD), hemodiafiltration (HDF), biofiltration (BF), con- 
tinuous arteriovenous hemofiltration (CAVH), continuous arteriovenous hemodialysis (CAVHD), slow 
continuous ultrafiltration (SCUF), simultaneous dialysis and ultrafiltration (SDUF), and so on. 

Rigorous description of simultaneous diffusion and convection in artificial kidneys has not yet been 
carried out. Available analyses span a wide range of complexity and involve, to varying degrees, simplifying 
assumptions. Their predictions have not been systematically compared with experimental data. In view 
of the growing interest in various "high-flux" membranes and their application for enhanced solute 
removal rates and/or shortened treatment times, further refinement may be helpful. 

130.13 Pharmacokinetics 

Whereas the above analysis is founded on understanding the solute-removal capabilities of hemodialyzers, 
clinical application must also consider the limitations imposed by the transport of solute between body 
fluid compartments. The earliest physiologic models were produced by chemical engineers [Bell et al., 
1965; Dedrick & Bischoff, 1968] using techniques which had been developed to describe the flow of 
material in complex chemical processes and were applied to the distribution of drugs and metabolites 
in biologic systems. This approach has progressively found its way into the management of uremia by 
hemodialysis [e.g., Farrell, 1983; Gotch & Sargent, 1983; Lowrie et al., 1976; Sargent et al., 1978]. 

Pharmacokinetics summarizes the relationships between solute generation, solute removal, and con- 
centration in the patient's blood stream. It is most readily applied to urea as a surrogate for other uremic 
toxins in the quantitation of therapy and in attempts to define its adequacy. In the simplest case, the 
patient is assumed to have no residual renal function and to produce no urea during the relatively short 
periods of dialysis. Urea is generated in the body from the breakdown of dietary protein, which empirically 
has been found to approximate 

G=0.11 7-0.12 (130.24) 
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where G is the urea generation rate and I the protein intake (both in mg/min). If reliable measurements 
of / are not available, one assumes an intake of 1 gram of protein per kg of body weight per day. 

Urea accumulates in a single pool equivalent to the patient's total body water and is removed uniformly 
from that pool during hemodialysis. Mass balance yields the following differential equation: 

dlcV) 

-^ — '- = G-Kc (130.35) 

dt 

where c is the blood urea concentration (equal to total body water urea concentration) in mg/ml; Vis 
the urea distribution volume in the patient in ml; G is the urea generation rate in mg/min; t is the time 
from onset of hemodialysis in minutes; and K is the urea clearance in ml/min. V can be measured by 
tritiated water dilution studies but is usually 58% of body weight. Generation is calculated from actual 
measurement or estimate of the patient's protein intake (each gram of protein consumed produces about 
250 mg of urea). Therefore, a 70-kg patient, consuming a typical 1.0 g of protein per kilogram of body 
weight per day, would produce 28 g of urea distributed over a fluid volume of 40.6 L. In the absence of 
any clearance, urea concentration would increase by 70 mg/100 ml every 24 hours. The reduction of urea 
concentration during hemodialysis is readily obtained from Eq. (130.32) by neglecting intradialytic 
generation and changes in volume: 
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(130.36) 



where c' and c' represent the urea concentrations in the blood at the beginning and during the course 
of treatment. A 3-1/2-hour treatment of a 70-kg patient (V = 40.6 L) with a urea clearance of 200 ml/min 
would lead to a 64% reduction in urea concentration or a value of 0.36 for the c'lc' ratio. (This parameter 
almost always falls between 0.30 and 0.45.) 

The increase in urea concentration between hemodialysis treatments is obtained from Eq. (130.33), 
again assuming a constant V; 

c' = c f + — t (130.37) 

V 

where d is the urea concentration in the patient's blood at the end of the hemodialysis and c' the 
concentration at time t during the intradialytic interval. Urea concentration typically increases by about 
50-100 mg/100 ml/24 hours. Even a small residual renal clearance will prove numerically significant. 
Therefore in oliguric patients who still exhibit a minimum of kidney function, one should use the slightly 
more complex equations given by Sargent and Gotch [1989] or Farrell [1988]. 

The exponential decay constant in Eq. (130.33), Kt/v, expresses the net normalized quantity of hemo- 
dialysis therapy received by a uremic patient. It is calculated simply by multiplying the urea clearance of 
the dialyzer (in ml/min) by the duration of hemodialysis (in min) and dividing by the distribution volume 
(in ml) which in the absence of a better estimate is taken as 0.58 x body weight. Gotch and Sargent 
[1983] first recognized that this parameter provides an index of the adequacy of hemodialysis. Based 
upon a retrospective analysis of various therapy formats, they suggested a value of 1.0 or greater as 
representing an adequate amount of hemodialysis for most patients. Although not immune to criticism, 
this approach has found widespread clinical acceptance and represents the current prescriptive norm in 
hemodialysis therapy. 
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TABLE 130.1 Uremic Syndrome under Dialysis 



Adverse effects of uremia can be attributed to: 

1. Retention of solutes normally degraded or excreted by the kidneys. 

2. Overhydration associated with inadequate balance between fluid intake and water removal. 

3. Absence of factors normally synthesized by the kidneys. 

4. Pathophysiologic response to the decline in renal function on the part of other organ systems. 

5. Pathologic response of the organism to repeated exposure to damaging procedures and foreign materials. 



130.14 Adequacy of Dialysis 



As outlined in Table 130.1, the uremic syndrome under dialysis is more complex than observed in ESRD 
before the institution of treatment. The pathology observed not only is related to insufficient removal 
of toxic solutes but also comprises some unavoidable adverse effects of extracorporeal blood processing, 
including the interactions of blood with foreign materials [Colton et al., 1994]. The attenuation of uremic 
syndrome symptoms by protein restriction in the patient's diet and by various dialytic procedures 
underscores the combined roles of retention, removal, and metabolism in the constellation of signs of 
the disease. Toxicity may result from the synergism of the entire spectrum of accumulated molecules, 
which is surprisingly large (see Table 130.2 and Vanholder and Ringoir [1992]. The uremic syndrome 
resembles complex forms of systemic poisoning and is characterized by multiple symptoms and side 
effects. Survival requires that the toxins be removed, and survival quality depends on the quantity of 
toxins that are actually eliminated. Ideally, one would like to clean blood and body fluids to the same 
extent as is achieved by normal renal function. This is possible with an organ transplant that works 
without interruption but is only asymptotically approached with intermittent dialysis. 

There is a compelling need for objective definition of the adequacy of ESRD treatment: How much 
removal in how much time is necessary for each individual? The answer is indirect and approximate. 
Some define adequacy of dialysis by clinical assessment of patient well-being. More sophisticated proce- 
dures, such as electromyography, electroencephalography, and neuropsychologic tests, may refine the 
clinician's perception of inadequate dialysis. Yet inadequate therapy can remain unrecognized when 
therapeutic decisions are based exclusively on clinical parameters. The inverse is also true, and follow- 
up of dialysis adequacy should never be restricted to static markers of toxicity or dynamic biochemical 
parameters such as clearance, kinetic modeling, and the like. 

Most patients undergoing dialysis do not work or function as healthy people do, and often their 
physical activity and employment status does not go beyond the level of taking care of themselves. In 
many centers, the best patients in a hemodialysis program are selectively removed for transplantation. 
Hospitalization rate is an approximate index of dialysis inadequacy. About 25 percent of all hospitaliza- 
tions are due to vascular access problems. Comparison among centers may be difficult, however, because 
of differences in local conditions for hospital admission. Vanholder and Ringoir [1992] have attempted 
to relate the adequacy of dialysis to the relevant solute concentrations in blood and distinguish among 
solute-related factors, patient-related factors, and dialysis-related factors (Table 130.3). Their analysis 
constitutes a useful point of departure for adjusting the quantity of dialysis to the specific needs of an 
individual patient, which is a complex problem, since it requires not only an appreciation of what the 
removal process can do, but also of the generation rate of metabolic end products (related to nutrition, 
physical activity, fever, etc.) and the dietary load of water and electrolytes. Dialysis patients are partially 
rehabilitated, but their condition rarely compares to that of recipients of a successful renal transplant. 



130.15 Outlook 



The treatment of chronic renal failure by artificial kidney dialysis represents one of the most common, 
and certainly the most expensive, component of substitutive medicine. From an industrial viewpoint, 
500,000 patients each "consuming" perhaps 100 hemodialysis filters per year (allowing for some reuse 
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TABLE 130.2 Uremic Solutes with Potential Toxicity 



Urea 
Guanidines 

Methylguanidine 

Guanidine 

p-guanidinipropionic acid 

Guanidinosuccinic acid 

Gamma-guanidinobutyric acid 

Taurocyanine 

Creatinine 

Creatine 

Arginic acid 

Homoarginine 

N-a-acetylarginine 
Phenols 

O-cresol 

P-cresol 

Benzylalcohol 

Phenol 

Tyrosine 
Phenolic acids 

P-hydroxyphenylacetic acid 

p-(m hydroxyphenyl)- 
hydracrylic acid 
Hippurates 

p-(OH)hippuric acid 

o-(OH)hippuric acid 

Hippuric acid 
Benzoates 
Polypeptides 
P 2 -microglobulin 
Indoles 

Indol-3-acetic acid 

Indoxyl sulfate 

5-hydroxyindol acetic acid 

Indo-3-acrylic acid 

5-hydroxytryptophol 

N-acetyltryptophan 

Tryptophan 



Middle molecules 

Ammonia 

Alkaloids 

Trace metals (e.g., bromine) 

Uric acid 

Cyclic AMP 

Amino acids 

Myoinositol 

Mannitol 

Oxalate 

Glucuronate 

Glycols 

Lysozyme 

Hormones 

Parathormone 

Natriuretic factor 

Glucagon 

Growth hormone 

Gastrin 

Prolactin 

Catecholamines 
Xanthine 
Hypoxanthine 
Furanpropionic acid 
Amines 

Putrescine 

Spermine 

Spermidine 

Dimethylamine 

Polyamines 
Endorphins 
Pseudouridine 
Potassium 
Phosphorus 
Calcium 
Sodium 
Water 
Cyanides 



from the 150 units per year that would be needed for 3 times per week treatment) means a production 
of 50 million filters. With each unit selling for an approximate price of 15 dollars, the world market is 
on the order of $750 million. From a public health viewpoint, if one is to take the U.S. figure of $30,000 
for the world average annual cost of a single dialysis patient, the aggregate economic impact of the medical 
application of hemodialysis approaches $15 billion a year (of which less than 10 percent is spent on the 
purchase of technology; health care personnel costs are the most expensive component of the treatment). 
Yet "maintenance dialysis on the whole is non-physiological and can be justified only because of the 
finiteness of its alternative" [Burton, 1976]. Dialytic removal remains nonspecific, with toxic as well as 
useful compounds eliminated indiscriminately. A better definition of disturbed metabolic pathways will 
be necessary to formulate treatment hypotheses and design adapted equipment. Sensors for on-line 
monitoring of appropriate markers may also help to evaluate the modeling of clearance processes. The 
confusing interference of interactions between the patient and the foreign materials in the dialysis circuit 
may be reduced as more compatible materials become available. A better clinical condition of the ESRD 
patient remains the ultimate goal of dialysis therapy because at the moment it seems unlikely that either 
preventative measures or organ transplantation will reduce the number of patients whose lives depend 
on the artificial kidney. 
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TABLE 130.3 Factors Influencing Solute Concentration in Dialyzed Patients 

Solute-related factors Patient-related factors 

Compartmental distribution Body weight 

Intracellular concentration Distribution volume 

Resistance of cell membrane Intake and generation of solutes metabolic precursors 

Protein binding Residual renal function 

Electrostatic charge Quality of vascular access 

Steric configuration Absorption from the intestine 

Molecular weight Hematocrit 

Dialysis-related factors Blood viscosity 

Dialysis duration Absorption of solutes on the membrane, on other parts of the circuit 

Interdialytic intervals Ultrafiltration rate 

Blood flow Intradialytic changes in efficiacy 

Mean blood flow Changes with indirect effect on solute-related factors 
Blood flow pattern Blood pH 

Concentration gradients Heparinization 

Dialysate flow Free fatty acid concentration 

Dialyzer surface 

Dialyzer volume 

Dialyzer membrane resistance 

Dialyzer pore size 



Defining Terms 

Arteriovenous fistula: A permanent communication between an artery and an adjacent vein, created 
surgically, leading to the formation of a dilated vein segment which can be punctured transcuta- 
neously with large bore needles so as to allow connecting the circulatory system with an extracor- 
poreal blood processing unit. 

Artificial kidney: A blood purification device based on the removal of toxic substances through semi- 
permeable membranes washed out by an acceptor solution which can safely be discarded. 

Blood urea nitrogen (BUN): The concentration of urea in blood, expressed as the nitrogen content of 
the urea (BUN is actually 0.47 times, or approximately half, the urea concentration). 

Boundary layer: The region of fluid adjacent to a permeable membrane, across which virtually all 
(99%) of the concentration change within the fluid occurs. 

Catheter: A tube used to infuse a fluid in or out of the vascular system or a body cavity. 

Clearance: A measure of the rate of mass removal expressed as the volume of blood which per unit of 
time is totally cleared of a substance through processing in a natural or artificial kidney. Clearance 
has the dimensions of a flow rate and can be defined only in relation to a specific solute. Clearance 
can also be viewed as the minimal volume flow rate of blood which would have to be presented 
to a processing device to provide the amount actually recovered in the urine or the dialysate if 
extraction of that material from blood were complete. Clearance is measured as the mass transfer 
rate of a substance divided by the blood concentration of that substance. 

Continuous ambulatory peritoneal dialysis: A modality of peritoneal dialysis in which uninter- 
rupted — although not evenly effective — treatment is provided by 4-6 daily cycles of filling and 
emptying the peritoneal cavity with a prepared dialysate solution. Solute removal relies on diffusive 
equalization with molecular species present in capillary blood. Water removal relies on the use of 
hyperosmotic dialysate. 

Continuous arteriovenous hemodialysis: A dialytic procedure in which blood, propelled either by 
arterial pressure or by a pump, flows continuously at a low flow rate through a dialyzer, from where 
it returns to a vein, providing for uninterrupted solute and fluid removal and nearly constant 
equilibration of body fluids with the dialysate solution. 
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Dialysate: A buffered electrolyte solution, usually containing glucose at or above physiologic concen- 
tration, circulated through the water compartment of a hemodialyzer to control diffusional trans- 
port of small molecules across the membranes and achieve the blood concentrations desired. 

Dialysis: A membrane separation process in which one or more dissolved molecular species diffuse 
across a selective barrier in response to a difference in concentration. 

Dwell time: The duration of exposure of a solution used to draw waste products and excessive water 
out of the blood during peritoneal dialysis. 

ESRD: End-stage renal disease. 

Glomerular filtration rate: The volume of plasma water, or primary urine, filtered in the glomerulus 
per unit of time. Measured, for instance, by creatinine clearance, it expresses the level of remaining 
renal function in end-stage renal disease. 

Hemodiafiltration: Removal of water and solutes by a combination of diffusive and convective trans- 
port (paired filtration-dialysis) across a dialysis membrane to achieve effective transport of small 
and middle molecules. To compensate for the water loss, a large volume of saline or balanced 
electrolyte solution must be infused in the blood circuit to prevent hemoconcentration. 

Hemodialysis: A modality of extracorporeal blood purification in which blood is continuously circu- 
lated in contact with a permeable membrane, while a large volume of balanced electrolyte solution 
circulates on the other side of the membrane. Diffusion of dissolved substances from one stream 
to the other removes molecules that are in excess in the blood and replaces those for which there 
is a deficiency. Increased removal can be achieved by increasing the duration of the procedure, the 
overall membrane area, or the membrane permeability. 

Hemofiltration: Removal of water and solutes by convective transport, controlled by a large hydrostatic 
pressure difference between blood and a liquid compartment across a large-pore, high-water-flux 
membrane. 

Membrane: A thin film of natural or synthetic polymer which allows the passage of dissolved molecules 
and solvents in response to a concentration or pressure difference (diffusion or filtration) across 
the polymer. 

Middle molecules: Molecules of intermediate molecular weight (roughly of 1000 to 30,000 daltons) 
which are presumed to be responsible for the toxic manifestations of end-stage renal disease and 
therefore should be eliminated by substitutive therapy. 

Peritoneal dialysis: A process in which metabolic waste products, toxic substances, and excess body 
water are removed through a membranelike tissue that lines the internal abdominal wall and the 
organs in the abdominal cavity. 

Permeability: The ability of a membrane to allow the passage of certain molecules while maintaining 
a physical separation between two adjacent phases. 

Permselectivity: The property of a membrane whereby a differential rate of molecular transport 
between two phases is achieved based on characteristics such as molecular weight, molecular size, 
degree of hydration, affinity for membrane material, and electric charge. The most common feature 
leading to permselectivity is membrane pore size. 

Residual renal clearance: The small level of renal function (measured as creatinine clearance by the 
diseased kidneys) remaining in some patients in end-stage renal disease, particularly in the early 
years of dialytic treatment. 

Ultrafiltration: The process whereby plasma water flows through a membrane in response to a hydro- 
static pressure gradient, dragging with it solute molecules at concentrations equal or lower to that 
prevailing in plasma. 

Uremia: A condition in which the urea concentration in blood is chronically elevated, reflecting an 
inability to remove from the body the end products of protein metabolism. 

Uremic toxins: Partly unidentified and presumably toxic substances appearing in the blood of patients 
in end-stage renal failure, which can be eliminated to a variable extent by chemical processing of 
body fluids. 
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Irreversible end-stage kidney disease occurs with an annual frequency of about 1 in 5000 to 10,000 in 
general population, and this rate is increasing. Until the 1960s, such disease was universally fatal. In the 
last four decades various interventions have been developed and implemented for preserving life after 
loss of all or most of a patient's own kidney function. Continuous ambulatory peritoneal dialysis (CAPD) , 
the newest and most rapidly growing of renal replacement therapies, is one such process in which 
metabolic waste products, electrolytes, and water are removed through the peritoneum, an intricate 
membranelike tissue that lines the abdominal cavity and covers the liver, intestine, and other internal 
organs. This review begins with a brief summary of the development of CAPD and its role in the treatment 
of contemporary renal failure. The therapy format and its capacity for solute removal are then described 
in detail. Bioengineering studies of peritoneal transport, in which the peritoneum is described in terms 
analogous to the mass transfer properties of a planar membrane separating well-mixed pools of blood and 
dialysate, are then reviewed. The transport properties of the equivalent peritoneal membrane are summa- 
rized and compared to those of hemodialysis membranes. Models to describe and predict fluid and solute 
removal rates are examined. Finally, current developments and emerging trends are summarized. 

The early history of peritoneal dialysis, as reviewed by Boen [1985], is contemporaneous with that of 
hemodialysis (HD). Small-animal experiments were reported in the 1920s and 1930s in the United States 
and Germany. Earliest clinical trials in acute reversible cases of kidney failure began in the late 1930s in 
the form of a continuous "lavage" in which dialysate was continuously infused and withdrawn from dual 
trochar access sites. Acute treatments were continued through the 1940s, and about 100 case reports 
appeared in the literature by 1950; sequential inflow, dwell, and withdrawal was increasingly favored over 
continuous flow. Chronic therapy was introduced in the early 1960s, followed shortly by indwelling 
peritoneal catheters. From 1960 onward, peritoneal dialysis clearly lagged behind HD, as the latter became 
more streamlined, efficient, and cost-effective. Although endorsed by a small group of enthusiasts and 
proponents, peritoneal dialysis had evolved into a specialized or niche therapy. This changed dramatically 
in 1976 when Popovich, a biomedical engineer, and Moncrief, a clinical nephrologist, announced the 
development of a new form of peritoneal dialysis in which ambulatory patients were continuously treated 
by two liters of dialysis dwelling in the peritoneal cavity and exchanged four times daily [Popovich et al., 
1976] . Two years later Baxter began to offer CAPD fluid in flexible plastic containers, along with necessary 
ancillary equipment and supplies. The rapid subsequent growth of the process is tabulated in Fig. 131.1. 
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FIGURE 131.1 The growth of peritoneal dialysis. Line and points refer to the total estimated worldwide peritoneal 
dialysis population; the numbers adjacent to the points are PD patients as a percent of total dialysis population. At 
the end of 1993, 14,000 of the 90,000 peritoneal dialysis patients utilized some version of APD; the remainder were 
treated with CAPD. Data compiled taken from various patient registries and industrial sources. 

At this writing, approximately 90,000 patients are treated by CAPD (versus 490,000 by HD and 130,000 
with kidney transplants). A more recent development is the introduction of automated peritoneal dialysis 
(APD), in which all fluid exchanges are performed by a simple pump console, usually while the patient 
sleeps. About one peritoneal dialysis patient in six now receives some form of APD; this approach is 
discussed more fully in a later section on emerging developments. 

Both CAPD and HD have advantages and disadvantages, and neither therapy is likely to prove better 
for all the patients all the time. The principal attraction of CAPD is that it frees the patient from the 
pervasive life-style invasions associated with thrice weekly in-center HD. CAPD is particularly popular 
with patients living in rural areas distant from a hemodialysis treatment center. The continuous nature 
of CAPD eliminates fluctuations in the concentrations of uremic metabolites and avoids the sawtooth 
pattern of hemodialysis peak toxin concentrations. Fluid and dietary constraints are less restrictive for 
patients on CAPD than those on HD. A major complication of CAPD is peritonitis. Th rate of peritonitis 
was initially around 2 episodes per patient year; this has fallen to fewer than 0.5 episodes per patient year 
due to advances in administration set design and use. The morbidity of peritonitis has also decreased 
with increased experience in its treatment. In most cases, detected early and treated promptly, peritonitis 
can be managed without requiring hospitalization. Peritonitis caused by certain organisms including 
Staphylococcus aureus, Pseudomonas, and fungi remains a clinical problem. Other drawbacks of CAPD 
include the daily transperitoneal administration of 100-150 g of glucose providing -600 calories, and 
the tedium of the exchanges. APD and new solution formulations are being developed to address both 
issues. Little doubt now exists that risk-adjusted survival and morbidity for patients treated by CAPD is 
equivalent to that for patients treated with HD. On balance, the therapy seems well suited to many 
patients, and it continues to grow more rapidly than alternative treatment modalities. 

131.1 Therapy Format 

The process of CAPD is technically simple. Approximately 2 L of a sterile, nonpyrogenic, and hypertonic 
solution of glucose and electrolyte are instilled via gravity flow into the peritoneal cavity through an indwelling 
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catheter 4 times per day. A single exchange is illustrated 
in Fig. 131.2. Intraperitoneal fluid partially equili- 
brates with solutes in the plasma, and plasma water is 
ultrafiltered due to osmotic gradients. After 4-5 hours, 
except at night where the exchange is lengthened to 
9-1 1 hours to accommodate sleep, the peritoneal fluid 
is drained and the process repeated. Patients perform 
the exchanges themselves in 20-30 minutes, at home 
or in the work environment after a training cycle which 
lasts only 1-2 weeks. In APD, 10-15 L are automatically 
exchanged overnight; 2 L remain in the peritoneal cav- 
ity during the day for a "long dwell" exchange. 

As will be discussed in more detail below, the 
drained fluid contains solute at concentrations around 
90-100% of plasma for urea, 65-70% for creatinine, 
and 15-25% for inulin and p" 2 microglobulin. Net fluid 
removal ranges up to 1000 ml per exchange. CAPD 
generally removes the same quantity of toxins and fluid 
as HD ( a little thought will show that this is a require- 
ment of steady state, provided that generation is unal- 
tered between the two treatment formats); however, 
CAPD requires a higher plasma concentration as the 
driving force for this removal. Steady-state concentra- 
tions during CAPD are typically close to the peak, i.e., 
pretreatment, concentrations of small solutes during 
HD but much lower than the corresponding peaks for 
larger species. 

Access to the peritoneum is usually via a double-cuff Tenchkhoff catheter, essentially a 50-100 cm length 
of silicone tubing with side holes at the internal end, a Dacron mesh flange at the skin line, and connector 
fittings at the end of the exposed end. Several variations have evolved, but little hard evidence supports 
the selection of one design format over another [Dratwa etal., 1986]. Most are implanted in a routine 
surgical procedure requiring about 1 hour and are allowed to heal for 1-2 weeks prior to routine clinical 
use. Sterile and nonpyrogenic fluid is supplied in 2-L containers fabricated from dioctyl phthalate plasti- 
cized polyvinyl chloride. The formulation is essentially potassium-free lactated Ringers to which has been 
added from 15-42.5 g/L of glucose (dextrose monohydrate). The solution is buffered to a pH of 5.1-5.5, 
since the glucose would caramelize during autoclaving at higher pH levels. Several different exchange 
protocols are in use. In the original design, the patient simply rolls up the empty bag after instillation and 
then drains into the same bag following exchange. The bag filled with drain fluid is disconnected and a 
fresh bag is reconnected. Patients are trained to use aseptic technique to perform the connect and discon- 
nect. Many ingenious aids were developed to assist in minimizing breaches of sterility including enclosed 
ultraviolet-sterilized chambers and heat splicers. More recent approaches, known as the "O" set and "Y" 
set or more generically as "flush before fill" disconnect, invoke more complex tubing sets to allow the 
administration set to be flushed (often with antiseptic) prior to installation of dialysate and generally permit 
the patient to disconnect the empty bag during the dwell phase. Initial reports of the success of the protocols 
in reducing peritonitis were regarded with skepticism, but definite improvement over earlier systems has 
now been documented in a well-designed and carefully controlled clinical trials [Churchill et al., 1989]. 



FIGURE 131.2 Illustration of the three steps 
involved in a single CAPD exchange: fluid infusion, 
dwell, and drain. Some administration sets require 
the bag to stay connected during dwell (it is rolled 
and fits in a girdle around the waist); others allow it 
to be disconnected. Drain and infusion take about 
10 minutes each; three daytime dwells are 4-6 hours 
each; the overnight dwell lasts 8-10 hours. 



131.2 Fluid and Solute Removal 



The rate at which solutes are removed during peritoneal dialysis depends primarily upon the rate of 
equilibration between blood and instilled peritoneal fluid. This is usually quantified as the ratio of 
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FIGURE 131.3 Ratio of plasma to dialysate concentration for urea (60 daltons), creatinine (113 daltons), uric acid 
(158 daltons), and (5 2 microglobulin (-12,000 daltons). Data were obtained by withdrawing and analyzing a sample 
of dialysate at each time point and comparing it to plasma concentration. Each point is the average of two determi- 
nations on five patients. Error bars are standard error of the mean [Lysaght, 1989]. 

dialysate to plasma concentration as a function of dwell time, often in graphs called simply "D over P" 
(dialysate over plasma) curves. A typical plot of dialysate-to-plasma ratio for solutes of various molecular 
weight is given in Fig. 131.3. Smaller species equilibrate more rapidly than do larger ones, because 
diffusion coefficient varies in inverse proportion to the square root of a solute's molecular weight. 
Dialysate equilibrium rates vary considerably from patient to patient; error bars on the plot represent 
standard error of the mean for duplicate determinations with five patients. 

The rate of mass removal during dialysis, ()), is simply the volume of fluid, V D , removed from the 
peritoneal cavity at the end of a dwell period lasting time t, multiplied by the concentration C D of the 
solute in the removed fluid 

$ = V D C D (131.1) 

The whole blood clearance, CI, is the rate of mass removal divided by the solute concentration in blood C B 



a-- 



<? 



v D c D 



t C D t C 



(131.2) 



In Eqs. (131.1) and (131.2), time conventionally is reported in minutes, volume in milliliters, and 
concentration in any consistent units. Equations (131.1) and (131.2) are based on mass balances; they 
are thus general and unaffected by the complexity of underlying phenomena such as bidirectional selective 
connective transport and lymphatic uptake. Equation (131.2) requires that solute concentration in the 
denominator be reported as whole blood concentration, rather than as plasma concentration, which is 
often reported clinically. With many small solutes (urea, creatinine, and uric acid), only small error is 
introduced by considering blood and plasma concentration as interchangeable. With larger solutes, 
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FIGURE 131.4 Volume of fluid in the peritoneal cavity versus time during an exchange with -2.5% glucose dialysis 
fluid. Solid line is actual volume. Dotted line represents estimate of the volume in the absence of lymphatic flow. 
Results represent an average of duplicate determinations on five patients. Volume was estimated by dilution of 
radiolabeled tracers (too large to diffuse across the peritoneal membrane) added to dialysate prior to installation; 
lymphatic flow was calculated from a mass balance on net recovered marker. Each point is the average of two 
determinations on five patients. Error bars are standard error of the mean [Lysaght, 1989]. 

especially those excluded from the red blood cell, care must be taken to correct for differences in plasma 
and blood concentrations. 

Since urea is nearly completely equilibrated during CAPD, i.e., c D /c B = -1.0, urea clearance is commonly 
equated with total drainage volume. Four 2-L exchanges and 2 L of ultrafiltration would thus result in 
a continuous urea clearance of 10 L/day or -7 ml/min. The situation is more complex with APD, which 
involves several (4-6) short exchanges at partial equilibrium and one very long exchange. In any case, 
no meaningful direct or a priori comparison of clearance with hemodialysis is possible because one 
therapy is intermittent and the other continuous. 

The volume of fluid in the peritoneal cavity increases during an exchange but at a decreasing rate. The 
driving force for fluid transfer from the blood to the peritoneal cavity is the osmotic pressure of the 
glucose in the infused dialysate. Typical CAPD solutions contain -1.5%, -2.5%, or -4.25% by weight of 
glucose monohydrate, leading to an initial maximum osmotic force (across an ideally semipermeable 
membrane) of approximately 1000-5000 mmHg. In the first few minutes of an exchange, the rate of 
ultrafiltration may be as high as 10-30 ml/min. The driving force rapidly dissipates as glucose diffuses 
from the peritoneal cavity into the bloodstream. After the first hour, rates of 1.0-2.0 ml/min are common. 
Throughout the exchange, the peritoneal lymphatics are draining fluid from the peritoneal cavity at a 
rate of 0.5-2.0 ml/min. Fluid balance is thus the difference between removal by a time-dependent rate 
of ultrafiltration and return via a more constant lymphatic drainage. Net fluid removal is very easily 
determined in the clinical setting simply by comparing the weight of fluid drained to that instilled. 
Instantaneous rates of ultrafiltration may be estimated in study protocols by a series of tedious mass 
balances around high-molecular-weight radiolabled markers added to the dialysate fluid. The results of 
a typical study are plotted in Figs. 131.4 and 131.5 showing both the instantaneous rate of ultrafiltration 
and the net intraperitoneal volume as a function of time. On average, these patients removed 500 ml of 
fluid in a single 6-hour exchange or roughly 2 L/day, which permits far more liberal fluid intake than 
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FIGURE 131.5 Comparisons of rates of ultrafiltration of fluid into the peritoneal cavity (open circles) and lymphatic 
drainage of fluid from the peritoneal cavity hack to the patient (dotted line). Same study and methods as in Fig. 131.4. 

is possible with patients on HD. But here again patient variation is high. Commercial CAPD fluid is 
available in a variety of solute concentrations; physicians base their prescription for a particular patient 
on his or her fluid intake and residual urine volume. 



131.3 The Peritoneal Membrane: Physiology 
and Transport Properties 

In contrast to synthetic membranes employed during HD, the peritoneum is not a simple selective barrier 
between two phases. As implied by its Latin root (peritonere = to stretch tightly around), the primary 
physiologic function of the peritoneum is to line the walls of the abdominal cavity and encapsulate its 
internal organs (stomach, liver, spleen, pancreas, and parts of the intestine). Most CAPD literature, 
including this review, uses the terms peritoneum and peritoneal membrane interchangeably and conve- 
niently extends both expressions to include underlying and connective tissue. Overall adult peritoneal 
surface is approximately 1.75 ± 0.5 m 2 , which generally is considered equal on an individual basis to skin 
surface area. The peritoneum is not physically homogenous. The visceral portion (-80%) covering the 
internal organs differs somewhat from the parietal portion overlaying the abdominal walls, which in turn 
is different from the folded or pleated mesentery connecting the two. 

The physiology of the peritoneum, its normal ultastructure, and variations induced by CAPD have 
been increasingly elucidated over the past decade. Morphologically, the peritoneum is a smooth, tough, 
somewhat translucent sheath. Its thickness ranges from under 200 to over 1000 microns. The topmost 
layer, which presents to the dialysate during CAPD, is formed from a single layer of mesothelial cells, 
densely covered by microvilli (hairlike projections), although the latter tend to disappear gradually during 
the first few weeks of CAPD. Immediately underneath is the interstitium, a thick sheath of dense 
mucopolysaccharide hydrogel interlaced with collagenous fibers, microfibrillar structures, fibroblasts, 
adipocytes, and granular material. Most important for CAPD, the interstitium is perfused with a network 
of capillaries through which blood flows from the mesenteric arteries and the vasculature of the abdominal 
wall to the portal and systemic venous circulations. Blood-flow rate has been estimated to be in the range 
of 30-60 ml/min, but this is not well established. The interstitial layer is a hydrogel; its water content, 
and thus its transport properties, will vary in response to the osmolarity of the peritoneal dialysate. 

Peritoneal mass transfer characteristics are most commonly obtained by back-calculating basic mem- 
brane properties from results in standard or modified peritoneal dialysis. Three membrane parameters 
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will be described: Lp, the hydraulic permeability; _R, the rejection coefficient; and K A, the mass transfer 
coefficient ( = area A x diffusive permeability K ). The formal definitions of these parameters are given 
in Eqs. (131.3) through (131.5), with R and K A defined for the limiting conditions of pure convection 
and pure diffusion. 
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Filtration rate 
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Area • pressure driving force AlAP- "LoilA 
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where J F = filtration rate, A = area, O = Staverman reflection coefficient, n = osmotic pressure, and other 
terms are as defined previously. 

At the onset of a CAPD exchange using 4.25% dextrose, the ultrafiltration rate is 10-30 ml/min. 
Relative to a perfectly semipermeable membrane, the glucose osmotic pressure of the solution is 
4400 mmHg. Overall membrane hydraulic permeability is the quotient of these terms and is thus of the 
order of 0.2 ml/hr-mmHg, in the units commonly employed for HD membranes. This estimate needs 
to be corrected for the osmotic back-pressure, which is primarily due to urea in the blood (cone -1.3 g/L) 
as well as the fact that the membrane is only partially semipermeable. The best results are not obtained 
from a single point measurement but either from curve fitting to the ultrafiltration profile during the 
entire course of dialysis or from data taken at different osmotic gradients. A review [Lysaght & Farrell, 
1989] of reports from several different investigators suggests an average value of ~ 2 ml/hr-mmHg or, 
roughly, 2 gal/ft 2 /day (GSFD) at 100 PSI. This is higher than desalination membranes, just slightly lower 
than conventional regenerated cellulose hemodialysis membranes, and much lower than anisotropic 
ultrafiltration membranes. 

Rejection coefficients, R, numerically equal to unity minus sieving coefficient are obtained either from 
kinetic modeling as described below or experimentally by infusing a hypertonic solution into the peri- 
toneum with a permeant concentration equal to that in the plasma. After a suitable period of ultrafil- 
tration, the ration of solute to water flux is calculated from the dilution of the recovered solution. Both 
methods are approximate and results from different investigators may vary substantially. Reported values 
are observed average rejection coefficients. These are often described as the Staverman rejection coeffi- 
cient, O, which is somewhat overreaching, since filtration velocity is not recorded and differences between 
bulk and wall concentrations are not known. Representative values, from a review of the literature 
[Lysaght & Farrell, 1989], are summarized in Table 131.1. Thus the membrane appears quite tight, 
possibly rejecting about 10-20% of urea and other small molecules, about 50% of intermediate-molec- 
ular-weight species, and over 99% of plasma proteins. 

The diffusive permeability of the membrane is obtained by back calculation from measurements of 
blood and dialysate concentration versus time during an exchange, as will be further elaborated below. 
Values are given as the product of membrane permeability and estimated peritoneal area (K A), and the 
results of various investigators have been reasonably consistent. Critical values from a review of the 
literature are summarized in Table 131.1. A K A value of about 20 ml/min for urea is around one order 
of magnitude less than comparable values for contemporary hollow-fiber hemodialyzers. If the area of 
the peritoneum is taken as 1.75 m 2 , then urea transfers through the peritoneum analogously to urea 
diffusing through a stagnant film of water roughly a centimeter thick. Alternatively, given a peritoneal 
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TABLE 131.1 Equivalent Transport Coefficients for the Peritoneal Membrane 



Permeant Species 


MW 


Rejection 


Coefficient, dimensionless 


f^A cmVmin 


Urea, 60 






0.26 ± 0.08 


21 ±4 


Creatinine, 113 






0.35 ± 0.07 


10 ±2 


Uric acid, 158 






0.37 


10 


B-12, 1355 








5 


Inulin, 5200 






0.5 ± 0.2 


4± 1.5 


P 2 microglobulin, 


12,000 






0.8 ± 0.4 


Albumin, 69,000 






0.99 





Note: SD not given if n < 3. Equivalent ultrafiltration coefficient is -2.0 ml/min- 
m 2 -mmHg. Data taken from a review by Lysaght and Farrell [1989], 

thickness range of 200-2000 microns, the diffusion of urea inside the membrane is about 20% of what 
would be found in a film of stagnant water of the same thickness. 

It should once more be noted that the physiologic peritoneum is a complex and heterogeneous barrier, 
and its transport properties would be expected to vary over different regions of its terrain. For example, 
studies in animal models have suggested that transport during peritoneal dialysis is little affected when 
large segments of the visceral membrane are surgically excised. It is also repeated for emphasis that the 
terms Lp, R, and KJs. do not describe this membrane itself but rather a hypothetical barrier that is 
functionally and operationally equivalent and thus capable of producing the same mass transfer charac- 
teristics in response to the same driving forces. 

131.4 Transport Modeling 

Several investigators have developed mathematic models to describe, correlate, and predict relationships 
among the time-course of solute removal, fluid transfer, treatment variables, and physiologic properties 
[Lysaght & Farrell, 1989; Vonesh et al., 1991; Waniewski et al., 1991]. Virtually all kinetic studies start 
with the model illustrated in Fig. 131.6. The patient is considered to be a well-mixed compartment with 
a distribution volume V B set equal to some fraction of total body weight. (For example, urea distributes 
over total body water, which is ~ 0.58 times body weight.) Dialysate occupies a second, much smaller 
compartment, V D = 2-3 L, which is also considered well-mixed but which changes in size during the 



Lymph Flow 
"2 1 
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Peritoneal Cavity 
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FIGURE 131.6 Single pool model for peritoneal dialysis. Solute diffuses across a planar selective membrane from 
a large well-mixed plasma space at constant volume and concentration to a smaller well-mixed space in which 
concentration and volume both increase with time. Fluid and solute are selectively ultrafiltered across the peritoneal 
membrane from plasma to dialysate; they are also nonselectively transported by the lymphatics from the dialysate to 
the body compartment. 
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course of exchange. These two compartments are separated by a planar membrane capable of supporting 
bidirectional transport and characterized by the terms Lp, R, and K A previously defined by 
Eqs. (131.3)— (131.5). Fluid drains from the peritoneum to the blood at a rate of Q L . From this point 
forward, the complexity and appropriate utility of the models depend upon the investigators' choices of 
simplifying assumptions. The simplest model, proposed by Henderson and Nolph [1969], considers 
ultrafiltration rate and lymphatic flow to be negligible and treats all parameters except dialysate concen- 
tration as constant with time. The basic differential equation describing this model is 



dt 



V D d ^ L = K A(c B -C D ) 



(131.6) 



Equation 131.6 may be readily solved, either to obtain K A from a knowledge of concentration versus 
time data Eq. (131.7), or to predict dialysate concentration from a knowledge of mass transfer coefficient, 
blood concentration, and initial dialysate concentration Eq. (131.8) where: 
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(131.7) 
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In these equations, the superscript t represents the value at time t, and the superscript designates the 
value at t = 0. This model provides a very easy way of measuring K A if it is applied during the isovolemic 
interval that often occurs ~ 30-90 min after the beginning of an exchange. 

Several years later, investigators at the University of New South Wales [Garred et al., 1983] proposed 
a slightly more complex model that included ultrafiltration, subject to the assumptions that: (1) blood 
concentration was constant, (2) the membrane was nonselective (R = O), and (3) lymphatic involvement 
could be ignored. The appropriate differential equation is now: 



&A-i4c-c)+c.<?* 



dt 



dt 



(131.9) 



This equation can be solved in two ways. Over either relatively short time intervals or small differences 
in dialysate volume, an average volume V D is obtained as the mean of initial and final volumes. In that 
case K A is given by 



V 
K A=-^\n 



Vd(Cb-Cd) 



(131.10) 



where variables overlined with a solid diachrin are treated as constant during the integration of 
Eq. (131.9). The similarity of Eqs (131.9) and (131.10) to Eqs. (131.7) and (131.8) should be noted. 
Where a series of data points for blood and dialysate concentrations are available at various times during 
the treatment, Eq. (131.10) may be rewritten as 



c ■ 
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Data in the form of this equation may be readily regressed to obtain K A from a knowledge of V D , C B , 
and C D at various times in an exchange. The values for peritoneal volume V D may be obtained experi- 
mentally from tracer dilution studies, calculated from an algorithm, in which case it varies with time, or 
simply averaged between initial and final values, in which case it is assumed constant. Equations (131.11) 
and (131.12) are recommended for routine modeling of patient kinetics. 

Several investigators, reviewed by Lysaght and Farrell [1989] , have produced far more elaborate models 
which incorporate lymphatic drainage, deviations from ideal semipermeability of the peritoneal mem- 
brane, time-dependent ultrafiltration rates, and coupling between bidirectional diffusive and connective 
transport. Although potent in the hands of their developers, none of the numerical models has been 
widely adopted, and the current trend is toward simpler approaches. In comparative studies [Lysaght, 
1989; Waniewski et al., 1991], only small differences were found between the numeric values of transport 
parameters calculated from simple analytic models [Eqs. (131.6)— (131.12)] and those we obtained by far 
more complex numerical methods. In peritoneal dialysis, solute is being exchanged through an inefficient 
membrane between a large body compartment through an inefficient membrane and a second compart- 
ment only 5% as large, and treatment times have been chosen so that the smaller compartment will reach 
saturation. These physical circumstances, and the very forgiving nature of exponential asymptotes, 
perhaps explain why simple analytic solutions perform nearly as well as their more complex numeric 
counterparts. 

131.5 Emerging Developments 

Modified therapy formats and new formulations for exchange solutions constantly are being proposed 
and evaluated. APD is the most successful of the new formats; at the end of 1993, about one in six 
peritoneal dialysis patients received some variant of automated overnight treatment. APD is carried out 
by a small console (Fig. 131.7) which automatically instills and drains dialysate at 1.5-3-hour intervals 
while the patient sleeps, typically over 8-10 hours each night. The peritoneum is left full during the day. 
Since the short exchanges do not permit complete equilibration even for urea, the process is somewhat 
wasteful of dialysate. However, reference to Fig. 131.3 will readily demonstrate that small-solute removal 
is most efficient in the early portion of an exchange; for example, two 2-hour exchanges will provide 
75% more urea clearance than one 4-hour exchange. As currently prescribed, APD requires 84-105 L 
per week of dialysate (versus 56 for CAPD) and increases total small-solute clearance per 24 hours by 
up to 50% over that achieved by CAPD. The number of patients on APD is increasing by half every year, 
a phenomenon driven by two main factors. The first relates to quality of life; APD is far-and-away the 
least invasive of the maintenance dialysis protocols. The patient performs one connection at night and 
one disconnection in the morning and is thereby freed from the tedium and inconvenience of daily 
exchanges or the need to spend a significant portion of 3 days per week at an HD treatment facility. In 
addition, small-solute clearance is higher than in other continuous peritoneal therapies, which helps 
address increasing concern about the adequacy of the standard four 2-L CAPD exchanges per day, 
especially with large muscular patients and those with no residual renal function. A group of patients 
who may benefit from APD are those who have rapid transport of glucose across the peritoneal mem- 
brane; because of the consequent loss of the osmotic gradient, they have difficulty achieving adequate 
ultrafiltration. The short dwell times of ADP circumvent this problem. The counterbalancing disadvan- 
tage of APD is increased expense associated with the larger fluid consumption and the fluid cyclers. 

Virtually all solution development comprises attempts to replace glucose with an alternative osmotic 
agent, preferably one which diffuses more slowly and thus provides a more stable osmotic gradient and 
one which obviates the obligatory load of about 600 calories of sugar. However glucose is cheap and safe, 
and it will be difficult to find a satisfactory alternative. A competing osmotic agent must be safe to 
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FIGURE 131.7 Contemporary equipment module for APD (Home Choice, Renal division, Baxter Healthcare) which 
automatically controls and monitors the delivery of 10-15 L of dialysate from 5-L bags via a multipronged disposable 
administration set. The console incorporates a diaphragm pump used to emulate gravity, and a derivative of the ideal 
gas law measures fluid volume, eliminating the need for scales. Setup and operation are designed to be straightforward 
and convenient. 

administer in amounts of tens of grams per day over years to patients who have little or no ability to 
clear accumulated material via the kidney — but an osmotic agent which is readily metabolizable provides 
no caloric "advantage" over glucose. A glucose polymer, termed polyglucose, has been recently introduced 
in England [Mistry & Gokal, 1993]. This disperse oligodextrin has a weight-averaged MW of 18,700 
daltons and number-averaged molecular weight of 7300 daltons. At a concentration of 7.5% (i.e., 30 g 
per 2-L exchange), it provides more stable ultrafiltration during long dwell exchanges; however, admin- 
istration is limited to one exchange per day because of the accumulation of maltose and higher MW 
polysaccharides; an alternative approach, recently introduced in Europe and in clinical trials in the United 
States, is a solution in which glucose is replaced with 1.1% amino acids, enriched for essential amino 
acids [Jones et al., 1992] . This solution also improves nitrogen balance, a significant feature, since dialysate 
patients are frequently malnourished. Concern about excessive nitrogen intake, however, limits its use 
to one or two exchanges per day, and the amino acid solution is necessarily more expensive than glucose. 

Defining Terms 

Automated peritoneal dialysis (APD): A recent variant of CAPD in which fluid exchanges are per- 
formed by simple pumps, usually at night while the patient sleeps. 

Clearance: The rate of mass removal divided by solute concentration in the body. Clearance represents 
the virtual volume of blood or plasma cleared of a particular solute per unit time. 

Continuous ambulatory peritoneal dialysis (CAPD): A continuous process for the treatment of 
chronic renal failure in which metabolic waste products and excess body water are removed through 
the peritoneum with four exchanges of up to 3 L every 24 hours. 

Diffusion: The molecular movement of matter from a region of greater concentration to lesser con- 
centration at a rate proportional to the difference in concentration. 

Hemodialysis (HD): Intermittent extracorporeal therapy for chronic renal failure. See Chapter 130. 

Mass transfer coefficient: The proportionality constant between the rate of solute transport per unit 
area and the driving force. 

Membrane: A thin barrier capable of providing directional selective transport between two phases. 
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Peritoneal cavity: A topologically closed space in the abdomen which is surrounded by the peritoneum. 
Peritoneum: An intricate, vascularized, membranelike tissue that lines the internal abdominal walls 

and covers the liver, intestine, and other internal organs. Used interchangeably with the expression 

peritoneal membrane. 
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Further Information 

The literature on continuous peritoneal dialysis is abundant. Among several reference texts the most 
venerable and popular is Peritoneal Dialysis edited by K. Nolph and published by Kluwer; this is regularly 
updated. Also recommended is Continuous Ambulatory Peritoneal Dialysis edited by R Gokal and pub- 
lished by Churchill Livingston. The journal Peritoneal Dialysis International (published by MultiMed; 
Toronto) is published quarterly and is devoted exclusively to CAPD. The continuing education depart- 
ment of the University of Missouri-Columbia organizes a large annual conference on peritoneal dialysis 
with plenary lecture and submitted papers. The International Society of Peritoneal Dialysis holds its 
conference biannually and usually publishes proceedings. Peritoneal dialysis is also discussed in the 
meeting and journals of the other major artificial organ societies (American Society of Artificial Internal 
Organs; European Dialysis and Transplant Association; Japanese Society of Artificial Organs) and the 
American and International Societies of Nephrology Blood Purification (published by Karger; Basel) 
attracts many outstanding papers dealing with engineering and transport issues in peritoneal dialysis. 
For the insatiable, Medline now contains over 10,000 citations to CAPD and peritoneal dialysis. 
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Apheresis is the process in which a specific component of blood (either plasma, a plasma component, 
white cells, platelets, or red cells) is separated and removed with the remainder of the blood returned to 
the patient (often in combination with some type of replacement fluid). Donor apheresis is used for the 
collection of specific blood cells or plasma components from blood donors, resulting in a much more 
effective use of limited blood-based resources. Donor apheresis developed during World War II as a 
means for increasing the supply of critically needed plasma, and clinical trials in 1944 demonstrated that 
it was possible to safely collect donations of a unit of plasma (approximately 300 ml) on a weekly basis 
if the cellular components of blood were returned to the donor. Therapeutic apheresis is used for the 
treatment of a variety of diseases and disorders characterized by the presence of abnormal proteins or 
blood cells in the circulation which are believed to be involved in the progression of that particular 
condition. Therapeutic apheresis thus has its roots in the ancient practice of bloodletting, which was 
used extensively well into the 19th century to remove "bad humors" from the patient's body, thereby 
restoring the proper balance between the "blood, yellow bile, black bile, and phlegm." 

The term plasmapheresis was first used by Abel, Rowntree, and Turner in 1914 in their discussion of 
a treatment for toxemia involving the repeated removal of a large quantity of plasma, with the cellular 
components of blood returned to the patient along with a replacement fluid [Kambic & Nose, 1993]. 
The first successful therapeutic applications of plasmapheresis were reported in the late 1950s in the 
management of macroglobulinemia (a disorder characterized by a large increase in blood viscosity due 
to the accumulation of high-molecular-weight globulins in the blood) and in the treatment of multiple 
myeloma (a malignant tumor of the bone marrow characterized by the production of excessive amounts 
of immunoglobulins). 

By 1990, there were well over 50 diseases treated by therapeutic apheresis [Sawada et al., 1990] with 
varying degrees of success. Plasmapheresis is used in the treatment of: (1) protein-related diseases 
involving excessive levels of specific proteins (e.g., macroglobulins in Waldenstrom's syndrome and 
lipoproteins in familial hypercholesterolemia) or excessive amounts of protein-bound substances (e.g., 
toxins in hepatic failure and thyroid hormone in thyrotoxicosis), (2) antibody-related or autoimmune 
diseases (e.g., glomerulonephritis and myasthenia gravis), and (3) immune-complex-related diseases (e.g., 
rheumatoid arthritis and systemic lupus erythematosus). 
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Cytapheresis involves the selective removal of one (or more) of the cellular components of blood, and 
it has been used in the treatment of certain leukemias (for the removal of leukocytes) and in the treatment 
of polycythemia. Table 132.1 provides a more complete listing of some of the diseases and blood com- 
ponents that are removed during therapeutic apheresis. This list is not intended to be exhaustive, and 
there is still considerable debate over the actual clinical benefit of apheresis for a number of these diseases. 

The required separation of blood into its basic components (red cells, white cells, platelets, and plasma) 
can be accomplished using centrifugation or membrane filtration; the more specific removal of one (or 
more) components from the separated plasma generally involves a second membrane filtration or use of 
an appropriate sorbent. The discussion that follows focuses primarily on the technical aspects of the 
different separation processes currently in use. Additional information on the clinical aspects of thera- 
peutic apheresis is available in the references listed at the end of this chapter. 

132.1 Plasmapheresis 

The therapeutic application of plasmapheresis can take one of two forms: plasma exchange or plasma 
perfusion. In plasma exchange therapy, a relatively large volume of plasma, containing the toxic or 
immunogenic species, is separated from the cellular components of blood and replaced with an equivalent 
volume of a replacement fluid (either fresh frozen plasma obtained from donated blood or an appropriate 
plasma substitute). In plasma perfusion, the separated plasma is treated by an adsorptive column or 
second membrane filtration to remove a specific component (or components) from the plasma. This 
treated plasma is then returned to the patient along with the blood cells, thereby eliminating the need 
for exogenous replacement fluids. The different techniques that can be used for plasma perfusion are 
discussed subsequently. 

The reduction in the concentration (C,) of any plasma component during the course of a plasma- 
pheresis treatment can be described using a single compartment pharmacokinetic model as 

If 

V— i = -a i QC + G I . (132.1) 

v dt F 

where V is the volume of the patient's plasma (which is assumed to remain constant over the course of 
therapy through the use of a replacement fluid or through the return of the bulk of the plasma after a 
plasma perfusion), Q is the volumetric rate of plasma collection, G, is the rate of component generation, 
and a i is a measure of the effectiveness of the removal process. In membrane plasmapheresis, 0C;, is equal 
to the observed membrane sieving coefficient, which is defined as the ratio of the solute concentration 
in the filtrate collected through the membrane to that in the plasma entering the device; (X; is thus equal 
to 1 for a small protein that can pass unhindered through the membrane but can be less than 1 for large 
proteins and immune complexes. In plasma perfusion systems, a; is equal to the fraction of the particular 
component removed from the collected plasma by the secondary (selective) processing step. The gener- 
ation rate is typically negligible over the relatively short periods (fewer than 3 hours) involved in the 
actual plasmapheresis; thus the component concentration at the end of a single treatment is given as 
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where V exc = Q f is the actual volume of plasma removed (or exchanged) during the process. Plasma 
exchange thus reduces the concentration of a given component by 63% after an exchange of one plasma 
volume (for a i = 1) and by 86% after two plasma volumes. This simple single compartment model has 
been verified for a large number of plasma components, although immunoglobulin G actually appears 
to have about 50% extravascular distribution with the re-equilibration between these compartments 
occurring within 24-28 hr following the plasmapheresis. 
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TABLE 132.1 Disease States Treated by Therapeutic Apheresis 



Disease 

Hematologic 

Hemophilia 

Idiopathic Thrombocytopenia Purpura 

Thrombotic Thrombocytopenia Purpura 

AIDS, HIV 

Autoimmune Hemolytic Anemia 

Rh Incompatibility 

Cryoglobulinemia 

Hyperviscosity Syndrome 

Waldenstrom's Syndrome 

Paraproteinemia 

Sickle Cell Anemia 

Thrombocythemia 
Collagen/rheumatologic 

Systemic Lupus Erythematosus 

Progressive Systemic Sclerosis 

CREST Syndrome 

Sjorgen Syndrome 

Rheumatoid Arthritis 

Periarteritis Nodosa 

Raynaud's Disease 

Scleroderma 

Mixed Connective Tissue Diseases 
Neurologic 

Myasthenia Gravis 

Multiple Sclerosis 

Guillain-Barre Syndrome 

Polyneuropathy 

Polyradiculoneuropathy 

Lambert-Eaton Syndrome 
Hepatic 

Chronic Active Hepatitis 

Hepatic Failure 

Primary Biliary Cirrhosis 
Renal 

Goodpasture's Syndrome 

Glomerulonephritis 

Lupus Nephritis 

Transplant Rejection 
Malignant diseases 

Cancer 

Multiple Myeloma 

Leukemia 
Miscellaneous 

Addison's Disease 

Autoimmune Thyroiditis 

Chronic Ulcerative Colitis 

Diabetes Mellitus 

Hashimoto's Disease 

Insulin Autoimmune Syndrome 

Pemphigus 

Ulcerative Colitis 

Asthma 

Hypercholesterolemia 

Hyperlipidemia 

Thyrotoxicosis 



Components Removed 

AntiFactor VIII Ab 

Antiplatelet Ab, immune complexes 

Antiplatelet Ab, immune complexes 

Antilymphocyte Ab, immune complexes 

Anti-red cell Ab, red cells 

Anti-Rh Ab 

Cryoglobulins 

Macroglobulins, immunglobulin M 

Immunoglobulin M 

Paraproteins 

Red blood cells 

Platelets 

Anti-DNA Ab, immune complexes 

Antinonhistone nuclear Ab 

Anticentromere Ab 

Antimitochondrial Ab 

Rheumatoid factor, cryoglobulins, immunoglobulins 

Cryoglobulins, immune complexes 

Cryoglobulins, macroglobulins 

Immune complexes 

Immunoglobulins 

Antiacetylcholine receptor Ab, cryoglobulins 

Antimyelin Ab 

Antimyelin Ab 

Cryoglobulins, macroglobulins 

Antibodies 

Antibodies 

Antimitochondrial Ab 
Protein-bound toxins 
Protein-bound toxins, antimitochondrial Ab 

Antiglomerular basement membrane Ab 

Immune complexes 

Immune complexes 

Immune complexes, anti-HLA Ab 

Tumor-specific Ab, immune complexes 

Immunoglobulins 

Leukocytes 

Antiadrenal Ab 

Antimicrosomal Ab 

Anticolonic epithelial cell Ab 

Antiinsulin receptor Ab 

Antithyroglobulin Ab 
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There is still considerable variability in the frequency and intensity of the plasmapheresis used in 
different therapeutic applications, and this is due in large part to uncertainties regarding the metabolism, 
pharmacokinetics, and pathogenicity of the different components that are removed during therapeutic 
apheresis. The typical plasma exchange therapy currently involves the removal of 2-3 L of plasma 
(approximately one plasma volume) at a frequency of 2-4 times per week, with the therapy continued 
for several weeks. There have also been a number of studies of the long-term treatment of several diseases 
via plasmapheresis, with the therapy performed on a periodic basis (ranging from once per week to once 
every few months) over as much as 5 years (e.g., for the removal of cholesterol and lipoproteins in the 
treatment of severe cases of hypercholesterolemia). 



Centrifugal Devices 

Initially, all plasmapheresis was performed using batch centrifuges. This involved the manual removal of 
approximately one unit (500 ml) of blood at a time, with the blood separated in a centrifuge so that the 
target components could be removed. The remaining blood was then returned to the patient before 
drawing another unit and repeating the entire process. This was enormously time-consuming and labor- 
intensive, requiring as much as 5 hours for the collection of only a single liter of plasma. Batch centrif- 
ugation is still the dominant method for off-line blood fractionation in most blood-banking applications, 
but almost all therapeutic plasmapheresis is performed using online (continuous) devices. 

The first continuous flow centrifuge was developed in the late 1960s by IBM in conjunction with the 
National Cancer Institute, and this basic design was subsequently commercialized by the American 
Instrument Co. (now a division of Travenol) as the Aminco Celltrifuge [Nose et al., 1983]. A schematic 
diagram showing the general configuration of this, and most other continuous flow centrifuges is shown 
in Fig. 132.1. The blood is input at the bottom of the rotating device and passes through a chamber in 
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FIGURE 132.1 Schematic diagram of a generic continuous flow centrifuge for fractionation of blood into red cells, 
white cells/platelets, and plasma. 
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which the actual separation into plasma, white cells/platelets, and red cells occurs. Three separate exit 
ports are located at different radial positions to remove the separated components continuously from 
the top of the chamber using individual roller pumps. The position of the buffy coat layer (which consists 
of the white cells and platelets) is controlled by adjusting the centrifugal speed and the relative plasma 
and red cell flow rates to obtain the desired separation. Probably the most difficult engineering problem 
in the development of the continuous flow centrifuge was the design of the rotating seals through which 
the whole blood and separated components must pass without damage. The seal design in the original 
NCI/IBM device used saline lubrication to prevent intrusion of the cells between the contacting surfaces. 
In order to obtain effective cell separation in the continuous flow centrifuge, the residence time in the 
separation chamber must be sufficiently large to allow the red cells to migrate to the outer region of the 
device. The degree of separation can thus be characterized by the packing factor 

C V t 

P= ^- (132.3) 

h 

where G is the g-force associated with the centrifugation, V sed is the sedimentation velocity at 1 g, f is 
the residence time in the separation chamber, and h is the width of the separation chamber (i.e., the 
distance over which the sedimentation occurs). The packing factor thus provides a measure of the radial 
migration compared to the width of the centrifuge chamber, with adequate cell separation obtained when 
P > 1. The residence time in the separation chamber is inversely related to the blood flow rate (Q B ) as 

A I 

t = — (132.4) 

Q B 

where L and A are the length and cross-sectional area of the chamber, respectively. 

Rotational speeds in most continuous flow centrifuges are maintained around 1500 rpm (about 100 
g) to obtain a relatively clean separation between the red cells and buffy coat, to avoid the formation of 
a very highly packed (and therefore highly viscous) region of cells at the outer edge of the chamber, and 
to minimize excessive heating around the rotating seals [Rock, 1983] . The width of the separation chamber 
must be large enough to permit effective removal of the different blood components from the top of the 
device, while minimizing the overall extracorporeal blood volume. For example, a packing factor of 10 
requires a residence time of about 20 s for a device operated at 100 g with a 0.1 -cm-wide separation 
chamber. The blood flow rate for this device would need to be maintained below about 120 ml/min for 
a chamber volume of 40 ml. Most of the currently available devices operate at Q B ~ 50 ml/min and can 
thus collect a liter of plasma in about 30-40 minutes. 

More recent models for the continuous flow centrifuge have modified the actual geometry of the 
separation chamber to enhance the cell separation and reduce the overall cost of the device [Sawada 
et al., 1990]. Examples include tapering the centrifuge bowl to improve flow patterns and optimizing the 
geometry of the collection region to obtain purer products. The IBM 2997 (commercialized by Cobe 
Laboratories) uses a disposable semirigid plastic rectangular channel for the separation chamber, which 
eliminates some of the difficulties involved in both sterilizing and setting up the device. Fenwal Labora- 
tories developed the CS-3000 Cell Separator (subsequently sold by Baxter Healthcare), which uses a 
continuous J-shaped mulitchannel tubing connected directly to the rotating element. This eliminates the 
need for a rotating seal, thereby minimizing the possibility of leaks. The tubing in this device actually 
rotates around the centrifuge bowl during operation, using a "jump rope" principle to prevent twisting 
of the flow lines during centrifugation. 

In addition to the continuous flow centrifuge, Haemonetics has developed a series of intermittent flow 
centrifuges [Rock, 1983]. Blood flows into the bottom of a separation chamber similar to that shown in 
Fig. 132.1, but the red cells simply accumulate in this chamber while the plasma is drawn to the center 
of the rotating bowl and removed through an outlet port at the top of the device. When the process is 

© 2000 by CRC Press LLC 

I-&- -4- -4^1 




8594/S13/chl32/frame Page 6 Friday, March 03, 2000 03:08 PM 



complete, the pump action reverses, and the red cells are forced out of the bowl and reinfused into the 
patient (along with any replacement fluid). The entire process is then automatically repeated to obtain 
the desired level of plasma (or white cell) removal. This device was originally developed for the collection 
of leukocytes and platelets, but it is now used extensively for large-scale plasmapheresis as well. Maximum 
blood flow rates are typically about 70-80 ml/min, and the bowl rotates at about 5000 rpm. The device 
as a whole is more easily transported than most of the continuous flow centrifuges, but it requires almost 
50% more time for the collection of an equivalent plasma volume due to the intermittent nature of the 
process. In addition, the total extracorporeal volume is quite high (about 500 ml compared to only 250 
ml for most of the newer continuous flow devices) due to the larger chamber required for the red cell 
accumulation [Sawada et al., 1990]. 

All these centrifugal devices have the ability to carry out effective plasma exchange, although the rate 
of plasma collection tends to be somewhat slower than that for the membrane devices discussed in the 
next section. These devices can also be used for the collection of specific cell fractions, providing a degree 
of flexibility that is absent in the membrane systems. The primary disadvantage of the centrifugal units 
is the presence of a significant number of platelets in the collected plasma (typically about 10 5 platelets 
per p.1). Not only can this lead to considerable platelet depletion during repeated applications of plasma 
exchange, it can also interfere with many of the secondary processing steps employed in plasma perfusion. 



Membrane Plasmapheresis 

The general concept of blood filtration using porous membranes is quite old, and membranes with pores 
suitably sized to retain the cellular components of blood and pass the plasma proteins have been available 
since the late 1940s. Early attempts at this type of blood filtration were largely unsuccessful due to severe 
problems with membrane plugging (often referred to as fouling) and red cell lysis. Blatt and coworkers 
at Amicon recognized that these problems could be overcome using a cross-flow configuration [Solomon 
et al., 1978] in which the blood flow was parallel to the membrane and thus perpendicular to the plasma 
(filtrate) flow as shown schematically in Fig. 132.2. This geometry minimizes the accumulation of retained 
cells at the membrane surface, leading to much higher filtration rates and much less cell damage than 
could be obtained in conventional dead-end filtration devices [Solomon et al., 1978]. This led to the 
development of a large number of membrane devices using either flat sheet or hollow-fiber membranes 
made from a variety of polymers including polypropylene (Travenol Laboratories, Gambro, Fresenius), 
cellulose diacetate (Asahi Medical), polyvinyl alcohol (Kuraray), polymethylmethacrylate (Toray), and 
polyvinyl chloride (Cobe Laboratories). 
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FIGURE 132.2 Schematic diagram of a parallel plate device for cross-flow membrane plasmapheresis. 
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FIGURE 132.3 Experimental data for the nitrate flux as a function of the applied transmembrane pressure drop 
in a parallel plate membrane plasmaphersis device. Red cell hemolysis, defined as a filtrate hemoglobin concentration 
exceeding 20 mg/dl, occurs to the right of the dashed line. Data have been adapted from Zydney and Colton [1987]. 

These membrane devices all produce essentially cell-free plasma with minimal protein retention using 
membranes with pore sizes of 0.2-0.6 p.m. In addition, these devices must be operated under conditions 
that cause minimal red cell lysis, while maintaining a sufficiently high plasma filtrate flux to reduce the 
cost of the microporous membranes. Typical experimental data for a parallel plate membrane device are 
shown in Fig. 132.3 [Zydney & Colton, 1987]. The results are plotted as a function of the mean trans- 
membrane pressure drop (AP TM ) at several values of the wall shear rate (y w ), where y w is directly propor- 
tional to the inlet blood flow rate (Q B ). The filtrate flux initially increases with increasing AP TM reaching 
a maximum pressure independent value which increases with increasing shear rate. The flux under all 
conditions is substantially smaller than that obtained when filtering pure (cell-free) saline under identical 
conditions (dashed line in Fig. 132.3). No measurable hemolysis was observed at low AP TM even when 
the flux was in the pressure-independent regime. Hemolysis does become significant at higher pressures, 
with the extent of hemolysis decreasing with increasing y w and with decreasing membrane pore size 
[Solomon et al., 1978]. The dashed diagonal line indicates the pressure at which the filtrate hemoglobin 
concentration exceeds 20 mg/dl. 

The pressure-independent flux at high AP TM generally has been attributed to the formation of a 
concentration polarization boundary layer consisting of a high concentration of the formed elements in 
blood, mainly the red blood cells, which are retained by the microporous membrane (Fig. 132.2). This 
dynamic layer of cells provides an additional hydraulic resistance to flow, causing the flux to be substan- 
tially smaller than that obtained during filtration of a cell-free solution. At steady-state, this boundary 
layer is in dynamic equilibrium, with the rate of convection of formed elements toward the membrane 
balanced by the rate of mass transport back into the bulk suspension. There has been some debate in 
the literature over the actual mechanism of cell transport in these devices, and the different models that 
have been developed for the plasma flux during membrane plasmapheresis are discussed elsewhere 
[Zydney & Colton, 1986]. 

Zydney and Colton [ 1982] proposed that cell transport occurs by a shear- induced diffusion mechanism 
in which cell-cell interactions and collisions give rise to random cell motion during the shear flow of a 
concentrated suspension. This random motion can be characterized by a shear-included diffusion coef- 
ficient, which was evaluated from independent experimental measurements as 
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where y is the local shear rate (velocity gradient), a is the cell radius (approximately 4.2 (J.m for the red 
blood cells), and C is the local red cell concentration. The function /(C), which reflects the detailed 
concentration dependence of the shear-induced diffusion coefficient, is approximately equal to 0.03 for 
red cell suspensions over a broad range of cell concentrations. 

The local filtrate flux can be evaluated using a stagnant film analysis in which the steady-state mass 
balance is integrated over the thickness of the concentration boundary layer yielding [Zydney & Colton, 
1986] 



*)■ 
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(132.6) 



where z is the axial distance measured from the device inlet, and C w and C b are the concentrations of 
formed elements at the membrane surface and in the bulk suspension, respectively. The bulk mass transfer 
coefficient (k m ) can be evaluated using the Leveque approximation for laminar flow in either a parallel 
plate or hollow fiber device 
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(132.7) 



where the second expression has been developed using the shear-induced diffusion coefficient given by 
Eq (132.5) with /(C) = 0.03. The wall shear rate is directly proportional to the blood flow rate with 
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(132.8) 



for a hollow- fiber device with N fibers of inner radius R and 
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(132.9) 



for a parallel plate device with channel height h and total membrane width w. 

At high pressures, C w approaches its maximum value which is determined by the maximum packing 
density of the cells (about 95% under conditions typical of clinical plasmapheresis). The plasma flux 
under these conditions becomes independent of the transmembrane pressure drop, with this pressure- 
independent flux varying linearly with the wall shear rate and decreasing with increasing bulk cell 
concentration as described by Eqs. (132.6) and (132.7). A much more detailed numerical model for the 
flux [Zydney & Colton, 1987], which accounts for the concentration and shear rate dependence of both 
the blood viscosity and shear- induced cell diffusion coefficient as well as the compressibility of the blood 
cell layer that accumulates at the membrane, has confirmed the general behavior predicted by Eqs. (132.6) 
and (132.7). 

The volumetric filtrate (plasma) flow rate (Q ) in a hollow- fiber membrane filter can be evaluated by 
integrating Eqs. (132.6)-(132.8) along the length of the device accounting for the decrease in the blood 
flow rate (and thus y w ) due to the plasma removal [Zydney 8c Colton, 1986]. The resulting expression 
for the fractional plasma yield is 



Op 



= 1 — exp 



-0.62 



2/3 



In 



9w 

c, 



(132.10) 



1I-&- 



© 2000 by CRC Press LLC 



4&- 



-&-I 



8594/S13/chl32/frame Page 9 Friday, March 03, 2000 03:08 PM 



An analogous expression can be evaluated for a parallel plate device with the channel height h replacing 
R and the coefficient 0.62 becoming 0.84. Even though the development leading to Eq. (132.10) neglects 
the detailed variations in the bulk cell concentration and velocity profiles along the fiber length, the final 
expression has been shown to be in good agreement with experimental data for the plasma flow rate in 
actual clinical devices [Zydney & Colton, 1986]. Equation (132.10) predicts that the volumetric plasma 
flow rate is independent of the number of hollow fibers (or the membrane width for a parallel plate 
membrane device), a result which is consistent with a number of independent experimental investigations. 

According to Eq. (132.10), the plasma flow rate increases significantly with decreasing fiber radius. 
There are, however, a number of constraints on the smallest fiber radius that can actually be employed 
in these hollow-fiber membrane devices. For example, blood clotting and fiber blockage can become 
unacceptable in very narrow bore fibers. The blood flow in such narrow fibers also causes a very high 
bulk shear stress, which potentially can lead to unacceptable levels of blood cell damage (particularly for 
white cells and platelets). Finally, the hollow- fiber device must be operated under conditions which avoid 
hemolysis. 

Zydney and Colton [1982] developed a model for red cell lysis during membrane plasmapheresis in 
which the red cells are assumed to rupture following their deformation into the porous structure of the 
membranes. A given red cell is assumed to lyse if it remains in a pore for a sufficient time for the strain 
induced in the red cell membrane to exceed the critical strain for cell lysis. Since the red cell can be 
dislodged from the pore by collisions with other cells moving in the vicinity of the membrane or by the 
fluid shear stress, the residence time in the pore will be inversely related to the wall shear rate. 

The tension (o) in the red cell membrane caused by the deformation in the pore is evaluated using 
Laplace's law [Zydney & Colton, 1987] 

a= ™ p (132.11) 

2 

where _R is the pore radius. Hemolysis is assumed to occur at a critical value of the strain in the red cell 
membrane (S); thus the time required for lysis is given implicitly by 



S = a g(t) = V 0.0010 + 0.0012 l-exp(-8t) 



+ 4.5xl0~ 6 f (132.12) 



The function g(t) represents the temporal dependence of the lytic phenomenon and has been evaluated 
from independent experimental measurements [Zydney & Colton, 1982]. Cell lysis occurs when S > 0.03 
in Eq. (132.12) where O is given in dyne/cm and t is in sec. This simple model has been shown to be in 
good agreement with experimental data for red cell lysis during cross-flow membrane plasmapheresis 
[Zydney & Colton, 1987]. 

This physical model for red cell lysis implies that hemolysis can be avoided by operating at sufficiently 
high shear rates to reduce the residence time in the membrane pores. However, operation at high shear 
rates also causes the inlet transmembrane pressure drop to increase due to the large axial pressure drop 
associated with the blood flow along the length of the device [Zydney 6k Colton, 1982]: 



(o) = AP tm (l) + 2hy w \ (132.13) 



K 



where AP TM (0) and AP TM (L) are the inlet and exit transmembrane pressure drops, respectively, and p. is 
the average blood viscosity. AP TM (L) is typically maintained at a small positive value (about 20 mmHg) 
to ensure that there is a positive transmembrane pressure drop along the entire length of the device. 
Since the increase in AP TM {0) with increasing y w has a greater effect on hemolysis than the reduction in 
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FIGURE 132.4 Schematic representation of the safe operating regime for a clinical membrane plasmapheresis 
device. 

the residence time for the red cells in the membrane pores, there is also an upper bound on the shear 
rate for the safe operation of any given clinical device. 

The predicted safe operating regime for a clinical membrane plasmapheresis device can be determined 
using Eq. (132.11), with the maximum transmembrane pressure drop occurring at the device inlet, 
Eq. (132.13). The results are shown schematically in Fig. 132.4. Hemolysis occurs at very low shear rates 
due to the long residence time in the membrane pores, whereas lysis at high shear rates is due to the 
large value of the inlet transmembrane pressure drop associated with the axial flow. Note that there is a 
critical fiber length [at fixed values of the fiber radius and AP TM (L)] above which there is no longer any 
safe operating condition. 

To avoid some of the constraints associated with the design of both parallel plate and hollow-fiber 
membrane devices. Hemasciences developed a rotating membrane filter for use in both donor and 
therapeutic plasmapheresis. A nylon membrane is placed on an inner cylinder and rotated at about 
3600 rpm inside a concentric outer cylindrical chamber using a magnetic coupling device. The rotating 
membrane causes a very high shear rate (on the order of 10,000 S" 1 ) in the narrow gap between the 
cylinders. However, these high shear rates do not result in a large axial pressure drop, as found in the 
parallel plate and hollow-fiber devices, due to the decoupling of the axial blood flow and the shear rate 
in this system (the shear is now due almost entirely to the membrane rotation). The fluid flow in this 
rotating cylinder system also leads to the development of fluid instabilities known as Taylor vortices, and 
these vortices dramatically increase the rate of cell mass transport away from the membrane and back 
into the bulk suspension. This leads to a dramatic increase in the plasma filtrate flux and a dramatic 
reduction in the required membrane area. The Autopheresis-C (the rotating filter currently sold by Baxter 
Healthcare) uses only 70 cm 2 of membrane, which is more than an order of magnitude less than that 
required in competitive hollow-fiber and parallel plate devices. The mathematical analysis of the plasma 
filtrate flux and the corresponding design equations for the rotating cylinder plasma filter are provided 
by Zeman and Zydney [1996]. 

132.2 Plasma Perfusion 

In repeated applications of plasma exchange, it is necessary to use replacement fluids that contain proteins 
to avoid the risks associated with protein depletion. One approach to minimizing the cost of these protein- 
containing replacement fluids (either albumin solutions, fresh frozen plasma, or plasma protein fraction) 
is to use a saline or dextran solution during the initial stages of the process and to then switch to a 
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protein-containing replacement fluid toward the end of the treatment. Alternatively, a number of tech- 
niques have been developed to selectively remove specific toxic or immunogenic components from the 
plasma, with this treated plasma returned to the patient along with the cellular components of blood. 
This effectively eliminates the need for any expensive protein-containing replacement fluids. 

Plasma perfusion (also known as online plasma treatment) is typically performed using either mem- 
brane or sorbent-based systems. Membrane filtration separates proteins on the basis of size and is thus 
used to selectively remove the larger molecular weight proteins from albumin and the small plasma 
solutes (salts, sugars, amino acids, and so on). A variety of membranes have been employed for this type 
of plasma fractionation including cellulose acetate (Terumo), cellulose diacetate (Asahi Medical and 
Teijin), ethylene vinyl alcohol (Kuraray), and polymethylmetharcrylate (Toray). These membranes are 
generally hydrophilic to minimize the extent of irreversible protein adsorption, with pore sizes ranging 
from 100-600 A depending on the specific objectives of the membrane fractionation. 

The selectivity that can be obtained with this type of plasma filtration can be examined using available 
theoretical expressions for the actual sieving coefficient (SJ for a spherical solute in a uniform cylindrical 
pore: 

S a = (l-l) 2-(l-A,) exp(-0.711 A, 2 ) (132.14) 

where X is the ratio of the solute to pore radius. Equation (132.14) is actually an approximate expression 
which has been shown to be in good agreement with more rigorous theoretical analyses. This expression 
for the actual sieving coefficient, where S a is defined as the ratio of the protein concentration in the 
filtrate to that at the upstream surface of the membrane, is valid at high values of the plasma filtrate flux, 
since it implicitly assumes that the diffusive contribution to protein transport is negligible. To avoid 
excessive albumin loss, it is desirable to have S a > 0.8, which can be achieved using a membrane with an 
effective pore size greater than about 160 A (albumin has a molecular weight of 69,000 and a Stokes- 
Einstein radius of 36 A). This membrane would be able to retain about 80% of the immunglobulin M 
(which has a molecular weight of about 900,000 and a Stokes-Einstein radius of 98 A), but it would 
retain less than 40% of the immunglobulin G (with MW = 155,000 and a radius of 55 A). 

The protein retention obtained during an actual plasma filtration is substantially more complex than 
indicated by the above discussion. The polymeric membranes used in these devices actually have a broad 
distribution of irregularly shaped (noncylindrical) pores. Likewise, the proteins can have very different 
(nonspherical) conformations, and their transport characteristics also can be affected by electrostatic, 
hydrophobic, and van der Waals interactions between the proteins and the polymeric membrane, in addition 
to the steric interactions that are accounted for in the development leading to Eq. (132.14). Protein-protein 
interactions can also significantly alter the observed protein retention. Finally, the partially retained proteins 
will tend to accumulate at the upstream surface of the membrane during filtration (analogous to the 
concentration polarization effects described previously in the context of blood cell filtration). 

This type of secondary plasma filtration, which is generally referred to in the literature as cascade 
filtration, is primarily effective at removing large immune complexes (molecular weight of approximately 
700,000) and immunglobulin M (MW of 900,000) from smaller proteins such as albumin. Several studies 
have, however, found a higher degree of albumin-immunoglobulin G separation than would be expected 
based on purely steric considerations [Eq. (132.14)]. This enhanced selectivity is probably due to some 
type of long-range (e.g., electrostatic) interaction between the proteins and the membrane. 

A number of different techniques have been developed to enhance the selectivity of these plasma 
filtration devices. For example, Malchesky and coworkers at the Cleveland Clinic [Malchesky et al., 1980] 
developed the process of cryofiltration in which the temperature of the plasma is lowered to about 10°C 
prior to filtration. A number of diseases are known to be associated with the presence of large amounts 
of cryo- (cold-) precipitable substances in the plasma, including a number of autoimmune diseases such 
as systemic lupus erythematosus and rheumatoid arthritis. Lowering the plasma temperature causes the 
aggregation and/or gelation of these cryoproteins, making it much easier for these components to be 
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removed by the membrane filtration. About 10 g of cryogel can be removed in a single cryofiltration, 
along with significant amounts of the larger-molecular-weight immune complexes and IgM. The actual 
extent of protein removal during cyrofiltration depends on the specific composition of the plasma and 
thus on the nature as well as the severity of the particular disease state [Sawada et al., 1990]. There is 
thus considerable uncertainty over the actual components that are removed during cryofiltration under 
different clinical and/or experimental conditions. The cryogel layer that accumulates on the surface of 
the membrane also affects the retention of other plasma proteins, which potentially could lead to 
unacceptable losses even of small proteins such as albumin. 

It is also possible to alter the selectivity of the secondary membrane filtration by heating the plasma up 
to or even above physiologic temperatures. This type of thermofiltration has been shown to increase the 
retention of low- (LDL) and very low ( VLDL) density lipoproteins, and this technique has been used for 
the online removal of these plasma proteins in the treatment of hypercholesterolemia. LDL removal can 
also be enhanced by addition of a heparin/acetate buffer to the plasma, which causes precipitation of LDL 
and fibrinogen with the heparin [Sawada et al., 1990]. These protein precipitates can then be removed 
relatively easily from the plasma by membrane filtration. The excess heparin is subsequently removed from 
the solution by adsorption, with the acetate and excess fluid removed using bicarbonate dialysis. 

An attractive alternative to secondary membrane filtration for the selective removal of plasma com- 
ponents is the use of sorbent columns such as: (1) activated charcoal or anion exchange resins for the 
removal of exogenous toxins, bile acids, and bilirubin; (2) dextran sulfate cellulose for the selective 
removal of cholesterol, LDL, and VLDL; (3) immobilized protein A for the removal of immunoglobulins 
(particularly IgG) and immune complexes; and (4) specific immobilized ligands like DNA (for the 
removal of anti-DNA Ab), tryptophan (for the removal of antiacetylcholine receptor antibodies), and 
insulin (for the removal of anti-insulin antibodies). These sorbents provide a much more selective 
separation than is possible with any of the membrane processes; thus they have the potential to signifi- 
cantly reduce the side effects associated with the depletion of needed plasma components. The sorbent 
columns generally are used in combination with membrane plasmaphersis, since the platelets that are 
present in the plasma collected from available centrifugal devices can clog the columns and interfere with 
the subsequent protein separation. 

The development of effective sorbent technology for online plasma treatment has been hindered by 
the uncertainties regarding the actual nature of the plasma components that must be removed for the 
clinical efficacy of therapeutic apheresis in the treatment of different disease states. In addition, the use 
of biologic materials in these sorbent systems (e.g., protein A or immobilized DNA) presents particular 
challenges, since these materials may be strongly immunogenic if they desorb from the column and enter 
the circulation. 

132.3 Cytapheresis 

Cytapheresis is used to selectively remove one (or more) of the cellular components of blood, with the 
other components (including the plasma) returned to the patient. For example, leukocyte (white cell) 
removal has been used in the treatment of leukemia, autoimmune diseases with a suspected cellular 
immune mechanism (e.g., rheumatoid arthritis and myasthenia gravis), and renal allograft rejection. 
Erythrocyte (red cell) removal has been used to treat sickle cell anemia, severe autoimmune hemolytic 
anemia, and severe parasitemia. Plateletapheresis has been used to treat patients with thrombocythemia. 
Most cytapheresis is performed using either continuous or intermittent flow centrifuges, with appro- 
priate software and/or hardware modifications used to enhance the collection of the specific cell fraction. 
It is also possible to remove leukocytes from whole blood by depth filtration, which takes advantage of 
the strong adherence of leukocytes to a variety of polymeric materials (e.g., acrylic, cellulose acetate, 
polyester, or nylon fibers). Leukocyte adhesion to these fibers is strongly related to the configuration and 
the diameter of the fibers, with the most effective cell removal obtained with ultrafine fibers less than 
3 p.m in diameter. Available leukocyte filters (Sepacel, Cellsora, and Cytofrac from Asahi Medical Co.) 
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have packing densities of about 0.1-0.15 g fiber/cm 3 and operate at blood flow rates of 20-50 ml/min, 
making it possible to process about 2 L of blood in 1.5 hr. 

Leukocyte filtration is used most extensively in blood-banking applications to remove leukocytes from 
the blood prior to transfusion, thereby reducing the likelihood of antigenic reactions induced by donor 
leukocytes and minimizing the possible transmission of white-cell associated viral diseases such as 
cytomegalovirus. The absorbed leukocytes can also be eluted from these filters by appropriate choice of 
buffer solution pH, making it possible to use this technique for the collection of leukocytes from donated 
blood for use in the subsequent treatment of leukopenic recipients. Depth filtration has also been 
considered for online leukocyte removal from the extracorporeal circuit of patients undergoing cardio- 
pulmonary bypass as a means to reduce the likelihood of postoperative myocardial or pulmonary rep- 
erfusion injury which can be caused by activated leukocytes. 

A new therapeutic technique that involves online cytapheresis is the use of extracorporeal photochemo- 
therapy, which is also known in the literature as photopheresis. Photopheresis can be used to treat a variety 
of disorders caused by aberrant T- lymphocytes [Edelson, 1989], and it has become an established therapy 
for the treatment of advanced cutaneous T-cell lymphoma in the U.S. and several European countries. 
In this case, the therapy involves the use of photoactivated 8-methoxypsoralen, which blocks DNA 
replication causing the eventual destruction of the immunoactive T-cells. The psoralen compound is 
taken orally prior to the phototherapy. Blood is drawn from a vein and separated by centrifugation. The 
white cells and plasma are collected, diluted with a saline solution, and then pumped through a thin 
plastic chamber in which the cells are irradiated with a high-intensity UV light that activates the psoralen. 
The treated white cells are then recombined with the red cells and returned to the patient. Since the 
photoactivated psoralen has a half-life of only several microseconds, all its activity is lost prior to 
reinfusion of the cells, thereby minimizing possible side effects on other organs. The removal of the red 
cells (which have a very high adsorptivity to UV light) makes it possible to use a much lower energy UV 
light, thereby minimizing the possible damage to normal white cells and platelets. 

Photopheresis has also been used in the treatment of scleroderma, systemic lupus erythematosus, and 
pemphigus vulgaris. The exact mechanism for the suppression effect induced by the photo-therapy in 
these diseases is uncertain, although the T-cell destruction seems to be highly specific for the immuno- 
active T-cells [Edelson, 1989]. The response is much more involved than simple direct photoinactivation 
of the white cells; instead, the photo-treated cells appear to undergo a delayed form of cell death which 
elicits an immunologic response possibly involving the production of anti-idiotypic antibodies or the 
generation of clone-specific suppressor T-cells. This allows for an effective "vaccination" against a par- 
ticular T-cell activity without the need for isolating or even identifying the particular cells that are 
responsible for that activity [Edelson, 1989]. 

Phototherapy has also been used for virus inactivation, particularly in blood-banking applications 
prior to transfusion. This can be done using high-intensity UV light alone or in combination with specific 
photoactive chemicals to enhance the virus inactivation. For example, hematoporphyrin derivatives have 
been shown to selectively destroy hepatitis and herpes viruses in contaminated blood. This technique 
shows a high degree of specificity toward this type of enveloped virus, which is apparently due to the 
affinity of the photoactive molecules for the lipids and glycolipids that form an integral part of the viral 
envelope. 

Another interesting therapeutic application involving cytapheresis is the ex vivo activation of immu- 
nologically active white cells (lymphokine-activated killer cells, tumor-infiltrating lymphocytes, or acti- 
vated killer macrophages) for the treatment of cancer. The detailed protocols for this therapy are still 
being developed, and there is considerable disagreement regarding its actual clinical efficacy. A pool of 
activated cells is generated in vivo by several days of treatment with interleukin-2. These cells are then 
collected from the blood by centrifugal cytapheresis and further purified using density gradient centrif- 
ugation. The activated cells are cultured for several days in a growth media containing additional 
interleukin-2. These ex-vivo activated cells are then returned to the patient, where they have been shown 
to lyse existing tumor cells and cause regression of several different metastatic cancers. 
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132.4 Summary 

Apheresis is unique in terms of the range of diseases and metabolic disorders which have been successfully 
treated by this therapeutic modality. This broad range of application is possible because apheresis directly 
alters the body's immunologic system though the removal or alteration of specific immunologically active 
cells and/or proteins. 

Although there are a number of adverse reactions that can develop during apheresis (e.g., fluid imbalance, 
pyrogenic reactions, depletion of important coagulation factors, and thrombocytopenia), the therapy is 
generally well tolerated even by patients with severely compromised immune systems. This has, in at least 
some instances, led to the somewhat indiscriminate use of therapeutic apheresis for the treatment of diseases 
in which there was little physiologic rationale for the application of this therapy. This was particularly true 
in the 1980s, where dramatic advances in the available technology for both membrane and centrifugal blood 
fractionation allowed for the relatively easy use of apheresis in the clinical milieu. In some ways, apheresis 
in the 1980s was a medical treatment that was still looking for a disease. Although apheresis is still evolving 
as a therapeutic modality it is now a fairly well-established procedure for the treatment of a significant 
number of diseases (most of which are relatively rare) in which the removal of specific plasma proteins or 
cellular components can have a beneficial effect on the progression of that particular disease. Furthermore, 
continued advances in the equipment and procedures used for blood fractionation and component removal 
have, as discussed in this chapter, provided a safe and effective technology for the delivery of this therapy. 

The recent advances in sorbent-based systems for the removal of specific immunologically active 
proteins and in the development of treatment for the activation or inactivation of specific cellular 
components of the immune system has provided exciting new opportunities for the alteration and even 
control of the body's immunologic response. This includes: (1) the direct removal of specific antibodies 
or immune complexes (using membrane plasmapheresis with appropriate immunosorbent columns), 
(2) the inactivation or removal of specific lymphocytes (using centrifugal cytapheresis in combination 
with appropriate extracorporeal phototherapy or chemotherapy), and/or (3) the activation of a disease- 
specific immunologic response (using cytapheresis and ex vivo cell culture with appropriate lymphokines 
and cell stimuli). New advances in our understanding of the immune system and in our ability to 
selectively manipulate and control the immunologic response should thus have a major impact on 
therapeutic apheresis and the future development of this important medical technology. 

Defining Terms 

Autoimmune diseases: A group of diseases in which pathological antibodies are produced that attack 
the body's own tissue. Examples include glomerulonephritis (characterized by inflammation of 
the capillary loops in the glomeruli of the kidney) and myasthenia gravis (characterized by an 
inflammation of the nerve/muscle junctions). 

Cascade filtration: The combination of plasmapheresis with a second online membrane filtration of 
the collected plasma to selectively remove specific toxic or immunogenic components from blood 
based primarily on their size. 

Cytapheresis: A type of therapeutic apheresis involving the specific removal of red blood cells, white 
cells (also referred to as leukapheresis), or platelets (also referred to as plateletapheresis) . 

Donor apheresis: The collection of a specific component of blood (either plasma or one of the cellular 
fractions), with the return of the remaining blood components to the donor. Donor apheresis is 
used to significantly increase the amount of plasma (or a particular cell type) that can be donated 
for subsequent use in blood banking and/or plasma fractionation. 

Immune complexes: Antigen-antibody complexes that can be deposited in tissue. In rheumatoid arthri- 
tis this deposition occurs primarily in the joints, leading to severe inflammation and tissue damage. 

Photopheresis: The extracorporeal treatment of diseases characterized by aberrant T-cell populations 
using visible or ultraviolet light therapy, possibly in combination with specific photoactive chemicals. 

Plasma exchange: The therapeutic process in which a large volume of plasma (typically 3 L) is removed 
and replaced by an equivalent volume of a replacement fluid (typically fresh frozen plasma, a 
plasma substitute, or an albumin-containing saline solution). 
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Plasma perfusion: The therapeutic process in which a patient's plasma is first isolated from the cellular 
elements in the blood and then subsequently treated to remove specific plasma components. This 
secondary treatment usually involves a sorbent column designed to selectively remove a specific 
plasma component or a membrane filtration designed to remove a broad class of plasma proteins. 

Plasmapheresis: The process in which plasma is separated from the cellular components of blood using 
either centrifugal or membrane-based devices. Plasmapheresis can be employed in donor applica- 
tions for the collection of source plasma for subsequent processing into serum fractions or in 
therapeutic applications for the treatment of a variety of disorders involving the presence of 
abnormal circulating components in the plasma. 

Therapeutic apheresis: A process involving the separation and removal of a specific component of the 
blood (either plasma, a plasma component, or one of the cellular fractions) for the treatment of 
a metabolic disorder or disease state. 
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Further Information 

Several of the books listed above provide very effective overviews of both the technical and clinical aspects 
of therapeutic apheresis. In addition, the Office of Technology Assessment has published Health Tech- 
nology Case Study 23: The Safety, Efficacy, and Cost Effectiveness of Therapeutic Apheresis, which has an 
excellent discussion of the early clinical development of apheresis. Several journals also provide more 
detailed discussions of current work in apheresis, including Artificial Organs and the Journal of Clinical 
Apheresis. The abstracts and proceedings from the meetings of the International Congress of the World 
Apheresis Association and the Japanese Society for Apheresis also provide useful sources for current 
research on both the technology and clinical applications of therapeutic apheresis. 
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133.1 Morphology of the Liver 

The liver is a complex organ that operates both in series and in parallel with the gastrointestinal tract. 
After entering the portal system, the products of digestion come in contact with the liver parenchymal 
cells, or hepatocytes, which remove most of the carbohydrates, amino acids, and fats from the feeder 
circulation, therefore preventing excessive increases throughout the body after a meal. In the liver, these 
products are then stored, modified, and slowly released to the better advantage of the whole organism. 

The liver can be considered a complex large-scale biochemical reactor, since it occupies a central 
position in the metabolism, i.e., the sum of the physical and chemical processes by which living matter 
is produced, maintained, and destroyed, and whereby energy is made available for the functioning of 
liver cells as well as tissues from all other organs. 

The adult human liver (weighing 1500 g) receives its extensive blood supply (on the order of 1 L/min 
or 20% of cardiac output) from two sources: the portal vein (over two-thirds) and the hepatic artery 
(about one-third). Blood from the liver drains through the hepatic veins into the inferior vena cava. 
Macroscopically, the liver is divided into 4 or 5 lobes with individual blood supply and bile drainage 
channels. Some of these lobes can be surgically separated, although not without difficulty. 

Microscopically, human hepatocytes (250-500 x 10 9 in each liver) are arranged in plates (Fig. 133.1) 
that are radially distributed around the central (drainage) vein [Jones & Spring-Mills, 1977] and form 
somewhat hexagonal structures, or liver lobules, which are much more clearly demarcated in porcine 
livers. Present in the periphery of these lobules are the so-called portal triads, in the ratio of three triads 
for each central vein. In each portal triad, there are tributaries of the portal vein, branches of the hepatic 
artery, and collector ducts for the bile (Fig. 133.1). Blood enters the liver lobule at the periphery from 
terminal branches of the portal vein and the hepatic arteries and is distributed into capillaries which 
separate the hepatocyte plates. These capillaries, called sinusoids, characteristically have walls lined by 
layers of endothelial cells that are not continuous but are perforated by small holes (fenestrae). Other 
cells are present in the sinusoid wall, e.g., phagocytic Kuppfer cells, fat-storing Ito cells, and probably a 
few yet undefined mesenchymal cells. It is important to emphasize that blood-borne products (with the 
exception of blood cells) have free access to the perisinusoidal space, called the space of Disse, which can 
be visualized by electron microscopy as a gap separating the sinusoidal wall from the hepatocyte plasma 
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FIGURE 133.2 Hepatocyte relationships with the space of Disse and the sinusoid wall. 

membrane (Fig. 133.2). In this space, modern immunomicroscopic studies have identified three types 
of collagens: Type IV (the most abundant), Type I, and Type III. Fibronectin and glycosaminoglycans 
are also found there, but laminin is only present in the early stages of liver development not in adult 
mammalian livers [Martinez-Hernandez, 1984]. 
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The hepatocytes themselves are large (each side about 25 microns), multifaceted, polarized cells with 
an apical surface which constitutes the wall of the bile canaliculus (the channel for bile excretion) and 
basolateral surfaces which lie in close proximity to the blood supply. Hepatocytes constitute 80-90% of 
the liver cell mass. Kuppfer cells (about 2%) belong to the reticuloendothelial system, a widespread class 
of cells which specialize in the removal of particulate bodies, old blood cells, and infectious agents from 
the blood stream. 

The cytoplasm of hepatocytes contains an abundance of smooth and rough endoplasmic reticulum, 
ribosomes, lysosomes, and mitochondria. These organelles are involved in complex biochemical pro- 
cesses: fat and lipid metabolism, synthesis of lipoproteins and cholesterol, protein metabolism, and 
synthesis of complex proteins, e.g., serum albumin, transferrin, and clotting factors from amino acid 
building blocks. The major aspects of detoxification take place in the cisternae of the smooth endoplasmic 
reticulum, which are the site of complex oxidoreductase enzymes known collectively as the cytochrome 
P-450 system. In terms of excretion, hepatocytes produce bile, which contains bile salts and conjugated 
products. Hepatocytes also store large pools of essential nutrients such as folic acid, retinol, and cobalamin. 

133.2 Liver Functions 

The liver fulfills multiple and finely tuned functions that are critical for the homeostasis of the human 
body. Although individual pathways for synthesis and breakdown of carbohydrates, lipids, amino acids, 
proteins, and nucleic acids can be identified in other mammalian cells, only the liver performs all these 
biochemical transformations simultaneously and is able to combine them to accomplish its vital biologic 
task. The liver is also the principal site of biotransformation, activation or inactivation of drugs and 
synthetic chemicals. Therefore, this organ displays a unique biologic complexity. When it fails, functional 
replacement presents one of he most difficult challenges in substitutive medicine. 

Under normal physiologic requirements, the liver modifies the composition and concentration of the 
incoming nutrients for its own usage and for the benefit of other tissues. Among the major liver functions, 
the detoxification of foreign toxic substances (xenobiotics), the regulation of essential nutrients, and the 
secretion of transport proteins and critical plasma components of the blood coagulation system are 
probably the main elements to evaluate in a successful organ replacement [Jauregui, 1991]. The liver also 
synthesizes several other critical proteins, excretes bile, and stores excess products for later usage, functions 
that can temporarily be dispensed with but must eventually be provided. 

The principal functions of the liver are listed in Table 133.1. The challenge of liver support in case of 
organ failure is apparent from the complexity of functions served by liver cells and from our still imperfect 
ability to rank these functions in terms of urgency of replacement. 

TABLE 133.1 Liver Functions 

Carbohydrate metabolism: Glyconeogenesis and glycogenolysis 
Fat and lipid metabolism: Synthesis of lipoproteins and cholesterol 
Synthesis of plasma proteins, for example: 

Albumin 

Globulins 

Fibrinogen 

Coagulation factors 

Transferrin 

a- fetoprotein 
Conjugation of bile acids; conversion of heme to bilirubin and biliverdin 
Detoxification: Transformation of metabolites, toxins, and hormones into water-soluble compounds (e.g., 

cytochrome P-450 P-450 oxidation, glucuronyl transferase conjugation) 
Biotransformation and detoxification of drugs 
Metabolism and storage of vitamins 
Storage of essential nutrients 
Regeneration 
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133.3 Hepatic Failure 

More than any other organ, the liver has the property of regeneration after tissue damage. Removal or 
destruction of a large mass of hepatic parenchyma stimulates controlled growth to replace the missing 
tissue. This can be induced experimentally, e.g., two thirds of a rat liver can be excised with no ill effects 
and will be replaced within 6 to 8 days. The same phenomenon can be observed in humans and is a 
factor in the attempted healing process characteristic of the condition called liver cirrhosis. Recent attempts 
at liver transplantation using a liver lobe from a living donor rely on the same expectation of recovery 
of lost liver mass. Liver regeneration is illustrated by the myth of Prometheus, a giant who survived in 
spite of continuous partial hepatectomy through the good auspices of a vulture (a surgical procedure 
inflicted on him as punishment for having stolen the secret fire from the gods and passing it on to 
humanity). 

Hepatic failure maybe acute or chronic according to the time span it takes for the condition to develop. 
Mechanisms and toxic by-products perpetuating these two conditions are not necessarily the same. Acute 
fulminant hepatic failure (FHF) is the result of massive necrosis of hepatocytes induced over a period of 
days or weeks by toxic substances or viral infection. It is characterized by jaundice and mental confusion 
which progresses rapidly to stupor or coma. The latter condition, hepatic encephalopathy (HE), is currently 
thought to be associated with diminished hepatic catabolism. Metabolites have been identified which 
impair synaptic contacts and inhibit neuromuscular and mental functions (Table 133.2). Although brain 
impairment is the rule in this condition, there is no anatomic damage to any of the brain structures, and 
therefore, the whole process is potentially reversible. The mortality rate of FHF is high (70-90%), and 
death is quite rapid (a week or two) . Liver transplantation is currently the only effective form of treatment 
for FHF. Transplantation procedures carried out in life-threatening circumstances are much more risky 
than interventions in relatively better compensated patients. The earlier the transplantation procedure 
takes place, the greater is the chance for patient survival. However, 10-30% of FHF patients will regenerate 
their liver under proper medical management without any surgical intervention. Hence, liver transplan- 
tation presents the dilemma of choosing between an early intervention, which might be unnecessary in 
some cases, or proceeding to a late procedure with a statistically higher mortality [Jauregui et al., 1994]. 

Chronic hepatic failure, the more common and progressive form of the disease, is often associated with 
morphologic liver changes known as cirrhosis in which fibrotic tissue gradually replaces liver tissue as 
the result of long-standing toxic exposure (e.g., alcoholism) or secondary to viral hepatitis. More than 
30,000 people died of liver failure in the United States in 1990. 

In chronic hepatic failure, damaged hepatocytes are unable to detoxify toxic nitrogenous products that 
are absorbed by intestinal capillaries and carried to the liver by the portal system. Ammonia probably 
plays the major role in the deterioration of the patient's mental status, leading eventually to "hepatic 
coma." An imbalance of conventional amino acids (some abnormally high, some low) may also be 
involved in the pathogenesis of the central nervous system manifestation of hepatic failure, the most 
dramatic of which is cerebral edema. Impaired blood coagulation (due to decreased serum albumin and 
clotting factors), hemorrhage in the gastro-intestinal system (increased resistance to blood flow through 
the liver leads to portal hypertension, ascites formation, and bleeding from esophageal varices), and 



TABLE 133.2 Metabolic Products with Potential Effects in Acute Liver Failure 

Substance Mode of action 

Ammonia Neurotoxic interaction with other neurotransmitters 

Contributes to brain edema 
Benziodiazepinelike substances Neural inhibition 
GABA Neural inhibition 

Mercaptans Inhibition of Na-K ATPase 

Octopamine Acts as a false neurotransmitter 
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hepatic encephalopathy with glial cell damage in the brain are the standard landmarks of chronic hepatic 
failure. In fact, HE in chronic liver failure is often precipitated by episodes of bleeding and infection, and 
progression to deep coma is an ominous sign of impending death. 

Intensive management of chronic liver failure includes fluid and hemodynamic support, correction of 
electrolyte and acid-base abnormalities, respiratory assistance, and treatment of cerebral edema if present. 
Aggressive therapy can diminish the depth of the coma and improve the clinical signs, but the outcome 
remains grim. Eventually, 60-90% of the patients require transplantation. About 2500 liver transplants 
are performed every year in the United States, with a survival rate ranging from 68-92%. The most 
serious limitation to liver transplantation (besides associated interrelated diseases) remains donor scarcity. 
Even if segmented transplants and transplants from living related donors become acceptable practices, 
it is unlikely that the supply of organs will ever meet the demand. Further, the problem of keeping a 
patient alive with terminal hepatic failure, either chronic or acute, while waiting for an adequately matched 
transplant is much more difficult than the parallel problem in end-stage renal disease, where dialysis is 
a standardized and effective support modality. 

An appreciation of the modalities of presentation of the two types of hepatic coma encountered in 
liver failure is needed for a definition of the requirements for the proper use of liver assist devices. In the 
case of FHF, the hepatologist wants an extracorporeal device that will circulate a large volume of blood 
through a detoxifying system [Jauregui & Muller, 1992] allowing either the regeneration of the patient's 
damaged liver (and the avoidance of a costly and risky liver transplantation procedure) or the metabolic 
support needed for keeping the patient alive while identifying a cadaveric donor organ. In the first option, 
the extracorporeal liver assist device functions as an organ substitute for the time it takes the liver to 
regenerate and recover its function; in the second, it serves as temporary bridge to transplantation. 

In the case of chronic liver failure today, spontaneous recovery appears impossible. The damaged liver 
needs to be replaced by a donor organ, although not with the urgency of FHF. The extracorporeal liver 
assist device (LAD) is used as a bridge while waiting for the availability of a transplant. It follows that 
the two different types of liver failure may require different bioengineering designs. 

133.4 Liver Support Systems 

The concept of artificial liver support is predicated on the therapeutic benefit of removing toxic substances 
accumulating in the circulation of liver failure patients. These metabolites reflect the lack of detoxification 
by damaged hepatocytes, the lack of clearance of bacterial products from the gut by impaired Kupffer 
cells, and possibly the release of necrotic products from damaged cells which inhibit liver regeneration. 
Systemic endotoxemia as well as massive liver injury give rise to an inflammatory reaction with activation 
of monocytes and macrophages and release of cytokines which may be causally involved in the patho- 
genesis of multiorgan failure commonly encountered in liver failure. 

Technologies for temporary liver support focus on the detoxifying function, since this appears to be 
the most urgent problem in liver failure. The procedures and devices which have been considered for 
this purpose include the following. 

Hemodialysis 

Hemodialysis with conventional cellulosic membranes (cut-off point around 2000 daltons) or more 
permeable polysulfone or polyacrylonitrile [de Groot et al., 1984] (cut-off 1500-5000 daltons) helps to 
restore electrolyte and acid-base balance and may decrease the blood ammonia levels but cannot remove 
large molecules and plasma protein-bound toxins. Improvement of the patient's clinical condition (e.g., 
amelioration of consciousness and cerebral edema) is temporary. The treatment appears to have no 
lasting value and no demonstrated effect on patient survival. In addition, hemodialysis may produce a 
respiratory distress syndrome caused by a complement-mediated poly-morphonuclear cell aggregation 
in the pulmonary circulatory bed. Because some of the clinical benefit seems related to the removal of 
toxic molecules, more aggressive approaches focused on detoxification have been attempted. 
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Hemofiltration 

Hemofiltration with high cut-off point membranes (around 50,000 daltons with some polyacrylonitrile- 
polyvinyl chloride copolymers, modified celluloses, or polysulfones) clears natural or abnormal com- 
pounds within limits imposed by convective transport across the exchange membrane. These procedures 
again have a temporary favorable effect on hepatic encephalopathy (perhaps because of the correction 
of toxic levels of certain amino acids) with reversal of coma, but they do not clearly improve survival rates. 

Hemoperfusion 

Hemoperfusion, i.e., extracorporeal circulation of blood over nonspecific sorbents (e.g., activated char- 
coal) [Chang, 1975] or more complex biochemical reactors which allow the chemical processing of specific 
biologic products, such as ammonia, have not yet met clinical success in spite of encouraging experimental 
results, except in the case of hepatic necrosis induced by poisonous mushrooms such as Amanita phal- 
loides. Anion exchange resins and affinity columns similar to those used in separative chromatography 
may help in removing protein-bound substances (e.g., bilirubin) which would not pass through dialysis 
or hemofiltration membranes, but nonspecific sorbents may also deplete the plasma of biologically 
important substances. Further, these techniques are complicated by problems of hemocompatibility, 
related in part to the entrainment of dust ("fines") associated with the sorbent material itself and in part 
to platelet activation in patients with an already compromised coagulation status. To minimize this 
problem direct blood or plasma contact with the sorbent material can be avoided by polymer coating of 
the sorbent particles using either albumin, cellulose nitrate, or similar thin films, but hemocompatibility 
remains a concern. Here again, there is anecdotal evidence of clinical improvement of hepatic failure 
with hemoperfusion, with some reports claiming a higher survival rate in hepatic encephalopathy, but 
these reports have not been supported by well-controlled studies. As is the case for hemodialysis and 
hemofiltration, the possible beneficial effect of hemoperfusion should be evaluated in the context of the 
clinical variability in the course of FHF. 

Lipophilic Membrane Systems 

Because lipophilic toxins dominate in fulminant hepatic failure, it is conceibable to eliminate such 
compounds with a hydrophobic (e.g., polysulfone) membrane featuring large voids filled with a nontoxic 
oil [Brunner & Tegtimeier, 1984]. After diffusion, the toxins can be made water-soluble through reaction 
with a NaOH-based acceptor solution, thereby preventing their return to the blood stream. A standard, 
high-flux dialyzer in series with the lipophilic membrane device allows the removal of hydrophilic solutes. 
Such a system has proved effective in removing toxins such as phenol and p-cresol as well as fatty acids 
without inducing detrimental side effects of its own. 

Immobilized Enzyme Reactors 

To address the problem of specificity in detoxification, enzymes such as urease, tyrosinase, L-asparaginase, 
glutaminase, and UDP-glucuronyl transferase have been attached to hollow fibers or circulated in the 
closed dialysate compartment of an artificial kidney or still incorporated into microcapsules or "artificial 
cells" exposed to blood. There is considerable in vitro evidence for the effectiveness of this approach, 
and some indication of therapeutic value from in vivo animal experiments [Brunner et al., 1979]. 
However, no clinical report has documented the superiority of enzyme reactors over the various modal- 
itites of dialysis. Again there are clinical observations of clearing of the mental state of patients in hepatic 
coma, but no statistically demonstrated effect on survival. It is unclear whether the lack of success is due 
to the inability of specific enzymes to remove all offending toxins or is evidence of the need for more 
than detoxification for effective treatment. 

Parabiotic Dialysis 

Also referred to as cross-dialysis, parabolic dialysis is a variant of hemodialysis in which the dialysate 
compartment of a solute exchange device is perfused continuously with blood from a living donor. 
Because of membrane separation of the two blood streams, the procedure can be carried out even if the 
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two subjects belong to different blood groups or different animal species. However, the risk of the 
procedure to a human donor (control of blood volume, transfer of toxic substances, mixing of blood 
streams in case of dialyzer leak) and the difficulty of introducing a live animal donor into the hospital 
environment have relegated this approach to the class of therapeutic curiosities. 

Exchange Transfusion 

Exchange transfusion, i.e., the quasi-total replacement of the blood volume of a patient in liver failure 
by alternating transfusion and bleeding, is occasionally used in severe hyperbilirubinemia of the newborn, 
which used to carry an ominous prognosis because of its association with cerebral edema. The rationale 
is that exchange transfusion will reduce the level of toxins and replenish the deficient factors in the blood 
stream while the underlying condition is corrected by natural processes or drugs [Trey et al„ 1966]. With 
the advent of blood component therapy, specific plasma components can also be administered to treat 
identified deficiencies. Mortality rates of patients treated with exchange blood transfusions have been 
reported as greater than those observed with conventional therapies. 

Plasmapheresis 

Plasmapheresis, i.e., the combination of withdrawal of blood, centrifugation, or membrane processing 
to separate and discard the patient's plasma, and return of autologous cells diluted with donor plasma, 
was practiced initially as a batch process. Techniques now exist for a continuous exchange process, in 
which plasma and cells are separated by physical means outside of the body (membrane separation or 
centrifugation), and the patient's plasma replaced by banked plasma (up to 5000 ml per day) [Lepore 
et al., 1972]. There is evidence from controlled clinical trials for the effectiveness of this form of therapy, 
but the mortality rate remains high in patients with hepatic failure, whether from insufficient treatment 
or the risks of the procedure. It appears, however, that plasma exchange can be beneficial in the preop- 
erative period prior to liver transplantation so as to correct severe coagulopathy. Plasmapheresis is used 
in conjunction with the placement of a hepatocyte-seeded extracorporeal hollow-fiber device to treat 
acute and chronic liver failure [Rozga et al., 1993]. 

Combined Therapy 

Endotoxins and cytokines can be removed by hemoperfusion over activated charcoal and absorbent 
resins, but it may be more effective to process plasma than whole blood. This has led to the concept of 
combining plasmapheresis with continuous plasma treatment for removal of substances such as tumor 
necrosis factor (TNF), interleukin-6 (IL-6), and bile acids by a resin column, and then ultrafiltration or 
dialysis for fluid removal, since patients with liver failure often develop secondary renal failure. 

133.5 Global Replacement of Liver Function 

Because of the complexity and interplay of the various functions of the liver, more success can be expected 
from global approaches, which allow many or all hepatic functions to be resumed. These include the 
following. 

Cross-Criculation 

Cross-circulation of the patient in hepatic coma with a compatible, healthy donor is one approach. This 
procedure is more than a prolonged exchange transfusion since it allows the donor's liver to substitute 
for the patient's failing organ and to process chemicals from the patient's blood stream as long as the 
procedure lasts [Burnell, 1973]. It had been attempted in isolated cases, but reports of effectiveness are 
entirely anecdotal and the procedure has not been accepted clinically because of ambiguous results and 
the perceived risk for the donor. 

Hemoperfusion over Liver Tissue Slices 

The incorporation of active hepatocytes in a hemoperfusion circuit was suggested by the laboratory 
practice of biochemists who, since Warburg, have investigated metabolic pathways in tissue slices. For 
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liver replacement, this technology has been pioneered primarily in Japan as a substitute for organ 
transplantation, which is culturally frowned upon in that country in spite of a major incidence of severe 
liver disease. The procedure may improve biochemical markets of liver failure but has no demonstrated 
clinical value [Koshino et al., 1975]. 

Ex Vivo Perfusion 

Ex vivo perfusion uses an isolated animal liver (pig or baboon) connected to the patient's cardiovascular 
system [Saunders et al., 1968]. This is a cleaner and more acceptable form of treatment than cross- 
circulation of hemoperfusion over tissue pieces. Nevertheless, it is limited by the need for thorough 
washing of the animal's blood from the excised liver, the requirement for a virus-free donor organ source, 
the limited survival capacity of the excised, perfused organ, which must be replaced at intervals of 
approximately 24 hours, and the cost of the procedure. Success has recently been reported in isolated 
clinical trials [Chari, 1994]. 

Heterotopic Hepatocyte Transplantation 

This procedure may someday offer an alternative to whole organ transplantation, especially in cases of 
chronic liver failure, if a sufficient number of cells can be grafted. Freshly isolated hepatocytes have 
damaged cell surfaces and must be cultured to regain their integrity and display the surface receptors 
needed for attachment or binding of xenobiotics or endogenous toxic products [McMillan et al., 1988]. 
At the clinical level, this procedure could rely in part on banking frozen hepatocytes from livers which 
are not usable for whole organ transplants but constitute a reliable source of cells [Bumgardener et al., 
1988]. The procedure does not require removal of the recipients's liver and provides the followng 
advantages: (1) minimal surgery, (2) repeatability as needed, and (3) interim strategy to whole organ 
transplant. There is no agreement as to the best anatomic site for hepatocyte transplantation, the type 
of matrix needed for cell attachment and differentation, and the number of cells needed. It has been 
reported that hepatocyte culture supernatants were as effective as transplanted hepatocytes in treating 
rats with chemically induced liver failure [LaPlante-O'Neill et al., 1982]. The tridemensional reconstruc- 
tion of high-density, functional liver tissue will be the greatest challenge in view of the cell mass required. 
Structural organization and differentiated functions may be achieved by using an asialoglycoprotein 
model polymer as the synthetic substrate for a primary culture of hepatocytes to develop functional 
modules for implantation in humans [Akaike et al., 1989]. 

Whole Organ in Situ Transplantation 

This is currently the procedure of choice, most particularly in children. Although progress in clinical and 
surgical skills certainly accounts in large part for the growing success rate of this procedure over the past 
20 years, it is worth noting that the introduction of extracorporeal ciruclation techniques in the surgical 
protocol to support the donor organ while waiting for completion of the anastomosis has paralleled the 
steepest increase in success rate of liver transplantation in the past few years. Whereas most liver trans- 
plants rely on the availability of cadaver organs, the recent advent of segmented transplants allows 
consideration of living related donors and possibly the sharing of a donor organ among two or more 
recipients. 

133.6 Hybrid Replacement Procedures 

The complexity of hepatic functions, coupled with shortage of human donor organs, has encouraged the 
development of procedures and devices which rely on xenogeneic living elements attached to synthetic 
structures and separated from the most host by a permselective membrane to replace temporarily, and 
perhaps someday permanently, the failing organ. 

The incorporation of liver microsomes in microcapsules, hydrogels, or polymer sheets, with or without 
additonal enzymes or pharmacologic agents, goes one step beyond the immobilized enzyme reactor 
inasmuch as it calls on cell components endowed with a variety of enzymatic properties to process blood 
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or other body fluids. The feasibility of this technique has been demonstrated in vitro [Denti et al., 1976] 
but has not been investigated extensively in animal experiments. 

Cellular hybrid devices — the incorporation of functional cells in a device immersed in body fluids or 
connected to the vascular system (extracorporeally or somewhere inside the body of the recipient) — are 
a promising application of the concept of bioartificial organs. However, the problems faced by the 
"hepatocyte bioreactor" are formidable: 

1. The mass of functional cells required is much larger than in the case of secretory or endocrine 
organs, since a normal liver weighs about 1.5 kg and since as much as 10-30% of that mass (i.e., 
several billion cells) may be required for life-sustaining replacement of function. Taking into account 
the need for supporting structures, the sheer size of the device will be an obstacle to implantation. 

2. The liver features a double feeder circulation (portal and hepatic) and a complex secretory- 
excretory system which utilizes both blood and bile to dispose of its waste products. How to 
duplicate such a complex manifolding in an artificial organ and whether it is worth the resulting 
complexity are questions not yet resolved. 

3. With membrane separation of recipient and donor cells, the size of some natural macromolecules 
to be exchanged (e.g., low-density lipoproteins) precludes the use of standard diffusion membranes. 
Allowing relatively free solute exchange between the compartments in a device without endangering 
the immune sequestration of the donor tissue remains a challenge for membrane technology. 
However, the low immunogenicity of hepatocytes (which lack type I HLA antigens) allows the 
consideration of relatively open membranes for the construction of extracorporeal reactors. 

At the moment, most of the extracorporeal liver assist devices (ELAD) utilize xenogeneic mammalian 
hepatocytes seeded on solid, isotropic hollow fibers. Membrane selectivity limits the rate of diffusion of 
liposoluble toxins which are bound to plasma proteins for transport. Hence manipulation of membrane 
transport properties and concentrations of acceptor protiens can affect the clearance of polar materials. 

Also, most devices focus on replacing the detoxification function of the natural liver and avoid the 
more complex "cascade" dialysis circuitry which could either (1) allow the macro-molecules synthesized 
by hepatocytes to return to the blood stream (or another body fluid compartment) though a high cut- 
off point membrane or (2) provide an excretory path to clear toxic products manufactured by the 
hepatoctyes, on the model of the bile excretory system. Nonetheless, such circuitry might be valuable to 
prolong the life of the seeded cells, since the combination of bile salts and bile acids has a damaging 
detergent effect on the lipid components of the hepatocyte membrane. 

The development of bioreactors including cells capable of performing liver functions, and therefore 
capable of providing temporary liver support, finds applications in the treatment of acute, reversible liver 
failure or as a bridge for liver transplantation. Designs for a bioartificial liver can be classified according 
to (1) the type of cells selected to replace the hepatic functions or (2) the geometry and chemical nature 
of the polymer structure used organize the hepatocytes. 

Source of Functional Cells 

Two main methods of hepatocyte isolation (mechanical and enzymatic) can be used, separately or in 
combination. Mechanical methods (tissue dissociation) have largely failed in terms of long-term cell 
viability, although this is not always recognized by investigators developing liver assist systems. Collage- 
nase perfusion [Seglen, 1976] is today the method of choice, yet there is evidence that hepatocytes lose 
some of their oligosaccharide-lectin binding capacity in the process and do not recover their glycocalyx 
until after a day or two in culture. Collagenase is thought to loosen cell junctions and secondarily to 
digest the connective tissue around the hepatocytes. Chemical methods using citrate, EDTA, and similar 
substances weaken cell junctions by depleting calicum ions without altering the cell membranes and 
presumably result in better preservation of natural enzymatic functions. Although the yield of chemical 
methods is lower than that of enzymatic dissociation methods, it may be of interest once a reliable 
technology for separating viable from nonviable cells on a large scale is perfected. 
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( What type of cells should be cultured? J 

Adult mammalian hepatocytes 

Neonatal mammalian hepatocytes 

Fetal mammalian hepatocytes 

Transfected mammalian hepatocytes 
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Hepatoma (transformed) cells 

Proliferation Differentiation 

FIGURE 133.3 Cellular choices based on proliferation and differentiation. 

A priori, an effective bioartificial liver would require either all the multifunctional characteristics of 
normal hepatocytes in vivo or only the specific functional activity that happens to be missing in the 
patient. Unfortunately, in most cases, clinical signs do not allow us to distinguish between these two 
extremes, justifying a preference for highly diffrentiated cells. 

Differentiation and proliferation are usually at opposite ends of a biologic continuum in most cell 
types (Fig 133.3) [Jauregui, 1991]. Hence, there is a difficulty of obtaining large numbers of multifunc- 
tional hepatocytes. Several options are available: 

1. The simplest approach is to use adult mammalian liver cells isolated in sufficient number from 
large animals. Porcine hepatocytes are preferred for clinical applications because of the availability 
of virus-free donors. Large-scale isolation of porcine hepatocytes is becoming a routine procedure, 
and Demetriou has been able to treat several patients with such a system. The expectation is that 
hepatocytes in suspension or attached to synthetic microcarriers will remain metabolically active 
even though separated from neighboring cells and normal supportive structures. This approach, 
which appeared almost beyond practicality a few years ago, seems now less formidable because of 
expanded knowledge of the molecular factors which favor both hepatocyte attachment to poly- 
meric substrates and functional differentiation. The potential contribution of Kupffer cells to a 
bioartificial liver has not yet been extensively investigated. 

2. The use of liver tumor cells — preferably nonmalignant hepatoma — has been pioneered by Wolfe 
and Munkelt [1975] because such cells can proliferate indefinitely and therefore require a minimal 
seeding dose. They are also less anchor- dependent than normal cells. The drawbacks are the loss 
of specialized hepatic functions often encountered in tumor cell lines, and the theroretical risk of 
escape of tumor cells in the recipient. 

3. A modified, functionally differentiated, human hepatoblastoma cell line capale of growing to high 
densities, yet strongly contact-inhibited by containment with membranes, has been patented and 
used in animals and man by Sussman and colleagues [1992]. Evidence for metabolic effectiveness 
has been reported, and clinical trials are now in progress. The uncertainty associated with the use 
of a human tumor cell line remains a source of concern. 

4. Potentially replicating hepatocytes, such as those obtained from embryonic liver, neonatal animals, 
or recently hepatectomized adults, have been proposed as a means to obtain a stem cell population 
with enhanced proliferation capacity. Neonatal hepatocyte-based bioartificial devices have been 
built on that principle [Hager et al., 1983] and have shown to produce albumin, to metabolize 
urea, to deaminate cytidine, and to detoxify drugs such as a diazepam. The reliance on "juvenile" 
cells has become less important with the identification of molecular mechanisms for growth control 
mitogens such as EGF, TGFa, hepatopoieten B, HSS (hepatic stimulatory substance) and HGF 
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TABLE 133.3 Present Choice of the Cellular Component for Extracorporeal Liver Assist Devices 

Sussman & Kelly (1993) Neuzil et al. Jauregui & Muller (1992) 

Hepatoblastoma- Porcine hepatocytes separated via portal vein Porcine hepatocytes separated via hepatic 

derived cell line perfusion vein perfusion 

Cells divide indefinitely Hepatocyte division has not been tested (5 to 8) Limited number of hepatocyte doublings 

Cells are cultured in the Hepatocytes are seeded on microcarriers and Hepatocytes may be seeded or cultured in 

device introduced in the device immediately before the device through a proprietary 

clinical use technology 

(hepatoctye growth factor), co-mitogenic factors such as norepinephrine and growth inhibitors 
such as TGF(3, and interleukin-l(3. 
5. Transfected or transgeic hepatocytes may provide the ultimate solution to the cell supply problem. 
However, they are usually selected for the monoclonal expression of a single function and therefore 
may not be suitable except when a single cause of hepatic failure has been identified. Alternatively, 
a combination of different transfected hepatocytes may be considered. 

Table 133.3 illustrates the choices made by three different groups of investigators for clinical liver assist 
devices. 

Supporting Structures 

Microencapsulation in a nutrient liquid or a polymer gel surrounded by a conformal membrane provides 
a suspension of metabolically active units which can either be placed in a container for extracorporeal 
hemoperfusion, introduced into the peritoneal cavity for implantation, or even infused into the portal 
vein for settling in the patient's liver. The blood or tissue reaction to multiple implants and the long- 
term in vivo stability of hydrogels based on polyelectrolytes remain unresolved problems. 

A flat sheet of membrane or a spongy matrix coated with attachment factors can be used to anchor a 
suspension of functional cells. The limitation of this approach is the need for vascularization of the 
implant, since the cells are metabolically active and therefore are quite avid of oxygen and nutrients. 

Microcarrier-attached hepatocytes are attractive because the technology of suspension cultures is ame- 
nable to the proliferation of a large number of cells. The bulk of the carrier beads probably limits this 
approach to the extracorporeal circuits similar to those used for hemoperfusion [Rozga et al, 1993]. 

Hollow-fiber ELADs are constructed by filling the interstices within a bundle of parallel hollow fibers 
with liver cells, using the lumen of the tubes to provide metabolic support and an excretion channel, 
typically by circulating blood from an extracorporeal circuit. Alternatively, a hybrid organ can be built 
by filling the lumen of the hollow fibers with functional cells and implanting the bundle in the body 
cavity to allow exchanges with the surrounding fluid. One could combine such a device with an oxygen- 
ation system, e.g., by filling the peritoneal cavity with a high-oxygen-capacity fluid such as a fluorocarbon 
and a gas exchange system for that fluid analogous to the intravenous oxygenator (IVOX). 

133.7 Outlook 

Medical trials with extracorporeal hollow-fiber systems are already in progress, although there are some 
unanswered questions in the proper design of a hepatocyte-seeded ELAD. A review of Table 133.4 will 
indicate that there is no consensus for implementing cellular choices in the construction of such systems. 
Some researchers believe that the culture of hepatocytes on a synthetic matrix prior to chemical 
application is a complicated proposition that impairs the practical application of this technology. An 
alternative could be the isolation, freezing, and shipping of porcine hepatocytes to the medical centers 
treating patients with acute and chronic liver failure (usually transplantation centers). This approach 
may offer a direct and economically sound solution. Unfortunately, many FHF patients require emergency 
liver support and are managed in secondary care medical institutions which have neither a transplantation 
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TABLE 133.4 Consideration in the Choice of the Cellular Component of Extracorporeal Liver Assist Devices 
Advantages Disadvantages 

Hepatoblastoma cell lines are easy to culture and are free Hepatoblastoma cell lines are tumorigenic 

of other cell contaminants 

Tumor cell lines are not anchorage dependent Tumor cell lines may not respond to physiologic regulation 

Porcine or primary hepatocytes respond to physiologic Porcine hepatocytes have limited proliferation ability 

stimulation 

Porcine hepatocytes express P450 (detoxification activity) Porcine hepatocytes show limited life span 

program nor a tissue culture facility for the seeding of hollow-fiber devices. Part of the argument for the 
porcine hepatocyte isolation, shipment, seeding, and immediate use in a bioartificial liver system relies on 
the earlier concept that primary mammalian hepatocytes do not grow in vitro or are very difficult to 
maintain in hollow-fiber devices. In fact, rodent hepatocytes have shown excellent detoxification activities 
when cultured on perfused hollow fiber cultures. Rabbit hepatocytes also survive in hollow-fiber bioreac- 
tors which have proved successful in treating one of the most representative animal modes of human FHF. 

The technology for manufacturing ELADs may also need further attention. For instance, the efficiency 
of hollow fibers in maintaining hybridoma cultures and producing specific proteins is known to be 
inferior to that of cellular bioreactors based on microcarrier technology. The surface of hollow fibers has 
been optimized for blood compatibility but not for hepatocyte attachment, and therefore new materials 
or structures may have to be developed. 

At the conceptual level, primary hepatocytes may need to be cultured either in combination with other 
cells that will provide parabiotic support or on substrates that imitate the composition of the extracellular 
matrix found in situ in the space of Disse. Our own experience [Naik et al., 1990] and that of others 
[Singhvi et al., 1994] raise some questions about the role of substrates in maintaining long-term hepa- 
tocyte viability in vitro. In fact, most of them operate in a rather indiscriminate fashion by providing an 
anchor for hepatocytes. Collagen, types I, III, IV, and fibonectin contribute to cytoplasmic spreading 
which in the long term does not favor maintenance of the pheontypic expression of hepatocytes. Polymer 
compositions expressing surface sugar residues responsible for hepatocyte attachment through the asailo- 
glycoprotein receptor (a plasma membrane complex present mainly in the bile canalicular area which 
internalizes plasma asiaglycoprotein) appear able to maintain hepatocytes in culture with extended 
functional activities [Akaike et al., 1989]. Other investigators have shown the value of extracellular 
glycoproteins and glycosaminoglycans rich in laminin (e.g., Matrigel) [Caron & Bissell, 1989]. Hepato- 
cytes immobilized on these substraes do not spread but maintain their tridimensional morphology, as 
well as their functional activity. Such observations suggest that long-term expression of hepaotcyte- 
specific functions depend on maintaining the in situ cell shape and their spacial interrelations [Koide 
et al., 1990]. Future ELAD designs should provide not only ideal polymer substrates for cell attachment 
but also a special configuration that will maintain the tridimensional structure of hepatic tissue. 

Experience with artificial orans shows that the development of these devices tends to underestimate 
the effort and the technical advances needed to pass from a "proof of principle" prototype for animal 
evaluation to fabrictaion of a clinically acceptable system for human use. Full-scale design, cell procure- 
ment, cell survival, and device storage are all major bottlenecks on the way to a clinical product. One of 
the gray areas remains in the molecular weight cut-off of the hollow fiber wall. Which substances are 
responsible for the development of FHF and must therefore be cleared? Are they protein-bound, middle- 
molecular-, or low-molecular-size compounds? In the absence of clear answers to these questions, the 
designer of separation membranes is in an ambigous position; some investigators use low molecular cut- 
off membranes to guarantee immunoseparation of the xenograft, whereas others prefer high-molecular 
cut-off to enhance functionality. The use of microporous membranes in human subjects has not led, as 
of yet, to any hypersensitivity reactions in spite of potential passage of immunglobulins. 

A reliable way to restore consciousness in animal models of FHF is to cross-circulate the blood with a 
normal animal. To the extent that ELADs may function in patients as in these animal models, they may prove 
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successful only in relieving the symptoms of the disorder without increasing the survival rate. For instance, 
when 147 patients with advanced stages of HE were treated with charcoal hemoperfusion over a 10-year 
period, all showed symptomatic improvement, but the survival rate of 32% was the same as in five control 
groups. Therefore, enthusiasm generated by preliminary human trials with hollow-fiber devices should be 
tempered with a cautious approach. Without reliable control studies, we will remain in the position defined 
by Benhamou and coworkers [1972], "The best future one can wish for a sufferer from severe acute hepatic 
failure is to undergo a new treatment and have his case published — Be published or perish!" 

Defining Terms 

Bioartificial liver: A liver assist or liver replacement device incorporating living cells in physical or 

chemical processes normally performed by liver tissue. 
Catabolism: The aspect of metabolism in which substances in living tissues are transformed into waste 

products or solutes of simpler chemical composition. (The opposite process is called anabolism.) 
Cirrhosis: A degenerative process in the liver marked by excess formation of connective tissue destruc- 
tion of functional cells, and, often, contraction of the organ. 
Conjugated: The joining of two compounds to produce another compound, such as the combination 

of a toxic product with some substance in the body to form a detoxified product, which is then 

eliminated. 
ELAD: Extracorporeal liver assist device. 
Homeostasis: A tendency of stability in the normal body states (internal environment) of the organism. 

This is achieved by a system of control mechanisms activated by negative feedback, e.g., a high 

level of carbon dioxide in extracellular fluid triggers increased pulmonary ventilation, which in 

turn causes a decrease in carbon dioxide concentration. 
Mesenchymal cells: The meshwork of embryonic connective tissue in the mesoderm from which are 

formed the connective tissues of the body and the blood vessels and lymphatic vessels. 
Metabolism: The sum of all the physical and chemical processes by which living organized substance 

is produced and maintained (anabolism) and the transformation by which energy is made available 

for the uses of the organism (catabolism). 
Parenchymal: The essential elements of an organ. 
Phagoctyic: Pertaining to or produced by any cells that ingest microrganisms of other cells and foreign 

particles. 
Portal triad: These are microscopic areas of collagen type Till fibroblasts as well as other connective 

tissue elements. These triads have a branch of the portal vein, a branch of the hepatic artery, and 

intermediate caliber bile ducts. 
Xenobiotic: A chemical foreign to the biologic system. 
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Further Information 

Two books of particular interest in hepatic encephalopathy are Hepatic Encephalopathy: Pathophysiology 
and Treatment edited by Roger F. Butterworth and Gilles Pomier Layrargues, published by The Humana 
Press (1989), and Hepatology: A Textbook of Liver Disease edited by David Zakim and Thomas D. Boyer, 
published by W. B. Saunders Company (1990). 

Articles pertaining to extracorporeal liver assist devices appear periodically in the following journals: 
ASAIO Journal, the official journal for the American Society for Artificial Internal Organs (for subscrip- 
tion information, write to J.B. Lippincott, P.O. Box 1600, Hagerstown, MD 21741-9932); Artificial Organs, 
official journal of the International Society for Artificial Organs (for subscription information, write to 
Blackwell Scientific Publications, Inc., 328 Main Street, Cambridge, MA 02142); and Cell Transplantation, 
official journal of the Cell Transplantation Society (for subscription information, write to Elsevier Science, 
P.O. Box 64245, Baltimore, MD, 21264-4245). 
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134.1 Structure and Function of the Pancreas 

The pancreas is a slender, soft, lobulated gland (ca. 75 g in the adult human), located transversally in the 
upper abdomen in the space framed by the three portions of the duodenum and the spleen. Most of the 
pancreas is an exocrine gland which secretes proteolytic and lipolytic enzymes, conveying more than 1 L 
per day of digestive juices to the gastrointestinal tract. This liquid originates in cell clusters called acini 
and is collected by a system of microscopic ducts, which coalesce in a channel (the canal of Wirsung) 
which courses horizontally through the length of the organ and opens into the duodenum next to, or 
together with, the main hepatic duct (choledocous). Interspersed throughout the exocrine tissue are 
about 1 million separate, highly vascularized and individually innervated cell clusters known as islets of 
Langerhans, which together constitute the endocrine pancreas (1-2% of the total pancreatic mass). Blood 
is supplied to the islets by the pancreatic artery and drained into the portal vein. Therefore the entire 
output of pancreatic hormones is first delivered to the liver. 

Human islets average 150 ilm in diameter, each cluster including several endocrine cell types. Alpha 
cells (about 1 5% of the islet mass) are typically located at the periphery of the cluster. They are the source 
of glucagon, a fuel-mobilizing (catabolic) hormone which induces the liver to release into the circulation 
energy-rich solutes such as glucose and aceto-acetic and (3-hydroxybutyric acids. Beta cells, which occupy 
the central portion of the islet, comprise about 80% of its mass. They secrete insulin, a fuel-storing 
(anabolic) hormone that promotes sequestration of carbohydrate, protein, and fat in storage depots in 
the liver, muscle, and adipose tissue. Delta cells, interposed between alpha and beta cells, produce 
somatostatin, one role of which seems to be to slow down the secretion of insulin and digestive juices, 
thereby prolonging the absorption of food. The PP cells secrete a "pancreatic polypeptide" of yet unknown 
significance. Other cells have the potential to produce gastrin, a hormone that stimulates the production 
of gastric juice. 
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134.2 Endocrine Pancreas and Insulin Secretion 

The term artificial pancreas is used exclusively for systems aimed at replacing the endocrine function of 
that organ. Although the total loss of exocrine function (following removal of the gland for polycystic 
disease, tumor, or trauma) can be quite debilitating, no device has yet been designed to replace the 
digestive component of the pancreas. Since insulin deficiency is the life-threatening consequence of the 
loss of endocrine function, the artificial pancreas focuses almost exclusively on insulin supply systems, 
even though some of the approaches used, such as islet transplantation, of necessity include an undefined 
element of delivery of other hormones. 

Insulin is the most critical hormone produced by the pancreas because, in contradistinction to glu- 
cagon, it acts alone in producing its effects, and survival is not possible in its absence. Endogenous insulin 
secretion is greatest immediately after eating and is lowest in the interval between meals. Coordination 
of insulin secretion with fluctuating demands for energy production results from beta-cell stimulation 
by metabolites, other hormones, and neural signals. The beta cells monitor circulating solutes (in humans, 
primarily blood glucose) and release insulin in proportion to needs, with the result that in normal 
individuals, blood glucose levels fluctuate within quite narrow limits. The response time of pancreatic 
islets to an increase in blood glucose is remarkably fast: 10 to 20 sec for the initial burst of insulin release 
(primarily from intracellular stores), which is then followed, in a diphasic manner, by a gradual increase 
in secretion of newly synthesized hormone up to the level appropriate to the intensity and duration of 
the stimulus. Under fasting condition the pancreas secretes about 40(lg (1 unit) of insulin per hour to 
achieve a concentration of insulin of 2-4 (J.g/ml (50 to lOOpU/ml) in portal blood and of 0.5 (J.g/ml 
(12pU/ml) in the peripheral circulation. 

134.3 Diabetes 

Insulin deficiency leads to a disease called diabetes mellitus, which is the most common endocrine disease 
in advanced societies, affecting as many as 3-5% of the population in the United States, with an even 
higher incidence in specific ethnic groups. 

Diabetes is a chronic systemic disease resulting from a disruption of fuel metabolism either because 
the body does not produce enough insulin or because the available insulin is not effective. In either case, 
the result is an accumulation of glucose in the blood or hyperglycemia, and, once the renal tubular 
threshold for glucose is exceeded, a spillover of glucose into the urine or "glycosuria." Hyperglycemia is 
thought to be the main determinant of microvascular alterations which affect several organs (renal 
glomerulus, retina, myocardium). It is considered as an important factor in large-blood-vessel pathology 
(aorta and peripheral arteries, such as carotid and lower limb vessels) often observed in diabetes. Neu- 
ropathies of the autonomic, peripheral, and central nervous systems are also common in diabetics, 
although the pathogenic mechanism is not known. 

When blood glucose levels are abnormally high , an increasing fraction of hemoglobin in the red blood 
cells tends to conjugate with glucose forming a compound identifiable by chromatography and called HbA Ic . 
This reaction serves as a tool in diabetes management and control. The fraction of hemoglobin that is 
normally glycosylated (relative to total Hb) is between 6% and 9%. If elevated, HbA IC reflects the time- 
averaged level of hyperglycemia over the lifetime of the red blood cells (3-4 months). It can be interpreted 
as an index of the severity of diabetes or the quality of control by dietary measures and insulin therapy. 

There are two main forms of diabetes: Type I, juvenile onset diabetes (typically diagnosed before 
age 30), or insulin-dependent diabetes; and type II, maturity onset diabetes (typically observed after 
age 40), or non-insulin-dependent diabetes. 

Type I appears abruptly (in a matter of days or weeks) in children and young people, although there 
is evidence that the islet destruction pattern may start much earlier but remain undiagnosed until close 
to 90% of the islets have been rendered ineffective. Type I represents only 10% of all cases of diabetes, 
yet it still affects close to 1 million patients in the United States alone. Endogenous insulin is almost 
totally absent, and an exogenous supply is therefore required immediately to avoid life-threatening metabolic 
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TABLE 134.1 Pathogenesis of Diabetes 



Type I Diabetes 

Autoimmune/genetic/ 

environmental 

factors 



Destruction of beta cells 

I 

Absolute insulin deficiency 

I 

Inability to maintain 

normoglycemia 

I 
Secondary lesions 



Type II Diabetes 

Unknown 

pathologic/genetic 

factors 



Insulin resistance 



Inability to maintain 
normoglycemia 

I 
Secondary lesions 



accidents (ketoacidosis) as well as the more insidious 
degenerative processes which affect the cardiovascular 
system, the kidneys, the peripheral nervous system 
and the retina. For some unknown reason, 15% of 
juvenile diabetics never develop complications. 
Because the cellular and molecular mechanisms of 
hyperglycemic damage appear experi-mentally to be 
reversible by tight control of glucose levels, and the 
progress of vascular complications is said to be pro- 
portional to the square of the blood glucose concen- 
tration, early and rigorous control of insulin 
administration is believed to be essential to minimize 
the long term consequences of the disease. 

In contrast, type II diabetes, which affects close to 
90% of all patients, develops slowly and can often be 
controlled by diet alone or by a combination of weight 
loss and/or oral hypoglycemic drugs. Endogenous 

insulin is often present in normal and sometimes exaggerated amounts, reflecting an inability of cells to 
make use of available insulin rather than a true hormonal deficiency. Only 20-30% of type II diabetics 
benefit from insulin therapy. Keto-acidosis is rare, and the major problems are those associated with 
vascular wall lesions and arterial obstruction. Adult onset diabetics develop the same vascular, renal, 
ocular, and neural complications as type I diabetics, suggesting that even if the origin of the disease is 
not identical, its later evolution is fundamentally the same as in juvenile diabetics (Table 134.1). 

Diabetes is the leading cause of blindness in the United States. It is also responsible for the largest 
fraction of the population of patients with end-stage renal disease who are being keep alive by maintenance 
dialysis. It is one of the most frequent causes of myocardial infraction and stroke, and the most common 
factor in arterial occlusion and gangrene of the lower extremities leading to limb amputation. With the 
diabetic population increasing by about 5% per year, diabetes has become one of the major contributors 
to health expenditures in advanced societies. 

It is currently thought that destruction of the pancreatic islets in juvenile diabetes is the result of an 
autoimmune process, occurring in genetically predisposed individuals, perhaps in relation to an inter- 
current infectious disease. This has led to clinical attempts to salvage the remaining intact islets at the 
earliest recognizable stage of the disease using standard immunosuppressive drugs such as cyclosporin A. 
The effectiveness of this approach has not been demonstrated. 

Diabetologists have also recognized that, at the onset of juvenile diabetes, hyperglycemia exerts a 
deleterious effect on the function of the still surviving islets, and that in the short term, intensive 
exogenous insulin therapy may actually lead to an apparent cure of the disease. However, this phenom- 
enon, occasionally referred to as the honeymoon period of insulin treatment is transitory. After a period 
of weeks to months, diabetes reappears, and an exogenous source of insulin administration becomes 
necessary for survival. The need for insulin to protect borderline functioning islets against hyperglycemic 
damage has also been recognized in the early stage of islet transplantation. 



134.4 Insulin 



Insulin is the mainstay for the treatment of diabetes type I and, to a lesser extent, type II. Since it is the 
key substance produced by beta cells, it is important to define its chemical nature and its mode of action. 
Insulin is a 51 amino acid peptide with a molecular weight of about 5800 daltons, made up of two 
chains (A and B) connected by disulfide bridges. Insulin is formed in the beta cell as a cleavage product 
of a larger peptide, proinsulin, and is stored there as crystals in tiny intracellular vesicles until released 
into the blood. The other cleavage product, the "connecting" or C-peptide, though biologically inactive, 



•) 2000 by CRC Press LLC 



passes into the blood stream together with insulin, and therefore its concentration reflects, in a patient 
receiving exogenous insulin, the amount of insulin secreted by the pancreas itself, if any. 

Active extracts of islet tissue, obtained from animals following ligation of the pancreation duct to avoid 
autolysis, were first prepared by Banting and Best in 1922. Insulin was crystallized by Abel in 1926. In 
1960, Sanger established the sequence of amino acids that make up the molecule shortly thereafter the 
hormone was synthesized. 

The extract composition of insulin varies slightly among animal species. The presence of disulfide bonds 
between the A and the B chain is critical. Loss of the C-terminal alanine of the B chain by carboxypeptidase 
hydrolysis results in no loss of biologic activity. The octapeptide residue that remains after splitting off 
the last eight amino acids of the B chain has a biologic activity amounting to about 15% of the original 
insulin molecule. About one-half of the insulin disappears in its passage through the liver. Only a small 
fraction normally goes to peripheral tissues. Plasma half-life is on the order of 40 min, most of the 
degradation occurring in the liver and kidneys. No biologically active insulin is eliminated in the urine. 

Since insulin is a polypeptide, it cannot be taken orally because it would be digested and inactivated 
in the gastrointestinal tract. In an emergency, insulin can be administered intravenously, but standard 
treatment relies on subcutaneous administration. This port of entry differs from physiologic secretion 
of insulin in two major ways: The adsorption is slow and does not mimic the rapid rise and decline of 
secretion in response to food ingestion, and the insulin diffuses into the systemic veins rather than into 
the portal circulation. 

134.5 Insulin Therapy 

Preparations of insulin are classified according to their species of origin and their duration of action. 
Human insulin (synthesized by recombinant DNA techniques) and porcine insulin (which is obtained 
by chemical extraction from slaughterhouse-retrieved organs and differs from human insulin by only 
one amino-acid at the carboxyl-terminus of the B chain) are in principle preferable to bovine insulin 
(which differs from the human form by three amino acid residues). In practice, all three are equipotent, 
and all three can, in a minority of patients, stimulate an immune response and cause hypersensitivity 
reactions. Because of insulin's relatively short duration of action, formulations delaying the absorption 
of subcutaneously injected hormone and hence prolonging its effectiveness have greatly facilitated the 
treatment of diabetes. The mode of administration of insulin influences its plasma concentration and 
bioavailability. The usual treatment schemes rely on the use of one or more types of insulin (Table 134.2). 

Pharmacologists classify the available insulin formulations according to their latency and duration of 
action as fast acting, intermediate-acting, and slow acting using a terminology of regular, semilente, lente, 
and ultralente (Table 134.3). Crystalline insulin is prepared by precipitating the polypeptide in the pres- 
ence of zinc in a suitable buffer solution. Insulin complexed with a strongly basic protein, protamine, 
and stabilized with zinc is relatively insoluble at physiological pH and is therefore released only slowly 
from the site of injection. However, one must keep in mind that onset and duration of action may be 
quite variable from one patient to another. 

Whereas insulin was hailed as a life-saving drug in the years following its discovery (which it was, as 
evidenced by the much reduced incidence of hyperglycemic coma and the much longer life expectancy 
of juvenile diabetics, as compared to the period before 1930), it has not turned out to be quite the 
universal panacea it was expected to be. Some of the problems that have surfaced relate to the limitations 
associated with the mode of administration of the drug. Other problems are thought to derive from our 
inability to mimic the finely tuned feedback system which normally maintains the blood glucose levels 
within very narrow limits (Table 134.4). 

TABLE 134.2 Types of Insulin 



Fast acting Multiple injections (basal and preprandial) 

Intermediate Two injections (morning and evening) 

Slow acting One injection (to prevent hyperglycemia at night) 
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TABLE 134.3 Properties of Insulin Preparations 





Appearance 


Added 
Protein 


Zinc Content 
mg/100 U 


Time of Action (h) 


Type 


Onset 


Peak 


Duration 


Rapid 














Crystalline (regular) 


Clear 


None 


0.01-0.04 


0.3-0.7 


2-4 


5-8 


Insulin-zinc suspension (semilente) 


Cloudy 


None 


0.2-0.25 


0.5-1.0 


2-8 


12-16 


Intermediate 














NPH (Isophane insulin suspension) 


Cloudy 


Protamine 


0.016-0.04 


1-2 


6-12 


18-24 


Lente (insulin zinc suspension) 


Cloudy 


None 


0.2-0.25 


1-2 


6-12 


18-24 


Slow 














Ultalente (extended insulin zinc suspension) 


Cloudy 


None 


0.2-0.25 


4-6 


16-18 


20-36 


PZI (Protamine zinc insulin suspension) 


Cloudy 


Protamine 


0.2-0.25 


4-6 


14-20 


24-36 



Source. Modified from Table 71-1 in The Pharmacological Basis of Therapeutics, 3rd ed., Goodman and Gillman 



TABLE 134.4 Problems with Insulin Treatment 

Problem 

Poor compliance with multiple injections 



Inadequate pharmacokinetics of insulin preparations 

Hyperinsulinemia 
Lack of feedback control 



Answers 

Better syringes and needles 

Subcutaneous ports 

Intravenous ports 

Insulin pens 

Enhance immediate effect 

Prolong duration of effect 

Release insulin in portal circulation 

Servo-controlled administration 

Natural control by transplant 

Bioartificial pancreas 



134.6 Therapeutic Options in Diabetes 



Experimental studies of blood glucose regulation and empirical observations in diabetic patients have 
revealed three major characteristics of blood glucose regulation by the pancreas: 

1. The natural system operates as a closed-loop regulatory mechanism within narrow limits. 

2. Portal administration is more effective than systemic administration because insulin reaches the 
liver first. 

3. Pulsatile administration of insulin is more effective than continuous administration. 

There are several stages in the evolution of diabetes where a medical intervention might be helpful 
(Table 134.5). Many interventions have been attempted with various degrees of success. Since diabetes 

TABLE 134.5 Potential Approaches to Diabetes Treatment 

Prevent destruction of beta cells 
Prevent beta cell exhaustion 
Increase insulin output 
Amplify glucose signal 
Overcome insulin resistance 
Replace beta cells by: 

Insulin administration 

Electromechanical delivery system 

Whole organ transplantation 

Islet transplantation 

Biohybrid device 
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TABLE 134.6 Biologic and Engineered Insulin Delivery Systems 



Insulin Administration Systems 
Standard Insulin 

Insulin release systems 



Insulin delivery pumps 
Open loop 



Closed loop 

Insulin Synthesis Systems 
Pancreas transplantation 

Islet transplantation 

Encapsulated islets or beta cell transplants 
Genetically engineered cell transplants 



Routes of administration (subcutaneous, intravenous, intraperitoneal, nasal 

spray) 
Injection systems (syringes, pens) 
Passive release from depot forms 
Bioresponsive insulin depot 
Implanted, permeable reservoirs 
Programmed release systems 

Osmotic pumps 

Piston and syringe pumps 

Peristaltic (roller) pumps 

Bellow frame pumps 

Pressurized reservoir pumps 

Glucose sensors 

Electromechanical, servo-controlled delivery systems 



Simultaneous with kidney transplant 
After kidney transplant 
Before kidney transplant 
Autologous tissue 
Allogeneic tissue 
Xenogeneic tissue 
Microencapsulated cells 
Macroencapsulated cells 
Unprotected gene therapy 
Protected gene therapy 



expresses a disturbance of biologic feedback mechanism between blood glucose levels and insulin secre- 
tion, there might be possibilities to influence the biologic regulation process through its natural sensing 
or amplifying mechanisms. However, no practical solution has yet emerged from this approach. As a 
result the accent has been placed primarily on pharmacologic forms of treatment (diverse insulin for- 
mulations and routes of administration), engineered delivery systems (extracorporeal or implanted 
pumps), and substitution by natural insulin production sources (organ, islet, or individual beta cell 
transplantation, with or without genetic manipulation and with or without membrane immunoprotec- 
tion). An overview of the recently investigated approaches is given in Table 134.6. 

134.7 Insulin Administration Systems 
Syringes and Pens 

Insulin traditionally has been administered subcutaneously by means of a syringe and needle, from a 
vial containing insulin at a concentration of 40 units per ml. 

When insulin was first proposed to treat diabetic patients in 1922, glass syringes were used. The burden 
linked to repeated sterilization disappeared when disposable syringes became available in the early 1970s. 
Yet patients still had to perform the boring and, for some, difficult task of refilling the syringe from a 
vial. From a pragmatic viewpoint the development of pens in which insulin is stored in a cartridge 
represented a major advance in diabetes management. In France, for instance, a recent survey indicates 
that more than half of the patients use a pen for daily therapy. A needle is screwed on the cartridge and 
should ideally be replaced before each injection. Cartridges containing either 1.5 or 3 ml of lOOU/ml 
regular or intermediate insulin are available. Unfortunately, long-acting insulin cannot be used in pens 
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because it crystallizes at high concentrations. Attempts are being made toward developing soluble, slow- 
actin analogs of insulin to overcome this problem. 

A common insulin regimen consists of injecting regular insulin before each meal with a pen and long- 
acting insulin at bedtime with a syringe. Typically half of the daily dosage is provided in the form of 
regular insulin. Since patients often need 50 units of insulin per day, a pen cartridge of 150 units of 
regular insulin must be replaced every 6 days or so. The setting of the appropriate dose is easy. However, 
most pens cannot deliver more than 36 units of insulin at a time, which in some patients may place a 
limit on their use. In those countries where insulin comes in vials of 40 U/ml for use with syringes, 
patients must be aware that insulin from cartridges is 2.5 times more concentrated than standard vial 
insulin. The major advantage of insulin pens is that they do not require refilling and therefore can be 
used discretely. Pens are particularly well accepted by teenagers. 

Reservoirs, Depots, and Sprays 

Attempts have been made to develop distensible, implantable reservoirs or bags made of silicone elas- 
tomers, fitted with a small delivery catheter, and refillable transcutaneously. In the preferred embodiment, 
insulin drains at a constant rate into the peritoneal cavity. Assuming uptake by the capillary beds in the 
serosal membranes which line the gastrointestinal tract, insulin reaches first the portal valve and hence 
the liver which enhances its effectiveness. This is an attractive concept but not without serious handicaps. 
The implanted reservoir may elicit an untoward tissue reaction or become a site of infection. The delivery 
catheter may be plugged by insulin crystals if a high insulin concentration is used, or the catheter may 
be obstructed by biologic deposits at the tip. The system only provides a baseline insulin delivery and 
needs to be supplemented by preprandial injections. Its overall capacity constitutes a potential risk should 
the reservoir accidentally rupture and flood the organism with an overdose of insulin. 

Insulin depots, in the form of bioerodible polymer structures in which insulin — amorphous or in 
crystalline form — is entrapped and slowly released as hydrolytic decay of the carrier of the polymer 
progressively liberates it, present some of the same problems as reservoirs. In addition, the initial burst 
which often precedes zero-order release can prove unpredictable and dangerous. No reliable long-term 
delivery system has yet been developed on that principle. 

New routes of introduction of insulin are being investigated, in particular, transmucosal adsorption. 
Nasal sprays of specially formulated insulin may become a practical modality of insulin therapy if means 
are found to control reliably the dose administered. 

Insulin Pumps 

Externally carried portable pumps, based on the motorized syringe or miniature roller pump designs, were 
evaluated in the 1970s with the anticipation that they could be both preprogrammed for baseline delivery 
and overridden for bolus injections at the time of meals without the need for repeated needle punctures. 
In actual practice, this system did not find wide acceptance on the part of patients, because it was just 
too cumbersome and socially unacceptable. 

The first implantable insulin pump was evaluated clinically in 1980. Unlike heparin delivery or cancer 
chemotherapy, insulin administration requires programmable pumps with adjustable flow rates. Implant- 
able pumps provide better comfort for the patient than portable pumps, since the former are relatively 
unobtrusive and involve no danger of infection at the skin catheter junction. They also permit intrap- 
eritoneal insulin delivery, which is more efficient than subcutaneous administration. However, these 
pumps operate without feedback control, since implantable glucose sensors are not yet available, and the 
patient must program the pump according to his/her needs. 

The vapor pressure driven Infusaid pump relies on a remarkably astute mechanism for an implantable 
device. A rigid box is separated in two compartments by a metal bellow with a flat diaphragm and an 
accordion-pleated seal (bellow frame). One compartment is accessible from the outside through a sub- 
cutaneously buried filling port and contains a concentrated insulin solution. The other compartment 
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(the bellows itself) is filled with a liquid freon which vaporizes at 37°C with a constant pressure of 0.6 bar. 
With no other source of energy but body heat (which is continuously replenished by metabolism and 
blood circulation), the freon slowly evaporates, and the pressure developed displaces the diaphragm and 
forces insulin at a constant flow rate through a narrow delivery catheter. The freon energy is restored 
each time the insulin reservoir is refilled, since the pressure needed to move the bellow liquefies the freon 
vapor within a smaller compartment. 

The housing is a disc-shaped titanium box. The insulin reservoir contains 15 to 25 ml of insulin 
stabilized with polygenol at a concentration of either 100 or 400 U/ml, providing an autonomy of 1 to 
3 months. The self-sealing septum on the filling port can be punctured through the skin up to 500 times. 
The pump is implanted under general or local anesthesia between skin and muscle in the lateral abdominal 
area. The tip of the catheter can be located subcutaneously or intraperitoneally. 

The original device provided a constant flow of insulin which still needed to be supplemented by the 
patient at meal time. The most recent model (Infusaid 1000) weighs 272 g, contains 22 ml of insulin, and 
has a diameter of 9 cm and a height of 2.2 cm. It is designed for 100 U/ml insulin and is equipped with a 
side port which allows flushing of the catheter when needed. To dissipate the pressure generated by freon 
evaporation before it reaches the catheter, the insulin must cross a 0.22p.m bacteriologic filter and pass through 
a steel capillary 3 cm long and 50(Im in diameter. Basal flow rate can be adjusted from 0.001-0.5 ml/h, and 
a bolus can be superposed by releasing a precise amount of insulin stored in a pressurized accumulator 
through the opening of a valve leading to the catheter. These control features require an additional source 
of energy in the form of lithium thionile batteries with a service life of about 3 years. Control is achieved 
with a handheld electronic module or programs connected to the pump by telemetry. 

The current Minimed pump (INIP 2001) has a dry weight of 162 g and a diameter of 8.1 cm and 
contains 14 ml of insulin at a concentration of 400 U/ml, allowing up to 3 months' autonomy between 
refills. This pump relies on a reservoir from which insulin can be delivered in a pulsatile form by a piston 
mechanism under the control of solenoid- driven valves. The basal rate can be adjusted 0.13-30 U/h 
(0.0003 to 0.07 ml/h), and boluses from 1 to 32 U over intervals 1-60 min can be programmed. The 
reservoir is permanently under negative pressure to facilitate filling. 

The Siemens Infusor (1D3 model) replaced an earlier type based on a peristaltic miniature roller pump 
(discontinued in 1987). It is somewhat similar in design and dimensions to the minimed pumps, but it 
has now been withdrawn from the market. 

The Medtronic Synchromed pump, which was originally designed for drug therapy and relied on a 
roller pump controlled by an external programmer, was evaluated for insulin therapy but is seldom used 
for that purpose because it lacks the required flexibility. 

Between 1989 and 1992, 292 insulin pumps were implanted in 259 patients in France, where most of 
the clinical experience has been collected (205 Minimed, 47 Infusaid, and 7 Siemens). The treatment had 
to be permanently discontinued in 14 patients (10 because of poor tissue tolerance, 3 due to catheter 
obstruction, and 1 due to pump failure). The pump was replaced in 33 patients, with 28 cases of 
component failure; battery, microelectronic control, flow rate decline due to insulin precipitation in 
Infusaid pumps, insulin reflux in Minimed pumps, and perceived risk of leak from septum. The overall 
frequency of technical problems was 0.10 per patient-year. There were 46 surgical interventions without 
pump replacement because of catheter obstructions, and 24 related to poor tissue tolerance at the site 
of implantation. Glucose regulation was satisfactory in all cases, and the mean blood glucose concentra- 
tion dropped appreciably after 6 and 18 months of treatment. A major gain with the intraperitoneal 
delivery was the reduction in the incidence of severe hypoglycemia associated with improved metabolic 
control. For the sake of comparison, other methods used for intensive insulin therapy showed a threefold 
increase in the frequency of serious hypoglycemic episodes. 

Servo-Controlled Insulin Delivery Systems 

Sensor-actuator couples which allow insulin delivery in the amount needed to maintain a nearly constant 
blood glucose level are sometimes designated as the artificial beta cell because of the analogy between 
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TABLE 134.7 Component Mechanisms of Insulin Delivery Systems 

Natural (3 cell Artificial P cell 

Sensor of glucose level Intracellular glycolysis Polarographic or optoelectronic sensor 

Energy source Mitochondria Implantable battery 

Logic Nucleus Minicomputer chip 

Insulin reserve Insulin granules Insulin solution reservoir 

Delivery system Cell membrane Insulin pump 

Set point Normally 80-120 mg% Adaptable 

Fail-safe Glucagon and hunger Glucose infusion 

the components of the electromechanical system and the biologic mechanisms involved in sensing, 
controlling, and delivering insulin from the pancreas (Table 134.7). 

From a technology viewpoint, the key aspects are the glucose sensor, the control systems applicable 
to a biologic situation where both hyperglycemia and hypoglycemia must be prevented, and the practi- 
cality of the overall system. 

Glucose Sensors 

A glucose sensor provides a continuous reading of glucose concentration. The device consists of a 
detection part, which determines for the specificity of the glucose measurement, and a transducing 
element, which transforms the chemical or physical signals associated with glucose recognition into an 
electric signal. The most advanced technology is based on enzymatic and amperometric detection of 
glucose. Glucose is recognized by a specific enzyme, usually glucose oxidase, layered on the anode of an 
electrode, generating hydrogen peroxide which is then oxidized and detected by the current generated 
in the presence of a fixed potential between the anode and the cathode. Alternatively, a chemical mediator 
may serve as a shuttle between the enzyme and the electrode to avoid the need for a potential difference 
at which other substances, such as ascorbate or acetaminophen, may be oxidized and generate an 
interfering current. Other approaches such as the direct electrocatalytic oxidation of glucose on the 
surface of the electrode or the detection of glucose by a combination with a lectin have not reached the 
stage of clinical evaluation. 

Ideally a blood glucose sensor would be permanently installed in the bloodstream. Because of the 
difficulty of building a hemocompatible device and the inconvenience of a permanent blood access, 
several investigators have turned to the subcutaneous site for glucose sensing. Indeed, glucose concen- 
tration in the interstitial fluid is very close to that of blood and reflects quite directly changes associated 
with meals or physical activity as observed in diabetic patients. Two types of subcutaneous glucose sensors 
have been developed. Those with electrodes at the tip of a needle may be shielded from their environment 
by the inflammatory reaction in tissues contacting the electrode membrane and occasionally cause pain 
or discomfort. Nonetheless, reliable measurements have been obtained for up to 10 days, whereupon the 
sensor needed to be changed. Sensing can also be based on microdialysis through an implanted, permeable 
hollow fiber, providing fluid which is then circulated to a glucose electrode. This system has not yet been 
miniaturized and industrialized. 

In either case, the concept is to develop an external monitor such as a wrist watch, which could receive 
its analyte from the implanted catheter and/or sensor and trigger a warning when glucose concentration 
is abnormally high or low. Such a monitor could eventually be incorporated in a closed-loop insulin 
delivery system, but this objective is not yet realizable. 

Noninvasive technology has been suggested to monitor blood glucose, e.g., by near-infrared spectro- 
scopy or optical rotation applied to transparent fluid media of the eye. No reliable technology has yet 
emerged. 

The Artificial Beta Cell 

The standard equipment for feedback-controlled insulin administration is the artificial beta cell, also 
referred to in the literature as the extracorporeal artificial pancreas. It consists of a system for continuous 
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blood sampling, a blood glucose sensor, and minicomputer which, through preestablished algorithms, 
drives an insulin infusion pump according to the needs of the patient. Accessories provide a minute-by- 
minute recording of blood glucose levels, insulin delivery, and glucose delivery. 

In the first clinically oriented artificial beta cell (Miles Laboratories' Bioslator), the glucose sensor was 
located in a flow-through chamber where blood from a patient's vein was sucked continuously, at the 
rate of 2 ml/h, through a double-lumen catheter. The double lumen allowed the infusion of a minute 
dose of heparin at the tip of the catheter to prevent thrombosis in the glucose analyzer circuit. Glucose 
diffused through a semipermeable membrane covering the electrode and was oxidized by the enzyme. 
The hydrogen peroxide generated by this reaction crossed a second membrane, the role of which was to 
screen off substances such as urate and ascorbate. The resulting electric current was fed to a computer 
which controlled the flow rate of the insulin pump when the readings were in the hyperglycemic range 
and administered a glucose solution in case of hypoglycemia. 

Three forms of oversight of blood glucose levels and insulin needs can be achieved with the artificial 
beta cell: 

Blood glucose monitoring is the technique employed to investigate the time course of blood glucose 
levels in a particular physiologic situation without any feedback control from the artificial beta 
cell, that is, without insulin administration to correct the fluctuations of blood glucose levels. 

Blood glucose control is the standard feedback mode of application of the artificial beta cell, which 
entails an arbitrary choice of the level of blood glucose desired and provides a recording of the 
rate of administration and the cumulative dose of intravenous insulin needed to achieve a constant 
blood glucose level. 

Blood glucose clamp involves intravenous insulin administration at a constant rate to obtain a stable, 
high blood insulin level. The desired value of blood glucose level is arbitrarily selected (typically 
in the normal range), and the feedback-control capability of the artificial beta cell is used to 
measure the amount of exogenous glucose needed to "clamp" blood glucose at the desired level 
in the presence of a slight excess of insulin. With this technique, a decrease in the rate of glucose 
administration signals a decrease in insulin biologic activity. 

The artificial beta cell is primarily a clinical research tool, applicable also for therapy in high-risk 
conditions such as pregnancy in poor controlled diabetic mothers or cardiac surgery in brittle diabetics 
[Galletti et al., 1985] , where hypothermia and a massive outpouring of adrenergic agents alter the response 
to insulin [Kuntschen et al., 1986]. The primary drawbacks of current servo-controlled insulin delivery 
systems are the instability of the glucose sensor (in terms of both risk of thrombosis and the need for 
intermittent recalibration), the complexity of the system (up to four pumps can be needed for continuous 
blood sampling, heparinization, insulin administration, and glucose administration), and the risks 
involved. The dose of intravenous insulin required for rapid correction of hyperglycemia may cause an 
overshoot, which in turn calls for rapid administration of glucose. Such fluctuations may also influence 
potassium uptake by cells and in extreme conditions lead to changes in potassium levels which must be 
recognized and corrected [Kuntschen et al., 1983]. Finally, the general cumbersomeness of the extracor- 
poreal system, to which the patient must be tethered by sampling and infusion lines, limits its applicability. 
Modeling of blood glucose control by insulin has allowed the development of algorithms that are 
remarkably efficient in providing metabolic feedback in most clinical situations. The physiologic limits 
of closed-loop systems have been lucidly analyzed by Sorensen and coworkers [1982] andKraegen [1983]. 

134.8 Insulin Production Systems 

The exacting requirement placed on insulin dosage and timing of administration in diabetic patients, as 
well as the many years of safe and reliable treatments expected from the insulin delivery technology, have 
pointed to the advantages of implantable systems in which insulin would be synthesized as needed and 
made available to the organism on demand. 
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As already outlined in Table 134.6 four avenues have been considered and undergone chemical eval- 
uation: whole organ transplantation, human islet and xenogeneic islet transplantation, immunisolation 
of normal or tumoral insulin-secreting tissue, and transplantation of cells genetically engineered to replace 
the functions of the beta cells. 

Pancreas Transplantation 

Human allograft transplantation, first attempted over 20 years ago, has been slow in reaching clinical 
acceptance because of the difficulty of identifying healthy cadaver organs (the pancreas is also a digestive 
gland which undergoes autolysis soon after death) and the need to deal with the gland exocrine secretion, 
which serves no useful purpose in diabetes but nonetheless must be disposed of. After many ingenious 
attempts to plug the secretory channels with room-temperature vulcanizing biopolymers, a preferred 
surgical technique has been developed whereby the pancreas is implanted in the iliac fossa, its arterial 
supply and venous drainage vessels anastomosed to the iliac artery and vein, and the Wirsung canal 
implanted in the urinary bladder. Surprisingly, the bladder mucosa is not substantially damaged by 
pancreatic enzymes, and the exocrine secretion dries up after a while. 

The success rate of pancreatic tranplantation is now approaching that of heart and liver transplantation. 
Since it is mostly offered to immunosuppressed uremic diabetic patients who previously received or 
concurrently receive a kidney transplant, preventative therapy against organ rejection does not constitute 
an additional risk. The main limitation remains the supply of donor organs, which in the United States 
will probably not exceed 2000 per year, that is to say, very far from meeting the incidence of new cases 
of juvenile diabetes [Robertson & Sutherland, 1992]. Therefore, human organ transplantation is not a 
solution to the public health problems of diabetes and ensuing complications. 

Human Islet Transplantation 

Interest in the transplantation of islets of Langerhans, isolated from exocrine, vascular, and connective 
tissue by enzymatic digestion and cell separation techniques, has received a boost following the clinical 
demonstration that it can lead to insulin independence in type I diabetic patients [Scharp et al., 1990]. 
However, this "proof of principle" has not been followed by widespread application. Not only is the 
overall supply of cadaveric organs grossly insufficient in relating to the need (by as much as two orders 
of magnitude), but the islet separation techniques are complex and incompletely standardized. Therefore, 
it is difficult to justify at this point cutting into the limited supply of human pancreata for whole organ 
transplantation, the more so that whole organ replacement has become increasingly successful. Autolo- 
gous islet transplantation can be successfully performed in relatively uncommon cases of pancreatic and 
polyscystic disease, where the risk of spilling pancreatic juice in the peritoneal cavity necessitates the 
removal of the entire organ. Allogeneic islet transplantation under the cover of pharmacologic immun- 
osuppression is useful inasmuch as it provides a benchmark against which to evaluate islet transplantation 
from animal sources and the benefits of membrane immunoisolation. 

Xenogeneic Islet Transplantation 

The term bioartificial pancreas refers to any insulin-production-glycemia-regulation system combining 
living pancreatic beta cells or equivalents with a synthetic polymer membrane of gel to protect the 
transplant against immune rejection. A wide variety of device designs, cell sources and processing 
techniques, implant location, and biomaterial formulation and characterization have been investigated 
(Table 134.8). Common to all is the belief that if transplantation of insulin-producing tissue is to serve 
the largest number of insulin-dependent diabetic patients possible, xenogeneic tissue sources will have 
to be identified. In that context, protection by a semipermeable membrane may be the most effective 
way to dispense with drug immunosuppression therapy [Lysaght et al., 1994]. However, a number of 
issues must be addressed and resolved before the bioartificial pancreas becomes a clinically acceptable 
treatment modality [Colton, 1994]. 
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TABLE 134.8 Membrane-Encapsulated Cell Transplants 



Technology Microencapsulation 

Macroencapsulation 
Coextrusion 
Location Vascular release Intravascular 

Perivascular 
Portal release Intraperitoneal 

Intrahepatic 
Intrasplenic 
Systemic release Subcutaneous 

Intramuscular 
Cell source Human islets or beta cells 



, fetal or adult 
Animal islets or beta cells 

Functionally active tumor tissue 

Immortalized cell lines 

Genetically engineered cell lines 
Cell processing Isolation and purification 

Culture and banking 

Cryopreservation 

Implant manufacturing 
Membrane characterization Envelope mechanical stability 

Envelope chemical stability 

Diffusion and filtration kinetics 

Immunoportection 

Bioacceptance 



Tissue Procurement 

In the evolution of type I diabetes, the oral glucose tolerance test does not become abnormal until about 
70% of the islets are destroyed. Therefore, roughly 250,000 human islets, or about 3500 islets per kilogram 
of body weight, should suffice to normalize blood glucose levels. In reality, clinical observations of 
transplanted patients indicate a need for a considerably larger number (3500-6000 islets/kg, and perhaps 
more). This suggests that many processed islets are either nonviable or not functional because they have 
been damaged during the isolation procedure or by the hyperglycemic environment of the transplanted 
patient [Eizirik et al, 1992]. 

To procure such large numbers of islets an animal source must be identified. Porcine islets are favored 
because of the size of the animal, the availability of virus-free herds, and the low antigenicity of porcine 
insulin. Pig islet separation procedures are not yet fully standardized, and there are still considerable 
variations in terms of yield, viability, and insulin secretory function. Quality control and sterility control 
present serious challenges to industrialization. 

Alternative tissue sources are therefore being investigated. Insulinomas provided the first long-term 
demonstration of the concept of encapsulated endocrine tissue transplant [Altman et al., 1984]. Genet- 
ically engineered cell lines which can sense glucose concentration and regulate it with-in the physiologic 
range have been reported, but so far none has matched the secretory ability of normal pancreatic tissue 
or isolated islets. This approach remains nonetheless attractive for large-scale device production, even 
though no timetable can be formulated for successful development. 

Device Design 

Immunoisolation of allogeneic or xenogeneic islets can be achieved by two main classes of technology: 
microencapsulation and macroencapsulation. Microencapsulation refers to the formation of a spherical 
gel around each islet, cell cluster, or tissue fragment. Calcium alginate, usually surrounded with a 
polyanion such as poly-L-lysine to prevent biodegradation, and at times overcoated with an alginate layer 
to improve the biocompatibility, has been the most common approach, although other polymers may 
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be substituted. Suspension of microcapsules are typically introduced in the peritoneal cavity to deliver 
insulin to the portal circulation. 

Technical advances have permitted the fabrication of increasingly smaller microcapsules (order of 
150-200 (im) which do not clump, degrade, or elicit too violent a tissue reaction. Control of diabetes 
has been achieved in mice, rats, dogs, and, on pilot basis, in humans [Soon-Siong et al., 1994] . An obstacle 
to human application has been the difficulty, if not the impossibility, of retrieving the very large number 
of miniature implants, many of which adhere to tissue. There is also a concern for the antigenic burden 
that may arise upon polymer degradation and liberation of islet tissue, whether living or necrotic. 

Macroencapsulation refers to the reliance on larger, prefabricated envelopes in which a slurry of islets 
or cell clusters is slowly introduced and sealed prior to implantation. The device configuration can be 
tubular or planar. The implantation site can be perivascular, intravascular, intraperitoneal, subcutaneous, 
or intravascular. 

Intravascular Devices 

In the most successful intravascular device (which might be more accurately designated as perivascular), 
a semipermeable membrane is formed into a wide-bore tube connected between artery and vein or 
between the severed ends of an artery as if it were a vascular graft. The islets are contained in a gel matrix 
within a compartment surrounding the tube through which blood circulates. In other embodiments, the 
islets are contained in semipermeable parallel hollow fibers or coiled capillaries attached to the external 
wall of a macroporous vascular prosthesis. In all cases, a rising glucose concentration in the blood stream 
leads to glucose diffusion into the islet compartment, which stimulates insulin production, raises its 
concentration in the gel matrix, and promotes diffusion into the blood stream. 

In an earlier type of perivascular device, blood is forced to circulate through the lumina of a tight 
bundle of hollow fibers, and the islet suspension is placed in the extracapillary space between fibers and 
circumscribed by a rigid shell, as was the case in the design which established the effectiveness of 
immunoprotected islets [Chick et al., 1987]. A truly intravascular device design has been proposed in 
which the islets are contained within the lumina of a bundle of hollow fibers which are plugged at both 
ends and placed surgically in the blood stream of a large vein or artery, in a mode reminiscent of the 
intravenous oxygenator (IVOX). Hemocompatibility has been the major challenge with that approach. 

There have also been attempts to develop extracorporeal devices with a semipermeable tubular mem- 
brane connected transcutaneously, through flexible catheters, to an artery or vein. The islets are seeded 
in the narrow "extravascular" space separating the membrane from the device external wall and can 
therefore be inspected and, in case of need, replaced. This concept has evolved progressively to wider- 
bore (3-6 mm ID), spirally coiled tubes in a disc-shaped plastic housing. Such devices have remained 
patent for several months in dogs in the absence of anticoagulation and have shown the ability to correct 
hyperglycemia in spontaneous or experimentally induced diabetes. 

In order to accelerate glucose and insulin transport across synthetic membranes and shorten the reactive 
time lag of immuno-protective islets, Reach and Jaffrin [1984] have proposed to take advantage of a 
Starling-type ultrafiltration cycle made possible, in blood-perfused devices, by the arteriovenous pressure 
difference between device inlet and outlet. An outward-directed ultrafiltration flux in the first half of the 
blood conduit is balanced by a reverse readsorption flux in the second half, since the islet compartment 
is fluid-filled, rigid, and incompressible. An acceleration of the response time to fluctuations in blood 
glucose levels has been demonstrated both by modeling and experimentally. This system may also enhance 
the transport of oxygen and nutrients and improve the metabolic support of transplanted tissue. 

The primary obstacle to clinical acceptance of the intravascular bioartificial pancreas is the risk of 
thrombosis (including obstruction and embolism) of a device expected to function for several years. The 
smaller the diameter of the blood channel, and the greater the surface of polymeric material exposed to 
blood, the more likely are thrombotic complications. These devices also share with vascular grafts the 
risk of a small but definite incidence of infection of the implant and the potential sacrifice of a major 
blood vessel. Therefore, their clinical application is likely to remain quite limited. 
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Intratissular Devices 

Tubular membranes with diameters on the order of 0.5-2 mm have also been evaluated as islet containers 
for subcutaneous, intramuscular, or intracavitary implantation: The islets are contained inside the mem- 
brane envelope at a low-volume fraction in a gel matrix. A problem is that small- diameter tubes require 
too much length to be practical. Large-diameter tubes are mechanically fragile and often display a core 
of necrotic tissue because diffusion distances are too long for adequate oxygen and nutrient transport to 
the islets. Both systems are subject to often unpredictable foreign body reactions and the development 
of scar tissue which further impairs metabolic support of the transplanted tissue. However materials that 
elicit a minimal tissue reaction have been identified [Galletti, 1992], and further device design and 
evaluation is proceeding at a brisk pace [Scharp et al., 1994]. 

Some microporous materials display the property of inducing neovascularization at the tissue-material 
interface and in some cases within the voids of a macroporous polymer. This phenomenon is thought 
to enhance mass transport for nutrient and secretory products by bringing capillaries in closer proximity 
to immuno-seperated cells. Pore size, geometry, and interconnections are critical factors in vascularization 
[Colton, 1994]. Some encapsulated cell types also stimulate vacularization beyond the level observed 
with empty devices. Chambers made by laminating a 5-p.m porosity expanded polytetrafluorethylene 
(Gore PTFE) on a 9.45-p.m Biopore (Millipore) membrane have shown the most favorable results 
[Brauker et al., 1992]. The laminated, vascualrized membrane structure has been fabricated in a sand- 
wichlike structure that can be accessed through a port to inject an islet suspension once the empty device 
has been fully integrated in the soft tissues of the host. This organoid can nonetheless be separated from 
adjacent tissues, exteriorized and retrieved. 

Polymers for Immunoisolation 

The polymeric materials used in bioartificial endocrine devices serve two major purposes: ( 1 ) As a scaffold 
and an extracellular matrix, they favor the attachment and differentiation of functional cells or cell 
clusters, and keep them separate from one another, which in some cases has proven critical; (2) as 
permselective envelopes they provide immunoisolation of the transplant from the host, while inducing 
a surrounding tissue reaction which will maximize the diffusional exchange of solutes between the 
transplant and its environment. A number of materials can be used for these purposes (Table 134.9). 

The matrix materials typically are gels made of natural or synthetic polyelectrolytes, with quite specific 
requirements in terms of viscosity, porosity, and electric charge. In some cases specific attachment or 
growth factors may be added. 

For immunoisolation, the most commonly used envelopes are prepared from polyacrylonitrile-poly- 
vinyl chloride copolymers. These membranes display the anistropic structure typical of most ultrafiltra- 
tion membranes, in which a thin retentive skin is supported on a spongy matrix [Colton, 1994]. The 



TABLE 134.9 Polymer Technology for Cell Transplantation 



Component 



Function 



Polymer 



Scaffold 



Synthetic or semisynthetic extracellular matrix 



Physical separation of cells or cell aggregates 



Collagen 

Gelatin 

Alginate 

Agarose 

Chitosan 

Hyaluronan 

Envelope 

Microencapsulation Stabilization of cell suspension and immunoseperation Alginate 

Poly-L-lysine 

Macroencapsulation Physical immunoseperation Polyacrylonitrile 

Polyvinyl copolymers 
Polysulfones 
Modified celluloses 



Transport-promoting tissue-material interface 



PTFE-Biopore laminate 
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shape and dimensions of the interconnecting voids and the microarchitecture of the inner and outer 
surfaces of the membrane are often critical characteristics for specific cell types and implant locations. 

Protection from Immune Rejection 

The central concept of immunoisolation is the placement of a semipermeable barrier between the host 
and transplanted tissue. It has been tacitly assumed that membranes with a nominal 50,000-100,000- 
daltons cutoff would provide adequate protection, because they would prevent the passage of cells from 
the host immune system and impair the diffusion of proteins involved in the humoral component of 
immune rejection. 

This belief has been largely supported by empirical observations of membrane-encapsulated graft 
survival, with occasional failures rationalized as membrane defects. 

However, the cut-off point of synthetic semipermeable membranes is not as sharp as one may believe, 
and there is often a small number of pores which theoretically at least allow the transport of much larger 
molecules than suggested by the nominal cut-off definition. Therefore, complex issues of rate of transport, 
threshold concentration of critical proteins, adsorption and denaturation in contact with polymeric 
materials, and interactions between proteins involved in immune reactions in an environment where 
their relative concentrations may be far from normal may all impact in immunoprotection. The possibility 
of antigen release from living or necrotic cells in the sequestered environment must also be considered. 
The cellular and luminal mechanisms of immunoseperation by semipermeable membranes and the 
duration of the protection they afford against both immune rejection of the graft and sensitizations of 
the host call for considerably more study. 

134.9 Outlook 

Replacement of the endocrine functions of the pancreas presents a special challenge in substitutive 
medicine. The major disease under consideration, diabetes, is quite common, but a reasonably effective 
therapy already exists with standard insulin administration. The disease is not immediately life-threat- 
ening, and therefore optimization of treatment is predicated on the potential for reducing the long-term 
complications of diabetes. Therefore, complete clinical validation will require decreases of observations, 
not merely short-term demonstration of effectiveness in controlling blood glucose levels. 

Standard insulin treatment is also relatively inexpensive. Competitive therapeutic technologies will 
therefore be subject to a demanding cost-benefit analysis before they are widely recognized. Finally, the 
patient self-image will be a major factor in the acceptance of new diagnostic or treatment modalities. 
Already some demonstrably useful devices, such as subcutaneous glucose sensors, portable insulin pumps, 
and the extracorporeal artificial beta cell, have failed in the marketplace for reasons of excessive com- 
plexity, incompatibility with all activities of daily life, physicians' skepticism, or cost. Newer technologies 
involving the implantation of animal tissue or genetically engineered cells will bring about a new set of 
concerns, whether justified or imaginary. There is perhaps no application of the artificial organ concept 
where human factors are so closely intertwined with the potential of science and technology as is already 
the case with the artificial pancreas. 

Defining Terms 

Anabolism: The aspect of metabolism in which relatively simple building blocks are transformed into 
more complex substances for the purpose of storage or enhanced physiologic action. 

Artificial beta cell: A system for the control of blood glucose levels based on servo-controlled admin- 
istration of exogenous insulin based on continuous glucose level monitoring. 

Artificial pancreas: A device or system designed to replace the natural organ. By convention, this term 
designates substitutes for the endocrine function of the pancreas and specifically glucose homeo- 
stasis though the secretion of insulin. 

Autolysis: Destruction of the components of a tissue, following cell death, mediated by enzymes 
normally present in that tissue. 
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Bioartificial pancreas: A device or implant containing insulin-producing, glycemia-regulating cells in 
combination with polymeric structures for mechanical support and/or immune protection. 

Catabolism: The aspect of metabolism in which substances in living tissues are transformed into waste 
products or solutes of simpler chemical composition. 

Endogenous: Originating in body tissues. 

Exogenous: Introduced in the body from external sources. 

Extracorporeal artificial pancreas: An apparatus including a glucose sensor, a minicomputer with 
appropriate algorithms, an insulin infusion pump, and a glucose infusion pump, the output of 
which are controlled so as to maintain a constant blood glucose level. (Synonymous with artificial 
beta cell.) 

Hyperglycemia: Abnormally high blood glucose level (in humans, above 140 mg/100 ml). 

Hypoglycemia: Abnormally low blood glucose level (in humans, below 50 mg/100 ml). 

Immunoisolation: Separation of transplanted tissue from its host by a membrane or film which 
prevents immune rejection of the transplant by forming a barrier against the passage of immuno- 
logically active solutes and cells. 

Insulinoma: A generally benign tumor of the pancreas, originating in the beta cells and functionally 
characterized by a secretion of insulin and the occurrence of hypoglycemic coma. Insulinoma cells 
are thought to have lost the feedback function and blood-glucose-regulating capacity of normal 
cells, or to regulate blood glucose around an abnormally low set point. 

Ketoacidosis: A form of metabolic acidosis encountered in diabetes mellitus, in which fat is used as a fuel 
instead of glucose (because of lack of insulin), leading to high concentration of metabolites such as 
acetoacetic acid, (3 hydroxybutyzic acid, and occasionally acetone in the blood and intestinal fluids. 

Organoid: A device — typically an implant — in which cell attachment and growth in the scaffold pro- 
vided by a synthetic polymer sponge or mesh leads to a structure resembling that of a natural 
organ including, in many cases, revascularization. 
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Further Information 

A useful earlier review of insulin delivery systems is to be found in: Artificial Systems for Insulin Delivery, 
edited by P. Brunetti, K.G.M.M. Alberti, A.M. Albisser, K.D. Hepp, and M. Massi Benedetti, Serono 
Symposia Publications from Raven Press, New York, 1983. An upcoming review, focused primarily on 
islet transplantation will be found in Pancreatic Islet Transplantation, vol. Ill: Immunoisolation of Pancre- 
atic Islets, edited by R.P. Lanza and W.L. Chick, R.G. Landes Company, Austin, Texas, 1994. A volume 
entitled Implantation Biology: The Host Response and Biomedical Devices, edited by R.S. Greco, from CRC 
Press, 1994, provides a background in the multiple aspects of tissue response to biomaterials and implants. 
Individual contributions to the science and technology of pancreas replacement are likely to be found 
in the following journals: Artificial Organs, the journal of Cell Transplantation, and Transplantation. 
Reports on clinically promising devices are often published in Diabetes and Lancet. 
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Brown University 135.8 Summary 



In adult mammals, including humans, the peripheral nervous system (PNS) is capable of regeneration 
following injury. The PNS consists of neural structures located outside the central nervous system (CNS), 
which is comprised of the brain and spinal cord. Unfortunately, CNS injuries rarely show a return of 
function, although recent studies suggest a limited capacity for recovery under optimal conditions. Neural 
regeneration is complicated by the fact that neurons, unlike other cell types, are not capable of prolifer- 
ating. In successful regeneration, sprouting axons from the proximal nerve stump traverse the injury site, 
enter the distal nerve stump, and make new connections with target organs. Current surgical techniques 
allow surgeons to realign nerve ends precisely when the lesion does not require excision of a large nerve 
segment. Nerve realignment increases the probability that extending axons will encounter an appropriate 
distal neural pathway, yet the incidence of recovery in the PNS is highly variable, and the return of function 
is never complete. Surgical advances in the area of nerve repair seem to have reached an impasse, and 
biologic rather than technical factors now limit the quality of regeneration and functional recovery. The 
use of synthetic nerve guidance channels facilitates the study of nerve regeneration in experimental studies 
and shows promise in improving the repair of injured human nerves. Advances in the synthesis of 
biocompatible polymers have provided scientists with a variety of new biomaterials which may serve as 
nerve guidance channels, although the material of choice for clinical application has not yet been identified. 
The purpose of this chapter is to review the biologic aspects of PNS regeneration (CNS regeneration 
will be discussed in a subsequent chapter) and the influence of nerve guidance channels on the regen- 
eration process. The biologic mechanisms and the guidance channel characteristics regulating regener- 
ation will be emphasized, since the rational design of guidance systems hinges on the integration of 
engineering (polymer chemistry, materials science, and so on) and biologic (cellular and molecular 
events) principles. 
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FIGURE 135.1 Spinal cord in cross-section. Relationship between motor and sensory cell bodies and their axons. 
Motor neurons (black) are located in the anterior horn of the gray matter within the spinal cord. Sensory neurons 
(white) are located in dorsal root ganglia just outside the spinal cord. Axons exit via dorsal (sensory) and ventral 
(motor) spinal roots and converge to form peripheral nerves, which connect to target structures (sensory endings 
and muscles). Note that the distance between axons and their corresponding cell bodies can be quite long. 

135.1 Peripheral Nervous System 

Peripheral nerve trunks are responsible for the innervation of the skin and skeletal muscles and contain 
electrically conductive fibers termed axons, whose cell bodies reside in or near the spinal cord (Fig. 135.1). 
Nerve cells are unique in that their cellular processes may extend for up to one meter or longer (e.g., for 
axons innervating the skin of the foot). Three types of neuronal fibers can be found in peripheral nerves; 
(1) motor fibers, whose axons originate in the anterior horn of the spinal cord and terminate in the 
neuromuscular ending of skeletal muscle; (2) sensory fibers, which are the peripheral projections of 
dorsal root ganglion neurons and terminate at the periphery either freely or in a variety of specialized 
receptors; and (3) sympathetic fibers, which are the postganglionic processes of neurons innervating 
blood vessels or glandular structures of the skin. These three fiber types usually travel within the same 
nerve trunk, and a single nerve can contain thousands of axons. All axons are wrapped by support cells 
named Schwann cells. Larger axons are surrounded by a multilamellar sheath of myelin, a phospholipid- 
containing substance which serves as an insulator and enhances nerve conduction. An individual Schwann 
cell may ensheath several unmyelinated axons but only one myelinated axon, within its cytoplasm. 
Schwann cells are delineated by a fine basal lamina and are in turn surrounded by a complex structure 
made of collagen fibrils interspersed with fibroblasts and small capillaries, forming a tissue termed the 
endoneurium. A layer of flattened cells with associated basement membrane and collagen constitute the 
perineurium, which envelops all endoneurial constituents. The presence of tight junctions between the 
perineurial cells creates a diffusion barrier for the endoeurium and thus functions as blood-nerve barrier. 
The perineurium and its endoneurial contents constitute a fascicle, which is the basic structural unit of 
a peripheral nerve. Most peripheral nerves contain several fascicles, each containing numerous myelinated 
and unmyelinated axons. Since the fascicular tracts branch frequently and follow a tortuous pathway, the 
cross-sectional fascicular pattern changes significantly along a nerve trunk. Outside the perineurium is 
the epineurium, a protective, structural connective tissue sheath made of several layers of flattened 
fibroblastic cells interspersed with collagen, blood vessels, and adipocytes (fat cells). Peripheral nerves 
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are well vascularized, and their blood supply is derived either from capillaries located within the epineu- 
rium and endoneurium (i.e., vasa nervosum) or from peripheral vessels which penetrate into the nerve 
from surrounding arteries and veins. 

135.2 PNS Response to Injury 

Nerves subjected to mechanical, thermal, chemical, or ischemic insults exhibit a typical combination of 
degenerative and regenerative responses. The most severe from of injury results from complete transaction 
of the nerve, which interrupts communication between the nerve cell body and its target, disrupts the 
interrelations between neurons and their support cells, destroys the local blood-nerve barrier, and triggers 
a variety of cellular and humoral events. Injuries close to the nerve cell body are more detrimental than 
injuries occurring more peripherally. 

The cell bodies and axons of motor and sensory nerves react in a characteristic fashion to transection 
injury. The central cell body (which is located within or just outside the spinal cord) and its nucleus 
swell. Neurotransmitter (e.g., the chemicals which control neuronal signaling) production is diminished 
drastically, and carbohydrate metabolism is shifted to the pentose-phosphate cycle. These changes indicate 
a metabolic shift toward production of the substrates necessary for reconstituting the cell membrane and 
other structural components. For example, the synthesis of the protein tubulin, which is the monomeric 
elements of the microtubule, the structure responsible for axoplasmic transport, is increased dramatically. 
Following injury, tubulin and other substrates produced in the cell soma are transported, via slow and 
fast axoplasmic transport, to the distal nerve fiber. 

Immediately after injury, the tip of the proximal stump (i.e., the nerve segment closest to the spinal 
cord) swells to two or three times its original diameter, and the severed axons retract. After several days, 
the proximal axons begin to sprout vigorously and growth cones emerge. Growth cones are axoplasmic 
extrusions of the cut axons which flatten and spread when they encounter a solid, adhesive surface. They 
elicit numerous extensions (fillipodia) which extend outward in all directions until the first sprout reaches 
an appropriate target (i.e., Schwann cell basal lamina). The lead sprout is usually the only one to survive, 
and the others quickly die back. 

In the distal nerve stump, the process of Wallerian degeneration begins within 1 to 2 hours after 
transection. The isolated axons and their myelin sheaths are degraded and phagocytized by Schwann 
cells and macrophages, leading to a complete dissociation of neurotubules and neurofilaments. This 
degradation process is accompanied by a proliferation of Schwann cells which retain the structure of the 
endoneurial tube by organizing themselves into ordered columns termed bands of Bunger. While the 
Schwann cells multiply, the other components of the distal nerve stump atrophy or are degraded, resulting 
in a reduction in the diameter of the overall structure. Severe retrograde degeneration can lead to cell 
atrophy and, eventually, cell death. Peripheral nerve transection also leads to marked atrophy of the 
corresponding target muscle fibers. 

135.3 PNS Regeneration 

In successful regeneration, axons sprouting from the proximal stump bridge the injury gap and encounter 
an appropriate Schwann cell column (band of Bunger). In humans, axons elongate at an average rate of 
1 mm per day. The bridging axons are immediately engulfed in Schwann cell cytoplasm, and some of 
them become myelinated. Induction of myelogenesis by the Schwann cell is thought to depend on axonal 
contact. Some of the axons reaching the distal stump traverse the length of the nerve to form functional 
synapses, but changes in the pattern of muscular and sensory reinnervation invariably occur. The mis- 
routing of growing fibers occurs primarily at the site of injury, since once they reach the distal stump 
their paths do not deviate further. The original Schwann cell basal lamina in the distal stump persists 
and may aid in the migration of axons and proliferating Schwann cells. Axons which fail to reach an 
appropriate end organ or fail to make a functional synapse eventually undergo Wallerian degeneration. 
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Abnormal regeneration may lead to the formation of a neuroma, a dense, irregular tangle of axons which 
can cause painful sensations. Some nerve injuries result from clean transections of the nerve which leave 
the fascicular pattern intact. More often, a segment of nerve is destroyed (e.g., during high-energy trauma 
or avulsion injuries in which tissue is torn), thus creating a nerve deficit. The resulting nerve gap is the 
separation between the proximal and distal stumps of a damaged nerve that is due to elastic retraction 
of the nerve stumps and tissue loss. The longer the nerve gap, the less likely regeneration will occur. 

135.4 Surgical Repair of Transected Peripheral Nerves 

In the absence of surgical reconnection, recovery of function following a transection or gap injury to a 
nerve is negligible, since ( 1 ) the cell bodies die due to severe retrograde effects, (2) the separation between 
nerve ends precludes sprouting axons from finding the distal stump, (3) connective tissue ingrowth at 
the injury site acts as physical barrier to neurite elongation, and (4) the proximal and distal fascicular 
patterns differ so much that extending axons cannot find an appropriate distal pathway. 

Efforts to repair damaged peripheral nerves by surgical techniques date back many years although 
histologic evidence of regeneration was not reported by Ramon y Cajal until the first part of this century. 
Early attempts to reconnect severed peripheral nerves utilized a crude assortment of materials, and 
anatomic repair rarely led to an appreciable return of function. In the 1950s, the concept of end-to-end 
repair was refined by surgeons who directly reattached individual nerve fascicles or groups of fascicles. 
Further refinements occurred during the 1960s with the introduction of the surgical microscope and the 
availability of finer suture materials and instrumentation. Microsurgical nerve repair led to a significant 
improvement in the return of motor and sensory function in patients to the point that success rates of 
up to 70% can currently be achieved. In microsurgical nerve repair the ends of severed nerves are apposed 
and realigned by placing several strands of very fine suture material through the epineurial connective 
tissue without entering the underlying nerve fascicles. Nerve-grafting procedures are performed when 
nerve retraction or tissue loss prevents direct end-to-end repair. Nerve grafts are also employed when 
nerve stump reapproximation would create tension along the suture line, a situation which is known to 
hinder the regeneration process. In grafting surgery, an autologous nerve graft from the patient, such as 
the sural nerve (whose removal results in little functional deficit), is interposed between the ends of the 
damaged nerve. 

135.5 Repair with Nerve Guidance Channels 

In repair procedures using nerve guidance channels, the mobilized ends of a severed nerve are introduced 
in the lumen of a tube and anchored in place with sutures (Fig 135.2). Tubulation repair provides: (1) a 
direct, unbroken path between nerve stumps; (2) prevention of scar tissue invasion into the regenerating 
environment; (3) directional guidance for elongation neurites and migrating cells; (4) proximal-distal 
stump communication without suture-line tension in cases of extensive nerve deficit; (5) minimal number 
of epineurial stay sutures, which are known to stimulate connective tissue proliferation; and (6) preser- 
vation, within the guidance channel lumen, of endogenous trophic or growth factors released by the 
traumatized nerve ends. Guidance channels are also useful from an experimental perspective: (1) the gap 
distance between the nerve stumps can be precisely controlled; (2) the fluid and tissue entering the 
channel can be evaluated; (3) the properties of the channel can be varied; and (4) the channel can be 
filled with various drugs, gels, and the like. Nerves with various dimensions from several mammalian 
species, including mice, rats, rabbits, hamsters, and nonhuman primates have been tested. The regener- 
ated tissue in the guidance channel is evaluated morphologically to quantify the outcome of regeneration. 
Parameters analyzed include the cross-sectional area of the regenerated nerve cable, the number of 
myelinated and unmyelinated axons, and the relative percentages of cellular constituents (i.e., epineurium, 
endoneurium, blood vessels, and so on). Electrophysiologic and functional evaluation can also be per- 
formed in studies conducted for long periods (e.g., several weeks or more). 
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FIGURE 135.2 Tube to repair nerve. Surgical placement of nerve guidance channel. 

The frequent occurrence of nerve injuries during the world wars of this century stimulated surgeons 
to seek simpler, more effective means of repairing damaged nerves. A variety of biologic and synthetic 
materials shaped into cylinders were investigated (Table 135.1). Bone, collagen membranes, arteries, and 
veins (either fresh or freeze-dried to reduce antigenicity) were used from the late 1800s through the 1950s 
to repair nerves. These materials did not enhance the rate of nerve regeneration when compared to 
regular suturing techniques, so clinical applications were infrequent. Magnesium, rubber, and gelatin 
tubes were evaluated during World War I, and cylinders of parchment paper and tantalum were used 
during World War II. The poor results achieved with these materials can be attributed to poor biocom- 
patibility, since the channels elicited an intense tissue response which limited the ability of growing axons 
to reach the distal nerve stump. Following World War II, polymeric materials with more stable mechanical 
and chemical properties became available. Millipore (cellulose acetate) and Silastic (silicone elastomer) 
tubing received the greatest attention. Millipore, a filter material with a maximum pore size of 0.45u.m, 
showed early favorable results. In human trials, however, Millipore induced calcification and eventually 
fragmented several months after implantation so that its use was discontinued. Silastic tubing, a biologically 
inert polymer with rubberlike properties, was first tested in the 1960s. Thin-walled Silastic channels were 
reported to support regeneration over large gaps in several mammalian species. Thick-walled tubing was 
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TABLE 135.1 Materials Used for Nerve Guidance Channels 

Synthetic materials 
Nonresorbable 
Nonporous 

Ethylene- Vinyl Acetate Copolymer (EVA) 
Polytetrafiuoroethylene (PTFE) 
Polyethylene (PE) 
Silicone elastomers (SE) 
Polyvinyl chloride (PVC) 
Polyvinylidene fluoride (PVDF) 
Microporous 

Cortex, expanded polytetrafiuoroethylene (ePTFE) 
Millipore (cellulose filter) 
Semipermeable 

Polyacrylonitrile (PAN) 

Polyacrylonitrile/Polyvinyl chloride (PAN/PVC) 
Polysulfone (PS) 
Bioresorbable 

Polyglycolide (PGA) 
Polylactide (PLLA) 
PGA/PLLA blends 
Biologic materials 
Artery 
Collagen 

Hyaluronic acid derivatives 
Mesothelial tubes 
Vein 
Metals 

Stainless steel 
Tantalum 



associated with nerve necrosis and neuroma production. The material showed no long-term degradation 
nor did it elicit a sustained inflammatory reaction. As a result, thin-walled Silastic tubing has been used, 
on a very limited basis, in the clinical repair of severed nerves. 

135.6 Recent Studies with Nerve Guidance Channels 

The availability of a variety of new biomaterials has led to a resurgence of tubulation studies designed 
to elucidate the mechanisms of nerve regeneration. The spatial-temporal progress of nerve regeneration 
across a 10-mm rat sciatic nerve gap repaired with a silicone elastomer tube has been analyzed in detail 
(Fig. 135.3). During the first hours following repair, the tube fills with a clear, protein-containing fluid 
exuded by the cut blood vessels in the nerve ends. The fluid contains the clot-forming protein, fibrin, as 
well as factors known to support nerve survival and outgrowth. By the end of the first week, the lumen 
is filled with a longitudinally oriented fibrin matrix which coalesces and undergoes syneresis to form a 
continuous bridge between the nerve ends. The fibrin matrix is soon invaded by cellular elements 
migrating from the proximal and distal nerve stumps, including fibroblasts (which first organize along 
the periphery of the fibrin matrix), Schwann cells, macrophages, and endothelial cells (which form 
capillaries). At 2 weeks, axons advancing from the proximal stump are engulfed in the cytoplasm of 
Schwann cells. After 4 weeks some axons have reached the distal nerve stump, and many have become 
myelinated. The number of axons reaching the distal stump is related to the distance the regenerating 
nerve has to traverse and the length of original nerve resected. Silicone guidance channels do not support 
regeneration if the nerve gap is greater than 10 mm and if the distal nerve stump is left out of the guidance 
channel. The morphology and structure of the regenerated nerve is far from normal. The size and number 
of axons and the thickness of myelin sheaths are less than normal. Electroymyographic evaluation of 
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FIGURE 135.3 Regeneration process. Nerve regeneration through a guidance channel. 

nerves regenerated through silicone tubes reveals that axons can make functional synapses with distal 
targets, although nerve conduction velocities and signal amplitudes are slower than normal, even after 
many months. Attempts to improve the success rate and quality of nerve regeneration have led to the use 
of other tubular biomaterials as guidance channels (Table 135.1). biodurable materials such as acrylic 
copolymers, polyethylene, and porous stainless steel and bioresorbable materials including polyglycolides, 
polylactides, and other polyesters have investigated. There is some concern that biodurable materials may 
cause long-term complications via compression injury to nerves or soft tissues. Biodegradable materials 
offer the advantage of disappearing once regeneration is complete, but success thus far has been limited 
by swelling, increased scar tissue induction, and difficulty in controlling degradation rates. In all cases, 
these materials have displayed variable degrees of success in bridging transected nerves, and the newly 
formed nerves are morphologically quite different from normal peripheral nerves. The general spatial- 
temporal progress of nerve regeneration, however, resembles that described for the silicone channel model. 
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135.7 Enhancing Regeneration by Optimizing Nerve 
Guidance Channel Properties 

Manipulating the physical, chemical, and electrical properties of guidance channels allows control over 
the regenerating environment and optimization of the regeneration process. The following features of 
synthetic nerve guidance channels have been studied: (1) transmual permeability, (2) surface texture or 
microgeometry, (3) electric charges, (4) release of soluble factors, (5) inclusion of insoluble factors, and 
(6) seeding with neuronal support cells. 

Transmural Permeability 

The synthetic nerve guidance channel controls the regeneration process by influencing the cellular and 
metabolic aspects of the regenerating environment. Since transected nerves lose the integrity of their 
blood-nerve barrier (which controls oxygen and carbon dioxide tensions, pH, and the concentrations of 
nutrients and essential proteins), the guidance channel's transmural mass transfer characteristics mod- 
ulate solute exchange between the regenerating tissue and the surrounding fluids Nerves regenerated 
through permselective tubes display more normal morphologic characteristics than nerves regenerated 
in impermeable silicone elastomer (SE) and polyethylene (PE) or freely permeable expanded polytet- 
rafluoroethylene (ePTFE) tubes. Nerves found in semipermeable tubes feature more myelinated axons 
and less connective tissue. Nerve cables regenerated in semipermeable or impermeable tubes are both 
round-shaped and free from attachment to the inner wall of the guidance channel. Nerves regenerated 
in highly porous, open structures do not form a distinct cable but contain connective tissue and dispersed 
neural elements. The range of permeselectivity is very important, and optimal regeneration is observed 
with a molecular weight (MW) cut-off of 50,000-100,000 daltons (D). Permeselective PAN/PVC channels 
with an MW cut-off of 50,000 D support regeneration even in the absence of a distal nerve stump. 

These observations suggest that controlled solute exchange between the internal regenerative and 
external wound-healing environments is essential in controlling regeneration. The availability of oxygen 
and other nutrients may minimize connective tissue formation in permeable PAN/PVC and PS tubes. 
Decreased oxygen levels and waste buildup may increase connective tissue formation in SE and PE tubes. 
Regeneration may also be modulated by excitatory and inhibitory factors released by the wound-healing 
environment. Semipermeable channels may sequester locally generated growth factors while preventing 
the inward flux of molecules inhibitory to regeneration. 

Surface Texture or Microgeometry 

The microgeometry of the luminal surface of the guidance channel plays an important role in regulating 
tissue outgrowth. Expanded microfibrilar polytetrafluorethylene (ePTFE) tubes exhibiting different inter- 
nodal distances (1, 5, and 10 (im) were compared to smooth-walled impermeable PTFE tubes. Larger 
internodal distances result in greater surface irregularity and increased transmural porosity. Rough-walled 
tubes contained isolated fascicles of nerves disperses within a loose connective tissue stroma. The greater 
the surface roughness, the greater the spread of fascicles. In contrast, smooth-walled, impermeable PTFE 
tubes contained a discrete nerve cable delineated by an epineurium and located within the center of the 
guidance channel. Similar results were observed with semipermeable PAN/PVC tubes with the same 
chemistry and MW cut-off but with either smooth or rough surfaces. Nerves regenerated in tubes 
containing alternating sections of smooth and rough inner walls showed similar morphologies with an 
immediate change from single-cable to numerous fascicle morphology at the interface of the smooth 
and rough segments. 

These studies suggest that the microgeometry of the guidance channel lumen also modulates the nerve 
regeneration process. Wall structure changes may alter the protein and cellular constituents of the 
regenerating tissue bridge. For example, the orientation of the fibrin matrix is altered in the presence of 
a rough inner wall. Instead of forming a single, longitudinally oriented bridge connecting the nerve ends, 
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the fibrin molecules remain dispersed throughout the lumen. As a result, cells migrating in from the 
nerve stumps loosely fill the entire lumen rather than form a dense central structure. 

Electric Charge Characteristics 

Applied electric fields and direct dc stimulation are known to influence nerve regeneration in vitro and 
in vivo. Certain dielectric polymers may be used to study the effect of electric activity on nerve regen- 
eration in vivo and in vitro. These materials are advantageous in that they provide electric charges without 
the need for an external power supply, can be localized anatomically, are biocompatible, and can be 
formed into a variety of shapes including tubes and sheets. 

Electrets are a broad class of dielectric polymers which can be fabricated to display surface charges 
because of their unique molecular structure. True electrets, such as polytetrafluorethylene (PTFE), can 
be modified to exhibit a static surface charge due to the presence of stable, monopolar charges located 
in the bulk of the polymer. The sign of the charge depends on the poling conditions. Positive, negative, 
or combined charge patterns can be achieved. The magnitude of surface charge density is related to the 
number and stability of the monopolar charges. Charge stability is related to the temperature at which 
poling occurs. Crystalline piezoelectric materials such as polyvinylidene fluoride (PVDF) display transient 
surface charges related to dynamic spatial reorientation of molecular dipoles located in the polymer bulk. 
The amplitude of charge generation depends on the degree of physical deformation (i.e., dipole displace- 
ment) of the polymer structure. The sign of the charge is dependent on the direction of deformation, 
and the materials show no net charge at rest. 

Negatively and positively poled PVDF and PTFE tubes have been implanted as nerve guidance channels. 
Poled PVDF and PTFE channels contain significantly more myelinated axons than unpoled, but otherwise 
identical, channels. In general, positively poled channels contained larger neural cables with more myeli- 
nated axons than negatively poled tubes. It is not clear how static or transient charge generation affects 
the regeneration process. The enhancement of regeneration may be due to electrical influences on protein 
synthesis, membrane receptor mobility, growth core mobility, cell migration, and other factors. 

Release of Soluble Factors 

The release of soluble agents, including growth factors and other bioactive substances from synthetic 
guidance channels, may improve the degree and specificity of neural outgrowth. Using single or multiple 
injections of growth factors has disadvantages including early burst release, poor control over local drug 
levels, and degradation in biologic environments. Guidance channels can be prefilled with drugs or 
growth factors, but the aforementioned limitations persist. Advantages of using a local, controlled delivery 
system are that the rate and amount of factor release can be controlled and that the delivery can be 
maintained for long periods (several weeks). Channels composed of an ethylene-vinyl acetate (EVA) 
copolymer have been fabricated and designed to release incorporated macromolecules in a predictable 
manner. The amount of drug loaded, its molecular weight, and geometry of the drug-releasing structure 
affect the drug release kinetics. It is also possible to restrict drug release to the luminal side of the guidance 
channel by coating its outer wall with a film of pure polymer. 

Growth or neurontrophic factors that ensure the survival and general growth of neurons are produced 
by support cells (e.g., Schwann cells) or by target organs (e.g., muscle fibers). Some factors support 
neuronal survival, other support nerve outgrowth, and some do both. Numerous growth factors have 
been identified, purified, and synthesized through recombinant technologies (Table 135.2). In vivo, 
growth factors are found in solution in the serum or extracellular fluid or bound to extracellular matrix 
(ECM) molecules. Nerve guidance channels fabricated from EVA and designed to slowly release basic 
fibroblast growth factor (bFGF) or nerve growth factor (NGF) support regeneration over a 15-mm gap 
in a rat model. Control EVA tubes supported regeneration over a maximum gap of only 10 mm with no 
regeneration in 15-mm gaps. The concurrent release of growth factors which preferentially control the 
survival and outgrowth of motor and sensory neurons may further enhance regeneration, since the 
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TABLE 135.2 Growth Factors Involved in Peripheral Nerve Regeneration 

Growth Factor Possible function 

NGF — nerve growth factor Neuronal survival, axon-Schwann cell interaction 

BDNF — brain-derived neurotophic factor Neuronal survival 

CNTF — ciliary neuronotrophic factor Neuronal survival 

NT-3 — neuronotrophin 3 Neuronal survival 

NT-4 — neuronotrophin 4 Neuronal survival 

IGF-1 — insulinlike growth factor- 1 Axonal growth, Schwann cell migration 

IGF-2 — Insulinlike growth factor-2 Motoneurite sprouting, muscle reinnervation 

PDGF — platelet-derived growth factor Cell proliferation, neuronal survival 

aFGF — acidic fibroblast growth factor Neurite regeneration, cell proliferation 

bFGF — basic fibroblast growth factor Neurite regeneration, neovascularization 



TABLE 135.3 Neuronal Attachment and Neurite-Promoting Factors 

Factor Minimal Peptide Sequence 

Collagen RGD 

Fibronectin RGD 

Laminim RGD, SIKVAV, YIGSR 

Neural cell adhesion molecule (N-CAM) Unknown 

N-caherin Unknown 

majority of peripheral nerves contain both populations. For example, NGF and b-FGF control sensory 
neuronal survival and outgrowth, whereas brain-derived growth factor (BDGF) and ciliary neurontrophic 
factor (CNTF) control motor neuronal survival and outgrowth. Growth factors released by guidance 
channels may also allow regeneration over large nerve deficits, and important consideration in nerve 
injuries with severe tissue loss. The local release of other pharmacologic agents (e.g., anti-inflammatory 
drugs) may also be useful in enhancing nerve growth. 

Inclusion of Insoluble Factors 

Several neural molecules found on cell membranes and in the extracellular matrix are potent modulators 
of neural attachment and outgrowth (Table 135.3). Proteins responsible for eliciting and stimulating 
axon elongation are termed neurite promoting factors. The glycoprotein laminin, an ECM component 
present in the balsa lamina of Schwann cells, has been reported to promote nerve elongation in vitro 
and in vivo. Other ECM products, including the glycoprotein fibronectin and the proteoglycan heparan 
sulphate, also have been reported to promote nerve extension in vitro. Some subtypes of the ubiquitous 
protein collagen also support neural attachment. Filling guidance channels with gels of laminin and 
collagen has been shown to improve and accelerate nerve repair. The addition of longitudinally oriented 
fibrin gels to SE tubes has also been shown to accelerate regeneration. Collagen, laminin, and glycosami- 
noglycans (another ECM component) introduced in the guidance channel lumen support some degree 
of regeneration over a 15-20 mm gap in adult rats. The concentration of ECM gel is important, as thicker 
gels impede regeneration in semipermeable tubes. 

The activity of these large, insoluble ECM molecules (up to 10 6 D MW) can be mimicked by short 
sequences only 3-10 amino acids long (e.g., RGD, YIGSR) (Table 135.3). The availability of small, soluble, 
bioactive agents allows more precise control over the chemistry, conformation, and binding of neuron- 
specific substances. Additionally, their stability and linear structure facilitate their use instead of labile 
(and more expensive ) proteins which require three-dimensional structure for activity. Two- and three- 
dimensional substrates containing peptide mimics have been shown to promote neural attachment and 
regeneration in vitro and in vivo. 



•) 2000 by CRC Press LLC 



Seeding Neuronal Support Cells 

Adding neural support cells to the lumen of guidance channels is another strategy being used to improve 
regeneration or to make regeneration possible over otherwise irreparable gaps. For example, Schwann 
cells cultured in the lumen of semipermeable guidance channels have been shown to improve nerve repair 
in adult rodents. Cells harvested from inbred rats were first cultured in PAN/PVC tubes using various 
ECM gels as stabilizers. The Schwann cells and gel formed a cable at the center of the tube after several 
days in culture. Once implanted, the cells were in direct contact with the nerve stumps. A dose-dependent 
relationship between the density of seeded cells and the extent of regeneration was noted. Another 
approach toward nerve repair involves the use of Schwann cells, fibroblasts, and the like which are 
genetically engineered to secrete growth factors. The use of support cells which release neuronotrophic 
and neurite-promoting molecules may enable regeneration over large gaps. There is increasing evidence 
that PNS elements, especially Schwann cells, are capable of supporting CNS regeneration as well. For 
example, Schwann-cell-seeded semipermeable channels support regeneration at the level of the dorsal 
and ventral spinal cord roots and the optic nerve, which are CNS structures. 

135.8 Summary 

The permeability, textural, and electrical properties of nerve guidance channels can be optimized to 
impact favorably on regeneration. The release of growth factors, addition of growth substrates, and 
inclusion of neural support cells and genetically engineered cells also enhance regeneration through 
guidance channels. Current limitations in PNS repair, especially the problem of repairing long gaps, and 
in CNS repair, where brain and spinal cord trauma rarely result in appreciable functional return, may 
benefit from advances in engineering and biology. The ideal guidance system has not been identified but 
will most likely be a composite device that contains novel synthetic or bioderived materials and that 
incorporates genetically engineered cells and new products from biotechnology. 
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As the ability to reconstruct parts of the body has increased, so has the potential for complications 
associated with the replacement devices used to do so. Some of the most significant complications 
associated with replacement devices, such as vascular prostheses, cardiac valves, and artificial joints, are 
caused by infections at the implantation site. It is well known that the presence of a foreign material 
impairs host defenses and that prolonged infections cannot be cured unless the foreign material is 
removed from the site of infection. As the trachea, larynx, and esophagus are located at sites facing the 
"external environment," these prostheses are exposed to a high risk of infections and severe complications. 
The development of an artificial trachea, esophagus, and larynx is way behind that of artificial vascular 
grafts even though they are all tubular organs. In this chapter, we review conventional prostheses and 
their problems and limitations and discuss the current state of developments in this field. 

136.1 Tracheal Replacement Devices 

End-to-end anastomosis has been one of the standard operations for tracheal reconstruction. However, 
in patients in whom a large length of trachea has to be resected, this procedure is often difficult (the 
resectable limit is now considered to be approximately 6 cm). In such patients, alternative methods are 
required to reconstruct the airway. Such reconstructive methods can be classified into the following three 
categories: (1) reconstruction with autologous tissue, (2) reconstruction with nonautologous trachea 
(that is, transplantation), and (3) reconstruction with artificial material. The only one of these operative 
techniques which achieves good clinical results in the long term is the first (reconstruction with autolo- 
gous tissue), particularly cervical tracheal reconstruction using an autologous skip flap conduit. 

Designs of Artificial Tracheae 

The artificial tracheae developed previously were designed according to one of two concepts. One is that the 
implanted prosthesis alone replaces the resected area of trachea, and the inner surface of the reconstructed 
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FIGURE 136.1 Neville artificial trachea constructed with a nonporous silicone tube with Dacron suture rings. 

trachea is not endothelialized. The other is that the implanted prosthesis is incorporated by the host 
tissue and its inner surface is endothelialized. These two types of prosthesis are called nonporous and 
mesh types, respectively, which reflect the materials from which they are made. 

Nonporous Tube — Type Artificial Tracheae 

The study of nonporous tube-type trachea has a long history, and many materials have been tried 
repeatedly for artificial tracheae. However, the results have been unsatisfactory, and now only one 
prosthesis of this type remains on the market, the Neville artificial trachea, which comprises a silicone 
tube with two suture rings attached to each of its ends (Fig 136.1). Neville used this trachea in 62 patients, 
from 1970 to 1988, some of whom survived for a long time [1]. However, late complications, such as 
migration of the artificial trachea and granular tissue formation at the anastomosis, were inevitable in 
many cases and occurred within several months and even as late as 2 years after implantation. Therefore, 
the Neville prosthesis is not used for patients with benign tracheal disease, because clinically, even if a 
large portion of the trachea up to the bifurcation has to be resected, the airway can be reconstructed easily 
by tracheotomy using a skin flap with sternal resection. For the alleviation of tracheal stenosis, silicone 
T-tubes (Fig. 136.2) are widely indicated. The major advantage of such nonporous tubular prostheses is 
that airway patency can be ensured. Therefore, for patients for whom end-to-end anastomosis is impos- 
sible, nonporous prostheses may be used at last resort to avoid threatened suffocation (asphyxiation). 
Tracheal replacement using the Neville artificial trachea requires the following operative procedure: 

1. Right-sided posterolateral skin incision and 4th or 5th intercoastal thoracotomy. Up to the stage of 
tracheal resection, the operative procedure is similar to that for standard tracheal resection. 
However, because the tension at anastomosis is not as high as that with an end-to-end anastomosis, 
neither hilar nor laryngeal release is often necessary. 

2. Reconstruction with an artificial trachea. After resection of the tracheal lesion, the oral intubation 
tube is drawn back, and the trachea is reintubated via the operative field (Fig. 136.3). In patients 
who require resection that reaches to near the bifurcation, the second intubation tube should be 
placed in the left main bronchus. An artificial trachea with a diameter similar to that of the tracheal 
end is used. Any small differences in their diameters can be compensated for by suturing. Anas- 
tomosis is carried out using 4-0 absorbable sutures, 2 mm apart, with interrupted suturing. The 
tracheal sutures are attached to the tracheal cartilage. On completion of the anastomosis on one 
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FIGURE 136.2 Silicone T-Tube. 
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FIGURE 136.3 Operation proceeding of tracheal reconstruction using an artificial trachea. 

side, the oral intubation tube is readvanced, and the other side is anastomosed under oral venti- 
lation. After anastomosis, an air leakage test at pressure of 30 cmH 2 is carried out. In order to 
avoid rupture of the great vessels near the implanted artificial trachea, they should not be allowed 
to touch each other. In cases at risk of this occurring, wrapping of the artificial trachea with the 
grater omentum is also recommended. 
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3. Postoperative care. Frequent postoperative boncofiberscopic checks should be performed to ensure 
sputum does not come into contact with the anastomosis sites. The recurrent laryngeal nerves on 
both sides are often injured during the operation, so the movement of the vocal cord should be 
checked at extubation. 

The major reason for the poor results with the Neville tracheal prosthesis during long-term follow- 
up was considered to be the structure of the prosthesis, that is, the nonelasticity of the tube and suture 
rings. A variety of improvements to this prosthesis have been tried, especially at the areas of interface 
with the host tissue by suture reinforcement with mesh skirts, increasing the flexibility of the tube, and 
application of hydroxyapatite to the suture rings. However, the problems at the anastomosis have not 
been conquered yet. 

In attempt to overcome the problems described above, a new mesh-type artificial trachea has been 
designed that is intended to be incorporated into the host tissue so that eventually there is no foreign- 
body-external-environment interface. 

Mesh-Type Artificial Tracheae 

Porous artificial tracheae are called mesh-type because the prosthetic trunk is made of mesh. In the 1950s, 
several trials of tracheal reconstruction using metallic meshes made of tantalum and stainless steel were 
conducted. In the 1960s, heavy Marlex mesh was used clinically for tracheal reconstruction, and good 
short-term results were reported. However, long-term observations showed that this mesh caused rupture 
of the adjacent graft vessels, which was fatal, so it fell gradually out of use for tracheal reconstruction. 
When used clinically, because the heavy mesh was rough and not air-tight, other tissues, such as 
autografted pericardium, fascia, or dura mater, were applied to seal it until the surrounding tissue grew 
into it and made it air-tight. The pore size of materials conventionally used for artificial vessels, such as 
expanded PTFE (polytetrafluoroethylene, pore size of 15 ~ 30(0.m), is so small that the host tissue cannot 
penetrate the mesh, which is rejected eventually. The optimal pore size for tracheal replacement mesh is 
200 ~ 300 u.m. Fine Marlex mesh is made of polypropylene with a pore size 200 ~ 300 |J.m (Fig. 136.4). 
It is now widely used clinically for abdominal wall reconstruction and reinforcement after inguinal 




FIGURE 136.4 Marlex Mesh, fine (left) and heavy (right) (division of the scale; mm). 
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FIGURE 136.5 New artificial trachea made from collagen-conjugated fine Marlex mesh. 



herniation. Collagen-grafted fine Marlex mesh is air-tight, and clinically, good tissue regeneration is 
achieved when it is used to patch-graft of the trachea. The grafted collagen has excellent biocompatibility 
and promotes connective tissue infiltration into the mesh. However, the fine mesh alone is too soft to 
keep the tube open, so a tracheal prosthesis was made of collagen-grafted fine Marlex mesh reinforced 
with a continuous polypropylene spiral (Fig. 136.5). In dogs, complete surgical resection of a 4-cm length 
of trachea, which was replaced with a 5-cm long segment of this type of artificial trachea, was performed, 
and the prostheses were incorporated completely by the host trachea and confluent formation of respi- 
ratory epithelium on each prosthetic lumen was observed (Fig. 136.6) [2]. These results indicate that this 
artificial trachea is highly biocompatible and promising for clinical application. 




FIGURE 136.6 Macroscopic inner view of the reconstructed trachea 6 months after operation. Inner surface is 
covered with smooth and lustrous soft regenerated tissue. 
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136.2 Laryngeal Replacement Devices 

Total laryngectomy is one of the standard operations for laryngeal carcinomas. As radiation therapy and 
surgery have progressed, the prognosis associated with laryngeal carcinoma has improved. The curability 
of total laryngeal carcinoma is now almost 70%, and therefore many patients survive for a long time 
after surgery. Individuals who have undergone laryngectomy are called laryngectomees or laryngetomized 
patients, and for them, laryngeal reconstruction is of the utmost importance. However, because the larynx 
is situated just beneath the oral cavity, where the danger of infection is high, successful reconstruction 
with foreign materials is very difficult. As yet, no total replacement device for the larynx has been 
developed, and laryngeal transplantation, although apparently feasible, is still at the animal experimental 
stage. 

The larynx has three major functions: (1) phonation, (2) respiration, and (3) protection of the lower 
airway during swallowing. Of these, phonation is considered to be the most important. The conventional 
so-called artificial larynx can only substitute phonation. A variety of methods have been developed to 
recover phonation after total larygectomy, which is called vocal rehabilitation. Methods for vocal reha- 
bilitation are classified as (1) esophageal speech, (2) artificial larynx, and (3) surgical laryngoplasty. Two- 
thirds of laryngectomees learn esophageal speech as their means of communication. For the rest, an 
artificial larynx and surgical laryngoplasty are indicated. The typical devices are the pneumatic and 
electrical larynx which are driven by the expiratory force and electric energy, respectively. Tracheo- 
esophageal (T-E) fistula with voice prosthesis is the most popular method in surgical laryngoplasty. 
Fig. 136.7 illustrates the mechanical structures of typical artificial larynxes. 

Pneumatic Larynxes 

The first pneumatic mechanical device was developed by Tapia in 1883. Several variations of pneumatic 
larynxes are now used. In Fig. 136.8, the pneumatic device uses expired air from the tracheo stoma to 
vibrate a rubber band or reed to produce a low frequency sound, which is transmitted to the mouth via 
a tube. Pneumatic transoral larynxes produce excellent natural speech, which is better than that with 
other artificial larynxes, but their disadvantages are that they are conspicuous and that regular cleaning 
and mopping of saliva leakage is necessary. 

Electric Artificial Larynxes 

The transcervical electrolarynx is an electric, handheld vibrator that is placed on the neck to produce 
sound (Fig. 136.9). The frequency used is 100 ~ 200 Hz. The vibrations of the electrolarynx are conducted 
to the neck tissue and create a low-frequency sound in the hypopharynx. This is the most popular artificial 
larynx. The transoral artificial laryngeal device is a handheld electric device that produces a low-pitched 
sound which is transmitted to the back of the mouth by a connecting tube placed in the patient's mouth. 
As microelectronic science progresses, great hopes of applying microelectric techniques to the artificial 
larynx are now entertained, and some devices have been designed, although implantable laryngeal 
prostheses have not achieved widespread use. 

Voice Prostheses 

As well as the artificial larynxes described above, tracheo-esophageal (T-E) fistula prostheses are now 
widely used for vocal rehabilitation, and excellent speech and voice results have been achieved. In 1980, 
Singer and Blom developed and introduced the first simple method, which is called Blom and Singer's 
voice prosthesis. The principle of the tracheo-esophageal fistula technique is to shunt expired pulmonary 
air through a voice prosthesis device into the esophagus to excite the mucosal tissue to vibrate. A fistula 
is made by puncturing the posterior wall of the trachea 5 mm below the upper margin of the tracheal 
stoma, and when a patient speaks, he/she manually occludes the stoma to control the expiratory flow 
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FIGURE 136.7 (a) Sagital views of the laryngectomee (left) and esophageal speech (right). Air flow from the 
esophagus makes the sound, (b) Pneumatic larynx (reed type) (left); electric artificial larynx (transcervical type), 
(center); voice prosthesis (T-E shut) of Blom & Singer method, right. 

through the fistula to the oral cavity. The voice prosthesis has a one-way value to prevent saliva leakage 
(Fig. 136.10). 

136.3 Artificial Esophagi (Esophageal Prostheses) 

In patients with esophageal cancer, the esophagus is resected and reconstructed using a piece of pedic- 
ulated alimentary tract, such as the gastric conduit, colon, or ileum. However, in some cases, it is 
impossible to use autologous alimentary tract, for example, in patients who have undergone gastrectomy. 
In such cases, an artificial esophagus is indicated. However, with the exception of extracorporeal-type 
esophagi and intraesophageal stent tubes, the esophageal replacement devices developed have remained 
far from useful in clinical reconstructive practice. The main reason is anastomosis dehiscence, which 
often leads to fatal infections, as well as prosthetic dislodgment, migration, and narrowing at a late stage. 
The conventional artificial esophagi now used in the clinic can be broadly classified into two types: 
extracorporeal and intraesophageal stents. Extracorporeal artificial esophagi are used as bypasses from 
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FIGURE 136.8 Pneumatic larynxes: Myna (left) (Nagashima Medical Instrument Tokyo, Japan) and Okumura 
Artificial larynx (Okumura, Osaka, Japan) (right). Arrow marks indicate the portions of reed and rubber band, 
respectively. 




FIGURE 136.9 Electric artificial larynx (transcervical type): Servox (Dr Kuhn & Co. GmBH, Koln, Germany). 
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FIGURE 136.10 Voice prosthesis (Bivona, Indiana, USA). This valved tube is inserted into a surgically placed 
tracheo-esophageal fistula for voice restoration following laryngectomy. 





(a) \ / ' (b) 

FIGURE 136.11 Extracorporeal artificial esophagus (Tokyo University type). 




cervical esophageal to gastric fistulae (Fig. 136.11). They are used during the first stage of two-stage 
esophageal reconstruction. They are made of latex rubber or silicone, but since the development of IVH 
method, their use is now extremely rare. 

Intraesophageal stent types of artificial esophagus are made from a rubber or plastic tube, which is 
inserted in the stenotic part of the esophagus (Fig. 136.12). They are used only for unresectable esophageal 
carcinoma, i.e., they are not indicated for resectable cases. Therefore, conventional artificial esophagi are 
only palliative devices for use as stop-gap measures. 
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FIGURE 136.12 Intraesophageal type of artificial esophagus. (Sumitomo Bakelite Co., Ltd., Tokyo, Japan). 



Development of New Artificial Esophagi 

In contrast to the palliative artificial esophagi, the ideal artificial esophagus would replace the resected 
part of the esophagus by itself. The artificial esophagi of this type are classified according to the materials 
from which they are made, namely natural substances, artificial materials, and their composites (hybrids). 

Artificial Esophagi Made of Natural Substances 

The first report of an artificial esophagus made of a natural substance was the skin conduit developed 
by Bircher in 1907. Subsequently, esophageal reconstruction using a variety of natural substances — muscle 
fasciae, isolated jejunum, autologous aorta, aortic homograft, autologous esophagus, connective tissue 
conduit, trachea, and freeze-dried dura mater homograft — has been reported. However, non of these has 
overcome the problems of stenosis, which necessitates continuous bougieing, and other complications 
at the anastomosis sites. 

Artificial Esophagi Made of Artificial Materials 

The first trial of an esophagus made of an artificial material was carried out by Neuhof, who, in 1922, 
used a rubber tube to reconstruct the cervical esophagus. Subsequently, several artificial materials have 
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been tried repeatedly, such as polyethylene, Dacron, stainless steel mesh, tantalum mesh, dimethylopol- 
ysiloxane, polyvinylformal sponge, Teflon, nylon, silicone rubber, collagen-silicone, Dacron-silastic, acryl- 
resin, and expanded PTFE tubes, but all the trials ended in failure. These materials were foreign bodies, 
after all, and the host tissues continuously rejected them, which caused anastomotic dehiscence followed 
by infections. Even rare cases, whose artificial esophagi escaped early rejection, did not avoid the late 
complications, such as prosthetic migration and esophageal stenosis. 

Hybrid Artificial Esophagi 

Cultured cells are now widely used for hybrid artificial organs, especially for metabolic organs, such as 
the liver and pancreas. A hybrid artificial esophagus comprising a latissimus dorsi muscle tube, the inner 
surface of which is epithelialized by human cultured esophageal epithelial cells, is being studied by the 
Keio University group [3]. Human esophageal epithelial cells cultured on collagen gel for 10 days were 
transplanted onto the surfaces of the lattisimus dorsi muscles of athymic mice, and new epithelial cells 
were observed, which indicates it may be possible to develop an artificial esophagus incorporating 
cultured epithelial cells. 

New Concepts in Artificial Esophageal Design 

The previous studies on artificial esophagi are a history of how to prevent the host tissue from recognizing 
the artificial esophagus as a foreign body, and all the trials, without exception, resulted in failure. 
Currently, an artificial esophagus made according to a completely new design concept is undergoing 
trials. The outer collagen sponge layer of the prosthesis is intended to be replaced with host tissue over 
a period of time, and its inner tube acts as a palliative (temporary) stent until the outer layer has been 
replaced by host tissue. This type of artificial esophagus comprises an inner silicone tube and an outer 
tube of collagen sponge (Fig. 136.13), which is nonantigenic and has excellent biocompatibility, as well 
as promoting tissue regeneration in the form of an extracellular matrix. In dogs implanted with a 5-cm 
length of this artificial esophagus, esophageal regeneration was accomplished in 3 weeks, and all the early 
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FIGURE 136.13 An artificial esophagus made of collagen sponge which is intended to be replaced with host tissue. 
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FIGURE 136.14 Pathologic finding of a regenerated esophageal tissue in a dog using the artificial esophagus 
(Fig. 136.13). Continuous epithelial layer and the muscle and esophageal gland were regenerated at the interposed 
area. 

complications at the anastomosis were overcome. However, in dogs from which the inner stents were 
removed as soon as mucosal regeneration was accomplished, stenosis developed rapidly, and the regen- 
erated esophagus began to constrict. Such stenosis and constriction could not be overcome, even when 
autologous buccal mucosal cells were seeded onto the collagen sponge. Accordingly, on the basis of the 
hypothesis that stenosis and constriction depend on the maturity of the regenerated submucosal tissue, 
stent removal was postponed for at least 1 week after mucosal epithelialization of the artificial esophagus 
was complete. Late stenotic complications did not occur, and regeneration of muscle tissue and esophageal 
glands was observed (Fig. 136.14) [4]. The regenerated esophagus showed adequate physiologic function 
and pathologically satisfactory results. The limit to the length of the esophagus that can be resected 
successfully was reported to be 9 cm, and the longest successful artificial esophagus developed hitherto 
was 5 cm. In order to achieve more widespread use, artificial esophagi which can be used for longer 
reconstructions are needed. Although still at the stage of animal experiments, the long-term observations 
indicate that typical difficulties, such as anastomotic leakage, ablation, and dislocation of the prosthesis, 
have been overcome, which suggests this type of prosthesis will have a promising future in clinical practice. 

Defining Terms 

Collagen: A main supportive protein of connective tissue, bone, skin, and cartilage. One-third of protein 

of vertebrate animal consists of collagen. 
End-to-end anastomosis: An operative union after resecting the lesion, each end to be joined in a plane 

vertical to the ultimate flow through the structures. 
Expanded PTFE (polytetrafluorethylene): PTFE is polymer made from tetrafluoroethylene (CF 2 = 

CF 2 ), with the structure of -CF 2 -CF 2 -. It provides excellent stability chemically and thermally. 

Teflon is the trade mark of PTFE of Du Pont Co. Expanded PTFE has a microporous structure 

which has elasticity and antithrombogenisity in the body and is medically applied for vascular 

grafts or surgical seats. 
Extracellular matrix (ECM): The substances which are produced by connective tissue cells. The two 

major components of the extracellular matrix are collagen and proteoglycans. There are a number 

of other macromolecules which provide important functions such as tissue growth, regeneration 

or aging. 
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Heavy Marlex mesh and fine Marlex mesh: Surgical mesh is used for reconstruction of the defect that 
results from massive resection. Most popular surgical meshes are Marlex meshes. Heavy mesh is 
made of high-density polyethylene and has been used for reconstruction of chest wall. Fine mesh 
is made of polypropylene and is now widely used for abdominal wall reconstruction or reinforce- 
ment of inguinal herniation operation. 

Hilar release and laryngeal release: The pulmonis hilus is the depression of the mediastinal surface of 
the lung where the blood vessels and the bronchus enter. The hilus is fixed by the pulmonary 
ligament to downward. In order to reduce the tension at the tracheal anastomoses, the pulmonary 
ligament is released surgically. This method is called hilar release. At the resection and reconstruct 
of upper trachea, the larynx can be also released from its upper muscular attachment. This method 
is called laryngeal release and up to 5 cm of tracheal mobilization may be achieved. 

Hydroxyapatite: An inorganic compound, Ca 10 (PO 4 ) 6 (OH) 2 , found in the matrix of bone and the teeth 
which gives rigidity to these structures. The biocompatibility of hydroxyapatite has attracted special 
interest. 

IVH (intravenous hyperalimentation): An alimentation method for patients who cannot eat. A cath- 
eter is placed in the great vessel, through which high concentration alimentation is given contin- 
uously. 

Silicone T-tube: A self-retaining tube in the shape of a T which is made of silicone. Tracheal T-tube is 
used popularly for tracheal stenotic disease and serves both as a tracheal stent and tracheotomy 
tube. One side branchi of T-tube projects from the tracheotomy orifice. 
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Further Information 

Proceedings of the American Society of Artificial Internal Organs Conference are published annually by 
the American Society of Artificial Internal Organs (ASAIO). These proceedings include the latest devel- 
opments in the field of reconstructive devices each year. The monthly journal Journal of Thoracic and 
Cardiovascular Surgery reports advances in tracheal and esophageal prosthetic instruments. An additional 
reference is H.F. Mahieu, "Voice and speech rehabilitation following larygectomy, Groningen," the Neth- 
erlands, Rijksuniversiteit Groningen, 1988. 
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Conclusions 



Blood is a key component of surgery and treatment of injury, and its availability is critical for survival in the 
presence of severe blood losses. Its present use under conditions of optimal medical care delivery is one unit 
(0.5 L) per 20 person-year. On a world-wide basis, however, its availability and use is limited to one unit per 
100 person-year according to statistics from the World Health Organization. The gap between current optimal 
need and actual availability is further aggravated by the fact that the blood supply in many locations is not 
safe. Even in conditions of optimal testing and safeguards, blood per se has a number of inherent risks [Dodd, 
1992], ranging from about 3% (3 adverse outcomes per 100 units transfused) for minor reactions, to a 
probability of 0.00 1 % of undergoing a fatal hemolytic reaction. Superimposed on these risks is the possibility 
of transmission of infectious diseases such as hepatitis B (0.002%) and hepatitis non-A non-B (0.03%). The 
current risk of becoming infected with the human immunodeficiency virus (HIV) is about 1 chance in 
400,000 under optimal screening conditions. These are risks related to the transfusion of one unit of blood, 
and become magnified in surgical interventions requiring multiple transfusions. 
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The HIV epidemic caused many fatalities due to blood transfusion before stringent testing was intro- 
duced in 1984. The danger of contamination of the blood supply led the U.S. military to develop blood 
substitutes starting in 1985 with the objective of finding a product for battlefield conditions that was free 
of contamination, could be used immediately, and did not need special storage. The end of the Cold 
War, changes in the nature of military engagements, the slow progress in the development of a blood 
substitute, and the fact that blood testing caused the blood supply to be again safe and abundant in both 
the U.S. and Europe lowered the interest of the U.S. military in artificial blood, and the development 
passed almost exclusively into the hands of private industry. 

While about a dozen products have been tested in humans, at present only three are in clinical trials 
for blood replacement. These products have side effects whose relevance is difficult to assess in the absence 
of massive clinical trials. A side effect common to hemoglobin-based products is hypertension, which 
was proposed to be of additional therapeutic value in the treatment of hypotensive conditions associated 
with severe blood losses and hemorrhagic shock. However, extensive clinical trials led by Baxter Health- 
care, with their product HemAssist™ showed that their product caused twice the mortality found with 
the use of conventional therapies for volume resuscitation. This unfortunate outcome was fully predicted 
by results in academic research showing that hypertension per se causes a severe impairment of microvas- 
cular function. 

The key transport parameters that determine the exchange of oxygen in the microcirculation were, 
until recently, incompletely understood. Most experimental studies underlying the development of arti- 
ficial blood emphasized systemic, whole organ methods in order to assess efficacy and biocompatibility, 
without quantitative analysis of transport phenomena inherent to blood function, or data from the 
microcirculation. Advances in instrumentation and in vivo methods (Fig. 137.1) during the past few years 
have provided new understanding of how oxygen is distributed to the tissues by the microscopic blood 
vessels, and this information has proven to be crucial in identifying the transport characteristics needed 
for artificial blood to work effectively, and to counteract some of its inherent problems. 




FIGURE 137.1 Experimental hamster skin fold model for the analysis of the microvasculature in the awake animal, 
without anesthesia and after the effects of surgery have subsided. This technology allows to quantify blood flow, 
blood and tissue p0 2 , leukocyte activation, blood vessel permeability to macromolecules, and blood vessel tone 
throughout the arterioles, capillaries, and venules. The tissue under study consists of skeletal muscle and subcutaneous 
connective tissue. Arterial and venous catheters are used to monitor systemic data. 
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137.1 Oxygen Carrying Plasma Expanders and the Distribution 
of Transport Properties in the Circulation 

To the present it has not been possible to obtain an artificial fluid with the same properties as blood, 
primarily due to its cellular nature. Consequently, artificial blood or blood substitutes are plasma expand- 
ing fluids that carry more oxygen than that dissolved in water or plasma. Their physical properties are 
significantly different from those of natural blood, and therefore it is necessary to determine the positive 
or negative effects on circulatory function and tissue oxygenation that may arise from these differences. 

Under normal conditions, the transport properties of blood, such as hydraulic pressure, partial pressure 
of oxygen, oxygen content and viscosity, are distributed (Fig. 137.2). The distribution of transport 
properties of blood is matched to anatomical features of the macro- and microcirculation and set within 
narrow limits throughout the life of the organism. Hydraulic blood pressure, which changes continuously 
in the circulation due to viscous losses, is an example of this situation. Each branching order of the 
vasculature is adapted to recognize a narrow range of hydraulic blood pressures and react if pressure 
changes and the normal set point is not met. The distribution of blood pressure influences the cellular 
composition of blood vessels, and is regulated by the so-called myogenic response [Johnson, 1986], which 
causes vessels to constrict, thus increasing viscous losses, if pressure increases and vice versa. 

The same type of matching is present for the distribution of oxygen tension and blood oxygen content 
in the vasculature. The shape of the oxygen dissociation curve for hemoglobin and the distribution of 
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FIGURE 137.2 Distributed nature of transport properties in the hamster skinfold microcirculation. Arterial blood 
vessels are classified according to their hierarchical position starting with the A small arteries (100 (im and greater 
diameter), and ending with A 4 terminal arterioles. Blood pressure, blood viscosity, and blood p0 2 fall continuously 
from the systemic value. Furthermore, the characteristics of the oxygen saturation of hemoglobin curve correspond 
to specific microvascular locations. This distribution is maintained through the life of an organism and is autoreg- 
ulated by central and peripheral controls, including adrenergic nerve endings which have the highest density in the 
A, arterioles. Introduction of a molecular plasma expander fundamentally changes the distribution of viscosity, blood 
pressure, shear stress, and the availability of oxygen to the vessel wall. Autoregulatory processes sense these changes 
and cause the microcirculation to react. Data on viscosity is derived from [Lipowsky, 1987], data on blood pressure 
is from the hamster cheek pouch from [Joyner et al., 1972], other data is from our studies. 
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enervation of arterioles is such that the knee of the oxygen dissociation curve for hemoglobin is located 
in arterioles that present the highest density of adrenergic enervation [Saltzman et al., 1992]. 

Blood viscosity and blood flow velocity in the circulation are also distributed in such a fashion that 
together with the anatomical features of the local blood vessels, shear stress is caused to be virtually 
uniform throughout the circulation. This is also the consequence of the continuous variation of hema- 
tocrit from the systemic value in the larger blood vessels, to about half this value in the capillaries, due 
to the Faraheus-Lindquist effect (a property due to the cellular composition of blood) and the very strong 
dependance of blood viscosity on hematocrit. In larger arterioles, blood viscosity is about 3.5 to 
4.0 centipoise (cP), while in the smaller arterioles viscosity falls to about that of plasma (1.1 to 1.2 cP). 
This has large effects on shear stress and shear stress dependant release of endothelial dependant relaxing 
factors (EDRF, NO, and prostaglandins), since the circulation is designed to maintain shear stress con- 
stant. In other words, shear stress is also present in a prescribed way in the microcirculation when the 
system is perfused by normal blood. 

A molecular oxygen carrying plasma expander is not subjected to the Fahraeus-Lindquist effect (which 
determines the progressive decrease of hematocrit in the microcirculation), causing the whole microcir- 
culation to be exposed to a uniform viscosity (see the mathematical model) and significantly changing 
shear stress distribution. 

The significance of the mechanical traction at the interface between flowing blood and tissue is that 
shear stress modulates the production of vasoactive materials such as nitric oxide (NO) and prostacyclin 
[Frangos et al., 1985; Kuchan et al., 1994] and therefore is an active control of blood flow and the oxygen 
supply. Furthermore, NO is also a major regulator of mitochondrial metabolism [Shen et al., 1995], 
acting as a brake for tissue oxygen consumption. These considerations show that the transport properties 
of blood are intertwined with the physical and biological regulation of oxygen delivery and the level of 
tissue oxygen consumption. 

137.2 The Distribution of Oxygen in the Circulation 

Oxygenation of tissue as a whole has been extrapolated from the so-called Krogh model, which is focused 
on how gases are exchanged between blood flowing in a cylindrical conduit, the single capillary, and a 
surrounding tissue cylinder. It is generally assumed that most of the oxygen is exchanged at this level, 
implying the existence of large blood/tissue oxygen gradients in a substantial portion of tissue capillaries. 
However, capillary/tissue 2 gradients are maximal in the lung (50 mmHg/Lim) and minimal in the 
tissues (0.5 mmHg/Lim). Most tissue capillaries appear to be in near oxygen concentration equilibrium 
with the tissue; thus, large oxygen gradients are not present, suggesting that capillaries may not be the 
primary mechanism for tissue oxygenation. 

The technology of phosphorescence quenching for measuring oxygen partial pressure optically in the 
microvessels and the surrounding tissue [Torres & Intaglietta, 1994; Wilson, 1993), when used in con- 
junction with the awake hamster skinfold model, allows for in vivo analysis of the assumptions in the 
Krogh model. Current findings indicate that in the hamster skin fold connective tissue and skeletal muscle 
at rest, most of the oxygen is delivered by the arterioles and that little oxygen is contributed by the 
capillaries. In this tissue average capillary p0 2 is about 25 mmHg and venular p0 2 is higher. This indicates 
that at least half of the oxygen in blood exits the blood vessels prior to arrival in the tissue. In summary, 
these results and previous findings from other laboratories [Intaglietta et al., 1996] show that: 

1. Capillary blood p0 2 is only slightly higher (about 5 mmHg) than tissue p0 2 . 

2. Arterio/venous capillary p0 2 differences are very small because tissue p0 2 is essentially uniform, 
and capillaries are close to p0 2 equilibrium with the tissue. 

3. The only tissue domain where p0 2 exhibits large gradients is the immediate vicinity of the 
microvessels (vessels with diameter 80 p.m and smaller), a tissue compartment whose main con- 
stituent is the microvascular wall. 

4. A major portion of blood oxygen exits the circulation via the arterioles. 
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137.3 Oxygen Gradients in the Arteriolar Wall 

The phosphorescence optical technology allows us to make an accurate mass balance between the decrease 
of oxygen content in the arterioles and the diffusion flux of oxygen out of the microvessels determined 
by the oxygen gradients in the surrounding tissue. These measurements made simultaneously inside and 
outside of the vessels show that oxygen exiting from arterioles is driven by steep oxygen gradients at the 
arteriolar wall, which is consistent with the hypothesis that the arteriolar wall is a high metabolism tissue 
and therefore a large oxygen sink [Tsai et al., 1998]. These steep gradients are present in arterioles, but 
not in capillaries and venules. The large oxygen consumption is due to biological activity in the endot- 
helium and smooth muscle [Kjellstrom et al., 1987]. These findings lead to the following conceptualiza- 
tion for the design of artificial blood: 

1. Endothelium and smooth muscle serve as a metabolic barrier to the passage of oxygen from blood 
to tissue, which in part protects the tissue from the high oxygen content (p0 2 ) of blood. 

2. One of the goals of basal tissue perfusion is to supply oxygen to the endothelium and smooth 
muscle. 

3. Oxygenation of working tissue (exercising skeletal muscle) results from three events: 

• Lowering of the vessel wall metabolic barrier. 

• Increased perfusion with oxygenated blood. 

• Deployment of a biochemical process that protects the tissue from high p0 2 levels. 

4. Under basal conditions, tissue capillaries only partially serve to supply oxygen to the tissue. They 
may be a structure to expose the endothelium to blood in order to fulfill the large oxygen demand 
of these cells and provide for the extraction of C0 2 . 

5. The physical and biological properties of blood affect the oxygen consumption of the vessel walls. 

137.4 Mathematical Modeling of Blood/Molecular 
Oxygen Carrying Plasma Expander 

The previous discussion shows that tissue oxygenation is the result of the interplay of quantifiable physical 
events, and therefore it may be subjected to analytical modeling to identify key parameters and set the 
stage for rational design. The critical issue is to understand the changes due to the presence of molecular 
oxygen carriers. These changes are: (a) change in availability of oxygen to the vessel wall due to the 
molecular nature of the carrier; (b) distortion of the pattern of intraluminal oxygen distribution in the 
microcirculation; (c) lowered viscosity and decreased generation of EDRF due to lowered shear stress; 
(d) increased colloid osmotic pressure; and (e) increased vessel wall p0 2 gradient. The mathematical 
model is based on oxygen mass balance in subsequent vessel segments, starting with a major arterial 
vessel and terminating in the capillaries, with the objective of calculating the oxygen tension of blood in 
the capillaries, which should correspond to tissue p0 2 . 

The result of this analysis is the following equation, which expresses how the total loss of oxygen from 
the arterial and arteriolar network K n is related to anatomical and transport features of the blood vessels 
and blood [Intaglietta, 1997]. Total oxygen exit is the summation of losses from n individual vascular 
segments ;'. This equation gives the functional relationship between transport parameters that determine 
capillary blood p0 2 for given changes in the physical properties of blood: 
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where (J. is blood viscosity, F(Htc,C) is the concentration of hemoglobin (red blood cells + molecular), 
m t is the maximum amount of oxygen that can be dissolved in blood under normal conditions, D is the 
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diffusion constant for oxygen in tissue, a is the solubility of oxygen in tissue, g is the oxygen consumption 
by the vessel wall, L t is the length of each vessel segment, d t is the diameter of each segment, and n i is 
the slope of the oxygen dissociation curve for hemoglobin. 

The summation shows two distinct groups of terms. One group is common to all vessel segments and 
includes blood viscosity, hematocrit or blood oxygen carrying capacity, and vessel wall metabolism. The 
second group is a summation where each term is specific to each vascular segment. This expression shows 
that, in principle, better capillary oxygenation results from lowering viscosity. However, increased cap- 
illary oxygenation is a signal that triggers auto regulatory mechanisms which strive to maintain capillary 
oxygen constant through vasoconstriction. Thus, lowering blood viscosity should be expected to lead to 
vasoconstriction. 

The factor g represents vessel wall metabolism which is directly affected by the composition of blood 
and flow velocity. Furthermore g has a basal value representative of the baseline activity necessary for 
the living processes of the tissue in the microvessel wall and alteration of this activity, such as by an 
inflammatory process or increased tone, leads to an increase in tissue metabolism and therefore lowering 
of tissue oxygenation (since the increase in K n lowers capillary p0 2 ). 

137.5 Blood Substitutes and Hemodilution 

In general, it is possible to survive very low hematocrits, corresponding to losses of the red blood cell 
mass of the order of 70%; however, our ability to compensate for comparatively smaller losses of blood 
volume is limited. A 30% deficit in blood volume can lead to irreversible shock if not rapidly corrected. 
Maintenance of normovolemia is the objective of most forms of blood substitution or replacement, 
leading to the dilution of the original blood constituents. This hemodilution produces systemic and 
microvascular phenomena that underlie all forms of blood replacement and provides a physiological 
reference for comparison for blood substitutes. The fluids available to accomplish volume restitution can 
be broadly classified as crystalloid solutions, colloidal solutions, and oxygen carrying solutions. All of 
these materials significantly change the transport properties of blood, and therefore it is important to 
determine how these changes affect tissue oxygenation, a phenomenon that takes place in the microvas- 
culature. Therefore, hemodilution must be analyzed in terms of systemic effects and how these, coupled 
with the altered composition of blood, influence the transport properties of the microcirculation. 

137.6 Hematocrit and Blood Viscosity 

Blood viscosity is primarily determined by the hematocrit in the larger vessels while it is a weaker function 
of the systemic hematocrit in the microcirculation. At a given shear rate, blood viscosity is approximately 
proportional to the hematocrit squared according to the relationship: 

U = fl +bH 2 

" s s 

while microvascular blood viscosity can be empirically described by a relation of the form: 

U=fl +b H 

r mm 

where p. is the blood viscosity in centipoise and a, and the b t are parameters that are shear rate and vessel 
size dependent [Dintenfass, 1979; Quemada, 1978]. In the microcirculation, blood viscosity is relatively 
insensitive to shear rate. These relationships show that when hematocrit is reduced, systemic viscous 
pressure losses decrease much more rapidly than those in the microvasculature, while in the microcir- 
culation the A- V pressure drop is not very much affected. The net result is that if arterial pressure remains 
constant, hemodilution produces a significant pressure re-distribution in the circulation [Mirhashemi 
etal., 1987]. 
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Hemodilution increases central venous pressure, which improves cardiac performance and increases 
cardiac output [Richardson and Guyton, 1959]. This effect causes increased blood flow velocity and 
therefore the maintenance of oxygen delivery capacity, since a lesser number of red blood cells arrive at 
a greater frequency. As a consequence of hemodilution, both systemic and capillary oxygen carrying 
capacities improve up to hematocrit 33% and are at the normal value at arterial hematocrits of the order 
of 27%. The maximum improvement in oxygen carrying capacity is about 10%. 

This behavior of the heart and the circulation as determined by the viscosity of the circulating blood 
has been subjected to exhaustive experimental and clinical verification. The consequence of the adapt- 
ability of the circulation to changes in blood viscosity is that, in general, oxygen delivery capacity is not 
compromised up to red blood cell losses of about 50%. This fact determines the transfusion trigger, 
currently set at about 7 g Hb/dl. It should be noted that up to the transfusion trigger blood losses can 
be corrected with the use of a molecular plasma expander. Furthermore, if artificial blood is based on a 
molecular oxygen carrying material, its introduction in the circulation once the transfusion trigger is 
passed should be analyzed also as a phenomenon of extreme hemodilution. Conversely, introduction of 
an oxygen carrying plasma expander prior to reaching the transfusion trigger should show no improve- 
ment on oxygenation, since this point is the limit for compensatory adjustments in the circulation. 

137.7 Regulation of Capillary Perfusion During Extreme 
Hemodilution in Hamster Skin Fold Microcirculation 
by High Viscosity Plasma Expander 

Although capillaries do not appear to be the determinant structure for the supply of oxygen, our studies 
[Kerger et al., 1996] have shown that maintenance of functional capillary density (FCD, defined as the 
number of capillaries that possess red blood cell transit in a mass of tissue) in shock is a critical parameter 
in determining the outcome in terms of survival vs. non-survival, independent of tissue p0 2 , suggesting 
that extraction of products of metabolism maybe a more critical function of capillaries than oxygenation 
(Fig. 137.3). 

When hemodilution is carried to extreme conditions, defined as the replacement of more than 80% 
of the red blood cell mass, blood viscosity falls to near plasma levels. A concomitant effect is the reduction 
of FCD to near pathological levels. The decrease in FCD observed in blood substitutions with plasma 
expanders is in part due to the lowered viscosity of the resulting blood in circulation. This has been 
demonstrated with conventional plasma expanders such as dextran 70 kDa, where reductions of systemic 
hematocrit were to 75% of control and final plasma viscosities of 1.38 cP lowered FCD to 53% of control, 
while hemodilution with a combination of dextrans of different molecular weights yielding a final plasma 
viscosity of 2.19 cP maintained FCD to near normal levels. In these experiments, vessel wall shear stress 
was not changed after low viscosity hemodilution while it increased by a factor of 1.3 in arterioles and 
2.0 in venules relative to baseline with high viscosity [Tsai et al., 1998] exchanges. Therefore increased 
shear stress dependant release of endothelium derived relaxing factors are a possible mechanism that 
reverses the constrictor effects and decreased FCD due to low blood viscosity. 

137.8 Crystalloid and Colloidal Solutions 
as Volume Expanders 

Crystalloids are among the most widely used fluids for volume replacement. Ringer's lactate, for example, 
is administered in volumes that are as much as three times the blood loss, since the dilution of plasma 
proteins lowers the plasma oncotic pressure causing an imbalance of the fluid exchange favoring microvas- 
cular fluid extravasation and edema. The advantage of crystalloid solutions is that large volumes can be 
given over a short period of time with low danger of increasing pulmonary wedge pressure. Excess volumes 
are rapidly cleared from the circulation by diuresis, which in many instances is a beneficial side effect in 
the treatment of trauma. Blood volume replacement with Ringer's lactate lowers blood viscosity. 



© 2000 by CRC Press LLC 



Survivors 



Non-Survivors 



100 r 




Bsln 60 120 240 B30 B24 



Timepoint 



FIGURE 137.3 Changes of functional capillary density (FCD) in the hamster skin fold model during shock and 
resuscitation. FCD is the number of capillaries with red blood cell transit in a microscopic field of view. Hamsters 
were subjected to 4 h hemorrhagic shock to 40 mmHg and resuscitated with their own blood. FCD was the principal 
microvascular parameter that was the predictor of outcome, while microvascular and tissue p0 2 was the same for 
survivors and non-survivors. This result indicates that resuscitation solutions should ensure the restoration of FCD 
as well as re-establish tissue oxygenation [Kerger et al., 1996]. Bsln: Baseline. Timepoint: Time after induction of 
hemorrhage. B30: 30 min. after resuscitation. B24: 24 h after resuscitation. *: Significantly different from control, p 
> 0.05. +: Significantly different between survivors and non-survivors, p > 0.05. 



The rapid clearance of crystalloid solutions from the intravascular compartment is at the expense of 
the expansion of the tissue compartment and edema. Peripheral edema has been speculated to impair 
oxygen delivery, wound healing, and the resistance to infection. Microvascular flow may be impaired by 
capillary compression from edema. However, given the wide clinical experience with the use of crystal- 
loids, it appears that these are not major effects. A more important effect is the development of pulmonary 
edema, or adult respiratory distress syndrome (ARDS); however, the relationship between the use of large 
volumes of salt solutions and ARDS is not firm. 

Larger colloidal molecular weight materials such as albumin have the advantage of longer retention 
time, they do not extravasate and therefore do not cause edema. Albumin is used as a plasma expander 
for emergency volume restitution, but due to cost considerations the synthetic colloids dextran and 
hydroxyethyl starch are used more frequently. These materials are free of viral contaminations, but may 
cause anaphylactic reactions and have a tendency to alter platelet function thus interfering with hemo- 
stasis. It is not well established whether these materials increase bleeding per se or the effect is due to 
improved perfusion. 

The beneficial effect of this form of volume replacement resides in the high oncotic pressure which 
they generate, maintaining intravascular volume in the amount of about 20 ml of fluid per gram of 
circulating colloid. In this context, dextran 70 kDA and hydroxyethyl starch have the highest retention 
capacity, 22 and 17 ml/g, respectively, while albumin is 15 ml/g. Furthermore, this form of volume 
replacement lowers blood viscosity with all the associated positive transport effects described for hemodi- 
lution [Messmer et al., 1972]. 
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137.9 Artificial Oxygen Carriers 

The amount of oxygen carried in simple aqueous solutions is inconsequential in the overall oxygen 
requirements of the tissue, and therefore when intrinsic oxygen carrying capacity has to be restored, it 
is necessary to include an oxygen carrier. This can be presently accomplished through the use of modified 
hemoglobins which bind oxygen chemically and reversibly, and fluorocarbons which have a high capacity 
for dissolving oxygen. These compounds present important differences which will ultimately determine 
in what form of blood substitution and replacement they will be used. 

Hemoglobin separated from the red cell membrane, i.e., stroma-free hemoglobin, carries oxygen with 
a high affinity. These materials have intrinsically high oxygen carrying capacity; however, they are based 
on a biologically active molecule, which in most instances requires specialized storage procedures. By 
contrast, fluorocarbons carry a limited amount of oxygen under normal atmospheric conditions, but are 
biologically inert, can be stored at room temperature, and are excreted as gas through the lungs. 

137.10 Fluorocarbons 

Fluorocarbons (perfluorochemicals, PFCs) are compounds that have a very high solubility of oxygen and 
their oxygen transport capacity is directly proportional to their concentration and p0 2 . They are formu- 
lated in stable water soluble emulsions that are metabolically inert. These PFC emulsions consist in 
droplets of fluorochemicals in the range of 0.1 to 0.2 p.m diameter coated by a thin film of egg yolk 
phospholipids. These products are heat sterilized and can be stored under ambient conditions ready for 
use. The technology for large scale manufacture is well established and the necessary materials for 
production are well characterized and commercially available. Their half life in the circulation is in the 
range of 6 to 9 h. 

Pure PFCs have oxygen solubilities of the order of 50 ml O 2 /100 ml at 37°C and 760 mmHg oxygen 
pressure. The solubility varies linearly as a function of gas pressure. These materials carry about 30% 
more oxygen than blood when diluted to 60% (the recommenced clinical form of administration) and 
with 100% inspired oxygen 100%. They have been given clinically at a maximal dosage of 2 g PFC/kg 
(vs. 15 g hemoglobin/kg for normal blood in humans) [Lamy et al., 1998] . At this dosage, in an isovolemic 
exchange they increase intrinsic oxygen carrying capacity of blood by about 20%, provided that 100% 
oxygen is inspired. Since normal tissues and the microcirculation operate in the range of 20 to 50 mmHg, 
the effect of this fundamentally altered blood p0 2 and oxygen content on the normal distribution of 
oxygen tension in the circulation is not known. However, since tissues regulate their blood oxygen supply 
and p0 2 , it is unlikely that microvascular blood p0 2 would be much greater than 50 mmHg without 
eliciting autoregulatory adjustments; therefore, in the absence of hypertensive effects it may be assumed 
that microvascular blood p0 2 in the presence of PFCs does not exceed 50 mmHg and therefore in terms 
of tissue oxygen delivery the contribution is small relative to hemoglobin since at p0 2 50 mmHg they 
carry about 2 ml 2 /dl, while hemoglobin carries 15 ml 2 /dl. 

The viscosity of PFCs is only slightly higher than water; therefore, when used in the amounts described 
they behave as low viscosity plasma expanders, and therefore should exhibit some of the beneficial effects 
to be derived from moderate hemodilution. 

137.11 Hemoglobin for Oxygen Carrying Plasma Expanders 

The hemoglobin molecule in solution presents unique characteristics as an oxygen carrying plasma 
expander, since it has a molecular weight similar to that of albumin and therefore comparable intravas- 
cular retention. Its ability to chemically bind oxygen determines that large amounts of oxygen can be 
associated chemically at relatively low p0 2 , and its molecular weight is such that it does not extravasate. 
Consequently, it should provide an extended period of colloid osmotic effects that prevent the passage 
of water into the interstitium and therefore edema. However, when hemoglobin is diluted from the 
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concentration present in red blood cells, the tetrameric molecule tends to spontaneously dissociate into 
smaller molecular weight dimers and monomers which rapidly extravasates from the vascular compart- 
ment. Until recently it was assumed that when hemoglobin is free in the circulation it has an oxygen 
affinity that is too high for tissue oxygenation. As a consequence, strategies for the modification of 
hemoglobin by chemical means at present are aimed at prolonging intravascular retention and reducing 
oxygen affinity [Winslow, 1997]. 

Hemoglobin can be prevented from dissociating by chemically crosslinking the tetramers such as the 
widely used agent glutaraldehyde. Direct crosslinking with this agent produces a spectrum of different 
high molecular weight hemoglobin derivatives since glutaraldehyde is non-specific and reacts with any 
amino group. A more homogenous product can be obtained by reacting human hemoglobin first with 
the 2,3-DPG analog PLP, which reduces oxygen affinity and then polymerizing the compound with 
glutaraldehyde. This product is called PLP-polyHB and has near normal oxygen carrying capacity and a 
plasma half-time retention (in rats) of about 20 h. 

Diaspirins are another group of crosslinking agents. The reaction of native hemoglobin with bis(3,5- 
dibromosalicy)fumarate (DBBF) leads to the linking of the two Ot chains of the hemoglobin molecule, 
producing a compound called OlOCHb which has an oxygen affinity (P 50 ) of 30 mm Hg and an intravascular 
retention time of the order of 12 h in rats. Phosphorylated compounds added to purified hemoglobin 
also change the P 50 value of the resulting compound and have been studied extensively. There is an 
increasing variety of hemoglobin compounds that is being developed; however, at this time OlOCHb is the 
most widely studied compound since it was produced in quantity by the U.S. Army Letterman Institute, 
San Francisco, and a similar compound was developed and used in extensive clinical trials by Baxter 
Healthcare Inc. 

Hemoglobin can be produced as a recombinant protein leading to the construction of either the 
naturally occurring molecule or variants that have different crosslinking and oxygen affinity. It is presently 
possible to produce both alpha and beta chains of human hemoglobin in bacteria, yeast, and transgenic 
animals. While it is in principle possible to produce large amounts of a specific product by these means, 
there remain problems of purification, elimination of other bacterial products, and endotoxins that may 
be expressed in parallel. 

Human hemoglobin genes have been induced into transgenic mammals, leading to the potential 
production of large amounts of human hemoglobin. However, the production is not 100% pure and 
some of the original animal material is also present, causing an important purification problem. This 
problem may be compounded by potential immunogenicity of animal hemoglobin and the potential 
transmission of animal diseases to humans. 

A unique form of hemoglobin modification is the process of pegylation, consisting of attaching 
polyethylene glycol polymeric strands to the surface of the hemoglobin molecule. The resulting product 
termed PEG-hemoglobin interacts with the surrounding water, producing an aqueous barrier that sur- 
rounds the molecule. This phenomenon causes several beneficial effects, namely (1) renders the molecule 
invisible to the immune system; (2) draws water into the circulation increasing blood volume; 
(3) increases molecular size of the molecule, thus further limiting its capacity to extravasate; (4) decreases 
the diffusive oxygen delivery from Hb0 2 by increasing path length; and (5) increases plasma viscosity. 
This type of molecule is presently manufactured by Sangart, Inc., La Jolla, and Apex Bioscience Inc., 
North Carolina, who uses human hemoglobin and Enzon Inc., New Jersey, who uses bovine hemoglobin. 

137.12 Hemoglobin-Based Artificial Blood 

Many hemoglobin-based artificial blood substitutes are being developed which may be classified relative 
to the technology used and whether the hemoglobin is either of human, animal, or bacterial origin. It 
is now established that hemoglobin containing red blood cell substitutes deliver oxygen to the issue and 
maintain tissue function. However, problems of toxicity and efficacy are not yet resolved, and are shared 
by many of the hemoglobin solutions presently available. Renal toxicity is one of the primary problems 
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because hemoglobin toxicity to the kidney is a classic model for kidney failure. The mechanism of toxicity 
includes tubular obstruction, renal arterial vasospasm, and direct toxicity to tubular cells. It is not clear 
whether hemoglobin must be filtered by the kidney to be toxic, and the tolerance of the kidney to different 
types of hemoglobins and concentrations is not established [Paller, 1988]. 

There are many reports of anaphylaxis/anaphylactoid reactions to hemoglobin infusion. This may 
result from heme release from hemoglobin, the presence of endotoxins bound to hemoglobin or pre- 
existing in the circulation, or from oxygen-derived free radicals released after exposure to hemoglobin. 
Vasoconstriction of coronary, cerebral, and renal vessels has also been attributed to the presence of free 
hemoglobin in the circulation, a phenomenon that may be due to the intrinsic capacity of hemoglobin 
to scavenge nitric oxide or metabolic autoregulation. 

Toxicity of hemoglobin may be due to special characteristics of the manufacturing procedure, the 
given molecule, or problems due to purification. It may also be inherent to the presence of large quantities 
of a molecular species for which the circulation has a very limited tolerance when present in solution. 
This problem may be circumvented by separating hemoglobin from the circulation by encapsulating this 
material in a synthetic membrane, in the same way that the red cell membrane encapsulates hemoglobin. 

Encapsulation of hemoglobin has been accomplished by introducing the molecule into a liposome. 
To the present this technology presents several problems, namely (1) poor efficiency of hemoglobin 
incorporation into the liposome; (2) physical instability of liposomes in storage; and (3) chemical 
instability of the lipid-hemoglobin interface leading to lipid peroxidation and hemoglobin denaturation. 
These factors translate in decreased oxygen carrying capacity, increased viscosity of the circulating 
blood/liposome mixture, and reticuloendothelial blockade. This situation notwithstanding, lipid encap- 
sulation may ultimately provide the artificial blood of choice once a quality hemoglobin becomes con- 
sistently available and liposome production can be scaled up to quantities commensurate with demand. 

137.13 Results in the Microcirculation with Blood 
Substitution with oca-Hemoglobin 

The effectiveness of aa-Hb (cell-free o-raffinose cross-linked) as a blood carrying plasma expander was 
tested in the microcirculation by implementing isovolemic hemodilution up to an exchange of 75% of 
the original cell mass. Exchanges with dextran 70 kDa molecular weight were used as control. Experiments 
were carried out in the microcirculation of the hamster window preparation, which was evaluated in 
terms of hemodynamic parameters and oxygen distribution at three successive levels of exchange. The 
oxygen delivery capacity of the microcirculation was determined in terms of the rate of arrival of 
hemoglobin (in red cell cells + molecular) to the capillaries. It was found that using molecular solutions 
of about 10 g/dl concentration, the oxygen delivery capacity of the microcirculation was identical to that 
of dextran 70 for substitutions up to 60% of the red blood cell mass. This was a consequence of the lack 
of increased cardiac output with the Hb solutions, which should be expected when viscosity is lowered, 
as is the case with colloidal solutions (starch and dextran). 

This hemoglobin caused hypertension, and at the highest level of exchange exhibited abnormally low 
tissue p0 2 , which fell from a normal value of 22.4 mmHg to about 5 mmHg. This lowered tissue p0 2 
was accompanied by a significant decrease in functional capillary density, and increased vessel wall 
gradients. Consequently, this hemoglobin does not appear to function as a tissue oxygenator in the 
microcirculation, although the total amount of hemoglobin and its capacity to transport oxygen is similar 
to that of natural blood. The principal reasons for this outcome are hypertension elicited by lowered 
blood viscosity with consequent decrease of production of NO, increased scavenging of NO, and increased 
availability of oxygen to the vessel wall, which promotes autoregulatory responses aimed at limiting over 
oxygenation of the vessel wall. The linkage between blood supply and oxygen demand is due to the 
existence of chemical vasodilator signals transmitted from the tissue cells to the resistance vessels, which 
relax (increase their diameter) causing blood flow to increase when tissue oxygen tension falls below 
critical levels. Conversely, increasing blood oxygen tension over normal levels elicits vasoconstriction. 



© 2000 by CRC Press LLC 



The most critical side effect in terms of microvascular function is the finding that molecular hemo- 
globin solutions significantly lower functional capillary density. 

137.14 Hemoglobin and Nitric Oxide (NO) Binding 

The principal adverse effect of hemoglobin in solution is to produce vasoconstriction. This is based on 
two fundamental observations: (1) Hb has a very high affinity to bind NO at heme sites, and (2) Hb has 
been shown to produce constriction of isolated aortic rings. However, there is no direct proof that these 
mechanisms exists because NO has a very short half-life in the circulation (1.8 ms) [Liu et al., 1998], 
making its detection difficult. Further complicating the story is the observation that NO can also bind 
to hemoglobin at its sulfhydryl residues [Stamler et al., 1997], and that this mechanism might be oxygen- 
linked. It should also be noted that NO binding as a cause for vasoconstriction has been primarily deduced 
from studies that have shown constriction of isolated aortic rings when exposed to hemoglobin solutions. 
However, in vivo the situation is more complicated, particularly because most of the important diameter 
reductions occur in specific microvessels. Alternative explanations are that this effect can be produced 
directly on the blood side of the endothelium, where the presence of molecular hemoglobin in the red 
cell free plasma layer could significantly distort the diffusion field from the endothelial cell, diverting 
NO from smooth muscle into blood. Another effect could arise if the protein leaves the vascular space 
and enters the interstitium, there to interfere with the diffusion of NO from endothelium to its target, 
vascular smooth muscle. Thus, Rohlfs et al. [1997], found that hemoglobins with differing effects on 
mean arterial blood pressure all had similar reaction kinetics with NO. These findings are consistent with 
the autoregulation hypothesis, and suggest that there is more to the story than NO scavenging. 

A different explanation is that altered blood properties in extreme hemodilution affect the production 
of NO and prostacyclin (and other cytokines) in endothelial cells, which is dependent on the maintenance 
of some level of shear stress. Such shear stress, in a flowing system such as blood, is dependent on a 
number of factors, including the viscosity of the solution. Thus, it would be expected that hemoglobin 
solutions with low viscosity, such as OMX-Hb, would decrease NO synthesis and induce vasospasm. In 
fact, vasoconstriction is probably a natural reaction of the organism to reduced blood viscosity. 

The difficulties associated with the measurement of NO directly in biological systems has led to the 
formulation of theoretical models. This type of analysis shows that NO distribution between blood and 
smooth muscle is the consequence of its high diffusivity, the fact that its half-life is limited by its reaction 
with oxygen and hemoglobin, its diffusion into red cells, and the geometry of vessels in the microcircu- 
lation. A tentative conclusion, in part supported by direct studies in the microcirculation, indicates that 
effects should be maximal in arterioles of the order of 30 to 100 p.m in diameter. 

Thus far, the NO physiology has not been completely translated into the design of blood substitutes 
[Winslow, 1998]. The group at Somatogen have designed a series of hemoglobin mutants with a range 
of NO affinity and shown that there is a correlation between NO binding and the blood pressure response 
in rats [Dohertyet al., 1998]. However, Rohlfs et al. [1998], have shown that a different series of chemically 
modified hemoglobins with different vasoactivities have essentially the same NO reactivity. 

137.15 Rational Design of an Oxygen Carrying Molecular 
Plasma Expander 

The products presently in clinical trials reflect the physiological know-how of the of 1970s, when their 
effects could not be analyzed at the level of microscopic blood vessels, blood flow regulation by NO was 
unknown, the theory of microvascular autoregulation had just been formulated, and the endothelium 
was viewed as a passive cellular lining which prevented the extravasation of plasma proteins. At that time, 
systemic experimental findings and the clinical experience with low viscosity plasma expanders used in 
hemodilution established that lowering blood viscosity was a safe and efficacious method for restoring 
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blood volume, particularly if oncotic pressure was maintained within the physiological range through 
the use of colloids. These well-established tenets were incorporated into the design of artificial blood, or 
blood substitute, which was conceived as a fluid of low viscosity (i.e., lower than blood and closer to 
plasma in order to obtain the beneficial effects of hemodilution), moderate oncotic pressure (i.e., of the 
order of 25 mmHg), a right-shifted oxygen dissociation curve, and a concentration of hemoglobin, or 
equivalent oxygen carrying capacity, of 10 g Hb/dl. 

This scenario was firmly established because there were no practical methods for assessing tissue pO, 
clinically, other than direct measurement of venous blood p0 2 , while in experimental conditions tissue 
p0 2 could only be determined by extremely laborious and difficult to implement microelectrode methods. 
Thus, the distribution of oxygen in the tissue, even at the experimental level, was not known and related 
fundamental mechanisms directly involved with survival at the level of the microcirculation were equally 
unknown or misunderstood. 

High resolution methods for measuring tissue p0 2 became available at the beginning of the 1990s, 
and when these were applied to the analysis of the effects of hemoglobin-based blood substitutes, they 
reversed several of the established principles. It was found that oxygen is delivered by the arterioles and 
not the capillaries, and that this rate of delivery was modulated by the consumption of oxygen in the 
arteriolar wall. Vasoconstrictor effects were found to increase the oxygen consumption of the arterioles 
to the detriment of tissue oxygenation. This was particularly true for hemoglobin solutions formulated 
with small molecules, such as the Ota-crosslinked hemoglobin, which also presented another of the 
presumed required properties, namely low viscosity. 

To the present, the vasoconstrictor effect has been attributed to NO scavenging by hemoglobin; 
however, other mechanisms may be involved and superimposed on the scavenging effect. The molecular 
nature of the presently developed hemoglobin solutions determines that the oxygen source in blood is 
very close to the endothelium, since the barrier determined by the plasma layer is no longer present. This 
configuration increases the oxygen availability to the microvascular wall, potentially eliciting autoregu- 
latory vasoconstriction aimed at regulating oxygen delivery. It is now apparent that right-shifted materials 
further increase oxygen availability, thus enhancing the vasoconstrictor stimulus. 

Concerning NO scavenging, it may well be that this shall remain an intrinsic property of the hemo- 
globin molecule dictated by the physical similarity of NO and 2 . In this context, two solutions to the 
problem may be implemented directly at the level of the oxygen carrier: One consists of producing an 
elevated (relative to normal) plasma viscosity, in such a fashion that shear stress dependant NO produc- 
tion remains constant or is elevated. Second, physically separating the hemoglobin molecule from the 
endothelial barrier as may be obtained with a pegylated material may retard hemoglobin-NO interaction. 

There is increasing evidence that low viscosity is of no benefit once the transfusion trigger is passed, 
particularly if combined with vasoconstrictor effects that impede the increase of cardiac output. In fact, 
conditions of extreme hemodilution, as those that obtain purely on the basis of viscosity considerations 
when the red blood cell mass is reduced, can only survive when plasma viscosity is increased, so that 
blood viscosity is near normal, which has the effect of maintaining functional capillary density at basal 
levels. This beneficial effect is further augmented by added production of shear stress dependant NO, 
which induces vasodilation and adequate blood flow. The physiological consequences of introducing 
molecular solutions of hemoglobin in the circulation can be seen in full perspective when the products 
are ranked according to size (Fig. 137.4), where it becomes apparent that side effects such as hypertension 
and retention time are a direct result of molecular dimensions, which in turn determine viscosity and 
diffusivity. 

In these conditions, tissue oxygenation is enhanced by the use of a left-shifted hemoglobin which 
effectively "hides" oxygen from the arterioles and facilitates its unloading in the regions of the microcir- 
culation where oxygen consumption by the vessel wall is smaller. 

Finally, a critical issue is how much hemoglobin is necessary to obtain the desired effects. Experimental 
evidence presently indicates that when the "counter-intuitive" formulation of product is implemented, 
the actual amount of hemoglobin needed may be as low as 3 g Hb/dl (Winslow et al., 1998). This 
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FIGURE 137.4 The physiological reaction to the presence of even small amounts of circulating hemoglobin can be 
predicted by grouping the products according to molecular dimension. Systemic blood pressure rise is maximal for 
the smallest molecules and virtually absent for the largest molecules. The effect on blood viscosity correlates with 
both the rise of blood pressure and the maintenance of functional capillary density, where small molecules, which 
lower blood viscosity, also significantly lower FCD, while the larger molecules, causing blood viscosity to remain 
similar to that of whole blood, have no effect on FCD. Intravascular retention time is also directly proportional to 
molecular size, varying from about 12 h for the smaller molecules to more than 48 h for PEG-hemoglobin. Products 
are identified according to their manufacturer, namely: Baxter Healthcare, Round Lake, IL; Somatogen, Inc., Boulder, 
CO; Northfield Laboratories Inc., Chicago, IL; Hemosol Ltd., Toronto, Canada; Biopure Pharmaceuticals Corp., 
Boston, MA; Apex Bioscience Inc., Research Triangle Park, NC; Enzon Inc., Piscataway, NJ; SANGART Inc., La Jolla, 
CA. (Data from Vandegriff et al., 1997.) 

consideration impacts on the cost of the product and the potential for production, since most hemoglobin 
modifications have at most a 50% yield. This is critical for products based on human blood, where the 
principal source is outdated material, which would impose an inherent limitation on the total production 
capacity if the product is formulated with 7 to 10 g Hb/dl concentration. 

137.16 Conclusions 

Lethality consequent to blood losses results from hypovolemia and not anemia, and a number of plasma 
expanders exist that restitute volume and ensure survival even though they carry oxygen only to the 
extent of its plasma solubility. This is a consequence of the "derived" increase of oxygen carrying capacity 
resulting from lowered viscosity and increased cardiac output. This approach serves as hemoglobin 
concentration reaches the transfusion trigger, beyond which restoration of oxygen carrying capacity is 
needed. 

The use of oxygen carrying plasma expanders made with molecular solutions of hemoglobin, when 
the red blood cell mass is reduced below the transfusion trigger, causes extreme hemodilution associated 
with a significant reduction of blood viscosity and NO production, and reflex vasoconstriction, leading 
to decreased functional capillary density. This combination of events is difficult to survive because 
decreased NO availability increases the intrinsic oxygen consumption of the tissue. An oxygen carrying 
volume replacement with a capacity commensurate to blood is presently the perceived sine qua necessity 
for extreme blood losses. However, new findings indicate that this may not necessarily be the case. 

Recent developments in the understanding of the physiology of extreme hemodilution, and the physical 
events associated with the substitution of red blood cells with molecular hemoglobin solutions have 



© 2000 by CRC Press LLC 



determined a shift in paradigm, indicating that a viable "artificial blood" will be obtained from a 
counterintuitive formulation of the product. In this formulation, viscosity is near normal for blood, the 
dissociation curve is left-shifted, oncotic pressure is high, and the concentration of hemoglobin is in the 
range of 3 to 5 g Hb/dl [Winslow and Intaglietta, 1998]. 
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Defining Terms 

Hemodilution: The replacement of natural blood with a compatible fluid that reduces the concentra- 
tion of red blood cells 
Stroma-free hemoglobin: Hemoglobin derived from red blood cells where all materials related to cell 

membrane and other components within the red blood cells have been removed. 
Liposome: Microscopic phospholipid vesicle used to encapsulate materials for slow release. The use of 

liposomes to encapsulate hemoglobin is a departure from the conventional use of liposomes, 

exploiting encapsulation, and requires their modification to ensure sustained entrapment. 
Oncotic: Refers to colligative properties due to the presence of macromolecules, for instance oncotic 

pressure, which is differentiated from osmotic pressure, which is that due to the presence of all 

molecular species in solution. 
Oxygen carrying capacity: The total amount of oxygen that may be transported by blood or the fluid 

in the circulation. Differentiated from oxygen delivery capacity which involves considerations of 

flow rate. 
Plasma viscosity: Viscosity of blood devoid of cellular elements, a parameter that is critically impacted 

by the introduction of molecular hemoglobin solutions. 
Shear stress: Force per unit area parallel to the vessel wall or traction, experienced by the endothelium 

due to blood flow and blood viscosity. 
Transfusion trigger: Concentration of blood hemoglobin beyond which blood is required to restore 

circulatory function. 
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138.1 The Vital Functions of Skin 

Skin is a vital organ, in the sense that loss of a substantial fraction of its mass immediately threatens the 
life of the individual. Such loss can result suddenly, either from fire or from a mechanical accident. Loss 
of skin can also occur in a chronic manner, as in skin ulcers. Irrespective of the time scale over which 
skin loss is incurred, the resulting deficit is considered life threatening primarily for two reasons: Skin is 
a barrier to loss of water and electrolytes from the body, and it is a barrier to infection from airborne 
organisms. A substantial deficit in the integrity of skin leaves the individual unprotected either from 
shock, the result of excessive loss of water and electrolytes, or from sepsis, the result of a massive systemic 
infection. It has been reported that burns alone account for 2,150,000 procedures every year in the United 
States. Of these, 150,000 refer to individuals who are hospitalized, and as many as 10,000 die. 

Four types of tissue can be distinguished clearly in normal skin. The epidermis, outside, is a 0.1-mm- 
thick sheet, comprising about 10 layers of keratinocytes at levels of maturation which increase from the 
inside out. The dermis, inside, is a 2-5-mm-thick layer of vascularized and innervated connective tissue 
with very few cells, mostly quiescent fibroblasts. The dermis is a massive tissue, accounting for 15-20% 
of total body weight. Interleaved between the epidermis and the dermis is the basement membrane, an 
approximately 20-nm-thick multilayered membrane (Fig. 138.1). A fourth layer, the subcutis, underneath 
the dermis and 0.4-4-mm in thickness, comprises primarily fat tissue. In addition to these basic structural 
elements, skin contains several appendages (adnexa), including hair follicles, sweat glands, and sebaceous 
glands. The latter are mostly embedded in the dermis, although they are ensheathed in layers of epidermal 
tissue. 
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FIGURE 138.1 Schematic view of skin which highlights the epidermis, the basement membrane interleaved between 
the epidermis and the dermis, and the dermis underneath. Only a small fraction of the thickness of the dermis is 
shown. (Redrawn with permission from J. Darnell, J. Lodish, and D. Baltimore, Molecular Cell Biology, Scientific 
American Books, New York, Chapter 5, Fig. 552, 1986.) 

The functions of skin are quite diverse, although it can be argued that the single function of skin is 
to provide a viable interface with the individual's environment. In addition to the specific vital functions 
mentioned above (protection from water and electrolyte loss, and from infection), skin also protects 
from heat and cold, mechanical friction, chemicals, and from UV radiation. Skin is responsible for a 
substantial part of the thermoregulatory and communication needs of the body, including the transduc- 
tion of signals from the environment such as touch, pressure, and temperature. Further, skin transmits 
important emotional signals to the environment, such as paleness or blushing of the face and the emission 
of scents (pheromones). Far from being a passive membrane that keeps the internal organs in shape, 
skin is a complex organ. 

138.2 Current Treatment of Massive Skin Loss 

The treatment of skin loss has traditionally focused on the design of a temporary wound closure. Attempts 
to cover wounds and servere burns have been reported from historical sources at least as far back as 
1500 B.C., and a very large number of temporary wound dressings have been designed. These include 
membranes or sheets fabricated from natural and synthetic polymers, skin grafts from human cadavers 
(homografts, or allografts), and skin grafts from animals (hetero grafts, or xenografts). Although a satis- 
factory temporary dressing helps to stem the tide, it does not provide a permanent cover. Polymeric 
membranes which lack specific biologic activity, such as synthetic polymeric hydrogels, have to be 
removed after several days due to incidence of infection and lack of formation of physiologic structures. 
Patients with cadaver allografts and xenografts are frequently immunosuppressed to avoid rejection; 
however, this is a stop-gap operation which is eventually terminated by removal of the graft after several 
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FIGURE 138.2 Comparision between treatment with the meshed autograft (R) and treatment with the artificial 
skin (L). Autograft is usually meshed before grafting; scar forms in areas coinciding with the open slits of the autograft. 
The artificial skin treatment consists of grafting the excised wound bed with a skin regeneration template, followed 
by grafting on about day 14 with a very thin epidermal autograft. (Photo courtesy of J.E Burke.) 

days. In all cases where temporary dressings have been used, the routine result has been an open wound. 
Temporary dressings are useful in delaying the time at which a permanent graft, such as an autograft, is 
necessary and are therefore invaluable aids in the management of the massively injured patient. 

The use of an autograft has clearly shown the advantages of a permanent wound cover. This treatment 
addresses not only the urgent needs but also the long-term needs of the patient with massive skin loss. 
The result of treatment of a third-degree burn with a split-thickness autograft is an almost fully functional 
skin which has become incorporated into the patient's body and will remain functional over a lifetime. 
Autografts usually lack hair follicles and certain adnexa as well. However, the major price paid is the 
removal of the split thickness graft from an intact area of the patient's body: The remaining dermis 
eventually becomes epithelialized but not without synthesis of scar over the entire area of trauma (donor 
site). To alleviate the problem associated with the limited availability of autograft, surgeons have resorted 
to meshing, a procedure in which the sheet autograft is passed through an apparatus which cuts slits 
into the sheet autograft, allowing the expansion of the graft by several times and thereby extending greatly 
the area of use. An inevitable long-term result of use of these meshed autografts is scar synthesis in areas 
coinciding with the open slits of the meshed graft and a resulting pattern of scar which greatly reduces 
the value of the resulting new organ (Fig. 138.2). An important aspect of the use of the autograft is the 
requirement for early excision of dead tissue and the provision, thereby, of a viable wound bed to "take" 
the autograft. 

The term "artificial skin" has been used to describe a cell-free membrane comprising a highly porous 
graft copolymer of type I collagen and chondroitin 6-sulfate which degrades at a specific rate in the 
wound and regenerates the dermis in dermis-free wounds in animal models and patients (see below: 
dermis regeneration template, DRT). "Skin equivalent" (SE) refers to a collagen lattice which has been 
prepared by contraction of a collagen gel by heterologous fibroblasts ("dermal equivalent" or DE) and 
has subsequently been overlayed with a keratinocyte culture to induce formation of a mature, cornified 
epidermis in vitro prior to grafting of skin wounds. Cultured epithelial autografts (CEA) consist of a 
mature, cornified epidermis which has been produced by culturing keratinocytes in vitro, prior to grafting 
on skin wounds. The major goal of these treatments has been to replace definitively the use of the autograft 
in the treatment of patients with massive skin loss. 
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138.3 Two Conceptual Stages in the Treatment of Massive 
Skin Loss by Use of the Artificial Skin 

Loss of the epidermis alone can result from a relatively mild burn, such as an early exposure to the sun. 
Controlled loss of epidermis in a laboratory experiment with an animal can result from the repeated use 
of adhesive tape to peel off the keratinocyte layers. In either case, the long-term outcome is an apparently 
faithful regeneration of the epidermis by migration of epithelial cells from the wound edge, and from 
roots of hair follicles, over the underlying basement membrane and dermis. It has been shown that the 
epidermis can regenerate spontaneously provided there is a dermal substrate over which epithelial 
migration and eventual anchoring to the underlying connective tissue can occur. 

Loss of the dermis has quite a different outcome. Once lost, the dermis does not regenerate spontane- 
ously. Instead, the wound closes by contraction of the wound edges toward the center of the skin deficit, 
and by synthesis of scar. Scar is a distinctly different type of connective tissue than is dermis. The depth 
of skin loss is, therefore, a critical parameter in the design of a treatment for a patient who has a skin 
deficit. In the treatment of burns, physicians distinguish among a first-degree burn (loss of epidermis 
alone), a second-degree burn (loss of epidermis and a fraction of the thickness of the dermis), and a third- 
degree burn (loss of the epidermis and the entire dermis down to muscle tissue). A similar classification, 
based on depth of loss, is frequently applied to mechanical wounds, such as abrasion. The area of skin 
which has been destroyed needs also to be specified in order to assess the clinical status of the patient. 

A massively injured patient, such as a patient with about 30% body surface area or more destroyed 
from fire through the full thickness of skin, presents an urgent problem to the clinician, since the open 
wound is an ongoing threat to survival. A large number of temporary wound coverings have been used 
to help the patient survive through this period while waiting for availability of autografts which provide 
permanent cover. If autograft is unavailable over a prolonged period while the patient has survived the 
severe trauma, contraction and scar synthesis occur over extensive areas. In the long run, the patient 
therefore has to cope with deep, disfiguring scars or with crippling contractures. Thus, even though the 
patient has been able to survive the massive trauma and has walked out of the clinic, the permanent loss 
of skin which has been sustained prevents, in many cases, resumption of an active, normal life. 

138.4 Design Principles for a Permanent Skin Replacement 

The analysis of the plight of the patient who has suffered extensive skin loss, presented above, leads 
logically to a wound cover which treats the problem in two stages. Stage 1 is the early phase of the clinical 
experience, one in which protection against severe fluid loss and against massive infection are defined 
as the major design objectives. Stage 2 is the ensuing phase, one in which the patient needs protection 
principally against disfiguring scars and crippling contractures. Even though the conceptual part of the 
design is separated in two stages for purposes of clarity, the actual treatment is to be delivered continu- 
ously, as will be become clear below. The sequential utilization of features inherent in stages 1 and 2 in 
a single device can be ensured by designing the graft as a bilayer membrane (Fig. 138.3). In this approach, 
the top layer incorporates the features of a stage 1 device, while the bottom layer delivers the performance 
expected from a stage 2 device. The top layer is subject to disposal after a period of about 10-15 days, 
during which time the bottom layer has already induced substantial synthesis of new dermis. Following 
removal of the top layer, the epidermal cover is provided either by covering with a thin epidermal graft 
or by modifying the device (cell seeding) so that an epidermis forms spontaneously by about 2 weeks 
after grafting. 

Stage 1 Design Parameters 

The overriding design requirement at this stage is based on the observation that air pockets ("dead space") 
at the graft- wound bed interface readily become sites of bacterial proliferation. Such sites can be prevented 
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FIGURE 138.3 Schematic of the bilayer membrane which has become known as the artificial skin. The top layer 
is a silicone film which controls moisture flux through the wound bed to nearly physiologic levels, controls infection 
of the wound bed by airborne bacteria, and is strong enough to be sutured on the wound bed. The bottom layer is 
the skin regeneration template, which consists of a graft copolymer of type I collagen and chondroitin 6-sulfate, with 
critically controlled porosity and degradation rate. About 14 days after grafting, the silicone layer is removed and 
replaced with a thin epidermal autograft. The bottom layer induces synthesis of a nearly physiologic dermis and 
eventually is removed completely by biodegradation. (From Yannas IV, Burke JF, OrgillDP, et al. 1982. Science 215:74.) 

from forming if the graft surface wets, in the physicochemical sense, the surface of the wound bed on 
contact and thereby displaces the air from the graft-tissue interface (Fig. 138.4). It follows that the 
physicochemical properties of the graft must be designed to ensure that this leading requirement is met, 
not only when the graft is placed on the wound bed but for several days thereafter, until the function of 
the graft has moved clearly into its stage 2, in which case the graft-wound bed interface has been 
synthesized de novo and the threat of dead space has been thereby eliminated indefinitely. 

First, the flexural rigidity of the graft, i.e., the product of Young's modulus and moment of inertia of 
a model elastic beam, must be sufficiently low to provide for a flexible graft which drapes intimately over 
a geometrically nonuniform wound bed surface and thus ensures that the two surfaces will be closely 
apposed. In practice, these requirements can be met simply by adjusting both the stiffness in tension and 
the thickness of the graft to appropriately low values. Second, the graft will wet the wound bed if the 
surface energy of the graft-wound bed interface is lower than that of the air- wound bed surface, so that 
the graft can adequately displace air pockets from the air-wound bed surface. Although the measurement 
of a credible value of the surface energy is not a simple matter when the graft is based on certain natural 
polymers in the form of a hydrated gel, the requirement of adequate adhesion can be met empirically 
by chemical modification of the surface or by proper use of structural features such as porosity. 

Third, the moisture flux through the graft must be maintained within bounds which are set by the 
following considerations. The upper bound to the moisture flux must be kept below the level where 
excessive dehydration of the graft occurs, thereby leading to alteration of the surface energy of the graft- 
wound bed interface and loss of the adhesive bond between graft and wound bed. Further, when the 
moisture flux exceeds the desired level, the graft is desiccated, and shrinkage stesses develop which pull 
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FIGURE138.4 Certain physicochemical and mechanical requirements in the design of an effective closure for a 
wound bed with full-thickness skin loss, (a) The graft (cross-hatched) does not displace air pockets (arrows) efficiently 
from the graft- wound bed interface, (b) Flexural rigidity of the graft is excessive. The graft does not deform sufficiently, 
under its own weight and the action of surface forces, to make good contact with depressions on the surface of the 
wound bed; as a result, air pockets form (arrows), (c) Shear stresses x (arrows) cause buckling of the graft, rupture 
of the graft- wound bed bond and formation of an air pocket, (d) Peeling force P lifts the graft away from the wound 
bed. (e) Excessively high moisture flux rate / through the graft causes dehydration and development of shrinkage 
stresses at the edges (arrows), which cause lift-off away from the wound bed. (J) Very low moisture flux / causes fluid 
accumulation (edema) at the graft-wound bed interface and peeling off (arrows). (From Yannas IV, Burke JF. 1980. 
J Biomed Mater Res 14:65.) 

the graft away from the wound bed. An estimate of the maximum normal stress a m can be obtained by 
modeling the desiccating graft in one dimension as a shrinking elastic beam bonded to a rigid surface 



(v 2 -v)e (138.1) 



o m = 0.45a(V 2 

In Eq. (138.1), a is the coefficient of expansion of a graft which swells in water, V, and V 2 are initial and 
final values of the volume fraction of moisture in the graft, and E is Young's modulus of the graft averaged 
over the range V 1 to V 2 , the latter range being presumed to be narrow. If, by contrast, the moisture flux 
through the graft is lower than the desired low bound, water accumulates between the graft and the 
wound bed, and edema results with accompanying loss of the adhesive bond between the two surfaces. 

Stage 2 Design Parameters 

The leading design objectives in this stage are two: synthesis of new, physiologic skin and the eventual 
disposal of the graft. 

The lifetime of the graft, expressed as the time constant of biodegradation t b , was modeled in relation 
to the time constant for normal healing of a skin incision t h . The latter is about 25 days. In preliminary 
studies with animals, it was observed that when matrices were synthesized to degrade at a very rapid 
rate, amounting to t b <§ t h , the initially insoluble matrix was reduced early to a liquidlike state, which 
was incompatible with an effective wound closure. At the other extreme, matrices were synthesized which 
degraded with exceptional difficulty within 3-4 weeks, compatible with t b > t h . In these preliminary 
studies it was observed that a highly intractable matrix, corresponding to the latter condition, led to 
formation of a dense fibrotic tissue underneath the graft which eventually led to loss of the bond between 
graft and wound bed. Accordingly, it was hypothesized that a rule of isomorphous matrix replacement, 
equivalent to assuming a graft degradation rate of order of magnitude similar to the synthesis rate for 
new tissue, and represented by the relation 
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would be optimal. Control of t b is possible by adjustment of the crosslink density of the matrix. 
Equation (138.2) is the defining equation for a biodegradable scaffold which is coupled with, and there- 
fore interacts with, the inflammatory process in a wound. 

Migration of cells into the matrix is necessary for synthesis of new tissue. Such migration can proceed 
very slowly, defeating Eq. (138.2), when fibroblasts and other cells recruited below the wound surface 
are required to wait until degradation of a potentially solid-like matrix has progressed sufficiently. An 
easier pathway to migrating cells can be provided by modifying a solid-like matrix into one which has 
an abundance of pore channels, where the average pore is at least as large as one cell diameter (about 
10 urn) for ready access. Although this rationale is supported by experiment, results with animal studies 
have shown that not only is there a lower limit to the average pore diameter, but there is also an upper 
limit (see below). 

Migration of cells into the porous graft can proceed only if nutrients are available to these cells. Two 
general mechanisms are available for transport of nutrients to the migrating cells, namely, diffusion from 
the wound bed and transport along capillaries which may have sprouted within the matrix (angiogenesis). 
Since capillaries would not be expected to form for at least a few days, it is necessary to consider whether 
a purely diffusional mode of transport of nutrients from the wound bed surface into the graft could 
immediately supply the metabolic needs of the invading cells adequately. The cell has been modled as a 
reactor which consumes a critical nutrient with a rate r, in units of mole/cmVs; the nutrient is transported 
from the wound bed to the cell by diffusion over a distance /, the nutrient concentration at or near the 
surface of the wound bed is c , in units of mole/cm 3 , and the diffusivity of the nutrient is D, in cm 2 /s. 
The appropriate conditions were expressed in terms of a dimensionless number S, the cell lifeline number, 
which expresses the relative importance of reaction rate for consumption of the nutrient by the cell to 
rate of transport of the nutrient by diffusion alone: 



Dc 



(138.3) 



Eq. (138.3) suggests that when S = 1, the critical value of the path length, l c , corresponds to the maximum 
distance along which cells can migrate inside the graft without requiring angiogenesis (vascularization) 
for nutrient transport. The value of l c defines the maximum thickness of graft that can be populated with 
cells within a few hours after grafting, before angiogenesis has had time to occur. 

These conceptual objectives have been partially met by designing the graft as an analog of extracellular 
matrix (ECM) which possesses morphogenetic activity since it leads to partial regeneration of dermis. 
The discovery of the specific ECM analog that possesses this activity has been based on the empirical 
observation that, whereas the vast majority of ECM analogs apparently do not inhibit wound contraction 
almost at all, one of the analogs does. The activity of this analog, for which the term regeneration template 
has been coined, is conveniently detected as a significant delay in the onset of wound contraction. When 
seeded with (uncultured) autologous keratinocytes, an active regeneration template is capable of inducing 
simultaneous synthesis both of a dermis and an epidermis in the guinea pig and in the swine (Yorkshire 
pig). The regeneration is almost complete; however, hair follicles and other skin adnexa are not formed. 
The resulting integument performs the two vital functions of skin, i.e., control of infection and moisture 
loss, while also providing physiologic mechanical protection to the internal organs and, additionally, 
providing a cosmetic effect almost identical to that of intact skin. 

The morphogenetic specificity of the dermis regeneration template depends sensitively on retention 
of certain structural characteristics. The overall structure is that of an insoluble, three-dimensional 
covalently crosslinked network. The primary structure can be described as that of a gra/f-copolymer of 
type I collagen and a glycosaminoglycan (GAG) in the approximate ratio 98/2. The GAG can be either 
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chondroitin 6-sulfate or dermatan sulfate; other GAGs appear capable of contributing approximately 
equal increments to morphogenetic specificity. The collagen fibers lack banding almost completely 
although the integrity of the triple helical structure is retained through the network. The resistance of 
the network to collagenase degradation is such that approximately two-thirds of the mass of the network 
becomes solubilized in vivo within about 2 weeks. The structure of the network is highly porous. The 
pore volume fraction exceeds 95% while the average pore diameter is maintained in the range 20-125 urn. 
The regeneration template loses its activity rapidly when these structural features are flawed deliberately 
in control studies. 

The dermis regeneration template, a porous matrix unseeded with cells, induces synthesis of a new 
dermis and solves this old surgical problem. Simultaneous synthesis of a new, confluent epidermis occurs 
by migration of epithelial cell sheets from the wound edges, over the newly synthesized dermal bed. With 
wounds of relatively small characteristic dimension, e.g., 1 cm, epithelial cells migrating at speeds of 
about 0.5 mm/day from each wound edge can provide a confluent epidermis within 10 days. In such 
cases, the unseeded template fulfills all the design specifications set above. However, the wounds incurred 
by a massively burned patient are typically of characteristic dimension of several centimeters, often more 
than 20-30 cm. These wounds are large enough to preclude formation of a new epidermis by cell 
migration alone within a clinically acceptable timeframe, say 2 weeks. Wounds of that magnitude can be 
treated by seeding the porous collagen-GAG template, before grafting, with at lest 5 x 10 4 keratinocytes 
per cm 2 wound area. These uncultured, autologous cells are extracted by applying a cell separation 
procedure, based on controlled trypsinization, to a small epidermal biopsy. 

Details of the synthesis of the dermis regeneration template, as well as of other templates which 
regenerate peripheral nerves and the knee meniscus, are presented elsewhere in this Handbook (see 
Chapter 13). The dermis regeneration template described in this section was first reported as a synthetic 
skin and as an artificial skin. 

138.5 Clinical Studies of a Permanent Skin Replacement 
(Artificial Skin) 

Clinical Studies 

The skin regeneration template has been tested clinically on two occasions. In the first, conducted in the 
period 1979-1980, one clinical center was involved, and 10 severely burned patients were studied. In the 
second, conducted during 1986-1987, 11 clinical centers were involved, and 106 severely burned patients 
were treated in a prospective, randomized manner. In each case the results have been published in some 
detail. The second study led to a surgical report, a histologic report, and an immunologic report. There 
is now adequate information available to discuss the advantages and disadvantages of this prototype 
artificial skin in the treatment of the severely burned patient. 

The artificial skin used in clinical studies so far consists of the bilayer device illustrated in Fig. 138.3. 
The outer layer is a silicone film, about 100 um in thickness, which fulfills the requirements of stage 1 
of the design (see above), and the inner layer is the skin regeneration template. In these clinical studies 
this device has not been seeded with keratinocytes. Closure of the relatively large wounds by formation 
of an epidermis has been achieved instead by use of a 100-um-thin layer of the patient's epidermis 
(autoepidermal graft). The latter has been excised from an intact area of the patient's skin; the donor 
site can, however, be harvested repeatedly, since the excised epidermis regenerates spontaneously in a 
few days over the relatively intact dermal bed. Briefly, the entire procedure consists of preparation of the 
wound bed prior to grafting by excision of thermally injured tissue {eschar), followed by immediate 
grafting of the unseeded template on the freshly excised wound and ending, 3 weeks later, by replacing 
the outer, silicone layer of the device with a thin epidermal graft. The results of studies with a guinea pig 
model and a swine model have shown that seeding of the dermis regeneration template with fresh, 
uncultured autologous keratinocytes prior to grafting leads to simultaneous synthesis of an epidermis as 
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well as a dermis in about 2 weeks. However, definitive clinical studies of the keratinocyte-seeded template 
have yet to be performed. 

The discussion below focuses on the advantages and disadvantages of the (unseeded) artificial skin, 
as these emerge from clinical observations during the treatment as well as from a limited number of 
follow-up observations extending over several years after the treatment. The controls used in the clinical 
studies included meshed autograft, allograft, and xenografts. Comparative analysis of the clinical data 
will focus on each of the two stages of treatment for the massively burned patient, i.e., the early (acute) 
stage and the long term stage, the conceptual basis for which has been discussed above. 

Clinical Parameters Used in the Evaluation 

The clinical parameters during the early stage of treatment (about 3 weeks) include the take of the graft, 
expressed as a percentage of graft area which formed an adhesive bond of sufficient strength with the 
wound bed and became vascularized. In the case of the artificial skin treatment, two different measures 
of take are reported, namely, that of the bilayer membrane on the freshly excised wound bed and the 
take of the epidermal graft applied on the neodermal bed about 3 weeks later. Another parameter is the 
thickness of dermis that has been excised from the donor site in order to obtain the autograft that is 
used to close the wound definitively. An additional parameter which characterizes the cost of the donor 
site to the patient is the time to heal the donor site. The surgeon's overall qualitative evaluation of the 
treatment (relative to controls) during the early stage is also reported. 

The long-term evaluation has extended at least 1 year in approximately one-quarter of the patients. 
The first long-term parameter is based on the patients' reports of the relative incidence of nonphysiologic 
sensations, including itching, dryness, scaliness, lack of elasticity (lack of deformability), sweating, sen- 
sation, and erythema. The second parameter is based on the physicians' report of the relative presence 
of hypertrophic scarring in the grafted area. A third parameter is the patient's evaluation of the physiologic 
feel and appearance of the donor sites. Finally, there is an overall evaluation and preference of the patients 
for a given grafted site as well as the physicians' evaluation of the same grafted site. 

Short-Term Clinical Evaluation of Artificial Skin 

The median percentage take of the artificial skin was 80%, compared with the median take of 95% for 
all controls. Use of the Wilcoxin Rank Sum Test for the bimodally distributed data led to the conclusion 
that the take of the artificial skin was lower than that of all controls with a p value of <0.0001. The 
reported difference reflected primarily the significantly lower take of the artificial skin relative to the 
meshed autograft. There was no significant difference in take when the artificial skin was compared with 
allograft (Wilcoxin Rank Sum p > 0.10). The take of the epidermal autograft was 86%. 

Mean donor site thickness was 0.325 + 0.045mm for control sites and only 0.15 + 0.0625mm for 
epidermal grafts which were harvested for placement over the newly synthesized dermis; the difference 
was found to be significant by t test with a p value of < 0.00 1 . The thinner donor sites used in the artificial 
skin procedures healed, as expected, significantly faster, requiring 10.6 + 5.8 d compared to 14.3 + 6.9 d 
for control sites, with a p value of < 0.001 by t test. It is worth noting that donor sites used in the artificial 
skin procedure were frequently reharvested sites from previous autografting; reharvested donor sites 
healed more slowly than primary sites. 

The subjective evaluation of the operating surgeons at the conclusion of the acute stage of treatment 
was a response to the question, "Was artificial dermis (artificial skin) advantageous in the management 
of this particular patient?" Sixty-three percent of the comments were affirmative, whereas in 36% of the 
responses, the acute (early) results were believed to be no better than by use of routine methods. The 
physicians who responded positively to the use of artificial skin commented on the ability to use thin 
donor sites that healed quickly, relative to the thicker donor sites which were harvested in preparation 
for an autograft. Positive comments also cited the handling characteristics of the artificial skin relative 
to the allograft as well as the ability to close the wound without fear of rejection while awaiting healing 
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of the donor site. Negative comments included a less-than-adequate drainage of serum and blood through 
the unperforated silicone sheet, the seemingly poor resistance of the artificial skin to infection, and the 
need for a second operation. 

Long-Term Clinical Evaluation of Artificial Skin 

One year after treatment, the allografted and xenografted sites had been long ago covered definitively 
with autograft; therefore, the experimental sites included, in the long term, either autografts (referred to 
occasionally as controls below) or the test sites, comprising the new integument induced as a result of 
treatment with the artificial skin (partially regenerated dermis closed with an epidermal graft). 

The patients reported that itching was significantly less (Wilcoxin Rank Sum test p < 0.02) in the 
artificial skin site than in control sites. Dryness, scaliness, elasticity (deformability), sweating, sensation, 
and erythema were similar at both control and artificial skin sites. Hypertrophic scarring was reported 
to be less in artificial skin in 42% of sites and was reported to be equivalent on test and control sites 57% 
of the time. No patient reported that the artificial skin sites had more hypertrophic scar than the 
autograft ed sites. Even though donor sites that were used during treatment with the artificial skin were 
harvested repeatedly (recropping), 72% of patients reported that these artificial skin donor sites felt "more 
normal," 17% felt that there was no difference, and 11% felt that the control donor site was "more normal." 

The results of the histologic study on this patient population showed that, in sites where the artificial 
skin was used, an intact dermis was synthesized as well as definitive closure of a complete epidermal layer 
with a minimum of scarring had occurred. The results of the immunologic study led to the conclusion 
that, in patients who had been treated with the artificial skin, there was increased antibody activity to 
bovine skin collagen, bovine skin collagen with chondroitin sulfate, and human collagen; however, it was 
concluded that these increased levels of antibodies were not immunologically significant. 

The overall evaluation by the patients showed that 26% preferred the new integument generated by 
use of the artificial skin whereas 64% found that the sites were equivalent and 10% showed preference 
for the autografted site. Physician's overall evaluation showed that 39% preferred the artificial skin site, 
45% found the sites to be equivalent, and 16% preferred the autografted site. 

Clinical Results 

The take of the artificial skin was comparable to all other grafts and was inferior only to the meshed autograft. 
The latter showed superior take in part because meshing reduces drastically the flexural rigidity of the graft 
(see above) leading thereby to greater conformity with the wound bed (see Fig. 138.4). The interstices in 
the meshed autograft also provided an outlet for drainage of serum and blood from the wound, thereby 
allowing removal of these fluids. By contrast, the continuity of the silicone sheet in the artificial skin 
accounted for the increased flexural rigidity of the graft and prevented drainage of wound fluids with a 
resulting increased incidence of fluid accumulation underneath the graft. Fluid accumulation was probably 
the cause of the reduced take of the artificial skin, since immediate formation of a physiochemical bond 
between the graft and the wound bed was thereby prevented (see Fig. 138.4). The development of infection 
underneath the artificial skin, noted by physicians in certain cases, can also be explained as originating in 
the layer of wound fluid which presumptively collected underneath the artificial skin. This analysis suggests 
that meshing of the silicone layer of the artificial skin, without affecting the continuity of the collagen-GAG 
layer, could lead to improved take and probably to reduced incidence of infection. 

The healing time for donor sites associated with use of the artificial skin was shorter by about 4 days 
than for donor sites that were used to harvest autograft. An even shorter healing time for donor sites for 
artificial skin can be realized by reducing the thickness of the epidermal graft which is required to close 
the dermal bed. The average epidermal graft thickness reported in this study, 0.15mm, was significantly 
higher than thicknesses in the range 0.05-0. 07mm, corresponding to an epidermal graft with adequate 
continuity but negligible amount of attached dermis. Increasing familiarity of surgeons with the proce- 
dure for harvesting these thin epidermal grafts is expected to lead to harvesting of thinner grafts in future 
studies. The importance of harvesting a thin graft cannot be overestimated, since the healing time of the 
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donor site decreases rapidly with decreasing thickness of harvested graft. It has been reported that the 
mean healing time for donor sites for the artificial skin reported in this study, about 1 1 days, is reduced 
to 4 days provided that a pure epidermal graft, free of dermis, can be harvested. 

Not only does the time to heal increase, but the incidence of hypertrophic scarring at a donor site also 
increases with the thickness of the harvested graft. This observation explains the higher incidence of 
hypertrophic scarring in donor sites associated with harvesting of autografts, since the latter were thicker 
by about 0.175mm than the epidermal grafts used with the artificial skin. An additional advantage 
associated with use of a thin epidermal graft is the opportunity to reharvest (recropping) within a few 
days; this reflects the ability of epithelial tissues to regenerate spontaneously provided there is an under- 
lying dermal bed. When frequent recropping of donor graft is possible, the surface area of a patient that 
can be grafted within a clinically acceptable period increases rapidly. In the clinical study described here, 
a patient with deep burns over as much as 85% body surface area was covered with artificial skin grafts 
for 75 days while the few donor sites remaining were being harvested several times each. 

In the long term rarely did a patient or a physician in this clinical study prefer the new skin provided 
by the autograft to that provided by the artificial skin treatment. This result is clearly related to the use 
of meshed autografts, a standard procedure in the treatment of massively burned patients. Meshing 
increases the wound area which can by autografted by between 1.5 and 6 times, thereby alleviating a 
serious resource problem. However, meshing destroys the dermis as well as the epidermis; although the 
epidermis regenerates spontaneously and fills in the defects, the dermis does not. The long-term result 
is a skin site with the meshed pattern permanently embossed on it. The artificial skin is a device that, in 
principle, is available in unlimited quantity; accordingly, it does not suffer from this problem (Fig. 138.2). 
The result is a smooth skin surface which was clearly preferred on average by patients and physicians alike. 

It has been established that the artificial skin regenerates the dermis and, therefore, its use leads to 
complete inhibition of scar formation in full-thickness skin wounds. The regeneration is partial because 
skin adenexa (hair follicles, sweat glands) are not recovered. The results of studies of the mechanism by 
which the artificial skin regenerates the dermis in full-thickness skin wounds in animal models have been 
described elsewhere (see Chapter 113, Regeneration Templates). 

Summary 

The artificial skin leads to a new skin which appears closer to the patient's intact skin than does the 
meshed autograft. Take of the artificial skin is as good as all comparative materials except for the unmeshed 
autograft, which is superior in this respect. Donor sites associated with the artificial skin treatment heal 
faster, can be recropped much more frequently, and eventually heal to produce sites that look closer to 
the patient's intact skin than do donor sites harvested for the purpose of autografting. In comparison to 
the allograft, the artificial skin is easier to use, has the same take, does not get rejected, and is free of the 
risk of viral infection associated with use of allograft. 

138.6 Alternative Approaches: Cultured Epithelial 
Autografts (CEA) and Skin Equivalents (SE) 

The use of cultured epithelial autografts has been studied clinically. In this approach autologous epidermal 
cells are removed by biopsy and are then cultured in vitro for about 3 weeks until a mature keratinizing 
epidermis has formed; the epidermis is then grafted onto the patient. The epithelial cells spread and 
cover the dermal substrate, eventually covering the entire wound. Early reports on two pediatric patients 
were very encouraging, describing the life-saving coverage of one-half of body surface with cultured 
epithelial autografts and the technique was eventually used in a very large number of clinical centers in 
several countries. Later studies showed that the "take" of CEA was very good on partial thickness wounds 
but was questionable in full thickness wounds. In particular, blisters formed within 2 weeks in areas 
grafted by CEA, a problem which has recurred persistently in clinical studies. The mechanical fragility 
of the resulting integument resulting from use of CEA has been traced to lack of three structural features 
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which are required for formation of a physiological dermal-epidermal junction at the grafted site, namely, 
the 7-S domain of type IV collagen, anchoring fibrils, and rete ridges. Early studies of the connective 
tissue underlying the CEA grafts have shown lack of a convincing dermal architecture as well as lack of 
elastin fibers. 

In another development, a skin equivalent has been prepared by populating a collagen lattice with 
heterlologous fibroblasts, observing the contraction of the lattice by the cells and finally seeding the 
surface of the lattice with a suspension of epidermal cells from an autologous source or from cell banks. 
The latter attach, proliferate, and differentiate to form a multilayered epidermis in 7 to 10 days of exposure 
to the atmosphere and the resulting skin equivalent is then grafted on wounds. Clinical studies of the 
SE have been limited. In an early study (1988), the SE was used to cover partially full-thickness burn 
wounds covering over 15% of body surface area on eight patients. In every patient grafted with SE, an 
extensive lysis of the SE grafts was observed at the first dressing (48 h). In one patient only, a significant 
percentage of "take" (40%) was observed 14 days after grafting. It was concluded that the SE was not 
completely appropriate to serve routinely as a substitute for the autograft. In a later study (1995), the 
wounds treated were acute, mostly the result of excision of skin cancers. Twelve patients had clinical 
"takes" at the time of grafting and there was no evidence of rejection or toxicity following grafting with 
the SE. The wounds grafted with SE contracted by 10 to 15%, an extent larger than that observed following 
grafting with full-thickness skin. Biopsies of the grafted sites showed formation of scar tissue. The authors 
hypothesized that the SE was eventually replaced by host tissue. Recent studies of the SE have focused 
on patients with venous ulcers; these studies are in progress. 

An attempt has been made to correct the erratic cover provided by split-thickness autografts; the latter 
are normally applied in a meshed form (meshed autograft) and, consequently, fail to cover the entire 
wound bed with a dermal layer. The attempted improvement consisted in grafting underneath the meshed 
autograft a living dermal tissue replacement, consisting of a synthetic polymeric mesh (polyglactin-910) 
which had been cultured in vitro over a period of 2 to 3 weeks with fibroblasts isolated from neonatal 
foreskin. Seventeen patients with full-thickness burn wounds were included in a preliminary clinical trial. 
Epithelialization of the interstices of the meshed autograft led to complete wound closure in 14 days in 
sites where the dermal living replacement had been grafted underneath the meshed autograft (experimental 
sites) and in those where it was omitted (control sites). Take of the meshed autograft was slightly reduced 
when the living dermal tissue replacement was underneath. Basement membrane structures developed 
both in control and experimental sites. Elastic fibers (elastin) were not observed in neodermal tissue either 
in control or experimental sites at periods up to one year after grafting. A subsequent clinical study explored 
the use of this device for the temporary closure of excised burn wounds. This 66-patient multicenter trial 
showed that the biosynthetic skin replacement was equivalent or superior to cadaver skin graft (frozen 
human cadaver allograft) with respect to its ability to prepare wounds for eventual closing with autograft. 

Allograft is a human cadaver skin, which is frequently stored in frozen state in a skin bank. It is a 
temporary cover. If left on the wound longer than about 2 weeks, allograft is rejected by the severely 
burned patient, even though the latter is in an immunocompromised condition. When rejection is allowed 
to occur, the wound bed is temporarily ungraftable and is subject to infection. In a modification of this 
basic use of the allograft, the latter has been used as a dermal equivalent prior to grafting with cultured 
epithelia. Since the allograft is rejected if allowed to remain on the wound long enough for the epithelia 
to spread across the wound bed, the allograft has been treated in a variety of media in an effort to 
eliminate its immunogenicity. 

Defining Terms 

Adnexa: Accessory parts or appendages of an organ. Adnexa of skin include hair follicles and sweat 
glands. 

Allograft: Human cadaver skin, usually maintained frozen in a skin bank and used to provide a tempo- 
rary cover for deep wounds. About 2 weeks after grafting, the allograft is removed and replaced 
with autograft which has become available by that time. Previously referred to as homograft. 
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Artificial skin: A bilayer membrane consisting of an upper layer of silicone and a lower layer of dermis 
regeneration template. The template is a cell-free, highly porous analog of extracellular matrix. 

Autograft: The patient's own skin, harvested from an intact area in the form of a membrane and used 
to graft an area of severe skin loss. 

Basement membrane: An approximately 20-nm-fhick multilayered membrane interleaved between the 
epidermis and the dermis. 

Cell lifeline number: A dimensionless number which compares the relative magnitudes of chemical 
reaction and diffusion. This number, defined as S in Eq. (134.3) above, can be used to compute 
the maximum path length, l c , over which a cell can migrate in a scaffold while depending on 
diffusion alone for transport of critical nutrients that it consumes. When the critical length is 
exceeded, the cell requires transport of nutrients by angiogenesis in order to survive. 

Cultured epithelial autografts: A mature, keratinizing epidermis synthesized in vitro by culturing epi- 
thelial cells removed from the patient by biopsy. A relatively small skin biopsy ( 1 cm 2 ) can be treated 
to yield an area larger by about 10,000 in 2-3 weeks and is then grafted on patients with burns. 

Dermal equivalent: A term which has been loosely used to describe a device that replaces, usually 
temporarily, the functions of the dermis following injury. 

Dermis: A 2-5mm-thick layer of connective tissue populated with quiescent fibroblasts which lies 
underneath the epidermis. It is separated from the former by a very thin basement membrane. 
The dermis of adult mammals does not regenerate spontaneously following injury. 

Dermis regeneration template: A graft copolymer of type I collagen and chondroitin 6-sulfate, average 
pore diameter 20-125 um, degrading in vivo to an extent of about 50% in 2 weeks, which induces 
partial regeneration of the dermis in wounds from which the dermis has been fully excised. When 
seeded with keratinocytes prior to grafting, this analog of extracellular matrix has induced simul- 
taneous synthesis both of a dermis and an epidermis. 

Donor site: The skin site from which an autograft has been removed with a dermatome. 

Epidermis: The cellular outer layer of skin, about 0.1 -mm thick, which protects against moisture loss 
and against infection. An epidermal graft, e.g., cultured epithelium or a thin graft removed surgi- 
cally, requires a dermal substrate for adherence onto the wound bed. The epidermis regenerates 
spontaneously following injury, provided there is a dermal substrate underneath. 

Eschar: Dead tissue, typically the result of a thermal injury, which covers the underlying, potentially 
viable tissue. 

Extracellular matrix: A largely insoluble, nondiffusible macromolecular network, consisting mostly of 
glycoproteins and proteoglycans. 

Isomorphous matrix replacement: A term used to describe the synthesis of new, physiologic tissue 
within a skin regeneration template at a rate which is of the same order as the degradation of the 
template. This relation, Eq. (138.2) above, is the defining equation for a biodegradable scaffold 
which biologically interacts with the inflammatory process of the wound bed. 

Living dermal replacement: A synthetic biodegradable polymeric mesh, previously cultured with fibro- 
blasts, which is placed underneath a conventional meshed autograft. 

Meshed autograft: A sheet autograft which has been meshed and then expanded by a factor of 1.5-6 
to produce grafts with a characteristic pattern. 

Regeneration template: A biodegradable scaffold which, when attached to a missing organ, induces its 
regeneration. 

Scar: The result of a repair process in skin and other organs. Scar is morphologically different from 
skin, in addition to being mechanically less extensible and weaker than skin. The skin regeneration 
template induces synthesis of nearly physiologic skin rather than scar. 

Sheet autograft: A layer of the patient's skin, comprising the epidermis and about one-third of the 
dermal thickness, which has not been meshed prior to grafting areas of severe skin loss. 

Skin: A vital organ which indispensably protects the organism from infection and dehydration while 
also providing other functions essential to physiologic life, such as assisting in thermoregulation 
and providing a tactile sensor for the organism. 
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Skin equivalent: A collagen gel which has been contracted by fibroblasts cultured therein. Following 
culturing with keratinocyte, until a cornified epidermis is formed over the contracted collagen 
lattice, it is grafted on skin wounds. 

Split-thickness autograft: An autograft which is about one-half or one-third as thick as the full thick- 
ness of skin. 

Subcutis: A layer of fat tissue underneath the dermis. 

Synthetic skin: A term used to describe the artificial skin in the early literature. 

Take: The adhesion of a graft on the woundbed. Without adequate take, there is no physicochemical 
or biologic interaction between graft and wound bed. 

Xenograft: Skin graft obtained from a different species: e.g., pig skin grafted on human. Synthetic 
polymeric membranes are often referred to as xenografts. Previously referred to as heterogmft. 
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Engineering advances have resulted in enormous strides in the field of rehabilitation. Individuals 
with reduced or no vision can be given "sight"; those with severe or complete hearing loss can 
"hear" by being provided with a sense of their surroundings; those unable to talk can be aided to 
"speak" again; and those without full control of a limb (or with the limb missing) can, by artificial means, 
"walk" or regain other movement functions. But the present level of available functional restoration for 
seeing, hearing, speaking, and moving still pales in comparison to the capabilities of individuals without 
disability. As is readily apparent from the content of many of the chapters in this Handbook, the human 
sensory and motor (movement) systems are marvelously engineered, both within a given system and 
integrated across systems. The rehabilitation engineer thus faces a daunting task in trying to design 
augmentative or replacement systems when one or more of these systems is impaired. 

Rehabilitation engineering had its origins in the need to provide assistance to individuals who were 
injured in World War II. Rehabilitation engineering can be defined in a number of ways. Perhaps the most 
encompassing (and the one adopted here) is that proposed by Reswick [1982] — Rehabilitation engineer- 
ing is the application of science and technology to ameliorate the handicaps of individuals with disabilities. 
With this definition, any device, technique, or concept used in rehabilitation that has a technological 
basis falls under the purview of rehabilitation engineering. This contrasts with the much narrower view 
that is held by some that rehabilitation engineering is only the design and production phase of a broader 
field called Assistive Technology. Lest one consider this distinction trivial, consider that the U.S. Congress 
has mandated that rehabilitation engineering and technology services be provided by all states; and an 
argument has ensued among various groups of practitioners about who can legally provide such services 
because of the various interpretations of what rehabilitation engineering is. 

There is a core body of knowledge that defines each of the traditional engineering disciplines. Bio- 
medical engineering is less precisely defined; but, in general, a biomedical engineer must be proficient 
in a traditional engineering discipline and have a working knowledge of things biological or medical. 
The rehabilitation engineer is a biomedical engineer who must not only be technically proficient as an 
engineer and know biology and medicine, but must also integrate artistic, social, financial, psychological, 
and physiological considerations to develop or analyze a device, technique, or concept that meets the 
needs of the population the engineer is serving. In general, rehabilitation engineers deal with muscu- 
loskeletal or sensory disabilities. They often have a strong background in biomechanics. Most work in a 
multidisciplinary team setting. 

Rehabilitation engineering deals with many aspects of rehabilitation including applied, scientific, 
clinical, technical, and theoretical. Various topics include, but are not limited to, assistive devices and 
other aids for those with disability, sensory augmentation and substitution systems, functional electrical 
stimulation (for motor control and sensory-neural prostheses), orthotics and prosthetics, myoelectric 
devices and techniques, transducers (including electrodes), signal processing, hardware, software, robot- 
ics, systems approaches, technology assessment, postural stability, wheelchair seating systems, gait anal- 
ysis, biomechanics, biomaterials, control systems (both biological and external), ergonomics, human 
performance, and functional assessment [Robinson, 1993]. 

In this section of the Handbook, we focus only on applications of rehabilitation engineering. The 
concepts of rehabilitation engineering, rehabilitation science, and rehabilitation technology are outlined 
in Chapter 139. Chapter 141 discusses the importance of personal mobility and various wheeled modes 
of transportation (wheelchairs, scooters, cars, vans, and public conveyances). Chapter 142 looks at other 
non-wheeled ways to enhance mobility and physical performance. Chapter 143 covers techniques avail- 
able to augment sensory impairments or to provide a substitute way to input sensory information. 
Conversely, Chapter 144 looks at the output side. 

For the purposes of this Handbook, many topics that partially fall under the rubric of rehabilitation 
engineering are covered elsewhere. These include chapters on Electrical Stimulation (Durand), Hard 
Tissue Replacement — Long Bone Repair and Joints (Goel), Biomechanics (Schneck), Musculoskeletal 
Soft Tissue Mechanics (Lieber), Analysis of Gait (Davis), Sports Biomechanics/Kinesiology (Johnson), 
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Biodynamics (Diggs), Cochlear Mechanics (Steele), Measurement of Neuromuscular Performance Capa- 
bilities (Smith), Human Factors Applications in Rehabilitation Engineering (Strauss and Gunderson), 
Electrical Stimulators (Peckham), Prostheses and Artificial Organs (Galletti), Nerve Guidance Channels 
(Valentini), and Tracheal, Laryngeal, and Esophageal Replacement Devices (Shimizu). 

Rehabilitation engineering can be described as an engineering systems discipline. Imagine being the 
design engineer on a project that has an unknown, highly non-linear plant, with coefficients whose 
variations in time appear to follow no known or solvable model, where time (yours and your client's) 
and funding are severely limited, where no known solution has been developed (or if it has, will need 
modification for nearly every client so no economy of scale exists) . Further, there will be severe impedance 
mismatches between available appliances and your client's needs. Or, the low residual channel capacity 
of one of your client's senses will require enormous signal compression to get a signal with any appreciable 
information content through it. Welcome to the world of the rehabilitation engineer!! 
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Rehabilitation engineering requires a multidisciplinary effort. To put rehabilitation engineering into its 
proper context, we need to review some of the other disciplines with which rehabilitation engineers must 
be familiar. Robinson [93] has reviewed or put forth the following working definitions and discussions: 
Rehabilitation is the (reintegration of an individual with a disability into society. This can be done either 
by enhancing existing capabilities or by providing alternative means to perform various functions or to 
substitute for specific sensations. 

Rehabilitation engineering is the "application of science and technology to ameliorate the handicaps of 
individuals with disabilities" [Reswick, 82] . In actual practice, many individuals who say that they practice 
"rehabilitation engineering" are not engineers by training. While this leads to controversies from prac- 
titioners with traditional engineering degrees, it also has the de facto benefit of greatly widening the scope 
of what is encompassed by the term "rehabilitation engineering." 

Rehabilitation medicine is a clinical practice that focuses on the physical aspects of functional recovery, 
but that also considers medical, neurological and psychological factors. Physical therapy, occupational 
therapy, and rehabilitation counseling ait professions in their own right. On the sensory-motor side, other 
medical and therapeutical specialties practice rehabilitation in vision, audition, and speech. 

Rehabilitation technology (or assistive technology) narrowly defined is the selection, design, or manu- 
facture of augmentative or assistive devices that are appropriate for the individual with a disability. Such 
devices are selected based on the specific disability, the function to be augmented or restored, the user's 
wishes, the clinician's preferences, cost, and the environment in which the device will be used. 

Rehabilitation science is the development of a body of knowledge, gleaned from rigorous basic and 
clinical research, that describes how a disability alters specific physiological functions or anatomical 
structures, and that details the underlying principles by which residual function or capacity can be 
measured and used to restore function of individuals with disabilities. 
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139.2 Rehabilitation Concepts 

Effective rehabilitation engineers must be well versed in all of the areas described above since they 
generally work in a team setting, in collaboration with physical and occupational therapists, orthopedic 
surgeons, physical medicine specialists and/or neurologists. Some rehabilitation engineers are interested 
in certain activities that we do in the course of a normal day that could be summarized as activities of 
daily living (ADL). These include eating, toileting, combing hair, brushing teeth, reading, etc. Other 
engineers focus on mobility and the limitations to mobility. Mobility can be personal (e.g., within a home 
or office) or public (automobile, public transportation, accessibility questions in buildings). Mobility 
also includes the ability to move functionally through the environment. Thus, the question of mobility 
is not limited to that of getting from place to place, but also includes such questions as whether one can 
reach an object in a particular setting or whether a paralyzed urinary bladder can be made functional 
again. Barriers that limit mobility are also studied. For instance, an ill-fitted wheelchair cushion or support 
system will most assuredly limit mobility by reducing the time that an individual can spend in a wheelchair 
before he or she must vacate it to avoid serious and difficult-to-heal pressure sores. Other groups of 
rehabilitation engineers deal with sensory disabilities, such as sight or hearing, or with communications 
disorders, both in the production side (e.g., the non-vocal) or in the comprehension side. For any given 
client, a rehabilitation engineer might have all of these concerns to consider (i.e., ADLs, mobility, sensory 
and communication dysfunctions). 

A key concept in physical or sensory rehabilitation is that of residual function or residual capacity. Such 
a concept implies that the function or sense can be quantified, that the performance range of that function 
or sense is known in a non-impaired population, and that the use of residual capacity by a disabled 
individual should be encouraged. These measures of human performance can be made subjectively by 
clinicians or objectively by some rather clever computerized test devices. 

A rehabilitation engineer asks three key questions: Can a diminished function or sense be successfully 
augmented? Is there a substitute way to return the function or to restore a sense? And is the solution 
appropriate and cost-effective? These questions give rise to two important rehabilitation concepts: orthot- 
ics and prosthetics. An orthosis is an appliance that aids an existing function. A prosthesis provides a 
substitute. 

An artificial limb is a prosthesis, as is a wheelchair. An ankle brace is an orthosis. So are eyeglasses. In 
fact, eyeglasses might well be the consumate rehabilitation device. They are inexpensive, have little social 
stigma, and are almost completely unobtrusive to the user. They have let many millions of individuals 
with correctable vision problems lead productive lives. But in essence, a pair of eyeglasses is an optical 
device, governed by traditional equations of physical optics. Eyeglasses can be made out of simple glass 
(from a raw material as abundant as the sands of the earth!) or complex plastics such as those that are 
ultraviolet sensitive. They can be ground by hand or by sophisticated computer-controlled optical grind- 
ers. Thus, crude technology can restore functional vision. Increasing the technical content of the eyeglasses 
(either by material or manufacturing method) in most cases will not increase the amount of function 
restored, but it might make the glasses cheaper, lighter and more prone to be used. 

139.3 Engineering Concepts in Sensory Rehabilitation 

Of the five traditional senses, vision and hearing most define the interactions that permit us to be human. 
These two senses are the main input channel through which data with high information content can flow. 
We read; we listen to speech or music; we view art. A loss of one or the other of these senses (or both) can 
have a devastating impact on the individual affected. Rehabilitation engineers attempt to restore the func- 
tions of these senses either through augmentation or via sensory substitution systems. Eyeglasses and hearing 
aids are examples of augmentative devices that can be used if some residual capacity remains. A major area 
of rehabilitation engineering research deals with sensory substitution systems [Kaczmarek, this Handbook]. 
The visual system has the capability to detect a single photon of light, yet also has a dynamic range 
that can respond to intensities many orders of magnitude greater. It can work with high contrast items 
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and with those of almost no contrast, and across the visible spectrum of colors. Millions of parallel data 
channels form the optic nerve that comes from an eye; each channel transmits an asynchronous and 
quasi-random (in time) stream of binary pulses. While the temporal coding on any one of these channels 
is not fast (on the order of 200 bits per sec or less), the capacity of the human brain to parallel process 
the entire image is faster than any supercomputer yet built. 

If sight is lost, how can it be replaced? A simple pair of eyeglasses will not work, since either the sensor 
(the retina), the communication channel (the optic nerve and all of its relays to the brain), or one or 
more essential central processors (the occipital part of the cerebral cortex for initial processing; the parietal 
and other cortical areas for information extraction) has been damaged. For replacement within the 
system, one must determine where the visual system has failed and whether a stage of the system can be 
artificially bypassed. If one uses another sensory modality (e.g., touch or hearing) as an alternate input 
channel, one must determine whether there is sufficient bandwidth in that channel and whether the 
higher-order processing hierarchy is plastic enough to process information coming via a different route. 

While the above discussion might seem just philosophical, it is more than that. We normally read 
printed text with our eyes. We recognize words from their (visual) letter combinations. We comprehend 
what we read via a mysterious processing in the parietal and temporal parts of the cerebral cortex. Could 
we perhaps read and comprehend this text or other forms of writing through our fingertips with an 
appropriate interface? The answer surprisingly is yes! And, the adaptation actually goes back to one of 
the earliest applications of coding theory — that of the development of Braille. Braille condenses all text 
characters to a raised matrix of 2 by 3 dots (2 6 combinations), with certain combinations reserved as 
indicators for the next character (such as a number indicator) or for special contractions. Trained readers 
of Braille can read over 250 words per minute of grade 2 Braille (as fast as most sighted readers can read 
printed text!). Thus, the Braille code is in essence a rehabilitation engineering concept where an alternate 
sensory channel is used as a substitute and where a recoding scheme has been employed. 

Rehabilitation engineers and their colleagues have designed other ways to read text. To replace the 
retina as a sensor element, a modern high resolution, high sensitivity, fast imaging sensor (CCD, etc.) is 
employed to capture a visual image of the text. One method, used by various page scanning devices, 
converts the scanned image to text by using optical character recognition schemes, and then outputs the 
text as speech via text-to-speech algorithms. This machine essentially recites the text, much as an sighted 
helper might do when reading aloud to the blind individual. The user of the device is thus freed of the 
absolute need for a helper. Such independence is often the goal of rehabilitation. 

Perhaps the most interesting method presents an image of the scanned data directly to the visual cortex 
or retina via an array of implantable electrodes that are used to electrically activate nearby cortical or 
retinal structures. The visual cortex and retina are laid out in a topographic fashion such that there is 
an orderly mapping of the signal from different parts of the visual field to the retina, and from the retina 
to corresponding parts of the occipital cortex. The goal of stimulation is to mimic the neural activity 
that would have been evoked had the signal come through normal channels. And, such stimulation does 
produce the sensation of light. Since the "image" stays within the visual system, the rehabilitation solution 
is said to be modality-specific. However, substantial problems dealing with biocompatibility and image 
processing and reduction remain in the design of the electrode arrays and processors that serve to interface 
the electronics and neurological tissue. 

Deafness is another manifestation of a loss of a communication channel, this time for the sense of 
hearing. Totally deaf individuals use vision as a substitute input channel when communicating via sign 
language (also a substitute code), and can sign at information rates that match or exceed that of verbal 
communication. Hearing aids are now commercially available that can adaptively filter out background 
noise (a predictable signal) while amplifying speech (unpredictable) using autoregressive, moving average 
(ARMA) signal processing. With the recent advent of powerful digital signal processing chips, true digital 
hearing aids are now available. Previous analog aids, or digitally programable analog aids, provided a set 
of tunable filters and amplifiers to cover the low, mid and high frequency ranges of the hearing spectrum. 
But the digital aids can be specifically and easily tailored (i.e., programmed) to compensate for the specific 
losses of each individual client across the frequency continuum of hearing, and still provide automatic 
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gain control and one or more user-selectable settings that have been adjusted to perform optimally in 
differing noise environments. 

An exciting development is occurring outside the field of rehabilitation that will have a profound 
impact on the ability of the deaf to comprehend speech. Electronics companies are now beginning to 
market universal translation aids for travellers, where a phrase spoken in one language is captured, parsed, 
translated, and restated (either spoken or displayed) in another language. The deaf would simply require 
that the visual display be in the language that they use for writing. 

Deafness is often brought on (or occurs congenitally) by damage to the cochlea. The cochlea normally 
transduces variations in sound pressure intensity at a given frequency into patterns of neural discharge. 
This neural code is then carried by the auditory (eighth cranial) nerve to the brainstem where it is 
preprocessed and relayed to auditory cortex for initial processing and on to the parietal and other cortical 
areas for information extraction. Similar to the case for the visual system, the cochlea, auditory nerve, 
auditory cortex and all relays in between maintain a topological map, this time based on tone frequency 
(tonotopic). If deafness is solely due to cochlear damage (as is often the case) and if the auditory nerve 
is still intact, a cochlear implant can often be substituted for the regular transducer array (the cochlea) 
while still sending the signal through the normal auditory channel (to maintain modality-specificity). 

At first glance, the design of a cochlear prosthesis to restore hearing appears daunting. The hearing 
range of a healthy young individual is 20 to 16,000 Hz. The transducing structure, the cochlea, has 3500 
inner and 12000 outer hair cells, each best activated by a specific frequency that causes a localized 
mechanical resonance in the basilar membrane of the cochlea. Deflection of a hair cell causes the cell to 
fire an all-or-none (i.e., pulsatile) neuronal discharge, whose rate of repetition depends to a first approx- 
imation on the amplitude of the stimulus. The outputs of these hair cells have an orderly convergence 
on the 30,000 to 40,000 fibers that make up the auditory portion of the eighth cranial nerve. These 
afferent fibers in turn go to brainstem neurons that process and relay the signals on to higher brain 
centers [Klinke, 1983]. For many causes of deafness, the hair cells are destroyed, but the eighth nerve 
remains intact. Thus, if one could elicit activity in a specific output fiber by means other than the hair 
cell motion, perhaps some sense of hearing could be restored. The geometry of the cochlea helps in this 
regard as different portions of the nerve are closer to different parts of the cochlea. 

Electrical stimulation is now used in the cochlear implant to bypass hair cell transduction mechanisms 
[Loeb, 1985; Clark et al., 1990] . These sophisticated devices have required that complex signal processing, 
electronic and packaging problems be solved. One current cochlear implant has 22 stimulus sites along 
the scala tympani of the cochlea. Those sites provide excitation to the peripheral processes of the cells 
of the eighth cranial nerve, which are splayed out along the length of the scala. The electrode assembly 
itself has 22 ring electrodes spaced along its length and some additional guard rings between the active 
electrodes and the receiver to aid in securing the very flexible electrode assembly after it is snaked into 
the cochlea's very small (a few mm) round window (a surgeon related to me that positioning the electrode 
was akin to pushing a piece of cooked spaghetti through a small hole at the end of a long tunnel). The 
electrode is attached to a receiver that is inlaid into a slot milled out of the temporal bone. The receiver 
contains circuitry that can select any electrode ring to be a source and any other electrode to be a sink 
for the stimulating current, and that can rapidly sequence between various pairs of electrodes. The receiver 
is powered and controlled by a radiofrequency link with an external transmitter, whose alignment is 
maintained by means of a permanent magnet imbedded in the receiver. 

A digital signal processor stores information about a specific user and his or her optimal electrode 
locations for specific frequency bands. The object is to determine what pair of electrodes best produces 
the subjective perception of a certain pitch in the implanted individual, and then to associate a particular 
filter with that pair via the controller. An enormous amount of compression occurs in taking the frequency 
range necessary for speech comprehension and reducing it to a few discrete channels. At present, the 
optimum compression algorithm is unknown, and much fundamental research is being carried out in 
speech processing, compression and recognition. But, what is amazing is that a number of totally deaf 
individuals can relearn to comprehend speech exceptionally well without speech-reading through the 
use of these implants. Other individuals find that the implant aids in speech-reading. For some only an 
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awareness of environmental sounds is apparent; and for another group, the implant appears to have had 
little effect. But, if you could (as I have been able to) finally converse in unaided speech with an individual 
who had been rendered totally blind and deaf by a traumatic brain injury, you begin to appreciate the 
power of rehabilitation engineering. 

139.4 Engineering Concepts in Motor Rehabilitation 

Limitations in mobility can severely restrict the quality of life of an individual so affected. A wheelchair 
is a prime example of a prosthesis that can restore personal mobility to those who cannot walk. Given 
the proper environment (fairly level floors, roads, etc.), modern wheelchairs can be highly efficient. In 
fact, the fastest times in one of man's greatest tests of endurance, the Boston Marathon, are achieved by 
the wheelchair racers. Although they do gain the advantage of being able to roll, they still must climb 
the same hills, and do so with only one-fifth of the muscle power available to an able-bodied marathoner. 

While a wheelchair user could certainly go down a set of steps (not recommended), climbing steps in 
a normal manual or electric wheelchair is a virtual impossibility. Ramps or lifts are engineered to provide 
accessibility in these cases, or special climbing wheelchairs can be purchased. Wheelchairs also do not 
work well on surfaces with high rolling resistance or viscous coefficients (e.g., mud, rough terrain, etc.), 
so alternate mobility aids must be found if access to these areas is to be provided to the physically disabled. 
Hand-controlled cars, vans, tractors and even airplanes are now driven by wheelchair users. The design 
of appropriate control modifications falls to the rehabilitation engineer. 

Loss of a limb can greatly impair functional activity. The engineering aspects of artificial limb design 
increase in complexity as the amount of residual limb decreases, especially if one or more joints are lost. 
As an example, a person with a mid-calf amputation could use a simple wooden stump to extend the 
leg, and could ambulate reasonably well. But such a leg is not cosmetically appealing and completely 
ignores any substitution for ankle function. 

Immediately following World War II, the United States government began the first concerted effort to 
foster better engineering design for artificial limbs. Dynamically lockable knee joints were designed for 
artificial limbs for above-knee amputees. In the ensuing years, energy-storing artificial ankles have been 
designed, some with prosthetic feet so realistic that beach thongs could be worn with them! Artificial 
hands, wrists and elbows were designed for upper limb amputees. Careful design of the actuating cable 
system also provided for a sense of hand grip force, so that the user had some feedback and did not need 
to rely on vision alone for guidance. 

Perhaps the most transparent (to the user) artificial arms are the ones that use electrical activity 
generated by the muscles remaining in the stump to control the actions of the elbow, wrist and hand 
[Stein et al., 1988]. This electrical activity is known as myoelectricity, and is produced as the muscle 
contraction spreads through the muscle. Note that these muscles, if intact, would have controlled at least 
one of these joints (e.g., the biceps and triceps for the elbow). Thus, a high level of modality-specificity 
is maintained since the functional element is substituted only at the last stage. All of the batteries, sensor 
electrodes, amplifiers, motor actuators and controllers (generally analog) reside entirely within these 
myoelectric arms. An individual trained in the use of a myoelectric arm can perform some impressive 
tasks with this arm. Current engineering research efforts involve the control of simultaneous multi-joint 
movements (rather than the single joint movement now available) and the provision for sensory feedback 
from the end effector of the artificial arm to the skin of the stump via electrical means. 

139.5 Engineering Concepts in Communications Disorders 

Speech is a uniquely human means of interpersonal communication. Problems that affect speech can 
occur at the initial transducer (the larynx) or at other areas of the vocal tract. They can be of neurological 
(due to cortical, brainstem or peripheral nerve damage), structural, and/or cognitive origin. A person 
might only be able to make a halting attempt at talking, or might not have sufficient control of other 
motor skills to type or write. 
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If only the larynx is involved, an externally applied artificial larynx can be used to generate a resonant 
column of air that can be modulated by other elements in the vocal tract. If other motor skills are intact, 
typing can be used to generate text, which in turn can be spoken via text-to-speech devices described 
above. And the rate of typing (either whole words or via coding) might be fast enough so that reasonable 
speech rates could be achieved. 

The rehabilitation engineer often becomes involved in the design or specification of augmentative 
communication aids for individuals who do not have good muscle control, either for speech or for limb 
movement. A whole industry has developed around the design of symbol or letter boards, where the user 
can point out (often painstakingly) letters, words or concepts. Some of these boards now have speech 
output. Linguistics and information theory have been combined in the invention of acceleration tech- 
niques intended to speed up the communication process. These include alternative language represen- 
tation systems based on semantic (iconic), alphanumeric, or other codes; and prediction systems, which 
provide choices based on previously selected letters or words. A general review of these aids can be found 
in Chapter 144, while Goodenough-Trepagnier [1994] edited a good publication dealing with human 
factors and cognative requirements. 

Some individuals can produce speech, but it is dysarthric and very hard to understand. Yet the utterance 
does contain information. Can this limited information be used to figure out what the individual wanted 
to say, and then voice it by artificial means? Research labs are now employing neural network theory to 
determine which pauses in an utterance are due to content (i.e., between a word or sentence) and those 
due to unwanted halts in speech production. 

139.6 Appropriate Technology 

Rehabilitation engineering lies at the interface of a wide variety of technical, biological and other concerns. 
A user might (and often does) put aside a technically sophisticated rehabilitation device in favor of a 
simpler device that is cheaper, and easier to use and maintain. The cosmetic appearance of the device 
(or cosmesis) sometimes becomes the overriding factor in acceptance or rejection of a device. A key 
design factor often lies in the use of the appropriate technology to accomplish the task adequately given 
the extent of the resources available to solve the problem and the residual capacity of the client. Adequacy 
can be verified by determining that increasing the technical content of the solution results in dispropor- 
tionately diminishing gains or escalating costs. Thus, a rehabilitation engineer must be able to distinguish 
applications where high technology is required from those where such technology results in an incre- 
mental gain in cost, durability, acceptance and other factors. Further, appropriateness will greatly depend 
on location. What is appropriate to a client near a major medical center in a highly developed country 
might not be appropriate to one in a rural setting or in a developing country. 

This is not to say that rehabilitation engineers should shun advances in technology. In fact, a fair 
proportion of rehabilitation engineers work in a research setting where state-of-the-art technology is 
being applied to the needs of the disabled. However, it is often difficult to transfer complex technology 
from a laboratory to disabled consumers not directly associated with that laboratory. Such devices are 
often designed for use only in a structured environment, are difficult to repair properly in the field, and 
often require a high level of user interaction or sophistication. 

Technology transfer in the rehabilitation arena is difficult, due to the limited and fragmented market. 
Advances in rehabilitation engineering are often piggybacked onto advances in commercial electronics. 
For instance, the exciting developments in text-to-speech and speech-to-text devices mentioned above 
are being driven by the commercial marketplace, and not by the rehabilitation arena. But such develop- 
ments will be welcomed by rehabilitation engineers no less. 

139.7 The Future of Engineering in Rehabilitation 

The traditional engineering disciplines permeate many aspects of rehabilitation. Signal processing, control 
and information theory, materials design, computers are all in widespread use from an electrical engineering 
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perspective. Neural networks, microfabrication, fuzzy logic, virtual reality, image processing and other 
emerging electrical and computer engineering tools are increasingly being applied. Mechanical engineer- 
ing principles are used in biomechanical studies, gait and motion analysis, prosthetic fitting, seat cushion 
and back support design, and the design of artificial joints. Materials and metalurgical engineers provide 
input on newer biocompatable materials. Chemical engineers are developing implantable sensors. Indus- 
trial engineers are increasingly studying rehabilitative ergonomics. 

The challenge to rehabilitation engineers is to find advances in any field, engineering or otherwise, 
that will aid their clients who have a disability. 

Defining Terms 

[n.b., The first five terms below have been proposed by the National Center for Medical Rehabilitation 
and Research (NCMRR) of the US National Institutes of Health (NIH)]. 

Disability: Inability or limitation in performing tasks, activities, and roles to levels expected within 

physical and social contexts 
Functional Limitation: Restriction or lack of ability to perform an action in the manner or within the 

range consistent with the purpose of an organ or organ system 
Impairment: Loss or abnormality of cognitive, emotional, physiological or anatomical structure or 

function, including all losses or abnormalities, not just those attributed to the initial pathophysiology 
Pathophysiology: Interruption or interference with normal physiological and developmental processes 

or structures 
Societal Limitation: Restriction, attributable to social policy or barriers (structural or attitudinal), 

which limits fulfillment of roles, or denies access to services or opportunities that are associated 

with full participation in society 

Activities of daily living (ADL): Personal activities that are done by almost everyone in the course of 
a normal day including eating, toileting, combing hair, brushing teeth, reading, etc. ADLs are 
distinguished from hobbies and from work-related activities (e.g., typing). 

Appropriate technology: the technology that will accomplish a task adequately given the resources 
available. Adequacy can be verified by determining that increasing the technological content of the 
solution results in diminishing gains or increasing costs. 

Modality-specific: A task that is specific to a single sense or movement pattern. 

Orthosis: A modality-specific appliance that aids the performance of a function or movement by 
augmenting or assisting the residual capabilities of that function or movement. An orthopaedic 
brace is an orthosis. 

Prosthesis: an appliance that substitutes for the loss of a particular function, generally by involving a 
different modality as an input and/or output channel. An artificial limb, a sensory substitution 
system, or an augmentative communication aid are prosthetic devices. 

Residual function or residual capacity: Residual function is a measure of the ability to to carry out 
one or more general tasks using the methods normally used. Residual capacity is a measure of the 
ability to to carry out these tasks using any means of performance. These residual measures are 
generally more subjective than other more quantifiable measures such as residual strength. 
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An orthopedic prosthesis is an internal or external device that replaces lost parts or functions of the 
neuroskeletomotor system. In contrast, a orthopedic orthosis is a device that augments a function of the 
skeletomotor system by controlling motion or altering the shape of body tissue. For example, an artificial 
leg or hand is a prosthesis, whereas a calliper (or brace) is an orthosis. This chapter addresses only 
orthoses and external orthopedic prostheses; internal orthopedic prostheses, such as artificial joints, are 
a subject on their own. 

When a human limb is lost through disease or trauma, the integrity of the body is compromised in 
so many ways that an engineer may well feel daunted by the design requirements for a prosthetic 
replacement. Consider the losses from a lower limb amputation. Gone is the structural support for the 
upper body in standing, along with the complex joint articulations and muscular motor system involved 
in walking. Lost also is the multimode sensory feedback, from inter alia pressure sensors on the sole of 
the foot, length and force sensors in the muscles, and position sensors in the joints, which closed the 
control loop around the skeletomotor system. The body also has lost a significant percentage of its weight 
and is now asymmetrical and unbalanced. 

We must first ask if it is desirable to attempt to replace all these losses with like-for-like components. 
If so, we need to strive to make a bionic limb of similar weight embodying anthropomorphic articulations 
with equally powerful motors and distributed sensors connected back into the wearer's residual neuro- 
muscular system. Or, is it better to accept the losses and redefine the optimal functioning of the new 
unit of person-plus-technology? In many cases, it may be concluded that a wheelchair is the optimal 
solution for lower limb loss. Even if engineering could provide the bionic solution, which it certainly 
cannot at present despite huge inroads made into aspects of these demands, there remain additional 
problems inherent to prosthetic replacements to consider. Of these, the unnatural mechanical interface 
between the external environment and the human body is one of the most difficult. Notable, in place of 
weight bearing through the structures of the foot that are well adapted for this purpose, load must now 
be transferred to the skeletal structures via intimate contact between the surface of residual limb and 
prosthesis; the exact distribution of load becomes critical. To circumvent these problems, an alternative 
direct transcutaneous fixation to the bone has been attempted in limited experimental trials, but this 
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brings its own problems of materials biocompatability and prevention of infection ingress around the 
opening through the skin. Orthotic devices are classified by acronyms that describe the joint which they 
cross. Thus an AFO is an ankle-foot orthosis, a CO is a cervical orthosis (neck brace or collar), and a 
TLSO is a thoracolumbosacral orthosis (spinal brace or jacket). The main categories are braces for the 
cervix (neck), upper limb, trunk, lower limb, and foot. Orthoses are generally simpler devices than 
prostheses, but because orthoses are constrained by the existing body shape and function, they can present 
an equally demanding design challenge. Certainly the interaction with body function is more critical, 
and successful application demands an in-depth appreciation of both residual function and the probable 
reaction to external interference. External orthotics are often classified as structural or functional, the 
former implying a static nature to hold an unstable joint and the latter a flexible or articulated system 
to promote the correct alignment of the joints during dynamic functioning. An alternative orthotic 
approach utilizes functional electrical stimulation (FES) of the patient's own muscles to generate appro- 
priate forces for joint motion; this is dealt with in Chapter 142. 

140.1 Fundamentals 

Designers of orthotic and prosthetic devices are aware of the three cardinal considerations — function, 
structure, and cosmesis. 

For requirements of function, we must be very clear about the objectives of treatment. This requires 
first an understanding of the clinical condition. Functional prescription is now a preferred route for the 
medical practitioner to specify the requirements, leaving the implementation of this instruction to the 
prosthetist, orthotist, or rehabilitation technologist. The benefits of this distinction between client spec- 
ification and final hardware will be obvious to design engineers. Indeed, the influence of design procedures 
on the supply process is a contribution from engineering that is being appreciated more and more. 

The second requirement for function is the knowledge of the biomechanics that underlies both the 
dysfunction in the patient and the function of proposed device to be coupled to the patient. Kinematics, 
dynamics, energy considerations, and control all enter into this understanding of function. Structure is 
the means of carrying the function, and finally both need to be embodied into a design that is cosmetically 
acceptable. Some of the fundamental issues in these concepts are discussed here. 

To function well, the device needs an effective coupling to the human body. To this end, there is often 
some part that is molded to the contours of the wearer. Achieving a satisfactory mechanical interface of 
a molded component depends primarily on the shape. The internal dimensions of such components are 
not made an exact match to the external dimensions of the limb segment, but by a process of rectification, 
the shape is adjusted to relieve areas of skin with low load tolerance. The Shapes are also evolved to 
achieve appropriate load distribution for stability of coupling between prosthetic socket and limb or, in 
orthotic design, a system of usually three forces that generates a moment to stabilize a collapsing joint 
(Fig. 140.1). Alignment is a second factor influencing the interface loading. For lower limb prostheses 
particularly, the alignment of the molded socket to the remainder of the structural components also will 
be critical in determining the moments and forces transmitted to the interface when the foot is flat on 
the ground. The same is true for lower limb orthoses, where the net action of the ground reaction forces 
and consequent moments around the natural joints are highly dependent on the alignment taken up by 
the combination of orthosis and shoe. Adjustability may be important, particularly for children or 
progressive medical conditions. Functional components that enable desirable motions are largely straight- 
forward engineering mechanisms such as hinges or dampers, although the specific design requirements 
for their dynamic performance may be quite complex because of the biomechanics of the body. An 
example of the design of knee joints is expanded below. These motions may be driven from external 
power sources but more often are passive or body-powered mechanisms. In orthoses where relatively 
small angular motions are needed, these may be provided by material flexibility rather than mechanisms. 

The structural requirements for lower-limb prosthetics have been laid down at a consensus meeting 
(1978) bases on biomechanical measurement of forces in a gait laboratory, referred to as the Philadelphia 
standards and soon to be incorporated into an ISO standard (ISO 13404,5; ISO 10328). Not only are the 
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FIGURE 140.1 Three-force system required in an orthosis to control a valgus hindfoot due to weakness in the 
hindfoot supinators. 

load level and life critical, but so is the mode of failure. Sudden failure of an ankle bolt resulting in 
disengagement of an artificial foot is not only potentially life-threatening to an elderly amputee who falls 
and breaks a hip but also can be quite traumatic to unsuspecting witnesses of the apparent event of 
autoamputation. Design and choice of materials should ensure a controlled slow yielding, not brittle 
fracture. A further consideration is the ability of the complete structure to absorb shock loading, either 
the repeated small shocks of walking at the heel strike or rather more major shocks during sports activities 
or falls. This minimizes the shock transmitted through the skin to the skeleton, known to cause both 
skin lesions and joint degeneration. Finally, the consideration of hygiene must not be overlooked; the 
user must be able to clean the orthosis or prosthesis adequately without compromising its structure or 
function. 

Added to the two elements of structure and function, the third element of cosmesis completes the 
trilogy. Appearance can be of great psychological importance to the user, and technology has its contri- 
bution here, too. As examples, special effects familiar in science fiction films also can be harnessed to 
provide realistic cosmetic covers for hand or foot prostheses. Borrowing from advanced manufacturing 
technology, optical shape scanning linked to three-dimensional (3D) computer-aided design, and CNC 
machining can be pressed into service to generate customized shapes to match a contralateral remaining 
limb. Up-to-date materials and component design each contribute to minimize the "orthopedic appli- 
ance" image of the devices (Fig. 140.2). In providing cosmesis, the views of the user must remain 
paramount. The wearer will often choose an attractive functional design in preference to a life like design 
that is not felt to be part of his or her body. 

Upper limb prostheses are often seen as a more interesting engineering challenge than lower limb, 
offering the possibilities for active motor/control systems and complex articulations. However, the market 
is an order of magnitude smaller and cost/benefit less easy to prove — after all, it is possible to function 
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FIGURE 140.2 The ARGO reciprocating-gait orthosis, normally worn under the clothing, with structural compo- 
nents produced from 3D CAD. (Courtesy of Hugh Steeper, Ltd., U.K.) 

fairly well with one arm, but try walking with one leg. At the simplest end, an arm for a below-elbow 
amputee might comprise a socket with a terminal device offering a pincer grip (hand or hook) that can 
be operated through a Bowden cable by shrugging the shoulders. Such body-powered prostheses may 
appear crude, but they are often favored by the wearer because of a sense of position and force feedback 
from the cable, and they do not need a power supply. Another, more elegant method of harnessing body 
power is to take a muscle made redundant by an amputation and tether its tendon through an artificially 
fashioned loop of skin: the cable can then be hooked through the loop [Childress, 1989]. 

Externally powered devices have been attempted using various power sources with degrees of success. 
Pneumatic power in the form of a gas cylinder is cheap and light, but recharging is a problem that 
exercised the ingenuity of early suppliers: where supplies were not readily available, even schemes to 
involve the local fire services with recharging were costed. Also, contemplate the prospect of bringing a 
loaded table fork toward your face carried on the end of a position-controlled arm powered with spongy, 
low-pressure pneumatic actuators, and you will appreciate another aspect of difficulties with this source. 
Nevertheless, gas-powered grip on a hand can be a good solution. Early skirmishes with stiffer hydraulic 
servos were largely unsuccessful because of power supply and actuator weight and oil leakage. Electric 
actuation, heavy and slow at first, has gradually improved to establish its premier position. Input control 
to these powered devices can be from surface electromyography or by mechanical movement of, for 
example, the shoulder or an ectromelic limb. Feedback can be presented as skin pressure, movement of 
a sensor over the skin, or electric stimulation. Control strategies range from position control around a 
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FIGURE 140.2 (continued) 

single joint or group of related joints through combined position and force control for hand grip to 
computer-assisted coordination of entire activities such as feeding. 

The physical designs in prosthetic and orthotic devices has changed substantially over the past decade. 
One could propose that this is solely the introduction of new materials. The sockets of artificial limbs 
have always been fashioned to suit the individual patient, historically by carving wood, shaping leather, 
or beating sheet metal. Following the introduction of thermosetting fiber-reinforced plastics hand-shaped 
over a plaster cast of the limb residuum, substitution of thermoforming plastics that could be automat- 
ically vacuum-formed made a leap forward to give light, rapidly made, and cosmetically improved 
solutions. Polypropylene is the favored material in this application. The same materials permitted the 
new concept of custom-molded orthoses. Carbon fiber composites substituted for metal have certainly 
improved the performance of structural components such as limb shanks. But some of the progress owes 
much to innovative thinking. The flex foot is a fine example, where a traditional anthropomorphic design 
with imitation ankle joint and metatarsal break is completely abandoned and a functional design adopted 
to optimize energy storage and return. This is based on two leaf springs made from Kevlar, joined together 
at the ankle with one splaying down toward the toes to form the forefoot spring and the other rearward 
to form the heel spring (Fig. 140.3). Apart from the gains for the disabled athletes for whom the foot 
was designed-and these are so remarkable that there is little point in competing now without this 
foot-clients across all age groups have benefited from the adaptability to rough ground and shock- 
absorption capability. 
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140.2 Applications 



Computer-Aided Engineering in Customized Component Design 

Computer-aided engineering has found a fertile ground for exploitation in the process of design of 
customized components to match to body shape. A good example is in sockets for artificial limbs. What 
prosthetists particularly seek is the ability to produce a well-fitting socket during the course of a single 
patient consultation. Traditional craft methods of casting the residual limb in plaster of paris, pouring 
a positive mold, manual rectification, and then socket fabrication over the rectified cast takes too long. 

By using advanced technology, residual limb shapes can be captured in a computer, rectified by 
computer algorithms, and CNC machined to produce the rectified cast in under an hour so that with 
the addition of vacuum-formed machinery to pull a socket rapidly over the cast, the socket can be ready 
for trial fitting in one session. There are added advantages too, in that the shape is now stored in digital 
form in the computer and can be reproduced or adjusted whenever and wherever desired. Although such 
systems are still in an early stage of introduction, many practicing prosthetists in the United States have 
now had hands-on experience of this technology, and a major evaluation by the Veterans Administration 
has been undertaken [Houston et al., 1992]. 

Initially, much of the engineering development work went into the hardware components, a difficult 
brief in view of the low cost target for a custom product. Requirements are considerably different from 
those of standard engineering, e.g., relaxation in the accuracies required (millimeters, not microns);a 
need to measure limb or trunk parts that are encumbered by the attached body, which may resist being 
orientated conveniently in a machine and which will certainly distort with the lightest pressure; and a 
need to reproduce fairly bulky items with strength to be used as a sacrificial mold. Instrumentation for 
body shape scanning has been developed using methods of silhouettes, Moire fringes, contact probes 
measuring contours of plaster casts, and light triangulation. Almost universally the molds are turned by 
"milling on a spit" [Duncan & Mair, 1983], using an adapted lathe with a milling head to spiral down 
a large cylindrical plug of material such as plaster of paris mix. Rehabilitation engineers watch with great 
interest, some with envy, the developments in rapid prototyping manufacture, which is so successful in 
reducing the cycle time for one-off developments else- 
where in industry, but alas the costs of techniques such 
as stereolithography are as yet beyond economic feasi- 
bility for our area. 

Much emphasis also has been placed on the graphics 
and algorithms needed to achieve rectification. Opin- 
ions vary as to what extent the computer should simply 
provide a more elegant tool for the prosthetist to exer- 
cise his or her traditional skills using 3D modeling and 
on-screen sculpting as a direct replacement for manual 
plaster rectification or to what extent the computer 
system should take over the bulk of the process by an 
expert systems approach. Systems currently available 
tend to do a little of each. A series of rectification maps 
can be held as templates, each storing the appropriate 
relief or buildup to be applied over a particular ana- 
tomic area of the limb. Thus the map might provide for 
a ridge to be added down the front of the shin of a lower 
limb model so that the eventual socket will not press 
against the vulnerable bony prominence of the tibia 
(Fig. 140.4). Positioning of the discrete regions to match 
individual anatomy might typically be anchored to one 
or more anatomic features indicated by the prosthetist. 




FIGURE 140.3 The Flex Foot. 
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FIGURE 140.4 A rectification require defined over the tibia of lower limb stump using the Shapemaker application 
for computer-aided socket design. 




FIGURE 140.5 Adjusting a socket contour with reference to 3D graphics and cross-sectional profiles in the UCL 
CASD system. (Reproduced from Reynolds and Lord [1992] with permission.) 

The prosthetist is also able to free-form sculpt a particular region by pulling the surface interactively 
with reference to graphic representation (Fig. 140.5); this is particularly useful where the patient has 
some unusual feature not provided for in the templates. 

As part of this general development, finite-element analysis has been employed to model the soft tissue 
distortion occurring during limb loading and to look at the influence of severity of rectification in the 
resultant distribution of interface stress [e.g., Reynolds & Lord, 1992] (Fig. 140.6). In engineering terms, 
this modeling is somewhat unusual and decidedly nonlinear. For a start, the tissues are highly deformable 
but nearly incompressible, which raises problems of a suitable Poisson ratio to apply in the modeling. 
Values of n = 0.3 to n = 0.49 have been proposed, based on experimental matching of stress-strain curves 
from indentation of limb tissue in vivo. In reality, though, compression (defined as a loss of volume) 
may be noted in a limb segment under localized external pressure due to loss of mass as first the blood 
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FIGURE 140.6 Finite-element analysis employed to determine the sensitivity of interface pressure to socket shape 
rectification: {a) limb and socket; (6) elements in layers representing idealized geometry of bone, soft tissue, and 
socket liner; (c) rectification map of radial differences between the external free shape of the limb and the internal 
dimensions of socket; and (d) FE predictions of direct pressure. (Courtesy of Zhang Ming, King's College London.) 

is rapidly evacuated and then interstitial fluids are more slowly squeezed out. Also, it is difficult to define 
the boundaries of the limb segment at the proximal end, still attached to the body, where soft tissues can 
easily bulge up and out. This makes accurate experimental determination of the stress-strain curves for 
the tissue matrix difficult. A nonlinear model with interface elements allowing slip to occur between skin 
and socket at the limit of static friction may need to be considered, since the frictional conditions at the 
interface will determine the balance between shear and direct stresses in supporting body weight against 
the sloping sidewalls. Although excessive shear at the skin surface is considered particularly damaging, 
higher pressures would be required in its complete absence. 

In a similar vein, computer-aided design (CAD) techniques are also finding application in the design 
of bespoke orthopedic footwear, using CAD techniques from the volume fashion trade modified to suit 
the one-off nature of bespoke work. This again requires the generation of a customized mold, or shoe 
last, for each foot, in addition to the design of patterns for the shoe uppers [Lord et al., 1991]. The 
philosophy of design of shoe lasts is quite different from that of sockets, because last shapes have 
considerable and fundamental differences from foot shapes. In this instance, a library of reference last 
shapes is held, and a suitable one is selected both to match the client's foot shape and to fulfill the 
shoemaking needs for the particular style and type of shoe. The schematic of the process followed in 
development of the Shoemaster system is shown in Fig. 140.7. 

Design of shoe inserts is another related application, with systems to capture, manipulate, and repro- 
duce underfoot contours now in commercial use. An example is the Ampfit system, where the foot is 
placed on a platform to which preshaped arch supports or other wedges or domes may first be attached. 
A matrix of round-ended cylinders is then forced up by gas pressure through both platform and supports, 
supporting the foot over most of the area with an even load distribution. The shape is captured from 
the cylinder locations and fed into a computer, where rectification can be made similar to that described 
for prosthetic sockets. A benchtop CNC machine then routs the shoe inserts from specially provided 
blanks while the client waits. 
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FIGURE 140.6 (continued) 
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FIGURE 140.7 Schematic of operation of the Shoemaster shoe design system based on selection of a basis last from 
a database of model lasts. A database of styles is also employed to generate the upper patterns. 

Examples of Innovative Component Design 
An Intelligent Prosthetic Knee 

The control of an artificial lower limb turns out to be most problematic during the swing phase, during 
which the foot is lifted off the ground to be guided into contact ahead of the walker. A prosthetic lower 
limb needs to be significantly lighter than its normal counterpart because the muscular power is not 
present to control it. Two technological advances have helped. First, carbon fiber construction has reduced 
the mass of the lower limb, and second, pneumatic or hydraulically controlled damping mechanisms for 
the knee joint have enabled adjustment of the swing phase to suit an individual's pattern of walking. 
Swing-phase control of the knee should operate in three areas: 

1. Resistance to flexion at late stance during toe-off controls any tendency to excessive heel rise at 
early swing. 

2. Assistance to extension after midswing ensures that the limb is fully extended and ready for heel 
strike. 

3. Resistance before a terminal impact at the end of the extension swing dampens out the inertial 
forces to allow a smooth transition from flexed to extended knee position. 

In conventional limbs, parameters of these controls are determined by fixed components (springs, 
bleed valves) that are set to optimum for an individual's normal gait at one particular speed, e.g., the 
pneumatic controller in Fig. 140.8. If the amputee subsequently walks more slowly, the limb will tend to 
lead, while if the amputee walks more quickly, the limb will tend to fall behind; the usual compensatory 
actions are, respectively, an unnatural tilting of the pelvis to delay heel contact or abnormal kicking 
through of the leg. 

In a recent advance, intelligence is built into the swing-phase controller to adjust automatically for 
cadence variations (Fig. 140.9). A 4-bit microprocessor is used to adjust a needle valve, via a linear stepper 
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PNEUMATIC CYLINDER ACTION 

FIGURE 140.8 Pneumatic cylinder action in a swing phase controller. (Reprinted with permission from S. Zahedi, 
The Results of the Field Trial of the Endolite Intelligent Prosthesis, internal publication, Chas. A. Blatchford & Sons, U.K.) 

motor, according to duration of the preceding swing phase [Zahedi, in press]. The unit is programmed by 
the prosthetist to provide optimal damping for the particular amputee's swing phase at slow, normal, and 
fast walking paces. Thereafter, the appropriate damping is automatically selected for any intermediate speed. 

A Hierarchically Controlled Prosthetic Hand 

Control of the intact hand is hierarchical. It starts with the owner's intention, and an action plan is 
formulated based on knowledge of the environment and the object to be manipulated. For gross move- 
ments, the numerous articulations rely on "preprogrammed" coordination from the central nervous 
system. Fine control leans heavily on local feedback from force and position sensors in the joints and 
tactile information about loading and slip at the skin. In contrast, conventional prostheses depend on 
the conscious command of all levels of control and so can be slow and tiring to use. 

Current technology is able to provide both the computing power and transducers required to recreate 
some of a normal hand's sophisticated proprioceptive control. A concept of extended physiologic prop- 
rioception (EPP) was introduced for control of gross arm movement [Simpson & Kenworthy, 1973] 
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FIGURE 140.9 The Endolite intelligent prosthesis in use, minus its cosmetic covers. 

whereby the central nervous system is retrained through residual proprioception to coordinate gross 
actions applying to the geometry of the new extended limb. This idea can be applied to initiate gross 
hand movements while delegating fine control to an intelligent controller. 

Developments by Chappell and Kyberd [1991] and others provide a fine example of the possibilities. 
A suitable mechanical configuration is shown in Fig. 140.10. Four 12-V dc electric motors with gearboxes 
control, respectively, thumb adduction, thumb flexion, forefinger flexion, and flexion of digits 3,4, and 
5. Digits 3, 4, and 5 are linked together by a double-swingletree mechanism that allows all three to be 
driven together. When one digit touches an object the other two can continue to close until they also 
touch or reach their limit of travel. The movement of the digits allows one of several basic postures: 

• Three-point chuck: Precision grip with digits 1, 2, and 3 (thumb set to oppose the midline between 
digits 2 and 3); digits 4 and 5 give additional support. 

• Two-point grip: Precision grip with digits 1 and 2 (thumb set to oppose forefinger); digits 3, 4, 
and 5 fully flexed and not used or fully extended. 

• Fist: As two-point grip but with thumb fully extended to allow large objects to be grasped. 

• Small Fist: As fist but with thumb flexed and abducted to oppose side of digit 2. 

• Side, or key: Digits 2 to 5 half fully flexed with thumb opposing side of second digit. 

• Flat hand: Digits 2 to 5 fully extended with thumb abducted and flexed, parked beside digit 2. 

The controller coordinates the transition between these positions and ensures that trajectories do not 
tangle. Feedback to the controller is provided by several devices. Potentiometers detect the angles of 
flexion of the digits; touch sensors detect pressure on the palmer surfaces of the digits; and a combined 
contact force (Hall effect) and slip sensor (from acoustic frequency output of force sensor) is mounted 
at the fingertips. The latter detects movement of an object and so controls grip strength appropriate to 
the task-whether holding a hammer or an egg [Kyberd & Chappell, 1993]. 
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The whole hand may be operated by electromyo- 
graphic signals from two antagonistic muscles in the 
supporting forearm stump, picked up at the skin sur- 
face. In response to tension in one muscle, the hand 
opens progressively and then closes to grip with an 
automatic reflex. The second muscle controls the 
mode of operation as the hand moves between the 
states of touch, hold, squeeze, and release. 

A Self-Aligning Orthotic Knee Joint 

Knee orthosis are often supplied to resist knee flexion 
during standing and gait at an otherwise collapsing 
joint. The rigid locking mechanisms on these devices 
are manually released to allow knee flexion during 
sitting. Fitting is complicated by the difficulty of 
attaching the orthosis with its joint accurately aligned 
to that of the knee. The simple diagram in Fig. 140.1 1 
shows how misplacement of a simple hinged orthosis 
with a notional fixed knee axis would cause the cuffs 
on the thigh and calf to press into the soft tissues of 
the limb (known as pistoning). 

The human knee does not have a fixed axis, though, 
but is better represented as a polycentric joint. In a 
sagittal (side) view, it is easy to conceptualize the ori- 
gin of these kinematics from the anatomy of the cru- 
ciate ligaments running crisscross across the joint, 
which together with the base of the femur and the 
head of the tibia form a classic four-bar linkage. The 




FIGURE 140.10 The Southampton hand prosthesis 
with four degrees of freedom in a power grip. An 
optical/acoustic sensor is mounted on the thumb. 
(Reprinted from Kyberd and Chappell [1993], Fig. 1.) 
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FIGURE 140.11 The problem caused by misplacement of a single-axis orthotic joint (a) is overcome by an orthosis 
(h) with a self-aligning axis. (The Laser system, courtesy of Hugh Steeper, Ltd., U.K.) 
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polycentric nature of the motion can therefore be mimicked by a similar geometry of linkage on the 
orthosis. 

The problem of alignment still remains, however, and precision location of attachment points is not 
possible when gripping through soft tissues. In one attempt to overcome this specific problem, the knee 
mechanism has been designed with not one but two axes (Fig. 140.11). The center of rotation is then 
free to self-align. This complexity of the joint while still maintaining the ability to fixate the knee and 
meeting low weight requirements is only achieved by meticulous design in composite materials. 

140.3 Summary 

The field of prosthetics and orthotics is one where at present traditional craft methods sit alongside the 
application of high technology. Gradually, advanced technology is creeping into most areas, bringing 
vast improvements in hardware performance specifications and aesthetics. This is, however, an area where 
the clinical skills of the prosthetist and orthotist will always be required in specification and fitting and 
where many of the products have customized components. The successful applications of technology are 
those which assist the professional to exercise his or her judgment, providing him or her with good tools 
and means to realize a functional specification. 

Since the demand for these devices is thankfully low, their design and manufacture are small scale in 
terms of volume. This taxes the skills of most engineers, both to design the product at reasonable up- 
front costs and to manufacture it economically in low volume. For bespoke components, we are moving 
from a base of craft manufacture through an era when modularization was exploited to allow small- 
batch production toward the use of CAD. In the latter, the engineering design effort is then embodied 
in the CAD system, leaving the prosthetist or orthotist to incorporate clinical design for each individual 
component. 

Specific examples of current applications have been described. These can only represent a small part 
of the design effort that is put into prosthetics and orthotics on a continuing basis, making advances in 
materials and electronics in particular available. 

We are also aware that in the space available, it has not been possible to include a discussion of the 
very innovative work that is being done in intermediate technology for the third world, for which the 
International Society of Prosthetics and Orthotics (address below) currently has a special working group. 

Defining Terms 

Biocompatability: Compatibility with living tissue, e.g., in consideration of toxicity, degradability, and 

mechanical interfacing. 
CNC machining: Use of a computer numerically controlled machine. 
Cosmesis: Aesthetics of appearance. 

Ectromelia: Congenital gross shortening of the long bones of a limb. 
Functional prescription: A doctor's prescription for supply of a device written in terms of its function 

as opposed to embodiment. 
Neuroskeletomotor system: The skeletal frame of the body with the muscles, peripheral nerves, and 

central nervous system of the spine and brain, which together participate in movement and 

stabilization of the body. 
Rectified, rectification: Adjustment of a model of body shape to achieve a desirable load distribution 

in a custom-molded prosthesis or orthosis. 
Soft tissues: Skin, fat, connective tissues, and muscles which, along with the hard tissues of bone, teeth, 

etc. and the fluids, make up the human body. 
Transcutaneous: Passing through the skin. 
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Centuries ago, people with disabilities who survived for an extended period of time were transported on 
hammocks slung between poles which were carried by others. This was the preferred means of transpor- 
tation of the upper class and thus carried no stigma. Later the wheelbarrow was developed and soon 
became a common mode of transportation for people with disabilities. Because wheelbarrows were used 
to transport materials, during this period in history, people with disabilities were looked upon as outcasts 
from society. During the renaissance, the French court popularized the first wheelchairs. Wheelchairs 
were overstuffed arm chairs with wheels placed upon them. This enabled movement, with assistance, 
indoors. Later the wooden wheelchair with wicker matting was developed. This type of chair remained 
the standard until the 1930s. Franklin D. Roosevelt was not satisfied with the wooden wheelchair and 
had many common metal kitchen chairs modified with wheels. In the 1930s a young mining engineer, 
named Everest experienced an accident that left him mobility impaired. He worked with a fellow engineer 
Jennings to develop steel wheelchairs. Within a few years, they formed a company Everest & Jennings to 
manufacture wheelchairs. Following World War II, medical advances saved the lives of many veterans 
with spinal cord injuries or lower limb amputations, who would have otherwise died. Veterans medical 
centers issued these veterans steel framed wheelchairs with 18 inch seat widths. These wheelchairs were 
designed to provide the veteran some mobility within the hospital and home, and not to optimize 
ergonomic variables. Just as among the ambulatory population, mobility among people with disabilities 
varies. Mobility is more of a functional limitation than a disability related condition. Powered mobility 
can have tremendous positive psycho-social effects on an individual. Power wheelchairs provide greater 
independence to thousands of people with severe mobility impairments. 
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Power wheelchairs began in the 1940's as standard cross-brace folding manual wheelchairs adapted with 
automobile starter motors and an automobile battery. The cross-braced wheelchair remained the standard 
for a number of years. When the rigid power wheelchair frame was developed, space became available 
under the seat for electronic controls, respirators, communication systems, and reclining devices. By the 
mid- 1 970s, wheelchairs had evolved to the point where people had acquired a significant level of mobility. 

A personal automobile has a profound affect on a persons' mobility and ability to participate in society. 
A wheelchair is suitable for short distances, and for many situations where an unimpaired person would 
walk. Modifications to vehicles may be as simple as a lever attached to the brake and accelerator pedals 
or as complex as a complete joystick controlled fly-by-wire system. Modifications to other components 
of the vehicle may be required to provide wheelchair access. An automobile may not be appropriate for 
some people who travel distances too long to be convenient with a wheelchair, but not long enough to 
warrant an adapted automobile. Micro-cars, enlarged wheelchairs which travel at bicycle speeds, are 
convenient for many people who wish only to travel to the local grocery store or post office. Micro-cars 
are also useful for people who like to travel along bicycle paths or drive short distances off-road. 

141.2 Categories of Wheelchairs 

There are two basic classes of wheelchairs: manually powered, and externally powered. For practical 
purposes, externally powered wheelchairs are electrically powered wheelchairs. There are approximately 
200,000 wheelchairs sold annually within the US of which about 20,000 are powered wheelchairs. Most 
wheelchairs are purchased by third-party-payers (e.g., insurance companies, government agencies). This 
requires the market to be responsive to wheelchairs user's needs, prescriber expertise and experience, 
third-party-payer purchase criteria, and competition from other manufacturers. Despite the complicated 
interaction between these components, and the regulation of products by several government agencies, 
a variety of wheelchairs and options are available. 

Depot wheelchairs are intended for institutional use where several people may use the same wheelchair. 
Generally, these wheelchairs are inappropriate for active people who use wheelchairs for personal mobility. 
Depot wheelchairs are designed to be inexpensive, accommodate large variations in body size, to be low 
maintenance, and often to be attendant propelled. They are heavy and their performance is limited. A 
typical depot wheelchair will have swing away footrests, removable armrests, a single cross-brace frame 
and solid tires. 

People who have impairment of an arm and one or both lower extremities may benefit from a one- 
arm drive wheelchair that uses a linkage connecting the rear wheels. This allows the user to push upon 
the pushrim of one wheel and to propel both wheels. To effectively turn the wheelchair, the user must 
have the ability to disengage the drive mechanism. 

Some people have weakness of the upper and lower extremities and can gain maximal benefit from 
wheelchair propulsion by combining the use of their arms and legs or by using only their legs. The design 
and selection of a foot-drive wheelchair depends greatly upon how the user can take greatest advantage 
of their motor abilities. 

Indoor spaces are more limited and one is often required to get close to furnishings and fixtures to 
use them properly. Indoor wheelchairs often use rear castors because of the manueverability of these 
designs. However, rear castor designs make the wheelchair less stable in lateral directions. Indoor wheel- 
chairs typically have short wheelbases. 

All wheelchairs are not propelled by the person sitting in the wheelchair. In many hospitals and long- 
term care facilities, wheelchairs are propelled by attendants. Attendant propelled wheelchair designs must 
consider the rider and the attendant as users. The rider must be transported safely and comfortably. The 
attendant must be able to operate and easily maneuver safely and with minimum physical strain. 

Active users often prefer highly maneuverable and responsive wheelchairs which fit their physical and 
psychosocial character. The ultralight wheelchair evolved from the desire of wheelchair users to develop 
functional ergonomic designs for wheelchairs. Ultralight wheelchairs are made of such materials as 
aluminum, alloy steel, titanium, or composites. The design of ultralight wheelchairs allows a number of 

© 1997 by CRC Press LLC 



features to be customized by the user or be specified for manufacture. The most common features of all 
are the light weight, the high quality of materials used in their construction, and their functional design. 
Many people can benefit from ultralight wheelchair designs. 

The desire to achieve better performance has led wheelchair users, inventors, and manufacturers to 
constantly develop specialized wheelchairs for sports. There is no real typical sports wheelchair as the design 
depends heavily on the sport. Basketball and tennis wheelchairs are often thought to typify sports wheelchair 
design. However, racing, field events or shooting wheelchairs have little in common with the former. 

Some wheelchairs are made to change configuration from reclining to sitting, and from sitting to 
standing. Most stand-up wheelchairs cannot be driven in the stand-up setting in order to insure safe and 
stable operation. Standing gives the user the ability to reach cabinet and counter spaces otherwise 
inaccessible. Standing has the additional advantage of providing therapeutic benefits, i.e., hemodynamic 
improvements, and amelioration of osteoporosis. 

Stairs and other obstacles persist despite the progress made in universal design. Stair-climbing wheel- 
chairs are electrically powered wheelchairs designed to ascend and descend stairs safely under the occu- 
pant's control. Stair-climbing wheelchairs are quite complicated, and often reconfigure themselves while 
climbing stairs. The additional power required to climb stairs often reduces the range of the wheelchair 
when compared to standard power wheelchairs. 

141.3 Wheelchair Structure and Component Design 

Several factors must be considered when designing a wheelchair frame: what are the intended uses, what 
are the abilities of the user, what are the resources available, and what are the existing products available. 
These factors determine if and how the frame will be designed and built. Successful designs of wheelchairs 
can only be accomplished with continuous input from and interaction with wheelchair users. The dura- 
bility, aesthetics, function, ride comfort, and cost of the frame are dependent on the materials for con- 
struction, the frame geometry, and fabrication methods. One of the issues that makes wheelchair design 
more complicated is the fact that many users are dependent upon wheeled mobility everyday, nearly all day. 

Materials 

Most wheelchairs are made of either aluminum or steel. Some chairs are made of titanium or advanced 
composite materials, primarily carbon fiber, and in the future composite frames will probably begin to 
become more available. All of these materials have their strengths and weaknesses. 

Common aluminum wheelchairs are Tungsten Inert Gas (TIG) welded (i.e., electrically welded together 
in a cloud of inert gas). They are sometimes bolted together using lugs. Most aluminum wheelchair frames 
are constructed of round drawn 6061 aluminum tubing. This is one of the least expensive and most versatile 
of the heat treatable aluminum alloys. It has most of the desirable qualities of aluminum, has good 
mechanical properties and high corrosion resistance. It can be fabricated using most standard techniques. 

Most steel wheelchairs are made of mild steel (1040 or 1060) or chromium-molybdenum alloy (4130 or 
4140) seamless tubing commonly called chro-moly Mild steel is very inexpensive and easy to work with. 
It is wildly available, and performs well for many applications. However, it has a low strength to weight 
ratio compared to other materials. Chro-moly is widely used because of its weldability, ease of fabrication, 
mild hardenability, and high fatigueability Commonly wheelchairs are made of tubing 0.028-0.035 inches 
in wall thickness but diameters vary depending on the expected loads from between 0.25 to 1.25 in. 

More and more of the high-end wheelchairs are made of titanium. Titanium is a lightweight, strong, 
nonferrous metal. Titanium wheelchair frames are TIG welded. Titanium is the most exotic of the metals 
used in production wheelchairs and the most expensive. Titanium requires special tooling and skill to 
be machined and welded. It has very good mechanical properties and high corrosion resistance. It is 
resilient to wear and abrasion. Titanium is used because of its availability, appearance, corrosion resis- 
tance, very good strength and light weight. A drawback of titanium, besides cost, is that titanium once 
worn or if flawed may break rapidly (i.e., it has a tendency towards brittle fractures). 
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Advanced composites have been in use in aerospace and industrial applications for a number of years. 
These materials include Kevlar, carbon fiber, and polyester limestone composite. These materials are now 
making the transition to wheelchair design. Kevlar is an organic fiber which is yellow in color and soft 
to the touch. It is extremely strong and tough. It is one of the lightest structural fabrics on today's market. 
Kevlar is highly resistant to impact, but its compression strength is poor. Carbon fibers are made by 
changing the molecular structure of Rayon fibers by extreme stretching and heating. Carbon fiber is very 
stiff (high modulus of elasticity), very strong (high tensile strength) and has very low density (weight for 
a given volume). Composites come as cloth or yarn. Composite cloth is woven into bidirectional or 
unidirectional cloth. Unidirectional weaves can add strength along a particular direction. Composites 
must be bound together by resin or epoxy. Generally, polyester resins or various specialty epoxies (e.g., 
Safe-T-Poxy) are used. To achieve greatest strength a minimum amount of epoxy must be used while 
wetting all of the fibers. This is often achieved through a process called bagging. To increase the strength 
and stiffness of structural components a foam (e.g., styrofoam, urethane or PVC) core is used. The 
strengthening occurs because of the separation of the cloth layers (it now becomes more like a tube than 
a flat sheet). Polyesther limestone composites have been used widely in industrial high voltage electrical 
component enclosures. A blend of polyester and limestone are used to form a mixture which can be 
molded under pressure and heated to form a stiff and durable finished product. Polyester limestone 
composites have high impact strength, and hold tolerances well, but have substantially lower strength to 
weight ratios than other composites. Their primary advantage is cost, polyester limestone composites are 
very inexpensive and readily available. Composites can be molded into elaborate shapes which opens a 
multitude of possibilities for wheelchair design. 

Frame Design 

Presently all common wheelchair frames center around tubular construction. The tubing can either be 
welded together, or bolted together using lugs. There are two basic common frame styles: box frame and 
the cantilever frame. The box frame is named such because of its rectangular shape, and that tubes outline 
the edges of the "box". Box frames can be very strong and very durable. A cantilever frame is named so 
because the front and rear wheels, when viewing the chair from the side, appear to be connected by only 
one tube; this is similar to having the front wheels attached to a cantilever beam fixed at the rear wheels. 
Both frame types require cross bracing to provide adequate strength and stiffness. 

The box frame provides great strength and rigidity. If designed and constructed properly the frame 
only deflects minimally during normal loading, and most of the suspension is provided by the seat 
cushion, the wheels and the wheel mounting hardware. Many manufacturers do not triangulate their 
box frame designs to allow some flexibility. The cantilever frame is based upon a few basic principles: 
(1) the frame can act as a suspension, (2) there are fewer tubes and they are closer to the body which 
may make the chair less conspicuous, and (3) there are fewer parts and fewer welds which makes the 
frame easier to construct. 

Wheels and Casters 

Casters can be as small as 2 in. in diameter or as large as 12 in. in diameter for wheelchairs designed for 
daily use. Casters are either pneumatic, semi-pneumatic, or solid (polyurethane). Pneumatic casters offer 
a smoother ride at the cost of increased maintenance, whereas polyurethane casters are very durable. 
Semi-pneumatic tires offer a compromise. Most active users prefer 5 in. polyurethane casters or 8 inch 
pneumatic casters for daily use. An 8-in. caster offers a better ride comfort at the expense of foot clearance. 
Caster foot clearance is maximized with a 2-in. "Roller Blade" caster often used for court sports (e.g., 
basketball, tennis, and racquetball). Rear wheels come in three common sizes 22, 24, and 26 in. They 
come in two styles: spoked and MAG. MAG wheels are typically made of plastic and are die cast. MAG 
wheels require minimal maintenance, and wear well. However, spoked wheels are substantially lighter, 
more responsive, and are generally preferred by active manual wheelchair users. Rear tires can be two 
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types; pneumatic or puncture proof. Pneumatic tires can either use a separate tube and tire or a combined 
tube and tire (sew-up). Commonly, a belted rubber tire with a Butyl tube (65 psi) is used. However, 
those desiring higher performance prefer sew-up tires or Kevlar belted tires with high pressure tubes 
(180 psi) . Puncture proof tires are heavier, provide less suspension, and are less lively than pneumatic tires. 
The chair must be designed to optimize the interaction of the wheels with the ground. Four critical 
performance factors need to be considered: (1) Castor Flutter, (2) Castor Float, (3) Tracking, and 
(4) Alignment. Castor flutter is the shimmy (rapid vibration of the front wheels) that may occur on some 
surfaces above certain speeds. When one of the castors does not touch the floor when on level ground, 
the wheelchair has castor float. Castor float decreases the stability and performance of the wheelchair. A 
manual wheelchair uses rear wheel steering via differential propulsion torque. Tracking is the tendency of 
the wheelchair/rider to maintain its course once control has been relinquished. Tracking is important, as 
the rider propels the handrims periodically (about every second) and if the chair does not track well it 
will drift from its course between pushes and force the rider to correct heading. This will waste valuable 
energy and reduce control over the chair. Alignment generally refers to the orientation of the rear wheels 
with respect to one another. Typically, it is desirable to have the rear wheels parallel to one another without 
any difference between the distance across the two rear wheels at the front and back. Misalignment on the 
order of 1/8 inch can cause a noticeable increase in the effort required to propel the wheelchair. 

141.4 Ergonomics of Wheelchair Propulsion 

The most important area of wheelchair design and prescription is determining the proper interaction 
between the wheelchair and the user. This can lead to reducing the risk of developing repetitive strain 
injury while maximizing mobility. Cardiovascular fitness can be improved through exercise which 
requires a properly fitted wheelchair. 

Kinematics 

Kinematic data by itself does not provide sufficient information for the clinician to implement appropriate 
rehabilitation intervention strategies or for the engineer to incorporate this information into wheelchair 
design changes. Kinematic data are commonly collected at 60 Hz, which is the maximum frequency of 
many videotape-based systems. Kinematic data analysis shows that experienced wheelchair users contact 
the pushrim behind top-dead-center and push to nearly 90° in front of top-dead-center. This is signifi- 
cantly longer than non-wheelchair users. Lengthening the stroke permits lowering the propulsion 
moment and may place less stress on the users' joints. 

An important aspect of the evaluation, and possible retraining of wheelchair users is to determine the 
optimal stroke kinetics and kinematics [2]. However, there is typically some degree of variation from 
one stroke to another. Wheelchair propulsion kinematic data are typically cyclic (i.e., a person repeats 
or nearly repeats his/her arm motions over several strokes). Each marker of the kinematic model (e.g., 
shoulder, elbow, wrist, knuckle) of each subject generates an x and y set of data which is periodic. The 
frequencies of the x and y data are dependent upon the anthropometry of the individual, the construction 
of the wheelchair, and the speed of propulsion. The periodic nature of the kinematic data for wheelchair 
propulsion can be exploited to develop a characteristic stroke from a set of kinematic data (with the rear 
hub of the wheelchair chosen as the origin) including several strokes. 

Kinetics 

The SMART" heel was developed to measure the pushrim forces required for evaluating net joint forces 
and moments to allow the clinician and researcher to study the level of stress experienced by the joint 
structures during wheelchair propulsion. The SMART vheel uses a standard wheelchair wheel fitted with 
three beams 120° apart and each has two full strain gage bridges. The strain gage bridges are each 
interfaced through an instrumentation amplifier to a micro-controller which transmits the data through 
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a mercury slip-ring to the serial port of a computer. Kinetics of wheelchair propulsion are affected by 
speed of propulsion, injury level, user experience, and wheelchair type and fit. Van Der Woude et al. have 
reported on an ergometer which detected torque by way of a force transducer located in the wheel center 
and attached to what is referred to as the "wheel/hand rim construction" [7]. The ergometer was adjusted 
for each subject's anthropometric measurements. Data were sampled at 100 Hz for 7.5 second periods 
with a digital filter cut-off frequency of 10 Hz. Mean and peak torque increased with mean velocity, a 
maximum mean peak torque of 31 N-m occurred at 1.27 m/s. Torque curves of inexperienced subjects 
showed an initial negative deflection and a dip in the rising portion of the curve. Torque curves were 
reported to be in agreement with results of previous investigations [8,9,10]. 

Brauer and Hertig [12] measured the static torque produced on push-rims which were rigidly restrained 
by springs and mounted independent of the tires and rims of the wheelchair. The spring system was 
adjustable for the subject's strength. The wheels were locked in a fixed position. Torque was measured 
using slide-wire resistors coupled to the differential movements between the push-rim and wheels and 
recorded using a strip chart recorder. Subjects were asked to grasp the push-rim at six different test positions 
(-10, 0, 10, 20, 30, and 40 degrees relative to vertical) and to use maximal effort to turn both wheels 
forward. Male subjects (combined ambulatory and wheelchair users) produced torques of 27.9 to 46.6 N-m 
and female subjects produced torques of 17.1 to 32.1 N-m [12]. Grip location, handedness, grip strength, 
and how well the test wheelchair fit the anthropometric measurements of the individual affected the torque 
generated. Problems encountered were slipperiness of the push-rims due to a polished finish and limited 
contact due to the small diameter of the push-rim tubing (12.7 mm or 1/2 in). The use of one wheelchair 
for all subjects presented the problem of variations due to inappropriate fit for some individuals. 

Brubaker, Ross, and McLaurin [13] examined the effect of horizontal and vertical seat position (relative 
to the wheel position) on the generation of static push-rim force. Force was measured using a test platform 
with a movable seat and strain gauged beams to which the push-rims were mounted. Pushing and pulling 
forces were recorded using a strip chart recorder. Static force was measured for four grip positions (-30, 
0, 30, and 60 degrees) with various seat positions. Push-rim force ranged from approximately 500 to 
750 N and varied considerably with seat position and rim position [13]. 

Net Joint Forces and Moments 

Net joint forces and moments acting at the wrist, elbow and shoulder during wheelchair propulsion 
provide scientists and clinicians with information related to the level of stress borne by the joint structure. 
Joint moments and forces are calculated using limb segment and joint models, anthropometric data, 
kinetic data, and kinematic data. Joint moments data shows that forces at each joint vary among subjects 
in terms of peak forces, where they occur during the propulsion phase, and how quickly they develop. 
Peak net joint moments occur at different joint angles for different subjects, and conditions (e.g., speed, 
resistance). Convention for joint angles is that 180 degrees at the elbow represents full extension; while 
at the wrist, this is the hand in the neutral position (flexion less than 180 degrees and extension greater 
than 180 degrees). Joint angles at the shoulder are determined between the arm and the trunk, with zero 
measured at the point where the trunk and arm are aligned. Wheelchair users show maximum net 
shoulder moment between 20° and 40° of extension. Some wheelchair users also show a rapid rise in the 
elbow extensor moment at the beginning of the stroke with the elbow at about 120°. This moment value 
begins to decrease between 150° and 170°. At the wrist, the peak moments occur between 190° and 220°. 
Net joint moments and force models need to account for hand center of pressure, inaccuracies in 
anthropometric data, and joint models related to clinical variables. 

141.5 Power Wheelchair Electrical Systems 

Some people are impaired to an extent that they would have no mobility without a power wheelchair. 
However, some people may have limited upper body mobility and may have the ability to propel a manual 
wheelchair for short distances. These people may liken using a power wheelchair to admitting defeat. 
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However, a power wheelchair may provide greater mobility. In such cases it may be best to suggest a 
power wheelchair for longer excursions, and a manual wheelchair for in home and recreational use. 

User Interface 

Power wheelchairs are often used in conjunction with a number of other adaptive devices. For people 
with severe mobility impairments, power wheelchairs may be used with communication devices, com- 
puter access devices, respirators, and reclining seating systems. The integration of the users' multiple 
needs must also be considered when designing or prescribing a power wheelchair. 

The joystick is the most common control interface between the user and the wheelchair. Joysticks 
produce voltage signals proportional to displacement, force, or switch closures. Displacement joysticks 
are most popular. Displacement joysticks may use either potentiometers, variable inductors (coils) or 
optical sensors to convert displacement to voltage. Inductive joysticks are most common as they wear 
well and they can be made quite sensitive. Joysticks can be modified to be used for chin, foot, elbow, 
tongue or shoulder control. Typically, short throw joysticks are used for these applications. Force sensing 
joysticks use three basic transducers: simple springs and dampeners on a displacement joystick, cantilever 
beams with strain gages, and fluid with pressure sensors. Force sensing joysticks which rely on passive 
dampeners or fluid pressure generally require the user to have a range of motion within normal values 
for displacement joysticks users. Beam-based force sensing joysticks require negligible motion, and hence 
may be used for people with limited motion abilities. 

People who exhibit intention or spastic tremor or with multiple sclerosis may require special control 
considerations. Signal processing techniques are often required to grant the user greater control over the 
wheelchair. Typically, signal averaging or a low pass filter with a cut-off frequency of below 5 Hertz is 
used. The signal processing is typically incorporated into the controller. 

Some people lack the fine motor control to effectively use a joystick. An alternative for these people 
is to use a switch control or a head position control. A switch control simply uses either a set of switches 
or a single switch and a coded input, i.e., Morse code or some other simple switch code. The input of 
the user is latched by the controller and the wheelchair performs the task commanded by the user. The 
user may latch the chair to repeatedly perform a task a specified number of times, e.g., continue straight 
until commanded to do otherwise. Switch control is quite functional, but it is generally slower than 
joystick control. Switch inputs can be generated in many ways. Typically, low pressure switches are used. 
The input can come from a sip-and-puff mechanism which works off of a pressure transducer. A switch 
contact is detected when the pressure exceeds or drops below a threshold. The pressure sensor may be 
configured to react to pressure generated by the user blowing into or sipping from an input or by the 
user simply interrupting the flow in or out of a tube. Sip-and-puff may also be used as a combination 
of proportional and switch control. For example, the user can put the control in the "read speed" mode 
and then the proportional voltage output from the pressure transducer will be latched as the user-desired 
speed. 

Simple switches of various sizes can be used to control the chair using many parts of the body. Switches 
may be mounted on the armrests or a lap tray for hand or arm activation, on the footrest(s) for foot 
activation, or on a head rest for head activation. The motion of the head can also be used for proportional 
control by using ultrasonic sensors. Ultrasonic sensors can be mounted in an array about the headrest. 
The signal produced by the ultrasonic sensors is related to the position of the head. Hence, motion of 
the head can be used to create a proportional control signal. Ultrasonic head control and switch control 
can be combined to give some users greater mastery over their power wheelchair. Switches can be used 
to select the controller mode, whereas the ultrasonic sensors give a proportional input signal. 

A critical consideration when selecting or designing a user interface is that the ability of the user to 
accurately control the interface is heavily dependent upon the stability of the user within the wheelchair. 
Often custom seating and postural support systems are required for a user interface to be truly effective. 
The placement of the user interface is also critical to its efficacy as a functional control device. 
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Integrated Controls 

People with severe physical impairments may only be able to effectively manipulate a single input device. 
Integrated controls are used to facilitate using a single input device (e.g., joystick, head switches, voice 
recognition system) to control multiple actuators (e.g., power wheelchair, environmental control unit, 
manipulator). This provides the user with greater control over the environment. The M3S-Multiple 
Master Multiple Slave bus is designed to provide simple, reliable access to a variety of assistive devices. 
Assistive devices include input devices, actuators, and end-effectors. M3S is based on the Computer Area 
Network (CAN) standard. 

A wide range of organizations provide assistive devices which offer the opportunity of functioning in a 
more independent manner. However, many of these devices and systems are developed without coordination 
resulting in incompatible products. Clinicians and users often desire to combine products from various 
sources to achieve maximal independence. The result is to have several devices with their own input devices 
and overlapping functions. Integrated controls provide access to various end-effectors with a single input 
device. The M3S provides an electronic communication protocol so that the system operates properly. 

M3S is an interface specification with a basic hardware architecture, a bus communication protocol, 
and a configuration method. The M3S standard incorporates CAN plus two additional lines for greater 
security (i.e., 7 wire bus, 2 power lines, 2 CAN lines, safety lines, 1 shield, and 1 harness line). The system 
can be configured to each individual's needs. An M3S system consists of a microcontroller in each device 
and a control and configuration module (CCM). The CCM insures proper signal processing, system 
configuration, and safety monitoring. The CCM is linked to a display (e.g., visual, auditory, tactile) which 
allows the user to select and operate each end-effector. Any M3S compatible device can communicate 
with another M3S compatible device. M3S is an International Organization of Standards (ISO) open 
system communication implementation. 

Power System 

To implement a motor controller, a servo-amplifier is required to convert signal level power (volts at 
milliamps) to motor power (volts at amps). Typically, a design requirement for series, shunt, and brushless 
motor drives is to control torque and speed, and hence, power. Voltage control can often be used to 
control speed for both shunt and series motors. Series motors require feedback to achieve accurate control. 

Either a linear servo-amplifier or a chopper can be used. Linear servo amplifiers are not generally used 
with power wheelchairs primarily because of their lower efficiency than chopper circuits. A motor can 
be thought of as a filter to a chopper circuit, in this case, the switching unit can be used as part of a 
speed and current control loop. The torque ripple and noise associated with phase control drives can be 
avoided by the use of high switching frequencies. The response of the speed control loop is likewise 
improved with increasing switching frequency. 

Motor torque is proportional to the armature current in shunt motors and to the square of the current 
in series motors. The conduction loss of the motor and servo-amplifier are both proportional to the current 
squared. Optimal efficiency is achieved by minimizing the form factor, (I rms /I mean )- This can be done by 
increasing the switching frequency to reduce the amplitude of the ripple. Benefits of increased efficiency 
are increased brush life, gear life, and lower probability of field permanent magnet demagnetization. 

Switching or chopper drives are classified as either unidirectional or bi-directional. They are further 
divided by whether they use dynamic braking. Typically, power wheelchairs use bi-directional drives 
without dynamic braking. However, scooters may use unidirectional drives. The average voltage delivered 
to the motor from a switching drive is controlled by varying the duty cycle of the input waveform. There 
are two common methods of achieving this goal: (1) fixed pulse width, variable repetition rate, and 
(2) pulse-width modulation (PWM). Power wheelchair servo-amplifiers typically employ PWM. 

Pulse width modulation at a fixed frequency has no minimum on-time restriction. Therefore, current 
peaking and torque ripple can be minimized. For analysis, a d.c. motor can be modeled as a RL circuit, 
resistor and inductor, or in series with a voltage source. If the motor current is assumed continuous, 
then the minimum and maximum motor current can be represented by Eq. (141.1). 
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Two basic design principles are used when designing switching servo-amplifiers: (1) I max should be 
limited to five times the rated current of the motor to ensure that demagnetization does not occur, (2) the 
ripple, (I max -I m i n )/I avg , should be minimized to improve the form factor, and reduce the conduction loss 
in the switching devices. To achieve low ripple, either the inductance has to be large or the switching 
frequency has to be high. Permalloy powder cores can be used to reduce core loss at frequencies above 
a few kilohertz. However, this comes at the cost of the electrical time constant of the motor, degrading 
the motor response time. Hence, raising the switching frequency is most desirable. A power MOSFET 
has the ability to switch rapidly without the use of load-shaping components. 

There are several motor types that may be suitable for use with power wheelchairs. Most current 
designs use permanent magnet direct current motors. These motors provide high torque, high starting 
torque and are simplest to control. Permanent magnet direct current motors can either be controlled in 
what are commonly called current mode or voltage mode. These modes developed out of designs based 
upon controlling torque and speed, respectively. 

Alternating current motors can be designed to be highly efficient and can be controlled with modern 
power circuitry. Because of the development and wide spread dissemination of switching direct current 
converters, it is quite feasible to use alternating current motors with a battery supply. To date, alternating 
current motors have only been used in research on power wheelchairs. The output of the motor is 
controlled by varying the phase or the frequency. 

The battery energy storage system is recognized as one of the most significant limiting factors in 
powered wheelchair performance. Battery life and capacity are important. If battery life can be improved, 
the powered wheelchair user will have longer, reliable performance from his/her battery. An increase in 
battery capacity will allow powered wheelchair users to travel greater distances with batteries that weigh 
and measure the same as existing wheelchair batteries. Most importantly, increases in battery capacity 
will enable the use of smaller and lighter batteries. Because batteries account for such a large proportion 
of both the weight and volume of current powered wheelchair systems, wheelchair manufacturers must 
base much of their design around the battery package. 

Power wheelchairs typically incorporate 24 volt d.c. energy systems. The energy for the wheelchair is 
provided by two, deep cycle lead-acid batteries connected in series. Either wet cell or gel cell batteries 
are used. Wet cell batteries also cost about one-half as much as gel cell batteries. Gel cells may be required 
for transport by commercial air carriers. 

Battery technology for wheelchair users remains unchanged despite the call for improvements by power 
wheelchair users. This may be in part due to the relatively low number of units purchased, about 500,000 
per annum, when compared to automotive applications with about 6.6 million per annum by a single 
manufacturer. Wheelchair batteries are typically rated at 12 Volts and 30-90 ampere-hours capacity at 
room temperature. A power wheelchair draws about 10 amps during use. The range of the power 
wheelchair is directly proportional to the ampere-hour rating for the operating temperature. 

Batteries are grouped by size. Group size is indicated by a standard number. The group size defines 
the dimensions of the battery as shown in Table 141.1. The ampere-hour rating defines the battery's 
capacity. 

It is important that the appropriate charger be used with each battery set. Many battery chargers 
automatically reduce the amount of current delivered to the battery as the battery reaches full charge. 
This helps to prevent damage to the battery from boiling. The rate at which wet and gel cell batteries 
charge is significantly different. Some chargers are capable of operating with both types of batteries. 
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Table 141.1 Standard Power Wheelchair 
Battery Group Sizes 



Group number 


Length 


Width 


Height 


Ul 


7-3/4 


5-3/16 


7-5/16 


22NF 


9-7/16 


5-1/2 


8-15/16 


24 


10-1/4 


6-13/16 


8-7/8 


27 


12-1/16 


6-13/16 


8-7/8 



Note: Units are in inches. 

Many require setting the charger for the appropriate battery type. Most wheelchair batteries connected 
in series are charged simultaneously with a 24 Volt battery charger. 

Electromagnetic Compatibility 

Powered wheelchairs have been reported to exhibit unintended movement. Wheelchair manufacturers 
and the US Food and Drug Administration Center for Devices and Radiological Health (FDA-CDRH) 
have examined the susceptibility of powered wheelchairs and scooters to interference from radio and 
microwave transmissions. These devices are tested at frequencies ranging from 26 MHz to 1 GHz, which 
is common for transmissions (e.g., radio, TV, microwave, telephones, mobile radios). Power wheelchairs 
incorporate complex electronics and microcontrollers which are sensitive to electromagnetic (EM) radi- 
ation, electrostatic discharge (ESD), and other energy sources. 

Electric powered wheelchairs may be susceptible to electromagnetic interference (EMI) present in the 
ambient environment. Some level of EMI immunity is necessary to ensure the safety of power wheelchair 
users. Electromagnetic compatibility (EMC) is the term used to describe how devices and systems behave 
in an electromagnetic environment. Because of the complexity of power wheelchairs and scooters, and 
the interaction with an electromagnetic environment, susceptibility to interference cannot be calculated 
or estimated reliably. A significant number of people attach accessories (e.g., car stereos, computers, 
communication systems) to their power wheelchairs which also share the batteries. This may increase 
the susceptibility of other system components to EMI. A number of companies make electric powered 
devices designed to operate on power wheelchairs to provide postural support, pressure relief, environ- 
mental control, and motor vehicle operation. These devices may alter the EMI compatibility of power 
wheelchairs as provided by the original equipment manufacturer (OEM). Wheelchairs and accessories 
can be made to function properly within EM environments through testing. 

Field strengths have been measured at 20 V/m from a 15 Watt hand held cellular telephone, and 8 V/m 
from a 1 Watt hand held cellular telephone [2]. The FDA-CDRH has tested power wheelchairs and 
scooters in a Gigahertz Transverse Electromagnetic (GTEM) cell, and in an anechoic chamber with 
exposure strengths from 3 V/m to 40 V/m. The US FDA requires that a warning sticker be placed on 
each power wheelchair or scooter indicating the risk due to EMI. 

Two tests are commonly performed on chairs: brake release and variation in wheel speed. The device(s) 
used for measuring wheel speed and brake release must not significantly alter the field. The brakes shall 
not release or the wheels are not to move with a wheel torque equivalent to a 1:6 slope with a 100 kilogram 
rider when the wheelchair is exposed to EM radiation. Non-electrical contact methods (e.g., audio sensing, 
optical sensing) of measuring brake release or wheelchair movement are preferable. Nominal wheel speed 
may drift over the length of the test. This drift is primarily due to drop in battery charge over the test 
interval. Wheel speed must be recorded without EM interference between test intervals. The percentage 
change in wheel speed during exposure to EM interference shall be referenced to the nominal wheel speed 
for that test interval. The variation in absolute forward speed, (v emR + v emL )/2, is to be within 30% of the 
nominal forward speed, (v nomR + v nomL )/2. The differential speed between the two wheels should be within 
30% of each other, 2* (v emR - v emL )/(v nomR + v nomL ).The test frequency must be held long enough to accom- 
modate the slowest time constant (time required to reach 63% of maximum or minimum) of parameters 
related to wheelchair driving behavior. Currently two seconds is used by the FDA-CDRH test laboratory. 
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141.6 Personal Transportation 

Special adaptive equipment requirements increase with the degree of impairment and desired degree of 
independence in areas such as personal care, mobility, leisure, personal transportation, and employment. 
People are concerned that they receive the proper equipment for them to safely operate their vehicle. 
Access and egress equipment have the greatest maintenance requirements. Other devices such as hand 
controls, steering equipment, securement mechanisms, and interior controls require less maintenance. 
Most users of adaptive driving equipment are satisfied with the performance of such equipment. Most 
frequent equipment problems are minor and are repaired by consumers themselves. 

Physical functional abilities such as range of motion, manual muscle strength, sensation, grip strength, 
pinch strength, fine motor dexterity, and hand-eye coordination all may be related to driving potential. 
Driving characteristics must also be evaluated when determining an individual's potential to safely operate 
a motor vehicle. Throttle force, brake force, steering force, brake reaction time, and steering reaction 
time are all factors which influence an individual's driving potential. 

Vehicle Selection 

While it is often a difficult task to find an automobile which meets the specific needs of a particular 
wheelchair user, currently, no automobile meets the needs of all wheelchair users. Automotive consumers 
with disabilities are also concerned about ease of entry, stowage space for the wheelchair, and seat 
positioning. Reduced size, increasingly sloping windshields, lower roofs, and higher sills of new cars make 
selecting a new vehicle difficult for wheelchair users. The ability to load the wheelchair into a vehicle is 
essential. Some individuals with sufficient strength and a suitable vehicle are able to stow their wheelchairs 
inside the vehicle without the use of assistive devices. Many people must rely on an external loading 
device. The selection of the appropriate vehicle should be based upon the client's physical abilities and 
social needs. 

An approach some people have used to overcome the problems associated with a smaller car is to use 
a car-top wheelchair carrying device. These devices lift the wheelchair to the top of the car, fold, and 
stow it. They have been designed to work with four door sedans, light trucks, and compact automobiles. 

There are several critical dimensions to an automobile when determining wheelchair accessibility. The 
wheelbase of the automobile is often used by auto manufacturers to determine vehicle size (e.g., full- 
size, mid-size, compact). Typical ranges for passenger vehicles are presented in Table 141.2. 

Lift Mechanisms 

Many wheelchair users who cannot transfer into a passenger vehicle seat or prefer a larger vehicle, drive 
vans equipped with wheelchair lifts. Platform lifts may use a lifting track, a parallelogram lifting linkage, 
or a rotary lift. Lift devices are either electromechanically or electrohydraulically powered. The platform 
often folds into the side doorway of the van. Crane lifts, also called swing-out lifts, have a platform which 
elevates and folds or rotates into the van. Lifts may either be semi-automatic or automatic. In many cases 

TABLE 141.2 Typical Ranges of Accessibility 
Dimensions for Sedans 



Wheelbase 


93-108 


Door height 


33-47 


Door width 


41-47 


Headroom 


36-39 


Max. Space behind seat 


9-19 


Min. Space behind seat 


4-9 


Seat-to-ground distance 


18-22 


Width of door opening 


38-51 



Note: Units in inches. 
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semi-automatic lifts require the user to initiate various stages (e.g., unlocking door, door opening, 
lowering lift) of the lifting process. Automatic lifts are designed to perform all lift functions. They usually 
have an outside key operated control box, or an interior radio controlled control box. 

Some lifts use electrohydraulic actuators to lift and fold, with valves and gravity used to lower the lift. 
Crane lifts may swing out from a post at the front or rear of the side door. Interlocking mechanisms are 
available with some lifts to prevent the lift from being operated while the door is closed. The Society of 
Automotive Engineers (SAE) has developed guidelines for the testing of wheelchair lift devices for entry 
and exit from a personal vehicle. The standards are intended to set an acceptable level of reliability and 
performance for van lifts. 

Wheelchair Restraint Mechanisms 

Securement systems are used to temporarily attach wheelchairs to vehicles during transport. Many 
wheelchair users can operate a motor vehicle from their wheelchair, but are unable to transfer into a 
vehicle seat. Auto safety standards have reduced the number of US automobile accident fatalities despite 
an increase in the number of vehicles. The crash pulse determines the severity of the collision of the test 
sled, and hence, simulates real-world conditions. Securement systems are tested with a surrogate wheel- 
chair at 30 miles per hour (48 +2/-0 kilometers per hour) with a 20g deceleration. Wheelchairs must be 
safely restrained when experiencing an impact of this magnitude and no part of the wheelchair shall 
protrude into the occupant space where it might cause injury. 

Proper use of lap and shoulder belts is critical to protecting passengers in automobiles seats. A similar 
level of crash protection is required for individuals who remain in their wheelchairs during transportation. 
Wheelchairs are flexible, higher than a standard automobile seat, and not fixed to the vehicle. The 
passenger is restrained using a harness of at least one belt to provide pelvic restraint and two shoulder 
or torso belts that restrain both shoulders. A head support may also be used to prevent rearward motion 
of the head during impact or rebound. A three point restraint is the combination of a lap belt and a 
shoulder belt (e.g., pelvic torso restraint, lap-sash restraint, lap-shoulder restraint). 

The relationship between injury criteria and the mechanics of restraint systems are important to insure 
the safety of wheelchair users in motor vehicles. Hip and head deflection are often used criteria for 
determining potential injury. The automotive industry has invested considerable effort for research and 
development to protect vehicle passengers. Research is not nearly extensive for the passenger who remains 
seated in a wheelchair while traveling. Many wheelchair and occupant restraint systems copy the designs 
used for standard automobile seats. However, this type of design may not be appropriate. 

Crash tests have shown that for 10 or 20 g impacts of 100 millisecond duration, people may sustain 
injuries despite being restrained. When shoulder belts mounted 60 inches above the floor were used to 
restrain a 50th percentile male dummy, it was found that the torso was well controlled, and head and 
chest excursions were limited. When shoulder belts were anchored 36 inches above the floor they were 
ineffective in controlling torso movement. Kinematic results and head injury criteria (HIC) can be used 
to estimate the extent of injury sustained by a human passenger. A HIC of 1000 or greater indicates a 
serious or fatal head injury. Generally, a HIC value approaching or exceeding 1000 is indicative of head 
impact with some portion of the vehicle interior. The open space typically surrounding a wheelchair user 
in a public bus precludes impact with the buses' interior. High HIC values may occur when the torso is 
effectively restrained and there is a high degree of neck flexion. If the chin strikes the chest then there 
may be an impact great enough to cause head injury. 

Hand Controls 

Hand-controls are available for automatic and manual transmission vehicles. However, hand-controls 
for manual transmission automobiles must be custom made. There are also portable hand-controls, and 
long-term hand controls. Portable hand-controls are designed to easily attach to most common auto- 
mobiles with a minimal number of tools. Hand-controls are available for either left or right hand control. 
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Many hand controls are attached to the steering column. Hand-controls either clamp to the steering 
column or are attached to a bracket which is bolted to the steering column or dash, typically where the 
steering column bolts to the dash. Installation of the hand-controls should not interfere with driver safety 
features (e.g., air bags, collapsible steering columns). The push rods of the hand-control either clamp 
directly to the pedals or the levers connected to them. 

Most systems activate the brakes by having the driver push forward on a lever with a hand grip. This 
allows the driver to push against the back of the seat, creating substantial force, and braces the driver in 
the event of a collision. The throttle, or gas pedal, is operated in a number of ways. Some systems use a 
twist knob or motorcycle type throttle. Other systems actuate the throttle by pulling on the brake throttle 
lever. Another method is to rotate the throttle-brake lever downwards (i.e., pull the lever towards the 
thigh at a right angle to the brake application to operate the throttle). It is common to have the same 
vehicle driven by multiple people which may require the vehicle to be safely operated with hand-controls, 
and the OEM foot controls. Care must be taken to insure that the lever and brackets of the hand-controls 
do not restrict the driving motions of foot control drivers. 

Many people have the motor control necessary to operate a motor vehicle, but they do not have the 
strength required to operate manual hand-controls. Automatic or Fly-By-Wire hand-controls use external 
actuators (e.g., air motors, servo mechanisms, hydraulic motors) to reduce the force required to operate 
various vehicle primary controls. Power steering, power brakes, six-way power seats, and power adjustable 
steering columns can be purchased as factory options on many vehicles. 

Six-way power seats are used to provide greater postural support and positioning than standard 
automotive seats. They can be controlled by a few switches to move fore-aft, incline-recline, and superior- 
inferior. This allows the user to position the seat for easy entry and exit, and for optimal driving comfort. 
Power adjustable steering columns also make vehicles more accessible. By using a few buttons, the steering 
column can be tilted upwards or downwards allowing positioning for entry/exist into the vehicle, and 
for optimal driving control. 

Custom devices are available for people who require more than the OEM options for power assistance. 
Microprocessor and electronic technology have dramatically changed how motor vehicles are designed. 
Many functions of an automobile are controlled electronically or with electromechanical-electro hydraulic 
controls. This change in vehicle design has made a wide variety of options available for people who 
require advanced vehicle controls. Many automobiles use electronic fuel injection. Electronic fuel injec- 
tion systems convert the position of the accelerator pedal to a serial digital signal which is used by a 
microcontroller to inject the optimal fuel-air mixture into the automobile at the proper time during the 
piston stroke. The electronic signal for the accelerator position can be provided by another control device 
(e.g., joystick, slide-bar). 
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Rehabilitation of humans with disabilities requires effective usage of assistive systems for restoration of 
motor functions. Important features of an effective assistive system are: (1) reliability; (2) minimum 
increase of energy rate and cost with respect to able-bodied subjects performing the same task; 
(3) minimum disruption of normal activities when employing the assistive system; (4) cosmetics; and 
(5) practicality. The system should be easy to don and doff and available for daily use at home. These 
requirements and available technology have led to the development of externally powered orthoses and 
prostheses that interface directly or indirectly with the human neuromuscular system. These devices 
include battery-powered actuators or muscle stimulators, microprocessor-based controller and reliable 
biologic-like sensors. 

Hand, arm, and lower extremities assistive systems for amputees and humans with paralysis are 
elaborated in this chapter. Two approaches for the restoration of movements in humans with paralysis 
are described: functional activation of paralyzed muscles called Functional Electrical Stimulation (FES) 
or Functional Neuromuscular Stimulation (FNS), and the combined usage of an FES and a mechanical 
orthosis. The latter is called Hybrid Assistive System (HAS). The section dealing with prosthesis relates 
to externally controlled and powered artificial limbs, with specific emphasis on so-called myoelectric 
controlled devices. 

Available assistive systems meet many of the requirements listed above, but still have some drawbacks. 
The small number of externally powered system users on a daily basis is a significant indicator that these 
systems are not perfected. Recent neurophysiological findings on muscle properties and strengthening 
techniques, in addition to improved percutaneous or implantable stimulators, may increase the applica- 
bility of many assistive systems, specifically in cases of paralyzed limbs. Major limitations in daily appli- 
cation relate to insufficiently adaptive and robust control methods. The complexity of central nervous 
system (CNS) control and the interface between voluntary control and external artificial control are still 
challenging, unanswered questions. Hierarchical control methods, combining symbolic models at the 
highest level and analytic models at lower levels, give hope for further progress (see Chapter 165 — Control 
of Movements). 
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Sensory feedback is essential for effective control of FES systems. In addition to artificial sensors, hopes 
are directed towards the use of natural sensors and the development of an "intelligent" movement 
controller. 

142.1 FES Systems 

Functional Electrical Stimulation (FES) can help in regaining functional movements in numerous par- 
alyzed humans. FES activates innervated but paralyzed muscles, by using an electronic stimulator to 
deliver trains of pulses to neuromuscular structures. The basic phenomenon of the stimulation is a 
contraction of muscle due to the controlled delivery of electric charge to neuromuscular structures. 

FES systems can restore (1) goal-oriented (hand and arm) movements, and (2) cyclic (walking and 
standing) movements. 

Restoration of Hand Functions 

The objective of an upper extremity assistive system is directed toward establishing independence to the 
user. Efforts in developing upper extremity FES systems were targeted toward individuals with diminished, 
but preserved, shoulder and elbow functions, with lack of wrist control and grasping ability [1]. 

There have been several designs of FES systems. These systems can be divided among the source of 
control signals to trigger or regulate the stimulation pattern: shoulder control [2], voice control [3,4], 
respiratory control [5], joystick control [6], and position transducers [7, 8] or trigger [9, 6]. The division 
can be made based upon the method to which patterned electrical stimulation is delivered: one to three 
channel surface electrode systems [6-10], multichannel surface stimulation system [4], multichannel 
percutaneous systems with intramuscular electrodes [2,3,5], and fully implanted systems with epimysial 
electrodes [1]. Only a small number of FES grasping systems has been used outside the laboratory. 

The first grasping system used to provide prehension and release [11] used a splint with a spring for 
closure and electrical stimulation of the thumb extensor for release. This attempt was unsuccessful mostly 
because of the state-of-the-art technology used, but also because of muscle fatigue and erratic contractile 
response. Rudelet al. [12], following the work of Vodovnik [8], suggested the use of a simple two-channel 
stimulation system and a position transducer (sliding potentiometer). The shift of the potentiometer 
forward from its neutral position causes opening by stimulating the dorsal side of the forearm; a shift 
backward causes closing of the hand by stimulating the volar side of the forearm. 

The follow-up of initial FES system use was systematically continued in Japan. Japanese groups 
succeeded in developing the FES clinic in Sendai, Japan, where many subjects are implanted with up to 
30 percutaneous intramuscular electrodes that are used for therapy, but not to assist grasping. The 
Japanese research approach in functional grasping relates to subjects lacking not only hand functions 
but also the elbow control (e.g., C4 complete CNS injuries), and the system uses either a voice or suck/puff 
control and preprogrammed EMG-based stimulation patterns. This preprogrammed EMG-based stim- 
ulation is developed by detailed studies of muscle activities with intramuscular electrodes in able-bodied 
subjects while reaching and grasping [3]. 

The approach taken at Ben Gurion University, Israel [4] used a voice controlled multichannel surface 
electrode system. As many as 12 bipolar stimulation channels and a splint are used to control elbow, 
wrist, and hand functions. There is very little practical experience with the system, and the system has 
to be tuned for the needs of every single user. Surface stimulation most probably does not allow control 
of small hand and forearm muscles necessary to provide dexterity while grasping. Daily mounting and 
fitting of the system is problematic. The work of Nathan resulted with a commercial device called 
Handmaster NMS1 [9,13]. This device is approved as a therapeutic device claiming that it is improving 
grasping functionality in humans after stroke. A detailed analysis of effects of FES systems after stroke 
is presented by Hines and colleagues [14], 

The group at the Institut for Biokibernetik, Karlsruhe, Germany suggested the use of EMG recordings 
from the muscle, which is stimulated [15]. The aim of this device is to enhance the grasping by using 
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weak muscles. Hence, in principle it could be possible to use retained recordings from the volar side of 
the forearm to trigger on and off the stimulation of the same muscles. In this case, it is essential to 
eliminate the stimulation artifact and the evoked potential caused by the stimulus in order to eliminate 
positive feedback effects which will generate a tetanic contraction that cannot be turned off using the 
method presented. This approach is further developed in Denmark [16]. 

The Case Western Reserve University (CWRU) fully implantable system has a switch to turn the system 
on and off and select the grasp and a joystick to proportionally control aperture of the hand for palmar 
and lateral grasp [17-19]. These two grasps are synthesized using a preprogrammed synergy. The joystick 
mounted on the contralateral shoulder voluntarily controls the preprogrammed sequence of stimulation. 
The palmar grasp starts from the extended fingers and thumb (one end position of the joystick), followed 
by the movement of the thumb to opposition and flexing of fingers (other end position of the joystick). 
The lateral grasp starts from the full extension of fingers and the thumb, followed by the flexion of fingers 
and adduction of the thumb. The system is applicable if the following muscles can be stimulated: extensor 
pollicis longus, flexor pollicis longus, adductor pollicis, opponens, flexor digitorum profundus and 
superficialis, and extensor digitorum communis. It is possible to surgically change the grasp permanently 
[1] (pining some joints, tendon transfer, etc.). An important feature of the grasping system is related to 
daily fitting of the joystick (zeroing the neutral position), and going to hold mode from the movement 
mode. The hold mode is the regime where the muscle nerves are stimulated at the level that the same 
force is maintained, and the user selects the level. At this time, the CWRU system uses the joystick with 
two degrees of freedom or velocity sensor software to switch from active control to hold mode. When 
the system is in the hold mode, joystick movements do not effect the grasp. The initial CWRU system 
[20] suggested the use of myoelectric signals obtained from a site with some regaining voluntary activity. 
The CWRU implantable system is the only system used for assistance in daily living function [1]. The 
functional evaluation of the system [21] showed that there is substantial improvement in simple grasping 
tasks, which is important to a person with tetraplegia, and an increasing number are in use around North 
America and Australia. 

Prochazka [7] suggested the use of wrist position to control the stimulation of muscles to enhance the 
tenodesis grasping, and designed a device called the Bionic Glove (Fig. 142.1). A sensor is used to detect 
wrist movement, and trigger opening and closing of the hand. A microcomputer is built into the battery 
operated stimulation unit, which detects movements and controls three channels to stimulate thumb 
extension and flexion, and finger flexors. Clinical evaluation of the Bionic Glove [22] showed that the 
stimulation is beneficial for persons with tetraplegia both therapeutically and as an orthosis, but that the 
overall acceptance rate is still low. 

Based on four-channel surface electrical stimulation for walking triggered from the lesion EMG 
recordings [23,24], a grasping system with up to three channels of surface stimulation provides similar 
function to the Prochazka's system [25]. The use of myoelectric signals from above lesion sites (forearm 
wrist extensors) to trigger the stimulation of thumb and finger flexors was presented. A threshold 
discrimination of the EMG is done, which activates the proper stimulus pattern to be applied to the 
surface electrodes. A new laboratory version of the myoelectric control of a grasping system has been 
developed and tried at the ETH Zurich [26]. An interesting approach of using myoelectric control to 
drive the CWRU system was evaluated by Scott and colleagues [27]. Using muscles that are not affected 
and easily controllable, a bilateral control of the CWRU system might be possible. 

The idea of controlling the whole arm and assisting manipulation in humans lacking shoulder and 
elbow control has been getting more attention recently from the research team at the Case Western 
Reserve University in Cleveland, Ohio [28-31]. The system is designed to combine a fully implantable 
grasping system with some additional channels to control elbow extension, flexion, and shoulder move- 
ments. A different approach is taken by using surface stimulation to control elbow movements [32,33]. 
The clinical study at the Rehabilitation Institute in Belgrade with the so-called Belgrade grasping/reaching 
system showed that most persons with tetraplegia responding to stimulation of the biceps and triceps 
brachii muscles improve their function within six weeks to the stage at which they become reluctant to 
use FES assistance for manipulation [32,33]. 
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FIGURE 142.1 Bionic glove for restoration of grasping in persons with tetraplegia used for multicentre trials [7,22]. 
The upper panel shows electrodes, stimulator, and the glove, while the bottom panel shows the glove mounted and 
ready for use. 

The development of implantable cuff electrodes to be used for sensing contact, slippage, and pressure 
[34-38] opens a new prospective in controlling grasping devices, and the Center for Sensory Motor 
Interactions in Aalborg, Denmark is pursuing a series of experiments combining their sensing technique 
with the fully implantable CWRU system. 

Restoration of Standing and Walking 

The application of FES to the restoration of gait was first investigated systematically in Ljubljana, Slovenia 
[39-44]. Currently, FES for gait rehabilitation is used in a clinical setting in several rehabilitation centers 
[45-57] and there is a growing trend for the design of devices for home use. 

Current surface FES systems use various numbers of stimulation channels. The simplest one, from a 
technical point, is a single-channel stimulation system. This system is only suitable for stroke patients 
and a limited group of incomplete spinal cord injury patients. These individuals can perform limited 
ambulation with assistance of the upper extremities without an FES system, although this ambulation 
may be both modified and/or impaired. The FES in these humans is used to activate a single muscle 
group. The first demonstrated application of this technique was in stroke patients [40], even though the 
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original patent came from the Liberson patent in 1961. The stimulation is applied to ankle dorsiflexors 
so the "foot-drop" can be eliminated. A commercial system has been designed by Stein and colleagues 
[58], which integrates a single-channel stimulator and a tilt sensor; thus, eliminating a foot switch which 
was proved for easily generating false triggering and malfunctioning. Single and dual channel correcting 
foot-drop is now a regular clinical treatment in some rehabilitation institutions [59]. 

A multichannel system with a minimum of four channels of FES is required for ambulation of a patient 
with a complete motor lesion of lower extremities and preserved balance and upper body motor control 
[49]. Appropriate bilateral stimulation of the quadriceps muscles locks the knees during standing. Stim- 
ulating the common peroneal nerve on the ipsilateral side, while switching off the quadriceps stimulation 
on that side, produces a flexion of the leg. This flexion, combined with adequate movement of the upper 
body and use of the upper extremities for support, allows ground clearance and is considered the swing 
phase of the gait cycle. Hand or foot switches can provide the flexion-extension alternation needed for 
a slow forward or backward progression. Sufficient arm strength must be available to provide balance in 
parallel bars (clinical application), and with a rolling walker or crutches (daily use of FES). These systems 
evolved into a commercial product called Parastep-IR (Sigmedics, Chicago, IL) which was approved for 
home usage in 1994 by the Food and Drug Administration. 

Multichannel percutaneous systems for gait restoration were suggested [46,50,61,62] . The main advan- 
tage of these systems is the plausibility to activate many different muscle groups. A very similar prepro- 
grammed stimulation pattern to the one in a human with no motor disorders is delivered to ankle, knee, 
and hip joints, as well as to paraspinal muscles. The experience of the Cleveland research team suggested 
that 48 channels are required for a complete SCI walking system to achieve a reasonable walking pattern, 
but they recently changed their stimulation strategy and included some external bracing. Fine-wire 
intramuscular electrodes are cathodes positioned close to the motor point within selected muscles. Knee 
extensors (rectus femoris, vastus medialis, vastus lateralis, vastus intermedius), hip flexors (sartorius, 
tensor fasciae latae, gracilis, iliopsoas), hip extensors (semimembranosus, gluteus maximus), hip abduc- 
tors (gluteus medius), ankle dorsiflexors (tibialis anterior, peroneus longus), ankle plantar flexors (gas- 
trocnemius lateralis and medialis, plantaris and soleus), and paraspinal muscles are selected for activation. 
A surface electrode is used as a common anode. Interleaved pulses are delivered with a multichannel, 
battery-operated, portable stimulator. The hand controller allows the selection of gait activity. These 
systems were limited to the clinical environment. The application was investigated in complete spinal 
cord lesions and in stroke patients [61,62]. The same strategy and selection criteria for implantation was 
used for both stroke and SCI patients. Recent developments use the CWRU system with eight channels 
per leg to be activated and improved control [61-63]. 

A multichannel totally implanted FES system [55] was proposed and tested in a few subjects. This 
system uses a 16-channel implantable stimulator attached to the epineurium electrodes. Femoral and 
gluteal nerves were stimulated for hip and knee extension. The so-called "round-about" stimulation was 
applied in which four electrodes were located around the nerve and stimulated intermittently. This 
stimulation method reduces muscle fatigue. 

The development of the stimulation technology is providing new hope for patients and researchers 
alike. Two new techniques are especially important: ( 1 ) application of remotely controlled wireless micro- 
stimulators [64-66], and (2) so-called "stimulator for all seasons" [67]. There are several attempts to 
design effective wireless stimulators that are believed to be capable of selectively stimulating fascicles 
[68,69]. The technology used for cochlear implants is finding its way into applications for standing and 
walking restoration [70]. 

However, there are some unanswered questions which limit the effectiveness of FES systems that deal 
with muscle fatigue, reduced joint torques generated through FES in comparison to CNS activated torques 
in healthy subjects, modified reflex activities, spasticity, joint contractures, osteoporosis, and stress frac- 
tures. From the engineering point of view, the further development of FES systems has to address the 
following issues: the interface between an FES system and neuromuscular structures in the organism, 
biocompatibility of the FES system, and overall practicality. The least resolved questions in FES systems 
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deal with control. Open-loop control, often called reference trajectory control, is not efficient because 
of the disturbances and model errors. Closed-loop control requires that we know in advance the desired 
trajectory and use multiple sensors within the system in which the actuators have a delay in the same 
time range as duration of some locomotion states. In principle, closed-loop systems offer substantial 
increases in input-output linearity and repeatability, along with a substantial decrease in the sensitivity 
of the system to parameter variations (internal disturbances) and load changes (external disturbances). 
Digital closed-loop methods using proportional (P) and proportional plus integral (PI) controllers were 
studied for both recruitment and fatigue of muscles [71]. 

A non-conventional method based on symbolic representation of motion in animals and humans, called 
artificial reflex control, was proposed [72-76]. Different solutions are discussed in great detail in Chapter 165 
of this book. Many recent studies for improving control [77-82] aim to prolong standing with FES systems 
without arm supports. This is a very difficult task, considering that the human body acts as an inverted 
chain, and that FES systems generate non-physiological, fast fatiguing muscle contractions. 

The research results indicate that the usual classification of CNS lesions is applicable as a guideline, 
but is not convenient as a selection criterion for FES candidates. FES users and an appropriate FES system 
must be selected through a functional diagnosis. The term "functional diagnosis" means that the func- 
tional status after the injury determines the type of treatment. General statements about FES systems 
indicate that it is suitable for subjects with preserved peripheral neuromuscular structures, moderate 
spasticity, without joint contractures and limited osteoporosis. Subjects should be able to control their 
balance and upper body posture using the upper extremities for assistance (parallel bars, walker, crutches, 
etc.). Subjects with pathologies that affect the heart, lungs, or circulation should be treated with extreme 
care, and often they should not be included in an FES walking program. A satisfactory mental and 
emotional condition is extremely important because an FES treatment requires a certain degree of 
intelligence and understanding of the technical side of the system. High motivation and good cooperation 
with medical staff is a significant aspect for the efficacy of FES. Surface stimulation may be applied only 
in subjects with limited sensation because it activates pain receptors. Subjects suitable for FES can include 
those with head and spinal cord injuries, cerebral paralysis, multiple sclerosis, and different types of 
myelitis and others. 

The final current drawback to FES ambulation is the excessive energy cost while walking. FES walking 
should be described as a sequence of static states; long stance phase (several seconds) followed by brisk 
flexion movement and short step, followed by the same movement at the contralateral side. Dynamic 
walking is a necessary development which will introduce better use of inertial and gravity forces, and 
reduce energy rate and cost, increase the walking distance, and increase the speed of progression in 
parallel with decreased fatigue of muscles because the muscles will not be stimulated for long periods. 

Table 142.1 summarizes differences in walking performances between able-bodied persons and persons 
with paralysis (energy cost, energy rate, amount of the use of upper extremities, cardiovascular stress 
measured through pulse rate, and blood pressure). 

Hybrid Assistive Systems (HAS) 

A specific approach to integrating two assistive systems (FES and an external mechanical orthosis) has 
been proposed (Fig. 142.2). These systems are called hybrid assistive systems (HAS) or hybrid orthotic 
systems (HOS) [83,84]. A few possibilities for HAS design have been suggested, which combine relatively 
simple rigid mechanical structures for passive stabilization of lower limbs during stance phase and FES 
systems. These systems combine the use of a reciprocating gait orthosis with multichannel stimulation 
[53,84], the use of an ankle-foot orthosis or an extended ankle-foot orthosis with a knee cage [85], or 
the use of a self- fitting modular orthosis [86]. A few more sophisticated laboratory systems were dem- 
onstrated [85,87-89]. Each trend in the design of HAS implies different applications as well as specific 
hardware and control problems. On the basis of accumulated experience, the following features can serve 
as criteria for a closer description of various HAS designs [88]: (1) partial mechanical support, (2) parallel 
operation of the biological and mechanical system, and (3) sequential operation of the biological and 
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TABLE 142.1 Best Performance with the Usage of an Assistive System Compared to Performance 
of Abie-Bodied Subjects 





Oxygen Rate 


Oxygen Cost 


Speed vMAX 


Heart Rate 


Blood Pressure 


Use of Upper 




ml/kg/min 


ml/kg/m 


m/min 


b/min 


mmHg 


Extremities [%] 


Paralyzed 


13.6 


0.6 


60 


118 


119/85 


28 


subjects 








(84 rest) 






Able-bodied 


12.1 


0.15 


85 


95 


120/80 





subjects 








(81 rest) 






Control method 


Voluntary, 


Voluntary, 


Automatic, 


Voluntary, hand- 


Voluntary, hand- 


Voluntary, hand- 




hand-switch 


sensory 


preprogrammed 


switch or sensory 


switch or sensory 


switch or sensory 






driven 




driven 


driven 


driven 


Assistive system 


FES+SFMO 


FES+SFMO 


FES implant 


FES+RGO 


FES+RGO 


FES+RGO 


used 


FES+RGO 






FES+SFMO 







The data is collected from literature [53, 83, 86, 88; 115-119]. Abbreviations used in the table are: RGO — reciprocating gait 
orthosis; SFMO — self-fitting modular orthosis; FES — functional electrical stimulation. Data for able-bodied subjects are from 
Waters etal. [1989]. 




FIGURE 142.2 An incomplete tetraplegic subject walking with a hybrid assistive system. The particular system 
incorporates a six-channel surface stimulation system and a powered self-fitting modular orthosis [88]. 



the mechanical system. The partial mechanical support refers to the use of braces to assist FES only at 
specific events within a walking cycle [85]. The advanced version of powered orthoses to be used with 
FES is being developed by Durfee and colleagues [89]. Control of joints in mechanical orthosis is again 
becoming a target for research and development mainly because of new technological tools [90,91]. 
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142.2 Active Prostheses 

The role of active prosthesis is to extend the function provided by a "non-externally" powered and 
controlled artificial organ; hence, to improve the overall performance of motor function, ultimately 
providing a better quality of life. 

Active Above-Knee Prostheses 

Effective restoration of walking and standing of handicapped humans is an important element to improve 
the quality of life. Artificial legs of different kinds have been in use for a long time, but in many cases 
they are inadequate for the needs of amputees, specifically for high above-knee amputations (e.g., hip 
disarticulation), bilateral amputees, and patients who have demanding biomechanical requirements (e.g., 
subjects involved in sports). 

Modern technology has led to improved designs of below-knee prostheses (BKP) in recent years. 
Below-knee amputees perform many normal locomotor activities and participate in many sports requir- 
ing running, jumping, and other jerky movements [92]. The biggest progress was made using readily 
available and easy-to-work-with plastic and graphite alloys for building the artificial skeletal portion of 
the shank and foot [93]. Below-knee prostheses are light, easy to assemble, and reliable. They provide 
good support and excellent energy absorption, which prevents impact, jerks, and allows the push-off 
phase in the gait cycle. Existing below-knee prostheses, even those without ankle joints, duplicate the 
behavior of the normal foot-ankle complex during swing and stance phases of the step cycle. 

The same technology has been introduced into the design of above-knee prostheses (AKP). However, 
commercially available AKPs suffer from several drawbacks. The requirements for an AKP were stated by 
Wagner and Catranis [94]. The prosthesis must support the body weight of the amputee like the normal 
limb during the stance phase of level walking, on slopes and on soft or rough terrain. This implies that 
the prosthesis provides "stability" during weight bearing activity, i.e., it prevents sudden or uncontrolled 
flexion of the knee. The second requirement is that the body is supported such that undesirable 
socket/stump interface pressures and gait abnormalities due to painful socket/stump contact are prevented. 
The analysis of biomechanical factors that influence the shaping, fitting, and alignment of the socket is a 
problem in itself. If the fitting has been accomplished, allowing the amputee to manipulate and control 
the prosthesis in an active and comfortable manner, the socket and stump can be treated as one single 
body. The third requirement, which is somewhat controversial, is that the prosthesis should duplicate as 
nearly as possible the kinematics and dynamics of normal gait. The amputee should walk with a normal- 
looking gait over a useful range of speeds associated with typical activities for normal persons of similar 
age. The latter requirement has received attention in recent years and fully integrated systems, so called 
"self-contained" active AKPs are being incorporated into modern rehabilitation. The self-contained prin- 
ciple implies that the artificial leg contains the energy source, actuator, controller, and sensors (Fig. 142.3). 

Devices such as the polycentric knee mechanism [95] , the polycentric knee mechanism with a hydraulic 
valve, or the AKP with friction type brake [96] satisfy some of the above performance requirements. A 
logically controlled AKP with a hydraulic valve represents a further bridge between purely passive and 
fully controllable assistive devices [97,98] . Studies of the knee joint performance in the stance phase have 
been done [99-101]. From those studies it became clear that for several gait modes (ramp, stairs), active 
knee control in the swing phase is desirable. To meet these requirements, self-contained, multi-mode 
AKPs have been introduced [102-105]. The final prototype version of an active self-contained AKP 
allowing controlled flexion and extension throughout the gait cycle was developed by James [97] and is 
a product of Otto Bock Company, Germany. This leg uses an efficient knee controller with a custom- 
built hydraulic valve and a single chip microcontroller with a rule-based control scheme (see Chapter 165: 
Control of Movements). 

The advantages of the externally powered and controlled leg (see, e.g., [104]) are an increased speed 
of locomotion, better gait symmetry, and lower energy cost and rate. These accomplishments are due to 
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the fact that the powered leg allows controlled knee 
bounce after heel contact, limited push-off at the 
end of the stance phase, and effective flexion and 
controlled knee extension during the swing phase 
of gait. The external control allows the amputee to 
walk almost without any circumduction, which 
helps gait symmetry considerably. The knee joint of 
a standard endoskeletal prosthesis is fitted with an 
actuator having two independent braking units 
(friction drum type brake with an azbestum lining 
to allow control of the knee joint stiffness), and the 
extension-flexion driver with a ball-screw mecha- 
nism. The rechargeable battery power supply is 
designed for up to three hours of continuous level 
walking. The Motorola 68HCll-based microcon- 
troller has been fitted into the interior of the pros- 
thesis fulfilling self-containing principles. A 
hierarchical controller allows for intention recogni- 
tion (volitional actions of the subject), adaptation 
to environmental changes, cyclical triggering 
throughout locomotion, and minimal jerk actuator 
control. 

Myoelectric Hand and 
Arm Prostheses 

The popularity of this control strategy comes from FIGURE 142.3 A self-contained, battery powered, 
the fact that it is "very elegant" because the control microcomputer controlled above-knee prosthesis using 
signals going to the prosthesis may come from the artificial reflex control [104]. 
myoelectric activity of the muscle which formerly 

controlled the lost limb [23,24,106,107]. Therefore, in some sense, the brain's own signals are used to 
control the motion of the prosthesis [ 109] . Although the validity of myoelectric control-based techniques 
seems reasonable, in practice the performances are somewhat disappointing. There are a number of 
reasons for this. In the cases of high level amputation, the electrical activity of the remaining muscles 
have only minor correlation with the normal arm movements and this makes coordinated control of 
several functions very difficult. A more important problem is that any EMG controlled prosthesis is 
inherently open-loop (preprogrammed) [110]. The absence of position proprioception and other related 
problems of reliability in executing movements [111] has contributed to the poor success rate of myo- 
electrically controlled prostheses. 

Presently two systems based on myoelectric control are used with success (Fig. 142.4). The Otto Bock 
myoelectric hand [120] uses the technique of detection of the fixed threshold of the integrated rectified 
surface electromyography. Extension/flexion of the targeted muscle group serves as the on/off control, 
which causes opening and closing of the hand, respectively. Two sets of electrodes are used to detect this 
differential signal. The Utah arm [107,108] utilizes the myoelectric signals from two antagonists, as the 
proportional elbow control (Fig. 142.5). These signals are full wave rectified and differentiated to give 
the command signal. The Utah arm has a wrist rotator and it can be combined with different grasping 
systems. Most of the artificial hands used in everyday activities do not include the five finger dexterity, 
except in the devices that are mechanically preprogrammed. A new two-degree of freedom hand prosthesis 
with hiearachical grip was designed by Kyberd [ 1 1 2 ] . A new appealing version of a controller was designed 
by Kurtz [113]. 
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FIGURE 142.4 The Waseda University prototype of a myoelectric artificial hand. 




s 






FIGURE 142.5 The Utah arm [ 107] . 

An interesting approach to overcoming the problems encountered with EMG control has been pro- 
posed by Simpson [110]. The control strategy is termed "extended physiological proprioception (EPP)", 
and uses the positions of intact joints as the controlling input signals to a prosthesis. This strategy is 
based on recognition of the fact that intact joints possess inherent position feedback, and that most goal 
oriented tasks are highly synergistic [32,33]. By establishing a one-to-one relationship between the 
position of the intact joint and the position in the space of the terminal device (prosthesis), the natural 
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feedback of the intact joint can be "extended" to the prosthesis, and thereby provide it with proprioceptive 
information. Based on this idea, an externally controlled elbow-wrist-hand prosthesis was evaluated 
[110]. The shoulder joint position is measured with a potentiometer, mounted between the upper arm 
and trunk. An important feature of the system is that disabled humans can learn how to control the 
system within minutes using trial-and-error procedure. Bertos [114] presented a very elegant solution 
for position control based on EPP. 

Defining Terms 

Functional electrical stimulation or Functional neuromuscular stimulation: Patterned electrical 
stimulation of neuromuscular structures dedicated to restore motor functions. 

Externally controlled assistive system: Assistive system for restoration of motor functions with auto- 
matic control. 

Externally powered assistive system: Assistive system for restoration of motor functions which uses 
external power to either stimulate muscles or drive actuators. 

Hybrid assistive systems: Combination of a functional electrical stimulation and a mechanical orthosis. 

Myoelectric (EMG) control: Use of voluntary generated myoelectric activity as control signals for an 
externally controlled and powered assistive system. 

Artificial reflex control: A sensory-driven control algorithm based on knowledge representation (pro- 
duction rule based system). 

Reciprocating gait orthosis: A walking and standing assistive system with a reciprocating mechanism 
for hip joints which extends the contralateral hip when the ipsilateral hip is flexed. 

Self-fitting modular orthosis: A modular, self- fitting, mechanical orthosis with a soft interface between 
the human body and the orthosis. 
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This chapter will consider methods and devices used to present visual, auditory, and tactual (touch) 
information to persons with sensory deficits. Sensory augmentation systems such as eyeglasses and hearing 
aids enhance the existing capabilities of a functional human sensory system. Sensory substitution is the 
use of one human sense to receive information normally received by another sense. Braille and speech 
synthesizers are examples of systems that substitute touch and hearing, respectively, for information that 
is normally visual (printed or displayed text). 

The following three sections will provide theory and examples for aiding the visual, auditory, and 
tactual systems. Because capitalizing on an existing sensory capability is usually superior to substitution, 
each section will consider first augmentation and then substitution, as shown below: 

Human Sensory Systems 



Visual 



Auditory 



Tactual 



Visual augmentation 
Tactual vision substitution 
Auditory vision substitution 



Auditory augmentation 
Visual auditory substitution 
Tactual auditory substitution 



Tactual augmentation 
Tactual substitution 



143.1 Visual System 



With a large number of receptive channels, the human visual system processes information in a parallel 
fashion. A single glimpse acquires a wealth of information; the field of view for two eyes is 180 degrees 
horizontally and 120 degrees vertically [Mehr & Shindell, 1990]. The spatial resolution in the central 
(foveal) part of the visual field is approximately 0.5 to 1.0 minute of arc [Shlaer, 1937] , although Vernier 
acuity, the specialized task of detecting a misalignment of two lines placed end to end, is much finer, 
approximately 2 seconds of arc [Stigmar, 1970]. Low-contrast presentations substantially reduce visual 
acuity. 
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The former resolution figure is the basis for the standard method of testing visual acuity, the Snellen 
chart. Letters are considered to be "readable" if they subtend approximately 5 minutes of arc and have 
details one-fifth this size. Snellen's 1862 method of reporting visual performance is still used today. The 
ratio 20/40, for instance, indicates that a test was conducted at 20 ft and that the letters that were 
recognizable at that distance would subtend 5 minutes of arc of 40 ft (the distance at which a normally 
sighted, or "20/20," subject could read them). Although the standard testing distance is 20 ft, 10 ft and 
even 5 ft may be used, under certain conditions, for more severe visual impairments [Fonda, 1981]. 

Of the approximately 6 to 1 1.4 million people in the United States who have visual impairments, 90% 
have some useful vision [NIDRR, 1993]. In the United States, severe visual impairment is defined to be 
20/70 vision in the better eye with best refractive correction (see below). Legal blindness means that the 
best corrected acuity is 20/200 or that the field of view is very narrow (<20 degrees). People over 65 years 
of age account for 46% of the legally blind and 68% of the severely visually impaired. For those with 
some useful vision, a number of useful techniques and devices for visual augmentation can allow 
performance of many everyday activities. 

Visual Augmentation 

People with certain eye disorders see better with higher- or lower-than-normal light levels; an illuminance 
from 100 to 4000 lux may promote comfortable reading [Fonda, 1981]. Ideal illumination is diffuse and 
directed from the side at a 45-degree angle to prevent glare. The surrounding room is preferably 20% to 
50% darker than the object of interest. 

Refractive errors cause difficulties in focusing on an object at a given distance from the eye [Mount- 
castle, 1980]. Myopia (near-sightedness), hyperopia (far-sightedness), astigmatism (focus depth that 
varies with radial orientation), and presbyopia (loss of ability to adjust focus, manifested as far-sighted- 
ness) are the most common vision defects. These normally can be corrected with appropriate eyeglasses 
or contact lenses and are rarely the cause of a disability. 

Magnification is the most useful form of image processing for vision defects that do not respond to 
refractive correction. The simplest form of image magnification is getting closer; halving the distance to 
an object doubles its size. Magnifications up to 20 times are possible with minimal loss of field of view. 
At very close range, eyeglasses or a loupe may be required to maintain focus [Fonda, 1981]. Hand or 
stand magnifiers held 18 to 40 cm (not critical) from the eye create a virtual image that increases rapidly 
in size as the object-to-lens distance approaches the focal length of the lens. Lenses are rated in diopters 
(D = 1//J where /is the focal length of the lens in centimeters). The useful range is approximately 4 to 
20 D; more powerful lenses are generally held close to the eye as a loupe, as just mentioned, to enhance 
field of view. For distance viewing, magnification of 2 to 10 times can be achieved with hand-held 
telescopes at the expense of a reduced field of view. 

Closed- circuit television (CCTV) systems magnify print and small objects up to 60 times, with higher 
effective magnifications possible by close viewing. Users with vision as poor as 1/400 (20/8000) may be 
able to read ordinary print with CCTV [Fonda, 1981]. Some recent units are portable and contain 
black/white image reversal and contrast enhancement features. 

Electrical (or, more recently, magnetic) stimulation of the visual cortex produces perceived spots of 
light called phosphenes. Some attempts, summarized in Webster et al. [1985], have been made to map 
these sensations and display identifiable patterns, but the phosphenes often do not correspond spatially 
with the specific location on the visual cortex. Although the risk and cost of this technique do not yet 
justify the minimal "vision" obtained, future use cannot be ruled out. 

Tactual Vision Substitution 

With sufficient training, people without useful vision can acquire sufficient information via the tactile 
sense for many activities of daily living, such as walking independently and reading. The traditional long 
cane, for example, allows navigation by transmitting surface profile, roughness, and elasticity to the hand. 
Interestingly, these features are perceived to originate at the tip of the cane, not the hand where they are 
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transduced; this is a simple example of distal attribution [Loomis, 1992]. Simple electronic aids such as 
the hand-held Mowat sonar sensor provide a tactile indication of range to the nearest object. 

Braille reading material substitutes raised-dot patterns on 2.3-mm centers for visual letters, enabling 
reading rates up to 30 to 40 words per minute (wpm). Contracted Braille uses symbols for common 
words and affixes, enabling reading at up to 200 wpm (125 wpm is more typical). 

More sophisticated instrumentation also capitalizes on the spatial capabilities of the tactile sense. The 
Optacon (optical-to- tactile converter) by TeleSensory, Inc. (Mountain View, Calif.) converts the outline 
of printed letters recorded by a small, hand-held camera to enlarged vibrotactile letter outlines on the 
users fingerpad. The camera's field of view is divided into 100 or 144 pixels (depending on the model), 
and the reflected light intensity at each pixel determines whether a corresponding vibrating pin on the 
fingertip is active or not. Ordinary printed text can be read at 28 (typical) or 90 (exceptional) wpm. 

Spatial orientation and recognition of objects beyond the reach of a hand or long cane are the objective 
of experimental systems that convert an image from a television-type camera to a matrix of electrotactile 
or vibrotactile stimulators on the abdomen, forehead, or fingertip. With training, the user can interpret 
the patterns of tingling or buzzing pints to identify simple, high-contrast objects in front of the camera, 
as well as experience visual phenomena such as looming, perspective, parallax, and distal attribution 
[Bach-y-Rita, 1972; Collins, 1985]. 

Access to graphic or spatial information that cannot be converted into text is virtually impossible for 
blind computer users. Several prototype devices have been built to display computer graphics to the 
fingers via vibrating or stationary pins. A fingertip-scanned display tablet with embedded electrodes, 
under development in our laboratory [Kaczmarek et al., 1997], eliminates all moving parts; ongoing tests 
will determine if the spatial performance and reliability are adequate. 

Auditory Vision Substitution 

Electronic speech synthesizers allow access to electronic forms of text storage and manipulation. Until 
the arrival of graphic user interfaces such as those in the Apple Macintosh® and Microsoft Windows® 
computer operating systems, information displayed on computer screens was largely text-based. A num- 
ber of products appeared that converted the screen information to speech at rates of up to 500 wpm, 
thereby giving blind computer users rapid access to the information revolution. Fortunately, much of 
the displayed information in graphic operating systems is not essentially pictorial; the dozen or so 
common graphic features (e.g., icons, scroll bars, buttons) can be converted to a limited set of words, 
which can then be spoken. Because of the way information is stored in these systems, however, the screen- 
to-text conversion process is much more complex, and the use of essentially spatial control features such 
as the mouse await true spatial display methods [Boyd et al., 1990]. 

Automated optical character recognition (OCR) combined with speech synthesis grants access to the 
most common printed materials (letters, office memorandums, bills), which are seldom available in 
Braille or narrated-tape format. First popularized in the Kurzweil reading machine, this marriage of 
technologies is combined with a complex set of lexical, phonetic, and syntactic rules to produce under- 
standable speech from a wide variety of, but not all, print styles. 

Mobility of blind individuals is complicated, especially in unfamiliar territory, by hazards that cannot be 
easily sensed with a long cane, such as overhanging tree limbs. A few devices have appeared that convert 
the output of sonar-like ultrasonic ranging sensors to discriminable audio displays. For example, the 
Wormald Sonicguide uses interaural intensity differences to indicate the azimuth of an object and frequency 
to indicate distance [Cook, 1982]; subtle information such as texture can sometimes also be discriminated. 

143.2 Auditory System 

The human auditory system processes information primarily serially; having at best two receptive chan- 
nels, spatial information must be built up by integration over time. This later capability, however, is 
profound. Out of a full orchestra, a seasoned conductor can pinpoint an errant violinist by sound alone. 
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Human hearing is sensitive to sound frequencies from approximately 16 to 20,000 Hz and is most 
sensitive at 1000 Hz. At this frequency, a threshold root-mean-square pressure of 20 Pa (200 p.bar) can 
be perceived by normally hearing young adults under laboratory conditions. Sound pressure level (SPL) 
is measured in decibels relative to this threshold. Some approximate benchmarks for sound intensity are 
a whisper at 1 m (30 dB), normal conversion at 1 m (60 dB), and a subway train at 6 m (90 dB). Sounds 
increasing from 100 to 140 dB become uncomfortable and painful, and short exposures to a 160-dB level 
can cause permanent hearing impairment, while continuous exposure to sound levels over 90 dB can 
cause slow, cumulative damage [Sataloff et al., 1980]. 

Because hearing sensitivity falls off drastically at lower frequencies, clinical audiometric testing uses 
somewhat different scales. With the DIN/ ANSI reference threshold of 6.5 dB SPL at 1 kHz, the threshold 
rises to 24.5 dB at 250 Hz and 45.5 dB at 125 Hz [Sataloff et al, 1980]. Hearing loss is then specified in 
decibels relative to the reference threshold, rather than the SPL directly, so that a normal audiogram 
would have a flat threshold curve at approximately dB. 

Auditory Augmentation 

Loss of speech comprehension, and hence interpersonal communication, bears the greatest effect on daily 
life and is the main reason people seek medical attention for hearing impairment. Functional impairment 
begins with 21- to 35-dB loss in average sensitivity, causing difficulty in understanding faint speech 
[Smeltzer, 1993]. Losses of 36 to 50 dB and 51 to 70 dB cause problems with normal and loud speech. 
Losses greater than 90 dB are termed profound or extreme and cannot be remedied with any kind of 
hearing aid; these individuals require auditory substitution rather than augmentation. 

Hearing loss can be caused by conduction defects in the middle ear (tympanic membrane and ossicles) 
or by sensorineural defects in the inner ear (cochlear transduction mechanisms and auditory nerve). 
Conduction problems often can be corrected medically or surgically. If not, hearing aids are often of 
benefit because the hearing threshold is elevated uniformly over all frequencies, causing little distortion 
of the signal. Sensorineural impairments differentially affect different frequencies and also cause other 
forms of distortion that cannot be helped by amplification or filtering. The dynamic range is also reduced, 
because while loud sounds (> 100 dB) are often still perceived as loud, slightly softer sounds are lost. 
Looked at from this perspective, it is easy to understand why the amplification and automatic gain control 
of conventional hearing aids do not succeed in presenting the 30-dB or so dynamic range of speech to 
persons with 70+ dB of sensorineural impairment. 

Most hearing aids perform three basic functions. (1) Amplification compensates for the reduced 
sensitivity of the damaged ear. (2) Frequency-domain filtering compensates for hearing loss that is not 
spectrally uniform. For example, most sensorineural loss disproportionately affects frequencies over 
1 kHz or so, so high-frequency preemphasis may be indicated. (3) Automatic gain control (ACG) com- 
presses the amplitude range of desired sounds to the dynamic range of the damaged ear. Typical AGC 
systems respond to loud transients in 2 to 5 ms (attack time) and reduce their effect in 100 to 300 ms 
(recovery time). 

Sophisticated multiband AGC systems have attempted to normalize the ear's amplitude/frequency 
response, with the goal of preserving intact the usual intensity relationships among speech elements. 
However, recent research has shown that only certain speech features are important for intelligibility 
[Moore, 1990]. The fundamental frequency (due to vocal cord vibration), the first and second formants 
(the spectral peaks of speech that characterize different vowels), and place of articulation are crucial to 
speech recognition. In contrast, the overall speech envelope (the contour connecting the individual peaks 
in the pressure wave) is not very important; articulation information is carried in second-formant and 
high-frequency spectral information and is not well-represented in the envelope [Van Tasell, 1993]. 
Therefore, the primary design goal for hearing aids should be to preserve and make audible the individual 
spectral components of speech (formants and high-frequency consonant information). 

The cochlear implant could properly be termed an auditory augmentation device because it utilizes 
the higher neural centers normally used for audition. Simply stated, the implant replaces the function 
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of the (damaged) inner ear by electrically stimulating the auditory nerve in response to sound collected 
by an external microphone. Although the auditory percepts produced are extremely distorted and noise- 
like due to the inadequate coding strategy, many users gain sufficient information to improve their 
lipreading and speech-production skills. The introductory chapter in this section provides a detailed 
discussion of this technology. 

Visual Auditory Substitution 

Lipreading is the most natural form of auditory substitution, requiring no instrumentation and no 
training on the part of the speaker. However, only about one-third to one-half of the 36 or so phonemes 
(primary sounds of human speech) can be reliably discriminated by this method. The result is that 30% 
to 50% of the words used in conversational English look just like, or very similar to, other words 
(homophenes) [Becker, 1972]. Therefore, word pairs such as buried/married must be discriminated by 
grammar, syntax, and context. 

Lipreading does not provide information on voice fundamental frequency or formants. With an 
appropriate hearing aid, any residual hearing (less than 90-dB loss) often can supply some of this missing 
information, improving lipreading accuracy. For the profoundly deaf, technological devices are available 
to supply some or all of the information. For example, the Upton eyeglasses, an example of a cued-speech 
device, provide discrete visual signals for certain speech sounds such as fricatives (letters like / of s, 
containing primarily high-frequency information) that cannot be readily identified by sight. 

Fingerspelling, a transliteration of English alphabet into hand symbols, can convey everyday words at 
up to 2 syllables per second, limited by the rate of manual symbol production [Reed et al., 1990] .American 
Sign Language uses a variety of upper body movements to convey words and concepts rather than just 
individual letters, at the same effective rate as ordinary speech, 4 to 5 syllables per second. 

Closed captioning encodes the full text of spoken words on television shows and transmits the data 
in a nonvisible part of the video signal (the vertical blanking interval). Since July of 1993, all new television 
sets sold in the United States with screens larger than 33 cm diagonal have been required to have built- 
in decoders that can optionally display the encoded text on the screen. Over 1000 hours per week of 
programming is closed captioned [National Captioning Institute, 1994]. 

Automatic speech-recognition technology may soon be capable of translating ordinary spoken dis- 
course accurately into visually displayed text, at least in quiet environments; this may eventually be a 
major boon for the profoundly hearing impaired. Presently, such systems must be carefully trained on 
individual speakers and/or must have a limited vocabulary [Ramesh et al., 1992]. Because there is much 
commercial interest in speech command of computers and vehicle subsystems, this field is advancing 
rapidly. 

Tactual Auditory Substitution 

Tadoma is a method of communication used by a few people in the deaf-blind community and is of 
theoretical importance for the development of tactual auditory substitution devices. While sign language 
requires training by both sender and receiver, in Tadoma, the sender speaks normally. The trained receiver 
places his or her hands on the face and neck of the sender to monitor lip and jaw movements, airflow 
at the lips, and vibration of the neck [Reed et al., 1992]. Experienced users achieve 80% keyword 
recognition of everyday speech at a rate of 3 syllables per second. Using no instrumentation, this the 
highest speech communication rate recorded for any tactual-only communication system. 

Alternatively, tactile vocoders perform a frequency analysis of incoming sounds, similarly to the ear's 
cochlea [Bekesy, 1955], and adjust the stimulation intensity of typically 8 to 32 tactile stimulators 
(vibrotactile or electrotactile) to present a linear spectral display to the user's abdominal or forehead 
skin. Several investigators [Blarney & Clark, 1985; Boothroyd & Hnath-Chisolm, 1988; Brooks 8c Frost, 
1986; Saunders et al., 1981] have developed laboratory and commercial vocoders. Although vocoder users 
cannot recognize speech as well as Tadoma users, research has shown that vocoders can provide enough 
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"auditory" feedback to improve the speech clarity of deaf children and to improve auditory discrimination 
and comprehension in some older patients [Szeto & Riso, 1990] and aid in discrimination of phonemes 
by lipreading [Hughes, 1989; Rakowski et al., 1989]. An excellent review of earlier vocoders appears in 
Reed et al. [1982]. The most useful information provided by vocoders appears to be the second-formant 
frequency (important for distinguishing vowels) and position of the high-frequency plosive and fricative 
sounds that often delineate syllables [Bernstein et al., 1991]. 

143.3 Tactual System 

Humans receive and combine two types of perceptual information when touching and manipulating 
objects. Kinesthetic information describes the relative positions and movements of body parts as well as 
muscular effort. Muscle and skin receptors are primarily responsible for kinesthesis; joint receptors serve 
primarily as protective limit switches [Rabischong, 1981]. Tactile information describes spatial pressure 
patterns on the skin given a fixed body position. Everyday touch perception combines tactile and kines- 
thetic information; this combination is called tactual or haptic perception. Loomis and Lederman [1986] 
provide an excellent review of these perceptual mechanisms. 

Geldard [1960] and Sherrick [1973] lamented that as a communication channel, the tactile sense is 
often considered inferior to sight and hearing. However, the tactile system possess some of the same 
spatial and temporal attributes as both of the "primary" senses [Bach-y-Rita, 1972]. With over 10,000 
parallel channels (receptors) [Collins & Saunders, 1970], the tactile system is capable of processing a 
great deal of information if it is properly presented. 

The human kinesthetic and tactile senses are very robust and, in the case of tactile, very redundant. 
This is fortunate, considering their necessity for the simplest of tasks. Control of movement depends on 
kinesthetic information; tremors and involuntary movements can result from disruption of this feedback 
control system. Surgically repaired fingers may not have tactile sensation for a long period or at all, 
depending on the severity of nerve injuries; it is known that insensate digits are rarely used by patients 
[Tubiana, 1988]. Insensate fingers and toes (due to advanced Hansen's disease or diabetes) are often 
injured inadvertently, sometimes requiring amputation. Anyone who has had a finger numbed by cold 
realizes that it can be next to useless, even if the range of motion is normal. 

The normal sensitivity to touch varies markedly over the body surface. The threshold forces in dynes 
for men (women) are lips, 9 (5); fingertips, 62 (25); belly 62 (7); and sole of foot, 343 (79) [Weinstein, 
1968]. The fingertip threshold corresponds to 10-p.m indentation. Sensitivity to vibration is much higher 
and is frequency-and area-dependent [Verillo, 1985]. A 5-cm 2 patch of skin on the palm vibrating at 
250 Hz can be felt at 0.16-p.m amplitude; smaller areas and lower frequencies require more displacement. 
The minimal separation for two nonvibrating points to be distinguished is 2 to 3 mm on the fingertips, 
17 mm on the forehead, and 30 to 50 mm on many other locations. However, size and localization 
judgments are considerably better than these standard figures might suggest [Vierck & Jones, 1969]. 

Tactual Augmentation 

Although we do not often think about it, kinesthetic information is reflected to the user in many types 
of human-controlled tools and machines, and lack of this feedback can make control difficult. For 
example, an automobile with power steering always includes some degree of "road feel" to allow the 
driver to respond reflexively to minor bumps and irregularities without relying on vision. Remote-control 
robots (telerobots) used underwater or in chemical- or radiation-contaminated environments are slow 
and cumbersome to operate, partly because most do not provide force feedback to the operator; such 
feedback enhances task performance [Hannaford & Wood, 1992]. 

Tactile display of spatial patterns on the skin uses three main types of transducers [Kaczmarek & Bach- 
y-Rita, 1995; Kaczmarek et al., 1991]. Static tactile displays use solenoids, shape-memory alloy actuators, 
and scanned air or water jets to indent the skin. Vibrotactile displays encode stimulation intensity as the 
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amplitude of a vibrating skin displacement (10 to 500 Hz); both solenoids and piezoelectric transducers 
have been used. Electrotactile stimulation uses 1- to 100-mm 2 -area surface electrodes and careful wave- 
form control to electrically stimulate the afferent nerves responsible for touch, producing a vibrating or 
tingling sensation. 

Tactile rehabilitation has received minimal attention in the literature or medical community. One 
research device sensed pressure information normally received by the fingertips and displayed it on the 
forehead using electrotactile stimulation [Collins & Madey, 1974]. Subjects were able to estimate surface 
roughness and hardness and detect edges and corners with only one sensor per fingertip. Phillips [1988] 
reviews prototype tactile feedback systems that use the intact tactile sense to convey hand and foot 
pressure and elbow angle to users of powered prosthetic limbs, often with the result of more precise 
control of these devices. 

Slightly more attention has been given to tactile augmentation in special environments. Astronauts, 
for example, wear pressurized gloves that greatly diminish tactile sensation, complicating extravehicular 
repair and maintenance tasks. Efforts to improve the situation range from mobile tactile pins in the 
fingertips to electrotactile stimulation on the abdomen of the information gathered from fingertip sensors 
[Bach-y-Rita et al, 1987]. 

Tactual Substitution 

Because of a paucity of adequate tactual display technology, spatial pressure information from a robot 
or remote manipulator is usually displayed to the operator visually. A three-dimensional bar graph, for 
example, could show the two-dimensional pressure pattern on the gripper. While easy to implement, 
this method suffers from two disadvantages: ( 1 ) the visual channel is required to process more informa- 
tion (it is often already heavily burdened), and (2) reaction time is lengthened, because the normal human 
tactual reflex systems are inhibited. An advantage of visual display is that accurate measurements of force 
and pressure may be displayed numerically or graphically. 

Auditory display of tactual information is largely limited to warning systems, such as excessive force 
on a machine. Sometimes such feedback is even inadvertent. The engine of a bulldozer will audibly slow 
down when a heavy load is lifted; by the auditory and vibratory feedback, the operator can literally "feel" 
the strain. 

The ubiquity of such tactual feedback systems suggests that the human-machine interface on many 
devices could benefit from intentionally placed tactual feedback systems. Of much current interest is the 
virtual environment, a means by which someone can interact with a mathematic model of a place that 
may or may not physically exist. The user normally controls the environment by hand, head, and body 
movements; these are sensed by the system, which correspondingly adjusts the information presented 
on a wide-angle visual display and sometimes also on a spatially localized sound display. The user often 
describes the experience as "being there," a phenomenon known as telepresence [Loomis, 1992]. One can 
only imagine how much the experience could be enhanced by adding kinesthetic and tactile feedback 
[Shimoga, 1993], quite literally putting the user in touch with the virtual world. 

Defining Terms 

Distal attribution: The phenomenon whereby events are normally perceived as occurring external to 
our sense organs — but also see Loomis' [1992] engaging article on this topic. The environment or 
transduction mechanism need not be artificial; for example, we visually perceive objects as distant 
from our eyes. 

Electrotactile: Stimulation that evokes tactile (touch) sensations within the skin at the location of the 
electrode by passing a pulsatile, localized electric current through the skin. Information is delivered 
by varying the amplitude, frequency, etc. of the stimulation waveform. Also called electrocutaneous 
stimulation. 
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Illuminance: The density of light falling on a surface, measured in lux. One lux is equivalent to 0.0929 
foot-candles, an earlier measure. Illuminance is inversely proportional to the square of the distance 
form a point light source. A 100-W incandescent lamp provides approximately 1280 lux at a 
distance of 1 ft (30.5 cm). Brightness is a different measure, depending also on the reflectance of 
the surrounding area. 

Kinesthetic perception: Information about the relative positions of and forces on body parts, possibly 
including efference copy (internal knowledge of muscular effort). 

Sensory augmentation: The use of devices that assist a functional human sense; eyeglasses are one 
example. 

Sensory substitution: The use of one human sense to receive information normally received by another 
sense. For example, Braille substitutes touch for vision. 

Sound pressure level (SPL): The root-mean-square pressure difference from atmospheric pressure 
(=100 kPa) that characterizes the intensity of sound. The conversion SPL = 20 log (P/P ) expresses 
SPL in decibels, where P is the threshold pressure of approximately 20 Pa at 1 kHz. 

Static tactile: Stimulation that is a slow local mechanical deformation of the skin. It varies the defor- 
mation amplitude directly rather than the amplitude of vibration. This is "normal touch" for 
grasping objects, ect. 

Tactile perception: Information about spatial pressure patterns on the skin with a fixed kinesthetic 
position. 

Tactual (haptic) perception: The seamless, usually unconscious combination of tactile and kinesthetic 
information; this is "normal touch." 

Vibrotactile: Stimulation that evokes tactile sensations using mechanical vibration of the skin, typically 
at frequencies of 10 to 500 Hz. Information is delivered by varying the amplitude, frequency, etc. 
of the vibration. 

Virtual environment: A real-time interactive computer model that attempts to display visual, auditory, 
and tactual information to a human user as if he or she were present at the simulated location. 
The user controls the environment with head, hand, and body motions. A airplane cockpit simu- 
lator is one example. 
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Further Information 

Presence: Teleoperators and Virtual Environments is a bimonthly journal focusing on advanced human- 
machine interface issues. In an effort to develop tactile displays without moving parts, our laboratory 
has demonstrated simple pattern recognition on the fingertip [Kaczmarek et al., 1997] and tongue [Bach- 
y-Rita et al., 1998] using electrotactile stimulation. 

The Trace Research and Development Center, Madison, Wise, publishes a comprehensive resource 
book on commercially available assistive devices, organizations, etc. for communication, control, and 
computer access for individuals with physical and sensory impairments. 

Electronic Devices for Rehabilitation, edited by J. G. Webster (Wiley, 1985), summarizes the technologic 
principles of electronic assistive devices for people with physical and sensory impairments. 
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The inability to express oneself through either speech or writing is perhaps the most limiting of physical 
disabilities. Meaningful participation in life requires the communication of basic information, desires, 
needs, feelings, and aspirations. The lack of full interpersonal communication substantially reduces an 
individual's potential for education, employment, and independence. 

Today, through multidisciplinary contributions, individuals who cannot speak or write effectively have 
access to a wide variety of techniques, therapies and systems designed to ameliorate challenges to verbal 
communication. The field of augmentative and alternative communication (AAC) consists of many 
different professions, including speech-language pathology, regular and special education, occupational 
and physical therapy, engineering, linguistics, technology, and others. Experienced professionals now can 
become certified as Assistive Technology practitioners and/or suppliers (Minkel, 1996). Individuals who 
rely on AAC, as well as their families and friends, also contribute to the field (Slesaransky-Poe, 1998). 

Engineering plays a significant role in the development of the field of AAC and related assistive 
technology. Engineering contributions range from relatively independent work on product definition 
and design to the collaborative development and evaluation of tools to support the contributions of other 
professions, such as classical and computational linguistics and speech-language pathology. 

Augmentative communication can be classified in a variety of ways ranging from unaided communi- 
cation techniques, such as gestures, signs, and eye pointing, to highly sophisticated electronic devices 
employing the latest technology. This chapter focuses on the review of technology-aided techniques and 
related issues. 

Technology-based AAC systems have taken two forms: hardware designed specifically for this appli- 
cation and software that runs on mass market computer hardware. Three basic components comprise 
AAC systems. These are the language representation method (including acceleration techniques), the 
user interface, and the outputs. Generally, a multidisciplinary team evaluates the current and projected 
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skills and needs of the individual, determines the most effective language representation method(s) and 
physical access technique(s), and then selects a system with characteristics that are a good match. 

144.2 Language Representation Methods 
and Acceleration Techniques 

The ultimate goal of AAC intervention is functional, interactive communication. The four purposes that 
communication fulfills are (1) communication of needs/wants, (2) information transfer, (3) social close- 
ness, and (4) social etiquette (Light, 1988). From the perspective of the person who uses AAC, commu- 
nicative competence involves the ability to transmit messages efficiently and effectively in all four of the 
interaction categories, based on individual interests, circumstances, and abilities (Buekelman & Miranda, 
1998). To achieve communication competence, the person using an AAC system must have access to 
language representation methods capable of handling the various vocabulary and message construction 
demands of the environment. Professionals rely on the theoretical models of language development and 
linguistics to evaluate the effectiveness of an AAC language representation method. 

Language is defined as an abstract system with rules governing the sequencing of basic units (sounds, 
morphemes, words, and sentences), and rules governing meaning and use (McCormick & Schiefelbusch, 
1990). An individual knows a language when he or she understands and follows its basic units and rules. 
Knowledge of language requires both linguistic competence (understanding the rules), and linguistic 
performance (using these rules). Most language models identify the basic rules as phonology, semantics, 
morphology, syntax and pragmatics. The basic rules of language apply to AAC. For example, AAC research 
on vocabulary use has documented the phenomenon of core and fringe vocabulary (Yorkston et al., 1988) 
and the reliance on a relatively limited core vocabulary to express a majority of communication utterances 
(Vanderheiden & Kelso, 1987). Research on conversations has documented topic and small talk patterns 
(Stuart et al., 1993; King et al., 1995). An awareness of how a given AAC system handles these basic rules 
enables one to be critical of AAC language representation methods. 

In addition to the language representation method, the acceleration technique(s) available in an AAC 
system contribute(s) to communication competence. Communication by users of AAC systems is far 
slower than that of the general population. Yet the speed of communication is a significant factor 
influencing the perceptions of the user's communication partner, and potential for personal achievement 
for the person relying on AAC. The development and application of techniques to accelerate communi- 
cation rates is critical. Further, these techniques are most effective when developers pay attention to 
human factors design principles (Goodenough-Trepagnier, 1994). 

Alphabet-based language representation methods involve the use of traditional orthography and accel- 
eration techniques that require spelling and reading skills. AAC systems using orthography require the 
user to spell each word using a standard keyboard or alphabet overlay on a static or dynamic display. A 
standard or customized alphabet overlay provides use for all the rules and elements of a natural language; 
however, spelling letter-by-letter is a slow and inefficient AAC strategy without acceleration techniques. 

Abbreviation systems represent language elements using a number of keystrokes typically smaller than 
that required by spelling. For example, words or sentences are abbreviated using principled approaches 
based on vowel elimination or the first letters of salient words. Abbreviation systems can be fast, but 
require not only spelling and reading skills, but memory of abbreviation codes. Typically, people with 
spelling and reading skills have large vocabulary needs and increased demands on production of text. 
Demasco (1994) offers additional background and proposes some interesting work in this area. 

Word prediction is another acceleration technique available from many sources. Based on previously 
selected letters and words, the system presents the user with best guess choices for completing the spelling 
of a word. The user then chooses one of the predictions or continues spelling, resulting in yet another 
set of predictions. Prediction systems have demonstrated a reduction in the number of keystrokes, but 
recent research (Koester and Levine, 1994) reports that the actual communication rate does not represent 
a statistical improvement over spelling. The reason for word predication's failure to improve rate is that 
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increased time is needed to read and select the word table choices. The cost of discontinuity and increased 
cognitive load in the task seems to match the benefits of reduced keystrokes. 

Picture Symbol-based language representation methods involve the use of graphic or line drawn symbols 
to represent single word vocabulary or messages (phrases, sentences, and paragraphs). A variety of AAC 
symbol sets are available on devices or software depict the linguistics elements available through the 
system. Picture Communication Symbols (PCS), DynaSyms, and Blissymbols are popular symbol sets 
used in either dedicated or computer-aided systems. One taxonomy differentiates symbols according to 
several subordinate levels including static/dynamic, iconic/opaque, and set/system (Fuller, et.al., 1998) 

Universal considerations regarding the selection of a symbol set include research on symbol charac- 
teristics such as size, transparency, complexity and iconicity (Fuller et al., 1997, Romski & Sevcik, 1988). 
Understanding symbol characteristics is necessary to make clinical decisions about vocabulary organiza- 
tion and system selection. The choice of picture communication language representation methods is 
facilitated when teams use a goal-driven graphic symbol selection process (Schlosser et al., 1996). For 
example, identification of vocabulary and language outcomes assists the selection of an AAC graphic 
symbol system. 

Vocabulary size and representation of linguistic rules (grammar) are concerns for users relying on 
graphic symbol sets. Users of static display systems have a limited number of symbols available on any 
one overlay; however, they have ready access to that vocabulary. Dynamic display users have an almost 
unlimited number of symbols available as vocabulary for message generation; however, they must navigate 
through pages or displays to locate a word. Frequently, morphology and syntax are not graphically 
represented. Since research has strongly supported the need for users to construct spontaneous, novel 
utterances (Beukelman et al., 1984), neither method is efficient for interactive communication. 

Semantic Compaction or Minspeak is perhaps the most commonly used AAC language representation 
method (Baker, 1994). With this method, language is represented by a relatively small set of multi- 
meaning icons. The specific meaning of each icon is a function of the context in which it is used. Semantic 
compaction makes use of a meaningful relationship between the icon and the information it represents; 
it does not require spelling and reading skills, and yet is powerful even for people with these skills. The 
performance of Minspeak stems from its ability to handle both vocabulary and linguistic structures as 
found in the Minspeak Application Programs (MAPS) of Words Strategy and Unity. Both MAPS support 
the concept of a core and fringe vocabulary. They provide the architecture for handling rules of grammar 
and morphology. The number of required keystrokes is reduced relative to spelling. 

Predictive selection is an acceleration technique used with Minspeak. When this feature is enabled, 
only those choices that complete a meaningful sequence can be selected. With scanning, for example, 
the selection process can be significantly faster because the number of possible choices is automatically 
reduced. 

144.3 User Interface 

Most AAC systems employ a user interface based on the selection of items that will produce the desired 
output (Vanderheiden and Lloyd, 1986). Items being selected may be individual letters, as used in spelling, 
or whole words or phrases expressing thoughts, or symbols that represent vocabulary. The numerous 
techniques for making selections are based on either direct selection or scanning. 

Direct selection refers to techniques by which a single action from a set of choices indicates the desired 
item. A common example of this method is the use of a computer keyboard. Each key is directly selected 
by finger. Expanded keyboards accommodate more gross motor actions, such as using the fist or foot. 
In some cases, pointing can be enhanced through the use of technology. Sticks are held in the mouth or 
attached to the head using a headband or helmet. Light pointers are used to direct a light beam at a 
target. Indirect pointing systems might include the common computer mouse, trackball, or joystick. 
Alternatives to these for people with disabilities are based on the movement of the head or other body 
part. Figure 144.1 depicts direct selection in that the desired location is pointed to directly. 
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FIGURE 144.1 Direct selection. 



Scanning refers to techniques in which the individual is presented 
with a time sequence of choices and indicates when the desired choice 
appears. A simple linear scanning system might be a clock face type 
display with a rotating pointer to indicate a letter, word, or picture. 
Additional dimensions of scanning can be added to reduce the selection 
time when the number of possible choices is larger. A common tech- 
nique involves the arrangement of choices in a matrix of rows and 
columns. The selection process has two steps. First, rows are scanned 

to select the row containing the desired element. The selected row is then scanned to select the desired 
element. This method is called row-column scanning. See Fig. 144.2. Either by convention or by the 
grouping of the elements, the order might be reversed. For example, in the U.K., column-row scanning 
is preferred over row-column scanning. Other scanning techniques also exist and additional dimensions 
can be employed. 

Both direct selection and scanning are used to select elements that might not of themselves define an 
output. In these cases the output may be defined by a code of selected elements. A common example is 
Morse code by which dots and dashes are directly selected but must be combined to define letters and 
numbers. Another example, more common to AAC, has an output defined by a sequence of two or more 
pictures or icons. 

Scanning, and to some degree direct selection, can be faster when the choices are arranged such that 
those most frequently used are easiest to access. For example, in a row-column scanning spelling system 
that scans top to bottom and left to right, the most frequently used letters are usually grouped toward 
the upper left corner. 

Generally the selection technique of choice should be that which results in the fastest communication 
possible. Consideration of factors such as cognitive load (Cress and French, 1994), environmental changes, 
fatigue, aesthetics, and stability of physical skill often influence the choice of the best selection technique. 
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FIGURE 144.2 Row-column scanning. 
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144.4 Outputs 

AAC system outputs in common use are speech, displays, printers, beeps, data, infrared, and other control 
formats. Outputs are used to facilitate the interactive nature of "real time" communication. Users may 
rely on auditory and visual feedback to enhance vocabulary selection and the construction of messages. 
The speech and display outputs may be directed toward the communication partner to support the 
exchange of information. Auditory feedback such as beeps and key clicks, while useful to the individual 
using the system, also provide the communication partner with the pragmatic information that a user 
is in the process of generating a message. Finally, outputs may be used to control other items or devices 
such as printers. 

AAC speech output normally consists of two types: synthetic and digitized. 

Synthetic speech usually is generated from text input following a set of rules. Synthetic speech is usually 
associated with AAC systems that are able to generate text. These systems have unlimited vocabulary and 
are capable of speaking any word or expression that can be spelled. Commonly used synthetic speech 
systems like DECtalk™ offer a variety of male, female, and child voices. Intelligibility has improved over 
recent years to the point that it is no longer a significant issue. With some systems it is actually possible 
to sing songs, again a feature that enhances social interaction. Most synthetic speech systems are limited 
to a single language, although bilingual systems are available. Further limitations relate to the expression 
of accent and emotion. Research and development in artificial speech technology is attacking these 
limitations. 

Digitized speech is essentially speech that has been recorded into digital memory. Relatively simple 
AAC systems typically use digitized speech. The vocabulary is entered by speaking into the system through 
a microphone. People who use these systems can say only what someone else said in programming them. 
They are independent of language and can replicate the song, accent, and emotion of the original speaker. 

RS-232c serial data output is used to achieve a variety of functional outcomes. Serial output permits 
the AAC system to replace the keyboard and mouse for computer access, a procedure known as emulation. 
The advantage of emulation is that the language representation method and physical access method used 
for speaking is used for writing and other computer tasks (Buning and Hill, 1998). Another application 
for the serial output is environmental control. Users are able to operate electrical and electronic items 
in their daily-living surroundings. Particular sequences of characters, symbols or icons are used to 
represent commands such as answering the telephone, turning off the stereo, and even setting a thermo- 
stat. In addition, the serial output also may be used to monitor the language activity for purposes of 
clinical intervention, progress reporting, and research. 

Infrared communication now is available in many AAC devices. Infrared output supports the same 
functions as the RS-232c serial data output, but without requiring direct wiring or linking between 
operating devices. Infrared interfaces providing computer access improve the independence of AAC users 
because no physical connection needs to be manipulated to activate the system. Infrared interfaces can 
perform as environmental control units by learning the codes of entertainment and other electronic 
systems. 

144.5 Outcomes, Intervention and Training 

The choice of an AAC system should rely primarily on the best interests of the person who will be using 
the system. Since outcomes and personal achievement will be related directly to the ability to commu- 
nicate, the choice of a system will have lifelong implications. The process of choosing the most appropriate 
system is not trivial and can be accomplished best with a multidisciplinary team focusing on outcomes. 
Team members should realize that the interests of the individual served professionally are not necessarily 
aligned with those of the providers and/or payers of AAC services. The temptation to select a system that 
is easy to apply or inexpensive to purchase frequently exists, especially with untrained teams. Teams that 
identify outcomes and have the goal of achieving interactive communication make informed decisions. 
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FIGURE 144.3 

Wooster, OH. 



Individual using an electronic communication system. Photo courtesy of Prentke Romich Company, 



Following the selection (and funding) of a system, the next step is the actual intervention program. 
For successful outcomes to be achieved, intervention must go beyond the technical use of the AAC system 
and include objectives for language development and communication pragmatics. Early in the history 
of AAC, Rodgers postulates (1984) that AAC systems are more like tools than appliances. To be effective, 
there is a need for much more than simply "plugging them in and turning them on." The individual who 
relies on AAC must develop the skills to become a communication craftsperson. In general, as well as 
specific to AAC, the fastest, most efficient use of a system occurs when the individual operates from 
knowledge that is in the head, rather than knowledge that must be gathered from the world (Norman, 
1980). The implication for AAC is that an intervention program must include a significant training 
component to assure that the needed knowledge is put into the head (Romich, 1994). Further, a drill 
component of training develops automaticity (Treviranus, 1994). 

Perhaps the single factor limiting the widespread use of assistive technology in general and AAC in 
particular is the lack of awareness of its availability and the impact it can have on the lives of the people 
who could benefit (Romich, 1993). This situation exists within not only the general public but also in 
many of the professions providing services to this population. Training opportunities continue to be limited 
for a majority of professionals working with persons who could benefit from AAC technology. University 
programs have lagged behind in their integration of this information into professional curricula. 

Intervention and training should extend beyond the initial application of the system. People who rely 
on AAC frequently live unstable lives. Consequently they have a need for on-going services. For people 
with congenital (from birth) conditions, developmental delays may have occurred that result in educa- 
tional and training needs past the normal age of public school services. For people with progressive 
neurological disorders, such as Lou Gehrig's disease, the physical skill level will change and additional 
accommodations will need to be evaluated and implemented. 

144.6 Future 

Technological development in and of itself will not solve the problems of people with disabilities. 
Technology advancements, however, will continue to provide more powerful tools allowing the explora- 
tion of new and innovative approaches to support users and professionals. As access to lower cost, more 
powerful computer systems increases, the availability of alternative sources of information, technical 
support and training is possible either through software or the internet. Users and professionals will have 
increased opportunities to learn and exchange ideas and outcomes. 
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A related development is the collection and analysis of data describing the actual long term use of 
AAC devices. Currently, clinicians and researchers have not utilized this information in the clinical 
intervention process, progress reporting, or research. Tools for monitoring, editing, and analyzing lan- 
guage activity are just now becoming available (Hill and Romich, 1998). 
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Further Information 

There are a number of organizations and publications that relate to AAC. 

AAC (Augmentative and Alternative Communication) is the quarterly refereed journal of ISAAC. It is 
published by Decker Periodicals Inc., PO Box 620, L.C.D. 1, Hamilton, Ontario, L8N 3K7 CANADA, Tel. 
905-522-7017, Fax. 905-522-7839. http://www.isaac-online.org 

ACOLUG (Augmentative Communication On-Line User Group) is a listserve with primary partici- 
pants being people who rely on AAC. Topics include a wide range of issues of importance to this 
population, their relatives, and friends, and those who provide AAC services, http://nimbus.ocis. tem- 
ple. edu/~kcohen/listserv/homeacolug.html 

American Speech-Language-Hearing Association (ASHA) is the professional organization of speech- 
language pathologists. ASHA has a Special Interest Division on augmentative communication. ASHA, 
10801 Rockville Pike, Rockville, MD, 20852, Tel. 301-897-5700, Fax. 301-571-0457. http://www.asha.org. 

Augmentative Communication News is published bi-monthly by Augmentative Communication, Inc., 
1 Surf Way, Suite #215, Monterey, CA, 93940, Tel. 408-649-3050, Fax. 408-646-5428. 

CAMA (Communication Aid Manufacturers Association) is an organization of manufacturers of AAC 
systems marketed in North America. CAMA, 518-526 Davis St., Suite 211-212, Evanston, IL, 60201, Tel. 
800-441-2262, Fax. 708-869-5689. http://www.aacproducts.org. 

Communicating Together is published quarterly by Sharing to Learn as a means of sharing the life 
experiences and communication systems of augmentative communicators with other augmentative com- 
municators, their families, their communities, and those who work with them. Sharing to Learn, PO Box 
986, Thornhill, Ontario, L3T 4A5, CANADA, Tel. 905-771-1491, Fax. 905-771-7153. http://www.isaac- 
online.org. 

Communication Outlook is an international quarterly addressed to the community of individuals 
interested in the application of technology to the needs of persons who experience communication 
handicaps. It is published by the Artificial Language Laboratory, Michigan State University, 405 Computer 
Center, East Lansing, MI, 48824-1042, Tel. 517-353-0870, Fax. 517-353-4766. http://www.msu.edu/~art- 
lang/CommOut.html. 

Trace Research & Development Center, University of Wisconsin — Madison, http://www.trace.wisc.edu. 

ISAAC is the International Society for Augmentative and Alternative Communication). USSAAC is 
the United States chapter. Both can be contacted at PO Box 1762 Station R, Toronto, Ontario, M4G 4A3, 
CANADA, Tel. 905-737-9308, Fax 905-737-0624. http://www.isaac-online.org. 
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munication and Computer Applications. RESNA, 1700 North Moore Street, Suite 1540, Arlington, VA, 
22209-1903, Tel. 703-524-6686. 



© 2000 by CRC Press LLC 



Kondraske, G. V. "Measurement Tools and Processes in Rehabilitation Engineering." 

The Biomedical Engineering Handbook: Second Edition. 

Ed. Joseph D. Bronzino 

Boca Raton: CRC Press LLC, 2000 



145 



Measurement Tools 

and Processes in 

Rehabilitation 

Engineering 



145.1 Fundamental Principles 

Structure, Function, Performance, and Behavior ■ 
Subjective and Objective Measurement Methods • 
Measurements and Assessments 

145.2 Measurement Objectives and Approaches 
Characterizing the Human System and Its Subsystems • 
Characterizing Tasks • Characterizing Assistive Devices 
Characterizing Overall Systems in High-Level-Task 
Situations 

145.3 Decision-Making Processes 

145.4 Current Limitations 

George V. Kondraske Quality of Measurements • Standards • Rehabilitation 

University of Texas at Arlington Service Delivery and Rehabilitation Engineering 



In every engineering discipline, measurement facilitates the use of structured procedures and decision- 
making processes. In rehabilitation engineering, the presence of "a human," the only or major component 
of the system of interest, has presented a number of unique challenges with regard to measurement. This 
is especially true with regard to the routine processes of rehabilitation that either do or could incorporate 
and rely on measurements. This, in part, is due to the complexity of the human system's architecture, 
the variety of ways in which it can be adversely affected by disease or injury, and the versatility in the 
way it can be used to accomplish various tasks of interest to an individual. 

Measurement supports a wide variety of assistive device design and prescription activities undertaken 
within rehabilitation engineering (e.g., Leslie and Smith [1990], Webster et al. [1985], and other chapters 
within this section). In addition, rehabilitation engineers contribute to the specification and design of 
measurement instruments that are used primarily by other service providers (such as physical and 
occupational therapists). As measurements of human structure, performance, and behavior become more 
rigorous and instruments used have taken advantage of advanced technology, there is also a growing role 
for rehabilitation engineers to assist these other medical professionals with the proper application of 
measurement instruments (e.g., for determining areas that are most deficient in an individual's perfor- 
mance profile, objectively documenting progress during rehabilitation, etc.). This is in keeping with the 
team approach to rehabilitation that has become popular in clinical settings. In short, the role of 
measurement in rehabilitation engineering is dynamic and growing. 
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In this chapter, a top-down overview of measurement tools and processes in rehabilitation engineering 
is presented. Many of the measurement concepts, processes, and devices of relevance are common to 
applications outside the rehabilitation engineering context. However, the nature of the human population 
with which rehabilitation engineers must deal is arguably different in that each individual must be 
assumed to be unique with respect to at least a subset of his or her performance capacities and/or 
structural parameters; i.e., population reference data cannot be assumed to be generally applicable. While 
there are some exceptions, population labels frequently used such as "head-injured" or "spinal cord- 
injured" represent only a gross classification that should not be taken to imply homogeneity with regard 
to parameters such as range of motion, strength, movement speed, information processing speed, and 
other performance capacities. This is merely a direct realization that many different ways exist in which 
the human system can be adversely affected by disease or injury and recognition of the continuum that 
exists with regard to the degree of any given effect. The result is that in rehabilitation engineering, 
compared with standard human factors design tasks aimed at the average healthy population, many 
measurement values must be acquired directly for the specific client. 

Measurement in the present context encompasses actions that focus on (1) the human (e.g., structural 
aspects and performance capacities of subsystems at different hierarchical levels ranging from specific 
neuromuscular subsystems to the total person and his or her activities in daily living, including work), 
(2) assistive devices (e.g., structural aspects and demands placed on the human), (3) tasks (e.g., distances 
between critical points, masses of objects involved, etc.), and (4) overall systems (e.g., performance 
achieved by a human-assistive device-task combination, patterns of electrical signals representing the 
timing of muscle activity while performing a complex maneuver, behavior of an individual before and 
after being fitted with a new prosthetic devices, etc). Clearly, an exhaustive treatment is beyond the scope 
of this chapter. Measurements are embedded in every specialized subarea of rehabilitation engineering. 
However, there are also special roles served by measurement in a broader and more generic sense, as well 
as principles that are common across the many special applications. Emphasis here is placed on these. 

There is no lack of other literature regarding the types of measurement outlined to be of interest here 
and their use. However, it is diffusely distributed, and gaps exist with regard to how such tools can be 
integrated to accomplish goals beyond simply the acquisition of numeric data for a given parameter. 
With rapidly changing developments over the last decade, there is currently no comprehensive source 
that describes the majority of instruments available, their latest implementations, procedures for their 
use, evaluation of effectiveness, etc. While topics other than measurement are discussed, Leslie and Smith 
[1990] produced what is perhaps the single most directly applicable source with respect to rehabilitation 
engineering specifically, although it too is not comprehensive with regard to measurement, nor does it 
attempt to be. 

145.1 Fundamental Principles 

Naturally, the fundamental principles of human physiology manifest themselves in the respective sensory, 
neuromuscular, information-processing, and life-sustaining systems and impact approaches to measure- 
ment. In addition, psychological considerations are vital. Familiarization with this material is essential 
to measurement in rehabilitation; however, treatment here is far beyond the scope of this chapter. The 
numerous reference works available may be most readily found by consulting relevant chapters in this 
Handbook and the works that they reference. In this section, key principles that are more specific to 
measurement and of general applicability are presented. 

Structure, Function, Performance, and Behavior 

It is necessary to distinguish between structure, function, performance, and behavior and measurements 
thereof for both human and artificial systems. In addition, hierarchical systems concepts are necessary 
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both to help organize the complexity of the systems involved and to help understand the various needs 
that exist. 

Structural measures include dimensions, masses (of objects, limb segments), moments of inertia, 
circumferences, contours, compliances, and any other aspects of the physical system. These may be 
considered hierarchically as being pertinent to the total human (e.g., height, weight, etc.), specific body 
segments (e.g., forearm, thigh, etc.), or components of basic systems such as tendons, ligaments and 
muscles. 

Function is the purpose of the system of interest (e.g., to move a limb segment, to communicate, to 
feed and care for oneself). Within the human, there are many single-purpose systems (e.g., those that 
function to move specific limb segments, process specific types of information, etc.). As one proceeds to 
higher levels, such as the total human, systems that are increasingly more multifunctional emerge. These 
can be recognized as higher-level configurations of more basic systems that operate to feed oneself, to 
conduct personal hygiene, to carry out task of a job, etc. This multilevel view of just functions begins to 
help place into perspective the scope over which measurement can be applied. 

In rehabilitation in general, a good deal of what constitutes measurement involves the application of 
structured subjective observation techniques (see also the next subsection) in the form of a wide range 
of rating scales [e.g., Fuhrer, 1987; Granger & Greshorn, 1984; Potvin et al„ 1985]. These are often termed 
functional assessment scales and are typically aimed at obtaining a global index of an individual's ability 
to function independently in the world. The global index is typically based on a number of items within 
a given scale, each of which addresses selected, relatively high-level functions (e.g., personal hygiene, 
mobility, etc.). The focus of measurement for a given item is often in estimate of the level of independence 
or dependence that the subject exhibits or needs to carry out the respective function. In addition, 
inventories of functions that an individual is able or not able to carry out (with and without assistance) 
are often included. The large number of such scales that have been proposed and debated is a consequence 
of the many possible functions and combinations thereof that exist on which to base a given scale. 
Functional assessment scales are relatively quick and inexpensive to administer and have a demonstrated 
role in rehabilitation. However, the nature and levels of measurements obtained are not sufficient for 
many rehabilitation engineering purposes. This latter class of applications generally begins with a function 
at the level and of the type used as a constituent component of functional assessment scales, considers 
the level of performance at which that function is executed more quantitatively, and incorporates one or 
more lower levels in the hierarchy (i.e., the human subsystems involved in achieving the specific functions 
of daily life that are of interest and their capacities for performance). 

Where functions can be described and inventoried, performance measures directly characterize how 
well a physical system of interest executes its intended function, performance is multidimensional (e.g., 
strength, range, speed, accuracy, steadiness, endurance, etc.). Of special interest are the concepts of 
performance capacity and performance capacity measurement. Performance capacity represents the limits 
of a given system's ability to operate in its corresponding multidimensional performance space. In this 
chapter, a resource-based model for both human and artificial system performance and measurement of 
their performance capacities is adopted [e.g., Kondraske, 1990, 1995]. Thus the maximum knee flexor 
strength available (i.e., the resource availability) under a stated set of conditions represents one unique 
performance capacity of the knee flexor system. In rehabilitation, the terms impairment, disability, and 
handicap [World Health Organization, 1980] have been prominently applied and are relevant to the 
concept of performance. While these terms place an emphasis on what is missing or what a person cannot 
do and imply not only a measurement but also the incorporation of an assessment or judgment based 
on one or more observations the resource-based performance perspective focuses on "what is present" 
or "what is right" (i.e., performance resource availability). From this perspective, an impairment can be 
determined to exist if a given performance capacity is found to be less than a specified level (e.g., less 
than 5th percentile value of a health reference population). A disability exists when performance resource 
insufficiency exists in a specified task. 
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While performance relates more to what a system can do (i.e., a challenge or maximal stress is implied), 
behavior measurements are used to characterize what a system does naturally. Thus a given variable such 
as movement speed can relate to both performance and behavior depending on whether the system (e.g., 
human subsystem) was maximally challenged to respond "as fast as possible" (performance) or simply 
observed in the course of operation (behavior). It is also possible to observe a system that it is behaving 
at one or more of its performance capacities (e.g., at the maximum speed possible, etc.) (see Table 145.1). 

Subjective and Objective Measurement Methods 

Subjective measurements are made by humans without the aid of instruments and objective measure- 
ments result from the use of instruments. However, it should be noted that the mere presence of an 
instrument does not guarantee complete objectivity. For example, the use of a ruler requires a human 
judgment in reading the scale and thus contains a subjective element. A length-measurement system with 
an integral data-acquisition system would be more objective. However, it is likely that that even this 
system would involve human intervention in its use, e.g., the alignment of the device and making the 
decision as to exactly what is to be measured with it by selection of reference points. Measures with more 
objectivity (less subjectivity) are preferred to minimize questions of bias. However, measurements that 
are intrinsically more objective are frequently more costly and time-consuming to obtain. Well-reasoned 
tradeoffs must be made to take advantage of the ability of a human (typically a skilled professional) to 
quickly "measure" many different items subjectively (and often without recording the results but using 
them internally to arrive at some decision). 

It is important to observe that identification of the variable of interest is not influenced by whether it 
is measured subjectively or objectively. This concept extends to the choice of instrument used for objective 
measurements. This is an especially important concept in dealing with human performance and behavior, 
since variables of interest can be much more abstract than simple lengths and widths (e.g., coordination, 
postural stability, etc.) In fact, many measurement variables in rehabilitation historically have tended to 
be treated as if they were inextricably coupled with the measurement method, confounding debate 
regarding what should be measured with what should be used to measure it in a given context. 

Measurements and Assessments 

The basic representation of a measurement itself in terms of the actual units of measure is often referred 
to as the raw form. For measures of performance, the term raw score is frequently applied. Generally, 
some form of assessment (i.e., judgment or interpretation) is typically required. Assessments may be 
applied to (or, viewed from a different perspective, may require) either a single measure of groups of 
them. Subjective assessments are frequently made that are based on the practitioner's familiarity with 
values for a given parameter in a particular context. However, due to the large number of parameters 
and the amount of experience that would be required to gain a sufficient level of familiarity, a more 
formal and objective realization of the process that takes place in subjective assessments is often employed. 
This process combines the measured value with objectively determined reference values to obtain new 
metrics, or scores, that facilitate one or more steps in the assessment process. 

For aspects of performance, percent normal scores are computed by expressing subject Y's availability 
of performance resource k[R Ak (Y)] as a fraction of the mean availability of that resource in a specified 
reference population [R Ak {pop)]. Ideally, the reference population is selected to match the characteristics 
of the individual as closely as possible (e.g., age range, gender, handedness, etc.). 
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Percent normal = , \ x 100 (145.1) 
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Aside from the benefit of placing all measurements on a common scale, a percent normal representation 
of a performance capacity score can be loosely interpreted as a probability. Consider grip strength as the 
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performance resource. Assume that there is a uniform distribution of demands of demands placed on 
grip strength across a representative sample of tasks of daily living, with requirements ranging from zero 
to the value representing mean grip strength availability in the reference population. Further assuming 
that grip strength was the only performance resource that was in question for subject Y (i.e., all other 
were available in nonlimiting amounts), the percent normal score would represent the probability that 
a task involving grip strength, randomly selected from those which average individuals in the reference 
population could execute (i.e., those for which available grip strength would be adequate), could be 
successfully executed by subject Y. While the assumptions stated here are unlikely to be perfectly true, 
this type of interpretation helps place measurements that are most commonly made in the laboratory 
into daily-life contexts. 

In contrast to percent normal metrics, z-scores take into account variability within the selected reference 
population. Subject Y's performance is expressed in terms of the difference between it and the reference 
population mean, normalized by a value corresponding to one standard deviation unit (a) of the reference 
population distribution: 

R A [y)-R a (pop) 

z=^^ WJV (145.3) 



It is important to note that valid z-scores assume that the parameter in question exhibit a normal 
distribution in the reference population. Moreover, z-scores are useful in assessing measures of structure, 
performance, and behavior. With regard to performance (and assuming that measures are based on a 
resource construct, i.e., a larger numeric value represents better performance), a z-score of zero is 
produced when the subject's performance equals that of the mean performance in the reference popu- 
lation. Positive z-scores reflect performance that is better than the population mean. In a normal distri- 
bution, 68.3% of the samples fall between z-scores of -1.0 and +1.0, wile 95.4% of these samples fall 
between z-scores of -2.0 and +2.0. Due to variability of a given performance capacity within a healthy 
population (e.g., some individuals are stronger, faster, more mobile that others), a subject with a raw 
performance capacity score that produces a percent normal score of 70% could easily produce a z-score 
of -1.0. Whereas this percent normal score might raise concern regarding the variable of interest, the 
z-score of -1.0 indicates that a good fraction of healthy individuals exhibit lower level of performance 
capacity. 

Both percent normal and z-scores require reference population data to compute. The best reference 
(i.e., most sensitive) is data for that specific individual (e.g., preinjury or predisease onset). In most cases, 
these data do not exist. However, practices such as preemployment screenings and regular checkups are 
beginning to provide individualized reference data in some rehabilitation contexts. 

In yet another alternative, it is frequently desirable to use values representing demands imposed by 
tasks [i? D) .(task A)] as the reference for assessment of performance capacity measures. Demands on 
performance resources can be envisioned to vary over the time course of a task. In practice, an estimate 
of the worst-case value (i.e., highest demand) would be used in assessments that incorporate task demands 
as reference values. In one form, such assessments can produce binary results. For example, availability 
can be equal to or exceed demand (resource sufficiency), or it can be less than demand (resource insuf- 
ficiency). These rule-based assessments are useful in identifying limiting factors, i.e., those performance 
resources that inhibit a specified type of task from being performed successfully or that prevent achieve- 
ment of a higher level of performance in a given type of task. 



If R A (subject Yj > R D (task A], then 

R A (subject Y I is sufficient, else (145.3) 

R A (subject Y I is insufficient 
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These rule-based assessments represent the basic process often applied (sometimes subliminally) by 
experienced clinicians in making routine decisions, as evidenced by statements such as "not enough 
strength," "not enough stability," etc. It is natural to extend and build on these strategies for use with 
objective measures. Extreme care must be employed. It is often possible, for example, for an individual 
to substitute another performance resource that is not insufficient for one that is. Clinicians take into 
account many such factors, and objective components should be combined with subjective assessments 
that provide the required breadth that enhances validity of objective components of a given assessment. 
Using the same numeric values employed in rule-based binary assessments, a preference capacity stress 
metric can be computed: 



, , R D (task a) 

Performance capacity stress % = ? r X 100 

V ' R At (subject YJ 



(145.4) 



Binary assessments also can be made using this metric and a threshold of 100%. However, the stress 
value provides additional information regarding how far (or close) a given performance capacity value 
is from the sufficiency threshold. 

145.2 Measurement Objectives and Approaches 
Characterizing the Human System and Its Subsystems 

Figure 145.1 illustrates various points at which measurements are made over the course of a disease or 
injury, as well as some of the purposes for which they are made. The majority of measurements made 
in rehabilitation are aimed at characterizing the human system. 

Measurements of human structure [e.g., Pheasant, 1986] play a critical role in the design and pre- 
scription of components such as seating, wheelchairs, workstations, artificial limbs, etc. Just like clothing, 
these items must "fit" the specific individual. Basic tools such as measuring tapes and rulers are becoming 
supplemented with three-dimensional digitizers and devices found in computer-aided manufacturing. 
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FIGURE 145.1 Measurements of structure, performance, and behavior serve many different purposes at different 
points over the course of a disease or injury that results in the need for rehabilitation services. 
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Measurements of structure (e.g., limb segment lengths, moments of inertia, etc.) are also used with 
computer models [Vasta & Knodraske, 1995] in the process of analyzing tasks to determine demands in 
terms of performance capacity variables associated with basic systems such s flexors and extensors. 

After nearly 50 years, during which a plethora of mostly disease- and injury-specific functional 
assessment scales were developed, the functional independence measure (FIM) [Hamilton et al., 1987; 
Keith et al., 1987] is of particular note. It is the partial result of a task-force effort to produce a systematic 
methodology (Uniform Data System for Medical Rehabilitation) with the specific intent of achieving 
standardization throughout the clinical service-delivery system. This broad methodology uses subjective 
judgements exclusively, based on rigorous written guidelines, to categorize demographic, diagnostic, 
functional, and cost information for patients within rehabilitation settings. Its simplicity to use once 
learned and its relatively low cost of implementation have helped in gaining a rather widespread utilization 
for tracking progress of individuals from admission to discharge in rehabilitation programs and evaluating 
effectiveness of specific therapies within and across institutions. 

In contrast, many objective measurement tools of varying degrees of technological sophistication exist 
[Jones, 1995; Kondraske, 1995, Smith & Leslie, 1990; Potvin et al, 1985; Smith, 1995] (also see Further 
Information below. A good fraction of these have been designed to accomplish the same purposes as 
corresponding subjective methods, but with increased resolution, sensitivity, and repeatability. The intent 
is not always to replace subjective methods completely but to make available alternatives with the 
advantages noted for situations that demand superior performance in the aspects noted. There are certain 
measurement needs, however, that cannot be accomplished via subjective means (e.g., measurement of 
a human's visual information-processing speed, which involves the measurement of times of less than 
1 second with millisecond resolution). These needs draw on the latest technology in a wide variety of 
ways, as demonstrated in the cited material. 

With regard to instrumented measurements that pertain to a specific individual, performance capacity 
measures at both complex body system and basic system levels (Fig. 145.1) constitute a major area of 
activity. A prime example is methodology associated with the implementation of industrial lifting stan- 
dards [NIOSH, 1981]. Performance-capacity measures reflect the limits of availability of one or more 
selected resources and require test strategies in which the subject is commanded to perform at or near 
a maximum level under controlled conditions. Performance tests typically last only a short time (seconds 
or minutes). To improve estimates of capacities, multiple trials are usually included in a given "test" from 
which a final measure is computed according to some established reduction criterion (e.g., average across 
five trials, best of three trials). This strategy also tends to improve test-retest repeatability. Performance 
capacities associated with basic and intermediate-level systems are important because they are "targets 
of therapy" [Tourtellotte, 1993], i.e., the entities that patients and service providers want to increase to 
enhance the chance that enough will be available to accomplish the tasks of daily life. Thus measurements 
of baseline levels and changes during the course of a rehabilitation program provide important docu- 
mentation (for medical, legal, insurance, and other purposes) as well as feedback to both the rehabilitation 
team and the patient. 

Parameters of human behavior are also frequently acquired, often to help understand an individual's 
response to a therapy or new circumstance (e.g., obtaining a new wheelchair or prosthetic device). 
Behavioral parameters reflect what the subject does normally and are typically recorded over longer time 
periods (e.g., hours or days) compared with that required for a performance capacity measurement under 
conditions that are more representative of the subject's natural habitat (i.e., less laboratory- like). The 
general approach involves identifying the behavior (i.e., an event such as "head flexion," "keystrokes," 
"steps," "repositionings," etc.) and at least one parametric attribute of it. Frequency, with units of "events 
per unit time," and time spent in a given behavioral or activity state [e.g., Gonapthy 8c Kondraske, 1990] 
are the most commonly employed behavioral metrics. States may be detected with electromyographic 
means or electronic sensors that respond to force, motion, position, or orientation. Behavioral measures 
can be used as feedback to a subject as a means to encourage desired behaviors or discourage undesirable 
behaviors. 
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Characterizing Tasks 

Task characterization or task analysis, like the organization of human system parameters, is facilitated 
with a hierarchical perspective. A highly objective, algorithmic approach could be delineated for task 
analysis in any given situation [Imrhan, 1995; Maxwell, 1995]. The basic objective is to obtain both 
descriptive and quantitative information for making decisions about the interface of a system (typically 
a human) to a given task. Specifically, function, procedures, and goals are of special interest. Function 
represents the purpose of a task (e.g., to flex the elbow, to lift an object, to communicate). In contrast, 
task goals relate to performance, or how well the function is to be excuted, and are quantifiable (e.g., the 
mass of an object to be lifted, the distance over which the lift must occur, the speed at which the lift 
must be performed, etc.). In situations with human and artificial systems, the term overall task goals is 
used to distinguish between goals of the combined human-artificial system and goals associated with the 
task of operating the artificial system. Procedures represent the process by which goals are achieved. 
Characterization of procedures can include descriptive and quantitative components (e.g., location of a 
person's hands at beginning and end points of a task, path in three-dimensional space between beginning 
and end points). Partial or completely unspecified procedures allow for variations in style. Goals and 
procedures are used to obtain numeric estimates of task demands in terms of the performance resources 
associated with the systems anticipated to be used to execute the task. Task demands are time dependent. 
Worst-case demands, which may occur only at specific instants in time, are of primary interest in task 
analysis. 

Estimates of task demand can be obtained (1) direct measurement (i.e., of goals and procedures), 
(2) the use of physics-based models to map direct measurements into parameters that relate more readily 
to measurable performance capacities of human subsystems, or (3) inference. Examples of direct mea- 
surement include key dimensions and mass of objects, three-dimensional spatial locations between 
"beginning" and "end points" of objects in tasks involving the movement of objects, etc. Instrumentation 
supporting task analysis is available (e.g., load cells, video and other systems for measuring human 
position and orientation in real-time during dynamic activities), but it is not often integrated into systems 
for task analysis per se. Direct measurements of forces based on masses of objects and gravity often must 
be translated (to torques about a given body joint): this requires the use of static and dynamic models 
and analysis [e.g., Vista & Kondraske, 1995; Winter, 1990]. 

An example of an inferential task-analysis approach that is relatively new is nonlinear causal resource 
analysis (NCRA) [Kondraske, 1999; Kondraske et al., 1997; Kondraske, 1988]. This method was motivated 
by human performance analysis situations where direct analysis is not possible (e.g., determination of 
the amount of visual information-processing speed required to drive safely on a highway). Quantitative 
task demands, in terms of performance variables that characterize the involved subsystems, are inferred 
from a population data set that includes measures of subsystem performance, resource availabilities (e.g., 
speed, accuracy, etc.), and overall performance on the task in question. This method is based on the 
simple observation that the individual with the least amount of the given resource (i.e., the lowest 
performance capacity) who is still able to accomplish a given goal (i.e., achieve a given level of performance 
in the specified high-level task) provides the key clue. That amount of availability is used to infer the 
amount of demand imposed by the task. 

The ultimate goal to which task characterization contributes is to identify limiting factors or unsafe 
conditions when a specific subject undertakes the task in question; this goal must not be lost while 
carrying out the basic objectives of task analysis. While rigorous algorithmic approaches are useful to 
make evident the true detail of the process, they are generally not performed in this manner in practice 
at present. Rather, the skill and experience of individuals performing the analysis are used to simplify 
the process, resulting in a judicious mixture of subjectives estimates and objective measurements. For 
example, some limiting factors (e.g., grip strength) may be immediately identified without measurement 
of the human or the task requirements because the margin between availability and demand is so great 
that quick subjective "measurements" followed by an equally quick "assessment" can be used to arrive at 
the proper conclusion (e.g., "grip strength is a limiting factor in this task"). 
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Characterizing Assistive Devices 

Assistive devices can be viewed as artificial systems that either completely or partially bridge a gap between 
a given human (with his or her unique profile of performance capacities, i.e., available performance 
resources) and a particular task or class of tasks (e.g., communication, mobility, etc.). It is thus possible 
to consider the aspects of the device that constitute the user-device interface and those aspects which 
constitute, more generally, the device-task interface. In general, measurements supporting assessment of 
the user-device interface can be viewed to consist of (1) those which characterize the human and (2) those 
which characterize tasks (i.e., "operating" the assistive device). Each of these was described earlier. 
Measurements that characterize the device-task interface are often carried out in the context of the 
complete system, i.e., the human-assistive device-task combination (see next subsection). 

Characterizing Overall Systems in High-Level-Task Situations 

This situation generally applies to a human-artificial system-task combination. Examples include an 
individual using a communication aid to communicate, an individual using a wheelchair to achieve 
mobility, etc. Here, concern is aimed at documenting how well the task (e.g., communication, mobility, 
etc.) is achieved by the composite or overall system. Specific aspects or dimensions of performance 
associated with the relevant function should first be identified. Examples include speech, accuracy, 
stability, efficiency, etc. The total system is then maximally challenged (tempered by safety considerations) 
to operate along one or more of these dimensions of performance (usually not more than two dimensions 
are maximally challenged at the same time). For example, a subject with a communication device may 
be challenged to generate a single selected symbol "as fast as possible" (stressing speed without concern 
for accuracy). Speed is measured (e.g., with units of symbols per second) over the course of short trial 
(so as not to be influenced by fatigue). Then the "total system" may be challenged to generate a subset 
of specific symbols (chosen at random from the set of those available with a given device) one at a time, 
"as accurately as possible" (stressing accuracy while minimizing stress on speed capacities). Accuracy is 
then measured after a representative number of such trials are administered (in terms of "percent correct," 
for example). To further delineate the speed-accuracy performance envelope, "the system" may be chal- 
lenge to select symbols at a fixed rate while accuracy is measured. Additional dimensions can be evaluated 
similarly. For example, endurance (measure in units of time) can be determined by selecting an operating 
point (e.g., by reference to the speed-accuracy performance envelope) and challenging the total system 
"to communicate" for "as long as possible" under the selected speed- accuracy condition. 

In general, it is more useful if these types of characterizations consider all relevant dimensions with 
some level of measurements (i.e., subjective or objective) than it would be to apply a high resolution, 
objective measurement in a process that considers only one aspect of performance. 

145.3 Decision-Making Processes 

Measurements that characterize the human, task, assistive device, or combination thereof are themselves 
only means to an end; the end is typically a decision. As noted previously, decisions are often the result 
of assessment processes involving one or more measurements. Although not exhaustive, many of the 
different types of assessments encountered are related to the following questions: ( 1) Is a particular aspect 
of performance normal (or impaired)? (2) Is a particular aspect of performance improving, stable, or 
getting worse? How should therapy be modified? (3) Can a given subject utilize (and benefit from) a 
particular assistive device? (4) Does a subject possess the required capacity to accomplish a given higher 
level task (e.g., driving, a particular job after a work-related injury, etc.)? 

In Fig. 145.2, several of the basic concepts associated with measurement are used to illustrate how they 
enter into and facilitate systematic decision-making processes. The upper section shows raw score values 
as well as statistics for a healthy normal reference population in tabular form (left). It is difficult to reach 
any decision by simple inspection of just the raw performance capacity values. Tabular data are used to 
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obtain percent normal (middle) and z-score (right) assessments. Both provide a more directly interpret- 
able result regarding subject A's impairments. By examining the "right shoulder flexion extreme of 
motion" item in the figure, it can be seen that a raw score value corresponding to 51.2% normal yields 
a very large-magnitude, negative z-score (-10.4). This z-score indicates that virtually no one in the 
reference population would have a score this low. In contrast, consider similar scores for the "grip strength" 
item (56.2% normal, z-score = -1.99). On the basis of percent normal scores, it would appear that both 
of these resources are similarly affected, whereas the z-score basis provides a considerably different 
perspective due to the fact that grip strength is much more variable in healthy populations than the 
extreme angle obtained by a given limb segment about a joint, relatively speaking. As noted, z-scores 
account for this variability. 

The lower section of Fig. 145.2 considers a situation in which the issue is a specific individual (subject A) 
considered in a specific task. Tabular data now include raw score values (which are the same as in upper 
section of the figure) and quantitative demands (typically worst case) imposed on the respective perfor- 
mance resources by task X. The lower-middle plot illustrates the process of individually assessing suffi- 
ciency of each performance resource in this task context using a rule-based assessment that incorporates 
the idea of a threshold (i.e., availability must exceed demand for sufficiency). The lower-right plot 
illustrates an analogous assessment process that is executed after computation of a stress metric for each 
of the performance capacities. Here, any demand that corresponds to more than a 100% stress level is 
obviously problematic. In addition to binary conclusions regarding whether a given capacity is or is not 
a limiting factor, it is possible to observe that of the two limiting resources (e.g., grip strength and right 
shoulder flexion extreme of motion), the former is more substantial. This might suggest, for example, 
that the task be modified so as to decrease the grip-strength demand (i.e., gains in performance capacity 
required would be substantial to achieve sufficiency) and the use of focused exercise therapy to increase 
shoulder flexion mobility (i.e., gains in mobility required are relatively small). 

145.4 Current Limitations 
Quality of Measurements 

Key issues are measurement validity, reliability (or repeatability), accuracy, and discriminating power. At 
issue in terms of current limitations is not necessarily the quality of measurements but limitations with 
regard to methods employed to determine the quality of measurements and their interpretability. 

A complete treatment of these complex topics is beyond the present scope. However, it can be said 
that standards are such [Potvin et al., 1985] that most published works regarding measurement instru- 
ments do address quality of measurements to some extent. Validity (i.e., how well does the measurement 
reflect the intended quantity) and reliability are most often addressed. However, one could easily be left 
with the impression that these are binary conditions (i.e., measurement is or is not reliable or valid), 
when in fact a continuum is required to represent these constructs. Of all attributes that relate to 
measurement quality, reliability is most commonly expressed in quantitative terms. This is perhaps 
because statistical methods have been defined and promulgated for the computation of so-called reliability 
coefficients [Winer, 1971]. Reliability coefficients range from 0.0 to 1.0, and the implication is that 1.0 
indicates a perfectly reliable or repeatable measurement process. Current methods are adequate, at best, 
for making inferences regarding the relative quality of two or more methods of quantifying "the same 
thing." Even these comparisons require great care. For example, measurement instruments that have 
greater intrinsic resolving power have a great opportunity to yield smaller-reliability coefficients simply 
because they are capable of measuring the true variability (on repeated measurement) of the parameter 
in question within the system under test. While there has been widespread determination or reliability 
coefficients, there has been little or no effort directed toward determination of what value of a reliability 
coefficient is "good enough" for a particular application. In fact, reliability coefficients are relatively 
abstract to most practitioners. 
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Methods for determining the quality of a measurement process (including the instrument, procedures, 
examiner, and actual noise present in the variable of interest) that allow a practitioner to easily reach 
decisions regarding the use of a particular measurement instrument in a specific application and limi- 
tations thereof are currently lacking. At the use of different measurements increases and the number of 
options available for obtaining a given measurement grows, this topic will undoubtedly receive additional 
attention. Caution in interpreting literature, common sense, and the use of simple concepts such as "I 
need to measure range of motion to within 2 degrees in my application" are recommended in the 
meantime [Mayer et al., 1997]. 

Standards 

Measurements, and concepts with which they are associated, can contribute to a shift from experience- 
based knowledge acquisition to rule-based, engineering-like methods. This requires (1) a widely accepted 
conceptual framework (i.e. known to assistive device manufacturers, rehabilitation engineers, and other 
professionals within the rehabilitation community), (2) a more complete set of measurement tools that 
are at least standardized with regard to the definition of the quantity measured, (3) special analysis and 
assessment software (that removes the resistance to the application of more rigorous methods by enhanc- 
ing the quality of decisions as well as the speed with which they can be reached), and (4) properly trained 
practitioners. Each is a necessary but not sufficient component. Thus balanced progress is required in 
each of these areas. 

Rehabilitation Service Delivery and Rehabilitation Engineering 

In a broad sense, it has been argued that all engineers can be considered rehabilitation engineers who 
merely work at different levels along a comprehensive spectrum of human performance, which itself can 
represent a common denominator among all humans. Thus an automobile is a mobility aid, a telephone 
is a communication aid, and so on. Just as in other engineering disciplines, measurement must be 
recognized not only as an important end in itself (in appropriate instances) but also as an integral 
component or means within the overall scope of rehabilitation and rehabilitation engineering processes. 
The service-delivery infrastructure must provide for such means. At present, one should anticipate and 
be prepared to overcome potential limitations associated with factors such as third-party reimbursement 
for measurement procedures, recognition of equipment and maintenance costs associated with obtaining 
engineering-quality measurements, and education of administrative staff and practitioners with regard 
to the value and proper use of measurements. 

Defining Terms 

Behavior: A general term that relates to what a human or artificial system does while carrying out its 
function(s) under given conditions. Often, behavior is characterized by measurement of selected 
parameters or identification of unique system states over time. 

Function: The purpose of a system. Some systems map to a single primary function (e.g., process visual 
information). Others (e.g., the human arm) map to multiple functions, although at any given time 
multifunction systems are likely to be executing a single function (e.g., polishing a car). Functions 
can be described and inventoried, whereas level of performance of a given function can be measured. 

Functional assessment: The process of determining, from a relatively global perspective, an individual's 
ability to carry out tasks in daily life. Also, the result of such a process. Functional assessments 
typically cover a range of selected activity areas and include (at a minimum) a relatively gross 
indication (e.g., can or can't do; with or without assistance) of status in each area. 

Goal: A desired endpoint (i.e., result) typically characterized by multiple parameters, at least one of 
which is specified. Examples include specified task goals (e.g., move an object of specified mass 
from point A to point B in 3 seconds) or estimated task performance (maximum mass, range, 
speed of movement obtainable given a specified elemental performance resource availability pro- 
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file), depending on whether a reverse or forward analysis problem is undertaken. Whereas function 
describes the general process of task, the goal directly relates to performance and is quantitative. 

Limiting resource: A performance resource at any hierarchical level (e.g., vertical lift strength, knee 
flexor speed) that is available in an amount that is less than the worst-case demand imposed by a 
task. Thus a given resource can be "limiting" only when considered in the context of a specific task. 

Overall task goals: Goals associated with a task to be executed by a human-artificial system combination 
(to be distinguished from goals associated with the task of operating the artificial system). 

Performance: Unique qualities of a human or artificial system (e.g., strength, speed, accuracy, endur- 
ance) that pertain to how well that system executes its function. 

Performance capacity: A quantity in finite availability that is possessed by a system or subsystem, 
drawn on during tasks, and limits some aspect (e.g., speed, force, production, etc.) of a system's 
ability to execute tasks, or, the limit of that aspect itself. 

Performance capacity measurement: A general class of measurements, performed at different hierar- 
chical levels, intended to quantify one or more performance capacities. 

Procedure: A set of constraints placed on a system in which flexibility exists regarding how a goal (or 
set of goals) associated with a given function can be achieved. Procedure specification requires 
specification of initial intermediate, and/or final states or conditions dictating how the goal is to be 
accomplished. Such specification can be thought of in terms of removing some degrees of freedom. 

Structure: Physical manifestation and attributes of a human or artificial system and the object of one 
type of measurements at multiple hierarchical levels. 

Style: Allowance for variation within a procedure, resulting in the intentional incomplete specification 
of a procedure or resulting from either international or unintentional incomplete specification of 
procedure. 

Task: That which results from (1) the combination of specified functions, goals, and procedures or 
(2) the specification of function and goals and the observation of procedures utilized to achieve 
the goals. 
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Further Information 

The section of this Handbook entitled "Human Performance Engineering" contains chapters that address 
human performance modeling and measurement in considerably more detail. 

Manufacturers of instruments used to characterize different aspects of human performance often provide 
technical literature and bibliographies with conceptual backgrounds, technical specifications and application 
examples. A partial list of such sources is included below. (No endorsement of products is implied.) 

Baltimore Therapeutic Equipment Co. 
7455-L New Ridge Road 
Hanover, MD 21076-3105 
http://www.bteco.com/ 

Chattanooga Group 
4717 Adams Road 
Hixson, TN 37343 
http://www.chattanoogagroup.com/ 

Henley Healthcare 
120 Industrial Blvd. 
Sugarland, TX 77478 
http://www.henleyhealth.com/ 
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Human Performance Measurement, Inc. 

P.O. Box 1996 

Arlington, TX 76004-1996 

http://www.flash.net/~hpm/ 

Lafayette Instrument 

3700 Sagamore Parkway North 

Lafayette, IN 47904-5729 

http://www.lafayetteinstrument.com 

The National Institute on Disability and Rehabilitation Research (NIDRR), part of the Department 
of Education, funds a set of Rehabilitation Engineering Research Centers (RERCs) and Research and 
Training Centers (RTCs). Each has a particular technical focus; most include measurements and mea- 
surements issues. Contact NIDRR for a current listing of these centers. 

Measurement devices, issues, and application examples specific to rehabilitation are included in the 
following journals: 

IEEE Transactions on Rehabilitation Engineering 

IEEE Service Center 

445 Hoes Lane 

P.O. Box 1331 

Piscataway, N.J. 08855-1331 

http://www.ieee.org/index.html 

Journal of Rehabilitation Research and Development 
Scientific and Technical Publications Section 
Rehabilitation Research and Development Service 
103 South Gay St., 5th floor 
Baltimore, MD 21202-4051 
http://www.vard.org/jour/jourindx.htm 

Archives of Physical Medicine and Rehabilitation 

Suite 1310 

78 East Adams Street 

Chicago, IL 60603-6103 

American Journal of Occupational Therapy 

The American Occupational Therapy Association, Inc. 

4720 Montgomery Ln., 

Bethesda, MD 20814-3425 

http://www.aota.org/ 

Physical Therapy 

American Physical Therapy Association 

1111 North Fairfax St. 

Alexandria, VA 22314 

http://www.apta.org/ 

Journal of Occupational Rehabilitation 

Subscription Department 

Plenum Publishing Corporation 

233 Spring St. 

New York, NY 10013 

http : //www. plenum.com/ 
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Hobson, D., Trefler, E. "Rehabilitation Engineering Technologies: Principles of Application." 

The Biomedical Engineering Handbook: Second Edition. 

Ed. Joseph D. Bronzino 

Boca Raton: CRC Press LLC, 2000 
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Rehabilitation engineering is the branch of biomedical engineering that is concerned with the application 
of science and technology to improve the quality of life of individuals with disabilities. Areas addressed 
within rehabilitation engineering include wheelchairs and seating systems, access to computers, sensory 
aids, prosthetics and orthotics, alternative and augmentative communication, home and work-site mod- 
ifications, and universal design. Because many products of rehabilitation engineering require careful 
selection to match individual needs and often require custom fitting, rehabilitation engineers have 
necessarily become involved in service delivery and application as well as research, design, and develop- 
ment. Therefore, as we expand on later, it is not only engineers that practice within the field of rehabil- 
itation engineering. 

As suggested above, and as in many other disciplines, there are really two career tracks in the field of 
rehabilitation engineering. There are those who acquire qualifications and experience to advance the state 
of knowledge through conducting research, education, and product development, and there are others 
who are engaged in the application of technology as members of service delivery teams. At one time it 
was possible for a person to work in both arenas. However, with the explosion of technology and the 
growth of the field over the past decade, one must now specialize not only within research or service 
delivery but often within a specific area of technology. 

One can further differentiate between rehabilitation and assistive technology. Rehabilitation technology 
is a term most often used to refer to technologies associated with the acute-care rehabilitation process. 
Therapy evaluation and treatment tools, clinical dysfunction measurement and recording instrumentation, 
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and prosthetic and orthotic appliances are such examples. Assistive technologies are those devices and 
services that are used in the daily lives of people in the community to enhance their ability to function 
independently, examples being specialized seating, wheelchairs, environmental control devices, worksta- 
tion access technologies and services are now communication aids. Recognition and support of assistive 
technology devices and services are now embedded in all the major disability legislation that has been 
enacted over the last decade. 

The primary focus of this chapter is on the role of the rehabilitation engineering practitioner as he or 
she carries out the responsibilities demanded by the application of assistive technology. 

Before launching into the primary focus of this chapter, let us first set a conceptual framework for the 
raison d'etre for assistive technology and the role of the assistive technology professional. 



146.1 The Conceptual Frameworks 



The application of assistive technology can be conceptualized as minimizing the functional gap between 
the person and his or her environment. This reality is what technology does for all of us to varying 
degrees. For example, if you live in a suburban area that has been designed for access only by car and 
your car breaks down, you are handicapped. If your house has been designed to be cooled by air 
conditioning in the "dog days" of summer and you lose a compressor, your comfort is immediately 
compromised by your incompatibility with your environment. Similarly, if you live in a home that has 
only access by steps and you have an impairment requiring the use of a wheelchair, you are handicapped 
because you no longer have abilities that are compatible with your built environment. Because our 
environments, homes, workplaces, schools, and communities have been designed to be compatible with 
the abilities of the norm, young children, persons with disabilities, and many elderly people experience 
the consequences of their mismatch as a matter of course. The long-term Utopian solution would be to 
design environments and their contents so that they can be used by all people of all ages, which is the 
essence of the universal design concept. However, given that today we do not have very many products 
and environments that have been universally designed, rehabilitation engineers attempt to minimize the 
effects of the mismatch by designing, developing, and providing technologies that will allow persons with 
disabilities to pursue their life goals in a manner similar to any other person. Of course, the rehabilitation 
engineer cannot accomplish this working in isolation but rather must function as a part of a consumer- 
responsive team that can best deal with the multiplicity of factors that usually impact on the successful 
application of assistive technology. 

Let us now move to another conceptual framework, one that conceptualizes how people actually 
interact with technology. 

The following conceptualization has been adapted from the model proposed by Roger Smith [Smith, 
1992]. In Fig. 146.1, Smith suggests that there are three cyclic elements that come into play when humans 
interact with technology: the human and his or her innate sensory, cognitive, and functional abilities; 
the human factor's characteristics of the interface between the human and the technology; and the 
technical characteristics of the technology itself in terms of its output as a result of a specific input by 
the user. People with disabilities may have varying degrees of dysfunction in their sensory, cognitive, and 
functional abilities. The interface will have to be selected or adapted to these varying abilities in order 
to allow the person to effectively interact with the technology. The technology itself will need to possess 
specific electronic or mechanical capabilities in order to yield the desired outcome. The essence of assistive 
technology applications is to integrate all three of these element into a functional outcome that meets 
the specific needs of a user. This is usually done by selecting commercially available devices and tech- 
nologies at a cost that can be met by either the individual or his or her third-party payment source. 
When technologies are not available, then they must be modified from existing devices or designed and 
fabricated as unique custom solutions. It is particularly in these latter activities that a rehabilitation 
engineer can make his or her unique contribution to team process. 

In 1995, Cook and Hussey [Cook and Hussey, 1995], published an excellent text, Assistive Technolo- 
gies — Principles and Practice. As well as comprehensively addressing many of the assistive technologies 
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FIGURE 146.1 Conceptual framework of technology and disability. (Modified from Smith [1992].) 

briefly covered in this chapter, they also present a conceptual framework which builds on the one 
developed by Smith above. They introduce the additional concepts of activity and context. That is, 
understanding of a person's activity desires and the context (social, setting, physical) in which they are 
to be carried out are essential components to successful assistive technology intervention. 

It should be realized that there are several levels of assistive technology. The first level might be termed 
fundamental technology in contrast to advanced technology. Fundamental technologies, such as walkers, 
crutches, many wheelchairs, activities of daily living (ADL) equipment, etc., usually do not require the 
involvement of the rehabilitation engineer in their application. Others on the team can better assess the 
need, confirm the interface compatibility, and verify that the outcome is appropriate. The rehabilitation 
engineer is most often involved in the application of advanced technologies, such as powered wheelchairs, 
computerized workstation designs, etc., that require an understanding of the underlying technological 
principles in order to achieve the best match with the abilities and needs of the user, especially if custom 
modifications or integration of devices are required to the original equipment. The rehabilitation engineer 
is usually the key person if a unique solution is necessary. 

Let's now discuss a few fundamental concepts related to the process by which assistive technology is 
typically provided in various service delivery programs. 

146.2 The Provision Process 
The Shifting Paradigm 

In the traditional rehabilitation model of service delivery, a multidisciplinary team of professionals is 
already in place. Physicians, therapists, counselors, and social worker meet with the client and, based on 
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the findings of a comprehensive evaluation, plan a course of action. In the field of assistive technology, 
the rules of the team are being charted anew. First, the decision making often takes place in a nonmedical 
environment and often without a physician as part of the team. Second, the final decision is rapidly 
moving into the hands of the consumer, not the professionals. The third major change is the addition 
of a rehabilitation engineer to the team. Traditional team members have experience working in groups 
and delegating coordination and decision making to colleagues depending on the particular situation. 
They are trained to be team players and are comfortable working in groups. Most engineers who enter 
the field of rehabilitation engineering come with a traditional engineering background. Although well 
versed in design and engineering principles, they often do not receive formal training in group dynamics 
and need to learn these skills if they are to function effectively. As well, engineers are trained to solve 
problems with technical solutions. The psychosocial aspects of assisting people with disabilities to make 
informed choices must be learned most often outside the traditional education stream. Therefore, for 
the engineer to be a contributing member of the team, not only must he or she bring engineering expertise, 
but it must be integrated in such a manner that it supports the overall objectives of the technology 
delivery process, which is to respond to the needs and desires of the consumer. 

People with disabilities want to have control over the process and be informed enough to make good 
decisions. This is quite different from the traditional medical or rehabilitation model, in which well- 
meaning professionals often tell the individual what is best for him or her. Within this new paradigm, 
the role is to inform, advise, and educate, not to decide. The professional provides information as to the 
technical options, prices, etc. and then assists the person who will use the technology to acquire it and 
learn how to use it. 

The Evaluation 

An evaluation is meant to guide decision-making for the person with a disability toward appropriate and 
cost-effective technology. Often, more than one functional need exists for which assistive technology could 
be prescribed. Costly, frustrating, and time-consuming mistakes often can be avoided if a thorough evaluation 
based on a person's total functional needs is performed before any technology is recommended. Following 
the evaluation, a long-range plan for acquisition and training in the chosen technology can be started. 

For example, suppose a person needs a seating system, both a powered and manual wheelchair, an 
augmentative communication device, a computer workstation, and an environmental control unit (ECU). 
Where does one begin? Once a person's goals and priorities have been established, the process can begin. 
First, a decision would likely be made about the seating system that will provide appropriate support in 
the selected manual chair. However, the specifications of seating system should be such that the components 
also can be interfaced into the powered chair. The controls for the computer and augmentative commu- 
nication device must be located so that they do not interfere with the display of the communication device 
and must in some way be compatible wit the controls for the ECU. Only if all functional needs are addressed 
can the technology be acquired in a logical sequence and in such a manner that all components will be 
compatible. The more severely disabled the individual, the more technology he or she will need, and the 
more essential is the process of setting priorities and ensuring compatibility of technical components. 

In summary, as suggested by the conceptual model, the process begins by gaining an understanding 
of the person's sensory, cognitive, and functional abilities, combined with clarification of his or her desires 
and needs. These are then filtered through the technology options, both in terms of how the interface 
will integrate with the abilities of the user and how the technology itself will be integrated to meet the 
defined needs. This information and the associated pros and cons are conveyed to the user, or in some 
cases their caregiver, who then has the means to participate in the ultimate selection decisions. 

Service Delivery Models 

People with disabilities can access technology through a variety of different service delivery models. A 
team of professionals might be available in a university setting where faculty not only teach but also 
deliver technical services to the community. More traditionally, the team of rehabilitation professionals, 
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including a rehabilitation engineer, might be available at a hospital or rehabilitation facility. More recently, 
technology professionals might be in private practice either individually, as a team, or part of the 
university, hospital, or rehabilitation facility structure. Another option is the growing number of reha- 
bilitation technology suppliers (RTSs) who offer commercial technology services within the community. 
They work in conjunction with an evaluation specialist and advise consumers as to the technical options 
available to meet their needs. They then sell and service the technology and train the consumer in its 
use. Local chapters of national disability organizations such as United Cerebral Palsy and Easter Seals 
also may have assistive technology services. In recent years, a growing number of centers for independent 
living (CILs) have been developed in each state with federal support. Some of these centers have opted 
to provide assistive technology services, in addition to their information and referral services, which are 
common to all CILs. And finally, there are volunteers, either in engineering schools or community colleges 
(student supervised projects) or in industry (high-technology industries often have staff interested in 
doing community service), such as the Telephone Pioneers. Each model has its pros and cons for the 
consumer, and only after thoroughly researching the options will the person needing the service make 
the best choice as to where to go with his or her need in the community. A word of caution. Only if 
there is timely provision and follow-up available is a service delivery system considered appropriate, even 
if the cost of the service is less. 

A more extensive description of service delivery options may be reviewed in a report that resulted 
from a RESNA-organized conference on service delivery [ANSI/RESNA, 1987]. 

146.3 Education and Quality Assurance 

Professionals on the assistive technology team have a primary degree and credential in their individual 
professions. For example, the occupational or physical therapist will have a degree and most often state 
licensure in occupational or physical therapy. The engineer will have recognized degrees in mechanical, 
electrical, biomedical, or some other school of engineering. However, in order to practice effectively in 
the field of assistive technology, almost all will need advanced training. A number of occupational therapy 
curriculums provide training in assistive technology, but not all. The same is true of several of the others. 
Consumers and payers of assistive technology need to know that professionals practicing in the field of 
assistive technology have a certain level of competency. For this reason, all professionals, including reha- 
bilitation engineers, are pursuing the ATP (assistive technology practitioner) credential through RESNA. 

RESNA 

RESNA, an interdisciplinary association of persons dedicated to the advancement of assistive technology 
for people with disabilities, has a credentialing program that credentials individuals on the assistive 
technology team. As part of the process, the minimum skills and knowledge base for practitioners is 
tested. Ties with professional organizations are being sought so that preservice programs will include at 
least some of the knowledge and skills base necessary. Continuing education efforts by RESNA and others 
also will assist in building the level of expertise of practitioners and consumers. At this time RESNA has 
a voluntary credentialling process to determine if a person meets a predetermined minimal standard of 
practice in the field of Assistive Technology. Persons who meet the prerequisite requirements, pass a 
written exam, and agree to abide by the RESNA Standards of Practice can declare themselves as RESNA 
certified. They can add the ATP if they are practitioners or ATS if they are suppliers of assistive technology. 
Payment for technology and the services required for its application is complex and changing rapidly 
as health care reform evolves. It is beyond the scope of this discussion to detail the very convoluted and 
individual process required to ensure that people with disabilities receive what they need. However, there 
are some basic concepts to be kept in mind. Professionals need to be competent. The documentation of 
need and the justification of selection must be comprehensive. Time for a person to do this must be 
allocated if there is to be success. Persistence, creativity, education of the payers, and documentation of 
need and outcomes are the key issues. 
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146.4 Specific Impairments and Related Technologies 

Current information related to specific technologies is best found in brochures, trade magazines {Report 
Rehab), exhibit halls of technology- related conferences, and databases such as ABLEDATA. Many suppliers 
and manufacturers are now maintaining Websites, which provides a quick means to locate information 
on current products. What follows is only a brief introduction to specific disabilities areas to which 
assistive technology applications are commonly used. 

Mobility 

Mobility technologies include wheelchairs, walkers, canes, orthotic devices, FES (functional electrical 
stimulation), laser canes, and any other assistive device that would assist a person with a mobility 
impairment, be it motor or sensory, to move about in his or her environment. There are very few people 
who have a working knowledge of all the possible commercial options. Therefore, people usually acquire 
expertise in certain areas, such as wheelchairs. There are hundreds of varieties of wheelchairs, each offering 
a different array of characteristics that need to be understood as part of the selection process. Fortunately, 
there are now several published ways that the practitioner and the consumer can obtain useful informa- 
tion. A classification system has been developed that sets a conceptual framework for understanding the 
different types of wheelchairs that are produced commercially [Hobson, 1990] . Paraplegic News and Sports 
and Spokes annually publish the specifications on most of the manual and powered wheelchairs commonly 
found in the North American marketplace. These reviews are based on standardized testing that is carried 
out by manufacturers following the ANSI/RESNA wheelchair standards [ANSI/RESNA, 1990]. Since the 
testing and measurements of wheelchairs are now done and reported in a standard way, it is possible to 
make accurate comparisons between products, a tremendous recent advancement for the wheelchair 
specialist and the users they serve [Axelson et al., 1994]. 

Possibly the most significant advancement in wheelchairs is the development and application of industry, 
on an international scale, for testing the safety and durability of their products. These standards also mandate 
what and how the test information should be made available in the manufacturer's presale literature. The 
Rehabilitation Engineering Research Center and University of Pittsburgh [RERC, 1999] maintains a large 
Website, where among its many resources is a listing of wheelchair research publications and a general 
reference site, termed Wheelchairnet [Wheelchairnet, 1999]. The RERC site also tracks the current activities 
occurring in many of the wheelchair standards working groups. Finally, Cooper [1995, 1998] has published 
two excellent reference texts on rehabilitation engineering with emphasis on wheeled mobility. 

Sitting 

Many people cannot use the wheelchairs as they come from the manufacturer. Specialized seating is 
required to help persons to remain in a comfortable and functional seated posture for activities that 
enable them to access work and attend educational and recreational activities. Orthotic supports, seating 
systems in wheelchairs, chairs that promote dynamic posture in the workplace, and chairs for the elderly 
that fit properly, are safe, and encourage movement all fit into the broad category of sitting technology. 

Sensation 

People with no sensation are prone to skin injury. Special seating technology can assist in the preventions 
of tissue breakdown. Specially designed cushions and backs for wheelchairs and mattresses that have 
pressure-distributing characteristics fall into this category. Technology also has been developed to measure 
the interface pressure. These tools are now used routinely to measure and record an individual's pressure 
profile, making cushion selection and problem solving more of a science than an art. 

Again, a classification system of specialized seating has been developed that provides a conceptual 
framework for understanding the features of the various technologies and their potential applications. 
The same reference also discusses the selection process, evaluation tools, biomechanics of supported 
sitting, and materials properties of weight-relieving materials [Hobson, 1990]. 
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Access (Person-Machine Interface) 

In order to use assistive technology, people with disabilities need to be able to operate the technology. 
With limitations in motor and/or sensory systems, often a specially designed or configured interface 
system must be assembled. It could be as simple as several switches or a miniaturized keyboard or as 
complex as an integrated control system that allows a person to drive a wheelchair and operate a computer 
and a communication device using only one switch. 

Communication 

Because of motor or sensory limitations, some individuals cannot communicate with spoken or written 
word. There are communication systems that enable people to communicate using synthesized voice or 
printed output. Systems for people who are deaf allow them to communicate over the phone or through 
computer interfaces. Laptop computers with appropriate software can enable persons to communicate 
faster and with less effort than previously possible. Some basic guidelines for selecting an augmentative 
communication system, including strategies for securing funding, have been proposed in an overview 
chapter by James Jones and Winifred Jones [Jones & Jones, 1990]. 

Transportation 

Modified vans and cars enable persons with disabilities to independently drive a vehicle. Wheelchair tie- 
downs and occupant restraints in personal vehicles and in public transportation vehicles are allowing 
people to be safely transported to their chosen destination. Fortunately, voluntary performance standards 
for restraint and tie-down technologies have been developed by a task group within the Society for 
Automotive Engineers (SAE). Standards for car hand controls, van body modifications, and wheelchair 
lifts are also available from SAE. These standards provide the rehabilitation engineer with a set of tools 
that can be used to confirm safety compliance of modified transportation equipment. Currently in process 
and still requiring several more years of work are transport wheelchair and vehicle power control standards. 

Activities of Daily Living (ADL) 

ADL technology enables a person to live independently as much as possible. Such devices as environ- 
mental control units, bathroom aids, dressing assists, automatic door openers, and alarms are all con- 
sidered aids to daily living. Many are inexpensive and can be purchased through careful selection in stores 
or through catalogues. Others are quite expensive and must be ordered through vendors who specialize 
in technology for independent living. 

Ron Mace, now deceased and creator of the Center for Universal Design at the North Carolina State 
University, is widely acknowledged as the father of the Universal Design concept. The concept of universal 
design simply means that if our everyday built environments and their contained products could be 
designed to meet the needs of a wider range of people, both young and old, then the needs of more 
persons with disabilities would be met without the need for special adaptions [Center for Universal 
Design, 1999]. Others like Paul Grayson have also published extensively regarding the need to re-think 
how we design our living environments [Grayson, 1991] . Vanderheiden and Denno have prepared human 
factors guidelines that provide design information to allow improved access by the elderly and persons 
with disabilities [Denno et al, 1992; Vanderheiden & Vanderheiden, 1991; Trace Center, 1999]. 

School and Work 

Technology that supports people in the workplace or in an educational environment can include such 
applications as computer workstations, modified restrooms, and transportation to and from work or 
school. Students need the ability to take notes and do assignments, and people working have a myriad 
of special tasks that may need to be analyzed and modified to enable the employee with the disability to 
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be independent and productive. Weisman has presented an extensive overview of rehabilitation engineer- 
ing in the workplace, which includes a review of different types of workplaces, the process of accommo- 
dation, and many case examples [Weisman, 1990]. 

Recreation 

A component of living that is often overlooked by the professional community is the desire and, in fact, 
need of people with disabilities to participate in recreational activities. Many of the adaptive recreational 
technologies have been developed by persons with disabilities themselves in their effort to participate 
and be competitive in sports. Competitive wheelchair racing, archery, skiing, bicycles, and technology 
that enables people to bowl, play pool, and fly their own airplanes are just a few areas in which equipment 
has been adapted for specific recreational purposes. 

Community and Workplace Access 

There is probably no other single legislation that is having a more profound impact on the lives of people 
with disabilities then the Americans with Disabilities Act (ADA), signed into law by President Bush in 
August of 1990. This civil rights legislation mandates that all people with disabilities have access to public 
facilities and that reasonable accommodations must be made by employers to allow persons with dis- 
abilities to access employment opportunities. The impact of this legislation is now sweeping America 
and leading to monumental changes in the way people view the rights of persons with disabilities. 

146.5 Future Developments 

The field of rehabilitation engineering, both in research and in service delivery, is at an important 
crossroad in its young history. Shifting paradigms of services, reduction in research funding, consumer- 
ism, credentialing, health care reform and limited formal educational options all make speculating on 
what the future may bring rather hazy. Given all this, it is reasonable to say that one group of rehabilitation 
engineers will continue to advance the state of the art through research and development, while another 
group will be on the front lines as members of clinical teams working to ensure that individuals with 
disabilities receive devices and services that are most appropriate for their particular needs. 

The demarcation between researchers and service providers will become clearer, since the latter will 
become credentialed. RESNA and its professional specialty group (PSG) on rehabilitation engineering 
are working out the final credentialing steps for the Rehabilitation Engineer RE and the Rehabilitation 
Engineering Technologist RET. Both must also be an ATP. They will be recognized as valued members 
of the clinical team by all members of the rehabilitation community, including third-party payers, who 
will reimburse them for the rehabilitation engineering services that they provide. They will spend as 
much or more time working in the community as they will in clinical settings. They will work closely 
with consumer-managed organizations who will be the gatekeepers of increasing amounts of government- 
mandated service dollars. 

If these predictions come to pass, the need for rehabilitation engineering will continue to grow. As 
medicine and medical technology continue to improve, more people will survive traumatic injury, disease, 
and premature birth, and many will acquire functional impairments that impede their involvement in 
personal, community, educational, vocational, and recreational activities. People continue to live longer 
lives, thereby increasing the likelihood of acquiring one or more disabling conditions during their lifetime. 
This presents an immense challenge for the field of rehabilitation engineering. As opportunities grow, 
more engineers will be attracted to the field. More and more rehabilitation engineering education 
programs will develop that will support the training of qualified engineers, engineers who are looking 
for exciting challenges and opportunities to help people live more satisfying and productive lives. 
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Leslie A. Geddes Early Investigations 

Purdue Univeisity Clinical Electromyography 



Early Investigations 

The study of bioelectricity started with the Galvani-Volta controversy over the presence of electricity in 
frog muscle [see Geddes and Hoff, 1971]. Galvani likened the sciatic nerve-gastrocnemius muscle to a 
charged Leyden jar (capacitor) in which the nerve was the inner conductor and the surface of the muscle 
was the outer conductor. Therefore, Galvani thought that by joining the two with an arc of dissimilar 
metals, the biologic capacitor was discharged and the muscle twitched. Volta proved conclusively that it 
was the dissimilar metals in contact with tissue fluid that was the stimulus. 

Interestingly, it was found that when the sciatic nerve of a nerve-muscle preparation was laid on the 
cut end of another frog muscle and the nerve was touched to the intact surface, the muscle of the nerve- 
muscle preparation twitched. Here was evidence of stimulation without metal conductors; this experi- 
ment was performed by Matteucci [1842]. 

With the first galvanometers, it was shown that current would be indicated when one electrode was 
placed on the cut end of a frog muscle and the other on the intact surface. This phenomenon became 
known as the injury current or frog current, the cut surface being negative to the intact surface. 

Whereas the foregoing experiments showed that skeletal muscle possessed electricity, little was known 
about its relation to contraction. Matteucci [1842] conducted an ingenious experiment in which he placed 
the nerve of a second nerve-muscle preparation on the intact muscle of a first such preparation and 
stimulated the nerve of the first using an inductorium. He discovered that both muscles contracted. Here 
is the first evidence of the electric activity of contracting skeletal muscle. 

Matteucci and DuBois-Reymond both found that the injury current disappeared when a muscle was 
contracted tetanically. This observation led directly to the concept of a resting membrane potential and 
its disappearance with activity [see Hoff and Geddes, 1957]. 

That human muscle, as well as frog muscle, produced electric activity was demonstrated by Du Bois- 
Reymond [1858] in the manner shown in Fig. HP4.1. With electrodes in saline cups connected to a 
galvanometer, Du Bois-Reymond stated that as soon as the fingers were placed in the cups, the galva- 
nometer needle deflected, and it required some time for a position of equilibrium to be attained. Du 
Bois-Reymond [1858] stated: 

As soon as this state [equilibrium] is attained, the whole of the muscles of one of the arms must be 
so braced that an equilibrium may be established between the flexors and the extensors of all the 
articulations of the limb, pretty much as in a gymnastic school is usually done when one wants to let 
a person feel the development of one's muscles. 

As soon as this is done, the [galvanometer] needle is thrown into movement, its deflection being 
uniformly in such a sense as to indicate in the braced arm "an inverse current," according to Nobili's 
nomenclature; that is to say, a current passing from the hand to the shoulder. The braced arm then 
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FIGURE HP4.1 The first evidence contracting skeletal muscle in man produces an electrical signal. (From Du Bois- 
Reymond [1858].) 



acts the part of the copper in the compound arc of zinc and copper mentioned above. [Du Bois- 
Reymond was referring to the polarity of a voltaic cell in which zinc and copper are the positive and 
negative electrodes, respectively.] 

The rheotome and slow-speed galvanometer were used to reconstruct the form of the muscle action 
potential. However, it was desired to know the time course of the electric change associated with a single 
muscle contraction (twitch), as well as its relationship to the electrical event (action potential). A second 
item of interest was the so-called latent period, that time between the stimulus and the onset of muscle 
contraction, which Helmholtz [1853] reported to be 10 ms for frog muscle. 

Waller [1887] set himself the task of measuring the latent period and the relationship between the 
action potential and the force developed by frog gastrocnemius muscle in response to a single stimulus. 
He found that the onset of the twitch was later than the onset of the action potential, as shown in 
Fig. HP4.2. However, the true form of the muscle action potential and its relationship to the onset of 
the twitch had to await the development of the micropipet electrode, the vacuum-tube amplifier, and 
the cathode-ray oscilloscope. In 1957, Hodgkin and Horowicz [1957] recorded the twitch and action 
potential of a single muscle fiber of the frog. Figure HP4.3 is a copy of their record. Note that the onset 
of the action potential precedes the onset of muscle contraction by about 4 ms. We know that it is the 
action potential that triggers the release of mechanical energy. 
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FIGURE HP4.2 Relationship between the twitch (recorded with a myograph, m) and the action potential (recorded 
with a capillary electrometer, e) of a frog gastrocnemius muscle. The time marks (f) are 1/20 s. (From Waller [1887].) 




30 msec 



FIGURE HP4.3 The relationship between the muscle action potential and the force of concentration in a single 
skeletal muscle fiber in the frog. (From Hodgkin and Horowicz [1957].) 

Clinical Electromyography 

It was well known that when a nerve that innervates a skeletal muscle is cut, the muscle is paralyzed 
immediately; however, days to weeks later (depending on the species), on careful visual examination, the 
individual muscle fibers are seen to be contracting and relaxing randomly, i.e., fibrillating. The first to bring 
the facts together regarding normal muscle action potentials and denervation-fibrillation potentials were 
Denny-Brown and Pennybacker in the United Kingdom [1939]; the date and locale are highly significant. 
They distinguished between involuntary twitching of innervated muscle and fibrillation of denervated 
muscle by recording both the electric and mechanical activity of muscles. Two instrumental advances made 
their study possible: (1) the use of a hypodermic needle electrode inserted into the muscle and (2) the use 
of a rapidly responding, mirror-type photographic recorder, the Matthews [1928] oscillograph. 

The cathode-ray tube was not generally available in the United Kingdom when Matthews [1928] 
constructed his moving-tongue mirror oscillograph. The device consisted of a piece of soft iron mounted 
on a steel leaf spring, as shown in Fig. HP4.4. A strong electromagnet attracted the soft iron, which bent 
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FIGURE HP4.4 The Matthews moving-tongue oscillograph and amplifier used with it. The electromagnetic coil (A) 
provided an attractive force on the tongue (soft iron and steel spring); the signal current applied to the small coils 
aided or opposed this force and caused the tongue to bend more or less and hence move the mirror which reflected 
a beam of light on to a recording surface. (From BHC Matthews, 1928, J. Physiol (Lond) 65:225, with permission.) 

the steel (leaf-spring) support. Two coils mounted on the pole faces were connected to the output tubes 
of a five-stage, single-sided, resistance-capacitance coupled amplifier. The amplified potentials altered 
the current in the electromagnet coils, causing more or less bending of the leaf spring, thereby tilting the 
mirror mounted on the leaf spring and permitting photographic recording of action potentials. 

With the Matthews oscillograph, Denny-Brown and Pennybacker [ 1939] laid the foundation for clinical 
electromyography when they reported as follows: 

Denervated muscle fibers contract periodically, and the confused medley of small twitches constitutes 
true fibrillation. The movement is so slight that it can seldom be seen in the clinic. The twitchings 
appear to be due to heightened excitability of the sarcolemma or rapidly conducting portion of the 
muscle fibre to traces of free acetylcholine in the tissues. 

Reinnervated muscle is free from such involuntary excitation, except for the curious "contracture" 
of facial muscle, which consists of periodic, intense repetitive discharges which suggest a central 
mechanism. 
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Earlier it was stated that 1939 was significant; this was the year when World War II broke out in Europe. 
Soon motor nerve injuries due to shrapnel wounds began to appear in large numbers, and the need for 
electromyography to identify denervation fibrillation potentials and their gradual disappearance with 
reinnervation was urgent. The first electromyograph in North America was developed by Herbert Jasper 
at McGill University (Montreal Neurological Institute). Starting in 1942, design concepts were developed, 
and by 1944 prototypes had been built and used clinically. In his report to the Committee on Army 
Medical Research, Capt. Jasper [1945] stated: 

The present equipment has been developed over a period of about 18 months experimentation with 
different designs of electromyograph for use on hospital wards. Numerous modifications of design 
have been incorporated in the present model in order to provide simplicity of operation, portability, 
freedom from electrical interference, and perfection of both the audible and visible analysis of the 
electrical activity of normal and diseased muscles. A portable clinical electromyograph has been 
developed which has proven to be practical in operation on hospital wards to aid in the diagnosis and 
prognosis of muscles paralyzed by traumatic injuries of their nerve supply. Four complete units have 
been constructed for use in special centers for the treatment of nerve injuries. 

The Royal Canadian Army Medical Corps (RCAMC) electromyograph had many unique design 
features that were incorporated in all later commercially available electromyographs. It consisted of three 
units, a small battery-operated three-stage differential amplifier (Fig. HP4.5) and an oscilloscope 
(Fig. HP4.6), both placed on a loudspeaker cabinet on rubber-wheel casters (Fig. HP4.7). Thus simulta- 
neous visual display and aural monitoring of normal motor units and fibrillation potentials was possible. 

The preamplifier (Fig. HP4.5) was very carefully constructed, the input tubes being supported by 
rubber-mounted antimicrophonic sockets. The grid resistors (R9) and plate resistors (R8) were wire 
wound and carefully matched. A high common-mode rejection ratio was obtained by matching the input 
tubes and adjustment of the potentiometer (R7) in the screen-grid supply. A common-mode rejection 
ratio in excess of 10,000 was easily achieved. The overall frequency response extended from 3 to 10,000 Hz. 

The cathode-ray oscilloscope (Fig. HP4.6) was of unusual design for that time because the sweep 
velocity was independent of the number of sweeps per second, a feature to appear much later in 
oscilloscopes. A linear sweep (time base) was obtained by charging a capacitor (0. 1 (J.F) through a pentode 
(6U7G) which acted as a constant-current device. The sweep was initiated at a rate of about 7 times per 
second by the multivibrator (6N7), which also provided an output to enable stimulating a nerve, the 
stimulus occurring at the beginning of each sweep, thereby permitting nerve conduction-time measure- 
ments. The oscilloscope also contained the audio amplifier (6L6). The cathode-ray tube had a short- 
persistence, blue-white phosphor that produced brilliant blue-white images of remarkable clarity. A 
camera was used to obtain photographic records of the waveforms, which were optimized by listening 
to them via the loudspeaker (as advocated by Adrian) as the needle electrode was being inserted and 
adjusted. Fig. HP4.8 illustrates typical action potentials. 

At the end of the war (1945), oscilloscopes became available, and Fig. HP4.7 shows the RCAMC 
electromyograph with a Cossor oscilloscope (right) and the high-gain differential amplifier (left), both 
on the loudspeaker cabinet, which was on casters. The recessed opening at the top of the loudspeaker 
cabinet face provided access to the on-off and volume controls. 

In addition to the creation of a high-performance EMG unit, Jasper introduced the monopolar needle 
electrode system used in all subsequent EMGs. The needle electrode was insulated with varnish down to 
its tip and was paired with a skin-surface electrode of silver. The patient was grounded by another electrode 
taped to the same member that was being examined. Figure HP4.8 illustrates application of the electrodes 
and typical motor-unit and fibrillation potentials. The report by Jasper and Ballem [1949] summarized 
the experience with the RCAMC electromyograph and laid the foundation for diagnostic EMG. 

World War II ended in 1945, after which electromyographs became available commercially. Their 
features were essentially the same as those embodied in the RCAMC electromyograph. From the begin- 
ning, these units were completely power-line-operated, the author's master's thesis describing the first 
of these units. 
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FIGURE HP4.5 The three-stage resistance-capacity coupled differential amplifier used is the RCAMC electromyo- 
graphy. (From Jasper et al. [1945].) 
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FIGURE HP4.6 The oscilloscope, loudspeaker amplifier, and stimulator unit of the RCAMC electromyograph. 
(From Jasper et al. [1945].) 
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FIGURE HP4.7 A later version of the RCAMC elec- 
tromyograph showing the high-gain preamplifier (left) 
and oscilloscope (right) on top of the loud-speaker cabinet. 
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FIGURE HP4.8 Electrode arrangement and action potentials with the RCAMC electromyograph. (Redrawn from 
Jasper and Ballem [1949].) 
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THE ULTIMATE GOAL OF HUMAN performance engineering is enhancement of the performance 
and safety of humans in the execution of tasks. The field (in a more formalized sense) was fueled 
initially by military applications but has become an important component in industrial settings 
as well. In a biomedical engineering context, the scope of definition applied to the term human not only 
encompasses individuals with capabilities that differ from those of a typical healthy individual in many 
possible different ways (e.g., individuals who are disabled, injured, unusually endowed, etc.) but also 
includes those who are "healthy" (e.g., health care professionals). Consequently , one finds a wide range 
of problems in which human performance engineering and associated methods are employed. Some 
examples include 

• Evaluation of an individual's performance capacities for determining the efficacy of new thera- 
peutic interventions or so-called level of disability for worker's compensation and other medical- 
legal purposes. 

• Design of assistive devices and/or work sites in such a way that a person with some deficiency in 
his or her "performance resource profile" will be able to accomplish specified goals. 

■ Design of operator interfaces for medical instruments that promote efficient, safe, and error-free 
use. 

In basic terms, each of these situations involves one or more of the following: (1) a human, (2) a task 
or tasks, and (3) the interface of a human and task(s). Human performance engineering emphasizes 
concepts, methods, and tools that strive toward treatment of each of these areas with the engineering 
rigor that is routinely applied to artificial systems (e.g., mechanical, electronic, etc.). Importance is thus 
placed on models (a combination of cause-and-effect and statistical), measurements (of varying degrees 
of sophistication that are selected to fit needs of a particular circumstance), and various types of analyses. 
Whereas many specialty areas within biomedical engineering begin with an emphasis on a specific 
subsystem and then proceed to deal with it at lower levels of detail (sometimes even at the molecular 
level) to determine how it functions and often why it malfunctions, human performance engineering 
emphasizes subsystems and their performance capacities (i.e., how well a system functions), the integra- 
tion of these into a whole and their interactions, and their operation in the execution of tasks that are 
of ultimate concern to humans. These include tasks of daily living, work, and recreation. In recent years, 
there has been an increased concern within medical communities on issues such as quality of life, 
treatment outcome measures, and treatment cost-effectiveness. By linking human subsystems into the 
"whole" and discovering objective quantitative relationships between the human and tasks, human 
performance engineering can play an important role in addressing these and other related concerns. 

Human performance engineering combines knowledge, concepts, and methods from across many 
disciplines (e.g., biomechanics, neuroscience, psychology, physiology, and many others) which, in their 
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overlapping aspect, all deal with similar problems. Among current difficulties is that these wide-ranging 
efforts are not linked by a conceptual framework that is commonly employed across contributing disci- 
plines. In fact, few candidate frameworks exist even within the relevant disciplines. One attempt to provide 
some unification and commonality is presented in Chapter 147 as basis for readers to integrate material 
in subsequent chapters and from other sources. In a further attempt to enhance continuity across this 
section, chapter authors have been requested to consider this perspective and to incorporate basic 
concepts and terms where applicable. 

Chapters 148 through 150 look"toward the human" and focus on measurement of human performance 
capacities and related issues. Owing to a combination of the complexity of the human system (even when 
viewed as a collection of rather high-level subsystems) and limited space available, treatment is not 
comprehensive. For example, measurement of sensory performance capacities (e.g., tactile, visual, audi- 
tory) is not included in this edition. Both systems and tasks can be viewed at various hierarchical levels. 
Chapters 148 and 149 focus on a rather "low" systems level and discuss basic functional units such as 
actuator, processor, and memory systems. Chapter 150 moves to a more intermediate level, where speech, 
postural control gait, and hand-eye coordination systems could be considered. Measurement of structural 
parameters, which play important roles in many analyses, also is not allocated the separate chapter it 
deserves (as a minimum) due to space limitations. Chapter 151 and 152 then shift focus to consider the 
analysis of different types of tasks in a similar, representative fashion. 

Chapters 153 through 155 are included to provide insight into a representative selection of application 
types. Space constraints, the complexity of human performance, and the great variety of tasks that can 
be considered limit the level of detail with which such material can reasonably be presented. Work in all 
application areas will begin to benefit from emerging computer-based tools, which is the theme of chapter 
156. The section concludes with a look to the future (Chapter 157) that summarizes selected current 
limitations, identifies some specific research and development needs, and speculates regarding the nature 
of some anticipated developments. 

Many have contributed their talents to this exciting field in terms of both research and applications, 
yet much remains to be done. I am indebted to the authors not only for their contributions and 
cooperation during the preparation of this section but also for their willingness to accept the burdens 
of communicating complex subject matter reasonably, selectively, and as accurately as possible within 
the imposed constraints. 
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Humans are complex systems. Our natural interest in ourselves and things that we do has given rise to 
the study of this complex system at every conceivable level ranging from genetic, through cellular and 
organ systems, to interactions of the total human with the environment in the conduct of purposeful 
activities. At each level, there are corresponding practitioners who attempt to discover and rectify or 
prevent problems at the respective level. Some practitioners are concerned with specific individuals, while 
others (e.g., biomedical scientists and product designers) address populations as a whole. Problems dealt 
with span medical and nonmedical contexts, often with interaction between the two. Models play a key 
role not only in providing conceptual understanding of key issues at each level but also in describing 
relationships between various levels and providing frameworks that allow practitioners to obtain reason- 
ably predictable results in a systematic and efficient fashion. In this chapter, a working model for human 
system-task interfaces is presented. Any such model must, of course, not only consider the interface per 
se but also representations of the human system and tasks. The model presented here, the elemental 
resource model (ERM), represents the most recent effort in a relatively small family of models that attempt 
to address similar needs. 

147.1 Background 

The interface between a human and a task of daily living (e.g., work, recreation, or other) represents a 
level that is quite high in the hierarchy noted above. One way in which to summarize previous efforts 
directed at this level, across various application contexts, is to recognize two different lines along which 
study has evolved: (1) bottom-up and (2) top-down. Taken together, these relative terms imply a focus of 
interest at a particular level of convergence. Here, this special level is termed the human-task interface 
level. It is emphasized that bottom-up and top-down terms are used here to characterize the general 
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course of development and not specific approaches applied at a particular instant in time. A broad view 
is necessary to grapple with the many previous efforts that either are, or could be, construed to be 
pertinent. 

The biomedical community has approached the human-task interface largely along the bottom-up 
path. This is not surprising given the historical evolution of interest first in anatomy (human structure) 
and then physiology (function). The introduction of chemistry (giving rise to biochemistry) and the 
refinement of the microscope provided motivations to include even lower hierarchical levels of inquiry 
and of a substantially different character. Models in this broad bottom-up category begin with anatomic 
components and include muscles, nerves, tendons (or subcomponents thereof), or subsets of organs (e.g., 
heart, lungs, vasculature, etc.). They often focus on relationships between components and exhibit a 
scope that stays within the confines of the human system. Many cause-and-effect models have been 
developed at these lower levels for specific purposes (e.g., to understand lines of action of muscle forces 
and their changes during motion about a given joint). 

As a natural consequence of linkages that occur between hierarchical levels and our tendency to utilize 
that which exists, consideration of an issue at any selected level (in this case, the human-task interface 
level) brings into consideration all lower levels and all models that have been put forth with the stated 
purpose of understanding problems or behaviors at the original level of focus. The amount of detail that 
is appropriate or required at these original, lower levels results in great complexity when applied to the 
total human at the human-task interface level. In addition, many lower-level modeling efforts (even those 
which are quantitative) are aimed primarily at obtaining a basic scientific understanding of human 
physiology or specific pathologies (i.e., pertaining to populations of humans). In such circumstances, 
highly specialized, invasive, and cumbersome laboratory procedures for obtaining the necessary data to 
populate models are justified. However, it is difficult and sometimes impossible to obtain data describing 
a specific individual to utilize in analyses when such models are extended to the human-task interface 
level. Another result of drawing lower-level models (and their approaches) into the human-task interface 
context is that the results have a specific and singular character (e.g., biomechanical versus neuromuscular 
control versus psychologic, etc.) [e.g., Card et al., 1986; Delp et al., 1990; Gottlieb et al., 1989; Hemami, 
1988; Schoner & Kelso, 1988]. Models that incorporate most or all of the multiple aspects of the human 
system or frameworks for integrating multiple lower-level modeling approaches have been lacking. Lower- 
level models that serve meaningful purposes at the original level of focus have provided and will continue 
to provide insights into specific issues related to human performance at multiple levels of considerations. 
However, their direct extension to serve general needs at the human-task interface level has inherent 
problems; a different approach is suggested. 

A top-down progression can be observed over the major history in human factors/ergonomic 
[Gilbreth & Gilbreth, 1917; Taylor, 1911] and vocational assessment [e.g., Botterbusch, 1987] fields 
(although the former has more recently emphasized a "human-centered" concept with regard to design 
applications). In contrast to the bottom-up path, in which anatomic components form the initial basis 
of modeling efforts, the focus along the top-down path begins with consideration of the task or job that 
is to be performed by the total human. The great variety in the full breadth of activities in which humans 
can be engaged gives rise to one aspect of complexity at this level that pertains to taxonomies for job 
and task classification [e.g., Fleishman & Quaintance, 1984; Meister, 1989; U.S. Department of Labor, 
1992]. Another enigmatic aspect that quickly adds complexity with respect to modeling concerns the 
appropriate level to be used to dissect the items at the highest level (e.g., jobs) into lower-level components 
(e.g., tasks and subtasks). In fact, the choice of level is complicated by the fact that no clear definition 
has evolved for a set of levels from which to choose. 

After progressing down through various levels at which all model elements represent tasks and are 
completely outside the confines of the human body, a level is eventually reached where one encounters 
the human. Attempts to go further have been motivated, for example, by desires to predict performance 
of a human in a given task (e.g., lifting an object, assembling a product, etc.) from a set of measures that 
characterizes the human. At the human-task interface level, difficulty is encountered with regard to the 
strategy for approaching a system as complex, multifaceted, and multipurpose as a human [Fleishman 
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& Quaintance, 1984; Wickens, 1984]. In essence, the full scope of options that have emerged from the 
bottom-up development path is now encountered from the opposite direction. Options range from 
relatively gross analyses (e.g., estimates of the "fraction" of a task that is physical or mental) to those 
which are much more detailed and quantitative. The daunting prospect of considering a "comprehensive 
quantitative model" has led to approaches and models, argued to be "more practical," in which sets of 
parameters are often selected in a somewhat mysterious fashion based on experience (including previous 
research) and intuition. The selected parameters are then used to develop predictive models, most of 
which have been based primarily on statistical methods (i.e., regression models) [Fleishman, 1967; 
Fleishman & Quaintance, 1984]. Although the basic modeling tools depend only on correlation, it is 
usually possible to envision a causal link between the independent variables selected (e.g., visual acuity) 
and the dependent variable to be predicted (e.g., piloting an aircraft). Models (one per task) are then 
tested in a given population and graded with regard to their prediction ability, the best of which have 
performed marginally [Kondraske and Beehler, 1994]. Another characteristic associated with many of 
the statistically based modeling efforts from the noted communities is the almost exclusive use of healthy, 
"normal" subjects for model development (i.e., humans with impairments were excluded). Homogeneity 
is a requirement of such statistical models, leading to the need for one model per task per population 
(at best). Also, working with a mindset that considers only normal subjects can be observed to skew 
estimates regarding which of the many parameters that one might choose for incorporation in a model 
are "most important." The relatively few exceptions that employ cause-and-effect models (e.g., based on 
physical laws) at some level of fidelity [e.g., Chaffin & Andersonn, 1984] often adopt methods that have 
emerged from the bottom-up path and are, as noted above, limited in character at the "total human" 
level (e.g., "biomechanical" in the example cited). 

The issue is not that no useful models have emerged from previous efforts but rather that no clear 
comprehensive strategy has emerged for modeling at the human-task interface level. A National Research 
Council panel on human performance modeling [Baron et al., 1990] considered the fundamental issues 
discussed here and also underscored needs for models at the human-task interface level. While it was 
concluded that an all-inclusive model might be desirable (i.e., high fidelity, in the sense that biomechan- 
ical, information processing, sensory and perceptual aspects, etc. are represented), such a model was 
characterized as being highly unlikely to be achieved and perhaps ultimately not useful because it would 
be overly complex for many applications. The basic recommendation made by this panel was to pursue 
development of more limited scope submodels. The implication is that two or more submodels could 
be integrated to achieve a broader range of fidelity, with the combination selected to meet the needs of 
particular situations. The desire to "divide efforts" due to inherent complexity of the problem also surfaces 
within the histories of the bottom-up and top-down development paths discussed above. While a rea- 
sonable concept in theory, one component in the division of effort that has consistently been underrep- 
resented is the part that ties together the so-called submodels. Without a conceptual framework for 
integration of relatively independent modeling efforts and a set of common modeling constructs, pros- 
pects for long-term progress are difficult to envision. This, along with the recognition that enough work 
had been undertaken in the submodel areas so that key issues and common denominators could be 
identified, motivated development of the ERM. 

The broad objectives of the ERM [Kondraske, 1994] are most like those of Fleishman and colleagues 
[Fleishman, 1966, 1972,1982; Fleishman & Quaintance, 1984], whose efforts in human performance are 
generally well known in many disciplines. These are the only two efforts known that (1) focus on the 
total human in a task situation (i.e., directly address the human-task interface level); (2) consider tasks 
in general, and not only a specific task such as gait, lifting, reading, etc.; (3) incorporate all aspects of 
the total human system (e.g., sensory, biomechanical, information processing, etc.); and (4) aim at 
quantitative models. There are also some similarities with regard to the incorporation of the ideas of 
"abilities" (of humans) and "requirements" (of tasks). The work of Fleishman and colleagues has thus 
been influential in shaping the ERM either directly or indirectly through its influence of others. However, 
there are several substantive conceptual differences that have resulted in considerably different endpoints. 
Fleishman's work emerged from "the task" perspective and is rooted in psychology, whereas the ERM 
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emerges from the perspective of "human system architecture" and is rooted in engineering methodology 
with regard to quantitative aspects of system performance but also incorporates psychology and physi- 
ology. Both approaches address humans and tasks, and both efforts contain aspects identifiable with 
psychology and engineering, as they ultimately must. These different perspectives, however, may explain 
in part some of the major differences. Aspects unique to the ERM include (1) the use of a resource 
construct for modeling and measurement of all aspects of a system's performance, (2) the use of cause- 
and-effect resource economic principles (i.e., the idea of threshold "costs" for achieving a given level of 
performance in any given high-level task), (3) the concept of monadology (i.e., the use of a finite set of 
"elements" to explain a complex phenomenon), and (4) a consistent strategy for identifying performance 
elements at different hierarchical levels. 

The ERM attempts to provide a quantitative and relatively straightforward framework for character- 
izing the human system, tasks, and the interface of the human to tasks. It depends in large part on, and 
evolves directly from, a separate body of material referred to collectively as general systems performance 
theory (GSPT). GSPT was developed first and independently, i.e., removed from the human system 
context. It incorporates resource constructs exclusively for modeling of the abstract idea of system 
performance, including specific rules for measuring performance resource availability and resource eco- 
nomic principles to provide a cause-and-effect analysis of the interface of any system (e.g., humans) to 
tasks. The concept of a performance model is emphasized and distinguished from other model types. 

147.2 Basic Principles 

The history of the ERM and the context in which it was developed are described elsewhere [Kondraske, 
1987a, 1990b, 2000]. It is important to note that the ERM is derived from the combination of GSPT with 
the philosophy of monadology and their application to the human system. As such, these two constituents 
are briefly reviewed before presenting and discussing the actual ERM. 

General Systems Performance Theory 

The concept of performance now pervades all aspects of life, especially decision-making processes that 
involve both human and artificial systems. Yet it has not been well understood theoretically, and systematic 
techniques for modeling and its measurement have been lacking. While a considerable body of material 
applicable to general systems theory exists, the concept of performance has not been incorporated in it, 
nor has performance been addressed in a general fashion elsewhere. Most of the knowledge that exists 
regarding performance and its quantitative treatment has evolved within individual application contexts, 
where generalizations can easily be elusive. 

Performance is multifaceted, pertaining to how well a given system executes an intended function and 
the various factors that contribute to this. It differs from behavior of a system in that "the best of 
something" is implied. The broad objectives of GSPT are 

1. To provide a common conceptual basis for defining and measuring all aspects of the performance 
of any system. 

2. To provide a common conceptual basis for the analysis of any task in a manner that facilitates 
system-task interface assessments and decision making. 

3. To identify cause-and-effect principles, or laws, that explain what occurs when any given system 
is used to accomplish any given task. 

While GSPT was motivated by needs in situations where the human is "the system" of interest and it was 
first presented in this context [Kondraske, 1987a], application of it has been extended to the context of 
artificial systems. These experiences range from computer vision and sensor fusion [Yen & Kondraske, 
1992] to robotics [Kondraske & Khoury, 1992; Kondraske & Standridge, 1988]. 

A succint statement of GSPT designed to emphasize key constructs is presented below in a steplike 
format. The order of steps is intended to suggest how one might approach any system or system-task 
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interface situation to apply GSPT. While somewhat terse and "to-the-point," it is nonetheless an essential 
prerequisite for a reasonably complete understanding the ERM. 

1. Within a domain of interest, select any level of abstraction and identify the system(s) of interest 
(i.e., the physical structure) and its function (i.e., purpose). 

2. Consider "the system" and "the task" separately. 

3. Use a resource construct to model the system's performance. First, consider the unique intangible 
qualities that characterize how well a system executes its function. Each of these is considered to 
represent a unique performance resource associated with a specific dimension of performance (e.g., 
speed, accuracy, stability, smoothness, "friendliness," etc.) of that system. Each performance 
resource is recognized as a desirable item (e.g., endurance versus fatigue, accuracy versus error, 
etc.) "possessed" by the system in a certain quantitative amount. Thus one can consider quantifying 
the amount of given quality available. As illustrated, an important consequence of using the 
resource construct at this stage is that confusion associated with duality of terms is eliminated. 

4. Looking toward the system, identify all I dimensions of performance associated with it. In situa- 
tions where the system does not yet exist (i.e., design contexts), it is helpful to note that dimensions 
of performance of the system are the same as those of the task. 

5a. Keeping the resource construct in mind, define a parameterized metric for each dimension of 
performance (e.g., speed, accuracy, etc.). If the resource construct is followed, values will be 
produced with these metrics that are always nonnegative. Furthermore, a larger numeric value 
will consistently represent more of a given resource and therefore more performance capacity. 

5b. Measure system performance with the system removed from the specific intended task. This is a 
reinforcement of Step 2. The general strategy is to maximally stress the system (within limits of 
comfort and/or safety, when appropriate) to define its performance envelope, or more specifically, 
the envelope that defines performance resource availability, R A Jt). Note that R As (t) is a continuous 
surface in the system's nonnegative, multidimensional performance space. Also note that unless 
all dimensions of performance and parameterized metrics associated with each are defined using 
the resource construct, a performance envelope cannot be guaranteed. Addressing the issue of 
measurement more specifically , consider resource availability values R A .\ Q . t (t) for i = 1 to /, 
associated with each of the J dimensions of performance. Here, each Q ; k represents a unique 
condition, in terms of a set of values _R, along other identified dimensions of performance, under 
which a specific resource availability {R Al ) is measured; i.e., Q ik = {R lk , R 2 »:>•••> R p k\ f° r au P ^ l 
(1 > p> I). The subscript k is used to distinguish several possible conditions under which a given 
resource availability (R Al , for example) can be measured. These values are measured using a set 
of "test tasks," each of which is designed to maximally stress the system (within limits of comfort 
and/or safety, when appropriate): (a) along each dimension of performance individually (where 
Q i i c = Q i0 = {0,0, ...,0}) or (b) along selected subsets of dimensions of performance simultaneously 
(i.e., Q, t = Q, „, where each possible Q, n has one or more nonzero elements). The points obtained 
[R A .lg..(f)] provide the basis to estimate the performance envelope R As (t). Note that if only on- 
axis points are obtained (e.g., maximally stress one specific performance resource availability with 
minimal or no stress on other performance resources, or the Q i0 condition), a rectangular or 
idealized performance envelope is obtained. A more accurate representation, which would be 
contained within the idealized envelope, can be obtained at the expense of making additional 
measurements or the use of known mathematic functions based on previous studies that define 
the envelope shape in two or more dimensions. 

5c. Define estimates of single-number system figures-of-merit, or composite performance capacities, as 
the mathematical product of all or any selected subset oi R A .\ Q (t). If more accuracy is desired and 
a sufficient number of data points is available from the measurement process described in Step 
5b, composite performance capacities can be determined by integration over R As (t) to determine 
the volume enclosed by the envelope. The composite performance capacity is a measure of perfor- 
mance at a higher level of abstraction than any individual dimension of performance at the "system" 
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level, representing the capacity of the system to perform tasks that place demands on those perfor- 
mance resources availabilities included in the calculation. Different composite performance capacities 
can be computed for a given system by selecting different combinations of dimensions of perfor- 
mance. Note that the definition of a composite performance capacity used here preserves dimen- 
sionality; e.g., if speed and accuracy dimensions are included in the calculation, the result has 
units of speed x accuracy. (This step is used only when needed, e.g., when two general-purpose 
systems of the same type are to be compared. However, if decision making that involves the 
interface of a specific system to a specific task is the issue at hand, a composite performance 
capacity is generally of any use only to rule out candidates). 

6. Assess the "need for detail." This amounts to a determination of the number of hierarchical levels 
included in the analysis. If the need is to determine if the currently identified "system" can work 
in the given task or how well it can execute its function, go to Step 7 now. If the need is to determine 
the contribution of one or more constituent subsystem or why a desired level of performance is not 
achieved at the system level, repeat Steps 1 to 5 for all / functional units (subsystems), or a selected 
subset thereof based on need, that form the system that was originally identified in Step 1; i.e., go 
to the next-lowest hierarchical level. 

7. At the "system" level, look toward the task(s) of interest. Measure, estimate, or calculate demands 
on system performance resources (e.g., the speed, accuracy, etc. required), _R D l Q - .it), where the 
notation here is analogous to that employed in Step 5b. This represents the quantitative definition 
and communication of goals, or the set of values P HLT representing level of performance P desired 
in a specific high-level task (HLT). Use a worst-case or other less-conservative strategy (with due 
consideration of the impact of this choice) to summarize variations over time. This will result in 
a set of M points {R Dm for m = 1 to M) that lie in the multidimensional space defined by the set 
of / dimensions of performance. Typically, M> I. 

8. Use resource economic principles (i.e., require R A > R D for "success") at the system level and at all 
system-task interfaces at the subsystem level (if included) to evaluate success/failure at each 
interface. More specifically, for a given system-task interface, all task demand points (i.e., the set 
of R Dm associated with a given task or subtask) must lie within the performance resource envelope 
R As (t) of the corresponding system. This is the key law that governs system-task interfaces. If a 
two-level model is used (i.e., "system" and "subsystem" levels are incorporated), map system-level 
demands to demands on constituent subsystems. That is, functional relationships between P HLT 
and demands imposed on constituent subsystems [i.e., R D .(t, P HLT )] must be determined. The 
nature of these mappings depends on the type of systems in question (e.g., mechanical, information 
processing, etc.). The basic process includes application of Step 7 to the subtasks associated with 
each subsystem. If resource utilization flexibility (i.e., redundancy in subsystems of similar types) 
exists, select the "best" or optimal subsystem "configuration (handled in GSPT with the concept 
of performance resource substitution) and procedure (i.e., use of performance resources over time) 
as that which allows accomplishment of goals with minimization of stress on available performance 
resources across all subsystems and over the time of task execution. Thus redundancy is addressed 
in terms of a constrained performance resource optimization problem. Stress on individual per- 
formance resources is defined as < R D ..(t, P HLT )I R A {t) < 1. It is also useful to define and take 
note of reserve capacity, i.e., the margin between available and utilized performance resources. 

The preceding statement is intended to reflect the true complexity that exists in systems, tasks, and 
their interfaces when viewed primarily from the perspective of performance. This provides a basis for 
the judicious decision making required to realize "the best" practical implementation in a given situation 
where many engineering tradeoffs must be considered. While a two-level approach is described above, 
it should be apparent that it can be applied with any number of hierarchical levels by repeating the steps 
outlined in an iterative fashion starting at a different level each time. A striking feature of GSPT is the 
threshold effect associated with the resource economic principle. This nonlinearity has important impli- 
cations in quantitative human performance modeling, as well as interesting ramifications in practical 
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applications such as rehabilitation, sports, and education. Note also that no distinction is made as to 
whether a given performance resource is derived from a human or artificial system; both types of systems, 
or subcomponents thereof, can be incorporated into models and analyses. 

Monadology 

Monadology dates back to 384 B.C. [Neel, 1977] but was formalized and its importance emphasized by 
Gottfried Wilhelm Leibniz, inventor of calculus, in his text Monadologia in 1714. This text presents what 
is commonly called Leibniz's Monadology and has been translated and incorporated into contemporary 
philosophy texts (e.g., Leibniz and Montgomery, 1992). It is essentially the idea of "basic elements" vis 
a vis chemistry, alphabets, genetic building blocks, etc. The concept is thus already well accepted as being 
vital to the systematic description of human systems from certain perspectives (i.e., chemical, genetic). 
Success associated with previous applications of monadology, whether intentional or unwitting (i.e., 
discovered to be at play after a given taxonomy has emerged), compels its serious a priori consideration 
for other problems. 

Insight into how monadology is applied to human performance modeling is perhaps more readily 
obtained with reference to a widely known example in which monadology is evident, such as chemistry. 
Prior to modern chemistry (i.e., prior to the introduction of the periodic table), alchemy existed. The 
world was viewed as being composed of an infinite variety of unique substances. The periodic table 
captured the notion that this infinite variety of substances could all be defined in terms of a finite set of 
basic elements. Substances have since been analyzed using the "language" of chemistry and organized 
into categories of various complexity, i.e., elements, simple compounds, complex compounds, etc. Despite 
the fact that this transition occurred approximately 200 years ago, compounds remain that have yet to 
be analyzed. Furthermore, the initial periodic table was incorrect and has undergone revision up to 
relatively recent times. Analogously, in the alchemy of human performance, the world is viewed as being 
composed of an infinite variety of unique tasks. A "chemistry" can be envisioned that first starts with 
the identification of the "basic elements" or, more specifically, basic elements of performance. Simple and 
complex tasks are thus analogous to simple and complex compounds, respectively. The analogy carries 
over to quantitative aspects of GSPT as well. Consider typical equations of chemical reactions with 
resources on the left and products (i.e., tasks) on the right. Simple compounds (tasks) are realized by 
drawing on basic elements in the proper combination and amounts. The amount of "product" (level of 
performance in a high-level task) obtained depends on availability of the limiting resource. 

Another informative aspect of this analogy is the issue of how to deal with the treatment of hierarchical 
level. Clearly, the chemical elements are made up of smaller particles (e.g., protons, neutrons, and 
electrons). Physicists have identified even smaller, more elusive entities, such as bosons, quarks, etc. Do 
we need to consider items at this lowest level of abstraction each time a simple compound such as 
hydrochloric acid is made? Likewise, the term basic in basic elements of performance is clearly relative 
and requires the choice of a particular hierarchical level of abstractions for the identification of systems 
or basic functional units, a level that is considered to be both natural and useful for the purpose at hand. 
Just as it is possible but not always necessary or practical to map chemical elements down to the atomic 
particle level, it is possible to consider mapping a basic element of performance (see below) such as elbow 
flexor torque production capacity down to the level of muscle fibers, biochemical reactions at neuromus- 
cular junctions, etc. 

The Elemental Resource Model 

The resource and resource economic constructs used in GSPT specifically to address performance have 
employed and have become well established in some segments of the human performance field, specif- 
ically with regard to attention and information processing [Navon & Gopher, 1979; Wickens, 1984]. 
However, in these cases, the term resource is used mostly conceptually (in contrast to quantitatively), 
somewhat softly defined, and applied to refer in various instances to systems (e.g., different processing 
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FIGURE 147.1 The elemental resource model contains multiple hierarchical levels. Performance resources (i.e., the 
basic elements) at the "basic element level" are finite in number, as dictated by the finite set of human subsystems 
and the finite set of their respective dimensions of performance. At higher levels, new "systems" can be readily created 
by configuration of systems at the basic element level. Consequently, there are an infinite number of performance 
resources (i.e., higher-level elements) at these levels. However, rules of general systems performance theory (refer to 
text) are applied at any level in the same way, resulting in the identification of the system, its function, dimensions 
of performance, performance resource availabilities (system attributes), and performance resource demands (task 
attributes). 

centers), broad functions (e.g., memory versus processing), and sometimes to infer a particular aspect 
of performance(e.g., attentional resources). In the ERM, through the application of GSPT, these con- 
structs are incorporated universally (i.e., applied to all human subsystems) and specifically to model 
"performance" at both conceptual and quantitative levels. In addition to the concept of monadology, the 
insights of others [Shoner & Kelso, 1988, Turvey et al., 1978] were valuable in reinforcing the basic 
"systems architecture" employed in the ERM and in refining description of more subtle, but important 
aspects. 

As illustrated in Fig. 147.1, the ERM contains multiple hierarchical levels. Specifically, three levels are 
defined: (1) the basic element level, (2) the generic intermediate level, and (3) the high level. GSPT is to 
define performance measures at any hierarchical level. This implies that to measure performance, one 
must isolate the desired system and then stress it maximally along one dimension of performance (or 
more, if interaction effects are desired) to determine performance resource availability. For example, 
consider the human "posture stabilizing" system (at the generic intermediate level), which is stressed 
maximally along a stability dimension. As further illustrated below, the basic element level represents 
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measurable stepping stones in the human system hierarchy between lower-level systems (i.e., ligaments, 
tendons, nerves, etc.) and higher-level tasks. 

A summary representation emphasizing the basic element level of the ERM is depicted in Fig. 147.2. 
While this figure is intended to be more or less self-explanatory, a brief walk-through is warranted. 

Looking Toward the Human. The entire human (lower portion of Fig. 147.2) is modeled as a pool of 
elemental performance resources that are grouped into one of four different domains: (1) life-sustaining, 
(2) environmental interface (containing purely sensory and sensorimotor components), (3) central 
processing, and (4) information. Within each of the first three domains, physical subsystems referred to 
as functional units are identified (see labels along horizontal aspect of grids) through application of fairly 
rigorous criteria [Kondraske, 1990fo]. GSPT is applied to each functional unit, yielding first a set of 
dimensions of performance (defined using a resource construct) for each unit. A single basic element of 
performance (BEP) is defined by specifying two items: (1) the basic functional unit and (2) one of its 
dimensions of performance. Within a domain, not every dimension of performance indicated in 
Fig. 147.2 is applicable to every functional unit in that domain. However, there is an increasing degree 
of "likeness" among functional units (i.e., fewer fundamentally different types) in this regard as one 
moves from life-sustaining to environmental-interface to central-processing domains. The fourth 
domain, the information domain, is substantially different than the other three. Whereas the first three 
represent physical systems and their intangible performance resources, the information domain simply 
represents information. Thus, while memory functional units are located within the central-processing 
domain, the contents of memory (e.g., motor programs and associated reference information) are parti- 
tioned into the information domain. As illustrated, information is grouped, but within each group there 
are many specific skills. The set of available performance resources [R A .(t)L] consist of both BEPs (z = 
dimension of performance, j = functional unit) and information sets (e.g., type i within group j). Although 
intrinsically different, both fit the resource construct. This approach permits even the most abstract items 
such as motivation and friendliness to be considered with the same basic framework as strength and speed. 
Note that resource availability in GSPT and thus in the ERM is potentially a function of time, allowing 
quantitative modeling of dynamic processes such as child development, aging, disuse atrophy, and 
rehabilitation. The notation further implies that availability of a given resource must be evaluated at a 
specific operating point, denoted as Q. At least conceptually, many parameters can be used to characterize 
this Q point. In general, the goal of measurement when looking "toward the human" is to isolate 
functional units and maximally stress (safely) individual performance resources to determine availability. 
Such measures reflect performance capacities. The simplest Q point is one in which there is stress along 
only one dimension of performance (i.e., that corresponding to the resource being stressed). Higher- 
fidelity representation is possible at the expense of additional measurements. The degree to which 
isolation can be achieved is of practical concern in humans. Nonetheless, it is felt that reasonable isolation 
can be achieved in most situations [Kondraske, 1990a, 1990/;]. Moreover, this and similar issues of 
practical concern should be addressed as separate problems; i.e., they should not be permitted to obfuscate 
or thwart efforts to explain phenomenon at the human-task interface. 

Looking Toward the Task. The midportion of Fig. 147.2 suggests the representation of any given task 
in terms of the unique set of demands [R D . (t)] imposed on the pool of BEPs and information resources; 
i.e., this is the elemental level of task representation. Shading implies that demands can be represented 
quantitatively in terms of amount. The upper portion of the figure defines hierarchical mapping options, 
where mapping is the process of translating what happens in tasks typically executed by humans to the 
elemental level. Two such additional levels (for a total of three, including the elemental level) are included 
as part of the ERM: (1) generic intermediate-level tasks and (2) higher-level complex tasks. At all three 
levels of tasks (Figs. 147.1 and 147.2) are processes that occur over time and can be characterized by 
specific goals (e.g., in terms of speed, force, etc.) and related to systems at the same level that possess 
performance resources. Using established analytical techniques, even the most complex task can be 
divided into discrete task segments. Then mapping analyses, which take into account task procedures 
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(e.g., a squatting subject with two hands on the side of a box), are applied to each task segment to 
determine _R D . .(f). Once this is found, the worst case or a selected percentile point in the resource demand 
distribution (over a time period corresponding to a selected task segment) can be used to obtain a single 
numeric value representation of demand for a given resource and the conditions under which the given 
demand occurs; i.e., the Q' point (e.g., at what speed and position angle does the worst-case demand on 
elbow flexor torque occur?). This reduction process requires parameterization algorithms that are similar 
to those used to process time-series data collected during tests designed to measure performance resource 
availability. 

The Human-Task Interface. Using GSPT, success in achieving the goals of a given task segment is 
governed by resource economic principles requiring that R A r (t)\ Q > R Dj (t) for all i and; (i.e., R A > R D 
AND R AU > R DU AND R All > R DU ). In other words, all task demands, when translated to the individual 
subsystems involved, must fit within the envelopes that define performance resource availability. Adequacy 
associated with any one resource is a necessary but not sufficient condition for success. Concepts and 
observations in human performance referred to as compensation or redundancy are explained in terms 
of resource utilization flexibility, which includes the possibility of substituting one performance resource 
(of the same dimensionality) for another (i.e., resource substitution). It has been hypothesized [Kondraske, 
\990b] that an optimal performance resource utilization is achieved through learning. Furthermore, the 
optimization rule suggested by GSPT is that the human system is driven to accomplish task goals and 
use procedures that minimize performance resource stress (i.e., the fraction of available performance 
resource utilized) over the duration of a given task segment and across all BEPs involved. Minimizing 
stress is equivalent to maximizing the margin between available and utilized performance resources. Thus 
optimization is highly dependent on the resource availability profile. It would be predicted, for instance, 
that two individuals with different resource availability profiles would not optimally accomplish the same 
task goals by using identical procedures. 

147.3 Application Issues 

Implications of the ERM and what it demonstrates regarding intrinsic demands imposed by nature and 
methods for creatively navigating these demands over both the short and long terms are considered. The 
ERM offers a number of flexibilities with regard to how it can be applied (e.g., "in whole" or "in part," 
"conceptually" or "rigorously," with "low tech" or sophisticated "high tech" tools, and to define perfor- 
mance measures or to develop predictive models). While immediate application is possible at a conceptual 
level of application, it also provides the motivation and potential to consider coordinated, collaborative 
development of sophisticated tools that allow rigorous and efficient solutions to complex problems by 
practitioners without extraordinary training. 

The ERM provides a basis for obtaining insight into the nature of routine tasks that clinicians and 
other practitioners are expected to perform; there are both "troublesome" and "promising" insights in 
this regard. Perhaps the most obvious troublesome aspect is that the ERM makes it painfully evident 
that many BEPs are typically called into play in tasks of daily living, work, and/or recreation and that 
resource insufficiency associated with any one of this subset of BEPs can be the factor that limits 
performance in the higher-level task. The further implication is that for rigorous application, one must 
know (via measurement and analyses) the availability and demand associated with each and every one 
of these unique resources. An additional complexity with which practitioners must cope is the high degree 
of specificity and complexity of resources in the information domain (i.e., the "software"). There is no 
simple, rapid way to probe this domain to determine if the information required for a given task is 
correct; it requires methods analogous to those used to debug software source code. 

Aspects that hold promise are associated with (1) the nature of hierarchical systems, (2) the threshold 
mathematics of resource economics, and (3) the fact that when n resources combine to address a single 
task, the mathematics of logical combination is employed to arrive at an overall assessment. That is, the 



© 2000 by CRC Press LLC 



individual "R A > R D V questions result in a set of "OK" or "not OK" results that are combined with logical 
AND operations to obtain the final "OK" or "not OK" assessment (note that the OR operator is used 
when resource substitution is possible). 

Conceptual, Low-Tech Practical Application 

The ERM description alone can be used simply to provide a common conceptual basis for discussing 
the wide range of concepts, measurements, methods, and processes of relevance in human performance 
or a particular application area [Frisch, 1993; Mayer & Gatchel, 1988; Syndulko et al., 1988]. It also can 
be used at this level as a basis for structured assessment [Kondraske, 1988fo, 2000] of individuals in 
situations, including therapy prescription, assistive device prescription, independent living decision mak- 
ing (e.g., self-feeding, driving, etc.), age or gender discrimination issues in work or recreational tasks, 
etc. At points in such processes, it is often more important to consider the full scope of different 
performance resources involved in a task using even a crude level of quantification than it is to consider 
just a select few in great depth. A "checklist" approach is recommended. The professional uses only his 
or her judgment and experience to consider both the specific individual and the specific task of interest 
to make quantitative but relatively gross assessments of resource adequacy using a triage-like categori- 
zation process (e.g., with "definitely limiting," "definitely not limiting," or "not sure" categories). This is 
feasible because of the threshold nature of the system-task interface; in cases where R A and R D are widely 
separated instrumented, high- resolution measurements are not required to determine if a given perfor- 
mance resource is limiting. Any resource(s) so identified as "definitely limiting" becomes an immediate 
focus of interest. If none is categorized as such, concern moves to those in the "not sure" category, in 
which case more sensitive measurements may be required. Purely subjective methods of measuring 
resource availability and/or demands can be augmented with selected, more objective and higher-reso- 
lution measurements in "hybrid applications." 

Conceptual, Theoretical Application 

The ERM can be used to reconsider previous work in human performance. For example, it can be 
employed to reason why Fleishman [Fleishman & Quaintance, 1984] (as well as others) achieved prom- 
ising, but limited, success with statistically based predictions of performance in higher-level tasks using 
regression models with independent variables, which can now be viewed as representing lower-level 
performance resources (in most cases). Specifically, regression models rely heavily on an assumption that 
there exists some correlation between dependent and each independent variable, the latter of which 
typically represent scores from maximal performance tasks and therefore reflect resource availability 
(using GSPT and ERM logic). Brief reflection results in the realization that correlation is not to be 
anticipated between the level of performance attainable in a "higher-level task" and availability of one 
of the many performance resources essential to the task (e.g., if 4 cups of flour are needed for a given 
cake, having 40 cups available will not alone result in a larger cake of equal quality — availability of another 
ingredient may in fact be limiting). Rather, correlation is expected between high-level task performance 
and the amount of resource utilization. Unfortunately, as noted above, the independent variables used 
typically reflect resource availability. The incomplete labeling of performance variables in such studies 
reflects the general failure to distinguish between utilization and availability. Why not, then, just use 
measures of resource utilization in such statistical models? Resource availability measures are simple to 
obtain in the laboratory without requiring that the individual execute the high-level task of interest. 
Resource utilization measures can only be obtained experimentally by requiring the subject to execute 
the task in question, which is counterproductive with respect to the goal of using a set of laboratory 
measurements to extrapolate to performance in one or more higher-lever task situations. Furthermore, 
regression models based on linear combination of resource availability measures do not reflect the 
nonlinear threshold effect accounted for with resource economic, GSPT-based performance models. One 
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potential alternative based on GSPT and termed nonlinear causal resource analysis has been proposed 
[Kondraske, 1988a; Vasta & Kondraske, 1994]. 

Application "in Part" 

In this approach, whole domains (i.e., many BEPs) are assessed simultaneously, resulting in an estimate 
or well-founded assumption that states that "all performance resources in domain X are nonlimiting in 
task(s) if' Such assumptions are often well justified. For example, it would be reasonable to assume that 
a young male with a sports-related knee injury would have only a reduction in performance resource 
availability in the environmental interface domain. More specifically, it is reasonable to assume that the 
scope of interest can be confined to a smaller subset of functional units, as in gait or speech. These can 
then be addressed with rigorous application. Examples of this level and manner of applying the ERM 
have been or are being developed for head/neck control in the context of assistive communication device 
prescription [Carr, 1989], workplace design [Kondraske, 1988c], evaluating work sites and individuals 
with disabilities for employment [Parnianpour & Marras, 1993], gait [Carollo & Kondraske, 1987], 
measurement of upper extremity motor control [Behbehani et al., 1988] , speech production performance 
[Jafari, 1989; Jafari et al., 1989], and to illustrate changes in performance capacity associated with aging 
[Kondraske, 1989]. Additionally, in some applications only the generic intermediate level need be con- 
sidered. For example, one may only need to know how well an individual can walk, lift, etc. While it is 
sometimes painfully clear just how complex the execution of even a relatively simple task is, it also can 
be recognized that relatively simple, justifiable, and efficient strategies can be developed to maintain a 
reasonable degree of utility in a given context. 

Rigorous, High-Tech Application 

While this path may offer the greatest potential for impact, it also presents the greatest challenge. The 
ultimate goal would be to capture the analytic and modeling capability (as implied by the preceding 
discussions) for a "total human" (single subject or populations) and "any task" in a desktop computer 
system (used along with synergistic "peripherals" that adopt the same framework, such as measurement 
tools [e.g., Kondraske, 1990a]). This suggests a long-term, collaborative effort. However, intermediate 
tools that provide significant utility are feasible and needed (e.g., [Allard, Stokes, and Blanchi, 1994; Vasta 
and Kondraske, 1995]). A promising example based directly on GSPT and the ERM is Nonlinear Causal 
Resource Analysis, or NCRA [Vasta and Kondraske, 1994; Kondraske et al, 1997]. This inferential method 
has recently been used [Kondraske et al, 1997] to develop models that relate performance resource 
demands on various human subsystems to level of performance attained in higher level mobility tasks 
(e.g., gait and stair climbing). In turn, these models have been used to predict level of performance in 
these higher level tasks with success that far exceeds that which has been obtained with regression models. 
Furthermore, the NCRA method inherently provides not only a prediction of high level task performance, 
but also identifies which performance resources are most likely to be preventing better performance (i.e., 
which ones are the "limiting performance resources"). While tools that are useful to practitioners are 
desirable, they are almost essential for the efficient conduct of in-depth experimental work with the ERM 
that must accompany the evolution of such tools. 

The issue of biologic variability and its influence on numeric analyses can be raised in the context of 
rigorous numeric application. In this regard, the methods underlying GSPT and the ERM (or any similar 
cause-and-effect model) are noted to be analogous with those used to design artificial systems. In recent 
years, conceptual approaches and mathematic tools widely known as Taguchi methods [e.g., Bendell et al., 
1988] have been shown to be effective for understanding and managing a very similar type of variability 
that surfaces in the manufacture of artificial systems (e.g., variability associated with performance of 
components of larger systems and the effect on aspects of performance of the final "product"). Such tools 
may prove useful in working through engineering problems such as those associated with variability. 
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147.4 Conclusion 

The ERM is a step toward the goal of achieving an application-independent approach to modeling any 
human-task interface. It provides a systematic and generalizable (across all subsystem types) means of 
identifying performance measures that characterize human subsystems, as well as a consistent basis for 
performance measurement definition (and task analysis). It also has served to stimulate focus on a 
standardized, distinct set of variables that facilitates clear communication of an individual's status among 
professionals. 

After the initial presentation, refinements in both GSPT and the ERM were made. However, the basic 
approaches, terminology, and constructs used in each have remained quite stable. More recent work has 
focused on development of various components required for application of the ERM in nontrivial 
situations. This entails using GSPT and basic ERM concepts to guide a full "fleshing out" of the details 
of measurement parameterizations and models for different types of human subsystems, definition of 
standard conventions and notations, and development of computer-based tools. In addition, experimen- 
tal studies designed to evaluate key constructs of the ERM and to demonstrate the various ways in which 
it can be applied are being conducted. A good portion of the developmental work is aimed at building 
a capability to conduct more complex, nontrivial experimental studies. Collaborations with other research 
groups also have emerged and are being supported to the extent possible. Experiences with it in various 
contexts and it various levels of application have been productive and encouraging. 

The ERM is one, relatively young attempt at organizing and dealing with the complexity of some 
major aspects of human performance. There is no known alternative that, in a specific sense, attempts 
to accomplish the same goals as the working model presented here. Is it good enough? For what purposes? 
Is a completely different approach or merely refinement required? The process of revision is central to 
the natural course of the history of ideas. Needs for generalizations in human performance persist. 

Defining Terms 

Basic element of performance (BEP): A modeling item at the basic element level in the ERM defined 
by identification of a specific system at this level and one of its dimensions of performance (e.g., 
functional unit = visual information processor, dimension of performance = speed, BEP = visual 
information processor speed). 

Behavior: A general term that relates to what a human or artificial system does while carrying out its 
function(s) under given conditions. Often, behavior is characterized by measurement of selected 
parameters or identification of unique system states over time. 

Composite performance capacity: A performance capacity at a higher level of abstraction, formed by 
combining two or more lower-level performance capacities (e.g., via integration to determine the 
area or volume within a performance envelope). 

Dimension of performance: A unique quality that characterizes how well a system executes its function 
(e.g., speed, accuracy, torque production); one of axes or the label associated with one of the axes 
in a multidimensional performance space. 

Function: The purpose of a system. Some systems map to a single primary function (e.g., process visual 
information). Others (e.g., the human arm) map to multiple functions, although at any given time 
multifunction systems are likely to be executing a single function (e.g., polishing a car). Functions 
can be described and inventoried, whereas level of performance of a given function can be mea- 
sured. 

Generic intermediate level: One of three major hierarchical levels for systems and tasks identified in 
the elemental resource model. The generic intermediate level represents new systems (e.g., postural 
maintenance system, object gripper, object lifter, etc.) formed by the combination of functional 
units at the basic element level (e.g., flexors, extensors, processors, etc.). The term generic is used 
to imply the high frequency of use of systems at this level in tasks of daily life (i.e., items at the 
"high level" in the ERM). 
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Goal: A desired endpoint (i.e., result) typically characterized by multiple parameters, at least one of 
which is specified. Examples include specific task goals (e.g., move an object of specified mass from 
point A to point B in 3 seconds) or estimated task performance (maximum mass, range, speed of 
movement obtainable given a specified elemental performance resource availability profile), 
depending on whether a reverse or forward analysis problem is undertaken. Whereas function 
describes the general process of a task, the goal directly relates to performance and is quantitative. 

Limiting resource: A performance resource at any hierarchical level (e.g., vertical lift strength, knee 
flexor speed) that is available in an amount that is less than the worst-case demand imposed by a 
task. Thus a given resource can only be "limiting" when considered in the context of a specific task. 

Performance: Unique qualities of a human or artifical system (e.g., strength, speed, accuracy, endur- 
ance) that pertain to how well that system executes its function. 

Performance capacity: A quantity in finite availability that is possessed by a system or subsystem, 
drawn on during the execution of tasks, and limits some aspect (e.g., speed, force, production, 
etc.) of a system's ability to execute tasks; or, the limit of that aspect itself. 

Performance envelope: The surface in a multidimensional performance space, formed with a selected 
subset of a system's dimensions of performance, that defines the limits of a system's performance. 
Tasks represented by points that fall within this envelope can be performed by the system in question. 

Performance resource: A unique quality of a system's performance modeled and quantified using a 
resource construct. 

Performance resource substitution: The term used in GSPT to describe the manner in which intelligent 
systems, such as humans, utilize redundancy or adapt to unusual circumstances (e.g., injuries) to 
obtain optimal procedures for executing a task. 

Procedure: A set of constraints placed on a system in which flexibility exists regarding how a goal (or set 
of goals) associated with a given function can be achieved. Procedure specification requires specifi- 
cation of initial, intermediate, and/or final states or conditions dictating how the goal is to be 
accomplished. Such specification can be thought of in terms of removing some degrees of freedom. 

Resource construct: The collective set of attributes that define and uniquely characterize a resource. 
Usually, the term is applied only to tangible items. A resource is desirable and measurable in terms 
of amount (from zero to some finite positive value) in such a manner that a larger numeric value 
indicates a greater amount of the resource. 

Resource economic principle: The principle, observable in many contexts, that states that the amount 
of a given resource that is available (e.g., money) must exceed the demand placed on it (e.g., cost 
of an item) if a specified task (e.g., purchase of the item) is to be executed. 

Resource utilization flexibility: A term used in GSPT to describe situations in which there are more 
than one possible source of a given performance resource type, i.e., redundant supplies exist. 

Structure: Physical manifestation and attributes of a human or artificial system; the object of one type 
of measurements at multiple hierarchical levels. 

Style: Allowance for variation within a procedure, resulting in the intentional incomplete specification 
of a procedure or resulting from either intentional or unintentional incomplete specification of 
procedure. 

System: A physical structure, at any hierarchical level of abstraction, that executes one or more functions. 

Task: That which results from (1) the combination of specified functions, goals, and procedures or 
(2) the specification of function and goals and the observation of procedures utilized to achieve 
the goals. 
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General discussions of major issues associated with human performance modeling can be found in the 
following texts: 
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Wickens CD. 1984. Engineering Psychology and Human Performance. Columbus, Ohio, Charles E 
Merrill. 

More detailed information regarding general systems performance theory, the elemental resource model, 
and their application is available from the Human Performance Institute, P.O. Box 19180, University of 
Texas at Arlington, Arlington, TX 76019-0180 and at the HPI web site: http://www-ee.uta.edu/hpi. 
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Movements allow us to interact with our environment, express ourselves, and communicate with each 
other. Life is movement. Movement is constantly occurring at many hierarchical levels including cellular 
and subcellular levels. By using the adjective "human" to clarify the term "movement", we are not only 
denning the species of interest, but also limiting the study to observable performance and its more overt 
causes. Study of human performance is of interest to a broad range of professionals including rehabili- 
tation engineers, orthopaedic surgeons, therapists, biomechanists, kinesiologists, psychologists, and so 
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on. Because of the complexity of human performance and the variety of investigators, the study of human 
performance is conducted from several theoretical perspectives including, (1) anatomical, (2) purpose 
or character of the movement (such as locomotion), (3) physiological, (4) biomechanical, 
(5) psychological, (6) socio-cultural, and (7) integrative. The Elemental Resource Model (ERM), pre- 
sented at the beginning of this section, is an integrative model which incorporates aspects of the other 
models into a singular system accounting for the human, the task, and the human-task interface [Kon- 
draske, 1999]. 

The purposes of this chapter are to: (1) provide reasons for measuring four selected variables of human 
performance: extremes/range of motion, strength, speed of movement, and endurance; (2) briefly define 
and discuss these variables; (3) overview selected instruments and methods used to measure these vari- 
ables; and (4) discuss interpretation of performance for a given neuromuscular subsystem. 

148.1 Neuromuscular Functional Units 

While the theoretical perspectives listed previously may be useful within specific contexts or within 
specific disciplines, the broader appreciation of human performance and its control can be gained from 
the perspective of an integrative model such as the ERM [Kondraske, 1999]. This model organizes 
performance resources into four different domains. Basic movements, such as elbow flexion, are executed 
by neuromuscular functional units in the environmental interface domain. Intermediate and complex 
tasks, such as walking and playing the piano, utilize multiple basic functional units. A person performing 
a movement operates the involved functional units along different dimensions of performance according 
to the demands of the task. Dimensions of performance are factors such as joint motion, strength, speed 
of movement, and endurance. Lifting a heavy box off the floor requires, among other things, a specific 
amount of strength associated with neuromuscular functional units of the back, legs, and arms according 
to the weight and size of the box. Reaching for a light weight box from the top shelf of a closet requires 
that the shoulder achieve certain extremes of motion according to the height of the shelf. 

Whereas four dimensions of performance are considered individually in this chapter, they are highly 
interdependent. For an example, strength availability during a movement is partly dependent on joint 
angle. Despite interdependence, considering the variables as different dimensions is essential to studying 
human performance. The components limiting a person's ability to complete a task can only be identified 
and subsequently enhanced by determining, for example, that the reason a person cannot reach the box 
off the top shelf is not because of insufficient range of motion of the shoulder, but because of insufficient 
strength of the shoulder musculature required to lift the arm through the range of motion. Isolating the 
subsystems involved in a task and maximally stressing them along one or more "isolated" dimensions of 
performance is a key concept in the ERM. "Maximally stressing" the subsystems means that the maximum 
amount of the resource available is being determined. This differs from determining the amount of the 
resource which happened to be used while performing a particular task. Often the distinction between 
obtaining maximal performance vs. submaximal performance is in the instructions given to the subject. 
For example, in measuring speed, we say "move as fast as you can." 

Purposes of Measuring Selected Neuromuscular Performance Capacities 

Range of motion, strength, speed of movement, and/or endurance can be measured for one or more of 
the following purposes: 

1. To determine the amount of the resource available and to compare it to the normal value for that 
individual. "Normal" is frequently determined by comparisons with the opposite extremity or 
with normative data when available. This information may be used to develop goals and a program 
to change the performance. 

2. To assist in determining the possible effects of insufficient or unbalanced amounts of the variable 
on a person's performance of activities of daily living, work, sport, and leisure pursuits. In this 
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case, the amount of the variable is compared to the demands of the task, rather than to norms or 
to the opposite extremity. 

3. To assist in diagnosis of medical conditions and the nature of movement dysfunctions. 

4. To reassess status in order to determine the effectiveness of a program designed to change the 
amount of the variable. 

5. To motivate persons to comply with treatment or training regimes. 

6. To document status and the results of treatment or training and to communicate with other 
involved persons. 

7. To assist in economically designed furniture, equipment, techniques, and environments. 

8. To provide information to combine with other measures of human performance to predict func- 
tional capabilities. 

148.2 Range of Motion and Extremes of Motion 

Range of motion (ROM) is the amount of movement that occurs at a joint. Range of motion is typically 
measured by noting the extremes of motion (EOM). The designated reference or zero position must be 
specified for measurements of the two extremes of motion. For example, to measure elbow (radiohumeral 
joint) flexion and extension, the preferred starting position is with the subject supine with the arm parallel 
to the lateral midline of the body with the palm facing upward [Norkin and White, 1995; Palmer and 
Epler, 1998]. Measurements are taken with the elbow in the fully flexed position and with the elbow in 
the fully extended position. 

Movement Terminology 

Joint movements are described using a coordinate system with the human body in an anatomical position. 
The anatomical position of the body is an erect position, face forward, arms at sides, palms facing forward, 
and fingers and thumbs in extension. The central coordinate system consists of three cardinal planes and 
axes with its origin located between the cornua of the sacrum [Panjabi, White, and Brand, 1974]. 
Figure 148.1 demonstrates the planes and axes of the central coordinate system. The same coordinate 
system can parallel the master system at any joint in the body by relocating the origin to any defined point. 

The sagittal plane is the y, z plane; the frontal (or coronal) plane is the y, x plane; the horizontal (or 
transverse) plane is the x, z plane. Movements are described in relation to the origin of the coordinate 
system. The arrows indicate the positive direction of each axis. An anterior translation is +z; a posterior 
translation is -z. Clockwise rotations are +9, and counterclockwise rotations are -9. 

Joints are described as having degrees of freedom (dof ) of movement. Dof is the number of indepen- 
dent coordinates in a system that are necessary to accurately specify the position of an object in space. 
If a motion occurs in one plane and around one axis, the joint is defined as having one dof. Joints with 
movements in two planes occurring around two different axes, have two dof, and so on. 

Angular movements refer to motions that cause an increase or decrease in the angle between the 
articulating bones. Angular movements are flexion, extension, abduction, adduction, and lateral flexion 
(See Table 148.1). Rotational movements generally occur around a longitudinal (or vertical) axis except 
for movements of the clavicle and scapula. The rotational movements occurring around the longitudinal 
axis (internal rotation, external rotation, opposition, horizontal abduction, and horizontal adduction) 
are described in Table 148.1. Rotation of the scapula is described in terms of the direction of the inferior 
angle. Movement of the inferior angle of the scapula toward the midline is a medial (or downward) 
rotation, and movement of the inferior angle away from the midline is lateral (or upward) rotation. In 
the extremities the anterior surface of the extremity is used as the reference area. Because the head, neck, 
trunk, and pelvis rotate about a midsagittal, longitudinal axis, rotation of these parts is designated as 
right or left. As can be determined from Fig. 148.1, axial rotation of the trunk toward the left can be 
described mathematically as +9y. 
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A communications problem often exists in describing 
motion using the terms defined in Table 148.1. A body 
segment can be in a position such as flexion, but can be 
moving toward extension. This confusion is partially 
remedied by using the form of the word with the suffix, 
-ion, to indicate a static position and using the suffix, -ing, 
to denote a movement. Thus, an elbow can be in a posi- 
tion of 90° flexion and also extending. 

Factors Influencing ROM/EOM 
and ROM/EOM Measurements 

The ROM and EOM available at a joint is determined by 
morphology and the soft tissues surrounding and cross- 
ing a joint, including the joint capsule, ligaments, ten- 
dons, and muscles. Other factors such as age, gender, 
swelling, muscle mass development, body fat, passive 
insufficiency (change in the ROM/EOM available at one 
joint in a two-joint muscle complex caused by the posi- 
tion of the other joint), and time of day (diurnal effect) 
also affect the amount of motion available. Some persons, 
because of posture, genetics, body type, or movement 
habits, normally demonstrate hypermobile or hypomo- 
bile joints. Dominance has not been found to significantly 
affect available ROM. See discussion in Miller [ 1985] . The 
shapes of joint surfaces, which are designed to allow 
movement in particular directions, can become altered 
by disease, trauma, and posture, thereby, increasing or 
decreasing the ROM/EOM. Additionally, the soft tissues 
crossing a joint can become tight (contracted) or over- 
stretched altering the ROM/EOM. 

The type of movement, active or passive, also affects 




FIGURE 148.1 Planes and axes are illustrated 
in anatomical position. The central coordinate 
system with its origin between the cornua of the 
sacrum is shown. Source: White III AA, Panjabi 
MM. 1990. Clinical Biomechanics of the Spine, 
2nd ed., p 87. Philadelphia, JB Lippincott Com- 
pany. With permission. 



ROM/EOM. When measuring active ROM (AROM), the 

person voluntarily contracts muscles and moves the body part through the available motion. When 
measuring passive ROM (PROM), the examiner moves the body part through the ROM. PROM is usually 
slightly greater than AROM due to the extensibility of the tissues crossing and comprising the joint. 
AROM can be decreased because of restricted joint mobility, muscle weakness, pain, unwillingness to 
move, or inability to follow instructions. PROM is assessed to determine the integrity of the joint and 
the extent of structural limitation. 

Instrumented Systems Used to Measure ROM/EOM 

The most common instrument used to measure joint ROM/EOM is a goniometer. The universal goni- 
ometer, shown in Fig. 148.2a, is most widely used clinically. A variety of universal goniometers have been 
developed for specific applications. Two other types of goniometers are also shown in Fig. 148.2. 

Table 148.2 lists and compares several goniometric instruments used to measure ROM/EOM. Choice 
of the instrument used to measure ROM/EOM depends upon the degree of accuracy required, time 
available to the examiner, the measurement environment, the body segment being measured, and the 
equipment available. 

Non-goniometric methods of joint measurement are available. Tape measures, radiographs, photo- 
graphy, cinematography, videotape, and various optoelectric movement monitoring systems can also be 
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TABLE 148.1 Movement Terms, Planes, Axes, and Descriptions of Movements 



Movement Term 


Plane 


Axis 


Flexion 


Sagittal 


Frontal 


Extension 


Sagittal 


Frontal 


Abduction 


Frontal 


Sagittal 


Adduction 


Frontal 


Sagittal 


Lateral flexion 


Frontal 


Sagittal 


Internal (medial) 


Horizontal 


Longitudinal 


rotation 






External (lateral) 


Horizontal 


Longitudinal 


rotation 






Opposition 


Multiple 


Multiple 


Horizontal 


Horizontal 


Longitudinal 


abduction 






Horizontal 


Horizontal 


Longitudinal 


adduction 






Tilt 


Depends 


Depends on 




on joint 


joint 



Gliding 

Elevation 

Depression 



Depends 
on joint 
Frontal 

Frontal 



Depends on 
joint 



Description of Movement 

Bending of a part such that the anterior surfaces approximate each other. 
However, flexion of the knee, ankle, foot, and toes refers to movement in 
the posterior direction. 

Opposite of flexion; involves straightening a body part. 

Movement away from the midline of the body or body part; abduction of 
the wrist is sometimes called radial deviation. 

Movement towards the midline of the body or body part; adduction of the 
wrist is sometimes called ulnar deviation. 

Term used to denote lateral movements of the head, neck, and trunk. 

Turning movement of the anterior surface of a part towards the midline of 
the body; internal rotation of the forearm is referred to as pronation. 

Turning movement of the anterior surface of a part away from the midline 
of the body; external rotation of the forearm is referred to as supination. 

Movement of the tips of the thumb and little finger toward each other. 

Movement of the arm in a posterior direction away from the midline of the 
body with the shoulder joint in 90° of either flexion or abduction. 

Movement of the arm in an anterior direction toward the midline of the 
body with the shoulder joint in 90° of either flexion or abduction. 

Term used to describe certain movements of the scapula and pelvis. In the 
scapula, an anterior tilt occurs when the coracoid process moves in an 
anterior and downward direction while the inferior angle moves in a 
posterior and upward direction. A posterior tilt of the scapula is the 
opposite of an anterior tilt. In the pelvis, an anterior tilt is rotation of the 
anterior superior spines (ASISs) of the pelvis in an anterior and downward 
direction; a posterior tilt is movement of the ASISs in a posterior and 
upward direction. A lateral tilt of the pelvis occurs when the pelvis is not 
level from side to side, but one ASIS is higher than the other one. 

Movements that occur when one articulating surface slides on the opposite 
surface. 

A gliding movement of the scapula in an upward direction as in shrugging 
the shoulders. 

Movement of the scapula downward in a direction reverse of elevation. 



used to measure or calculate the motion available at various joints. These methods are beyond the scope 
of this chapter. 

Key Concepts in Goniometric Measurement 

Numerous textbooks [Clarkson and Gilewich, 1989; Norkin and White, 1995; Palmer and Epler, 1998] 
are available that describe precise procedures for goniometric measurements of each joint. Unfortunately, 
there is a lack of standardization among these references. 

In general, the anatomical position of zero degrees {preferred starting position) is the desired starting 
position for all ROM/EOM measurements except rotation at the hip, shoulder, and forearm. The arms 
of the goniometer are usually aligned parallel and lateral to the long axis of the moving and the fixed 
body segments in line with the appropriate landmarks. In the past, some authors contended that place- 
ment of the axis of the goniometer should be congruent with the joint axis for accurate measurement 
[West, 1945; Wiechec and Krusen, 1939]. However, the axis of rotation for joints changes as the body 
segment moves through its ROM; therefore, a goniometer cannot be placed in a position in line with 
the joint axis during movement. Robson [1966] described how variations in the placement of the 
goniometer's axis could affect the accuracy of ROM measurements. Miller [ 1985] suggested that the axis 
problem could be handled by ignoring the goniometer's axis and concentrating on the accurate alignment 
of the arms of the goniometer with the specified landmarks. Potentially some accuracy may be sacrificed, 
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FIGURE 148.2 Three types of goniometric instruments used to measure range and extremes of motion are shown: 
(a) typical 180- and 360-degree universal goniometers of various sizes; (b) a fluid goniometer, which is activated by 
the effects of gravity; (c) an APM I digital electronic device that works similarly to a pendulum goniometer. 



but the technique is simplified and theoretically more reproducible. The subject's movement is observed 
for unwanted motions that could result in inaccurate measurement. For example, a subject might attempt 
to increase forearm supination by laterally flexing the trunk. 
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TABLE 148.2 Comparison of Various Goniometers Commonly Used to Measure joint Range and Extremes of Motion 



Type of Goniometer 

Universal goniometer 

A protractor-like device with one 
arm considered movable and the 
other arm stationary; protractor 
can have a 180° or 360° scale and 
is usually numbered in both 
directions; available in a range of 
sizes and styles to accommodate 
different joints (See Fig. 148.2a). 

Fluid (or bubble) goniometer 

A device with a fluid-filled channel 
with a 360°-scale that relies on 
the effects of gravity (See 
Fig. 148.2b); dial turns allowing 
the goniometer to be "zeroed;" 
some models are strapped on and 
others must be held against the 
body part. 



Pendulum goniometer 

A scaled, inclinometer-like device 
with a needle or pointer (usually 
weighted); some models are 
strapped on and others must be 
held against the body part (not 
shown). 
"Myrin" OB goniometer 

A fluid-filled, rotatable container 
consisting of compass needle that 
responds to the earth's magnetic 
field (to measure horizontal 
motion), a gravity- activated 
inclination needle {to measure 
frontal and sagittal motion), and 
a scale (not shown). 
Ar thro dial protractor 

A large, flat, clear plastic protractor 
without arms that has a level on 
the straight edge (not shown). 
APMI 

Computerized goniometer with 
digital sensing and electronics; 
can perform either continuous 
monitoring or calculate 
individual ROM/EOM from a 
compound motion function (See 
Fig. 148.2c). 
Electrogoniometer 

Arms of a goniometer are attached 
to a potentiometer and are 
strapped to the proximal and 
distal body parts; movement 
from the device causes resistance 
in the potentiometer which 
measures the ROM (not shown). 



Advantages/Uses 

Inexpensive; portable; familiar devices; 
size of the joint being measured 
determines size of the goniometer used; 
clear plastic goniometers have a line 
through the center of the arms to make 
alignment easier and more accurate; 
finger goniometers can be placed over 
the dorsal aspect of the joint being 
measured. 

Quick and easy to use because it is not 
usually aligned with bony landmarks; 
does not have to conform to body 
segments; useful for measuring neck and 
spinal movements; using a pair of fluid 
goniometers permits distinguishing 
regional spinal motion. 



Inexpensive; same advantages as for the 
fluid goniometer described above. 



Can be strapped on the body part allowing 
the hands free to stabilize and move the 
body part; not necessary to align the 
goniometer with the joint axis; permits 
measurements in all three planes. 



Does not need to conform to body 
segments; most useful for measuring 
joint rotation and axioskeletal motion. 

Easy to use; provides rapid digital read- 
out; measures angles in any plane of 
motion; one hand is free to stabilize and 
move body segments; particularly easy 
for measuring regional spinal 
movements. 



More useful for dynamic ROM, especially 
for determining kinematic variables 
during activities such as gait; provides 
immediate data; some 
electrogoniometers permit measurement 
in one, two, or three dimensions. 



Disadvantages/Limitations 



Several goniometers of different sizes may be 
required, especially if digits are measured; full- 
circle models may be difficult to align when the 
subject is recumbent and axis alignment is 
inhibited by the protractor bumping the surface; 
the increments on the protractors may vary 
from 1°, 2°, or 5°; placement of the arms is a 
potential source of error. 



More expensive than universal goniometers; 
using a pair of goniometers is awkward; useless 
for motions in the horizontal plane; error can 
be induced by slipping, skin movement, 
variations in amount of soft tissue owing to 
muscle contraction, swelling, or fat, and the 
examiner's hand pressure changing body 
segment contour; reliability may be sacrificed 
from lack of orientation to landmarks and 
difficulty with consistent realignment [Miller, 
1985]. 

Some models cannot be "zeroed"; useless for 
motions in the horizontal plane; same soft tissue 
error concerns as described above for the fluid 
goniometer. 



Expensive and bulky compared with universal 
goniometer; not useful for measuring small 
joints of hand and foot; susceptible to magnetic 
fields [Clarkson and Gilewich, 1989]; subject to 
same soft tissue error concerns described above 
under fluid goniometer. 



Not useful for measuring smaller joints, especially 
those with lesser ROMs; usually scaled in large 
increments only. 

Expensive compared to most other instruments 
described; device must be rotated perpendicular 
to the direction of segment motion only; unit 
must stabilize prior to measurement; excessive 
delays in recording must be avoided; subject to 
the same soft tissue error concerns as described 
above under fluid goniometer. 

Aligning and attaching the device is time- 
consuming and not amenable to all body 
segments; device and equipment needed to use 
it are moderately expensive; essentially 
laboratory equipment; less accurate for 
measurement of absolute limb position; device 
itself is cumbersome and may alter the 
movement being studied. 



© 2000 by CRC Press LLC 



Numerical Notation Systems 

Three primary systems exist for expressing joint motion in terms of degrees. These are the 0-180 System, 
the 180—0 System, and the 360 System. The 0-180 System is the most widely accepted system in medical 
applications and may be the easiest system to interpret. In the 0-180 System, the starting position for all 
movements is considered to be 0°, and movements proceed toward 180°. As the joint motion increases, 
the numbers on the goniometric scale increase. In the 180-0 System, movements toward flexion approach 
0° and movements toward extension approach 180°. Different rules are used for the other planes of 
motion. The 360 System is similar to the 180-0 System. In the 360 System, movements are frequently 
performed from a starting position of 180°. Movements of extension or adduction which go beyond the 
neutral position approach 360°. Joint motion can be reported in tables, charts, graphs, or pictures. In 
the 0-180 System, the starting and ending ranges are recorded separately, as 0°-130°. If a joint cannot 
be started in the 0° position, the actual starting position is recorded, as 10°— 130°. 

148.3 Strength 

Muscle strength implies the force or torque production capacity of muscles. However, to measure strength, 
the term must be operationally defined. One definition modified from Clarkson and Gilewich [1989] 
states that muscular strength is the maximal amount of torque or force that a muscle or muscle groups 
can voluntarily exert in one maximal effort, when type of muscle contraction, movement velocity, and 
joint angle(s) are specified. 

Strength Testing and Muscle Terminology 

Physiologically, skeletal muscle strength is the ability of muscle fibers to generate maximal tension for a 
brief time interval. A muscle's ability to generate maximal tension and to sustain tension for differing 
time intervals is dependent on the muscle's cross-sectional area (the larger the cross-sectional area, the 
greater the strength), geometry (including the muscle fiber arrangement, length, moment arm, and angle 
of pennation), and physiology. Characteristics of muscle fibers have been classified based on twitch 
tension and fatigability. Different fiber types have different metabolic traits. Different types of muscle 
fibers are differentially stressed depending on the intensity and duration of the contraction. Ideally, 
strength tests should measure the ability of the muscle to develop tension rapidly and to sustain the 
tension for brief time intervals. In order to truly measure muscle tension, a measurement device must 
be directly attached to the muscle or tendon. Whereas this direct procedure has been performed [Komi, 
1990], it is hardly useful as a routine clinical measure. Indirect measures are used to estimate the strength 
of muscle groups performing a given function, such as elbow flexion. 

Muscles work together in groups and may be classified according to the major role of the group in 
producing movement. The prime mover, or agonist, is a muscle or muscle group that makes the major 
contribution to movement at a joint. The antagonist is a muscle or muscle group that has an opposite 
action to the prime mover(s). The antagonist relaxes as the agonist moves the body part through the 
ROM. Synergists are accessory muscles that contract and work with the agonist to produce the desired 
movement. Synergists may work by stabilizing proximal joints, preventing unwanted movement, and 
joining with the prime mover to produce a movement that one muscle group acting alone could not 
produce. 

A number of terms and concepts are important toward understanding the nature and scope of strength 
capacity testing. Several of these terms are defined below; however, there are no universally accepted 
definitions for these terms. 

Dynamic contraction — the output of muscles moving body segments [Kroemer, 1991]. 
Isometric — tension develops in a muscle, but the muscle length does not change and no movement 
occurs. 
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Static — same as isometric. 

Isotonic — a muscle develops constant tension against a load or resistance. Kroemer [1991] suggests 

the term, isoforce, more aptly describes this condition. 
Concentric — a contraction in which a muscle develops internal force that exceeds the external force of 

resistance, the muscle shortens, and movement is produced [O'Connel and Gowitzke, 1972]. 
Eccentric — a contraction in which a muscle lengthens while continuing to maintain tension [O'Connel & 

Gowitzke, 1972]. 
Isokinetic — a condition where the angular velocity is held constant. Kroemer [1991] prefers the term, 

isovelocity, to describe this type of muscle exertion. 
Isoinertial — a static or dynamic muscle contraction where the external load is held constant [Kroemer, 

1983]. 

Factors Influencing Muscle Strength and Strength Measurement 

In addition to the anatomical and physiological factors affecting strength, other factors must be considered 
when strength testing. The ability of a muscle to develop tension depends on the type of muscle con- 
traction. Per unit of muscle, the greatest tension can be generated eccentrically, less can be developed 
isometrically, and the least can be generated concentrically. These differences in tension generating 
capacity are so great that the type of contraction being strength-tested requires specification. 

Additionally, strength is partially determined by the ability of the nervous system to cause more motor 
units to fire synchronously. As one trains, practices an activity, or learns test expectations, strength can 
increase. Therefore, strength is affected by previous training and testing. This is an important consider- 
ation in standardizing testing and in retesting. 

A muscle's attachments define the angle of pull of the tendon on the bone and thereby the mechanical 
leverage at the joint center. Each muscle has a moment arm length, which is the length of a line normal 
to the muscle passing through the joint center. This moment arm length changes with the joint angle 
which changes the muscle's tension output. Optimal tension is developed when a muscle is pulling at a 
90° angle to the bony segment. 

Changes in muscle length alter the force-generating capacity of muscle. This is called the length-tension 
relationship. Active tension decreases when a muscle is either lengthened or shortened relative to its resting 
length. However, after applying a precontraction stretch, or slightly lengthening a muscle and the series 
elastic component (connective tissue) prior to a contraction causes a greater amount of total tension to be 
developed [Soderberg, 1992] . Of course, excessive lengthening would reduce the tension-generating capacity. 

A number of muscles cross over more than one joint. The length of these muscles may be inadequate 
to permit complete ROM of all joints involved. When a multijoint muscle simultaneously shortens at all 
joints it crosses, further effective tension development is prevented. This phenomena is called active 
insufficiency. For example, when the hamstrings are tested as knee flexors with the hip extended, less 
tension can be developed than when the hamstrings are tested with the hip flexed. Therefore, when testing 
the strength of multijoint muscles, the position of all involved joints must be considered. 

The load-velocity relationship is also important in testing muscle strength. A load-velocity curve can 
be generated by plotting the velocity of motion of the muscle lever arm against the external load. With 
concentric muscle contractions, the least tension is developed at the highest velocity of movement and 
vice versa. When the external load equals the maximal force that the muscle can exert, the velocity of 
shortening reaches zero and the muscle contracts isometrically. When the load is increased further, the 
muscle lengthens eccentrically. During eccentric contractions, the highest tension can be achieved at the 
highest velocity of movement [Komi, 1973]. 

The force generated by a muscle is proportional to the contraction time. The longer the contraction 
time, the greater the force development up to the point of maximum tension. Slower contraction leads 
to greater force production because more time is allowed for the tension produced in contractile elements 
to be transferred through the noncontractile components to the tendon. This is the force-time relationship. 
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Tension in the tendon will reach the maximum tension developed by the contractile tissues only if the 
active contraction process is of adequate (even up to 300 msec) duration [Sukop and Nelson, 1974]. 

Subject effort or motivation, gender, age, fatigue, time of day, temperature, occupation, and dominance 
can also affect force or torque production capacity. Important additional considerations may be changes 
in muscle function as a result of pain, overstretching, immobilization, trauma, paralytic disorders, 
neurologic conditions, and muscle transfers. 

Grading Systems and Parameters Measured 

Clinically, the two most frequently used methods of strength testing are actually non-instrumented tests: 
the manual muscle test (MMT) and the functional muscle test (See Amundsen [1990] for more infor- 
mation on functional muscle tests). In each of these cases interval scaled grading criteria are operationally 
defined. However, a distinct advantage of using instruments to measure strength is that quantifiable units 
can be obtained, usually force or torque. Torque = force x the distance between the point of force 
application and the axis of rotation: 

T=Fx. (148.1) 

An important issue in strength testing is deciding whether to measure force (a linear quantity) or 
torque (a rotational quantity). If the point of application of a force is closer to the axis of rotation, the 
muscle being assessed has a mechanical advantage as compared to when the point of contact is more 
distal. Therefore, when forces are measured, unless the measurement devices are applied at the same 
anatomical position for each test, force measurements can differ substantially even though actual muscle 
tension remains the same. If the strength testing device has an axis of rotation that can be aligned with 
the anatomical axis of rotation, then torque can be measured directly. When this is not the case, the 
moment arm can be measured and torque calculated. Force is more typically measured in whole-body 
exertions, such as lifting. Another issue is whether to measure and record peak or averaged values. 
However, if strength is defined as maximum torque production capacity, peak values are implied. 

In addition to single numerical values, some strength measurement systems display and print force 
or torque (versus time) curves, angle-torque curves, and graphs. Computerized systems frequently com- 
pare the "involved" with the "uninvolved" extremity calculating "percent deficits." As strength is consid- 
ered proportional to body weight (perhaps erroneously, see Delitto [1990]) force and torque 
measurements are frequently reported as a peak torque to body weight ratio. This is seemingly to facilitate 
use of normative data where present. 

Methods and Instruments Used to Measure Muscle Strength 

There are two broad categories of testing force or torque production capacity: one category consists of 
measuring the capacity of defined, local muscle groups (e.g., elbow flexors); the second category of tests 
consists of measuring several muscle groups on a whole-body basis performing a higher level task (e.g., 
lifting). The purpose of the test, required level of sensitivity, and expense are primary factors in selecting 
the method of strength testing. No single method has emerged as being clearly superior or more widely 
applicable. Like screwdrivers, different types and different sizes are needed depending upon job demands. 
Many of the instrumented strength testing techniques, which are becoming more standardized clini- 
cally and which are almost exclusively used in engineering applications, are based on the concepts and 
methods of MMT. Although not used for performance capacity tests, because of the ease, practicality, 
and speed of manual testing, it is still considered a useful tool, especially diagnostically to localize lesions. 
Several MMT grading systems prevail. These differ in the actual test positions and premises upon which 
muscle grading is based. For example, the approach promoted by Kendall, McCreary, and Provance [1993] 
tests a specific muscle (e.g., brachioradialis) rather than a motion. The Daniels and Worthingham 
[Hislop and Montgomery, 1995] method tests motions (e.g., elbow flexion) which involve all the agonists 
and synergists used to perform the movement. The latter is considered more functional and less time 
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consuming, but less specific. The reader is advised to consult these references directly for more informa- 
tion about MMT methods. Further discussion of non-instrumented tests is beyond the scope of this 
chapter. 

An argument can be made for using isometric testing because the force or torque reflects actual muscle 
tension as the position of the body part is held constant and the muscle mechanics do not change. 
Additionally, good stabilization is easier to achieve, and muscle actions can be better isolated. However, 
some clinicians prefer dynamic tests, perceiving them as more reflective of function. An unfortunate fact 
is that neither static nor dynamic strength measurements alone can reveal whether strength is adequate 
for functional activities. However, strength measurements can be used with models and engineering 
analyses for such assessments. 

Selected instrumented methods of measuring force or torque production capacity are listed and 
compared in Table 148.3. Table 148.3 is by no means comprehensive. More indepth review and compar- 
isons of various methods can be found in Amundsen [1990] and Mayhew and Rothstein [1985]. 
Figure 148.3 illustrates three common instruments used to measure strength. 

Key Concepts in Measuring Strength 

Because of the number of factors influencing strength and strength testing (discussed in a previous 
section), one can become discouraged rather than challenged when faced with the need to measure 
strength. Optimally, strength testing would be based on the "worst case" functional performance demands 
required by an individual in his or her daily life. "Worst case" testing requires knowing the performance 
demands of tasks including the positions required, types of muscle contractions, and so on. 

In the absence of such data, current strategy is to choose the instruments and techniques that maximally 
stress the system under a set of representative conditions that either (a) seem logical based on knowledge 
of the task, or (b) have been reported as appropriate and reliable for the population of interest. An 
attempt is made to standardize the testing in terms of contraction-type, test administration instructions, 
feedback, warm-up, number of trials, time of day, examiner, duration of contraction (usually 4 to 
6 seconds), method and location of application of force, testing order, environmental distractions, subject 
posture and position of testing, degree of stabilization, and rest intervals between exertions (usually 
30 seconds to 2 minutes) [Chaffin, 1975; Smidt and Rogers, 1982]. In addition, the subject must be 
observed for muscle group substitutions and "trick" movements. 

148.4 Speed of Movement 

Speed of movement refers to the rate of movement of the body or body segments. The maximum 
movement speed that can be achieved represents another unique performance capacity of an identified 
system that is responsible for producing motion. Everyday living, work, and sport tasks are commonly 
described in terms of the speed requirements (e.g., repetitions per minute or per hour). For physical 
tasks, such descriptions translate to translational motion speeds (e.g., as in lifting) as well as rotational 
motion speeds (i.e., movement about a dof of the joint systems involved). Thus, there is important 
motivation to characterize this capacity. 

Speed of Movement Terminology 

Speed of movement must be differentiated from speed of contraction. Speed of contraction refers to how 
fast a muscle generates tension. Two body parts may be moving through an arc with the same speed of 
movement; however, if one part has a greater mass, its muscles must develop more tension per unit of 
time to move the heavier body part at the same speed as the lighter body part. 

Speed, velocity, and acceleration also can be distinguished. The terms velocity and speed are often 
used interchangeably; however, the two quantities are frequently not identical. Velocity means the rate 
of motion in a particular direction. Acceleration results from a change in velocity over time. General 
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TABLE 148.3 Comparison of Various Instrumented Methods Used to Measure Muscle Strength 



Instrument/Method 

Repetition maximum 

Amount of weight a subject can lift a given 
number of times and no more; one 
determines either a one repetition 
maximum (1-RM) or a ten repetition 
maximum (10-RM). A 1-RM is the 
maximum amount of weight a subject can 
lift once; a 10-RM is the amount of weight 
a subject can lift 10 times; a particular 
protocol to determine RMs is defined 
[DeLorme and Watkins, 1948]; measures 
dynamic strength in terms of weight 
(pounds or kilograms) lifted. 
Hand-held dynamometer 

Device held in the examiner's hand used to 
test strength; devices use either hydraulics, 
strain gauges (load-cells), or spring systems 
(See Fig. 148.3a); used with a "break test" 
(the examiner exerts a force against the body 
segment to be tested until the part gives way) 
or a "make test" (the examiner applies a 
constant force while the subject exerts a 
maximum force against it); "make tests" are 
frequently preferred for use with hand-held 
dynamometers [Bohannon, 1990; Smidt, 
1984]; measures force; unclear whether test 
measures isometric or eccentric force (this 
may depend on whether a "make test" or a 
"break test" is used). 

Cable tensiometer 

One end of a cable is attached to an 
immovable object and the other end is 
attached to a limb segment; the tensiometer 
is placed between the sites of fixation; as the 
cable is pulled, it presses on the tensiometer's 
riser which is connected to a gauge (See 
discussion in Mayhew and Rothstein 
[1985]); measures isometric force. 

Strain gauge 

Electroconductive material applied to metal 
rings or rods; a load applied to the ring or 
bar deforms the metal and a gauge; 
deformation of the gauge changes the 
electrical resistance of the gauge causing a 
voltage variation; this change can be 
converted and displayed using a strip chart 
recorder or digital display; measures 
isometric force. 

Isokinetic dynamometer 

Constant velocity loading device; several 
models marketed by a number of different 
companies; most consist of a movable lever 
arm controlled by an electronic servomotor 
that can be preset for selected angular 
velocities usually between 0° and 500° 
per second; when the subject attempts to 



Advantages/Uses 

Requires minimal equipment 
(weights); inexpensive and 
easy to administer; frequently 
used informally to assess 
progress in strength training. 



Similar to manual muscle 
testing (MMT) in test 
positions and sites for load 
application; increased 
objectivity over MMT; 
portable; easy to administer; 
relatively inexpensive; 
commercially available from 
several suppliers; adaptable for 
a variety of test sites; provide 
immediate output; spring and 
hydraulic systems are non- 
electrical; load-cell based 
systems provide more precise 
digital measurements. 



Mostly used in research 
settings; evidence presented 
on reliability when used with 
healthy subjects [Clarke, 1954; 
Clarke, Bailey and Shay, 1952]; 
relatively inexpensive. 



Mostly used in research 
settings; increased sensitivity 
for testing strong and weak 
muscles. 



Disadvantages/Limitations 

Uses serial testing of adding weights 
which may invalidate subsequent 
testing; no control for speed of 
contraction or positioning; minimal 
information available on the reliability 
and validity of this method. 



Permits dynamic testing of 
most major body segments; 
especially useful for stronger 
movements; most devices 
provide good stabilization; 
measures reciprocal muscle 
contractions; widespread 



Stabilization of the device and body 
segment can be difficult; results can be 
affected by the examiner's strength; 
limited usefulness with large muscle 
groups; spring-based systems fatigue 
over time becoming inaccurate; range 
and sensitivity of the systems vary; 
shape of the unit grasped by the 
examiner and shape of the end-piece 
vary in comfort, and therefore the force 
a subject or examiner is willing to exert; 
more valuable for testing subjects with 
weakness than for less involved or 
healthy subjects due to range limits 
within the device (See discussion in 
Bohannon [1990]). 

Requires special equipment for testing; 
testing is time-consuming and some 
tests require two examiners; unfamiliar 
to most clinicians; not readily available; 
less sensitive at low force levels. 



Strain gauges require frequent calibration 
and are sensitive to temperature 
variations; to be accurate the body part 
must pull or push against the gauge in 
the same line that the calibration 
weights were applied; unfamiliar to 
most clinicians; not commercially 
available; difficult to interface the device 
comfortably with the subject. 

Devices are large and expensive; need 
calibration with external weights or are 
"self-calibrating;" signal damping and 
"windowing" may affect data obtained; 
angle-specific measurements may not 
be accurate if a damp is used because 
torque readings do not relate to the 
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TABLE 148.3 (continued) Comparison of Various Instrumented Methods Used to Measure Muscle Strength 



Instrument/Method 

accelerate beyond the pre-set machine 
speed, the machine resists the movement; a 
load cell measures the torque needed to 
prevent body part acceleration beyond the 
selected speed; computers provide digital 
displays and printouts (See typical device in 
Fig. 148.3b); measures isokinetic-concentric 
(and in some cases, isokinetic-eccentric) and 
isometric strength; provides torque (or 
occasionally force) data; debate exists about 
whether data are ratio-scaled or not; 
accounting for the weight of the segment 
permits ratio-scaling [Winter, Wells, and 
Orr, 1981]. 
Hand dynamometer 

Instruments to measure gripping or pinching 
strength specifically for the hand; usually use 
a spring scale or strain-gauge system (See 
typical grip strength testing device in 
Fig. 148.3c); measures isometric force. 



Advantages/Uses 

clinical acceptance; also 
records angular data, work, 
power, and endurance-related 
measures; provides a number 
of different reporting options; 
also used as exercise devices. 



Disadvantages/Limitations 

goniometric measurements; joints must 
be aligned with the mechanical axis of 
the machine; inferences about muscle 
function in daily activities from 
isokinetic test results have not been 
validated; data obtained between 
different brands are not 
interchangeable; adequate stabilization 
may be difficult to achieve for some 
movements; may not be usable with 
especially tall or short persons. 



Readily available from several Only useful for the hand; different brands 



suppliers; easy to use; 
relatively inexpensive; 
widespread use; some 
normative data available. 



not interchangeable; normative data 
only useful when reported for the same 
instrument and when measurements are 
taken with the same body position and 
instrument setting; must be recalibrated 
frequently. 



velocity and acceleration measurements are beyond the intent of this chapter. Reaction speed and response 
speed are other related variables also not considered. 

Factors Influencing Speed of Movement and Speed 
of Movement Measurements 

Muscles with larger moment arms, longer muscle fibers, and less pennation tend to be capable of 
generating greater speed. Many of the same factors influencing strength, discussed previously, such as 
muscle length, fatigue, and temperature affect the muscle's contractile rate. The load-velocity relationship 
is especially important when testing speed of movement. In addition to these and other physiological 
factors, speed can be reduced by factors such as friction, air resistance, gravity, unnecessary movements, 
and inertia [Jensen and Fisher, 1979]. 

Parameters Measured 

Speed of movement can be measured as a linear quantity or as an angular quantity. Typically, if the whole 
body is moving linearly in space as in walking or running, a point such as the center of gravity is picked, 
and translational motion is measured. Also, when an identified point on a body segment (e.g., the tip of 
the index finger) is moved in space, translational movement is observed, and motion is measured in 
translational terms. If the speed of a rotational motion system (e.g., elbow flexors) is being measured, 
then the angular quantity is determined. As the focus here is on measuring isolated neuromuscular 
performance capacities, the angular metric is emphasized. Angular speed of a body segment is obtained 
by: angular speed = change in angular position/change in time: 



At' 



(148.2) 



Thus, speed of movement may be expressed in revolutions, degrees, or radians per unit of time, such as 
degrees per second (deg/s). 
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(a) 




(b) 

FIGURE 148.3 Three types of instrumented strength testing devices are shown: (a) a representative example of 
a typical hand-held dynamometer; (b) an example of an isokinetic strength testing device; (c) a hand dynamom- 
eter used to measure grip strength 

Another type of speed measure applies to well-defined (over fixed angle or distance) cyclic motions. 
Here repetitions per unit time or cycles per unit time measures are sometimes used. However, in almost 
every one of these situations, speed can be expressed in degrees per second or meters per second. The 
latter units are preferred because they allow easier comparison of speeds across a variety of tasks. The 
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only occasion when this is difficult is when translation motion is not in a simple straight line, such as 
when a person is performing a complex assembly task with multiple subtasks. 

The issue of whether to express speed as maximum, averaged, or instantaneous values must also be 
decided based on which measure is a more useful indicator of the performance being measured. In 
addition to numerical reporting of speed data, time-history graphs of speed may be helpful in comparing 
some types of performance. 

Instruments Used to Measure Speed of Movement 

When movement time is greater than a few seconds and the distance is known, speed can be measured 
with a stop watch or with switch plates, such as the time elapsed in moving between two points or over 
a specified angle. With rapid angular joint movements, switch plates or electrogoniometers with electronic 
timing devices are required. Speeds can also be computed from the distance or angle and time data 
available from cinematography, optoelectric movement monitoring systems, and videotape systems. Some 
dynamic strength testing devices involve presetting a load and measuring the speed of movement. 

In addition, accelerometers can be used to measure acceleration directly, and speed can be derived 
through integration. However, piezoelectric models have no steady-state response and may not be useful 
for slower movements. Single accelerometers are used to measure linear motion. Simple rotatory motions 
require two accelerometers. Triaxial accelerometers are commercially available that contain three pre- 
mounted accelerometers perpendicular to each other. Multiple accelerometer outputs require appropriate 
processing to resolve the vector component corresponding to the desired speed. Accelerometers are most 
appropriately used to measure acceleration when they are mounted on rigid materials. Accelerometers 
have the advantage of continuously and directly measuring acceleration in an immediately usable form. 
They can also be very accurate if well-mounted. Because they require soft tissue fixation and cabling or 
telemetry, they may alter performance and further error may be induced by relative motion of the device 
and tissues. The systems are moderately expensive (See discussion of accelerometers in Robertson and 
Sprigings [1987]). 

Key Concepts in Speed of Movement Measurement 

As discussed, maximum speed is determined when there is little stress on torque production resources. 
As resistance increases, speed will decrease. Therefore, the load must be considered and specified when 
testing speed. Because speed of movement data are calculated from displacement and temporal data, a 
key issue is minimizing error which might result from collecting this information. Error can result from 
inaccurate identification of anatomical landmarks, improper calibration, perspective error, instrument 
synchronization error, resolution, digitization error, or vibration. The sampling rate of some of the 
measurement systems may become an issue when faster movements are being analyzed. In addition, the 
dynamic characteristics of signal conditioning systems should be reported. 

148.5 Endurance 

Endurance is the ability of a system to sustain an activity for a prolonged time (static endurance) or to 
perform repeatedly (dynamic endurance). Endurance can apply to the body as a whole, a particular body 
system, or to specific neuromuscular functional units. High levels of endurance imply that a given level 
of performance can be continued for a long time period. 

Endurance Terminology 

General endurance of the body as a whole is traditionally considered cardiovascular endurance or aerobic 
capacity. Cardiovascular endurance is most frequently viewed in terms of V0 2m . This chapter considers 
only endurance of neuromuscular systems. Although many central and peripheral anatomic sites and 
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physiologic processes contribute to a loss of endurance, endurance of neuromuscular functional units is 
also referred to as muscular endurance. 

Absolute muscle endurance is defined as the amount of time that a neuromuscular system can continue 
to accomplish a specified task against a constant resistance (load and rate) without relating the resistance 
to the muscle's strength. Absolute muscle endurance and strength are highly correlated. Conversely, 
strength and relative muscle endurance are inversely related. That is, when resistance is adjusted to the 
person's strength, a weaker person tends to demonstrate more endurance than a stronger person. Fur- 
thermore, the same relationships between absolute and relative endurance and strength are correlated 
by type of contraction; in other words, there is a strong positive correlation between isotonic strength 
and absolute isotonic endurance and vice versa for strength and relative isotonic endurance. The same 
types of relationships exist for isometric strength and isometric endurance [Jensen and Fisher, 1979]. 

Factors Influencing Neuromuscular Endurance 
and Measurement of Endurance 

Specific muscle fiber types, namely fast-twitch fatigue-resistant fibers (FR), generate intermediate levels 
of tension and are resistant to short-term fatigue (a duration of about 2 minutes or intermittent stimu- 
lation). Slow-twitch fibers (S) generate low levels of tension slowly, and are highly resistant to fatigue. 
Muscle contractions longer than 10 seconds, but less than 2 minutes, will reflect local muscle endurance 
[Astrand and Rodahl, 1986]. For durations longer than 2 minutes, the S fibers will be most stressed. A 
submaximal isometric contraction to the point of voluntary fatigue will primarily stress the FR and S 
fibers [Thorstensson and Karlsson, 1976]. Repetitive, submaximal, dynamic contractions continued for 
about 2 to 6 minutes will measure the capacity of FR and S fibers. Strength testing requires short duration 
and maximal contractions; therefore, to differentiate strength and endurance testing, the duration and 
intensity of the contractions must be considered. 

Because strength affects endurance, all of the factors discussed previously as influencing strength, also 
influence endurance. In addition to muscle physiology and muscle strength, endurance is dependent 
upon the extensiveness of the muscle's capillary beds, the involved neuromuscular mechanisms, contrac- 
tion force, load, and the rate at which the activity is performed. 

Endurance time, or the time for muscles to reach fatigue, is a function of the contraction force or load 
[von Rohmert, 1960] . As the load (or torque required) increases, endurance time decreases. Also, as speed 
increases, particularly with activities involving concentric muscle contractions, endurance decreases. 

Parameters Measured 

Endurance is how long an activity can be performed at the required load and rate level. Thus, the basic 
unit of measure is time. Time is the only measure of how long it takes to complete a task. If we focus 
on a given variable (e.g., strength, speed, or endurance), it is necessary to either control or measure the 
others. When the focus is endurance, the other factors of force or torque, speed, and joint angle, can be 
described as conditions under which endurance is measured. Because of the interactions of endurance 
and load, or endurance and time, for examples, a number of endurance-related measures have evolved. 
These endurance-related measures have clouded endurance testing. 

One endurance-related measure uses either the number of repetitions that can be performed at 20, 
25, or 50 percent of maximum peak torque or force. The units used to reflect endurance in this case are 
number of repetitions at a specified torque or force level. One difficulty with this definition has been 
described previously, that is, the issue of relative versus absolute muscle endurance. Rothstein and Rose 
[1982] demonstrated that elderly subjects with selected muscle fiber type atrophy were able to maintain 
50 percent of their peak torque longer than young subjects. However, if a high force level is required to 
perform the task, then the younger subject would have more endurance in that particular activity 
[Rothstein, 1982]. Another difficulty is that the "repetition method" can be used only for dynamic 
activities. If isometric activities are involved, then the time an activity can be sustained at a specified 
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force or torque level is measured. Why have different units of endurance? Time could be used in both 
cases. Furthermore, the issue of absolute versus relative muscle endurance becomes irrelevant if the 
demands of the task are measured. 

Yet another method used to reflect endurance is to calculate an endurance-related work ratio. Many 
isokinetic testing devices, such as the one shown in Fig. 148.3b, will calculate work (integrate force or 
torque over displacement). In this case, the total amount of work performed in the first five repetitions 
is compared with the total amount of work performed in the last five repetitions of a series of repetitions 
(usually 25 or more). Work degradation reflects endurance and is reported as a percentage. An additional 
limitation of using these endurance ratios is that work cannot be determined in isometric test protocols. 
Mechanically there is no movement, and no work is being performed. 

Overall, the greatest limitation with most endurance-related approaches is that the measures obtained 
cannot be used to perform task-related assessments. In a workplace assessment, for example, one can 
determine how long a specific task (defined by the conditions of load, range, and speed) needs to be 
performed. Endurance-related metrics can be used to reflect changes over time in a subject's available 
endurance capacity; however, endurance-related metrics cannot be compared to the demands of the task. 
Task demands are measured in time or repetitions (e.g., 10) with a given rate (e.g., 1/0.5 h) from which 
total time (e.g., 5 h) can be calculated. A true endurance measure (versus an endurance-related measure) 
can serve both purposes. Time reflects changes in endurance as the result of disease, disuse, training, or 
rehabilitation and also can be linked to task demands. 

Methods and Instruments Used to Measure Neuromuscular Endurance 

Selection of the method or instrument used to measure endurance depends on the purpose of the mea- 
surement and whether endurance or endurance-related measures will be obtained. As in strength testing, 
endurance tests can involve simple, low level tasks or whole-body, higher level activities. The simplest 
method of measuring endurance is to define a task in terms of performance criteria and then time the 
performance with a stop watch. A subject is given a load and a posture and asked to hold it "as long as 
possible" or to move from one point to another point at a specific rate of movement for "as long as possible." 

An example of a static endurance test is the Sorensen test used to measure endurance of the trunk 
extensors [Biering-Sorensen, 1984]. This test measures how long a person can sustain his or her torso 
in a suspended prone posture. The individual is not asked to perform a maximal voluntary contraction, 
but an indirect calculation of load is possible [Smidt and Blanpied, 1987]. 

An example of a dynamic endurance test is either a standardized or non-standardized, dynamic isoin- 
ertial (see previous description in the section on strength testing) repetition test. In other words, the subject 
is asked to lift a known load with a specified body part or parts until defined conditions can no longer be 
met. Conditions such as acceleration, distance, method of performance, or speed may or may not be 
controlled. The more standardized of these tests, particularly those which involve lifting capacity, are 
reported and projections about performance capacity over time are estimated [Snook, 1978]. Ergometers 
and some of the isokinetic dynamometers discussed previously measure work, and several can calculate 
endurance-related ratios. These devices could be adapted to measure endurance in time units. 

Key Concepts in Measuring Muscle Endurance 

Of the four variables of human performance discussed in this chapter, endurance testing is the least 
developed and standardized. Except for test duration and rest intervals, attention to the same guidelines 
as described for strength testing is currently recommended. 

148.6 Reliability, Validity, and Limitations in Testing 

Space does not permit a complete review of these important topics. However, a few key comments are 
in order. First, it is important to note that reliability and validity are not inherent qualities of instruments, 
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but exist in the measurements obtained only within the context in which they are tested. Second, reliability 
and validity are not either present or absent but are present or absent along a continuum. Third, 
traditional quantitative measures of reliability might indicate how much reliability a given measurement 
method demonstrates, but not how much reliability is actually needed. Fourth, technology has advanced 
to the extent that it is generally possible to measure physical variables such as time, force, torque, angles, 
and speed accurately, repeatably, and with high resolution. Lastly, clinical generalizability of human 
performance capacity measures ultimately results from looking at the body of literature on reliability as 
a whole and not from single studies. 

For these types of variables, results of reliability studies basically report that: ( 1 ) if the instrumentation 
is good, and (2) if established, optimal procedures are carefully followed, then results of repeat testing 
will usually be in the range of about 5 to 20% of each other. This range of repeatability depends on: 
(1) the particular variable being measured (i.e., repeated endurance measures will differ more than 
repeated measures of hinge joint EOM), and (2) the magnitude of the given performance capacity (i.e., 
errors are often in fixed amounts such as 3° for ROM; thus, 3° out of 180° is smaller percentage-wise 
than 3° out of 20°). One can usually determine an applicable working value (e.g., 5 or 20%) by careful 
review of the relevant reliability studies. Much of the difference obtained in test-retest results is because 
of limitations in how well one can reasonably control procedures and the actual variability of the 
parameter being measured, even in the most ideal test subjects. Measurements should be used with these 
thoughts in mind. If a specific application requires extreme repeatability, then a reliability study should 
be conducted under conditions that most closely match those in which the need arises. Reliability 
discussions specific to some of the focal measures of this chapter are presented in Amundsen [1990], 
Hellebrandt, Duvall, and Moore [1949], Mayhew and Rothstein [1985], and Miller [1985]. 

Measurements can be reliable but useless without validity. Most validity studies have compared the 
results of one instrument to another instrument or to known quantities. This is the classical type of validity 
testing which is an effort to determine whether the measurement reflects the variable being measured. In 
the absence of a "gold standard" this type of testing is of limited value. In addition to traditional studies 
of the validity of measurements, the issue of the validity of the inferences based on the measurements is 
becoming increasingly important [Rothstein and Echternach, 1993]. That is, can the measurements be 
used to make inferences about human performance in real life situations? Unfortunately, measurements 
which have not demonstrated more than content validity are frequently used as though they are predictive. 
The validity of the inferences made from human performance data needs to be rigorously addressed. 

Specific measurement limitations were briefly addressed in Tables 148.2 and 148.3 and in the written 
descriptions of various measurement techniques and instruments. Other limitations have more to do with 
interpreting the data. A general limitation is that performance variables are not fixed human attributes. 
Another limitation is that population data are limited and available normative data are, unfortunately, 
frequently extrapolated to women, older persons, and so on [Chaffin and Andersson, 1991]. Some nor- 
mative data suggest the amount of resources such as strength, ROM, speed of movement, and endurance 
required for given activities; other data suggest the amount available. As previously mentioned, these are 
two different issues. Performance measurements may yield information about the current status of per- 
formance, but testing rarely indicates the cause or the nature of dysfunction. More definitive, diagnostic 
studies are used to answer these questions. Whereas, considerable information exists with regard to 
measuring performance capacities of human systems, much less energy has been directed to understanding 
requirements of tasks. The link between functional performance in tasks and laboratory-acquired mea- 
surements is a critical question and a major limitation in interpreting test data. The ERM addresses several 
of these limitations by using a multidimensional, individualized, cause-and-effect model. 

148.7 Performance Capacity Space Representations 

In both the study and practice, performance of neuromuscular systems has been characterized along one 
or two dimensions of performance at a time. However, human subsystems function within a multi- 
dimensional performance space. ROM/EOM, strength, movement speed, and endurance capacities are 
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not only interdependent, but may also vary uniquely within individuals. Multiple measurements are 
necessary to characterize a person's performance capacity space, and performance capacity is dependent 
on the task to be performed. Therefore, both the individual and the task must be considered when 
selecting measurement tools and procedures [Chaffin and Andersson, 1991]. 
In many of the disciplines in which human per- 



formance is of interest, traditional thinking has 
often focused on single number measures of ROM, 
strength, speed, etc. More recent systems engineer- 
ing approaches [Kondraske, 1999] emphasize con- 
sideration of the performance envelope of a given 
system and suggest ways to integrate single mea- 
surement points that define the limits of perfor- 
mance of a given system [Vasta and Kondraske, 
1997]. Figure 148.4 illustrates a three-dimensional 
performance envelope derived from torque, angle, 
and velocity data for the knee extensor system. The 
additional dimension of endurance can be repre- 
sented by displaying this envelope after performing 
an activity for different lengths of time. A higher 
level, composite performance capacity, as is some- 
times needed, could be derived by computing the 
volume enclosed by this envelope. Such represen- 
tations also facilitate assessment of the given system 
in a specific task; that is, a task is defined as a point 
in this space that will either fall inside or outside 
the envelope. 



Torque 




Angle 



Velocity 

FIGURE 148.4 An example of a torque-angle-velocity 
performance envelope for the knee extensor system. 
Source: Vasta PJ, Kondraske GV. 1994. A multi-dimen- 
sional performance space model for the human knee 
extensor (Technical Report 94-001R). p 11. University of 
Texas at Arlington, Human Performance Institute, 
Arlington, Texas. With permission. 



148.8 Conclusions 



In conclusion, human movement is so essential that it demands interest and awe from the most casual 
observer to the most sophisticated scientists. The complexity of performance is truly inspiring. We are 
challenged to understand it! We want to reduce it to comprehensible units and then enhance it, reproduce 
it, restore it, and predict it. To do so, we must be able to define and quantify the variables. Hence, an 
array of instruments and methods have emerged to measure various aspects of human performance. To 
date, measurement of neuromuscular performance capacities along the dimensions of ROM/EOM, 
strength, speed of movement, and endurance represents a giant stride but only the "tip of the iceberg." 
Progress in developing reliable, accurate, and valid instruments and in understanding the factors influ- 
encing the measurements cannot be permitted to discourage us from the larger issues of applying the 
measurements toward a purpose. Yet, single measurements will not suffice; multiple measurements of 
different aspects of performance will be necessary to fully characterize human movement. 

Defining Terms 

Endurance: The amount of time a body or body segments can sustain a specified static or repetitive 
activity. 

Extremes of motion (EOM): The end ranges of motion at a joint measured in degrees. 

Muscle strength: The maximal amount of torque or force production capacity that a muscle or muscle 
groups can voluntarily exert in one maximal effort, when type of muscle contraction, movement 
velocity, and joint angle(s) are specified. 

Neuromuscular functional units: Systems (that is, the combination of nerves, muscles, tendons, liga- 
ments, and so on) responsible for producing basic movements. 
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Range of motion (ROM): The amount of movement that occurs at a joint, typically measured in 
degrees. ROM is usually measured by noting the extremes of motion, or as the difference between 
the extreme motion and the reference position. 

Speed of movement: The rate of movement of the body or body segments. 
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The human nervous system is capable of simultaneous and integrated control of 100-150 mechanical 
degrees of freedom of movement in the body via tensions generated by about 700 muscles. In its widest 
context, movement is carried out by a sensory-motor system comprising multiple sensors (visual, audi- 
tory, proprioceptive), multiple actuators (muscles and skeletal system), and an intermediary processor 
which can be summarized as a multiple-input multiple-output non-linear dynamic time-varying control 
system. This grand control system comprises a large number of interconnected processors and sub- 
controllers at various sites in the central nervous system of which the more important are the cerebral 
cortex, thalamus, basal ganglia, cerebellum, and spinal cord. It is capable of responding with remarkable 
accuracy, speed (when necessary), appropriateness, versatility, and adaptability to a wide spectrum of 
continuous and discrete stimuli and conditions. Certainly, by contrast, it is orders of magnitude more 
complex and sophisticated than the most advanced robotic systems currently available — although the 
latter also have what are often highly desirable attributes such as precision and repeatability and a much 
greater immunity from factors such as fatigue, distraction, and lack of motivation! 

This chapter addresses the control function. First, it introduces several important concepts relating to 
sensory-motor control, accuracy of movement, and performance resources/capacities. Second, it provides 
an overview of apparatuses and methods for the measurement and analysis of complex sensory-motor 
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performance. The overview focuses on measurement of sensory-motor control performance capacities 
of the upper-limbs and by means of tracking tasks. 

149.2 Basic Principles 

Sensory-motor Control and Accuracy of Movement 

From the perspective of Kondraske's [Kondraske, 1995a; Kondraske, 1995b] elemental resource model of 
human performance, sensory-motor control is the function of the overall sensory-motor control system. 
This system can be considered as a hierarchy of multiple interconnected sensory-motor controllers cited 
in the central processing and skills ("software") domains (cf. environmental interface domain, comprising 
sensors and actuators, and life sustaining domains) of the elemental resource model. These controllers 
range from low-level elemental level controllers for control of movement around single joints, through 
intermediate-level controllers needed to generate integrated movements of an entire limb and involving 
multiple joints and degrees of freedom, and high-level controllers and processors to enable coordinated 
synergistic multi-limb movements and the carrying out of central executive functions concerned with 
allocation and switching of resources for execution of multiple tasks simultaneously. 

Each of these controllers is considered to possess limited performance resources (PRs) — or perfor- 
mance capacities — necessary to carry out their control functions. PRs are characterized by dimensions 
of performance, which for controllers are accuracy of movement (including steadiness and stability) and 
speed of movement. Accuracy is the most important of these and can be divided into four major classes: 

1. Spatial accuracy — Required by tasks which are self-paced and for which time taken is of secondary 
or minimal importance and includes tracing (e.g., map-tracking), walking, reaching, and, in fact, 
most activities of daily living. Limitation in speed PRs should have no influence on this class of 
accuracy. 

2. Spatial accuracy with time constraints — Identical to "spatial accuracy" except that, in addition to 
accuracy, speed of execution of task is also of importance. Because maximal performance capacities 
for accuracy and speed of movement cannot, in general, be realized simultaneously, the carrying 
out of such tasks must necessarily involve speed-accuracy tradeoffs [Fitts, 1954; Fitts and Posner, 
1967]. The extent to which accuracy is sacrificed for increased speed of execution, or vice versa, 
is dependent on the perceived relative importance of accuracy and speed. 

3. Temporal accuracy — Required by tasks which place minimal demands on positional accuracy and 
includes single and multi-finger tapping and foot tapping. 

4. Spatiotemporal accuracy — Required by tasks which place considerable demand on attainment of 
simultaneous spatial and temporal accuracy. This includes paced positional tasks such as tracking, 
driving a vehicle, ball games and sports, and video games. It should be stressed, however, that most 
of the above self-paced tasks also involve a considerable interrelationship between space and time. 

Tracking tasks have become well established as being able to provide one of the most accurate and 
flexible means for laboratory-based measurement of spatiotemporal accuracy and, thus, of the perfor- 
mance capacity of sensory-motor control or sensory-motor coordination. In addition, they provide an 
unsurpassed framework for studies of the underlying control mechanisms of motor function (e.g., Cooper 
et al, 1989; Lynn et al., 1979; Neilson et al., 1993; Neilson et al, 1995; Neilson et al., 1998) — the potential 
for which was recognized as early as 1943 as seen in writings by Craik [1966]. They have achieved this 
status through their (a) ability to maximally stress the accuracy dimension of performance and, hence, 
the corresponding control PR, (b) the continuous nature and wide range of type and characteristics of 
input target signals they permit, (c) facility for a wide range of 1-D and 2-D sensors for measuring a 
subject's motor output, and (d) measure of continuous performance (cf. reaction time tests). 

From this perspective, it will be of little surprise to find that tracking tasks are the primary thrust of 
this chapter. 
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The Influence of Lower-Level Performance Resources on Higher-Level 
Control Performance Resources 

By their very nature, tasks which enable one to measure spatiotemporal accuracy are complex or higher- 
level sensory-motor tasks. These place demands on a large number of lower-level PRs such as visual 
acuity, dynamic visual perception, range of movement, strength, simple reaction times, acceleration/deac- 
celeration, static steadiness, dynamic steadiness, prediction, memory, open-loop movements, concentra- 
tion span, attention switching i.e., central executive function or supervisory attentional system (multitask 
abilities), utilization of preview, and learning. 

It is, therefore, important to ask: If there are so many PRs involved in tracking, can tracking perfor- 
mance provide an accurate estimate of sensory-motor control performance capacities? Or, if differences 
are seen in tracking performance between subjects, do these necessarily indicate comparable differences 
in control performance capacities? Yes, they can, but only if the control resource is the only resource 
being maximally stressed during the tracking task. Confirmation that the other PRs are not also being 
maximally stressed for a particular subject can be ascertained by two means. First, by independently 
measuring the capacity of the other PRs and confirming that these are considerably greater than that 
determined as necessary for the tracking task in question. For example, if the speed range for a certain 
reference group on a non-target speed test is 650-1250 mm/s and the highest speed of a tracking target 
signal is 240 mm/s, then one can be reasonably confident that intra-group differences in performance 
on the tracking task are unrelated to intra-group differences in speed. Second, where this process is less 
straightforward or not possible, it may be possible to alter the demands imposed by the task on the PR 
in question. For example, one could see whether visual acuity was being maximally stressed in a tracking 
task (and hence, be a significant limiting factor to the performance obtained) by increasing or decreasing 
the eye-screen distance. Similarly, one could look at strength in this context by altering the friction, 
damping, or inertia of the sensor, or at range of movement by altering the gain of the sensor. 

The conclusion that a task and a PR are unrelated for a particular group does not, of course, mean 
that this can necessarily be extrapolated to some other group. For example, strength may be completely 
uncorrelated with tracking performance in normal males yet be the primary factor responsible for poor 
tracking performance by the paretic arm of subjects who have suffered a stroke. 

The foregoing discussion is based on the concept of a assumption that if a task requires less than the 
absolute maximum available of a particular PR then performance on that task will be independent of 
that PR. Not surprisingly, the situation is unlikely to be this simplistic or clear-cut! If, for example, a 
tracking task places moderate sub-maximal demands on several PRs, these PRs will be stressed to varying 
levels such that the subject may tend to optimize the utilization of those resources [Kondraske, 1995a; 
Kondraske, 1995b] so as to achieve an acceptable balance between accuracy, speed, stress/effort, and 
fatigue (physical and cognitive). Thus, although strength available is much greater than strength needed 
(i.e., Resource available 9> Resource demand) for both males and females, could the differential in strength 
be responsible for males performing better on tracking tasks than females? [Jones et al., 1986] 

149.3 Measurement of Sensory-Motor Control Performance 
Techniques: An Overview 

Tracking tasks are the primary methodological approach outlined in this chapter for measurement of 
sensory-motor control performance. There are, however, a large number of other approaches, each with 
their own set of apparatuses and methods, which can provide similar or different data on control 
performance. It is possible to give only a cursory mention of these other techniques in this chapter [see 
also Further Information] . 

Hand and foot test boards comprising multiple touch-plate sensors provide measures of accuracy and 
speed of lateral reaching-tapping abilities [Kondraske et al., 1984, 1988]. 
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Measurement of steadiness and tremor in an upper-limb, lower-limb, or segment of either, particularly 
when sustended, can be made using variable size holes, accelerometers, or force transducers [Potvin et al., 
1975]. A dual-axis capacitive transducer developed by Kondraske et al. [1984] provides an improved 
means of quantifying steadiness and tremor due to it requiring no mechanical connection to the subject 
(i.e., no added inertia) and by providing an output of limb position as opposed to less informative 
measures of acceleration or force. Interestingly, tests of steadiness can be appropriately considered as a 
category of tracking tasks in which the target is static. The same is also true for measurement of postural 
stability using force balance platforms, whether for standing [Kondraske et al., 1984; Milkowski et al., 
1993] or sitting [Riedel et al., 1992] [see also Further Information]. 

Tracking Tasks: An Overview 

A tracking task is a laboratory-based test apparatus characterized by a continuous input signal — the 
target — which a subject must attempt to match as closely as possible by his/her output response by 
controlling the position of (or force applied to) some sensor. It provides unequalled opportunities for 
wide-ranging experimental control over sensors, displays, target signals, dimensionality (degrees of 
freedom), control modes, controlled system dynamics, and sensor-display compatibility, as well as the 
application of a vast armamentarium of linear and non-linear techniques for response signal analysis 
and systems identification. Because of this, the tracking task has proven to be the most powerful and 
versatile tool for assessing, studying, and modelling higher-level functioning of the human 'black-box' 
sensory-motor system. 

There are three basic categories of tracking tasks differing primarily in their visual display and in the 
corresponding control system (Fig. 149.1). The pursuit task displays both the present input and output 
signals, whereas the compensatory task displays only the difference or error signal between these. The 
preview task [Poulton, 1964; Welford, 1968; Jones and Donaldson, 1986] (Fig. 149.2) is similar to the 
pursuit task except that the subject can see in advance where the input signal is going to be and plan 
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FIGURE 149.1 Modes of tracking (i) Compensatory: subject aims to keep resultant error signal X (= input sig- 
nal — output signal = S;(t) - s D (t)) on stationary vertical line, (ii) Pursuit: subject aims to keep his output signal X, 
s (t), on the target input signal O, S;(t). (iii) Preview: subject aims to keep his output signal X, s (t), on the descending 
target input signal, [S;(t + T ) - S;(t - T h )] (where t = present time, T p = preview time, T h = history or postview time). 
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FIGURE 149.2 Visual display for random tracking 
with a preview of 8.0 s and postview of 1.1 s. 



accordingly to minimize the resultant error signal. 
Tracing tasks [Driscoll, 1975; Stern et al„ 1983; 
Hocherman and Aharon-Peretz, 1994] are effec- 
tively self-paced 2-D preview tracking tasks. 

The input-output nature of tracking tasks has 
made them most suitable for analysis using engi- 
neering control theories. This has led to the com- 
mon view of pursuit tracking as a task involving 
continuous negative feedback [Notterman et al., 
1982] but there is evidence that tracking viewed as 
a series of discrete events would be more appropri- 
ate [Bosser, 1984; Neilson et al, 1988]. The inclu- 
sion of preview of the input signal greatly 
complicates characterization of the human control- 
ler and Sheridan [1966] has suggested three models 
of preview control which employ the notions of 
constrained preview and nonuniform importance 

of input. Lynn et al. [1979] and Neilson et al. [1992] have also demonstrated how, by treating the 
neurologically impaired subject as a black-box, control analysis can lead to further information on 
underlying neurological control mechanisms. 

Despite the wide-spread utilization and acceptance of tracking tasks as a powerful and versatile means 
for quantifying and studying sensory-motor control capacities, there is little available on the market in 
this area. The most obvious exception to this is the photoelectric pursuit rotor which is ubiquitous in 
the motor behavior laboratories of university psychology departments and has been available since the 
1950s [Welford, 1968; Schmidt, 1982; Siegel, 1985]. It is a paced 2-D task with a target with the periodic 
on each revolution. Although inexpensive, the pursuit rotor is a crude tracking task allowing limited 
control over target signals and possessing a very gross performance analysis in terms of time on target. 
Thus nearly all of the many and varied tracking tasks which have been used in countless experimental 
studies around the world have been developed by the users themselves for their specific objectives. 

An improvement in this situation has occurred recently with the arrival on the market of a number 
of tracking devices from Human Performance Measurement, Inc. (Arlington, TX). These devices are a 
natural extension of those developed by Kondraske et al. [1984]. Off-the-shelf availability of computer- 
based tracking tests, including sensors for both upper- and lower-limbs, opens up the possibility of a 
much broader and widespread use of tracking tasks. In particular, one can look forward to a much greater 
utilization of tracking tasks outside of traditional research areas and in more routine assessment appli- 
cations in clinical, rehabilitative, vocational, sports, and other environments. 



Tracking Tasks: Options and Considerations 

Whatever the reasons for needing quantification of sensory-motor control capacity via a tracking task, 
there are a number of options available and factors to be considered in choosing or designing a tracking 
task. 

Sensors 

Sensors for measuring a subject's motor output in 1-D tracking tasks can be categorized under 
(a) movements involving a single degree of freedom such as flexion-extension rotation around a single 
joint including elbow [Deuschl et al., 1996; Lynn et al., 1977; O'Dwyer et al., 1996; Soliveri et al., 1997], 
wrist [Johnson et al., 1996], or a finger, or pronation-supination of the wrist, and (b) movements 
involving two or more degrees of freedom of a body part (e.g., hand) — i.e., coordinated movement at 
multiple joints — which are either 1-D, such as some form of linear transducer [Patrick and Mutlusoy, 
1982; Baroni et al., 1984; van den Berg et al., 1987] or 2-D, such as steering wheel [Buck, 1982; Ferslew 
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et al., 1982; Jones and Donaldson, 1986; Jones et al., 1993], stirring wheel [De Souza et al., 1980], position 
stick (i.e., 1-D joystick) [Neilson and Neilson, 1980; O'Dwyer and Neilson, 1998; Potvin et al, 1977], 
joystick [Kondraske et al., 1984; Miall et al., 1985; Jones etal., 1993], finger-controlled rotating knob 
[Neilson etal., 1993], and light-pen [Neilson and Neilson, 1980]. Force sticks, utilizing strain-gauge 
transducers mounted on a cantilever, are also commonly used as sensors [Garvey, 1960; Potvin etal., 
1977; Miller and Freund, 1980; Barr et al, 1988; Stelmach and Worrington, 1988]. Isometric integrated 
EMG (i.e., full-wave rectification and low-pass filtering of the raw EMG) can also be used to control the 
tracking response cursor, as was done by Neilson et al. [1990] to help show that impairment of sensory- 
motor learning is the primary cause of functional disability in cerebral palsy. 

Sensors for 2-D tasks must, of course, be capable of moving with and recording two degrees of freedom. 
Joysticks are commonly used for this and range in size from small, for finger movement [Anderson, 1986] 
and wrist/forearm movement [Bloxham etal., 1984; Frith etal., 1986; Neilson etal., 1998], up to large 
floor-mounted joysticks for arm movements primarily involving shoulder and elbow function [Anderson, 
1986; Behbehani etal, 1988; Dalrymple-Alford etal, 1994; Jones etal, 1993; Kondraske etal., 1984; 
Watson and Jones, 1998; Watson et al., 1997]. Other 2-D task sensors include a hand-held stylus for the 
photoelectric pursuit rotor [Schmidt, 1982; Siegel, 1985], plexiglass tracing [Driscoll, 1975], and tasks 
utilizing sonic digitizers [Hocherman and Aharon-Peretz, 1994; Stern et al., 1984; Viviani and Mounoud, 
1990]. Abend et al. [1982] and Flash and Hogan [1985] used a two-joint mechanical arm to restrict hand 
movements to the horizontal plane in the investigation of CNS control of two-joint (shoulder and elbow) 
movements in trajectory formation. Stern et al. [1983] simply used the subject's finger as the sensor for 
a tracing task on a vertical plexiglass 'screen'; a video camera behind the screen recorded finger move- 
ments. Novel 'whole-body' 2-D tracking is also possible by having subjects alter their posture while 
standing on a dual-axis force platform [Kondraske et al., 1984]. 

1-D sensors can also be used in 2-D tasks by way of bimanual tracking. For example, O'Dwyer and 
Neilson [1995] used two 1-D joysticks to investigate dynamic synergies between the right and left arms. 

Displays 

Early tracking devices used mechanical-based displays such as a rotating smoked drum [Vince, 1948], 
the ubiquitous pursuit rotor, and a paper-strip preview task [Poulton, 1964; Welford, 1968]. An oscillo- 
scope was, and still is to a much less degree, used in a large number of tracking tasks, initially driven by 
analog circuitry [Anderson, 1986; Flowers, 1976] but later by D/A outputs on digital computers [Kon- 
draske etal, 1984; Miall etal, 1985; Sheridan, M.R. etal, 1987; Cooper etal, 1989]. Standard raster- 
based television screens have been used by some workers [Potvin et al., 1977; Beppu et al., 1984]. Non- 
raster vector graphics displays, such as Digital Equipment's VT1 1 dynamic graphics unit, proved valuable 
during the PDP-era as a means for generating more complex dynamic stimuli such as squares [Neilson 
and Neilson, 1980; Frith et al, 1986] and preview [Jones and Donaldson, 1986] (Fig. 149.2). More 
recently, raster-based color graphics boards have allowed impressive static displays and simple dynamic 
tracking displays to be generated on PCs. However, such boards are not, in general, immediately amenable 
for the generation of flawless dynamic displays involving more complex stimuli, such as required for 
preview tracking. Jones et al. [1993] have overcome this drawback by the use of specially-written high- 
speed assembly-language routines for driving their display. These generate a display of the target and the 
subject's response marker by considering the video memory (configured in EGA mode) as four overlap- 
ping planes, each switchable (via a mask), and each capable of displaying the background color and a 
single color from a palette. Two planes are used to display the target, with the remaining two being used 
to display the subject's pointer. The current target is displayed on one target plane, while the next view 
of the target, in its new position, is being drawn on the other undisplayed target plane. The role of the 
two planes is reversed when the computer receives a vertical synchronization interrupt from the graphics 
controller indicating the completion of a raster. Through a combination of a high update-rate of 60.34 Hz 
(i.e., the vertical interrupt frequency), assembly language, and dual display buffers, it has been possible 
to obtain an extremely smooth dynamic color display. Their system for tracking and other quantitative 
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sensory-motor assessments is further enhanced through its facility to generate dynamic color graphics 
on two high-resolution monitors simultaneously: one for the tracking display, and one for use by the 
assessor for task control and analysis. The monitors are driven by a ZX1000 graphic controller (Artist 
Graphics Inc), at 800 x 600, and a standard VGA controller, respectively. 

In contrast to the above CRT-based displays, Warabi et al. [1986] used a laser-beam spot to indicate 
a subject's hand position together with a row of LEDs for displaying a step target. Similarly, Gibson et al. 
[1987] used a galvonometer-controlled laser spot to display smooth and step stimuli on a curved screen 
together with a white-light spot controlled by subject. Leist et al. [1987], Viviani and Mounoud [1990], 
and Klockgether [1994] also used galvonometer-controlled spots but via back-projection onto a curved 
screen, transparent digitizing table, and plexiglass surface respectively. Van den Berg et al. [1987] used 
two rows of 240 LEDs each to display target and response. 2-D arrays of LEDs have also been used to 
indicate step targets in 2-D tracking tasks [Abend et al., 1982; Flash and Hogan, 1985]. 

Target Signals 

Tracking targets cover a spectrum from smoothly-changing (low-bandwidth) targets, such as sinusoidal 
and random, through constant velocity ramp targets, to abrupt changing step targets. 

Sinusoidal Targets 

The periodicity, constancy of task complexity (over cycles), and spectral purity of sine targets make them 
valuable for measurement of within-run changes in performance (e.g., learning, lapses in concentration) 
[Jones and Donaldson, 1981], the study of ability to make use of the periodicity to improve tracking 
performance [Jones and Donaldson, 1989], and the study of the human frequency response [Leist et al., 
1987]. Several other workers have also used sine targets in their tracking tasks [Ferslew et al., 1982; 
Johnson et al, 1996; Miller and Freund, 1980; Notterman et al., 1982; Potvin et al., 1977; Soliveri et al, 
1997]. 

Bloxham et al. [1984] and Frith et al. [1986] extended the use of sinewaves into a 2-D domain by 
having subjects track a moving circle on the screen. 

Random Targets 

These are commonly generated via a sum of sines approach in which a number of harmonically or non- 
harmonically related sinusoids of random phase are superimposed [Baddeley et al., 1986; Barr et al., 
1988; Cassell, 1973; Cooper et al, 1989; Frith et al, 1986; Hufschmidt and Lucking, 1995; Jones et al., 
1993; Miall et al, 1985; Neilson and Neilson, 1980; van den Berg et al, 1987; Watson and Jones, 1998; 
Watson et al., 1997]. If harmonically related, this can effectively give aflat spectrum target out to whatever 
bandwidth is required. Thus, in Jones et al.'s [Jones et al., 1993; Watson and Jones, 1998; Watson et al., 
1997] system the random signal generation program asks the user for the required signal bandwidth and 
then calculates the number of equal amplitude harmonics that must be summed together to give this 
bandwidth, each harmonic being assigned a randomly selected phase from a uniform phase distribution. 
Each target comprises 4096 (2 12 ) or more samples, a duration of at least 68 s (4096 samples/60.34 Hz), 
and a fundamental frequency of 0.0147 Hz (i.e., period of 68 s). By this means it is possible to have 
several different pseudo-random target signals that are non-periodic up to 68 s duration, have flat spectra 
within a user specified bandwidth, no components above this bandwidth, and whose spectra can be 
accurately computed by FFT from any 68 s block of target (or response). Another common approach to 
the generation of random targets is to digitally filter a sequence of pseudo-random numbers [Lynn et al., 
1977; Potvin et al, 1977; Kondraske et al, 1984; van den Berg et al, 1987; Neilson et al., 1993] although 
this method gives less control over the spectral characteristics of the target. B'sser [1984] summed a 
number of these filtered sequences in such a way as to generate a target having an approximate 1/f 
spectrum. Another smooth pursuit target was generated by linking together short segments of sinewaves 
with randomly selected frequencies up to some maximum [Gibson et al., 1987] and was thus effectively 
a hybrid sinusoidal-random target. 
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Ramp Targets 

These have been used in conjunction with sensory gaps of target or response to study predictive tracking 
and ability to execute smooth constant velocity movements in the absence of immediate visual cues in 
normal subjects [Flowers, 1978b] and subjects with cerebellar disorders [Beppuet al., 1987], stroke [Jones 
etal., 1989], and Parkinson's disease [Cooke et al., 1978; Flowers, 1978a]. 

Step Targets 

These have been used in many applications and studies to measure and investigate subjects' abilities to 
predict, program, and execute ballistic (open-loop) movements. To enable this, spatial and temporal 
unpredictability have been incorporated into step tasks in various ways: 

■ Temporal predictability — The time of onset of steps has ranged from (a) explicitly predictable, 
with preview of the stimulus [Day et al., 1984; Jones et al., 1993], (b) implicitly predictable, with 
fixed interval between steps [Abend et al., 1982; Cooke et al., 1978; Deuschl et al., 1996; Flowers, 
1978; Johnson et al., 1996; Potvin et al., 1977], to (c) unpredictable, with intervals between steps 
varied randomly over spans lying somewhere between 1.5 s and 7.0 s [Anderson, 1986; Angel et al., 
1970; Baroni etal, 1984; Flowers, 1976; Gibson etal, 1987; Jones and Donaldson, 1986; Jones 
etal, 1993; Kondraske etal., 1984; Neilson etal, 1995; O'Dwyer and Neilson, 1998; Sheridan 
et al, 1987; Warabi et al, 1986; Watson et al., 1997]. 

■ Amplitude predictability — The amplitude of steps has ranged from (a) explicitly predictable, where 
the endpoint of the step is shown explicitly before it occurs [Abend et al., 1982; Baroni et al., 1984; 
Deuschl etal, 1996; Jones etal, 1993; Sheridan etal., 1987; Watson etal, 1997], (b) implicitly 
predictable, where all steps have the same amplitude [Anderson, 1986; Angel et al., 1970; Cooke 
etal, 1978; Day etal, 1984; Johnson etal., 1996; Kondraske etal, 1984; O'Dwyer and Neilson, 
1998; Potvin et al., 1977] or return-to-centre steps in variable-amplitude step tasks [Flowers, 1976; 
Jones and Donaldson, 1986; Jones etal., 1993], to (c) unpredictable, with between 2 and 
8 randomly distributed amplitudes [Flowers, 1976; Jones and Donaldson, 1986; Warabi et al., 1986; 
Gibson et al, 1987; Sheridan, M.R. et al, 1987; Jones et al, 1993]. 

■ Direction predictability — Previous step tasks have had steps whose direction of steps has ranged 
from (a) all steps explicitly predictable, alternating between right and left [Baroni etal., 1984; 
Cooke et al., 1978; Deuschl et al, 1996; Flowers, 1976; Johnson et al., 1996; Kondraske et al., 1984; 
O'Dwyer and Neilson, 1998; Potvin et al., 1977], or all in one direction (i.e., a series of discontin- 
uous steps) [Sheridan etal., 1987], or between corners of an invisible square [Anderson, 1986], 
or having preview [Abend etal., 1982; Jones etal., 1993], (b) most steps predictable but with 
occasional "surprises" for studying anticipation [Flowers, 1978a] , (c) a combination of unpredict- 
able (outward) and predictable (back- to-center) steps [Angel etal., 1970; Jones and Donaldson, 
1986; Jones et al., 1993; Watson et al., 1997], and (d) all steps unpredictable, with multiple end- 
points [Warabi et al., 1986; Gibson et al., 1987] or resetting between single steps [Day et al., 1984]. 

The three elements of unpredictability can be combined in various ways to generate tasks ranging 
from completely predictable to completely unpredictable (Table 149.1). Several groups have implemented 
several variations of unpredictability both within and between step tracking tasks to investigate the 
possible loss of ability to use predictability to improve performance in, for example, Parkinson's disease 
[Flowers, 1978; Sheridan et al., 1987; Watson et al., 1997]. In addition to unpredictability, other charac- 
teristics can be built into step tasks including explicit target zones [Sheridan et al., 1987] and visual gaps 
in target [Flowers, 1976; Warabi et al., 1986]. 

An example of a 1-D step tracking task possessing full spatial and temporal unpredictability is that of 
Jones and colleagues [Jones and Donaldson, 1986; Jones etal., 1993] (Fig. 149.3a). The task comprises 
32 abrupt steps alternating between displacement from and return to center screen. In the non-preview 
form, spatial unpredictability is present in the outward steps through four randomly distributed ampli- 
tude/direction movements (large and small steps requiring 90 and 22.5 deg. on a steering wheel respec- 
tively, and both to right and left of center) with temporal unpredictability achieved via four randomly 
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TABLE 149.1 Unpredictability in Step Tracking Tasks 



Temporal 
Full Partial None 



Spatial — Amplitude 
Full Partial None 



Spatial — Direction Overall 

Full Partial None Full 



Angel etal. [1970]* 


1-D 


Flowers [1976]* 


1-D 


Potvinetal. [1977] 


1-D 


Cooke etal. [1978] 


1-D 


Flowers [1978a] 


1-D 


Baronietal. [1984] 


1-D 


Day etal. [1984] 


1-D 


Kondraske et al. [1984] 


1-D 


Warabietal. [1986] 


1-D 


Jones and Donaldson 


1-D 


[1986]* 




Gibson etal. [1987] 


1-D 


Sheridan et al. [1987]* 


1-D 


Jones etal. [1993]* 


1-D 


Deuschl, et al. [1996] 


1-D 


Johnson, et al. [1996] 


1-D 


O'Dwyer and Neilson 


1-D 


[1998] 




Abend etal. [1982] 


2-D 


Anderson [1986] 


2-D 


Watson etal. [1997] 


2-D 



* Several authors have several variations of unpredictability within one task or between multiple tasks. 





t 





□ 


□ 


□ 


□ 


□ 

+ 


□ 


□ 


□ 


D 



(a) 



(b) 



FIGURE 149.3 Visual displays for (a) 1-D step tracking task and (b) 2-D step tracking task (bottom-right square 
is current target). 

distributed durations between steps (2.8, 3.4, 4.0, 4.6 s). This task has been used, together with preview 
random tracking, to demonstrate deficits in sensory-motor control in the asymptomatic arm of subjects 
who have had a unilateral stroke [Jones et al., 1989]. 

Watson and Jones [1997] also provide an example of a 2-D step tracking task with spatial and temporal 
unpredictability. In this task the subject must move a cross from within a central starting square to within 
one of eight 10 mm x 10 mm target squares that appear on the screen with temporal and spatial 
unpredictability (Fig. 149.3b). The centers of the eight surrounding targets are positioned at the vertices 
and midway along the perimeter of an imaginary 100 mm x 100 mm square centered on the central 
square. To initiate the task, the subject places the cross within the perimeter of the central target. After 
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1 partition — 3 sectors 




FIGURE 149.4 Section of input waveform in combination tracking in which the target alternates between preview- 
random and nonpreview-step. 

a 2-5 s delay, one of the surrounding blue targets turns green and the subject moves the cross to within 
the green target square as quickly and as accurately as possible. After a further delay, the central target 
turns green indicating onset of the spatially predictive "back-to-center" target. The task, which comprises 
ten outward and ten return targets, was used to show that Parkinsonian subjects perform worse than 
matched controls on all measures of step tracking but are not impaired in their ability to benefit from 
spatial predictability to improve performance. 

Step tasks with explicit target zones, in 1-D [Sheridan et al., 1987] or 2-D [Watson et al., 1997], provide 
the possibility of altering task difficulty by varying the size of the target. On the basis that subjects need 
only aim to get their marker somewhere within target zone (cf. close to center) then, according to Fitt's 
[1954] ratio rule, the difficulty of the primary movement is proportional to log 2 (2A/W), where A is the 
amplitude of the movement and W is the width of the target. 

Combination Targets 

Jones et al. [1986, 1989, 1993] have combined two quite different modes of tracking within a single task. 
Combination tracking involves alternating between preview random and non-preview step tracking over 
lis cycles (Fig. 149.4). Thus, while tracking the random target, the preview signal is abruptly and 
unpredictably replaced by a stationary vertical line at some distance from the random signal, and vice 
versa. Although the steps occur with a fixed foreperiod (as with the step tasks listed above with implicit 
temporal predictability) of 7.3 s, subjects are not informed of this and, irrespective, Weber's law [Fitts 
and Posner, 1967] indicates the accuracy of prediction of steps with such a long pre-stimulus warning is 
very low. Combination tracking allows the study of ability to change motor set [Robertson and Flowers, 
1990] between quite different modes of tracking and is analogous to having to quickly and appropriately 
respond to an unexpected obstacle, such as a child running on to the road, while driving a vehicle. 

Dimensionality 

The number of dimensions of a tracking task usually refers to the number of cartesian coordinates over 
which the target moves, rather than those of the response marker or sensor handle, or the number of 
degrees of freedom of the target or of the upper-limb. Some examples: (1) Most tasks with a 2-D joystick 
sensor or light-pen are 2-D but, if the target moves in the vertical direction only [Neilson and Neilson, 
1980; Kondraske etal, 1984; Miall et al, 1985; Jones et al, 1993], the task is only considered 1-D, 
irrespective of whether the response marker is confined to vertical movements on the screen or not; (2) If 
the target trajectory is a circle [Bloxham et al., 1984], the task is 2-D despite the target having only one 
degree of freedom (i.e., radius r is constant); (3) A pursuit rotor is a 2-D task as it has a target which 
moves in two dimensions (whether cartesian or polar) as well as doing so with two degrees of freedom. 
Watson and Jones [1998] compared random 2-D with 1-D performance but in doing so scaled their 
2-D target down so as to have an average displacement and velocity equal to that of its 1-D horizontal 
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and vertical components. By this means they were able to unequivocally demonstrate that there is poorer 
performance on 2-D tasks and that is due to both the increased dimensionality and increased posi- 
tion/speed demands of an unsealed 2-D task. 

Dual-axis tracking is a variant of 2-D tracking in which the 2-D task comprises two simultaneous 
orthogonal 1-D tasks in which one or more of the target, input device, control dynamics, and on-line 
feedback are different between the two axes. It has been used to investigate mechanisms and characteristics 
of 2-D tracking, such as whether it is a single task or two separate orthogonal tasks [Fracker and Wickens, 
1989; Navon et al., 1984]. 

Tracking Mode 

The two primary modes of tracking — compensatory and pursuit — have been introduced above. The 
majority of tasks are of the pursuit type which is appropriate in that has a more direct parallel with real- 
world sensory-motor tasks than the more artificial compensatory task (e.g., [Vince, 1948; Garvey, 1960; 
Potvin et al., 1977; Miller and Freund, 1980; B'sser, 1984; Barr et al., 1988]) in which the subject is only 
shown the instantaneous value of the error signal. The compensatory mode may be preferentially chosen 
for control-theory modelling due to its simpler set of defining equations [Potvin et al., 1977] . The preview 
task [Welford, 1968; Jones and Donaldson, 1986] is an important variation of pursuit tracking in which 
a still greater correspondence with everyday tasks is achieved. 

Controlled System Dynamics 

It has been well established that subjects can deal satisfactorily with a variety of tracking systems incor- 
porating different control characteristics [Neilson et al., 1995; Poulton, 1974]. That notwithstanding, the 
majority of tracking tasks have a zero-order controlled system in which the position of the response 
marker is proportional to the position of the sensor and the mechanical characteristics of friction, inertial 
mass, and velocity damping are simply those of the input device. Van den Berg et al. [1987] eliminated 
even these by feeding back a force signal from a strain gauge on the tracking handle to the power amplifier 
of a torque motor connected to their sensor. Conversely, Neilson et al. [1993] artificially introduced 
mechanical characteristics into the movement of their response marker by having a linear 2nd-order 
filter as the controlled system; by an appropriate transfer function (H(z) = 0.4060/(1 - 1.0616z _1 + 
0.4610z~ 2 )), they were able to introduce inertial lag and underdamping (resonant peak at 2.0 Hz). Miall 
et al. [1985] introduced an analog delay of 500 ms between their joystick and display so that they could 
study the effect of delayed visual feedback on performance. Soliveri et al. [1997] used both lst-order 
(velocity) and zero-order (position) linear control dynamics to investigate differences in learning between 
parkinsonian and control subjects and between on- and off-medication. Navon etal. [1984] used a 
combination of velocity and acceleration control dynamics in their study on dual-axis tracking. Nonlinear 
transfer functions, such as 2nd-order Volterra (fading memory) nonlinearities, have also been used in 
tracking controlled systems, primarily as a means for investigating adaptive inverse modelling mecha- 
nisms in the brain relating to voluntary movement [Davidson et al., 1999; Sriharan, 1997]. 

Controlled system dynamics can also be changed during a task. In "critical tracking", a novel variation 
of pursuit tracking conceived by Jex [1966], the delay of the controlled system increases during the task. 
There is no external target but instead the subject's own instability acts as an input to an increasingly 
unstable controlled system, Y(s) = K8/(s - 8), in which the level of instability, represented by the root 8 
(= 1/T), is steadily increased during the task until a preset error is exceeded. The task has been described 
as analogous to driving a truck with no brakes down a hill on a winding road [Potvin et al., 1977]. The 
task has been applied clinically by Potvin et al. [1977] and Kondraske et al. [1984] and shown to be a 
reliable measure of small changes in neurological function [Potvin etal., 1977]. Alternatively, "gain- 
change step tracking" [Neilson et al., 1995], in which the gain of the control- display relation is increased 
or decreased without warning, has been used to investigate adaptive mechanisms in the brain [O'Dwyer 
and Neilson, 1998]. 

Having a torque motor as part of the sensor opens up several new possibilities. It can be operated as 
a"torque servo", in which applied torque is independent of position [Kondraske et al., 1984], or a"position 
servo", in which applied torque is proportional to position error (together with velocity damping if 
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desired) [Thomas et al., 1976]. By adding external force perturbations, it is possible to measure and study 
neuromuscular reflexes and limb transfer function (i.e., stiffness, viscosity, and inertia), such as by 
applying constant velocity movements [Kondraske et al., 1984] or pulsatile [van den Berg et al., 1987], 
sinusoidal [Gottlieb et al., 1984], or random [Kearney and Hunter, 1983; van den Berg et al., 1987] force 
perturbations. Alternatively, the torque motor can be used to alter controlled system characteristics in 
tracking tasks for studies and/or improvement of voluntary movement. For example, van den Berg et al. 
[1987] cancelled unwanted controller characteristics. Chelette et al. [1995] used "force reflection" to 
improve tracking performance in both normal subjects and those with spasticity, and Johnson et al. 
[1996] used anti- viscous loading to investigate the cause of poor tracking in patients with Parkinson's 
disease. 

Sensor-Display Compatibility 

It is generally accepted that the level of compatibility between sensor and display in continuous tracking 
tasks influences the accuracy of performance [Neilson and Neilson, 1980]. The perfectly compatible 
sensor is the display marker itself [Poulton, 1974] where the subject holds and moves the response marker 
directly such as with a light-pen in tracking [Neilson and Neilson, 1980], rotary pursuit [Schmidt, 1982; 
Welford, 1968], handle on a two-joint mechanical arm [Abend et al., 1982], or in self-paced 2-D tracing 
tasks [Driscoll, 1975; Stern et al., 1984; Hocherman and Aharon-Peretz, 1994]. Similarly, van den Berg 
et al. [1987] achieve a high sensor-display compatibility by having the LED arrays for target and response 
displayed directly above a horizontally-moving handle. However, the majority of tracking tasks have 
sensors which are quite separate from the response marker displayed on an oscilloscope or computer 
screen. Sensor- display compatibility can be maximized in this case by having the sensor physically close 
to the display, moving in the same direction as the marker, and with a minimum of controlled system 
dynamics (e.g., zero-order). In the case of a joystick in a 2-D task, for example, direct compatibility {Left- 
Right — > Left-Right) is easier than inverse compatibility (Left-Right — > Right-Left), which is easier than 
non-compatibility (Left-Right — > Up-Down). In contrast, fore-aft movements on a joystick appear to 
possess bidirectional compatibility in that Fore-aft — > Up-Down seems as inherently natural as Fore-aft — ¥ 
Down-Up (i.e., no obvious inverse). 

Sensor-display compatibility may not, however, be overly critical to performance. For example, Neilson 
and Neilson [1980] found no decrement in performance on random tracking of overall error scores, 
such as mean absolute error, between a light pen and a 1-D joystick; nevertheless, the latter did result in 
a decrease in gain, an increase in phase lag, and an increase in the non-coherent response component. 
Conversely, normal subjects find incompatible 2-D tracking very difficult to perform, taking up to 4 hours 
of practice to reach a level of performance equal to that seen on prepractice 2-D compatible tracking 
[Neilson et al, 1998]. 

Response Sampling Rates 

Although some workers have manually analyzed tracking data from multichannel analog chart recordings 
[Flowers, 1976; Beppu et al., 1984] or analog processed results [Potvin et al., 1977], the majority have 
used computers, sometimes via a magnetic tape intermediary [Day et al., 1984; Miall et al., 1985], to 
digitize data for automated analyses. Sampling rates used have varied from 10 Hz [Neilson and Neilson, 
1980], through 20 Hz [Cooper et al, 1989; Neilson et al, 1993; Neilson et al., 1998], 28.6 Hz [Jones and 
Donaldson, 1986], 30.2 Hz [Watson and Jones, 1998; Watson et al, 1997], 40 Hz [Frith et al, 1986], 
60 Hz [Viviani and Mounoud, 1990], 60.3 Hz (= screen's vertical interrupt rate) [Jones et al, 1993], 
66.7 Hz [O'Dwyer and Neilson, 1998], 100 Hz (all 2-D tasks) [Abend et al, 1982; Stern et al., 1984; 
Hocherman and Aharon-Peretz, 1994], to as high as 250 Hz [Day et al., 1984]. 

For the most part, a relatively low sampling rate is quite satisfactory for analysis of tracking performance 
as long as the Nyquist criterion is met and there is appropriate analog or digital low-pass filtering to 
prevent aliasing. Spectral analysis indicates that the fastest of voluntary arm movements have no power 
above about 8.7 Hz [Jones and Donaldson, 1986]. This is very similar to the maximal voluntary oscilla- 
tions of the elbow of 4-6 Hz [Neilson, 1972; Leist et al., 1987] and to maximum finger tapping rates of 
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6-7 Hz [Muir et al., 1995]. The sampling rate can be reduced still further if the primary interest is only 
in coherent performance, whose bandwidth is only of the order of 2 Hz for both kinesthetic stimuli 
[Neilson, 1972] and visual stimuli [Leist et al., 1987; Neilson et al, 1993]; that is, performance above 
2 Hz must be open-loop and, hence, learned and preprogrammed [Neilson, 1972]. Thus, from an 
information theory point of view, there is no need to sample tracking performance beyond, say, 20 Hz. 
However, a higher rate may well be justified on the grounds of needing better temporal resolution than 
50 ms for transient or cross-correlation analysis, unless one is prepared to regenerate the signal between 
samples by some form of non-linear interpolation (e.g., sine, spline, polynomial). 

Other Measures 

Several researchers have further extended the information which can be derived from upper-limb tracking 
performance by comparison with other simultaneously recorded biosignals. The most common of these 
is the EMG, particularly integrated EMG due to its close parallel to force of contraction [Neilson, 1972] 
and where the tracking movement is constrained to be around a single joint. The EMG has been used 
together with step tracking for fractionating reaction times into premotor and motor components [Anson, 
1987; Sheridan et al., 1987] and confirmation of open-loop primary movements [Sittig et al., 1985; 
Sheridan et al., 1987]. In smooth tracking, correlation/cross-spectral analysis between the EMG and limb 
position has been used to study limb dynamics [Neilson, 1972; Barr et al., 1988]. 

In contrast, Cooper et al. [1989] measured the EEG at four sites during 2-D random tracking to show 
that slow changes in the EEG (equivalent to the Bereitschaftspotential preceding self-paced voluntary 
movement), particularly at the vertex, are correlated with the absolute velocity of the target. 

Simultaneous measurement of hand and eye movements has been carried out by Warabi et al. [1986] 
and Leist et al. [1987] using EOG to measure horizontal eye movements and by Gibson et al. [1987] who 
used an infra-red limbus reflection technique. Interestingly, Leist et al. [1987] found that ocular pursuit 
and self-paced oscillations were limited to about 1 Hz and 2.2 Hz, respectively, whereas the equivalent 
values for arm movements are 2 Hz and 4-6 Hz, respectively. 

Standard Assessment Procedures 

Having designed and constructed a tracking task or set of tracking tasks with the characteristics necessary 
to allow measurement of the sensory-motor control performance capacities under investigation, it is 
essential that this process be complemented by a well formulated set of standard assessment procedures. 
These must include (a) standard physical setup, in which positioning of subject, sensor, and screen are 
tightly specified and controlled, as well as factors such as screen brightness, room lighting, etc. and 
(b) standardized instructions. The latter are particularly important in tasks where speed-accuracy tradeoff 
[Fitts, 1954; Welford, 1968; Agarwal and Logsdon, 1990] is possible. This applies particularly to step 
tracking in which leaving the tracking strategy completely up to subjects introduces the possibility of 
misinterpretation of differences in performance on certain measures, such as reaction time, risetime, and 
mean absolute error. For example, subjects need to know if it is more important to have the initial 
movement end up close to the target (i.e., emphasis on accuracy of primary movement) or to get within 
the vicinity of the target as soon as possible (i.e., emphasis on speed of primary movement); the latter 
results in greater under/overshooting but also tends to result in lower mean errors). The most common 
approach taken is to stress the importance of both speed and accuracy with an instruction to subjects of 
the form: "Follow the target as fast and as accurately as possible." 

Test and Experimental Protocols 

The design of appropriate test and experimental protocols is also a crucial component of the tracking 
task design process [Roscoe, 1975; Pitrella and Kruger, 1983]. When comparisons are made between 
different subjects, tasks, and/or conditions, careful consideration needs to be given to the paramount 
factors of matching and balancing to minimize the possibility of significant differences being due to some 
bias or confounding variable other than that under investigation. Matching can be achieved between 
experimental and control subjects in an inter-subject design by having average or one-to-one equivalence 
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on age, gender, education, etc., or through an intra-subject design in which the subject acts as his/her 
own control in, say, a study of dominant versus non-dominant arm performance. Balancing is primarily 
needed to offset order effects due to learning which pervade much of sensory-motor performance [Wel- 
ford, 1968; Poulton, 1974; Schmidt, 1982; Frith etal, 1986; Jones et al, 1990]. A study by Jones and 
Donaldson [1989] provides a good example of the application of these principles. Their study, aimed at 
investigating the effect of Parkinson's disease on predictive motor planning, involved 16 Parkinsonian 
subjects and 16 age and sex matched control subjects. These were then divided into 8 subgroups in a 
3-way randomized cross-over design so as to eliminate between- and within-session order effects in 
determining the effect of target type, target preview, and medication on tracking performance. 

149.4 Analysis of Sensory-Motor Control Performance 
Accuracy 

Analyses of raw tracking data can provide performance information which is objective and quantitative 
and which can be divided into two broad classes: 

Measures of Global Accuracy of Performance 

Measures of global (or overall or integrated) sensory-motor control capacities have proven invaluable for: 

• Vocational screening of minimum levels of sensory-motor skills: Tracking tasks, in fact, have their 
origins in this area during World War II when they were used to help screen and train aircraft 
pilots [Poulton, 1974; Welford, 1968]. 

• Clinical screening for sensory-motor deficits (arising from one or more lesions in one or more 
sites in the sensory-motor system): An excellent example of the application of this in clinical 
practice is the provision of objective measures in off-road driving assessment programs [Jones 
et al, 1983; Croft and Jones, 1987]. 

• Clinical and rehabilitation research by measurement of longitudinal changes in sensory-motor 
function: There are many examples of subjects being assessed repeatedly on tracking tasks for 
periods up to 12 or more months. This has been done to quantify recovery following head injury 
[Jones and Donaldson, 1981] and stroke [Lynn etal., 1977; De Souza etal., 1980; Jones and 
Donaldson, 1981; Jones et al., 1990] as well as for studies of learning in tracking performance 
[Poulton, 1974; Jones and Donaldson, 1981; Schmidt, 1982; Frith et al, 1986; Jones et al, 1990]. 
They can also be used to quantify changes due to medication, such as in Parkinson's disease [Baroni 
et al, 1984; Johnson et al., 1996; Jones et al, 1996; Soliveri et al., 1997]. 

Measures of Characteristics of Performance 

Measures of global accuracy of tracking performance can detect and quantify the presence of abnormal 
sensory-motor control performance capacities with considerable sensitivity [Potvin etal., 1977; Jones 
etal., 1989]. Conversely, they are unable to give any indication of which of the many subsystems or 
performance resources in the overall sensory-motor system are, or may be, responsible for the abnormal 
performance. Nor can they provide any particular insight into the underlying neuromuscular control 
mechanisms of normal or abnormal performance. 

Four approaches can be taken to provide information necessary to help identify the sensory-motor 
subsystems and their properties responsible for the characteristics of observed normal and abnormal 
performance: 

• Batteries of neurologic sensory-motor tests — These tests can be used to, at least ideally, isolate 
and quantify the various sensory, motor, cognitive, and integrative functions and subsystems 
involved in sensory-motor control performance as measured globally by, for example, tracking 
tasks. 
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• Functional decomposition — Fractionation of the various performance resources contributing to 
tracking performance. 

• Traditional signal processing approaches — Time domain (ballistic and nonballistic) and frequency 
domain techniques. 

• Graphical analysis — This has primarily been developed for measurement and investigation of 
changes in performance and underlying PRs over time. 

Measures of Global Accuracy of Performance 

The most commonly used measure of global or overall accuracy is the mean absolute error (MAE) [Jones 
and Donaldson, 1986] which indicates the average distance the subject was away from the target irre- 
spective of side; it is also variously called average absolute error [Poulton, 1974], modulus mean error 
[Poulton, 1974], mean rectified error [Neilson and Neilson, 1980], or simply tracking error [Kondraske 
et al., 1984; Behbehani et al., 1988]. In contrast, the mean error, or constant position error, is of little 
value as it simply indicates only the extent to which the response is more on one side of the target than 
the other [Poulton, 1974]. Measures of overall performance which give greater weighting to larger errors 
include mean square error [Neilson et al., 1993], root mean square error [McRuer and Krendel, 1959; 
Navon et al., 1984; O'Dwyer and Neilson, 1995; Poulton, 1974], variance of error [Neilson and Neilson, 
1980], and standard deviation of error [Poulton, 1974]. Relative or normalized error score equivalents 
of these can be calculated by expressing the raw error scores as a percentage of the respective scores 
obtained had subject simply held the response marker stationary at the mean target position [Poulton, 
1974; Neilson and Neilson, 1980; Day et al., 1984]; i.e., no response = 100%. Alternatively, the relative 
root mean square error, defined as the square root of the ratio of the mean square value of the error 
signal to the mean square value of the target signal expressed as a percentage, allows tracking errors to 
be compared across tests using different target signals [Neilson et al., 1998]. 

Overall coherence is an important alternative to the above measures when it is wished to assess the 
similarity between target and response waveforms but there is a substantial delay between them. It 
provides an estimate of the proportion of the response that is correlated with the target over all frequencies 
[O'Dwyer et al, 1996; O'Dwyer and Neilson, 1995]. 

An issue met in viewing error scores from the perspective of Kondraske's [Kondraske, 1995a; Kon- 
draske, 1995b] elemental resource model is the unifying requirement of its associated general systems 
performance theory {GSPT) that all dimensions of performance must be in a form for which a higher 
numerical value indicates a superior performance. Thus scores which state that a smaller score indicates 
a superior performance, including reaction times, movement times, and all error scores, need to be 
transformed into performance scores [Kondraske, 1988]. For example: 

• central response speed = l/(reaction time) 

• information processing speed = l/(8-choice reaction time) 

• movement speed = l/(movement time) 

• tracking accuracy = l/(tracking error) 

As transformation via inversion is non-linear, the distributions of raw error scores and derived per- 
formance will be quite different. This has no effect on ordinal analyses, such as non-parametric statistics, 
but will have some effect on linear analyses, such as parametric statistics, linear regression/correlation, 
etc., and may include improvements due to a possible greater normality of the distributions of derived 
performances. An alternative transformation which would retain a linear relationship with the error 
scores is: Tracking accuracy = 100 - Relative tracking error. However, while this gives a dimension of 
performance with the desired 'bigger is better' characteristic, it also raises the possibility of negative 
values, implying an accuracy worse than zero! — the author can attest to the fact that some subjects do 
indeed end up with error scores worse than the "hands off" score. Irrespective of GSPT, there is no doubt 
that it is beneficial to deal conceptually and analytically with multiple performance measures when all 
measures are consistently defined in terms of "bigger is better". 



© 2000 by CRC Press LLC 



Time on target is a much cruder measure of tracking performance than all of the above but it has been 
used reasonably widely due to it being the result obtained from the pursuit rotor. The crudeness generally 
reflects (a) a lack of spatiotemporal sampling during a task (i.e., simple integration of time on target 
only) preventing the possibility of further analysis of any form, and (b) a task's performance ceiling due 
to the target having a finite zone within which greater accuracy, relative to center of zone, is unrewarded. 
This latter factor can, however, be used to advantage for the case where the investigator wishes to have 
control over the difficulty of a task, to gain, for example, similar levels of task difficulty across subjects 
irrespective of individual ability. This attribute has been used very effectively with 2-D random tracking 
tasks to minimize the confounding effects of major differences in task load between experimental and 
control subjects in dual task studies of impairment of central executive function in subjects with Alz- 
heimer's disease [Baddeley et al., 1986] and Parkinson's disease [Dalrymple-Alford et al., 1994]. 

Measures of Characteristics of Performance 
Batteries of Neurologic Sensory-Motor Tests 

Potvin and associates [Potvin and Tourtellotte, 1975; Potvin et al., 1985], now led by Kondraske et al. 
[Kondraske et al., 1984, 1988], have developed what is by far the most comprehensive battery of tests 
available for quantitative evaluation of neurologic function covering a number of sensory, motor, cog- 
nitive, and sensory-motor functions or performance resources. Similarly, Jones et al. [1989, 1993] have 
developed a battery of component function tests, most of which have been specifically designed to isolate 
and quantify the various performance resources involved in their tracking tasks. There is, therefore, a 
close resemblance between the component and tracking tests so as to maximize the validity of compar- 
isons made between them. 

Functional Decomposition of Tracking Performance 

There are three main approaches whereby tracking performance can be fractionated or decomposed into 
its functional components: sensory, perceptual, cognitive, motor planning, and motor execution. 

The first involves breaking the ballistic response in step tracking into reaction time, movement time, 
overshoot, and settling time [Behbehani et al., 1988; Evarts et al., 1981; Flowers, 1976; Jones and Donald- 
son, 1986; Kondraske et al., 1988] (see "Time domain (ballistic) analysis" below). This allows indirect 
deductions about cognitive, motor planning, and motor execution functions, although the distinction 
between cognitive and motor elements often remains imprecise. 

The second involves calculation of differentials in tracking performance from inter-trial alterations in 
target and/or controlled system dynamics. This has been successfully used to study predictive motor 
planning [Bloxham et al., 1984; Day et al., 1984; Flowers, 1976; Flowers, 1978; Jones and Donaldson, 
1989], acquisition/modification of motor sets [Frith et al., 1986], and reliance on visual feedback [Cooke 
et al., 1978; Flowers, 1976; Frith et al., 1986] in Parkinson's disease. 

The third approach allows a more direct identification of the contribution of certain elemental 
resources to tracking performance during a specific tracking run. For example, by introducing the concept 
of a visuoperceptual buffer- zone, it is possible to estimate the contribution of visuoperceptual function 
to tracking performance [Jones et al., 1996]. This technique has been used to demonstrate that impaired 
visuoperceptual function in Parkinsonian subjects plays only a minor role in their poor tracking perfor- 
mance [Jones et al., 1996] but, conversely, that impaired tracking performance in stutterers is predomi- 
nantly due to reduced dynamic visuospatial perception [A. J. White, R. D. Jones, K. H. C. Lawson, and 
T J. Anderson, unpublished observations]. Furthermore, the visuoperceptual function can itself be 
fractionated into visual acuity, static perception, and dynamic perception [Jones and Donaldson, 1995]. 

In contrast to fractionation of performance on a high level task (e.g., tracking, driving), Kondraske 
et al. have developed techniques for the reverse process. They have shown how their hierarchical elemental 
resource model [Kondraske, 1995a; Kondraske, 1995b] can be used to predict performance on high level 
tasks from performance on a number of lower level tasks [Kondraske, 1987; Kondraske, 1995b; Kondraske 
et al., 1997; Vasta and Kondraske, 1994]. This approach has considerable potential in application areas 



© 2000 by CRC Press LLC 



such as rehabilitation. For example, it could be used in driving assessment programs to predict on-road 
driving ability by off-road measurement of performance on several key lower level tasks pertinent to 
driving, such as reaction time, visuospatial, cognitive, and tracking. 

Time Domain (Non-Ballistic) Analysis of Tracking Performance 

There are several run-averaged biases which can indicate the general form of errors being made, partic- 
ularly when the tracking performance is subnormal. Positive side of target (%) and direction of target (%) 
biases reflect a greater proportion of errors occurring to the right of the target or while the target is 
moving to the right respectively [Jones and Donaldson, 1986] which, if substantial, may indicate the 
presence of some visuoperceptual deficit. Similarly, the side of screen bias (assuming mean target position 
is mid-screen) [Jones and Donaldson, 1986] is identical to the mean error or constant position error. 

Perhaps the single most important measure of performance, other than mean absolute error, for non- 
transient targets is that of the average time delay, or lag, of a subject's response with respect to the target 
signal. The lag is most commonly defined as being the shift, X, corresponding to the peak of the cross- 
correlation function, calculated directly in the time domain or indirectly via the inverse of the cross- 
spectrum in the frequency domain. Although simulation studies indicate that these techniques are at 
least as accurate and as robust to noise/remnants as the alternatives listed below [Watson, 1994], one 
needs to be aware of a bias leading to underestimation of the magnitude of the lag (or lead) due to 
distortion of the standard cross-correlation function, but specifically of the peak towards zero shift. The 
distortion arises due to the varying overlap of two truncated signals (i.e., the target and the response) 
resulting in the multiplication of the cross-correlation function by a triangle (maximum at X = and 
zero at X = NT S , assuming signals of equal length NT S ). This effect is minimal as long as both signals 
have a mean value of zero (i.e., zero d.c). Temporal resolution is another factor deserving consideration. 
If desired, greater resolution than that of the sampling period can be obtained by interpolation of the 
points around the peak of the cross-correlation function by some form of curve fitting (e.g., parabola 
[Jones etal, 1993]). 

An alternative estimate of the lag, which has proven accurate on simulated responses, can be gained 
from the least squares time delay estimation by finding the time shift between the response and target at 
which the mean square error is minimized [Fertner and Sjolund, 1986; Jones etal., 1993]. Another 
approach, phase shift time delay estimation, calculates lag from the gradient of the straight line providing 
a best least-squares to the phase points in the cross-spectrum [Watson et al., 1997]. This technique has, 
however, proven more sensitive to non-correlated remnants in the response than the other procedures 
[Watson, 1994]. 

Several measures used to help characterize within-run variability in performance include variance of 
error [Neilson and Neilson, 1980], standard deviation of error [Poulton, 1974], and inconsistency [Jones 
and Donaldson, 1986]. 

Time Domain (Ballistic) Analysis of Tracking Performance 

Whether or not any of the above nonballistic analyses, evaluation of step tracking performance usually 
involves separate ballistic or transient analysis of each of the step responses. This generally takes the form 
of breaking up each response into three phases (Fig. 149.5): (1) reaction time phase, or the time between 
onset of step stimulus and initiation of movement defined by exit from a visible or invisible reaction 
zone, (2) primary movement phase, or the open-loop ballistic movement made by most normal subjects 
aiming to get within the vicinity of the target as quickly as possible, the end of which is defined as the 
first stationary point, and (3) secondary correction phase, comprising one or more adjustments and the 
remaining time needed to enter and stay within target zone. The step measures from individual steps 
can then be grouped into various step categories to allow evaluation of the effect of step size, spatial 
predictability, arm dominance, etc., on transient performance. 

Accuracy of the primary aimed movement can also be characterized in terms of a constant error and 
a variable error (standard deviation of error), which are considered to be indices of accuracy of central 
motor programming and motor execution respectively [Guiard et al., 1983]. 
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FIGURE 149.5 Transient response analysis. 
Tolerance zones: 

RZ = reaction zone 

TZ = target zone 
Performance parameters: 

RT = reaction time 

PMT = primary movement time 

SCT = secondary correction time 

TET = target entry time 

PV = peak velocity 

PME = primary movement error 

MAE = mean absolute error over a fixed interval following stimulus 

Phase-plane (velocity vs. position) plots provide an alternative means for displaying and examining 
the qualitative characteristics of step tracking responses. In particular, they have proven valuable for rapid 
detection of gross abnormalities [Potvin et al., 1985]. Behbehani et al. [1988] have introduced a novel 
quantitative element to phase-plane analysis by deriving an index of coordination: I c = V^/A, where V m 
is the maximum velocity during an outward and return step and A is the area within the resultant loop 
on the phase-plane plot. 

Frequency Domain Analysis of Tracking Performance 

Cross-correlation and spectral analysis have proven invaluable tools for quantifying the frequency depen- 
dent characteristics of the human subject. The cross-spectral density function, or cross-spectrum S (f), 
can be obtained from the random target x(t) and random response y(t) by taking the Fourier transform 
of the cross-correlation function r (x), i.e., S (f) = F{r (%)}, or in the frequency domain via S (f) = 
X(f)Y(f)*, or by a nonparametric system identification approach (e.g., "spa.m" in MATLAB). The 
cross-spectrum provides estimates of the relative amplitude (i.e., gain) and phase-lag at each frequency. 
Gain, phase, and remnant frequency response curves provide objective measures of pursuit tracking 
behaviour, irrespective of linearity, and are considered a most appropriate "quasi-linear" tool for obtaining 
a quantitative assessment of pursuit tracking behaviour [Neilson and Neilson, 1980]. From the cross- 
spectrum one can also derive the coherence function which gives the proportion of the response signal 
linearly related to the target at each frequency: ( xy 2 = *S ly (f)* 2 /S x (f)S y (f). Lynn et al. [1977] emphasize, 
however, that one must be cognizant of the difficulty representing tracking performance by a quasi-linear 
time-invariant transfer function, especially if the run is of short duration or if the target waveform is of 
limited bandwidth, as the results can be so statistically unreliable as to make description by a 2nd- or 
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3rd-order transfer function quite unrealistic. Van den Berg et al. [1987] chose four parameters to char- 
acterize tracking performance: low-frequency performance via the mean gain of transfer function at the 
3 lowest of 8 frequencies in target signal, high-frequency performance via the frequency at which the 
gain has dropped to less than 0.4, mean delay via shift of peak of cross-correlation function, and remnant 
via power in frequencies introduced by subject relative to total power. Spectral and coherence analysis 
have been used to demonstrate that the human bandwidth is about 2 Hz for both kinaesthetic tracking 
[Neilson, 1972] and visual tracking [Neilson et al., 1993], a much greater relative amplitude of second 
harmonic in the response of cerebellar subjects in sine tracking [Miller and Freund, 1980], a near constant 
lag except at low frequencies in normal subjects [Cassell, 1973], adaptation to time-varying signals 
[Bosser, 1984], 2-D asymmetry in postural steadiness [Milkowski et al., 1993], and that normal subjects 
can form non-dynamic and dynamic inter-limb synergies in a bimanual tracking task [O'Dwyer and 
Neilson, 1995]. 

Graphical Analysis of Tracking Performance 

Most of the of the above analyses give quantitative estimates of some aspect of performance which is 
effectively assumed to be constant over time, other than for random fluctuations. This is frequently not 
the case, especially for more complex sensory-motor tasks such as tracking. Changes in performance 
over time can be divided into two major classes: class I: those for which the underlying PRs remain 
unchanged (these are due to factors such as practice, fatigue, lapses in concentration, lack of practice, 
and changes in task complexity) and class II: those for which one or more underlying PRs have changed 
(these are due to abrupt or gradual alterations at one or more sites in the sensory-motor system and 
include normal changes, such as due to age, and abnormal changes, due to trauma or pathology). 

Studies of class I factors using tracking tasks are complicated most by the intra-run difficulty of a task 
not being constant. Changes in tracking accuracy during a run can be viewed via graphs of target, response, 
and errors [Jones et al., 1993]. The latter is particularly informative for sinusoidal targets for which the 
mean absolute errors can be calculated over consecutive epochs, corresponding to sine-wave cycles, and 
plotted both in a histogram form and as a smoothed version of this [Jones and Donaldson, 1986]. As 
complexity of task is constant over epochs (cf. random pursuit task), the error graph gives an accurate 
measure of a subject's time-dependent spatiotemporal accuracy that is not confounded by changes in 
task difficulty and, therefore, gives a true indication of changes in performance due to factors such as 
learning, fatigue, and lapses in concentration. Attempts by the author to derive an instantaneous or short 
epoch (up to several seconds) function or index of task difficulty which would allow equivalent graphs 
to be generated for random targets were unsuccessful. 

Neilson et al. [1998] devised an alternative procedure for intra-run analysis, termed micromovement 
analysis. This involves segmentation of the X and Y deflections of the response cursor on the basis of 
discontinuities, flat regions, and changes in direction of the response. They used this to identify changes 
in visual-motor coupling during the first 4 min of tracking on a 2-D compatible task following 4 h of 
practice on a 2-D incompatible task. They propose that these changes are evidence of rapid switching 
between different sensory-motor models in the brain. 

By comparison, as long as the task remains unchanged over successive runs, studies of class II factors 
using tracking tasks are complicated most by inter-run learning. Although most learning occurs over the 
first one or two runs or sessions, tracking performance can continue to improve over extended periods 
as evidenced by, for example, significant improvements still being made by normal subjects after nine 
weekly sessions [Jones and Donaldson, 1981]. Consequently, a major difficulty met in the interpretation 
of serial measures of performance following acute brain damage is differentiation of neurologic recovery 
from normal learning. Furthermore, it is not simply a matter of subtracting off the degree of improved 
performance due to learning seen in normal control subjects. Jones et al. [1990] have developed graphical 
analysis techniques which provide for the removal of the learning factor, as much as is possible, and 
which can be applied to generating recovery curves for individual subjects following acute brain damage 
such as stroke. They demonstrated that, for tracking, percentage improvement in performance (PIP) 
graphs give more reliable evidence of neurologic recovery than absolute improvement in performance 
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(PIA) graphs due to the former's greater independence from what are often considerably different absolute 
levels of performance. 

Statistical Analysis 

Parametric statistics (t-test, ANOVA) are by far the most commonly used in studies of sensory-motor/psy- 
chomotor performance due, in large part, to their availability and ability to draw out interactions between 
dependent variables. However, there is also a strong case for the use of non-parametric statistics. For 
example, the Wilcoxon matched-pairs statistic may be preferable for both between-group and within-subject 
comparisons due to its greater robustness over its parametric paired t-test equivalent, with only minimal 
loss of power. This is important due to many sensory-motor measures having very non-Gaussian skewed 
distributions as well as considerably different variances between normal and patient groups. 

Defining Terms 

Basic element of performance (BEP): Defined by a functional unit and a dimension of performance, 

e.g., right elbow flexor + speed. 

Dimension of performance: A basic measure of performance such as speed, range of movement, 
strength, spatial perception, spatiotemporal accuracy. 

Functional unit: A subsystem such as right elbow flexor, left eye, motor memory. 

Performance capacity: The maximal level of performance possible on a particular dimension of per- 
formance. 

Performance resource (PR): One of a pool of elemental resources, from which the entire human is 
modelled [Kondraske, 1995b], and which is available for performing tasks. These resources can be 
subdivided into life sustaining, environmental interface, central processing, and skills domains, 
and have a parallel with basic elements of performances. 

Sensor: [in context of tracking tasks] A device for measuring/transducing a subject's motor output. 

Sensory-motor control: The primary (but not only) performance resource responsible for accuracy of 
movement. 

Spatiotemporal accuracy: The class of accuracy most required by tasks which place considerable 
demand on attainment of simultaneous spatial and temporal accuracy; this refers particularly to 
paced tasks such as tracking, driving, ball games, and video games. 

Tracking task: A laboratory apparatus and associated procedures which have proven one of the most 
versatile means for assessing and studying the human "black-box" sensory-motor system by pro- 
viding a continuous record of a subject's response, via some sensor, to any one of a large number 
of continuous and well- controlled stimulus or target signals. 
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The human brain has been the subject of much scientific research. While a tremendous amount of 
information exists and considerable progress has been made in unlocking its many mysteries, many gaps 
in understanding exist. However, it is not essential to understand in full detail how a given function of 
the brain is meditated in order to accept that the function exists or to understand how to maximally 
isolate it, stress it, and characterize at least selected attributes of its performance quantitatively. In this 
chapter, a systems view of major functional aspects of the brain is used as a basis of discussing methods 
employed to measure what can the termed central processing performance capacities. Central processing 
capacities are distinguished from the information that is processed, in humans, the latter can be viewed 
to represent the contents of memory (e.g. facts, "programs," etc.). Clearly, both the information itself 
and the characteristics of the systems that process it (i.e., the various capacities discussed below) combine 
to realize what are commonly observed as skills, whether perceptual, motor, cognitive, or other. 

Investigations of how humans process information have been performed within various fields including 
psychology, cognitive science, and information theory with the primary motivation being to better 
understand how human information processing works and the factors that influence it. On the basis that 
it provided a rigorous definition for the measurement of information, it can be argued that Shannons 
information theory [Shannon, 1948] has been and continues to be one of the more important develop- 
ments to influence both the science and engineering associated with human information processing. 
Several early attempts to apply it to human information processing [Fitts, 1954; Hick 1952; Hyman, 
1953] have stood the test of time and have provided the basis for subsequent efforts of both researchers 
and practitioners. These works are central to the material presented here. In addition, the work of Wiener 
[1955] is also noteworthy in that it began the process of viewing human and artificial information 
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processing from a common perspective. Analogies between humans and computers have proven to be 
very useful up to certain limits. 

While there is considerable overlap and frequent interchange, the roles of science and engineering are 
different. In the present context, the emphasis is on aspects of the latter. It has been necessary to engineer 
useful measurement tools and processes without complete science to serve a wide variety of purposes 
that have demanded attention in both clinical and nonclinical contexts. Whether purposeful or accidental, 
methods of systems engineering have been incorporated and have proven useful in dealing with the 
complexity of human brain structure and function. While scientific controversies continue to exist, much 
research has contributed to the now common view of separate (in both function and location) processing 
subsystems that make up the whole. Various versions of a general distributed, multiprocessor model of 
human brain function have been popularly sketched [Gazzaniga, 1985; Minsky, 1986; Ornstein, 1986]. 
For example, it is widely known that the occipital lobe of the brain is responsible for visual information 
processing, while other areas have been found to correspond to other functions. It is not possible to do 
justice here to the tremendous scope of work that has been put forth and therefore to the brain itself. 
Nonetheless, this compartmentalistic or systems approach, which stresses major functional systems that 
must exist based on overwhelming empirical evidence, has proven to be useful for explaining many 
normal and pathologic behavioral observations. This approach is essential to the development of mean- 
ingful and practical performance measurement strategies such as those described. 

150.1 Basic Principles 

Many of the past efforts in which performance-related measurements have played a significant role have 
been directed toward basic research. Furthermore, much of this research has been aimed at uncovering 
the general operational frameworks of normal human inforamtion processing and not the measurement 
of performance capacities and their use, either alone or in combination with other capacity metrics, to 
characterize humans of various types (e.g., normal, aged, handicapped, etc.). However, representative 
models and theories provide direction for, and are themselves shaped by, subsequent measurement efforts. 
While there are many principles and basic observations that have some relevance, the scope of material 
presented herein is limited to topics that more specifically support the understanding of human infor- 
mation-processing performance capacity measurement. 

Functional Model of Central Information Processing 

A simplified, although quite robust, model that is useful within the context of human information 
processing is illustrated in Fig. 150.1. With this figure, attention is called to systems, their functions, and 
major interconnectivities. At a functional level that is relatively high within the hierarchy of the human 
central nervous system, the central processing system can be considered to be composed of two types of 
subsystems: (1) information processors and (2) memories. As can be seen from the diagram, information 
from the environment is provided to the information-processing subsystems through human sensor 
subsystems. These not only include the obvious sensors (e.g., the eyes) that receive input from external 
sources but also those specifically designed to provide information regarding the internal environment, 
including proprioception and state of being. The capacity to process information input from multiple 
sources at a conscious level is finite. Constant overload is prevented by limiting the amount of information 
simultaneously received; specific sensory inforamtion of high priority is controlled by what may be termed 
an attention processor. Information associated with a given sensor system is processed by a corresponding 
processor. The model further suggests that there are memory subsystems for each sensory modality in 
bidirectional communications with the associated processor. Thus modality-specific (e.g., visual, audi- 
tory, etc.) information may be referenced and processed with new information. As information is received 
and processed at modality-specific levels, it is combined at a higher level by integrative processors to 
generate situation-specific responses. These may be in the form of musculoskeletal, cognitive, or atten- 
tion-modifying events, as well as any combination of these. 
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FIGURE 150.1 A functional systems-level block diagram of human information processing that facilitates descrip- 
tion of the measurement of subsystem performance capacities. 

General measurement issues of fidelity, validity, and reliability are natural concerns in measurement 
of information-processing performance capacities. Special issues emerge that increase the overall com- 
plexity of the problem when attempting to measure attributes that reflect just the information-processing 
subsystems. Unlike a computer, where one may remove a memory module, place it in a special test unit, 
and determine its capacity, the accessibility of the subsystems within the human brain is such that it is 
impossible to perfectly isolate and directly measure any of the individual components. When measuring 
characteristics of human information-processing subsystems, it is necessary to address measurement 
goals that are similar to those applied to analogous artificial systems within the constraint that human 
sensors and motoric subsystems also must be utilized in the measurement process. 

Performance Capacities 

In considering the performance of human information-processing systems, the resource-based perspec- 
tive represented by the elemental resource model [Kondraske, 2000] is adopted here. This model for 
human performance encompasses all types of human subsystems and is the result of the application of 
a general theoretical framework for system performance to the human system and its subsystems. A 
central idea incorporated in this framework, universal to all types of systems, is that of performance 
capacity. This implies a finite availability of some quantity which thereby limits performance. A general 
two-part approach is used to identify unique performance capacities (e.g., visual information-processor 
speed): (1) identify the system (e.g., visual information processor) and (2) identify the dimension of 
performance (e.g., speed). In this framework, system performance capacities are characterized by avail- 
ability of performance resources along each of the identified dimensions. These performance resources 
are to be distinguished from less rigorously defined general processing resources described by others 
[e.g., Kahneman, 1973, Wickens, 1984]. However, many of the important basic constructs associated with 
the idea of "a resource" are employed in a similar fashion in each of these contexts. For processors, key 
dimensions of performance are speed and accuracy. Processor speed and processor accuracy capacities are 
thus identified. For memory systems, key dimensions of performance are storage capacity, speed (e.g., 
retrieval), and accuracy (e.g., retrieval). Other important attributes of performance capacities are devel- 
oped in other sections of this chapter. 

Many aspects of information-processing performance have been investigated, resulting in discoveries 
that have provided insight into the capacities of subsystems as well as refinement to both system structure 
and function definitions [Lachman et al, 1979]. One of the oldest studies in which the basic concept of 
an information-processing performance capacity was recognized addressed "speed of mental processes" 
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[Donders, 1868]. The basic idea of capacity has been a central topic of interest in human information- 
processing research [Moray, 1967; Posner & Boies, 1971; Schneider & Shiffrin 1977; Shiffrin & Schneider, 

1977]. 

Stimulus-Response Scenario 

The stimulus-response scenario, perhaps most often recognized in association with behavioral psychology, 
has emerged as a fundamental paradigm in psychology experiments [Neel, 1977]. Aside from general 
utility in research, it is also an essential component of strategies for measurement of human information- 
processing performance capacities. A typical example is the well-known reaction-time test, in which the 
maximum speed (and sometimes accuracy) at which information can be processed is of interest. Here, 
a subject is presented a stimulus specific to some sensory modality (e.g., visual, auditory, tactile) and is 
instructed to respond in a prescribed manner (e.g., lift a hand from a switch "as fast as possible" when 
the identified stimulus occurs). This general approach, which has become so popular and useful in 
psychology, also can be recognized as one that has been commonly employed in engineering to charac- 
terize artificial systems (e.g., amplifiers, motors, etc.). A specific known signal (stimulus) is applied to 
the system's input, and the corresponding output (response) is observed. Specified, measurable attributes 
of the output, in combination with the known characteristics of the input, are used to infer various 
characteristics of the system under test. When the focus of interest is the performance limits of processing 
systems, these characteristics include processing speed, processing accuracy, memory storage capacity, etc. 
In performance capacity tests, an important related component is the prestimulus set, or simply the 
way in which the system is "programmed to respond to the stimulus." This usually accomplished by one 
or more components of the instructions given to the subject under test (e.g., respond "as quickly as 
possible," etc.) just prior to the execution of an actual test. 

Measurement of Information (Stimulus Characterization) 

Within a given sensory modality, it is easy to understand that different stimuli place different demands 
or "loads" on information-processing systems. Thus, in order to properly interpret results of performance 
tests, it is necessary to describe the stimulus. While this remains a topic of ongoing research with inherent 
controversies, some useful working constructs are available. At issue is not simply a qualitative description, 
but the measurement of stimulus content (or complexity). Shannon's information theory [1948], which 
teaches how to measure the amount of information associated with a generalized information source, 
has been the primary tool used in these efforts. Thus a stimulus can be characterized in terms of the 
amount of information present in it. Simple stimuli (e.g., a light that is "on" or "off") possess less 
information than complex stimuli (e.g., a computer screen with menus, buttons, etc.). The best successes 
in attempts to quantitatively characterize stimuli have been achieved for simple, discrete stimuli (Hick, 
1952; Hyman, 1953]. From Shannon, the amount of information associated with a given symbol z'selected 
from a source with n such symbols is given by 



= log 2 (Vp,-) 



(150.1) 



where p i is the probability of occurrence of symbol i (within a finite symbol set), and the result has the 
units of "bits." Thus high-probability stimuli contain less information than low-probability stimuli. It is 
a relatively straightforward matter to control the probabilities associated with symbols that serve as stimuli 
in test situations. The application of basic information theory to the characterization of stimuli that are 
more complex (i.e., multiple components, continuous) is challenging both theoretically and practically 
(e.g., large symbol sets, different sets with different probability distributions that must be controlled, etc.) 
If stimuli are not or cannot be characterized robustly in terms of a measure with units of "bits," 
operationally defined units are frequently used (e.g., "items," "chunks," "stimuli," etc.). In addition, as 
many stimulus attributes as possible are identified and quantified, and others are simply "described." 
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This at least maximizes the opportunity for obtaining repeatable measurements. However, the additional 
implication is that the number of "bits per item" or "bits per chunk" is waiting to be delineated and that 
perhaps a conversion could be substituted at a later time. While this state leaves much to be desired from 
a rigorous measurement perspective, it is nonetheless quite common in the evolution of measurement 
for many physical quantities and allows useful work to be conducted. 

Speed-Accuracy Tradeoff 

Fundamental to all human information-processing 
systems and tasks in the so-called speed-accuracy 
tradeoff. This basic limitation can be observed in 
relatively high-level everyday tasks such as reading, 
writing, typing, listening to a lecture, etc. Psychol- 
ogists [see Wicklegren, 1977] have studied this 
tradeoff in many different context. Fitts [1966] 
demonstrated a relationship between measures 
reflecting actual performance (reaction time and 
errors) and incentive-based task goals (i.e., were 
subjects attempting to achieve high speed or high 
accuracy). As shown in Fig. 150.2, relationships that 
have been found suggest an upper limit to the com- 
bination of speed and accuracy available for infor- 
mation-processing tasks. In this figure, original 
reaction-time measures have been transformed by 
simple inversion to obtain true speed measures that 
also conform to the resource construct used in 
modeling performance capacities, as described ear- 
lier. The result then defines a two-dimensional (e.g., speed-accuracy) performance envelope for infor- 
mation-processing systems. The area within this envelope, given that each dimension represents a 
resource-based performance capacity, represents a higher-level or composite performance capacity metric 
(with units of speed x accuracy), analogous in some respects to a gain-bandwidth product or work 
(force x displacement) metric. 
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FIGURE 150.2 A typical information-processing per- 
formance envelope in two-dimensional speed-accuracy 
space. Data used were derived from Wicklegren [1977]. 



Divided Attention and Time Sharing 

An approach sometimes used in a variety of measurement contexts incorporates a dual-task or divided 
attention scenario that is designed to require use of attention resources in two different simultaneously 
executed tasks [e.g., Wickens, 1984]. For example, visual tracking (primary task) accuracy and speed of 
response to an embedded visual stimulus (secondary task) can be measured. Details of the potential time- 
sharing possibilities at play are quite complex [Schneider & Shiffrin, 1977; Shiffrin & Schneider, 1977]. 

In comparison with single-task performance test situations, in which the attention processor may not 
be working at capacity, the additional demand is designed to change (compared with a single-task 
baseline) and sometimes maximizes the stress on attentional resources. Performance on both primary 
and secondary tasks, compared with levels attained when each task is independently performed, can 
provide an indirect measure of capacity associated with attention [Parasuraman & Davies, 1984]. This 
approach has been useful in determining relative differences in demand imposed by two different primary 
tasks by comparison of results from respective tests in which a fixed secondary task is used with different 
primary tasks. Of more direct relevance to the present context, an appropriate secondary task can be 
used to control in part the conditions under which a given performance capacity (defined in a standard 
way and measured in association with the primary task) is measured; for example, visual information- 
processing speed can be measured with no additional attentional load or with several different additional 
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attentional load levels. While it may be possible to rank secondary tasks in terms of the additional load 
presented, there are no known methods to quantify attentional load in absolute terms. 

150.2 General Measurement Paradigms 

Despite the complexity of human information processing, a fairly small number of different measurement 
paradigms have emerged for quantification of the many unique human information-processing perfor- 
mance capacities. This is perhaps in part due to the limited number of system types (e.g., processors and 
memories). A good portion of the observed complexity can therefore be attributed to the number of 
different processors and memories, as well as the vastness and diversity of actual information that human's 
typically possess (i.e., facts, knowledge, skills, etc.). While most of the paradigms described below have 
been used for decades, it is helpful to recognize that they all conform to, or can be made to conform to, 
the more recently proposed [Kondraske, 1987, 1995] generalized strategy for measuring any aspect of 
performance for any human subsystem: (1) maximally isolate first the system of interest, (2) maximally 
isolate the dimension(s) of performance of interest, and (3) maximally stress (tempered by safety con- 
siderations when appropriate) the system along the dimension(s) (Table 150.1). 

Information-Processing Speed 

The paradigm for measuring information-processing speed is commonly referred to as a reaction-time 
test because the elapsed time (i.e., the processing time) between the onset of a given stimulus and the 
occurrence of a prescribed response is the basic measurable quantity. To obtain a processing-speed 
measure, the stimulus content in "bits," "chunks," or simply "stimuli" is divided by the processing time 
to yield measures with units of bits per second, chunks per second, or stimuli per second. Choice of 
stimulus modality isolates a specific sensory processor, whereas choice of a responder (e.g., index finger, 
upper extremity motion caused primarily by shoulder flexors, etc.) isolates a motoric processor associated 
with generation of the response. Much research has addressed the allocation of portions of the total 
processing time to various subsystem components [e.g., Sternberg, 1966]. The system of primary interest 
as presented here is the sensory processor. Processing times involved in the motoric processor involved 
in the response are substantially less in normal systems. However, it is recommended that given processor 
speed capacity be identified not only by the sensor subsystem stress but also by the responder (e.g., visual- 
shoulder flexor information processor speed). This identifies not only the test scenario employed but 
also the complete information path. 

Information-Processing Accuracy and Speed-Accuracy Combinations 

In contrast to the processor speed capacity, which describes limits on information rate, accuracy relates 
to the ability to resolve content. In general, paradigms for tests that include accuracy measures typically 
involve a finite set of symbols with a corresponding set of responses. Various stimulus presentation- 
response scenarios can be used. For example, a stimulus can be randomly selected from the set with the 
subject required to identify the stimulus presented. Alternately, a subset of stimuli can be presented and 
the subject asked to select the response corresponding to the symbol not in the subset (size of subsets 
should be small if it is desired to minimize stress on memory capacities). A key element is that, within 
a poststimulus response window, the subject is forced to select one of the available responses. This allows 
accuracy of the response to be measured in any of a number of ways ranging from relatively simple to 
complex [Green & Swets, 1966; Wicklegren, 1977]. The response window is either relatively open-ended, 
as short as possible, and variable (based on the speed of the subject's response) or fixed at a given length 
of time. When only accuracy is stressed, subjects are instructed to perform as accurately as possible, and 
the stress on speed is minimized (e.g., "take your time"). In a second type of speed-accuracy test, subjects 
are instructed to perform as fast and as accurately as possible, maximally stressing both dimensions, 
while both accuracy and speed of responses are measured. In a variation of this general paradigm, a fixed 
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TABLE 150.1 The Combination of the Human Sensor Subsystem (Determined by Stimulus Source) 
and Responder Isolates Unique Subsystems. Depending on objectives of the test task (as communicated 
to the subject under test) and metrics obtained, many different unique capacities can be measured. 



Capacity 
Types 



Stimulus Type 



Responder* 



Measurements 



Attention 


Visual 


Upper extremity 




Auditory 


Vocal system 


Processor 


Visual 


Upper extremity 


speed 


Visual 


Lower extremity 




Visual 


— 




Auditory 


Upper extremity 




Auditory 


Lower extremity 




Auditory 


Vocal system 




Vibrotactile 


Vocal system 


Processor 


Visual 


Upper extremity 


accuracy 


Visual 


Lower extremity 




Visual 


— 




Auditory 


Upper extremity 




Auditory 


Lowe extremity 




Auditory 


Vocal system 




Vibrotactile 


Vocal system 


Processor 


Visual 


Upper extremity 


speed- 


Visual 


Lower extremeity 


accuracy 


Visual 


— 




Auditory 


Upper extremity 




Auditory 


Lower extremity 




Auditory 


Vocal system 




Vibrotactile 


Vocal system 


Memory 


Visual 


(Same options 


storage 


Visual 


as abovef 


capacity 


Visual 

Auditory 

Auditory 

Auditory 

Vibrotactile 





Length of time that task can be performed to specification and 
accuracy (if appropriate for task) when subject is instructed to 
perform for "as long as possible." 

Inverse of time required to react to stimulus (i.e. stimuli/second) 
when subject is instructed to respond "as quickly as possible." 



Subject is asked to perform task (e.g., typically recognition of 
symbols from set with similar information content across 
symbols) "as accurately as possible" and without stress on speed 
of measures of accuracy. The percentage correct (out of a 
predetermined set size) is often used, although different 
accuracy measures have been proposed [Green & Swets, 1966; 
Wicklegren, 1977], 

Basically a combination of tests that stress speed and accuracy 
capacities individually; both speed and accuracy measures are 
obtained as defined above to get off-axis data points in two- 
dimensional speed accuracy space. Both speed and accuracy can 
be maximally stressed (e.g., by instructing subject to perform 
"as fast and accurately as possible"), or accuracy can be 
measured at different speeds by varying time available for 
responses and stressing accuracy within this constraint. 

Maximum amount of information of type defined by stimulus 
which subject is able to recall. Stimuli usually consist of sets of 
symbols of varying complexity in different sets (e.g., spatially 
distributed lights, alphanumeric characters, words, motions, 
etc.) but similar amount of information per symbol in a given 
set. Units of "bits" are ideal but not often possible if number 
of bits per symbol is unknown. In such cases, units are often 
reported as "symbols," "chunks," or "items." 



*General terms are used in table to illustrate combination options. Very specific definition that controls the motoric 
functional units involved as precisely as possible should be used for any given capacity test. For example, ( 1 ) upper 
extremity (shoulder flexor vs. elbow flexor vs. digit 2 flexor), (2) vocal system (lingua-dental "ta" vs. labial "pa" response), 
etc. 

f Unless motor memory capacities are being tested, typically chosen to minimize stress on motoric processing and, 
compared with processor performance measures, not to isolate a unique memory system. 

response window size is selected to challenge response speed. This provides a known point along the 
speed dimension of performance. In all three cases, measured response accuracy provides the second 
coordinate of a point in the two-dimensional speed- accuracy performance space. The combination of 
different paradigms can be used to determine the speed-accuracy performance envelope (see above). 

Memory Capacity 

As used here, memory capacity refers to the amount of information that can be stored and/or recalled. 
It is well accepted that separate memory exists for different sensory modalities. Also, short-, medium-, 
and long-term memory systems have been identified [Crook et al., 1986]. Thus separate capacities can 
be identified for each. Higher-level capacities also can be identified with processes that utilize memory, 
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such as scanning [Sternberg, 1966]. A comprehensive assessment of memory would require tasks that 
challenge the various memory systems to determine resources available, singly and along different com- 
binations of their various modalities, most directly relevant to the individual's development stage (infant, 
youth, adolescent, young adult, middle-aged adult, or older adult) and reflecting the diversity and depth 
of life experiences of that individual . No such comprehensive batteries are available. As noted by Syndulko 
et al. [1988], "Most typically, a variety of individual tests are utilized that evaluate selected aspects of 
memory systems under artificial conditions that relate best to college students." This general circumstance 
is in part due to the lack of generalized performance measurement strategies for memory. Despite such 
diversity, perhaps the most common variation of the many memory capacity tests involves providing a 
stimuli of the appropriate modality and requiring complete, accurate recall (i.e., a response) after some 
specified period of delay. The response window is selected to be relatively long so that processor speed 
is only minimally stressed. Typically, a test begins with a stimulus that has a low information content 
(i.e., one that should be within capacity limits of the lowest one expects to encounter). If the response 
is correct, the information content of the stimuli is increased, and another trial is administered. It is 
typically assumed that the amount of information added after each successful trial is fairly constant. 
Examples include adding another light to a spatially distributed light pattern, adding another randomly 
selected digit (0 to 9) to a sequence of such digits, etc. By continuing this process of progressively 
increasing the amount of stimulus information until an inaccurate response is obtained, the isolated 
memory capacity is assumed to be maximally stress. The result is simply the amount of information 
stored and recalled, in units of bits, chunks, or items. 

Attention 

Tests for basic attention capacity can be considered to be somewhat analogous to endurance capacity 
tests for neuromuscular systems. Simply put, the length of time over which a specified information- 
processing task can be executed provides a measure of attention capacity (or attention span). The test 
paradigm typically involves presentation at random time intervals of a randomly selected stimulus from 
a finite predefined symbol set for a relatively short, fixed time (e.g., 1 s). Within a predefined response 
window (with a maximum duration that is selected so that processing speed resources are minimally 
stressed), the subject must generate a response corresponding to the stimulus that occurred. Attention 
is maximally stressed by continuing this process until the subject either (1) makes no response within 
the allocated response window or (2) produces an incorrect response (i.e., a response associated with a 
stimulus that was not presented). These criteria thus essentially define the point at which the specified 
task (i.e., recognize stimuli and respond corretly within a generous allotted time) can no longer be 
completed. Once again, choice of stimulus modality isolates a specific sensory processor (e.g., visual, 
auditory, tactile, etc.). Clearly, stimulus complexity is also important. Stress on higher-level cognitive 
resources can be minimized by choice of simple stimuli (e.g. lights, tones, etc.), although motoric 
processors associated with response generation are also involved. However, these are minimally stressed, 
and as long as basic functionality is present, the choice of responder should have little influence on results 
compared with, for example, the influence occurring during processor speed capacity measurements. 

150.3 Measurement Instruments and Procedures 

Given the preceding functional systems model for human information processing (see Fig. 150.1) and 
review of measurement paradigms for information-processing subsystems, a general architecture for 
instruments capable of measuring information-processing performance capacities can be defined 
(Fig. 150.3). It is interesting to observe that this architecture parallels the human inforamtion-processing 
system (i.e., compare Fig. 150.1 and Fig. 150.3). 

Most such instruments are computer-based, allowing for generation of stimuli with different contents, 
presentation of stimuli at precisely timed intervals, measurement of processing times (from which 
processing speeds are derived) with high accuracy (typically, to within 1 ms), and accurate recording of 
responses and determination of their correctness. 
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protocols, review 

of results, etc.) 



FIGURE 150.3 The general architecture for instruments used to measure human information-processing perfor- 
mance capacities. 

A typical desktop computer system fits the general architecture presented in Fig. 150.3, and as such, 
it has been widely exploited "as is" physically with appropriate software to conduct information-process- 
ing tests. This is adequate for tests that specifically focus on higher-level, complex cognitive processing 
tasks due to the longer processing times involved. That is, screen refresh times and keyboard response 
times are smaller fractions of total times measured and contribute less error. However, for measurement 
of more basic performance capacities and to allow testing of subjects who do not possess normal physical 
performance capacities, custom test setups are essential. The general design goal for these stimuli gen- 
erators and response sensors is to minimize the stress on human sensory, motor, and aspects of central 
processing resources that are not the target of the test. For example, the size of standard keyboard switches 
imposes demands on positioning accuracy when, for example, visual information-processing accuracy 
is being tested. According to Fitts' law [Fitts, 1954], the task component involved with hitting the desired 
response key (i.e., the component that occurs after the visual information processing component) requires 
longer processing times (associated with the motoric response task) for smaller targets. Thus, to minimize 
such influences, a relatively large target is suggested. Along the same lines of reasoning, these additional 
design guidelines are important: (1) key stimulus attributes that are not related to information content 
should be well above normal minimum human sensing thresholds (e.g., relatively bright lights that do 
not stress visual sensitivity, large characters or symbols that do not stress visual acuity), and (2) response 
speeds of stimulus generators and response sensors should be much faster than the fastest human response 
speed that is anticipated to be measured. 

Even the many custom devices reported in the literature (as well as commercially available versions 
of some) have been devised primarily to support research into fundamental aspects of inforamtion 
processing and are used in studies that typically involve only healthy young college students. While such 
devices measure the stated or implied capacities well in this population, the often unstated assumption 
regarding other involved performance capacities (i.e., that they are available in "normal" amounts) is 
often severely challenged when the subject under test is a member of a population with impairments 
(e.g., head-injured, multiple sclerosis, Parkinson disease, etc.). 

A representative instrument [Kondraske, 1990; Kondraske et al., 1984] is illustrated in Fig. 150.4. This 
device incorporates 8 LEDs and 15 high-speed touch sensors ("home", A1-A8, B1-B6) and a dedicated 
internal microprocessor to measure an array of different performance capacities associated with infor- 
mation processing. Measures at the most basic hierarchical level (visual inforamtion-processing speed, 
visual-spatial memory capacity, visual attention span, etc.) and intermediate level (coordination tasks 
involving different sets of more basic functional units) are included. 

For example, visual-upper extremity information-processing speed is measured in a paradigm whereby 
the test subject first places his or her hand on the "home" plate. A random time after an audible warning 
cue, one out of eight LEDs (e.g., LED 4) stimulates the subject to respond "as quickly as possible" by 
lifting his or her hand off the home plate and touching the sensor (e.g., A4) associated with the lighted 
LED. Each LED represents a binary (ON of OFF) information source. Since an equiprobable selection 
scheme is employed, the log 2 (number of choices) represents the number of bits of information to be 
processed when determining which LED is lighted. In this example, the number of choices is eight; prior 
to the test, the subject was informed that any one of eight LEDs will be selected. Modes reflecting different 
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FIGURE 150.4 Major components of a specific microprocessor-based instrument for measurement of a selected 
subset of information-processing capacities incorporating high-intensity LEDs for visual stimuli and large-contact- 
area, fast-touch sensors to acquire responses (made by the upper extremity) from the subject under test. (Radius of 
semicircle is 15 cm.) 

information loads (1, 2, 4, or 8 choices) are available. Processing speed (in stimuli per second) is obtained 
as a result from each trial by inverting the total processing time, measured as the time from presentation 
of the stimulus until the subject's hand is lifted off the home plate. By combining time measures from 
two or more tests with different information loads, a more direct measure of true processing speed in 
bits per second is obtained. For example, results from an 8-choice (3-bit load) test and a 1-choice test 
(0-bit load) are combined as follows: (3 bits - bits)/(f 3 - f ), where t n is the reaction time measured for 
an n-bit information load. This method removes transmission and other time delays not associated with 
the primary visual information-processing task. Multiple trials are performed (typically, 3 x number of 
choices, but no fewer than 5 trials for any mode) to improve test-retest reliability in the final measures, 
which are computed by averaging over a fraction (best 80%) of the trials. While even more trials might 
further improve repeatability of final capacity measures in some subjects, the overall test duration must 
be considered in light of the desire (from a test design perspective) to minimize stress on attention 
resources, the availability of which could vary widely over potential test subjects. Careful tradeoffs are 
necessary. 

Visual-spatial memory capacity is measured using the same device. A random sequence of LEDs, 
beginning with a sequence length of "one," is presented to the subject. The subject responds by contracting 
(with the preferred hand) the subset of touch sensors (from Al through A8) corresponding to the LEDs 
incorporated in a given stimulus presentation in the same order as the LEDs sequence was presented. A 
correct response causes a new trial to be initiated automatically with the sequence length increased by 
"one." The maximum length of this sequence (in items) that the subject can repeat without error is used 
as the measure of visual-spatial memory capacity. The best of two trials is used as a final score. The bright 
LEDs, relatively large contact area of the touch sensors (5 x 5cm), and a relatively long poststimulus 
response window (up to 10 s) ensure that other performance resources are minimally stressed. 

A measure of visual attention span is obtained with a task in which random LEDs are lighted for 1 s 
at a random intervals ranging from 4 to 15 s. After each LED is lighted, a response window is entered 
(3 s maximum) during which the subject is required to indicate that a lighted LED was observed by 
contacting the corresponding touch sensor. The task continues until either the wrong touch sensor is 
contacted or no response is received within the defined response window. The time (in seconds) from 
the beginning of the test until an "end of test" criterion is met is used as the measure of visual attention 
performance capacity. The simple stimuli ensure that higher-level cognitive resources are minimally 
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stressed. This allows a wide range of populations to be tested, including young children. Some normal 
subjects can achieve very long attention spans (e.g., hours) with this specific scenario, while some head- 
injured subjects (for example) produce times less than 60 s. 

One example of a more comprehensive, computer-based memory performance test battery is that 
developed over the past 15 years by Crook and colleagues [1986]. This battery is directed at older adults, 
evaluates memory in the visual and auditory modalities at each of the three temporal process levels, and 
utilizes challenges that relate well to the daily activities of the target group. The challenges (i.e., stimulus 
sets) are computer generated and controlled, administered by means of a color graphics terminal, and 
in some cases, played for a videodisk player to provide a realistic a stimulus representation as possible 
outside of using actors in the everyday environment of the subject. The battery takes 60 to 90 minutes 
to administer and provides a profile of scores across the various measurement domains. Extensive age 
and gender norms are available to scale the scores in terms of percentiles. Despite its scope, it does not 
provide nor was it intended to provide, information about modalities other than visual and auditory 
(e.g., tactile, kinesthetic, or olfactory). It also does not address long-term memory storage beyond about 
45 minutes (consider the test administration complexities in doing so). Nonetheless, it represents one of 
the most sophisticated memory performance measurement and assessment systems currently available. 

150.4 Present Limitations 

General methods and associated tools do no exist to quantify the information content of any arbitrary 
stimulus within a given modality (e.g., visual, auditory, etc.). Success has been achieved only for relatively 
simple stimuli (e.g., lights which are on or off, at fixed positions spatially, etc.) It can be said, however, 
that serious attempts to quantify complex stimuli even in one modality appear to be lacking. To the 
practitioner, this prevents the standardization that could occur across many information-processing 
performance capacity metrics. For example, information-processing speed capacities cannot yet be 
expressed universally in terms of bits per seconds. Perhaps more significant, this limitation not only 
impacts looking toward the human and the measurement of intrinsic performance capacities but also 
the analysis of information-processing tasks in quantitative terms. This, in turn, has limited the way in 
which information-processing performance capacity measures can be utilized. Whereas one can measure 
grip strength available in a subject (a neuromuscular capacity) and also the worst-case grip force required 
to support turning a door knob, it is not yet possible to quantify how much visual information one must 
process to reach out and find the door knob. 

As was also noted, one cannot ideally isolate the systems under test. The required involvement of 
human sensory and motoric systems also requires the use of ancillary performance tests that establish 
at least some minimal levels of performance availabilities in these systems (e.g., visual acuity, etc.) in 
order to lend validity to measures of information-processing performance capacities. New imaging 
techniques and advanced electroencephalographic and magnetoencephalographic techniques such as 
brain mapping may offer the basis for new approaches that circumvent in part this limitation. While the 
cost and practicality of using these techniques routinely are likely to be prohibitive for the foreseeable 
future, the use of such techniques in research settings to optimize and establish validity of simpler methods 
can be anticipated. 

Moray [1967] was one of many to use the analogy to data and program storage in a digital computer 
to facilitate understanding of a "limited capacity concept" with regard to human information processing. 
In the material presented, emphasis has been placed on the components analogous to those within the 
realm of computer hardware (i.e., processors and memories). These have been dealt with at their most 
basic operational levels for testing purposes, attempting to intentionally limit stresses on the test subject's 
acquired "skill" (i.e., stored information itself). It has been implied and is argued that knowledge of these 
hardware system capacities is in itself quite useful (e.g., it is helpful to know whether your desktop 
computer's basic information-processing rate is 100 or 400 MHz). However, many tasks that humans 
perform daily that involve information processing require not only processing and storage capacities but 
also the use of unique information acquired in the past (i.e., training or "programs"). These software 
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components, which like processor and memory performance capacities also represent a necessary but 
not sufficient resource for completion of the specific task at hand, present even greater challenges to 
evaluation. If the analogy to computer programs is used (or even musical scores, plans, scripts, etc.), it 
can be observed that even the smallest "programming error" can lead to a failure of the total system in 
accomplishing its intended purpose with the intended degree of performance. Checking the integrity of 
human "programs" can be envisioned to be relatively easy if the desired functionality is in fact found to 
be present. However, if functionality is not present and diagnostic data points to a "software problem," 
a more difficult challenge is at hand. Methods that can be employed to reveal the source of the error in 
the basic information itself (i.e., the contents of memory) can be expected to be just as tedious as checking 
the integrity of code written for digital computers. 

Defining Terms 

Attention: A performance capacity representing the length of time that an information-processing 
system can carry out a prescribed task when maximally stressed to do so. This is analogous to 
endurance in neuromuscular subsystems. 

Dual-task scenario: A test paradigm characteristic by a primary and secondary task and measurement 
of subject performance in both. 

Memory: One of two major subsystem types of the human information-processing system that per- 
forms the function of storage of information for possible later retrieval and use. 

Performance capacity: A quantity in finite availability that is possessed by a system or subsystem, 
drawn on during the execution of tasks, and limits some aspect (e.g., speed, accuracy, etc.) of a 
system's ability to execute tasks; or, the limit of that aspect itself. 

Prestimulus set: The manner in which a human system is programmed to respond to an ensuing 
stimulus. 

Processor: One of two major subsystem types within the human information-processing system that 
performs the general function of processing information. Multiple processors at different hierar- 
chical levels can be identified (e.g., those specific to sensory modalities, integrative processors, etc.). 

Processor accuracy: The aspect, quality, or dimension of performance of a processor that characterizes 
the ability to correctly process information. 

Processor speed: The aspect, quality, or dimension of performance of an information processor that 
characterizes the rate at which information is processed. 

Speed-accuracy tradeoff: A fundamental limit of human information-processing systems at any level 
of abstraction that is most likely due to a more basic limit in channel capacity; i.e., channel capacity 
can be used to achieve more accuracy at the expense of speed or to achieve more speed at the 
expense of accuracy. 
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Further Information 

A particularly informative yet concise historical review of the major perspectives in psychology leading 
up to much of the more recent work involving quantitative measurements of information processing and 
determination of information processing capacities is provided in Theories of Psychology: A Handbook, 
by A. Neel (Schenkman Publishing, Cambridge, Mass., 1977). Human information processing has long 
been and continues to be a major topic of interest of the U.S. military. Considerable in-depth information 
and experimental results are available in the form of technical reports, many of which are unclassified. 
CSERIAC Gateway, published by the Crew System Ergonomics Information Analysis Center (CSERIAC), 
is distributed free of charge and provides timely information on scientific, technical, and engineering 
knowledge. For subscription information, contact AL/CFH/CSERIAC, Bldg. 248, 2255 H St., Wright- 
Patterson Air Force Base, Ohio, 45433. Also, see the CSERIAC website: http://cseriac. flight, wpafb.af.ml/. 
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Also, the North Atlantic Treaty Organization AGAARD (Advisory Group for Aerospace Research and 
Development) Aerospace Medical Panel Working Group 12 has worked on standards for a human 
performance test battery for military environments (much of which addresses information processing) 
and has defined data exchange formats. Documents can be obtained through the National Technical 
Information Service (NTIS), 5285 Port Royal Road, Springfield, Va 22161. 

Information regarding the design, construction, and evaluation of instrumentation (including quan- 
titative attributes of stimuli, etc.) is often embedded in journal articles with titles that emphasize the 
major question of the study (i.e., not the instrumentation). Most appear in many journals associated 
with the fields of psychology, neurology, and neuroscience. 
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As advanced information technologies have been increasingly integrated into communications, manu- 
facturing, transportation, power, and medical systems, among others, the predominant requirements of 
many human-performed tasks have correspondingly shifted from manual to mental (i.e., information- 
processing) activities. Examples of tasks reflecting this shift include supervisory control of complex 
systems and operating advanced technology equipment. As a result, data, models, and metrics for 
describing and understanding mental tasks and the mental capabilities of persons performing them have 
become very important to engineering decisions making. Task analysis is a generic term that represents 
processes and methods for applying task data to improve engineering decisions. This chapter presents a 
process and associated methods for analyzing the mental components of tasks. It further describes the 
role of mental task analysis along with human performance and machine models (as applicable) in applied 
decision making and presents a quantitative, performance-based framework for performing a unified 
human-machine-task analysis. This chapter is not intended as a review. Selected techniques are presented 
to familiarize the reader with different major approaches that are currently applied to solving real-world 
engineering problems. 
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151.1 Fundamentals 

The goals of task analysis are to ( 1 ) decompose high-level tasks into their constituent, mutually exclusive, 
lower-level tasks; (2) describe intertask relationships and dependencies; and (3) define each task's goals, 
procedures, and performance and skill requirements. A major thesis of this chapter is that using task 
analysis for applied decisions will be improved to the degree that (1) the performance requirements are 
quantified and (2) they are quantified in a form that is consistent with quantitative models of human 
performance. A task is a goal-directed, procedural activity that is defined with regard to the resources 
(i.e., capabilities) that performing agents (i.e., humans or machines) must possess to accomplish the goal. 
High-level tasks refer to complex, composite activities that entail performing many simpler but interrelated 
tasks in meaningful ways. Mental components refer to the perceptual and cognitive information-processing 
activities and resources that are necessary or sufficient for accomplishing tasks. 

Figure 151.1 depicts the mental task- analysis process in four major steps. Before any data are collected, 
required data should be defined on the basis of the characteristics of the system and the applied decision. 
Selection and use of data-collection methods, task models, and task metrics are driven by the system and 
applied decision. 
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FIGURE 151.1 Mental task analysis is a four-step process that is used in combination with models of human and 
machine performance, as applicable, to make applied decisions. 
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Figure 151.1 illustrates how task analysis is used with human and machine models as a basis for solving 
applied problems. Task data by themselves are of limited use in solving applied problems because the 
task data address only one set of the total system variables that need to be considered. To be applied, 
task data must be analyzed with human performance and machine performance data (i.e., capacities), 
as applicable to the applied problem (i.e., not all problems will involve machines.). This combined human- 
machine-task system analysis requires that the analysis of each component is accomplished with consistent 
constructs. Further, the quality and precision of applied decisions are enhanced to the degree that these 
constructs provide for quantitative treatment. A framework for human-machine-task decision making 
(described below) combines the use of task decomposition and resource requirement modeling tech- 
niques with human performance and machine performance models. 

151.2 Mental Task Analysis: Process and Methods 
Step 1: Data Requirements Definition 

In Step 1, the task data required for the analysis are identified and defined. There are many different 
dimensions and properties of tasks that can be analyzed. The functional scope of the human-machine- 
task system being analyzed, the point in the system design process at which the analysis is conducted, 
and the applied purpose of the analysis all contribute to determining what data are needed. In Fig. 151.1, 
the functional scope of the real-world, high-level mental task being performed by a human-machine 
system is represented generically. Identifying the task is necessary to bound the analysis. Task analysis 
can be applied to existing systems that perform well-defined tasks or to proposed systems for which tasks 
are in the process of being defined. At different points in the analysis, different decisions are applicable. 
When task analysis is used for system design decisions, different types of task data can be collected as 
the fidelity of system prototypes progresses and operational system become available. These differences 
are apparent by examining three points in the system engineering process at which task analysis is of 
particular utility. 

Predesign. After the functional and performance requirements for a system have been established, and 
analysis of major functions, together with data about technology availability and human performance, 
can be used to (1) define and allocate tasks, (2) develop procedures to accomplish the tasks, and 
(3) identify the needed human-machine interfaces. Note that the description of a task depends on the 
technology assumed. For example, a writing task would be accomplished differently with paper and 
pencil, a manual typewriter, or a computer with word processing software. 

Design Phase. During the design phase, task scenarios, prototypes, and simulations can be used to detail 
and refine the task procedures, task requirements, allocation decisions, and human-machine interface 
design. 

Postdevelopment. Analyzing well-defined tasks performed by an operational system provides a detailed 
description of task procedures and resource requirements that can be used to improve task performance 
and the system design. 

The applied purpose of the task analysis has the greatest impact on what type of data are needed. 
Applications vary greatly and include design and nondesign decisions that may or may not involve 
machines. Applications include, but are not limited to, those described below. 

Human-Machine Task Allocation. Allocation of tasks represents a major systems engineering decision. 
Task analysis is used to identify task load and performance requirements and contributes to assessing 
workload. Workload assessment provides a basis for assigning task responsibilities to humans and 
machines such that performance requirements can be satisfied and workload levels are not excessively 
low or high. These assessments augment qualitative allocation strategies (e.g., Fitts' lists) in which tasks 
are assigned on judgments of the relative performance capacities of humans and machines on generic 
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tasks. Allocating tasks requires that the resource demands associated with each lower-level task are assessed 
and that the human and machine resources available can be matched against these at comparable levels 
of analysis. 

Task-Technology Tradeoffs. Task analyses may be performed for systems that have yet to be developed 
but for which functional and performance requirements have been defined. In these cases, the analysis 
will be based on the denned requirements combined with data on human performance capacities and 
available machine technology. These analyses can be used to assess the workload, technology, and per- 
formance tradeoffs expected with different human-machine system configurations. 

Human-Machine Interface Design. Mental task analysis provides not only a basis for allocating task 
responsibilities to humans and machines but also can be used to define, for example, the spatial aspects 
of display and control requirements for the interface between humans and machines. 

Job Design. The roles humans satisfy in complex systems and organizations (e.g., manager, operator, 
maintainer) are generally designed to include many mental tasks. By identifying and describing the mental 
skills and resources required for task performance, task analysis provides a basis for optimizing job 
performance in new and existing systems. 

Personnel Selection and Job Fitness. Mental task analysis yields a specification of information-processing 
resources (e.g., speed, accuracy) required for task performance. This specification can be used in com- 
bination with measurements of the available resources individuals possess. Assessing performance 
resource sufficiency provides a basis for selecting personnel and determining if injured, rehabilitated 
personnel are sufficiently recovered to safely return to work. 

Training Systems Development. Task analysis identifies and characterizes the skills, knowledge, and 
mental capabilities that are necessary or sufficient for task performance. More specifically, it can be used 
to characterize differences in these abilities and in task performance between experts and novices. These 
differences serve as a basis for designing effective personnel training systems. 
These applications reduce to answering the following types of questions: 

• What is the best way to structure the task procedures? 

• Does a specific individual have the needed mental resources to perform the task? 

• What knowledge or skills does an individual need to acquire to perform the task with a given level 
of technology? 

■ Can the task be modified to accommodate the special needs of individuals? 

If machines are involved: 

• What human and machine combinations are most capable of performing the task? 

• How should human and machine components interact in performing the task? 

To answer these applied questions, the following data must be defined and described: 

• The demands imposed on mental capacities by tasks (e.g., knowledge, required mental actions, 
speed and accuracy in performance actions). 

• The mental capacities of humans in the system (e.g. knowledge, learning, processing speed, 
processing accuracy). 

If machines are involved: 

• The information-processing capacities of machine components (e.g., processing speed, algorithms, 
and data). 

■ The human-machine interaction demands imposed by the task. 

• The human-machine interaction capacities of the humans in the system. 

• The human-machine interaction capacities of applicable machines. 
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TABLE 151.1 Task Data- Acquisition Techniques 



Technique 

Knowledge 
acquisition 

Protocol analysis 



Instrumented 

human-Interface 
Observation 



Data Collected 

Knowledge required to perform 
tasks 

Procedures 
Strategies 
Knowledge 
Overt human action 
(e.g., keys pressed) 
Overt machine behaviors 
Overt human behaviors 
Situation state and state changes 



Requirements 

Task performers 

Interaction instruments, prototypes and 

simulations (dependent on techniques used) 
Task performers 
Task scenarios 

Task performers 

Applicable hardware and software 

Task performers 

Prototypes or developed systems 

Monitoring Instruments 



Application Phase 



All 



Design 
Postdevelopment 

Design 

Postdevelopment 
Design 
Postdevelopment 



Step 2: Data Collection 

Once the required task data are denned, they need to be collected. Table 151.1 lists four generic categories 
of techniques for acquiring data about mental tasks. The categories are not mutually exclusive. For 
example, all techniques use some form of observation, and instrumented human interface techniques 
and protocol analysis are used as knowledge-acquisition techniques. Each generic category is described 
in more detail below. 

Knowledge-Acquisition Techniques. Knowledge acquisition refers to the process of eliciting from persons 
the knowledge they possess. This process was originally applied to elicit knowledge from experts in the 
development of knowledge-based systems. However, knowledge-acquisition techniques have been used 
more generally as a means of analyzing the cognitive components of tasks [Lehto et al., 1992]. Knowledge- 
acquisition methods include those listed below: 

• Various interviewing techniques 

• Automatic inductive methods that infer and specify knowledge 

• Protocol analysis [Ericsson & Simon, 1984] 

■ Psychological scaling [Cooke & McDonald, 1988] 

■ Repertory grids from personal construct psychology theory [Boose, 1985] 

■ Observational techniques [Boy & Nuss, 1988] 

Protocol Analysis. Protocol analysis refers to various techniques for collecting data in the form of verbal 
reports. In this methodology, persons generate thinking-aloud protocols of the unobservable (i.e., mental) 
activities they perform while accomplishing a task [Ericsson & Simon, 1984] . The verbal reports represent 
behavioral descriptions from which task strategies, requirements, procedures, and problem areas can be 
derived. Protocols provide data on the sequence in which tasks are performed and thus can be used to 
identify the strategies used by humans in accomplishing tasks that can be accomplished in multiple ways. 
There are several issues that need to be considered about the validity of using verbal reports as data and 
inferring thought processes from them. Ericsson and Simon [1984] address these concerns by demon- 
strating how verbal protocols are distinct from retrospective responses and classic introspection by trained 
observers and detailing the methods for collecting and analyzing protocols. 

Instrumented Human-Interface Techniques. These techniques are used to passively and unobtrusively 
collect objective data on task performance and the procedures by instrumenting human-machine inter- 
faces. These technique are used to record the timing, sequences, and frequencies of explicit actions. For 
a computer system, examples are keystrokes, button pushes, and mouse movements. In an aircraft, 
examples are steering and throttle commands, as well as any other actions controlling any onboard 
systems. By themselves, the data associated with actions can be used to infer workload and to study the 
frequency of actions, their temporal relationships, and human error. Although these data are objective, 
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they are restricted to overt responses and, by themselves, do not provide a basis for analyzing mental 
components. Modeling the mental task components involves correlating these action data with data about 
the situations and system behavior that elicited the actions and the changes in the situations and system 
state caused by the actions, i.e., knowledge of context is vital. 

Observation Techniques. Observation is a generic term referring to many techniques that can use a wide 
range of instruments to monitor situations and the behavior of systems and task performers. Two types 
of observation are distinguished. In direct observation, a human observer is with the task performer as 
he or she performs the task. In indirect observation, a human observer monitors the task performer 
remotely with audio and visual monitoring systems. Indirect observation can be done as the task is being 
performed or after the fact by using a recording. Observation techniques may be unobtrusive or invasive, 
record subjective or objective data, and may be used in controlled testing or real-world environments. 

Steps 3 and 4: Data Modeling and Metric Computation 

Steps 3 and 4 are the actual analysis steps. In Step 3, the task data are modeled in accordance with a 
selected modeling method. This task model is employed in Step 4 to compute various task metrics that 
are useful in solving the applied problem. Table 151.2 lists seven major approaches for analyzing the 
mental components of tasks. Each has been shown to be useful in applied decision making, and each is 
described below. For an extended review of techniques, see Linton et al. [1989]. 

Task-Timeline Analysis. Timeline analysis organizes and relates low-level tasks on the basis of their time 
of onset, duration, and concurrences in the context of performing a high-level task. Any task taxonomy 
or decomposition model can be used to define the lower-level tasks that will be mapped to the high-level 
task timeline. Generally, the task-timeline mapping is constructed by decomposing a high-level task in 
terms of specific scenarios because different scenarios can have very different task and time requirements. 

This technique is useful for analyzing workload, allocating tasks, and determining human and system 
performance requirements. When the resource requirements for each lower-level task are mapped to the 
timeline, the result is a dynamic task-demand profile. Wickens [1984] discusses two limitations of timeline 
analysis. First, the technique often does not account for time-sharing capabilities in which the perfor- 
mance of multiple tasks can be accomplished efficiently when resource demands are low. Second, the 
technique is most effective when analyzing forced-pace activities. The more freedom the human has to 
schedule activities, the harder it is to construct a useful timeline. 



TABLE 151.2 Analytic Techniques for Modeling Tasks 



Technique 



Modeled Dimensions/Characteristics 



Analysis 



Task time-line analysis 

GOMS, NGOMSL, and 
keystroke-level analysis 



Task- action grammar 



Fleishman and 

Quaintance's cognitive 

task taxonomy 
Rasmussen's cognitive task 

analysis 
Knowledge-representation 

techniques 
Elemental resource model 



Temporal onset, duration, and concurrences 

Task and system knowledge required by task performers 

Overt low-level actions 

Procedures 

Temporal relationships 

Task and system knowledge required by task performers 

Observable low-level actions 

Action sequences 

Identifies twenty-three cognitive abilities underlying task 

performance. Tasks are modeled in terms of required 

abilities. 
Generic activities, knowledge states, and dependencies in 

cognitive task performance 
Task and System knowledge required by task performers 

Mental resources needed to perform tasks 
Resources are defined in terms of function and 
performance requirements 



Taskload estimates 
Task complexity 
Task complexity 
Knowledge requirements 
Performance estimates from 
production rule simulation 
Task complexity 
Task consistency 
Knowledge requirements 
Cognitive ability requirements 
Performance requirements 

Knowledge requirements 
Task complexity 
Knowledge requirements 

Functional requirements 
Performance requirements 
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GOMS and NGOMSL Analysis. GOMS, an acronym for goals, operators, methods, and selection rules 
[Card et al., 1983, 1986], is an analytic technique for modeling (1) the knowledge about a task and a 
machine that a task performer must possess and (2) the operations that a task agent must execute in 
order to accomplish the task using the machine. It is based on work by Newell and Simon [1972] in 
human problem solving. The model was extended by Kieras [1988] into NGOMSL (for natural GOMS 
knguage), which affords a more detailed analysis and specification of tasks. In both GOMS and NGOMSL 
goals represent what a human is trying to accomplish. Operators are elementary perceptual, cognitive, 
or motor acts that may be observable or unobservable. Methods are sequences of operations that accom- 
plish a goal. Selection rules are criteria used to select one method to apply when many methods are 
available. 

The NGOMSL analysis uses this model to predict quantitative measures of the complexity of the 
knowledge required to perform a task with a system. These measures include learning time, amount of 
transfer, and task execution time. To accomplish this, methods (i.e., procedural knowledge) are repre- 
sented in the form of production rules. These are IF-THEN rules that describe knowledge in approxi- 
mately equal-sized units. The number of rules needed to represent a task provides a measure of the 
complexity of that task. Time predictions are obtained by modeling the production rules in a computer 
program that stimulates the tasks being analyzed. 

The keystroke-level task model is an abbreviated application of the GOMS analysis [Card et al., 1983] 
that models only the overt actions (i.e., the observable operators and methods) taken by the task 
performer. In this way it does not require the inferences about mental processes required by the full 
GOMS approach. The model was developed with regard to a computer system and defines six operators: 
a keystroke or mouse button push, pointing using a mouse, moving the mouse, moving hands between 
the mouse and keyboard, mental preparation, and system response. Tasks are modeled by identifying the 
sequence of operators needed to perform the task. 

Task-Action Grammar Analysis. Task-action grammar (TAG) [Payne & Green, 1986; Schiele & Green, 
1990] is a formalism for modeling the knowledge required to perform simple tasks with a given system. 
Task knowledge is represented as sequences of simple acts. Similarities in the syntactic representation of 
simple tasks are used to derive higher-level schemas that apply across tasks possessing a family resem- 
blance. The ability to derive schemas is used to assess the consistency of the tasks performed with a given 
user interface and has implications for ease of learning and overall usability. In similarity with NGOMSL, 
the number of grammatical rules and schemas provides a quantitative measure of task and interface 
complexity. 

Rasmussen's Cognitive Task Analysis. Rasmussen [1986] and Rasmussen ad Goodstein [1988] define a 
framework for cognitive task analysis that was derived from an analysis of human-machine interaction 
in process-control tasks. One element of this framework is a schematic structure for describing cognitive 
tasks and information requirements. The schematic structure includes a sequence of (1) information- 
processing activities, (2) states of knowledge resulting from performing the information-processing 
activities, and (3) conditions (related to the experience of the human performing the task) that allow the 
processing sequence to be shortened. This structure constitutes a generic processing task model into 
which specific task data can be mapped. The information processing activities are 

• Detecting a need for action 

• Observing relevant data 

• Identifying the present system state 

• Interpreting the present system state 

• Evaluating possible consequences of this state with reference to a goal 

• Defining a desired target state 

• Formulating procedures to achieve the desired state 

• Executing the actions required by the procedures 
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Fleishman and Quaintance's Cognitive Task Taxonty. Task taxonomies classify tasks on the basis of various 
characteristics and properties. As such, they can serve a useful purpose in modeling tasks. Tasks of the 
same taxonomic classification will have similar characteristics and similar requirements. This inference 
provides a basis for efficiently creating models for analyzed tasks. Fleishman and Quaintance [1984] 
provide an extensive review of taxonomies of human performance and identify four classifications: 
(1) behavior descriptions, (2) behavior requirements, (3) ability requirements, and (4) task characteris- 
tics. They further detail a taxonomy of human abilities that identifies 23 cognitive factors underlying 
task performance. These factors were derived from an analysis of empirical performance data collected 
from a large number of diversified tasks and individuals. The factors constitute a structure for modeling 
cognitive tasks. That is, these factors can be mapped onto specific tasks and thus specify the cognitive 
abilities needed to perform the task. 

Knowledge-Representation Techniques. Knowledge representation is a generic term that refers to several 
formalisms for modeling knowledge about tasks, systems, and the physical environment. Major repre- 
sentation schemes include production rules, frames, scripts, and cases. Production rules represent knowl- 
edge as a set of condition-consequence (i.e., IF-THEN) associations. Frames, scripts,, and cases are 
different forms of schemas that represent knowledge as typical stereotypical chunks. Rules and scripts 
generally represent procedural knowledge, while frames and cases represent declarative knowledge. By 
formally representing data acquired from knowledge-acquisition techniques, a model of the procedural 
and declarative knowledge required to perform a task is created. 

Elemental Resource Analysis of Tasks. The elemental resource model (ERM) [Kondraske, 1995] defines 
human performance in terms of the performance and skill resources (capacities) that a human or system 
possesses and which can be brought to bear in performing tasks. Elemental resources are defined at a low- 
level such that many of them may be required to perform a high-level task. In addition, intermediate- 
level resources are defined to provide task-analysis targets at a less granular (i.e., higher hierarchical) 
level. In this model, a resource is defined as a paired construct consisting of a functional capability (e.g., 
visual word recognition) combined with a performance capability (e.g., recognition speed). Thus, even 
though the functional capability is the same, visual-word-recognition speed and visual-word-recognition 
accuracy are defined as two distinct resources because they model two distinct dimensions of perfor- 
mance. This model can be applied in task analysis by decomposing the mental components of a task in 
terms of the resources that a human or system performing them is required to process. 

151.3 Models of Human Mental Processing and Performance 

The analysis of task data by themselves is of limited use in solving applied problems because task data 
address only one set of the system variable that need to be considered. For example, a task-timeline 
analysis provides a model from which a measure of task demand can be computed. However, this analysis 
does not indicate whether the taskload is acceptable or how it could be optimally distributed among 
humans and machines. Making these judgments requires consideration of human and machine capabil- 
ities. Similarly, using the NGOMSL and TAG models, a measure of task complexity can be computed, 
indicating that one task or procedure is more complex than another. However, these judgments are based 
solely on the number of task steps and the amount of information needed to perform each step and do 
not consider how difficult each step is for a human or machine to perform. Using these task models 
without incorporating human and machine models forces a general assumption that task difficulty is a 
linear function of complexity (modeled as more task steps or information requirements). Even with this 
assumption, the task model by itself provides no indication of the acceptability of the task complexity 
or expected human or machine performance. Both these judgments require considerations of human 
and machine processing and performance models. 

Because of these issues, task analysis is used in combination with human and machine models to make 
applied decisions. Using task, human, and machine models in concert requires that the mental compo- 
nents being modeled are described in consistent and compatible terms. Further, the quality and precision 
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TABLE 151.3 Models of Human Mental Processes and Performance 



Model 

Multiple resource model 
of attention and 
W/INDEX 



Model human processor 
(HMP) 



Keystroke-level 
performance model 



Skills, rules, and 
knowledge model 



Technique for human 
error-rate prediction 
(THERP) 

Elemental resource model 
of human and system 
performance (ERM) 



Description 

Models attention as a collection of separable 
processing resources. W/INDEX is a computer 
model of workload that adopts the multiple 
resource construct and accounts for resource 
conflict from concurrent processing. 
Models human information processing in terms 
of processing subsystems, memories, and 
performance parameters. 



Models the performance (i.e., time) estimated 
for executing each of six defined actions, from 
which overall estimated task completion times 
can be computed. 

Models human information processing in terms 
of three levels of behavioral control: skill- 
based, rule-based, and knowledge-based. 



Structure methodology for modeling human 

error and task completion in terms of 

probabilities. 
Models human and system performance in 

terms of basic elements of performance and 

elemental resources 



Modeled Components 

Identifies separable processing resources 
Time-sharing ability 
Automatic processing 



Performance is modeled by assigning time 
values to the model's parameters 

Perceptual, cognitive, motor processing 
subsystems 

Long-term and working memories 

Performance is modeled by assigning time 
values to each of six modeled actions. 



Models the processes and requirements for 
each level of behavior. 

Can be used with quantitative models of 
human performance to estimate task time 
and errors. 

Predicts human error and task completion 
probabilities using a human performance 
database and expert judgments 

Performance is modeled in terms of 
elemental units that can be combined to 
provide estimates of high-level task 
performance and completability. 



of the decision will be enhanced to the degree that the task model describes requirements quantitatively, 
while the human and machine models describe their capabilities quantitatively. 

Five models of human mental processes and performance are listed in Table 151.3 and discussed below. 
These were selected because they ( 1 ) describe concepts that are generally applicable to a wide range of 
information-processing tasks and (2) are engineering- oriented models for use on applied problems. 
Connectionist (e.g., neural network) models [Rumelhart & McClelland, 1986; Schneider & Detweiller, 
1987] and the optimal control model [Barron & Kleinman, 1969], which represent major but different 
approaches to modeling human processing, are intentionally not included due to space constraints. 

Multiple-Resource Model of Attention and W/INDEX 

In psychology, resource models of attention attribute the capability to perform the mental components 
of tasks to the capacity of applicable and available processing resources, time-sharing skills, and automatic 
processes. Attention is modeled as either a single processing resource or multiple, separable processing 
resources. In these models, the processing resource or resources afford a capacity to perform information- 
processing tasks (e.g., perception, cognition, and response selection) but have limited availability. The 
definition of resources here is different from that used in the ERM, which formally includes a specific 
performance dimension as part of the definition of a performance resource, which is distinguished from 
the system itself. 

Norman and Bobrow [1975] adopted a single-resource model to examine the theoretical relationships 
between the application of the resource and performance on a single and multiple tasks. For a given task, 
performance may be limited by insufficient processing resources (resource-limited) or by insufficient 
data (data-limited). In multitask situations, a human performing the tasks is assumed to be capable of 
controlling the allocation of a portion of the processing resource to each task. Different allocation 
strategies result in different relative performance on each task. The relationship between allocation 
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strategies and relative performance is plotted on a graph called the performance operating characteristic. 
Automated processing (e.g., skill-based) is assumed not to require processing resources. 

The multiple-resource model of attention [Navon & Gopher, 1979; Wickens, 1980, 1984] extends and 
specifies the resource concept by identifying several separate processing resources that are exclusive to a 
subset of activities. Results of numerous dual-task studies were used to distinguish separable resources. 
The model identifies different resource structures for verbal and auditory modalities, spatial and verbal 
codes, manual and vocal responses, and between (1) selection and execution of responses and 
(2) perceptual and central processing stages. 

W/INDEX, for workload index [North & Riley, 1989], is a computer model for predicting operator 
workload that uses the multiple-resource construct and accounts for time-sharing skills. A task-timeline 
combined with a design model that differentiates different interface channels (e.g., visual or auditory, 
manual or verbal) is used to compile an interface activity matrix. This matrix specifies, by subjective 
ratings, the amount of attention the task demands for each channel. A conflict matrix is generated with 
respect to each design channel that specifies a penalty resulting when the operator must attend simulta- 
neously to multiple channels. The W/INDEX algorithm is then applied to these data, resulting in a 
workload profile. 

The Model Human Processor (MHP) 

MHP [Card et al., 1983, 1986] defines three interacting processing subsystems: perceptual, cognitive, and 
motor. The perceptual and cognitive subsystems include memories. Within each subsystem, processing 
parameters (e.g., perceptual processor, long-term memory, visual-image memory, and eye movements) 
and metrics (e.g., capacity, speed, and decay rate) are defined. Using data from numerous empirical studies, 
the MHP defines typical and range values for 19 parameters. To apply the model, processing parameters 
are associated with an analysis of the steps involved in accomplishing a task with a given system. Values 
for each parameter are assigned and summed, providing time estimates for task completion. 

Skills, Rules, and Knowledge Model 

Rasmussen [1983, 1986] developed a three-level model for describing qualitatively different modes of 
human information processing. Skill-based processing describes sensorimotor performance that is accom- 
plished without conscious control. The behavior exhibited is smooth, automated, and integrated and 
only moderately based on feedback from the environment. Skill-based processes are generally simple 
automated behaviors, but they can be combined by higher-level conscious processes into long sequences 
to fit complex situations. Rule-based processing describes performance that is goal-directed and con- 
sciously controlled in a feed-forward manner from stored rules that have been developed from experience. 
Rule-based processing does not use feedback control. Knowledge-based processing is the highest level and 
applies to unfamiliar situations for which rules are available. Performance is goal-controlled and con- 
scious. The goal and plans need to be explicitly developed, and reasoning is accomplished with knowledge 
(i.e., a mental model) of the system and environment. 

This three-level model can be directly related to Rasmussen's framework for cognitive task analysis. 
Knowledge-based processing requires performing all the activities in the framework. Rule-based process- 
ing provides a means to bypass activities by applying rules that have been stored from familiarity. Skill- 
based processing provide a means to go directly from detection to execution. Thus the processing required 
is a function of the experience of the task performer and the nature of the task. 

Technique for Human Error-Rate Prediction (THERP) 

THERP [Swain & Guttmann, 1980] is a widely applied human reliability method [Meister, 1984] used 
to predict human error rates (i.e., probabilities) and the consequences of human errors. The method 
relies on conducting a task analysis. Estimates of the likelihood of human errors and the likelihood that 
errors will be undetected are assigned to tasks from available human performance databases and expert 
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judgments. The consequences of uncorrected errors are estimated from models of the system. An event 
tree is used to track and assign conditional probabilities of error throughout a sequence of activities. 

Elemental Resource Model (ERM) of Human Performance 

ERM [Kondraske, 1995] was discussed briefly in the preceding section as a task-analysis technique. That 
discussion addressed how the ERM uses the same quantitative modeling construct (i.e., elemental per- 
formance resources) for task requirements and human capabilities. Major classes of resources include 
motor, environmental (i.e., sensing), central processing (i.e., perception and cognition), and skills (i.e., 
knowledge). Central processing and information (skill) resources are of interest in this chapter. When 
applied to humans or to machines (i.e., agents that perform tasks), the ERM describes the perceptual, 
cognitive, and knowledge resources available to the agent. If the agent has a deficiency in any of these 
elemental resources, a task requiring that resource cannot be completed by the agent. In this case, agents 
with sufficient resources need to be selected, agents with insufficient resources need to be trained, or the 
task goals or procedures need to be modified to accommodate the resources available to the agents 
targeted to perform the task. 

151.4 Models of Machine Processing Capabilities 

Table 151.4 describes models of machine and performance. These models vary greatly in scope and detail. 
They are presented to illustrate that quantitative analysis of performance resources using the ERM extends 
to machine capabilities. These models are not elaborated further. 

151.5 A Human-Machine-Task Analytic Framework 

Making applied decisions involves performing a combined human-machine-task analysis of available and 
required mental capacities. This analysis (1) requires that the mental components of tasks, the mental 
capabilities of humans, and the information processing capabilities of machines be modeled and analyzed 
in compatible and consistent terms and (2) is greatly enhanced if the terms are quantitative. From the 



TABLE 151.4 Models of Machines Processes and Capabilities 



Machine Model 
Process models 



Principles of operation 
models 



Human interaction 
models 



Rasmussen's abstraction 
hierarchy 

ERM 



Description 

This refers to a category of models that describe 
the flow of information and the process 
performed on information through a system. 
It is useful for determining the consequences 
of failures in THERP. 

In this category machines are modeled in terms 
of their operating principles. These are also 
useful for determining the consequences of 
failures needed to apply THERP. 

In this category the characteristics of the machine 
side of the human-machine interface are modeled. 



Developed for modeling process systems. Provides 
a task-independent description of systems at 
five levels. 

Models machines in terms of the performance 
resources they possess. Resource definition is 
consistent across task, human, and machine 
domains. 



Components/Parameters 

Flow 

Transformations 
Dependencies 
Normal performance 

Physical properties 

Mathematic formulas 

Mathematic relationships 

Logical relationships 

Normal range of performance 

Physical properties 

Information content 

Information display format 

Performance metrics 

Hierarchy levels that include 
process, operation, and 
interaction descriptions 

Elemental information- 
processing performance 
resources available to the 
machine. 
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techniques and models described above for task analysis and human and machine models, a methodology 
for a unified, quantitative human-machine-task analysis of mental components is described. 

The approach first quantifies task requirements by combining techniques that produce detailed task- 
decomposition analyses of goals, actions, and required procedures (e.g., GOMS, NGOMSL, TAG, key- 
stroke-level task model, or Rasmussen's cognitive framework) with timeline analysis, where applicable, 
and the elementary resource analysis of tasks. The task-decomposition models are used to specify tasks 
at a level at which the elemental mental resources required to perform the tasks can be specified. The 
timeline provides a basis for specifying performance requirements. Other performance dimensions also 
should be used depending on the task. 

The next step employs quantitative models of human performance and technology to reach an applied 
decision that best satisfies the required resources if sufficient resources are available. If sufficient resources 
are not available, additional performance and skill resources need to be obtained to complete the task, 
or the task requirements need to be reduced. Quantitative models of human performance include MHP, 
THERP, and ERM. Using the MHP, the human processing parameters needed for each task operation 
are identified. The time values assigned to each parameter are used to compute task time estimates. This 
approach is limited to estimating performance in terms of time. THERP maps operations to the tasks 
included in human reliability databases to estimate human error probabilities. The ERM provides a 
framework for specifying performance and functional capacities at the resource level. It is the only model 
that (1) incorporates all required dimensions of performance and skills and (2) uses consistent modeling 
constructs across tasks, humans, and machines. 

Using the approach outlined above establishes a basis for making applied decisions objectively. The 
task-analysis portion of this approach is illustrated by the example below. 

151.6 Brief Example: Analysis of Supervisory Control Task 

Supervisory control refers to the role a human plays in operating a semiautomatic process or system. 
Examples include control of large systems such as a nuclear power plant and specific instrumentation 
such as a robotic or assistive device. Performing supervisory control is high-level task that predominantly 
consists of mental components. This task is used to generically illustrate the use of analytic techniques 
to model a task. 

High-Level Goals 

Two major modes of operation are distinguished [Wickens, 1984]. In one mode (normal), the system is 
behaving as expected. The primary role of the human controller is to monitor system activity and modify 
performance parameters in accordance with planned output goals. In the second mode (fault), the system 
is faulty. The goal of the human controller, in this case, is to detect and diagnose the fault and intervene 
to bring the process back into normal performance. 

Task Decomposition and Procedures 

Sheridan [1987] identifies 10 functions performed by a supervisory controller within these modes of 
operation. Rasmussen's cognitive task framework, which was developed with regard to process-control 
tasks, includes 8 functions applicable to the fault mode of operation. Figure 151.2 depicts these 8 generic 
functions as a top-level breakdown of the supervisory control task in fault mode. For this example, only 
selected tasks are decomposed to lower levels. At each level, the decomposition is not intended to be 
complete. Selected tasks at each level are intended to be illustrative and representative of increasingly 
specific and simple tasks. The arrow within each level provide procedural information. The goals of the 
lower-level tasks are defined by their titles (e.g., acquire data). Only the situation interpretation task is 
decomposed. In reference to Rasmussen's human information-processing model, the task is assumed to 
be knowledge-based. The lower-level tasks represent a hypothesis-testing strategy for knowledge-based 



» 2000 by CRC Press LLC 



High-L«vel Task: Supervisory Control of a Complex System in Fault Mode 



~ Goal: Bring System Back to Normal Operating Mode 




Knowledge: 

of System Parameters 

Knowledge: 

of System Operation 

Memory: Capacity 
Memory: Persistance 
Memory: Access Speed 
Memory: Access Accuracy 



Knowledge: of Interface 
Layout and Structure 
Knowledges/ Words 
Knowledge: of Letters 
Knowledge: of Symbols 

Memory: Capacity 
Memory: Persistance 
Memory: Access Speed 
Memory: Access Accuracy 
Memory: Encoding Speed 
Memory: Encoding Accuracy 

Visual Perception: Speed 

Visual Perception: Accuracy 

For Applicable Level: 

Word Recognition 

Letter Recognition 

Symbol Recognition 



Knowledge: 

of System Parameters 

Knowledge: 

of System Operation 

Memory: Capacity 
Memory: Persistance 
Memory: Access Speed 
Memory: Access Accuracy 

Reasoning: 
Inferencing Speed 
Reasoning: 
Inferencing Accuracy 



Required Resources 



FIGURE 151.2 Selected task decomposition and resource requirements for a high-level supervisory control task. 

diagnosis described by Rasmussen and Goodstein [1988]. However, other models of lower-level tasks 
could be used. Note that although a Rasmussen model is illustrated, the decomposition could be accom- 
plished with other applicable models (e.g., GOMS). 

Required Resources 

Of the task-modeling techniques presented above, only Fleishman and Quaintance's cognitive abilities 
taxonomy and the ERM provide for explicit analysis of required resources. The approach defined by the 
ERM is illustrated in this example because it defines similar analyses for human and machine performance. 
As such, it is the only technique that models all three components of a human-machine-task system in 
the same way. For three of the lowest-level tasks, a set of elemental central processing resources is identified. 
Humans or machines responsible for performing each task are required to possess these resources to 
accomplish the task successfully. Four types of elemental mental resources are identified: knowledge, 
memory, perception, and reasoning. Resources are distinguished by their functional type and the perfor- 
mance dimension they specify. In the example, the performance dimensions are generically identified. In 
application, values should be assigned to these dimensions to the degree they are available or obtainable. 
Given time and cost constraints for the analysis, the ability to quantify all resources may not be practical. 
Task priorities can be used to selectively quantify resources, or the resource model can be used less 
rigorously as a guide to decision making if specific data are not available [Maxwell, 1995]. Once values 
are assigned, these required resources can be analyzed in concert with sets of available resources possessed 
by humans and machines. Available resources will have values that indicate a performance capability. 
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Although the MHP is a human processing and performance model, it can be adapted for use in a way 
analogous to the way the ERM is used in task analysis. The MHP would be applied as usual by identifying 
the processing parameters needed to accomplish each low- level task. However, instead of using the human 
performance values associated with these parameters, time values required by the task would be derived 
and assigned. These required times could then be compared with the estimated human performance 
times available for these low-level tasks to determine the sufficiency of human processing to meet the 
task's requirements. 

Performance Assessment 

After the task is specified, human and machine performance models can be applied to estimate task 
performance. The MHP and keystroke-level performance model can provide task performance estimates 
in terms of task completion time. THERP can be used to estimate human error probabilities for each 
task and task sequence. The ERM approach can be used to estimate performance along any required 
dimension and to compare required with available resources along any required dimension as long as 
the human performance data are available. The results of the ERM assessment would identify stress levels 
on capacities (e.g., resources stressed too long or beyond maximum capacity). These results indicate 
limiting factors to successful task performance. Limiting factors can be identified at elemental or inter- 
mediate performance resource levels. As such, the ERM represents a more comprehensive and internally 
consistent model than the others. It is more comprehensive in that it can be used to model any perfor- 
mance dimension. It is more consistent in that (1) all resources are defined in similar terms, (2) all 
resources are defined at the same level of description, and (3) resource requirements for tasks and 
resources available to humans and machines are all defined identically. 

Defining Terms 

Elemental resource: These consist of performance and skill resources. Performance resources are each 
defined as a couplet consisting of an elementary functional capability associated with a low-level 
undimensional quantitative capacity, either available or required, to perform the function. Skill 
resources are defined in terms of the knowledge, either available or required, to perform an 
elemental function. 

Knowledge acquisition: Any of several techniques for eliciting knowledge about a task or system from 
a human. 

Mental components: The perceptual, cognitive, and knowledge-processing components of a task. 

Mental tasks: Tasks that consist predominantly of mental components and, in ERM terms, require 
mental performance and skill resources to perform. 

Protocol analysis: Protocols are verbal reports generated contemporaneously by a person accomplish- 
ing a task that describe the unobservable (mental) processes being performed. 

Task: A goal-directed, procedural activity that is defined with regard to the resources (i.e., capabilities) 
that performing agents (i.e., humans or machines) must possess to accomplish the goal. 

Task analysis: The process of (1) decomposing high-level tasks into their constituent, mutually exclu- 
sive, lower-level tasks; (2) describing intertask relationships and dependencies; and (3) defining 
each task's goals, procedures, and performance and skill requirements. 

Task taxonomy: A classification of tasks in accordance with a defined method, strategy, or set of criteria. 

Task timeline: A time-based profile of a high-level task that maps its lower-level tasks by time-of-onset, 
duration, and concurrences onto a timeline. 
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relationship to human performance and workload analysis in the context of system design. Behavioral 
Analysis and Measurement Methods, by David Meister, provides a review and practical guide to applying 
task-analytic methods. The Handbook of Perception and Human Performance, Vols. I and II, edited by 
K.R. Boff, L. Kaufman, and J. P. Thomas, provides broad range of human factors and ergonomic data 
related to perceptual and cognitive task performance. The Handbook of Human Factors, edited by 
G. Salvendy, reviews task analytic methods, workload and resource models of mental processing, and 
human performance models in detail. The Handbook of Industrial Engineering, edited by G. Salvendy, 
reviews task analytic techniques and human performance models in the context of industrial design 
applications. The journal Human Factors, published by the Human Factors and Ergonomics Society, is 
a source for new task analytic techniques and application studies. Cognitive Psychology and Information 
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information-processing approach as applied to research in modern experimental and cognitive psychol- 
ogy- 
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A task can be viewed as a sequence of actions performed to accomplish one or more desired objectives. 
Task analysis and decomposition involve breaking down a task into identifiable elements or steps and 
analyzing them to determine the resources (human, equipment, and environmental) necessary for the 
accomplishment of the task. As indicated in earlier chapters, all human tasks require the interaction of 
mental and physical resources, but it is often convenient, for analytical purposes, to make a distinction 
between tasks that require predominantly physical resources of the person performing the tasks and tasks 
that require predominantly mental resources. These different types of tasks are often analyzed separately. 
Usually, in physical tasks, the mental requirements are described but are not analyzed as meticulously as 
the physical requirements. For example, a heavy-lifting task in industry requires musculoskeletal strength 
and endurance, decision making, and other resources, but the successful completion of such a task is 
limited by musculoskeletal strength. Decision-making and cognitive resources may be lightly tapped, and 
so the heavy-lifting task analysis will tend to focus only on those factors which modify the expression of 
musculoskeletal strength while keeping the other kinds of requirements at a descriptive level. The descrip- 
tions and analyses of manual materials handling tasks in Chaffin and Andersson [1991] and Ayoub and 
Mital [ 1991 ] exemplify this kind of focus. The degree with which a resource may be stressed depends on 
a number factors, among which are the resource availabilities (capacities) of the person performing the 
task compared with the task demands. 
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This chapter deals with the analysis and decomposition of only the physical aspects of tasks. It is 
assumed that nonphysical resources are either available in nonlimiting quantities or are not crucial to 
task accomplishment. While the compartmentalization of task characteristics tends to blur the natural 
connections among the various human performance resources, it is still the most pragmatic method 
available for task analysis. 

Physical task analysis first achieved scientific respectability in the early part of this century from the 
work of the industrial engineer Frederick Winslow Taylor [Taylor, 1911] and, shortly afterward, by Frank 
and Lillian Gilbreth [Gilbreth & Gilbreth, 1917]. Taylor and the Gilbreths showed how a task can be 
broken down into a number of identifiable, discrete steps that can be characterized by type of physical 
motions, energy expenditure, and time required to accomplish the task. By this method, they argued that 
many task steps, normally taken for granted, may contribute little or nothing to the accomplishment of 
the task and can therefore be eliminated. As a result of these kinds of analyses, Taylor and the Gilbreths 
were able to enhance productivity of individual workers on a scale that was considered unrealistic at that 
time. The basic approach to the highest level of physical task analysis today has evolved from the motion 
and time studies of Taylor and the Gilbreths (known as taylorism), both in the industrial and nonindustrial 
environments. The exact methods have become more sophisticated and refined, incorporating new 
technologies and knowledge accumulated about human-task interaction. 

From its roots in time and motion studies, task analysis has become a very complex exercise. No longer 
is a single task considered in isolation, as shoveling was by Taylor and bricklaying by the Gilbreths. Today, 
different ideas in management and advanced data analytical methods have established the need for 
cohesion between physical tasks and (1) the more global processes, such as jobs and occupations encom- 
passing them, and (2) the more detailed processes contained within them (the different types of subtasks 
and basic elements of performance). Campion and Medsker [1992] give examples of the first, and 
Kondraske [1995] gives examples of the second. This chapter describes physical task analysis, showing 
the approach for proceeding from higher-level tasks to lower-level ones. Figure 152.1 shows a summary 
of the overall approach. 
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FIGURE 152.1 Diagram showing the concept of a hierarchical task analysis for physical tasks. 
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152.1 Fundamental Principles 

The fundamental principle of physical task analysis is the establishment of the work relationships between 
the person(s) performing the task and the elements of the task under specified environmental and social 
conditions. The main driving forces are performance enhancement, protection from injuries and illnesses, 
and decision making. The order of importance depends on the particular context in which the task 
analysis is performed. Performance enhancement is a concern in all disciplines that deal with human 
performance, though the approaches and objectives may differ. In physical rehabilitation, for example, 
performance enhancement focuses on improving the performance of basic physical functions of the 
human body, such as handgrip, elbow flexion, etc., that have deteriorated from injury or illness. In 
athletics, it focuses on improving performance to the highest possible levels, such as the highest jump 
or the fastest 100-m run. In exercise and fitness, it aims at improving body functions to enhance health 
and quality of life. Finally, in the discipline of ergonomics, it aims at enhancing task performance as well 
as the health and safety of people at work. 

Speed-Accuracy Principle 

Performance enhancement and protection from injuries and illnesses are not necessarily mutually exclu- 
sive. In performance enhancement, the aim is to complete a task as accurately and quickly as possible. 
Both accuracy and speed are important in the work environment, especially in manufacturing, and are 
influenced by other factors. When lack of accuracy or precision leads to dire consequences, as in air traffic 
control jobs, then a decision to sacrifice speed is usually made. The basic concept of Fitt's law [1954] is 
relevant to these situations. It is difficult to find examples of cases where gross inaccuracy is tolerable, 
but in many cases more speed is required and less accuracy is tolerable, and a different point on the 
speed-accuracy continuum is selected. In the manufacturing environment, speed translates into produc- 
tivity and accuracy into quality, and both determine profitability of the production enterprise. Speed and 
accuracy are rarely attainable at the highest levels simultaneously. In practice, a certain amount of one 
may be sacrificed for the other. Thus industrial engineers who set these tolerable limits talk of "optimum" 
speed instead of "fastest" speed and "optimum" quality instead of "perfect" quality. These facts must be 
borne in mind when analyzing physical tasks. A task analysis indicating that a person can increase his 
or her speed of performance should not necessarily imply that the task is being done inefficiently, because 
quality of performance may deteriorate with increased speed. This kind of flawed reasoning can occur 
when a variable is dealt with in isolation, without regard to its interaction with other variables. 

Stress-Strain Concept 

In task analysis, it is common practice to distinguish between stress and strain. However, these terms are 
sometimes used interchangeably and confusedly. In this chapter, stress refers to a condition that may lead 
to an adverse effect on the body, whereas strain refers to the effect of stress on the body. For example, 
working at a computer job in dim lighting often leads to headaches. The dim lighting is considered the 
stress, and the headache is the strain. The term stressor is also widely used as a synonym for stress. Strain 
has often been wrongly called stress. These terms must be clearly defined to determine which factors are 
causative ones (stresses) and which are consequences (strains). Stress is determined by task demands, 
while strain is determined by the amount of physical resources expended beyond some tolerable level, 
defined by the person's resource capacities. The stress-strain principle is pivotal to task analysis when 
one is concerned with errors, cumulative traumas, and injuries in the workplace and is applicable to task 
situations where task demands are likely to exceed human resource capacities. 

Ayoub et al. [1987] developed a quantitative stress-strain index, called the job severity index (JSI), from 
empirical task analysis and epidemiologic data for manual materials handling tasks. This index computes 
the ratio of "job task physical demands to person physical capacities" from several interacting variables. 
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The application of the JSI is also detailed in Ayoub and Mital [1991]. Also, Kondraske [1995] defines a 
quantitative measure of stress that can be applied to individual performance resources. In this measure, 
defined as the "ratio of resource utilization to resource availability," an adverse effect may be noted when 
the stress level exceeds a threshold, which may be different for different types of performance resources 
(e.g., strength, range of motion, etc.). 

Cumulative Strains 

The elimination of motions and other physical actions that do not contribute directly to task performance 
(tayloristic practice) has led to a concentration of work at specific localities in the body and to consequent 
overwork at those localities, but this concentration often leads to rapid consumption of the limited 
resources of the working systems. For example, VDT data entry is a highly repetitive task with very little 
variation; work is concentrated heavily on the hands (rapid finger motions for operating the keyboard) 
and in the neck and trunk (prolonged static muscular contraction), and this often leads to cumulative 
trauma disorders of the hands, wrists, neck, and shoulders. By comparison, traditional typists performed 
a variety of tasks in addition to using the keyboard — changing paper on the typewriter, filing documents, 
using the phone, etc. — and this helped to stem the depletion of physiologic resources caused by rapid 
finger work or sustained static muscular contractions in the neck and shoulder. Also, it is now recognized 
that some functions of the body that have not been considered "productive" under taylorism are highly 
stressed and may even be the limiting factors in task performance. In data processing, the muscles of the 
neck and back maintain prolonged static contractions that often result in muscular strains in the neck 
and back and limit work time. Traditional task analysis does not always consider this static muscular 
work as productive, even though it is absolutely necessary for steadying the arms (for keyboard and 
mouse use) and the head (for viewing the screen and documents). Likewise, work pauses, which have 
been considered wasteful, are now valued for their recuperative effects on the physical (and mental) 
working systems of the body. The increasing awareness of the causes of work- related cumulative trauma 
disorders will continue to influence the interpretation of data derived from task analyses. 

152.2 Early Task- Analysis Methods 

The history of physical task analysis does not show a trend toward the development of a single generalized 
model. The direction of physical task analysis has been influenced by taylorism and by specific efforts in 
the military that were later adapted to the industrial and other nonmilitary environments. A list of 
influential methods, to which readers are referred, is given below: 

1. Time studies of Fredrick Winslow Taylor [Taylor, 1911] 

2. Time an motion studies of Frank and Lillian Gilbreth [Gilbreth & Gilbreth, 1917] 

3. Job analysis developed by the U.S. Department of Labor in the 1930s [Drury et al., 1987] 

4. The U.S. Air Force "method for man-machine task analysis" developed by R. Miller [1953] 

5. Singleton's methods [Singleton, 1974] 

6. Hierarchical task analysis (HTA) developed by Annette and Duncan [1974] 

7. Checklist of worker activities developed by E.J. McCormick and others for the U.S. Office of Naval 
Research [Drury et al, 1987] 

8. Position analysis questionnaire [McCormick et al., 1969] 

9. The AET method (Arbeitswissenchaftliches Erhenbungsverfahen zur Tatigkeitsanalyse, or ergonomic 
job analysis) developed by Rohmert and Landau [Drury et al., 1987] 

152.3 Methods of Physical Task Analysis 

Methods of physical task analysis vary widely. Drury and colleagues [1987] point out that "the variation 
of task analysis format is the result of different requirements placed on the task analysis." While this is 
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a pragmatic method for short-term results, a more general approach for long-term solutions is needed. 
Such an approach should aim to proceed from higher- to lower-level tasks, as discussed in other chapters 
[Kondraske, 1995]. The following general sequence of analysis is recommended: 

1. Description of the system within which the tasks reside 

2. Identification of jobs (if relevant) within the system 

3. Identification of the tasks 

4. Identification of subtasks that can stand independently enough to be analyzed as separate entities 

5. Determination of the specific starting point of each task 

6. Determination of the specific stopping point of each task 

7. Characterization of the task as discrete, continuous, or branching [Drury et al., 1987] 

8. Determination of task resources (human, equipment, and environmental) required to perform 
the task (this step is the essence of task description) 

9. Determination of other support systems (e.g., in work situations, managerial and supervisory help 
may be needed) 

10. Determination of possible areas of person-task conflicts 

11. Determination of possible consequences of conflict, e.g. errors, slowing down of work rate, dete- 
rioration of quality of end product, decline in comfort, harm to health, and decline in safety 

General Methods for Background Information 

Many different techniques for gathering information in physical task analysis have evolved over the years, 
especially in the industrial and biomedical engineering environments. The quality and quantity of 
information that the techniques provide are limited by available equipment, and it seems that the quality 
and quantity of available equipment lag far behind the state of the art in technology. General data- 
gathering techniques, which are somewhat self-descriptive, include 

• Direct observation (predominantly visual) 

• Indirect observation (e.g., replay of videotape) 

• Document review 

• Questionnaire survey 

• Personal interview 

Specific Techniques 

Specialized data-gathering techniques [Chaffin & Andersson, 1991; Niebel, 1993; Winter 1990] that have 
evolved into powerful tools in the industrial environment and which can be used for task analysis in 
other situations include 

• Methods analysis — determines the overall relationships among various operations, the work- force, 
and equipment 

• Operations analysis — determines the details of specific operations 

• Worker-equipment relationship study — determines the interaction between people and their 
equipment (cases of person-machine mismatches are determined here) 

• Motion study — details the motions made by the various segments of the body while performing 
the task 

■ Times study — details the time taken to perform each motion element 

• Micromotion study — detailed description of motion using mainly videotapes 

• Kinematic analysis — measures or estimates various motion parameters for specified body seg- 
ments, e.g. displacement, velocity, acceleration. (It involves micromotion studies. The data are 
used for subsequent biomechanical analyses, e.g., to determine mechanical strain in the muscu- 
loskeletal system [Chaffin & Andersson, 1991] 
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• Kinetic analysis — measures or estimates forces produced by body segments and other biomechan- 
ical parameters (e.g., center of mass, moment of inertia, etc.) and physical parameters of objects 
(e.g., mass, dimension, etc.) (the data are used for subsequent biomechanical analyses) 

Decomposition of Physical Tasks 

Physical tasks can be viewed as exchanges of energy within a system consisting of the human, the task, 
the equipment, and the environment. In task analysis and decomposition, we look at the human on one 
side and the other system components on the other side and then determine the nature of the exchange. 
The energy is transmitted by muscular action. Tasks are hierarchical, however, and for systematic decom- 
position, it is useful to identify the various levels and proceed in a top-down sequence (see Fig. 152.1). 
The items in the various levels below are self-explanatory, but there is considerable content within each. 
Detailed descriptions of each is beyond the scope of this chapter, and readers should consult the references 
[Chaffin & Andersson, 1994; Corlett etal., 1979; Meister, 1985; Niebel, 1993; Singleton, 1974; Winter, 
1990] for standard measurement procedures and analyses. 

Level I. Identifying general types of tasks according to levels of energy requirements: 

■ Tasks requiring great muscular forces 

■ Tasks requiring medium to low levels of muscular forces 

Tasks requiring very low levels of muscular forces are not predominantly physical. Their successful 
accomplishment is strongly dependent on cognitive resources. The application of strong and weak 
muscular forces is not necessarily mutually exclusive. Such forces my be exerted simultaneously by 
different muscular systems or may even alternate within the same system. For example, a typing task 
may require strong back extensor forces while sitting bent forward (if the workstation is poorly designed) 
and weak finger flexion forces for activating the keyboard keys, and a weak index finger flexion may be 
required to activate a trigger on a hand drill while strong handgrip forces stabilize the drill. 

Level II. Identifying the crucial subsystems involved in the generation and transference of forces of 
varying intensities involved in tasks: 

• Musculoskeletal — for great forces 

■ Cardiopulmonary — for medium to low forces 

■ General body posture 

■ Body motion 

• Local segmental motion 

■ Local segmental configuration 

■ Anthropometry 

■ Workplace and equipment dimensions 

The last three subsystems modify directly the expression of muscular force. 

Level III. Identifying environmental requirements and constraints. Environmental conditions influence 
performance and functional capacities in humans. They should therefore be measured or estimated in 
task analyses. Table 152.1 describes common environmental constraints. 

Level IV. Identifying body actions in relation to tasks. These actions can be decomposed into two major 
types: 

1. Those which involve body motions. They are used for either changing body positions or moving 
objects. 

2. Those which do not involve body motions. They are used for primarily for balancing and stabilizing 
the body for performing some task. Body posture is especially important here, especially when 
great muscular forces must be exerted. 
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TABLE 152.1 Environmental Factors in Task Analysis 



Environmental factor 

Illumination 

Heat 

Cold 

Vibration 

Noise 

Body supporting surface 

Space 



Constraint 

Insufficient intensity 
Excessive heat and humidity 
Excessive Cold 
Excessive amplitude 
Excessive intensity 
Slipperiness 
Insufficient workspace 



Examples of effects 

Errors 

Lowering of physiologic endurance 

Numbness of hands — inability to grasp handtool properly 

Loss of grip and control of vibrating tool 

Inability to coordinate job with coworkers 

Difficulty in balancing body 

Inability to position body properly 



TABLE 152.2 Representative Actions with Motions 



Class of Action Subclass Purpose of Action 

Whole body Positioning body 

Transporting object 

Gross segmental Arm movement Applying forces to object 



Leg movement Applying forces 

Trunk movement Generating momentum 

Fine segmental Finger movement Applying forces 



Examples 

Walking, crawling, running, jumping, climbing 
Lifting, lowering, carrying horizontally, pushing, 

pulling, rotating 
Reaching for object, twisting, turning, lifting, 

lowering, transporting horizontally, positioning, 

pressing down, pushing, pulling, rotating 
Rotating (e.g., foot pedal), pushing 
Sitting and bending forward, backward, and sideways 

to retrieve items on a table 
Turning, pushing, pulling, lifting 



TABLE 152.3 Representative Actions without Motions 



Class of 
Action 

Whole body 
Segmental 



Subclass 

Arm 

Leg 



Purpose of Action 



Examples 



Balancing body Standing while preparing meal in kitchen 

Supporting or stabilizing object Bracing ladder while coworker climbs on it 

Holding food item with one hand while cutting it with other 
hand 
Supporting body Forcing foot on ground while sitting in constrained posture 



These actions are not mutually exclusive during task performance. One part of the body may move, while 
another part, involved in the task, may be still. For example, the fingers move rapidly when using a 
keyboard, but the rest of the hand is kept motionless while the fingers are moving. Tables 152.2 through 
152.5 depict practical decompositions of these actions as well as those requiring muscular forces. 

Level V. Identifying anthropometric variables. Human anthropometry interacts with workplace and 
equipment dimensions. Tasks are performed better when the interaction defines a close "match" between 
human body size and task-related physical dimensions. Therefore, physical task analysis may be inade- 
quate without measurements of the dimensions of people performing the task(s) and the corresponding 
dimensions of other aspects of the system of which the task is a part. This kind of matching is one of 
the basic thrusts in the discipline of ergonomics. Grandjean [1988] and Konz [1990] discuss these issues 
in many types of human-task environments, and Pheasant [1986] gives numerous body dimensions 
related to task design. Some important examples of human-system dimensions that must be considered 
during task analysis are shown in Table 152.6. 



152.4 Factors Influencing the Conduct of Task Analysis 

While task analysis traditionally focuses on what a person does and how he or she does it, other factors 
must be considered to complete the picture. In general, the characteristics of the task, the equipment 
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TABLE 152.4 Representative Actions Requiring Forces, for Different Types of Forces 



Class of 
Action 



Subclass 



Sub- 
subclass 



Whole body Whole body Dynamic 



Static 



Segmental 



Static 



Dynamic 



Body-Object 
Contact 

One hand, two 
hands, or other 
contact 



One hand, two 
hands, or other 
contact 

One hand, two 
hands, one foot 
or two feet 



One hand, two 
hands, one foot, 
or two feet 



Purpose of Action 

Generate great forces that must 
be transmitted over small 
localized area or large area. The 
effort produces motion of whole 
body. 

Generate great forces that must 
be transmitted over small 
localized area or large area. The 
effort does not produce motion. 

Generate forces (over a wide 
range of magnitude) that must 
be transmitted over a relatively 
small localized area. The effort 
does not produce motion of the 
body segment. 

Generate great forces that must 
be transmitted over small 
localized area or large area. The 
effort produces motion of the 
body segment. 



Examples 
Lift, push, pull 



Lift, push, pull 



Gripping with whole 
hand, pinching, 
pressing down, 
twisting, turning, 
pushing, pulling, or 
combinations 

Pushing, pulling, 
pressing down 



TABLE 152.5 Posture While Performing Tasks 



Class 



Subclass 



Whole body Standing, sitting, kneeing, 
lying down, other 

Segmental Arms, legs, trunk, hands, 

other 



Purpose 

Optimize body leverage; position 
body for making proper contact 
with object; other 

Optimize segment leverage; 
position segment for making 
proper contact with object 



Examples 

Squatting at beginning of a heavy lift 
to prevent excessive strain in the 
lower back 

Placing elbow at about right angles 
while using wrench to prevent 
excessive shoulder and muscle efforts 



TABLE 152.6 Human Anthropometry and System Dimensions 



Body Dimensions 



System Dimensions 



Height, arm length, shoulder width 

Leg length, knee height 

Hand length, grip circumference 
Crotch height, arm length 
Face height, hand width 



Workspace — height of shelf, depth of space under work table, circumference of 

escape hatch 
Equipment size — distance from seat reference point to foot pedal of machine, height 

from floor to dashboard of vehicle 
Handtool size — trigger length, handle circumference 
Apparel size — leg length of pants, arm length of shirt 
Personal protective equipment — height of respirator, width of glove 



used, and environmental conditions must all be accounted for. Human functional performance cannot 
be viewed in isolation because the person, the task, the equipment, and the environment form a system 
in which a change in one factor is likely to affect the way the person reacts to the others. For example, 
in dim lighting, a VDT operator may read from a document with marked flexion of the neck but with 
only slight flexion (which is optimal) when reading from the screen. Brighter ambient lighting may 
prevent sharp flexion when reading from documents but is likely to decrease contrast of characters on 
the screen. Screen character brightness must therefore be changed to maintain adequate reading perfor- 
mance to prevent sharp flexion of the neck again. 
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Knowing the Objectives of Task Analysis 

Motion, speed, force, environment, and object (equipment) characteristics are common factors for con- 
sideration in task analysis. However, the reasons for performing a task are an equally important factor. 
Tasks that may involve the same equipment and which may seem identical may differ in their execution 
because of their objectives. Thus a person opening a jar in the battlefield in the midst of enemy fire to 
retrieve medicines will not perform that task in exactly the same way as a person at home opening a similar 
jar to retrieve sugar for sweetening coffee. In the former case, speed is more important, and both the type 
of grasp used and the manual forces exerted are likely to be markedly different from the latter case. Napier 
[1980] recognized this principle, in relation to gripping tasks, when he wrote that the "nature of the 
intended activity influences the pattern of grip used on an object." These kinds of considerations in task 
analysis can influence the design of objects and other aids and the nature of education and training programs. 

Levels of Effort 

The level of required effort when performing similar tasks in different environments is not necessarily 
the same, and the interpretation of task-analysis data is influenced by these differences. Tasks in the 
workplace are designed so that human physical effort is minimized, whereas tasks in the competitive 
athletics environment are designed to tap the limits of human performance resources. There are other 
differences. An athlete paces himself or herself according to his or her own progress. He or she has greater 
freedom to stop performing a task should physical efforts become painful or uncomfortable. In the 
workplace, however, there is often little freedom to stop. Many workers often push themselves to the 
limits of their endurance in order to maintain (flawed) performance goals set by their employers. This 
is why cumulative strains are more prevalent in the workplace than in many other environments. These 
differences in performance levels may influence the way task analyses are performed among the different 
disciplines and the application of the data derived therefrom. 

Criticality of Tasks 

An inventory of task elements is necessary for describing the requirements of the task. It can tell us about 
the person, equipment, and environmental requirements for performing a job, but also we must be able 
to identify specific elements or factors that can prevent the successful completion of the task or that an 
lead to accidents and injuries. These critical elements are usually measured in detail or estimated and used 
for developing quantitative models for predicting success or failure in performance. One such widely 
researched element is the compression force on the L5/S1 disk in the human spine while performing heavy- 
lifting tasks [Chaffin & Andersson, 1984] . This force is being used by the National Institute of Occupational 
Safety and Health as a criterion for determining how safe a lifting activity is. If analysis of task-analysis 
data indicates that the safety limit (3400 N of compression) is exceeded then work redesign, based on task- 
analysis information, must be implemented [National Technical Information Service, 1991; NIOSH, 1981]. 

152.5 Measurement of Task Variables 

During task analysis, variables must be measured for subsequent data summary and analysis. It is 
important to know not only what the inventory of task-related variables is but also to what degree the 
variables are related to or affect overall performance. The measurement of task-related variables allows 
for quantification of the overall system. For example, the NIOSH lifting equation [National Technical 
Information Service, 1991] shows how task variables such as the horizontal distance of the load from 
the body, the initial height of the load, the vertical height of lift, the frequency of lifting, the angular 
displacement of the load from the saggital plane during lifting, and the type of hand-handle coupling 
affect the maximal weight of the load that can be lifted safely by most people in a manual materials- 
handling task. The number of task-related variables measured should be adequate for describing the task 
and for representing it quantitatively. However, there are often constraints. These include 



© 2000 by CRC Press LLC 



1. The number of variables that can be identified 

2. The number of variables that can be measured at an acceptable level of reliability and accuracy 

3. The availability of measurement instruments 

Merely summarizing individual measurements seldom yields the desired information. Task perfor- 
mance is essentially a multivariable operation, and variables often must be combined by some quantitative 
method that can yield models representative of the performance of the task. Sometimes a variable cannot 
be measure directly but can be estimated from other measured variables. The estimated variable may 
then be used in a subsequent modeling process. A good example of this is the intraabdominal pressure 
achieved during heavy lifting. Though it can be predicted by a cumbersome process of swallowing a 
pressure-sensitive pill, it also can be estimated from the weights of the upper body and load lifted and 
other variables related to lifting posture. Its estimate can then be used to estimate the compressive force 
in the L5/S1 disk and the tension in the erector spinae muscles during lifting. Imrhan and Ayoub [1988] 
also show how estimated velocity and acceleration of elbow flexion and shoulder extension can be used, 
with other variables, to predict linear pulling strength. 

Task-Related Variables 

Variables that are usually measured during physical task analysis include 

1. Those related to the physical characteristics of task objects or equipment: 

• Weight of load lifted, pushed, carried, etc. 

• Dimensions of load 

• Location of center of mass of load 

2. Those related to the nature of the task: 

• The frequency of performance of a cycle of the task 

• The range of heights over which a load must be lifted 

• The speed of performance 

• The level of accuracy of performance 

3. Those related to the capacities of various physical resources of the person 

• Muscular strength 

• Joint range of motion 

• Joint motion (velocity and acceleration) 

• Maximal aerobic power 

• Anthropometry 

4. Those related to the environment: 

• Temperature 

• Illumination 

• Vibration 

5. Those related to workplace design: 

• Amount of space available for the task 

• Geometric and spatial relationships among equipment 

• Furniture dimensions 

6. Those related to anthropometry: 

• Length, breadth, depth, or circumference of a body segment 

• Mass and mass distribution of a body segment 

• Range of motion of a skeletal joint 
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Instruments for Gathering Task- Analysis Data 

A great number of instruments are available for measuring variables derived from task analyses. The 
choice of instruments depends on the type of variables to be measured and the particular circumstances. 
In general, there should be instruments for recording the sequence of actions during task, e.g., videotape 
with playback feature, and for measuring kinematic, kinetic, and anthropometric variables [Chaffin & 
Andersson, 1994; Winter, 1990]. The main kinematic variables include displacement (of a body part), 
velocity, and acceleration. Kinetic variables include force and torque. Anthropometric variables include 
body segment length, depth, width, girth, segment center of mass, segment radius of gyration, segment 
moment of inertia, joint axis of rotation, and joint angle. Some variables are measure directly, e.g., 
acceleration (with accelerometers), force applied at a point of contact between the body and an object 
(with load cells), and body lengths (with anthropometers); some may be measured indirectly, e.g., joint 
angle (from a videotape image) and intraabdominal pressure (using swallowed pressure pill); and others 
may be estimated by mathematic computations from other measured variables, e.g., compressive force 
on the lumbosacral (L5/S1) disk in the lower back. Posture targeting, a method for recording and 
analyzing stressful postures in work sampling [Corlett et al., 1979], is becoming popular. 

The measurements of performance capacities associated with human functions should conform to 
certain criteria. Details can be found in Brand and Crownshield [1981] and Chaffin [1982]. A general set 
of psychometric criteria is also discussed by Sanders and McCormick [1993]. It includes measurement 
accuracy, reliability, validity, sensitivity, and freedom from contamination. Meeting these criteria depends 
not only on the instruments used but also on the methods of analysis and the expertise of the analyst. It 
is almost impossible to satisfy these criteria perfectly, especially when the task is being performed in its 
natural environment (as opposed to a laboratory simulation). However, the analyst must always be aware 
of them and must be pragmatic in measuring task variables. Meister [1985] lists the following practical 
requirements for measurements: (1) objective, (2) quantitative, (3) unobtrusive, (4) easy to collect, 
(5) requiring no special data-collection techniques or instrumentation, and (6) of relatively low cost in 
terms of money and effort by the experimenter. However, there are not necessarily mutually exclusive. 

152.6 Uses and Applications of Task Analysis 

The uses of task-analysis information depend on the objectives of performing the task, which, in turn, 
depend on the environment in which the task is performed. Table 152.7 gives the different situations. 

152.7 Future Developments 

To date, there is no reliable quantitative model that can combine mental and physical resources. A general- 
purpose task-analysis method that uses active links between the various compartments of human func- 
tions would be an ideal method, but our lack of knowledge about the way in which many of these 
"compartments" communicate precludes the development of such a method. The present approach to 
compartmentalization seems to be the most pragmatic approach. It offers the analyst access to paths 
from a gross task or job to the myriad of basic task elements of performance. Unfortunately, actual 
systems and subsystems used in practice seem to be loosely defined and inconsistent, often with confusing 
metrics and terminologies for important variables. Moreover, available task-analysis models deal only 
with specific classes of application [Drury, 1987]. A general-purpose model is badly needed for handling 
different mental and physical resources at the same time, dealing with wide ranges in a variable, and 
bridging the gaps across disciplines. Such a model should help us to perform task analyses and manipulate 
the resulting data from environments ranging from the work-place, the home (activities of daily living), 
athletics, exercise and fitness, and rehabilitation. Such a model also can help to eliminate the present 
trend of performing task analyses that are either too specific to apply to different situations or not specific 
enough to answer general questions. Fleishman [1982] deals with the issue. At least one recent effort is 
active [Kondraske, 1995]. 
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TABLE 152.7 Uses and Applications of Task Analysis 



Uses and Applications 

Modeling human performance and determining decision- 
making strategies 
Predicting human performance 



Redesigning of the existing tasks or designing of new tasks 

Determining whether to use task performance aids 

Personnel selection or placement 

Determining whether to use aids that enhance health or 
safety 

Determining educational and training procedures 

Allocating humans to machines and machines to humans 
Determining emergency procedures 



Examples of Relevant Situations 

A model showing the sequence of the various steps required to 

perform a task and the type of equipment needed at each step 
Using a quantitative model to predict whether an elderly person 

has enough arm strength to lift a pot full of water from a 

cupboard onto a stove 
Eliminating an unnecessary step in the packaging of production 

items into cartons; designing a different package for a new 

item based on task analytic data gathered for a related item 
The analysis may show that most elderly persons do not possess 

enough strength to open many food jars and may, therefore, 

need a mechanical torqueing aid. 
Matching personnel physical characteristics with task 

requirements can reveal which persons may be able to perform 

a task and, hence, be assigned to it. 
Task analysis may show that too much dust always gets into the 

atmosphere when opening packages of a powered material and 

that workers should wear a respirator. 
Identification of difficult task steps or the need for using 

mechanical aids, together with knowledge of workers' skills, 

may indicate the level of education and training needed. 
Matching people skills and capacities with the resource 

demands of machines 
Task-analysis information may indicate which steps are likely 

to result in dangerous situations, and therefore, require 

contingency plans. 



There is also a need for better quantification of performance. We need to know not only what kind 
of shifting in resources a person resorts to when one route to successful task accomplishment is blocked 
(e.g., insufficient strength for pressing down in a certain sitting posture) but also the quantity and 
direction of that shift. For example, we need to know not merely that a change in posture can increase 
manual strength for torqueing but also what the various postures can yield and what quantity of specific 
mental resources were involved in the change. The models today that answer the quantitative questions 
are mainly statistical regression ones, and they are limited by the number of variables and the number 
of levels of each variable that they can deal with. 

Defining Terms 

Anthropometry: The science that deals with the measure of body size, mass, shape, and inertial 

properties. 
Cumulative trauma: The accumulation of repeated insults to body structures over a period of time 

(usually months or years) often leading to "cumulative trauma disorders." 
Endurance: The maximum time for which a person can perform a task at a certain level under specified 

conditions without adverse effects on the body. 
Fitts 1 law: The equation, derived by P. Fitts, showing the quantitative relationship between the time for 

a human (body segment) to move from one specific point to another (the target) as a function of 

distance of movement and width of the target. 
Hierarchical task analysis: The analysis of a task by breaking it down to its basic components, starting 

from an overall or gross description (e.g., lifting a load manually) and moving down in a series of 

steps in sequence. 
Job analysis: Any of a number of techniques for determining the characteristics of a job and the 

interactions among workers, equipment, and methods of performing the job. 
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Job Severity Index: An index indicating the injury potential of a lifting or lowering job. It is computed 
as the ratio of the physical demands of the job to the physical capacities of the worker. 

Local segment: A specific body segment, usually the one in contact with an object required for per- 
forming a task or the one most active in the task. 

Maximal aerobic power: The maximal rate at which a person's body can consume oxygen wile breath- 
ing air at sea level. 

Motion study: The analysis of a task or job by studying the motions of humans and equipment related 
to its component activities. 

Person-task conflict: The situation in which task demands are beyond a person's capacities and the 
task is unlikely to be performed according to specifications by that person. 

Position analysis questionnaire: A checklist for job analysis, developed by the Office of Naval Research 
in 1969, requiring the analyst to rate or assess a job from a list of 187 job elements. 

Safety limit: The maximum stress (e.g., load to be lifted) that most workers can sustain, under specified 
job conditions without adverse effects (injuries or cumulative strains) on their bodies. 

Time study: The study of a task by timing its component activities. 

Work pause: A short stoppage from work. 
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Further Information 

For the design of equipment in human work environments where task analysis is employed, see Human 
Factors Design Handbook, by W. E. Woodson (McGraw-Hill, New York, 1981). For human anthropometric 
data that are useful to task analysis, see NASA Reference Publication 1024: Anthropometric Source Book, 
vol 1: Anthropometry for Designers (Webb Associates, Yellow Springs, Ohio, 1978). 
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Purpose of This Chapter 

This chapter provides the reader with a structure for addressing human-computer interface (HCI) design 
issues and making design decisions on the basis of quantitative human performance assessments. HCI 
design issues are structured in accordance with a dual-component, dual-task HCI architecture and are 
analyzed using a performance-based usability model. This framework is general and not bound to specific 
interaction paradigms or limited to assumptions about specific technologies. As such, it is anticipated to 
serve the needs for making HCI design decisions for any biomedical application. It is beyond the scope of 
this chapter to identify, categorize, describe, and analyze all HCI design issues. Further, it is not the intent 
of this chapter to review and compare the specific characteristics of different user-interface systems — there 
are several commonly used — or to review in detail the voluminous research that has been generated in 
recent years on HCI design. References for this information are provided in the Further Information section. 

HCI Design Challenges 

Since the introduction of personal computers approximately 20 years ago, the manner in which humans 
and computers interact has dramatically change from computer-centered to user-centered. A major driver 
of change in HCI design has been a growing understanding of user needs and expectations and of the 
ways in which humans can use computers in accomplishing applied objectives. This increased user 
awareness in concert with emerging technology has dramatically expanded the uses and users of com- 
puters. This expansion is not simply limited to enabling users to do traditional tasks more effectively and 
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efficiently. New technology can enable work to be done in ways that were previously impossible and with 
contructs that previously did not exist. For example, electronic documents can be merged, displayed, 
and stored in ways that do not have hardcopy analogues. New approaches to computer-based instructional 
media enable users to discover and experience concepts in ways that traditional multimedia education 
does not. The use of computer programs as aids for medical, legal, and business decision making has 
expanded individual's access to expertise. 

That computers can be used to accomplish many different tasks and that both experienced and novice 
users need to use them present a complex challenge for HCI designers. The multitude of user needs 
creates a diverse set of functional and performance requirements that must be accommodated. Accom- 
modating these diverse requirements is complicated by the advantages of designing user interfaces across 
applications and platforms using a standard set of generic interface devices and techniques. This approach 
reduces the need for task-specific devices and software and exploits user experience to reduce training 
requirements and improve interaction performance. However, it can cause the use of standard devices 
and techniques to be generalized and applied suboptimally. Another design goal, aimed at accommodating 
special user groups and individual user differences, pushes designs toward alternative devices and tech- 
niques that optimally meet specific requirements. 

The emergence of new technologies combined with an awareness among software and hardware 
developers that users demand intuitive, direct, and easy-to-use interfaces promises to foster continued 
dramatic change in the near future. Reductions in size and integration of computers in specific instru- 
ments and personal devices have increased their portability and applied uses. The integration of digital 
information technology across information-gathering, communication, and information-delivery devices 
promises to dramatically change the model of a computer as a single device into a seamless assemblage 
of devices for assisting in all information-processing tasks. Not surprisingly, the design of human- 
computer interfaces has become prominent area of applied research in human factors and ergonomics. 

153.1 Fundamentals 
User Tasks 

When humans and computers interact, two tasks are performed: an interaction task and an application 
task. The interaction task consists of the activities that the user performs to use the computer. These 
activities include moving a mouse device, typing keyboard inputs, viewing displays, searching for and 
managing files, managing displays, setting preferences, and executing programs. The application task 
refers to the user's functional objectives. These objectives vary widely and include writing a report, 
performing a statistical analysis, finding information in a database, controlling a process, making a 
medical diagnosis, and monitoring/testing the health of a system. The application task is a higher-level 
task than the interaction task, and consequently, the design issues associated with each task are different. 
The HCI design process should include a task analysis. Methods for analyzing tasks are presented by 
Maxwell [1995] and Imrhan [1995] in this Handbook. 

A Dual-Component, Dual-Task-Level HCI Architecture 

HCIs have many components. For the purposes of this chapter, two major types of components are of 
interest: mental and physical. These components are distinguishable in terms of the performance and 
skill resources [Kondraske, 1995] that a user needs to perform interaction and application tasks with the 
computer. Using the mental-interface component requires perceptual-cognitive performance and infor- 
mation (skill) resources. Using the physical-interface component requires sensory-motor performance 
and skill resources. The relationship of these HCI components with the user are illustrated in Fig. 153.1. 
For each HCI component, a design specification associated with each user task is defined. The appli- 
cation task-level design specifies the structure and level of objects, actions, and information that the user 
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FIGURE 153.1 The mental and physical HCI components fit different user performance resources. 
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needs, and the procedures, both mental and physical, that a user must take to accomplish applied 
objectives. The interaction-task-level design specifies the representation and behavior of HCI elements 
associated with the mental and physical HCI components. This framework, illustrated in Fig. 153.2, 
reflects the need to enable two user tasks requiring both mental and physical performance and skill 
resources and is used to structure the analysis of HCI design issues. 

Successful HCI design decisions must be made on the 
basis of (1) knowledge about the user's performance and 
skill resources, (2) the performance and skill resource 
requirements of the application task, and (3) the per- 
formance capabilities of available HCI technologies. The 
design of the physical-interface component requires 
knowledge of the user's sensory-motor performance and 
skill resources and of the resources required to use var- 
ious physical HCI devices with regard to the application 
tasks to be performed. Design of the mental-interface 
component requires knowledge of the user's perceptual- 
cognitive performance and skill resources and of the 
resources required to process information in different 
organizations and representations with regard to the 
application tasks to be performed. 
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FIGURE 153.2 A dual-component, dual-task- 
level HCI architecture for design. 
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153.2 A Quantitative Performance-Based Model of Usability 



Achieving a highly usable interface is the HCI designer's primary goal. All HCI design issues are addressed 
on the basis of their impact on usability. The concept of usability is fundamental to the design of any human- 
system interface and is a major factor affecting the operational performance, availability, maintainability, 
and reliability of the overall system. Usability is a multidimensional quality that affords the user practical 
and convenient interaction with the computer for achieving applied objectives. The International Standards 
Organization's (ISO) [1993] usability definition characterizes this quality in terms of effectiveness, efficiency, 
and user satisfaction. An effort to define quantitative usability metrics for these characteristics has resulted 
in the metrics for usability standards in computing (MUSiC) methodology [Bevan & Macleod, 1994]. This 
methodology uses these metrics for specifying usability goals and intermittently testing to verify meeting 
the goals as part of the design process. Generally, usability metrics focus on usability testing and are not 
directly useful early in the design process for defining and quantifying a highly usable interface. The objective 
here is to define a quantitative performance-based approach for guiding HCI design. 

Toward this objective, usability is defined in terms of the elemental resource model (ERM) of human 
performance [Kondraske, 1995]. This model defines performance and skill resources. Each performance 
resource is defined in terms of a quantitative unidimensional capability to perform an elemental function. 
Skill resources are defined in terms of knowledge and experience. Adopting this model allows usability 
tradeoffs in design decisions to be stated in terms of the relationships between (1) available resources 
and the sources that possess them and (2) required resources and the sinks that expend them. Sources 
of resources include the user, the HCI, and non-HCI computer components. Sinks are the interaction 
and application tasks. The basic model is depicted in Fig. 153.3. The HCI designer has partial or full 
control over the following: 

• HCI available resources 

• Interaction-task required resources 

• Application-task required resources 

■ Non-HCI computer components available resources 



^ 



Why Use the ERM? 

There are several methods for quantifying various aspects of HCI design. The GOMS (for goals, operators, 
methods, and selection rules) and keystroke-level models [Card et al., 1983, 1986], when used in com- 
bination with human performance models such as the model human processor, provide task performance 
estimates and knowledge requirements. The NGOMSL (for natural GOMS language) [Kieras, 1998] and 
TAG (task action grammer) [Payne & Green, 1986; Schiele & Green, 1990] models provide quantitative 
metrics for interaction task complexity and consistency. However, the ERM is the only model that 
(1) incorporates all required dimensions of performance and skills, (2) provides a performance-modeling 
capability at elementary and intermediate levels, and (3) uses consistent modeling constructs across tasks, 
humans, and computers. Also, the ERM can be used in a rigorous way when performance data are 
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FIGURE 153.3 HCI design goal is to minimize interaction and application task demands. 
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obtainable or less rigorously as a set of principles to guide designing for enhanced usability. Finally, the 
ERM provides a framework for specifying performance and functional capacities that can be used in 
concert with other models and techniques at various design levels. This is illustrated in the example at 
the end of this chapter. 

Minimal and Optimal Usability 

In terms of the ERM, and HCI design is minimally usable when all the user's performance and skill 
resources needed to accomplish the interaction and application tasks are exhausted in achieving the 
desired level of application task performance. This means that the user can perform the task using the 
computer only with maximal effort and expenditure of resources. Note that this definition of usability 
does not address the design of the HCI specifically. Rather, it addresses the relationship between the 
performance that can be obtained with a given HCI design and the effort a user needs to expend to 
achieve it. This means that different users will have different points at which an HCI for a given task will 
be minimally usable. The concept of optimal usability is more complex. An optimized design will allow 
users maximal use of their performance and skill resources, allocated at their discretion to the application 
task, to maximize their performance on the application task. Thus an optimal design minimizes the 
resources required to perform the interaction task and enhances the utility of the user's resources in 
performing the application task. This means that the HCI will support the user in performing the portion 
of the application task allocated to the user. This is accomplished through application-level design issues 
associated with the organization and level of objects, actions, and information. 

Performance-Based Usability Design Decisions 

The design strategy for achieving usability follows from an analysis of available and required performance 
and skill resources and application of HCI design principles that act to reduce resource task demand on 
users (Fig. 153.4). For example: 

• Usability will increase as the resources required of the user to perform the interaction and appli- 
cation tasks decrease. 

• Usability will increase as the HCI available resources increase (provided the design principles for 
reducing resource demands are applied). 

ERM-Based Approach to Usability Design Decisions 
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HCI Design Goal: 

Use HCI performance resources to minimize performance and skill resource task demands on the user. 

HCI Design Approach: 

• Apply design principles for reducing the user's performance and skill resource requirements 

• Naturalness and Familiarity of Interaction 

• Simplicity of Interaction 

• Robustness of Design 

• Consistency of Design 

• Accommodate Different User Capabilities 

• Apply ERM quantitative analysis techniques 

FIGURE 153.4 A quantitative human-performance-based approach to HCI design. 
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A detailed discussion regarding performance resources is presented by Kondraske [1995]. Different 
resource sources and tasks impose different design objectives. One performance-based design objective 
is to apply HCI resources to minimize the number of performance resources the user needs to perform 
the interaction task. This approach permits the user to allocate more resources to performing the 
application task. Achieving this objective includes ( 1 ) incorporating appropriate input and display devices 
to meet the user's sensory-motor resources and (2) use of organizations, representations, and levels of 
object, actions, and information that fit within the limits of the user's knowledge and perceptual-cognitive 
resources. 

A second performance-based design objective is to apply HCI resources to optimize the resources the 
user needs to perform the application task. Many of the objects, actions, and information with which 
the user interacts are application-task-specific. Organization and representation decisions should be based 
on these task-specific requirements. For example, if the application task involves drawing, the user needs 
general software and hardware drawing tools and devices. This consideration can be used to illustrate 
the relationship between application and interaction task-level design issues. A software drawing package 
can provide different levels of drawing elements (e.g., having a tool to directly draw M-sided polygons 
versus a line-drawing tool from which a polygon can be drawn segment by segment). Assume that the 
HCI provides a palette of drawing tools and that it includes primitive and complex tools. The user can 
draw an ra-sided polygon using the line tool or using the polygon tool. Both tools are included in the 
palette. The issue here is how the decision to develop the polygon tool and include it in the product is 
made. The decision must be based on the user's application task need to draw polygons. This need is 
established by an analysis of the application task. The decision to include the polygon tool is an HCI 
decision at the application-task level. Once this applied need is established, the HCI design needs of how 
to represent, where to locate, and how to operate the tool are addressed. These design needs are HCI 
design issues at the interaction-task level. 

The decisions at these two task levels can be further used to illustrate interactions in assessing perfor- 
mance-based usability. A decision to include the polygon tool made on the basis of the application-task 
requirements will increase usability at the application-task level in that the user will need to use fewer 
sensory-motor resources to create a polygon. However, the inclusion of the polygon tool complicates the 
interaction-task level in that more items are added to the tool palette and that procedures need to be 
defined for drawing it. As will be discussed, these complications are addressed by applying HCI design 
principles that reduce performance and skill resource demands. 

Because the ERM defines performance resources at an elemental level, performance-based design can 
be conducted at the elemental level or at higher intermediate levels. Intermediate levels reduce the 
granularity of the analysis and provide for use of composite performance resources. For example, ana- 
lyzing a menu-selection task performed with a mouse device at an elemental level would involve perfor- 
mance resources for speed, accuracy, and steadiness of positioning movements. An intermediate-level 
analysis could use pointing as a composite performance resource. At his level the selection task would 
involve two intermediate-level performance resources: pointing and clicking. 

Design Principles for Reducing Task Resource Demands 

Optimizing usability is accomplished through the application of resource-demand-reducing HCI design 
principles. As stated earlier, the ERM can be applied rigorously or as a set of guiding principles. Rigorous 
application requires an analysis of available and required performance and skill resources and perfor- 
mance data associated with each resource. These data are usually not obtainable for all resources. Less 
rigorous application uses the principles of resource economics, resource stress, and analyses of the 
performance and skill resources involved in tasks to guide the design toward lower resource demands on 
the user. Five major design principles that act to reduce task demands on performance and skill resources 
are described. Each principle can be applied to all HCI architectural design categories. 
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Naturalness and Familiarity of Interaction. Usability will be improved to the degree that the interaction 
is natural and familiar. Naturalness and familiarity reduce training requirements and increase available 
resources (e.g., available knowledge, sensory-motor performance). For physical components at a task 
level, this means that the user's sensory and effector modalities are properly matched to the required 
interactions. For example, keyboards used for typing, keypads for data entry, and mouse, pen, or touch- 
screen devices for pointing. For physical components at a feature level, this means that (1) user anthro- 
pometrics are employed in defining workstation and component configurations and dimensions and 

(2) that the physical characteristics of controls and displays (e.g., operating force, sensitivity, brightness, 
intensity, and size of displayed information) are congruous with the users and physical capabilities. For 
mental components at a task-level, natural interaction means that the HCI models the user's tasks in 
accordance with the user's knowledge and experience. For mental components at a feature level, natural 
interaction means that the user's perceptual and cognitive capabilities and limitations are employed in 
defining HCI features (e.g., icons, names, screen layouts, and the behavior of window controls such as 
scroll bars). Direct manipulation interaction techniques (e.g., drag and drop) are a form of natural 
interaction in that they create a software mechanism with a physical analogy. 

Simplicity of Interaction. Simplicity applies across a wide variety of design issues and must be incorpo- 
rated in every architectural category. Applying this principle reduces training requirements, memory 
demands, perceptual demands, and the cognitive demands for inferencing. For the physical interface 
component, applying this principle reduces the number of physical steps needed to select and invoke 
actions and reduces movement precision requirements. 

Consistency of Design. Consistency reduces training requirements and memory requirements and pro- 
motes skill development. Consistency applies across a wide variety of design issues and must be incor- 
porated in every architectural category. Examples include object, action, and information organizations, 
task flows, representations for objects and actions, and procedures. TAG [Payne & Green, 1986; Schiele 
& Green, 1990] provides a formal technique for analyzing consistency. 

Robustness of Design. Robustness refers to the ability of the design to tolerate user error and to span the 
functional requirements of the user. Error tolerance reduces resources needed to recover from errors and 
reduces initial accuracy requirements, allowing the user to increase operating speed. Users may commit 
errors because of a lack of knowledge. In this case, the user may be exploring the system to learn it. These 
are not errors. But this type of behavior should be expected, and the design should allow the user to do 
this exploration and to recover from anything he or she may do. Users may commit errors because of a 
misunderstanding or negative transfer from other systems. Thus a user may infer that a menu item will 
elicit a given behavior because it does so on another system that he or she has used. Or the user may feel 
that he or she understands what a given menu item or icon means and be wrong. Users may commit 
errors through unintentional commands. For example, a user may inadvertently click a wrong button, 
hit a wrong key, or select a wrong item from a menu. Ensuring adequate functional coverage reduces the 
functional demands on the user. 

User Capabilities Accommodation. In resource terms, applying this design principle increases the fit 
between an individual user's available mental and physical resources and those required by the interaction 
and application tasks. There are two major dimensions on which users vary: experience and capability. 
Along the experience dimension users vary between experts and novices on either the interaction or 
application task. Thus a user could be very experienced with the applied task but not with doing that 
task on a computer. Similarly, a user may be very used to using computers but not skilled in the applied 
task. Novice users will (1) need more guidance and feedback, (2) require more structure to the task, 

(3) rely more on memory-aiding design features (e.g., menus), and (4) benefit more from on-line help 
facilities than expert users. Expert users (1) will want more flexibility in task control, (2) will want 
shortcut methods to do high-level actions, (3) will rely more on recall rather than memory aids, and 
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(4) will benefit more from the use of preferences and tailored configurations than novice users [Brown, 
1988;Galitz, 1993]. 

Along the capacity dimension users vary between healthy and impaired. In the ERM, impairments are 
not defined differently from any other performance or skill differences among users. All differences are 
defined in terms of the amount of specific performance resources available to an individual. Impaired 
users will vary greatly in capacity and experience. Only through an analysis of resources demanded by 
the task and resources available to the user can an adequate design for an impaired user be accomplished 
successfully. 

153.3 Selected HCI Design Issues, Goals, 
and Resource Requirements 

Selected design issues in each architectural category are identified. Design goals for enhancing usability 
related to each issue are described. These goals are derived by applying the resource-demand-reducing 
design principles described earlier. The human performance resources associated with each issue are also 
identified. Thus for each issue the reader is presented with the areas of a human's performance resource 
profile that need to be assessed to achieve a design that enhances usability. For illustrative purposes, and 
example design process is presented using issues related to the design of a menu system. 



Mental Interface: Application-Task-Level Specification 

Design issues in this category are concerned with the organization and level of objects, actions, and 
information that users need to perform the interaction and applied tasks. Design decisions must consider 
the perceptual-cognitive performance and skill resources possessed by users. Design goals are to minimize 
the resources required to accomplish the applied task. Table 153.1 lists selected design issues of this type, 
along with the design goals and performance and skill resources involved in making the design decisions 
for each issue. 
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TABLE 153.1 Selected Mental-Interface/ Application-Task-Level Design Issues 



Design Issue 



Selected Design Goals 
(Based on Design Principles) 



Selected Performance and 
Skill Resources Required of the User 



Application-task-level 
interaction structure 



Design of dialogs and 
query languages 



Use of an application- 
task-level metaphor 

Menu structure design 



Match interaction structure to user's cognitive 

model of task 
Minimize the number of steps needed to 

complete tasks 
Consistent task syntax 
Consistent task semantics 
Structure syntax in accordance with natural 

language 
Define semantics in accordance with task usage 
Minimize number of dialogs and queries 

required 
Allow for flexible usage 
Consistency of usage 
Leverage user's prior experience 
Increase intuitiveness of application-task-level 

interaction 
Optimize menu navigation 
Structure menus in accordance with user's 

knowledge of task 
Structure menus in accordance with user's 

assoiations among menu set. 



User's knowledge of the application task structure 



User's knowledge of the application task structure 
User's knowledge of application task terminology 
Dialog or query entry speed 
Dialog or query entry accuracy 



User's knowledge of the application task structure 



User's knowledge of the application task structure 
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TABLE 153.2 Selected Mental-Interface/Interaction-Task-Level Design Issues 



Selected Design Goals 
Design Issue (Based on Design Principles) 

Interaction-level Select based on strength of association to 
icon, symbol, represented objects and commands 

and label Establish consistent usage, i.e., identical icons, 

selection symbols, and labels should be used for the 

same objects and actions. 

Color selection Consistency with cultural usage 

Screen and Consistency of within-screen activity flow with 

window layout other learned activities (e.g., left to right flow 

of activity for English speaking users) 

Number of Optimize 

controls and Use of integrated displays where applicable 

displays 



Selected Performance and 
Skill Resources Required for the User 

User's knowledge of computer objects and actions 
Perceptual accuracy 
Perceptual speed 



User's existing color associations to application task 

and HCI objects and actions 
Memory speed 
Memory accuracy 
Perceptual speed 
Perceptual accuracy 
Memory speed 
Memory accuracy 
Perceptual speed 
Perceptual accuracy 



Mental Interface: Interaction-Task-Level Specification 

Design issues in this category are concerned with the representation and behavior of HCI elements. 
Design decisions must consider the perceptual-cognitive performance and skill resources possessed by 
users. Design goals are to minimize the resources required to accomplish the interaction task. Table 153.2 
lists selected design issues of this type, along with the design goals and performance and skill resources 
involved in making design decisions for each issue. 

Physical Interface: Application-Task-Level Specification 

Design issues in this category are concerned with (1) the performance and structural characteristics of 
major physical elements of the workstation and (2) the physical actions required to perform tasks. Design 
decisions must consider the sensory-motor performance and skill resources possessed by users. Design 
goals are to minimize the resources required to accomplish the applied task. Table 153.3 lists selected 
design issues of this type, along with the design goals and performance and skill resources involved in 
making design decisions for each issue. 

Physical Interface: Interaction-Task-Level Specification 

Design issues at the feature level of the physical HCI are concerned with the physical performance 
parameters associated with using controls and displays. Design decisions must consider the sensory- 
motor performance and skill resources possessed by users. Design goals are to minimize the resources 
required to accomplish the interaction task. Table 153.4 lists selected design issues of this type, along 
with the design goals and performance and skill resources involved in making design decisions for each 
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Brief Example Using Menu System Design 

This example identifies the design issues associated with each HCI architectural category and traces the 
decision process for the design of a menu system. The issues and process are illustrated in Fig. 153.5. 
The design process begins with the results of an analysis of the application task, the performance and 
skill resources of the user, and available HCI technology. The analysis of the user will account for user 
knowledge of the application task, familiarity with computer interfaces, and user performance capabil- 
ities. The application-task analysis will provide information about the number of tasks, sequences of 
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TABLE 153.3 Selected Physical-Interface/ Application-Task-Level Design Issues 



Design Issue 

Input device selection 



Display selection 



Action procedure 
specification 



Use of multiple input and 
control methods (e.g., 
keyboard and mouse) 



Selected Design Goals 
(Based on Design Principles) 

Account for user familiarity 

Account for naturalness for task (e.g., keyboard for 

text entry, number pad for data entry, pen, mouse, 

or touch screen for pointing) 

The greater the amount of information that has to 

be displayed simultaneously, the greater the size 

and resolution requirement. 
Greater size increases the number of required head 

and eye movements. 
Optimize size, resolution, and color needs. 
Minimize the required number of keystrokes, mouse 

movements, hand movements, head and eye 

movements 
Use of direct manipulation techniques (e.g., drag 

and drop interaction, scroll bars, mouse controls 

for moving and resizing windows) 
Provide the user with more than one action 

procedure for doing tasks 



Selected Performance and 
Skill Resources Required for the User 

For manual and ocular controls: 

Movement speed 

Movement accuracy 

Movement steadiness 

Movement extension 
Visual acuity 
Color Sensitivity 
Visual span angle 



For manual and ocular controls: 
Movement speed 
Movement accuracy 
Movement steadiness 
Movement extension 

For manual and ocular controls: 
Movement speed 
Movement accuracy 
Movement steadiness 
Movement extension 



TABLE 153.4 Selected Physical-Interface/Interaction-Task-Level Design Issues 



Design Issue 

Keyboard and button devices: 

Size of keys 



Force required to press keys 



Use of audio/visual feedback 

when key is pressed 
Visual display devices: 
Font 
Brightness 



Selected Design Goals 
(Based on Design Principles) 

Greater size will reduce the required accuracy and 

steadiness of movements 
Smaller size will reduce the required movement 

length and duration 
Optimize size across these considerations 
Greater force requires more strength 
Too little force will not provide adequate feedback 

to the user. 
Optimize force across these considerations 
User requires feedback to know that the key press 

has been acknowledged. 

Low confusion rate 
Optimal viewing 



Selected Performance and 
Skill Resources Required for the User 

For wrist, hand, and fingers: 

Movement accuracy 

Movement steadiness 

Movement speed 

Movement extension 
Finger strength 
Proprioceptive feedback sensitivity 



Audio/visual acuity 
Audio/visual sensitivity 



Visual acuity 
Visual sensitivity 



required actions to perform tasks, the semantic associations among tasks and actions, and information 
to be manipulated. The HCI technology assessment will provide the designer with the set of resources 
available for application. Data from these analyses are used to make initial mental-interface component 
and physical-interface component decisions at an application-task level. 

The decision to use a menu system to access actions and control the flow of the application task is 
based on the usability criteria discussed above with regard to performance and skill resources. Specifically, 
it is based on an assessment of the resources required to perform tasks using menu selection compared 
with other operating mechanisms. Alternative mechanisms would be command-line input from the user 
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Application-Task Analysis 
Action Sequence 




Time 



• number of tasks 

• procedures 

• task objects, actions, and information 

• performance and skill requirements 



User Analysis 
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Physical 

• input devices 

• display devices 
Mental 

• organization 

• representation 



[ Mental-Application f 



Define overall HCI structure 
for Application task 

• Organization of objects, 
actions, and information 

• Task flow 



Define menu structure 

• number of menus 

• number of items in menus 

• order of items 
» use of cascading menus 



Mental-Weraction 



Select HCI mechanism for accessing 
objects, actions, and information 



• use menu: 



Define menu-specific 
interaction paramenters 

• names of menus and menu item 

• icons for menus and menu item; 

• color selection 

• location on screen/window 

• user feedback cues 




Physical-Application 



Select workstation elements 

Input Devices 

• keyboard 

• pointing device 
Displays 

• screen size 

• resolution 

• color capabilities 



Define sensory-motor interaction 
procedures for menu selection 



• multimodal 
(keyboard and pointing device) 

• point and click 

• keyboard equivalents 
and accelerators 



T Physical-Interaction 



Define menu-specific 
interaction paramenters 

• size of active selection area for menu 

• size of active selection area for menu 
items 

• fonts, font size and style 

• icon pixel matrix sizes 

• user feedback cues 



IT- 



FIGURE 153.5 HCI design decisions for design of a menu system. 

and visually presented buttons for each object and action. Menus and visual buttons both reduce the 
demands on the user's memory capacities because available actions are listed. In the case of menus, many 
actions are hidden but can be made explicit. If all actions were constantly visible, the user would have 
an easier access to individual actions (i.e., eliminating the step of calling up the menu) but at the cost 
of greater display size and clutter, requiring greater use of visual resources. If data specifically addressing 
each tradeoff in resources are not available, performance testing can be conducted to verify decisions. 
Before a final decision is made, however, the initial decisions made regarding the physical interface at 
the application task level need to be considered. 
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As illustrated, the task-analysis data are used to select input and display devices. Selection of input 
and display devices is based on matching the congruity of the information to be input or displayed with 
the capabilities of the device. Both menu and button selection can be accomplished with mouse and 
keyboard devices. Command-line entry cannot be accomplished with a mouse. The point here is that 
there are additional tradeoffs in resource capabilities between the mental and physical components of 
the interface. Finally, another consideration is created by the user analysis. User familiarity with another 
pointing device such as a pen may push the user to a pen-based mode of operation. This is especially 
true if the HCI design is attempting to account for gestural inputs that may be beyond the capability of 
the mouse. Additionally, design for users with disabilities may make selection of alternative input and 
display devices necessary. For example, head and ocular controls can be used in place of hand-manipulated 
pointing and selection devices. For the purposes of the example, it is assumed that menus have been 
determined to be optimal for use of resources on the mental interface, that mouse and keyboard input 
devices have been determined to be optimal, and that a 14-in. color monitor has been determined to be 
minimal for the physical interface. It is further determined that a number keypad is not needed for the 
application tasks of interest. 

The next mental-interface design issue is defining the menu system structure. This issue is associated 
with the application-task level. Structural decisions will be based on the task structure in an attempt to 
reflect the task knowledge of the user. Specific techniques such as pathfinder [Roske-Hofstrand & Paap, 
1986; Schvaneveldt et al., 1985] can be employed to extract user associations and structural knowledge 
about the concepts that will be included in the menus. Specific design decisions are included in Fig. 153.5. 

For the physical interface, the next design issue remains at the application-task level and involves 
specifying the overt physical actions that the user must perform to select menus and menu items. This 
specification of procedures is dependent on the input and display devices selected and the structure of 
the menu system to be accessed. 

The goal is to define procedures that reduce the user's sensory-motor resource requirements. At the ( # ) 

application level, this reduces to making the procedures simple, consistent, natural, and fault-tolerant. 
This translates to minimizing the number of steps to select a given item, minimizing the number of types 
of steps needed to be learned, providing adequate feedback to the user at each stage, and providing 
multiple access procedures that allow skilled users to access actions more directly. 

The next design issues for the mental-interface component are associated with the interaction-task 
level and involve specification of the representations for menus and menu items. These include menu 
and item names or graphic representations. It further involves the specification of the menu box and 
other menu indications (e.g., an ellipsis for indicating that a dialog box will be opened or an arrow 
indicating a cascading menu item). The specification of the selection procedures also may impose design 
considerations at this point. For example, keyboard operation will involve use of a selection cursor that 
cues the user to the active selection position. The perceptual-cognitive aspects of the design of this cursor 
will be considered here. 

For the physical interface, the next design issues are associated with the interaction-task level. The 
specified procedures will be used to determine menu-specific, low-level operating parameters for the 
display and input devices. Menu-specific issues related to menu selection with a multibutton mouse 
device include the following: 

■ Size of the active screen selection area of each menu button. Greater size will translate to a reduced 
demand on pointing (positioning) accuracy resources. 

■ Size of the active screen selection area for each menu item. Greater size will require less precise 
pointing. 

• Number of items in a selected menu. More items require more access time (or greater processing 
speed to keep access time fixed), more precise pointing, and create a greater probability of selection 
errors. 

• Use of cascading menus. Selection requires more precise pointing. 
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Issues related to selection with a keyboard include the following: 

• Key binding to selection action. Traditional, familiar, and consistent binding will reduce memory 
requirements and probability of user error. 

• Distinguishability of the displayed selection cursor. Greater distinguishability requires less visual 
search and detection time. 

• Size, sensitivity, required operating force of keyboard device. 

• Response lag between keystroke and selection-cursor movement. Greater lag will create greater 
pointing error and increased pointing time. 

Defining Terms 

Application task: Refers to the applied objectives the user is employing the computer to accomplish 
(e.g., writing a report, performing a statistical analysis, finding information in a database, control- 
ling a process, making a medical diagnosis, and monitoring/testing the health of a system). 

Direct manipulation: A term coined by Schneiderman [1982, 1983]. Refers to an HCI or style of 
interaction with the following characteristics: 

• Objects are continuously displayed. 

• Physical action (e.g., point and click, drag and drop interaction) rather than command line 
interaction is employed. 

• Impact of actions are rapid and directly observable. 

Interaction task: Refers to the activities that the user performs to use the computer (e.g., moving a 
mouse device, typing keyboard inputs, viewing displays, searching for and managing files, man- 
(v) aging displays, setting preferences, and executing programs). 

Usability: Usability is a multidimensional quality that affords the user practical and convenient inter- 
action with the computer for achieving applied objectives. The concept of usability is fundamental 
to the design of any human-system interface and is a major factor affecting the operational 
performance, availability, maintainability, and reliability of the overall system. 

User-centered: User-centered design refers to a philosophy of human-machine system design that places 
the focus of the design process on the needs of the human who uses the system to accomplish a 
task. The following items characterize a user-centered design approach: 

• The system is viewed as a tool that the user employs to accomplish objectives. 

• The user is always performing a higher-level task than the system. 

• Takes are allocated away from the user rather than to the user. This is not simply a matter of 
perspective but rather is fundamental to a human-centered approach. That is, the user is assumed 
to be performing a high-level task. Lower-level tasks that are better performed by the computer 
are allocated from the user to the computer. This approach is opposed to one that considers the 
computer as the high-level task performer and allocates to the user lower-level tasks that the 
machine cannot do well. 

• Tasks allocated to the user are meaningful. 

• The user's responsibilities provide job satisfaction and make use of the user's talents and skills. 

• Users are involved in all phases of the design process. 

• The user's skills and performance capabilities are explicitly considered in and accounted for in 
design decisions. 
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A clinical trial is a research study involving human subjects and an intervention (i.e., device, drug, surgical 
procedure, or other procedure) that is ultimately intended to either enhance the professional capabilities 
of physicians (i.e., improve the service delivered), improve the quality of life of patients, or contribute 
to the field of knowledge in those sciences which are traditionally in the medical field setting — e.g., 
physiology, anatomy, pharmacology, epidemiology, neurology, cognitive psychology, etc. [Levin, 1986]. 
Clinical trials research is in the business of evaluating therapeutic interventions intended to benefit 
humans. Its value is directly related to the relevance of the questions "Do our treatments work?" "How 
well do our treatments work?" and "Are our treatments safe?" For example, drug A is designed and 
anticipated to relieve sinus congestion. Is there a drug interaction when drug A is taken with drug B 
and/or moderate levels of alcohol consumption such that while congestion is relieved (i.e., drug A is 
effective), human information processing capacities are reduced (i.e., is drug A safe?). And what is the 
time course of effects with regard to positive and negative (or adverse) effects? Thus it is clear that not 
only steady-state issues buy also dynamic questions are on interest in clinical trials research. 

While clinical trials research incorporates many different components, the focus of this chapter is 
limited to study questions associated with human performance capacity variables and their measurement 
as they contribute to the determination of therapy effectiveness and safety. Such variables have been 
incorporated into trials since the use of controlled studies in the medical field began. However, the 
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methodology employed to address human performance variables has been slowly but steadily shifting 
from mostly subjective to more objective instrumented methods [e.g., Tourtellotte et al., 1965] as both 
the understanding of the phenomena at play and the demand for improved quality of studies have 
increased. This chapter begins by briefly examining a classification of typical clinical trials study models, 
presents a summary of methods employed and key methodologic issues in both the design and conduct 
of studies (with special emphasis on issues related to the selection of measures and interpretation of 
results), and ends with a walk- through of a typical example that demonstrates the methods described. 
Brief discussion of the benefits that can be attributed to the use of objective, instrumented measures of 
human performance capacities as well as their current limitations in clinical trials research is also 
presented. While it is emphasized that most methods and issues addressed are applicable to any inter- 
vention, the use of human performance variables in pharmaceutical clinical trials has been most prevalent, 
and special attention has been given here to this application. 

154.1 Basic Principles: Types of Studies 

Depending on the set of primary and secondary questions to be addressed, considerable variety can exist 
with respect to the structure of a given trial and the analysis that is performed. Within the focus and 
scope of this chapter, clinical trials are classified into two categories for discussion: (1) safety-oriented 
and (2) efficacy-oriented. According to the Food and Drug Administration (FDA) [1977], four phases 
of research studies are required before a drug can be marketed in the United States. Phase I is known as 
clinical pharmacology and is intended to include the initial introduction of a drug into humans. These 
studies are safety-oriented; one issue often addressed is determination of the maximum tolerable dose. 
Phases II and III are known as clinical investigation and clinical trials, respectively, consisting of controlled 
and uncontrolled clinical trials research. Phase IV is the postmarketing of clinical trials to supplement 
premarketing data. Both safety and efficacy are addressed in phase II to IV investigations. As specified 
earlier, not all interventions studied are drugs. However, a similar phased approach is also characteristic 
of the investigation of therapeutic devices and treatments. 

Safety-oriented trials usually involve vital sign measures (e.g., heart rate, blood pressure, respiration 
rate), clinical laboratory tests (e.g., blood chemistries, urinalysis, ECG), and adverse reaction tests (both 
mental and physical performance capacities) to evaluate the risk of the intervention. In this chapter we 
focus on the adverse reaction components, most of which historically have been addressed with subjective 
reporting methods. In the case of drug interventions, different doses (e.g., small to large) are administered 
within the trial, and the dose-related effects are examined. Thus the rate at which a drug is metabolized 
{pharmacokinetics), as evidenced by changes in drug concentrations in blood, cerebral spinal fluid, urine, 
etc., is usually addressed in safety-oriented drug studies, as well as the maximum dosage which subjects 
can tolerate [Baker et al,. 1985; Fleiss, 1986; Jennison & Turnbull, 1993; Tudiver et al, 1992]. 

Efficacy-oriented trials are usually conducted after initial safety-oriented trials have established that 
the intervention has met safety criteria, but they will always have safety questions and elements as well. 
In this type of study, the goal is to objectively determine the therapeutic effect. Whatever the type of 
intervention, these studies are designed around the bottom-line question "Is the intervention effective?" 
In many cases (e.g., drugs for neurologic disorders, exercise programs for musculoskeletal injuries, etc.), 
effectiveness implies improvement or retarding the rate of deterioration in one or more aspects of mental 
and/or physical performance. With regard to disease contexts, it is necessary to distinguish performance 
changes that merely reflect treatment of systems as opposed to those which reflect the slowing or reversal 
of the basic disease process. Studies typically include pre- and postintervention measurement points and, 
whenever feasible, a control group that is administered a placebo intervention [Fleiss, 1986; Tang & 
Geller, 1993; Weissman, 1991]. The situation is complicated because there are many performance capac- 
ities that could be affected. In response to the intervention, some capacities may improve, some may 
remain unchanged, and some may be adversely affected. Thus efficacy-oriented studies typically include 
a number of secondary questions that address specificity of effects. When the intervention is a drug, both 
the pharmacokinetics and pharmacodynamics are often addressed. Pharmacodynamic studies attempt to 
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relate physiologic and/or metabolic changes in the concentration of the agent over time to corresponding 
changes in the therapeutic effect. Thus repeated measurement of selected performance capacities are 
required over relatively short periods of time as part of these protocols. 

Clinical trials involving human performance metrics generally use the randomized clinical trial (RCT) 
design. The RCT is a way to compare the efficacy and safety of two or more therapies or regimens. It 
was originally designed to test new drugs [Hill, 1963]; however, over the last 25 years, it has been applied 
to the study of vaccinations, surgical interventions, and even social innovations such as multiphasic 
screening [Levin, 1986]. Levin [1986] describes four key elements of RCTs: (1) the trials are "controlled," 
i.e., part of the group of subjects receive a therapy that is tested while the other subjects receive either 
no therapy or another therapy; (2) the significance of its results must be established through statistical 
analysis; (3) a double-blind experimental design should be used whenever possible; and (4) the therapies 
being compared should be allocated among the subjects randomly. Levin[1986] further noted that "the 
RCT is the gold standard for evaluating therapeutic efficacy." 

154.2 Methods 

Studies are defined and guided by protocols, a detailed statement of all procedures and methods to be 
employed. Figure 154.1 summarizes the major steps in clinical trials in which human performance 
variables represent those of primary interest, although the general process is similar for most trial 
involving human subjects. 

Selecting Study Variables 

In many clinical trials, performance measurements must be focused on the specific disease or set of 
symptoms against which a medication or intervention is directed. The critical element is selection of an 
outcome assessment that will indicate directly whether the intervention can eliminate key signs or 





Conduct Statistical 

Analysis and 
Significance Tests 




FIGURE 154.1 Summary of major steps in the conduct of a clinical trial involving human performance capacities. 
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symptoms of the disease, improve function affected by the disease, or even simply delay further disease 
progression and loss of function. Economic and statistical constraints of such trials often indicate that a 
minimal set of measures must be used to provide valid, reliable, and sensitive indicators of changes in 
underlying disease activity related to the treatments. In many instances, there is considerable controversy 
about what constitutes a valid and reliable assessment of the underlying disease process, or there may be 
de facto standards adopted by clinicians, pharmaceutical companies, and the FDA as to the choice of 
outcome assessments. In many diseases, these standards may be physician rating scales of disease severity. 
Thus objective studies may be required in the case of specific diseases and disorders to delineate the most 
sensitive set of performance measures for clinical trial evaluations [Mohr & Prouwers, 1991]. Further, it 
often must be demonstrated to clinicians, drug companies, and the FDA that the objective performance 
measures selected as outcome assessments actually do provide equal or more sensitive indicators of disease 
progression (and improvement) than do existing rating scale measures traditionally utilized in clinical 
trials. Experience with human performance testing in multiple sclerosis clinical trials, discussed below, 
illustrates these issues. 

When objective measurement of performance capacities has been incorporated into many clinical 
trials, concepts and tools from human performance engineering can facilitate the selection of variables 
and shed some light on issues noted above. In either safety- or efficacy-oriented studies, study variable 
selection can be characterized as a two-step process: (1) identification of the factors in question (e.g., 
Table 154.1) and (2) selection of the relevant performance capacities to be measured and associated 
measurement instruments. This link between these two steps often represents a challenge to researchers 
for a number of reasons. First, duality in terminology must be overcome. Concerns about an intervention 
are typically initially identified with "negative" terms such as "dizziness" and not in terms of performance 
capacities such as "postural stability." Human performance models based on systems engineering concepts 
[e.g., Kondraske,1995] can be used to facilitate the translation of both formal and lay terms used to 
identify adverse effects to relevant performance capacities to be measured, as shown in Table 154.1. 

In Kondraske's [1995] model, performance capacities are modeled as resources that a given system 
possesses and draws up to perform tasks. It also provides a basis for delineation of hierarchical human 
systems and their performance capacities as well as a basis to quantitatively explain the interaction of 
available performance capacities with demands of higher-level tasks (e.g., such as those encountered in 
daily living). The hierarchical aspect is particularly important in selecting performance capacities to be 
measured. It is generally good practice to include a combination of carefully selected lower-level and 
higher-level capacity measures. This combination allows careful tradeoffs between information content 
(i.e., specificity) and simplicity of the protocol (e.g. number of variables, time for test administration, 
etc.). Higher-level capacities (characterizing systems responsible for gait, postural stability, complex 
mental task, etc.) are dependent on the performance capacities of multiple lower-level subsystems. Thus 
a few higher-level performance capacities can reflect multiple lower-level capacities (e.g., knee flexor 
strength, visual information processing speed, etc.). However, lower-level capacity measures are typically 
less variable than higher-level capacities within individuals (i.e., on retest) and within populations. 
Therefore, small changes in performance can be more readily discriminated statistically if they exist and 
are attributable primarily to localized lower-level capacities (e.g., see data in Potvin et al. [1985]). Also, 
more specific information can provide valuable insights into physiologic effects if they are present. By 
combing both types, broad coverage can be achieved (so that aspects of human performance in which 
the expectation of an effect is less remain included), as well as a degree of specificity, while keeping the 
total number of study variables to a reasonable size. In addition, if an effect is found in both a higher- 
level capacity and a related lower-level capacity, an internal cross-validation of finding is obtained. Many 
early studies incorporated variables representing looks across different hierarchical levels but did not 
distinguish them by level during analysis and interpretation. While such logic begins to remove the 
guesswork, much work remains to refine variable-selection methodology. 

To address the broad, bottom-line questions regarding safety and efficacy, the formation of composite 
scores (i.e., some combination of scores from two or more performance capacity challenges or "tests") is 
necessary. A resource model and the concept of hierarchical human systems also can be used to develop 
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TABLE 154.1 Representative Examples of Linking Factors in Question to Specific Measurable 
Performance Capacities 



Factor in Question* 

General central nervous system effects (of drug 



Alcohol interaction (of drug) 



Dizziness 
Drowsiness 



Slowness of movement, psychomotor retardation 



Mood 

Speed 

Coordination 

Tremors, abnormal movements 



Joint stiffness or pain 
Weakness, strength 

Sensation 



Selected Relevant Performance Capacities* 

Selected activity of daily living execution speed 

Postural stability 

Upper extremity neuromotor channel capacity 

Manual manipulation speed 

Visual-upper extremity information processing speed 

Visual attention span 

Visual-spatial memory capacity 

Visual-numerical memory capacity 

Selected activity of daily living execution speed 

Postural stability 

Visual information processing speed 

Visual-spatial memory capacity 

Postural stability 

Selected activity of daily living execution speed 

Visual attention span 

Visual-upper extremity information procession speed 

Bond-Lader visual analog scale 

Visual-upper extremity information processing speed 

Upper extremity random reach movement speed 

Index finger (proximal intercarpal phalangeal joint), flexion-extension 

speed 
Emotional stability (e.g., with Hamilton anxiety scale, Bond-Lader 

analog scale and other similar tools) 
Reaction Time 
Finger tapping 
Limb steadiness 
Postural stability 

Vocal amplitude and pitch steadiness 
Extremes and range of motion 
Isomeric strength (selected joins) 
Postural stability (one leg, eyes open) 
Isometric grip strength 
Isometric strength (representative set of upper and lower extremity, 

proximal and distal muscle groups) 
Vibration sensitivity 
Thermal sensitivity 



Terms used include those often used by pharmaceutical companies to communicate adverse effects to general public. 
Tists are illustrative and not exhaustive. More or fewer items can be included depending on time available or willingness 
to expend for data collection, with commensurate tradeoffs between specificity and protocol simplicity. 



composite variables for practical use in clinical trials. Component capacities can be selected that are 
theoretically or empirically most sensitive to the disease or condition under study. The composite variable 
is then created by calculating a more traditional weighted arithmetic sum [e.g., Potvin et al., 1985] or 
product [e.g., Kondraske, 1989] of the component scores. In both cases, the definition of component 
scores is important (i.e., whether a smaller or larger number represents better performance). Treatments 
are not approved if only a small subset of individuals is helped. Composite measures provide the only 
objective means of integrating multidimensional information about an intervention's effects. The primary 
advantages of the composite variable is the creation of a single, global, succinct measure of the disease, 
condition, or intervention, which is often essential as the primary outcome assessment for efficacy. The 
major disadvantage is loss of detailed information about the unique profile of performance changes that 
each subject of a group as a whole may show. Such tradeoffs are to be expected. Thus both composite 
and component measures play important but different roles in clinical trials. 
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It is imperative that the selection of the study measures also consider a test's objectivity (nonbiased), 
reliability (consistency), and validity (measures what is intends to measure) to add to the quality of the 
measurements. Many complex issues, which are beyond the present scope, are associated with measuring 
and interpreting the quality of a measurement. (See Baumgartner and Jackson [1993], Hastad and Lacy 
[1994], and Safrit [1990] for more information.) 

Formation of the Subject Pool 

Clinical research investigations usually involve a sample group of subjects from a defined population. 
Selection of the study population so that generalizations from that sample accurately reflect the defined 
population is a dilemma that must be adequately addressed. If probability samplingis chosen, each subject 
in the defined population theoretically has an equal chance of being included in the sample. The advantage 
of this kind of sampling is that differences between treatment groups can be detected and the probability 
that these differences actually exist may be estimated. If nonprob ability sampling is chosen, there is no 
way to ensure that each subject had an equal chance of being included in the sample. Conclusions of 
nonprobability sampling therefore have less merit (not as generalizable) than those based on probability 
sampling. 

Studies with healthy subjects are believed to be necessary before exposing sick persons to some 
interventions because persons with disease or injuries commonly have impaired function of various 
organs that metabolize drugs and may take medications that can alter the absorption, metabolism, and/or 
excretion rates of the intervention. Gender issues also should be a concern in clinical trials because of 
new FDA regulations [Cotton, 1993; Merkatz et al, 1993; Stone, 1993]. 

All subjects selected as study candidates should be informed of the procedures that will be utilized in 
the clinical trials investigation by signing an informed consent document, as required by the Department 
of Health and Human Services Code of Federal Regulations [1985] and other federal regulations applicable 
to research involving human subjects. Using all or a subset of study measurement variables (in addition 
to medical history and examinations as necessary), a screening procedure is recommended to determine 
if each subject meets the minimum performance criteria established for subject inclusion. When patients 
are part of the sample group, this performance screening also can be used to establish that the sample 
includes the desired balance of subjects with different amounts of "room for improvement" on relevant 
variables. An added benefit of this screening process is that subjects and test administrators obtain 
experience with test protocols, equipment, an procedures that will add to the validity of the study. 

Unless available from a previous similar study, it is typical for pilot data to be collected to estimate 
the expected size of outcome effects. With this information, the power of the statistical analysis (i.e., the 
likelihood that a significant difference will be detected) can be estimated so that sample size can be 
determined [Cohchran & Cox, 1957; Fleiss, 1986; Kepple, 1982]. Another concern is that some of the 
original subject pool will not complete the study or will have incomplete data. High attrition rates may 
damage the credibility of the study, and every effort should be made to not lose subjects. DeAngelis 
[1990] estimates that attrition rates higher than 50% make the interpretation of clinical trial research 
very difficult. Some researchers believe that no attempt should be made to replace these subjects because 
even random selection of new subjects will not ensure bias caused by differential participation of all the 
subjects. 

Data Collection 

Investigators should seek to minimize sources of variability by careful control of the test conditions and 
procedures. Proper control can be attained by (1) using rooms that are reasonably soundproof or sound- 
deadened, well lighted, and of a comfortable temperature; (2) selecting chairs and other accessories 
carefully (e.g., no wheels for tests in which the subject is seated); (3) testing subjects one at a time without 
other subjects in the test room; (4) using standardized written instructions for each test to eliminate 
variability in what is stated; (5) allowing for familiarization with the test instruments and procedures; 
(6) not commenting about subject performance to avoid biased raising or lowering of expectations; 
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(7) arranging a test order to offset fatigue (mental and physical) or boredom and to include rest periods; 
and (8) training test administrators and evaluating their training using healthy subjects, especially in 
multicenter studies, so that consistent results can be obtained. 

Data Integrity Screening 

Despite all good efforts and features incorporated to ensure high-quality data, opportunities exist for 
error. It is therefore beneficial to subject data obtained during formal test sessions to quality screenings. 
This is a step toward forming the official study data set (i.e., the set that will be subjected to statistical 
and other analyses). Several independent analyses used to screen data, which are typically computer- 
automated, are described below. 

Screening of Baseline Measures Against Inclusionary Criteria. If human performance study variables 
with specific performance criteria are used as part of the subject selection process as recommended above, 
each subject's baseline score can be compared with the score obtained during inclusionary testing. For 
most variables, baseline scores should not differ from inclusionary testing by more than 20% [Potvin 
et al., 1985]. Greater deviations point to examiner training, test procedure, or subject compliance prob- 
lems. 

Screening Against Established Norms. For variables that are not included in inclusionary screenings or 
in studies where performance variables are not used as part of the subjects inclusionary criteria, baseline 
data can be screened against established reference data (e.g., human performance means and standard 
deviations). Data are considered acceptable if they fall within an established range (e.g., two standard 
deviation units of the reference population mean). From the perspective of risk associated with not 
identifying a data point that could be a potential problem, this is a fairly liberal standard that can still 
identify problems in data collection and management. Standards that lessen risk can, or course, be 
employed at the discretion of the investigator, but at the expense of the possible identification of a larger 
number of data points that require follow-up. 

Screening Against Anticipated Effects. It is more difficult to screen nonbaseline data for quality because 
of the possible influences of the intervention. However, criteria can be established based on (1) absolute 
level of variables (both maximums and minimums) and (2) rates and direction of change from one 
measurement period to the next. Even criteria that allow a rather wide range of data (or changes across 
repeated measures) can be useful in detecting gross anomalies and is recommended. 

If anomalies are discovered using the screening methods described above, several outcomes are possible: 
(1) The anomaly can be traced and rectified (e.g., it may be attributable to a human or computer error 
with backup available); (2) the anomaly may be explainable as a procedural error, but it may not be 
possible to rectify; and (3) the anomaly may be unexplainable. Data anomalies that are explainable with 
supporting documentation could justify classification of the given data items as "missing data" or replace- 
ment of all data for the corresponding subject in the data set (i.e., the subject could be dropped from 
the study and replaced with another). There is no justification for eliminating or replacing data anomalies 
for which a documentable explanation does not exist. 

Analysis and Interpretation of Results 

Traditional inferential statistical analysis should be performed according to the statistical model and 
significance levels agreed on by the clinical research team when the study is designed. However, data also 
should be analyzed and interpreted from a clinical perspective as well. 

In experimental research, tests of statistical significance are the most commonly used tools for assessing 
possible associations between independent and dependent variables as well as differences among treat- 
ment groups. The purpose of significance (i.e., statistical) tests is to evaluate the research hypothesis at 
a specific level of probability, or p value. For example, if a p value of 0.05 was chosen, the researcher is 
asking if the levels of treatment (for example) differ significantly so that these differences are not 
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attributable to a chance occurrence more than 5 times out of 100. By convention, a p value of 0.01 (1%) 
or 0.05 (5%) is usually selected as the cutoff for statistical significance. Significance tests cannot accept 
a research hypothesis; all that significance tests can do is reject or fail to reject the research hypothesis 
[Thomas & Nelson, 1990]. Ultimately, significance tests can determine if treatment groups are different, 
but not why they are different. Good experimental design, appropriate theorizing, and sound reasoning 
are used to explain why treatment groups differ. 

Exclusive reliance on tests of statistical significance in the present context can mislead the clinical 
researcher, and interpretation of clinical significance is required. The key issue here is the size of the 
observed effect; p values give little information on the actual magnitude of a finding (i.e., decrement or 
improvement in performance, etc.). Also, statistical findings are partially a function of sample size. Thus 
even an effect that is small in size can be detected with tests of statistical significance in a study with a 
large sample. For example, a mean difference in grip strength of 2 kg may be found in response to drug 
therapy. This difference is a quite small fraction of the variability in grip strength observed across normal 
individuals, however. Thus, although statistically significant, such a change would perhaps have a minimal 
impact on an individual's ability to function in daily activities which make demands on grip strength 
performance resources. Thus clinical significance addresses cause-and-effect relationships between study 
variables and performance in activities of daily life or other broad considerations such as the basic disease 
process. The danger of not defining clinical significance appropriately can be either that a good inter- 
vention is not used in practice (e.g., it may be rejected for a safety finding that is statistically significant 
but small) or that a poor intervention is allowed into practice (e.g., it may result in statistically significant 
improvements that are small). 

Ideally, if empirical data existed that established what amount of a given performance capacity such 
as visual information processing speed (VIPS) was necessary to perform a task (e.g., driving safely on 
the highway), then it would be possible to interpret statistically significant findings (e.g., a decrement in 
VIPS of a known amount) to determine if the change would be a magnitude that would limit the 
individual from successfully accomplishing the given task. Unfortunately, while general cause-effect 
relationships between laboratory-measured capacities and performance in high-level tasks are evident, 
quantitative models do not yet exist, and completely objective interpretations of clinical significance are 
not yet possible. As such, the process by which clinical significance is determined is less well developed 
and structured. More recent concepts introduced by Kondraske [1988a, 1988fo, 1989] based on the use 
of resource economic principles (i.e., the idea of a threshold "cost" associated with lower-level variables 
typically employed in clinical trials for achieving a given level of human performance in any given high- 
level task) may be helpful in defining objective criteria for clinical significance. This approach directly 
addresses cause-and-effect relationships between performance capacities and high-level tasks with an 
approach much like that which an engineer would employ to design a system capable of performing a 
specified task. 

Despite known limitations, interpretation of clinical significance is always incorporated into clinical 
trials in some fashion. Any change in human performance for a given variable should be first documented 
to be statistically significant before it is considered to be a candidate for clinical significance (i.e., a 
statistically significant difference is a necessary but not a sufficient condition for clinical significance). 
Objective determination of clinical significance can be based in part on previous methods introduced 
by Potvin et al. [1985] in clinical trials involving neuromotor and central processing performance tests. 
They advocate the use of an objective criteria whereby a decrease or increase in a human performance 
capacity (with healthy test subjects, for example) should be greater than 20% to be classified as "clinically 
significant." Kondraske [1989] uses a similar approach that uses z-scores (i.e., number of standard 
deviation units form a population mean) as the basis for determining criteria. This accounts for popu- 
lation variability which is different for different performance capacity variables. A recent approach is to 
assess effect size, a statistical metric that is independent of sample size and which takes into account data 
variability. This method provides an objective basis for comparison of the magnitude of treatment effects 
among studies [Cohen, 1988; Ottenbacher & Barrett, 1991]. 
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154.3 Representative Application Examples 
Safety-Oriented Example: Drug-Alcohol Interaction 

In this section, selected elements of an actual clinical trial are presented to further illustrate methods and 
issues noted above. To maintain confidentialities, the drug under test is simple denoted as drug A. 

Identify Factors in Question. Upper respiratory infections are among the most frequent infections 
encountered in clinical practice and affect all segments of the general population. The pharmacologic 
agent of choice for upper respiratory infections are the antihistamines (reference drug), which possess a 
wide margin of safety and almost no lethality when taken alone in an overdose attempt. Although quite 
effective, antihistamines can produce several troublesome side effects (i.e., general CNS impairment, 
sedation, and drowsiness) and have been incriminated in automobile accidents as well as public trans- 
portation disasters. New drugs (e.g., drug A) are being developed to have similar benefits but fewer side 
effects. From prior animal studies, drug A has been shown to have minimal effects on muscle relaxation 
and muscular coordination as well as less sedative and alcohol-potentiating effects than those associated 
with classical antihistamines. Based on drug As history and concerns, the following factors in question 
were identified in this clinical trial: general CNS impairment, alcohol interaction, dizziness, drowsiness, 
mood, and slowness of movement/speed. The purpose of this investigation was to examine the effect of 
drug A on human performance capacity relative to a reference drug and placebo as well as the drug- 
alcohol potentiating interaction effects after multiple dose treatment. 

Select Performance Capacity Variables and Instruments. With the factors in question identified, the 
following performance capacity variables and their testing instruments were selected for the clinical trial: 
visual information processing speed (VIPS), finger tapping speed (TS), visual arm lateral reach coordi- 
nation (VALRC), visual spatial memory capacity (VSMC), postural performance (PP), digit symbol 
substitution task (DSST), and Bond-Lader Visual Analog Scale (BLS). These variables were collectively 
called the human performance capacity test battery (HPCTB) (Table 154.2). Due to space limitations, only 
one performance capacity variable (VMRS — visual information processing speed, i.e., VIPS) is discussed 
in this section. 

Experimental Design. The first primary objective of this investigation was to examine after 8 days of 
oral dosing the relative effects of drug A, reference drug, and placebo on the human performance capacity 
of healthy male and female adult volunteers (see Table 154.2). This effect was determined by examining 
the greatest decrease in human performance capacity from day 1 of testing (i.e., baseline at -0.5 hour 
drug ingestion time) to day 8 of testing at drug ingestion times of 0.0, 1.0, 2.0, 4.0, and 6.0 hours. The 

TABLE 154.2 Experimental Schedule for Human Performance Capacity Test Battery (HPCTB) 









Drug Ingestion 


Times (hours) 






Test Day 


-0.5 


0.0 


1.0 


2.0 


4.0 


6.0 


1* 
8+ 
9 U 


HPCTB 


HPCTB 
HPCTB 


HPCTB 
HPCTB 


HPCTB 
HPCTB 


HPCTB 
HPCTB 


HPCTB 
HPCTB 



Note: The human performance capacity test battery (HPCTB) was administered in the following 
order: VIPS, TS, VALRC, VSMC, PP, DSST, and BLS. 

*On day 1 of testing only, the -0.5 HPCTB was performed and utilized as the baseline value for test 
days 8 and 9. 

f The peak effect was determined by comparing baseline values (day 1) against the greatest detriment 
in performance for the eighth and ninth testing days at drug ingestion times of 0.0, 1.0, 2.0, 4.0, and 
6.0 hours. 

'On day 9 of testing, an alcohol drink was served immediately after drug administration and was 
ingested over a 15-minute period. 



) 2000 by CRC Press LLC 



second primary objective was to examine after 9 days of oral dosing the relative effects of drug A, reference 
drug, and placebo in combination with a single dose of alcohol served immediately after drug adminis- 
tration (male alcohol dose 0.85 g/kg body weight, female alcohol dose 0.75 g/kg body weight) on the 
human performance capacity of healthy male and female adult volunteers (see Table 154.2). This effect 
was determined by comparing the greatest decrease in performance on the ninth day of study drug 
treatment (at drug ingestion times of 0.0, 1.0, 2.0, 4.0, and 6.0 hours) to baseline on day one of testing 
(-0.5 hour drug ingestion time) for the HPCTB (see Table 154.2). Independent variables were treatment 
group (1 = drug A, 2 = reference drug, and 3 = placebo) and gender (male/female). The dependent 
variable was maximum decrease in performance (peak effect) for each human performance capacity 
study variable and was determined by comparing the baseline value (day 1) against the greatest decrease 
in performance at drug ingestion times of 0.0, 1.0, 2.0, 4.0, and 6.0 hours for day 8 and day 9 of testing. 
Inferential statistical analysis of the data was performed using a 3 (treatment group: reference drug, drug 
A, and placebo) by 5 (hour; 0, 1, 2, 4, and 6) mixed factorial ANOVA with repeated measures on hour 
for each dependent variable. Statistical tests of significance of all ANOVAs were conducted at the 0.05 level. 

Data Screening. For each dependent variable, baseline treatment data were compared against the criteria 
established for subject inclusion using each subject's performance score from the HPCTB to identify data 
anomalies. Then the expected increases/decreases in human performance capacity from baseline treat- 
ment data were compared against the drug therapy-influenced data to determine if these changes were 
within "reasonable" limits. Potential anomalies were detected in less than 5% of the data, with most 
occurring within records for only a few subjects. These cases were investigated, and in consultation with 
the principal investigator, decisions were made and documented to arrive at the official data set. 

Descriptive Statistical Analysis. Figure 154.2 illustrates the visual information processing speed (VIPS) 
for treatment groups reference drug, drug A, and placebo. On day 1 of testing (baseline), all treatment 
groups had similar VIPSs, with means between 5.7 and 5.8 stimuli per second. On day 8 of testing and 
at drug ingestion time of 2 hours, VIPS decreased to 4.9 stimuli per second for drug A, to 5.1 stimuli 
per second for reference drug and to 5.65 stimuli per second for the placebo. On day 8, the greatest group 
impairment of VIPS occurred at drug ingestion time of 4 hours. Drug A appeared to have the greatest 
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FIGURE 154.2 One performance capacity variable (visual information processing speed) for three treatment groups 
(drug A — the drug under test — reference drug, and placebo) taken from any actual efficacy-oriented study. Note 
changes from baseline to drug stabilization (day 8) and interaction with alcohol (day 9). See text for further 
explanation. 



© 2000 by CRC Press LLC 



impairment of VIPS (4.6 stimuli per second), followed by the reference drug group (4.9 stimuli per 
second). This decrease in performance was not present with the placebo group (5.61 stimuli per second). 
By the sixth hour after drug ingestion on day 8, VIPS improved toward baseline for treatment groups 
reference drug (5.1 stimuli per second) and drug A (5.3 stimuli per second), while the placebo group 
remained unchanged (5.58 stimuli per second). One day 9 of testing, all treatment groups received their 
assigned drug and alcohol. By 1 hour after drug ingestion, all treatment groups' VIPSs were impaired 
from baseline, with drug A showing the greatest impairment (4.3 stimuli per second), followed by 
reference drug (4.6 stimuli per second) and placebo (4.9 stimuli per second). All treatment groups' VIPSs 
improved toward baseline values by the second hour after drug ingestion (5.0 to 5.1 stimuli per second) 
and the fourth hour after drug ingestion (5.4 to 5.5 stimuli per second), respectively. By the sixth hour 
after drug ingestion, reference drug and drug A plateaued between 5.55 and 5.6 stimuli per second, 
respectively, while the placebo group improved slightly above the baseline value (5.8 stimuli per second). 
The basic pattern observed (i.e., decrease from baseline, return to baseline, impairment of all groups, 
including placebo, with alcohol) lend validity to the overall study. 

Statistical Analysis and Significance Test Results. On day 8 of testing, significant differences in VIPS for 
main effects treatment group and hour were observed. There was a significant interaction, however, 
between treatment group and hour of drug ingestion. Post hoc analysis demonstrated that treatment 
groups drug A and reference drug were significantly slower in VIPS than the placebo group at drug 
ingestion times of 2, 4, and 6 hours, but there was no difference in VIPS between treatment groups drug 
A and reference drug. (Simple main effects and simple, simple main effects showed that the placebo 
group had faster VIPSs than both reference drug and drug A treatment groups at drug ingestion times 
of 2, 4, and 6 hours.) 

On day 9 of testing when the drug-alcohol potentiating interaction effect was of primary interest, 
significant differences in VIPS occurred for main effects treatment group and hour, but there were no 
interactions between treatment group and hour of drug ingestion. Post hoc analysis demonstrated that 
treatment groups drug A and reference group were significantly slower in VIPS than the placebo group 
at 1 hour after drug ingestion only, but there was no statistically significant difference in VIPS between 
treatment groups drug A and reference group at any hour after drug-alcohol ingestion. Also, the drug- 
alcohol interaction was significant 1 hour after drug-alcohol ingestion for all treatment groups. 

Conclusions from Statistically Significant Findings. After 8 days of oral dosing, while a significant 
difference was found for both drug treatment groups compared with the placebo group, there was no 
significant difference in VIPS between drug A and reference drug treatment groups. The effect was 
apparent at approximately the same drug ingestion time of 4 hours. It was concluded that drug A had 
the same effect on safety as the reference drug in terms of CNS impairment as measured by VIPS. 

After 9 days of oral dosing in combination with alcohol consumption, drug A and reference drug 
caused a greater decrease in VIPS compared with the placebo group. This effect was most apparent at 
1 hour after drug ingestion. The placebo group also decreased its VIPS at 1 hour after drug ingestion at 
a statistically significant level. It was concluded after 9 days of oral dosing that drug A in combination 
with alcohol produced the same effect as the reference drug in combination with alcohol in terms of 
CNS impairment as measured by VIPS. 

Conclusions from Clinically Significant Findings. Since there was no statistically significant difference 
between drug A and the reference drug on either day 8 or day 9, the only findings requiring an interpre- 
tation of clinical significance are the statistically significant findings for both drug A and reference drug 
relative to the placebo group. On both day 8 and day 9, these findings are clinically significant using 
either the Potvin et al. [1985] percentage change or Kondraske [1989] z-score approach. Using the 
resource economic model for human performance [Kondraske, 1988a, 1988fo, 1989], it can be argued 
that, for example, during critical events during tasks such as driving, all of an individual's available VIPS 
resource is drawn on. Clearly, the substantial decrease in VIPS observed can compromise safety during 
such events. 
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Efficacy-Oriented Study: Experiences in Neurology 

Performance testing in clinical trials in neurology has grown tremendously within the last decade [Mohr & 
Brouwers, 1991], but within specific disease areas there remains reluctance to accept the replacement of 
traditional rating-scale evaluations of disease presence and progression by objective performance testing 
[Syndulko et al„ 1993]. The evolution of clinical evaluations in multiple sclerosis clinical trials illustrates 
this point. 

Evaluation of disability change or deterioration in multiple sclerosis (MS) historically has been limited 
to physician rating scales that attempt to globally summarize some or most of the salient clinical features 
of the disease. General consensus in the use of rating scales for MS has been achieved in the specification 
of a minimal record of disability (MRD) for MS that incorporates several physician- and paramedical- 
administered rating scales for evaluating MS disease status and its effects on the patient's life [International 
Federation of Multiple Sclerosis Societies, 1984]. However, there has been general dissatisfaction with 
the use of rating scales in MS clinical trials because of issues of lack or sensitivity to disease change, lack 
of interrater reliability, and simply the inherent reliance on subjective ratings of signs, symptoms, and 
unmeasured performance changes [Paty et al., 1992]. 

As an alternative to MS rating scales, comprehensive, quantitative evaluation of human performance 
was originally proposed and developed over 25 years ago for use in MS clinical trials [Tourtellotte et al., 
1965]. The first application of performance testing in a major MS clinical trial was the multicenter 
cooperative ACTH study [Rose et al., 1968]. The study proved that human performance testing could be 
incorporated into a multicenter MS clinical trial and that examiners at multiple centers could be trained 
to administer the tests in a standardized, repeatable fashion. The results generally supported fairly 
comparable levels of sensitivity among the outcome measures, including performance testing [Dixon & 
Kuzma, 1974]. Subsequent analyses showed that a priori composites that provided succinct summary 
measures of disease change in key functional areas could be formed from the data and that these composite 
measures were sensitive to treatment effects in relapsing/remitting MS patients [Henderson et al., 1978]. 
Despite the favorable results of the ACTH study, performance testing did not achieve general acceptance 
in MS clinical studies. Comprehensive performance testing as conducted in the ACTH trial was considered 
time-consuming, the instrumentation was not generally available, and despite the use of composites, the 
number of outcome measures remained too large. In a recent double-blind, placebo-controlled collabo- 
rative study in 12 medical centers, the study design included both rating-scale and performance measures 
as outcome assessments in the largest sample of chronic progressive MS patients studied to date [The 
Multiple Sclerosis Study Group, 1990]. Analyses of the change in performance composite scores from 
baseline over the 2-year course of the clinical trial showed that the drug-treated patients worsened signif- 
icantly less than the placebo-treated patients. In contrast, the EDSS and other clinical rating scores did 
not show significant treatment effects for the subset of MS patients at the same center [Syndulko et al., 
1993]. This indicated that the performance composites were more sensitive than the clinical disability 
measures to disease progression and treatment effects. A more recent comparative analysis of the full 
data set also supports the greater sensitivity of composites based on performance testing compared with 
rating scales to both MS disease progression and to a treatment effect [Syndulko et al., 1994a; Syndulko 
et al., 1944fo]. Although the biomedical community remains divided regarding the best outcome assess- 
ment in MS clinical trials, the accumulating evidence favors performance testing over clinical measures. 

Another type of efficacy-oriented study of particular note in the context of this chapter involves the 
combination of pharmacodynamic models with instrumented, objective, and sensitive performance 
capacity measurements to optimize therapeutic effectiveness by fine-tuning dose prescription for individual 
subjects. For example, the results from a Parkinson's disease study by Hacisalihzade et al. [1989] suggest 
that performance capacity measurements, which would play a key role in a strategy to determine phar- 
macodynamics of a drug for the individual (i.e., not a population), could become a component in the 
management of some patients receiving long-term drug therapy. 
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154.4 Future Needs and Anticipated Developments 

The field of clinical trials in which human performance variables are of interest has expanded at such a 
rapid rate in recent years that it has been difficult for researchers to keep up with all the new technologies, 
instrumentations, and methodologies. The type of evaluation employed has steadily shifted from mostly 
subjective to more objective, which can be argued to provide improved measurement quality. Increased 
standardization of human performance measurements has led to greater cost-effectiveness due to com- 
puter-based test batteries [Kennedy et al., 1993; Woollacott, 1983]. Thus, even if the quality of measure- 
ment were the same with methodologies that used less sophisticated methods, the initial investment 
would frequently be saved with newer data management techniques. This sentiment has not yet been 
accepted by all segments of the relevant communities. Also, the speed at which data can now be collected 
(which is especially important when changes over short time periods are of interest, as in pharmacody- 
namic studies) has led to greater cost-effectiveness. Furthermore, since instrumented devices are becom- 
ing commercially available and more widely used, it is not necessary nor desirable for researchers whose 
primary interest is the intervention of disease process to "reinvent" measurements (possibly with subtle 
but significant changes that ultimately inhibit standardization) for use in studies. Also, new measurements 
that may be different — but not necessarily better — must undergo long and expensive studies prior to 
their use in clinical trials. Although certain human performance measures are becoming standard in 
selected situations, more experience is needed before a substantial degree of standardization in human 
performance measurement is achieved in such situations. 

The perfect research strategy in clinical trials involving human performance capacities to determine 
therapy effectiveness and safety may not be possible due to financial and/or ethical considerations. Thus 
the challenge of clinical trials research in the future is to maintain scientific integrity while conforming 
to legal mandates and staying within economic feasibility; i.e., an optimization is almost always necessary. 
Multidisciplinary research efforts can be useful in these circumstances. 

Defining Terms 

Clinical significance: An additional level of interpretation of a statistically significant finding that 
addresses the size of the effect found. 

Clinical trial: A research study involving human subjects and an intervention (i.e., device, drug, surgical 
procedure, or other procedure) that is ultimately intended to either enhance the professional 
capabilities of physicians, improve the quality of life of patients, or contribute to the field of 
knowledge in those sciences which are traditionally in the medical field setting. 

Composite score: A score derived by combining two or more measured performance capacities. 

Efficacy-oriented clinical trial: A study that is conducted to objectively determine an intervention's 
effectiveness along with any pharmacokinetic or pharmacodynamic interactions. 

Nonprobability sampling: A method of sampling whereby no subject in a defined population has an 
equal chance of being included in the sample. 

Performance capacities: Dimensional capabilities or resources (e.g., speed, accuracy, strength, etc.) 
that a given human subsystem (e.g., visual, central processing, gait production, posture mainte- 
nance, etc.) possesses to perform a given task. 

Pharmacodynamics: The physiologic and/or metabolic changes in the concentration of a drug over 
time to corresponding measurable performance capacity changes in the therapeutic effect. 

Pharmacokinetics: The rate at which a drug is metabolized, as evidenced by changes in drug concen- 
trations in blood, cerebral spinal fluid, urine, etc. 

Probability sampling: A method of sampling whereby each subject in a defined population theoretically 
has an equal chance of being included in the sample. 

Protocol: A detailed statement of all procedures and methods to be employed in a research investigation. 
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Randomized clinical trial (RCT): A study that uses controlled trials whereby part of the group of 

subjects receives a therapy that is tested, while the other subjects receive either no therapy or 

another therapy. 
Resource economics: A cause-effect type of relationship between available performance resources (i.e., 

capacities) and demands of higher-level tasks in which the relevant subsystems are used. 
Safety-oriented clinical trial: A study that usually involves vital sign measures (e.g., heart rate, blood 

pressure, respiration rate), clinical laboratory tests (e.g., blood chemistries, urinalysis, ECG), and 

adverse reaction tests (both mental and physical performance capacities) to evaluate the risk of an 

intervention. 
Sample: A proportion of the defined population that is studied. When used as a verb (sampling), it 

refers to the process of subject selection. 
Significance (statistical) test: A tool used for assessing possible associations between independent and 

dependent variables as well as differences among treatment groups. 
Statistical significance: An evaluation of the research hypothesis at a specific level of probability so 

that any differences observed are not attributable to a chance occurrence. 
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The original monograph, Experimental and Quasi-Experimental Designs for Research, by D.T. Campbell 
and J.C. Stanley (Boston: Houghton Mifflin, 1964), is a classic experimental design text. The book 
Statistics in Medicine, by T. Colton (Boston: Little, Brown, 1982), examines statistical principles in medical 
research and identifies common perils when drawing conclusions from medical research. The book 
Clinical Epidemiology: The Essentials, R.H. Fletcher, S.W. Fletcher, and E.H. Wagner (2 nd ed., Baltimore: 
Williams & Wilkins, 1982), is a readable text with concise descriptions of various types of studies. Clinical 
Trials: Design, Conduct, and Analysis, By C.L. Meinert (New York: Oxford University Press, 1986), is a 
comprehensive text covering most details and provides many helpful suggestions about clinical trials 
research. R.J. Porter and B.S. Schoenberg, Controlled Clinical Trials in Neurological Disease (Boston: 
Kluwer Academic Publishers, 1990), is also a comprehensive text covering clinical trials with a special 
emphasis on neurologic diseases. 
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As early as 1700, Bernardino Ramazzini, one of the founders of occupational medicine, had associated 
certain physical activities with musculoskeletal disorders (MSD). He postulated that certain violent and 
irregular motions and unnatural postures of the body impair the internal structure [Snook et al., 1988]. 
Presently, much effort is directed toward a better understanding of work-related musculoskeletal disorders 
involving the back, cervical spine, and upper extremities. The World Health Organization (WHO) has 
defined occupational diseases as those work-related diseases where the relationship to specific causative 
factors at work has been fully established [WHO, 1985]. Other work-related diseases may have a weaker 
or unclear association to working conditions. They may be aggravated, accelerated, or exacerbated by 
workplace factors and lead to impairment of workers' performance. Hence obtaining the occupational 
history is crucial to proper diagnosis and appropriate treatment of work-related disorders. The occupa- 
tional physician must consider the conditions of both the workplace and the worker in evaluation of 
injured workers. Biomechanical and ergonomic evaluators have developed a series of techniques for 
quantification of the task demands and evaluation of the stresses in the workplace. Functional capacity 
evaluation also has been advanced to quantify the maximum performance capability of workers. The 
motto of ergonomics is to avoid the mismatch between the task demand and functional capacity of 
individuals. A multidisciplinary group of physicians and engineers constitutes the rehabilitation team 
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that will work together to implement the prevention measures. Through proper workplace design, 
workplace stressors could be minimized. It is expected that one-third of the compensatable low back 
pain in industry could be prevented by proper ergonomic workplace or task design. In addition to 
reducing the probability of both the initial and recurring episodes, proper ergonomic design allows earlier 
return to work of injured workers by keeping the task demands at a lower level. Unfortunately, ergono- 
mists are often asked to redesign the task or the workplace after a high incidence of injuries has already 
been experienced. The next preventive measure that has been suggested is preplacement of workers based 
on the medical history, strength, and physical examinations [Snook et al., 1988]. Training and education 
have been the third prevention strategy in the reduction of musculoskeletal disorders. Some components 
of these educational packages such as "back schools" and the teaching of "proper body mechanics" have 
been used in the rehabilitation phase of injured workers as well. 

Title I of the Americans with Disability Act [ADA, 1990] prohibits discrimination with regard to any 
aspect of the employment process. Thus the development of preplacement tests has been impeded by 
the possibility of discrimination against individuals based on gender, age, or medical condition. The ADA 
requires physical tests to simulate the "essential functions" of the task. In addition, one must be aware 
of "reasonable accommodations," such as lifting aids, that may make an otherwise infeasible task possible 
for a disabled applicant to perform. Healthcare providers who perform physical examinations and provide 
recommendations for job applicants must consider the rights of disabled applicants. It is extremely crucial 
to quantify the specific physical requirements of the job to be performed and to examine an applicant's 
capabilities to perform those specific tasks, taking into account any reasonable accommodations that 
may be provided. Hence task analysis and functional capacity assessment are truly intertwined. 

Work-related disorders of the upper extremities, unlike low back disorders, can better be related to 
specific anatomic sites such as a tendon or compressed nerve. Examples of the growing number of 
cumulative trauma disorders of the upper extremities and the neck are carpal tunnel syndrome (CTS), 
DeQuervain's disease, trigger finger, lateral epicondylitis (tennis elbow), rotator cuff tendinitis, thoracic 
outlet syndrome, and tension neck syndrome. The prevalence of these disorders is higher among some 
specific jobs, such as meat cutters, welders, sewer workers, grinders, meat packers, and keyboard opera- 
tors. Some of the common risk factors leading to pain, impairment, and physical damage in the neck 
and upper extremities are forceful motion, repetitive motion, vibration, prolonged awkward posture, and 
mechanical stress [Kroemer et al., 1994]. 

This chapter is intended to illustrate the application of some principles and practices of human 
performance engineering, especially quantification of human performance in the field of occupational 
medicine. I have selected the problem of low back pain to illustrate a series of concepts that are essential 
to evaluation of both the worker and the workplace, while realizing the importance of the disorders of 
the neck and upper extremities. By inference and generalization, most of these concepts can be extended 
to these situations. 

155.1 Principles 

Assessment of function across various dimensions of performance (i.e., strength, speed, endurance, and 
coordination) has provided the basis for a rational approach to clinical assessment, rehabilitation strat- 
egies, and determination of return-to-work potential for injured employees [Kondraske, 1990] . To under- 
stand the complex problem of trunk performance evaluation of low back pain (LBP) patients, the 
terminology of muscle exertion must first be defined. However, it should be noted that a number of 
excellent reviews of trunk muscle function have been performed [Andersson, 1991; Beimborn & Mor- 
rissey, 1988; Newton & Waddell, 1993; Pope, 1992]. I do not intend to reproduce this extensive literature 
here, since my motive is to provide a critical analysis that will lead the reader toward an understanding 
of the future of functional assessment techniques. A more extensive clinical application is provided 
elsewhere [Szpalski and Parnianpour, 1996; Parnianpour & Shirazi-adl, 1999]. 
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Impairment, Disability, and Handicap 

The tremendous human suffering and economic costs of disability present a formidable medical, social, 
and political challenge in the midst of growing healthcare costs and scarcity of resources. The WHO 
[1980] distinguished among impairment, disability, and handicap. Impairment is any loss or abnormality 
of psychological, physiologic, or anatomic structure or function — impairment reflects disturbances at 
the organ level. Disability is any restriction or lack of ability (resulting from impairment) to perform an 
activity in the manner or within the range considered normal for a human being — disability reflects 
disturbances at the level of person. Handicap is a disadvantage for a given individual, resulting from an 
impairment or a disability, that limits or prevents the fulfillment of a role that is normal (depending on 
age, sex, and social and cultural factors) for that individual. Since disability is the objectification of an 
impairment, handicap represents the socialization of an impairment or disability. Despite the immense 
improvement presented by the International Classification of Impairments, Disabilities, and Handicaps 
(ICIDH), the classification is limited from an industrial medicine or rehabilitation perspective. The 
hierarchical organization lacks the specificity required for evaluating the functional state of an individual 
with respect to task demands. 

Kondraske [1990] has suggested an alternative approach using the principles of resource economics. 
The resource economics paradigm is reflective of the principal goal of ergonomics: fitting the demands 
of the task to the functional capability of the worker. The Elemental Resource Model (ERM) is based on 
the application of general performance theory that presents a unified theory for measurement, analysis, 
and modeling of human performance across all different aspects of performance, across all human 
subsystems, and at any hierarchical level. This approach uses the same bases to describe both the 
fundamental dimensions of performance capacity and task demand (available and utilized resources) of 
each functional unit involved in performance of the high-level tasks. The elegance of the ERM is due to 
its hierarchical organization, allowing causal models to be generated based on assessment of the task 
demands and performance capabilities across the same dimensions of performance [Kondraske, 1990]. 

Muscle Action and Performance Quantification 

The details of the complex processes of muscle contraction in terms of the bioelectrical, biochemical, 
and biophysical interactions are under intense research. Muscle tensions is a function of muscle length 
and its rate of change and can be scaled by the level of neural excitation. These relationships are called 
the length-tension and velocity-tension relationships. From a physiologic point of view, the measured force 
or torque applied at the interface is a function of (1) the individual's motivation (magnitude of the neural 
drive for excitation and activation processes), (2) environmental conditions (muscle length, rate of change 
of muscle length, nature of the external load, metabolic conditions, pH level, temperature, and so forth), 
(3) prior history of activation (fatigue), (4) instructions and descriptions of the tasks given to the subject, 
(5) the control strategies and motor programs employed to satisfy the demands of the task, and (6) the 
biophysical state of the muscles and fitness (fiber composition, physiologic cross-sectional area of the 
muscle, cardiovascular capability). It cannot be overemphasized that these processes are complex and 
interrelated [Kroemer et al., 1994]. Other factors that may affect the performance of patients are misun- 
derstanding of the degree of effort needed in maximal testing, test anxiety, depression, nociception, fear 
of pain and reinjury, as well as unconscious and conscious symptom magnification. 

The following sections review some methods to quantify performance and lifting capability of isolated 
trunk muscles during a multilink coordinated manual materials handling task. Relevant factors that 
influence the static and dynamic strength and endurance measures of trunk muscles will be addressed, 
and the clinical applications of these assessment techniques will be illustrated. 

The central nervous system (CNS) appropriately excites the muscle, and the generated tension is 
transferred to the skeletal system by the tendon to cause motion, stabilize the joint, and/or resist the 
effect of external forces on the body. Hence the functional evaluation of muscles cannot be performed 
without the characterization of the interfaced mechanical environment. 
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The four fundamental types of muscle exertion or action are isometric, isokinetic, isotonic, and 
isoinertial. In isometric exertion, the muscle length is kept constant, and there is no movement. Although 
mechanical work is not achieved, physiologic work, i.e., static work, is performed, and energy is con- 
sumed. When the internal force exerted by the muscle is greater than the external force offered by the 
resistance, then concentric, i.e., shortening, muscle action occurs, whereas if the muscle is already 
activated and the external force exceeds the internal force of the muscle, then eccentric, i.e., lengthening, 
muscle action occurs. When the muscle moves, either concentrically or eccentrically, dynamic work is 
performed. If the rate of shortening or lengthening of the muscle is constant, the exertion is called 
isokinetic. When the muscle acts on a constant inertial mass, the exertion is called isoinertial. Isotonic 
action occurs when the muscle tension is constant throughout the range of motion. 

These definitions are very clear when dealing with isolated muscles during physiologic investigations. 
However, terminology employed in the literature of strength evaluation is imprecise. The terms are 
intended to refer to the state of muscles, but they actually refer to the state of the mechanical interface, 
i.e., the dynamometer. Isotonic exertion, as defined, is not as realizable physiologically because muscular 
tensions change as its lever arm changes despite the constancy of external loads. Special designs may vary 
the resistance level in order to account for changes in mechanical efficiency of the muscles. In addition, 
the rate of muscle length change may not remain constant even when the joint angular velocity is regulated 
by the dynamometer during isokinetic exertions. During isoinertial action, the net external resistance is 
not only a function of the mass (inertia) but also a function of the acceleration. The acceleration, however, 
is a function of the input energy to the mass. Hence, to fully characterize the net external resistance, we 
need to have both the acceleration and the inertial parameters (mass and moment of inertia) of the load 
and body parts. Future research should better quantify the inertial effects of the dynamometers, partic- 
ularly during nonisometric and nonisokinetic exertions. 

For any joint or joint complex, muscle performance can be quantified in terms of the basic dimensions 
of performance: strength, speed, endurance, steadiness, and coordination. Muscle strength is the capacity 
to produce torque or work by voluntary activation of the muscles, whereas muscle endurance is the ability 
to maintain a predetermined level of motor output — e.g., torque, velocity, range of motion, work, or 
energy — over a period of time. Fatigue is considered to be a process under which the capability of muscles 
diminish. However, neuromuscular adjustments take place to meet the task demands (i.e., increase in neural 
excitation) until there is final performance breakdown — endurance time. Coordination, in this context, is 
the temporal and spatial organizations of movement and the recruitment patterns of the muscle synergies. 

Despite the proliferation of various technologies for measurement, basic questions such as "What 
needs to be measured and how can it best be measured?" are still being investigated. However, there is 
a consensus on the need to measure objectively the performance capability along the following dimen- 
sions: range of motion, strength, endurance, coordination, speed, acceleration, etc. Strength is one of the 
most fundamental dimensions of human performance and has been the focus of many investigations. 
Despite the general consensus about the abstract definition of strength, there is no direct method for 
measurement of muscle tension in vivo. Strength has often been measured at the interface of a joint (or 
joints) with the mechanical environment. A dynamometer, which is an external apparatus onto which 
the body exerts force, is used to measure strength indirectly. 

Different modes of strength testing have evolved based on different levels of technologic sophistication. 
The practical implication of contextual dependencies on the provided mechanical environment of the 
strength measures must be considered during selection of the appropriate mode of measurement. In this 
regard, equipment that can measure strength in different modes is more efficient in terms of both initial 
capital investment, required floor space in the clinics or laboratories, and the amount of time it takes to 
get the person in and out of the dynamometer. 

155.2 Low Back Pain and Trunk Performance 

The problem of LBP is selected to present important models that could be used by the entire multidis- 
ciplinary rehabilitation team for the measurement, modeling, and analysis of human performance 
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[Kondraske, 1990]. The inability to relate LBP to anatomic findings and the difficulties in quantifying 
pain have directed much effort toward quantification of spinal performance. The problem is made even 
more complex by the increasing demand of the healthcare system to quantify the level of impairment of 
patients reporting back pain without objective findings. 

There are three basic impairment evaluation systems, each having their merits and shortcomings: 
(1) anatomic, based on physical examination findings, (2) diagnostic, based on pathology, and 
(3) functional, based on performance or work capacity [Luck & Florence, 1988] . The earlier systems were 
anatomic, based on amputation and ankylosis. Although this approach may be more applicable to the 
hand, it is very inappropriate for the spine. The diagnostic-based systems suffer from lack of correspon- 
dence between the degree of impairment for a given diagnosis and the resulting disability and even more 
from the lack of a clear diagnosis. A large percentage of symptom-free individuals have anatomic findings 
detectable by the imaging technologies, while some LBP patients have no structural anomalies. 

The function-based systems are more desirable from an occupational medicine perspective for the 
following reasons: They allow the rehabilitation team to rationally evaluate the prospect for return to 
light-duty work and the type of "reasonable accommodations" needed (such as assistive devices) that 
could reduce the task demand below the functional capability of the individual. By focusing on remaining 
ability and transferable skills rather than the disability or structural impairment of the injured worker, 
the set of feasible jobs can be identified. These points are extremely important, given the natural history 
of work disability after a single low back pain episode causing loss of work time: 40% to 50% of workers 
return to work by 2 weeks, 60% to 80% return by 4 weeks, and 85% to 90% return by 12 weeks. The 
small portion of disabled workers who become chronic are responsible for the majority of the economic 
cost of LBP. It is therefore the primary goal of the rehabilitation team to prevent the LBP, which is self- 
correcting in most cases, no matter what kind of therapy is used, from becoming a chronic disabling 
predicament. Injured workers should neither be returned to work too early nor too late, since both could 
complicate the prognosis. The results of functional capacity evaluation and task demand quantification 
should guide the timing for returning to work. It is clear that psychosocioeconomic factors become 
increasingly more important than physical factors as the disability progresses into "chronicity syndrome" 
and play a major role in defining the evolution of a low back disability claim. Future research should 
further establish the reliability and reproducibility of performance assessment tools to expedite their 
widespread use [Luck & Florence, 1988; Newton & Waddell, 1993]. 

Maximal and Submaximal Protocols 

Biomechanical strength models of the trunk are usually based on static maximal strength measurement. 
In real-life work situations, individuals rarely exert lengthy or maximum static effort. In most clinical 
situations, submaximal protocols are recommended, especially in patients with pain or with cardiovas- 
cular problems. Also, submaximal testing is less susceptible to fatigue and injury. The activities of daily 
living also have a great deal of submaximal efforts at the self-selected pace. Hence it has been argued that 
testing at the preferred rate may be complementary to the maximal effort protocols. The preferred motion 
can be solicited by instructing the subject to perform repetitive movement at a pace and through the 
range of motion which he or she feels is the most comfortable. It has been shown that LBP patients and 
normal individuals have different resisted preferred flexion/extension motion characteristics. Having the 
subject perform against resistance is based on the hypothesis that, at higher resistance levels, the sepa- 
ration between the performance levels of patients and normal subjects becomes more evident. It has 
been shown, for example, that functional impairment of trunk extensors in LBP patients with respect to 
the normal population is larger at higher velocities during isokinetic trunk extension. However, the 
proponents of unconstrained testing have argued that separation of these groups can be performed based 
on the position, velocity, and acceleration profiles of the trunk during self-selected flexion/extension 
tasks. They have noted that pain and fear of reinjury may become the limiting factors. The sudden surge 
in acquiring performance measures of LBP patients during the initial rehabilitation process also under- 
scores the validity of this concept. 
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Static and Dynamic Strength Measurements of Isolated Trunk Muscles 

Weakness of the trunk extensor and abdominal muscles in patients with LBP was demonstrated using the 
cable tensiometer to measure isometric strength. The disadvantage of the cable tensiometer (which records 
applied force) is that it neglects to measure the lever arm distance from the center of trunk motion. It is 
also recommended that cable tensiometer be used to determine peak isometric torques rather than the 
stable average torque exerted over a 3 -second period. Dynamometers used for testing dynamic muscle 
performances contain either hydraulic or servo motor systems to provide constant velocity, e.g., isokinetic 
devices, or constant resistance, e.g., isoinertial devices. The isokinetic devices can be further categorized 
into passive and active types. The robotics-based dynamometers can actively apply force on the body and 
hence allow eccentric muscle performance assessments, while only concentric exertions can be measured 
by the passive devices. Eccentric muscle action can stimulate the lowering phase of a manual materials 
handling task. Based on sports medicine literature, eccentric action has been implicated for its significant 
role in the muscle injury mechanism. Using isokinetic dynamometers, the isometric and isokinetic 
strengths of trunk extensor and abdominal muscles were shown to be weaker in LBP patients compared 
with healthy individuals. Dedicated trunk testing systems have become the cornerstone of objective func- 
tional evaluation and have been incorporated in the rehabilitation programs in many centers. 

Two issues of importance for future research are the role of pelvic restraints and the significance of 
using newly developed triaxial dynamometers as opposed to more traditional uniaxial dynamometers. 
Studies on healthy volunteers have shown that trunk motions occur in more than one plane — lateral 
bending accompanies the primary motion of axial rotation. Numerous attempts have been made to 
measure the segmental range of motion three-dimensionally in the lumbar spine with the purpose of 
quantifying abnormal coupling and diagnosing instabilities. 

The effect of posture on the maximum strength capability can be described based on the length-tension 
relationship of muscle action. Marras and Mirka [1989] studied the effect of trunk postural asymmetry, 
flexion angle, and trunk velocity (eccentric, isometric, and concentric) on maximal trunk torque pro- 
duction. It was shown that trunk torque decreased by about 8.5% of the maximum for every 15 degrees 
of asymmetric trunk angle. At higher trunk flexion angles, extensor strength increased. Complex, signif- 
icant interaction effects of velocity, asymmetry, and sagittal posture were detected. The ranges of velocity 
studies were more limited (±30 degrees per second) than those used customarily in spinal evaluation. 
Tan et al. [1993] tested 31 healthy males for the effects of standing trunk-flexion positions (0, 15, and 
35 degrees) on triaxial torques and did electromyograms (EMGs) of 10 trunk muscles during isometric 
trunk extension at 30%, 50%, 70%, and 100% of maximum voluntary exertions (MVE). Trunk muscle 
strength was significantly increased at a more flexed position. However, the accessory torques in the 
transverse and coronal planes were not affected by trunk postures. The recorded lateral bending and 
rotation accessory torques were less than 5% and 16% of the primary extension torque, respectively. The 
rectus abdominis muscles were inactive during all the tests. The EMGs of the erector spinae varied linearly 
with higher values of MVE, while the latissimus dorsi had a nonlinear behavior. The obliques were 
coactivated only during 100% MVE. The neuromuscular efficiency ratio (NMER) was constructed as the 
ratio of the extension torque over the processed (RMS) EMG of the extensor muscles. It was hoped that 
NMER could be used in clinical settings where generation of the maximum exertion are not indicated. 
However, the NMER proved to have a limited clinical utility because it was significantly affected by both 
exertion level and posture. The NMER of the extensor muscles increased at more flexed position. Studies 
that have combined the EMG activities and dynamometric evaluations have the potential of discovering 
the neuromuscular adaptation during different phases of injury and rehabilitation processes. 

Static and Dynamic Trunk Muscle Endurance 

The high percentage of type I fibers in the back muscles, in addition to the better vascularization of these 
muscle groups, contributes to their superior endurance. Physiologic studies indicate that at higher muscle 
utilization ratios (relative muscle loads), fatigue is detected earlier. Isometric endurance tests have been 
used to compute the median frequency (MF) of the myoelectrical activities of trunk muscles in both 

© 2000 by CRC Press LLC 



normal and LBP populations. The expected decline of the median frequency with fatigue is parameterized 
by the intercept (initial MF) and the slope of the fall. It has been shown that trunk range of motion 
(ROM) and isometric strength suffered from lower specificity and sensitivity than spectral parameters. 
Trunk muscle endurance does differ between healthy subjects and those reporting LBP. During isometric 
endurance testing, trunk flexors develop fatigue faster than extensors in symptom-free subjects. The 
flexor fatigability appeared significantly higher in patients with LBF as compared with controls. Chronicity 
also influences trunk muscle endurance. Chronic LBP patients showed reduced abdominal as well as 
back muscle endurance as compared with the healthy controls and lower back muscle endurance as 
compared with the intermittent LBP group. Individuals with a history of debilitating LBP demonstrated 
less isometric trunk extensor endurance than either normal individuals or patients with history of lesser 
LBP. 

Soft tissues subjected to repetitive loading, due to their viscoelastic properties, demonstrate creep and 
load relaxation. The loss of precision, speed, and control of the neuromuscular system induced by fatigue 
reduces the ability of muscles to protect the weakened passive structure, which may explain many 
industrial, clinical, and recreational injury mechanisms. These results further indicate the necessity of 
relating clinical protocols to the job and show how short-duration maximal isometric testing alone cannot 
provide the complex functional interaction of strength, endurance, control, and coordination. 

Parnianpour et al. [1988] studied the effect of isoinertial fatiguing of flexion and extension trunk 
movements on the movement pattern (angular position and velocity profile) and the motor output 
(torque) of the trunk. They showed that, with fatigue, there is a reduction of the functional capacity in 
the main sagittal plane. There is also a loss of motor control enabling a greater range of motion in the 
transverse and coronal planes while performing the primary sagittal task. Association of sagittal with 
coronal and transverse movements is considered more likely to induce back injuries; thus the effect of 
fatigue and reduction of motor control and coordination may be an important risk factor leading to 
injury-prone working postures. The endurance limit is a more useful predictor of incidence and recur- 
rence of low back disorders than the absolute strength values. Although physiologic criteria used in the 
National Institute for Occupational Safety and Health Lifting Guide [NIOSH, 1981] considered cardiovas- 
cular demands of dynamic repetitive lifting tasks, the limits of muscular endurance were not explicitly 
addressed. Future research should fill this gap, since the maximum strength measures should not guide 
the design decisions. Maximum level of performance can only be maintained for short periods of time, 
and muscular fatigue should be avoided to prevent the development of MSD. This caveat should be 
applied to all dimensions of performance capability [Kondraske, 1990]. 

A prospective, randomized study among employees in a geriatric hospital showed that exercising during 
work hours to improve back muscle strength, endurance, and coordination proved cost-effective in 
preventing back symptoms and absence from work [Gundewall et al., 1993]. Every hour spent by the 
physiotherapist on the exercise group reduced the work absence by 1.3 days. In this study, both training 
and testing equipment were very modest. Endurance training is based on exercises with high repetition 
and low resistance, while strength training requires exercise with high resistance and low repetition. 

Lifting Strength Testing 

The National Institute for Occupational Safety and Health [NIOSH, 1981] recommended static, i.e., 
isometric, strength measurements as its standard as its standard for lifting tasks. This was based on the 
evidence that associated LBP with inadequate isometric strength. The incidence of an individual's sus- 
taining an on-the-job back injury increases threefold when the task-lifting requirements approached or 
exceed the individual's strength capacity. However, lifting strength is not a true measure of trunk function 
but is a global measure taking into account arm, shoulder, and leg strength as well as the individual's 
lifting technique and overall fitness. It has been shown that strength tests were more valid and predictive 
of risk of low back disorders if they simulated the demands of the job. The clinicians must be aided with 
easy-to-use and validated instruments or questionnaires to gather information about the task demands 
in order to decide what testing protocol best simulates the applicant's spinal loading conditions. 
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Static strength measurements have been reported to underestimate significantly the loads on the spine 
during dynamic lifts. Comparing static and dynamic biomechanical models of the trunk, the predicted 
spinal loads under static conditions were 33% to 60% less than those under dynamic conditions, depend- 
ing on the lifting technique. The recruitment patterns of trunk muscles (and thus the internal loading 
of the spine) are significantly different under isometric and dynamic conditions. General manual mate- 
rials handling tasks require a coordinated multilink activity that can be simulated using classic psycho- 
physical techniques or the robotics-based lift task simulators. Various lifting tests, including static, 
dynamic, maximal, and submaximal, are currently available. The experimental results of correlational 
studies have confirmed the theoretical prediction that strength will be dependent on the measurement 
technique. Since muscle action requires external resistance, the effect of muscle action will depend on 
the nature of the resistance. These results refute the implicit assumption that a generic strength test exists 
that can be used for preplacing workers (preemployment) and predicting the risk of injury or future 
occurrence of LBP. The psychometric properties of isokinetic and isoresistive modes of strength testing 
were recently addressed. The quantification of the surface response of strength as a function of joint 
angle and velocity was only possible for isokinetic testing, while isoresistive tests yielded a very sparse 
data set. Figure 155.1 and 155.2 illustrate these points graphically. 

The wide conflicting results found in the literature regarding the relationship of an individual's strength 
to the risk of developing LBP may be due to inappropriate modes of strength measurements, i.e., lack 
of job specificity. Isometric strength testing of the trunk is still widely used, especially in large-scale 
industrial or epidemiologic studies, because it has been standardized and studied prospectively in indus- 
try. Compared with trunk dynamic strength testing protocols, the trunk isometric strength testing 
protocols are simpler and less expensive. 

One outstanding issue during dynamic testing is the unresolved problem of how the wealth of infor- 
mation can be presented in a succinct and informative fashion. One approach has been to compare the 
statistical features of the data with the existing normal databases. This is particularly crucial because one 
does not have the option of comparing the results to the "contralateral asymptomatic joint," as one has 




FIGURE 155.1 Bivariate distribution histogram of isokinetic trunk extension for 10 subjects. 
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FIGURE 155.2 Bivariate distribution histogram of isotonic trunk extension for 10 subjects. 

with lower or upper extremity joints. Given the large differences between individuals, I recommend 
comparison be made to job-specific databases. For example, it is more appropriate for the trunk strength 
of an injured construction worker to be compared with age- and gender- controlled healthy construction 
workers than with data from healthy college graduate students or office workers. However, given the 
scarcity of such data, I argue for comparison of performance capacity with job demand based on task 
analysis. The performance capacity evaluation is once again linked to task demand quantification. 

Inverse and Direct Dynamics 

A major task of biomechanics has been to estimate the internal loading of musculoskeletal structure and 
establish the physiologic loading during various daily activities. Kinematic studies deal with joint move- 
ment, with no emphasis on the forces involved. However, kinetic studies address the effect of forces that 
generate such movements. Using sophisticated experimental and theoretical stress/strain analyses, haz- 
ardous/failure levels of loads have been determined. The estimated forces and stresses are used to estimate 
the level of deformation in the tissues. This technique allows one to assess the risk of overexertion injury 
associated with any physical activity. Given repetitive motions and exertion levels much lower than the 
ultimate strength of the tissues, an alternative injury mechanism, the cumulative trauma model, has been 
used to describe much of the musculoskeletal disorders of the upper extremities. 

The experimental data on the joint trajectories are differentiated to obtain the angular velocity and 
acceleration. Appropriate inertial properties of the limb segments are used to compute the net external 
moments about each joint. This mapping from joint kinematics to net moments is called inverse dynamics. 
Direct dynamics refers to studies that simulate the motion based on known actuator torques at each joint. 
The key issue in these investigations is understanding the control strategies underlying the trajectory 
planning and performance of purposeful motion. A highly multidisciplinary field has emerged to address 
these unsolved questions (see Berme and Cappozzo [1990] for a comprehensive treatment of these issues.) 

It should be pointed out that determination of the external moments about different joints during 
manual materials-handling tasks is based on the well-established laws of physics (Fig. 155.3). However, 
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FIGURE 155.3 The dynamic analysis of a sagittal plane symmetrical isokinetic lift (load 21 kg, speed = 51 cm/s, 
mode = preferred method of lifting): (a) joint angle; (b) net muscular joint moment; and (c) the joint reaction forces 
at L5/S1. 
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the determination of human performance and assessment of functional capacity are based on other 
disciplines, e.g., psychophysics, that are not as exact or well developed. One can describe easily the job 
demand, in terms of the required moments about each joint, by analyzing the workers performing the 
tasks. However, one is unable to predict the ability to perform an arbitrary task based on the incomplete 
knowledge of functional capacities at the joint levels. A task is easily decomposed to its demands at the 
joint level; however, one cannot compose (construct) the set of feasible tasks based on one's functional 
capacity knowledge. The mapping from high-level task demands to the joint-level functional capacity 
for a given performance trial is unique. However, the mapping from joint-level functional capacity to 
the high-level task demand is one to many (not unique). The challenge to the human performance 
research community is to establish this missing link. Much of the integration of ergonomics and func- 
tional analysis depends on removal of this obstacle. The question of whether a subject can perform a 
task based on knowledge of his or her functional capacity at the joint level remains an area of open 
research. When ergonomists or occupational physicians evaluate the fitness of task demands and worker 
capability, the following clinical questions will be presented; (1) Which space should be explored for 
determining normalcy, fit, or equivalence? (2) Should we consider the performance of the multilink 
system in the joint space or end-effector (cartesian workspace)? These issues have profound effects on 
both the development of new technologies and the evaluation of trunk or lifting performance. 

The enormous degrees of freedom existing in the neuromusculoskeletal system provide the control 
centers both the kinematic and actuator redundancies. The redundancies provide optimization possibility. 
Since one can lift an object from point A to point B with infinite postural possibilities, it can be suggested 
that certain physical parameters maybe optimized for the learned movements. The possible candidates 
for objective function to be optimized are movement time, energy, smoothness, muscular activities, etc. 
This approach, though still in its early stage, may be very important for spine functional assessment. 
One could compare the given performance with the optimal performance that is predicted by the model. 
This approach provides specific goals and gives biofeedback with respect to the individual's performance. 

Comparison of Task Demands and Performance Capacity 

The regression analysis was used to model the dynamic torque, velocity, and power output as a function 
of resistance level during flexion and extension using the B-200 Isostation [Parnianpour et al., 1990]. 
Results indicated that the measured torque was not a good discriminator of the tenth, fiftieth, and 
ninetieth percentile population. However, velocity and power were shown to effectively discriminate the 
three populations. Based on these data, it was suggested that during clinical testing, sagittal plane 
resistance should not be set at higher than about 80 N-m in order to minimize the internal loading of 
spine while taxing trunk functional capacity. This presentation of data may be useful to the physician or 
ergonomist in evaluating the functional capacity requirements of workplace manual materials-handling 
tasks. For example, a manual material-handling task that requires about 80 N-m (61 ft-lb) of trunk 
extensor strength could be performed by 90% of the population in the normal database if the required 
average trunk velocity does not exceed 40 degrees per second, while only 50% could perform the task if 
the velocity requirement exceeds 70 degrees per second. More important, only the top tenth percentile 
population could perform the task if the velocity requirement approaches 105 degrees per second 
(Fig. 155.4). A few versions of lumbar motion monitors that can record the triaxial motion in the 
workplace have been used to provide the trunk movement requirements. The preceding example also 
illustrates the importance of having the same bases for evaluation of both task and the functional 
capability of the worker. 

155.3 Clinical Applications 

Clinical studies have utilized quantitative human performance, i.e., strength and endurance measures, 
to predict the first incidence or recurrence of LBP and disability outcome and also as a prognosis measure 
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FIGURE 155.4 The average extension velocity measured during maximum trunk extension against a set resistance 
for 10th (cross), 50th (triangle), and 90th (square) percentile distribution. 



during the rehabilitation process. Training programs to enhance the endurance and strength of workers 
have been implemented in some industries. More studies on the effectiveness of these programs are 
needed. It can be hypothesized that these programs complement the stress-management programs to 
enhance both worker satisfaction and coping strategies with regard to physical and nonphysical stressors 
at the workplace. 

Functional-based impairment evaluation schemes traditionally have used spinal mobility. Given the 
poor reliability of range of motion (ROM), its large variability among individuals, and the static psycho- 
metric nature of ROM, the use of continuous dynamic profiles of motion with the higher-order derivatives 
has been suggested. Dynamic performances of 281 consecutive patients from the Impairment Evaluation 
Center at the Mayo Clinic were used. As part of the comprehensive physical and psychological evaluation, 
281 consecutive LBP patients underwent isometric and dynamic trunk testing using the B200 Isostation. 
Feature extraction and cluster analysis techniques were used to find the main profiles in dynamic patient 
performances. The middle three cycles of movements were interpolated and averaged into 128 data points; 
thus the data were normalized with respect to cycle time. This allowed for comparison between individ- 
uals. Figure 155.5 presents the main profiles of sagittal trunk angular position. The number of patients 
in each group is also noted on the graph. Patients in the first (n = 48) and second (n = 55) groups had 
similar flexion mobility; however, those in the first group had more limited extension mobility. The time 
to peak sagittal position also varied among the five groups. Forty-seven patients in the fifth group showed 
extreme impairment in both flexion and extension. The third group (26 patients) showed differential 
impairments with respect to direction of motion. A marked improvement over the use of ROM has been 
achieved by preserving information in the continuous profiles. The LBP patients in this study are 
heterogeneous with respect to their movement profile. Uniform treatment of these patients is question- 
able, and rehabilitation programs should consider their specific impairments. Future research should 
incorporate the clinical profiles with these movement profiles to further delineate the heterogeneity of 
LBP patients. Marras et al. [1993] used similar feature-extraction techniques to characterize the move- 
ment profiles of 510 subjects belonging to normal (n = 339) and 10 LBP patient groups (n = 171). 
Subjects were asked to perform flexion/ extension trunk movement at five levels of asymmetry, while the 
three-dimensional movement of the spine was monitored by the Lumbar Motion Monitor (an exoskeleton 
goniometer developed at the Biodynamics Lab of The Ohio State University). Trunk motions were 
performed against no resistance, and no pelvic stabilization was required. The quadratic discriminant 
analysis was able to correctly classify over 80% of the subjects. The same technology was used to develop 
logistic regression models to identify the high-risk jobs in industrial workplaces. Hence principles of 
human performance can be applied successfully to the worker and the task to avoid the mismatch between 
performance capability and task demand. 
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FIGURE 155.5 The five principal profiles of trunk sagittal movement for 281 low back pain patients. 

The key limitations in the development of the discriminate functions for classification purposes, are 
the data-driven nature of the algorithms and the lack of theoretical orientation in the process of devel- 
opment and validation of these models. It is suggested that the mathematical simulation of flexion or 
extension trunk movement may identify an objective basis for the evaluation and assessment of trunk 
kinematic performance. A catalog of movement patterns that are optimal with respect to physical and 
biomechanical quantities may contribute to the emergence of a more theoretically based computational 
paradigm for the evaluation of kinematic performance of normal subjects and patients. It must be 
emphasized that in this paradigm one has no intention to claim that the central nervous system actually 
optimizes any single or composite cost function. 

To provide clinical insight for interpretation of the distinctive features in the movement profiles, 
Parnianpour et al. [1999] have suggested an optimization-based approach for simulation of dynamic 
point-to-point sagittal trunk movement. The effect of strength impairment on movement patterns was 
simulated based on minimizing different physical cost functions: Energy, Jerk, Peak Torque, Impulse, 
and Work. During unconstrained simulations, the velocity patterns of all models are predicted, while 
time to peak velocity is distinct for each cost function (Figure 155.6). Imposing an 80% reduction in 
extensor muscle strength diminished the significant differences between unimpaired optimal movement 
profiles (Figure 155.7). The results indicate that the search for finding the objective function being used 
by central nervous system is an ill-posed problem since we are sure if we have included all the active 
constraints in the simulation. The four application areas of these results are: (1) providing optimized 
trajectories for biofeedback to patients during the rehabilitation process; (2) training workers to lift safely; 
(3) estimating the task demand based on the global description of the job; and (4) aiding the engineering 
evaluation to develop ergonomic and workplace interventions which are needed to accommodate indi- 
viduals with prior disability. 



155.4 Conclusions 

The outcome of trunk performance is affected by the many neural, mechanical, and environmental factors 
that must be considered during quantitative assessment. The objective evaluation of the critical dimen- 
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FIGURE 155.6 The optimized unconstrained trunk flexion and extension trajectories for the five cost functions: 
Energy, Jerk, Peak Torque, Impulse, and Work. 

sions of functional capacity and its comparison with the task demands is crucial to the decision-making 
processes in the different stages of the ergonomic prevention and rehabilitation process. Knowing the 
tissue tolerance limits from biomechanical studies, task demands from ergonomic analysis, and function 
capacities from performance evaluation, the rehabilitation team will optimize the changes to the work- 
place or the task that will maximize the functional reserves (unutilized resources) to reduce the occurrence 
of fatigue and overexertion. This will enhance worker satisfaction and productivity while reducing the 
risk of the MSD. Based on ergonomic and motor control literature, the testing protocols that best simulate 
the loading conditions of the task will yield more valid results and better predictive ability. Ergonomic 
principles indicate that the ratio of the functional capacity to the task demand (utilization ratio) is critical 
to the development of muscular fatigue, which may lead to more injurious muscle recruitment patterns 
and movement profiles due to loss of motor control and coordination. However, large prospective studies 
are still needed to verify this. The most promising application for these quantitative measures is to be 
used as a benchmark for the safe return to work of injured workers, given the enormous variations within 
the normal population. With the advent of technologies to monitor trunk performance in the workplace, 
one can obtain estimates of the injurious levels of task demands (kinematic and kinetic parameters), 
which can be used to guide preplacement and rehabilitation strategies. The more functional the clinical 
tests become, the more clinicians need a complex interpretation scheme. An increasingly complex interpre- 
tation scheme opens the possibility of using mathematical modeling with intelligent computer interfaces. 
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FIGURE 155.7 The optimized trunk flexion trajectories for the five cost functions with the relative and absolute 
peak extensor strength impairments. The relative strength reduction meant that the extensor strength was reduced 
to 80% of the peak extension moment required during unconstrained simulations. The absolute strength reduction 
meant that the extensor strength was set to 200 Nm. 

The ability to identify subgroups of patients or high-risk individuals based on their functional perfor- 
mance will remain an open area of research to interested biomedical engineers within the multidisci- 
plinary group of experts addressing neuromusculoskeletal occupational disorders. 

Defining Terms 

Carpal tunnel syndrome (CTS): The result of compression of the median nerve in the carpal tunnel 

of the wrist. 
Cluster analysis: A statistical technique to identify natural groupings in data. 
DeQuervain's disease: A special case of tendosynovitis (swelling and irritation of the tendon sheath) 

which occurs in the abductor and extensor tendons of the thumb, where they share a common 

sheath. 
Electromyogram: Recordings of the electric potentials produced by muscle action. 
Feature extraction: A statistical technique to allow efficient representation of variability in the original 

signals while reducing the dimensions of the data. 
Lateral epicondylitis (tennis elbow): Tendons attaching to the epicondyle of the humerus bone become 

irritated. 
Rotator cuff tendinitis: The irritation and swelling of the tendon or the bursae of the shoulder that is 

caused by continuous muscle or tendon effort to keep the arm elevated. 



© 2000 by CRC Press LLC 



Tension neck syndrome: An irritation of the levator scapulae and trapezius group of muscles of the 

neck commonly occurring after repeated or sustained overhead work. 
Thoracic outlet syndrome: A disorder resulting from compression of nerves and blood vessels between 

the clavicle and the first and second ribs at the brachial plexus. 
Trigger finger: A special case of tendosynovitis where the tendon becomes nearly locked so that forced 

movement is not smooth. 
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Computer software applications have been implemented in virtually every aspect of our lives and the 
field of human performance has not proven to be an exception. Due to the growing recognition of the 
role of human performance engineering in many different areas (e.g., clinical medicine, industrial design, 
etc.) and the resulting increase in requirements of the methods involved, having the "right tool for the 
job" is becoming of vital importance. Software developers bear the brunt of the responsibility of deter- 
mining the qualities of a software application that define it as being "the right tool" for a specific job. In 
contrast, users of this class of tools must determine when their application extends a given tool beyond 
its intended scope. These abilities require a knowledge base spanning a number of different fundamental 
concepts and methods encompassing not only the obvious aspects of human performance and computer 
programing, but also many other less obvious issues related to database requirements, parameter stan- 
dards, systems engineering principles, and software architecture. In addition, foresight of how specific 
components can best be integrated to fit the needs of a particular usage is also necessary. 

This chapter addresses selected aspects of computer software tools specifically directed toward human 
performance design and analysis. The majority of tools currently available emphasize biomechanical 
models, and as such, this emphasis is reflected here. However, a much broader scope in terms of the 
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body systems incorporated is anticipated, and an effort is made to consider the evolution of more versatile 
and integrated packages. Selected key functional components of tools are described and a representative 
sample of currently emerging state-of-the-art packages is used to illustrate not only a snapshot of the 
capabilities now available, but also those which are needed and options that exist in terms of the 
fundamental approach taken to address similar problems. 

In general, the development of computer software applications, especially in maturing fields such as 
human performance engineering, serves multiple purposes. The most apparent is the relative speed and 
accuracy that can be achieved in computationally intensive tasks (e.g., dynamic analysis) compared to 
performing the processes by hand. In addition, computers can handle large amounts of data and help 
keep track of the multitude of parameters associated with the human architecture. This allows otherwise 
impossible procedures, such as the detailed analysis of a complex human-task interface, visualization of 
a human figure in a virtual workspace, or the computation of time-series multi-body joint torques, to 
be realized. Perhaps more important, though, are the indirect benefits provided. For example, relatively 
complex analytic methods utilized in research facilities can be directly implemented in the field by 
practitioners, with the need for only a complete knowledge of how to effectively use the capability available 
(which is substantially different than the knowledge required to create that ability). Also, because the 
software environment demands rigor (e.g., coding structure requirements, data handling and storage, 
etc.) and since the potential scope of end-user needs in terms of functional and parametric components 
encompassed is broad, decisions are forced and rules must be clearly delineated. These characteristics 
thus serve as a motivation to develop standards for (or at least decide upon the use of) many items 
including methods, parameters, parameter definition conventions, and units of measure. Such efforts 
can expose and correct inadequacies inherent in current standards while successful software tools (i.e., 
those which make their way into everyday use) facilitate dissemination of key knowledge and standards 
to both researchers and practitioners alike. 

It can be anticipated that most analysis or decision-making tasks required of practitioners will be made 
available in computer-based form, including some that are not currently feasible to perform at all except, 
perhaps, by using intuitive methods that are inconsistently applied and produce results of questionable 
validity. Given the extent of the possible list of tools within a given package, and considering the 
substantial overlap of various support functions, classification of packages is difficult. This can already 
be seen in the relatively modest number of those currently available. 

156.2 Selected Fundamentals 

It is typical to think of a software capability in terms of a high-level process, for example "gait analysis," 
where the end result is of primary interest since the underlying processes are performed "invisibly". Yet 
these supportive processes, many of which have common mathematical methodologies and modeling 
approaches, determine the ultimate usability and applicability of the software to a given problem and, 
therefore, deserve closer examination. It makes sense to consider these methods generically not only 
because they are common to several existing packages, but also because they will undoubtedly be 
important in others. 

Physics-based Models and Methods 

Physics-based methods are implemented when direct analysis of the system and/or its resources is feasible. 
With regard to the human system, such methods are commonly applied in biomechanical analysis where 
unknown parameters, like joint torque, are derived from known characteristics, like segment lengths and 
reaction forces, based on established physical laws and relationships. Given that relatively simple biome- 
chanical analyses typically include multiple segments and degrees of freedom, the feasibility of employing 
such methods depends not only on the availability of the required input data, but on the capability to 
process large amounts of information as well. As the analytical complexity of the task increases, e.g., with 
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the inclusion of motion, the significance of implementing such processes within a computer software 
application becomes readily apparent. 

Physical motion is common to most situations in which humans function and is therefore fundamental 
to the analysis of performance. Parameters such as segment position, orientation, velocity, and acceler- 
ation are derived using kinematic and/or dynamic analysis. This approach is equally appropriate for 
operations on a single joint system or linked multi-body systems, such as is typically required for human 
analysis. Depending on the desired output, forward (direct) or inverse analysis may be employed to 
obtain the parameters of interest. For example, inverse dynamic analysis can provide joint torque given 
motion and force data, while forward (direct) dynamic analysis uses joint torque to derive motion. 
Especially for three-dimensional analyses of multi-joint systems, the methods are quite complex and are 
presently a focal point for computer implementation [Allard, Stokes, and Blanchi, 1994]. 

Physics models used to address biomechanical aspects of human performance are relatively well 
established and utilized in both research and applied domains. In contrast, concepts and methods directed 
towards other aspects of human performance, such as information processing (in purely perceptual 
contexts as well as in neuromotor control contexts), are based on strong science and have been incor- 
porated into one-of-a-kind computer models but, to our knowledge have not been engineered into any 
general purpose computer tools. It can be argued that Shannon's information theory provides the basis 
for a similar set of "physics-based" cause-and-effect models to support the incorporation of this aspect 
of human performance into computer-based modeling and analysis tools. Considerable work has gone 
into the application of information theory to human information processing [e.g., Lachman, Lachman, 
and Butterfield, 1979] within research domains. The definition of neuromotor channel capacity [Fitts, 
1954] and metrics for the information content of a visual stimulus measured in bits [Hick, 1952; Hyman, 
1953] are some examples of the influence of information theory that have become well established. Given 
the considerable science that now exists, useful computer-based analytic tools that use these ideas as basic 
modeling constructs are likely to emerge. 

Inference-Based Models 

Often, direct measurement of a performance resource or structural element is not feasible or practical 
and estimates must be inferred from models based on representative populations. Inferential based 
methods, therefore, utilize derived relationships between parameters to provide an estimation (or pre- 
diction) of an unknown quantity based on other available measures. One type of modeling approach in 
such a situation is through statistical regression. This process utilizes data measured from a population 
of subjects with characteristics similar to the subject or population of interest, to derive a function which 
represents a "typical" relationship between the independent (measured) variables and the dependent 
(desired) variable. Specifically, this is achieved through the determination of the function that minimizes 
the error between the actual and the predicted value across all observations of the independent variable. 
This process results in an approximation of the desired parameter with a quantified standard error 
[Remington and Schork, 1985]. Applications of this method require that the dependent variable is 
distributed normally and with constant variance, and correspondingly, that an estimation of its typical 
value (for a given population) is desired. Applied to the human system, regression is often implemented 
to develop data models, which are then used to estimate unknown structural parameter values from 
known parameters, such as the estimation of body segment moments of inertia from stature and weight 
[McConville, Churchill, Kaleps, Clauser, and Cuzzi, 1980]. Regression has also been used to predict task 
performance from a wide variety of other variables (e.g., height, weight, age, other performance variables, 
etc.). A specific example is the determination of the maximum acceptable load during lifting tasks as a 
function of variables such as body- weight and arm strength [Jiang and Ayoub, 1987]. 

A conceptually different and relatively new example of an inferential model, motivated by human 
performance problems specifically, is Non-linear Causal Resource Analysis (NCRA) [Vasta and Kon- 
draske, 1994; Kondraske, 1988]. Quantitative task demands, in terms of performance variables that 
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characterize the involved subsystems, are inferred from a population data set that includes measures of 
subsystem performance resource availabilities (e.g., speed, accuracy, etc.) and overall performance on 
the task in question. This method is based on the following simple concept. Consider a sample of 
100 people, each with a known amount of cash (e.g., a fairly even distribution from $0 to $10,000). Each 
person is asked to try to purchase a specific computer, the "cost" of which is unknown. In the subgroup 
that was able to make the purchase (some won't have enough cash), the individual who had the least 
amount of cash provides the key clue. That "amount of cash availability" provides an estimate of the 
computer's cost (i.e., the unknown value). Thus, in human performance, demand is inferred from 
resource availabilities. 

Fidelity 

An important principle that has not often received the scrutiny it deserves is that of the level of fidelity 
desired, or perhaps more critically, necessary for a particular application. This includes anything that 
affects the range of applicability or quality of the results provided in a given application. Fidelity can be 
characterized in terms of three distinctly different components: (1) model scope, (2) computational 
quality (e.g., resolution limits, compounding of errors, convergence limits of numerical methods, etc.), 
and (3) data quality (e.g., noise, intrinsic resolution limits, etc.). Model scope considers the extent to 
which major systems are incorporated (e.g., neuromuscular, sensory, cognitive, etc.) and, as a separate 
issue, the extent to which these major subsystems are represented (e.g., peak torque and angle limits 
versus a three-dimensional torque-angle-speed envelope for neuromuscular systems of rotational joints). 

Parameter Conventions 

Often, the exact definition of parameters across (and even within) scientific disciplines varies, limiting 
communication of findings and inhibiting dissemination of knowledge. Careful selection and clear 
documentation of parameter conventions is an important principle in producing analytic software that 
can be understood, accepted, and used. 

Within the broad scope of parameters that could possibly be incorporated in analytic and other 
software tools, there are many parameter convention challenges that arise. Due to the fact that of the 
reported analyses in which various parameters appear to have been of restricted scope and largely special 
purpose, more generalized situations where convention is important have escaped standardization in 
terms broad enough to support all application needs. As one example, consider the description of relative 
orientation between two object-attached coordinate systems in three dimensions (in the context of the 
human system, this specifies joint angle). There are two basic forms of angle set representations, each 
derived in terms of the method of rotation of one coordinate system (attached to a moving object) about 
a specific axis: (1) Fixed angle representation, which involves referencing each rotation of the moving 
system to some fixed reference frame and, (2) Euler angle representation, indicating that each consecutive 
rotation of the moving system is referenced to the coordinate axes of its present orientation. Given 
multiple degrees of freedom, multiple angles result representing the amount of rotation about a specific 
axis and at a defined position in the order of rotations. The specification of these parameters define the 
associated angle set convention (i.e., fixed or Euler). Utilization of the terms "roll, pitch, and yaw" [e.g., 
Chao, 1980] in communicating this convention, originally used to describe ship and aircraft orientation, 
can also lead to confusion. This is not only due to the lack of similarity between the defined reference 
frame of an aircraft and that of a human segment, but also due to its altered definition within other 
disciplines (e.g., fixed angle representation [Spong and Vidyasagar, 1989] ). Thus, depending on the "type" 
of Euler angle used and the sequence of axes about which the rotations occur, two entirely different 
orientations are likely to result. This discussion does not even consider the clinical perspective on joint 
angles, where only angles measured in three orthogonal planes [Panjabi, White, and Brand, 1974] are 
considered. Despite the fact that the human architecture has remained constant (unlike many artificial 
systems), to our knowledge there is no standard convention that defines all angles in a total human link 
model for three-dimensional motion. 
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Data Formats 

The utilization of data, especially within a software environment, involves both communication and 
manipulation not only within a single stand-alone application, but often across facilities, databases, and 
platforms. Data formats, in this light, can be considered among the most important of the components 
fundamental to analytic software. Problematic effects may result from aspects including inconsistent 
adoption of terminology (e.g., endurance vs. fatigue), units of measure specifications, file structures, 
parameter coding, and database structure. CAD environments in traditional engineering disciplines (e.g., 
drafting, mechanical design, etc.) have confronted such issues with standardization such as the .DXF file 
formats (used for two and three-dimensional geometric drawings; even accepted by some numerical 
machining systems) and the Gerber format (used to communicate a printed circuit board specification 
to board manufacturers). Within the realm of human performance engineering, standards for data 
formats remain at the forefront of developmental needs. Some weak standards are emerging, such as the 
relatively consistent use of ASCII text files with either labeled or position-dependent parameters. However, 
there are currently no known agreed upon or d e facto standards for positions, labels, units of measure, etc. 

156.3 Scope, Functionality, and Performance 

As summarized in Fig. 156.1, CAD-like human performance software depends on conceptual issues as 
well as decisions regarding fidelity, functionality, and implementation. There is a also a great deal of 
interaction among these categories. Topics within each category were derived in part from a review of 
current packages (e.g., those described below) and an assessment of other issues and needs. These lists 
are intended to be illustrative and not exhaustive. While every major category applies to any given software 
tool, not all topics listed within a category will always be relevant. 

Conceptual issues include taxonomies on which a given package is based (e.g., basic approaches to 
motor control, system performance, task categorizations and analysis, data estimation, etc.), parameter 
choices and codification (e.g., hierarchical level of representation, identification in structured input/out- 
put file formats) and compliance with or deviation from accepted standards with the varied communities 
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FIGURE 156.1 Related aspects of human design and performance software describe the diversity and scope of 
underlying issues. 
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that deal with human performance. They may well deserve special recognition given the impressionable 
developmental stage of the class of software tools addressed. Perhaps because developers are so familiar 
with a given perspective or body of knowledge, key conceptual issues are often overlooked or incorporated 
de facto from previous work (e.g., research studies) that is similar (but perhaps not identical) to the 
intended purpose of a given package, which is frequently broader and more general than the research 
efforts or projects that inspired it. Software packages impose on users the constructs on which they are 
based and this may result in conflicts within an already structured environment. Conceptual foundations 
and approaches are not yet as cleanly defined in human performance as they are in disciplines such as 
electrical and mechanical engineering. This is further complicated by the wide variety of disciplines (and 
therefore educational backgrounds) represented by those with interest in participating in software devel- 
opment. Clear and complete disclosure of the conceptual foundation used by developers is thus helpful 
to both users (potential and actual) and developers. 

The concept and scope of fidelity in the present context has been delineated in the section titled 
"Fidelity" and is further represented by the subtopics in Fig. 156.1. We have chosen to apply it here in 
its broadest sense. In a fairly recent National Research Council panel on human performance modeling 
[Baron, Kruser, and Huey, 1990], recommendations were made toward problems regarding design and 
implementation issues. Though not specifically addressing computer software, the extension of the 
discussion is natural given present technology and implementation methods. While it was concluded 
that an all-inclusive model (i.e., high fidelity, in the sense that biomechanical, information processing, 
sensory and perceptual aspects, etc. are represented) might be desirable, it is highly unlikely that it could 
be achieved or would be useful, as the inherent complexity would impede effective usage. The basic 
recommendation made was to pursue more limited scope sub-models. The implication is that two or 
more computer-based versions of such sub-models could be integrated to achieve a wide range of fidelity, 
with the combination selected to meet the needs of particular situations. However, there is neither a 
general framework nor set of guidelines for developers of sub-models (i.e., an open-architecture concept) 
that would facilitate integration of relatively independent software development efforts. Thus, integration 
is left to the end-user and typically only cumbersome methods are available. 

Fidelity should be considered in combination with productivity and both must be carefully assessed 
against needs. For example, a package that provides the user with control over a large number of 
parameters (e.g., joint stiffness, balance control, etc.) that define the human system under analysis, 
typically allows for greater intrinsic fidelity and can be valuable if the ability to specify parameters at that 
level of detail is an absolute user requirement. If not, this level of specificity only increases the number 
of prerequisite steps to achieve a given analysis and leads to a more complex program in terms of operation 
and function (which is perhaps most evident in the user interface). Furthermore, more accurate results 
are not necessarily obtained. Programs that are structured to automatically rely on default parameter 
values that can be inspected or changed when desired serve both types of needs. However, concern is 
typically raised that "lazy" users will forego the entry of values more appropriate to the situation at hand. 
Thus, while software developers can provide features that greatly simplify procedures and recommenda- 
tions regarding their proper use, users bear the responsibility of choosing when to invoke such features. 
Fidelity issues are often viewed as "either"-"or" choices, when many times a "both" or "all" approach is 
quite feasible. This gives the user the decision-making power and responsibility for decisions that may 
affect quality of results. Such approaches also typically result in a greater user pool for a given package. 
How such flexibility is implemented, however, is critical. For example, if fidelity decisions are likely to 
be made once in a given installation, schemes that require decision-making with each analysis can prove 
to be cumbersome and ineffective. 

Functionality can be considered in terms of basic and special subcategories. Far too often, attention 
is given to special features while those which are more basic (such as import/export capabilities) are 
ignored. Most items listed in Fig. 156.1 under the "functionality" category are self explanatory, although 
it should be noted that features such as importing and exporting are more complex in the present context 
than one might anticipate due in part to the lack of standards for parameters representation (see above). 
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Developers must carefully select functionality and potential users must carefully evaluate the impact of 
those choices against their specific needs. For example, the processing power required to display and 
animate contoured multi-segment human figures in real-time and in three dimensions is substantial. 
Programs with this ability typically require high-end platforms and also require that a large fraction of 
the programming effort and user interface be used for this functionality. The addition of processes such 
as a dynamic analysis places even a greater stress on processing power and complexity. Visualization of 
human figures in three dimensions along with some environmental components is essential for some 
applications. For many applications, such as those in which numerical results are required and conveyed 
to others with simple printed reports, such visualizations are not required, or contribute little, to achieving 
the desired end purpose (although they are almost always viewed as "attractive" by potential users). 
Functionality options are further developed and exemplified in the discussion of actual software packages 
below (Section 156.4). 

The last of these four categories describes the support structure upon which the software packages are 
built. Software implementation in today's ever changing computer market requires careful thought. In 
addition to host platform and standard accessories, special input/output interfaces are often necessary 
(e.g., to allow direct access to laboratory acquired kinematic data). Support software for such special 
hardware is not always available on all platforms, preventing operation with analytic software in the same 
environment. Choice of programming language (supporting modules and libraries, platform diversity) 
and the operating system (Macintosh®, Windows™, Unix, etc.) are also critical. As exemplified in 
Section 156.4, no existing packages run on all platforms and under all operating systems, leaving users 
who wish to employ multiple packages with cumbersome and often costly solutions. 

As computing power at the single user level (i.e., personal computer) increases, there is little doubt 
that a full set of state-of-the-art features covering a broad scope of human performance and data needs 
could be integrated into a single package. At present, though, limitations are real and necessary. These 
are the result of specific trade-offs that must be considered during development when determining target 
users. Operational costs include software acquisition, maintenance, basic platform acquisition and out- 
fitting, and training costs. A large feature set and high performance are not always indicative of a software 
tool's value. Value is lost if a user's requirements are either unfulfilled or greatly exceed by the functionality 
and/or performance of a program. Unfortunately, general tradeoffs that are acceptable within different 
groups of users have not yet become evident. 

156.4 Functional Overview of Representative Packages 

In this section, a selected set of different types of software packages intended specifically for human 
design and performance analysis applications are described. At least one example of each type exists and 
is noted for reference and to illustrate the types of functionality developers are addressing in response 
to perceived needs. No endorsement of any example cited is implied. The packages included vary widely 
in function and performance and should not be considered to be directly comparable. Operational costs 
reflect this diversity, ranging from relatively inexpensive (e.g., approximately $100.00 plus the price of a 
moderately equipped DOS-based PC™) to the level defined by high-end workstations (e.g., in excess of 
$2,000 for software plus the cost of a Unix-based workstation). As with desktop computer applications, 
such as word processors and spreadsheets, the packages used to illustrate functionality are under constant 
development and changing feature-sets, performance, and cost are to be anticipated. 

The types of packages included illustrate fundamental options with regard to general or dedicated 
analysis (e.g., user-defined tasks vs. gait), different levels of analysis (e.g., "muscle" vs. "joint" levels in 
biomechanical analyses), body-function specific or general purpose analysis (e.g., gait vs. user-defined 
function), fidelity in terms of scope (e.g., biomechanical, sensory, neuromotor control, etc.), and target 
user orientation (medical vs. non-medical design; although overlap is possible). It should be noted that 
while particular packages are highlighted under certain categories, underlying features and functions are 
common across many applications. 
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Human Parameter Databases 

The need for data is evident in all aspects of human-related computer aided design and the analysis of 
human performance. In certain situations, requirements may be for a specific individual or a represen- 
tative population. In the former case, there are certain parameters that are not readily attainable (e.g., 
location of center of mass for a segment, inertia, etc.) and therefore must be estimated, as in the latter 
case, from normative values derived from studies. For a number of years, these estimates have been 
available in book form primarily as lookup tables. This format, however, does not take advantage of 
current technology and is not sufficient for use with software-based analytic tools. Some currently 
available packages, in addressing this need, have included data tables that are utilized exclusively within 
the program. Programs such as MannequinPro and SafeWork, (detailed below) for example, each include 
a significant database of anthropometric measures for various populations. The user specifies design 
parameter values by choosing characteristics such as gender, ethnic origin, and/or morphology repre- 
sentative of the desired population. Various other stand-alone database software packages, for example 
PeopleSize (targeted for clothing design), allow data to be exported, thereby providing at least indirect 
support for external applications. 

In addition to measures such as segment lengths, parameters including reach and range of motion for 
selected postures are frequently provided in databases (e.g., horizontal sitting reach, etc.). Because it is 
prohibitive (if not impossible) to measure and report all possible situations, this issue may be more 
adequately addressed through the use of parametric data models. Data derived from such models would 
provide estimates for all possible conditions as well as having the additional advantage of requiring lesser 
storage and management requirements. Caution is warranted with regard to issues such as model validity, 
statistical sample sizes and variations, combinatory effects of merging databases or data models, and 
traceability of data to original sources. 

Biomechanical, Muscle-Level Modeling and Analysis 

SIMM is a graphics-based tool for modeling and visualization of any human or animal musculoskeletal 
system and is directed at lower-level structures, i.e., individual muscles. The program allows users to 
model any type of musculoskeletal system using files which specify (1) the skeletal structure through 
polygonal surface objects, (2) the joint kinematics, and (3) the muscle architecture through specifications 
of a line of action for each muscle and isometric force data. SIMM contains various tools for observing 
and editing the model. The user can visualize and animate the model, display or hide the individual 
muscles (shown as line segments), manipulate joint angles, muscle forces, and moment arms. Joint 
torques (static) can be analyzed via plots and graphs and all of the muscle parameters may be edited 
including visual alteration of muscle lines of action. Finally, the user also has control over joint kinematics 
by altering the cubic splines used to control the animated joint movements. Developers claim that SIMM 
is aimed at biomechanics researchers, kinesiologists, workspace designers, and students who can benefit 
from visual feedback showing muscle locations and actions. 

Musculographics, the developers of SIMM, also produce an add-on package specifically directed toward 
gait analysis that provides ground reaction force vectors and color-highlighted muscle activity through 
animation of the model. The software is designed to input data files written by movement analysis systems 
and can display plots of data such as muscle lengths and ground reaction forces. Another application, 
the Dynamics Pipeline, is a general-purpose package that provides forward and inverse dynamics to 
calculate motions resulting from forces or torques required to generate a given motion, respectively. A 
recent version of SIMM was made available for Microsoft Windows, however the original Unix-based 
version is still available. 

Body-Function Specific Tools 

Gait Lab is a public domain software package that allows fundamental parameters of human gait to be 
derived, analyzed, and inspected. The program is divided into two main sections (i.e., sub-programs that 
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FIGURE 156.2 Muscle attachments and utilization (indicated by shading) for the lower extremities during gait, 
created and displayed using SIMM software. (Reprinted with permission from Musculo Graphics Inc.) 

run under the "shell" of the main application): (1) a mathematics section which provides processes for 
generating text files representing various gait parameters throughout a gait cycle and (2) a graphics section 
that creates plots of the parameters as well as a representation of gait through an animated stick figure. 
A manual [Vaughan, Davis, and O'Connor, 1992], available separately, covers the gait parameters and 
equations associated with the software package as well as the underlying principles for their use in gait 
analysis. The software accepts four different, custom-formatted data files (ASCII text) based on direct 
measurements of a human subject. These include anthropometric, kinematic (i.e., three-dimensional 
positions of anatomic landmarks), force plate, and EMG measures. Sample files are included for a healthy 
adult male, female, and an adult male with cerebral palsy. From this data, text files are created for lower 
extremity body segment parameters (mass, center of gravity position, and moment of inertia), linear 
kinematics (joint center, heel, and toe locations, segment reference frames), center of gravity (position, 
velocity, and acceleration), angular kinematics (joint angles, angular velocity and acceleration), and 
dynamics (joint forces and torques). The package can also display up to three simultaneous plots of these 
parameters in any combination. Time, percent of gait cycle, and marker position scales are also available 
for plotting against any of the variables. 

Gait Lab provides a useful function in its ability to generate the above mentioned parameters and is 
effective with its simplistic stick figure animation, allowing minimal processing power/speed requirements 
while providing the nuances of lower limb movement pertinent to gait analysis. The DOS-based software 
tested (no version number was indicated — copyright 1992), while certainly not published pre-Windows™, 
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forces the user into the key-command mode (no mouse interface is provided) of using the enter, arrow, 
and function keys to operate menus and functions. Though not difficult to become acquainted with, it 
is not intuitive in layout or operation relative to today's user interface standards. Also, with the exception 
of the anthropometry file which can be created from within the software, no means is provided for editing 
data files. If new files are to be utilized, they must be in the required format (which must be inferred by 
inspection of the example files provided with the package). 

Biomechanical, Joint-Level, Total Human Analysis 

ADAMS is a software package designed for mechanical design and system simulation in general. It is an 
extensive package incorporating specific components for both modeling and kinematic/dynamic analysis. 
What sets ADAMS apart from other non-performance based simulation/dynamics packages is the avail- 
ability of a module, AD AMSI Android, specifically designed to allow the user to create a human model. 
The human model may be edited as needed but is provided in a default configuration of a 15-joint body 
architecture with anthropometric parameters based on the 75th percentile male. Because the package 
focuses on mechanical systems in general, a great deal of control is provided to the user over the 
parameters of the model including contact forces, joint friction, damping, and non-linear stiffness as 
well as control parameters. Having this capability, the human model is able to affect control over objects 
in its environment as well as responding biomechanically to contact. Supplemental toolsets may be 
(custom) developed to provide proper control over model parameters such as motion resistance rela- 
tionships, joint strength and range of motion limits, and delays in neuromuscular response. For a given 
model, kinematic or dynamic simulations may be run and values for any parameter can be plotted on 
screen. Both forward and inverse dynamic analysis are available and graphically based facilities such as 
the specification of trajectories via mouse input are available. Modules for animation and optimization 
are being developed at present but are expected to be released by the time this handbook is published. 
Because ADAMS' capabilities are so extensive, it has become one of the most widely used mechanical 
system simulation packages. Outside of the realm of human performance analysis, the package is targeted 
at engineers and scientists who require complete observation and control of the model parameters across 
many hierarchical levels. While this format affords great fidelity and specificity in the analysis, there is 
also a great deal of complexity and detail that must be addressed in every new simulation run. Though 
performance assessment is not an inherent or recognized feature in the package, custom modules may 
be incorporated which would allow for direct comparison of system resource utilization and stress against 
the demands of the task. 

Visualization for Low-End Computer Platforms 

MannequinPro, while having inherent database functionality, is essentially a visualization tool that allows 
users to quickly create three-dimensional humanoid figures which may be represented in stick-figure 
form, as a wireframe "robot" with polygonal segments, or as its name implies, a mannequin with higher- 
resolution segments. These figures may be postured within preset (i.e., non-editable) range-of-motion 
limits, as well as viewed from many different perspectives. They are created automatically by the program 
according to user defined structural aspects chosen from a list of specifications for gender, population 
percentile, body type, age, and nationality. The package also provides a modest drawing (CAD-type) tool 
for creating various objects to be placed in the scene with the figure. Two additional features extend 
MannequinPro beyond the domain of a "simple" object rendering tool. The first is provided through 
animation effects, allowing the mannequin to walk via an internally generated gait sequence (a specific 
path may be defined through a virtual workspace), reach to a specific location, or move through any 
sequence of positions created on a frame-by-frame basis by the user. The second feature allows the user 
to specify forces acting on the mannequin's hands or feet from which static torques are calculated for 
both left and right wrists, elbows, shoulders, neck, back, hips, knees, and ankles. 
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As a visualization tool, MannequinPro provides a needed capability for a modest price and requires 
only a personal computer to run. Figures are easily produced to the anthropometric specifications avail- 
able and posing, reaching, and walking functions, though limited, are simple to control. The package is 
primarily operated from mouse input with standard point, click, and drag operations that should be 
familiar to most users of current commercial software. Once the scene is created with humans and objects 
in their workspace, it is up to the user to extract any information regarding function and performance 
from the visual images presented by the package. These features, as well as others, are also contained 
within the software package Jack and SafeWork (described below), which provide greater functionality 
but are targeted toward workstation environments. In summation, MannequinPro, with a good price to 
performance ratio, is an effective tool when human visualization with moderate graphics is required at 
the single-user platform level. 

Extended Visualization for the High-End Workstations 

Jack, developed within the University of Pennsylvania Center for Human Modeling and Simulation 
[Badler, Phillips, and Webber, 1993], and SafeWork are high-end, CAD-oriented design and visualization 
packages. Similar in function to MannequinPro (above), but with considerably more flexibility and detail, 
they are designed to allow human factors engineers to place a human in a virtual workstation environment 
while still in the design phase. Both extend far beyond the capabilities of MannequinPro, trading simplicity 
for specificity in that they provide users with control over a large range of variables regarding many 
aspects of the human figure and its virtual environment. Human models are fully articulated and have 
preset or user-definable anthropometric parameters including segment dimensions, joint limits, moments 
of inertia, and strength allow highly-specific representation. The total functionality of theses package are 
far too great to list the individual aspects here, but examples include reach, hand gripping, balance 
(including reorientation behavior), collision detection, light and camera (view) manipulation, animation, 
and kinematic/dynamic analysis. Jack also allows for real-time posture tracking of a human operator 
through position and orientation sensors. 

Similar to the functionality of ADAMS, these packages provide a great deal of control over the design 
and manipulation of the human figure. Because of the large number of controllable parameters and the 
difficulty involved in providing an interface allowing three-dimensional manipulation of high-end graph- 
ics in a two-dimensional environment, the majority of users will require training. Jack and SafeWork 
require a Silicon Graphics workstation to run. 




FIGURE 156.3 A human/workspace figure created and displayed using Jack software. (Reprinted with permission 
from the Center for Human Modeling and Simulation at the University of Pennsylvania.) 
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Total Human, Performance-based, Human-Task Interface Analysis 

While now under development, HMT-CAD represents yet a different perspective in that it is based 
primarily on performance modeling constructs (e.g., Kondraske, 1995) and is aimed at producing bot- 
tom-line assessments regarding human-to-task, human-to-machine, and eventually machine-to-task 
interfaces using logic similar to that used by systems-level designers. In a joint effort between Photon 
Research Associates, Inc. and the University of Texas at Arlington Human Performance Institute, perfor- 
mance models and decision-making strategies are combined with advanced multi-body dynamic analysis. 

HMT-CAD's initial development emphasized a top-down approach that set out with the goal of 
realizing a shell that would allow systematic incorporation of increasingly higher fidelity in a modular, 
step-wise fashion. The initial modules encompass biomechanical and neuromuscular aspects of perfor- 
mance that bases analyses on knowledge of range/extremes of motion, torque, and speed capacities. Gross 
total human (23 links and 41 degrees of freedom) and hand link models (16 links and 22 degrees of 
freedom each) are included. The Windows™ based tool provides a pre-defined framework for specifying 
human structure and performance parameters (e.g., anthropometry, strength, etc.) for individuals or 
populations. A task library is included based on an object-oriented approach to task analysis. HMT-CAD 
attempts to help communicate information about many parameters via a custom graphical-button 
interface. For example, one analysis result screen shows a human figure surrounded by buttons connected 
to body joints. Button color (e.g., red or green) communicates whether an analysis found limiting factors 
for the specified person in the specified task associated with that joint system. Using computer mouse 
input, clicking on red buttons produces a new window showing a list of performance capacities and stress 
levels associated with systems of that joint. Maximally stressed performance resources are tagged. HMT- 
CAD uses the same systems level modeling constructs for human and artificial systems. A novel technique 
that transparently draws upon a large set of data models as needed is used to provide "the best analysis 
possible" with the data provided by the user. This allows users to directly specify values for all parameters 
if desired, but does not require this for each analysis. 

In the present prototype, analysis scope is very limited (e.g., upper extremity, object moving tasks). 
Databases are being populated to allow a similar fidelity of analysis for trunk and lower extremity analyses. 
Development of an optimization engine based on minimizing stress across performance resources [Kon- 
draske and Khoury, 1992] to solve problems in which redundant approaches are possible is under way. 
No capability is currently provided for animation of human movement, as binary assessments (e.g., red 
and green buttons) and numerical results (stress levels on performance capacities) are emphasized. 
However, an interface to emerging general purpose animation tools is planned. Long-term development 
will proceed to increase fidelity in a stepwise fashion by the inclusion of performance capacities for other 
major systems (e.g., information processing associated with motor control) as well as additional perfor- 
mance capacities for major systems represented at any given time (e.g., the inclusion of neuromuscular 
endurance limits). 

156.5 Anticipated Development 

While it is relatively easy to outline the functionality and unique aspects of the software discussed above, 
it is a great deal harder to describe the level of difficulty required to achieve a desired end result and to 
characterize how well functions are performed. On-line "help" systems, multi-media components, asso- 
ciated software development tools, improved parameter measurement tools, and higher fidelity models 
will no doubt impact these aspects. 

As mundane as it may seem, perhaps nothing is more important to the overall advancement of this 
class of software tools than the development of standards for parameters used, parameter conventions, 
and data formats. It is likely that de facto standards will emerge from individual commercial endeavors. 
Due to the diversity of professionals involved, it is difficult to envision the evolution of "standards by 
committee". Observation of success stories in analogous efforts also supports this opinion. This, in turn, 
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FIGURE 156.4 A desktop environment of the HMT-CAD performance assessment package including system model 
development and analysis result reporting. 



will enable progress to be made in database and data model development, which is vital to many of the 
more routine functions such software will perform. Combined with modular object-oriented program- 
ming techniques, the emergence of true multi-tasking operating systems on desktop computers, and the 
support for inter-process communication (such as dynamic data exchange) will encourage developers. 
Many new tools, both similar to and completely different than those described here, can be anticipated. 
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Defining Terms 

Channel capacity: The maximum rate of information flow through a specific pathway from source to 
receiver. In the context of the human, a sensory-motor pathway (e.g., afferent sensory nerves, 
processors, descending cerebro-spinal nerve and a-motoneuron) is an example of a channel 
through which motor control information flows from sensors to actuators (muscle). 

Data model: A mathematical equation derived as a representation of the relationship between a depen- 
dent variable and one or more independent variables. Similar in function to a database, data models 
provide parameter values across conditions of other related parameters. 

Human-task interface: The common boundary between the human and a task specified by the demands 
of a task on the respective resources of the human and the capacities of the human's resources 
involved in performing the task. 

Linked multi-body system: A system of three or more individual segments joined (as an open or closed 
chain) in some manner with the degree of freedom between any two segments defined by the 
characteristics of the corresponding hinge. 

Open-architecture concept: A general methodology for the development of functional modules that 
may be readily integrated to form a higher-level system through well-defined design and interface 
constructs. 

Parameter conventions: Aspects and/or usage of a parameter that are specifically defined with respect 
to general implementation. 

Performance assessment: The process of determining the level or degree in which a system can perform 
a specific task given the demands required and the availabilities of the resources of the system 
which are involved in performing the task. 

Performance resource: Defined as a functional unit and an associated dimension of performance (e.g., 
knee extensor strength) that is available to a system. 

Virtual workspace: A representation of a three-dimensional physical workspace generated by computer 
software and displayed on a video monitor or similar device. This enables, for example, the 
inclusion and manipulation of computer generated objects within the virtual workspace such that 
designs can be tested and modified prior to manufacturing. 
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Further Information 

For explicit information regarding the software packages included in this chapter, the developers may be 
directly contacted as follows: 

ADAMS: Mechanical Dynamics, 2301 Commonwealth Blvd., Ann Arbor, MI 48105.USA, 800-88- 
ADAMS, http://www.adams.com 

Gait Lab: originally published by Human Kinetics, 1607 N. Market St., Box 5076, Champaign, IL 61825, 
(217) 351-5076, now available as public domain software on the International Society of Biome- 
chanics web page at http://isb.ri.ccf.org/isb/data/gaitlab/. 

HMT-CAD: Human Performance Institute, The University of Texas at Arlington, P.O. Box 19180, Arling- 
ton, TX 76019, (817) 272-2335 

Jack: Center for Human Modeling and Simulation, The University of Pennsylvania, 200 South 33rd St., 
Philadelphia, PA 19104-6389, (215) 898-1488, http://www.cis.upenn.edu/~hms/jack.html 

MannequinPro: HumanCAD, a division of Biomechanics Corporation of America, 1800 Walt Whitman 
Road, Melville, NY, 11747, (516) 752-3568, http://www.humancad.com 

PeopleSize: Open Ergonomics Ltd., Loughborough Technology Centre, Epinal Way, Loughborough, LEI 1 
3GE, UK, +44 (0) 1509 218 333, http://www.openerg.com/psz.htm 

SafeWork: Genicom Consultants Inc., 3400 de Maisonneuve Blvd. West, 1 Place Alexis Nihon, Suite 1430, 
Montreal, Quebec, CANADA, H3Z 3B8, (514) 931-3000, http://www.safework.com 

SIMM: Musculographics, Inc., 1840 Oak Ave., Evanston, IL 60201, (708) 866-1882, http://www.muscu- 
lographics.com 
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A broad perspective of issues related to those presented here may be found in: Matilla, M. and 
Karwowski, W., eds. 1992. Computer applications in ergonomics, occupational safety, and health, In: 
Proc. Int. Conf. Computer-Aided Ergonomics and Safety '92, Tampere, Finland, North-Holland Publishers, 
Amsterdam. 

Additionally, periodic discussions of related topics may be found in the following publications: CSE- 
RIAC Gateway, published by the Crew System Ergonomics Information Analysis Center. For subscription 
information contact AL/CFH/CSERIAC, Bldg. 248, 2255 H St., Wright-Patterson Air Force Base, Ohio, 
45433; IEEE Transactions on Biomedical Engineering or IEEE Engineering in Medicine & Biology Magazine. 
For further information contact: IEEE Service Center, 445 Hoes Lane, P.O. Box 1331, Piscataway, NJ, 
08855-1331. 

Numerous other related sites can be found on the Internet. Among those of interest are: 

Biomechanics World Wide: includes a very thorough listing of various topics related to biomechanics 

inlcuding links to educational, corporate, and related sites of interest, http://www.per.ual- 

berta.ca/biomechanics/b wwframe.htm 
Biomch-L: an internet list-server supporting discussions regarding topics relating to biomechanics. 

http://isb.ri.ccf.org/isb/biomch-l/. 
Motion Lab Systems, Inc.: a collection of information and links to a variety of Internet sites focused on 

Electromyography, Biomechanics, Gait Analysis, and Motion Capture. Includes both commercial 

and non-commercial sites, http://www.emgsrus.com/links.htm 
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Human performance engineering is a dynamic area with rapid changes in many facets and constantly 
emerging developments that affect application possibilities. Material presented in other chapters of this 
section illustrates the breadth and depth of effort required as well as the variety of applications in this 
field. At the same time, the field is relatively young, and as is often the case in such instances, some "old" 
issues remain unresolved and new issues emerge with new developments. With the other material in this 
section as background, this chapter focuses on selected issues of a general nature (i.e., cutting across 
various human subsystem types and applications) in three major areas vital to the future of the field: 
(1) human performance modeling, (2) measurement, and (3) data-related topics. In addition to the 
identification of some key issues, some speculation is presented with regard to the lines along which 
future work may develop. Awareness of the specific issues raised is anticipated to be helpful to practitioners 
who must operate within the constraints of the current state of the art. Moreover, the issues selected 
represent scientific and engineering challenges for the future that will require not only awareness but 
also the collaboration of researchers and practitioners from the varied segments of the community to 
resolve. 

157.1 Models 

It is important to distinguish various types of models encountered in human performance. They include 
conceptual, statistically based predictive models, predictive models based on cause and effect (e.g., 
[Kondraske et al., 1997]), and data models [e.g., Vasta and Kondraske, 2000]. Issues related to data models 
are described later in this chapter, while those related to the remaining types are discussed in this section. 
Traditionally, a tremendous amount of activity in various segments of the human performance com- 
munity has been directed toward development of statistically based predictive models (e.g., regression 
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TABLE 157.1 Representative Quotations Illustrating the Need for a Shift Toward the Development of Casual Models 
Quote Field Reference 

"... there has been little advancement of the science of ergonomics ... the degree of Human factors Smith, 1987 

advancement is so low that, in my opinion, ergonomics does not yet qualify as a science 

. . . There has been no progress in the accumulation of general knowledge . . . because 

nearly all studies have not been general studies." 
"... experiments are performed year after year to answer the same questions, those Human factors Simmon, 1987 

questions — often fundamental ones — remain unanswered . . . the methodology 

employed today is a hodgepodge of quick fixes that evolved over the years into a 

paradigm that is taught and employed as a sacrosanct, when in fact it is woefully 

inadequate and frequently incompetent ..." 
"... authors and users of these instruments lacked a conceptual model ... lackofawell- Rehabilitation Frey, 1987 

developed conceptual model is unfortunately characteristic of the entire history of the 

field ..." 

models). This remains the most popular approach. By their very nature, each of these is very application- 
specific. Even within the intended application, success of these models has ranged mostly from poor to 
marginal, with a few reports of models that look "very good" in the populations in which they were 
tested. However, the criteria for "good" have been skewed by the early performance achieved with these 
methods, and statistically significant rvalues of 0.6 have come to be considered "good." Even if predictive 
performance was excellent, the long-term merit of statistically based prediction methods can be ques- 
tioned on the basis that this approach is intrinsically inefficient. A unique and time-consuming modeling 
effort, requiring new data collection with human subjects, is required for every situation. At the same 
time, powerful computer-based tools are commonly available that facilitate the generation of what is 
estimated to be hundreds and perhaps thousands of these regression models on an annual basis. It appears 
that many researchers and practitioners have forgotten that one purpose of such statistical methods is 
to provide insight into the cause-and-effect principles at play and into generalizations that might be more 
broadly applied. Statistically based regression models will continue to serve useful purposes within the 
specific application contexts for which they are developed. However, the future of predictive modeling 
in human performance engineering must be based on cause-and-effect principles. The sentiment for the 
need to shift from statistical to more physically based models is present and growing in several of the 
related application disciplines, as evidenced by a sampling of quotations from the literature presented in 
Table 157.1. 

A National Research Council panel on human performance modeling [Baron et al., 1990] conducted 
one of the more broad-looking investigations regarding complex human performance models; i.e., those 
which consider one or more major attributes (e.g., biomechanical, sensory, cognitive, structural details, 
etc.) of a total or near-total human. The convening of this panel itself underscored the interest in and 
need for human performance models. As an interesting aside, while the panel's focus was on modeling, 
a large fraction of the efforts cited as background represented major software packages, most of which 
were developed to support defense industry needs. A major application for such models is in computer 
simulation of humans in various circumstances, which could be used to support prediction and other 
analysis needs. A series of useful recommendations was included in panel's report regarding the direction 
that should be taken in future work. However, recommendations were quite general in nature, and no 
specific plans were outlined regarding how some of these objectives might actually be achieved. None- 
theless, it was clear from the choice of previous work cited and from the recommendations that devel- 
opment of cause-and-effect that general-purpose models should receive priority. The working model 
presented for this section of this text [Kondraske, 2000] is one example of a causal, integrative model 
that reflects an initial attempt to incorporate many of this panel's recommendations. 

Whether primarily statistical or causal in nature, frustration with what has been characterized or 
perceived as "limited success" achieved to date in efforts to unlock and formalize the underpinnings of 
complex human functions such as speech, gait, lifting, and memory have led to attacks in recent years 
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on the basic reductionistic approach that has been pursued most aggressively. In fact, reductionism has 
been characterized as a "dead end" methodology — one that has been tried and has failed [Bunge, 1977; 
Gardner, 1985], as further evidenced by the following somewhat representative quote [Weismer & Liss, 
1991]: 

This view held the scientific community spellbound until people started looking past the technological 
issues, and asking how well the reductionist observations were doing at explaining behavior, and the 
answer was, miserably. Many microscopic facts had been accumulated, and incredible technological 
advances had been made, but the sum of all of these reductionist observations could not make good 
sense of macroscopic levels of movement behavior. 

Prior to total abandonment, it is perhaps wise not only to characterize the performance of reductionism 
but also to question why it might have failed thus far. For example, it may be prudent to entertain the 
proposition that reductionism may be "a correct approach" that has failed to date because of a problem 
with one or more components associated its implementation. Are we certain that this is not the case? 

One potential explanation for an apparent failure of reductionism is simply the manner in which 
manpower has been organized (i.e., the research infrastructure) to attack problems of the magnitude of 
those considered, for example, by the National Research Council panel noted above. With some excep- 
tions, the majority of funded research efforts are short term and involve very small teams. Is it possible 
that the amount of information "to reduce" is too much to produce results that would be characterized 
as "success" (i.e., does the equivalent of an undercapitalized business venture exist)? It is further observed 
that the value of reduction is often only demonstrable by the ability to assemble. As noted, the prime tool 
of assembly thus far has been statistical in nature, and relatively few causal models have been seriously 
entertained. With regard to human systems, it is clear that there are many such "items" to assemble and 
many details to consider if "assembly" — with high fidelity — is to result. Furthermore, it is clear that tools 
(i.e., special computer software) are needed to make such assembly efforts efficient enough to consider 
undertaking. Has the data management and analytical power of computers, which have only been readily 
available in convenient-to-use forms and with required capacities for less than a decade, been fully 
exploited in a "fair test" of reductionism? 

Reductionism is clearly a methodology for understanding "that which is." Drawing by inference from 
engineering design in general, a close relationship can be observed with systems engineering synthesis 
methods used to define "that which will be." In artificial systems (as opposed to those which occur 
naturally, such as humans), reverse engineering (basically a reductionistic method) has been used quite 
successfully to create functionally equivalent replicas of products. The methodology employed is guided 
by knowledge of the synthesis process. The implementation of reductionism in human performance 
modeling can possibly benefit from the reverse mental exercise of "building a human," which may not 
be so abstract given current efforts and achievements in rehabilitation engineering. In looking toward 
the future, it is fair to ask, "To what extent have those who have attempted to apply reductionism, or 
who have dismissed it, attempted to bring to bear methods of synthesis?" The real support for reduc- 
tionism is success obtained with methods based on this concept; new findings of an extremely encouraging 
nature are beginning to come forth (e.g., [Kondraske et al., 1997]. 

The issue of biologic variability is also frequently raised in the context of the desire to develop predictive 
models of human performance. It is perhaps noteworthy that variability is an issue with which one must 
deal in the manufacture of man-made products (i.e., particularly with regard to quality assurance in 
manufacturing), which are designed almost exclusively using causal principles. Taguchi methods [Bendell 
et al., 1988], a collection of mathematics and concepts, have been used in widespread fashion with 
remarkable success in modeling and controlling variability in the characteristics of a final product which 
is based on the combination of many components, each of which has multiple characteristics with values 
that also range within tolerance bands. Similarity of these circumstances suggests that Taguchi methods 
also may be valuable in human performance engineering. Investigations of this nature are likely to be a 
part of future work. 
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157.2 Measurements 

Standardization of Variables and Conditions 

Human performance literature is replete with different measures that characterize human performance. 
Despite the number and magnitude of efforts where human performance is of interest, a standard set of 
measurement variables has yet to emerge. Relatively loose, descriptive naming conventions are used to 
identify variables reported, leading to the perception of differences when, upon careful inspection, the 
variables used in two different studies or modeling efforts are the same. In other cases, the names of 
variables are the same, but conditions under which measures are acquired are different or not reported 
with enough detail to evaluate if numeric data are comparable (across the reports in question) or not. 

The needs of science and engineering are considerably different. Although standardization is generally 
considered to be desirable in both arenas, the former stresses only that methods used be accurately 
reported so that they may be replicated. There has been, unfortunately, little motivation to achieve 
standardization in any broad sense in the human performance research world. In traditional areas of 
engineering, it is the marketplace that has forced the development of standards for naming of variables 
such as performance characteristics of various components (e.g., sensors, actuators, etc.) and for condi- 
tions under which these variables are measured. In more recent years, product databases (e.g., for 
electronic, mechanical, and electromechanical components) and systems-level modeling software (i.e., 
computer-aided design and computer-aided manufacturing), along with the desire to reduce concept- 
to-production cycle times, have further increased the levels of standardization in these areas. 

Despite the somewhat gloomy circumstance at present, there are some signs of potential improvement. 
An increasing number of research groups are beginning to attack problems of larger scale, and interest 
is growing with regard to the ability to exchange data and models. Several small standards development 
efforts have surfaced recently in specific areas such as biomechanics to develop, for example, standards 
for kinematic representation of various body joints. The advent of an increasing number of commercially 
available measurement tools also has driven a trend to similarity in conventions used in some specific 
subareas. However, the quest for standardization in measurement that is often voiced is still a distant 
vision. A willingness to suffer the inconvenience of "standardizing" in the short run in exchange for a 
greater convenience and "power" over the long run must be recognized by a critical mass of researchers, 
instrument manufacturers, and practitioners before a significant breakthrough can occur. The large 
number of different professional bodies and societies that may potentially become involved in promul- 
gating an official set of standards will most likely contribute to slow progress in this area. 

New Instruments 

Instruments to acquire measures of both human structure and performance have been improving at a 
rapid pace, commensurate with the improvement of base technologies (e.g., sensors, signal conditioning, 
microprocessors, and desktop computer systems, etc.). Compared with one decade ago, a practitioner 
or researcher can today assemble a relatively sophisticated and broad-based measurement laboratory with 
commercially available devices instead of facing the burden of fabricating his or her own instruments. 
However, a considerable imbalance still exists across the profile of human performance with regard to 
commercial availability of necessary tools. This imbalance extends to the profile of tools seen in use in 
contexts such as rehabilitation and sports training. In particular, devices that measure sensory perfor- 
mance capacities are not nearly as commonplace or well developed as those which measure, for example, 
strength and range of motion. Likewise, there are few commercially available instruments that measure 
aspects of neuromotor control in any general sense other than, for example, in a selected task such as 
maintaining stable posture. The complexity and diversity of higher-level human cognitive processes also 
have hindered development of measurement instruments in this domain that possess any true degree of 
commonality in content across products. A wide variety of computer-based test batteries has been 
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proliferating that are more or less implementations of the great number of tests formerly administered 
in paper-pencil format. 

From a practitioner's perspective, prospects for the future are both good and bad. There is some 
evidence that suppliers of instruments that cover areas of the performance where there has been vigorous 
competition (e.g., dynamic strength) are experiencing market saturation and uncertainty on the part of 
consumers regarding what is "the best way" to acquire necessary information. This may force some groups 
to drop product lines, while others are expressing subtle interests in expanding measurement instrument 
product lines to fill empty niches. This latter behavior is good in that it will likely increase the scope of 
measurement tools that is commercially available. However, problems associated with standardization 
often confuse potential users, and this, in general, has slowed progress in the adoption of more objective 
measurement instruments (compared with subjective methods) in some application areas. This, in turn, 
has inhibited product providers from taking the risks associated with introduction of new products. 

Compared with the commercial availability of tools that characterize the performance of various 
human subsystems, instruments that quantify task attributes are much less prevelant. However, the 
perception that increased emphasis on this area will increase the utility of measures that characterize the 
human will likely motivate a substantial increase in the number and variety of products available for task 
characterization. In addition, factors such as the Americans with Disabilities Act (ADA) , which encourages 
work-site evaluations and modifications to facilitate employment of individuals with disabilities, and the 
increase in work-site related injuries such as carpal tunnel syndrome have led to an increased demand 
for such tools. 

There has been a subtle but noticeable developing sense of awareness in the various communities that 
there is room for more than one kind of instrument to measure a given variable. Thus debates regarding 
"the correct approach" which were commonplace are beginning to be replaced with debates concerned 
with determining "the best approach for a given situation." A prime example is in the area of strength 
measurement. Devices range from relatively inexpensive hand-held dynamometers that provide rapid 
measurements and meaningful results with neurologic patients to expensive devices with electric or 
hydraulic servomotor systems that are more suited, for example, for use in sports medicine contexts. 
This trend is quite healthy and is likely to spread. 

Measuring the Measurements 

Reliability and validity have been the key words associated with characterizing the quality of measures 
of human structure and performance [e.g., Potvin et al., 1985]. Often, these terms are used in a manner 
that implies that a given test "has it" or "does not have it" (i.e., reliability or validity) when in fact both 
reliability and validity should be recognized as continuums. 

There are traditional, well-known methods used in academic circles to quantify reliability. The result 
produced implies an interpretation of "how much" reliability a given test has, but there is no corresponding 
way to determine how much is needed (in the same terms that reliability of a test is measured) so that 
one could determine if one has "enough." Other aspects of the academic treatment of human performance 
measurement quality are similarly troublesome with regard to the implications for widespread, general 
use of measurements. For example, there is an inherent desire (and need) to generalize the results of a 
given reliability study for a given instrument and, at the same time, a reluctance on the part of those who 
conduct or consume such studies to do so in writing. The strictly academic position is that one can never 
generalize a reliability study; i.e., it applies only to a situation which is identical to that reported (i.e., that 
instrument, those subjects, that examiner, that room, that time of day, etc.). Thus one may ask, "What is 
the value of reporting any such study?" The purely academic view may be the most correct, but strict 
interpretation is completely useless to a practitioner who needs to reach a conclusion regarding the 
applicability of a given instrument in given application. The mere fact that reliability studies are reported 
implies that there is an expectation of generalization to some degree. The question is, "How much gener- 
alization can one make?" This issue has not been adequately addressed from a general methodological 
standpoint. However, awareness of this issue and the need for improved methods are growing. 
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What is the general "quality" of measurement available today? The following is offered as a reasonably 
"healthy" perspective on this complex issue: 

1. Technology has advanced to the point where it is now possible to measure many of the basic 
physical variables employing measurements such as time, force, torque, angles, linear distance, 
etc. very accurately, repeatably, and with high resolution without significant difficulty. 

2. Most of the differences on test-retest are due to limitations on how well one can reasonably control 
procedures and actual variability of the parameter measured — even in the most ideal test subjects. 

3. Many studies have been conducted on the reliability of a wide variety of human performance 
capacities. Some true generalization can be achieved by looking at this body of work as a whole 
and not from attempts to generalize from only single studies. 

4. Across the many types of variables investigated, results of such studies are amazingly quite similar. 
Basically, they say that if your instrumentation is good, and if you carefully follow established, 
optimal test administration procedures, then it should be possible to obtain results on repeat 
testing that are within a range of 5% to 20% of each other. The exact location achieved in this 
range depends in large part on the particular variable in question (e.g., repeated measures asso- 
ciated with complex tasks with many degrees of freedom such as lifting or gait will differ more 
than repeated measures of hinge joint range of motions, for example). In addition, the magnitude 
of the given metric will influence such percentage characterizations since errors are often fixed 
amounts and independent of the size of the quantity measured. Thus, when measuring quantities 
with small magnitudes, a larger variability (i.e., more like 20%) should be anticipated. One can 
usually determine an applicable working value in these straightforward, usable terms (e.g., 5% or 
20%) that allows direct comparison with an evaluation of needs by careful review of the relevant 
reliability studies. 

A major point of this discussion is to illustrate that traditional methods used to "measure the measure- 
ments" do not adequately communicate the information that current and potential users of measurements 
often need to know. Traditional methods were valuable to some degree in providing relative indicators 
within the context of academic group studies. However, methods that are more similar to those used to 
characterize measurement system performance in physics and other traditional engineering areas (e.g., 
accuracy, signal-noise-ratio concepts, etc.) are needed. This will help practitioners who must make single 
measurements on individual subjects (i.e., not populations) to support clinical decision making [Mayer 
et al., 1997]. As the manner in which measurements are used continues to change to include more of the 
latter, new interest and methods for quantifying the quality of measurements can be anticipated. 

157.3 Databases and Data Modeling 

Much research has been conducted to define measurements of human structure and performance and 
to collect and characterize data as a basis to increase understanding and make inferences in different 
contexts. A large number of problems of major medical and other societal significance require the use 
of such data (1) for diagnostic purposes (i.e., when assessing measures obtained from a subject under 
test, such as to determine the efficacy of an intervention or treatment in a medical context), (2) for 
modeling and simulation (e.g., analyses for return-to-work decisions of injured employees, to support 
job-site modifications for individuals with disabilities, to design specifications for virtual reality systems), 
(3) for status/capacity evaluations (e.g., disability determinations for insurance settlements, worker's 
compensation claims, etc.), (4) to obtain a better understanding of disease and aging processes (e.g., 
associated with such problems as falls in elderly populations, etc.), (5) to gain insight into the impact of 
environmental and occupational factors (e.g., as in epidemiological studies, such as in lead toxicity or 
carpal tunnel syndrome), and (6) to support ergonomic design of consumer products and living envi- 
ronments for use by the public in general. Despite such a diverse range in needs for basically the same 
information, there has been little — if any — attempt to organize, integrate, and represent available data 
in a compact, accessible form that can serve the cited needs. 
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FIGURE 157.1 Summary of the complexity of the problem associated with integrating human structural and 
performance data to realize compact representations (i.e., parametric data models) to support analyses in multiple 
application areas. 

Figure 157.1 illustrates the scope of the general problem and infers a potential approach to integration. 
At the most basic level, individual measurements for a given variable must be collected and databased. 
In addition, databases from published studies are required to identify specific gaps that limit either 
analytic functionality or fidelity in the types of tasks that data must support. Data from the literature 
are typically available only in summary form, i.e., in terms of means and standard deviations for defined 
populations. Future research is required to define and validate methods for integrating data (for example) 
from multiple studies to form a single, more robust data model for a given performance or structure 
parameter. This will involve testing data models developed from databases of individual measures and 
databases of studies against each other. Development of data warping methods (e.g., to adapt predictions 
of expected performance levels to subjects of different body sizes and/or ages) is required to provide 
models that would be transparent operationally and would allow analyses to proceed in data-limited 
situations with predictable tradeoffs in fidelity. In many areas, especially with regard to practitioners such 
as occupational and physical therapists, performance characterization has been viewed only unidimen- 
sionally. More multidimensional perspectives are needed for an adequate representation of fidelity [e.g., 
Kondraske, 2000]. Another major aspect of data modeling therefore involves the development of multi- 
variate models of performance envelopes [Vasta and Kondraske, 1997] for individual human subsystems 
(e.g., knee flexors, visual information processors, etc.). Such models would support, for example, pre- 
diction of available torque production capacity under different specified conditions (e.g., joint angle, 
speed, etc. ). In summary, a basic strategy is required for harvesting the vast amount of previously collected 
data to obtain compact, accessible representations. Means must be incorporated in this strategy to allow 
for integration of data from multiple sources and continuous update as new data as collected to enhance 
fidelity and range of applicability of composite models. 

As previously noted, individuals within the human performance community typically adopt a variety 
of terms or identification schemes for structures and parameters employed in both the execution and 
communication of their work. The sheer number of parameters and variety of combinations in which 
they may be required to meet analysis needs may be sufficient to justify a more formal approach [e.g., 
Kondraske, 1993], thereby facilitating the development of common data structures. The engineering 
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contexts in which data models can be envisioned to be used require more rigor and stability. Computer- 
based models and analyses beg for codification of terms and parameters (i.e., for databases, etc.). In this 
regard, these emerging applications bring to light the lack of precision in terminology and confusion in 
definitions. Moreover, the ability to integrate models and analysis modules developed independently 
within more narrow subsets of the field to address more complex problems requires standardized notation 
if for no other reason than for efficiency. Development of a standardized systematic notation is compel- 
ling; it is postulated that without such a notation, progress to the next level of sophistication cannot occur. 

157.4 Summary 

A relatively high degree of sophistication currently exists with regard to the science, engineering, and 
technology of human performance. The field is, nonetheless, relatively young and undergoing natural 
maturation processes. Remaining needs in terms of both conceptual underpinnings and tools (which are 
not independent) are enormous. Future progress, however, is likely to be more dependent on collaborative 
efforts of different types. The magnitude and nature of problems that are at the forefront demand the 
achievement of agreement among a critical mass of researchers and practitioners. Both the challenges 
and prospects for the future are significant. 
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Further Information 

This section of this Handbook entitled "Human Performance Engineering" contains chapters that address 
human performance modeling and measurement in considerably more detail. Manufacturers of instru- 
ments used to characterize different aspects of human performance often provide technical literature and 
bibliographies of studies related to measurement quality, technical specifications of instruments, and 
application examples. 
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The complexities of physiological systems often lead to the creation of models to aid in the 
understanding of measured experimental data and to predict new features of the system under 
study. Models may serve to compactly summarize known system properties, including bio- 
mechanical, chemical, and electrical phenomena. Experiments performed on the model yield new infor- 
mation that casts light on unknown mechanisms underlying these properties and stimulate the formu- 
lation of new questions and the design of new experiments. Further, such quantitative description often 
leads to a better understanding of the physical mechanisms underlying both normal and altered (diseased) 
system performance. Ultimately, modeling may facilitate clinical diagnosis of system failure at an early 
stage of disease development. 

Models begin conceptually; for example, the concept that a blood vessel behaves as a fluid-filled pipe. 
Concepts may be developed into physical models, e.g., using a latex tube to describe a blood vessel, upon 
which experiments are performed. Often, concepts are realized as mathematical models, whereby the 
concept is described by physical laws, transformed into a set of mathematical equations, and solved via 
computer. Simulations, distinct from models, are descriptions that mimic the physiological system. The 
quantitative nature of physiological models allows them to be employed as components of systems for 
the study of physiological control, illustrated in several of this section's chapters. 

Chapter 158 describes the iterative process of physiological modeling and presents three recent, widely 
different, modeling approaches in cardiovascular dynamics. The first, an arterial system model, combines 
features of two flawed classical methods into a hybrid model capable of interpreting a wide range of 
previously unexplained experimental observations. Next, a large-scale, distributed muscle model shows 
how mechanical description of muscle structure can predict a molecular mechanism responsible for heart 
muscle's complex mechanical function, i.e., structure predicts function. Finally, an approach for predict- 
ing ventricular hypertrophy based on cardiac function is presented, with function predicting structure. 
These latter two approaches serve to integrate molecular level and organ level function and structure. 

Chapter 159 shows how mass balance is used to study material kinetics, e.g., pharmacokinetics. The 
production, distribution, transport, and interaction of exogenous materials, such as drugs or tracers, and 
endogenous materials, such as hormones, are also described. Examples of both linear and nonlinear 
compartmental models are presented, as well as parameter estimation, optimal experiment design, and 
model validation. 

Chapter 160 demonstrates the importance of cardiovascular modeling in designing closed-loop drug 
delivery systems, such as intravenous infusion pumps for pharmacological agents. The chapter begins 
with an extensive review of segmental and transmission line models of the arterial system and time- 
varying elastance descriptions of the left ventricle. The author makes an argument for employing non- 
linear cardiovascular-linked pharmacological models to aid in controller design, since data from these 
models can improve, and reduce the number of, animal and human experiments, and can be used to 
simulate human conditions not easily produced in animals. 

Chapter 161 presents models used to study the continual interaction of the respiratory mechanical 
system and pulmonary gas exchange. Traditionally, the respiratory system is described as a chemo- 
stat — ventilation increases with increased chemical stimulation. Alternatively, the author proposes that 
quantitative description of the "respiratory central pattern generator", a network of neuronal clusters in 
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the brain, is a much more sophisticated and realistic approach. Study of this dynamic, optimized con- 
troller of a nonlinear plant is interesting from both physiological and engineering perspectives, the latter 
due to applications of these techniques for new, intelligent control system design. 

Chapter 162 introduces the use of neural network techniques and their application in modeling 
physiological control systems. As their name implies, neural networks are computational algorithms 
based upon the computational structure of the nervous system, and are characterized as distributed 
processing and adaptive. Neural networks have been used to describe the control of arm movements with 
electrical stimulation and the adaptive control of arterial blood pressure. 

Chapter 163 surveys methods of system identification in physiology, the process of extracting models 
or model components from experimental data. Identification typically refers to model specification or 
model estimation, where unknown parameters are estimated within the specified model using experi- 
mental data and advanced computational techniques. Estimation may be either parametric, where alge- 
braic or difference equations represent static or dynamic systems, or non-parametric, where analytical 
(convolution), computational (look-up tables), or graphical (phase-space) techniques characterize the 
system. 

Chapter 164 shows how modeling can propose mechanisms to explain experimentally observed oscil- 
lations in the cardiovascular system. Low frequency flow oscillations are measured in individual vascular 
beds as well as in the entire systemic circulation. The authors modified a common windkessel description 
of the heart's systemic arterial load to include autoregulation, whereby peripheral resistance changes 
continuously to maintain cardiac output. This control system is characterized by a slow and delayed 
change in resistance due to smooth muscle activity. Experiments on this new model show oscillations in 
the input impedance frequency spectrum, and flow and pressure transient responses to step inputs 
consistent with experimental observations. This autoregulation model supports the theory that low 
frequency oscillations in heart rate and blood pressure variability spectra (Mayer waves) find their origin 
in the intrinsic delay of flow regulation. 

Chapter 165 presents models to describe the control of human movement, primarily the dynamics of 
human extremities. Presented are the challenges associated with description of the extremities as rigid 
bodies using laws of mechanics such as D'Alembert's principle, the Newton-Euler method, and the 
LaGrange equations. Analytic limb models are presented first, described by sets of ordinary differential 
equations. Next, the complexities of limb movement control are described by nonanalytic models, for 
example an expert system denoted "artificial reflex control", learning algorithms and fuzzy logic. Finally, 
hybrid controller models, combining analytic and nonanalytic techniques, are discussed. 

Chapter 166 describes physiological models of the fast eye movement, or saccade, that enables the eye 
to move quickly from one image to another. This common eye movement is encountered when scanning 
from the end of one line of printed text to the start of another. Early quantitative models helped define 
important characteristics of saccades. Increasingly complex models of the oculomotor plant were derived 
using improved models of the rectus eye muscle. Finally, application of systems control theory and 
simulation methods led to a saccade generator model in close agreement with experimental data. 
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158.5 Summary 



In physics and chemistry a multitude of experimental observations can be captured in a small number 
of concepts, often referred to as laws. A law is a statement of the relation between quantities which, as 
far as is known, is invariable within a range of conditions. For example, Ohm's law expresses the ratio 
between the voltage difference across and the current through an object as a constant. In physiology, 
experimental observations can likewise be ordered by relating them to natural laws. In such complex 
systems several laws may play their part, and the process is then referred to as development of a model. 
Model design, therefore, must meet stringent requirements and serves to rein in undisciplined specula- 
tion. Modeling attempts to identify the mechanism(s) responsible for experimental observations and is 
fundamentally distinct from simulation, in which anything that reproduces the experimental data is 
acceptable. 

A successful model provides guidance to new experiments, which may modify or generalize the model, 
in turn suggesting new experiments. This iterative process is illustrated in Fig. 158.1. The quest for 
understanding of a body of experimental observations (block 1) leads to a tentative conceptual interpre- 
tation (block 2), which is transformed into quantitative statements by invoking established natural laws. 
The resulting mathematical model (block 3) consists of a set of equations that are solved generally with 
the aid of a computer. The equation solutions are quantitative statements about experimental observa- 
tions, some of which were not part of the original body of experiments, extending knowledge about the 
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FIGURE 158.1 Iterative inter- relationships between models and experiments. 

system under study. Consequently, new experiments become specified (block 4), adding to the store of 
experimental data. The enlarged body of experimental observations may cause modification in the 
investigator's conceptual interpretation, which in turn may give reason to modify the mathematical 
model. This iterative process should weed out flawed experiments and misconceptions in the mathemat- 
ical model. If the entire effort is successful, a comprehensive, quantitative, and verified interpretation of 
the relevant experimental observations will result. This procedure recently led to the discovery that 
William Harvey's view of the circulation is unduly restrictive [32]. 

The first case study of this chapter outlines the development of an arterial system model using a hybrid 
approach. Features of two separate, flawed classical methods are combined to interpret a wide range of 
previously unexplained experimental observations. In the course of interaction between model and 
experimentation, inclusion of greater detail may expand model complexity and size, rendering it too 
unwieldy as a component to a larger physiological system. Reduction in size, distinct from simplification, 
may be necessary. The former is based on understanding, allowing intelligent separation of major and 
minor effects. The latter implies arbitrary action based on intuition. This case study closes by demon- 
strating systematic reduction of the arterial model. 

A major challenge in the biosciences today is integration of a large body of molecular level information, 
with the ultimate goal of understanding organ function. Two different, novel modeling approaches to 
this goal are presented next. The first shows how a model based on muscle structure can predict a 
molecular mechanism responsible for a muscle's mechanical properties (function). The second presents 
an approach for predicting ventricular hypertrophy (structure) based on cardiac metabolic function. Due 
to space constraints, discussion is limited to the main points, however, details are available in the original 
research papers cited. 

158.2 Hybrid Approaches and Model Reduction: 
Arterial Dynamics 

Windkessel Model 

This early focus was to develop a non-invasive method to measure cardiac output from noninvasively 
obtained arterial pressure information with the aid of a model of the systemic arterial circulation. Hales 
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FIGURE 158.2 Windkessel description of arterial load. Adapted from [34]. 

(1733) believed that the aorta and major vessels act as a "compression chamber" which transforms 
pulsatile inflow from the heart into steady flow into the organs. Frank (1895) called this compression 
chamber a windkessel, after fire-fighting pumps that produce steady flow despite oscillating pumping. 

Cardiac output is the product of stroke volume, the volume pumped with each beat, and heart rate. 
Heart rate is readily obtainable from the palpated pulse. Stroke volume was to be estimated using the 
following windkessel model of the arterial load. Fig. 158.2 shows left ventricular inflow, Q(t) into a 
compliant chamber (windkessel), arterial pressure p, and volume V coupled to a single resistor, R s 
representing the peripheral resistance. Since all arterial properties are lumped, pulse wave velocity c is 
assumed infinite. The differential equation relating these variables is: 

i^ +— p=QLt) (158.1) 

E' dt R 

where E represents the windkessel elastance dp/dV . During diastole Q(t) = 0, leading to the solution: 



Pit) = P o e K ' 



.158.2) 



By measuring the exponential decay of an experimental pressure curve and the (finite) pulse wave 
velocity c, R s can be obtained, and thereby stroke volume V s from: 



V s = {dt)dt = —[p(.t)dt 



1158.3) 



The problem is that the pulse wave velocity is assumed to be both finite and infinite in the same model. 
This contradiction leads to errors in V s of up to a few hundred percent. A large number of efforts to 
improve accuracy by modifying this model has achieved little. 

Transmission Line Model 

The interpretation of pulse wave speed has fascinated many researchers. Initially considering the aorta 
as a simple uniform, infinitely long transmission line, the Webers [43], [44] proposed the second version 
of a finite wave propagation theory. Consider a uniform vessel segment (Fig. 158.3) with axial direction z, 
cross-sectional area S, and radius r. Shown are two points: z and z + dz from left to right. Application 
of Newton's second law gives: 
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FIGURE 158.3 Segment of blood vessel with length dz in the axial direction. Adapted from [34]. 
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where p is blood density, p is pressure, and v z is axial velocity. This expression may be rewritten as the 
equation of motion: 



dp dv 
— = p — z - 
dz dt 



The change in volume V with time is equal to the inflow minus the outflow: 



(158.5) 



S v{z\—v{z+dz\ 
which may be expressed as the equation of continuity: 



1 dV_ 

dt 



(158.6) 
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Relating S to p via LaPlace's law and Hooke's law, and combining the result with Eq. 158.5 gives 



where: 



dt 2 ~ ' dz 2 



hE 



(158.8) 
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(158.9) 



with E denoting Young's modulus of elasticity. This last expression, the Moens-Korteweg equation [30], 
[26], predicts a frequency independent wave velocity, c. Experimental measurements show c to be highly 
frequency dependent. In spite of many attempts, the conflict between theory and experiment proved 
unresolvable, even if the vessel was given a finite length [25]. 
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Combining the Windkessel and Transmission Line Theories 

Broadening the view from pulse wave velocity to include other phenomena as well, experimental studies 
show that the arterial flow pulse decreases moving away from the heart, as expected for a passive 
transmission line. The arterial pressure pulse, however, increases, seemingly violating energy conserva- 
tion. Further, input impedance Z in = P in /Q ln is frequency dependent, similar to wave velocity. An arterial 
model capable of describing all of these features results by combining the windkessel and transmission 
line ideas [33]. 

The actual arterial system has bifurcations which should produce reflected pressure and flow waves. 
It is unfeasible to describe each branching vessel due to their very large number (10 9 ). A compromise is 
to take a finite number of interconnected uniform (lumped) segments. Each of 125 segments is repre- 
sented by an electrical analog equivalent described by two partial differential equations in pressure and 
flow. The resulting transmission line, built of finite windkessel-like segments, was constructed as an 
electrical analog, well before the digital computer was sufficiently advanced, with voltage and current 
representing pressure and flow respectively. This model (Fig. 158.4) was the first to demonstrate c and 
Z in correctly. The increasing pressure paradox was shown to be the result of constructive wave summation. 
Thus, the modification of the model to include branching proved to be the crucial step that allowed 
interpretation of a wide range of puzzling experimental observations [45]. 

Model Reduction 

The level of insight with this large, distributed model permitted answering the question whether its size 
could be reduced, retaining the desired distributed system properties with a much reduced number of 
equations. A specific example, shown in Fig. 158.5, allows study of pulse wave transmission between the 
carotid and renal arteries [34]. This model reduces the number of equations from 250 to fewer than 40 
by lumping large areas of the system outside the area under study. 

If distributed system properties such as pulse wave reflection are not of interest, the arterial system 
load seen by the heart can be further reduced to a simple, three-element windkessel (Fig. 158.6). This 
concise model, denoted the Westkessel after its founder [46] is the most widely used arterial load 
representation today. It also allows accurate estimation of total arterial compliance, a quantity of clinical 
interest [40]. 

158.3 Predicting Function From Structure: 
Muscle Contraction 

Description of muscle contraction has essentially evolved into two separate approaches — lumped whole 
muscle models and specialized crossbridge models of the sarcomere. The former seeks to interpret 
muscle's complex mechanical properties with a single set of model elements. Muscle experiments measure 
muscle force and length subjected to isometric (fixed length) conditions, isotonic (fixed load) conditions, 
and transient analysis where either length or load is rapidly changed. 

Muscle force generation is believed to arise from the formation of crossbridge bonds between thick 
and thin myofilaments within the basic building block of muscle, the sarcomere. These structures, in the 
nm-jlm range, must be viewed by electron microscopy or X-ray diffraction, limiting study to fixed, dead 
material. Consequently, muscle contraction at the sacromere level must be described by models that 
integrate metabolic and structural information. 

Lumped Muscle Strip Models 

Mayow (1674) proposed that muscle contraction arises from changes in its constituent elastic fibers as 
a byproduct of respiration, introducing the concept of muscle as an elastic material that changes due to 
metabolic processes. E. Weber (1846) viewed muscle as an elastic material, or spring, whose stiffness 
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FIGURE 158.4 Electrical analog model of human systemic arterial circulation. Numbered boxes refer to windkessel- 
like segments, yielding 250 partial differential equations. This model combined features of the windkessel and 
transmission line schools of thought, producing a new approach capable of interpreting puzzling experimental 
observations. Adapted from [45]. 

depends on whether it is in a passive or active state. In 1890 Chauveau and Laulanie also considered 
muscle as an elastic spring with time dependent stiffness, and proposed that velocity of shortening plays 
a role in force production. Fick (1891) and Blix (1893) refuted a purely elastic description on thermo- 
dynamic grounds, finding that potential energy stored during stretching a strip of muscle was less than 
the sum of energy released during shortening as work and heat. Blix also noticed that at the same muscle 
length, muscle tension produced during stretch was greater than that during release. Tension would be 
the same during stretch and release for a simple spring. Further, Blix reported that he was unable to 
describe stress relaxation and creep in muscle with a visoelastic model. 
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FIGURE 158.5 Reduced model of the systemic arterial tree of Fig. 158.4. This particular example retains distributed 
model elements between the area of interest, the carotid artery to the renal artery, and reduces the total number of 
system equations from 250 to fewer than 40 by lumping the remaining arterial segments. Adapted from [34], 
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Hill (1922) coupled the spring with a viscous 
medium, thereby reintroducing viscoelastic muscle 
models, after noticing that external work done accel- 
erating an inertial load was inversely related to short- 
ening velocity. A two-element viscoelastic model is 
objectionable on mechanical grounds: in series (Max- 
well body), length is unbounded for a step change in 
force, in parallel (Voight body), force is infinite for a 
step change in length [37]. Further, Fenn (1923) 
reported that energy released by the muscle differs for 
isometric or shortening muscle. Subsequently, Hill 
(1939) developed the contractile element, embodied 
as an empirical hyperbolic relation between muscle 
force and shortening velocity. The contractile element 
was combined with two standard springs, denoted the 
parallel and series elastic elements. Current whole 

muscle models have, in general, incorporated Hill's contractile element, an empirically derived black- 
box force generator, in networks of traditional springs and dashpots (e.g. [31]). Since model elements 
are not directly correlated to muscle structure, there is no logical reason to stop at three elements — models 
with at least nine have been proposed [39]. 

This approach can be criticized on the following grounds. There exists little, if any, correlation between 
model elements and muscle anatomical structures. No single model has been shown to comprehensively 



FIGURE 158.6 Equivalent systemic arterial load 
reduced to one differential equation. This modified 
windkessel, the Westkessel, after [46], enjoys wide- 
spread use as a concise description of the load seen 
by the left ventricle. Z is characteristic impedance, C s 
is arterial compliance and R s is peripheral resistance. 
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describe different muscle loading conditions, e.g. isometric, isotonic, etc. Models tend to focus on 
description of a single type of experiment, and are typically based on a force-velocity relation measured 
under specific loading conditions. These models give little or no insight into the mechanism of contraction 
at the sarcomere level. Energy considerations, including the fundamental necessity of a biochemical 
energy source, have been ignored. 

Crossbridge Models 

Crossbridge models focus on mechanics at the sarcomere level. Prior to actual observation of crossbridge 
bonds, Huxley (1957) proposed a model whereby contractile force is generated by the formation of 
crossbridge bonds between myofilaments. This model was subsequently shown unable to describe mus- 
cle's transient mechanical properties. Extension of Huxley's ideas to multiple-state crossbridges has been 
very limited in scope, e.g. the Huxley and Simmons (1973) two-state model does not include crossbridge 
attachment, detachment or filament sliding, but rather describes only the mechanics of a single set of 
overlapping actin and myosin filaments. In general, crossbridge models dictate bond attachment and 
detachment by probability or rate functions. This approach has shifted emphasis away from muscle 
mechanics toward increasingly complex rate functions [12]. 

Distributed Muscle Model 

A large-scale, distributed muscle model was developed based on ultrastructural kinetics and possessing 
direct anatomic and physiologic relevance [36]. This model describes all of muscle's fundamental mechan- 
ical properties with a single set of assumptions: isometric and isotonic contractions, and force transients 
resulting from rapid changes in muscle length during contraction. Variations in muscle properties with 
current loading conditions arise from the dynamic model structure, rather than adaptation of a particular 
force-velocity curve to describe the contractile element, or of a particular bond attachment or detachment 
function as for previous models. The following assumptions define the model: 

• The sarcomere consists of overlapping thick and thin filaments connected by crossbridge bonds 
which form during activation and detach during relaxation. 

• The mass of the myofilaments is taken into account. 

• Crossbridge bonds have viscoelastic properties [22] and thus may exhibit stress relaxation and creep. 

■ In passive muscle, thick filaments exist in stable suspension at the center of the sarcomere, which 
contributes to the muscle's passive elastic properties [48]. 

■ Calcium ions serve as a trigger which allows bond formation between filaments. Bonds formed 
are stretched via biochemical energy, thereby converting chemical to potential energy and devel- 
oping force. This stretching results from crossbridge motion [22]. 

■ The number of bonds formed depends upon the degree of thick-thin filament overlap [16]. 

• Asynchrony in bond formation and unequal numbers of bonds formed in each half sarcomere, 
as well as mechanical disturbances such as shortening and imposed length transients, cause 
movement of thick with respect to thin filaments. Since myofilament masses are taken into account 
these movements take the form of damped vibrations. Such vibrations occur with a spectrum of 
frequencies due to the distributed system properties. 

• When the stress in a bond goes to zero the bond detaches [22]. 

• Myofilament motion and bond stress relaxation lead to bond detachment and therefore produce 
relaxation. 

Description of the Sarcomere 

Crossbridge bonds are described as linear viscoelastic material, each represented by a three-element 
spring-dashpot model (Fig. 158.7). During activation, the attached crossbridge head rotates, stretching 
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FIGURE 158.7 Schematic representation of a muscle fiber composed of N series sarcomeres each with M parallel 
pairs of active, viscoelastic bonds. Each bond is described by a three-element viscoelastic model. Adapted from [36]. 



the bond and generating force. A passive sarcomere is mechanically represented as myofilament masses 
interconnected by a matrix of permanently attached passive viscoelastic bonds. During activation, dynam- 
ically attached active bonds are formed as calcium ion and energy (ATP) allows. 

The sarcomere model is extended to a muscle fiber by taking N sarcomeres in series, each with M 
parallel pairs of active bonds. The model currently consists of 50 series sarcomeres, each with 50 parallel 
pairs of crossbridges. Bond formation is taken to be distributed in two dimensions due to muscle's finite 
electrical propagation speed and the finite diffusion rate of calcium ions from the sarcoplasmic reticulum. 
The large set of resulting differential equations (5300) is solved by digital computer. Since the instanta- 
neous number of crossbridge bonds, and therefore the model system equations, is continuously changing, 
the resulting model is strongly nonlinear and time-varying, despite its construction from linear, time- 
invariant components. Crossbridge bonds are added in a raster pattern due to the distributed system 
properties. After all bonds have been added, no further conditions are imposed. Crossbridge bonds 
subsequently detach simply from the small internal movements of myofilaments. Consequently, the model 
relaxes without separate assumptions. 

This model was subjected to a wide range of loading conditions. A single set of model parameters is 
sufficient to describe the main features of isometric, isotonic and quick release and stretch experiments. 
The model is sensitive to mechanical disturbrances, consistent with experimental evidence from muscle 
force curves [47], aequorin measurements of free calcium ion [1], and high speed X-ray diffraction 
studies [23]. The model is also consistent with sarcomere length feedback studies [22] where reduced 
internal motion delays relaxation. 

Muscle Contraction 

The outermost curve of Fig. 158.8 depicts an isometric contraction computed from the muscle fiber 
model. At each A t available bonds are attached, adding their equations to the model, and force is generated 
via bond stretching. For the twitch contractions in Figs. 158.8 and 158.9 bonds are attached from t = to 
0.5 sec. Since bond formation is not perfectly symmetrical, due to the raster attachment pattern, small 
force imbalances within thick-thin units develop, leading to small myofilament motions that tend to 
detach bonds as their force goes to zero. Consequently, at each At of solution bonds are added during 
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FIGURE 158.8 Outermost curve describes an isometric contraction computed from the distributed muscle fiber 
model. Also plotted is force computed during a quick release experiment, showing significant force deactivation. 
Adapted from [36]. 
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FIGURE 158.9 Quick stretches computed from the distributed muscle fiber model. Adapted from [36]. 
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activation and detach continuously. Despite this very dynamic state, Fig. 158.8 shows that the net iso- 
metric fiber force generated smoothly rises during muscle activation. Peak isometric force of 250 nN is 
in close agreement with recent experimental measurements on isolated guinea pig myocytes [47]. 

The time of peak force corresponds to the end of bond attachment. Physiologically, this point is 
associated with a lack of available bond attachment sites. This may be due to lack of calcium ion for 
release necessary inhibition of actin-myosin interaction by troponin/ tropomyosin, lack of myosin heads 
in a state of steric for attachment, or some other mechanism. Force relaxation then results simply from 
bond detachment due to the same internal myofilament motion mechanism. Net force smoothly recovers 
to the resting force level, in contrast to the dynamic motions of individual bonds and myofilamets. 
Surprisingly, this smooth relaxation occurs without the added feature of cross-bridge recycling, i.e. bond 
reattachment after detachment. 

Transient response to quick length change is a critical test of any muscle model. This distributed model 
predicts that rapid length changes enhance system vibration and thereby promote bond detachment and 
muscle relaxation, underlying muscle's force deactivation phenomenon. Figure 158.8 also shows force 
deactivation following quick release during an otherwise isometric twitch contraction. This particular 
release corresponds to 0.2% of sarcomere length over 10 msec. Quick release corresponding to 2% of 
sarcomere length applied during 10 msec was sufficient to bring fiber force to zero (not shown). Slower 
releases produce less force deactivation. These curves compare favorably with quick release transients 
measured on whole muscle strips and more recent experiments on isolated frog myocytes [4, 6], and for 
ferret papillary muscles [27]. 

Figure 158.9 shows quick stretches computed for the distributed fiber model. Stretches corresponding 
to 0.2% of sarcomere length were performed over 10 msec. Curves show rapid force overshoot, followed 
by rapid recovery and slow recovery back to the isometric force levels. Quick stretch response is in 
agreement with classic studies on muscle strip and recent studies on ferret papillary muscles [27]. 

The idea for this distributed muscle fiber model arose in 1990 [35]. At that time, muscle fibers were 
assumed to be functionally similar to muscle strips. Recently, experiments on isolated muscle fibers show 
this to be the case. Also, predictions from the model have recently been borne out, for example, peak 
isometric force [47]. Peak isometric stress measured on isolated rabbit myocytes (5.4 mN/mm 2 ) is very 
close to peak stress from rabbit papillary muscle strips (6.4 mN/mm 2 ) [3]. The distributed model 
generates peak isometric stress of 2.5 mN/mm 2 . Other muscle phenomena measured on the isolated fiber 
include: a quadratic force-length relation [3], inotropic changes in contractile state [4], quick release and 
stretch [27, 4] and isotonic contractions [38]. 

Physiological modeling strives to create a framework that integrates experimental observations. A great 
strength of modeling is its capacity for predicting new experiments and physiological phenomena 
(Fig. 158.1). The distributed muscle fiber model predicts that mechanical external vibration of muscle 
should enhance its internal vibration and promote bond detachment and therefore relaxation. To test 
this hypothesis ultrasound was used to vibrate muscle. A single millisecond pulse of ultrasound delivered 
to the in vivo frog heart during systole was observed to reduce developed aortic pressure [7], [8]. 
Experiments were designed to eliminate heating effects. Ultrasound pulses applied late in the twitch had 
particularly strong effects. After peak muscle force is generated, bond attachment ends so detachment 
via vibration has no mechanism for recovery. This force (pressure) deactivation was reproducible and 
not permanently damaging as beats without irradiation produced normal pressure magnitudes and time 
courses. A mechanical effect is likely since moving the heart slightly out of the ultrasound beam produced 
no deactivation effect, despite having the same electrical environment. Ultrasound delivered during 
diastole was found to produce a premature ventricular contraction, as depicted in Fig. 158.10, an unex- 
pected result. These experiments suggest that medical ultrasound may have strong effects on living tissue, 
and demonstrates how modeling can lead to the discovery of new physiological phenomena. 
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FIGURE 158.10 Premature ventricular contraction following ultrasound irradiation, during diastole, of the in vivo 
frog heart. The top trace is aortic pressure and below is the ECG. A lithotripter pulse of 20 MPa, 5 ms duration, is 
delivered at the vertical arrow. Adapted from [7]. 



158.4 Predicting Structure from Function: 
Cardiac Hypertrophy 

Integration of Micro- and Macro-Dynamics 

Modeling methodology often relies on the availability of structural parameters that have been determined 
from prior physical and quantitative experimentation. Alternatively, it is of vital interest to understand 
how a given structure arises, i.e., to understand nature's design. To accomplish this, it is necessary to 
allow the system to adapt to its external functional demands. Due to adaptive processes, a biological 
system may be undergoing synthesis and de-synthesis simultaneously. The balance of these opposing 
mechanisms results in a structure that meets the system's current functional demand. The advantage of 
this type of modeling approach is that it obviates measurement of every parameter. This enables the 
physiological modeler to provide theoretical insight outside the realm of conventional experimentation. 

Often models become increasingly complex in the attempt to study smaller levels of structural detail. 
Ultimately, there may be a minimum level of structure at which the function is known and relatively 
constant, for example, a cell or a cardiac sarcomere. This structure is denoted the micro-unit. The assembly 
of such units comprises the biological system and is denoted the macro-unit. The macro-unit function 
may also be known, characterized, for example, by ventricular pressure and flow. It is the arrangement 
and interaction of multiple micro-units that introduces complexity in modeling. To resolve this problem, 
the micro-unit function is integrated while structure is varied to identify structural rules. The resulting 
rules are usually simple and thereby offer a special advantage to the integrative approach. 

The concept of integrating micro-level function to solve for the macro-level of physiological function 
and overall geometry has been discussed earlier by Taylor and Weibel [41]. These researchers stated in 
qualitative terms that an organ system achieves "a state of structural design commensurate to functional 
needs resulting from regulated morphogenesis, whereby the formation of structural elements is regulated 
to satisfy but not exceed the requirements of the functional system." This process has been termed 
symorphosis. 
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Integrative Modeling 

In its simplest form, this method matches function at two different levels of structure. A match is achieved 
by varying the macro-structure until energy utilization of the system is minimized. Structural variation 
can be accomplished analytically, or in the case of discontinuous or nonlinear functions by means of a 
computer. Energy is then minimized in terms of the micro-unit. Since the time frame for the adaptive 
process tends to be long, it may be difficult to accumulate experimental results. Alternatively, it would 
be difficult to design an "integrative experiment" whereby the macro-structure is actually altered. For 
example, it is not easy to add or subtract selected cardiac myocytes to or from an intact cardiac ventricle 
without trauma to the animal. Usually, the same heart is observed over long time periods for its response 
to various external loading conditions. A theoretical approach can overcome some of these difficulties. 

Micro-unit 

Integrative modeling begins identifying a minimal level of structure at which function is known. For an 
organ, this could be at the cellular level or, even lower, at the molecular level. The function of interest 
must be defined in terms of a physical quantity, such as electrical, mechanical, chemical, etc. The physical 
quantity of interest may also be time dependent, but the function of the micro-unit must be independent 
of change in the geometry of the macro- structure. For normal physiological conditions this rule is valid, 
but it may break down in the case of a disease process. Pathological conditions further require an analysis 
of how the micro-unit function alters with time, e.g. how myocyte function deteriorates over time in 
heart failure. The dimensions of the micro-unit must be defined. Since the micro-unit is chosen to be 
the smallest structural element, its mean dimensions are often treated as constants. 

Macro-unit 

The macro-unit, for example the structure at the organ level, consists of a group of organized micro- 
dynamic units. Macro-unit function must be defined in terms of the physical quantities of interest. The 
physical quantities at the two structural levels must be of the same type, for example, myocyte force and 
ventricular wall tension define force at different structural levels. Once the macrofunction is defined it 
must be independent of variation in the macro-structure, i.e., unchanged by the adaptive process over 
long time periods. Since the macro-function is important to the survival of the animal it may be 
independent of a disease process as well. This is generally true until structural changes cannot accom- 
modate the deterioration of micro-function at a sufficiently rapid pace. 

Organization and Equilibrium 

The organization of the micro-units defines the macro-structure. If the structure is a simple geometry 
that models the anatomic shape, analytical expressions can be obtained. Use of the actual anatomical shape 
forces the modeling to be computational. Once the geometry is prescribed, the physical equilibrium 
equations can be derived that relate the micro-function to the macro-function. This is accomplished by 
mathematically integrating the micro-function over the entire geometry, hence the name integrative 
modeling. For example, the myocardial force is integrated over the ventricle in order to relate it to wall 
tension. The final equations contain the geometric dimensions of the model and are unknowns at the onset. 

Energy Considerations 

The micro-unit is the smallest building block employed to assemble the overall structure. The dimensions 
of the micro-unit therefore represent the limit of resolution for any physical quantity computed from 
the model. Of particular interest is the energy required by the micro-unit. When the micro-unit is fixed 
in dimensions, on average, it provides a gauge of energy consumption that is size independent. While 
the macro-unit is varied in size, the effect on the energy at the micro-unit level can then be observed. 
Energy considerations for biological systems are typically more complex than for standard engineering 
systems. For example, care must be taken to include the energy required for the basal tissue metabolism 
as well as the energy consumption necessary to accomplish functional tasks. This differs from a nonliving 
structure where only physical energy considerations suffice. 
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Macrostructure Dimensions 

The final analysis solves for the dimensions of the macro-structure, accomplished by examining the 
micro-unit energy consumption, while the dimensions are varied. A solution is then obtained by solving 
for a zero derivative of energy with respect to each dimension and minimum in energy. Specific problems 
may require variance from the above procedure. For example, additional information may be available 
to assist in finding the solution or in placement of physiological constraints. This is particularly useful 
when a completely analytical approach is not possible. Lastly, it is not always necessary to obtain the 
solution relative to the micro-unit energy. Energy was chosen here because of its generality to all systems, 
but other physical quantities may be used. Whichever quantity is chosen it must be evaluated for the 
smallest functional unit. Ultimately, the resulting dimensions represent the optimal solution for the 
structure in terms of its smallest functional building block. These integrative modeling concepts are 
illustrated below in a study of cardiac hypertrophy. 

Cardiac Hypertrophy 

Current thinking dictates that the left ventricle attempts to normalize its peak wall stress values during 
systolic contractions and diastole by altering its dimensions [17]. There are several reasons to doubt the 
validity of this concept. First, anatomical locations of systolic and diastolic stress transducers have not 
been identified. Second, the value of wall stress cannot be determined from basic physical knowledge. 
Instead, wall stress is assumed to be regulated by an unknown mechanism. Lastly, this theory cannot 
explain ventricular size alterations due to metabolic disturbance, as opposed to a mechanical disturbance. 
Alternatively, integrative modeling is applied to the problem of cardiac hypertrophy. 

It was hypothesized that cardiac size can be determined from energetics at the sarcomere level of structure, 
i.e., the micro-unit [ 10] . The cardiac sarcomere was redefined as a sarcounit (Fig. 158. 1 1), a constant volume 
structure capable of axial stress generation. The function of the sarcounit was defined by its active and 
passive stress-length curves (Fig. 158.12), with the active function corresponding to maximal activation. 
Maximal stress was modulated by a periodic time function which was zero during diastole. 

A cylindrical approximation to the geometry of the ventricle was chosen (Fig. 158.11). Thus, its 
structure was described by the length, diameter (D) and wall thickness (W) of the cylinder [9]. The 




FIGURE 158.11 Cardiac sarcounit (upper right) and structural arrangement to form a cylindrical left ventricle. 
Adapted from [10]. 
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FIGURE 158.12 (top) Active and passive stress-length curves describing the sarcounit. (bottom) Active and passive 
pressure-volume curves for the initial cylindrical left ventricle model. Adapted from [9]. 

sarcounits were arranged in series and parallel so that ventricular diameter is determined by the number 
of series units and ventricular wall thickness and length by the number of parallel units. 

Macro-function was denned by the mean aortic pressure and afterload impedance, using a modified 
windkessel arterial load. Preload was treated as a constant filling pressure in series with a constant flow 
resistance. Ventricular function was then equated with micro-function by integrating the stress and length 
of the sarcounits over the cylindrical structure. This yielded ventricular pressure and volume at any 
instant of time for given sarcounit stress and length at the corresponding time (Fig. 158.12). Heart rate 
was allowed to vary as necessary to maintain constant mean aortic pressure. The sarcounit energetics 
were evaluated from the rate of myocardial oxygen consumption (MV0 2 ). MV0 2 was computed by 
assuming a proportional relationship with the area of the stress — length loop of the sarcounit per unit 
time. All quantities were then computed numerically, allowing several cycles to achieve steady state. Each 
quantity was measured at the final steady-state beat. 

To solve for the dimensions of the ventricle, ventricular diameter and wall thickness were varied while 
evaluating all quantities, including MV0 2 . Results show that MV0 2 is minimized only at a specific shape 
of the cylinder, or W/D ratio (Fig. 158.13). Additionally, the level of MV0 2 varied with the overall size 
of the ventricle. When MVO, is also equated with the amount of oxygen supplied by the coronary 
circulation, a unique solution for left ventricular size and shape is obtained. 

The above model was further employed to study observed patterns of cardiac hypertrophy. First, the 
initial size of the ventricle was consistent with that of a normal dog. Second, a pressure overload was 
created by raising mean aortic pressure. The model was then used to recompute ventricular size for the 
new afterload condition. Results show the classic wall thickening and increased wall thickness-to-diameter 
ratio (W/D) is consistent with observed concentric hypertrophy. Third, volume overload was imposed by 
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FIGURE 158.13 Sarcounit myocardial oxygen consumption (MV0 2 ) versus the circumference (C) and wall thick- 
ness (W) of the ventricle model. Adapted from [10]. 



elevating cardiac output. In this case, the model predicted an increase in diameter with relatively little 
increase in the W/D ratio, referred to as eccentric hypertrophy. Fourth, the sarcomere function was 
impaired and its effect on the ventricle evaluated. An eccentric form of hypertrophy resulted. The diameter 
to length ratio was found to increase along with the degree of myocardial dysfunction [11]. This was 
consistent with observations that show the ventricle assuming a progressively more spherical shape with 
further myocardial failure [24] . 

In conclusion, modeling by means of integrating micro- and macro-function was applied to the study 
of cardiac hypertrophy. This approach matches the functional information available at two different levels 
of structure. The ability to predict macro-structure dimensions is a result of achieving this match. The 
cardiac growth model suggests the conclusion that ventricular shape can be determined by the regulation 
of a single quantity, MV0 2 , at the sarcomere level. This modeling approach is more general than the wall 
stress theory as it can predict growth response to metabolic anomaly as well as to mechanical overloads. 
As a primary benefit, this approach permits theoretical analysis that may be impractical to realize 
experimentally. 



158.5 Summary 



This chapter focuses on modeling as a tool to identify and quantify the mechanism(s) responsible for 
observed phenomena in physiology. The rather strict rules to which the modeling process is subject 
generally lead to the formulation of new, specific experiments, thereby broadening the pool of experi- 
mental data, and the data-base for modeling. On occasion, this process leads to areas of experimentation 
and the development of concepts originally judged to lie outside the scope of the original problem 
definition. Consequently, successful modeling aids in deepening a researcher's insight and breadth of 
view. Three examples taken from the cardiovascular field were summarized to illustrate procedures and 
techniques. A selection of benefits achieved was presented as well. 
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Compartmental models are a class of dynamic, i.e., differential equation, models derived from mass 
balance considerations which are widely used for quantitative study of the kinetics of materials in 
physiologic systems. Materials can be either exogenous, such as a drug or a tracer, or endogenous, such 
as a substrate or a hormone, and kinetics include processes such as production, distribution, transport, 
utilization, and substrate-hormone control interactions. 

Compartmental modeling was first formalized in the context of iso topic tracer kinetics. Over the years 
it has evolved and grown as a formal body of theory [Carson et al., 1983; Godfrey, 1983; Jacquez, 1996]. 

Compartmental models have been widely employed for solving a broad spectrum of physiologic 
problems related to the distribution of materials in living systems in research, diagnosis, and therapy at 
the whole-body, organ, and cellular level. Examples and references can be found in books [Carson et al., 
1983; Gibaldi and Perrier, 1982; Jacquez, 1996] and reviews [Carson and Jones, 1979; Cobelli, 1984; 
Cobelli and Caumo, 1998]. Purposes for which compartmental models have been developed include: 

■ identification of system structure, i.e., models to examine different hypotheses regarding the nature 
of specific physiologic mechanisms; 

■ estimation of unmeasurable quantities, i.e., estimating internal parameters and variables of phys- 
iologic interest; 

■ simulation of the intact system behavior where ethical or technical reasons do not allow direct 
experimentation on the system itself; 

■ prediction and control of physiologic variables by administration of therapeutic agents, i.e., models 
to predict an optimal administration of a drug in order to keep one or more physiologic variables 
within desirable limits; 

■ optimization of cost effectiveness of dynamic clinical tests, i.e., models to obtain maximum 
information from the minimum number of blood samples withdrawn from a patient; 

■ diagnosis, i.e., models to augment quantitative information from laboratory tests and clinical 
symptoms, thus improving the reliability of diagnosis; 

■ teaching, i.e., models to aid in the teaching of many aspects of physiology, clinical medicine, and 
pharmacokinetics. 
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The use of compartmental models of physiologic systems has been greatly facilitated by the availability 
of specific software packages for the PC [Saam, 1998; Adapt, 1997], which can be used both for simulation 
and identification. 



159.1 Definitions and Concepts 



Let's start with some definitions. A compartment is an amount of material that acts as though it is well- 
mixed and kinetically homogeneous. A compartmental model consists of a finite number of compartments 
with specified interconnections among them. The interconnections represent a flux of material which 
physiologically represents transport from one location to another or a chemical transformation or both. 
An example of a compartmental model is illustrated in Fig. 159.1, where the compartments are repre- 
sented by circles and the interconnections by arrows. 

Given the introductory definitions, it is useful before explaining well-mixed and kinetic homogeneity 
to consider possible candidates for compartments. Consider the notion of a compartment as a physical 
space. Plasma is a candidate for a compartment; a substance such as plasma glucose could be a compart- 
ment. Zinc in bone could be a compartment as well as thyroxine in the thyroid. In some experiments, 
different substances could be followed in plasma: plasma glucose, lactate, and alanine provide examples. 
Thus, in the same experiment, there can be more than one plasma compartment, one for each of the 
substances being studied. This notion extends beyond plasma. Glucose and glucose-6-phosphate could 
be two different compartments inside a liver cell. Thus, a physical space may actually represent more 
than one compartment. 

In addition, one must distinguish between compartments that are accessible and nonaccessible for 
measurement. Researchers often try to assign physical spaces to the nonaccessible compartments. This 
is a very difficult problem which is best addressed once one realizes that the definition of a compartment 
is actually a theoretical construct which may in fact lump material from several different physical spaces 
in a system; to equate a compartment with a physical space depends upon the system under study and 
assumptions about the model. 

With these notions of what might constitute a compartment, it is easier to define the concepts of well- 
mixed and kinetic homogeneity. Well-mixed means that any two samples taken from the compartment 
at the same time would have the same concentration of the substance being studied and therefore be 
equally representative. Thus, the concept of well-mixed relates to uniformity of information contained 
in a single compartment. 




FIGURE 159.1 A three-compartment model. The substance enters de novo compartment 3 (arrow entering a 
compartment from "outside") and irreversibly leaves from compartment 2 (arrow leaving a compartment to the 
"outside"). Material exchanges occur between compartments 3 and 1 and compartments 1 and 2 and are represented 
by arrows. Compartment 1 is the accessible compartment: Test input and measurement (output) are denoted by a 
large arrow and a dashed line with a bullet, respectively. 
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Kinetic homogeneity means that every particle in a compartment has the same probability of taking 
the pathways leaving the compartment. Since when a particle leaves a compartment it does so because 
of metabolic events related to transport and utilization, it means that all particles in the compartment 
have the same probability of leaving due to one of these events. 

The notion of a compartment, i.e., lumping material with similar characteristics into collections that 
are homogeneous and behave identically, is what allows one to reduce a complex physiologic system into 
a finite number of compartments and pathways. The number of compartments required depends both 
on the system being studied and on the richness of the experimental configuration. A compartmental 
model is clearly unique for each system studied, since it incorporates known and hypothesized physiology 
and biochemistry. It provides the investigator with insights into the system structure and is as good as 
the assumptions that are incorporated in the model structure. 

159.2 The Compartmental Model 



Theory 

Let Fig. 159.2 represent the rth compartment of an n-compartment 
model; q t > denotes the mass of the compartment. The arrows 
represent fluxes into and out of the compartment: the input flux into 
the compartment from outside the system, e.g., de novo synthesis of 
material and/or exogenous test input, is represented by u i > 0; the 
flux to the environment and therefore out of the system by F oi ; the 
flux from compartment i to j by i 7 ,, and the flux from compartment 
j to i by K. All F hk are > 0. The general equations for the compart- 
mental model are obtained by writing the mass balance equation for 
each compartment: 
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FIGURE 159.2 The rth compart- 
ment of an n-compartment model. 



The fluxes u ; are generally constant or functions of only time. The fluxes F ,, F„, and F oj can be functions 
of q v ..., q n (sometimes also of time, but, for the sake of simplicity, we will ignore this dependence). 
It is usually possible to write: 

^W 3 **(*)* (159 - 2) 

where q = [q v ..., q, J T is the vector of the compartmental masses. As a result, Eq. (159.1) can be written as: 
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The kjS are called fractional transfer coefficients. Equation (159.3) describes the nonlinear compartmental 
model. If the ks do not depend on the masses q ; s, the model becomes linear. 



Defining k u =-[ /k- t + k m 
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we can now write Eq. (159.3) as: 
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and the model of the whole system as 

q=K(q)q+U q(o)= q o (159.5) 

where Kis the compartmental matrix and u = [Wj...M„] T is the vector of input fluxes into the compartments 
from outside the system. 

For the linear case, K is constant and one has: 

q=Kq+u q[o\=q o (159.6) 

The entries of the compartmental matrix K, for both the nonlinear (159.5) and the linear (159.6) model, 
satisfy 

fc,.<0 for alii (159.7) 

k tj > for all i± j (159.8) 

n 

K is thus a (column) diagonally dominant matrix. This is a very important property, and in fact the 
stability properties of compartmental models are closely related to the diagonal dominance of the 
compartmental matrix. For instance, for the linear model Eq. (159.6) one can show that no eigenvalues 
can have positive real parts and that there are no purely imaginary eigenvalues. This means that all 
solutions are bounded and if there are oscillations they must be damped. The qualitative theory of linear 
and nonlinear compartmental models have been reviewed in Jacquez and Simon [1993], where some 
new stability results on nonlinear compartmental models are presented as well. 

The Linear Model 

The linear model Eq. (159.6) has become very useful in applications due to an important result: the 
kinetics of a tracer in a constant steady-state system, linear or nonlinear, are linear with constant 
coefficients. An example is shown in Fig. 159.3, where the three-compartment model by Cobelli and co- 
workers [1984b] for studying tracer glucose kinetics in steady state at the whole-body level is depicted. 
Linear compartmental models in conjunction with tracer experiments have been extensively used in 
studying distribution of materials in living systems at the whole-body, organ, and cellular level. Examples 
and references can be found in Carson and co-workers [1983] and Jacquez [1996]. 

An interesting application of linear compartmental models at the organ level is in describing the 
exchange of materials between blood, interstitial fluid, and cell of a tissue from multiple tracer indicator 
dilution data. Compartmental models provide a finite difference approximation in the space dimension 
of a system described by partial differential equations which may be easier to resolve from the data. These 
models are discussed in [Jacquez, 1985] and an example of a model describing glucose transport and 
metabolism in the human skeletal muscle can be found in Saccomani and colleagues [1996]. 

The Nonlinear Model 

The nonlinear model, Eq. (159.5), is frequently found in applications as well. For such models the entries 
of K are functions of q, most commonly k t : is a function of only a few components of q, often q t and q r 
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FIGURE 159.3 The linear compartmental model of glucose kinetics by Cobelli et al. [1984b]. 



Examples of L function of q, and q t only are the Hill and Langmuir nonlinearities described, respectively, 
by: 



/ v aq h r l 



(159.10) 
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where a, (3, y are positive reals and h is a positive integer. For h = 1, Eq. (159.10) turns into the widely 
used Michaelis-Menten equation. 

Other interesting examples arise in describing substrate-hormone control systems. For instance, the 
model of Fig. 159.4 has been proposed [Caumo and Cobelli, 1993; Vicini et al., 1997] to describe the 
control of insulin on glucose distribution and metabolism during a glucose perturbation, which brings 
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FIGURE 159.4 The nonlinear compartmental model of insulin control on glucose distribution and metabolism by 
Caumo and Cobelli [1993]. The dashed line denotes a control signal. 
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the system out of steady state. Since there is no direct interconversion, one has to have two separate 
compartmental models, one for glucose and one for insulin kinetics. The two models interact via a control 
signal which emanates from the remote insulin compartment and affects the transfer rate coefficient, fc 02 , 
responsible for insulin-dependent glucose utilization. In this case, one has: 

k 02 (q 3 ) = 8 + q } (159.12) 

where 8 is a constant. 

Additional examples and references on nonlinear compartmental models can be found in Carson and 
co-workers [1983], Godfrey [1993], and Jacquez [1996]. 

Use in Simulation and Indirect Measurement 

Models such as those described by Eq. (159.5) can be used for simulation. Assuming that both the matrix 
K(q) and initial state q(0) are known, it is possible to predict the behavior of the state vector q for any 
possible input function u. A major use of compartmental models is in the indirect measurement of 
physiologic parameters and variables. For instance, given data obtained from compartments that are 
accessible to measurement, the model is used to estimate not directly measurable parameters and predict 
quantities related to nonaccessible compartments. In this case, K(q) is unknown and input-output 
experiments are performed to generate data to determine K(q). These data usually can be described by 
a linear algebraic equation 

y=Cq (159.13) 

where y = [y, . . ,y m ] T is the vector of m model outputs and C is a constant matrix. Since model outputs 
are often concentrations, the entries of C are inverse of compartment volumes. Note that usually only a 
small number of compartments is accessible to measurement. 

Real data are noisy; thus, an error term e is usually added to y in Eq. (159.13) to describe the actual 
measurements 

z = y + e (159.14) 

e is usually given a probabilistic description, e.g., errors are assumed to be independent and often 
Gaussian. Data are usually collected at discrete time instants t l ...t N so that the measurements are not 
continuous functions of time z(t) (assuming that only one output variable is observed), as inEq. (159.14), 
but are denoted z(^) ... z{t N ). 

Equations (159.13) and (159.14) together with Eq. (159.5) or Eq. (159.6) define the compartmental 
model of the system, i.e., the model structure and the input-output experimental configuration. 

From the data of Eq. (159.14), the entries of K(q) can be determined by exploiting the identification 
techniques described below. 



159.3 A Priori Identifiability 



Let's assume that a compartmental model structure has been postulated to describe the physiologic 
system, i.e., the number of compartments and the connections among them have been specified. This is 
the most difficult step in compartmental model building. 

The structure can reflect a number of facts: there may be some a priori knowledge about the system 
which can be incorporated in the structure; one can make specific assumptions about the system which 
are reflected in the structure; one can arrive at a structure by testing via simulation what is needed to fit 
the data. The result at this stage is a nonlinear, Eq. (159.5), or linear, Eq. (159.6), model which has as 
unknowns the parameters of the compartmental matrix K and also often those of the observation 
matrix CinEq. (159.13). 
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Before performing the experiment to collect the data to be analyzed using the model or, if the 
experiment is already completed, before using the model to estimate the unknown parameters from the 
data, the following question arises: Do the data contain enough information to estimate all the unknown 
parameters of the postulated model structure? This question is usually referred to as the identifiability 
problem. It is set in the ideal context of error-free model structure and noise-free measurements, and is 
an obvious prerequisite to determine if parameter estimation from real data is well posed. In particular, 
if it turns out in such an ideal context that the postulated model structure is too complex for the particular 
set of ideal data, i.e., some model parameters are not identifiable from the data, there is no way in a real 
situation — where there is error in the model structure and noise in the data — that the parameters can 
be identified. To focus on the fact that the identifiability problem is worked out in the ideal context, one 
normally speaks of a priori identifiability. 

A priori identifiability thus examines whether, given the ideal noise-free data y, Eq. (159.13), and the 
error- free compartmental model structure, Eqs. (159.5) or (159.6), it is possible to make unique estimates 
of all the unknown model parameters. A model can be uniquely (globally) identifiable or nonuniquely 
(locally) identifiable — that is, one or more of the parameters has more than one but a finite number of 
possible values- or nonidentifiable — that is, one or more of the parameters has an infinite number of 
solutions. For instance, the model of Fig. 159.3 is nonidentifiable. 

The identifiability problem is, in general, a large nonlinear algebraic one and thus difficult to solve, 
since there is the need to solve a system of nonlinear algebraic equations which is increasing in the 
number of terms and nonlinearity degree with the model order, i.e., the number of compartments in 
the model. Various methods for testing identifiabilty of linear and nonlinear compartmental models are 
available [Carson et al., 1983; Cobelli and DiStefano, 1980; Godfrey, 1983; Jacquez, 1996]. For linear 
compartmental models, various specific methods based, e.g., on transfer function, similarity transfor- 
mation, and graph topology, have been developed. Explicit identifiability results on catenary and mamil- 
lary compartmental models [Cobelli et al., 1979] and on the three-compartmental model [Norton, 1982] 
are also available. A parameter-bound strategy for dealing with nonidentifiable compartmental models 
has been also developed [DiStefano, 1983]. For nonlinear compartmental models, the problem is more 
difficult and for small models the output series expansion method [Pohjanpalo, 1978] is usually employed. 

However, all the proposed methods apply to models of relatively low dimension; when applied to large 
models, the methods involve nonlinear algebraic equations too difficult to be solved even by resorting 
to symbolic algebraic manipulative languages (e.g., Reduce, Maple). These difficulties have stimulated 
new approaches to studying global identifiability based on computer algebra. In particular, for nonlinear 
models, approaches based on differential algebra have been investigated [Ljung and Glad, 1994; Sacco- 
mani et al., 1997], while for linear compartmental models a method based on the transfer function and 
the Groebner basis algorithm has been proposed and implemented in a software [Audoly et al., 1998]. 

If a model is a priori uniquely or nonuniquely identifiable, then identification techiques (e.g., least 
squares) can be used to estimate from the noisy data z, Eq. (159.14), the numerical values of the unknown 
parameters. If a model is a priori nonidentifiable, then not all its parameters can be estimated using 
identification techniques. Various strategies can be used, e.g., derivation of bounds for nonidentifiable 
parameters, model reparametrization (parameter aggregation), incorporation of additional knowledge, or 
design of a more informative experiment. An example of the use of some of these approaches for dealing 
with the nonidentifiable model of glucose kinetics of Fig. 159.3 is given in Cobelli and Toffolo [1987]. 

From the above considerations it follows that a priori unique identifiability is a prerequisite for well- 
posedness of parameter estimation and for reconstructability of state variables in nonaccessible compart- 
ments. It is a necessary step, but because of the ideal context where it is posed, it does not guarantee a 
successful estimation of model parameters from real input-output data. 

159.4 Parameter Estimation 

Given a uniquely identifiable parameterization of the compartmental model, one can proceed by esti- 
mating the values of the unknown parameters from the set of noisy real data. Usually the input-output 
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experiment provides a limited set of discrete-time measurements, Eq. (159.14), for estimating the 
unknown parameters of the nonlinear, Eq. (159.5), and linear, Eq. (159.6), compartmental model. Prob- 
lems arise in parameter estimation as a result of various sources of error, e.g., measurement error — e in 
Eq. (159.13) — and error in model structure — Kin Eq. (159.5) or Eq. (159.6). It is normally not possible 
to consider explicitly errors in model structure: alternative model structures can be analyzed in order to 
minimize this error. The simplest situation is to consider the noisy real data as reflecting model output 
corrupted by an additive measurement error, like in Eq. (159.14). 

Parameter estimates can be determined from Eq. (159.14) by several methods. These methods can be 
divided in two major categories. In the so-called Fisher estimation approach, only the data of Eq. (159.14) 
are supplied to the estimator in order to estimate the unknown model parameters. The second approach, 
known as the Bayes estimation approach, takes into account not only the data of Eq. (159.14), but also 
some statistical information which is a priori available on the unknown parameters of the model. 

In the Fisher approach, parameter estimates can be obtained by nonlinear least squares or maximum 
likelihood together with their precision, i.e., a measure of a posteriori or numerical identifiability. Details 
and references on parameter estimation of physiologic system models can be found in Carson and co- 
workers [1983] and Landaw and DiStefano [1984]. 

Weighted nonlinear least squares is mostly used, and both direct and gradient-type search methods are 
implemented in estimation schemes. A correct knowledge of the error structure is needed in order to have 
a correct summary of the statistical properties of the estimates. This is a difficult task. Measurement errors 
are usually independent, and often a known distribution, e.g., Gaussian, is assumed. However, many 
properties of least squares hold approximately for a wide class of distributions if weights are chosen 
optimally, i.e., inverse of known variances or of their relative values if variances are known up to a propor- 
tionality constant. Under these circumstances, an asymptotically correct approximation of the covariance 
matrix of parameter estimates can be used to evaluate precision of parameter estimates. The approximation 
becomes exact for a sufficiently large sample size and/or decreasing variances of the measurement error. If 
measurement errors are Gaussian, then this approximation is the Cramer-Rao lower bound (inverse of the 
Fisher information matrix), i.e., the optimally weighted least squares estimator is also the maximum 
likelihood estimator. Care must be taken in not using lower bound variances as true parameter variances. 
Several factors corrupt these variances, e.g., inaccurate knowledge of error structure, limited data set. Monte 
Carlo studies are needed to assess robustness of Cramer-Rao lower bound in specific practical applications. 

To examine the quality of model predictions to observed data, in addition to visual inspection, various 
statistical tests on residuals are available to check for the presence of systematic misfitting, nonrandomness 
of the errors, and accordance with assumed experimental noise. Model order estimation, i.e., number of 
compartments in the model, is also relevant here, and for linear compartmental models criteria such as 
the F-test, and those based on the parsimony principle like the Akaike and Schwarz criteria, can be used 
if measurement errors are Gaussian. 

Bayes estimators, e.g., maximum a posteriori (MAP) and mean square (MS), assume the parameters 
vector to be a random vector and determine an estimate of its current realization according to a criterion 
connected to its a posteriori distribution (i.e., the probability distribution of the parameters vector 
conditioned on the knowledge of the data vector), which also reflects the distribution of the noise. The 
a priori information required on the unknown parameters vector can often be obtained from population 
studies. The concepts previously considered in the Fisher approach, i.e., assessment of the parameters 
estimates precision, choice of model order, can be addressed in the Bayes approach as well. Bayes 
estimation can be of relevant interest, since when statistical information on the unknown parameters of 
the model is a priori available and exploited, a possibly significant improvement in the parameters 
estimates precision with respect to Fisher estimation can be obtained. 

159.5 Optimal Experiment Design 

At this point, one has a compartmental model structure, a description of the measurement error, and a 
numerical value of the parameters together with the precision with which they can be estimated. It is 
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now appropriate to address the optimal experiment design issue. The rationale of optimal experiment 
design is to act on design variables such as number of test input and outputs, form of test inputs, number 
of samples and sampling schedule, and measurement errors so as to maximize, according to some 
criterion, the precision with which the compartmental model parameters can be estimated [Carson et al., 
1983; DiStefano, 1981; Landaw and DiStefano, 1984; Walter and Pronzato, 1990]. 

In the Fisher approach, the Fisher information matrix /, which is the inverse of the lower bound of 
the covariance matrix, is treated as a function of the design variables and usually the determinant of / 
(this is called D-optimal design) is maximized in order to maximize precision of parameter estimates, 
and thus numerical identifiability. 

The optimal design of sampling schedules, i.e., the determination of the number and location of 
discrete-time points where samples are collected, has received much attention as it is the variable which 
is less constrained by the experimental situation. Theoretical and algorithmic aspects have been studied, 
and software is available, for both the single- and multi-output case [Cobelli et al., 1985; DiStefano, 1981; 
Landaw and DiStefano, 1984]. Optimal sampling schedules are usually obtained in an iterative manner: 
One starts with the model obtained from pilot experiments and the program computes optimal sampling 
schedules for subsequent experimentation. An important result for single-output linear compartmental 
models is that D-optimal design usually consists of independent replicates at P distinct time points, where 
P is the number of parameters to estimate. 

Optimal sampling schedule design has been shown, e.g., to improve precision of the estimates as 
compared to schedules designed by intuition or other conventions and to optimize the cost effectiveness 
of a dynamic clinical test by reducing the number of blood samples withdrawn from a patient without 
significantly deteriorating their precision [Cobelli and Ruggeri, 1989; 1991]. 

The optimal input design problem has been relatively less studied, but some results are available on 
optimal equidose rectangular inputs (including the impulse) for parameter estimation in compartmental 
models [Cobelli and Thomaseth, 1987, 1988a,b]. 

In a Bayes estimation context, optimal experiment design is a more difficult task (see [Walter and 
Pronzato, 1990] for a survey). Applications have thus been less than those with a Fisher approach because 
the numerical implementation of the theoretical results is much more demanding. 

159.6 Validation 

Validation involves assessing whether the compartmental model is adequate for its purpose. This is a 
difficult and highly subjective task in modeling of physiologic systems, because intuition, understanding 
of the system, and so on play an important role. It is also difficult to formalize related issues such as 
model credibility, i.e., the use of the model outside its established validity range. Some efforts have been 
made, however, to provide some formal aids for assessing the value of models of physiologic systems 
[Carson et al., 1983; Cobelli et al., 1984]. A set of validity criteria have been explicitely defined, i.e., 
empirical, theoretical, pragmatic, and heuristic validity, and validation strategies have been outlined for 
two classes of models, broadly corresponding to simple and complex models. This operational classifi- 
cation is based on a priori identifiability and leads to clearly defined strategies as both complexity of 
model structure and extent of available experimental data are taken into account. For simple models, 
quantitative criteria based on identification, e.g., a priori identifiability, precision of parameter estimates, 
residual errors, can be used in addition to physiologic plausibility. In contrast with simple models, 
complex simulation models are essentially incomplete, as there will naturally be a high degree of uncer- 
tainty with respect to both structure and parameters. Therefore, validation will necessarily be based on 
less solid grounds. The following aids have been suggested: increasing model testability through model 
simplification; improved experimental design and model decomposition; adaptive fitting based on qual- 
itative and/or quantitative feature comparison and time-course prediction; and model plausibility. 



© 2000 by CRC Press LLC 



References 

Adapt II User's Guide. 1997. Biomedical Simulation Resource, University of Southern California, Los 

Angeles. 
Audoly, S., DAngio', L., Saccomani, M.P., and Cobelli, C. 1998. Global identifiability of linear compart- 

mental models. A computer algebra algorithm. IEEE Trans. Biom. Eng., 45: 36-41. 
Carson, E.R. and Jones, E.A. 1979. The use of kinetic analysis and mathematical modeling in the 

quantitation of metabolic pathways in vivo: application to hepatic anion metabolism. New Engl. 

J. Med., 300:1016-1027 and 1078-1986. 
Carson, E.R., Cobelli, C, and Finkelstein, L. 1983. The Mathematical Modelling of Metabolic and 

Endocrine Systems. John Wiley, New York. 
Caumo, A. and Cobelli, C. 1993. Hepatic glucose production during the labelled IVGTT: estimation by 

deconvolution with a new minimal model. Am. J. Physiol. 264:E829-E841. 
Cobelli, C. and Caumo, A. 1998. Using what is accessible to measure that which is not: necessity of model 

of system. Metabolism, 47: 1009-1035. 
Cobelli, C, Lepschy, A., and Romanin Jacur G. 1979. Identifiability results on some constrained com- 

partmental systems. Math. Biosci. 47:173-196. 
Cobelli, C. and DiStefano III, J.J. 1980. Parameter and structural identifiability concepts and ambiguities: 

a critical review and analysis. Am. J. Physiol. 239: R7-R24. 
Cobelli, C. 1984. Modeling and identification of endocrine-metabolic systems. Theoretical aspects and 

their importance in practice. Math. Biosci. 72:263-289. 
Cobelli, C, Carson, E.R., Finkelstein, L., and Leaning M.S. 1984. Validation of simple and complex models 

in physiology and medicine. Am. J. Physiol. 246: R259-R266. 
Cobelli, C, Toffolo, G., and Ferrannini, E. 1984. A model of glucose kinetics and their control by insulin. 

Compartmental and noncompartmental approaches. Math. Biosci. 72: 291-315. 
Cobelli, C, Ruggeri, A., DiStefano III, J.J., and Landaw, E.M. 1985. Optimal design of multioutput 

sampling schedules: software and applications to endocrine-metabolic and pharmacokinetic mod- 
els. IEEE Trans. Biomed. Eng. 32: 249-256. 
Cobelli, C. and Thomaseth, K. 1987. The minimal model of glucose disappearance: optimal input studies. 

Math. Biosci. 83: 127-155. 
Cobelli, C. and Toffolo, G. 1987. Theoretical aspects and practical strategies for the identification of 

unidentifiable compartmental systems. In Identifiability of Parametric Models, Walter, E., Ed., 

Pergamon, Oxford, 85-91. 
Cobelli, C. and Thomaseth, K. 1988a. On optimality of the impulse input for linear system identification. 

Math. Biosci. 89: 127-133. 
Cobelli, C. and Thomaseth, K. 1988b. Optimal equidose inputs and role of measurement error for 

estimating the parameters of a compartmental model of glucose kinetics from continuous-and 

discrete-time optimal samples. Math. Biosci. 89: 135-147. 
Cobelli, C, Saccomani, M.P., Ferrannini, E., DeFronzo, R.A., Gelfand, R., and Bonadonna, R.C. 1989. A 

compartmental model to quantitate in vivo glucose transport in the human forearm. Am. J. Physiol. 

257: E943-E958. 
Cobelli, C. and Ruggeri, A. 1989. Optimal design of sampling schedules for studying glucose kinetics 

with tracers. Am. J. Physiol. 257: E444-E450. 
Cobelli, C. and Ruggeri, A. 1991. A reduced sampling schedule for estimating the parameters of the 

glucose minimal model from a labelled IVGTT. IEEE Trans. Biomed. Engng. 38:1023-1029. 
DiStefano III, J.J. 1981. Optimized blood sampling protocols and sequential design of kinetic experiments. 

Am. J. Physiol. 9:R259-R265. 
DiStefano III, J.J. 1983. Complete parameter bounds and quasi identifiability conditions for a class of 

unidentifiable linear systems. Math. Biosci. 65:51-68. 
Gibaldi, M. and Perrier, D. 1982. Pharmacokinetics. 2nd ed., Marcel Dekker, New York. 
Godfrey, K. 1983. Compartmental Models and Their Application. Academic Press, London. 



© 2000 by CRC Press LLC 



Jacquez, J.A. 1996. Compartmental Analysis in Biology and Medicine, 3rd ed., Biomedware, Ann Arbor, 

MI. 
Jacquez, J.A. and Simon, C.P. 1993. Qualitative theory of compartmental systems. Siam Review 35:43-79. 
Landaw, E.M. and DiStefano III, J.J. 1984. Multiexponential, multicompartmental, and noncompartmen- 

tal modeling. II. Data analysis and statistical considerations. Am. J. Physiol. 246: R665-R677. 
Ljung, L. and Glad, T. 1994. On global identifiability for arbitrary model parametrizations. Automatica 

30:265-276. 
Norton, J.P. 1982. An investigation of the sources of non-uniqueness in deterministic identifiability. Math. 

Biosci. 60:89-108. 
Pohjanpalo, H. 1978. System identifiability based on the power series expansion of the solution. Math. 

Biosci. 41:21-33. 
Saccomani, M.P., Audoly, S., Bellu, G., D'Angio', L., and Cobelli, C. 1997. Global identifiability of 

nonlinear model parameters. 1997. In Savaragi, Y. and Sagara, S., Eds., Proceedings of the SYSID 

'97, 11th IFAC Symposium on Systems Identification, pp. 219-224, Kitakyushu (Japan), Nichigen 

Co. Ltd. 
Saccomani, M.P., Bonadonna, R., Bier, D.M., De Fronzo, R.A., and Cobelli, C. 1996. A model to measure 

insulin effects on glucose transport and phosphorylation in muscle: a three-tracer study. Am J 

Physiol. 33: E170-E185. 
SAAM II User Guide 1998. SAAM Institute FL-20, University of Washington, Seattle, WA. 
Vicini, P., Sparacino, G., Caumo, A., and Cobelli, C. 1997. Estimation of hepatic glucose release after a 

glucose perturbation by nonparametric stochastic deconvolution. Comp. Meth. Progr. Biomed., 

52: 147-156. 
Walter, E. and Pronzato, L. 1990. Qualitative and quantitative experiment design for phenomenological 

models — a survey. Automatica 26: 195-213. 



© 2000 by CRC Press LLC 



Timmons, W. D. "Cardiovascular Models and Control." 
The Biomedical Engineering Handbook: Second Edition. 
Ed. Joseph D. Bronzino 
Boca Raton: CRC Press LLC, 2000 



160 



Cardiovascular Models 

and Control 



160.1 Advantages and Disadvantages of Desktop 
Patients 

160.2 History of Cardiovascular Modeling 

160.3 Cardiovascular Modeling 

An Idealized Segment of Artery • An Idealized Segment of 
Vein • Arterial and Venous Trees • Models of the Heart • 
Models of Combined Heart and Circulation ■ Neural and 
Humoral Control ■ Combined CV and Pharmacologic 

William D. Timmons Models 

University of Akron 160.4 Conclusions 



This chapter reviews cardiovascular (CV) modeling for use in controller design, especially for closed- 
loop drug delivery systems (IV infusions of vasodilators, inotropes, anesthetics, neoplastic agents, and 
so on) and other pharmacologic applications. This first section describes the advantages and disadvan- 
tages of employing CV models for design and control and presents a brief history of CV modeling. The 
next section describes the basic principles and techniques for modeling the CV system, as well as the 
uptake, distribution, and action of cardio- and vasoactive pharmaceuticals. A short, annotated bibliog- 
raphy follows the reference section for those interested in further reading. 

160.1 Advantages and Disadvantages of Desktop Patients 

More and more industries are coming to rely on computer simulations to increase productivity and 
decrease costs as well as to increase reliability and safety of resulting products. The defense, automotive, 
and aerospace industries come most quickly to mind, but the medical field is rapidly gaining ground in 
this area, too. Recent ARPA 1 initiatives for defense technology conversion are probably most responsible 
for the recent surge in computer-aided health care. Today, computerized simulators teach medical stu- 
dents everything from basic anatomy to techniques in radial keratotomy, and skilled surgeons design, 
evaluate, and practice new and risky procedures without ever cutting into a single patient. Thanks to the 
ARPA initiative, telesurgery will soon be viable, so that in the near future, surgeons may operate on 
patients from half a world away. From the calculation of simple-dosage regimens to the design of 
sophisticated blood pressure controllers, cardiovascular medicine also benefits from this technology. CV 
models can be used to prove initial feasibility, reduce development time, decrease costs, and increase 
reliability and safety of cardiovascular devices and therapeutics. 



■The Advanced Research Projects Agency of the U.S. Department of Defense, chair of the Defense Technology 
Conversion Council which administers the Technology Reinvestment Project. 
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The increasing speed and decreasing costs of computers are having an enormous impact on controller 
design. There is a veritable explosion in the number of control designs as well as the theory to go with 
them. The ready availability of powerful computer workstations and sophisticated engineering design 
packages (e.g., MATLAB, MATRIXx, CC) allow for the interactive design of high-performance, robust 
controllers. Engineers can tweak a controller or perturb a system, then immediately assess the effect. 
When combined with modern visualization techniques, these tools allow one to readily see problems 
and solutions that might otherwise have taken years to find and solve. Furthermore, such capabilities 
allow for exhaustive testing and debugging, so that the transition from the workbench to the field becomes 
smooth. Many companies and agencies have reported flawless transitions from the simulator to the field, 
most notably NASA, with its many satellites and probes that have had to work right the first time. Various 
companies are using simulations to improve or correct equipment and systems already in the field. A 
recent example is BAE Automated Systems Inc. of Dallas, Texas, which is using simulations to help debug 
and correct the troubled baggage system at the new Denver International Airport [Geppert, 1994]. 

The risk of failure or the fear of malpractice often precludes the design and evaluation of controllers 
in live humans, so that devices and treatments are often tested on computer and animal models first. In 
the past, animals have been the preferred choice, but this approach is continually being reevaluated. 
Whereas most researchers readily agree that animal experimentation has been and will continue to be 
necessary for the advancement of medical science, most would welcome methods and procedures that 
might reduce or even eliminate the use of animals. Furthermore, animals are rarely perfect models of 
human diseases and conditions. Thus, there has been and will continue to be a demand for alternative 
models and procedures. Tissue cultures are beginning to fill some of this need, and it is likely that 
computer modeling and simulation may also fill this need. 

Desktop patients, if used to complement animal studies, can actually increase the reliability and 
applicability of certain animal tests, while also reducing the number of required experiments. In certain 
instances, computer modeling may be the only way to test a device prior to human use, since some 
human conditions are extremely difficult if not impossible to reproduce in animals. Indeed, as computers 
become more powerful and models become better, it may be that some day computers will be used in 
place of certain animal and human trials. Meanwhile, computer models can be used to (1) demonstrate 
feasibility, (2) increase confidence in controller designs by complementing animal studies, (3) help design 
better animal and clinical experiments, and (4) reduce the number of required animal and human 
experiments. 

The disadvantage of using computer simulation is that available models will almost certainly need to 
be tailored to the application at hand. Since there is generally no predefined method for modeling or for 
modifying an existing model, the accuracy and reliability of the resulting model will depend heavily on 
the skill of the modeler, the appropriateness of the modifications, and the model's intended scope. 
Furthermore, modeling typically requires a good understanding of the physics of the problem, and this 
in turn requires supporting data and experimentation. Thus, modifications to include new pharmaceu- 
ticals may be problematic, since their mechanisms may only be poorly understood. And, even if sup- 
porting data are available, if a basic model does not yet exist, a significant time commitment may be 
required to develop one. Despite this up-front cost, model development may save tremendous time and 
costs later in the development process. 

Fortunately, there are a large number of CV models to pick from, and experienced modelers can 
determine which, if any, is best for a given application. Once determined, the model can usually be 
converted easily to a form compatible with the development platform. Once converted, model integrity 
and suitability must be checked. Modifications can then be made, after which the model integrity must 
again be checked to ensure that it has not been compromised. Once a model is obtained and suitably 
modified, adequate thought must be given to program flow. Ideally, program flow should be designed 
such that the controller code is the same for real time and for simulation. This may be facilitated by 
treating all facets of data acquisition as part of the controlled system, instead of as part of the controller. 
With this approach, all software for I/O and calibration would be bundled separately from the controller 
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software. Windows-based systems facilitate this separation even further by allowing the controller and 
plant software to be compiled and run as two separate programs. 

Other issues that must be considered involve the practical aspects of computer simulation. Even if a 
well-designed and validated model exists, many factors can affect the accuracy of the simulation. Stiffness, 
machine precision, program coding, integration method, step size, and even computer language can 
affect the results. Probably more insidious than all these factors is the operation of the model in regions 
outside its scope. This happens most often in large, complex programs when a variable takes on values 
outside its intended range. For example, a sudden increased cardiac preload may shift the end diastolic 
volume of the heart into a nonlinear region of the diastolic pressure-volume relationship. If the nonlinear 
behavior is not included in the model, misleading results might be produced. Finally, even in the best 
commercial code (let alone home-brewed code), bugs may affect simulation accuracy. For example, many 
commercial simulation packages are particularly poor at handling discrete events (such as a bolus infusion 
of a drug) when variable step-size algorithms are used to integrate continuous time models [Gustafsson, 
1993]. 

160.2 History of Cardiovascular Modeling 

William Harvey in the early 17th century was probably the first to clearly and convincingly demonstrate 
the role of the heart as a pump which caused the blood to flow in a unidirectional closed circuit through 
the systemic and pulmonary circulations. Prior to Harvey, the Galenic viewpoint from the 2d century ad 
predominated, 2 namely, that the blood ebbed to and fro in the veins to provide nutrition to the rest of 
the body, that the arteries (from the Greek arteria, meaning windpipe) carried air, that the lungs cooled 
the heart and removed sooty impurities from new blood as it entered from the intestines, and that the 
heart contained pores to allow spirit or gaseous exchange across the septum [Acierno, 1994]. 

It was Stephen Hales in his Statistical Essays nearly a century later who considered arterial elasticity 
and postulated its buffering effect on the pulsatile nature of blood flow [Hales, 1733]. He likened the 
depulsing effect to the fire engines of his day, in which a chamber with an air-filled dome ("inverted 
globe") acted to cushion the bolus from the inlet water pump so that "a more nearly equal spout" flowed 
out the nozzle. This analogy became the basis of the first modern cardiovascular models and antedated 
the ARPA movement by quite a few decades! In the translation from English to German, Hales's inverted 
globe became a windkessel (air kettle); his idea later became known as the Windkessel theory when it 
was more formally developed and propounded by the German physiologist Otto Frank (of the Frank- 
Starling law of the heart) near the beginning of our century [Noordergraaf, 1978]. 

Frank was initially a strong proponent of the Windkessel theory, and his work spawned the interest 
of many subsequent investigators and led to a proliferation of modified Windkessel-type models. A critical 
review of these early models can be found in Aperia [1940]. The development of the analog computer 
shortly after World War II lead to another burst in CV modeling, based on more sophisticated segmental 
and transmission line theories. Grodins [1959] was probably one of the first to use these new machines 
to simulate cardiovascular hemodynamics. Interestingly, another early analog computer model, PHYSBE 
[McLeod, 1966], became a popular benchmark that is still used today to evaluate computers and simu- 
lation languages. About the mid-1970s, the advent of the inexpensive digital computer led to another 
revolution in CV modeling, which has continued to the present with ever-increasingly detail and scope. 
As Rideout [1991] observed, model complexity seems to have been limited only by computer technology. 
If this trend continues, we can expect to see highly detailed models emerging within the next few years 
as parallel computers become economical. 

The major criticism of the pure Windkessel theory is that it does not allow for finite wave propagation 
and reflection in the arteries. Whereas blood flow in the arterial vasculature is more properly formulated 

2 Galen, in turn, based many of his beliefs on those of the hippocratic era scholars — Hippocrates, Aristotle, Polybus, 
and Diodes (4th century bc). 
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in terms of space and time, the Windkessel theory ignores spacial considerations and instead lumps all 
the vasculature into a single point, or compartment, in space. Attempts to correct this shortcoming 
include adding inertial and damping factors as well as collapsible components. In addition, the great 
arteries can be partitioned into sections to produce segmental models, or, when electrical network theory 
is invoked, transmission line models. Frank himself realized that the Windkessel theory was flawed, and 
so he too added modifications to include traveling waves and reflections [Milnor, 1989; Noordergraaf, 
1978; Rideout, 1991]. 

Despite its flaws, the pure Windkessel theory is still useful as a teaching tool because of its simplicity 
and clear, intuitive imagery. Furthermore, as mean pressure accounts for approximately 90% of the power 
in the arterial pressure waveform, it performs surprisingly well and displays many of the interesting 
phenomena seen in the vasculature. Add to this the fact that the equations are readily extended to include 
spacial effects (the segmental and transmission line models), and one can see why the theory has persisted 
into the late 20th century. Of course, one must keep in mind that any of these approaches is only an 
approximation; the segmental and transmission line approaches are in many ways only a mathematical 
convenience that allows one to fit an arbitrarily high-order model to observed data. These approaches 
do not necessarily lend themselves to extrapolations and predictions regarding the physiology. 

Even before Frank, there was great interest in the pure mathematics of blood flow in distensible vessels. 
Many a famous mathematician and scientist explored wave propagation, reflection, and pressure-flow 
relations in blood vessels, including: Bernoulli, Euler, Young (of Young's modulus), Poiseuille, Navier and 
Stokes, the Weber brothers, Resal, Moens and Korteweg, Lamb, and Witzig; more recent investigators 
including Morgan and Kiely, and Womersley and McDonald [Fung, 1984; Milnor, 1989; Noordergraaf, 
1978]. Womersley and McDonald's students continue to be influential in cardiovascular modeling today. 
Early studies of theoretical hemodynamics greatly influenced the field of fluid mechanics and paved the 
way for modern pulse-wave propagation and reflection models of the vasculature. These models are 
treated in the texts by Noordergraaf [1978], Fung [1985], Li [1987], and Milnor [1989]. 

The hydrodynamic pulse-wave CV models are not without criticisms either. For example, they are 
limited to short segments of the arterial or venous tree for which input and terminal impedances must 
be supplied. This restriction makes them currently impractical for the analysis of long-acting pharma- 
ceutical agents and their feedback effects on the cardiovascular system. Pharmaceuticals with vaso- and 
cardio-active effects usually require the tracking of drug concentrations throughout the body, which, for 
these models, would require an exceedingly large number of components and would result in huge 
programs requiring large computers, with the simulations invariably being slow and cumbersome. Fur- 
thermore, numerical techniques such as the finite-element method must be used to solve many of the 
hydrodynamic equations, which over extended periods might incur significant round-off and integration 
errors. Nevertheless, this approach will certainly form the basis of future pharmacologically linked CV 
models as faster and more sophisticated computers and algorithms are developed. 

Because of the current shortfall in computational technology, most pharmacologic simulations requir- 
ing a CV model employ simplified or reduced-order models mentioned earlier. Besides the segmental 
and transmission line approaches, purely black box models are also used. The reduced segmental and 
transmission line models are probably most appropriate for simulations of fast-acting CV pharmaceu- 
ticals, whereas compartmental models (see Chapter 157) are more appropriate for slower-acting drugs. 

The distinction between segmental and transmission line models has become increasingly blurred, 
especially as elements of both are often combined into the same model. Transmission line models were 
used as early as 1905 by Frank, although Westerhof and Noordergraaf [1969] were probably most 
influential in promoting the transmission line approach. As a result, these models are often called 
Westkessel models. They employ black box, two-terminal circuits (Westkessel terminations) to reproduce 
the input impedances of lumped vascular trees. Segmental models, however, are generally three terminal 
circuits, with two terminals representing inertial flows and viscous resistances and the third representing 
the compliance of the segment with respect to an external reference (typically atmospheric pressure). 
The segmental approach is described in more detail below. 
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As mentioned above, reduction in model order is typically achieved by combining segmental and 
transmission line models. The idea here is to represent critical, nonlinear components by segments, while 
the less critical, more linear components are lumped together and represented by Westkessel terminations. 
Unfortunately, the black box Westkessel sections lose their physiologic meaning, which may present a 
problem in certain applications. 

The highly simplified black box models are even worse in this regard. They are less concerned with 
the physiology than with the input-output relationships of the system and typically employ linear 
dynamics fit to observed patient responses. Though easy to implement, once defined, physiologically 
relevant perturbations may be difficult to produce. Furthermore, these models should not be used to 
infer properties or behaviors outside the range of the data sets. This is a potentially crippling drawback, 
especially when designing and evaluating CV controllers. However, if the goal is to generate clinically 
observed behaviors irrespective of the pathophysiology, then these models make an excellent choice. 

Another potential drawback, which applies to both simplified and reduced-order models, is the loss 
of phenomena associated with pulse propagation and its effects on cardiac-arterial coupling and ener- 
getics. These effects are usually considered minor from a pharmacologic point of view, but their omission 
may lead to potentially serious design flaws when the resulting controller is used in certain patient 
populations. For example, isolated systolic hypertension (ISH) significantly contributes to strokes and 
heart disease in the elderly, and O'Rourke [1993] suggests that the therapy of choice, as well as the goal 
of treatment, should be different for ISH than for diastolic and mean arterial hypertension. It may be 
prudent, then, for control applications aimed at this population to be evaluated on more detailed models. 

160.3 Cardiovascular Modeling 

In this section, the concepts behind the segmental CV approach are described. Neurohumoral controls 
are also briefly discussed, as is the inclusion of pharmacokinetics and pharmacodynamics for drug uptake, 
distribution, and action. 

An Idealized Segment of Artery 

A short segment of artery can be modeled by an elastic, isobaric chamber attached to a rigid inlet and 
outlet tube (Fig. 160.1). The ability of the chamber to store fluid depends on its compliance, C, which 
is defined as 

dV 
C= — c - (160.1) 

dP 

c 

where V c is total segment (chamber) volume and P c is chamber pressure. Many modelers prefer to use 
the reciprocal of compliance (termed stiffness, S, or elastance, E). Here, viscoelastic (stress relaxation) 
effects are assumed negligible, so that compliance is not an explicit function of time. Thus, compliance 
becomes an instantaneous variable which can be obtained from the experimentally determined, steady- 
state pressure-volume (P-V) relationship of the segment. Furthermore, model order can now be reduced, 
as pressure can be mathematically represented as an empirical function of volume, either by a piecewise 
linear approximation, a polynomial quotient, or some other function fit to the P-V relationship. A typical 
P-V curve is illustrated in Fig. 160.2, along with its piecewise liner approximation. 

In arteries, pressure is usually positive with small oscillations about a nominal operating point (point A 
in Fig. 160.2). Hence the piecewise linear approximation can be reduced to a single line with constant 
slope II C: 
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FIGURE 160.1 Idealized segment of artery approximated by an elastic, isobaric chamber attached to a rigid inlet 
and outlet tube. P,„, P ouP and P c are mlet, outlet, and chamber pressures; f in and /„„, are ^lows; L in and L aM are inertances; 
R ht and R allt are viscous resistances to flow; and V c is total segment (chamber) volume. Compliance C is defined in 
the text. 




FIGURE 160.2 Static pressure-volume (P-V) relationship of a segment of artery. 

where V cu is the unstressed volume (the idealized zero-pressure volume intercept). Of course, if pressure 
fluctuates outside this region, then additional straight-line sections should be included. 

Assuming that blood is incompressible and newtonian, that the flow profile is parabolic and unchang- 
ing with axial distance along a straight rigid tube, then flow (poisenillean) is linearly proportional to the 
pressure gradient across the ends of the tube. Hence, flow into and out of the chamber in Fig. 160.1 may 
be calculated given P in and P ollt (the inlet and outlet pressures); P c ; L in and L mt (the inertances due to fluid 
mass); and -R,„ and R out (the resistances due to viscous drag) 



L. -^+R. f. =P. -P 

m i , in Jin in c 

at 



J Oil 



(160.3) 
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where f in and f out are the flows into and out of the segment. Volume can now be calculated from the 
difference between the inlet and outlet flows: 
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FIGURE 160.3 (a) The idealized segment of artery from Fig. 160.1 as an RLC circuit, (b) The nonlinear residual 
charge capacitor in (a). After Tsidik et al. [1992]. 

P c , V c , f in , and f out can now be uniquely determined at any time given P in , P mlt , and the initial conditions 
of the segment. Determination of the parameters L and R are more problematic. They can be derived 
analytically, although probably because the modeling assumptions are not completely correct, an empir- 
ical fit generally produces better segmental properties. 

The simplified segmental structure can be represented as an RLC circuit as shown in Fig. 160.3a. Tsitlik 
and coworkers [1992] remind us that each capacitor should contain a residual charge to simulate the 
respective segment's unstressed volume ( V cu ). This can be achieved most easily by inserting a battery 
between a standard capacitor and ground (Fig. 160. 3b) [Tsitlik et al., 1992]. Or, if the unstressed volume 
for the segment remains constant (say, because the pharmacologic drug of interest has little effect on it), 
then the unstressed volume (and the battery) can be eliminated by subtracting it from the total vascular 
blood volume. The segment will then contain the stressed volume only (V c -V n ), so that an ordinary 
capacitor will suffice (again assuming operation near point A on the P-V curve). 

An Idealized Segment of Vein 

In most of the venous circulation, the pressure inside the vessels is greater than the external pressure, so 
that the pressure-flow relationship is like that in the arteries. However, in the vena cava, in certain organs 
such as the lungs and during certain procedures such as resuscitation or measurement of blood pressure 
using a cuff, venous collapse may occur [Fung, 1984]. Thus, in contrast to an artery in which only one 
condition normally exists, two additional conditions must be considered for a vein. 

The first condition occurs when the inlet and outlet transmural pressures are positive. [Transmural 
pressure is defined as the pressure gradient across the vessel wall (Pj m id e -P outside) -\ ^ n this instance, the vein 
can be treated as discussed above for an artery. A second condition occurs when the inlet and outlet 
transmural pressures are negative. In this instance, the vein will collapse, and flow will stop or be greatly 
reduced. 

The third condition occurs when the inlet transmural pressure is positive and the outlet transmural 
pressure is negative. In this instance, blood flows into the chamber but not out because the outlet will 
be collapsed. As blood flows in, the pressure inside rises until it exceeds the external pressure, forcing 
the outlet open and allowing blood to flow out. But, as blood flows out, the internal pressure decreases, 
so that the outlet transmural pressure may become negative again, collapsing the outlet, and commencing 
the next cycle. This cycling is called flutter, and the collapsing at the outlet, choking. Besides flutter, a 
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limited steady-state flow is also possible if the outlet is not completely choked off [Fung, 1984]. Limited 
flow is the predominant (and likely the only) effect occurring under physiologic conditions [Noorder- 
graaf, 1978]. 

Purmutt and colleagues [1962] described this effect as a vascular waterfall. Water flowing over a falls 
depend only on conditions at the top, not on the length of the drop. In 1994 Holt used the term flow 
regulator for this effect in his seminal work on the collapsible penrose tube, and Conrad in 1969 called 
it a negative impedance conduit [Noordergraaf, 1978]. Starling in 1915 also made use of this concept 
when he used a collapsible tube, now known as a Starling resistor, to control peripheral resistance in his 
heart-lung preparations. Once flutter was found to be absent in the human physiology, Brower in 1970, 
Brower and Noordergraaf in 1973, and Griffiths in 1975 developed simplified equations of the pressure- 
flow relationship [Noordergraaf, 1978]. Snyder and Rideout [1969] also developed a model of collapsible 
vein. They employed a two-line approximation of the P-V curve — one for the uncollapsed region and 
one for the collapsed region. Then, based on modifications of the Navier-Stokes equations for an elliptical 
tube (the cross-sectional shape of a collapsing vessel), and assuming a flat flow profile, they were able to 
relate flow to pressure and volume in terms of nonlinear inertances, resistances, and compliances in 
Eqs. (160.2) and (160.3). They also included the effects of gravity and external pressures in the model. 

Arterial and Venous Trees 

By using the output of one segment as the input to a second, and the output of the second as the input 
to a third, and so on, differential equations for arbitrary lengths of vessel can be constructed (Fig. 160.4a). 
This method also allows bifurcations to be added. Another approach can be used to derive these same 
equations. First, partial differential equations are formulated to define flow along the desired length of 
vessel. The spacial differentials are then changed to finite differences, leaving only time derivatives 
[Noordergraaf, 1978]. 

Similar to an arterial section, the resulting lumped model of the vascular tree can be represented as 
an electrical circuit if desired (Fig. 160. 4b), and nearby resistors and inductances can be combined to 
form r, T, or II sections. A similar approach can be used for the venous circulation. To prevent ringing 
and other problems due to sudden impedance changes, it is often useful to add intermediate segments 
to taper the impedances between the heart and the capillaries [Rideout, 1991]. That is, the impedances 
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FIGURE 160.4 (a) A segmented, arbitrary length of vessel. (6) The equivalent RLC circuit. 
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of the large arteries should be tapered to match the higher resistive impedances of the arterioles and 
capillary beds and then reduced back again from the capillary beds to the large veins. 

Arterial and venous branching can be roughly grouped by vessel size (large, medium, and small arteries, 
arterioles, capillaries, etc.) or by organ system (heart, head, legs, liver, fat, skin, muscle, etc.). Vessel 
grouping is useful when the model needs to track drug concentrations at whole-body effector sites, such 
as nitroprusside concentration in the systemic arteriolar and venous compartments. Organ grouping is 
useful when the model needs to track drug concentrations in certain organs, such as insulin in the liver, 
glucose in the pancreas, or antineoplastic agents in diseased tissues. Note that within each organ, the 
flows can be broken down further by vessel size or by regional or conceptual intraorgan blood flows. 
Once the desired organs and vessels are described, Westkessel-type impedances can be used for the rest 
of the vasculature. 

Models of the Heart 

The heart is a muscular pump with four chambers intimately linked into one organ (Fig. 160.5a). 
Nevertheless, in modeling the heart, each chamber is usually modeled independently of the others. Each 
is typically set up as an elastic compartment with inertance and resistance similar to an artery or vein 
(see above). The differences are (1) that valves are included to constrain flow to one direction, and (2) that 
elastance (the inverse of compliance) is treated as a time-varying parameter. 

The valves are straightforward and are often implemented as ideal diodes (for electrical circuit models) 
or as IF-THEN-ELSE statements (for algorithmic models) to keep all flows nonnegative. Defects in the 
valves can be added to simulate heart defects (e.g., leaky diodes for regurgitation). Other types of heart 
defects are just as easily simulated. For example, Blackstone and coworkers [1976] placed an impedance 
between the atrial chambers to simulate a septal opening. 

Mathematical representation of the time-varying elastance is more complicated and is generally based 
on the characterization of the cardiac work cycle in the pressure-volume plane. In the P-V diagram in 
Fig. 160. 5b, the time course of the left ventricular cardiac cycle proceeds as follows. 
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FIGURE 160.5 (a) The chambers and valves of the heart. LV, left ventricle; RV, right ventricle; LA, left atrium; RA, 
right atrium; M, mitral valve; A, aortic valve; T, tricuspid valve; P, pulmonary valve. ( b) The left ventricular cardiac 
cycle in the P-V plane. See text. 
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During diastolic filling (A to B), ventricular pressure is below atrial and aortic pressures, so the mitral 
valve (between the left atrium and the left ventricle) is open, and the aortic valve (between the left 
ventricle and the aorta) is closed. Blood therefore flows from the atrium into the ventricle, based on the 
pressure gradient and the inertances and resistances of the inlet. Note that the pressure gradient will 
decrease as the ventricle fills due to the passive compliance of the chamber and the drop in pressure in 
the atrium. 

After diastole, the ventricle is stimulated to contract so that pressure rises above the atrial pressure, 
closing the mitral valve. In this state, pressure is too low for the aortic valve to open, so that both valves 
are closed and no blood flow occurs. Meanwhile, the ventricle continues to contract, raising ventricular 
pressure. This period (B to C) is termed isovolumetric contraction. Here, the muscle contraction can be 
thought of as decreasing the compliance of the chamber. 

Once ventricular pressure exceeds the pressure in the aorta, the aortic valve opens and systolic ejection 
commences (C to D). Blood flows out from the ventricle into the aorta based on the pressure gradient, 
the inertances, and the resistances of the outlet. The muscle continues to contract until the cardiac action 
potentials have run their course. 

As the muscle begins to relax, pressure will drop until it falls below the aortic pressure. At this time, 
the aortic valve closes, and again blood flow ceases. Meanwhile, the ventricle continues to relax, decreasing 
the ventricular pressure. This period (D to A) is termed isovolumetric relaxation. Here, muscle relaxation 
can be thought of as increasing the compliance of the chamber. Finally, ventricular pressure falls below 
atrial pressure, the mitral valve opens, and the cycle begins anew. 

Suga and Sagawa [1972, 1974] identified and characterized several key properties of the time-varying 
compliance in the pressure-volume plane. In their experiments, they varied preload, afterload, stroke 
volume, cardiac contractility, and heart rate for both ejecting beats (heart valves patent) and isovolumic 
beats (heart valves sewn shut). Four observations should be noted. First, the P-V point at the end of 
systole (Fig. 160.6a, points Dj— D 4 ) nearly always lies along the static P-V curve for activated myocardium. 
Second, the end systolic P-V curve is unaffected by changes in preload and afterload (Fig. 160.6a); 
however, it is affected by changes in cardiac contractility (Fig. 160. 6b). Third, t max , the time to end systole 
(point D), is also unaffected by changes in preload and afterload; however, it is affected by changes in 
cardiac contractility and heart rate. And fourth, the ventricular filling phase allows the static P-V curve 
for relaxed myocardium, as expected. There are some deviations from these observations, but over normal 
ranges these behaviors are fairly reliable [Maughan et al., 1984; Sunagawa & Sagawa, 1982]. 

Based on these observations, a continuum of static P-V relationships can be visualized as spanning 
the space between the fully relaxed and the fully contracted curves, say, as a linear combination of the 
two. The active curve at any time could then be construed as a function of the activity level of the 
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FIGURE 160.6 (a) Heart cycles with varying preload and afterload. (b) The end-systolic P-V curve with increasing 
contractility of the heart. 
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myocardium, a, which itself could be a function of time and t max , and which would range from zero 
(inactive) to one (fully active): 

PVR.(a(t,t )) = a(t,t )■ PVR K + (l-a(t,t ))-PVR n (160.5) 

l ^ ' max J J \ ' maxj ES \ \ ' max J J D K ' 

Here, PVR (pressure- volume relationship) is the active P-V curve, and PVR ES and PVR D are the end 
systolic and diastolic P-V curves, respectively. Using this approach, the PVR ES curve would be parame- 
terized in terms of the inotropic state of the myocardium, and t max would be a function of inotropic state 
and heart rate. 

Suga and Sagawa [1972] originally approximated the two curves, PVR ES and PVR D , with two straight 
lines that intersected in a common point on the volume axis (V d , the unstressed, or dead, volume). This 
was a reasonable approximation, since, in their original experiments, the nonlinear portions of the P-V 
curves were not encountered. Based on this approach, the elastance £ as a function of time becomes 
greatly simplified: 

E[t)= M W (160.6) 

w v(t)-v d 

With this formulation, u{t,t max ) maps to E(t) by the relation 

E(t) = a(t,t )-E +(l-a(t,t ))-E . (160.7) 

\ I I ' max I max I \ ' max I j mm x ' 

where E min and E max are the minimum and maximum elastances. E max sets the inotropic state, and E min 
sets the diastolic filling curve. A later modification allowed each curve to have its own dead volume 
[Sunagawa & Sagawa, 1982]. 

A common definition of a{t,t max ) is a squared sine wave, time scaled and shifted to fit its first half- 
period into the systolic time interval, and zeroed elsewhere, e.g., see Martin and coworkers [1986]. 
Another common definition uses the first half-period of a sine wave, time-scaled and shifted to fit into 
the systolic interval. It is sometimes clipped and sometimes modified with a second harmonic to skew 
the waveform, e.g., see Rideout [1991]. Still others have approximated a{t,t max ) as a square wave [Warner, 
1959], a triangle wave [Katona et al„ 1967; McLeod, 1966], a sum of charging and discharging exponen- 
tials [Sun & Chiaramida, 1992], and even as a sum of gaussian (bell-shaped) exponentials [Chung et al., 
1994]. 

Pulsations can likewise be added to the other heart chambers, although the shape of the elastance 
curves are somewhat different [Sunagawa & Sagawa, 1982]. In the left ventricle, E(t) appears more like 
a skewed sine wave while V d (t) is fairly constant during ejection; in the right ventricle, E(t) appears more 
like a squared sine wave with a larger E max , while V d (t) varies continuously throughout the systolic period; 
in the right atrium, E(t) and V d (t) behave as in the right ventricle. A common simplification of this 
relationship was employed by Leaning and coworkers [1983]. They used the same function for each 
chamber's elastance (in this case a sine wave), but with a different £„„■„, E max , and t max for each, as well as 
a different time shift to contract them when appropriate, e.g., the atria prior to the ventricles. As is 
common, they also made V d constant, though each chamber was provided with a different value. 

Since the activation function is typically time-scaled to fit within the heart period, the effect of a 
changing heart rate, say, due to neural reflexes, can be explored. However, this requires that the heart 
period itself be partitioned into diastolic and systolic intervals. Various partitioning schemes exist. For 
the left ventricle, some fix the duration of systole [Grodins, 1959] or set it to a percentage of the heart 
period, with diastole taking up the rest of the period [Rideout, 1991]. A general clinic rule allocates one- 
third to systole and two-thirds to diastole. However, Beneken and DeWit [1967], after summarizing 
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FIGURE 160.7 The Frank-Starling law of the heart; 3-D of Sagawa's [1967] left-ventricle equation for a 10-kg dog; 
CO, cardiac output; MAP, mean arterial pressure; MLAP, mean left arterial pressure. 

several publications, suggest that ventricular systole is better approximated by one-fifth the heart period 
plus 0.16 s. Likewise, they determined the duration of atrial systole as 9/100 the heart period plus 0.1 s. 
These formulae have gained considerable popularity, e.g., see Leaning and coworkers [1983] and Rideout 
[1991]. 

In many drug-delivery applications, the time constants of interest are on the order of minutes to hours. 
In these situations, a pulsatile heart is not needed and can be traded advantageously for a mean flow 
model. Mean flow models are based either on the Frank-Starling law of the heart or on the pulsatile 
model above. In the Frank-Starling approach, a family of flow curves is constructed based on experimental 
observations of the heart under various conditions of preload and afterload (Fig. 160.7) [Sagawa, 1967]. 
These are then used to calculate stroke volume for each beat. In the other approach, the stroke volume 
is derived analytically from the equations of the pulsatile heart [Rideout, 1991]. When using the Frank- 
Starling law, attenuation factors are used to impose the effects of inotropic state and heart rate. Examples 
of nonpulsatile heat models include those by Sagawa [1967], Greenway [1982], Moller and coworkers 
[1983], and Tham and coworkers [1990]. The Guyton-Coleman model [Guyton et al., 1972, 1980] is also 
nonpulsatile, though it uses a very different approach (see below). 

Some models of the uptake and distribution of slow-acting drugs do not explicitly use a heart, or, for 
that matter, a circulation. These fall under the category of purely compartmental models. In these models, 
the heart and circulation are subsumed into an assumption that each compartment in the model is 
uniformly mixed (See Chapter 159). 

Models of Combined Heart and Circulation 

Whole-body CV models couple a model of the heart to models of the vasculature. The complexity of 
the coupling depends on the particular need and can give rise to nonpulsatile left side (left side meaning 
the systemic circulation) only, nonpulsatile left and right side (right side meaning the pulmonary circu- 
lation), and pulsatile left and right side models [Rideout, 1991]. 

Left-side-only models are justified when the pharmaceutical agents of interest have little effect on the 
pulmonary circulation. In these cases, the right side is considered a follower circuit and hence can be 
eliminated to provide computational savings. Even if the agents of interest to affect the pulmonary 
circulation, an occamistic left-side-only model may still suffice. This approach may work for two reasons: 
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Either the model order is high enough that the right-side effects are inadvertently captured in the left- 
side model, or the drug effects are essentially the same on both sides. Keep in mind that these models 
have limited scope and hence should be used with care. 

As an example of the modeling considerations for drug delivery, consider the following two drugs: 
propranolol, a beta blocker; and sodium nitroprusside (SNP), a vasodilator. Propranolol has a time 
constant on the order of 33 min, primarily affects the heart, and has only minor effects on the pulmonary 
circulation (although it may constrict the airways) [Ewy & Bressler, 1992; Gilman et al., 1993]. Hence, 
for propranolol, a nonpulsatile left-side-only model should be adequate for controller design. SNP, 
however, has a transport delay of 0.5 min and a time constant between 0.25 and 0.50 min. It affects the 
arterioles, the venous unstressed volume, and venous compliance. It also significantly decreases pulmo- 
nary wedge pressure [Ewy & Bressler, 1992; Gilman et al., 1993; Greenway, 1982]. Hence, for SNP, a 
nonpulsatile model is questionable, though it may be adequate. Furthermore, a two-sided model should 
be used, since SNP may have a significant effect on the pulmonary circulation. Greenway's [1982] two- 
sided nonpulsatile model seems to function adequately, although other investigators have switched to 
two-sided pulsatile models, e.g., see Yu et al., 1990. 

Interestingly, Martin et al. [1986] may have finessed the field of nonpulsatile models by overcoming 
the speed limitations associated with pulsatile models. They did this by solving the pulsatile model 
equations semi-analytically, and were thus able to significantly reduce simulation time. Their model 
currently makes an attractive choice for studying and designing automated drug-delivery systems. 

Despite the emphasis here on nonlinear CV-based modeling, probably the most used model for the 
design of blood pressure controllers is the model by Slate and Sheppard [1982]. It is based on a linear, 
first-order impulse response of the effects of nitroprusside on mean arterial blood pressure. The model 
includes a recirculation effect, as well as an occasionally observed nonlinear reflex (possibly due to the 
chemoreceptors). It has gained wide acceptance probably because of its simplicity. Nevertheless, it has 
severe limitations: It lumps the effects of the primary neural and humoral reflexes as well as the patient 
drug sensitivity into one gain; it does not account for the effect of changing cardiac output on drug 
transport; and it is not easily extended to include additional drugs or disease states. 

Neural and Humoral Control 

Once the heart and vasculature are linked, neural and hormonal controls need to be added. These include 
the baroreflex, the chemoreflex, the renin-angiotensin reflex, capillary fluid shift, autoregulation, stress 
relaxation, and renal-body- fluid balance (water intake and urine output). 

Baroreceptors monitor the pressure in the carotid sinuses, the aortic arch, and other large systemic 
arteries and increase their firing rate when the pressure increases. Their response is nonlinear and depends 
on whether they are exposed to mean pressure only, pulsatile pressure only, or a combination of both. 
Katona and colleagues [1967] developed a model of baroreceptor feedback that has become the basis for 
many CV neural control models. The output of the baroreceptor model is often passed through a low 
pass filter representing the CNS and then mapped back to changes in heart rate, contractility, vascular 
resistances, and vascular unstressed volumes through the sympathetic and parasympathetic nervous 
systems (Fig. 160.8), e.g., see Yu and coworkers [1990]. 

In addition to baroreceptors, chemoreceptors may also mediate a strong CV response, especially when 
systemic arterial pressure falls below 80 mmHg [Guyton et al., 1972; see also Dampney, 1994]. This reflex, 
through the CNS, increases cardiac activity and peripheral resistance. Strangely, few models include this 
effect, although Slate and Sheppard [1982] may have inadvertently included it in their simplified black 
box model. 

The renin-angiotensin reflex is mediated hormonally through the kidneys. When pressure falls below 
normal, renin is secreted into the blood stream by the kidneys, which then causes the release of free 
angiotensin. Angiotensin causes marked vasoconstriction and thus increases peripheral resistance and 
arterial pressure. This reflex is often included in models because it plays an important role in many forms 
of hypertension and heart failure. 
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FIGURE 160.8 Baroreceptor firing rate is passed through a low-pass filter representing the CNS and then mapped 
back to changes in heart rate, contractility, vascular resistances, and vascular unstressed volumes. 

The other CV control reflexes are only sometimes included, although these and others are elucidated 
and modeled by Guyton and his coworkers [1972, 1980]. In his elaborate model, there are five main 
empirically derived physiologic function blocks with many subcomponents. The model has nearly 400 
parameters and is quite remarkable in its scope. Given current computer technology, however, it is 
somewhat cumbersome for drug delivery applications, although it promises to have great utility in the 
future. 

Combined CV and Pharmacologic Models 

For pharmacologic control studies, CV models readily lend themselves to the calculation of drug uptake 
and distribution to the various parts of the body (pharmacokinetics). Once the transport model has been 
built, models of drug action (pharmacodynamics) then need to be linked and related back to the CV 
model. Other models must also be linked if necessary. For example, models of inhalational anesthetic 
uptake, distribution, and effect on the respiratory and CV systems require not only a CV and pharma- 
cologic model but also a lung model. Combined models such as these are known as multiple models, a 
term coined by Rideout's group [Beneken & Rideout, 1968; Rideout, 1991]. Again, the time course of 
the agents and their effects on the heart and circulation determine whether a pulsatile or nonpulsatile 
model is needed, as well as the requisite level of vascular detail. 

To calculate drug uptake and transport, mass balance equations can be set up for each segment and 
compartment in the CV model. Additional compartments, such as a tissue or an effects compartment, 
can also be added (see Chapter 159). For example, given a particular compartment (or segment), the 
mass of drug in the compartment is calculated as follows: 



dt 



' fin,, C ,-l- L,,, 1 



(160.8) 



where i identifies the compartment, q is the mass of drug, c is its concentration, f in is the rate of blood 
flow into the compartment (normally equal to the flow out of the previous compartment), and f out is the 
rate of blood flow out of the compartment. The instantaneous concentration in any compartment can 
then be calculated based on drug mass and total volume of the compartment (stressed plus unstressed 
volumes); 
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Once the concentration is known at the effector site, a pharmacodynamic model is used to describe 
its action. Saturation may need to be built into the effect, as well as a possible threshold concentration. 
Hill's sigmoidal equation can accommodate both: 



EFF = e - (160.10) 

50 e 

where EFF is the normalized pharmacologic effect (0 < EFF < 1), c e is the concentration at the effector 
site, c 50 is the half-matrix concentration, and n parameterizes the steepness of the sigmoid. Hill's equation 
becomes the Michaelis-Menten equation when n — 1. 

Once calculated, the drug effect can be used to modify the parameters in the CV model, much like 
the neural and humoral reflexes. Resistances, compliances (including the unstressed volumes) of each 
segment, neural and humoral feedback gains, heart rate, and contractility are all typically modified by 
drugs. For example, sodium nitroprusside causes vasodilation and blood pooling; therefore it would be 
made to primarily increase the compliance and unstressed volume of the veins, as well as to decrease 
arteriolar (peripheral) resistance [Greenway, 1982; Yu et al., 1990]. 

160.4 Conclusions 

For designing controllers to automate cardiovascular therapeutics, the nonlinear, cardiovascular-linked 
pharmacologic models are well worth the added effort and time needed to construct and interface them 
to the design platform. Not only can they help prove initial feasibility, but they can also speed up the 
overall design process. For example, they can be used for interactive controller design; they can be used 
to design improved animal and human experiments, which can potentially reduce the total number of 
experiments; and they can be used to simulate human conditions that are not easily produced in animals. 
The reduced compartmental and combined segmental and transmission line models are probably the 
most practical model form at this time given current computer technology. However, as computer 
technology grows, we can expect to see increasingly detailed pharmacologic models that include addi- 
tional cardiovascular features such as pulse wave propagation and reflection as well as multiple short- 
and long-term neurohumoral feedback loops. 
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Further Information 

The text by Rideout [1991] has a particularly good tutorial on cardiovascular modeling and even includes 
ACSL source code. McDonald's [1974] and Noordergraaf 's [ 1978] texts are classics, thoroughly reviewing 
cardiovascular physiology and the physical principles useful in modeling it. More recent texts on the 
cardiovascular system with a bioengineering slant include those by Milnor [1989], Li [1987], and Fung 
[ 1 984] . Also, Guyton's 1 980 text provides a detailed review of his model and its application for the analysis 
of hypertension and is well worth reading. Along this same line, Safar and O'Rourke's [1993] text, as 
well as O'Rourke's earlier work [1982], are also worth reading and provide many insights into hypertension 
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and cardiovascular modeling. Melchior, Srinivasan, and Charles [1992] also present an overview of 
cardiovascular modeling, though with an emphasis on orthostatic response. 

For historical interest, The History of Cardiology by Acierno [1994] provides fascinating reading, going 
back beyond even the early Greeks. In more recent history, a review of early CV models (early 1900s up 
to 1940) can be found in Aperia's monograph [1940], and a review of the post-World War II era models 
(late 1950s to early 1970s) that set the tone for almost all modern CV models can be found in Sagawa 
[1973]. 

For those interested in pharmacologic modeling, along with examples using the CV system, Carson, 
Cobelli, and Finkelstein's 1983 text is a good source. Middleman's 1972 text specifically targets in transport 
in the CV system, but it is somewhat dated. The current understanding of the anatomic and functional 
organization of the neurohumoral regulation pathways can be found in Dampney [1994]. 

For a survey of blood pressure controllers, the reader is referred to the recent article by Isaka and 
Sebald [1993]. Linkens and Hacisalihzade [1990], Katona [1988], and Westenskow [1986] also survey 
blood pressure controllers as well as other CV control applications. 
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The respiratory system is a complex neurodynamical system. It exhibits many interesting characteristics 
that are akin to other physiologic control systems. However, the respiratory system is much more 
amenable to modeling analysis than many other systems for two reasons. First, the respiratory system is 
dedicated to a highly specific physiologic function, namely, the exchange of 2 and C0 2 through the 
motor act of breathing. This physiologic function is readily distinguishable from extraneous disturbances 
arising from behavioral and other functions of the respiratory muscles. Second, the respiratory control 
system is structurally well organized, with well-defined afferent and efferent neural pathways, peripheral 
controlled processes, and a central controller. The functional and structural specificity of the respiratory 
system — and the diverse neurodynamic behaviors it represents — make it an ideal model system to 
illustrate the basic principles of physiologic control systems in general. 

Like any closed-loop system, the behavior of the respiratory control system is defined by the continual 
interaction of the controller and the peripheral processes being controlled. The latter include the respi- 
ratory mechanical system and the pulmonary gas exchange process. These peripheral processes have been 
extensively studied, and their quantitative relationships have been described in detail in previous reviews. 
Less well understood is the behavior of the respiratory controller and the way in which it processes 
afferent inputs. A confounding factor is that the controller may manifest itself in many different ways, 
depending on the modeling and experimental approaches being taken. Traditionally, the respiratory 
control system has been modeled as a closed-loop feedback/feedforward regulator whereby homeostasis 
of arterial blood gas and pH is maintained. Alternatively, the respiratory controller may be viewed as a 
central pattern generator in which rhythmic respiratory activity is produced in response to phasic afferent 
feedback. Finally, there is increasing evidence that the respiratory controller may function as an adaptive 
self-tuning regulator which optimizes breathing pattern and ventilation according to certain performance 



measure. 
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Each modeling approach reveals a separate "law" of a multifaceted controller. However, these control 
laws must be somehow related to one another because they are governed by the same neural network 
that forms the respiratory controller. It is therefore instructive to examine not only how the various 
models work but also how they might be fit together to encompass the myriad of response behaviors of 
the respiratory control system. 

161.1 Structure of the Respiratory Control System 

The respiratory control system is a nonlinear, multioutput, delayed-feedback dynamic system which is 
constantly being perturbed by unknown physiologic and pathologic disturbances (Fig. 161.1). Rhythmic 
respiratory activity is produced by a respiratory central pattern generator (RCPG) which consists of a 
network of neuronal clusters in the medulla oblongata and pons areas of the brain stem. The RCPG 
forms the kernel of the respiratory controller. The control problem is defined by the characteristics of 
the controlled processes (plants) and the control objective. 

Chemical Plant 
Pulmonary Gas Exchange 

The chemical plant is propelled by the ventilation of the lung, V E which determines the alveolar PC0 2 
and P0 2 according to the following mass-balance equations: 
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The input-output relationships of the chemical plants are subject to several endogenous and exogenous 
disturbances. For example, increases in the metabolic production of C0 2 during muscular exercise, PC0 2 
in the inspired air, and respiratory dead space during rebreathing — all contribute to an increase in C0 2 
load to the lung. The added C0 2 is eliminated by an increase in pulmonary ventilation, but the effec- 
tiveness of the control action in restoring -P A C0 2 varies considerably with the type of disturbance. The 
input-output sensitivity S 2 £ §P A CO^Vg of the plant is highest with metabolic C0 2 load and lowest 
with inhaled C0 2 load, whereas the plant dynamics is slowest with dead space load. 

In pulmonary disease the efficiency of the plant in removing C0 2 is further decreased because of 
pulmonary ventilation/perfusion maldistribution [Poon, 1987a]. Another source of disturbance is met- 
abolic acidosis and alkalosis which alter arterial blood gas tensions and pH through acid-base buffering 
in blood. All these disturbances interact nonlinearly with the plant and are not directly sensed. How does 
the controller cope with such variety of disturbances effectively if its only means is to alter V E 7 . Not 
surprisingly, the controller responds quite differently to the various forms of C0 2 disturbance and their 
combinations [Juratsch et al, 1982; Oren et al., 1981; Poon & Greene, 1985; Poon, 1989a, 1989b, 1992b; 
Sidney & Poon, 1995]. 

Chemosensory Feedback 

The peripheral (arterial) chemoreceptors response to changes in P CO 2 may be modeled by linear first- 
order dynamics [Bellville et al., 1979] 



p dt p p p 
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(161.3) 



where X , G„, L, are the time constant, gain, threshold of the peripheral chemoreceptor and T is the 
lung-to-chemoreceptor transit delay of blood flow. Similarly, the central (intracranial) chemoreceptor is 
responsive to brain tissue PC0 2 

A = G c {P b C0 2 -I c } (161.4) 

where the brain tissue PC0 2 is given by 

V.CO, ■— RCO = alpCoit-T)- P,C0\+ MhC ° 2 (161.5) 

h 2 dt h 2 H " A c > h 2 J KC0 2 

and the cerebral blood flow [Vis & Folgering, 1980] is given by 

Q b (t) = Q +AQ b (t) (161.6) 

V^AQ^-AQ^ + G.-PCqjf-r) (161.7) 

The central chemoreceptor normally has a greater sensitivity than the peripheral chemoreceptors (which 
may be silenced by hyperoxia). Their combined effects are presumably additive at low to moderate 
stimulation levels [Heeringaet al., 1979] but may become progressively saturated at higher levels [Eldridge 
et al., 1981]. The peripheral chemoreceptors have a shorter response time constant and delay than the 
central chemoreceptor, presumably due to their proximity to the lung and arterial blood gases. They are 
generally thought to be more responsive to rapid changes in blood chemistry; although the central 
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chemoreceptor may also contribute substantially to the transient respiratory response to breath-to-breath 
fluctuations in chemical input during eupneic breathing in wakefulness [Bruce et al., 1992]. 

The dynamic response of the peripheral chemoreceptors to P a 2 ma y De modeled in a similar fashion 
but the steady-state response is hyperbolic [Cunningham et al., 1986]. Furthermore, hypoxia and hyper- 
capnia have a multiplicative effect on carotid chemoreceptor discharge [Fitzgerald & Lahiri, 1986], 
although the effect of changes in pH is presumably additive [Cunningham et al., 1986] . Prolonged hypoxia 
has a depressant effect on central respiratory neurons [Neubauer et al., 1990] which may offset the 
stimulatory effect of peripheral chemosensitivity. 

Inputs from the peripheral chemoreceptors are temporally gated throughout the respiratory cycle. 
Stimuli are effective only if they are delivered during the second half of the inspiratory or expiratory 
cycle [Eldridge & Millhorn, 1986; Hildebrandt, 1977; Teppema et al., 1985]. The effect of such stimuli 
on medullary respiratory neurons is excitatory during the inspiratory phase and inhibitory during the 
expiratory phase [Lipski & Voss, 1990]. The gating effect may be modeled by a gated time function 



A «=s flfe K( f )} (l6L8) 



where gate {•} is a windowing function which reflects the phasic activity of peripheral chemoreceptor 
inputs. 

The intrinsic transient delay and nonlinearity in the transduction of chemical signals are undesirable 
from an automatic control perspective. On the other hand, they may represent neural preprocessing in 
the sensory nervous system. Such transformations in the feedback path may have important bearing on 
the controller's ability to achieve the control objective. 

Mechanical Plant 

Respiratory Mechanics 

Pulmonary ventilation results from tidal expansion and relaxation of the lung. The equation of motion is: 



dV 
dt 



V+P (t)-P (t) (161.9) 



Both R rs and E n may be nonlinear. The mechanical load of the respiratory muscles is increased by airway 
constriction or lung restriction in pulmonary disease, or by the imposition of external resistance and 
elastance which cause a back pressure P aw at the airway opening. Increases in ventilatory load may elicit 
a compensatory response in P mm so that V E is largely restored [Poon 1987fo, 1989a, 1989b] except under 
severe ventilatory loading. Reverse compensatory responses are observed during ventilatory unloading 
[Poon et al., 1987 a,b; Ward et al., 1982]. 

Respiratory Muscles 

The mechanical plant is propelled by the respiratory muscles which serve as a mechanical pump. Nor- 
mally, P mm is sustained by the inspiratory muscles (principally the diaphragm and, to a lesser extent, the 
external intercostal and parasternal muscles). During hyperpnea or with increased expiratory load the 
expiratory muscles may be recruited [Marroquin, 1991], as are accessory muscles which may contribute 
significantly to the generation of P mm in paraplegics. 

Paradoxically, the respiratory muscles may also act to reduce ventilation. During quiet breathing some 
postinspiratory inspiratory activity may persist during the early expiratory phase, retarding lung emptying 
[Poon et al., 1987fo]. In disease states inspiratory activity may also be recruited during the expiratory 
phase to prevent lung collapse [Stark et al., 1987] or overinfiation. The upper airways abductor muscles 
may be activated by the controller or higher brain centers to increase R n in disease states [Stark et al., 
1987] or in sleep [Phillipson & Bowes, 1986] . In awake states, respiration may be interrupted by behavioral, 
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emotional, or defensive activities which compete for the respiratory apparatus under cortical or somatic 
sensory commands. 

The mechanical efficiency of the respiratory muscles is limited by the force-velocity and force-length 
relationships which may be modeled as linear "active" resistance and elastance, respectively [Milic-Emili & 
Zin, 1986]. Neuromechanical efficiency may be impaired in respiratory muscle fatigue or muscle weak- 
ness, resulting in diminished P mm for any given neural drive. 

Thus the respiratory pump is equipped with various types of actuators to regulate V F in the face of 
many exogenous and endogenous mechanical disturbances. Such disturbances are detected by mecha- 
noafferents originating in the lungs, thorax, and airways. 

Mechanosensory Feedback 

Among the various types of mechanoafferents, vagal slowly adapting pulmonary stretch receptors (SAR) 
input exerts the greatest influence on respiration by inhibiting inspiratory activity and facilitating expi- 
ratory activity (Hering-Breurer reflex). Vagal SAR volume feedback is crucial for the compensatory 
response of the RCPG to mechanical disturbances. 

The SAR characteristic is liner at low lung volumes but may exhibit saturation nonlinearity at elevated 
volumes. Also, at any given lung volume SAR discharge increases with increasing rate of lung inflation 
[Pack et al., 1986]. To a first approximation vagal volume feedback may be modeled by 

vag(t\=sat\a Q +a l v(t) + a 2 v(t)\ (161.10) 

where sat{-} is a saturation function; a denotes tonic vagal activity [Phillipson, 1974]; and a x , o 2 are 
sensitivity parameters. 

Control Object 

It is generally assumed that the prime function of respiration is to meet the metabolic demand by the 
exchange of 2 and C0 2 . The control problem is to accomplish this objective within some prescribed 
tolerance, subject to the physical as well as environmental constraints of the respiratory apparatus. Various 
models of the controller have been proposed as possible solutions to the control problem [Petersen, 
1981], each representing a separate control strategy and degree of modeling abstraction. In what follows 
we provide a synopsis of the various modeling approaches and their physiologic significance. 

161.2 Chemoreflex Models 

The simplest controller model is a proportional controller [Cunningham et al., 1986; Grodins & Yamash- 
iro, 1978]. In this approach attention is focused on the control of the chemical plant; the RCPG and the 
mechanical plant are lumped together to form a constant-gain controller. The control signal is taken to 
be V E , and the effects of the breathing pattern and vagal feedback are often neglected. Thus the controller 
is assumed to be a fixed relay station that regulates V E via the chemoreflex loop. These simplification 
allow quantitative closed-loop analyses of the chemoreflex system. 

Feedback Control 

In the chemoreflex model the respiratory control system acts like a chemostat. This is consistent with 
the well-known experimental observation that ventilation increases with increasing chemical stimulation. 
The steady-state ventilatory response to C0 2 inhalation is given by: 

V £ = a(PflC0 2 -|3) (161.11) 
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This empirical relationship is in agreement with the chemoreflex model which assumes that ventilatory 
output is proportional to the sum of chemosensory feedback: 

V e = G (a+A p ) (161.12) 

Equation (161.11) follows from the steady-state solutions to the model Eqs. ( 161.3)— (161.7) and (161.12), 
provided that the effect of changes in cerebral blood flow may be neglected. The effects of hypoxia and 
hypoxic-hypercapnic interaction may be modeled in a similar fashion by a hyperbolic relation [Cun- 
ningham et al., 1986]. 

A comprehensive model of the dynamical chemoreflex system is due to the classic work of Grodins 
and coworkers [1967]. The dynamical response to C0 2 is given by the transient solutions of the system 
equations. Eqs. ( 161.3)— (161.7) and (161.12). Under closed-loop conditions system dynamics is also 
influenced by the nonlinear dynamical response of the chemical plant, Eq. (161.1), as well as C0 2 uptake 
in body tissues. Therefore, measurement of the C0 2 response curve, Eq. (161.8), is often a slow and 
tedious procedure. Open-loop conditions may be achieved by dynamic forcing of end-tidal PC0 2 with 
servo-control of P t C0 2 which obviates the nonlinearity and slow equilibration of the lung and other body 
tissue stores. This technique has been used to experimentally estimate the model parameters in normal 
and peripheral-chemoreceptor denervated subjects by means of system identification techniques [Bellville 
et al., 1979]. Another open-loop technique is the C0 2 rebreathing procedure [Read, 1967] which causes 
a metabolically induced ramp increase in P a C0 2 and, hence, a similar increase in V F by way of chemoreflex. 
The rebreathing technique is much simpler to use than dynamic end-tidal PC0 2 forcing and is therefore 
suitable for clinical applications. However, this technique may not accurately determine the steady-state 
C0 2 response, presumably because of the variability in cerebral blood flow induced by changing PJZ0 2 . 
More reliable techniques using pulse-step [Poon & Olson, 1985] or step-ramp [Dahan 1990] P,C0 2 
forcings have been proposed for rapid determination of the steady-state C0 2 response with minimal 
experimental requirements. 

Stability of the chemoreflex model may be studied by linearization of the system about some nominal 
state. Instability may occur if the loop gain and phase shift exceed unity and 180°, respectively. This may 
explain the periodic breathing phenomena found in hypoxia or high altitude where peripheral chemore- 
ceptor gain is increased and congestive heart failure in which transit delay is prolonged [Khoo et al., 
1982] . However, chemical instability alone does not account for the periodic episodes in obstructive sleep 
apnea which may also be influenced by fluctuation in arousal state [Khoo et al., 1991]. A technique for 
estimation of chemoreflex loop gain using pseudorandom binary C0 2 forcing has been proposed 
[Ghazanshahi & Khoo, 1997]. 

Feedforward Control 

The chemoreflex model provides a satisfactory explanation for the chemical regulation of ventilation as 
well as respiratory instability. However, it fails to explain a fundamental aspect of ventilatory control 
experienced by everyone in everyday life: the increase in ventilation during muscular exercise. Typically, V E 
increases in direct proportion to the metabolic demand (VC0 2 , VD 2 ) such that the outputs of the chemical 
plant, Eqs. (161.1) and (161.2), are well regulated at constant levels from rest to exercise. As a result, 
homeostasis of arterial blood chemistry is closely maintained over a wide range of work rates. The 
dilemma is: If increases in metabolic rate are not accompanied by corresponding increases in chemical 
feedback, then what causes exercise hyperpnea? 

One possible explanation of exercise hyperpnea is the so-called set-point hypothesis [Defares, 1964; 
Oren et al., 1981] which stipulates that P„C0 2 and PJD 2 are regulated at constant levels by the chemoreflex 
controller. However, a set point is not evident during hypercapnic and hypoxic challenges where the 
homeostasis of P CO 2 and PJD 2 is readily abolished. Furthermore, to establish a set point the controller 
gain must be exceedingly high, but this is not found experimentally in the hypercapnic and hypoxic 
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ventilatory sensitivities. Finally, from a control systems perspective a high gain controller is undesirable 
because it could drive the system into saturation or instability. 

Another possible explanation of exercise hyperpnea is that the proportional controller may be driven 
by two sets of inputs: a chemical feedback component via the chemorefiex loop and a feedforward 
component induced by some exercise stimulus [Grodins & Yamashiro, 1978]. The "feedforward-feedback 
control" hypothesis offers a simple remedy of the chemorefiex model, but its validity can be verified only 
if the postulated "exercise stimulus" is identified. Unfortunately, although many such signals have been 
proposed as possible candidates [Wasserman et al., 1986], none of them has so far been unequivocally 
shown to represent a sufficient stimulus for exercise. 

Among the variously proposed mechanisms of exercise hyperpnea, the "PC0 2 oscillation" hypothesis 
of Yamamoto [1962] has received widespread attention. According to this hypothesis, the controller may 
be responsive not only to the mean value of chemical feedback but also to its oscillatory waveform which 
is induced by the tidal rhythm of respiration. This hypothesis is supported by the experimental finding 
that alterations of the temporal relationship of the -P A C0 2 waveform could profoundly modulate the 
exercise hyperpnea response [Poon, 1992^]. 

Regardless of the origin (or even existence) of the exercise stimulus, there is evidence that exercise and 
chemical signals are processed by the controller in a multiplicative fashion [Poon & Greene, 1985; Poon, 
1989a, 1989b] . The input-output relationship of the controller may therefore be written as 



V=G n (A +a)a +G A (161.13) 

h 0\ c pi ex ex ex v ' 



Note that V E vanishes when A ex is reduced to zero, in agreement with the finding of Phillipson and 
colleagues [1981]. 

161.3 Models of Respiratory Central Pattern Generator 

Although the chemorefiex model is useful in studying the closed-loop control of ventilation and the 
chemical plant, it is lacking as to the control of breathing pattern and the mechanical plant. Models of 
the RCPG are often studied under open-loop conditions with chemical feedbacks being held constant 
while vagal volume feedback is experimentally manipulated. Modeling and analysis of the RCPG in 
closed-loop form are difficult because the input-output relationships of the RCPG and the mechanism 
of pattern generation are poorly understood. Several modeling approaches have been used to characterize 
its behavior under specific conditions. 

Phase-Switching Models 

The breathing cycle consists of two distinct phases: inspiration and expiration. It is therefore natural to 
model each phase separately and then combine them with a model of phase switching (for review, see 
Younes and Remmers [1981]). Each phase is controlled by a separate "center" which generates ramplike 
central inspiratory and expiratory activity, respectively. The resulting lung expansion and collapse are 
sensed by SAR, which causes the RCPG to switch from inspiration phase to expiration phase, or vice 
versa, when the volume-time relationship reaches some inspiratory or expiratory off-switch threshold. 
The off-switch mechanisms are dependent on both the threshold values and the corresponding volume- 
time histories. Also, inspiratory and expiratory off-switches may be mechanically linked, as suggested by 
the correlation between inspiratory and expiratory durations in consecutive breaths. Both the rate of rise 
of inspiratory activity and the inspiratory off-switch threshold increase with increasing chemical stimu- 
lation. Throughout the inspiratory phase, vagal volume feedback exerts graded and reversible inhibition 
on inspiratory activity until immediately before the inspiratory off-switch where inspiratory termination 
becomes irreversible [Younes et al., 1978]. Perturbations of the mechanical plant, such as increases in 
ventilatory load, alter respiratory rhythm by changing the volume-time profiles and off-switch thresholds 
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FIGURE 161.2 Stylized diagrams illustrating the Hey plots (left) and respiratory phase-switching curves (right) for 
subjects breathing normally (NL) or: (a) under an inspiratory resistive load (IRL): (b) under an inspiratory elastic 
load (IEL) at rest or during exercise. In (a), IRL causes the normal operating point (C) to move to a new point LE 
if the inspirate is free of C0 2 , or to a point LH if C0 2 is added. In (b), IEL moves the operating point to the same 
point L whether CO, is present in the inspirate. Thus, the consequence of load compensation is highly dependent 
on the background C0 2 and the type of load. Relative scales only are shown on all axes. Source: Poon [1989a, 1989b], 
with permission. 



(Fig. 161.2). These effects are abolished by vagotomy such that the RCPG always oscillates with a fixed 
rhythm. Various neuronal models of phase switching have been proposed [Cohen & Feldman, 1977], 
and putative phase-switching neurons in the medulla have been identified [Oku et al., 1992]. 

The two-phase respiratory cycle may be expanded into three phases if the expiratory phase is subdivided 
into an early postinspiratory inspiratory phase (stage I) and a late expiration phase (stage II) . Vagal feedback 
has differential effects on neural activities during the two stages of expiration [St. John & Zhou, 1990]. 

An important link between RCPG and chemoreflex models is the Hey plot [Hey et al., 1966] which 
suggests that V E and V T are linearly related except when V T approaches vital capacity. The slope of the 
Hey plot is altered by resistive loads but not elastic loads to ventilation (Fig. 161.2). 

Neural Network Models 

Respiratory rhythmicity is an emergent property of the RCPG resulting from mutual inhibition of 
inspiratory and expiratory related neurons. A minimal model due to Duffin [1991] postulated the early- 
burst inspiratory (I) neurons and Botzinger complex expiratory (£) neurons to be the mutually inhibiting 
pair. Adaptation of the I neurons (e.g., by calcium-activated potassium conductance) results in sustained 
relaxation oscillation in the network under constant chemical excitation. Both neuron groups are assumed 
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to have monosynaptic inhibitory projections to bulbospinal inspira- 
tory (I s ) output neurons (Fig. 161.3). The model equations are: 



T^—x. = R-E+'Ycy 



dx 
dt 

dx 



(161.14) 



T, — f -+x f = R f + C f x, 

1 dt I I ' ' 




where y i = g(x t — H,); g(X) = max(0, X); T t , R p E p H t are respectively 
the membrane time constant, resting activity, excitation, and 
threshold of the zth neuron; C* is the connection strength between 
the rth and;th neurons; and x v Xj, x 3 , x, correspond respectively to 
the activities of the J, E, I R neurons and a Active (F) neuron which 
represents the adaptation effect. 

The two-phase Duffin model may be extended to a three-phase 
pattern by incorporating other respiratory-related neurons. Richter 
and coworkers [Ogilvie et al., 1992; Richter et al., 1986] have pro- 
posed a three-phase RCPG model with mutual inhibition between 
I neurons and postinspiratory, late-inspiratory, and E neurons, all 
with adaptations. Botros and Bruce [1990] have described several 
variants of the Richter model with varying connectivity patterns, 
with and without adaptation. 

Balis and coworkers [1994] proposed a RCPG model comprised 
of distributed populations of spiking neurons which are described 

by the Hodgkin-Huxley equations. Based on extensive spike-train analysis of neural recordings in vivo, 
the network connectivity differs from the Richter model in several essential ways. The model has been 
shown to mimick the normal and abnormal respiratory neural waveforms including the inspiratory ramp, 
inspiratory and expiratory off-switches, as well as apnea and apneusis. Rybak and colleagues [\997a,b] 
proposed a similar RCPG model that is modulated by both the membrane properties of the unit neuron 
and afferent feedback of the respiratory system. 



FIGURE 161.3 A minimal neural 
network model of RCPG. I and E 
denote respectively the early-burst 
inspiratory neurons and Botzinger 
complex expiratory neurons; F is a Ac- 
tive adaptation neuron; S is an excita- 
tion neuron or pacemaker cell. Not 
shown is the bulbospinal output neu- 
ron (I ). Numbers denote connection 
strengths. Source: Matsugu and 
coworkers [1998], with permission. 



Pacemaker and Hybrid Models 

Respiratory rhythm may also originate from endogenous pacemaker cells. In neonatal rats, certain 
conditional bursting pacemaker neurons have recently been identified in a region of the ventrolateral 
medulla referred to as the pre-Botzinger complex [Smith et al., 1991]. Network oscillation by mutual 
inhibition is difficult in neonates because of immaturity of the GABAergic inhibitory system. During the 
postnatal developmental stage there may be progressive transformation from pacemaker to network 
oscillation with possibly some form of hybrid operation in the transition period [Smith, 1994]. A hybrid 
RCPG model composed of a neural network oscillator driven by a pacemaker cell (Fig. 161.3) has been 
proposed [Matsugu et al., 1998]. 

Nonlinear Dynamics 

An empirical (black-box) approach to studying rhythmic behavior is to model the oscillation as limit 
cycles of a nonlinear oscillator. A classic example of limit-cycle oscillation is the Van der Pol oscillator: 



V = e 



V 2 



V-V 



(160.15) 
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where £ and a are shape and amplitude parameters, respectively. This model has been shown to mimick 
the phase resetting and phase singularity properties of the RCPG [Eldridge et al., 1989] which are typical 
of any oscillator with isolated limit cycles. Similar properties are exhibited by a network oscillator model 
of RCPG [Ogilvie et al., 1992] which produces similar limit cycles. 

Another property of nonlinear oscillators is that they may be entrained by (or phase-locked to) other 
oscillators that are coupled to them. The respiratory rhythm has been shown to be entrained by a variety 
of oscillatory inputs including locomotion, mechanical ventilation, blood gas oscillation, and musical 
rhythm. 

A third property of nonlinear oscillators is that the limit-cycle trajectory may bifurcate and become 
quasiperiodic or chaotic when the system is perturbed by nonlinear feedback or other oscillatory sources. 
The resting respiratory rhythm is highly rhythmic in vagotomized animals but may become chaotic in 
normal humans [Donaldson, 1992] and vagi-intact animals [Sammon & Bruce, 1991] especially when 
lung volume is reduced [Sammon et al., 1993]. Both entrainment quasiperiodic and chaotic regimes have 
been demonstrated in a network oscillator model of the RCPG that is driven by periodic inputs [Matsugu 
et al., 1 998 ] . The significance of nonlinear variations in respiratory pattern is discussed in a recent review 
[Bruce, 1996]. 

161.4 Optimization Models 

All control systems are meant to accomplish certain (implicit or explicit) control objectives, it is perhaps 
too simplistic to assume that the sole objective of respiration is to meet the metabolic demand; a complex 
physiologic system may subserve multiple objectives that are vital to animal survival. One approach to 
understanding respiratory control is therefore to discover the innate control objective that fits the 
observed behavior of the controller. 

Optimization of Breathing Pattern 

Any given level of ventilation may be produced by a variety of breathing patterns ranging from deep- 
and-slow to shallow-and-rapid breathing. How is the breathing pattern set for each ventilatory level? 
Rohrer [1925] was among the first to recognize that respiratory frequency at rest may be chosen by the 
controller to minimize the work rate of breathing, a notion that was subsequently advanced by Otis and 
colleagues [1950]. Mead [1960] showed that the resting frequency may be determined more closely on 
the basis of optimal inspiratory pressure-time integral, a measure of the energy cost of breathing. Neither 
the work nor energy measure, however, correctly predicts the inspiratory flow pattern [Bretschger, 1925; 
Ruttimann &Yamamoto, 1972]; such discrepancy has led to the suggestion that the optimization criterion 
may consist of a weighted sum of work and energy expenditures [Bates, 1986] or some higher-order 
terms [Hamalainen and Sipila, 1984; Yamashiro & Grodins, 1971]. 

Similarly, the optimization principle has also been applied to the prediction of airway caliber and dead 
space volume [Widdicombe & Nadel, 1963] as well as end-expiratory lung volume and respiratory duty 
cycle [Yamashiro et al., 1975]. In most cases, the cost functions are generally found to be relatively flat 
in the resting state but may become much steeper during C0 2 or exercise stimulations [Yamashiro et al., 
1975] . This may explain the observed variability of breathing pattern which is generally more pronounced 
at rest than during hyperpnea or ventilatory loading. 

Optimization of Ventilation 

The classical optimization hypothesis of Rohrer [1925] and Bretschger [1925] suggests that conservation 
of energy may be an important factor in the genesis of breathing pattern. It thus appears that the controller 
is charged with two opposing objectives: to meet the metabolic demand by performing the work of 
breathing, and to conserve energy by minimizing the work. How does the controller reconcile such 
conflict? One possible solution is to establish priority. In a hierarchical model of respiratory control, 
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metabolic needs take priority over energetic needs. At the higher hierarchy (outer/chemical feedback 
loop) ventilatory output is set by feedforward/feedback proportional control of the chemical plant to 
meet the metabolic demand, whereas at the lower hierarchy (inner/vagal feedback loop) breathing pattern 
is optimized by the RCPG for efficient energy utilization by the mechanical plant at a ventilation set by 
the higher hierarchy. 

A potential drawback of such a hierarchical system is that it is nonrobust to perturbations. Changes 
in ventilatory load, for example, would disrupt the ventilatory command from the feedforward signal. 
This is at variance with the experimental observation of a load compensation response of the controller 
which protects ventilation against perturbations of the mechanical plant at rest and during exercise [Poon, 
1989a, 1989fr; Poon et al., 1987a, 1987&]. Furthermore, if the prime objective of the controller were 
indeed to meet the metabolic demand (i.e., to maintain chemical homeostasis), then the hierarchical 
control system seems to perform quite poorly; it is well-known that arterial chemical homeostasis is 
readily disrupted environmental changes. 

Another form of conflict resolution is compromise. Poon (1983a, 1987] proposed that an optimal 
controller might counterbalance the metabolic needs versus energetic needs of the body, and the resulting 
compromise would determine the ventilatory response. The tug-of-war between the two conflicting 
control objectives may be represented by a compound optimization criterion which reflects the balance 
between the chemical and mechanical costs of breathing: 

--L + L 

t , W 1 ■ (161.16) 

= |<x[P e C0 2 -P)j +lnW 

The power (quadratic) form and logarithmic form for J c and J m correspond, respectively, to the classical 
Steven's law and Fechner- Weber law of sensory perception [Milsum, 1966]. Assuming that the work rate 
of breathing W~ V 2 , Eq. (161.16) yields an optimal ventilatory response that conforms with the normal 
hypercapnic ventilatory response during C0 2 inhalation and the isocapnic ventilatory response during 
exercise [Poon, 1983a, 1987]. Furthermore, by generalizing Eq. (161.16) to include other chemical and 
work rate components, it is possible to predict the normal ventilatory responses to hypoxia and acidosis 
[Poon, 1983a], mechanical loading [Poon, 1987fo], as well as breathing pattern responses to chemical 
and exercise stimulation [Poon, 1983b]. 

The ventilatory optimization model [Poon, 1983a, 1983&, 1987&] has several interesting implications. 
First, it provides a unified and coherent framework for describing the control of ventilation and control 
of breathing pattern with a common optimization criterion. Second, it offers a parsimonious explanation 
of exercise hyperpnea and ventilatory load compensation responses, without the need to invoke any 
putative exercise stimulus and load compensation stimulus. Third, it suggests that disruption of chemical 
homeostasis (e.g., during C0 2 inhalation) may represent an optimal response as much as maintenance 
of homeostasis during exercise. 

Energetics of breathing is only one of many constraints that conflict with the metabolic cause of 
respiration. Another is the sensation of dyspnea which may be a limiting factor at high ventilatory levels 
[Oku et al., 1993]. A general optimization criterion may therefore include both energetic and dyspneic 
penalties as follows: 

7 = {a(pC0 2 -p)} 2 +2\nV E + kV E 2 (161.17) 

where k is a weighting factor. At low to moderate ventilatory levels the energetic component (logarithmic 
term) dominates, and at high ventilatory levels the dyspneic component (k-V E ) dominates in counter- 
balancing the chemical component. In addition, the ventilatory apparatus may also be constrained by 
other factors such as behavioral and postural interference, which may further tip the balance of the 
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optimization equation. It has been suggested that the periodic breathing pattern at extremely high 
altitudes may represent an optimal response for the conservation of chemical and mechanical costs of 
hypoxic ventilation [Ghazanshahi & Khoo, 1993]. 

Optimization of Neural Waveform 

The neural output of the controller has been traditionally described in terms of measurable quantities 
such as breathing pattern and ventilation. A more accurate representation of the control signal is P mus (t) 
which drives the respiratory pump. The resulting continuous inspiratory and expiratory airflow, 
Eq. (161.9), then determine V E and all other ventilatory patterns. The control problem the RCPG must 
solve is how to optimize P llms (t) in order to deliver adequate ventilation without incurring excessive energy 
(or other) losses. It has been shown by Poon and colleagues [1992] that many interesting characteristics 
of the RCPG conform to a general optimal control law that calls for dynamic optimization of P mui (i). 

The model of Poon and coworkers [1992] assumes a compound optimization criterion, Eq. (161.16), 
with a mechanical penalty given by a weighted sum of the work rate of inspiration and expiration 



w=w,+xw c 



(161.18) 
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(161.19) 



w E = ^£p ws ( t )v( t )dt 



The terms P max and P max denote the limiting capacities of the inspiratory muscles, and n is an efficiency 
index. The optimal P ma (t) output is found by minimization of / subjects to the constraints set by the 
chemical and mechanical plants, Eqs. (161.1) and (161.9). Because P mm (t) is generally a continuous time 
function with sharp phase transitions, this amounts to solving a difficult dynamic nonlinear optimal 
control problem with piecewise smooth trajectories. An alternative approach adopted by Poon and 
coworkers [1992] is to model P„ nu (t) as a biphasic function 
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(161.20) 



where a , a v a 2 and X are shape parameters; T l is the duration of the inspiratory phase of neural activity. 
The optimal P mm waveform is given by the set of optimal parameters a^*, a*, a*, 1*, T*, T T * that 
minimize Eq. (161.16). 

Poon and colleagues [1992] have shown that the dynamic optimization model predicts closely the 
P mm (t) trajectories under various conditions of ventilatory loading as well as respiratory muscle fatigue 
and weakness (Fig. 161.4). In addition, the model also accurately predicts the ventilatory and breathing 
pattern responses to combinations of chemical and exercise stimulation and ventilatory loading [Poon 
et al., 1992], all without invocation of any putative exercise stimulus or load compensation stimulus. The 
generality of model predictions strongly supports the hypothesis that neural processing in the RCPG is 
governed by an optimization law. 
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FIGURE 161.4 {a) Optimal waveforms for respiratory muscle driving pressure, P(i); respiratory airflow, V; and 
respired volume, V, during normal breathing (NL) or under various types of ventilatory loads; IRL and ERL, 
inspiratory and expiratory resistive load; IEL and CEL, inspiratory and continuous elastic load. ( b) Optimal waveforms 
for P(t) under increasing respiratory muscle fatigue (amplitude limited; upper panel) and muscle weakness (rate 
limited; lower panel). Source: Poon and coworkers [1992], with permission. 



161.5 Self -Tuning Regulator Models 

Optimization models suggest an empirical objective function that might be obeyed by the controller. 
They do not, however, reveal the mechanism of such neural optimization. Priban and Fincham [1965] 
suggested that an optimal operating for arterial pH, PC0 2 , and P0 2 might be achieved by a self-adaptive 
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controller which seeks the optimal solution by a process of hill climbing. There is increasing evidence 
that the respiratory system is an adaptive control system [Poon, 1992a]. 

There are two basic requirements in any adaptive system. The first is that in order to adapt to changes, 
the system signals must be constantly fluctuating or persistently exciting. This should be readily satisfied 
by the respiratory system which is inherently oscillatory [Yamamoto, 1962] and chaotic [Donaldson, 1992; 
Sammon & Bruce, 1992] . Another requirement is that the system must be able to learn and then memorize 
the changes in the environment. Learning and memory in neuronal circuits are generally believed to result 
from synaptic modifications in the form of long-term or short-term potentiation which are mediated by 
NMDA receptors [Bliss & Collingridge, 1993]. Evidence for learning and memory in the RCPG is provided 
by the recent discoveries of both short-term potentiation [Fregosi, 1991; Wagner & Eldridge, 1991] and 
long-term potentiation [Martin & Mitchell, 1993] in vivo. Similar short and long-term memories have been 
identified recently in brain stem cardiorespiratory-related region in vitro [Zhou et al., 1997] . Also, neonatal 
mice which lack functional NMDA receptors have been found to suffer severe respiratory depression [Poon 
et al., 1994]. It has been shown that learning and memory in the brain are sufficient to achieve an optimal 
behavior characterized by the chemoreflex response and isocapnic exercise response [Poon, 1991]. 

One form of synaptic modification — correlation Hebbian learning [Sejnowski & Tesauro, 1989] — has 
been suggested to be compatible with the ventilatory optimization model [Poon, 1993, 1996; Young & 
Poon, 1998]. According to this neuronal model, variations in the chemical feedback and mechanical 
feedback signals converging at a Hebbian synapse in the RCPG may induce short-term potentiation if 
they are negatively correlated, or short-term depression if they are positively correlated. In other words, 
the controller gain may be adaptively increased or decreased depending on the coupling between the 
cause and effect of respiration. During exercise, ventilatory neural output and chemical feedback are 
strongly negatively correlated (since S 2 has a large negative value) so that the controller learns to increase 
its gain, G , in proportion to metabolic load. During C0 2 inhalation, V E and P a C0 2 are only weakly 
correlated (i.e., S 2 is small), and the controller gain remains unchanged. Such a neural controller is 
analogous to a self-tuning regulator [Astrom & Wittenmark, 1975] which regulates the output by adaptive 
adjustment of system parameters. 

Implicit in such a self-tuning regulator model is the assumption that afferent inputs may up- or 
downregulate the RCPG by inducing synaptic potentiation or depression of neural transmission. Respi- 
ratory short- and long-term potentiation have been variously reported as indicated above. The possibility 
of synaptic depression was recently demonstrated in the nucleus tractus solitarius of the medulla [Zhou 
etal., 1997]. 

Similarly, it is possible that some form of synaptic learning might be at work in the RCPG to modulate 
pattern generation, thereby resulting in an optimal P mm (t). However, experimental and simulation data 
are presently lacking for verification of this conjecture. 

161.6 Conclusion 

Many empirical and functional models have been proposed to describe various aspects of the respiratory 
control system. The classical chemostat model is useful in describing chemoreflex responses but may be 
too simplistic to explain the variety of system responses to exercise input and mechanical disturbances. 
The recent discovery of various complex behaviors of the controller such as nonlinear dynamics, opti- 
mization, and learning suggest that the RCPG may be endowed with highly sophisticated computational 
characteristics. Such computational capability of the RCPG may be important for the maintenance of 
optimal physiologic conditions under changing environments. The computational problem — which 
amounts to dynamic optimal control of a nonlinear plant — is a very difficult one even by the standard 
of modern digital computers but seems to be solved by the RCPG from instant to instant with relative 
ease. This remarkable ability of the respiratory neural network is interesting from both biologic and 
engineering standpoints. Understanding how it works may shed light on not only the wisdom of the 
body [Cannon, 1932] but also on the design of novel intelligent control systems with improved speed, 
accuracy and economy. 
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Activity of central, peripheral chemoreceptor 

Slope and intercept of C0 2 response curve; L/min/mmHg, mmHg 

Chemical, mechanical, and total costs of breathing 

Solubility constant of C0 2 in blood; mmHg -1 

Metabolic C0 2 production in brain tissues; L/min STPD 

Alveolar partial pressure of C0 2 , 2 ; mmHg 

Arterial partial pressure of C0 2 , 2 ; mmHg 

Brain tissue PC0 2 ; mmHg 

Inspired partial pressure of C0 2 , 2 ; mmHg 

Airway pressure; cmH 2 

Respiratory muscle driving pressure; cmH 2 

Cerebral blood flow; L/min 

Respiratory central pattern generator 

Total respiratory resistance, elastance; cmH /L/s, cmH 2 0/L 

Input-output sensitivity of chemical plant, SP A C0 2 /SV F mmHg/L/min 

Inspiratory, expiratory, and total respiration duration; s 

Lung volume above relaxation volume; L 

Brain tissue store of C0 2 ; L 

Whole-body metabolic C0 2 production, 2 consumption; L/min STPD 

Ratio of respiratory dead space to tidal volume 

Lung tissue store of C0 2 , 2 ; L 

Total ventilation of the lung; L/min BTPS 

Work rate of breathing 
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This chapter is intended to provide a description of neural network control systems and of their potential 
for use in biomedical engineering control systems. Neural network techniques have been used by the 
engineering community in a variety of applications, with particular emphasis on solving pattern recog- 
nition and pattern classification problems [Carpenter 1989; Grossberg, 1988a,b; Hagan, 1996; Hecht- 
Nielsen, 1989; Nerrand, 1993; Pao, 1988; Sanchez-Sinencio 1992; Zurada, 1992]. More recently, there has 
been much research into the use of these techniques in control systems [Antsaklis, 1990; Miller, 1990b; 
White, 1992]. Much of this research has been directed at utilizing neural network techniques to solve 
problems that have been inadequately solved by other control systems techniques. For example, neural 
networks have been used in adaptive control of nonlinear systems for which good models do not exist. 
The success of neural network techniques on this class of problems suggest that they may be particularly 
well suited for use in a wide variety of biomedical engineering applications. It should be emphasized that 
the field of neural network control is a relatively new area of research and that it is not intended to replace 
traditional engineering control. Rather, the focus has been on the integration of neural network tech- 
niques into control systems for use when traditional control systems alone are insufficient. 

There are several textbooks available on neural networks that provide good presentations of the imple- 
mentation and applications of neural network techniques [Hagan, 1996; Sanchez-Sinencio, 1992; Simpson, 
1990; Zurada, 1992]. Recently, a few books have been published that review the application of neural 
network techniques to control systems problems [Miller, 1990b; White, 1992]. This chapter is intended to 
provide an introduction to neural network techniques and a guide to their application to biomedical 
control systems problems. The reader is referred to recently published textbooks and numerous journal 
articles for the specific information required to implement a given neural network control system. 

162.2 Neural Network Basics 

The term neural networks is used to refer to a class of computational algorithms that are loosely based 
upon the computational structure of the nervous system. There is a wide variety among various neural 
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network algorithms, but the key features are that they have a set of inputs and a set of outputs, they 
utilize distributed processing, and they are adaptive. The basic idea is that computation is collectively 
performed by a group of distinct units, or neurons (sometimes referred to as processing elements), each 
of which receives inputs and performs its own local calculation. These units interact by providing inputs 
to each other via synapses and some units interact with the environment via input/output signals. The 
connectivity of various units determines the structure, or architecture, of the network. Adjustments to 
the strengths of the synapses (i.e., adjustments to the synaptic weights) modify the overall input-output 
properties of the network and are adjusted via a learning algorithm. The design of a neural network 
includes the specification of the neuron, the architecture and the learning algorithm. 

Most neural networks are based on a model of a neuron that captures the most basic features of real 
neurons: A neuron receives inputs from other neurons via synapses and the output from a neuron is a 
nonlinear function of a weighted summation of its inputs. There are many different sets of equations 
that capture these basic features; the following set of equations is commonly used: 



7f 



where y, is the weighted summation of inputs to neuron j, x, is the input from neuron z, n is the number 
of neurons providing synaptic input to neuron ;', w« is the synaptic weight from neuron i to neuron j, z- 
is the output from neuron j, and m is the slope of the sigmoidal output function. Note that in this set 
of equations, the output of a neuron is a static nonlinear function of the weighted summation of its 
inputs. Some commonly used variations on this model include: the use of different nonlinear output 
functions [Lippman, 1987], the use of recurrent inputs (i.e., past values of a neurons output would be 
included in the input summation) [Pineda, 1989; Williams and Zipser, 1989], and the use of dynamic 
neurons (i.e., Eq. (162.1) would be a differential equation rather than an algebraic one) [Carpenter, 1989; 
Grossberg, 1988a; Hopfield, 1982]. The use of recurrent inputs and/or dynamic neuron models may be 
particularly important for applications in the control of dynamic systems because they provide the 
individual neurons with memory and temporal dynamics [Pineda, 1989]. 

Most applications use networks of neurons that are arranged in layers. The most commonly used 
architecture is a three layer network which has an input layer, a hidden layer and an output layer. A 
feedforward neural network architecture is one in which each neuron receives input from neurons in the 
previous layer. The number of neurons in each layer must be specified by the designer. There are some 
heuristic rules, but no solid theory, to guide the designer in specifying the number of neurons [Zurada, 
1992]. Some commonly used variations on this architecture include: the use of only a single layer of 
neurons [Kohonen, 1989; Widrow, 1962], the use of bidirectional connections between layers [Carpenter, 
1989], the use of intra-layer connections [Kohonen, 1989], the use of pruning techniques which cut some 
of the connections for greater efficiency [Zurada, 1992], and the use of heterogeneous networks in which 
more than one neuron model is used to describe the various neurons in the network [Hecht-Nielson, 
1989]. 

A general form of a neural network learning algorithm is given by: 



Aw tj = f(i}, Zi , Zj ,e^ 



This equation states that the change in synaptic weight is a function of the learning rate (r|), the activation 
of the pre-synaptic neuron (z,), the activation of the post-synaptic neuron (zj), and a training signal (e,^). 
Several different learning algorithms have been developed [Fogel, 1990; Grossberg, 1988a; Hecht-Nielsen, 
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1990; Hinton, 1989; Rumelhart, 1988], most of which fit into this general form, but may not use all of 
the terms. An example is Hebbian learning [Brown, 1990; Hebb, 1949] in which the change in weight is 
proportional to the product of the pre-synaptic activation and the post-synaptic activation. Often, 
gradient descent techniques are used to adjust the synaptic weights. The most commonly used learning 
algorithm, error backpropogation, uses an error gradient descent technique and passes the output error 
backwards through the network to determine the training signal for a given neuron [Pineda, 1989; 
Rumelhart, 1 988; Vogl 1 988 ] . Most of the commonly used learning algorithms are classified as supervised 
learning techniques because they use a specification of the desired output of the network to determine 
the output error of the network. Other learning algorithms, such as reinforcement learning (discussed 
below) can be used in situations when it is not possible to directly specify the desired output of the 
network. 

In most engineering applications, the neural network is used to perform a nonlinear mapping from 
the space of network inputs to the space of network outputs. In addition to the internal specifications 
of the network discussed above, the engineer using a neural network must select the signals to be used 
as input to the network and the signals to be used as output from the network. Although this decision 
appears to be trivial, in many applications it is not. For applications in control systems, the selection of 
input and output signals is dependent upon how the neural network fits into the overall structure of the 
system. Several different options are described below. 

162.3 Neural Network Control Systems 

Neural networks have often been applied in control systems where, at a block diagram level, the neural 
network replaces one component of the control system. Typically, the motivation for utilizing the neural 
network lies in the ability to perform nonlinear mappings, in the ability to handle a wide range of model 
structures, and/or in the ability of the network to adapt. While there has been activity in the development 
of a theoretical framework for neural network control systems, most applications to-date have been 
heuristically-based to one degree or another [Farotimi, 1991; Miller, 1990b; Nguyen, 1990; Quinn, 1993; 
Wang, 1992; White, 1992]. The field of neural network control has been described as bridging the gap 
between mathematically-based control systems engineering and heuristically-based artificial intelligence 
[Barto, 1990]. 

In designing a neural network control system, one must select the overall structure of the system and 
decide which components will utilize neural network algorithms. Several examples of control system 
structures are provided below, each of which utilizes one or more neural networks as described above. 
This section of the chapter provides a brief overview of some neural control systems that have potential 
for application in biomedical control systems. For excellent, thorough reviews of recent developments 
in neural network control systems, the reader is referred to Miller [1990b] and White [1992]. 

Supervised Control 

In this structure (Fig. 162.1), the neural network is used to mimic, and eventually replace, an existing 
control system. Here, the neural network is used in a traditional feedback arrangement where it performs 
a mapping from the system outputs to the system inputs. The network would be trained using data 
collected from the actual system or a computer simulated model with the original controller active. The 
training signal for the network would be the difference between the output of the original controller and 
the output of the network. After the network learns to adequately mimic the original controller, the 
training is completed. Thus, the synaptic weights would no longer be adapted, and the original control 
system would be replaced by the neural network. This type of an arrangement could be useful in situations 
where the neural network could perform the task less expensively or more efficiently than the original 
one. One situation would be when the original control system requires such heavy computation that it 
is impractical for real-time use on an affordable computer system. In this case, the neural network might 
be able to learn to perform the same (or functionally equivalent) operation more efficiently such that 
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implementation on an affordable general purpose 
computer or on specialized neural network hardware 
would be practical. A second situation would be 
when the original control system is a human opera- 
tor that might be either expensive or prone to error. 
In this case the neural network is acting as an adap- 
tive expert system that learns to mimic the human 
expert [Werbos, 1990]. A third situation would be 
when the neural network could learn to perform the 
control using a different, and more easily measured, 
set of output variables [Barto, 1990]. This type of 
application would be particularly well-suited for bio- 
medical applications in which an invasive measure- 
ment could be replaced by a non-invasive one. 
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FIGURE 162.1 Supervised control system structure. 



Direct Inverse Control 

In direct inverse control, (Fig. 162.2), the neural network is used to compute an inverse model of the 
system to be controlled [Levin, 1991; Nordgren, 1993]. In classical linear control techniques, one would 
find a linear model of the system then analytically compute the inverse model. Using neural networks, 
the network is trained to perform the inverse model calculations, i.e., to map system outputs to system 
inputs. Biomedical applications of this type of approach include the control of arm movements using 
electrical stimulation [Lan, 1994] and the adaptive control of arterial blood pressure [Chen, 1997]. 

A variety of neural network architectures can be used in this direct inverse control system structure. 
A multi-layer neural network architecture with error backpropagation learning could be used off-line to 
learn an inverse model using a set of system input/output data. If on-line learning is desired, a feedforward 
model of the system can be trained while a feedback controller is active [Miller, 1990a; Miyamoto, 1988] . 
This technique, referred to as feedback error learning [Miyamoto, 1988], uses the control signal coming 
from the feedback controller as the training signal for the feedforward controller update. The idea used 
here is that if the feedforward control is inadequate, the feedback controller would be active and it would 
lead to updates in the feedforward controller. To achieve the rapid learning rates that would be desirable 
in on-line feedback error learning, typically single-layer networks have been used or the learning has 
been constrained to occur at one layer. This type of an approach is particularly useful in applications 
where on-line fine tuning of feedforward controller parameters is desired. 

Model-based Approaches to Control 

Neural network approaches have been used as an alternative to other nonlinear techniques for modeling 
physiological systems [Chon, 1998]. Several neural network control systems have utilized model-based 
approaches in which the neural network is used to identify a forward nonlinear system model. Often, 
these systems utilize neural network components in control system architectures that have been developed 
using linear or nonlinear components. 

For many control applications, system output error can be readily measured but in order to adapt the 
controller parameters one needs to relate system output error to controller output (system input) error. 
One strategy would be to use the derivative of the system's output error with respect to its input to 
perform gradient descent adaptation of the controller parameters. However, there is no direct way of 
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FIGURE 162.3 Backpropagating system output error through the system model. 

calculating the derivative of the system's output error with respect to its input and therefore it must be 
approximated. To implement such an approximation, a pair of multi-layer neural networks (Fig. 162.3) 
have been used: one for identification of a forward model of the plant, and one for implementation of 
the controller [Barto, 1990]. The forward model is used to provide a means of computing the derivative 
of the model's output with respect to its input by backpropagating the error signal (the actual plant 
output tracking error) through the model of the plant to estimate the error in the control signal. The 
error in the control signal can then be backpropagated through the controller neural network. As 
described here, the backpropagation algorithm is used to minimize the output error of the system, which 
is a special case of maximization of utility. Although this is a very powerful technique with potential 
applications in a wide variety of nonlinear control problems, its primary limitation for many biomedical 
applications is the relatively slow training rate achieved by the backpropagation algorithm. 

Neural networks have also been used in model reference adaptive control (MRAC) structures 
(Fig. 162.4) [Naranedra and Parthasarathy, 1990; Narendra, 1992]. This approach builds upon established 
techniques for adaptive linear control and incorporates neural networks to address the problem of 
controlling nonlinear systems. The MRAC approach is directed at adapting the controlled system such 
that it behaves like a reference model, which is specified by the control system designer. In the structure 
shown in Fig. 162.4, the identification model is a model of plant dynamics, the parameters of which are 
identified online using the identification error (the difference between the actual and the estimated plant 
outputs) as the training signal. The controller uses the estimates of the plant parameters, reference inputs, 
and system outputs to determine inputs to the system. In the neural network version of the control 
system structure, neural networks can be used for the identification of the system model, for the controller, 
or for both. Multi-layer neural networks using error backpropagation or a modified version of error 
backpropagation, termed dynamic backpropagation, have been used for adaptive control of nonlinear 
systems [Narendra and Parthasarathy, 1990; Narendra, 1992]. 

A third model-based approach is neural predictive control, which is a neural network version of 
nonlinear model predictive control [Trajanoski and Wach, 1998]. In this approach, the neural network 
is used for off-line identification of a system model, which is then used to design a nonlinear model 
predictive controller. This design may provide suitable control of nonlinear systems with time-delays and 
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FIGURE 162.4 Indirect adaptive control system structure. 
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thus may be particularly useful in biomedical applications. Recent computer simulation studies have 
demonstrated positive results for control of insulin delivery [Trajanoski and Wach, 1998]. 
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Adaptive Critics 

All of the approaches to control discussed thus far have utilized 
supervised learning techniques. The training signal used for adap- 
tation in these techniques is an error vector that gives the magni- 
tude of the error in the system output signal and the direction in 
which it should change. For many complex, multi-variable control 
problems this type of system output error information is not 
readily available. The only measure available might be a scalar 
measure of system performance that is not directly related to the 
system outputs in a way that is understood by the control system 
designer. In such cases, a directed search of the control system 
parameter space could be performed in order to maximize (opti- 
mize) the scalar performance measure. Dynamic programming is 
a control systems engineering technique that has been used for 
optimal control of linear systems but these techniques are not well 
suited for large-scale systems or nonlinear systems. Neural net- 
works have been used in an approach that is similar to dynamic 

programming in order to optimize system performance of large-scale nonlinear systems. These tech- 
niques, termed "adaptive critic methods", utilize a structure (Fig. 162.5) in which a critic module provides 
an evaluation signal to the controller module. The controller module utilizes a reinforcement learning 
algorithm in order to optimize performance. The details of this class of neural network controllers are 
beyond the scope of this chapter, but descriptions and example applications are given in [Barto, 1990; 
Barto, 1992; Werbos, 1992]. 

Adaptive critic algorithms may be particularly useful in several biomedical engineering control systems 
where the relationship between system performance and measurable system outputs are ill-defined. While 
these methods that use reinforcement learning may be attractive because they are general and do not 
require detailed information about the system, this increased generality comes at a cost of increased 
training times [Barto, 1990]. Supervised learning methods, when they can be used, can be more efficient 
than reinforcement learning methods. 



FIGURE 162.5 Adaptive critic con- 
trol system structure. 



Neurophysiologically Based Approaches 

All neural network systems are based upon neurophysiological models to some degree, but in most cases 
this biological basis is very superficial, as is evident in the description given above. Computational 
neuroscience is a term used to describe the development of computational models of the nervous system 
[Bower, 1998; Koch and Segev, 1989; Schwartz, 1990]. Such models are intended to be used by neuro- 
scientists in their basic science efforts to understand the functioning of the nervous system. There are 
several examples of neural network control system designs that have utilized models developed in the 
field of computational neuroscience [Abbas, 1995b; Beer, 1993; Bullock, 1988; Houk, 1990; Taga, 1991]. 
These designs have extended the notion of "mimicking the nervous system" beyond what is used in most 
neural networks. The reasoning behind this approach is based upon a view that biological systems solve 
many problems that are similar to, or exactly the same as, those faced in many engineering applications 
and that by mimicking the biological system we may be able to design better engineering systems. This 
reasoning is similar to the motivation for much of the work in the neural network field and for the recent 
emphasis in biomimetic techniques in other engineering disciplines. 

One approach to incorporating a stronger neurophysiological foundation has been to utilize an archi- 
tecture for the overall control system that is based upon neurophysiological models, i.e., mimicking the 



» 2000 by CRC Press LLC 



neurophysiological system at a block diagram level. This type of approach has led to the development 
of hierarchical control systems that are based upon the hierarchical structure of the human motor control 
system [Kawato, 1987; Srinivasan, 1992]. 

A second approach to incorporating a stronger neurophysiological foundation has been to utilize more 
realistic models of neurons and of their interconnections in the design of the neural network [Beer, 1990; 
Houk, 1990; Missler and Kamangar, 1995]. An example of this approach is the design of a coupled 
oscillator neural network circuit that is based upon the locomotor control system of the cockroach [Beer, 
1992; Quinn, 1993] . In this neural network, some of the neurons are capable of endogenously oscillating 
due to intrinsic membrane currents and the network is heterogeneous, meaning that each neuron is not 
described by the same model. Most of the parameters of the models are fixed. This network has been 
shown to robustly generate patterns for statically-stable gaits at various speeds. 

A biomedical control system that utilizes a neurophysiologically-based approach has been developed 
for use in Functional Neuromuscular Stimulation (FNS) systems [Abbas, 1995; Abbas and Chizeck, 1995] . 
FNS is a rehabilitation engineering technique that uses computer-controlled electrical stimuli to activate 
paralyzed muscle. The task of a control system is to determine appropriate stimulation levels to generate 
a given movement or posture. The neural network control system utilizes a block diagram structure that 
is based on hierarchical models of the locomotor control system. It also utilizes a heterogenous network 
of neurons, some of which are capable of endogenous oscillation. This network has been shown to provide 
rapid adaptation of the control system parameters [Abbas and Chizeck, 1995; Abbas and Triolo, 1997] 
and has been shown to exhibit modulation of reflex responses [Abbas, 1995]. 

162.4 Summary 

This chapter presents an overview of the relatively new field of neural network control systems. A variety 
of techniques are described and some of the advantages and disadvantages of the various techniques are 
discussed. The techniques described here show great promise for use in biomedical engineering appli- 
cations in which other control systems techniques are inadequate. Currently, neural network control 
systems lack the type of theoretical foundation upon which linear control systems are based, but recently 
there have been some promising theoretical developments. In addition, there are numerous examples of 
successful engineering applications of neural networks to attest to the utility of these techniques. 
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Defining Terms 

Backpropagation: A technique used to determine the training signal used for adjusting the weights of 

a given neuron in a neural network. 
Learning algorithm: An algorithm used to update the synaptic weights in a neural network. 
Neural network: A term used to refer to a broad class of computational algorithms that are loosely 

based on models of the nervous system. 
Reinforcement learning algorithms: Learning algorithms that utilize a system performance measure 

(that may or may not have a direct, known relationship to output error of the neural network) as 

a training signal for the neural network. 
Supervised learning algorithms: Learning algorithms often used in neural networks that use the output 

error of the neural network as a training signal. 
Synaptic weight: A scaling factor on the signal from one neuron in a network to another. 
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The problem of system identification in physiology derives its importance from the need to acquire 
quantitative models of physiologic function (from the subcellular to the organ system level) by use of 
experimental observations (data). Quantitative models can be viewed as summaries of experimental 
observations that allow scientific inference and organize our knowledge regarding the functional prop- 
erties of physiologic systems. Selection of the proper (mathematical or computational) form of the model 
is based on existing knowledge of the system's functional organization. System identification is the process 
by which the system model is determined from data. This modeling and identification problem is rather 
challenging in the general use of a physiologic system, where insufficient knowledge about the internal 
workings of the system or its usually confounding complexity prevents the development of an explicit 
model. The models may assume diverse forms (requiring equally diverse approaches) depending on the 
specific characteristics of the physiologic system (e.g., static/dynamic, linear/nonlinear) and the prevailing 
experimental conditions (e.g., noise contamination of data, limitations on experimental duration). This 
chapter will not address the general modeling issue, but rather it will concentrate on specific methods 
and tools that can be employed in order to accomplish the system identification task in most cases 
encountered in practice. Because of space limitations, the treatment of these system identification meth- 
ods will be consistent with the style of a review article providing overall perspective and guidance while 
deferring details to cited references. 

Since the complexity of the physiologic system identification problem rivals its importance, we begin 
by demarcating those areas where effective methods and tools currently exist. The selection among 
candidate models is made on the basis of the following key functional characteristics: (1) static or 
dynamic; (2) linear or nonlinear; (3) stationary or nonstationary; (4) deterministic or stochastic; 
(5) single or multiple inputs and/or outputs; (6) lumped or distributed. These classification criteria do 
not constitute an exhaustive list but cover most cases of current interest. Furthermore, it is critical to 
remember that contaminating noise (be it systemic or measurement-related) is always present in an actual 
study, and experimental constraints often limit experimentation time and the type of data obtainable 
from the system. Finally, the computational requirements for a practicable identification method must 
not be extraordinary, and the obtained results (models) must be amenable to physiologic interpretation, 
in addition to their demonstrated predictive ability. 
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A critical factor in determining our approach to the system modeling and identification task is the 
availability and quality of prior knowledge about the system under study with respect to the mechanisms 
that subserve its function. It can be said in general that, if sufficient knowledge about the internal 
mechanisms subserving the function of a system is available, then the development of an explicit model 
from first (physical or chemical) principles is possible, and the system modeling and identification task 
is immensely simplified by reducing to an estimation problem of the unknown parameters contained in 
the explicit model. Although the ease of this latter task depends on the manner in which the unknown 
parameters enter in the aforementioned explicit model, as well as on the quality of the available data, it 
is typically feasible. Furthermore, the obtained model is amenable to direct and meaningful physiologic 
interpretation. Unfortunately, it is rare that such prior knowledge (of adequate quality and quantity) is 
available in systems physiology. It is far more common that only limited prior knowledge is available, 
relative to the customary complexity of physiologic systems, that prevents the development of an explicit 
model from first principles and necessitates the search for a model that is compatible with the available 
input-output data. 

Thus, the system identification problem is typically comprised of two tasks: 

1. Model Specification: the selection or postulation of a model form suitable for the system at hand 

2. Model Estimation: the estimation of the unknown parameters or functions contained within the 
specified model, using experimental data 

All prior knowledge about the system under study is utilized in the model specification task. This 
includes results from specially designed preliminary experiments, which can be used to establish, for 
instance, whether the system is static or dynamic, linear or nonlinear, and so on. The model estimation 
task, however, relies on the quality of the available data (e.g., spectral characteristics, noise conditions, 
data length) and may set specific data- collection requirements. 

This article will focus on practicable methods to perform both the model specification and model 
estimation tasks for systems/models that are static or dynamic and linear or nonlinear. Only the stationary 
case will be detailed here, although the potential use of nonstationary methods will be also discussed 
briefly when appropriate. In all cases, the models will take deterministic form, except for the presence 
of additive error terms (model residuals). Note that stochastic experimental inputs (and, consequently, 
outputs) may still be used in connection with deterministic models. The cases of multiple inputs and/or 
outputs (including multidimensional inputs/outputs, e.g., spatio-temporal) as well as lumped or distrib- 
uted systems, will not be addressed in the interest of brevity. It will also be assumed that the data (single 
input and single output) are in the form of evenly sampled time-series, and the employed models are in 
discrete-time form (e.g., difference equations instead of differential equations, discrete summations 
instead of integrals). 

In pursuing the model specification task, two general approaches have developed: parametric and 
nonparametric. In the parametric approach, algebraic or difference equation models are typically used 
to represent the input-output relation for static or dynamic systems, respectively. These models are 
accordingly linear or nonlinear, and contain a (typically small) number of unknown parameters. The 
latter may be constant or time-varying depending on whether the system or model is stationary or 
nonstationary. The precise form of these parametric models (e.g., degree/order of equation) must be 
determined in order to complete the model specification task. This precise form is either postulated a 
priori or guided by the data (in which case it is intertwined with the model parameter estimation task) 
as outlined in the following section. 

In the nonparametric approach, the input-output relation is represented either analytically (in con- 
volutional form through Volterra- Wiener expansions where the unknown quantities are kernel func- 
tions), computationally (i.e., by compiling all input-output mapping combinations to look-up tables) or 
graphically (in the form of operational surfaces/subspaces in phase-space). The graphical representation, 
of course, is subject to the three-dimensional limitation for the purpose of visual inspection. The model 
specification requirements for the Volterra- Wiener formulation consist of the order of system nonlinearity 
and system memory, and, for the computational or graphical approach, they consist of defining the 
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appropriate phase-space mapping dimensions, as discussed below. Of the nonparametric approaches, 
the Volterra- Wiener (kernel) formulation has been used more extensively in a nonlinear context and will 
be the focus of this review, since nonlinearities are ubiquitous in physiology. Note that the nonparametric 
model estimation task places certain requirements on the experimental input (i.e., sufficient coverage of 
the frequency bandwidth and amplitude range of interest in each application) in order to secure adequate 
probing of the system functional characteristics. 

An important hybrid approach has also developed in recent years that makes use of block-structured 
or modular models. These models are composed of parametric and/or nonparametric components 
properly connected to represent reliably the input-output relation. The model specification task for this 
class of models is more demanding and may utilize previous parametric and/or nonparametric modeling 
results. A promising variant of this approach, which derives from the general Volterra-Wiener formula- 
tion, employs principal dynamic modes as a canonical set of filters to represent a broad class of nonlinear 
dynamic systems. Another variant of the modular approach that has recently acquired considerable 
popularity but will not be covered in this review is the use of artificial neural networks to represent input- 
output nonlinear mappings in the form of connectionist models. These connectionist models are often 
fully parametrized, making this approach affine to parametric modeling, as well. 

The relations among these approaches (parametric, nonparametric, and modular) are of critical 
practical importance and the subject of several recent studies [Marmarelis, 1994, 1997]. Considerable 
benefits may accrue from the combined use of these approaches in a cooperative manner that aims at 
securing the full gamut of distinct advantages specific to each approach. 

In the following sections, an overview of these methodologies will be presented and the relative 
advantages and disadvantages in practical applications will be briefly outlined. The ultimate selection of 
a particular methodology hinges upon the specific characteristics of the application at hand and the 
prioritization of objectives by the individual investigator. Two general comments are in order: 

1. No single methodology is globally superior to all others, i.e., excelling with regard to all criteria 
and under all possible circumstances. Judgment must be exercised in each individual case. 

2. The general system identification problem is not solvable in all cases at present, and challenging 
problems remain for future research. 

163.1 Parametric Approach 

Consider the input and output time-series data, x(n), respectively. If the system is static and linear, then 
we can employ the simplest (and most widely used) model of linear regression 



v(n)=ax(n)+b+e(n) (163.1) 



to represent the input-output relation for every n, where s(n) represents the noise or error term (or 
model residual) at each n. The unknown parameters (a,b) can be easily estimated through least-squares 
fitting, using a well-developed set of linear regression methods (e.g., ordinary least-squares or generalized 
least-squares depending on whether e(n) is a white sequence). 

This model can be extended to multiple inputs {x lt x 2 , ..., x k ] and outputs {/,} as 



Wn)= a x x\ n\+ a 2 xA n\+ ... + a k xAn\+ b + ei n\ (163.2) 



and well-developed multiple linear regression techniques exist that can be used for estimation of the 
unknown parameters (a p fl 2 , ■ ■ ■ , a k , b) for each output y r Although these estimation methods are widely 
known and readily available in the literature — see, for instance, Eykhoff [1974] and Soderstrom and 
Stoica [1989] — they will be briefly reviewed at the end of this section. 
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In the event of nonstationarities, the regression coefficients (model parameters) will vary through time 
and can be estimated either in a piecewise stationary fashion over a sliding time window (batch process- 
ing) or in a recursive fashion using an adaptive estimation formula (recursive processing). The latter has 
been favored and extensively studied in recent years [Goodwin & Sin, 1984; Ljung, 1987; Ljung & 
Soderstrom, 1983], and it is briefly outlined at the end of this section. 

If the system is static and nonlinear, then a nonlinear input-output relation 



in) = l y\c j p\x(n\ +e(n) (163.3) 



can be used as a parametric model, where the {P ■} functions represent a set of selected nonlinear functions 
(e.g., powers, polynomials, sinusoids, sigmoids, exponentials or any other suitable of functions over the 
range of x), and {cj are the unknown parameters that can be easily estimated through linear regres- 
sion — provided that the {P } functions do not contain other unknown parameters in a nonlinear fashion. 
In the latter case, nonlinear regression methods must be used (e.g., the gradient steepest-descent method) 
that are well developed and readily available [Eykhoff, 1974; Ljung, 1987; Soderstrom & Stoica, 1989], 
although their use is far from problem-free or devoid of risk of misleading results (e.g., local minima or 
noise effects). Naturally, the choice of the {P:} functions is critical in this regard and may depend on the 
characteristics of the system or the type of available data. 

These cases of static systems have been extensively studied to date but have only limited interest or 
applicability to actual physiologic systems, since the latter are typically dynamic — i.e., the output value 
at time n depends also on input and/or output values at other previous times (lags). Note that the possible 
dependence of the present output value on previous output values can be also expressed as a dependence 
on previous input values. Thus, we now turn to the all-important case of dynamic systems. 

For linear (stationary) dynamic systems, the discrete-time parametric model takes the form of an auto- 
regressive moving overage with exogenous variable (ARMAX) equation: 

y[n) = a 1 y(n-l) + ...+a k y(n-k)+$ x(n)+$ 1 x(n-l) 

+ ... +^ l xln-l\ + win\ + y l win-l\ (163.4) 

+ ...+Y m w(«-mj 

where w(n) represents a white noise sequence. This ARMAX model is a difference equation that expresses 
the present value of the output, y(n), as a linear combination of k previous values of the output (AR 
part), /previous (and the present) values of the input (X part), and m previous (and the present) values 
of the white noise disturbance sequence (MA part) that compose the model residual (error). When y t = 
for all i = 1, ..., m, the residuals form a white sequence, and the coefficients (a p ..., a k , (3 , fi^ ..., 
P,) can be estimated through the ordinary least-squares procedure. However, if any y, is nonzero, then 
unbiased and consistent estimation requires generalized or extended least-squares procedures, similar to 
the one required for the multiple regression model of Eq. (163.2) when e(«) is a nonwhite error sequence 
(reviewed at the end of the section). 

Although the estimation of the ARMAX model parameters can be straightforward through multiple 
linear regression, the model specification task remains a challenge. The latter consists of determining the 
maximum lag values {k, I, m) in the difference equation (163.4) from given input-output data, x(«) and 
y(n). A number of statistical procedures have been devised for this purpose (e.g., weighted residual 
variance, Akaike information criterion), all of them based on the prediction error for given model order 
(k, I, m) compensated properly for the remaining degrees of freedom (i.e., the total number of data minus 
the number of parameters). It is critical that the prediction error (typically, in mean-square sense) be 
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evaluated on a segment of input-output data distinct from the one used for the estimation of the model 
parameters. Application of these criteria is repeated successively for ascending values of the model order 
(k, I, m), and the model specification process is completed when an extremum (minimum or maximum, 
depending on the criterion) is achieved [Soderstrom & Stoica, 1989]. 

For nonlinear (stationary) systems, the ARMAX model can be extended to the NARMAX model 
(nonlinear ARMAX) that includes nonlinear expressions of the variables on the right side of Eq. (163.4) 
[Billings & Voon, 1984]. For instances, a second-degree multinomial NARMAX model of order (k = 2, 
/ = 1, m = 0) with additive white-noise residuals takes the form 

y{n) = a l y[n-\\ + a 2 yln-2} + a 11 y 2 («-l) + a l2 y{n-l\y(n-2) 
+ a 22r 2 («-2) + f3 x(n)+p 1 x(«-l) + p 00 x 2 («) 
+ f3 01 x(«)x(n-l) + (3 llX 2 (n-l) + Y lor («-l)x(«) (163.5) 

+ T 2 ,o>'("- 2 ) x (") + Yi,ir("- 1 )^(«- 1 ) 
+ y 2l y( n — 2 \xln — 1 + wy n\ 

Clearly, the form of a NARMAX model may become rather unwieldy, and the model specification task 
(i.e., the form and degree of nonlinear terms, as well as the number of input, output, and noise lags 
involved in the model) is very challenging. Several approaches have been proposed for this purpose 
[Billings 8c Voon, 1984, 1986; Haber & Unbenhauen, 1990; Korenberg, 1988; Zhao 8c Marmarelis, 1994], 
and they are all rather involved computationally. However, if the structure of the NARMAX model is 
established, then the parameter estimation task is straightforward in a manner akin to multiple linear 
regression — since the unknown parameters enter linearly in the NARMAX model. 

It is evident that all these multiple linear regression problems defined by Eqs. (163.2)-(163.5) can be 
written in a vector form: 

yln) = ^(njQ + eln\ (163.6) 

where ())(w) represents the vector of all regression variables in each case, and denotes the unknown 
parameter vector. For a set of data points (w = 1, ..., N), Eq. (163.6) yields the matrix formulation 

y=<I>e + e (163.7) 

The ordinary least-squares (OLS) estimate of 



e ( 



<&> 



-i 



<& T y (163.8) 



yields unbiased and consistent estimates of minimum variance, if e(«) is a white sequence; otherwise, 
the generalized least-squares (GLS) estimator 



4> T £ _1 <& 



1-1 

<S> r T. l y (163.9) 



ought to be used to achieve minimum estimation variance, where Z denotes the covariance matrix of e. 
Practical complications arise from the fact that S is not a priori known and, therefore, must be either 
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postulated or estimated from the data. The latter case, which is more realistic in actual applications, leads 
to an iterative procedure that may not be convergent or yield satisfactory results (i.e., start with an OLS 
estimate of 0; obtain a first estimate of e and E; evaluate a first GLS estimate of 0; obtain a second estimate 
of e and Z; and iterate until the process converges to a final GLS estimate of 0). As an alternative to this 
iterative procedure, an estimate of the moving- average model of the residual term: 



Eln)= w\ nl + YjWl n— II- 



+y m M 



In— m\ 



(163.10) 



may be obtained from initial OLS estimation, and the covariance matrix S may be estimated from 
Eq. (163.10). Although this alternative approach avoids problems of convergence, it does not necessarily 
yield satisfactory results due to the dependence on the noise model estimation of Eq. (163.10). Equivalent 
to this latter procedure is the residual whitening method, which amounts to prefiltering the data with the 
inverse of the transfer function corresponding to Eq. (163.10) {prewhitening filter), prior to OLS estima- 
tion. Finally, the parameter vector may be augmented to include the coefficient {y J of the moving-average 
model of the residuals in Eq. (163.10), leading to a pseudo-linear regression problem [since the estimates 
of win) depend on 0] that is operationally implemented as an iterative extended least-squares (ELS) 
procedure [Billings & Voon, 1984, 1986; Ljung, 1987; Ljung & Soderstrom, 1983; Soderstrom & Stoica, 
1989]. 

In the presence of nonstationarities, the parameter vector may vary through time, and its estimation 
may be performed either in a piecewise stationary manner (with segment-to-segment updates that may 
be obtained either recursively or through batch processing) or by introducing a specific parametrized 
time-varying structure for the parameters and augmenting the unknown parameter vector to include 
the additional parameters of the time-varying structure (e.g., a pth degree polynomial structure of time- 
varying model parameters will introduce p additional parameters for each time-varying term into the 
vector 0, that need be estimated from the data via batch processing). 

Any batch processing approach folds back to the previously discussed least-squares estimation meth- 
ods. However, the recursive approach requires a new methodological framework that updates continu- 
ously the parameter estimates on the basis of new data. This adaptive or recursive approach has gained 
increasing popularity in recent years, although certain important practical issues (e.g., speed of algorith- 
mic convergence, effect of correlated noise) remain causes for concern in certain cases [Goodwin & Sin, 
1984; Ljung, 1987; Ljung & Soderstrom, 1983]. The basic formulae for the recursive least-squares (RLS) 
algorithm are 



e(«) = e(n-i)+«p(«) y(«)-(t> T (n)e(«-l) 

«p(n) = y(n)p( M -l)(|)(n) 

p(n) = An- 1)- jlnjpln- lW nWfnWn- 1) 



(163.11) 
(163.12) 
(163.13) 



/(«) = [<|> t (m)p(«- l)<|>(n) + a(n) 



(163.14) 



where the matrix P(«), the vector ¥(«) and the scalar y(w) are updating instruments of the algorithm 
computed at each step n. Note that {a(«)} denotes a selected sequence of weights for the squared 
prediction errors in the cost function and is often taken to be unity. This recursive algorithm can be used 
for on-line identification of stationary or nonstationary systems. A critical issue in the nonstationary 
case is the speed of algorithmic convergence relative to the time-variation of the system/model parameters. 
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Its initialization is commonly made as: 9(0) = and P(0) = c I, where c is a large positive constant (to 
suppress the effect of initial conditions) and I is the identity matrix. 

It is important to note that when output autoregressive terms exist in the model, the regression vector 
<(>(«) is correlated with the residual e(n), and, thus, none of the aforementioned least-squares estimates 
of the parameter vector will converge to the actual value. This undesirable correlation weakens when the 
predicted output values are used at each step for the autoregressive lagged terms (closed-loop mode or 
one-step predictive model) instead of the observed output values [open-loop mode or global predictive 
model). To remedy this problem, the instrumental variable (IV) method has been introduced that makes 
use of a selected IV that is uncorrelated with the residuals but strongly correlated with the regression 
vector <|)(n) in order to evaluate the least-squares estimate [Soderstrom & Stoica, 1989]. The IV estimates 
can be computed in batch or recursive fashion. 

In closing this section, we should note that a host of other estimation methods has been developed 
through the years and that these methods are operationally affme to the foregoing ones (e.g., prediction- 
error methods, stochastic approximation, gradient-based methods) or represent different statistical 
approaches to parameter estimation (e.g., maximum likelihood, bayesian approach) including the state- 
space formulation of the problem, that cannot be detailed here in the interest of space [for review, see 
Eykhoff, 1974; Goodwin & Payne, 1977; Goodwin & Sin, 1984; Ljung, 1987; Ljung & Soderstrom, 1983; 
Soderstrom & Stoica, 1989]. 

163.2 Nonparametric Approach 

In the linear stationary case, the discrete-time nonparametric model takes the convolutional form: 

M 

y\ n ) = / h\ m ) x \ n ~ WJj + efn) (163.15) 

M = 

where e(n) is an output-additive error/noise term and h{m) is the discrete impulse response of the linear 
time-invariant (stationary) system with memory extent M. For finite-memory systems (i.e., when h(m) 
becomes negligible for m > M and M is finite), the estimation of h(m) from input-output data can be 
accomplished by multiple regression, since the model of Eq. (163.15) retains the form of a parametric 
model with no autoregressive terms. Note that any stable ARMAX model can be put in the form of 
Eq. (163.15), where M — > <x> and h(m) is absolute-summable. The estimation of h(m) can be also 
accomplished in the frequency domain via discrete Fourier transforms (DFTs, implemented as FFTs), 
observing the fact that convolution turns into multiplication in the frequency domain. Particular atten- 
tion must be paid to cases where the input DFT attains very small values to avoid numerical or estimation 
problems during the necessary division. In actual physiologic practice, many investigators have chosen 
to perform experiments with specific input waveforms to facilitate this identification task, i.e., the use 
of an impulsive input of unit strength directly yields h(m), or the use of sinusoidal inputs of various 
frequencies yields directly the values of the DFT of h(m) at the corresponding frequencies (covering the 
entire frequency range of interest). 

In the nonlinear stationary case, the most widely used methodology for nonparametric modeling is 
based on the Volterra functional expansions and Wiener's theory that employs a gaussian white noise 
(GWN) test input in conjunction with a modified (orthogonalized) Volterra functional expansion. 
Wiener's critical contribution is in suggesting that GWN is an effective test input for identifying nonlinear 
systems of a very broad class and in proposing specific mathematical procedures for the estimation of 
the unknown system descriptors (kernels) from input-output data, as outlined below [for more details, 
see Marmarelis & Marmarelis, 1987; Rugh, 1981; Schetzen, 1980]. 

The input-output relation of a causal nonlinear stationary system in discrete time is seen as a mapping 
of a vector comprised of the input past (and present) values onto the (scalar) present value of the output: 
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y(n) = F xltt'J, ; 



■M<n <n 



(163.16) 



were M is the system memory and F is a fixed multivariate function representing this mapping — note 
that, in continuous time, F is a functional. If the system is nonstationary, the function F varies through 
time. If the function F is analytic and the system is stable, then a discrete-time Volterra series expansion 
exists of the form: 



y\n)= y y •••> k\ n\, ..., m\x\ n — mA...x\ n— m\ 



(163.17) 



This series converges for any stable system within the radius of convergence defined by the input ensemble. 
If the multivariate function F is not analytic but is continuous, then a finite-order Volterra model can 
be found that achieves any desirable degree of approximation (based on the Stone-Weierstrass theorem). 

The multiple convolutions of the Volterra model involve kernel functions \k i (m v ..., m)} which 
constitute the descriptors of the system nonlinear dynamics. Consequently, the system identification task 
is to obtain estimates of these kernels from input-output data. These kernel functions are symmetric 
with respect to their arguments. 

When a sinusoidal input is used, then the rth order Volterra functional (i.e., the i-tuple convolution 
in the Volterra model) gives rise to output harmonics of order i, (i- 2), ..., (i-2j) where j is the integer 
part i/2. When an impulse A8(n) is used as input, then the rth order Volterra functional is contributing 
the term A'k^n, ..., n) to the output. This suggests that the Volterra kernels of a general system cannot 
be separated from each other and directly determined from input-output data, unless the Volterra 
expansion is of finite order. For a finite-order Volterra expansion, kernel estimation can be achieved 
through least-squares fitting procedures [Korenberg, 1988; Stark, 1968; Watanabe & Stark, 1975] or by 
use of specialized test inputs such as multiple impulses or step function [Schetzen, 1965; Shi & Sun, 1990; 
Stark, 1968] and sums of sinusoids of incommensurate frequencies [Victor et al., 1977], although the 
testing and analysis procedures can be rather laborious in those cases. The latter method yields estimates 
of the system kernels in the frequency domain at the selected input frequencies (and their harmonic/inter- 
modulation frequencies) with increased accuracy but with limited frequency resolution. 

The inability to estimate the Volterra kernels in the general case of an infinite series prompted Wiener 
to suggest the orthogonalization of the Volterra series when a GWN test input is used. The functional 
terms of the Wiener series are constructed on the basis of a Gram-Schmidt orthogonalization procedure 
requiring that the covariance between any two Wiener functionals be zero. The resulting Wiener series 
expansion takes the form: 



p = y G\h { ; x\n'\, n <n 



YY (- 1 ' i[P 



L.L.I 7 .\ M9 7L-L fc K-' m «^' / »-'^ / J (163 - 18) 

x{n—mA...xln— m j _ 2 I 

where [i/2] is the integer part of i/2 and Pis the power level of the GWN input. The set of Wiener kernels 
{hj} is, in general, different from the set of Volterra kernels {fc,j, but specific relations exist between the 
two sets [Marmarelis & Marmarelis, 1978]. 
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The Wiener kernels depend on the GWN input power level P (because they correspond to an ortho- 
gonal expansion), whereas the Volterra kernels are independent of any input characteristics. This situation 
can be likened to the coefficients of an orthogonal expansion of an analytic function being dependent 
on the interval of expansion. It is therefore imperative that Wiener kernel estimates be reported in the 
literature with reference to the GWN input power level that they were estimated with. When a complete 
set of Wiener kernels is obtained, then the complete set of Volterra kernels can be evaluated. Approxi- 
mations of Volterra kernels can be obtained from Wiener kernels of the same order estimated with various 
input power levels. Complete Wiener or Volterra models can predict the system output to any given 
input. When the Wiener or Volterra model is incomplete, the accuracy of the predicted output will be, 
in general, different for the two models and will depend on each specific input. 

The orthogonality of the Wiener series allows decoupling of the various Wiener functionals and the 
estimation of the respective Wiener kernels from input-output data through cross-correlation [Lee & 
Schetzen, 1965] 



i\P' 
where y,(«) is the ith response residual 



h\m v ...,m j ) = — : E yin\x\n — m l )..,x\n— m\ 



(163.19) 
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4"H")"E G i") (163.20) 
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The simplicity and elegance of the cross-correlation technique led to its adoption by many investigators 
in modeling studies of nonlinear physiologic systems [see McCann & Marmarelis, 1975; Marmarelis, 
1987, 1989, 1994; Marmarelis & Marmarelis, 1978; Stark, 1968]. 

Since the ensemble average of Eq. (163.19) is implemented in practice by time-averaging over a finite 
data-record, and since GWN is not physically realizable, many importance practical issues had to be 
explored in actual applications of the cross-correlation technique. To name but a few: the generation of 
appropriate quasi-white test signals (that adequately approximate the ideal and not physically realizable 
GWN); the choice of input bandwidth relative to the system bandwidth; the accuracy of the obtained 
kernel estimates as a function of input bandwidth and record length; the effect of extraneous noise and 
experimental imperfections. An extensive study of some of these practical considerations can be found 
in Marmarelis and Marmarelis [1978] and in Marmarelis [1979]. 

A broad class of effective quasi-white test signals (CSRS) has been introduced that can be easily 
generated on the computer and may follow any zero-mean symmetric (discrete or continuous) amplitude 
probability density function [Marmarelis, 1977]. The use of this type of test signal allows through analysis 
of kernel estimation errors via cross-correlation and the optimization of the input bandwidth B (for 
given record length N) on the basis of a total mean-square error (TMSE) criterion for the ith-order kernel 
estimate given by: 



(tMSe)=^ + ^ B ^ (163.21) 



B 4 N 

where a ; and b t are positive constants characteristic of the rth-order kernel [Marmarelis, 1979]. 

The cross-correlation technique results in considerable variance in the kernel estimates, unless very 
long data-records are available, because of the stochastic nature of the input. This has prompted the use 
of pseudo-random m-sequences [Billings & Fakhouri, 1981; Moller, 1983; Ream, 1970; Sutter, 
1992] — which can reduce considerably the data-record requirements and yield improved estimation 
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accuracy, provided that proper attention is given to certain problems in their high-order autocorrelation 
functions. To the same end, frequency-domain methods have been proposed for kernel estimation 
[Brillinger, 1970; French, 1976; Victor et al., 1977] yielding computational savings in some cases and 
offering a different perspective for the interpretation of results. 

To reduce the requirements of long experimental data records and improve the kernel estimation 
accuracy, least-squares methods also can be used to solve the classical linear inverse problem described 
earlier in Eq. (163.6), where the parameter vector includes all discrete kernel values of the finite Volterra 
model of Eq. (163.17), which is linear in these unknown parameters (i.e., kernel values). Least-squares 
methods also can be used in connection with orthogonal expansions of the kernels to reduce the number 
of unknown parameters, as outlined below. Note that solution of this inverse problem via OLS requires 
inversion of a large square matrix with dimensions [(M + I + 1)!/((M + 1)!7!)], where M is the kernel 
memory and / is the (maximum) nonlinear order of the Volterra model. Since direct inversion of this 
matrix may require considerable computing effort and be subject to conditioning problems, it is often 
expedient to solve this inverse problem by use of QR decomposition that offers certain computational 
and numerical advantages. One such implementation is Korenberg's exact orthogonalization method 
[Korenberg, 1988], which estimates the discrete kernel values through least-squares fitting of a sequence 
of orthogonal vectors built from input values in accordance with the Volterra expansion (effectively 
forming the Q orthogonal matrix of the QR decomposition). Other numerical methods also can be used 
for matrix inversion, depending on the requirements of each application. Likewise, a variety of orthogonal 
bases can be used for kernel expansion in order to obtain a more concise representation of the kernels 
in each particular application. The use of the Laguerre orthogonal basis typically results in reduction of 
the number of unknown parameters for most physiologic system kernels (due to their built-in exponential 
structure), with consequent improvement in estimation accuracy [Marmarelis, 1993; Watanabe & Stark, 
1975]. Thus, if {bim)} is a complete orthonormal basis defined over the system memory [O, M], then 
the rth-order Volterra kernel can be expanded as 

k i (m 1 ,...,m i ) = y } J c i(ii'-->Ji) b A m i)-- b i[ m i] (163.22) 

k h 

Then the Volterra model of Eq. (163.17) takes the form: 



>W = L £••!>(*' ■-■fchM - v ,M < 163 - 23 ) 



where, 



! .[n\=J b.{m\x[n-tn\ (163.24) 



and the expansion coefficients {c i (j l , ...,_/',-)} of the unknown kernels can be estimated through multiple 
regression of y(n) on the multinomial terms composed of the known functions {v(n)}. The kernel 
estimates are then reconstructed on the basis of Eq. (163.22). This kernel estimation method has been 
shown to be far superior to the conventional cross-correlation technique in terms of estimation accuracy 
and robustness to noise or experimental imperfections [Marmarelis, 1993]. 

Based on this observation, a general model for the Volterra class of systems can be proposed that is 
comprised of a set of parallel linear filters with impulse response functions {blm)} whose outputs are 
fed into a multi-input static nonlinearity, y = f(v u v 2 , ...), to produce the system output. If was Wiener 



© 2000 by CRC Press LLC 



who first proposed the use of Laguerre functions (albeit in continuous-time form) for the set |Wm)[, 
since they are defined over the interval [0, <x>) compatible with causal systems and can be generated in 
analog form — a fashionable model at the time — by a simple RC ladder network. He also suggested that 
the multi-input static nonlinearity be expanded in terms of orthogonal hermite functions to yield a 
system characterization in terms of the resulting Hermite expansion coefficients that are estimated 
through covariance computations. For a review of this approach, see Schetzen [ 1980] . This approach has 
been viewed as rather unwieldy and has not found many applications to date. However, the use of Laguerre 
expansions of kernels (in discrete time) in conjunction with least-squares fitting has been shown to be 
rather promising for practical kernel estimation from short and noisy data records [Marmarelis, 1993] 
and consequently suitable for experimental studies of physiologic systems. 

The greatest obstacle to the broader use of the Volterra- Wiener approach has been the practical 
limitations in estimating high-order kernels due to two reasons: ( 1 ) The amount of required computations 
increases geometrically with the order of estimated kernel; (2) kernel functions of more than three 
dimensions are difficult to inspect or interpret meaningfully. As a result, application of this approach 
has been limited to weakly nonlinear systems (second or third order). 

A practical way of overcoming this limitation in the order of estimated kernels has been recently 
presented as a spin-off of studies on the relation between Volterra models and feedforward artificial 
neural networks [Marmarelis & Zhao, 1994]. According to this approach, a perceptronlike network with 
polynomial activation functions is trained with the experimental data using a modified version of the 
back-propagation algorithm. The parameters of the resulting network can be subsequently converted to 
kernel estimates of arbitrary high order (although the network itself is a nonlinear model of the system 
on its own right). This alternative to current kernel estimation algorithms seems to hold significant 
promise, since it relaxes many of the requirements of existing methods (e.g., input whiteness and low- 
order models) at the only apparent cost of greater computing effort. 

In the event of nonstationarities, the kernel become dependent on time, and their estimation can be 
accomplished either in a piecewise stationary manner (when the nonstationary is slow relative to the 
system dynamic bandwidth) or through truly nonstationary estimation methods. In the former case, the 
piecewise stationary estimates can be obtained over adjacent segments or using sliding (overlapping) 
windows along the time record. They can be subsequently displayed in a time-ordered sequence (revealing 
the time-varying pattern), under the assumption of system quasi- stationarity over the data segment or 
sliding window. Kernel estimation through truly nonstationary methods may be accomplished by: 
(1) adaptive methods, employing the recursive least-squares formulae previously reviewed, which track 
changes in least-squares kernel estimates by continuous updating based on the output prediction error 
at each discrete step [Goodwin & Sin, 1984]; (2) ensemble-averaging methods, employing direct averaging 
of results obtained from many repetitions of identical experiments, which are rarely used because the 
latter is seldom feasible or practical in a physiologic experimental setting; (3) temporal expansion methods, 
employing explicit kernel expansions over time to avoid the experimental burden of the ensemble- 
averaging approach and the methodological constraints of adaptive methods, which utilize a single input- 
output data record to obtain complete nonstationary model representations under mild constraints 
[Marmarelis, 1981]. 

We note that the Volterra- Wiener approach has been extended to the case of nonlinear systems with 
multiple inputs and multiple outputs [Marmarelis &McCann, 1973; Marmarelis &Naka, 1974;Westwick & 
Kearney, 1992] where functional terms are introduced, involving cross-kernels which measure the non- 
linear interactions of the inputs as reflected on the output. This extension has led to a generalization for 
nonlinear systems with spatio-temporal inputs that has found applications to the visual system [Citron 
et al., 1981; Yasui et al., 1979]. Extension of the Volterra- Wiener approach to systems with spike (action 
potential) inputs encountered in neurophysiology also has been made, where the GWN test input is 
replaced by a Poisson process of impulses [Krausz, 1975]; this approach has found many applications 
including the study of the hippocampal formation in the brain [Sclabassi et al., 1988]. Likewise, the case 
of neural systems with spike outputs has been explored in the context of the Volterra-Wiener approach, 
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leading to efficient modeling and identification methods [Marmarelis et al., 1986; Marmarelis & Orme, 
1993]. 

The mathematical relations between parametric (NARMAX) and nonparametric (Volterra) models 
have been explored, and significant benefits are shown to accrue from combined use [Zhao & Marmarelis, 
1998]. These studies follow on previous efforts to relate certain classes of nonlinear differential equations 
with Volterra functional expansions in continuous time [Marmarelis, 1989; Rugh, 1981]. 

Only brief mention will be made of the computational and graphical nonparametric methods of phase- 
space mappings. The computational method requires prior specification of the number of input and 
output lags that are needed to express the present output value as 



y[ n\ = G y\n — l), ..., y\n — k\, xl «), xl n— l), ..., x\n — \\ 



(163.25) 



where G[*] is an unknown function which maps the appropriate input-output vector z(«) = [y(n - 1), 
. . . , y{ n — k), x(ri), . . . , x( n — I)] onto the present value of the output y{ n) . Then, using experimental data, 
we compile on the computer a list of correspondences between z{n) and y{n) in the form of an empirical 
model that can predict the output for a given z by finding the closest correspondence within the compiled 
list (look-up table) or using a properly defined interpolation scheme. 

The graphical method is based on the notion that the mathematical model of a discrete-time finite- 
order (stationary) dynamic system is, in general, a multivariate function _/(•) of the appropriate lagged 
values of the input-output variables 



/ 



y\n\ y\n-\\, ..., yin-kj, xln-lj, ..., xin-l) 



(163.26) 



This function (model) is a "constraint" among these input-output lagged (and present) values at each 
time instant n. The number of variables that partake in this relation and the particular form of _/(•) are 
characteristic of the system under study. Therefore, in a geometric sense, the system (and its model) are 
represented by a subspace in a multidimensional space defined by the coordinate system of these variables. 
If one can define the appropriate space by choosing the relevant variables, then a model can be easily 
obtained by following the vector point corresponding to the data through time — provided that the input 
is such that the data cover the model subspace densely. If the input is ergodic, then sufficiently long 
observations will allow reliable estimation of the system model. 

Although this is an old and conceptually straightforward idea, it has not been widely used (except in 
some recent studies of chaotic dynamics of autonomous systems, where no input variable exists) because 
several important practical issues must be addressed in its actual implementation, e.g., the selection of 
the appropriate coordinate variables (embedding space) and the impracticality of representation in high- 
dimensional spaces. If a low-dimensional embedding space can be found for the system under study, this 
approach can be very powerful in yielding models of strongly nonlinear systems. Secondary practical 
issues are the choice of an effective test input and the accuracy of the obtained results in the presence of 
extraneous noise. 

163.3 Modular Approach 

The practical limitations in the use of Volterra-Wiener models for strongly nonlinear systems have led 
some investigators to explore the use of block-structured or modular models in the form of cascaded or 
parallel configurations of linear subsystems (L) and static nonlinearities (N). This model representation 
often provides greater insight into the functional organization of the system under study and facilitates 
that identification task by allowing separate estimation of the various component subsystems (L and N), 
thereby avoiding the computational burden associated with the dimensionality of high-order kernel 
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functions. The advantages afforded by this approach can be had only when prior specification of the 
structure of the modular model is possible. 

Simple cascade models (e.g., L-N, N-L, L-N-L) have been studied extensively, yielding estimates of the 
component subsystems with moderate computational effort [Billings & Fakhouri, 1982; Korenberg & 
Hunter, 1986; Marmarelis & Marmarelis, 1978; Shi & Sun, 1990]. Distinctive kernel relationships (e.g., 
between first-order and second-order kernels) exist for each type of cascade model, which can be used 
for validation of the chosen modular model on the basis of the kernel estimates obtained from the data — a 
task that is often referred to as structural identification. Thus, the combined use of the modular and 
nonparametric approaches may yield considerable benefits. This idea can be extended to more compli- 
cated modular structures entailing multiple parallel and cascaded branches [Chen et al., 1990; Korenberg, 
1991]. The case of nonlinear feedback in modular models attracts considerable interest, because of its 
frequent and critical role in physiologic control and autoregulation. A case of weak nonlinear feedback 
in sensory systems has received thorough treatment [Marmarelis, 1991]. Naturally, the analysis becomes 
more complicated as the complexity of the modular model increases. Particular mention should be made 
of a rather complex model of parallel L-N-L cascades (usually called the S m model) that covers a broad 
class of nonlinear systems [Billings & Fakhouri, 1981; Rugh, 1981]. 

A method for the development of a general model for the Volterra- Wiener class of systems, which 
assumes a modular form, was originally proposed by Wiener and his associates and is reviewed in Schetzen 
[1980]. A general modular model, comprised of parallel L-N cascades, also has been proposed by 
Korenberg [1991], and another, employing principal dynamic modes as a filterbank whose outputs feed 
into a multi-input static nonlinearity, has been recently proposed by Marmarelis [1997]. 

This latter general model evolved from the original Wiener modular model. It was first adapted to 
studies of neural systems that generate spikes (action potentials), whereby a threshold-trigger is placed 
at the output of the general modular model that obviates the use of high-order kernels and yields a 
parsimonious complete model [Marmarelis & Orme, 1993]. The importance of this development is found 
in that, ever since the Volterra- Wiener approach was applied to the study of spike-output neural systems, 
it had been assumed that a large number of kernels would be necessary to produce a satisfactory model 
prediction of the timing of output spikes, based on the rationale that the presence of a spike-generating 
mechanism constitutes a hard nonlinearity. Although this rationale is correct if we seek to reproduce the 
numerical binary values of the system output using a conventional Volterra-Wiener model, the inclusion 
of a threshold trigger in our modular model yields a compact and complete model (of possibly low order) 
that predicts precisely the timing of the output spikes [Marmarelis et al., 1986]. This also led to a search 
for the principal pathways of dynamic transformations of neural signals using eigen-decomposition of 
a matrix composed of the Laguerre expansion coefficients of the first- and second-order kernels [Mar- 
marelis & Orme, 1993]. The estimation of these principal dynamic modes is accomplished via the 
aforementioned eigen-decomposition [Marmarelis, 1994] or through the training of a specific artificial 
neural network (a modified perceptron with polynomial activation functions) using a modified back- 
propagation algorithm [Marmarelis & Zhao, 1997]. The latter method holds great promise in providing 
a practical solution to the ultimate problem of nonlinear system identification, because it is not limited 
to low-order nonlinearities and does not place stringent requirements on the necessary input-output 
data, but it does retain a remarkable degree of robustness in the presence of noise. 
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164.1 Introduction 







The control of blood flow is a process involving neural and metabolic mechanisms. It is widely accepted 
that neural processes accomplish short term control of blood pre-capillary flow resistance such that blood 
pressure and flow are regulated. While metabolic processes dominate in the long term (Guyton, 1963) 
they ultimately determine the regulated blood flow levels in the various tissues in accordance with 
metabolic demands. This is achieved locally by the precapillary sphincters that adjust the duration and 
number of capillaries open and which, in turn, determine the value of the peripheral resistance, R s . 
Another theory suggests that it is the mechanical effect of pressure and flow that stimulates vascular 
smooth muscle to react. However experiments have shown that tissue blood flow and systemic circulation 
is directly related to its metabolic demand (Berne and Levy, 1977). Experimental observations of low 
frequency flow oscillations can be found in studies of a single vascular bed as well as the entire systemic 
circulation. This differs from mechanical stimulation where researchers have employed a drug response. 
For example, Kenner and Ono (1972) have observed what they termed flow auto-oscillations in the 
carotid and femoral arteries of the dog, following intravenous administration of acetylcholine. Similar 
oscillations in flow have been observed following the infusion of adenosine into the coronary circulation 
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(Wong and Klassen, 1991). These researchers attributed this response to opposing vasodilator and 
vasoconstrictor processes. 

Other experimental evidence of low frequency resonance can be found in measurements of heart rate 
variability and blood pressure variability spectra (Chess et al., 1975; Akselrod et al., 1981; Pagani et al., 
1986). Experiments in animals have shown that low frequency oscillations persist following denervation 
(Rimoldi et al., 1990), during external constant cardiac pacing (Wang et al., 1995), and in animals and 
humans with an artificial heart (Yambe et al., 1993; Cooley et al., 1998). To our knowledge, the origin 
of the low frequency resonance in the variability spectrum, the so-called Mayer waves (Mayer, 1876) is 
currently unexplained, although vasoregulatory dynamics has been proposed. The study by Yambe et al. 
(1993) revealed that the long waves were nearly 100% correlated with the periodicity of peripheral 
resistance. 

Further autoregulatory instability has been observed in the pathological situation. For example, prom- 
inent low frequency oscillations have been observed during heart failure (Goldberger et al., 1984). While 
it has been proposed that these oscillations follow the respiratory rate, they have been observed to occur 
at even lower frequencies. 

With innervation intact, the circulation at rest maintains constant ventricular stroke volume and blood 
pressure, on average. In addition, the regulation of cardiac output is accomplished primarily through the 
control of peripheral resistance. Then, the slow changes in heart rate (f h ) are directly related to cardiac 
output (CO) and inversely to peripheral resistance (R s ), so that, f h a COa 1/R S (Berne and Levy, 1977). 
This relationship is fundamental to the vascular theory of heart rate variation (Hering, 1924) and 
underscores the role of a time varying peripheral resistance. In this article, the dynamics of peripheral 
resistance control is examined analytically as an explanation of the very low frequency variation in heart 
rate. 



164.2 Background 



The control of blood flow resistance has been analyzed theoretically at both the organ and the microcir- 
culatory level. Since metabolic demand of the tissue is related to oxygen uptake under aerobic conditions, 
Huntsman et al. (1978) developed a model for control of blood flow resistance for skeletal muscle based 
on this concept. In this study, it was found that the metabolic control considerably outweighed that of 
neural and myogenic. The time frame of observation was also important, where longer time periods 
emphasized the metabolic control factors. The model controlled flow by means of regulating the oxygen 
stored in the extra-vascular spaces. The control of peripheral resistance was affected by invoking the 
relationship between arterial diameter and oxygen tension. This model successfully predicted the large 
blood flow response following a brief occlusion. 

The autoregulation response should ultimately arise from the action of multiple precapillary sphincters 
and resistance arterioles in the microcirculation. The control of flow in a microvascular model was 
analyzed by Mayrovitz et al. (1978). This model included muscular arterial and venous vasomotion, 
capillary filtration and reabsorption, and lymph flow. Tissue pressure was assumed to be regulated and 
was used to provide the control pathway for the activation of the precapillary sphincter. Local flow was 
found to vary considerably with periodic sphincter activity. This model demonstrated that autoregulation 
of flow is likely to find its genesis at the microcirculatory level. 

Although our knowledge of pulse propagation in the arterial system is extensive, it has been useful to 
approximate the relationship between pressure and flow in a single artery or vascular bed. Such approx- 
imations have been referred to as reduced arterial system models of which the three element Windkessel 
is the most widely employed (Westerhof et al., 1971; Noordergraaf, 1978). This Windkessel model is 
modified here to study the effects of peripheral resistance. 

The three element Windkessel model approximates the pulse propagation quality of the arterial system 
as a combination of an infinitely long tube, which is represented by its characteristic impedance, Z , in 
series with a parallel arrangement of a peripheral flow resistance, R s , and a total arterial compliance, C s . 
Within a single cardiac cycle, these three parameters are generally assumed to be constants (Toorop et al., 
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1987). Recent studies have scrutinized this assumption. For example, the compliance has been shown to 
depend on the level of arterial pressure (Bergel, 1961). Other studies have called into question the accuracy 
of the various methods employed to experimentally determine their values (Stergiopulos et al., 1995a). 
This difficulty has been addressed by Quick et al. (1998) who showed that wave reflection imparts 
frequency dependence on the values of R s and C s . 

The Windkessel model is often employed as a simplified load for the left ventricle. Under steady 
conditions and a fixed heart rate, this model is appropriate. But, given the case of varying ventricular 
function, the model responds inaccurately. The difficulty lies in the regulation of cardiac output, i.e., 
autoregulation is not provided. To correct this problem, our group has employed an iterative procedure 
that permits R s to change such that cardiac output is maintained (Drzewiecki et al., 1992). This procedure, 
although providing autoregulation, does not account for the dynamics associated with this physiological 
mechanism. 

In this chapter, the dynamics of autoregulation are incorporated into the modified Windkessel model. 
The frequency response of this autoregulating Windkessel is then predicted and compared with that of 
the standard three element model. The time response is also determined and discussed relative to the 
experimental observations in the literature. The stability of the model is also examined with respect to 
very low frequency oscillation in peripheral resistance. 

The physical relationships that govern the Windkessel model are now reviewed. Time dependency is 
provided by the blood volume storage property of arterial compliance, C s . The time derivative of periph- 
eral arterial pressure, P s , is proportional to the difference in blood flow into and out of the compliance, 
according to, 



dP 

S_ 

dt 






v 



R. 



C 



1 (164.1) 



to 



where Q is the inflow. P accounts for the critical closing pressure (Alexander, 1977). The input pressure, 
P, is then determined from the sum of pressure due to flow through the characteristic impedance and 
the pressure, P s , 

P = QZ o + P (164.2) 

The arterial compliance may be treated as a function of pressure also, accounting for large changes in 
blood pressure. The pressure dependence of compliance is then modeled by an exponential function (Liu 
et al, 1989; Li et al, 1990), as, 

C s = ae- bp * (164.3) 

where a and b are empirical constants evaluated from experimental data. 

164.3 Modeling Methods 

Steady Regulation 

In this study, it will be assumed that the local tissue bed or whole animal is in a steady metabolic state. 
For this regulated condition, the level of blood flow is denoted Q the peripheral resistance, R so , and the 
pressure across it, P so . The relation between these quantities reads, 

P so = RsoQo (164.4) 

When the flow is disturbed, the value of the peripheral resistance is adjusted in an attempt to restore 
flow to Q . Since the control of flow is not ideal, the new steady flow condition is described by 

P s =R rEg Q (164.5) 
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where R is the new value of peripheral resistance. Applying the simplest form of control, which is also 
imperfect, it is assumed that the change in peripheral resistance is proportional to the change in flow, 
i.e., linear proportional control. The change in peripheral resistance is then determined from the control 
equation, 



(Q-Qo) 



G 

AR =R -R =— ^ - (164.6) 

s reg so „ ' 

where G is defined as the autoregulatory gain. 

Autoregulation Dynamics 

There are two dynamic effects that must be accounted for to accurately model the physiological control 
mechanism. First, the change in peripheral resistance is delayed. This represents a delay from the time 
that the flow changes to when the vascular smooth muscle activity actually initiates the process of 
correcting the flow. This time delay in the change of peripheral resistance is denoted x Q . To account for 
this effect, the change in peripheral resistance determined from the steady value was written as AR s (t - t q ). 
Physiologically, even when peripheral resistance adjusts, it cannot be changed instantaneously. To 
account for the slow change in resistance, a first-order rate equation was used to represent the slow action 
of vascular smooth muscle. Thus, the time dependent value of peripheral resistance was determined from 
the following, 

— ^ = — AR,(t-x Q ) (164.7) 

dt x s v «; 

sm 

where the time constant for smooth muscle action is x sm . This equation simply states that the rate of 
change of peripheral resistance with respect to time is proportional to desired change in resistance, AR S . 
The net effect of delay in autoregulation and smooth muscle response is to cause R s to slowly approach 
its new control value, as defined by Eq. (164.6). 

Regulatory Limitations 

If the actual blood flow differs significantly from Q , the regulatory error can be large. Since a linear 
model is being employed here, the value of R s predicted by Eqs. (164.4)-(164.7) may fall outside of the 
physiological range. Thus, the time dependent value of R s was limited to maximum and minimum values 
of 1.6 R so and 0.4 R so , respectively. These values correspond with maximal vasoconstriction and vasodi- 
lation. When the computations reveal that the limit has been reached, the limited value of resistance was 
then inserted into Eq. (164.1) of the Windkessel model. Otherwise, the time dependent value obtained 
from Eq. (164.7) was employed. 

The two differential equations (Eqs. 164.1 and 164.7) govern the time dependent characteristics of the 
autoregulating Windkessel model. They were integrated numerically by a computer. All parameters were 
evaluated for the canine systemic arterial system (Table 164.1). 

164.4 Results 

Autoregulation Curve 

The standard experimental approach to measuring the autoregulation curve of a vascular bed has been 
to slowly vary input flow while recording pressure. Alternatively, one may alter pressure while measuring 
flow. In either case, the dependent variable stabilizes after several minutes. The set of final values results 
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TABLE 164.1 Parameters 



Parameter 


Value 


Description 


z 


0.25 mmHg/mL/sec 


Characteristic impedance 


c s 


0.50 mL/mmHg 


Arterial compliance 


K, 


3.10 mmHg/mL/sec 


Basal peripheral resistance 


a 


3.0 mL/mmHg 


Nonlinear compliance constant 


b 


0.02 mm Hg 1 


Nonlinear compliance constant 


G 


5.0 


Autoregulation gain 


Q„ 


27.0 mL/sec 


Regulated flow level 


T Q 


5.0 sec 


Flow delay time 


T sm 


3.0 sec 


Smooth muscle time constant 



in the steady state autoregulation curve. This procedure was followed using the autoregulating Windkessel 
model (Fig. 164.1). To reduce computation time, the time constants of the model were shortened. This 
does not affect the result, but simply shortens the time to reach steady state. 

The model generated autoregulation curve shows three distinct segments. The low flow segment was 
found to be a line of slope 0.4R SO and corresponds with maximal vasodilation. There was no control of 
flow in this region either. The middle flow segment represents the autoregulatory range. The peripheral 
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FIGURE 164.1 Pressure and flow relationship of the canine aorta obtained from the autoregulating Windkessel in 
the steady state. Flow regulation occurs in the range where the slope is steepest. 
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resistance was found to alter continuously in accordance with the feedback control relationship of 
Eq. (164.6) in this region. The slope of this segment was also proportionate to the autoregulatory gain 
factor, G, of the model. A greater slope reflects tighter control of flow, but, as will be shown later, can 
lead to autoregulatory instability. 

Input Impedance 

The frequency response of the autoregulating Windkessel was determined by applying a small constant 
amplitude flow sinusoid as input while the mean flow is held fixed. The resulting sinusoidal pressure 
response was computed from the model. The response was evaluated for as many cycles as were needed 
to obtain a sinusoidal steady state. For frequencies much below the heart rate, this required up to 
30 seconds. The input impedance magnitude was then obtained by finding the ratio of the sinusoidal 
pressure and flow amplitudes over the range of frequencies. Phase was determined by finding the phase 
difference between the pressure and flow waves (Fig. 164.2a and b). In order to compare these results 
with the conventional modified Windkessel, the input impedance was re-evaluated when the autoregu- 
latory gain is zero (G = 0) and is shown on the same graphs. 



Input 

Impedance 4 

[mmHg/ mL/ sec] 



Modified Windkessel 
Frequency Response 




Autoregulation gain = 10 



Autoregulation gain = 



1 
Frequency [Hz] 



Phase 
[degrees] 




Frequency [Hz] 



FIGURE 164.2 Input impedance magnitude (A) and phase (B) frequency spectrum obtained from the model. The 
oscillating curve represents active autoregulation. The smooth curve is the standard three - element Windkessel with 
constant peripheral resistance. 
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The high frequency impedance was found to be identical for both models. This might be expected, 
since autoregulation is slow compared with the heart rate. An unexpected result was that the impedance 
and phase were found to oscillate at about that of the modified Windkessel for frequencies near the heart 
rate. This oscillation was traced to the autoregulation delay time, T Q . The magnitude of these oscillations 
was found to increase further if the autoregulatory gain is increased. If the gain or the delay time was 
reduced to zero, the oscillations are eliminated. These oscillations in the frequency domain could have 
been predicted by referring to the Fourier transform of a delayed time function. 

The very low frequency portion of the autoregulating impedance magnitude was higher in comparison 
with that of the unregulated Windkessel, where impedance approaches R s at zero frequency. This can be 
explained by referring again to Fig. 164.1. Instead of converging to R s , the autoregulating impedance 
approaches the slope of the pressure-flow curve. That is because the sinusoidal response measures the 
dynamic variation about the flow control point, Q . This is not contradictory, since the mean pressure 
divided by the mean flow is simultaneously R s = R so , at this point. And, the derivative of pressure, dP/dQ, 
for a given flow, must be the slope of the autoregulation curve in the steady state. 

Flow Step Response 

The pressure was computed from the model given a step increase in flow above the control level. A flow 
step was initiated 2.0 seconds from the beginning of the calculations (Fig. 164.3). An immediate increase 
in pressure occurs. This coincided with the time dependent increase in blood volume that shifts into the 
compliance, C s . The peripheral resistance did not change during this time due to flow delay. It remains 
at the control value of R so . Five seconds later, the resistance begins to increase towards the new value 
required by Eq. (164.4) and for autoregulation. Pressure increased nearly in proportion with resistance. 
The rate of change was determined by the time constant of smooth muscle response, T sm , and Eq. (164.5). 
An equilibrium value was reached about 20 seconds after the flow step occurred. 

The above flow step response was repeated with a constant amplitude flow pulse added to the mean 
flow (Fig. 164.4). A flow step was initiated 10 seconds from the beginning of computations. The envelope 
of the pressure response follows that shown in Fig. 164.3, but additionally permits the observation of 
systolic and diastolic pressures. It was found that the pulse pressure amplitude increases following the 
step. This was associated with the decrease in pressure dependent compliance due to elevated pressure 
(Eq. 164.3). 

Pressure Step Response 

The flow was computed from the model following a step change in input pressure. The step change was 
initiated immediately by altering the initial condition for pressure from the steady state value of 83 to 
90 mmHg (Fig. 164.5). 

The graphs for flow and resistance were found to be 180° out of phase. This was expected, since they 
are inversely proportional for constant pressure. Part of the initial increase in flow was due to transient 
filling of the compliance, C s . Flow was found to oscillate about its control value. This was reasonable 
since the control system consists of feedback with two time lags. Thus, the stability of control becomes 
a consideration. It was found that increasing t q or the gain factor, G, led to unstable low frequency 
pressure oscillations. The smooth muscle time constant, T sm , was found to possess a damping effect and 
could be increased to eliminate oscillations. Thus, too rapid smooth muscle activation and accurate 
autoregulation were predicted to be a destabilizing influence on the circulation. Since the flow resistance 
was always limited to either maximal vasoconstriction or vasodilation, the amplitude of the oscillations 
was also constrained. 

As another type of pressure step, an arterial occlusion was computed from the model. A two second 
occlusion was applied by rapidly reducing the input pressure to zero and then restoring it to the 
preocclusion level. The initial effect was to cause a rapid negative flow (Fig. 164.6). This represents blood 



O 2000 by CRC Press LLC 



1 1 1 1 1 r — ■ 

Flow 
[ml/sec] 


' 


1 1 In 1 1 



Pressure [mm Hg] 




-5 5 10 15 20 25 30 



Peripheral Resistance 
[mm Hg/ mU sec] 



10 15 20 25 30 

Time [sec] 

FIGURE 164.3 Flow step response of the model for steady levels of flow. The time responses of pressure and 
peripheral resistance are shown. 

volume decrease of the arterial compliance. The effect was brief since the outflow time constant is less 
than one second. Peripheral resistance was unchanged until the end of the occlusion due to autoregulation 
delay. Afterwards, it rapidly dropped to the minimal resistance level of complete vasodilation. When the 
occlusion was released, the flow rapidly increased. Initially, the flow was briefly high at positive values 
as the blood volume of the compliance was restored. Flow remained elevated to relieve the ischemia. 
Following a delay time, flow was then restored to its initial value prior to occlusion. This was done in 
an oscillatory manner. 

Minimal Circuit Model 

As in much of hemodynamic research, the Windkessel model, as well as others, have been portrayed in 
electrical circuit form. The autoregulating Windkessel described here cannot be simplified completely in 
circuit form. But, a useful approximation was designed. It was assumed that the time constants of 
autoregulation and smooth muscle activation are minimal. Then, autoregulation is immediate. Addi- 
tionally, it was assumed that vasoconstriction and vasodilation are not limited. Under these conditions, 
the circuit model of Fig. 164.7 can be used to approximate the autoregulating Windkessel. The constant 
current source represents the control value of autoregulated flow, Q„. The resistance, R auto , is the slope 
of the autoregulation line (Fig. 164.1). When the actual flow, Q = Q , the average blood pressure is equal 
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FIGURE 164.4 Flow step response as in Fig. 164.3, but with the addition of constant pulsatile flow. The effect of 
nonlinear compliance is evident as the pulse pressure increases following the increase in mean flow. 

MAP 
to the pressure source, MAP. Also, under this condition, R s = R so = -q — . Note, that these expressions 

assume that the pressure drop across the characteristic impedance, Z , is negligible. 

The impedance frequency spectrum was evaluated for the autoregulating circuit model. It was found 

to follow the result previously shown in Fig. 164.2, with the exception of impedance oscillations. The 

predicted impedance was in error by the amount of the oscillation amplitude. On average, over a range 

of frequencies, the error was negligible. Hence, for steady state conditions, the minimal circuit model 

offers a simple solution to the autoregulating Windkessel. The transient step responses, as predicted by 

the complete model, cannot be reproduced by this simplified circuit model. 

164.5 Discussion 

The autoregulation curve described in the current model differs from physiological data only in the sense 
that it is a segmented linear description. This was due to the fact that the model assumed linear feedback 
regulation. The actual flow regulation system possesses nonlinearities not accounted for in the current 
model. Most researchers have found that the steady state pressure flow curve is sigmoidal (Folkow, 1953; 
Granger and Guyton, 1969). Other researchers have expressed these nonlinearities as feedback gain that 
is flow dependent (Van Huis et al., 1985). Still others have found that a linear pressure-flow relationship 
is a good approximation within the normal physiological range of blood pressure (Sagawa and Eisner, 
1975). In this case, the current model would be most accurate when flow is near Q and blood pressure 
is near MAP. 

The autoregulation curve has been shown to be affected by the carotid baroreflex function (Shoukas 
et al., 1984). Increased carotid pressure leads to a decrease in peripheral resistance so that blood pressure 
can be maintained at a given cardiac output. In spite of the baroreceptor effect, these same researchers 
have shown that the steady state pressure-flow curve remains sigmoidal in shape. Thus, the concept of 
a steady state autoregulation curve applies just as well as when the baroreceptors are functional. The net 
effect of the baroreceptor response is to decrease the autoregulatory gain, as observed by a decrease in 
the slope of the steady state pressure-flow curve. This might be expected since pressure has the opposite 
effect on peripheral resistance to that of flow (Burattini et al., 1991). This would have the added benefit 
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FIGURE 164.5 Pressure step response computed from the model (x Q = 2, x sm = 5, and G = 10). The step was 
produced by altering the initial condition for pressure at t = 0. Note that pressure and resistance are 180° out of 
phase. A damped resonant response is evident with a resonant frequency of -0.1 Hz. 

of stabilizing the autoregulation system by preventing its gain from becoming too high (see stability 
below). Thus, since the autoregulating Windkessel does not incorporate this response, its primary value 
is in the analysis of peripheral vascular beds. And, provided that the autoregulatory gain is reduced to 
account for the baroreceptor effect, it can be applied to the systemic circulation. 

The autoregulating Windkessel incorporates the critical closing pressure phenomenon (Alexander, 
1977). Critical closing causes the flow to cease at a slightly positive pressure, P . Debate over the mech- 
anism that leads to P continues. Since its value is typically small compared with systemic blood pressure, 
its value was set to zero here. 

The low frequency impedance of the autoregulating Windkessel was found to be greater than that 
predicted by the modified Windkessel (Fig. 164.2). The sinusoidal input impedance of a vascular bed has 
not been explored extensively below the frequency of the heart rate. Sipkema and Westerhof (1980) 
examined the low frequency range experimentally using an external sinusoidal pump. Under these 
conditions, it was found that the low frequency impedance approaches the slope of the autoregulation 
curve, as predicted by the autoregulating Windkessel. These researchers distinguish the low frequency 
impedance from peripheral resistance by defining it as a dynamic resistance. 

Input impedance measurements have identified a variation in the frequency spectrum. While exper- 
imental studies have not provided an explanation for the origin of these variations, the current model 
ties them to autoregulation delay (t q ). The model predicts that the spectrum should follow an oscillatory 
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FIGURE 164.6 A vascular occlusion produced by reducing the input pressure to zero for two seconds. The flow 
response transients are due to compliance effects. Flow increases to maximum value corresponding with complete 
vasodilation. A damped oscillatory response returns the flow and resistance to control levels. 
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FIGURE 164.7 Simplified autoregulating Windkessel in electrical circuit form. 

pattern, particularly for frequencies at and below the heart rate. Moreover, the sinusoidal steady state 
may not be easily achieved experimentally. The model required over one minute duration to obtain the 
steady state response. The model further predicts an erratic or random variation in the impedance 
spectrum if the steady state was not obtained. Another explanation could be that the different vascular 
beds may possess slightly different delay times. When measured at the aorta, this may appear as impedance 
scatter. Other researchers have attributed the source of impedance variations to nonlinear vascular 
properties (Stergiopulos et al„ 1995b). 
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The flow step response of the model follows experimental studies quite precisely (Braakman et al., 
1983; Van Huis et al., 1985; Shoukas et al., 1984). Others have observed experimentally an initial auto- 
regulation delay and a slow adjustment of the vascular smooth muscle in response to a step in flow. The 
time constants of the model were determined in accordance with previous observations. The time course 
of the flow step response was similar for a single vascular bed and for the entire systemic circulation. 
Experimental measurements found no oscillations in response to a flow step for the systemic circulation 
(Shoukas et al., 1984). Hence, employing a higher order model than was used here would be of little 
value. But, in vivo measurements in organs have detected the presence of oscillations in the step flow 
response (Westerhof et al., 1983). The amplitude of these oscillations were small. Modifying the model 
to include these higher order dynamics may be at the sacrifice of simplicity and was not considered in 
this study. 

The flow step response examines the autoregulation control under open loop conditions. This proce- 
dure essentially forces the flow to a specific level independent of the tissue's metabolic demands. The 
results of this experiment provide information about the dynamics of the regulatory process. Alternatively, 
the pressure step response permits autoregulation to function under in vivo conditions and provides the 
closed loop response. Under these conditions, the autoregulating Windkessel shows a damped resonant 
flow response. This resonant behavior was linked to the time lag of flow regulation. Damping was provided 
by the slow smooth muscle function. Others have modeled the resonant behavior of autoregulation as a 
second-order control system that attempts to regulate blood pressure by adjusting the peripheral resis- 
tance (Burattini et al., 1987). While this approach may achieve a similar response, the physiological 
mechanism differs from that employed in our model. Moreover, data obtained from the open loop flow 
response, supports the use of a time lag theory. In general, oscillations have been modeled and observed 
in other biological systems that possess time lags in their feedback pathways (An der Heiden, 1979.). It 
may be reasonable to expect that biology accomplishes flow autoregulation in a way similar to other 
types of homeostasis. 

While the response to a vasoactive drug was not computed from the autoregulation model, it can be 
obtained by applying a step change in the time dependent value of R s (Kenner and Ono, 1972). In this 
case, the model also predicts that oscillatory flow will result. Hence, the model suggests, alternatively, 
that the origin of oscillations may be due to regulatory delays. This interpretation, though, does not rely 
on conventional control system analysis and does not involve the parasympathetic and sympathetic reflex 
pathways (Wong and Klassen, 1991). 

The model predicted that autoregulation can be unstable for some parameter values. When the 
parameters were calibrated to match flow step experiments, the model produced a stable result. But, the 
pressure step can best be characterized as a damped low frequency resonance. Experimental evidence of 
very low frequency resonance can be found in measurements of the heart rate variability and blood 
pressure variability spectra (Chess et al., 1975; Akselrod et al., 1981; Pagani et al., 1986), following den- 
ervation (Rimoldi et al., 1990), during external constant cardiac pacing (Wang et al., 1995), and in animals 
with total artificial heart (Yambe et al., 1993). This experimental observation that a nervous system is 
not necessary to produce such low frequencies in the cardiovascular system has been perplexing. So much 
so, that these oscillations have been referred to as the Mayer waves (Mayer, 1876), an observed but 
currently unexplained phenomenon. The autoregulation model provided here supports the theory that 
these low frequency oscillations find their origin in the intrinsic delay of flow regulation. 

Model parameter values that lie outside the normal physiological range can further enhance autoreg- 
ulatory instability. Abnormal values may occur in the pathological situation. Oscillations in pressure and 
flow were predicted by the model to lie well below heart rate, in the range below 0.1 Hz for the dog. For 
example, prominent low frequency oscillations have been observed during heart failure (Goldberger et al., 
1984). While it has been proposed that these oscillations follow the respiratory rate, they have been 
observed to occur at even lower frequencies. The autoregulating Windkessel model offers a possible 
explanation of the source of these oscillations given pathological conditions. It also suggests the possibility 
that they may be used to noninvasively monitor the condition of the autoregulatory system. 
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164.6 Conclusions 

The autoregulating Windkessel was found to accurately reproduce the experimental flow step response 
of both vascular beds and the systemic circulation. The model further provided the steady-state pressure- 
flow autoregulation curve, in linearized form. The impedance spectrum was predicted to differ from that 
of the three element Windkessel for frequencies below the heart rate. For frequencies near zero, the 
impedance approached the slope of the pressure-flow autoregulation curve, as opposed to peripheral 
resistance. But, simultaneously, the average pressure divided by the average flow of the model yielded 
the usual value of peripheral resistance. The low frequency impedance spectrum was also predicted to 
contain oscillations that were traced to autoregulatory delay. This result provides an explanation of the 
variability obtained experimentally during multi-beat examination of the impedance spectrum. The 
pressure step response of the autoregulating Windkessel revealed a damped low frequency resonant 
behavior. The model parameters that cause increased autoregulatory gain and decreased damping were 
found to lead to sustained flow oscillations at frequencies well below the heart rate. This resonance 
explains the origin of very low frequencies observed in the heart rate variability spectrum referred to as 
the Mayer waves. 
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The mechanical system being controlled is referred to as the plant. The configuration of the system at 
any instant in time comprises the plant states. The devices that power the system are called actuators. 
The signals driving the actuators are called controls. The controller is the process by which the controls 
are generated. The time histories of the plant states in response to the control signals are referred to as 
the system trajectory. 

Controllers can be designed to function without sensors and therefore without knowledge of the actual 
plant trajectory (open-loop control, Fig. 165.1a). Reference-based open-loop controllers precompute and 
store control signals, and execute the desired motor task in real-time. To correct for disturbances and 
modeling errors, a feedback controller with ongoing knowledge of the effects of the disturbances can be 
used. Closed-loop controllers (Fig. 165.1b), or so-called error driven controllers use feedback information 
from sensors measuring the states. Both controllers described require full knowledge of the system 
parameters, desired trajectories of the system components, and allowed tracking errors. Operation of an 
open-loop or closed-loop controller can be simulated using analytic tools derived from mechanics and 
mathematics. 

Nonanalytic control is the method developed following the biological control. The system operation, 
not the plant, is modeled as a set of mappings between inputs and outputs (Fig. 165.1c). 

Control of movements in humans is a very complex task dealing with a large scale, nonlinear, time- 
variable, dynamically non-determined, redundant system [1]. A dynamical system may change its states 
in a continuous way. Without an optimization criterion, all trajectories of a system are equivalent. The 
term trajectory has a broad meaning; it does refer to the way the system coordinates vary in the course 
of transition from the current to the next state. An optimization criterion assigns to each transition 
trajectory or a subset of trajectories, a value so that they can be arranged in the order of preference. In 
some cases, the trajectories of the dynamical system may be well ordered so that a single solution stands 
in front. Such a trajectory is given the name, the optimal solution. Instances when optimal solutions of 
control tasks exist, or can be analytically determined, are relatively rare although this term is used rather 
loosely in everyday life. 

165.1 Biological Control of Movements 

Control of goal directed movements. Arm movements to a spatial target utilize sensory information 
(kinesthetic and visual) that is initially represented in different frames of reference, and the sensory signals 
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FIGURE 165.1 Open (a) and closed-loop (b) control systems. Nonanalytic model (c) for control of movements. 

that specify target location need to be transformed into motor commands to arm muscles [2]. Arm 
movements illustrate an aspect of sensorimotor transformations. In arm movements the direction of 
force and movement do not coincide. The transformation between kinematics and kinetics is nontrivial 
in the case of arm motion. Not much is known about how this transformation might be implemented 
by neural circuits. Several investigators [3,4,5] have provided mathematical formulations of the problem. 
Other investigators have quantified biomechanical factors, such as muscle stiffness [6] and the changes 
in the muscle lever arms with posture, which also affect the relationship between force and movement. 
Arm muscle activation onset [7] and activation waveforms [8] have been empirically related to the 
direction of movement. 

The automatic control of the arm reaching motion to accomplish grasping has been discussed in great 
detail [e.g., 9,10]. Imperfect models, wrong initial conditions, unmodeled perturbations and computa- 
tional difficulties are serious problems in implementing analytical closed-loop control methods [11-14]. 

A simplified control method for reaching while grasping has been proposed [ 15] . This approach follows 
a so-called "extended physiological proprioception" control developed for bilateral amputees in Scotland 
[ 16] . The number of control variables can be reduced to a single input by establishing a synergy between 
joint angles. 

Information about the target is provided by the visual system, whereas proprioceptors are adequate 
to signal initial arm posture [17,18]. Because proprioceptors sense muscle length and joint angles [19], 
the initial frames of reference for kinesthesis are fixed to the limb segments, i.e., elbow joint angles are 
initially sensed in the frame of reference fixed to the upper arm [20-22]. 

Control of cyclic movements. Three major requirements for cyclic movements (e.g., locomotion, cycling) 
are the following: 1) production of a basic cyclic rhythm which can control extremities and the trunk 
against gravity and propel it in the intended direction; 2) postural control of the moving body; and 
3) adaptation of movements to meet environmental demands, tasks and preferred modalities of propulsion. 
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Because locomotion is achieved through the repetition of a well-defined movement, the emphasis in 
research to date has been on identifying principles and mechanisms that govern the generation of this 
basic rhythm. The idea of a central pattern generator for animal locomotion is generally attributed to 
G. Brown [23]. At that time, physiology was under the influence of strong individuals such as Sherrington 
[24] who had developed the concept of reflexes and favored the idea of locomotion being generated by 
a closed chain of such reflexes. The idea of autonomous central generators for a behavior such as 
locomotion was not fashionable and it took about 70 years for the concept to be generally accepted [25]. 

An important and relevant question is: Can the models and principles for control of locomotion 
developed from animal models be applied to humans? Animals afford far greater latitude in experimental 
paradigms than humans, and therefore are important for gathering physiological knowledge. The implicit 
assumption in using animal models is that knowledge gained can be applied to humans. Past experience 
has provided partial justification for this thinking. While walking, animals are mostly in a statically stable 
position, while in humans, the center of mass is outside the base of support for approximately 80% of 
a stride [26]. In quadrupeds, during locomotion, a tripodal support is ensured at all times, while in 
bipedal locomotion, a single support phase is interchanged with a double support phase. The contact 
surface of the foot to the ground is relatively large in humans compared to many animals. Traditionally 
human locomotor studies have concentrated mainly on a straight unobstructed gait. This has provided 
a large database for the characteristics of the basic locomotor patterns [e.g., 27,28]. 

The bipedal locomotion can be understood as a process whereby a pendulum like movement of the 
body about a point above the head (about 10 to 16%) is followed with an inverted pendulum-like 
movement about the leg in stance [29]. This behavior is repeated on the contralateral side of the body, 
and there is a temporal shift of 50% of the gait cycle. During the inverted pendulum phase of the 
locomotion the potential energy of the body is initially increased as the kinetic energy decreases and 
muscles push the body center of the mass against the gravitational force. This is followed by a decrease, 
when the center of the mass is falling down and moves faster. In the pendulum-like phase of the gait 
cycle the body swings with simultaneous shortening of the swinging leg to ensure ground clearance. 
Muscle activity varies with the speed of locomotion: for slow walking, muscle activity decelerates the 
body and diminishes the effects of gravity and inertial forces; for fast walking, muscles accelerate the 
body against gravity [30]. 

Locomotion is achieved by coordinated movements of the body segments employing an interplay of 
internal and external forces, e.g., muscular, inertial, gravitational and frictional. The muscular actions 
and their efficacy depend on the mastery of the usage of external forces. Analysis of pathological and 
normal gait patterns [31-33] suggests that it is impossible to define normal locomotion. However, there 
is a more or less well defined set of profiles for ground reaction forces and joint angles which can be 
recorded in most humans with no obvious musculo-skeletal and sensory injuries, and they highly 
correlate with patterns of muscle activities. 

165.2 Modeling of Human Extremities 

Modeling of a human body with the emphasis on the extremities is the prerequisite for the synthesis of 
control. Human extremities are unlike any other plant encountered in control engineering especially in 
terms of joints, actuators and sensors. This fact must be kept in mind when applying the general equations 
of mechanics to model the dynamics of functional motions. A simple extension of analytical tools used 
for the modeling of mechanical plants to the modeling of biomechanical systems may easily produce 
results in sharp discrepancy to reality. 

A basic problem in motor control of human extremities is in the planning of motions to solve a 
previously specified task, and, then, controlling the extremity as it implements the commands. In terms 
of mechanics, extremities are rigid mechanical links with muscles, tendons, ligaments, and soft tissues 
that can set the joint angle to any value in its range. Muscles and tendons can also set the desired angular 
velocity, acceleration, and joint stiffness. The motions of extremities are called trajectories and consist of 
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a sequence of positions, velocities, and accelerations of any part of the system (e.g., hand, fingertip, foot, 
etc.). It is anticipated that by using laws of mechanics one can determine necessary muscle forces to 
follow the given trajectory. 

A body segment, when exposed to physiological loading, can be modeled as a rigid body, even though 
it contains bones and soft tissue (e.g., muscles, tendons, ligaments, skin etc.). Inertial properties of a 
rigid body are its mass and the inertia tensor. The inertia tensor for a rigid body provides information 
on inertial properties of the body when rotating. The mass for a rigid body describes its inertia for 
translation type of movements. In reality, the inertial properties change: the center of mass is shifting 
because the masses move relatively during contractions and passive stretching of muscles, relative move- 
ments of bones and other soft tissues modify the geometry and polycentric joints virtually change the 
length of a body segment. 

The second issue in the modeling relates to joint structures. A joint is modeled as a kinematic pair. 
The theory of mechanisms defines connection of neighboring segments as kinematic pairs having up to 
five degrees of freedom (DOF). A typical human joint is a rotational kinematic pair having three DOF 
(ball joint) or one DOF (pin joint). However, many movements can not be simplified to a pure pin or 
ball joint rotation. Some human joints are actually double mechanical joints (e.g., ankle joint consists 
of one joint for internal-external rotation and inversion-eversion, and the other for movements in sagittal 
plane (flexion-extension). Sometimes the body segment rotation results from more complex structures 
such as two parallel bones connected between two neighboring joints (e.g., supination-pronation). 
Human joints are spatial and polycentric, and the displacement of their center of rotation results in the 
virtual change of the body segment. 

Speaking of mechanics, one needs to distinguish the forward kinematics from the inverse kinematics. 
The direct kinematics is concerned with the determination of the position of body parts in the absolute 
reference frame, given joint angles, whereas the inverse kinematics computes joint angles for a given 
position of the body part. Of these two, the direct kinematic analysis is the simplest by far, with a 
straightforward solution for the unique end point of the body part corresponding to the given joint 
angles. On the other hand, the feasibility of an inverse kinematics solution depends on the redundancy 
of the structure as well as offsets and constraints. 

The redundancy is an important feature in biological systems. The concept of redundancy requires a 
distinction between a degree of freedom and a permissible motion (e.g., rotation). The term DOF pertains 
exclusively to that permissible motion which is independent from other motions. A free rigid body has 
a maximum of six DOF, and a rigid body connected to another rigid body by a ball joint can have a 
maximum of three DOF; thus, any body segment has less than three DOF. Speaking of a structure as a 
whole, the total number of permissible rotations is termed system DOF. Some permissible rotations are 
not independent and they should not be called DOF (e.g., two parallel hinge joints). It is difficult to 
determine which permissible rotations should be controlled, and which will be constrained by themselves 
due to anatomical and physiological limitations. 

Decomposing the linked structure into a set of single links is essential for numerical simulation. Internal 
forces within the body generated by muscle activities become external forces and torques when the 
dynamics of a single link are considered. The muscle is a linear actuator, with a very complex behavior 
depending on its length, velocity of shortening, firing rate, recruitment, and type. Muscle models [34] 
can be grouped to three classes: 1) Cross-bridge models including sarcomere-microscopic, conventional 
cross bridge, and unconventional cross-bridge models; 2) Fiber models; and 3) Whole muscle-macro- 
scopic models (viscoelastic and black box models). The distinctions between these classes are not absolute 
(e.g., micro-macro). 

The modeling and control of movements in this chapter relates to external control of muscles via so- 
called functional electrical stimulation. Macroscopic viscoelastic models started from the observation 
that the process of electrical stimulation transforms the viscoelastic material from a compliant, fluent 
state into the stiff, viscous state. Levin and Wyman [35] proposed a three-element model-damped and 
undamped elastic element in series. Hill's work [36] demonstrated that the heat transfer depends upon 
the type of contraction (isometric, slow contracting, etc.). The model includes the force generator, 
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damping and elastic elements. Winters [37] generalized Hill's model in a simple enhancement of the 
original, which makes no claim to be based on the microstructure. Winters' model deals with joint 
moments: 



M=M -B®,M = AM j(@) 



M is the muscle moment exerted about the joint, M is the isometric moment (active state) with a 
maximum possible value M 0m , is the joint angle corresponding to the current contractile element 
length, A is a normalized activation variable (A = for relaxed muscle, A = 1 for a tetanized muscle), 
/(©) is the normalized tetanic length-tension relation, and B is a variable damping coefficient. For 
shortening muscle: 



whereas for lengthening muscle: 



B=M o (l+a)/(0 + a0 o ),0<O 



5=M o (l-p)/(0 + Y o ),0>O 



where a, (3 and y are constants. 

Zajac [38] proposed an appealing new version of the Hill model designed primarily for use in studies 
of mechanical interactions in multiple muscle systems. Muscles are connected to a skeleton with tendons, 
and they have spring like behavior when stretched. A variety of different non-rigid structures are included 
in the musculoskeletal system (e.g., ligaments) adding different elastic and damping effects. Nonlinear 
models of electrically stimulated muscle have been reviewed by Durfee [39]. Block models are only one 
of several possible structures which can be used in biological system identification. Where there are only 
two blocks, one of which is nonlinear, the nonlinearity may be placed before or after the dynamic block, 
in so called Hammerstein or Wiener forms. While techniques for identifying the parameters for these 
models have been investigated by Kearney and Hunter [40], those working in the field of functional 
electrical stimulation have usually adopted the Hammerstein structure. 

Dynamic analysis deals with two problems: determination of forces and torques when desired laws of 
motion are known {forward dynamics), and determination of trajectories and laws of motion when 
torques and forces are prescribed {inverse dynamics). Forward dynamics is a simple problem, and the 
only difficulty comes from errors generated through numerical derivation on noisy input data. Using 
laws of mechanics (D'Alembert principle [41,42], Newton-Euler Method [38, 43-47], Lagrange equations 
[48-51], Basic theorems of mechanics [52], etc.), in inverse dynamics, work only for open chain structures 
such as an arm when reaching, or a leg during the swinging, but it is not applicable for closed chain 
structures such as legs during the double support phase in walking, or the arm when contacting an 
immobile object, nor in redundant systems. Every closed and chained structure is dynamically undeter- 
mined and the only possibility to determine forces and torques is to use the theory of elasticity. The 
theory of elasticity assumes that the elements of the structure are solid (not rigid), and it is inappropriate 
for control design because of its extreme complexity. Approximate solution of the dynamics of a closed 
chain mechanism assumes how forces and torques are distributed in the structure. In the case of loco- 
motion, it is necessary to introduce at least three equations in order to be able to determine the dynamics 
of the process. In most cases the chain is limited to a small number of admissible rotations. The reason 
for simplification is twofold: the problem can be solved, and the simulation results can be interpreted. 

The complexity of a body segment dynamical model depends on a number and types of body segments, 
the joints connecting the segments, and the interaction among the segments and the environment. How 
complex a model is required for dynamic analysis of externally controlled walking, standing, manipulation, 
and grasping? Some decisions on how complex the model should be are simple while others are not. For 
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FIGURE 165.2 A sketch of a human body used for modeling of locomotion. Rigid body segments (left) are driven 
with two pairs of muscles (right). See the text for details. 

instance, lower limbs can be assumed rigid during standing and walking. More controversial is the decision 
on how many segments should be assumed, and how many DOF should be assumed for each joint. 

Several studies deal with strategies for prolonged standing [53-58] and functional electrical stimulation 
aided gait [59-64]. 

Recent studies are concerned with estimating parameters required for modeling and simulation 
[65-74]. While many of the results are very useful, they are incomplete and difficult to reproduce. 

165.3 Analytic Models for Control of Movements 

Attempts to analyze multi-joint structures are often characterized by reducing the number of DOF to a 
manageable level. Locomotion is often decomposed into a component in the sagittal plane, and smaller 
components in the frontal and horizontal planes. Using a smaller number of links is essential for 
simulation of gait. 

A double, planar, pendulum with a moving hanging point, representing a human leg, is used to 
illustrate the modeling. The remaining part of the body is represented as a single force acting toward the 
moving hanging point (hip joint), called the hip joint torque (Fig. 165.2). Two pairs of muscles acting 
around the hip and knee joints actuate the leg. Both monoarticular and biarticular muscles add to the 
joint torques. Estimating the relative contribution of monoarticular and biarticular muscles is very 
difficult but possible [e.g., 75]. 

The model does not include ankle and phalangeal joints. The following system of differential equations 
describes the dynamics: 

>y> s + A 2 (|) T cos((f> r -(j) s l + A$\ sin((f> r -<(> s )- A 4 x H sin§ s 

- A s\Yh + s) COS <t>S - X G L S Sin <j> s + Y G L S COS <t>S = M S 

BJty r + 5 2 (() s cos ( r - (j> s I + 5 3 § J sin ( <|> r - s J 
— B 4 x H sm§ T — BAy H + g\ cos§ T — x G L T sin(j> r + Y G L r cos(j) r = M t 
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A 2 = m s d s L T , B 2 = A 2 , A 3 = -A,, 5 3 = B 2 
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. d s , B 4 = m s L T + m T d T , A- = -A 4 , B 5 



(165.2) 



O 2000 by CRC Press LLC 



There are several torques contributing to the total torques acting at the links: 



M c 



-M K , M T = M K + M H 



M K = M f K -M e K -M' K ,M H = M f H -M e H -M' H 



(165.3) 



The following notations are used: S — the shank segment (including the foot); X — the thigh segment; 
CS, CT — the center of the mass of the shank and thigh segments; H — the hip joint; K — the knee joint; 
G — the point of ground contact; d s , d T — distances of the proximal joint to the centers of the masses; L s , 
Lj- — lengths of the shank and thigh; m s , m T — masses of the shank and thigh; J cs , J CT — moments of inertia 
of the shank and thigh about the central axes perpendicular to xOy plane; g — gravitational acceleration; 
F ti — force acting at the hip joint; Y G , Y G — horizontal and vertical components of the ground reaction 
force; M s , M T — total torques acting at the shank and thigh segments; M K , M H — joint torques at the knee 
and hip joints; x H , y H — horizontal and vertical components of the hip acceleration; (|) s , § T — angles of the 
shank and thigh vs. the horizontal axis (Ox); § K , § H — angles of the knee and hip joint; § TR — angle of the 
trunk vs. the horizontal axis (Ox). 

Flexion of a joint is defined to be the positive direction for angular changes; hence, the flexor torque 
is assumed to be positive. Index / is for the equivalent flexor muscle, and index e for the equivalent 
extensor muscle. The contribution of passive tissue crossing the joints is included by a 'resistive' torque 
(index r), which will be described below. 

A three multiplicative factor model used is a modified version of the Hill model (Fig. 165.3). Active 
muscle force depends on the neural activation, muscle length and velocity of shortening or lengthening 
[41, 42]. The model is formulated as a function of joint angle and angular velocity, rather than muscle 
length and velocity. The three-factor model is given by: 



M =A\ 



(«)/(♦)«(♦) 



(165.4) 



where M a is the active torque generated by a muscle contraction; A(u) is the dependence of a torque on 
the level of muscle activity, w, depending on the stimulus amplitude, pulse width, and stimulus frequency; 
/(())) is the dependence on the angle <|); and g{§ ) is the dependence on the angular velocity <j). The parameter 
u is normalized to the range < u < 1. According to the literature [76], the muscle model described by 
(1) can predict the muscle torque with 85-90% accuracy during simultaneous, independent, pseudo- 
random variations of recruitment, angle and angular velocity. 




FIGURE 165.3 A three factor multiplicative model of a muscle (active torque). The model includes antagonist 
muscle (passive torque). 
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Activation A(u). In accordance with the literature [76-79] the muscle response to electrical stimulation 
was approximated by a second-order, critically damped [76,77], low-pass filter with a delay. Thus, the 
activation dynamics were assumed to be expressed by where A(j(o) is the Fourier transform of the muscle's 
contractile activity: 

a(j'co) g) 2 

-7— ( = l e-> mt * (165.5) 

Unco I co 2 + 2 jcoco^ + co 2 



U(j(o) is the Fourier transform of the muscle's electrical activity, ©„ is the muscle's natural (pole) frequency 
(typically in the range of 1-3 Hz, t d is the excitation-contraction and other delays in the muscle. 

joint torque (/"(<(>)) dependence on a joint angle. The nonlinear function relating torque and joint angle 
is simulated by a quadratic curve, F=a + a l § + a 2 (|> 2 . The torque generated can not be negative for 
any angle, so: 

, > [i?ifF>0 
fU) = \ (165.6) 

[OifF<0 

Coefficients a, define the shape of the torque-angle curve and they are determined as the best fit through 
the experimental data recorded. The quadratic polynomial was selected as the simplest adequate fitting 
curve as described elsewhere [73,80]. Note that each of the quantities is non-negative. 

Normalized joint torque (g((j))) dependence on the angular velocity. The normalized joint torques vs. 
joint angular velocities can be modeled piecewise linearly: 



>W : 



c ip <j)<(l -c„)/ c „ 
-c i .<j) ) (l-c i .)/c,<<i)<l/c i . (165.7) 

0,l/c tj <^ K 



The coefficients c tj determine the slope and saturation level of the linearized torque vs. velocity of the 
muscle shortening. These coefficients can be determined using the method described in [81]. 

Resistive joint torque. Soft tissue, passive stretching of antagonistic muscles and ligaments introduce 
nonlinearities, which can be modeled as: 



Ml 



K 



■■ d ll U K -^> K o) + d u ^ K + d u e d '^ K - d l5 e dl ^ K (165.8) 



The presented form was developed heuristically, and it shows that the resistive torques depend on 
both the joint angle and its angular velocity. The first two terms are the contributions of passive tissues 
crossing the joints (dissipative properties of joints) reduced to first-order functions. The other terms are 
the nonlinear components of the resistive torques around the terminal positions, and are modeled as a 
double exponential curve [73]. 

165.4 Nonanalytic Modeling of Movements 

As stated earlier, the control of movements in humans is a highly complex task. The model of the body 
and extremities is complicated and not observable per se, the trajectories are not known in advance, the 
driving resources are user specific and time variable; and thus, real-time control using analytical tools is 
probably not feasible. 
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FIGURE 165.4 A scheme of the organization of a skill-based multilevel control for locomotion. Regular reflexes 
are sequences of system states connected with conditional If-Then expressions within a single mode of the gait, mode 
reflexes relate to the change of the mode of the gait (level, slope, stairs, etc.), and hazard reflexes are for situations 
where software or hardware cannot drive the system upon the preferred sequence of system states. 

An alternative is to model the process of locomotion as a black box with multiple inputs and outputs. 
Tomovic [82] proposed a non-numerical, control method, called Artificial Reflex Control (ARC). ARC 
(Fig. 165.4) refers to a skill-based expert using rules that have an If-Then structure [83]. The locomotion 
is presented as a sequence of discrete events. Each of these discrete events can be described in terms of 
a sensory pattern and a motor activity. A sensory pattern occurring during particular motor activity can 
be recognized with the use of human made (artificial or natural) sensors. The specific discrete event is 
called the state of the system by analogy to the state of finite-state automata [82,84]. This skill-based 
expert system is of the On-Off type and does not explicitly consider the system dynamics. 

A rule-base control system has a hierarchical structure. The highest level is under volitional control 
of the user. Automatic adaptation to environmental changes and modes of gait is realized using artificial 
reflex control. The execution of the artificial reflex has to be tuned for smooth functional movements. 
Advantages of this hierarchical control method are the following: (1) adaptivity, (2) modularity, (3) ease 
of application and possibility of integration into a man-machine system. Rule-based control was tested 
in hybrid assistive systems [85,86] and artificial legs [83]. 

Nonanalytic control can be described as cloning the biological control [87,88]. The basic mechanisms 
for implementation of such algorithms are rule-based controllers (RBC) [89]. RBC were applied for 
assistive systems to restore locomotion with handcrafted [82,85,90,91], and lately an automatically deter- 
mined knowledge base [92-97] with limited success. A RBC is a system implementing "If-Then" produc- 
tion rules, where the "If" part describes the sensory and motor state of the system, while the "Then" part 
of a rule defines the corresponding motor activity to follow. 

It was hypothesized that machine learning (ML) can acquire the needed knowledge for RBC (see the 
Chapter of Abbas, this book). Learning in general can be described as capturing and memorizing of 
connectivisms between facts (Fig. 165.5). ML is a computerized capturing and memorizing process. In 
the most recent study, [93] used simulation results of a fully customized biomechanical model as inputs 
and outputs required for a ML. The following MLs were compared: 1) a multilayer perceptron (MLP) 
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FIGURE 165.5 A mapping obtained as a result of the Radial basis function artificial neural network (RBF ANN) 
learning algorithm. The knee joint angle sensory information for first four consecutive strides (full line, bottom 
panel), the ground reactions and the hip accelerations were used as inputs for training. The activation of knee flexor 
muscles for the same four strides (full line, upper panel) was used for output during the training. The upper panel 
presents superimposed desired activation of muscles (full line) and a predicted activity of the muscle by a RBF ANN 
(dashed line) obtained from a trained network for seven consecutive steps. See text for details. 

with the Levenberg-Marquardt improvement of a backpropagation (BP) algorithm [98]; 2) an adaptive- 
network-based fuzzy inference system (ANFIS) [99]; and 3) combination of an entropy minimization 
type of inductive learning (IL) technique [92,97,100] and a radial basis function (RBF) type of artificial 
neural network (ANN) [101,102] with an orthogonal least squares (OLS) learning algorithm [101]. 

A comparison of IL with adaptive logic networks (ALN) using restriction rules [95,96] shows that 
ALNs have some advantages over IL. Heller et al. [92] compared an IL method based on an algorithm 
called "hierarchical mutual information classifier" [103] with a MLP with a BP algorithm in reconstruct- 
ing muscle activation from kinematic data during normal walking. The conclusion was that both tech- 
niques show comparable performance, although each technique has some advantages over the other one. 
A comparison of the IL method based on minimization of entropy and the RBF network in predicting 
muscle activation and sensory data from the history of sensory data for a human with spinal cord injury 
(SCI) [93], shows that the best generalization comes from combining both. The benefits of merging fuzzy 
logic and an ANN were explored extensively in the literature [99-104]. 

Investigations of multilayer perceptrons (MLPs) have been intensified since the formulation of the BP 
learning algorithm [98]. An MLP is a feed-forward network, typically consisting of several layers of 
nonlinear processing nodes called hidden layers with a linear output layer. Processing nodes take as input 
only the outputs of the previous layer, which are combined as a weighted sum and then passed through 
a nonlinear processing function known as the "activation function". This activation function is typically 
sigmoidal in shape. An MLP with three hidden layers can form arbitrarily complex decision regions and 
can separate classes that are meshed together. It can form regions as complex as those formed using 
mixture distributions and nearest neighbor classifiers [105]. 

The BP learning algorithm is a generalization of a gradient descent algorithm. The BP algorithm may 
lead to a local, rather than a global error minimum. If the local minimum found is not satisfactory, use 
of several different sets of initial conditions or a network with more neurons can be tried. A simple BP 
algorithm is very slow because it must use low learning rates for stable learning. There are ways to improve 
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the speed and general performance of a BP algorithm. It can be improved in two different ways: by 
heuristics and by using more powerful methods of optimization. Speed and reliability of BP can be 
increased by techniques called momentum and adaptive learning rates (e.g., Levenberg-Marquardt mod- 
ification of the BP algorithm). 

Fuzzy sets are a generalization of conventional set theory. They were introduced by Zadeh [106] as a 
mathematical way to represent vagueness in everyday life. A formal definition of fuzzy sets that has been 
presented by many researchers is the following: a fuzzy set A is a subset of the universe of discourse X 
that admits partial membership. The fuzzy set A is defined as the ordered pair A = {x, m A (x)}, where x e 
Xand < m A {x) < 1. The membership function m A (x) describes the degree to which the object x belongs 
to the set A, where m A {x) = represents no membership, and m A (x) = 1 represents full membership. 

One of the biggest differences between conventional (crisp) and fuzzy sets is that every crisp set always 
has a unique membership function, whereas every fuzzy set has an infinite number of membership 
functions that may represent it. This is at once both a drawback and advantage; uniqueness is sacrificed, 
but this gives a concomitant gain in terms of flexibility, enabling fuzzy models to be "adjusted" for 
maximum utility in a given situation. 

The typical steps of "fuzzy reasoning" consist of: (1) Fuzzification, that is comparing the input variables 
with the membership functions of the premise (IF) parts in order to obtain the membership values 
between and 1; (2) Weighing, that is applying specific fuzzy logic operators (e.g., AND operator, OR 
operator, etc.) on the premise parts membership values to get a single number between and 1; 
(3) Generation, that is creating the consequent (THEN) part of the rule; and (4) Defuzzification, that is 
aggregating the consequent to produce the output. 

There are several kinds of fuzzy rules used to construct fuzzy models. These fuzzy rules can be classified 
into the following three types according to their consequent form [107]: 

1. crisply defined constant in the consequent: 

R ; : IF Xj is A (1 and ...and x m is A im THEN y is c ; 

2. a linear combination of the system's input variables in the consequent: 

R;: IF Xj is A u and ...and x m is A im THEN y is g i (x 1 ,...,x m ) = b + b^ + ... + b m x m 

3. fuzzy set in the consequent: 

Rj: IF Xj is A n and ...and x m is A im THEN y is B; 

where R ; is the i-th rule of the fuzzy system, X: (j = 1, 2,.,.m) are the inputs to the fuzzy system, and y is 
the output from the fuzzy system. The linguistic terms, A s and B i; are fuzzy sets, q and bj denote crisp 
constants. 

The so-called zero-order Sugeno, or Takagi-Sugeno-Kang fuzzy model [108] has rules of the first type, 
whereas the first-order Sugeno fuzzy model has rules of the second type. The easiest way to visualize the 
first-order Sugeno fuzzy model is to think of each rule as defining the location of a "moving singleton" 
(a single spike from the consequent) depending on what the input is. Sugeno models are similar to the 
Mamdani model [109] which has rules of the third type, and which is more intuitive, but computationally 
less efficient. Fuzzification and weighing, are exactly the same, but generation and defuzzification are 
different [107]. 

For the type of fuzzy rules used in the Mamdani model various methods are available for defuzzification: 
the centroid of area, bisector of area, middle of maximum, largest of maximum etc. [110], but all of 
these methods are based on the calculation of the two-dimensional-shape surface, that is on the integra- 
tion. The Sugeno-style enhances the efficiency of the defuzzification process because it greatly simplifies 
the computation; i.e., it has to find just the weighted average of a few data points. The implication method 
(generation) is simply multiplication, and the aggregation operator just includes all of the singletons. 
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Membership functions are subjective and context-dependent, so there is no general method to deter- 
mine them. Currently, when fuzzy set theory is applied in control systems, the system designers are given 
enough freedom to choose membership functions and operators, usually in a trial and error way. After 
a hand-tuning process, the system can function effectively. However, the same methodology is hardly 
applicable when the system is a general purpose one, or when the context changes dynamically. This 
suggests an explanation why the most successful applications of fuzzy logic happen in control systems, 
rather than in natural language processing, knowledge base management, and general purposes reasoning. 

The inductive learning (IL) is a symbolic technique, which uses supervised learning and generates a 
set of "if-then-else" decision rules. A method [97] is based on an algorithm called a "hierarchical mutual 
information classifier" [92, 103]. This algorithm produces a decision tree by maximizing the average 
mutual information at each partitioning step. It uses Shannon's entropy as a measure of information. 
Mutual information is a measure of the amount of information that one random variable contains about 
another random variable. It is a reduction in the uncertainty of one random variable due to the knowledge 
of the other. An effective method of integrating results of a mutual information algorithm into a pro- 
duction rule formalism, following the original work of Pitas [ 1 00, 111] was done by Nikolic [112]. While 
generating the decision tree, the algorithm performs a hierarchical partitioning of the domain multidi- 
mensional space. Each new node of the decision tree contains a rule based on a threshold of one of the 
input signals. Each new rule further subdivides the example set. The training is finished when each 
terminal node contains members of only one class. An excellent feature of this algorithm is that it 
determines threshold automatically based on the minimum entropy. This minimum entropy method is 
equivalent to the determination of the maximum probability of recognizing a desired event (output) 
based on the information from the input. 

Radial basis function (RBF) network is a feed-forward network. The RBF used here has a single output 
node and a single hidden layer which contains as many neurons as are required to fit the function within 
the specifications of error goal. The transformation from the input space to the hidden-unit space is 
nonlinear, whereas the transformation from the hidden-unit space to the output space is linear. A 
common learning algorithm for RBF networks is based on first randomly choosing some data points as 
radial basis function centers and then using singular value decomposition to solve for the weights of the 
network. An arbitrary selection of centers may not satisfy the requirement that centers should suitably 
sample the input domain. Furthermore, in order to achieve a given performance, an unnecessarily large 
RBF network may be required. Since a performance of an RBF network critically depends upon the 
chosen centers, we used an alternative learning procedure based on the OLS learning algorithm [101]. 
By providing a set of inputs and corresponding outputs, the values of weights and bias, and radial basis 
function centers (parameters for the RBF network) can be determined using the OLS algorithm in one 
pass of the learning data so that a network of an adequate size can be constructed. 

When an input vector is presented to such a network, each neuron in the hidden layer will output a 
value according to how close the input vector is to the centers vector of each neuron. The result is that 
neurons with centers vector very different from the input vector will have outputs near zero. These small 
outputs will have a negligible effect on the linear output neurons. In contrast, any neuron whose centers 
vector is very close to the input vector will output a value near 1. If a neuron has an output of 1, its 
output weights in the second layer pass their values to the neuron in the second layer. The width of an 
area in the input space to which each radial basis neuron responds can be set by defining a spread constant 
for each neuron. This constant should be big enough to enable neurons to respond strongly to overlapping 
regions of the input space. The same spread constant is usually selected for each neuron. 

165.5 Hybrid Modeling of Controllers 

Feedback Error Learning (FEL) is a hybrid technique [113] using mapping to replace the estimation of 
parameters within the feedback loop in a closed-loop control scheme. FEL is a feed-forward neural 
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FIGURE 165.6 Model for the hybrid modeling of controllers using feedback error learning algorithm. 

network structure, under training, learning the inverse dynamics of the controlled object. This method 
is based on contemporary physiological studies of the human cortex [114], and is shown in Fig. 165.6. 

The total control effort, u, applied to the plant is the sum of the feedback control output and network 
control output. The ideal configuration of the neural network would correspond to the inverse mathe- 
matical model of the system's plant. The network is given information of the desired position and its 
derivatives, and it will calculate the control effort necessary to make the output of the system follow the 
desired trajectory. If there are no disturbances, the system error will be zero. 

The configuration of the neural network should represent the inverse dynamics of the system when 
training is completed. It was prudent to use a total energy approach as the basis for the neural network, 
because only the commanded input and its first derivative is required for the FEL controller. By com- 
parison, if the FEL were based on the mathematical model, the second derivative would be needed for 
the neural network to operate [115]. 

Figure 165.7 depicts a more in-depth explanation of the FEL strategy. The system input and output 
are labeled d and 8, respectively. The proportional-plus-derivative (PD) feedback controller is included 
to provide stability during training of the neural network [114,117,118]. Enclosed in the dashed rectangle 
is an FEL controller, which outputs the necessary control signal, based on the desired inputs. The total 
energy of the system (nn) is calculated through parallel processing within the neural network, consisting 
of the functionals (A,B,C. . . ) obtained from the total energy expression, and the synaptic weights w i5 wj, 
and w k . In addition to the above-mentioned synaptic weights, W[ is associated with damping losses. The 
training of the FEL controller is facilitated by changing the synaptic weights based on the output from 
a PD controller. The learning rule used, was proposed by Kawato, Furukawa, and Suzuki [119] and it is 
given as follows: 

w. =w. ,, +u D „ATl At 

mew tola PD I 

where w jnew is the new value of the synaptic weight, w jold is the old value, u PD is the output from the PD 
controller, A is the network functional associated with weight w ; , r| is the learning rate and At is the 
integration step used in the computer simulation. A learning rate is included to control the rate of growth 
of the synaptic weights. The learning rule, as proposed by Kawato et al. [ 1 13] , is based on the assumption 
of slow growth of the synaptic weights. The weights are initialized at zero, and the learning rates adjusted 
so the growths of the weights are uniform. This causes the weights to reach their final value at the same 
point in time, causing the error to approach zero. Subsequently, the learning as a function of error will 
level off, and the training of the neural network will be completed. However, if the growth of the weights 
is not homogeneous, it will result in an unbounded growth of the weights. The vertical line in Figure 165.7 
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Scheme of the feedback error learning controller used for the one degree of freedom system. See 



pointing upward through w„ Wj, w k and w, symbolizes the learning. After the total energy is calculated, 
the time derivative is taken and divided by the desired velocity. The losses are calculated by multiplying 
the desired velocity by the weight w b and are then added to the control signal. Finally, the control effort 
from the FEL controller is added to the PD control effort. 

In essence, the output of the feedback controller is an indication of the miss-match between the 
dynamics of the plant and the inverse-dynamics model obtained by the neural network. If the true inverse- 
dynamic model has been learned, the neural network alone will provide the necessary control signal to 
achieve the desired trajectory [118,120]. 

Defining Terms 

Actuators: Device that powers a system. 

Artificial reflex control: Expert system using rule-based control. 

Closed-loop system: Control system which uses information about the output to correct the control 

parameters to minimize the error between the desired and actual trajectory. 
Controller: Process by which the controls are generated. 
Controls: The signals driving the actuators. 
Degree of freedom: Independent variable defining the position. A free rigid body has six degrees of 

freedom, a ball joint three, and a hinge joint one. 
Dynamic analysis: Analytic simulation of movements of the system considering forces, torques and 

kinematics. Forward dynamics uses the geometry and kinematics as input, and provides forces and 

torques as outputs; inverse dynamics starts from forces and torques and determines kinematics 

and geometry of the system. 
Hierarchical control: Multilevel control allowing the vertical decomposition of the system. 
Kinematic analysis: Analytic simulation of movements of the system considering positions, velocities 

and accelerations. 
Kinematic pair: Connection of two neighboring segments. 
Nonanalytic control: Mappings between inputs and outputs to be used for control. 
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Open-loop control: Control method that uses a prestored trajectory and the model of the system to 
control the plant. 

Plant: The mechanical system being controlled. 

Production rule: "If-Then" conditional expression used in expert systems. 

Reference-based open loop control: Control which precomputes and stores control signals, and exe- 
cutes the desired motor task in real-time. 

Rigid body: Set of material points with the distances between points being fixed. 

State: The configuration of the system at any instant in time. 

Trajectory: The time histories of the plant states in response to the control signals. 
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This chapter presents a broad overview of the fast eye movement control system. A fast eye movement 
is usually referred to as a saccade, and involves quickly moving the eye from one image to another image. 
This type of eye movement is very common and is observed most easily while reading when the end of 
a line is reached, the eyes are moved quickly to the beginning of the next line. A qualitative description 
of the fast eye movement system is given first in the introduction and then followed by a brief description 
of saccade characteristics. Next, the earliest quantitative saccade model is presented, followed by more 
complex and physiologically accurate models. Finally, the saccade generator, or saccade controller, is then 
discussed on the basis of anatomic pathways and control theory. The purpose of this review is focused 
on mathematical models of the fast eye movement system and its control strategy, rather than on how 
visual information is processed. The literature on the fast eye movement system is vast, and thus this 
review is not exhaustive but rather a representative sample from the field. 

The oculomotor system responds to visual, auditory, and vestibular stimuli, which results in one of 
five types of eye movements: fast eye movements, smooth pursuit eye movements, vestibular ocular 
movements, vergence eye movements, and optokinetic eye movements. Each of these movements is 
controlled by a different neuronal system, and all these controllers share the same final common pathway 
to the muscles of the eye. In addition to the five types of eye movements, these stimuli also cause head 
and body movements. Thus, the visual system is part of a multiple input-multiple output system. 

Regardless of the input, the oculomotor system is responsible for movement of the eyes so that images 
are focused on the central 1/2° region of the retina, known as the fovea. Lining the retina are photore- 
ceptive cells which translate images into neural impulses. These impulses are then transmitted along the 
optic nerve to the central nervous system via parallel pathways to the superior colliculus and the cerebral 
cortex. The fovea is more densely packed with photoreceptive cells than the retinal periphery; thus a 
higher resolution image (or higher visual acuity) is generated in the fovea than the retinal periphery. The 
purpose of the fovea is to allow us to clearly see an object, and the purpose of the retinal peripheral is 
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to allow us to detect a new object of interest. Once a new object of interest is detected in the periphery, 
the saccade system redirects the eyes, as fast as possible, to the new object. This type of saccade is typically 
called a goal-directed saccade. 

During a saccade, the oculomotor system operates in an open-loop mode. After the saccade, the system 
operates in a closed-loop mode to ensure that the eyes reach the correct destination. The reason that the 
saccade system operates without feedback during a fast eye movement is simple — information from the 
retina and muscle proprioceptors is not transmitted quickly enough during the eye movement for use 
in altering the control signal. 

The oculomotor plant and saccade generator are the basic elements of the saccadic system. The 
oculomotor plant consists of three muscle pairs and the eyeball. These three muscle pairs contract and 
lengthen to move the eye in horizontal, vertical, and torsional directions. Each pair of muscles acts in 
antagonistic fashion due to reciprocal innervation by the saccade generator. For simplicity, the models 
described here involve only horizontal eye movements and one pair of muscles, the lateral and medial 
rectus muscle. 

166.1 Saccade Characteristics 

Saccadic eye movements, among the fastest voluntary muscle movements the human is capable of 
producing, are characterized by a rapid shift of gaze from one point of fixation to another. The usual 
experiment for recording saccades is for subjects to sit before a horizontal target display of small light 
emitting diodes (LEDs), with instructions to maintain their eyes on the lit LED by moving their eyes as 
fast as possible to avoid errors. A saccade is made by the subject when the active LED is switched off and 
another LED is switched on. Saccadic eye movements are conjugate and ballistic, with a typical duration 
of 20-100 ms and a latency of 150-300 ms. The latent period is thought to be the time interval during 
which the CNS determines whether to make a saccade, and if so, calculates the distance the eyeball is to 
be moved, transforming retinal error into transient muscle activity. 

Generally, saccades are extremely variable, with wide variations in the latent period, time to peak 
velocity, peak velocity, and saccade duration. Furthermore, variability is well coordinated for saccades of 
the same size; saccades with lower peak velocity are matched with longer saccade durations, and saccades 
with higher peak velocity are matched with shorter saccade durations. Thus, saccades driven to the same 
destination usually have different trajectories. 

To appreciate differences in saccade dynamics, it is often helpful to describe them with saccade main 
sequence diagrams [Bahill et al., 1975]. The main sequence diagrams plot saccade peak velocity-saccade 
magnitude, saccade duration-saccade magnitude, and saccade latent period-saccade magnitude. Shown 
in Fig. 166.1 are the main sequence characteristics for a subject executing 54 saccades. Notice that the 
peak velocity-saccade magnitude is basically a linear function until approximately 10°, after which it 
levels off to a constant for larger saccades. Many researchers have fit this relationship to an exponential 
function. The solid lines in Fig. 166.1a include an exponential fit to the data for positive and negative 
eye movements. The lines in the first graph are fitted to the equation 



V = a 



\-e* 
V 



M66.1) 



where V is the maximum velocity, x the saccade size, and the constants a ; and (3, evaluated to minimize 
the summed error squared between the model and the data. Note that a ; is to represent the steady state 
of the peak velocity-saccade magnitude curve and (3, is to represent the time constant for the peak 
velocity-saccade magnitude curve. For this data set, a, equals 825 and 637, and P, equals 9.3 and 6.9, for 
positive and negative movements, respectively. The exponential shape of the peak velocity-saccade 
amplitude relationship might suggest that the system is nonlinear, if one assumes a step input to the 
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FIGURE 166.1 Main sequence diagrams, {a) peak velocity-saccade magnitude, (b) saccade duration-saccade mag- 
nitude, and (c) latent period-saccade magnitude for 54 saccadic eye movement by a single subject. Source: Enderle 
JD. 1988. Observations on pilot neurosensory control performance during saccadic eye movements. Aviat Space 
Environ Med 59:309. With permission. 

system. A step input would provide a linear peak velocity-saccade amplitude relationship. In fact, the 
saccade system is not driven by a step input, but rather a more complex pulse-step waveform. Thus, one 
cannot conclude that the saccade system is nonlinear solely based on the peak velocity-saccade amplitude 
relationship. 

Shown in Fig. 166.1 fc are data depicting a linear relationship between saccade duration-saccade mag- 
nitude. The dependent between saccade duration and saccade magnitude also might suggest that the 
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system is nonlinear, if one assumes a step input. Since the input is not characterized by a step waveform, 
one cannot conclude that the saccade system is nonlinear solely based on the saccade duration-saccade 
magnitude relationship. 

Shown in Fig. 1 66. 1 c is the latent period-saccade magnitude data. It is quite clear that the latent period 
does not show any linear relationship with saccade size, i.e., the latent period's value is independent of 
saccade size. In the development of the oculomotor plant models, the latent period will be implicitly 
assumed within the model. 

166.2 Westheimer's Saccadic Eye Movement Model 

The first quantitative saccadic eye movement model, illustrated in Fig. 166.2, was published by Westhe- 
imer [ 1 954] . Based on visual inspection of a recorded 20° saccade, and the assumption of a step controller, 
Westheimer proposed the following second order model: 



J® + B® + K@ = X 



(') 



(166.2) 



To analyze the characteristics of this model and compare it to data, it is convenient to solve Eq. (166.2) 
for peak velocity and duration through Laplace analysis. The transfer function of Eq. (166.2), written in 
standard form, is given by 



<# 



x(s) 5 2 +2^C0 n S+C0^ 



or 
K 



(166.3) 



! K 



B 



where ©„ = J~J, and C, = 2 JKJ . Based on the saccade trajectory for a 20° saccade, Westheimer estimated 

K>„ = 120 radians per second, and C, = 0.7. With the input x(s) = J- , &{t) is determined as 

K 



v ; K 



1 + - 
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FIGURE 166.2 A diagram illustrating Westheimer's 
[1954] second-order model of the saccade system. The 
parameters J, B, and K are rotational elements for 
moment of inertia, friction, and stiffness, respectively, 
and represent the eyeball and its associated viscoelastic- 
ity. The torque applied to the eyeball by the lateral and 
medial rectus muscles is given by x(r), and is the angu- 
lar eye position. 
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where 



CO^ = C0 n \, 1 ~ C 2 an< i (|> = 31 + tan 



-» -c 



Duration, T p , is found by first calculating 



39 = ye^"' 

3f J^Vi-C 2 



-£(O n cosl(0/ + (|>)-a) d sinla)/ + (|>) 



(166.5) 



then determining X from = 0, yielding 

dt 



«„ x/i-C 2 



(166.6) 



With Westheimer's parameter values, T = 37 ms for saccades of all sizes, which is independent of saccade 
magnitude and not in agreement with the experimental data that have a duration which increases as a 
function of saccade magnitude. 

Predicted saccade peak velocity, ®(t m ), is found by first calculating 



d 2 ® _ -ye 

.2 



^ M „ f 



Ceo \C,& coslco ,f + <|)) + C0 .sinlco ,f + ())) 
^V 1- ^ (166.7) 
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and the determining time at peak velocity, t mv , from 



3^0 

dt 2 



0, yielding 



tan 



f I A 

A-c 2 



CO, 



(166.8) 



Substituting t mv into Eq. (166.5) gives the peak velocity 0(f, m ,). Using Westheimer's parameter values, and 
with the saccade magnitude given by A0 = -p [based on the steady-state value from Eq. (166.3)], we 
have from Eq. (166.5) 



I mv l~ 



55.O2A0 



(166.9) 



that is, peak velocity is directly proportional to saccade magnitude. As illustrated in the main sequence 
diagram shown in Fig. 166.1a, experimental peak velocity data have an exponential form and are not a 
linear function as predicted by the Westheimer model. 

Westheimer noted the differences between saccade duration-saccade magnitude and peak velocity-sac- 
cade magnitude in the model and the experimental data and inferred that the saccade system was not 
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linear because the peak velocity-saccade magnitude plot was nonlinear, and the input was not an abrupt 
step function. Overall, this model provided a satisfactory fit to the eye position data for a saccade of 20°, 
but not for saccades of other magnitudes. Interestingly, Westheimer's second-order model proves to be 
an adequate model for saccades of all sizes, if one assumes a different input function as described in the 
next section. Due to its simplicity, the Westheimer model of the oculomotor plant is still popular today. 

166.3 Robinson's Model of the Saccade Controller 

In 1964, Robinson performed an experiment to measure the input to the eyeballs during a saccade. To 
record the input, one eye was held fixed using a suction contact lens, while the other eye performed a 
saccade from target to target. Since the same innervation signal is sent to both eyes during a saccade, 
Robinson inferred that the input, recorded through the transducer attached to the fixed eyeball, was the 
same input driving the other eyeball. He estimated that the neural commands controlling the eyeballs 
during a saccade are a pulse plus a step, or simply, a pulse-step input. 

It is important to distinguish between the tension or force generated by a muscle, called muscle tension, 
and the force generator within the muscle, called the active-state tension generator. The active-state tension 
generator creates a force within the muscle that is transformed through the internal elements of the muscle 
into the muscle tension. Muscle tension is external and measurable, and the active-state tension is internal 
and unmeasurable. Moreover, Robinson [1981] reported that the active-state tensions are not identical to 
the neural controllers, but described by low-pass filtered pulse-step waveforms. The neural control and 
the active-state tension signals are illustrated in Fig. 166.3. The agonist pulse input is required to get the 
eye to the target as soon as possible, and the step is required to keep the eye at that location. 

Robinson [1964] also described a model for fast eye movements (constructed from empirical consid- 
erations) which simulated saccades over a range of 5-40° by changing the amplitude of the pulse-step 
input. Simulation results were adequate for the position-time relationship, but the velocity-time rela- 
tionship was inconsistent with physiologic evidence. To correct this deficiency of the model, physiologic 
studies of the oculomotor plant were carried out during the 1960s through the 1970s that allowed the 
development of a more homeomorphic oculomotor plant. Essential to this work was the construction 
of oculomotor muscle models. 

166.4 A Linear Homeomorphic Saccadic Eye 
Movement Model 

In 1980, Bahill and coworkers presented a linear fourth-order model of the oculomotor plant, based on 
physiologic evidence, that provides an excellent match between model predictions and eye movement 
data. This model eliminates the differences seen between velocity predictions of the model and the data 
and the acceleration predictions of the model and the data. For ease in presentation, the modification 
of this model by Enderle and coworkers [1984] will be used. 

Figure 166.4 illustrates the mechanical components of the oculomotor plant for horizontal eye move- 
ments, the lateral and medial rectus muscle, and the eyeball. The agonist muscle is modeled as a parallel 
combination of an active state tension generator F AG , viscosity element B AG , and elastic element K LT , 
connected to a series elastic element K SE . The antagonist muscle is similarly modeled as a parallel 
combination of an active-state tension generator F ANT , viscosity element B ANT , and elastic element K LT , 
connected to a series elastic element K SE . The eyeball is modeled as a sphere with moment of inertia J p , 
connected to viscosity element B p and elastic element K p . The passive elasticity of each muscle is included 
in spring K p for ease in analysis. Each of the elements defined in the oculomotor plant is ideal and linear. 

Physiologic support for this model is based on the muscle model by Wilkie [1968], and estimates for 
the extraocular muscle elasticities and the passive tissues of the eyeball are based on experiments by 
Robinson [1981], Robinson and coworkers [1969], and Collins [1975], and studies of extraocular muscle 
viscosity by Bahill and coworkers [1980]. 
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FIGURE 166.3 Agonist and antagonist neurologic control signals (solid lines) and the agonist and antagonist active- 
state tensions (dashed lines). Note that the time constant for activation is different from the time constant for 
deactivation. Source: Enderle JD, Wolfe JW. 1987. Time optimal control of saccadic eye movements. IEEE Trans 
Biomed Eng 34(1 ):43. With permission. 

By summing the forces at junctions 2 and 3 (the equilibrium positions for x 2 and x 3 ) and the torques 
acting on the eyeball, using Laplace variable analysis about the operating point, the linear homeomorphic 
model, as shown in Fig. 166.4, is derived as 



where 



8 ( K s4 K sr( F AG " F Aivr) + B Wag ~ B ag^ant) 



(166.10) 
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FIGURE 166.4 This diagram illustrates the mechanical components of the oculomotor plant. The muscles are 
shown to be extended from equilibrium, a position of rest, at the primary position (looking straight ahead) , consistent 
with physiologic evidence. The average length of the rectus muscle at the primary position is approximately 40 mm 
and at the equilibrium position is approximately 37 mm. 9 is the angle the eyeball is deviated from the primary 
position, and variable x is the length of arc traversed. When the eye is at the primary position, both 9 and x are equal 
to zero. Variables x Y through x t are the displacements from equilibrium for the stiffness elements in each muscle. 
Values x pl through x pt are the displacements from equilibrium for each of the variables x 1 through x t at the primary 
position. The total extension of the muscle from equilibrium at the primary position is x pl plus x p2 or x pi plus x pi , 
which equals approximately 3 mm. It is assumed that the lateral and medial rectus muscles are identical, such that 
x pl equals x pi and x p} and x p2 . The radius of the eyeball is r. Source: Enderle JD, Wolfe JW, Yates JT. 1984. The linear 
homeomorphic saccadic eye movement model — a modification. IEEE Trans Biomed Eng 31:717. With permission. 
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The agonist and antagonist active-state tensions are given by the following low-pass-filtered waveforms 



AG AG 



and F, 
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(166.11) 



where N AG and N ANT are the pulse-step waveforms shown in Fig. 166.3, and % = x ac [u(t) - u(t- fj)] + 
x de u(t— fj) and x ant = x de [u(t) — u(t) — u(t- fj)] + x ac u(t- fj) are the time-varying time constants [Bahill 
etal., 1980]. 

Based on an analysis of experimental evidence, Enderle and Wolfe [1988] determined parameter 
estimates for the oculomotor plant as: K SE = 125 Nnr 1 , K LT = 32 Ntrr 1 , K = 66.4 Nnr 1 , B = 3.1 Nsnr 1 , 
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FIGURE 166.5 Saccadic eye movement in response to a 15° target movement. Solid line is the prediction of the 
saccadic eye movement model with the final parameter estimates computed using the system identification technique. 
Dots are the data. Source: Enderle JD, Wolfe JW. 1987. Time-optimal control of saccadic eye movements. IEEE Trans 
Biomed Eng 34(1 ):43. With permission. 



/ = 2.2 x 10- 3 Ns 2 !^ 1 , B AG 
active-state tensions as 



3.4 Nsrrr 1 , B ANT = 1.2 Nsnr 1 , and 8 = 72.536 x 10 6 , and the steady-state 



1 0.14 + 0.01850 N for®< 14.23° 
| 0.02830 N for 0> 14.23° 



(166.12) 



1 0.14-0.009800 N for 0< 14.23° 
! N for 0> 14.23° 



(166.13) 



Since saccades are highly variable estimates of the dynamic active-state tensions are carried out on a 
saccade-by-saccade basis. One method to estimate the active-state tensions is using the system identifi- 
cation technique, a conjugate gradient search program carried out in the frequency domain [Enderle & 
Wolfe, 1988]. Figures 166.5-166.7 show the system identification technique results for an eye movement 
response to a 15° target movement. A close fit between the data and model prediction is seen in Fig. 166.5. 
Figures 166.6 and 166.7 further illustrate the accuracy of the final parameter estimates for velocity and 
acceleration. Estimates for agonist pulse magnitude are highly variable for saccade to saccade, even for 
the same size — see Fig. 9 of Enderle and Wolfe [1988]. Agonist pulse duration is closely coupled with 
pulse amplitude; as the pulse amplitude increases, the pulse duration decreases for saccades of the same 
magnitude. Reasonable values for the pulse amplitude for this model range from about 0.6N-1.4N. The 
larger the magnitude of the pulse, the larger the peak velocity of the eye movement. 

166.5 Another Linear Homeomorphic Saccadic Eye 
Movement Model 

The previous linear model of the oculomotor plant is derived from a nonlinear oculomotor plant model 
by Hsu and coworkers [1976] and based on a linearization of the force-velocity curve [Bahill et al., 1980]. 
Muscle viscosity traditionally has been modeled with a hyperbolic force-velocity relationship. Using the 
linear model of muscle reported by Enderle and coworkers [1991], it is possible to avoid the linearization 
and derive an updated linear homeomorphic saccadic eye movement model. 
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FIGURE 166.6 Velocity estimates for the saccadic eye movement illustrated in Fig. 166.5. Solid line is the saccadic 
eye movement model velocity prediction with the final parameter estimates computed using the system identification 
techniques. The dots are the two-point central difference estimates of velocity computed with a step size of 3 and a 
sampling interval of 1 ms. Source: Enderle JD, Wolfe JW. 1987. Time-optimal control of saccadic eye movements. 
IEEE Trans Biomed Eng 34(1):43. With permission. 
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FIGURE 166.7 Acceleration estimates for the saccadic eye movement illustrated in Fig. 166.5. Solid line is the 
saccadic eye movement model acceleration prediction with the final parameter estimates computed using the system 
identification techniques. The dots are the 2-point central difference estimates of velocity computed with a step size 
of 4 and a sampling interval of 1 ms. Source: Enderle JD, Wolfe JW. 1987. Time-optimal control of saccadic eye 
movements. IEEE Trans Biomed Eng 34(1):43. With permission. 

The linear muscle model has the static and dynamic properties of rectus eye muscle, a model without 
any nonlinear elements. The model has a nonlinear force-velocity relationship that matches muscle data 
using linear viscous elements, and the length tension characteristics are also in good agreement with 
muscle data within the operating range of the muscle. Some additional advantages of the linear muscle 
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FIGURE 166.8 This diagram illustrates the mechanical components of the updated oculomotor plant. The muscles 
are shown to he extended from equilibrium, a position of rest, at the primary position (looking straight ahead), 
consistent with physiologic evidence. The average length of the rectus muscle at the primary position is approximately 
40 mm and at the equilibrium position is approximately 37 mm. is the angle the eyeball is deviated from the 
primary position, and variable x is the length of arc traversed. When the eye is at the primary position, both and 
x are equal to zero. Variables x l through x 4 are the displacements from equilibrium for the stiffness elements in each 
muscle, and 9 5 is the rotational displacement for passive orbital tissues. Values x p] through x pi are the displacements 
from equilibrium for each of the variables x, through x t at the primary position. The total extension of the muscle 
from equilibrium at the primary position is x pl plus x p2 or x p} plus x pi , which equals approximately 3 mm. It is 
assumed that the lateral and medial rectus muscles are identical, such that x pl equals x pi and x pi equals x p2 . The radius 
of the eyeball is r. 

model are that a passive elasticity is not necessary if the equilibrium point x e = 19.3°, rather than 15°, 
and muscle viscosity is a constant that does not depend on the innervation stimulus level. 

Figure 166.8 illustrates the mechanical components of the updated oculomotor plant for horizontal 
eye movements, the lateral and medial rectus muscle, and the eyeball. The agonist muscle is modeled as 
a parallel combination of viscosity B 2 and series elasticity K SE , connected to the parallel combination of 
active state tension generator F AG , viscosity element B l and length tension elastic element K LT . Since 
viscosity does not change with innervation level, agonist viscosity is set equal to antagonist viscosity. The 
antagonist muscle is similarly modeled with a suitable change in active state tension to F ANT . The eyeball 
is modeled as a sphere with moment of inertia J p , connected to a pair of viscoelastic elements connected 
in series; the update of the eyeball model is based on observations by Robinson [1981]. Each of the 
elements defined in the oculomotor plant is ideal and linear. 

The differential equation describing oculomotor plant model shown in Fig. 166.8 is derived by sum- 
ming the forces acting at junctions 2 and 3 and the torques acting on the eyeball and junction 5, and 
using Laplace variable analysis about the operating point, and given by 



&[ K SE K 12\ F AG - F ANTJ + [ K SE B M + B 2 K u)\ F AG ~ F ANTJ 

■4' 



+ B 2 BJF AG -F ANT 



)) = + P 3 + P 2 + P,0 + P o 
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Based on an analysis of experimental data, suitable parameter estimates for the oculomotor plant are: 
K SE = 125 Nnr 1 , K LT = 60.7 Nm-\ B x = 4.6 Nsm" 1 , B 2 = 0.5 Nsm" 1 , / = 2.2 x lO" 3 Ns^- 1 , B 3 = 0.538 
Nsnr 1 , B 4 = 41.54 Nsirr 1 , iCj = 26.9 Nm 4 , and K 2 = 41.54 Nm 4 . Based on the updated model of muscle 
and length tension data [Collins, 1975], steady-state active-state tensions are determined as described in 
Enderle and coworkers [1991] as 



0.4 + 0.01750 N for® >0° 
F = \ ' (166.16) 

[0.4-0.01250 N for®<0° 

Saccadic eye movements simulated with this model have characteristics which are in good agreement 
with the data, including position, velocity and acceleration, and the main sequence diagrams. 

166.6 Saccade Pathways 

Clinical evidence, lesion and stimulation studies all point toward the participation of vitally important 
neural sites in the control of saccades, including the cerebellum, superior colliculus, thalamus, cortex, 
and other nuclei in the brain stem; evidence shows also that saccades are driven by two parallel neural 
networks. From each eye, the axons of retinal ganglion cells exit and join other neurons to form the optic 
nerve. The optic nerve from each eye then join at the optic chiasm, where fibers from the nasal half of 
each retina cross to the opposite side. Axons in the optic tract synapse in the lateral geniculate nucleus 
(a thalamic relay) and continue to the visual cortex. This portion of the saccade neural network is 
concerned with the recognition of visual stimuli. Axons in the optic tract also synapse in the superior 
colliculus. This second portion of the saccade neural network is concerned with the location of visual 
targets and is primarily responsible for goal-directed saccades. 

Saccadic neural activity of the superior colliculus and cerebellum, in particular, have been identified 
as the saccade initiator and terminator, respectively, for a goal-directed saccade. The impact of the frontal 
eye field and the thalamus, while very important, have less important roles in the generation of goal- 
directed saccades to visual stimuli. The frontal eye fields are primarily concerned with voluntary saccades, 
and the thalamus appears to be involved with corrective saccades. Shown in Fig. 166.9 is a diagram 
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FIGURE 166.9 Shown is a diagram illustrating important sites for the generation of a conjugate horizontal saccade 
in both eyes. It consists of the familiar premotor excitatory burst neurons (EBN), inhibitory burst neurons (IBN), 
long lead burst neurons (LLBN), omnipause neurons (OPN), tonic neurons (TN), and the vestibular nucleus, 
abducens nucleus, oculomotor nucleus, cerebellum, substantia nigra, nucleus reticularis tegmenti pontis (NRTP), 
the thalamus, the deep layers of the superior colliculus (SC), and the oculomotor plant for each eye. Excitatory inputs 
are shown with an arrow, inhibitory inputs are shown with a X. Consistent with current knowledge, the left and 
right structures of the neural circuit model are maintained. This circuit diagram was constructed after a careful 
review of the current literature. Each of the sites and connections is supported by firm physiologic evidence. Since 
interest is in goal-directed visual saccades, the cortex has not been partitioned into the frontal eye field and posterior 
eye field (striate, prestriate, and inferior parietal cortices). 



illustrating important sites for the generation of a conjugate goal-directed horizontal saccade in both 
eyes [Enderle, 1994]. Each of the sites and connections detailed in Fig. 166.9 are fully supported by 
physiologic evidence. Some of these neural sites will be briefly described. 
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FIGURE 166.10 Movement fields of the superior colliculus. 

The superior colliculus (SC) contains two major functional divisions: a superficial division and an 
intermediate or deep division. Inputs to the superficial division are almost exclusively visual and originate 
from the retina and the visual cortex. The deep layers provide a convergence site of convergence for 
sensory signals from several modalities and a source of efferent commands for initiating saccades. The 
SC is the initiator of the saccade and thought to translate visual information into motor commands. 

The deep layers of the superior colliculus initiate a saccade based on the distance between the current 
position of the eye and the desired target. The neural activity in the superior colliculus is organized into 
movement fields that are associated with the direction and saccade amplitude, and does not involve the 
initial position of the eyeball whatsoever. The movement field is shown in Fig. 166.10 for a 20° saccade. 
Neurons active during a particular saccade are shown as the dark circle, representing a desired 20° eye 
movement. Active neurons in the deep layers of the superior colliculus generate a high-frequency burst 
of activity beginning 18-20 ms before a saccade and ending sometime toward the end of the saccade; 
the exact timing for the end of the burst firing is quite random and can occur slightly before or slightly 
after the saccade ends. Each active bursting neuron discharges maximally, regardless of the initial position 
of the eye. Neurons discharging for small saccades have smaller movement fields, and those for larger 
saccades have larger movement fields. All the movement fields are connected to the same set of LLBN. 

The cerebellum is responsible for the coordination of movement and is composed of a cortex of gray 
matter, internal white matter and three pairs of deep nuclei: fastigial nucleus, the interposed and globose 
nucleus, and dentate nucleus. The deep cerebellar nuclei and the vestibular nuclei transmit the entire 
output of the cerebellum. Output of the cerebellar cortex is carried through Purkinje cells. Purkinje cells 
send their axons to the deep cerebellar nuclei and have an inhibitory effect on these nuclei. The cerebellum 
is involved with both eye and head movements, and both tonic and phasic activity are reported in the 
cerebellum. The cerebellum is not directly responsible for the initiation or execution of a saccade but 
contributes to saccade precision. Sites within the cerebellum important for the control of eye movements 
include the oculomotor vermis, fastigial nucleus, and the flocculus. Consistent with the operation of the 
cerebellum for other movement activities, the cerebellum is postulated here to act both as the coordinator 
for a saccade and as a precise gating mechanism. 

The cerebellum is included in the saccade generator as a time-optimal gating element, using three 
active sites during a saccade: the vermis, fastigial nucleus, and flocculus. The vermis is concerned with 
the absolute starting position of a saccade in the movement field and corrects control signals for initial 
eye position. Using proprioceptors in the oculomotor muscles and an internal eye position reference, the 
vermis is aware of the current position of the eye. The vermis is also aware of the signals (dynamic motor 
error) used to generate the saccade via the connection with the NRTP and the superior colliculus. 

With regard to the oculomotor system, the cerebellum has inputs from superior colliculus, lateral 
geniculate nucleus (LGN), oculomotor muscle proprioceptors, and striate cortex via NRTP. The cerebel- 
lum sends inputs to the NRTP, LLBN, EGN, VN, thalamus, and superior colliculus. The oculomotor 
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vermis and fastigial nuclei are important in the control of saccade amplitude, and the flocculus, perihy- 
poglossal nuclei of the rostral medulla, and possibly the pontine and mesencephalic reticular formation, 
are thought to form the integrator within the cerebellum. One important function of the flocculus may 
be to increase the time constant of the neural integrator for saccades starting at locations different from 
primary position. 

The fastigial nucleus receives input from the superior colliculus, as well as other sites. The output of 
the fastigial nucleus is excitatory and projects ipsilaterally and contralaterally as shown in Fig. 166.9. 
During fixation, the fastigial nucleus fires tonically at low rates; 20 ms prior to a saccade, the contralateral 
fastigial nucleus bursts, and the ipsilateral fastigial nucleus pauses and then discharges with a burst. The 
pause in ipsilateral firing is due to Purkinje cell input to the fastigial nucleus. The sequential organization 
of Purkinje cells along beams of parallel fibers suggests that the cerebellar cortex might function as a 
delay, producing a set of timed pulses which could be used to program the duration of the saccade. If 
one considers nonprimary position saccades, different temporal and spatial schemes, via cerebellar 
control, are necessary to produce the same size saccade. It is postulated here that the cerebellum acts as 
a gating device which precisely terminates a saccade based on the initial position of the eye in the orbit. 

The PPRF has neurons that burst at frequencies up to 1000 Hz during saccades and are silent during 
periods of fixation and neurons that fire tonically during periods of fixation. Neurons that fire at steady 
rates during fixation are called tonic neurons (TNs) and are responsible for holding the eye steady. The 
TN firing rate depends on the position of the eye (presumably through a local integrator type network). 
The TNs are thought to provide the step component to the motoneuron. There are two types of burst 
neurons in the PPRF called the long-lead burst neuron (LLBN) and a medium-lead burst neuron (MLBN); 
during periods of fixation, these neurons are silent. The LLBN burst at least 12 ms before a saccade and 
the MLBN burst less than 12 ms (typically 6-8 ms) before the saccade. The MLBN are connected 
monosynaptically with the abducens nucleus. 

There are two types of neurons within the MLBN, the excitatory burst neurons (EBN) and the 
inhibitory burst neurons (IBN). The EBN and IBN labels describe the synaptic activity upon the moto- 
neurons; the EBN excite and are responsible for the burst firing, and the IBN inhibit and are responsible 
for the pause. A mirror image of these neurons exists on both sides of the midline. The IBN inhibit the 
EBN on the contralateral side. 

Also within the brain stem is another type of saccade neuron called the omnipause neuron (OPN). 
The OPN fires tonically at approximately 200 Hz during periods of fixation and is silent during saccades. 
The OPN stops firing approximately 10-12 ms before a saccade and resumes tonic firing approximately 
10 ms before the end of the saccade. The OPN are known to inhibit the MLBN and are inhibited by the 
LLBN. The OPN activity is responsible for the precise timing between groups of neurons that causes a 
saccade. 

Qualitatively, a saccade occurs according to the following sequence of events within the PPRF. First, 
the ipsilateral LLBN are stimulated by the SC, initiating the saccade. The LLBN then inhibits the tonic 
firing of the OPN. When the OPN cease firing, the MLBN is released from inhibition and begins firing 
(these neurons fire spontaneously and also stimulated by the fastigial nucleus). The ipsilateral IBN are 
stimulated by the ipsilateral LLBN and the contralateral fastigial nucleus of the cerebellum. The ipsilateral 
EBN are stimulated by the contralateral fastigial nucleus of the cerebellum and, when released from 
inhibition fire spontaneously. Except for the fastigial nucleus, there are no other accepted excitatory 
inputs to the EBN. The burst firing in the ipsilateral IBN inhibits the contralateral EBN and abducens 
nucleus and the ipsilateral oculomotor nucleus. The burst firing in the ipsilateral EBN causes the burst 
in the ipsilateral abducens nucleus, which stimulates the ipsilateral lateral rectus muscle and the con- 
tralateral oculomotor nucleus. With the stimulation of the ipsilateral lateral rectus muscle by the ipsilateral 
abducens nucleus and the inhibition of the ipsilateral medial rectus muscle via the oculomotor nucleus, 
a saccade occurs in the right eye. 

Simultaneously, with the contralateral medial rectus muscle stimulated by the contralateral oculomotor 
nucleus, and with the inhibition of the contralateral lateral rectus muscle via the abducens nucleus, a 
saccade occurs in the left eye. A similar scenario is carried out in the right eye. Thus the eyes move 
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conjugately under the control of a single drive center. The saccade is terminated with the resumption of 
tonic firing in the OPN via the fastigial nucleus. 

166.7 Saccade Control Mechanism 

Although the purpose for a saccadic eye movement is clear, that is, to quickly redirect the eyeball to the 
destination, the neural control mechanism is not. Until quite recently, saccade generator models involved 
a ballistic or preprogrammed control to the desired eye position based on retinal error alone. Today, an 
increasing number of investigators are putting forth the idea that visual goal-directed saccades are 
controlled by a local feedback loop that continuously drives the eye to the desired eye position. This 
hypothesis first gained acceptance in 1975 when Robinson suggested that saccades originate from neural 
commands that specify the desired position of the eye rather than the preprogrammed distance the eye 
must be moved. The value of the actual eye position is subtracted from the desired position to create an 
error signal that completes the local feedback loop that drives a high-gain burst element to generate the 
neural pulse. This neural pulse continuously drives the eye until the error signal is zero. 

Subsequently, a number of other investigators have modified the local feedback mechanism proposed 
by Robinson [1975] to better describe the neural connections and firing patterns of brainstem neurons 
in the control of horizontal saccadic eye. In addition to the Robinson model, two other models describe 
a saccade generator [Enderle, 1994; Scudder, 1988]. All the models involve three types of premotor 
neurons; burst, tonic, and pause cells, as previously described, and involve a pulse-step change in firing 
rate at the motoneuron during a saccadic eye movement. 

Although the general pattern of motoneuron activity is qualitatively accepted during a saccadic eye 
movement, there is little agreement on a quantitative discharge description. The saccade generator models 
by Robinson and Scudder are structured to provide a control signal that is proportionally weighted (or 
dependent) to the desired saccade size, as opposed to the saccade generator model structured to provide 
a control signal that is independent of saccade amplitude. Using time-optimal control theory and the 
system identification technique, Enderle and Wolfe [1988] investigated the control of saccades and 
reported that the system operates under a first-order time-optimal control. The concepts underlying this 
hypothesis are that each muscle's active-state tension is described by a low-pass filtered pulse-step 
waveform in which the magnitude of the agonist pulse is a maximum regardless of the amplitude of the 
saccade and that only the duration of the agonist pulse affects the size of the saccade. The antagonist 
muscle is completely inhibited during the period of maximum agonist stimulation. The saccade generator 
illustrated in Fig. 166.9 operates under these principles and provides simulations which match the data 
very well. 

Saccadic eye movements were simulated using TUTSIM, a continuous time simulation program, for 
the saccade generator model presented in Fig. 166.9, and compared with experimental data. Neural sites 
(nucleus) are described via a functional block diagram description of the horizontal saccade generator 
model as shown in Fig. 166.11 and 166.12. Table 166.1 summarizes additional firing characteristics for 
the neural sites. The output of each block represents the firing pattern at each neural site observed during 
the saccade; time zero indicates the start of the saccade and T represents the end of the saccade. Naturally, 
the firing pattern observed for each block represents the firing pattern for a single neuron, as recorded 
in the literature, but the block represents the cumulative effect of all the neurons within that site. 
Consistent with a time optimal control theory, neural activity is represented within each of the blocks 
as pulses and/or steps to reflect their operation as timing gates. The superior colliculus fires maximally 
as long as the dynamic motor error is greater than zero, in agreement with the first-order time optimal 
controller and physiologic evidence. Notice that the LLBNs are driven by the superior colliculus as long 
as there is a feedback error maintained by the cerebellar vermis. In all likelihood, the maximal firing rate 
by the superior colliculus is stochastic, depending on a variety of physiologic factors such as the interest 
in tracking the target, anxiety, frustration, stress, and other factors. The actual firing patterns in the 
superior colliculus, the burst neurons in the PPRF (LLBN, EBN, and IBN) and abducens nucleus are 
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FIGURE 166.11 A functional block diagram of the saccade generator model. Solid lines are excitatory and dashed 
lines are inhibitory. This figure illustrates the first half of the network. 

simulated with filtered pulse signals, consistent with the physical limitations of neurons. For the superior 
colliculus and the LLBN, this involves a single pulse, for the EBN and IBN, this involves two pulses with 
different filters (the first pulse describes the brief rise and subsequent fall within the first 10 ms during 
a saccade, and the second pulse describes the steady state pulse during the saccade) to match the 
electrophysiologic data. 

Illustrated in Fig. 166.13 is an extracellular single unit recording for the EBN, eye position data, and 
a simulation for a 20° saccade. Details of the experimental are given in Enderle [1994]. A 20° saccade 
was simulated by using EBN data as input and the oculomotor plant in Fig. 166.8. Few differences between 
the data and the simulation results are observed for this movement, as well as other eye movements. The 
saccade generator model in Fig. 166.9 also provides an excellent description of the saccade system and 
matches the data very well for all naturally occurring saccades, including saccades with dynamic overshoot 
and glissadic behavior, without parametric changes [Enderle, 1994]. 



166.8 Conclusion 



This chapter has focused on quantitative models and control of the fast eye movement system. Each of 
the oculomotor plant models described here are linear. Beginning with the most simple quantitative 
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FIGURE 166.12 A functional block diagram of the saccade generator model. Solid lines are excitatory and dashed 
lines are inhibitory. This figure illustrates the second half of the network. 



model of saccades by Westheimer [1954], important characteristics of saccades were determined as a 
means of evaluating the quality of saccade models. Next, models of increasing complexity were presented 
with the end goal of constructing a homeomorphic saccade model of the oculomotor plant. These plant 
models were driven by improved models of muscle that ultimately provided an excellent match of the 
static and dynamic properties of the rectus eye muscle. Finally, the control of saccades was considered 
from the basis of systems control theory and anatomic considerations. Many nonlinear models of the 
oculomotor plant exist, and readers interested in learning about them should consult Robinson [1981]. 
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TABLE 166.1 Activity of Neural Sites During a Saccade 





Onset 






Before 




Neural Site 


Saccade 


Peak Firing Rate 


Abducens nucleus 


5 ms 


400-800 Hz 


Contralateral fastigial nucleus 


20 ms 


200 Hz 


Contralateral superior colliculus 


20-25 ms 


800-1000 Hz 


Ipsilateral cerebellar vermis 


20-25 ms 


600-800 Hz 


Ipsilateral EBN 


6-8 ms 


600-800 Hz 


Ipsilateral fastigial nucleus 


20 ms 


Pause during saccade, and a 
burst of 200 Hz toward the 
end of the saccade 


Ipsilateral FEF 


>30ms 


600-800 Hz 


Ipsilateral IBN 


6-8 ms 


600-800 Hz 


Ipsilateral LLBN 


20 ms 


800-1000 Hz 


Ipsilateral NRTP 


20-25 ms 


800-1000 Hz 


Ipsilateral substantia nigra 


40 ms 


40-100 Hz 



End Time 



OPN 



6-8 ms 



150-200 Hz (before and after) 



Ends approx. 5 ms before saccade ends 
Pulse ends with pause approx. 10 ms 
before saccade ends, resumes tonic 
firing approx. 10 ms after saccade ends 
Ends approx. when saccade ends 
Ends approx. 25 ms before saccade ends 
Ends approx. 10 ms before saccade ends 
Pause ends with burst approx. 10 ms 
before saccade ends, resumes tonic 
firing approx. 10 ms after saccade ends 
Ends approx. when saccade ends 
Ends approx. 10 ms before saccade ends 
Ends approx. when saccade ends 
Ends approx. when saccade ends 
Resumes firing approx. 40-150 ms after 

saccade ends 
Ends approx. when saccade ends 



FIGURE 166.13 Simulated eye position in solid 
line shown in top graph, generated with EBN sac- 
cade data (shown in lower graph) and the oculomo- 
tor plant in Fig. 166.8. Actual eye movement data 
recorded with the EBN data during a saccadic eye 
movement, shown as dashed line in a top graph. 
Data provided by Dr. David Sparks from his labo- 
ratory at the University of Alabama. 
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Defining Terms 

Active-state tension generator: The active-state tension generator describes the element within the 
muscle that creates a force. This force is different from muscle tension, which is the force due to 
the active-state tension generator and all the other elements within the muscle. 

Extraocular muscles: The six muscles attached directly to the outside of the eyeball, consisting of the 
medial, lateral, superior, inferior recti, and the superior and inferior oblique muscles. 

Homeomorphic: As close to reality as possible. 

Latent period: The latent period is thought to be the time interval during which the CNS determines 
whether to make a saccade, and, if so, calculates the distance the eyeball is to be moved, trans- 
forming retinal error into transient muscle activity. 

Main-sequence diagrams: Summary plots of the characteristics of saccades that allow one to compare 
inter- and intrasubject variations. Commonly used characteristics include: (1) peak velocity-saccade 
magnitude, (2) saccade duration-saccade magnitude, and (3) latent period-saccade magnitude. 

Oculomotor system: The oculomotor system consists of the eyeball and extraocular muscles (also called 
the oculomotor plant) and the neural sites responsible for the eye movement. 

Saccade: A fast eye movement. 
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VER THE PAST 100 YEARS, the health care system's dependence on medical technology for 
the delivery of its services has grown continuously. To some extent, all professional care providers 
depend on technology, be it in the area of preventive medicine, diagnosis, therapeutic care, 
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rehabilitation, administration, or health-related education and training. Medical technology enables 
practitioners to intervene through integrated interactions with their patients in a cost-effective, efficient, 
and safe manner. As a result, the field of clinical engineering has emerged as the discipline of biomedical 
engineering that fulfills the need to manage the deployment of medical technology and to integrate it 
appropriately with desired clinical practices. 

The health care delivery system presents a very complex environment where facilities, equipment, 
materials, and a full range of human interventions are involved. It is in this clinical environment that 
patients of various ages and conditions, trained staff, and the wide variety of medical technology converge. 
This complex mix of interactions may lead to unacceptable risk when programs for monitoring, con- 
trolling, improving, and educating all entities involved are not appropriately integrated by qualified 
professionals. 

This section of clinical engineering focuses on the methodology for administering critical engineering 
services that vary from facilitation of innovation and technology transfer to the performance of technol- 
ogy assessment and operations support and on the management tools with which today's clinical engineer 
needs to be familiar. With increased awareness of the value obtained by these services, new career 
opportunities are created for clinical engineers. 

In this section the authors have attempted to provide a description of the wide range of responsibilities 
clinical engineering professionals encounter. After presenting the evolution of the field of clinical engi- 
neering, Chapter 165 gives specific attention to the primary function of clinical engineers. Chapter 166 
describes technology management and assessment in considerable detail, using examples and case studies 
in a large medical center. Chapter 167 focuses on a particular technology management tool for assessing 
the medical equipment risk factor that can help clinical engineers effectively manage/prioritize the services 
to be provided to each piece of equipment under their control. Chapter 168 takes an expansive view of 
career opportunities for clinical engineers, while Chapter 169 focuses on one particular role, namely, that 
of innovator and product developer. To further assist clinical engineers in managing their equipment, 
Chapters 170 and 171 examine an establishment of program indicators that can lead to quality improve- 
ment. The clinical engineering program can be based in a single community hospital, in a teaching 
medical center, within a chain of hospitals, as part of a government agency, or as a shared service 
organization. Chapter 172 describes the use of coordinated services as they are offered by the West of 
Scotland Health Board, and Chapters 173 and 174 present a review of the standards and regulatory 
agencies of interest to clinical engineers. Global programs present different challenges to practitioners in 
developing countries because the resources and logistics are different than they are in the United States. 
Chapter 175 looks at specific clinical engineering issues from the perspective of developing countries. 

In addition to highlighting the important roles that clinical engineers serve in many areas, the section 
focuses on those areas of the clinical engineering field that enhance the understanding of the "bigger 
picture." With such an understanding, the participation in and contribution by clinical engineers to this 
enlarged scope can be fully realized. The adoption of the tools described here will enable clinical engineers 
to fulfill their new role in the evolving health care delivery system. 

All the authors in this section recognize this opportunity and here recognized for volunteering their 
talent and time so that others can excel as well. 
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167.1 What Is a Clinical Engineer? 

As discussed in the introduction to this Handbook, biomedical engineers apply the concepts, knowledge, 
and techniques of virtually all engineering disciplines to solve specific problems in the biosphere, i.e., 
the realm of biology and medicine. When biomedical engineers work within a hospital or clinic, they 
are more properly called clinical engineers. But what exactly is the definition of the term clinical engineer 7 . 
In recent years, a number of organizations, e.g., the American Heart Association [1986], the American 
Association of Medical Instrumentation [Goodman, 1989], the American College of Clinical Engineers 
[Bauld, 1991], and the Journal of Clinical Engineering [Pacela, 1991], have attempted to provide an 
appropriate definition for the term, clinical engineer. For the purposes of this handbook, a clinical engineer 
is an engineer who has graduated from an accredited academic program in engineering or who is licensed 
as a professional engineer or engineer-in-training and is engaged in the application of scientific and 
technological knowledge developed through engineering education and subsequent professional experi- 
ence within the health care environment in support of clinical activities. Furthermore, the clinical 
environment is defined as that portion of the health care system in which patient care is delivered, and 
clinical activities include direct patient care, research, teaching, and public service activities intended to 
enhance patient care. 

167.2 Evolution of Clinical Engineering 

Engineers were first encouraged to enter the clinical scene during the late 1960s in response to concerns 
about patient safety as well as the rapid proliferation of clinical equipment, especially in academic medical 
centers. In the process, a new engineering discipline — clinical engineering — evolved to provide the 
technological support necessary to meet these new needs. During the 1970s, a major expansion of clinical 
engineering occurred, primarily due to the following events: 

■ The Veterans' Administration (VA), convinced that clinical engineers were vital to the overall 
operation of the VA hospital system, divided the country into biomedical engineering districts, 
with a chief biomedical engineer overseeing all engineering activities in the hospitals in that district. 
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• Throughout the United States, clinical engineering departments were established in most large 
medical centers and hospitals and in some smaller clinical facilities with at least 300 beds. 

• Clinical engineers were hired in increasing numbers to help these facilities use existing technology 
and incorporate new technology. 

Having entered the hospital environment, routine electrical safety inspections exposed the clinical 
engineer to all types of patient equipment that was not being maintained properly. It soon became 
obvious that electrical safety failures represented only a small part of the overall problem posed by the 
presence of medical equipment in the clinical environment. The equipment was neither totally understood 
nor properly maintained. Simple visual inspections often revealed broken knobs, frayed wires, and even 
evidence of liquid spills. Investigating further, it was found that many devices did not perform in 
accordance with manufacturers' specifications and were not maintained in accordance with manufactur- 
ers' recommendations. In short, electrical safety problems were only the tip of the iceberg. The entrance 
of clinical engineers into the hospital environment changed these conditions for the better. By the mid- 
1970s, complete performance inspections before and after use became the norm, and sensible inspection 
procedures were developed [Newhouse et al., 1989]. In the process, clinical engineering departments 
became the logical support center for all medical technologies and became responsible for all the bio- 
medical instruments and systems used in hospitals, the training of medical personnel in equipment use 
and safety, and the design, selection, and use of technology to deliver safe and effective health care. 

With increased involvement in many facets of hospital/ clinic activities, clinical engineers now play a 
multifaceted role (Fig. 167.1). They must interface successfully with many "clients," including clinical 
staff, hospital administrators, regulatory agencies, etc., to ensure that the medical equipment within the 
hospital is used safely and effectively. 

Today, hospitals that have established centralized clinical engineering departments to meet these 
responsibilities use clinical engineers to provide the hospital administration with an objective option of 
equipment function, purchase, application, overall system analysis, and preventive maintenance policies. 
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FIGURE 167.1 Diagram illustrating the range of interactions of a clinical engineer. 
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Some hospital administrators have learned that with the in-house availability of such talent and expertise, 
the hospital is in a far better position to make more effective use of its technological resources [Bronzino, 
1986, 1992]. By providing health professionals with needed assurance of safety, reliability, and efficiency 
in using new and innovative equipment, clinical engineers can readily identify poor-quality and ineffective 
equipment, thereby resulting in faster, more appropriate utilization of new medical equipment. 
Typical pursuits of clinical engineers, therefore, include 

• Supervision of a hospital clinical engineering department that includes clinical engineers and 
biomedical equipment technicians (BMETs) 

• Prepurchase evaluation and planning for new medical technology 

• Design, modification , or repair of sophisticated medical instruments or systems 

• Cost-effective management of a medical equipment calibration and repair service 

• Supervision of the safety and performance testing of medical equipment performed by BMETs 

• Inspection of all incoming equipment (i.e., both new and returning repairs) 

• Establishment of performance benchmarks for all equipment 

• Medical equipment inventory control 

• Coordination of outside engineering and technical services performed by vendors 

• Training of medical personnel in the safe and effective use of medical devices and systems 

• Clinical applications engineering, such as custom modification of medical devices for clinical 
research, evaluation of new noninvasive monitoring systems, etc. 

• Biomedical computer support 

• Input to the design of clinical facilities where medical technology is used, e.g., operating rooms 
(ORs), intensive care units, etc. 

• Development and implementation of documentation protocols required by external accreditation 
and licensing agencies. 

Clinical engineers thus provide extensive engineering services for the clinical staff and, in recent years, 
have been increasingly accepted as valuable team members by physicians, nurses, and other clinical 
professionals. Furthermore, the acceptance of clinical engineers in the hospital setting has led to different 
types of engineering-medicine interactions, which in turn have improved health care delivery. 

167.3 Hospital Organization and the Role 
of Clinical Engineering 

In the hospital, management organization has evolved into a diffuse authority structure that is commonly 
referred to as the triad model. The three primary components are the governing board (trustees), hospital 
administration (CEO and administrative staff), and the medical staff organization [Bronzino and Hayes, 
1988]. The role of the governing board and the chief executive officer are briefly discussed below to 
provide some insight regarding their individual responsibilities and their interrelationship. 

Governing Board (Trustees) 

The Joint Commission on the Accreditation of Healthcare Organizations (JCAHO) summarizes the major 
duties of the governing board as "adopting by-laws in accordance with its legal accountability and its 
responsibility to the patient." The governing body, therefore, requires both medical and paramedical depart- 
ments to monitor and evaluate the quality of patient care, which is a critical success factor in hospitals today. 
To meet this goal, the governing board essentially is responsible for establishing the mission statement 
and defining the specific goals and objectives that the institution must satisfy. Therefore, the trustees are 
involved in the following functions: 
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• Establishing the policies of the institution 

• Providing equipment and facilities to conduct patient care 

■ Ensuring that proper professional standards are defined and maintained (i.e., providing quality 
assurance) 

• Coordinating professional interests with administrative, financial, and community needs 

• Providing adequate financing by securing sufficient income and managing the control of expenditures 

• Providing a safe environment 

• Selecting qualified administrators, medical staff, and other professionals to manage the hospital 

In practice, the trustees select a hospital chief administrator who develops a plan of action that is in 
concert with the overall goals of the institution. 

Hospital Administration 

The hospital administrator, the chief executive officer of the medical enterprise, has a function similar 
to that of the chief executive officer of any corporation. The administrator represents the governing board 
in carrying out the day-to-day operations to reflect the broad policy formulated by the trustees. The 
duties of the administrator are summarized as follows: 

• Preparing a plan for accomplishing the institutional objectives, as approved by the board 

• Selecting medical chiefs and department directors to set standards in their respective fields 

• Submitting for board approval an annual budget reflecting both expenditures and income projections 

■ Maintaining all physical properties (plant and equipment) in safe operating condition 

■ Representing the hospital in its relationships with the community and health agencies 

• Submitting to the board annual reports that describe the nature and volume of the services 
delivered during the past year, including appropriate financial data and any special reports that 
may be requested by the board 

In addition to these administrative responsibilities, the chief administrator is charged with controlling 
cost, complying with a multitude of governmental regulations, and ensuring that the hospital conforms 
to professional norms, which include guidelines for the care and safety of patients. 

167.4 Clinical Engineering Programs 

In many hospitals, administrators have established clinical engineering departments to manage effectively 
all the technological resources, especially those relating to medical equipment, that are necessary for 
providing patient care. The primary objective of these departments is to provide a broad-based engi- 
neering program that addresses all aspects of medical instrumentation and systems support. 

Figure 167.2 illustrates the organizational chart of the medical support services division of a typical 
major medical facility. Note that within this organizational structure, the director of clinical engineering 
reports directly to the vice-president of medical support services. This administrative relationship is 
extremely important because it recognizes the important role clinical engineering departments play in 
delivering quality care. It should be noted, however, that in other common organizational structures, 
clinical engineering services may fall under the category of "facilities," "materials management," or even 
just "support services." Clinical engineers also can work directly with clinical departments, thereby 
bypassing much of the hospital hierarchy. In this situation, clinical departments can offer the clinical 
engineer both the chance for intense specialization and, at the same time, the opportunity to develop 
personal relationships with specific clinicians based on mutual concerns and interests [Wald, 1989]. 

Once the hospital administration appoints a qualified individual as director of clinical engineering, 
the person usually functions at the department-head level in the organizational structure of the institution 
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FIGURE 167.2 Organizational chart of medical support services division for a typical major medical facility. This 
organizational structure points out the critical interrelationship between the clinical engineering department and the 
other primary services provided by the medical facility. 

and is provided with sufficient authority and resources to perform the duties efficiently and in accordance 
with professional norms. To understand the extent of these duties, consider the job title for "clinical 
engineering director" as defined by the World Health Organization [Issakov et al, 1990]. 

General Statement. The clinical engineering director, by his or her education and experience, acts as 
a manger and technical director of the clinical engineering department. The individual designs and directs 
the design of equipment modifications that may correct design deficiencies or enhance the clinical 
performance of medical equipment. The individual also may supervise the implementation of those 
design modifications. The education and experience that the director possesses enables him or her to 
analyze complex medical or laboratory equipment for purposes of defining corrective maintenance and 
developing appropriate preventing maintenance or performance assurance protocols. The clinical engi- 
neering director works with nursing and medical staff to analyze new medical equipment needs and 
participates in both the prepurchase planning process and the incoming testing process. The individual 
also participates in the equipment management process through involvement in the system development, 
implementation, maintenance, and modification processes. 

Duties and Responsibilities. The director of clinical engineering has a wide range of duties and respon- 
sibilities. For example, this individual 

• Works with medical and nursing staff in the development of technical and performance specifi- 
cations for equipment requirements in the medical mission. 

• Once equipment is specified and the purchase order developed, generates appropriate testing of 
the new equipment. 

• Does complete performance analysis on complex medical or laboratory equipment and summa- 
rizes results in brief, concise, easy-to-understand terms for the purposes of recommending cor- 
rective action or for developing appropriate preventive maintenance and performance assurance 
protocols. 
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• Designs and implements modifications that permit enhanced operational capability. May supervise 
the maintenance or modification as it is performed by others. 

• Must know the relevant codes and standards related to the hospital environment and the perfor- 
mance assurance activities. (Examples in the United States are NFPA 99, UL 544, and JCAHO, 
and internationally, IEC-TC 62.) 

■ Is responsible for obtaining the engineering specifications (systems definitions) for systems that 
are considered unusual or one-of-a-kind and are not commercially available. 

• Supervises in-service maintenance technicians as they work on codes and standards and on 
preventive maintenance, performance assurance, corrective maintenance, and modification of new 
and existing patient care and laboratory equipment. 

• Supervises parts and supply purchase activities and develops program policies and procedures for 
same. 

• Sets departmental goals, develops budgets and policy, prepares and analyzes management reports 
to monitor department activity, and manages and organizes the department to implement them. 

• Teaches measurement, calibration, and standardization techniques that promote optimal perfor- 
mance. 

• In equipment-related duties, works closely with maintenance and medical personnel. Communi- 
cates orally and in writing with medical, maintenance, and administrative professionals. Develops 
written procedures and recommendations for administrative and technical personnel. 

Minimum Qualifications. A bachelor's degree (4 years) in an electrical or electronics program or the 
equivalent is required (preferably with a clinical or biomedical adjunct). A master's degree is desirable. 
A minimum of 3 years' experience as a clinical engineer and 2 years in a progressively responsible 
supervisory capacity is needed. Additional qualifications are as follows: 

• Must have some business knowledge and management skills that enable him or her to participate 
in budgeting, cost accounting, personnel management, behavioral counseling, job description 
development, and interviewing for hiring or firing purposes. Knowledge and experience in the 
use of microcomputers are desirable. 

• Must be able to use conventional electronic trouble-shooting instruments such as multimeters, 
function generators, oscillators, and oscilloscopes. Should be able to use conventional machine 
shop equipment such as drill presses, grinders, belt sanders, brakes, and standard hand tools. 

• Must possess or be able to acquire knowledge of the techniques, theories, and characteristics of 
materials, drafting, and fabrication techniques in conjunction with chemistry, anatomy, physiol- 
ogy, optics, mechanics, and hospital procedures. 

• Clinical engineering certification or professional engineering registration is required. 

Major Functions of a Clinical Engineering Department 

It should be clear by the preceding job description that clinical engineers are first and foremost engi- 
neering professionals. However, as a result of the wide-ranging scope of interrelationships within the 
medical setting, the duties and responsibilities of clinical engineering directors are extremely diversified. 
Yet a common thread is provided by the very nature of the technology they manage. Directors of clinical 
engineering departments are usually involved in the following core functions: 

Technology Management. Developing, implementing, and directing equipment management pro- 
grams. Specific tasks include accepting and installing new equipment, establishing preventive mainte- 
nance and repair programs, and managing the inventory of medical instrumentation. Issues such as cost- 
effective use and quality assurance are integral parts of any technology management program. The 
director advises the administrator of the budgetary, personnel, space, and test equipment requirements 
necessary to support this equipment management program. 
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Risk Management. Evaluating and taking appropriate action on incidents attributed to equipment 
malfunctions or misuse. For example, the clinical engineering director is responsible for summarizing 
the technological significance of each incident and documenting the findings of the investigation. He or 
she then submits a report to the appropriate hospital authority and, according to the Safe Medical Devices 
Act of 1990, to the device manufacturer, the Food and Drug Administration (FDA), or both. 

Technology Assessment. Evaluating and selecting new equipment. The director must be proactive in the 
evaluation of new requests for capital equipment expenditures, providing hospital administrators and 
clinical staff with an in depth appraisal of the benefits/ advantages of candidate equipment. Furthermore, 
the process of technology assessment for all equipment used in the hospital should be an ongoing activity. 

Facilities Design and Project Management. Assisting in the design of new or renovated clinical facil- 
ities that house specific medical technologies. This includes operating rooms, imaging facilities, and 
radiology treatment centers. 

Training. Establish and deliver instructional modules for clinical engineering staff as well as clinical 
staff on the operation of medical equipment. 

In the future, it is anticipated that clinical engineering departments will provide assistance in the 
application and management of many other technologies that support patient care, including computer 
support, telecommunications, and facilities operations. 

Defining Terms 

JCAHO, Joint Commission on the Accreditation of Healthcare Organizations: Accrediting body 
responsible for checking hospital compliance with approved rules and regulations regarding the 
delivery of health care. 

Technology assessment: Involves an evaluation of the safety, efficiency, and cost-effectiveness, as well 
as consideration of the social, legal, and ethical effects, of medical technology. 
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As medical technology continues to evolve, so does its impact on patient outcome, hospital operations, 
and financial efficiency. The ability to plan for this evolution and its subsequent implications has become 
a major challenge in most decisions of health care organizations and their related industries. Therefore, 
there is a need to adequately plan for and apply those management tools which optimize the deployment 
of medical technology and the facilities that house it. Successful management of the technology and 
facilities will ensure a good match between the needs and the capabilities of staff and technology, 
respectively. While different types and sizes of hospitals will consider various strategies of actions, they 
all share the need to manage efficient utilization of their limited resources and its monitoring. Technology 
is one of these resources, and while it is frequently cited as the culprit behind cost increases, the well- 
managed technology program contribute to a significant containment of the cost of providing quality 
patient care. Clinical engineer's skills and expertise are needed to facilitate the adoption of an objective 
methodology for implantation of a program that will match the hospital's needs and operational conditions. 
Whereas both the knowledge and practice patterns of management in general are well organized in 
today's literature, the management of the health care delivery system and that of medical technology in 
the clinical environment has not yet reached that same high level. However, as we begin to understand 
the relationship between the methods and information that guide the decision-making processes regard- 
ing the management of medical technology that are being deployed in this highly complex environment, 
the role of the qualified clinical engineer becomes more valuable. This is achieved by reformulating the 
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technology management process, which starts with the strategic planning process, continues with the 
technology assessment process, leads to the equipment planning and procurement processes, and finally 
ends with the assets management process. Definition of terms used in this chapter are provided at the 
end of the chapter. 

168.1 The Health Care Delivery System 

Societal demands on the health care delivery system revolve around cost, technology, and expectations. To 
respond effectively, the delivery system must identify its goals, select and define its priorities, and then wisely 
allocate its limited resources. For most organizations, this means that they must acquire only appropriate 
technologies and manage what they have already more effectively. To improve performance and reduce 
costs, the delivery system must recognize and respond to the key dynamics in which it operates, must shape 
and mold its planing efforts around several existing health care trends and directions, and must respond 
proactively and positively to the pressures of its environment. These issues and the technology manager's 
response are outlined here: (1) technology's positive impact on care quality and effectiveness, (2) an 
unacceptable rise in national spending for health care services, (3) a changing mix of how Americans are 
insured for health care, (4) increases in health insurance premiums for which appropriate technology 
application is a strong limiting factor, (5) a changing mix of health care services and settings in which care 
is delivered, and (6) growing pressures related to technology for hospital capital spending and budgets. 

Major Health Care Trends and Directions 

The major trends and directions in health care include (1) changing location and design of treatment 
areas, (2) evolving benefits, coverages, and choices, (3) extreme pressures to manage costs, (4) treating 
of more acutely ill older patients and the prematurely born, (5) changing job structures and demand for 
skilled labor, (6) the need to maintain a strong cash flow to support construction, equipment, and 
information system developments, (7) increased competition on all sides, (8) requirement for informa- 
tion systems that effectively integrate clinical and business issues, (9) changing reimbursement policies 
that reduce new purchases and lead to the expectation for extended equipment life cycles, (10) internal 
technology planning and management programs to guide decision making, (11) technology planning 
teams to coordinate adsorption of new and replacement technologies, as well as to suggest delivery system 
changes, and (12) equipment maintenance costs that are emerging as a significant expense item under 
great administrative scrutiny. 

System Pressures 

System pressures include (1) society's expectations — highest quality care at the lowest reasonable price, 
where quality is a function of personnel, facilities, technology, and clinical procedures offered; 
(2) economic conditions — driven often by reimbursement criteria; (3) legal-pressures — resulting prima- 
rily from malpractice issues and dealing with rule-intensive "government" clients; (4) regulatory — mul- 
tistate delivery systems with increased management complexity, or heavily regulated medical device 
industries facing free-market competition, or hospitals facing the Safe Medical Devices Act reporting 
requirements and credentialling requirements; (5) ethics — deciding who gets care and when; and 
(6) technology pressures — organizations having enough capabilities to meet community needs and to 
compete successfully in their marketplaces. 

The Technology Manager's Responsibility 

Technology mangers should (1) become deeply involved and committed to technology planning and 
management programs in their system, often involving the need for greater personal responsibilities and 
expanded credentials; (2) understand how the factors above impact their organization and how technology 
can be used to improve outcomes, reduce costs, and improve quality of life for patients, (3) educate other 
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health care professionals about how to demonstrate the value of individual technologies through involving 
financial, engineering, quality of care, and management perspective, and (4) assemble a team of care- 
givers with sufficient broad clinical expertise and administrators with planning and financial expertise 
to contribute their knowledge to the assessment process [ 16 ]. 

168.2 Strategic Technology Planning 
Strategic Planning Process 

Leading health care organizations have begun to combine strategic technology planning with other tech- 
nology management activities in program that effectively integrate new technologies with their existingly 
technology base. This has resulted in high-quality care at a reasonable cost. Among those who have been 
its leading catalysts, ECRI (formerly the Emergency Care Research Institute) is known for articulating this 
program [4] and encouraging its proliferation initially among regional health care systems and now for 
single or multihospital systems as well [5]. Key components of the program include clinical strategic- 
planning, technology strategic planning, technology assessment, interaction with capital budgeting, acqui- 
sition and deployment, resource (or equipment assets) management, and monitoring and evaluation. A 
proper technology strategic plan is derived from and supports as well-defined clinical strategic plan [15]. 

Clinical and Technology Strategic Plan 

Usually considered long-range and continually evolving, a clinical strategic plan is updated annually. For 
a given year, the program begins when key hospital participants, through the strategic planning process, 
assess what clinical services the hospital should be offering in its referral area. They take into account 
health care trends, demographic and market share data, and space and facilities plans. They analyze their 
facility's strengths and weaknesses, goals and objectives, competition, and existing technology base. The 
outcome of this process is a clinical strategic plan that establishes the organization's vision for the year 
and referral area needs and the hospital's objectives in meeting them. 

It is not possible to adequately complete a clinical strategic plan without engaging in the process of 
strategic technology planning. A key role for technology managers is to assist their organizations through- 
out the combined clinical and technology strategic planning processes by matching available technical 
capabilities, both existing and new, with clinical requirements. To accomplish this, technology managers 
must understand why their institution's values and mission are set as they are, pursue their institution's 
strategic plans through that knowledge, and plan in a way that effectively allocates limited resources. 
Although a technology manager may not be assigned to develop an institution's overall strategic plan, 
he or she must understand and believe it in order to offer good input for hospital management. In 
providing this input, a technology manager should determine a plan for evaluating the present state of 
the hospital's technological deployment, assist in providing a review of emerging technological innova- 
tions and their possible impact on the hospital, articulate justifications and provisions for adoption of 
new technologies or enhancement of existing ones, visit research and laboratories and exhibit areas at 
major medical and scientific meetings to view new technologies, and be familiar with the institution and 
its equipment users' abilities to assimilate new technology. 

The past decade has shown a trend toward increased legislation in support of more federal regulations 
in health care. These and other pressures will require that additional or replacement medical technology 
be well anticipated and justified. As a rationale for technology adoption, the Texas Children's Hospital 
focuses on the issues of clinical necessity, management support, and market preference, Addressing the 
issue of clinical necessity, the hospital considers the technology's comparison against medical standard 
of care, its impact on the level of care and quality of life, its improvement on intervention's accuracy 
and/or safety, its impact on the rate of recovery, the needs or desires of the community, and the change 
in service volume or focus. On the issue of management support, the hospital estimates if the technology 
will create a more effective care plan and decision-making process, improve operational efficiency in the 
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current service programs, decrease liability exposure, increase compliance with regulations, reduce work- 
load and dependence on user skill level ameliorate departmental support, or enhance clinical proficiency. 
Weighting the issue of market preference, the hospital contemplate if it will improve access to care, 
increase customer convenience and/or satisfaction, enhance the organization's image and market share, 
decrease the cost of adoption and ownership, or provide a return on its investment. 

Technology Strategic Planning Process 

When the annual clinical strategic planning process has started and hospital leaders have begun to analyze 
or reaffirm what clinical services they want to offer to the community, the hospital can then conduct 
efficient technology strategic planning. Key elements of this planning involve (1) performing an initial 
audit of existing technologies, (2) conducting a technology assessment for new and emerging technologies 
for fit with current or desired clinical services, (3) planning for replacement and selection of new 
technologies, (4) setting priorities for technology acquisition, and (5) developing processes to implement 
equipment acquisition and monitor ongoing utilization. "Increasingly, hospitals are designating a senior 
manager (e.g., an administrator, the director of planning, the director of clinical engineering) to take the 
responsibility for technology assessment and planning. That person should have the primary responsi- 
bility for developing the strategic technology plan with the help of key physicians, department managers, 
and senior executives" [4]. 

Hospitals can form a medical technology advisory committee (MTAC), overseen by the designated 
senior manager and consisting of the types of members mentioned above, to conduct the strategic 
technology planning process and to annually recommend technology priorities to the hospital strategic 
planning committee and capital budget committee. It is especially important to involve physicians and 
nurses in this process. 

In the initial technology audit, each major clinical service or product line must be analyzed to determine 
how well the existing technology base supports it. The audit can be conducted along service lines 
(radiology, cardiology, surgery) or technology function (e.g., imaging, therapeutic, diagnostic) by a team 
of designated physicians, department heads, and technology managers. The team should begin by devel- 
oping a complete hospital-wide assets inventory, including the quantity and quality of equipment. The 
team should compare the existing technology base against known and evolving standards-of-care infor- 
mation, patient outcome data, and known equipment problems. Next, the team should collect and 
examine information on technology utilization to assess its appropriate use, the opportunities for 
improvement, and the risk level. After reviewing the technology users' education needs as they relate to 
the application and servicing of medical equipment, the team should credential users for competence in 
the application of new technologies. Also, the auditing team should keep up with published clinical 
protocols and practice guidelines using available health care standards directories and utilize clinical 
outcome data for quality-assurance and risk-management program feedback [6]. 

While it is not expected that every hospital has all the required expertise in-house to conduct the initial 
technology audit or ongoing technology assessment, the execution of this planning process is sufficiently 
critical for a hospital's success that outside expertise should be obtained when necessary. The audit allows 
for the gathering of information about the status of the existing technology base and enhances the 
capability of the medical technology advisory committee to assess the impact of new and emerging 
technologies on their major clinical services. 

All the information collected from the technology audit results and technology assessments is used in 
developing budget strategies. Budgeting is part of strategic technology planning in that a 2- to 5-year 
long-range capital spending plan should be created. This is in addition to the annual capital budget 
preparation that takes into account 1 year at a time. The MTAC, as able and appropriate, provides key 
information regarding capital budget requests and makes recommendations to the capital budget com- 
mittee each year. The MTAC recommends priorities for replacement as well as new and emerging 
technologies that over a period of several years guides that acquisition that provides the desired service 
developments or enhancements. Priorities are recommended on the basis of need, risk, cost (acquisition, 
operational and maintenance), utilization, and fit with the clinical strategic plan. 
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168.3 Technology Assessment 

As medical technology continues to evolve, so does its impact on patient outcome, hospital operations, 
and financial resources. The ability to manage this evolution and its subsequent implications has become 
a major challenge for all health care organizations. Successful management of technology will ensure a 
good match between needs and capabilities and between staff and technology. To be successful, an ongoing 
technology assessment process must b an integral part of an ongoing technology planning and manage- 
ment program at the hospital, addressing the needs of the patient, the user, and the support team. This 
facilitates better equipment planning and utilization of the hospital's resources. The manager who is 
knowledgeable about his or her organization's culture, equipment users' needs, the environment within 
which equipment will be applied, equipment engineering, and emerging technological capabilities will 
be successful in proficiently implementing and managing technological changes [7]. 

It is in the technology assessment process that the clinical engineering/technology manager professional 
needs to wear two hats: that of the manager and that of the engineer. This is a unique position, requiring 
expertise and detailed preparation, that allows one to be a key leader and contributor to the decision- 
making process of the medical technology advisory committee (MTAC). 

The MTAC uses an ad hoc team approach to conduct technology assessment of selected services and 
technologies throughout the year. The ad hoc teams may incorporate representatives of equipment users, 
equipment service providers, physicians, purchasing agents, reimbursement mangers, representatives of 
administration, and other members from the institution as applicable. 

Prerequisites for Technology Assessment 

Medical technology is a major strategic factor in positioning and creating a positive community percep- 
tion of the hospital. Exciting new biomedical devices and systems are continually being introduced. And 
they are introduced at a time when the pressure on hospitals to contain expenditures is mounting. 
Therefore, forecasting the deployment of medical technology and the capacity to continually evaluate its 
impact on the hospital require that the hospital be willing to provide the support for such a program. 
(Note: Many organizations are aware of the principle that an in-house "champion" is needed in order to 
provide for the leadership that continually and objectively plans ahead. The champion and the program 
being "championed" may use additional in-house or independent expertise as needed. To get focused 
attention on the technology assessment function and this program in larger, academically affiliated and 
government hospitals, the position of a chief technology officer is being created. ) Traditionally, executives 
rely on their staff to produce objective analyses of the hospital's technological needs. Without such 
analyses, executives may approve purchasing decisions of sophisticated biomedical equipment only to 
discover later that some needs or expected features were not included with this installation, that those 
features are not yet approved for delivery, or that the installation has not been adequately planned. 

Many hospitals perform technology assessment activities to project needs for new assets and to better 
manage existing assets. Because the task is complex, an interdisciplinary approach and a cooperative 
attitude among the assessment team leadership is required. The ability to integrate information from 
disciplines such as clinical, technical, financial, administrative, and facility in a timely and objective 
manner is critical to the success of the assessment. This chapter emphasizes how technology assessment 
fits within a technology planning and management program and recognizes the importance of corporate 
skills forecasting medical equipment changes and determining the impact of changes on the hospital's 
market position. Within the technology planning and management program, the focus on capital assets 
management of medical equipment should not lead to the exclusion of accessories, supplies, and the 
disposables also required. 

Medical equipment has a life cycle that can be identified as (1) the innovation phase, which includes 
the concept, basic and applies research, and development, and (2) the adoption phase, which begins with 
the clinical studies, through diffusion, and then widespread use. These phases are different from each 
other in the scope of professional skills involved, their impact on patient care, compliance with regulatory 
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requirements, and the extent of the required operational support. In evaluating the applicability of a 
device or a system for use in the hospital, it is important to note in which phase of its life cycle the 
equipment currently resides. 

Technology Assessment Process 

More and more hospitals are faced with the difficult phenomenon of a capital equipment requests list 
that is much larger than the capital budget allocation. The must difficult decision, then, is the one that 
matches clinical needs with the financial capability. In doing so, the following questions are often raised: 
How do we avoid costly technology mistakes? How do we wisely target capital dollars for technology? 
How do we avoid medical staff conflicts as they relate to technology? How do we control equipment- 
related risks? and How do we maximize the useful life of the equipment or systems while minimizing 
the cost ownership? A hospital's clinical engineering department can assist in providing the right answers 
to these questions. 

Technology assessment is a component of technology planning that begins with the analysis of the 
hospital's existing technology base. It is easy to perceive then that technology assessment, rather than an 
equipment comparison, is a new major function for a clinical engineering department [8] . It is important 
that clinical engineers be well prepared for the challenge. They must have a full understanding of the 
mission of their particular hospitals, a familiarity with the health care delivery system, and the cooperation 
of hospital administrators and the medical staff. To aid in the technology assessment process, clinical 
engineers need to utilize the following tools: (1) access to national database services, directories, and 
libraries, (2) visits to scientific and clinical exhibits, (3) a network with key industry contacts, and (4) a 
relationship with peers throughout the country [9]. 

The need for clinical engineering involvement in the technology assessment process becomes evident 
when recently purchased equipment or its functions are underutilized, users have ongoing problems with 
equipment, equipment maintenance costs become excessive, the hospital is unable to comply with 
standards or guidelines (i.e., JCAHO requirements) for equipment management, a high percentage of 
equipment is awaiting repair, or training for equipment operators is inefficient due to shortage of allied 
health professionals. A deeper look at the symptoms behind these problems would likely reveal a lack of 
a central clearinghouse to collect, index, and monitor all technology-related information for future 
planning purposes, the absence of procedures for identifying emerging technologies for potential acqui- 
sition, the lack of a systematic plan for conducting technology assessment, resulting in an ability to 
maximize the benefits from deployment of available technology, the inability to benefit from the orga- 
nization's own previous experience with a particular type of technology, the random replacement of 
medical technologies rather than a systematic plan based on a set of well-developed criteria, and/or the 
lack of integration of technology acquisition into the strategic and capital planning of the hospital. 

To address these issues, efforts to develop a technology microassessment process were initiated at one 
leading private hospital with the following objectives: ( 1 ) accumulate information on medical equipment, 
(2) facilitate systematic planning, (3) create an administrative structure supporting the assessment process 
and its methodology, (4) monitor the replacement of outdated technology, and (5) improve the capital 
budget process by focusing on long-term needs relative to the acquisition of medical equipment [10]. 

The process, in general, and the collection of up-to-date pertinent information, in particular, require 
the expenditure of certain resources and the active participation of designated hospital staff in networks 
providing technology assessment information. For example, corporate membership in organizations and 
societies that provide such information needs to be considered, as well as subscriptions to certain 
computerized database and printed sources [11]. 

At the example hospital, and MTAC was formed to conduct technology assessment. It was chaired by 
the director of clinical engineering. Other managers from equipment user departments usually serve as 
the MTAC's designated technical coordinators for specific task forces. Once the committee accepted a 
request from an individual user, it identified other users that might have an interest in that equipment 
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or system and authorized the technical coordinator to assemble a task force consisting of users identified 
by the MTAC. This task force then took responsibility for the establishment of performance criteria that 
would be used during this particular assessment. The task force also should answer the questions of 
effectiveness, safety, and cost-effectiveness as they relate to the particular assessment. During any specific 
period, there may be multiple task forces, each focusing on a specific equipment investigation. 

The task force technical coordinator cooperates with the material management department in con- 
ducting a market survey, in obtaining the specified equipment for evaluation purposes, and in scheduling 
vendor-provided in-service training. The coordinator also confers with clinical staff to determine if they 
have experience with the equipment and the maturity level of the equipment under assessment. After 
establishment of a task force, the MTACs technical coordinator is responsible for analyzing the clinical 
experiences associated with the use of this equipment, for setting evaluation objectives, and for devising 
appropriate technical tests in accord with recommendations from the task force. Only equipment that 
successfully passes the technical tests will proceed to a clinical trial. During the clinical trial, a task force- 
appointed clinical coordinator collects and reports a summary of experiences gained. The technical 
coordinator then combines the results from both the technical tests and the clinical trial into a summary 
report for MTAC review and approval. In this role, the clinical engineer/technical coordinator serves as 
a multidisciplinary professional, bridging the gap between the clinical and technical needs of the hospital. 
To complete the process, financial staff representatives review the protocol. 

The technology assessment process at this example hospital begins with a department or individual 
filling out two forms: (1) a request for review (RR) form and (2) a capital asset request (CAR) form. 
These forms are submitted to the hospital's product standards committee, which determines if an 
assessment process is to be initiated, and the priority for its completion. It also determines if a previously 
established standard for this equipment already exists (if the hospital is already using such a technol- 
ogy) — if so, an assessment is not needed. 

On the RR, the originator delineates the rationale for acquiring the medical device. For example, the 
originator must tell how the item will improve quality of patient care, who will be its primary user, and 
how it will improve ease of use. On the CAR, the originator describes the item, estimates its cost, and 
provides purchase justification. The CAR is then routed to the capital budget office for review, during 
this process, the optimal financing method for acquisition is determined. If funding is secured, the CAR 
is routed to the material management department, where, together with the RR, it will be processed. The 
rationale for having the RR accompany the CAR is to ensure that financial information is included as 
part of the assessment process. The CAR is the tool by which the purchasing department initiates a 
market survey and later sends product requests for bid. Any request for evaluation that is received without 
a CAR or any CAR involving medical equipment that is received without a request for evaluation is 
returned to the originator without action. Both forms are then sent to the clinical engineering department, 
where a designated technical coordinator will analyze the requested technology maturity level and results 
of clinical experience with its use, review trends, and prioritize various manufactures' presentations for 
MTAC review. 

Both forms must be sent to the MTAC if the item requested is not currently used by the hospital or 
if it does not conform to previously adopted hospital standards. The MTAC has the authority to recom- 
mend either acceptance or rejection of any request for review, based on a consensus of its members. A 
task force consisting of potential equipment users will determine the "must have" equipment functions, 
review the impact of the various equipment configurations, and plan technical and clinical evaluations. 

If the request is approved by the MTAC, the requested technology or equipment will be evaluated 
using technical and performance standards. Upon completion of the review, a recommendation is 
returned to the hospital's products standard committee, which reviews the results of the technology 
assessment, determines whether the particular product is suitable as a hospital standard, and decides if 
its should be purchased. If approved, the request to purchase will be reviewed by the capital budget 
committee (CBC) to determine if the required expenditure meets with available financial resources and 
if or when it may be feasible to make the purchase. To ensure coordination of the technology assessment 
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program, the chairman of the MTAC also serves as a permanent member of the hospital's CBC. In this 
way, there is a planned integration between technology assessment and budget decisions. 

168.4 Equipment Assets Management 

An accountable, systemic approach will ensure that cost-effective, efficacious, safe, and appropriate 
equipment is available to meet the demands of quality patient care. Such an approach requires that 
existing medical equipment resources be managed and that the resulting management strategies have 
measurable outputs that are monitored and evaluated. Technology managers/clinical engineers are well 
positioned to organize and lead this function. It is assumed that cost accounting is managed and 
monitored by the health care organization's financial group. 

Equipment Management Process 

Through traditional assets management strategies, medical equipment can be comprehensively managed 
by clinical engineering personnel. First, the management should consider a full range of strategies for 
equipment technical support. Plans may include use of a combination of equipment service providers 
such as manufacturers, third-party service groups, shared services, and hospital-based (in-house) engi- 
neers and biomedical equipment technicians (BMETs). All these service providers should be under the 
general responsibility of the technology manager to ensure optimal equipment performance through 
comprehensive and ongoing best-value equipment service. After obtaining a complete hospital medical 
equipment inventory (noting both original manufacturer and typical service provider), the management 
should conduct a thorough analysis of hospital accounts payable records for at least the past 2 years, 
compiling all service reports and preventative maintenance-related costs from all possible sources. The 
manager then should document in-house and external provider equipment service costs, extent of 
maintenance coverage for each inventory time, equipment-user operating schedule, quality of mainte- 
nance coverage for each item, appropriateness of the service provider, and reasonable maintenance costs. 
Next, he or she should establish an effective equipment technical support process. With an accurate 
inventory and best-value service providers identified, service agreements/contracts should be negotiated 
with external providers using prepared terms and conditions, including a log-in system. There should 
be an in-house clinical engineering staff ensuring ongoing external provider cost control utilizing several 
tools. By asking the right technical questions and establishing friendly relationships with staff, the 
manager will be able to handle service purchase orders (POs) by determining if equipment is worth 
repairing and obtaining exchange prices for parts. The staff should handle service reports to review them 
for accuracy and proper use of the log-in system. They also should match invoices with the service reports 
to verify opportunities and review service histories to look for symptoms such as need for user training, 
repeated problems, run-on calls billed months apart, or evidence of defective or worn-out equipment. 
The manager should take responsibility for emergency equipment rentals. Finally, the manager should 
develop, implement, and monitor all the service performance criteria. 

To optimize technology management programs, clinical engineers should be willing to assume respon- 
sibilities for technology planning and management in all related areas. They should develop policies and 
procedures for their hospital's management program. With life-cycle costs determined for key high-risk 
or high-cost devices, they should evaluate methods to provide additional cost savings in equipment 
operation and maintenance. They should be involved with computer networking systems within the 
hospital. As computer technology applications increase, the requirements to review technology-related 
information in a number of hospital locations will increase. They should determine what environmental 
conditions and facility changes are required to accommodate new technologies or changes in standards 
and guidelines. Lastly, they should use documentation of equipment performance and maintenance costs 
along with their knowledge of current clinical practices to assist other hospital personnel in determining 
the best time and process for planning equipment replacement [12]. 
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Technology Management Activities 

A clinical engineering department, through outstanding performance in traditional equipment manage- 
ment, will win its hospital's support and will be asked to be involved in a full range of technology 
management activities. The department should start an equipment control program that encompasses 
routine performance testing, inspection, periodic and preventive maintenance, on-demand repair ser- 
vices, incidents investigation, and actions on recalls and hazards. The department should have multidis- 
ciplinary involvement in equipment acquisition and replacement decisions, development of new services, 
and planning of new construction and major renovations, including intensive participation by clinical 
engineering, materials management, and finance. The department also should initiate programs for 
training all users of patient care equipment, quality improvement (QI), as it relates to technology use, 
and technology-related risk management [13]. 

Case Study: A Focus on Medical Imaging 

In the mide- 1980s, a large private multihospital system contemplated the startup of a corporate clinical 
engineering program. The directors recognized that involvement in a diagnostic imaging equipment 
service would be key to the economic success of the program. They further recognized that maintenance 
cost reductions would have to be balanced with achieving equal or increased quality of care in the 
utilization of that equipment. 

Programs startup was in the summer of 1987 in 3 hospitals that were geographically close. Within the 
first year, clinical engineering operations began in 1 1 hospitals in 3 regions over a two-state area. By the 
fall of 1990, the program included 7 regions and 21 hospitals in a five-state area. The regions were 
organized, typically, into teams including a regional manager and 10 service providers, serving 3 to 4 
hospitals, whose average size was 225 beds. Although the staffs were stationed at the hospitals, some 
specialists traveled between sites in the region to provide equipment service. Service providers included 
individuals specializing in the areas of diagnostic imaging [x-ray and computed tomography (CT)], 
clinical laboratory, general biomedical instrumentation, and respiratory therapy. 

At the end of the first 18 months, the program documented over $1 million in savings for the initial 
11 hospitals, a 23% reduction from the previous annual service costs. Over 63% of these savings were 
attributable to "in-house" service x-ray and CT scanner equipment. The mix of equipment maintained 
by 1 1 imagining service providers — from a total staff of 30 — included approximately 75% of the radiology 
systems of any kind found in the hospitals and 5 models of CT scanners from the three different 
manufacturers. 

At the end of 3 years in 1990, program-wide savings had exceeded 30% of previous costs for partici- 
pating hospitals. Within the imaging areas of the hospitals, savings approached and sometimes exceed 
50% of initial service costs. The 30 imaging service providers — out of a total staff of 62 — had increased 
their coverage of radiology equipment to over 95%, had increased involvement with CT to include nine 
models from five different manufacturers, and had begun in-house work in other key imaging modalities. 

Tracking the financial performance of the initial 1 1 hospitals over the first 3 years of the program 
yields of the following composite example: A hospital of 225 beds was found to have equipment service 
costs of $540,000 prior to program startup. Sixty-three percent of these initial costs (or $340,000) was 
for the maintenance of the hospital's x-ray and CT scanner systems. Three years later, annual service 
costs for this equipment were cut in half, to approximately $170,000. That represents a 31% reduction 
in hospital-wide costs due to the imaging service alone. 

This corporate clinical engineering operation is, in effect, a large in-house program serving many 
hospitals that all have common ownership. The multihospital corporation has significant purchasing 
power in the medical device marketplace and provides central oversight of the larger capital expenditures 
for its hospitals. The combination of the parent organization's leverage and the program's commitment 
to serve only hospitals in the corporation facilitated the development of positive relationships with 
medical device manufacturers. Most of the manufacturers did not see the program as competition but 
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rather as a potentially helpful ally in the future marketing and sales of their equipment and systems. 
What staff provided these results? All service providers were either medical imaging industry or military 
trained. All were experienced at troubleshooting electronic subsystems to component level, as necessary. 
Typically, these individuals had prior experience on the manufacture's models of equipment under their 
coverage. Most regional managers had prior industry, third party, or in-house imaging service manage- 
ment experience. Each service provider had the test equipment necessary for day-to-day duties. Each 
individual could expect at least 2 weeks of annual service training to keep appropriate skills current. 
Desired service training could be acquired in a timely manner from manufactures and/or third-party 
organizations. Spare or replacement parts inventory was minimal because of the program's ability to get 
parts from manufacturers and other sources either locally or shipped in overnight. 

As quality indicators for the program, the management measured user satisfaction, equipment down- 
time, documentation of technical staff service training, types of user equipment errors and their effect 
on patient outcomes, and regular attention to hospital technology problems. User satisfaction surveys 
indicated a high degree of confidence in the program service providers by imaging department mangers. 
Problems relating to technical, management, communication, and financial issues did occur regularly, 
but the regional manager ensured that they were resolved in a timely manner. Faster response to daily 
imaging equipment problems, typically by on-site service providers, coupled with regular preventive 
maintenance (PM) according to established procedures led to reduced equipment downtime. PM and 
repair service histories were captured in a computer documentation system that also tracked service 
times, costs, and user errors and their effects. Assisting the safety committee became easier with ability 
to draw a wide variety of information quickly from the program's documenting system. 

Early success in imaging equipment led to the opportunity to do some additional value-added projects 
such as the moving and reinstallation of x-ray rooms that preserved exiting assets and opened up valuable 
space for installation of newer equipment and upgrades of CT scanner systems. The parent organization 
came to realize that these technology management activities could potentially have a greater financial 
and quality impact on the hospital's health care delivery than equipment management. In the example 
of one CT upgrade (which was completed over two weekends with no downtime), there was a positive 
financial impact in excess of $600,000 and improved quality of care by allowing faster off-line diagnosis 
of patient scans. However, opportunity for this kind of contribution would never have occurred without 
the strong base of a successful equipment management program staffed with qualified individuals who 
receive ongoing training. 

168.5 Equipment Acquisition and Deployment 

Process of Acquiring Technology 

Typically, medical device systems will emerge from the strategic technology planning and technology 
assessment processes as required and budgeted needs. At acquisition time, a needs analysis should be 
conducted, reaffirming clinical needs and device intended applications. The "request for review" docu- 
mentation from the assessment process or capital budget request and incremental financial analysis from 
the planning process may provide appropriate justification information, and a capital asset request (CAR) 
form should be completed [14]. Materials management and clinical engineering personnel should ensure 
that this item is a candidate for centralized and coordinated acquisition of similar equipment with other 
hospital departments. Typical hospital prepurchase evaluation guidelines include an analysis of needs 
and development of a specification list, formation of a vendor list and requesting proposals, analyzing 
proposals and site planning, evaluating samples, selecting finalists, making the award, delivery and 
installation, and acceptance testing. Formal request for proposals (RFPs) from potential equipment 
vendors are required for intended acquisitions whose initial or life-cycle cost exceeds a certain threshold, 
i.e., $100,000. Finally, the purchase takes place, wherein final equipment negotiations are conducted and 
purchase documents are prepared, including a purchase order. 
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Acquisition Process Strategies 

The cost-of-ownership concept can be used when considering what factors to include in cost comparisons 
of competing medical devices. Cost of ownership encompasses all the direct and indirect expenses 
associated with medical equipment over its lifetime [15]. It expresses the cost factors of medical equip- 
ment for both the initial price of the equipment (which typically includes the equipment, its installation, 
and initial training cost) and over the long term. Long-term costs include ongoing training, equipment 
service, supplies, connectivity, upgrades, and other costs. Health care organizations are just beginning 
to account for a full range of cost-of-ownership factors in their technology assessment and acquisition 
processes, such as acquisition costs, operating costs, and maintenance costs (installation, supplies, down- 
time, training, spare parts, test equipment and tools, and depreciation). It is estimated that the purchase 
price represents only 20% of the life-cycle cost of ownership. 

When conducting needs analysis, actual utilization information form the organization's existing same 
or similar devices can be very helpful. One leading private multihospital system has implemented the 
following approach to measuring and developing relevant management feedback concerning equipment 
utilization. It is conducting equipment utilization review for replacement planning, for ongoing account- 
ability of equipment use, and to provide input before more equipment is purchased. This private system 
attempts to match product to its intended function and to measure daily (if necessary) the equipment's 
actual utilization. The tools they use include knowing their hospital's entire installed base of certain kinds 
of equipment, i.e., imaging systems. Utilization assumptions for each hospital and its clinical procedural 
mix are made. Equipment functional requirements to meet the demands of the clinical procedures are 
also taken into account. 

Life-cycle cost analysis is a tool used during technology planning, assessment, or acquisition "either 
to compare high-cost, alternative means for providing a service or to determine whether a single project 
or technology has a positive or negative economic value. The strength of the life-cycle cost analysis is 
that it examines the cash flow impact of an alternative over its entire life, instead of focusing solely on 
initial capital investments" [15]. 

"Life-cycle cost analysis facilitates comparisons between projects or technologies with large initial cash 
outlays and those with level outlays and inflows over time. It is most applicable to complex, high-cost 
choices among alternative technologies, new service, and different means for providing a given service. 
Life-cycle cost analysis is particularly useful for decisions that are too complex and ambiguous for 
experience and subjective judgment alone. It also helps decision makers perceive and include costs that 
often are hidden or ignored, and that may otherwise invalidate results" [12]. 

"Perhaps the most powerful life-cycle cost technique is net present value (NPV) analysis, which 
explicitly accounts for inflation and foregone investment opportunities by expressing future cash flows 
in present dollars" [12]. 

Examples where LCC and NPV analysis prove very helpful are in deciding whether to replace/rebuild 
or buy/lease medical imaging equipment. The kinds of costs captured in life-cycle cost analysis, include 
decision-making costs, planning agency/certificate of need costs (if applicable), financing, initial capital 
investment costs including facility changes, life-cycle maintenance and repairs costs, personnel costs, and 
other (reimbursement consequences, resale, etc.). 

One of the best strategies to ensure that a desired technology is truly of value to the hospital is to 
conduct a careful analysis in preparation for its assimilation into hospital operations. The process of 
equipment prepurchase evaluation provides information that can be used to screen unacceptable per- 
formance by either the vendor or the equipment before it becomes a hospital problem. 

Once the vendor has responded to informal requests or formal RFPs, the clinical engineering depart- 
ment should be responsible for evaluating the technical response, while the materials management 
department should devaluate the financial responses. 

In translating clinical needs into a specification list, key features or "must have" attributes of the desired 
device are identified. In practice, clinical engineering and materials management should develop a "must 
have" list and an extras list. The extras list contains features that may tip the decision in favor of one 
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vendor, all other factors being even. These specification lists are sent to the vendor and are effective in 
a self-elimination process that results in a time savings for the hospital. Once the "must have" attributes 
have been satisfied, the remaining candidate devices are evaluated technically, and the extras are consid- 
ered. This is accomplished by assigning a weighting factor (i.e., to 5) to denote the relative importance 
of each of the desired attributes. The relative ability of each device to meet the defined requirements is 
then rated [15]. 

One strategy that strengthens the acquisition process is the conditions-of-sale document. This multi- 
faceted document integrates equipment specifications, performance, installation requirements, and fol- 
low-up services. The conditions-of-sale document ensures that negotiations are completed before a 
purchase order is delivered and each participant is in agreement about the product to be delivered. As a 
document of compliance, the conditions-of-sale document specifies the codes and standards having 
jurisdiction over that equipment. This may include provisions for future modification of the equipment, 
compliance with standards under development, compliance with national codes, and provision for 
software upgrades. 

Standard purchase orders that include the conditions of sale for medical equipment are usually used 
to initiate the order. At the time the order is placed, clinical engineering is notified of the order. In 
addition to current facility conditions, the management must address installation and approval require- 
ments, responsibilities, and timetable; payment, assignment, and cancellation; software requirements and 
updates; documentation; clinical and technical training; acceptance testing (hospital facility and vendor); 
warranty, spare parts, and service; and price protection. 

All medical equipment must be inspected and tested before it is placed into service regardless of 
whether it is purchased, leased, rented, or borrowed by the hospital. In any hospital, clinical engineering 
should receive immediate notification if a very large device or system is delivered directly into another 
department (e.g., imaging or cardiology) for installation. Clinical engineering should be required to sign 
off on all purchase orders for devices after installation and validation of satisfactory operation. Ideally, 
the warranty period on new equipment should not begin until installation and acceptance testing are 
completed. It is not uncommon for a hospital to lose several months of free parts and service by the 
manufacturer when new equipment is, for some reason, not installed immediately after delivery. 

Clinical Team Requirements 

During the technology assessment and acquisition processes, clinical decision makers analyze the follow- 
ing criteria concerning proposed technology acquisitions, specifically as they relate to clinical team 
requirements: ability of staff to assimilate the technology, medical staff satisfaction (short term and long 
term), impact on staffing (numbers, functions), projected utilization, ongoing related supplies required, 
effect on delivery of care and outcomes (convenience, safety, or standard of care), result of what is written 
in the clinical practice guidelines, credentialling of staff required, clinical staff initial and ongoing training 
required, and the effect on existing technology in the department or on other services/departments. 

Defining Terms 

Appropriate technology [ 1 ] : A term used initially in developing countries, referring to selecting medical 
equipment that can "appropriately" satisfy the following constraints: funding shortages, insufficient 
numbers of trained personnel, lack of technical support, inadequate supplies of consumables/acces- 
sories, unreliable water an power utilities/supplies, and lack of operating and maintenance manuals. 
In the context of this chapter, appropriate technology selection must taken into consideration local 
health needs and disease prevalence, the need for local capability of equipment maintenance, and 
availability of resources for ongoing operational and technical support. 

Clinical engineers/biomedical engineers: As we began describing the issues with the management of 
medical technology; it became obvious that some of the terms are being used interchangeably in 
the literature. For example, the terms engineers, clinical engineers, biomedical equipment technicians, 
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equipment managers, and health care engineers are frequently used. For clarification, in this chapter 
we will refer to clinical engineers and the clinical engineering department as a representative group 
for all these terms. 

Cost-effectiveness [1]: A mixture of quantitative and qualitative considerations. It includes the health 
priorities of the country or region at the macro assessment level and the community needs at the 
institution micro assessment level. Product life-cycle cost analysis (which, in turn, includes initial 
purchase price, shipping, renovations, installation, supplies, associated disposables, cost per use, 
and similar quantitative measures) is a critical analysis measure. Life-cycle cost also takes into 
account staff training, ease of use, service, and many other cost factors. But experience and 
judgement about the relative importance of features and the ability to fulfill the intended purpose 
also contribute critical information to the cost-effectiveness equation. 

Equipment acquisition and deployment: Medical device systems and products typically emerge from 
the strategic technology planning process as "required and budgeted" needs. The process that 
follows, which ends with equipment acceptance testing and placement into general use, is known 
as the equipment acquisition and deployment process. 

Health care technology: Health care technology includes the devices, equipment, systems, software, 
supplies, pharmaceuticals, biotechnologies, and medical and surgical procedures used in the pre- 
vention, diagnosis, and treatment of disease in humans, for their rehabilitation, and for assistive 
purposes. In short, technology is broadly denned as encompassing virtually all the human inter- 
ventions intended to cope with disease and disabilities, short of spiritual alternatives. This chapter 
focuses on medical equipment products (devices, systems, and software) rather than pharmaceu- 
ticals, biotechnologies, or procedures [ 1 ] . The concept of technology also encompasses the facilities 
that house both patients and products. Facilities cover a wide spectrum — from the modern hospital 
on one end to the mobile imaging trailer on the other. 

Quality of care (QA) and quality of improvement (QI): Quality assurance (QA) and Quality improve- 
ment (QI) are formal sets of activities to measure the quality of care provided; these usually include 
a process for selecting, monitoring, and applying corrective measures. The 1994 Joint Commission 
on the Accreditation of Healthcare Organizations (JCAHO) standards require hospital QA, pro- 
grams to focus on patient outcomes as a primary reference. JCAHO standards for plant, technology, 
and safety management (PTSM), in turn, require certain equipment management practices and 
QA or QI activities. Identified QI deficiencies may influence equipment planning, and QI audits 
may increase awareness of technology overuse or under utilization. 

Risk management: Risk management is a program that helps the hospital avoid the possibility of risks, 
minimize liability exposure, and stay compliant with regulatory reporting requirements. JCAHO 
PTSM standards require minimum technology-based risk-management activities. These include 
clinical engineering's determination of technology- related incidents with follow-up steps to prevent 
recurrences and evaluation and documentation of the effectiveness of these steps. 

Safety: Safety is the condition of being safe from danger, injury, or damage. It is judgment about the 
acceptability of risk in a specified situation (e.g., for a given medical problem) by a provider with 
specified training at a specified type of facility equipment. 

Standards [ 1 ] : A wide variety of formal standards and guidelines related to health care technology now 
exists. Some standards apply to design, development, and manufacturing practices for devices, 
software, and pharmaceuticals; some are related to the construction and operation of a health care 
facility; some are safety and performance requirements for certain classes of technologies, such as 
standards related to radiation or electrical safety; and others relate to performance, or even con- 
struction specifications, for specific types of technologies. Other standards and guidelines deal with 
administrative, medical, and surgical procedures and the training of clinical personnel. Standards 
and guidelines are produced and/or adopted by government agencies, international organizations, 
and professional and specialty organizations and societies. ECRI's Healthcare Standards Directory 
lists over 20,000 individual standards and guidelines produced by over 600 organizations and 
agencies from North America alone. 
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Strategic technology planning: Strategic technology planning encompasses both technologies new to 
the hospital and replacements for existing equipment that are to be acquired over several quarters. 
Acquisitions can be proposed for reasons related to safety, standard-of-care issues, and age or 
obsolescence of existing equipment. Acquisitions also can proposed to consolidate several service 
areas, expand a service areas to reduce cost of service, or add a new service area. 

Strategic technology panning optimizes the way the hospital's capital resources contribute to its 
mission. It encourages choosing new technologies that are cost-effective, and it also allows the 
hospital to be competitive in offering state-of-the-art services. Strategic technology planning works 
for a single department, product line, or clinical service. It can be limited to one or several high- 
priority areas. It also can be used for an entire mulihospital system or geographic region [4], 

Technology assessment: Assessment of medical technology is any process used for examining and 
reporting properties of medical technology used in health care, such as safety, efficacy, feasibility, 
and indications for use, cost, and cost-effectiveness, as well as social, economic, and ethical con- 
sequences, whether intended or unintended [2] . A primary technology assessment is one that seeks 
new, previously nonexistent data through research, typically employing long-term clinical studies 
of the type described below. A secondary technology assessment is usually based on published 
data, interviews, questionnaires, and other information-gathering methods rather than original 
research that creates new, basic data. 

In technology assessment, there are six basic objectives that the clinical engineering department 
should have in mind. First, there should be ongoing monitoring of developments concerning new 
and emerging technologies. For new technologies, there should be an assessment of the clinical 
efficacy, safety, and cost/benefit ratio, including their effects on established technologies. There 
should be an evaluation of the short- and long-term costs and benefits of alternate approaches to 
managing specific clinical conditions. The appropriateness of existing technologies and their clin- 
ical uses should be estimated, while outmoded technologies should be identified and eliminated 
from their duplicative uses. The department should rate specific technology-based interventions 
in terms of improved overall value (quality and outcomes) to patients, providers, and payers. 
Finally, the department should facilitate a continuous uniformity between needs, offerings, and 
capabilities [3]. 

The locally based (hospital or hospital group) technology assessment described in this chapter 
is a process of secondary assessment that attempts to judge whether a certain medical equip- 
ment/product can be assimilated into the local operational environment. 

Technology diffusion [ 1 ] : The process by which a technology is spread over time in a social system. 
The progression of technology diffusion can be described in four stages. The emerging or applied 
research stage occurs around the time of initial clinical testing. In the new stage, the technology 
has passed the phase of clinical trials but is not yet in widespread use. During the established stage, 
the technology is considered by providers to be a standard approach to a particular condition and 
diffuses into general use. Finally, in the obsolete/outmoded stage, the technology is superseded by 
another and/or is demonstrated to be ineffective or harmful. 

Technology life cycle: Technology has a life cycle — a process by which technology is crated, tested, 
applied, and replaced or abandoned. Since the life cycle varies from basic research and innovation 
to obsolescence and abatement, it is critical to know the maturity of a technology prior to making 
decisions regarding its adoption. Technology forecast assessment of pending technological changes 
are the investigative tools that support systematic and rational decisions about the utilization of a 
given institution's technological capabilities. 

Technology planning and management [ 3 ] : Technology planning and management are an accountable, 
systematic approach to ensuring that cost-effective, efficacious, appropriate, and safe equipment 
is available to meet the demands of quality patient care and allow an institution to remain com- 
petitive. Elements include in-house service management, management and analysis of equipment 
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external service providers, involvement in the equipment acquisition process, involvement of 
appropriate hospital personnel in facility planning and design, involvement in reducing technol- 
ogy-related patient and staff incidents, training equipment users, reviewing equipment replacement 
needs, and ongoing assessment of emerging technologies [4] . 
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169.1 Risk Management: A Definition 

Inherent in the definition of risk management is the implication that the hospital environment cannot 
be made risk-free. In fact, the nature of medical equipment — to invasively or noninvasively perform 
diagnostic, therapeutic, corrective, or monitoring intervention on behalf of the patient — implies that risk 
is present. Therefore, a standard of acceptable risk must be established that defines manageable risk in a 
real-time economic environment. 

Unfortunately, a preexistent, quantitative standard does not exist in terms of, for instance, mean time 
before failure (MTBF), number of repairs or repair redos per equipment item, or cost of maintenance 
that provides a universal yardstick for risk management of medical equipment. Sufficient clinical man- 
agement of risk must be in place the can utilize safeguards, preventive maintenance, and failure analysis 
information to minimize the occurrence of injury or death to patient or employee or property damage. 
Therefore, a process must be put in place that will permit analysis of information and modification of 
the preceding factors to continuously move the medical equipment program to a more stable level of 
manageable risk. 

Risk factors that require management can be illustrated by the example of the "double-edge" sword 
concept of technology (see Fig. 169.1). The front edge of the sword represents the cutting edge of 
technology and its beneficial characteristics: increased quality, greater availability of technology, timeliness 
of test results and treatment, and so on. The back edge of the sword represents those liabilities which 
must be addressed to effectively manage risk: the hidden costs discussed in the next paragraph, our 
dependence on technology, incompatibility of equipment, and so on [1]. 

For example, the purchase and installation of a major medical equipment item may only represent 
20% of the lifetime cost of the equipment [2]. If the operational budget of a nursing floor does not 
include the other 80% of the equipment costs, the budget constraints may require cutbacks where they 
appear to minimally affect direct patient care. Preventive maintenance, software upgrades that address 
"glitches," or overhaul requirements may be seen as unaffordable luxuries. Gradual equipment deterio- 
ration without maintenance may bring the safety level below an acceptable level of manageable risk. 
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FIGURE 169.1 Double-edged sword concept of risk management. 

Since economic factors as well as those of safety must be considered, a balanced approach to risk 
management that incorporates all aspects of the medical equipment lifecycle must be considered. 

The operational flowchart in Fig. 169.2 describe the concept of medical equipment life-cycle manage- 
ment from the clinical engineering department viewpoint. The flowchart includes planning, evaluation, 
and initial purchase documentation requirements. The condition of sale, for example, ensures that 
technical manuals, training, replacement parts, etc. are received so that all medical equipment might be 
fully supported in-house after the warranty period. Introduction to the preventive maintenance program, 
unscheduled maintenance procedures, and retirement justification must be part of the process. Institu- 
tional-wide cooperation with the life-cycle concept requires education and patience to convince health 
care providers of the team approach to managing medical equipment technology. 

This balanced approach requires communication and comprehensive planning by a health care team 
responsible for evaluation of new and shared technology within the organization. A medical technology 
evaluation committee (see Fig. 169.3), composed of representatives from administration, medical staff, 
nursing, safety department, biomedical engineering, and various services, can be an effective platform 
for the integration of technology and health care. Risk containment is practiced as the committee reviews 
not only the benefits of new technology but also the technical and clinical liabilities and provides a 
6-month followup study to measure the effectiveness of the selection process. The history of risk man- 
agement in medical equipment management provides helpful insight into its current status and future 
direction. 

169.2 Risk Management: Historical Perspective 

Historically, risk management of medical equipment was the responsibility of the clinical engineer 
(Fig. 169.4). The engineer selected medical equipment based on individual clinical department consul- 
tations and established preventive maintenance (PM) programs based on manufacturer's recommenda- 
tion and clinical experience. The clinical engineer reviewed the documentation and "spot-checked" 
equipment used in the hospital. The clinical engineer met with biomedical supervisors and technicians 
to discuss PM completion and to resolve repair problems. The clinical engineer then attempted to analyze 
failure information to avoid repeat failure. 
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FIGURE 169.3 Medical technology evaluation committee. 
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FIGURE 169.4 Operational flowchart. 

However, greater public awareness of safety issues, increasing equipment density at the bed-side, more 
sophisticated software-driven medical equipment, and financial considerations have made it more diffi- 
cult for the clinical engineer to singularly handle risk issues. In addition, the synergistic interactions of 
various medical systems operating in proximity to one another have added another dimension to the 
risk formula. It is not only necessary for health care institutions to manage risk using a team approach, 
but it is also becoming apparent that the clinical engineer requires more technology-intensive tools to 
effectively contribute to the team effort [3]. 
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1 00 Medical Equipment Operator Error 

101 Medical Equipment Failure 

102 Medical Equipment Physical Damage 

103 Reported Patient Injury 

104 Reported Employee Injury 

1 05 Medical Equipment Failed PM 
1 08 Medical Equipment MBA 

FIGURE 169.5 Failure codes. 



169.3 Risk Management: Strategies 



Reactive risk management is an outgrowth of the historical attitude in medical equipment management 
that risk is an anomaly that surfaces in the form of a failure. If the failure is analyzed and proper 
operational procedures, user in-services, and increased maintenance are supplied, the problem will 
disappear and personnel can return to their normal work. When the next failure occurs, the algorithm 
is repeated. If the same equipment fails, the algorithm is applied more intensely. This is a useful but not 
comprehensive component of risk management in the hospital. In fact, the traditional methods of 
predicting the reliability of electronic equipment from field failure data have not been very effective [4]. 
The health care environment, as previously mentioned, inherently contains risk that must be maintained 
at a manageable level. A reactive tool cannot provide direction to a risk-management program, but it 
can provide feedback as to its efficiency. 

The engine of the reactive risk-management tool is a set of failure codes (see Fig. 169.5) that flag 
certain anomalous conditions in the medical equipment management program. If operator training needs 
are able to be identified, then codes 100, 102, 104, and 108 (MBA equipment returned within 9 days for 
a subsequent repair) may be useful. If technician difficulties in handling equipment problems are of 
concern, then 108 may be of interest. The key is to develop failure codes not in an attempt to define all 
possible anomaly modalities but for those which can clearly be defined and provide unambiguous direction 
for the correction process. Also, the failure codes should be linked to equipment type, manufacturer/model, 
technician service group, hospital, and clinical department. Again, when the data are analyzed, will the 
result be provided to an administrator, engineer, clinical departmental director, or safety department? 
This should determine the format in which the failure codes are presented. 

A report intended for the clinical engineer might be formatted as in Fig. 169.6. It would consist of 
two part, sorted by equipment type and clinical department (not shown). The engineer's report shows 
the failure code activity for various types of equipment and the distribution of those failure codes in 
clinical departments. 

Additionally, fast data-analysis techniques introduced by NASA permit the survey of large quantities 
of information in a three-dimensional display [5] (Fig. 169.7). This approach permits viewing time- 
variable changes from month to month and failure concentration in specific departments and equipment 
types. 

The importance of the format for failure modality presentation is critical to its usefulness and accep- 
tance by health care professionals. For instance, a safety director requests the clinical engineer to provide 
a list of equipment that, having failed, could have potentially harmed a patient or employee. The safety 
director is asking the clinical engineer for a clinical judgment based on clinical as well as technical factors. 
This is beyond the scope of responsibility and expertise of the clinical engineer. However, the request 
can be addressed indirectly. The safety director's request can be addressed in two steps: first, providing 
a list of high-risk equipment (assessed when the medical equipment is entered into the equipment 
inventory) and, second, a clinical judgment based on equipment failure mode, patient condition, and so 
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FIGURE 169.7 Failure code analysis using a 3-D display. 
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on. The flowchart in Fig. 169.8 provides the safety director with useful information but does not require 
the clinical engineer to make an unqualified clinical judgment. If the "failed PM" failure code were selected 
from the list of high-risk medical equipment requiring repair, the failure would be identified by the 
technician during routine preventive maintenance and most likely the clinician still would find the 
equipment clinically efficacious. This condition is a "high risk, soft failure" or a high-risk equipment 
item whose failure is least likely to cause injury. If the "failed PM" code were not used, the clinician would 
question the clinical efficacy of the medical equipment item, and the greater potential for injury would be 
identified by "high risk, hard failure." Monitoring the distribution of high-risk equipment in these two 
categories assists the safety director in managing risk. 

Obviously, a more forward-looking tool is needed to take advantage of the failure codes and the 
plethora of equipment information available in a clinical engineering department. This proactive tool 
should use failure codes, historical information, the "expert" knowledge of the clinical engineer, and the 
baseline of an established "manageable risk" environment (perhaps not optimal but stable). 

The overall components and process flow for a proactive risk-management tool [6] are presented in 
Fig. 169.9. It consists of a two-component static risk factor, a two-component dynamic risk factor, and 
two "shaping" or feedback loops. 

The static risk factor classifies new equipment by a generic equipment type: defibrilator, electrocar- 
diograph, pulse oximeter, etc. When equipment is introduced into the equipment database, it is assigned 
to two different static risk (Fig. 169.10) categories [7]. The first is the equipment function that defines 
the application and environment in which the equipment item will operate. The degree of interaction 
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FIGURE 169.8 High-risk medical equipment failures. 

with the patient is also taken into account. For example, a therapeutic device would have a higher risk 
assignment than a monitoring or diagnostic device. The second component of the static risk factor is 
the physical risk category. It defines the worst-cases scenario in the event of equipment malfunction. The 
correlation between equipment function and physical risk on many items might make the two categories 
appear redundant. However, there are sufficient equipment types where there is not the case. A scale of 
1 to 25 is assigned to each risk category. The larger number is assigned to devices demonstrating greater 
risk because of their function or the consequences of device failure. The 1 to 25 scale is an arbitrary 
assignment, since a validated scale of risk factors for medical equipment, as previously described, is 
nonexistent. The risk points assigned to the equipment from these two categories are algebraically 
summed and designated the static risk factor. This value remains with the equipment type and the 
individual items within that equipment type permanently. Only if the equipment is used in a clinically 
variant way or relocated to a functionally different environment would this assignment be reviewed and 
changed. 

The dynamic component (Fig. 169.11) of the risk-management tool consists or two parts. The first is 
a maintenance requirement category that is divided into 25 equally spaced divisions, ranked by least 
(1) to greatest (25) average manhours per device per year. These divisions are scaled by the maintenance 
hours for the equipment type requiring the greatest amount of maintenance attention. The amount of 
nonplanned (repair) manhours from the previous 12 months of service reports is totaled for each 
equipment type. Since this is maintenance work on failed equipment items, it correlates with the risk 
associated with that equipment type. 

If the maintenance hours of an equipment type are observed to change to the point of placing it in a 
different maintenance category, a flag notifies the clinical engineer to review the equipment-type category. 
The engineer may increase the PM schedule to compensate for the higher unplanned maintenance hours. 
If the engineer believes the system "overacted," a "no" decision adjusts a scaling factor by a -5%. Progres- 
sively, the algorithm is "shaped" for the equipment maintenance program in that particular institution. 
However, to ensure that critical changes in the average manhours per device for each equipment type is 
not missed during the shaping period, the system is initialized. This is accomplished by increasing the 
average manhours per device for each equipment type to within 5% of the next higher maintenance 
requirement division. Thus the system is sensitized to variations in maintenance requirements. 
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EQUIPMENT 
FUNCTION 



25 Life Support 
23 Surgical & IC 
20 Physical Therapy & 
Treatment 

18 Surgical & IC 

Monitoring 
1 5 Additional Monitoring 

& Diagnostic 

1 3 Analytical Laboratory 

10 Laboratory Accessories 

8 Computer & Related 

5 Patient Related & Other 
Non-Patient 



PHYSICAL 
RISK 



25 Patient or Operator Death 
20 Patient or Operator Injury 
15 Inappropriate Therapy or 

Misdiagnosis 
10 Patient Discomfort 
5 No Significant Risks 




FIGURE 169.10 Static risk components. 



MAINTENANCE 
REQUIREMENT 



25 Divisions based on least to 
greatest Equipment Type annual 
unplanned maintenance requirements. 



Initialize Equipment Type 
categories by 5% adjustment 
within next higher category. 



Equipment Type 
Scaling Factor 



*1 



4> 



RISK 
POINTS 



Annual Risk Points assigned based 
on 9 risk categories. 



Points 


Risk 


+ 1 


Exceeds AHA Useful Life 


+ 2/2 


Patient/Employee Injury 


+ I 


Equipment Failure 


+ 1 


Exceeds MTBF 


+ 1 


Repair Redo (<7 day turnaround) 


+ 1 


User Operational Error 


+ 1 


PM Inspection Failure 


+ I 


Physical Damage 


-t 1 


PM Overdue 


Total Risk Points For Unit Devices 


Minus 


Equipment Type Annual Risk Points. 



Unit Scaling Factor 




FIGURE 169.11 Dynamic risk components. 



The baseline is now established for evaluating individual device risk. Variations in the maintenance 
requirement hours for any particular equipment type will, for the most part, only occur over a substantial 
period of time. For this reason, the maintenance requirement category is designated a "slow" dynamic 
risk element. 
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The second dynamic element assigns weighted risk points to individual equipment items for each 
unique risk occurrence. An occurrence is defined as when the device 

■ Exceeds the American Hospital Association Useful Life Table for Medical Equipment or exceeds 
the historical MTBF for that manufacturer and model 

■ Injures a patient or employee 

• Functionally fails or fails to pass a PM inspection 

• Is returned for repair or returned for rerepair within 9 days of a previous repair occurrence 

• Misses a planned maintenance inspection 

■ Is subjected to physical damage 

■ Was reported to have failed but the problem was determined to be a user operational error 

There risk occurrences include the failure codes previously described. Although many other risk occur- 
rences could be defined, these nine occurrences have been historically effective in managing equipment 
risk. The risk points for each piece or equipment are algebraically summed over the previous year. Since 
the yearly total is a moving window, the risk points will not continue to accumulate but will reflect a 
recent historical average risk. The risk points for each equipment type are also calculated. This provides 
a baseline to measure the relative risk of devices within an equipment type. The average risk points for 
the equipment type are subtracted from those for each piece of equipment within the equipment type. 
If the device has a negative risk point value, the device's risk is less than the average device in the equipment 
type. If positive, then the device has higher risk than the average device. This positive or negative factor 
is algebraically summed to the risk values from the equipment function, physical risk, and maintenance 
requirements. The annual risk points for an individual piece of equipment might change quickly over 
several months. For this reason, this is the "fast" component of the dynamic risk factor. 

The concept of risk has now been quantified in term of equipment function, physical risk, maintenance 
requirements, and risk occurrences. The total risk count for each device then places it in one of five risk 
priority groups that are based on the sum of risk points. These groups are then applied in various ways 
to determine repair triage, PM triage, educational and in-service requirements and test equipment/parts, 
etc. in the equipment management program. 

Correlation between the placement of individual devices in each risk priority group and the levels of 
planned maintenance previously assigned by the clinical engineer have shown that the proactive risk- 
management tool calculates a similar maintenance schedule as manually planned by the clinical engineer. 
In other words, the proactive risk-management tool algorithm places equipment items in a risk priority 
group commensurate with the greater or lesser maintenance as currently applied in the equipment 
maintenance program. 

As previously mentioned, the four categories and the 1 to 25 risk levels within each category are 
arbitrary because a "gold standard" for risk management is nonexistent. Therefore, the clinical engineer 
is given input into the dynamic components making up the risk factor to "shape the system" based on 
the equipment's maintenance history and the clinical engineer's experience. Since the idea of a safe medical 
equipment program involves "judgment about the acceptability of risk in a specified situation" [8], this 
experience is a necessary component of the risk-assessment tool for a specific health care setting. 

In the same manner, the system tracks the unit device's assigned risk priority group. If the risk points 
for a device change sufficiently to place it in a different group, it is flagged for review. Again, the clinical 
engineer reviews the particular equipment item and decides if corrective action is prudent. Otherwise, 
the system reduces the scaling factor by 5%. Over a period of time, the system will be "formed" to what 
is acceptable risk and what deserves closer scrutiny. 



169.4 Risk Management: Application 



The information can be made available to the clinical engineer in the form of a risk assessment report 
(see Fig. 169.12). The report lists individual devices by property tag number (equipment control number), 
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manufacturer, model, and equipment type. Assigned values for equipment function and physical risk are 
constant for each equipment type. The maintenance sensitizing factor enables the clinical engineer to 
control the algorithm's response to the maintenance level of an entire equipment type. These factors 
combine to produce the slow risk factor (equipment function + physical risk + maintenance require- 
ments). The unit risk points are multiplied for the unit scaling factor, which allows the clinical engineer 
to control the algorithm's response to static and dynamic risk components on individual pieces of 
equipment. This number is then added to the slow risk factor to determine the risk factor for each item. 
The last two columns are the risk priority that the automated system has assigned and the PM level set 
by the clinical engineer. This report provides the clinical engineer with information about medical 
equipment that reflects a higher than normal risk factor for the equipment type to which it belongs. 

The proactive risk management tool can be used to individually schedule medical equipment devices 
for PM based on risk assessment. For example, why should newer patient monitors be maintained at the 
same maintenance level as older units if the risk can be demonstrated to be less? The tool is used as well 
to prioritize the planned maintenance program. For instance, assume a PM cycle every 17 weeks is started 
on January 1 for a duration of 1 week. Equipment not currently available for PM can be inspected at a 
later time as a function of the risk priority group for that device. In other words, an equipment item with 
a risk priority of 2, which is moderately low, would not be overdue for 2/5 of the time between the current 
and the next PM start date or until the thirteenth week after the start of a PM cycle of 17 weeks. The 
technicians can complete more critical overdue equipment first and move on to less critical equipment later. 

Additionally, since PM is performed with every equipment repair, is it always necessary to perform 
the following planned PM? Assume for a moment that unscheduled maintenance was performed 10 weeks 
into the 17 weeks between the two PM periods discussed above. IF the equipment has a higher risk 
priority of the three, four, or five, the equipment is PMed as scheduled in April. However, if a lower 
equipment risk priority of one or two is indicated, the planned maintenance is skipped in April and 
resumed in July. The intent of this application is to reduce maintenance costs, preserve departmental 
resources, and minimize the war and tear on equipment during testing. 

Historically, equipment awaiting service has been placed in the equipment holding area and inspected 
on a first in, first out (FIFO) basis when a technician is available. A client's request to expedite the 
equipment repair was the singular reason for changing the work priority schedule. The proactive risk- 
management tool can prioritize the equipment awaiting repair, putting the critical equipment back into 
service more quickly, subject to the clinical engineer's review. 

169.5 Case Studies 

Several examples are presented of the proactive risk-assessment tool used to evaluate the performance 
of medical equipment within a program. 

The ventilators in Fig. 169.13 show a decreasing unit risk factor for higher equipment tag numbers. 
Since devices are put into service with ascending tag numbers and these devices are known to have been 
purchased over a period of time, the X axis represents a chronological progression. The ventilator risk 
factor is decreasing for newer units and could be attributable to better maintenance technique or man- 
ufacturer design improvements. This device is said to have a time-dependent risk factor. 

A final illustration uses two generations of infusion pumps from the same manufacturer. Figure 169.14 
shows the older vintage pump as Infusion Pump 1 and the newer version as Infusion Pump 2. A linear 
regression line for the first pump establishes the average risk factor as 53 with a standard deviation of 
2.02 for the 93 pumps in the analysis. The second pump, a newer version of the first, had an average risk 
factor of 50 with a standard deviation of 1.38 for 261 pumps. Both pumps have relatively time-indepen- 
dent risk factors. The proactive risk-management tool reveals that this particular brand of infusion pump 
in the present maintenance program is stable over time and the newer pump has reduced risk and 
variability of risk between individual units. Again, this could be attributable to tighter manufacturing 
control or improvements in the maintenance program. 
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UNITS IN CHRONOLOGICALLY ASCENDING ORDER 

FIGURE 169.13 Ventilator with time-dependent risk characteristics. 



Infusion Pump #1 
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FIGURE 169.14 Time-independent risk characteristics infusion pump #1. 
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169.6 Conclusions 

In summary, superior risk assessment within a medical equipment management program requires better 
communication, teamwork, and information analysis and distribution among all health care providers. 
Individually, the clinical engineer cannot provide all the necessary components for managing risk in the 
health care environment. Using historical information to only address equipment-related problems, after 
an incident, is not sufficient. The use of a proactive risk-management tool is necessary. 

The clinical engineer can use this tool to deploy technical resources in a cost-effective manner. In 
addition to the direct economic benefits, safety is enhanced as problem equipment is identified and 
monitored more frequently. The integration of a proactive risk-assessment tool into the equipment man- 
agement program can more accurately bring to focus technical resources in the health care environment. 
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The role, organization, and structure of clinical engineering departments in the modern health care 
environment continue to evolve. During the past 10 years, the rate of change has increased considerably 
faster than mere evolution due to fundamental changes in the management and organization of health 
care. Rapid, significant changes in the health care sector are occurring in the United States and in nearly 
every country. The underlying drive is primarily economic, the recognition that resources are finite. 

Indicators are essential for survival of organizations and are absolutely necessary for effective man- 
agement of change. Clinical engineering departments are not exceptions to this rule. In the past, most 
clinical engineering departments were task-driven and their existence justified by the tasks performed. 
Perhaps the most significant change occurring in clinical engineering practice today is the philosophical 
shift to a more business-oriented, cost-justified, bottom-line-focused approach than has been generally 
the case in the past. 

Changes in the health care delivery system will dictate that clinical engineering departments justify 
their performance and existence on the same basis as any business, the performance of specific functions 
at a high quality level and at a competitive cost. Clinical engineering management philosophy must 
change from a purely task-driven methodology to one that includes the economics of department 
performance. Indicators need to be developed to measure this performance. Indicator data will need to 
be collected and analyzed. The data and indicators must be objective and defensible. If it cannot be 
measured, it cannot be managed effectively. 

Indicators are used to measure performance and function in three major areas. Indicators should be 
used as internal measurements and monitors of the performance provided by individuals, teams, and 
the department. These essentially measure what was done and how it was done. Indicators are essential 
during quality improvement and are used to monitor and improve a process. A third important type of 
program indicator is the benchmark. It is common knowledge that successful businesses will continue 
to use benchmarks, even though differing terminology will be used. A business cannot improve its 
competitive position unless it knows where it stands compared with similar organizations and businesses. 
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Different indicators may be necessary depending on the end purpose. Some indicators may be able to 
measure internal operations, quality improvement, and external benchmarks. Others will have a more 
restricted application. 

It is important to realize that a single indicator is insufficient to provide the information on which to base 
significant decisions. Multiple indicators are necessary to provide cross-checks and verification. An example 
might be to look at the profit margin of a business. Even if the profit margin per sale is 100%, the business 
will not be successful if there are few sales. Looking at single indicators of gross or net profit will correct this 
deficiency but will not provide sufficient information to point the way to improvements in operations. 

170.1 Department Philosophy 

A successful clinical engineering department must define its mission, vision, and goals as related to the 
facility's mission. A mission statement should identify what the clinical engineering department does for 
the organization. A vision statement identifies the direction and future of the department and must 
incorporate the vision statement of the parent organization. Department goals are then identified and 
developed to meet the mission and vision statements for the department and organization. The goals 
must be specific and attainable. The identification of goals will be incomplete without at least implied 
indicators. Integrating the mission statement, vision statement, and goals together provides the clinical 
engineering department management with the direction and constraints necessary for effective planning. 
Clinical engineering managers must carefully integrate mission, vision, and goal information to 
develop a strategic plan for the department. Since available means are always limited, the manager must 
carefully assess the needs of the organization and available resources, set appropriate priorities, and 
determine available options. The scope of specific clinical engineering services to be provided can include 
maintenance, equipment management, and technology management activities. Once the scope of services 
is defined, strategies can be developed for implementation. Appropriate program indicators must then 
be developed to document, monitor, and manage the services to be provided. Once effective indicators 
are implemented, they can be used to monitor internal operations and quality-improvement processes 
and complete comparisons with external organizations. 

Monitoring Internal Operations 

Indicators may be used to provide an objective, accurate measurement of the different services provided 
in the department. These can measure specific individual, team, and departmental performance param- 
eters. Typical indicators might include simple tallies of the quantity or level of effort for each activity, 
productivity (quantify/effort), percentage of time spent performing each activity, percentage of scheduled 
IPMs (inspection and preventive maintenance procedures) completed within the scheduled period, mean 
time per job by activity, repair jobs not completed within 30 days, parts order for greater than 60 days, etc. 

Process for Quality Improvement 

When program indicators are used in a quality-improvement process, an additional step is required. 
Expectations must be quantified in terms of the indicators used. Quantified expectations result in the 
establishment of a threshold value for the indicator that will precipitate further analysis of the process. 
Indicators combined with expectations (threshold values of the indicators) identify the opportunities for 
program improvement. Periodic monitoring to determine if a program indicator is below (or above, 
depending on whether you are measuring successes or failures) the established threshold will provide a 
flag to whether the process or performance is within acceptable limits. If it is outside acceptable limits 
for the indicator, a problem has been identified. Further analysis may be required to better define the 
problem. Possible program indicators for quality improvement might include the number of repairs 
completed within 24 or 48 hours, the number of callbacks for repairs, the number of repair problems 
caused by user error, the percentage of hazard notifications reviewed and acted on within a given time 
frame, meeting time targets for generating specification, evaluation or acceptance of new equipment, etc. 
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An example might be a weekly status update of the percentage of scheduled IPMs completed. Assume 
that the department has implemented a process in which a group of scheduled IPMs must be completed 
within 8 weeks. The expectation is that 12% of the scheduled IPMs will be completed each week. The 
indicator is the percentage of IPMs completed. The threshold value of the indicator is 12% per week 
increase in the percentage of IPMs completed. To monitor this, the number of IPMs that were completed 
must be tallied, divided by the total number scheduled, and multiplied by 100 to determine the percentage 
completed. If the number of completed IPMs is less than projected, then further analysis would be 
required to identify the source of the problem and determine solutions to correct it. If the percentage of 
completed IPMs were equal to or greater than the threshold or target, then no action would be required. 

External Comparisons 

Much important and useful information can be obtained be carefully comparing one clinical engineering 
program with others. This type of comparison is highly valued by most hospital administrators. It can 
be helpful in determining performance relative to competitors. External indicators or benchmarks can 
identify specific areas of activity in need of improvement. They offer insights when consideration is being 
given to expanding into new areas of support. Great care must be taken when comparing services provided 
by clinical engineering departments located in different facilities. There are number of factors that must 
be included in making such comparisons; otherwise, the results can be misleading or misinterpreted. It 
is important that the definition of the specific indicators used be well understood, and great care must 
be taken to ensure that the comparison utilizes comparable information before interpreting the compar- 
isons. Failure to understand the details and nature of the comparison and just using the numbers directly 
will likely result in inappropriate actions by managers and administrators. The process of analysis and 
explanation of differences in benchmark values between a clinical engineering department and a com- 
petitor (often referred to as gap analysis) can lead to increased insight into department operations and 
target areas for improvements. 

Possible external indicators could be the labor cost per hour, the labor cost per repair, the total cost 
per repair, the cost per bed supported, the number of devices per bed supported, percentage of time 
devoted to repairs versus IPMs versus consultation, cost of support as a percentage of the acquisition 
value of capital inventory, etc. 

170.2 Standard Database 

In God we trust... all others bring data! 

Florida Power and Light 

Evaluation of indicators requires the collection, storage, and analysis of data from which the indicators 
can be derived. A standard set of data elements must be defined. Fortunately, one only has to look at 
commercially available equipment management systems to determine the most common data elements 
used. Indeed, most of the high-end software systems have more data elements than many clinical 
engineering departments are willing to collect. These standard data elements must be carefully defined 
and understood. This is especially important if the data will later be used for comparisons with other 
organizations. Different departments often have different definitions for the same data element. It is 
crucial that the data collected be accurate and complete. The members of the clinical engineering 
department must be trained to properly gather, document, and enter the data into the database. It makes 
no conceptual difference if the database is maintained on paper or using computers. Computers and 
their databases are ubiquitous and so much easier to use that usually more data elements are collected 
when computerized systems are used. The effort required for analysis is less and the level of sophistication 
of the analytical tools that can be used is higher with computerized systems. 

The clinical engineering department must consistently gather and enter data into the database. The 
database becomes the practical definition of the services and work performed by the department. This 
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standardized database allows rapid, retrospective analysis of the data to determine specific indicators 
identifying problems and assist in developing solutions for implementation. A minimum database should 
allow the gathering and storage of the following data: 

In-House Labor. This consists of three elements: the number of hours spent providing a particular service, 
the associated labor rate, and the identity of the individual providing the service. The labor cost is not 
the hourly rate the technician is paid multiplied by the number of hours spent performing the service. 
It should include the associated indirect costs, such as benefits, space, utilities, test equipment, and tools, 
along with training, administrative overhead, and many other hidden costs. A simple, straightforward 
approach to determine an hourly labor rate for a department is to take the total budget of the department 
and subtract parts' costs, service contract costs, and amounts paid to outside vendors. Divide the resulting 
amount by the total hours spent providing services as determined from the database. This will provide 
an average hourly rate for the department. 

Vendor Labor. This should include hours spent and rate, travel, and zone charges and any perdiem costs 
associated with the vendor supplied service. 

Parts. Complete information on parts is important for any retrospective study of services provided. This 
information is similar for both in-house and vendor-provided service. It should include the part number, 
a description of the part, and its cost, including any shipping. 

Timeless. It is important to include a number of time stamps in the data. These should include the date 
the request was received, data assigned, and date completed. 

Problem Identification. Both a code for rapid computer searching and classification and a free text 
comment identifying the nature of the problem and description of service provided are important. The 
number of codes should be kept to as few as possible. Detailed classification schemes usually end up with 
significant inaccuracies due to differing interpretations of the fine gradations in classifications. 

Equipment Identification. Developing an accurate equipment history depends on reliable means of 
identifying the equipment. This usually includes a department- and/or facility- assigned unique identifi- 
cation number as well as the manufacturer, vendor, model, and serial number. Identification numbers 
of provided by asset management are often inadequate to allow tracking of interchangeable modules or 
important items with a value less than a given amount. Acquisition cost is a useful data element. 

Service Requester. The database should include elements allowing identification of the department, 
person, telephone number, cost center, and location of the service requester. 

170.3 Measurement Indicators 

Clinical engineering departments must gather objective, quantifiable data in order to assess ongoing 
performance, identify new quality-improvement opportunities, and monitor the effect of improvement 
action plans. Since resources are limited and everything cannot be measured, certain selection criteria 
must be implemented to identify the most significant opportunities for indicators. High-volume, high- 
risk, or problem-prone processes require frequent monitoring of indicators. A new indicator may be 
developed after analysis of ongoing measurements or feedback from other processes. Customer feedback 
and surveys often can provide information leading to the development of new indicators. Department 
management, in consultation with the quality-management department, typically determines what indica- 
tors will be monitored on an ongoing basis. The indicators and resulting analysis are fed back to individuals 
and work teams for review and improvement of their daily work activities. Teams may develop new 
indicators during their analysis and implementation of solutions to quality-improvement opportunities. 

An indicator is an objective, quantitative measurement of an outcome or process that relates to 
performance quality. The event being assessed can be either desirable or undesirable. It is objective in 
that the same measurement can be obtained by different observers. This indicator represents quantitative, 
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measured data that are gathered for further analysis. Indicators can assess many different aspects of 
quality, including accessibility, appropriateness, continuity, customer satisfaction, effectiveness, efficacy, 
efficiency, safety, and timeliness. 

A program indicator has attributes that determine its utility as a performance measure. The reliability 
and variability of the indicator are distinct but related characteristics. An indicator is reliable if the same 
measurement can be obtained by different observers. A valid indicator is one that can identify opportunities 
for quality improvement. As indicators evolve, their reliability and validity should improve to the highest 
level possible. 

An indicator can specify a part of a process to be measured or the outcome of that process. An outcome 
indicator assesses the results of a process. Examples include the percentage of uncompleted, scheduled 
IPMs, or the number of uncompleted equipment repairs not completed within 30 days. A process 
indicator assesses an important and discrete activity that is carried out during the process. An example 
would be the number of anesthesia machines in which the scheduled IPM failed or the number of 
equipment repairs awaiting parts that are uncompleted within 30 days. 

Indicators also can be classified as sentinel event indicators and aggregate data indicators. A perfor- 
mance measurement of an individual event that triggers further analysis is called a sentinel-event indicator. 
These are often undesirable events that do not occur often. These are often related to safety issues and 
do not lend themselves easily to quality-improvement opportunities. An example may include equipment 
failures that result in a patient injury. 

An aggregate data indicator is a performance measurement based on collecting data involving many 
events. These events occur frequently and can be presented as a continuous variable indicator or as rate- 
based indicators. A continuous variable indicator is a measurement where the value can fall anywhere 
along a continuous scale. Examples could be the number of IPMs scheduled during a particular month 
or the number of repair requests received during a week. A rate-based variable indicator is the value of 
a measurement that is expressed as a proportion or a ratio. Examples could be the percentage of IPMs 
completed each month or the percentage of repairs completed within one workday. 

General indicators should be developed to provide a baseline monitoring of the department's perfor- 
mance. They also should provide a cross-check for other indicators. These indicators can be developed 
to respond to a perceived need within a department or to solve a specific problem. 

170.4 Indicator Management Process 

The process to develop, monitor, analyze and manage indicators is shown in Fig. 170.1. The different 
steps in this process include defining the indicator, establishing the threshold, monitoring the indicator, 
evaluating the indicator, identifying quality-improvement opportunities, and implementing action plans. 

Define Indicator. The definition of the indicator to be monitored must be carefully developed. This 
process includes at least five steps. The event or outcome to be measured must be described. Define any 
specific terms that are used. Categorize the indicator (sentinel event or rate-based, process or outcome, 
desirable or undesirable). The purpose for this indicator must be defined, as well as how it is used in 
specifying and assessing the particular process or outcome. 

Establish Threshold. A threshold is a specific data point that identifies the need for the department to 
respond to the indicator to determine why the threshold was reached. Sentinel-event indicator thresholds 
are set at zero. Rate indicator thresholds are more complex to define because they may require expert 
consensus or definition of the department's objectives. Thresholds must be identified, including the 
process used to set the specific level. 

Monitor Indicator. Once the indicator is defined, the data-acquisition process identifies the data sources 
and data elements. As these data are gathered, they must be validated for accuracy and completeness. 
Multiple indicators can be used for data validation and cross-checking. The use of a computerized 
database allows rapid access to the data. A database management tool allows quick sorting and organization 
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of the data. Once gathered, the data must be presented in 
a format suitable for evaluation. Graphic presentation of 
data allows rapid visual analysis for thresholds, trends, 
and patterns. 

Evaluate Indicator. The evaluation process analyze and 
reports the information. This process includes comparing 
the information with established thresholds and analyzing 
for any trends or patterns. A trend is the general direction 
the indicator measurement takes over a period of time 
and may be desirable or undesirable. A pattern is a group- 
ing or distribution of indicator measurements. A pattern 
analysis is often triggered when thresholds are crossed or 
trends identified. Additional indicator information is 
often required. If an indictor threshold has not been 
reached, no further action may be necessary, other than 
continuing to monitor this indicator. The department 
also may decide to improve its performance level by 
changing the threshold. 

Factors may be present leading to variation of the indi- 
cator data. These factors may include failure of the tech- 
nology to perform properly, failure of the operators to use 
the technology properly, and failure of the organization 
to provide the necessary resources to implement this tech- 
nology properly. Further analysis of these factors may lead 
to quality-improvement activities later. 



DEFINE INDICATOR 



ESTABLISH THRESHOLD 



MONITOR INDICATOR 



EVALUATE INDICATOR 




No 



IDENTIFY QUALITY 
IMPROVEMENT 
OPPORTUNITY 



IMPLEMENT ACTION PLAN 




FIGURE 170.1 Indicator management process. 



Identify Quality-Improvement Opportunity. A quality- 
improvement opportunity may present itself if an indica- 
tor threshold is reached, a trend is identified, or a pattern 
is recognized. Additional information is then needed to 
further define the process and improvement opportuni- 
ties. The first step in the process is to identify a team. This 

team must be given the necessary resources to complete this project, a timetable to be followed, and an 
opportunity to periodically update management on the status of the project. The initial phase of the 
project will analyze the process and establish the scope and definition of the problem. Once the problem 
is defined, possible solutions can be identified and analyzed for potential implementation. A specific 
solution to the problem is then selected. The solution may include modifying existing indictors or 
thresholds to more appropriate values, modifying steps to improve existing processes, or establishing 
new goals for the department. 

Implement Action Plan. An action plan is necessary to identify how the quality- improvement solution 
will be implemented. This includes defining the different tasks to be performed, the order in which they 
will be addressed, who will perform each task, and how this improvement will be monitored. Appropriate 
resources must again be identified and a timetable developed prior to implementation. Once the action 
plan is implemented, the indicators are monitored and evaluated to verify appropriate changes in the 
process. New indicators and thresholds may need to be developed to monitor the solution. 

170.5 Indicator Example 1: Productivity Monitors 

Defines Indicators. Monitor the productivity of technical personnel, teams, and the department. Pro- 
ductivity is defined as the total number of documented service support hours compared with the total 
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number of hours available. This is a desirable rate-based outcome indicator. Provide feedback to technical 
staff and hospital administration regarding utilization of available time for department support activities. 

Establish Thresholds. At least 50% of available technician time will be spent providing equipment 
maintenance support services (revolving equipment problems and scheduled IPMs). At least 25% of 
available technician time will be spent providing equipment management support services (installations, 
acceptance testing, incoming inspections, equipment inventory database management, hazard notifica- 
tion review). 

Monitor Indicator. Data will be gathered every 4 weeks from the equipment work order history database. 
A trend analysis will be performed with data available from previously monitored 4-week intervals. These 
data will consist of hours worked on completed and uncompleted jobs during the past 4-week interval. 
Technical staff available hours is calculated for the 4-week interval. The base time available is 160 hours 
(40 hours/week x 4 week) per individual. Add to this any overtime worked during the interval. Then 
subtract any holidays, sick days, and vacation days within the interval. 

CJHOURS: Hours worked on completed jobs during the interval 
UJHOURS: Hours worked on uncompleted jobs during the interval 
AHOURS: Total hours available during the 4-week interval 
Productivity = (CJHOURS + UJHOURS)/AHOURS 

Evaluate Indicator. The indicator will be compared with the threshold, and the information will be 
provided to the individual. The individual team member data can be summed for team review. The data 
from multiple teams can be summed and reviewed by the department. Historical indicator information 
will be utilized to determine trends and patterns. 

Quality-Improvement Process. If the threshold is not met, a trend is identified, or a pattern is observed, 
a quality-improvement opportunity exists. A team could be formed to review the indicator, examine the 
process that the indicator measured, define the problem encountered, identify ways to solve the problem, 
and select a solution. An action plan will then be developed to implement this solution. 

Implement Action Plan. During implementation of the action plan, appropriate indicators will be used 
to monitored the effectiveness of the action plan. 

170.6 Indicator Example 2: Patient Monitors IPM 
Completion Time 

Define Indicator. Compare the mean to complete an IPM for different models of patient monitors. 
Different manufacturers of patient monitors have different IPM requirements. Identify the most timely 
process to support this equipment. 

Establish Threshold. The difference between the mean time to complete an IMP for different models of 
patient monitors will not be greater than 30% of the lessor time. 

Monitor Indicator. Determine the mean time to complete an IPM for each model of patient monitor. 
Calculate the percentage difference between the mean time for each model and the model with the least 
mean time. 

Evaluate Indicator. The mean time to complete IPMs was compared between the patient monitors, and 
the maximum difference noted was 46%. A pattern also was identified in which all IPMs for that one 
particular monitor averaged 15 minutes longer than those of other vendors. 

Quality-Improvement Process. A team was formed to address this problem. Analysis of individual IPM 
procedures revealed that manufacturer X requires the case to be removed to access internal filters. 
Performing an IPM for each monitor required moving and replacing 15 screws for each of the 46 monitors. 
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The team evaluated this process and identified that 5 minutes could be saved from each IPM if an electric 
screwdriver was utilized. 

Implement Action Plan. Electric screwdrivers were purchased and provided for use by the technician. 
The completion of one IPM cycle for the 46 monitors would pay for two electric screwdrivers and provide 
4 hours of productive time for additional work. Actual savings were greater because this equipment could 
be used in the course of daily work. 

170.7 Summary 

In the ever-changing world of health care, clinical engineering departments are frequently being evaluated 
based on their contribution to the corporate bottom line. For many departments, this will require difficult 
and painful changes in management philosophy. Administrators are demanding quantitative measures 
of performance and value. To provide the appropriate quantitative documentation required by corporate 
managers, a clinical engineering a manager must collect available data that are reliable and accurate. 
Without such data, analysis is valueless. Indicators are the first step in reducing the data to meaningful 
information that can be easily monitored and analyzed. The indicators can then be used to determine 
department performance and identify opportunities for quality improvement. 

Program indicators have been used for many years. What must change for clinical engineering depart- 
ments is a conscious evaluation and systematic use of indicators. One traditional indicator of clinical 
engineering department success is whether the department's budget is approved or not. Unfortunately, 
approval of the budge as an indicator, while valuable, does not address the issue of predicting long-term 
survival, measuring program and quality improvements, or allowing frequent evaluation and changes. 

There should be monitored indicators for every significant operational aspect of the department. 
Common areas where program indicators can be applied include monitoring interval department activ- 
ities, quality-improvement processes, and benchmarking. Initially, simple indicators should be developed. 
The complexity and number of indicators should change as experience and needs demand. 

The use of program indicators is absolutely essential if a clinical engineering departments is to survive. 
Program and survival are now determined by the contribution of the department to the bottom line of 
the parent organization. Indicators must be developed and utilized to determine the current contribution 
of the clinical engineering department to the organization. Effective utilization and management of 
program indicators will ensure future department contributions. 
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In today's complex health care environment, quality improvement and team building must go hand in 
hand. This is especially true for Clinical Engineers and Biomedical Equipment Technicians as the diversity 
of the field increases and technology moves so rapidly that no one can know all that needs to be known 
without the help of others. Therefore, it is important that we work together to ensure quality improve- 
ment. Ken Blachard, the author of the One Minute Manager series, has made the statement that "all of 
us are smarter than any one of us" — a synergy that evolves from working together. Throughout this 
chapter we will look closely at defining quality and the methods for continuously improving quality, such 
as collecting data, interpreting indicators, and team building. All this will be put together, enabling us 
to make decisions based on scientific deciphering of indicators. 

Quality is defined as conformance to customer or user requirements. If a product or service does what 
it is supposed to do, it is said to have high quality. If the product or service fails its mission, it is said to 
be low quality. Dr. W. Edward Demings, who is known to many as the "father of quality," defined it as 
surpassing customer needs and expectations throughout the life of the product or service. 

Dr. Demings, a trained statistician by profession, formed his theories on quality during World War II 
while teaching industry how to use statistical methods to improve the quality of military production. 
After the war, he focused on meeting customer or consumer needs and acted as a consultant to lapanese 
organizations to change consumers' perceptions that "Made in Japan" meant junk. Dr. Demings predicted 
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that people would be demanding Japanese products in just 5 years, if they used his methods. However, 
it only took 4, and the rest is history. 

171.1 Dealing's 14 Points 

1. Create constancy of purpose toward improvement of product and service, with an aim to become 
competitive and to stay in business and provide jobs. 

2. Adopt the new philosophy. We are in a new economic age. Western management must awaken 
and lead for change. 

3. Cease dependence on inspection to achieve quality. Eliminate the needs for mass inspection by 
first building in quality. 

4. Improve constantly and forever the system of production and service to improve quality and 
productivity and thus constantly decrease costs. 

5. Institute training on the job. 

6. Institute leadership: The goal is to help people, machines, and gadgets to do a better job. 

7. Drive out fear so that everyone may work effectively for the organization. 

8. Break down barriers between departments 

9. Eliminate slogans, exhortations, and targets for the workforce. 

10. Eliminate work standards (quota) on the factory floor. 

11. Substitute leadership: Eliminate management by objective, by numbers, and numerical goals. 

12. Remove barriers that rob the hourly worker of the right to pride of workmanship. 

13. Institute a vigorous program of education and self-improvement. 

14. Encourage everyone in the company to work toward accomplishing transformation. Transforma- 
tion is everyone's job. 

171.2 Zero Defects 

Another well-known quality theory, called zero defects {ZD), was establishing by Philip Crosby. It got 
results for a variety of reasons. The main reasons are as follows: 

1. A strict and specific management standard. Management, including the supervisory staff, do not 
use vague phrases to explain what it wants. It made the quality standard very clear: Do it the right 
way from the start. As Philip Crosby said, "What standard would you set on how many babies 
nurses are allowed to drop?" 

2. Complete commitment everyone. Interestingly, Crosby denies that ZD was a motivational pro- 
gram. But ZD worked because everyone got deeply into the act. Everyone was encouraged to spot 
problems, detect errors, and prescribe ways and means for their removal. This commitment is 
best illustrated by the ZD pledge: "I freely pledge myself to make a constant, conscious effort to 
do my job right the first time, recognizing that my individual contribution is a vital part of the 
overall effort." 

3. Removal of actions and conditions that cause errors. Philip Crosby claimed that at ITT, where he 
was vice-president for quality, 90% of all error causes could be acted on and fully removed by 
first-line supervision. In other words, top management must do its part to improve conditions, 
but supervisors and employees should handle problems directly. Errors, malfunctions, and/or 
variances can best be corrected where the rubber hits the road — at the source. 

171.3 TQM (Total Quality Management) 

The most recent quality theory that has found fame is called TQM (Total Quality Management). It is a 
strategic, integrated management system for achieving customer satisfaction which involves all managers 
and employees and uses quantitative methods to continuously improve an organization's processes. Total 
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Quality Management is a term coined in 1985 by the Naval Air Systems Command to describe its 
management approach to quality improvement. Simply put, Total Quality Management is a management 
approach to long-term success through customer satisfaction. Total Quality Management includes the 
following three principles: (1) achieving customer satisfaction, (2) making continuous improvement, 
and (3) giving everyone responsibility. TQM includes eight practices. These practices are (1) focus on 
the customer, (2) effective and renewed communications, (3) reliance on standards and measures, (4) 
commitment to training, (5) top management support and direction, (6) employee involvement, (7) 
rewards and recognition, and (8) long-term commitment. 

171.4 CQI (Continuous Quality Improvement) 

Step 8 of the total quality management practices leads us to the quality concept coined by the Joint 
Commission On Accreditation of Healthcare Organizations and widely used by most health care agencies. 
It is called CQI (Continuous Quality Management). The principles of CQI are as follows: 

Unity of Purpose 

• Unity is established throughout the organization with a clear and widely understood vision. 

• Environment nurtures total commitment from all employees. 

• Rewards go beyond benefits and salaries to the belief that "We are family" and "We do excellent 
work'' 

Looking for Faults in the Systems 

• Eighty percent of an organization's failures are the fault of management-controlled systems. 

• Workers can control fewer than 20% of the problems. (£\ 

• Focus on rigorous improvement of every system, and cease blaming individuals for problems (the 
80/20 rule of J.M. Juran and the nineteenth-century economist Vilfredo Pareto). 

Customer Focus 

• Start with the customer. 

• The goal is to meet or exceed customer needs and give lasting value to the customer. 

• Positive returns will follow as customers boast of the company's quality and service. 

Obsession with Quality 

• Everyone's job. 

• Quality is relentlessly pursued through products and services that delight the customer. 

• Efficient and effective methods of execution. 

Recognizing the Structure in Work 

• All work has structure. 

• Structure may be hidden behind workflow inefficiency. 

• Structure can be studied, measure, analyzed, and improved. 

Freedom Through Control 

• There is control, yet freedom exists by eliminating micromanagement. 

• Employees standardize processes and communicate the benefits of standardization. 

• Employees reduce variation in the way work is done. 

• Freedom comes as changes occur resulting in time to spend on developing improved processes, 
discovering new markets, and adding other methods to increase productivity. 
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Continued Education and Training 

• Everyone is constantly learning. 

■ Educational opportunities are made available to employees. 

• Greater job mastery is gained and capabilities are broadened. 

Philosophical Issues on Training 

• Training must stay tuned to current technology. 

• Funding must be made available to ensure that proper training can be attained. 

• Test, measurement, and diagnostic equipment germane to the mission must be procured and 
technicians trained on its proper use, calibration, and service. 

• Creativity must be used to obtain training when funding is scarce. 
— Include training in equipment procurement process. 

— Contact manufacturer or education facility to bring training to the institution. 

— Use local facilities to acquire training, thus eliminating travel cost. 

— Allow employees to attend professional seminars where a multitude of training is available. 

Teamwork 

• Old rivalries and distrust are eliminated. 

• Barriers are overcome. 

• Teamwork, commitment to the team concept and partnerships are the focus. 

• Employee empowerment is critical in the CQI philosophy and means that employees have the 
authority to make well-reasoned, data-based decisions. In essence, they are entrusted with the 
legal power to change processes through a rational, scientific approach. 

Continuous quality improvement is a means for tapping knowledge and creativity, applying partici- 
pative problem solving, finding and eliminating problems that prevent quality, eliminating waste, instill- 
ing pride, and increasing teamwork. Further it is a means for creating an atmosphere of innovation for 
continued and permanent quality improvement. Continuous quality improvement as outlined by the 
Joint Commission on Accreditation of Healthcare Organizations is designed to improve the work pro- 
cesses within and across organizations. 

171.5 Tools Used for Quality Improvement 

The tools listed on the following pages will assist in developing quality programs, collecting data, and 
assessing performance indicators within the organization. These tools include several of the most fre- 
quently used and most of the seven tools of quality. The seven tools of quality are tools that help health 
care organizations understand their processes in order to improve them. The tools are the cause-and- 
effect diagram, check sheet, control chart, flowchart, histogram, Pareto chart, and scatter diagram. 
Additional tools shown are the Shewhart cycle (PDCA process) and the bar chart. The Clinical Engi- 
neering Manager must access the situation and determine which tool will work best for his/her situational 
needs. 

Two of the seven tools of quality discussed above are not illustrated. These are the scatter diagram and 
the check sheet. The scatter diagram is a graphic technique to analyze the relationship between two 
variations and the check sheet is simple data-recording device. The check sheet is custom designed by 
the user, which facilitates interpretation of the results. Most Biomedical Equipment Technicians use the 
check sheet on a daily basis when performing preventive maintenance, calibration, or electrical safety 
checks. 
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FIGURE 171.1 Cause-and-effect or Ishikawa chart. 

Cause-and-Effect or Ishikawa Chart 

This is a tool for analyzing process dispersion (Fig. 171.1). The process was developed by Dr. Karou 
Ishikawa and is also known as the fishbone diagram because the diagram resembles a fish skeleton. The 
diagram illustrates the main causes and subcauses leading to an effect. The cause-and-effect diagram is 
one of the seven tools of quality. 

The following is an overview of the process: 

• Used in group problem solving as a brainstorming tool to explore and display the possible causes 
of a particular problem. 

• The effect (problem, concern, or opportunity) that is being investigated is stated on the right side, 
while the contributing causes are grouped in component categories through group brainstorming 
on the left side. 

■ This is an extremely effective tool for focusing a group brainstorming session. 

■ Basic components include environment, methods (measurement), people, money information, 
materials, supplies, capital equipment, and intangibles. 

Control Chart 

A control chart is a graphic representation of a characteristic of a process showing plotted values of some 
statistic gathered from that characteristic and one or two control limits (Fig. 171.2). It has two basic uses: 
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FIGURE 171.2 Control chart. 
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• as a judgment to determine if the process is in control 

• as an aid in achieving and maintaining statistical control 

(This chart was used by Dr. W. A. Shewhart for a continuing test of statistical significance.) A control 
chart is a chart with a baseline, frequently in time order, on which measurement or counts are represented 
by points that are connected by a straight line with an upper and lower limit. The control chart is one 
of the seven tools of quality. 

Flowchart 

A flowchart is a pictorial representation showing all the steps 
of a process (Fig. 171.3). Flowcharts provide excellent doc- 
umentation of a program and can be a useful tool for exam- 
ining how various steps in a process are related to each other. 
Flowcharting uses easily recognizable symbols to represent 
the type of processing performed. The flowchart is one of 
the seven tools of quality. 
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Histogram 

A graphic summary of variation in a set of data is a histogram 
(Fig. 171.4). The pictorial nature of the histogram lets people 
see patterns that are difficult to see in a simple table of 
numbers. The histogram is one of the seven tools of quality. 

Pareto Chart 

A Pareto chart is a special form of vertical bar graph that 
helps us to determine which problems to solve and in what 
order (Fig. 171.5). It is based on the Pareto principle, which 
was first developed by J. M. Juran in 1950. The principle, 
named after the nineteenth-century economist Vilfredo 
Pareto, suggests that most effects come from relatively few 
causes; i.e., 80% of the effects come from 20% of the possible causes. 

Doing a Pareto chart, based on either check sheets or other forms of data collection, helps us direct 
our attention and efforts to truly important problems. We will generally gain more by working on the 
tallest bar than tackling the smaller bars. The Pareto chart is one of the seven tools of quality. 
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FIGURE 171.3 Flowchart. 



Biomedical JEquipment Technicians 
Productivity Chart 



I C antral Linrnt ' : 




nssa 



Jin Feb Mar Apr May Jm*e Jul* A* I! Srp Ocl Nov 



FIGURE 171.4 Histogram. 
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Preventive Maintenance 

Notice that the \ot Located bar clearly indicates a 
serious Preventive Maintenance service problem. 
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FIGURE 171.5 Pareto chart. 



i Step 1 - Plan (P): Collect data upon which a plan can be 
constructed. 

Step 2 - Do (D): Take the necessary actions that further 
the developed plan. 

Step 3 - Check (C): Check the results of our actions by 
collecting data to make sure that we have achieved what 
we planned. 

Step 4 - Act (A): Act by making necessary changes to 
achieve Customer Satisfaction. 
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FIGURE 171.6 The Shewhart cycle. 



The Plan-Do-Check-Act or Shewhart Cycle 

This is a four-step process for quality improvement that is sometimes referred to as the Deming cycle 
(Fig. 171.6). One of the consistent requirements of the cycle is the long-term commitment required. The 
Shewart cycle or PDCA cycle is outlined here and has had overwhelming success when used properly. It 
is also a very handy tool to use in understanding the quality cycle process. The results of the cycle are 
studied to determine what was learned, what can be predicted, and appropriate changes to be implemented. 

171.6 Quality Performance Indicators (QPI) 

An indicator is something that suggests the existence of a fact, condition, or quality — an omen (a sign 
of future good or evil.) It can be considered as evidence of a manifestation or symptom of an incipient 
failure or problem. Therefore, quality performance indicators are measurements that can be used to 
ensure that quality performance is continuous and will allow us to know when incipient failures are 
starting so that we may take corrective and preventive actions. 
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QPI analysis is a five-step process: 

Step 1: Decide what performance we need to track. 

Step 2: Decide the data that need to be collected to track this performance. 

Step 3: Collect the data. 

Step 4: Establish limits, a parameter, or control points. 

Step 5: Utilize BME (management by exception) — where a performance exceeds the established con- 
trol limits, it is indicating a quality performance failure, and corrective action must be taken 
to correct the problem. 

In the preceding section, there were several examples of QPIs. In the Pareto chart, the NL = not located, 
IU = in use, IR = in repair. The chart indicates that during the year 1994, 35% of the equipment could 
not be located to perform preventive maintenance services. This indicator tells us that we could eventually 
have a serious safety problem that could impact on patient care, and if not corrected, it could prevent 
the health care facility from meeting accreditation requirements. In the control chart example, an upper 
control limit of 6% "not located equipment" is established as acceptable in any one month. However, 
this upper control limit is exceeded during the months of April and October. This QPI could assist the 
clinical and/or Biomedical Equipment Manager in narrowing the problem down to a 2-month period. 
The histogram example established a lower control limit for productivity at 93%. However, productivity 
started to drop off in May, June, and July. This QPI tells the manager that something has happened that 
is jeopardizing the performance of his or her organization. Other performance indicators have been 
established graphically in Figs 171.7 and 171.8. See if you can determine what the indicators are and 
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FIGURE 171.7 Sample repair service report. 
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FIGURE 171.8 Customer satisfaction survey. August-September 1994. 
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what the possible cause might be. You may wish to use these tools to establish QPI tracking germane to 
your own organization. 

171.7 Teams 

A team is a formal group of persons organized by the company to work together to accomplish certain 
goals and objectives. Normally, when teams are used in quality improvement programs, they are designed 
to achieve the organization's vision. The organization's vision is a statement of the desired end state of 
the organization articulated and deployed by the executive leadership. Organizational visions are inspir- 
ing, clear, challenging, reasonable, and empowering. Effective visions honor the past, while they prepare 
for the future. The following are types of teams that are being used in health care facilities today. Some 
of the names may not be common, but their definitions are very similar if not commensurate. 

Process Action Teams (PAT) 

Process action teams are composed of those who are involved in the process being investigated. The 
members of a PAT are often chosen by their respective managers. The primary consideration for PAT 
membership is knowledge about the operations of the organization and consequently the process being 
studied. The main function of a PAT is the performance of an improvement project. Hence customers 
are often invited to participate on the team. PATs use basic statistical and other tools to analyze a process 
and identify potential areas for improvement. PATs report their findings to an Executive Steering Com- 
mittee or some other type of quality management improving group ("A problem well defined is half solved" 
John Dewey, American philosopher and educator; 1859-1952.) 

Transition Management Team (TMT) 

The transition management team (see Harvard Business Review, Nov-Dec 1993, pp 109-1 18) is normally 
used for a major organizational change such as restructuring or reengineering. The TMT can be initiated 
due to the findings of a PAT, where it has been indicated that the process is severely broken and 
unsalvageable. The TMT is not a new layer of bureaucracy or a job for fading executives. The TMT 
oversees the large-scale corporate change effort. It makes sure that all change initiatives fit together. It is 
made up of 8 to 12 highly talented leaders who commit all their time making the transition a reality. 
The team members and what they are trying to accomplish must be accepted by the power structure of 
the organization. For the duration of the change process, they are the CEO's version of the National 
Guard. The CEO should be able to say, "I can sleep well tonight, the TMT is managing this." In setting 
up a TMT, organizations should adopt a fail-safe approach: Create a position to oversee the emotional 
and behavioral issues unless you can prove with confidence that you do not need one. 

Quality Improvement Project Team (QIPT) 

A quality improvement project team can be initiated due to the findings of a PAT, where it has been 
indicated that the process is broken. The main agenda of the QIPT is to improve the work process that 
managers have identified as important to change. The team studies this process methodically to find 
permanent solutions to problems. To do this, members can use many of the tools described in this chapter 
and in many other publications on quality and quality improvement available from schools, bookstores, 
and private organizations. 

Executive Steering Committee (ESC) 

This is an executive-level team composed of the Chief Executive Officer (CEO) of the organization and 
the executive staff that reports directly to the CEO. Whereas an organization may have numerous QMBs, 
PATs, and QIPTs, it has only one ESC. The ESC identifies strategic goals for organizational quality 
improvement efforts. It obtains information from customers to identify major product and service 
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requirements. It is through the identification of these major requirements that quality goals for the 
organization are defined. Using this information, the ESC lists, prioritizes, and determines how to measure 
the organization's goals for quality improvement. The ESC develops the organization's improvement plan 
and manages the execution of that plan to ensure that improvement goals are achieved. 

Quality Management Board (QMB) 

This is a permanent cross-functional team made up of top and midlevel managers who are jointly responsible 
for a specific product, service or process. The structure of the board intended to improve communication 
and cooperation by providing vertical and horizontal "links" throughout the organization. 

171.8 Process Improvement Model 

This process is following the Joint Commission on the Accreditation of Healthcare Organizations' Quality 
Cube, a method of assessing the quality of the organization. 

PLAN: 

a. Identify the process to be monitored. 

b. Select important functions and dimensions of performance applicable to the process identified. 

c. Design a tool for collection of data. 

MEASURE (Under this heading, you will document how, when, and where data was collected): 

a. Collect data 

b. Select the appropriate tool to deliver your data (charts, graphs, tables, etc.) 

ASSESS (Document findings under this heading): 

a. Interpret data collected. 

b. Design and implement change. 

(1) Redesign the process or tool if necessary. 

(2) If no changes are necessary, then you have successfully used the Process Improvement pathway. 

IMPROVEMENT (Document details here): 

a. Set in place the process to gain and continue the improvement. 
OUTCOME (Document all changes here): 

a. Positive changes made to improve quality of care based on Performance Improvement Activity. 

Problem-Solving Model 

The FOCUS-PDCA/PMAIO Process Improvement Model is a statistical-based quality control method 
for improving processes. This approach to problem-solving could be used by all Process Action Teams 
to ensure uniformity within an organization. FOCUS-PDCA/PMAIO is as follows: 

F — Find a process to improve. 
O — Organize a team that knows the process. 
C — Clarify current knowledge of the process. 
U — Understand the cause or variations. 
S — Select the process to improve 

P — Plan the improvement. P — Plan 

D — Do the improvement (pilot test). M — Measure 

C — Check the results of the improvement. A — Assess 
A — Act to hold the gain. I — Improve 

O — Outcome 
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171.9 Summary 

Although quality can be simply defined as conformity to customer or user requirements, it has many 
dimensions. Seven of them are described here: (1) performance, (2) aesthetics, (3) reliability (how 
dependably it performs), (4) availability (there when you need it), (5) durability (how long it lasts), (6) 
extras or features (supplementary items), and (7) serviceability (how easy it is to get serviced). The word 
PARADES can help you remember this: 

PARADES Seven Dimensions of Quality 

1. Performance: A product or service that performs its intended function well scores high on this 
dimension of quality. 

2. Aesthetics: A product or service that has a favorable appearance, sound, taste, or smell is perceived 
to be of good quality. 

3. Reliability: Reliability, or dependability, is such an important part of product quality that quality- 
control engineers are sometimes referred to as reliability engineers. 

4. Availability: A product or service that is there when you need it. 

5. Durability: Durability can be defined as the amount of use one gets from a product before it no 
longer functions properly and replacement seems more feasible than constant repair. 

6. Extras: Feature or characteristics about a product or service that supplements its basic functioning 
(i.e., remote control dialing on a television). 

7. Serviceability: Speed, courtesy, competence, and ease of repair are all important quality factors. 

Quality Has a Monetary Value! 

Good quality often pays for itself, while poor quality is expensive in both measurable costs and hidden 
costs. The hidden costs include loss of goodwill, including loss of repeat business and badmouthing of 
the firm. High quality goods and services often carry a higher selling price than do those of low quality. 
This information is evidenced by several reports in the Wall Street Journal, Forbes Magazine, Money 
Magazine, Business Week Magazine, etc. A good example is the turnaround of Japanese product sales 
using quality methodologies outlined in The Deming Guide to Quality and Competitive Position, by 
Howard S. and Shelly J. Gitlow. As Dr. Demings has stated quality improvement must be continuousl 



Quality is never an accident; it has always the result of intelligent energy. 



John Ruskins, 1819-1900 
English art critic and historian 
Seven Lamps of Architecture 
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172.1 Introduction 



The development, understanding, and use of standards is an important component of a clinical engineer's 
activities. Whether involved in industry, a health care facility, governmental affairs, or commercial 
enterprise, one way or another, the clinical engineer will find that standards are a significant aspect of 
professional activities. With the increasing emphasis on health care cost containment and efficiency, 
coupled with the continued emphasis on patient outcome, standards must be viewed both as a mechanism 
to reduce expenses and as another mechanism to provide quality patient care. In any case, standards 
must be addressed in their own right, in terms of technical, economic, and legal implications. 

It is important for the clinical engineer to understand fully how standards are developed, how they 
are used, and most importantly, how they affect the entire spectrum of health related matters. Standards 
exist that address systems (protection of the electrical power distribution system from faults), individuals 
(means to reduce potential electric shock hazards), and protection of the environment (disposal of 
deleterious waste substances). 

From a larger perspective, standards have existed since biblical times. In the Book of Genesis [Chap. 6, 
ver. 14], Noah is given a construction standard by God, "Make thee an ark of gopher wood; rooms shalt 
thou make in the ark, and shalt pitch it within and without with pitch." Standards for weights and measures 
have played an important role in bringing together human societies through trade and commerce. The 
earliest record of a standard for length comes from ancient Egypt, in Dynasty IV (circa 3000 B.C.). This 
length was the royal cubit, 20.620 inches (52.379 cm), as used in construction of the Great Pyramid. 

The importance of standards to society is illustrated in the Magna Carta, presented by the English 
barons to King John in 1215 on the field at Runnymede. Article 35 states: 

"There shall be standard measures of wine, beer, and corn — the London quarter — throughout the 
whole of our kingdom, and a standard width of dyed, russet and halberject cloth — two ells within the 
selvedges; and there shall be standard weights also." 
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The principles of this article appear in the English Tower system for weight and capacity, set in 1266 
by the assize of Bread and Ale Act: 

"An English penny called a sterling, round and without any clipping, shall weigh thirty- two wheatcorns 
in the midst of the ear; and twenty ounces a pound: and eight pounds do make a gallon of wine, and 
eight gallons of wine do make a bushell, which is the eighth part of a quarter." 

In the United States, a noteworthy use of standards occurred after the Boston fire of 1689. With the 
aim of rapid rebuilding of the city, the town fathers specified that all bricks used in construction were 
to be 9 x 4 x 4 inches. An example of standardization to promote uniformity in manufacturing practices 
was the contract for 10,000 muskets awarded to Eli Whitney by President Thomas Jefferson in 1800. The 
apocryphal story is that Eli Whitney (better known to generations of grammar school children for his 
invention of the cotton gin) assembled a large number of each musket part, had one of each part randomly 
selected, and then assembled a complete working musket. This method of production, the complete 
interchangeability of assembly parts, came to be known as the "armory method," replacing hand crafting, 
which at that time had been the prevailing method of manufacturing throughout the world. 

172.2 Definitions 

A most general definition of a standard is given by Rowe [1983]. 

"A standard is a multi-party agreement for establishing an arbitrary criterion for reference." 

Each word used in the definition by Rowe corresponds to a specific characteristic that helps to define 
the concept of a standard. 

Multi means more than one party, organization, group, government, agency, or individual. 

Agreement means that the concerned parties have come to some mutually agreed upon understanding 
of the issues involved and of ways to resolve them. This understanding has been confirmed via some 
mechanism such as unanimity, consensus, ballot, or other means that has been specified. 

Establishing defines the purpose of the agreement — to create the standard and carry forth its provisions. 

Arbitrary emphasizes an understanding by the parties that there are no absolute criteria in creating 
the standard. Rather, the conditions and values chosen are based on the most appropriate knowledge 
and conditions available at the time the standard was established. 

Criterions are those features and conditions that the parties to the agreement have chosen as the basis 
for the standard. Not all issues may be addressed, but only those deemed, for whatever reasons, suitable 
for inclusion. 

A different type of definition of a standard is given in The United States Office of Management and 
Budget Circular A-l 19: 

"... a prescribed set of rules, conditions, or requirements concerned with the definition of terms; 
classification of components; delineation of procedures; specifications of materials, performance, 
design, or operations; or measurement of quality and quantity in describing materials, products, 
systems, services, or practices." 

A code is a compilation of standards relating to a particular area of concern, i.e., a collection of 
standards. For example, local government health codes contain standards relating to providing of health 
care to members of the community. A regulation is an organization's way of specifying that some 
particular standard must be adhered to. Standards, codes, and regulations may or may not have legal 
implications, depending on whether the promulgating organization is governmental or private. 

172.3 Standards for Clinical Engineering 

There is a continually growing body of standards that affect health care facilities, and hence clinical 
engineering. The practitioner of health care technology must constantly search out, evaluate, and apply 
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appropriate standards. The means to reconcile the conflicts of technology, cost considerations, the 
different jurisdictions involved, and the implementation of the various standards is not necessarily 
apparent. One technique that addresses these concerns and has proven to yield a consistent practical 
approach is a structured framework of the various levels of standards. This hierarchy of standards is a 
conceptual model that the clinical engineer can use to evaluate and apply to the various requirements 
that exist in the procurement and use of health care technology. 

Standards have different purposes, depending on their particular applications. A hierarchy of standards 
can be used to delineate those conditions for which a particular standard applies. There are four basic 
categories, any one or all of which may be in simultaneous operation. (1) Local or proprietary standards 
(perhaps more properly called regulations) are developed to meet the internal needs of a particular 
organization. (2) Common interest standards serve to provide uniformity of product or service through- 
out an industry or profession. (3) Consensus standards are agreements amongst interested participants 
to address an area of mutual concern. (4) Regulatory standards are mandated by an authority having 
jurisdiction to define a particular aspect of concern. In addition, there are two categories of standards 
adherence: (1) voluntary standards, which carry no inherent power of enforcement, but provide a refer- 
ence point of mutual understanding, and (2) mandatory standards, which are incumbent upon those to 
whom the standard is addressed, and enforceable by the authority having jurisdiction. 

The hierarchy of standards model can aid the clinical engineer in the efficient and proper use of 
standards. More importantly, it can provide standards developers, users, and the authorities having 
jurisdiction in these matters with a structure by which standards can be effectively developed, recognized, 
and used to the mutual benefit of all. 

172.4 A Hierarchy of Standards 

Local, or proprietary standards, are developed for what might be called internal use. An organization that 
wishes to regulate and control certain of its own activities issues its own standards. Thus, the standard 
is local in the sense that it is applied in a specific venue, and it is proprietary in that it is the creation of a 
completely independent administration. For example, an organization may standardize on a single type of 
an electrocardiograph monitor. This standardization can refer to a specific brand or model, or to specific 
functional or operational features. In a more formal sense, a local standard may often be referred to as an 
institutional Policy and Procedure. The policy portion is the why of it; the procedure portion is the how. 
It must be kept in mind that standards of this type that are too restrictive will limit innovation and progress, 
in that they cannot readily adapt to novel conditions. On the other hand, good local standards contribute 
to lower costs, operational efficiency, and a sense of coherence within the organization. 

Sometimes, local standards may originate from requirements of a higher level of regulation. For 
example, the Joint Commission for Accreditation of Healthcare Organizations [JCAHO] (formerly the 
Joint Commission for Hospital Accreditation (JCAH), a voluntary organization, (but an organization 
that hospitals belong to for various reasons, e.g., accreditation, reimbursement, approval of training 
programs) does not set standards for what or how equipment should be used. Rather, the JCAHO requires 
that each hospital set its own standards on how equipment is selected, used, and maintained. To monitor 
compliance with this requirement, the JCAHO inspects whether the hospital follows its own standards. 
In one sense, the most damaging evidence that can be adduced against an organization (or an individual) 
is that it (he) did not follow its (his) own standards. 

Common interest standards are based on a need recognized by a group of interested parties, which will 
further their own interests, individually or collectively. Such standards are generally accepted by affected 
interests without being made mandatory by an authority; hence they are one type of voluntary standard. 
These standards are often developed by trade or professional organizations to promote uniformity in a 
product or process. This type of standard may have no inducement to adherence except for the benefits 
to the individual participants. For example, if you manufacture a kitchen cabinet that is not of standard 
size, it will not fit into the majority of kitchens and thus it will not sell. Uniformity of screw threads is 
another example of how a product can be manufactured and used by diverse parties, and yet be absolutely 
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interchangeable. More recently, various information transfer standards allow the interchange of com- 
puter-based information amongst different types of instruments and computers. 

Consensus standards are those that have been developed and accepted in accordance with certain well 
defined criteria so as to assure that all points of view have been considered. Sometimes, the adjective 
"consensus" is used as a modifier for a "voluntary standard." Used in this context, consensus implies that 
all interested parties have been consulted and have come to a general agreement on the provisions of the 
standard. The development of a consensus standard follows an almost ritualistic procedure to insure that 
fairness and due process are maintained. There are various independent voluntary and professional 
organizations that sponsor and develop standards on a consensus basis (see below). Each such organi- 
zation has its own particular rules and procedures to make sure that there is a true consensus in developing 
a standard. 

In the medical products field, standards are sometimes difficult to implement because of the indepen- 
dent nature of manufacturers and their high level of competition. A somewhat successful standards story 
is the adoption of the DIN configuration for ECG lead-cable connection by the Association for the 
Advancement of Medical Instrumentation [AAMI]. The impetus for this standard was the accidental 
electrocution of several children brought about by use of the previous industry standard lead connection 
(a bare metal pin, as opposed to the new recessed socket). Most (but not all) manufacturers of ECG leads 
and cables now adhere to this standard. Agreement on this matter is in sharp contrast to the inability of 
the health care manufacturing industry to implement a standard for ECG cable connectors. Even though 
a standard was written, the physical configuration of the connector is not necessarily used by manufac- 
turers in production, nor is it demanded by medical users in purchasing. Each manufacturer uses a 
different connector, leading to numerous problems in supply and incompatibility for users. This is an 
example of a voluntary standard, which for whatever reasons, is effectively ignored by all interested parties. 

However, even though there have been some failures in standardization of product features, there has 
also been significant progress in generating performance and test standards for medical devices. A number 
of independent organizations sponsor development of standards for medical devices. For example, the 
American Society for Testing and Materials [ASTM] has developed, "Standard Specification for Minimum 
Performance and Safety Requirements for Components and Systems of Anesthesia Gas Machines 
(Fl 161-88)." Even though there is no statutory law that requires it, manufacturers no longer produce, 
and thus hospitals can no longer purchase anesthesia machines without the built-in safety features 
specified in this standard. The Association for the Advancement of Medical Instrumentation has spon- 
sored numerous standards that relate to performance of specific medical devices., such as defibrillators, 
electrosurgical instruments, and electronic sphygmomanometers. These standards are compiled in the 
AAMI publication, "Essential Standards for Biomedical Equipment Safety and Performance." The 
National Fire Protection Association [NFPA] publishes "Standard for Health Care Facilities (NFPA 99)," 
which covers a wide range of safety issues relating to facilities. Included are sections that deal with 
electricity and electrical systems, central gas and vacuum supplies, and environmental conditions. Special 
areas such as anesthetizing locations, laboratories, and hyperbaric facilities are addressed separately. 

Mandatory standards have the force of law or other authority having jurisdiction. Mandatory standards 
imply that some authority has made them obligatory. Mandatory standards can be written by the 
authority having jurisdiction, or they can be adapted from documents prepared by others as proprietary 
or consensus standards. The authority having jurisdiction can be a local hospital or even a department 
within the hospital, a professional society, a municipal or state government, or an agency of the federal 
government that has regulatory powers. 

In the United States, hospitals are generally regulated by a local city or county authority, and/or by 
the state. These authorities set standards in the form of health codes or regulations, which have the force 
of law. Often, these local bodies consider the requirements of a voluntary group, the Joint Commission 
for Accreditation of Healthcare Organizations, in their accreditation and regulatory processes. 

American National Standards. The tradition in the United States is that of voluntary standards. How- 
ever, once a standard is adopted by an organization, it can be taken one step further. The American 
National Standards Institute (ANSI) is a private, non-government, voluntary organization that acts as a 
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coordinating body for standards development and recognition in the United States. If the development 
process for a standard meets the ANSI criteria of open deliberation of legitimate concerns, with all 
interested parties coming to a voluntary consensus, then the developers can apply (but are not required) 
to have their standard designated as an American National Standard. Such a designation does not make 
a standard any more legitimate, but it does offer some recognition as to the process by which it has been 
developed. ANSI also acts as a clearing house for standards development, so as to avoid duplication of 
effort by various groups that might be concerned with the same issues. ANSI is also involved as a U.S. 
coordinating body for many international standards activities. 

An excellent source that lists existing standards and standards generating organizations, both nationally 
and internationally, along with some of the workings of the FDA (see below), is the "Medical Device 
Industry Fact Book" [Allen, 1996]. 

172.5 Medical Devices 

On the national level, oversight is generally restricted to medical devices, and not on operational matters. 
Federal jurisdiction of medical devices falls under the purview of the Department of Health and Human 
Services, Public Health Service, Food and Drug Administration (FDA), Center for Devices and Radio- 
logical Health. Under federal law, medical devices are regulated under the "Medical Device Amendments 
of 1976" and the "Radiation Control for Health and Safety Act of 1968." Additional regulatory authori- 
zation is provided by the "Safe Medical Devices Act of 1990", the "Medical Device Amendments of 1992," 
the "FDA Reform and Enhancement Act of 1996," and the "Food and Drug Administration Modernization 
Act of 1997." 

A medical device is defined by Section 201 of the Federal Food, Drug, and Cosmetic Act (as amended), 
as an: 

instrument, apparatus, implement, machine, contrivance, implant, in vitro reagent, or other similar 
or related article including any component, part, or accessory which is: 

recognized in the official National Formulary, or the United States Pharmacopeia, or any supplement 
to them; 

intended for use in the diagnosis of disease or other conditions, or in the care, mitigation, treatment, 
or prevention of disease, in man or other animals, or 

intended to affect the structure of any function of the body of man or other animals; and which does 
not achieve its primary intended purposes through chemical action within or on the body of man . . . and 
which is not dependent upon being metabolized for the achievement of its primary intended purposes. 
The major thrust of the FDA has been in the oversight of the manufacture of medical devices, with 
specific requirements based on categories of perceived risks. The 1976 Act (Section 513) establishes three 
classes of medical devices intended for human use: 

Class I. General controls regulate devices for which controls other than performance standards or 
premarket approvals are sufficient to assure safety and effectiveness. Such controls include regulations 
that (1) prohibit adulterated or misbranded devices; (2) require domestic device manufacturers and 
initial distributors to register their establishments and list their devices; (3) grant FDA authority to 
ban certain devices; (4) provide for notification of risks and of repair, replacement, or refund; 
(5) restrict the sale, distribution, or use of certain devices; and (6) govern Good Manufacturing 
Practices, records, and reports, and inspections. These minimum requirements apply also to Class II 
and Class III devices. 

Class II. Performance Standards apply to devices for which general controls alone do not provide 
reasonable assurance of safety and efficacy, and for which existing information is sufficient to establish 
a performance standard that provides this assurance. Class II devices must comply not only with 
general controls, but also with an applicable standard developed under Section 514 of the Act. Until 
performance standards are developed by regulation, only general controls apply. 
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Class III. Premarket Approval applies to devices for which general controls do not suffice or for which 
insufficient information is available to write a performance standard to provide reasonable assurance 
of safety and effectiveness. Also, devices which are used to support or sustain human life or to prevent 
impairment of human health, devices implanted in the body, and devices which present a potentially 
unreasonable risk of illness or injury. New Class III devices, those not "substantially equivalent" to a 
device on the market prior to enactment (May 28, 1976), must have approved Premarket Approval 
Applications (Section 510 k). 

Exact specifications for General Controls and Good Manufacturing Practices (GMP) are defined in 
various FDA documents. Aspects of General Controls include yearly manufacturer registration, device 
listing, and premarket approval. General Controls are also used to regulate adulteration, misbranding 
and labeling, banned devices, and restricted devices. Good Manufacturing Practices include concerns of 
organization and personnel; buildings and equipment; controls for components, processes, packaging, 
and labeling; device holding, distribution, and installation; manufacturing records; product evaluation; 
complaint handling; and a quality assurance program. Design controls for GMP were introduced in 1996. 
They were motivated by the FDA's desire to harmonize its requirements with those of a proposed 
international standard (ISO 13485). Factors that need to be addressed include planning, input and output 
requirements, review, verification and validation, transfer to production, and change procedures, all 
contained in a history file for each device. Device tracking is typically required for Class III life-sustaining 
and implant devices, as well as postmarket surveillance for products introduced starting in 1991. 

Other categories of medical devices include combination devices, in which a device may incorporate drugs 
or biologicals. Combination devices are controlled via intercenter arrangements implemented by the FDA. 

Transitional devices refer to devices that were regulated as drugs, prior to the enactment of the Medical 
Device Amendments Act of 1976. These devices were automatically placed into Class III, but may be 
transferred to Class I or II. 

A custom device may be ordered by a physician for his/her own use or for a specific patient. These 
devices are not generally available, and cannot be labeled or advertised for commercial distribution. 

An investigational device is one that is undergoing clinical trials prior to premarket clearance. If the 
device presents a significant risk to the patient, an Investigational Device Exemption must be approved 
by the FDA. Information must be provided regarding the device description and intended use, the origins 
of the device, the investigational protocol, and proof of oversight by an Institutional Review Board to 
insure informed patient consent. Special compassionate or emergency use for a nonapproved device or 
for a nonapproved use can be obtained from the FDA under special circumstances, such as when there 
is no other hope for the patient. 

Adverse Events. The Safe Medical Devices Act of 1990 included a provision by which both users and 
manufacturers (and distributors) of medical devices are required to report adverse patient events that 
may be related to a medical device. Manufacturers must report to the FDA if a device (a) may have caused 
or contributed to a death or serious injury, or (b) malfunctioned in such a way as would be likely to 
cause or contribute to a death or serious injury if the malfunction were to reoccur. Device users are 
required to notify the device manufacturer of reportable incidents, and must also notify the FDA in case 
of a device-related death. In addition, the FDA established a voluntary program for reporting device 
problems that may not have caused an untoward patient event, but which may have the potential for 
such an occurrence under altered circumstances. 

New devices. As part of the General Controls requirements, the FDA must be notified prior to marketing 
any new (or modifying an existing) device for patient use. This premarket notification, called the 510(k) 
process after the relevant section in the Medical Device Amendments Act, allows the FDA to review the 
device for safety and efficacy. 

There are two broad categories that a device can fall into. A device that was marketed prior to May 
28, 1976 (the date that the Medical Device Amendments became effective) can continue to be sold. Also, 
a product that is "substantially equivalent" to a preamendment device can likewise be marketed. However, 
the FDA may require a premarket approval application for any Class III device (see below). Thus, these 
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preamendment devices and their equivalents are approved by "grandfathering." (Premarket notification 
to the FDA is still required to assure safety and efficacy). Of course, the question of substantial equivalency 
is open to an infinite number of interpretations. From the manufacturer's perspective, such a designation 
allows marketing the device without a much more laborious and expensive premarket approval process. 

A new device that the FDA finds is not substantially equivalent to a premarket device is automatically 
placed into Class III. This category includes devices that provide functions or work through principles 
not present in preamendment devices. Before marketing, this type of device requires a Premarket 
Approval Application by the manufacturer, followed by an extensive review by the FDA. (However, the 
FDA can reclassify such devices into Class I or II, obviating the need for premarket approval). The review 
includes scientific and clinical evaluation of the application by the FDA and by a Medical Advisory 
Committee (composed of outside consultants). In addition, the FDA looks at the manufacturing and 
control processes to assure that all appropriate regulatory requirements are being adhered to. Clinical 
(use of real patients) trials are often required for Class III devices in order to provide evidence of safety 
and efficacy. To carry out such trials, an Investigational Device Exemption must be issued by the FDA. 

The Food and Drug Administration Modernization Act of 1997, which amends section 514 of the Food, 
Drug, and Cosmetic Act, has made significant changes in the above regulations. These changes greatly 
simplify and accelerate the entire regulatory process. For example, the law exempts from premarket 
notification Class I devices that are not intended for a use that is of substantial importance in preventing 
impairment of human health, or that do not present a potential unreasonable risk of illness or injury. 
Almost 600 Class I generic devices have been so classified by the agency. In addition, the FDA will specify 
those Class II devices for which a 510(k) submission will also not be required. 

Several other regulatory changes have been introduced by the FDA to simplify and speed up the 
approval process. So-called "third party" experts will be allowed to conduct the initial review of all Class I 
and low-to-intermediate risk Class II devices. Previously, the FDA was authorized to create standards for 
medical devices. The new legislation allows the FDA to recognize and use all or parts of various appro- 
priate domestic and internationally recognized consensus standards that address aspects of safety and/ or 
effectiveness relevant to medical devices. 

172.6 International Standards 

Most sovereign nations have their own internal agencies to establish and enforce standards. However, in 
our present world of international cooperation and trade, standards are tending towards uniformity 
across national boundaries. This internationalization of standards is especially true since formation of 
the European Common Market. The aim here is to harmonize the standards of individual nations by 
promulgating directives for medical devices that address "Essential Requirements" [Freeman, 1993] (see 
below). Standards in other areas of the world (Asia, Eastern Europe) are much more fragmented, with 
each country specifying regulations for its own manufactured and imported medical devices. 

There are two major international standards generating organizations, both based in Europe, the 
International Electrotechnical Commission (IEC) and the International Organization for Standardization 
(ISO). Nations throughout the world participate in the activities of these organizations. 

The International Electrotechnical Commission [IEC], founded in 1906, oversees, on an international 
level, all matters relating to standards for electrical and electronic items. Membership in the IEC is held 
by a National Committee for each nation. The United States National Committee (USNC) for IEC was 
founded in 1907, and since 1931 has been affiliated with ANSI. USNC has as its members representatives 
from professional societies, trade associations, testing laboratories, government entities, other organiza- 
tions, and individual experts. The USNC appoints a technical advisor and a technical advisory group for 
each IEC Committee and Subcommittee to help develop a unified United States position. These advisory 
groups are drawn from groups that are involved in the development of related U.S. national standards. 

Standards are developed by Technical Committees (TC), Subcommittees (SC), and Working Groups 
(WG). IEC TC 62, "Electrical Equipment in Medical Practice," is of particular interest here. One of the 
basic standards of this Technical Committee is document 60 1 - 1 , "Safety of Medical Electrical Equipment, 
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Part 1: General Requirements for Safety," 2nd Edition (1988) and its Amendment 1 (1991), along with 
Document 601-1-1, "Safety Requirements for Medical Electrical Systems" (1992). 

The International Organization for Standardization [ISO], oversees aspects of device standards other 
than those related to electrotechnology. This organization was formed in 1946 with a membership 
comprised of the national standards organizations of 26 countries. There are currently some 90 nations 
as members. The purpose of the ISO is to "facilitate international exchange of goods and services and to 
develop mutual cooperation in intellectual, scientific, technological, and economic ability." ISO addresses 
all aspects of standards except for electrical and electronic issues, which are the purview of the Interna- 
tional Electrotechnical Commission. ANSI has been the official United States representative to ISO since 
its inception. For each Committee or Subcommittee of the ISO in which ANSI participates, a U.S. 
Technical Advisory Group (TAG) is formed. The administrator of the TAG is, typically, that same U.S. 
organization that is developing the parallel U.S. standard. 

Technical Committees (TC) of the ISO concentrate on specific areas of interest. There are Technical 
Committees, Subcommittees, Working Groups and Study Groups. One of the member national standards 
organizations serves as the Secretariat for each of these technical bodies. 

One standard of particular relevancy to manufacturers throughout the world is ISO 9000. This standard 
was specifically developed to assure a total quality management program that can be both universally 
recognized and applied to any manufacturing process. It does not address any particular product or 
process, but is concerned with structure and oversight of how processes are developed, implemented, 
monitored, and documented. An independent audit must be passed by any organization to obtain ISO 
9000 registration. Many individual nations and manufacturers have adopted this standard and require 
that any product that they purchase be from a source that is ISO 9000 compliant. 

The European Union was, in effect, created by the Single Europe Act, (EC-92) as a region "without 
internal frontiers in which the free movement of goods, persons, and capital is ensured." For various 
products and classes of products, the European Commission issues directives with regard to safety and 
other requirements, along with the means for assessing conformity to these directives. Products that 
comply with the appropriate directives can then carry the CE mark. EU member states ratify these 
directives into national law. 

Two directives related to medical devices are the Medical Devices Directive (MDD), enacted in 1993 
(mandatory as of June 15,1998), and the Active Implanted Medical Devices Directive (AIMDD), effective 
since 1995. Safety is the primary concern of this system, and as in the U.S., there are three classes of risk. 
These risks are based on what and for how long the device touches, and its effects. Safety issues include 
electrical, mechanical, thermal, radiation, and labeling. Voluntary standards that address these issues are 
formulated by the European Committee for Standardization (CEN) and the European Committee for 
Electrotechnical Standardization (CENELEC). 

172.7 Compliance with Standards 

Standards that were originally developed on a voluntary basis may take on mandatory aspects. Standards 
that were developed to meet one particular need may be used to satisfy other needs as well. Standards 
will be enforced and adhered to if they meet the needs of those who are affected by them. For example, 
consider a standard for safety and performance for a defibrillator. For the manufacturer, acceptance and 
sales are a major consideration in both the domestic and international markets. People responsible for 
specifying, selecting, and purchasing equipment may insist on adherence to the standard so as to guar- 
antee safety and performance. The user, physician or other health care professional, will expect the 
instrument to have certain operational and performance characteristics to meet medical needs. Hospital 
personnel want a certain minimum degree of equipment uniformity for ease of training and maintenance. 
The hospital's insurance company and risk manager want equipment that meets or exceeds recognized 
safety standards. Third party payers, that is private insurance companies or government agencies, insist 
on equipment that is safe, efficacious, and cost effective. Accreditation agencies, such as local health 
agencies or professional societies, often require equipment to meet certain standards. More basically, 
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patients, workers, and society as a whole have an inherent right to fundamental safety. Finally, in our 
litigatious society, there is always the threat of civil action in the case of an untoward event in which a 
"non-standard", albeit "safe", instrument was involved. Thus, even though no one has stated "this standard 
must be followed", it is highly unlikely that any person or organization will have the temerity to manu- 
facture, specify, or buy an instrument that does not "meet the standard." 

Another example of how standards become compulsory is via accreditation organizations. The Joint 
Commission for Accreditation of Healthcare Organizations has various standards (requirements). This 
organization is a private body that hospitals voluntarily accept as an accrediting agent. However, various 
health insurance organizations, governmental organizations, and physician specialty boards for resident 
education use accreditation by the JCAHO as a touchstone for quality of activities. Thus, an insurance 
company might not pay for care in a hospital that is not accredited, or a specialty board might not 
recognize resident training in such an institution. Thus, the requirements of the JCAHO, in effect, become 
mandatory standards for health care organizations. 

A third means by which voluntary standards can become mandatory is by incorporation. Existing 
standards can be incorporated into a higher level of standards or codes. For example, various state and 
local governments incorporate standards developed by voluntary organizations, such as the National Fire 
Protection Association, into their own building and health codes. These standards then become, in effect, 
mandatory government regulations, and have the force of (civil) law. In addition, as discussed above, the 
FDA will now recognize voluntary standards developed by recognized organizations. 

172.8 Limitations of Standards 

Standards are generated to meet the expectations of society. They are developed by organizations and 
individuals to meet a variety of specific needs, with the general goals of promoting safety and efficiency. 
However, as with all human activities, problems with the interpretation and use of standards do occur. 
Engineering judgment is often required to help provide answers. Thus, the clinical engineer must consider 
the limits of standards, a boundary that is not clear and is constantly shifting. Yet clinical engineers must 
always employ the highest levels of engineering principles and practices. Some of the limitations and 
questions of standards and their use will be discussed below. 

Non-Compliance with a Standard. Sooner or later, it is likely that a clinical engineer will either be 
directly involved with or become aware of deviation from an accepted standard. The violation may be 
trivial, with no noticeable effect, or there may be serious consequences. In the former case, either the 
whole incident may be ignored, or nothing more may be necessary than a report that is filed away, or 
the incident can trigger some sort of corrective action. In the latter case, there may be major repercussions 
involving investigation, censure, tort issues, or legal actions. In any event, lack of knowledge about the 
standard is not a convincing defense. Anyone who is in a position that requires knowledge about a 
standard should be fully cognizant of all aspects of that standard. In particular, one should know the 
provisions of the standard, how they are to be enforced, and the potential risks of non-compliance. 
Nonetheless, non-compliance with a standard, in whole or in part, may be necessary to prevent a greater 
risk or to increase a potential benefit to the patient. For example, when no other recourse is available, it 
would be defensible to use an electromagnet condemned for irreparable excessive leakage current to 
locate a foreign body in the eye of an injured person, and thus save the patient's vision. Even if the use 
of this device resulted in a physical injury or equipment damage, the potential benefit to the patient is 
a compelling argument for use of the non-compliant device. In such a case, one should be aware of and 
prepared to act on the possible hazard (excessive electrical current, here). A general disclaimer making 
allowance for emergency situations is often included in policy statements relating to use of a standard. 
Drastic conditions require drastic methods. 

Standards and the Law. Standards mandated by a government body are not what is called "black letter 
law", that is a law actually entered into a criminal or civil code. Standards are typically not adopted in 
the same manner as laws, i.e., they are not approved by a legislative body, ratified by an elected executive, 
and sanctioned by the courts. The usual course for a mandated standard is via a legislative body enacting 
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a law that establishes or assigns to an executive agency the authority to regulate the concerned activities. 
This agency, under the control of the executive branch of government, then issues standards that follow 
the mandate of its enabling legislation. If conflicts arise, in addition to purely legal considerations, the 
judiciary must interpret the intent of the legislation in comparison with its execution. This type of law 
falls under civil rather than criminal application. 

The penalty for non-compliance with a standard may not be criminal or even civil prosecution. Instead, 
there are administrative methods of enforcement, as well as more subtle yet powerful methods of coercion. 
The state has the power (and the duty) to regulate matters of public interest. Thus, the state can withhold 
or withdraw permits for construction, occupancy, or use. Possibly more effective, the state can withhold 
means of finance or payments to violators of its regulations. Individuals injured by failure to abide by a 
standard may sue for damages in civil proceedings. However, it must be recognized that criminal pros- 
ecution is possible when the violations are most egregious, leading to human injury or large financial 
losses. 

Incorporation and Revision. Because of advances in technology and increases in societal expectations, 
standards are typically revised periodically. For example, the National Fire Protection Association revises 
and reissues its "Standard for Health Care Facilities" (NFPA 99) every three years. Other organizations 
follow a five year cycle of review, revision, and reissue of standards. These voluntary standards, developed 
in good faith, may be adapted by governmental agencies and made mandatory, as discussed above. When 
a standard is incorporated into a legislative code, it is generally referenced as to a particular version and 
date. It is not always the case that a newer version of the standard is more restrictive. For example, ever 
since 1984, the National Fire Protection Association "Standard for Health Care Facilities" (NFPA 99) 
does not require the installation of isolated power systems (isolation transformers and line isolation 
monitors) in anesthetizing locations that do not use flammable anesthetic agents, or in areas that are 
not classified as wet locations. A previous version of this standard, "Standard for the Use of Inhalation 
Anesthetics, (Flammable and Nonflammable)" (NFPA 56A-1978) did require isolated power. However, 
many State Hospital Codes have incorporated, by name and date, the provisions of the older standard, 
NFPA 56A. Thus, isolated power may still be required, by code, in new construction of all anesthetizing 
locations, despite the absence of this requirement in the latest version of the standard that addresses this 
issue. In such a case, the organization having jurisdiction in the matter must be petitioned to remedy 
this conflict between new and old versions of the standard. 

Safety. The primary purpose of standards in clinical practice is to assure the safety of patient, operator 
and bystanders. However, it must be fully appreciated that there is no such thing as absolute safety. The 
more safety features and regulations attached to a device, the less useful and the more cumbersome and 
costly may be its actual use. In the development, interpretation, and use of a standard, there are questions 
that must be asked: What is possible? What is acceptable? What is reasonable? Who will benefit? What 
is the cost? Who will pay? 

No one can deny that medical devices should be made as safe as possible, but some risk will always 
remain. In our practical world, absolute safety is a myth. Many medical procedures involve risk to the 
patient. The prudent physician or medical technologist will recognize the possible dangers of the equip- 
ment and take appropriate measures to reduce the risk to a minimum. Some instruments and procedures 
are inherently more dangerous than others. The physician must make a judgment, based on his/her own 
professional knowledge and experience, as well as on the expectations of society, whether using a par- 
ticular device is less of a risk than using an alternative device or doing nothing. Standards will help — but 
they do not guarantee complete safety, a cure, or legal and societal approval. 

Liability. Individuals who serve on committees that develop standards, as well as organizations 
involved in such activities, are justifiably concerned with their legal position in the event that a lawsuit 
is instituted as a result of a standard that they helped to bring forth. Issues involved in such a suit may 
include restraint of trade, in case of commercial matters, or to liability for injury due to acts of commission 
or of omission. Organizations that sponsor standards or that appoint representatives to standards devel- 
oping groups often have insurance for such activities. Independent standards committees and individual 
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members of any standards committees may or may not be covered by insurance for participation in these 
activities. Although in recent times only one organization and no individual has been found liable for 
damages caused by improper use of standards (see following paragraph), even the possibility of being 
named in a lawsuit can intimidate even the most self-confident "expert". Thus, it is not at all unusual for 
an individual who is asked to serve on a standards development committee first to inquire as to liability 
insurance coverage. Organizations that develop standards or appoint representatives also take pains to 
insure that all of their procedures are carefully followed and documented so as to demonstrate fairness 
and prudence. 

The dark side of standards is the implication that individuals or groups may unduly influence a 
standard to meet a personal objective, e.g., to dominate sales in a particular market. If standards are 
developed or interpreted unfairly, or if they give an unfair advantage to one segment, then restraint of 
trade charges can be made. This is why standards to be deemed consensus must be developed in a 
completely open and fair manner. Organizations that sponsor standards that violate this precept can be 
held responsible. In 1982, the United States Supreme Court, in the Hydrolevel Case [Perry, 1982], ruled 
that the American Society of Mechanical Engineers was guilty of antitrust activities because of the way 
some of its members, acting as a committee to interpret one of its standards, issued an opinion that 
limited competition in sales so as to unfairly benefit their own employers. This case remains a singular 
reminder that standards development and use must be inherently fair. 

Inhibition. Another charge against standards is that they inhibit innovation and limit progress [Flink, 
1984]. Ideally, standards should be written to satisfy minimum, yet sufficient, requirements for safety, 
performance, and efficacy. Improvements or innovations would still be permitted so long as the basic 
standard is followed. From a device users point of view, a standard that is excessively restrictive may limit 
the scope of permissible professional activities. If it is necessary to abrogate a standard in order to 
accommodate a new idea or to extend an existing situation, then the choice is to try to have the standard 
changed, which may be very time consuming, or to act in violation of the standard and accept the 
accompanying risks and censure. 

Ex Post Facto. A question continually arises as to what to do about old equipment (procedures, policies, 
facilities, etc.) when a new standard is issued or an old standard is revised so that existing items become 
obsolete. One approach, perhaps the simplest, is to do nothing. The philosophy here being that the old 
equipment was acquired in good faith and conformed to the then existing standards. As long as that 
equipment is usable and safe, there is no necessity to replace it. Another approach is to upgrade the 
existing equipment to meet the new standard. However, such modification may be technically impractical 
or financially prohibitive. Finally, one can simply throw out all of the existing equipment (or sell it to a 
second-hand dealer, or use the parts for maintenance) and buy everything new. This approach would 
bring a smile of delight from the manufacturer and a scream of outrage from the hospital administrator. 
Usually what is done is a compromise, incorporating various aspects of these different approaches. 

Costs. Standards cost both time and money to propose, develop, promulgate, and maintain. Perhaps 
the greatest hindrance to more participation in standards activities by interested individuals is the lack 
of funds to attend meetings where the issues are discussed and decisions are made. Unfortunately, but 
nonetheless true, organizations that can afford to sponsor individuals to attend such meetings have 
considerable influence in the development of that standard. On the other hand, those organizations that 
do have a vital interest in a standard should have an appropriate say in its development. A consensus of 
all interested parties tempers the undue influence of any single participant. 

From another viewpoint, standards increase the costs of manufacturing devices, carrying out proce- 
dures, and administering policies. This incremental cost is, in turn, passed on to the purchaser of the 
goods or services. Whether or not the increased cost justifies the benefits of the standard is not always 
apparent. It is impossible to realistically quantify the costs of accidents that did not happen or the 
confusion that was avoided by adhering to a particular standard. However, it cannot be denied that 
standards have made a valuable contribution to progress, in the broadest sense of that word. 
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172.9 Conclusions 

Standards are just like any other human activity, they can be well used or a burden. The danger of 
standards is that they will take on a life of their own; and rather than serve a genuine need will exist only 
as a justification of their own importance. This view is expressed in the provocative and iconoclastic 
book by Bruner and Leonard [1989], and in particular in their Chapter 9, 'Codes and Standards: Who 
Makes the Rules?' However, the raison d'etre of standards is to do good. It is incumbent upon clinical 
engineers, not only to understand how to apply standards properly, but also how to introduce, modify, 
and retire standards as conditions change. Furthermore, the limitations of standards must be recognized 
in order to realize their maximum benefit. No standard can replace diligence, knowledge, and a genuine 
concern for doing the right thing. 
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ECRI 173.2 Technology Assessment Agencies 



Effective management and development of clinical and biomedical engineering departments (hereafter 
called clinical engineering departments) in hospitals requires a basic knowledge of relevant regulatory 
and technology assessment agencies. Regulatory agencies set standards of performance and record keeping 
for the departments and the technology for which they are responsible. Technology assessment agencies 
are information resources for what should be an ever expanding role of the clinical engineer in the 
technology decision-making processes of the hospital's administration. 

This chapter presents an overview of regulatory and technology assessment agencies in the U.S., 
Canada, Europe, and Australia that are germane to clinical engineering. Due to the extremely large 
number of such agencies and information resources, we have chosen to focus on those of greatest relevance 
and/or informational value. The reader is directed to the references and sources of further information 
presented at the end of the chapter. 



173.1 Regulatory Agencies 



Within the healthcare field, there are over 38,000 applicable standards, clinical practice guidelines, laws, 
and regulations [ECRI, 1999]. Voluntary standards are promulgated by more than 800 organizations; 
mandatory standards by more than 300 state and federal agencies. Many of these organizations and 
agencies issue guidelines that are relevant to the vast range of healthcare technologies within the respon- 
sibility of clinical engineering departments. Although many of these agencies also regulate the manufac- 
ture and clinical use of healthcare technology, such regulations are not directly germane to the 
management of a clinical department and are not presented. 

For the clinical engineer, many agencies promulgate regulations and standards in the areas of, for 
example, electrical safety, fire safety, technology management, occupational safety, radiology and nuclear 
medicine, clinical laboratories, infection control, anesthesia and respiratory equipment, power distribu- 
tion, and medical gas systems. In the U.S., medical device problem reporting is also regulated by many 
state agencies and by the U.S. Food and Drug Administration (FDA) via its MEDWATCH program. It is 
important to note that, at present, the only direct regulatory authority that the FDA has over U.S. hospitals 
is in the reporting of medical device related accidents that result in serious injury or death. Chapter 172 
discusses in detail many of the specific agency citations. Presented below are the names and addresses of 
the primary agencies whose codes, standards, and regulations have the most direct bearing on clinical 
engineering and technology management. 
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American Hospital Association 
One North Franklin 
Chicago, IL 60606 
(312) 422-3000 
Website: www.aha.org 

American College of Radiology 
1891 Preston White Drive 
Reston.VA 22091 
(703) 648-8900 
Website: www.acr.org 

American National Standards Institute 

1 1 West 42nd Street 

13th Floor 

New York, NY 10036 

(212) 642-4900 

Website: www.ansi.org 

American Society for Hospital Engineering 
840 North Lake Shore Drive 
Chicago, IL 60611 

(312) 280 5223 
Website: www.ashe.org 

American Society for Testing and Materials 
1916 Race Street 
Philadelphia, PA 19103 
(215) 299-5400 
Website: www.astm.org 

Association for the Advancement of 

Medical Instrumentation 
3330 Washington Boulevard 
Suite 400 

Arlington, VA 22201 
(703) 525-4890 
Website: www.aami.org 

Australian Institute of Health and Welfare 

GPO Box 570 

Canberra, ACT 2601 

Australia 

(61) 06-243-5092 

Website: www.aihw.gov.au 

British Standards Institution 
2 Park Street 
London, W1A 2BS 
United Kingdom 
(44) 071-629-9000 
Website: www.bsi.org.uk 



Canadian Healthcare Association 

17 York Street 

Ottawa, ON KIN 9J6 

Canada 

(613) 241-8005 

Website: www.canadian-healthcare.org 

CSA International 

178 Rexdale Boulevard 

Etobicoke, ON M9W 1R3 

Canada 

(416) 747-4000 

Website: www.csa-international.org 

Center for Devices and Radiological Health 

Food and Drug Administration 

9200 Corporate Boulevard 

Rockville, MD 20850 

(301) 443-4690 

Website: www.fda.gov/cdrh 

Compressed Gas Association, Inc. 
1725 Jefferson Davis Highway 
Suite 1004 
Arlington, VA 22202 
(703) 412-0900 

ECRI 

5200 Butler Pike 

Plymouth Meeting, PA 19462 

(610) 825-6000 

(610) 834-1275 fax 

Websites: www.ecri.org 

www.ecriy2k.org 

www.mdsr.ecri.org 

Environmental Health Directorate 

Health Protection Branch 

Health Canada 

Environmental Health Centre 

19th Floor, Jeanne Mance Building 

Tunney's Pasture 

Ottawa, ON K1A 0L2 Canada 

(613) 957-3143 

Website: www.hc-sc.gc.ca/hpb/index_e.html 

Therapeutic Products Programme 

Health Canada 

Holland Cross, Tower B 

2nd Floor 

1600 Scott Street 
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Address Locator #3102D1 

Ottawa, ON K1A 1B6 

(613) 954-0288 

Website: www.hc-sc.gc.ca/hpb-dgps/therapeut 

Food and Drug Administration 
MEDWATCH, FDA Medical Products 

Reporting Program 
5600 Fishers Lane 
Rockville, MD 20857-9787 
(800) 332-1088 
Website: www.fda.gov/cdrh/mdr.html 

Institute of Electrical and Electronics Engineers 

445 Hoes Lane 

P.O. Box 1331 

Piscataway, NJ 08850-1331 

(732) 562-3800 

Website: www.standards.ieee.org 

International Electrotechnical Commission 

Box 131 

3 rue de Varembe, CH 1211 

Geneva 20 

Switzerland 

(41) 022-919-0211 

Website: www.iec.ch 

International Organization for Standardization 

1 rue de Varembe 

Case postale 56, CH 1211 

Geneva 20 

Switzerland 

(41) 022-749-0111 

Website: www.iso.ch 

Joint Commission on Accreditation 

of Healthcare Organizations 
One Renaissance Boulevard 
Oakbrook Terrace, IL 60181 
(630) 792-5600 
Website: www.jcaho.org 

Medical Devices Agency 

Department of Health 

Room 1209 

Hannibal House 

Elephant and Castle 

London, SE1 6TQ 

United Kingdom 

(44) 171-972-8143 

Website: www.medical-devices.gov.uk 



National Council on Radiation 
Protection and Measurements 
7910 Woodmont Avenue, Suite 800 
Bethesda, MD 20814 
(310) 657-2652 
Website: www.ncrp.com 

National Fire Protection Association 

1 Batterymarch Park 

PO Box 9101 

Quincy, MA 02269-9101 

(617) 770-3000 

Website: www.nfpa.org 

Nuclear Regulatory Commission 
11555 Rockville Pike 
Rockville, MD 20852 
(301) 492-7000 
Website: www.nrc.gov 

Occupational Safety and Health Administration 

US Department of Labor 

Office of Information and Consumer Affairs 

200 Constitution Avenue, NW 

Room N3647 

Washington, DC 20210 

(202) 219-8151 

Website: www.osha.gov 

ORKI 

National Institute for Hospital and Medical 

Engineering 
Budapest dios arok 3, H-1125 
Hungary 
(33) 1-156-1522 

Radiation Protection Branch 

Environmental Health Directorate 

Health Canada 

775 Brookfield Road 

Ottawa, ON K1A 1C1 

Website: www.hc-sc.gc. ca/ehp/ehd/rpb 

Russian Scientific and Research Institute 

Russian Public Health Ministry 

EKRAN, 3 Kasatkina Street 

Moscow 

Russia 129301 

(44) 071-405-3474 
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Society of Nuclear Medicine, Inc. 
1850 Samuel Morse Drive 
Reston.VA 20190-5316 
(703) 708-9000 
Website: www.snm.org 

Standards Association of Australia 

PO Box 1055 

Strathfield, NSW 2135 

Australia 

(61) 02-9746-4700 

Website: www.standards.org.au 



Therapeutic Goods Administration 

PO Box 100 

Wooden, ACT 2606 

Australia 

(61) 2-6232-8610 

Website: www.health.gov.au/tga 

Underwriters Laboratories, Inc. 
333 Pfingsten Road 
Northbrook, IL 60062-2096 
(847) 272-8800 
Website: www.ul.com 



VTT 

Technical Research Center of Finland 

Postbox316 

SF-33101 Tampere 10 

Finland 

(358) 31-163300 

Website: www.vti.fi 



173.2 Technology Assessment Agencies 



Technology assessment is the practical process of determining the value of a new or emerging technology 
in and of itself or against existing or competing technologies using safety, efficacy, effectiveness, outcome, 
risk management, strategic, financial, and competitive criteria. Technology assessment also considers 
ethics and law as well as health priorities and cost-effectiveness compared to competing technologies. A 
"technology" is defined as devices, equipment, related software, drugs, biotechnologies, procedures, and 
therapies; and systems used to diagnose or treat patients. The processes of technology assessment are 
discussed in detail in Chapter 168. 

Technology assessment is not the same as technology acquisition/procurement or technology planning. 
The latter two are processes for determining equipment vendors, soliciting bids, and systematically 
determining a hospital's technology related needs based on strategic, financial, risk management, and 
clinical criteria. The informational needs differ greatly between technology assessment and the acquisi- 
tion/procurement or planning processes. This section focuses on the resources applicable to technology 
assessment. 

Worldwide, there are nearly 400 organizations (private, academic, and governmental), providing 
technology assessment information, databases, or consulting services. Some are strictly information 
clearing houses, some perform technology assessment, and some do both. For those that perform 
assessments, the quality of the information generated varies greatly from superficial studies to in-depth, 
well referenced analytical reports. In 1997, the U.S. Agency for Health Care Policy and Research (AHCPR) 
designated 12 "Evidence-Based Practice Centers" (EPC) to undertake major technology assessment stud- 
ies on a contract basis. Each of these EPCs are noted in the list below and general descriptions of each 
center may be viewed on the internet at the AHCPR Website http://www.ahcpr.gov/clinic/epc/. 

Language limitations are a significant issue. In the ultimate analysis, the ability to undertake technology 
assessment requires assimilating vast amounts of information, most of which exists only in the English 
language. Technology assessment studies published by the International Society for Technology Assess- 
ment in Health Care (ISTAHC), by the World Health Organization, and other umbrella organizations 
are generally in English. The new International Health Technology Assessment database being developed 
by ECRI in conjunction with the U.S. National Library of Medicine contains more than 30,000 citations 
to technology assessments and related documents. 
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Below are the names, mailing addresses, and Internet Website addresses of some of the most prominent 
organizations undertaking technology assessment studies: 



Agence Nationale pour le Develeppement 
de l'Evaluation Medicale 
159 Rue Nationale 
Paris 75013 
France 

(33) 42-16-7272 

Website: www.upml.fr/andem/andem.htm 

Agencia de Evaluacion de Technologias Sanitarias 

Ministerio de Sanidad y Consumo 

Instituto de Salud Carlos III, AETS 

Sinesio Delgado 6, 28029 Madrid 

Spain 

(34) 1-323-4359 
Website: www.isciii.es/aets 

Alberta Heritage Foundation for 

Medical Research 
125 Manulife Place 
10180-101 Street 
Edmonton, AB T5J 345 
(403) 423-5727 
Website: www.ahfmr.ab.ca 

American Association of Preferred 

Provider Organizations 
601 13th Street, NW 
Suite 370 South 
Washington, DC 20005 
(202) 347-7600 

American Academy of Neurology 
1080 Montreal Avenue 
St. Paul, MN 55116-2791 
(612) 695-2716 
Website: www.aan.com 

American College of Obstetricians 

and Gynecologists 
409 12th Street, SW 
Washington, DC 20024 
(202) 863-2518 
Website: www.acog.org 

Australian Institute of Health and Welfare 

GPO Box 570 

Canberra, ACT 2601 

Australia 

(61) 06-243-5092 

Website: www.aihw.gov.au 



Battelle Medical Technology Assessment 
and Policy Research Center (MEDTAP) 
901 D Street, SW 
Washington, DC 20024 
(202) 479-0500 
Website: www.battelle.org 

Blue Cross and Blue Shield Association 

Technology Evaluation Center 

225 N Michigan Avenue 

Chicago, IL 60601-7680 

(312) 297-5530 

(312) 297-6080 (publications) 

Website: www.bluecares.com/new/clinical 

(An EPC ofAHCPK) 

British Columbia Office of Health 

Technology Assessment 
Centre for Health Services & Policy Research, 

University of British Columbia 
429-2194 Health Sciences Mall 
Vancouver, BC V6T 1Z3 
CANADA 
(604) 822-7049 
Website: www.chspr.ubc.ca 

British Institute of Radiology 
36 Portland Place 
London, WIN 4 AT 
United Kingdom 
(44) 171-580-4085 
Website: www.bir.org.uk 

Canadian Coordinating Office for Health 
Technology Assessment 
110-955 Green Valley Crescent 
Ottawa ON K2C 3V4 
CANADA 
(613) 226-2553 
Website: www.ccohta.ca 

Canadian Healthcare Association 

17 York Street 

Ottawa, ON KIN 9J6 

Canada 

(613) 241-8005 

Website: www.canadian-healthcare.org 
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Catalan Agency for Health 

Technology Assessment 
Travessera de les Corts 131-159 
Pavello Avenue 
Maria, 08028 Barcelona 
Spain 

(34) 93-227-29-00 
Website: www.aatm.es 

Centre for Health Economics 
University of York 
YorkYOl 5DD 
United Kingdom 

(44) 01904-433718 
Website: www.york.ac.uk 

Center for Medical Technology Assessment 

Linkoping University 

5183 Linkoping, Box 1026 (551-11) 

Sweden 

(46) 13-281-000 

Center for Practice and Technology Assessment 
Agency for Health Care Policy 

and Research (AHCPR) 
6010 Executive Boulevard, Suite 300 
Rockville, MD 20852 
(301) 594-4015 
Website: www.ahcpr.gov 

Committee for Evaluation and Diffusion 

of Innovative Technologies 
3 Avenue Victoria 
Paris 75004 
France 
(33) 1-40-273-109 

Conseil d'evaluation des technologies 

de la sante du Quebec 
201 Cremazie Boulevard East 
Bur 1.01 

Montreal, PQ H2M 1L2 
Canada 

(514) 873-2563 
Website: www.msss.gouv.qc.ca 

Danish Hospital Institute 
Landermaerket 10 
Copenhagen K 
Denmark DK1 119 

(45) 33-11-5777 



Danish Medical Research Council 
Bredgade 43 
1260 Copenhagen 
Denmark 

(45) 33-92-9700 

Danish National Board of Health 

Amaliegade 13 

PO Box 2020 

Copenhagen K 

Denmark DK1012 

(45) 35-26-5400 

Duke Center for Clinical Health Policy Research 

Duke University Medical Center 

2200 West Main Street, Suite 320 

Durham, NC 27705 

(919) 286-3399 

Website: www.clinipol.mc.duke.edu 

(An EPC of AHCPR) 

ECRI 

5200 Butler Pike 

Plymouth Meeting, PA 19462 

(610) 825-6000 

(610) 834-1275 fax 

Websites:www.ecri.org 

www.ecriy2k.org 

www.mdsr.ecri.org 

[An EPC of AHCPR) 

Finnish Office for Health Care 

Technology Assessment 
PO Box 220 
FIN-00531 Helsinki 
Finland 

(35) 89-3967-2296 
Website: www.stakes.fi/finohta 

Frost and Sullivan, Inc. 
106 Fulton Street 
New York, NY 10038-2786 
(212) 233-1080 
Website: www.frost.com 

Health Council of the Netherlands 

PO Box 1236 

2280 CE 

Rijswijk 

The Netherlands 

(31) 70-340-7520 
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Health Services Directorate Strategies 

and Systems for Health 
Health Promotion 

Health Promotion and Programs Branch 
Health Canada 
1915B Tunney's Pasture 
Ottawa, ON K1A 1B4 
Canada 

(613) 954-8629 
Website: www.hc-sc.gc.ca/hppb/hpol 

Health Technology Advisory Committee 

121 East 7th Place, Suite 400 

PO Box 64975 

St. Paul, MN 55164-6358 

(612) 282-6358 

Hong Kong Institute of Engineers 

9/F Island Centre 

No. 1 Great George Street 

Causeway Bay 

Hong Kong 

Institute for Clinical PET 
7100-A Manchester Boulevard 
Suite 300 

Alexandria, VA 22310 
(703) 924-6650 
Website: www.icpet.org 

Institute for Clinical Systems Integration 
8009 34th Avenue South 
Minneapolis, MN 55425 
(612) 883-7999 
Website: www.icsi.org 

Institute for Health Policy Analysis 
8401 Colesville Road, Suite 500 
Silver Spring, MD 20910 
(301) 565-4216 

Institute of Medicine (U.S.) 
National Academy of Sciences 
2101 Constitution Avenue, NW 
Washington, DC 20418 
(202) 334-2352 
Website: www.nas.edu/iom 

International Network of Agencies for 

Health Technology Assessment 
c/o SBU, Box 16158 



S-103 24 Stockholm 

Sweden 

(46) 08-611-1913 

Website: www.sbu.se/sbu-site/links/inahta 

Johns Hopkins Evidence-based Practice Center 

The Johns Hopkins Medical Institutions 

2020 E Monument Street, Suite 2-600 

Baltimore, MD 21205-2223 

(410) 955-6953 

Website: www.jhsph.edu/Departments/Epi/ 

(An EPC ofAHCPK) 

McMaster University Evidence-based 

Practice Center 
1200 Main Street West, Room 3H7 
Hamilton, ON L8N 3Z5 
Canada 

(905) 525-9140 ext. 22520 
Website: http://hiru.mcmaster.ca.epc/ 
(An EPC ofAHCPR) 

Medical Alley 

1550 Utica Avenue, South 

Suite 725 

Minneapolis, MN 55416 

(612) 542-3077 

Website: www.medicalalley.org 

Medical Devices Agency 

Department of Health 

Room 1209 

Hannibal House 

Elephant and Castle 

London, SE1 6TQ 

United Kingdom 

(44) 171-972-8143 

Website: www.medical-devices.gov.uk 

Medical Technology Practice Patterns Institute 
4733 Bethesda Avenue, Suite 510 
Bethesda, MD 20814 
(301) 652-4005 
Website: www.mtppi.org 

MEDTAP International 

7101 Wisconsin Avenue, Suite 600 

Bethesda MD 20814 

(301) 654-9729 

Website: www.medtap.com 
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MetaWorks, Inc. 
470 Atlantic Avenue 
Boston, MA 02210 
(617) 368-3573 ext. 206 
Website: www.metawork.com 
(An EPC ofAHCPR) 

National Institute of Nursing Research, NIH 

31 Center Drive, Room 5B10. MSC 2178 

Bethesda, MD 20892-2178 

(301) 496-0207 

Website: www.nih.gov/ninr 

National Commission on Quality Assurance 
2000 L Street NW, Suite 500 
Washington. DC 20036 
(202) 955-3500 
Website: www.ncqa.org 

National Committee of Clinical Laboratory Stan- 
dards (NCCLS) 
940 West Valley Road, Suite 1400 
Wayne, PA 19087-1898 
(610) 688-0100 
Website: www.nccls.org 

National Coordinating Center for 

Health Technology Assessment 
Boldrewood (Mailpoint 728) 
Univ of Southampton SO 16 7PX 
United Kingdom 
(44) 170-359-5642 
Website: www.soton.ac.uk/~hta/address.htm 

National Health and Medical Research Council 

GPO Box 9848 

Canberra, ACT 

Australia 

(61) 06-289-7019 

New England Medical Center 

Center for Clinical Evidence Synthesis 

Division of Clinical Research 

750 Washington Street, Box 63 

Boston, MA 02111 

(617) 636-5133 

Website: www.nemc.org/medicine/ccr/cces.htm 

(An EPC ofAHCPR) 

New York State Department of Health 
Tower Building 
Empire State Plaza 



Albany, NY 12237 

(518) 474-7354 

Website: www.health.state.ny.us 

NHS Centre for Reviews and Dissemination 

University of York 

YorkYOl 5DD 

United Kingdom 

(44) 01-904-433634 

Website: www.york.ac.uk 

Office of Medical Applications of Research 

NIH Consensus Program Information Service 

PO Box 2577 

Kensington, MD 20891 

(301) 231-8083 

Website: odp.od.nih.gov/consensus 

Ontario Ministry of Health 

Hepburn Block 

80 Grosvenor Street 

10th Floor 

Toronto, ON M7A 2C4 

(416) 327-4377 

Oregon Health Sciences University 
Division of Medical Informatics 

and Outcomes Research 
3181 SW Sam Jackson Park Road 
Portland, OR 97201-3098 
(503) 494-4277 
Website: www.ohsu.edu/epc 
(An EPC ofAHCPR) 

Pan American Health Organization 
525 23rd Street NW 
Washington, DC 20037-2895 
(202) 974-3222 
Website: www.paho.org 

Physician Payment Review Commission (PPRC) 
2120 L Street NW 
Suite 510 

Washington, DC 20037 
(202) 653-7220 

Prudential Insurance Company of America 

Health Care Operations and Research 

Division 

56 N Livingston Avenue 

Roseland, NJ 07068 

(201) 716-3870 
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Research Triangle Institute 

3040 Cornwallis Road 

PO Box 12194 

Research Triangle Park, NC 27709-2194 

(919) 541-6512 

(919) 541-7480 

Website: www.rti.org/epc/ 

(An EPC ofAHCPR) 

San Antonio Evidence-based Practice Center 

University of Texas Health Sciences Center 

Department of Medicine 

7703 Floyd Curl Drive 

San Antonio, TX 78284-7879 

(210) 617-5190 

Website: www.uthscsa.edu/ 

(An EPC ofAHCPR) 

Saskatchewan Health 

Acute and Emergency Services Branch 

3475 Albert Street 

Regina, SK S4S 6X6 

(306) 787-3656 

Scottish Health Purchasing Information Centre 

Summerfield House 

2 Eday Road 

Aberdeen AB15 6RE 

Scotland 

United Kingdom 

(44) 0-1224-663-456, ext. 75246 

Website: www.nahat.net/shpic 

Servicio de Evaluacion de Technologias Sanitarias 

Duque de Wellington 2 

E01010 Vitoria-Gasteiz 

Spain 

(94) 518-9250 

E-mail: osteba-san@ej-gv.es 

Society of Critical Care Medicine 

8101 E Kaiser Boulevard 

Suite 300 

Anaheim, CA 92808-2259 

(714) 282-6000 

Website: www.sccm.org 

Swedish Council on Technology Assessment 

in Health Care 
Box 16158 



S-103 24 Stockholm 

Sweden 

(46) 08-611-1913 

Website: www.sbu.se 

Southern California EPC-RAND 

1700 Main Street 

Santa Monica, CA 90401 

(310) 393-0411 ext. 6669 

Website: www.rand.org/organization/health/epc/ 

(An EPC ofAHCPR) 

Swiss Institute for Public Health 

Health Technology Programme 

Pfrundweg 14 

CH-5001 Aarau 

Switzerland 

(41) 064-247-161 

TNO Prevention and Health 

PO Box 2215 

2301 CE Leiden 

The Netherlands 

(31) 71-518-1818 

Website: www.tno.nl/instit/pg/index.html 

University HealthSystem Consortium 
2001 Spring Road, Suite 700 
Oak Brook, IL 60523 
(630) 954-1700 
Website: www.uhc.edu 

University of Leeds 
School of Public Health 
30 Hyde Terrace 
Leeds L52 9LN 
United Kingdom 
Website: www.leeds.ac.uk 

USCF-Stanford University EPC 

University of California, San Francisco 

505 Parnassus Avenue, Room M-1490 

Box 0132 

San Francisco, CA 94143-0132 

(415) 476-2564 

Website: www.stanford.edu/group/epc/ 

(An EPC ofAHCPR) 
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U.S. Office of Technology Assessment 

(former address) 600 Pennsylvania Avenue SE 

Washington, DC 20003 

Note: OTA closed on 29 Sep, 1995. However, 
documents can be accessed via the internet at 
www.wws.princeton.edu/5ota/html2/cong.html. 
Also a complete set of OTA publications is available 
on CD-ROM; contact the U.S. Government Print- 
ing Office (www.gpo.gov) for more information. 

Veterans Administration 

Technology Assessment Program 
VA Medical Center (152M) 
150 S Huntington Avenue 
Building 4 
Boston, MA 02130 
(617) 278-4469 
Website: www.va.gov/resdev 

Voluntary Hospitals of America, Inc. 
220 East Boulevard 
Irving, TX 75014 
(214) 830-0000 



Wessex Institute of Health Research 

and Development 
Boldrewood Medical School 
Bassett Crescent East 
Highfield, Southampton SO 16 7PX 
United Kingdom 
(44) 01-703-595-661 
Website: www.soton.ac.uk/~wi/index.html 

World Health Organization 

Distribution Sales 

CH 1211 

Geneva 27 

Switzerland 2476 

(41) 22-791-2111 

Website: www.who.ch 

Note: Publications are also available from the 

WHO Publications Center, USA, 

at (518) 436-9686. 
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174.1 Applications of Virtual Instruments in Health 
Care 
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Trending, Relationships, and Interactive Alarms • Data 
Modeling • Medical Equipment Risk Criteria • Peer 
Performance Reviews 



174.1 Applications of Virtual Instruments in Health Care 

Virtual Instrumentation (which was previously defined in Chapter 88, "Virtual Instrumentation: Appli- 
cations in Biomedical Engineering") allows organizations to effectively harness the power of the PC to 
access, analyze, and share information throughout the organization. With vast amounts of data available 
from increasingly sophisticated enterprise-level data sources, potentially useful information is often left 
hidden due to a lack of useful tools. Virtual instruments can employ a wide array of technologies such 
as multidimensional analyses and Statistical Process Control (SPC) tools to detect patterns, trends, 
causalities, and discontinuities to derive knowledge and make informed decisions. 

Today's enterprises create vast amounts of raw data and recent advances in storage technology, coupled 
with the desire to use this data competitively, has caused a data glut in many organizations. The healthcare 
industry in particular is one that generates a tremendous amount of data. Tools such as databases and 
spreadsheets certainly help manage and analyze this data; however databases, while ideal for extracting 
data are generally not suited for graphing and analysis. Spreadsheets, on the other hand, are ideal for 
analyzing and graphing data, but this can often be a cumbersome process when working with multiple 
data files. Virtual instruments empower the user to leverage the best of both worlds by creating a suite 
of user-defined applications which allow the end-user to convert vast amounts of data into information 
which is ultimately transformed into knowledge to enable better decision making. 

This chapter will discuss several virtual instrument applications and tools that have been developed 
to meet the specific needs of healthcare organizations. Particular attention will be placed on the use of 
quality control and "performance indicators" which provide the ability to trend and forecast various 
metrics. The use of SPC within virtual instruments will also be demonstrated. Finally, a non-traditional 
application of virtual instrumentation will be presented in which a "peer review" application has been 
developed to allow members of an organization to actively participate in the Employee Performance 
Review process. 
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Example Application #1: The EndoTester™— A Virtual 
Instrument-Based Quality Control and Technology 
Assessment System for Surgical Video Systems 

The use of endoscopic surgery is growing, in large part because it is generally safer and less expensive 
than conventional surgery, and patients tend to require less time in a hospital after endoscopic surgery. 
Industry experts conservatively estimate that about 4 million minimally invasive procedures were per- 
formed in 1996. As endoscopic surgery becomes more common, there is an increasing need to accurately 
evaluate the performance characteristics of endoscopes and their peripheral components. 

The assessment of the optical performance of laparoscopes and video systems is often difficult in the 
clinical setting. The surgeon depends on a high quality image to perform minimally invasive surgery, yet 
assurance of proper function of the equipment by biomedical engineering staff is not always straightfor- 
ward. Many variables in both patient and equipment may result in a poor image. Equipment variables, 
which may degrade image quality, include problems with the endoscope, either with optics or light 
transmission. The light cable is another source of uncertainty as a result of optical loss from damaged 
fibers. Malfunctions of the charge coupled device (CCD) video camera are yet another source of poor 
image quality. Cleanliness of the equipment, especially lens surfaces on the endoscope (both proximal 
and distal ends) are particularly common problems. Patient variables make the objective assessment of 
image quality more difficult. Large operative fields and bleeding at the operative site are just two examples 
of patient factors that may affect image quality. 

The evaluation of new video endoscopic equipment is also difficult because of the lack of objective 
standards for performance. Purchasers of equipment are forced to make an essentially subjective decision 
about image quality. By employing virtual instrumentation, a collaborative team of biomedical engineers, 
software engineers, physicians, nurses, and technicians at Hartford Hospital (Hartford, CT) and Premise 
Development Corporation (Avon, CT) have developed an instrument, the EndoTester™, with integrated 
software to quantify the optical properties of both rigid and flexible fiberoptic endoscopes. This easy-to- 
use optical evaluation system allows objective measurement of endoscopic performance prior to equip- 
ment purchase and in routine clinical use as part of a program of prospective maintenance. 

The EndoTester™ was designed and fabricated to perform a wide array of quantitative tests and 
measurements. Some of these tests include: (1) Relative Light Loss, (2) Reflective Symmetry, (3) Lighted 
(Good) Fibers, (4) Geometric Distortion, and (5) Modulation Transfer Function (MTF). Each series of 
tests is associated with a specific endoscope to allow for trending and easy comparison of successive 
measurements. 

Specific information about each endoscope (i.e., manufacturer, diameter, length, tip angle, depart- 
ment/unit, control number, and operator), the reason for the test (i.e., quality control, pre/post repair, 
etc.), and any problems associated with the scope are also documented through the electronic record. In 
addition, all the quantitative measurements from each test are automatically appended to the electronic 
record for life-cycle performance analysis. 

Figures 174.1 and 174.2 illustrate how information about the fiberoptic bundle of an endoscope can 
be displayed and measured. This provides a record of the pattern of lighted optical fibers for the endoscope 
under test. The number of lighted pixels will depend on the endoscope's dimensions, the distal end 
geometry, and the number of failed optical fibers. New fiber damage to an endoscope will be apparent 
by comparison of the lighted fiber pictures (and histogram profiles) from successive tests. Statistical data 
is also available to calculate the percentage of working fibers in a given endoscope. 

In addition to the two-dimensional profile of lighted fibers, this pattern (and all other image patterns) 
can also be displayed in the form of a three-dimensional contour plot. This interactive graph maybe viewed 
from a variety of viewpoints in that the user can vary the elevation, rotation, size, and perspective controls. 

Figure 174.2 illustrates how test images for a specific scope can be profiled over time (i.e., days, months, 
years) to identify degrading performance. This profile is also useful to validate repair procedures by 
comparing test images before and after the repair. 
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The EndoTester™ has many applications. In general, the most useful application is the ability to 
objectively measure an endoscope's performance prior to purchase, and in routine clinical use as part of 
a program of prospective maintenance. Measuring parameters of scope performance can facilitate equip- 
ment purchase. Vendor claims of instrument capabilities can be validated as a part of the negotiation 
process. Commercially available evaluation systems (for original equipment manufacturers) can cost 
upwards of $50,000, yet by employing the benefits of virtual instrumentation and a standard PC, an 
affordable, yet highly accurate test system for rigid and flexible fiberoptic endoscopes can now be obtained 
by clinical institutions. 

In addition to technology assessment applications, the adoption of disposable endoscopes raises 
another potential use for the EndoTester™. Disposable scopes are estimated to have a life of 20 to 
30 procedures. However, there is no easy way to determine exactly when a scope should be "thrown away." 
The EndoTester™ could be used to define this end-point. 

The greatest potential for this system is as part of a program of preventive maintenance. Currently, in 
most operating rooms, endoscopes are removed from service and sent for repair when they fail in clinical 
use. This causes operative delay with attendant risk to the patient and an increase in cost to the institution. 
The problem is difficult because an endoscope may be adequate in one procedure but fail in the next 
which is more exacting due to clinical variables such as large patient size or bleeding. Objective assessment 
of endoscope function with the EndoTester™ may eliminate some of these problems. 

Equally as important, an endoscope evaluation system will also allow institutions to ensure value from 
providers of repair services. The need for repair can be better defined and the adequacy of the repair 
verified when service is completed. This ability becomes especially important as the explosive growth of 
minimally invasive surgery has resulted in the creation of a significant market for endoscope repairs and 
service. Endoscope repair costs vary widely throughout the industry with costs ranging from $500 to 
$1500 or more per repair. Inappropriate or incomplete repairs can result in extending surgical time by 
requiring the surgeon to "switch scopes" (in some cases several times) during a surgical procedure. 

Given these applications, we believe that the EndoTester™ can play an important role in reducing 
unnecessary costs, while at the same time improving the quality of the endoscopic equipment and the 
outcome of its utilization. It is the sincere hope of the authors that this technology will help to provide 
accurate, affordable and easy-to-acquire data on endoscope performance characteristics which clearly are 
to the benefit of the healthcare provider, the ethical service providers, manufacturers of quality products, 
the payers, and, of course, the patient. 

Example Application #2: PIVIT™ — Performance Indicator 
Virtual Instrument Toolkit 

Most of the information management examples presented in this chapter are part of an application suite 
called PIVIT™. PIVIT is an acronym for "Performance Indicator Virtual Instrument Toolkit" and is an 
easy-to-use data acquisition and analysis product. PIVIT was developed specifically in response to the 
wide array of information and analysis needs throughout the healthcare setting. 

PIVIT applies virtual instrument technology to assess, analyze, and forecast clinical, operational, and 
financial performance indicators. Some examples include applications which profile institutional indi- 
cators (i.e., patient days, discharges, percent occupancy, ALOS, revenues, expenses, etc.), and departmen- 
tal indicators (i.e., salary, non-salary, total expenses, expense per equivalent discharge, DRGs, etc.). Other 
applications of PIVIT include 360° Peer Review, Customer Satisfaction Profiling, and Medical Equipment 
Risk Assessment. 

PIVIT can access data from multiple data sources. Virtually any parameter can be easily accessed and 
displayed from standard spreadsheet and database applications (i.e., Microsoft Access, Excel, Sybase, 
Oracle, etc.) using Microsoft's Open Database Connectivity (ODBC) technology. Furthermore, multiple 
parameters can be profiled and compared in real-time with any other parameter via interactive polar 
plots and three-dimensional displays. In addition to real-time profiling, other analyses such as SPC can 
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FIGURE 174.3 PIVIT™ — Performance Indicator Wizard displays institutional and departmental indicators. 



be employed to view large data sets in a graphical format. SPC has been applied successfully for decades 
to help companies reduce variability in manufacturing processes. These SPC tools range from Pareto 
graphs to Run and Control charts. Although it will not be possible to describe all of these applications, 
several examples are provided below to illustrate the power of PIVIT. 

Trending, Relationships, and Interactive Alarms 

Figure 174.3 illustrates a virtual instrument that interactively accesses institutional and department 
specific indicators and profiles them for comparison. Data sets can be acquired directly from standard 
spreadsheet and database applications (i.e., Microsoft Access®, Excel®, Sybase®, Oracle®, etc.). This 
capability has proven to be quite valuable with respect to quickly accessing and viewing large sets of data. 
Typically, multiple data sets contained within a spreadsheet or database had to be selected and then a 
new chart of this data had to be created. Using PIVIT, the user simply selects the desired parameter from 
any one of the pull-down menus and this data set is instantly graphed and compared to any other data set. 

Interactive "threshold cursors" dynamically highlight when a parameter is over and/or under a specific 
target. Displayed parameters can also be ratios of any measured value, for example, "Expense per Equiv- 
alent Discharge" or "Revenue to Expense Ratio". The indicator color will change based on how far the 
data value exceeds the threshold value (i.e., from green to yellow to red). If multiple thresholds are 
exceeded, then the entire background of the screen (normally gray) will change to red to alert the user 
of an extreme condition. 

Finally, multimedia has been employed by PIVIT to alert designated personnel with an audio message 
from the personal computer or by sending an automated message via e-mail, fax, pager, or mobile phone. 

PIVIT also has the ability to profile historical trends and project future values. Forecasts can be based 
on user-defined history (i.e., "Months for Regression"), the type of regression (i.e., linear, exponential, 
or polynomial), the number of days, months, or years to forecast, and if any offset should be applied to 
the forecast. These features allow the user to create an unlimited number of "what if" scenarios and allow 
only the desired range of data to be applied to a forecast. In addition to the graphical display of data 
values, historical and projected tables are also provided. These embedded tables look and function very 
much like a standard spreadsheet. 
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FIGURE 174.4 Injury epidemiology and public policy knowledgebase. 

Data Modeling 

Figure 174.4 illustrates another example of how virtual instrumentation can be applied to financial 
modeling and forecasting. This example graphically profiles the annual morbidity, mortality, and cost 
associated with falls within the state of Connecticut. Such an instrument has proved to be an extremely 
effective modeling tool due to its ability to interactively highlight relationships and assumptions, and to 
project the cost and/or savings of employing educational and other interventional programs. 

Virtual instruments such as these are not only useful with respect to modeling and forecasting, but 
perhaps more importantly, they become a "knowledgebase" in which interventions and the efficacy of 
these interventions can be statistically proven. In addition, virtual instruments can employ standard 
technologies such as Dynamic Data Exchange (DDE), ActiveX, or TCP/IP to transfer data to commonly 
used software applications such as Microsoft Access® or Microsoft Excel®. In this way, virtual instruments 
can measure and graph multiple signals while at the same time send this data to another application 
which could reside on the network or across the Internet. 

Another module of the PIVIT application is called the "Communications Center." This module can 
be used to simply create and print a report or it can be used to send e-mail, faxes, messages to a pager, 
or even leave voice-mail messages. This is a powerful feature in that information can be easily and 
efficiently distributed to both individuals and groups in real-time. 

Additionally, Microsoft Agent® technology can be used to pop-up an animated help tool to commu- 
nicate a message, indicate an alarm condition, or can be used to help the user solve a problem or point 
out a discrepancy that may have otherwise gone unnoticed. Agents employ a "text-to-speech" algorithm 
to actually "speak" an analysis or alarm directly to the user or recipient of the message. In this way, on- 
line help and user support can also be provided in multiple languages. 

In addition to real-time profiling of various parameters, more advanced analyses such as SPC can be 
employed to view large data sets in a graphical format. SPC has been applied successfully for decades to 
help companies reduce variability in manufacturing processes. It is the opinion of this author that SPC 
has enormous applications throughout healthcare. For example, Fig. 174.5 shows how Pareto analysis 
can be applied to a sample trauma database of over 12,000 records. The Pareto chart may be frequency 
or percentage depending on front panel selection and the user can select from a variety of different 
parameters by clicking on the "pull-down" menu. This menu can be configured to automatically display 
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FIGURE 174.5 Statistical process control — Pareto analysis of a sample trauma registry. 

each database field directly from the database. In this example, various database fields (i.e., DRG, Principal 
Diagnosis, Town, Payer, etc.) can be selected for Pareto analysis. Other SPC tools include run charts, 
control charts, and process capability distributions. 

Medical Equipment Risk Criteria 

Figure 174.6 illustrates a virtual instrument application which demonstrates how four "static" risk 
categories (and their corresponding values) are used to determine the inclusion of clinical equipment in 
the Medical Equipment Management Program at Hartford Hospital. Each risk category includes specific 
sub-categories that are assigned points, which when added together according to the formula listed below, 
yield a total score which ranges from 4 to 25. 

Considering these scores, the equipment is categorized into five priority levels (High, Medium, Low, 
Grey List, and Non-Inclusion into the Medical Equipment Management Program). The four static risk 
categories are: 

Equipment Function (EF): Stratifies the various functional categories (i.e., therapeutic, diagnostic, 
analytical, and miscellaneous) of equipment. This category has "point scores" which range from 1 
(miscellaneous, non-patient related devices) to 10 (therapeutic, life support devices). 

Physical Risk (PR): Lists the "worst case scenario" of physical risk potential to either the patient or 
the operator of the equipment. This category has "point scores" which range from 1 (no significant 
identified risk) to 5 (potential for patient and/or operator death). 

Environmental Use Classification (EC): Lists the primary equipment area in which the equipment is 
used and has "point scores" which range from 1 (non-patient care areas) to 5 (anesthetizing 
locations). 

Preventive Maintenance Requirements (MR): Describes the level and frequency of required mainte- 
nance and has "point scores" which range from 1 (not required) to 5 (monthly maintenance). 
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FIGURE 174.6 Medical equipment risk classification profiler. 

The Aggregate Static Risk Score is calculated as follows: 

Aggregate Risk Score = EF + PR + EC + MR 

Using the criteria's system described above, clinical equipment is categorized according to the following 
priority of testing and degree of risk: 

High Risk: Equipment that scores between and including 18 to 25 points on the criteria's evaluation 

system. This equipment is assigned the highest risk for testing, calibration, and repair. 
Medium Risk: Equipment that scores between and including 15 to 17 points on the criteria's evaluation 

system. 
Low Risk: Equipment that scores between and including 12 to 14 points on the criteria's evaluation 

system. 
Hazard Surveillance (Gray): Equipment that scores between and including 6 and 11 points on the 

criteria's evaluation system is visually inspected on an annual basis during the hospital hazard 

surveillance rounds. 
Medical Equipment Management Program Deletion: Medical equipment and devices that pose little 

risk and scores less than 6 points may be deleted from the management program as well as the 

clinical equipment inventory. 

Future versions of this application will also consider "dynamic" risk factors such as: user error, mean- 
time-between failure (MTBF), device failure within 30 days of a preventive maintenance or repair, and 
the number of years beyond the American Hospital Association's recommended useful life. 

Peer Performance Reviews 

The virtual instrument shown in Fig. 174.7 has been designed to easily acquire and compile performance 
information with respect to institution-wide competencies. It has been created to allow every member 
of a team or department to participate in the evaluation of a co-worker (360° peer review). Upon running 
the application, the user is presented with a "Sign-In" screen where he or she enters their username and 
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FIGURE 174.7 Performance reviews using virtual instrumentation. 

password. The application is divided into three components. The first (top section) profiles the employee 
and relevant service information. The second (middle section) indicates each competency as defined for 
employees, managers, and senior managers. The last (bottom) section allows the reviewer to evaluate 
performance by selecting one of four "radio buttons" and also provide specific comments related to each 
competency. This information is then compiled (with other reviewers) as real-time feedback. 
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IN THE LAST 20 YEARS the field of medical informatics has grown tremendously both in its 
complexity and in content. As a result, two sections will be written in this Handbook. The first one, 
represented in the chapters, will be devoted to areas that form a key "core" of computer technologies. 
These include: hospital information systems (HIS), computer-based patient records (CBPR or CPR), 
imaging, communications, standards, and other related areas. The second section includes the following 
topics: artificial intelligence, expert systems, knowledge-based systems neural networks, and robotics. Most 
of the techniques describe in the second section will require the implementation of systems explained in 
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this first section. We could call most of these chapters the information infrastructure required to apply 
medical informatics' techniques to medical data. These topics are crucial because they not only lay the 
foundation required to treat a patient within the walls of an institution but they provide the roadmap 
required to deal with the patient's lifetime record while allowing selected groups of researchers and 
clinicians to analyze the information and generate outcomes research and practice guidelines information. 

As an example a network of associated hospitals in the East Coast (a health care provider network) 
may want to utilize an expert system that was created and maintained at Stanford University. This group 
of hospitals, HMOs, clinics, physician's offices, and the like would need a "standard" computer-based 
patient record (CPR) that can be used by the clinicians from any of the physical locations. In addition 
in order to access the information all these institutions require telecommunications and networks that 
will allow for the electronic "dialogue." The different forms of the data, particularly clinical images, will 
require devices for displaying purposes, and the information stored in the different HIS, Clinical Infor- 
mation Systems (CIS), and departmental systems needs to be integrated. The multimedia type of record 
would become the data input for the expert system which could be accessed remotely (or locally) from 
any of the enterprise's locations. On the application side, the expert system could provide these institu- 
tions which techniques that can help in areas such as diagnosis and patient treatment. However, several 
new trends such as: total quality management (TQM), outcome research and practice guidelines could 
be followed. It should be obvious to the reader that to have the ability to compare information obtained 
in different parts of the world by dissimilar and heterogeneous systems, certain standards need to be 
followed (or created) so that when analyzing the data the information obtained will make sense. 

Many information systems issues described in this introduction will be addressed in this section. The 
artificial intelligence chapters which follow should be synergistic with these concepts. A good under- 
standing of the issues in this section is required prior to the utilization of the actual expert system. These 
issues are part of this section of medical informatics, other ones, however, e.g., systems integration and 
process reengineering, will not be addressed here in detail but will be mentioned by the different authors. 
I encourage the reader to follow up on the referenced material at the end of each chapter, since the 
citations contain very valuable information. 

Several current perspectives in information technologies need to be taken in consideration when 
reading this section. One of them is described very accurately in the book entitled Globalization, Tech- 
nology and Competition (The Fusion of Computers and Telecommunications in the 1990s) by Bradley, 
Hausman and Nolan (1993). The first chapter of this book talks about new services being demanded by 
end users which include the integration of computers and telecommunications. From their stages theory 
point of view, the authors describe that we are currently nearing the end of the micro era and the 
beginning of the network era. From an economy point of view, the industrial economy (1960s and 1970s) 
and the transitional economy (1970s and 1980s) is moving into an information economy (1990s and 
beyond). Also a leadership survey done in 1994 by the Healthcare Information and Management System 
Society (HIMSS) on trends in health care computing to mainly chief information officers, directors, and 
the likes of health care providers showed the following results: 

1. In a market driven by cost containment, the most important forces driving increased computer- 
ization in health care were: (a) movement to manage care (25%), (b) outcomes data requests 
(24%), and (c) movement to health care networks (17%). 

2. The most important systems priority for the next 2 years: (a) integrating across separate facilities 
(31%), (b) implementing a computer-based patient record (CPR) (19%), and (c) integrating 
departmental systems (13%) and reengineering to patient focused care (13%). 

3. 56% felt that the information superhighway was essential for health care. 

4. In the next 3 years the most significant health-care related computer development affecting the 
average consumer would be (a) more streamlined health care encounters (49%), (b) access to 
health information/services from home (20%), and (c) health care "smartcards" (17%). 
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5. Although 49% claimed to use the Internet, their health care facilities are using it for: (a) point- 
to-point E-mail (81%), (b) clinicians querying research databases (69%), (c) consumer-provider 
exchange (31%), and (d) two-way medical consultations (22%). 

6. Clinicians will share computerized patient information in a nationwide system: (a) by the year 
2000 (39%), (b) not happen for at least ten years (38%), and (c) within 1 to 3 years (14%). Many 
other questions and answers reflected some of the current technological barriers and users needs. 
Because of these trends it was essential to include in this Handbook technologies that today may 
be considered state of the art but when read about 10 years from now will appear to be transitional 
only. Information technologies are moving into a multimedia environment which will require 
special techniques for acquiring, displaying, storing, retrieving, and communicating the informa- 
tion. We are in the process of defining some of these mechanisms. 

In some instances such as imaging, this Handbook contains a full section dedicated to the subject. That 
section contains the principles, the associated math algorithms, and the physics related to all medical 
imaging modalities. The intention in this section is to address issues related to imagining as a form of 
medical information. These concepts include issues related then to acquisition, storage/retrieval, display 
and communications of document and clinical images, e.g., picture archival and communications systems 
(PACS). From a CPR point of view, clinical and document images will become part of this electronic 
chart, therefore many of the associated issues will be discussed in this section more extensively. 

The state of the telecommunications has been described as a revolution; data and voice communica- 
tions as well s full-motion video have come together as a new dynamic field. Much of what is happening 
today is a result of technology evolution and need. The connecting thread between evolutionary needs 
and revolutionary ideas is an integrated perspective of both sides of multiple industries. This topic will 
also be described in more detail in this section. 

In the first chapter Allan Pryor provides us with a tutorial on hospital information systems (HIS). He 
describes not only the evolution of HIS and departmental systems and clinical information systems (CIS), 
but also their differences. Within the evolution he follows these concepts with the need for the longitudinal 
patient record and the integration of patient data. This chapter includes patient database strategies for 
the HIS, data acquisition, patient admission, transfer and discharge functions. Also discussed are patient 
evaluation and patient management issues. From an end-user point of view, a terrific description on the 
evolution of data-driven and time-driven systems is included, culminating with some critical concepts 
on HIS requirements for decision support and knowledge base functionality. His conclusions are good 
indication of his vision. 

Michael Fitzmaurice follows with "Computer-Based Patient Records" (CBPR or CPR). In the intro- 
duction, it is explained what is the CPR and way it is a necessary tool for supporting clinical decision 
making and how it is enhanced when it interacts with medial knowledge sources. This is followed by 
clinical decision support systems (CDSS): knowledge server, knowledge sources, medical logic modules 
(MLM), and nomenclature. This last issue in particular is one which needs to be well understood. The 
nomenclature used by physicians and by the CPRs differ among institutions. Applying logic to the wrong 
concepts can produce misinterpretations. The scientific evidence in this chapter includes patient care 
process, CDSS hurdles, CPR implementation, research data bases, telemedicine, hospital and ambulatory 
care systems. A table of hospital and ambulatory care computer-based patient records systems concludes 
this chapter. 

Today it is impossible to separate computers and telecommunications (communications and net- 
works). Both are part of information systems. Soumitra Sengupta provides us n this chapter with a 
tutorial-like presentation which includes an introduction and history, impact of clinical data, information 
types, and platforms. The importance of this section is reflected both in the contents reviewed under 
current technologies — LANs, WANs, middleware, medical domain middleware; integrated patient data 
base, and medical vocabulary — as well as in the directions and challenges section which includes improved 
bandwidth, telemedicine, and security management. In the conclusions the clear vision is that networks 
will become the de facto fourth utility after electricity, water, and heat. 
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"Non-AI Decision Making" is covered by Ron Summers and Ewart Carson. This chapter includes an 
introduction which explains the techniques of procedural or declarative knowledge. The topics covered 
in this section include: analytical models, and decision theoretic models, including clinical algorithms and 
decision trees. The section that follows cover a number of key topics which appear while querying large 
clinical databases to yield evidence of either diagnostic/treatment or research value; statistical models, 
database search, regression analysis, statistical pattern analysis, bayesian analysis, Depster-Shafer theory, 
syntactic pattern analysis, causal modeling, artificial neural networks. In the summary the authors clearly 
advise the reader to read this section in conjunction with the expert systems chapters that follow. 

The standards section is closely associated with the CPR chapter of this section. Jeff Blair does a terrific 
job with his overview of standards related to the emerging health care information infrastructure. This 
chapter should give the reader not only an overview of the major existing and emerging health care 
information standards but an understanding of all current efforts, national and international, to coordi- 
nate, harmonize, and accelerate these activities. The introduction summarizes how this section is orga- 
nized. It includes identifier standards (patient's, site of care, product, and supply labeling), communications 
(message format) standards, content and structure standards. This section is followed by a summary of 
clinical data representations, guidelines for confidentiality, data, security, and authentication. After that 
quality indicators and data sets are described along with international standards. Coordinating and 
promotion organizations are listed at the end of this chapter including points of contact which will prove 
very beneficial for those who need to follow up. 

Design issues in developing clinical decision support and monitoring systems by John Goethe and 
Joseph Bronzino provide insight for the development of clinical decision support systems. In their intro- 
duction and throughout this chapter the authors provide a step-by-step tutorial with practical advice 
and make recommendations on design of the systems to achieve end-user acceptance. After that a 
description of a clinical monitoring system, developed and implemented by them for a psychiatric 
practice, is presented in detail. In their conclusions the human engineering issue is discussed. 

The authors of this section represent industry, academia and government. Their expertise in many 
instances is multiple from developing to actual implementing these technical ideas. I am very grateful 
for all our discussions and their contributions. 
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The definition of a hospital information system (HIS) is unfortunately not unique. The literature of both 
the informatics community and health care data processing world is filled with descriptions of many 
differing computer systems defined as an HIS. In this literature, the systems are sometimes characterized 
into varying level of HISs according to the functionally present within the system. With this confusion 
from the literature, it is necessary to begin this chapter with a definition of an HIS. To begin this definition, 
I must first describe what it is not. The HIS will incorporate information from the several departments 
within the hospital, but an HIS is not a departmental system. Departmental systems such as a pharmacy 
or a radiology system are limited in their scope. They are designed to manage only the department that 
they serve and rarely contain patient data captured from other departments. Their function should be 
to interface with the HIS and provide portions of the patient medical/administrative record that the HIS 
uses to manage the global needs of the hospital and patient. 

A clinical information system is likewise not an HIS. Again, although the HIS needs clinical information 
to meets its complete functionality, it is not exclusively restricted to the clinical information supported 
by the clinical information systems. Examples of clinical information systems are ICU systems, respiratory 
care systems, nursing systems. Similar to the departmental systems, these clinical systems tend to be one- 
dimensional with a total focus on one aspect of the clinical needs of the patient. They provide little 
support for the administrative requirements of the hospital. 

If we look at the functional capabilities of both the clinical and departmental systems, we see many 
common features of the HIS. They all require a database for recording patient information. Both types 
of systems must be able to support data acquisition and reporting of patient data. Communication of 
information to other clinical or administrative departments is required. Some form of management 
support can be found in all the systems. Thus, again looking at the basic functions of the system one 
cannot differentiate the clinical/departmental systems from the HIS. It is this confusion that makes 
defining the HIS difficult and explains why the literature is ambiguous in this matter. 
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The concept of the HIS appears to be, therefore, one of integration and breadth across the patient or 
hospital information needs. That is, to be called an HIS the system must meet the global needs of those 
it is to serve. In the context, if we look at the hospital as the customer of the HIS, then the HIS must be 
able to provide global and departmental information on the state of the hospital. For example, if we 
consider the capturing of charges within the hospital to be an HIS function, then the system must capture 
all patient charges no matter which departmental originated those charges. Likewise all clinical informa- 
tion about the patient must reside within the database of the HIS and make possible the reporting and 
management of patient data across all clinical departments and data sources. It is totality of function 
that differentiates the HIS from the departmental or restricted clinical system, not the functions provided 
to a department or clinical support incorporated within the system. 

The development of an HIS can take many architectural forms. It can be accomplished through 
interfacing of a central system to multiple departmental or clinical information systems. A second 
approach which has been developed is to have, in addition to a set of global applications, departmental 
or clinical system applications. Because of the limitation of all existing systems, any existing comprehen- 
sive HIS will in fact be a combination of interfaces to departmental/clinical systems and the applica- 
tions/database of the HIS purchased by the hospital. 

The remainder of this chapter will describe key features that must be included in today's HIS. The 
features discussed below are patient databases, patient data acquisition, patient admission/bed control, 
patient management and evaluation applications, and computer- assisted decision support. This chapter 
will not discuss the financial/administrative applications of an HIS, since those applications for the 
purposes of this chapter are seen as applications existing on a financial system that may not be integral 
application of the HIS. 

175.1 Patient Database Strategies for the HIS 

The first HISs were considered only an extension of the financial and administrative systems in place in 
the hospital. With this simplistic view many early systems developed database strategies that were limited 
in their growth potential. Their databases mimicked closely the design of the financial systems that 
presented a structure that was basically a "flat file" with well-defined fields. Although those fields were 
adequate for capturing the financial information used by administration to track the patient's charges, 
they were unable to adapt easily to the requirement to capture the clinical information being requested 
by health care providers. Today's HIS database should be designed to support a longitudinal patient 
record (the entire clinical record of the patient spanning multiple inpatient, outpatient encounters), 
integration of all clinical and financial data, and support of decision support functions. 

The creation of a longitudinal patient record is now a requirement of the HIS. Traditionally the 
databases of the HISs were encounter-based. That is, they were designed to manage a single patient visit 
to the hospital to create a financial record of the visit and make available to the care provider data recorded 
during the visit. Unfortunately, with those systems the care providers were unable to view the progress 
of the patient across encounters, even to the point that in some HISs critical information such as patient 
allergies needed to be entered with each new encounter. From the clinical perspective, the management 
of a patient must at least be considered in the context of a single episode of care. This episode might 
include one or more visits to the hospital's outpatient clinics, the emergency department, and multiple 
inpatient stays. The care provider to manage properly the patient, must have access to all the information 
recorded from those multiple encounters. The need for a longitudinal view dictates that the HIS database 
structure must both allow for access to the patient's data independent of an encounter and still provide 
for encounter-based access to adapt to the financial and billing requirements of the hospital. 

The need for integration of the patient data is as important as the longitudinal requirement. Tradi- 
tionally the clinical information tended to be stored in separate departmental files. With this structure 
it was easy to report from each department, but the creation of reports combining data from the different 
proved difficult if not impossible. In particular in those systems where access to the departmental data 
was provided only though interfaces with no central database, it was impossible to create an integrated 
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patient evaluation report. Using those systems the care providers would view data from different screens 
at their terminal and extract with pencil onto paper the results from each departmental (clinical labo- 
ratory, radiology, pharmacy, and so on) the information they needed to properly evaluate the patient. 
With the integrated clinical database the care provider can view directly on a single screen the information 
from all departments formatted in ways that facilitate the evaluation of the patient. 

Today's HIS is no longer merely a database and communication system but is an assistant in the 
management of the patient. That is, clinical knowledge bases are an integral part of the HIS. These 
knowledge bases contain rules and/or statistics with which the system can provide alerts or reminders 
or implement clinical protocols. The execution of the knowledge is highly dependent on the structure 
of the clinical database. For example, a rule might be present in the knowledge base to evaluate the use 
of narcotics by the patient. Depending on the structure of the database, this may require a complex set 
of rules looking at every possible narcotic available in the hospital's formulary or a single rule that checks 
the presence of the class narcotics in the patient's medical record. If the search requires multiple rules, 
it is probably because the medical vocabulary has been coded without any structure. With this lack of 
structure there needs to be a specific rule to evaluate every possible narcotic code in the hospital's 
formulary against the patient's computer medication record. With a more structured data model a single 
rule could suffice. With this model the drug codes have been assigned to include a hierarchical structure 
where all narcotics would fall into the same hierarchical class. Thus, a single rule specific only to the class 
"narcotics" is all that is needed to compare against the patient's record. 

These enhanced features of the HIS database are necessary if the HIS is going to serve the needs of 
today's modern hospital. Beyond these inpatient needs, the database of the HIS will become part of an 
enterprise clinical database that will include not only the clinical information for the inpatient encounters 
but also the clinical information recorded in the physician's office or the patient's home during outpatient 
encounters. Subsets of these records will become part of state and national health care databases. In 
selecting, therefore, and HIS, the most critical factor is understanding the structure and functionality of 
its database. 

175.2 Data Acquisition 

The acquisition of clinical data is key to the other functions of the HIS. If the HIS is to support an 
integrated patient record, then its ability to acquire clinical data from a variety of sources directly affect 
its ability to support the patient evaluation and management functions described below. All HIS systems 
provide for direct terminal entry of data. Depending on the system this entry may use only the keyboard 
or other "point and click" devices together with the keyboard. 

Interfaces to other systems will be necessary to compute a complete patient record. The physical 
interface to those systems is straightforward with today's technology. The difficulty comes in understand- 
ing the data that are being transmitted between systems. It is easy to communicate and understand ASCII 
textual information, but coded information from different systems is generally difficult for sharing 
between systems. This difficulty results because there are no medical standards for either medical vocab- 
ulary or the coding systems. Thus, each system may have chose an entirely different terminology or 
coding system to describe similar medical concepts. In building the interface, therefore, it may be 
necessary to build unique translation tables to store the information from one system into the databases 
of the HIS. This requirement has limited the building of truly integrated patient records. 

Acquisition of data from patient monitors used in the hospital can either be directly interfaced to the 
HIS or captured through an interface to an ICU system. Without these interfaces the acquisition of the 
monitoring data must be entered manually by the nursing personnel. It should be noted that whenever 
possible automated acquisition of data is preferable to manual entry. The automated acquisition is more 
accurate and reliable and less resource intensive. With those HISs which do not have interfaces to patient 
monitors, the frequency of data entry into the system is much less. The frequency of data acquisition 
affects the ability of the HIS to implement real-time medical decision logic to monitor the status of the 
patient. That is, in the ICU where decisions need to be made on a very timely manner, the information 
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on which the decision is based must be entered as the critical event is taking place. If there is no automatic 
entry of the data, then the critical data needed for decision making may not be present, thus preventing 
the computer from assisting in the management of the patient. 

175.3 Patient Admission, Transfer, 
and Discharge Functions 

The admission application has three primary functions. The first is to capture for the patient's computer 
record pertinent demographic and financial/insurance information. A second function is to communicate 
that information to all systems existing on the hospital network. The third is to link the patient to previous 
encounters to ensure that the patient's longitudinal record is not compromised. This linkage also assists 
in capturing the demographic and financial data needed for the current encounter, since that information 
captured during a previous encounter may need not to be reentered as part of this admission. Unfortu- 
nately in many HISs the linkage process is not as accurate as needed. Several reasons explain this 
inaccuracy. The first is the motivation of the admitting personnel. In some hospitals they perceive their 
task as a business function responsible only for ensuring that the patient will be properly billed for his 
or her hospital stay. Therefore, since the admission program always allows them to create a new record 
and enter the necessary insurance/billing information, their effort to link the patient to his previous 
record may not be as exhaustive as needed. 

Although the admitting program may interact with many financial and insurance files, there normally 
exists two key patient files that allow the HIS to meet its critical clinical functions. One is a master patient 
index (MPI) and the second is the longitudinal clinical file. The MPI contains the unique identifier for 
the patient. The other fields of this file are those necessary for the admitting clerk to identify the patient. 
During the admitting process the admitting process the admitting clerk will enter identifying information 
such as name, sex, birth date, social security number. This information will be used by the program to 
select potential patient matches in the MPI from which the admitting clerk can link to the current 
admission. If no matches are detected by the program, the clerk creates a new record in the MPI. It is 
this process that all too frequently fails. That is, the clerk either enters erroneous data and finds no match 
or for some reason does not select as a match one of the records displayed. Occasionally the clerk selects 
the wrong match causing the data from this admission to be posted to the wrong patient. In the earlier 
HISs where no longitudinal record existed, this problem was not critical, but in today's system, errors in 
matching can have serious clinical consequences. Many techniques are being implemented to eliminate 
this problem including probabilistic matching, auditing processes, postadmission consolidation. 

The longitudinal record may contain either a complete clinical record of the patient or only those 
variables that are most critical in subsequent admissions. Among the data that have been determined as 
most critical are key demographic data, allergies, surgical procedures, discharge diagnoses, and radiology 
reports. Beyond these key data elements more systems are beginning to store the complete clinical record. 
In those systems the structure of the records of the longitudinal file contain information regarding the 
encounter, admitting physician, and any other information that may be necessary to view the record 
from an encounter view or as a complete clinical history of the patient. 

175.4 Patient Evaluation 

The second major focus of application development for the HIS is creation of patient evaluation appli- 
cations. The purpose of these evaluation programs is to provide to the care giver information about the 
patient which assists in evaluating the medical status of the patient. Depending on the level of data 
integration in the HIS, the evaluation applications will be either quite rudimentary or highly complex. 
In the simplest form these applications are departmentally oriented. With this departmental orientation 
the care giver can access through terminals in the hospital departmental reports. Thus, laboratory reports, 
radiology reports, pharmacy reports, nursing records, and the like can be displayed or printed at the 
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hospital terminals. This form of evaluation functionality is commonly called results review, since it only 
allows the results of tests from the departments to be displayed with no attempt to integrate the data 
from those departments into an integrated patient evaluation report. 

The more clinical HISs as mentioned above include a central integrated patient database. With those 
systems patient reports can be much more sophisticated. A simple example of an integrated patient 
evaluation report is a diabetic flowsheet. In this flowsheet the caregiver can view the time and amount 
of insulin given, which may have been recorded by the pharmacy or nursing application, the patient's 
blood glucose level recorded in the clinical laboratory or again by the nursing application. In this form 
the caregiver has within single report, correlated by the computer, the clinical information necessary to 
evaluate the patient's diabetic status rather than looking for data on reports from the laboratory system, 
the pharmacy system, and the nursing application. As the amount and type of data captured by the HIS 
increases, the system can produce ever-more-useful patient evaluation reports. There exist HISs which 
provide complete rounds reports the summarize on one to two screens all the patient's clinical record 
captured by the system. These reports not only shorten the time need by the caregiver to locate the 
information, but because of the format of the report, can present the data in a more intuitive and clinically 
useful form. 

175.5 Patient Management 

Once the caregiver has properly evaluated the state of the patient, the next task is to initiate therapy that 
ensures an optimal outcome for the patient. The sophistication of the management applications is again 
a key differentiation of HISs. At the simplest level management applications consist of order-entry 
applications. The order-entry application is normally executed by a paramedical personnel. That is, the 
physician writes the order in the patient's chart, and another person reviews from the chart the written 
order and enters it into the computer. For example, if the order is for a medication, then it will probably 
be a pharmacist who actually enters the order into the computer. For most of the other orders a nurse 
or ward clerk is normally assigned this task. The HIS records the order in the patient's computerized 
medical record and transmits the order to the appropriate department for execution. In those hospitals 
where the departmental systems are interfaced to the HIS, the electronic transmission of the order to the 
departmental system is a natural part of the order entry system. In many systems the transmission of 
the order is merely a printout of the order in the appropriate department. 

The goal of most HISs is to have the physician responsible for management of the patient enter the 
orders into the computer. The problem that has troubled most of the HISs in achieving this goal has 
been the inefficiency of the current order-entry programs. For these programs to be successful they have 
to complete favorably with the traditional manner in which the physician writes the order. Unfortunately, 
most of the current order-entry applications are too cumbersome to be readily accepted by the physician. 
Generally they have been written to assist the paramedic in entering the order resulting with far too 
many screens or fields that need to be reviewed by the physician to complete the order. One approach 
that has been tried with limited success is the use of order sets. The order sets have been designed to 
allow the physician to easily from a single screen enter multiple orders. The use of order sets has improved 
the acceptability of the order-entry application to the physician, but several problems remain preventing 
universal acceptance by the physicians. One problem is that the order set will never be sufficiently 
complete to contain all orders that the physician would want to order. Therefore, there is some subset 
of patients orders that will have to be entered using the general ordering mechanisms of the program. 
Depending on the frequency of those orders, the acceptability of the program changes. Maintenance 
issues also arise with order sets, since it may be necessary to formulate order sets for each of the active 
physicians. Maintaining of the physician-specific order sets soon becomes a major problem for the data 
processing department. It becomes more problematic if the HIS to increase the frequency of a given 
order being present on an order set allows the order sets to be not only physician-defined but problem- 
oriented as well. Here it is necessary to again increase the number of order sets or have the physicians 
all agree on those orders to be included in an order set for a given problem. 
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Another problem, which makes use of order entry by the physician difficult, is the lack of integration 
of the application into the intellectual tasks of the physician. That is, in most of the systems the physicians 
are asked to do all the intellectual work in evaluating and managing the care of the patient in the 
traditional manner and then, as an added task, enter the results of that intellectual effort into the 
computer. It is at this last step that is perceived by the physician as a clerical task at which the physician 
rebels. Newer systems are beginning to incorporate more efficiently the ordering task into other appli- 
cations. These applications assist the physical throughout the entire intellectual effort of patient evaluation 
and management of the patient. An example of such integration would be the building of evaluation and 
order sets in the problem list management application. Here when the care provider looks at the patient 
problem list he or she accesses problem-specific evaluation and ordering screens built into the application, 
perhaps shortening the time necessary for the physician to make rounds on the patient. 

Beyond simple test ordering, many newer HISs are implementing decision support packages. With 
these packages the system can incorporate medical knowledge usually as rule sets to assist the care provider 
in the management of patients. Execution of the rule sets can be performed in the foreground through 
direct calls from an executing application or in the background with the storing of clinical data in the 
patient's computerized medical record. This latter mode is called data-driven execution and provides an 
extremely powerful method of knowledge execution and alerting, that is, after execution of the rule sets, 
the HIS will "alert" the care provider of any outstanding information that may be important regarding 
the status of the patient or suggestions on the management of the patient. Several mechanisms have been 
implemented to direct the alerts to the care provider. In the simplest form notification is merely a process 
of storing the alert in the patient's medical record to be reviewed the next time the care provider accesses 
that patient's record. More sophisticated notification methods have included directed printouts to indi- 
viduals whose job it is to monitor the alerts, electronic messages sent directly to terminals notifying the 
users that there are alerts which need to be viewed, and interfacing to the paging system of the hospital 
to direct alert pages to the appropriate personnel. 

Execution of the rule sets are sometimes, time-driven. This mode results in sets of rules being executed 
at a particular point in time. The typical scenario for time-driven execution is to set a time of day for 
selected rule set execution. At that time each day the system executes the given set of rules for a selected 
population in the hospital. Time drive has proven to be a particularly useful mechanism of decision 
support for those applications that require hospitalwide patient monitoring. 

The use of decision support has ranged from simple laboratory alerts to complex patient protocols. 
The responsibility of the HIS is to provide the tools for creation and execution of the knowledge base. 
The hospitals and their designated "experts" are responsible for the actual logic that is entered into the 
rule sets. Many studies are appearing in the literature suggesting that the addition of knowledge base 
execution tot he HIS is the next major advancement to e delivered with the HIS. This addition will 
become a tool to better manage the hospital in the world of managed care. 

The inclusion of decision support functionality in the HIS requires that the HIS be designed to support 
a set of knowledge tools. In general a knowledge bases system will consist of a knowledge base and an 
inference engine. The knowledge base will contain the rules, frames, and statistics that are used by the 
inference applications to substantiate a decision. We have found that in the health care area the knowledge 
base should be sufficiently flexible to support multiple forms of knowledge. That is, no single knowledge 
representation sufficiently powerful to provide a method to cover all decisions necessary in the hospital 
setting. For example, some diagnostic decisions may well be best suited for bayesian methods, whereas 
other management decisions may follow simple rules. In the context of the HIS, I prefer the term 
application manager to inference engine. The former is intended to imply that different applications may 
require different knowledge representations as well as different inferencing strategies to traverse the 
knowledge base. Thus, when the user selects the application, he or she is selecting a particular inference 
engine that may be unique to that application. The tasks, therefore, of the application manager are to 
provide the "look and feel" of the application, control the functional capabilities of the application, and 
invoke the appropriate inference engine for support of any "artificial intelligence" functionality. 
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175.6 Conclusion 

Today's HIS is no longer the financial/administrative system that first appeared in the hospital. It has 
extended beyond that role to become an adjunct to the care of the patient. With this extension into 
clinical care the HIS has not only added new functionality to its design but has enhanced its ability to 
serve the traditional administrative and financial needs of the hospital as well. The creation of these 
global applications which go well beyond those of the departmental/ clinical systems is now making the 
HIS the patient -focused system. With this global information the administrators and clinical staff together 
can accurately access where there are inefficiencies in the operation of the hospital from the delivery of 
both the administrative and medical care. This knowledge allows changes in the operation of the hospital 
that will ensure that optimal care continues to be provided to the patient at the least cost to the hospital. 
These studies and operation changes will continue to grow as the use of an integrated database and 
implementation of medical knowledge bases become increasingly routine in the functionality of the HIS. 
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176.1 Introduction 



The objective of this section is to present the computer-based patient record (CPR) as a powerful tool 
for organizing patient care data to improve patient care and strengthen communication of patient care 
data among health care providers. The CPR is even more powerful when used in a system that retrieves 
applicable medical knowledge to support clinical decision making. Evidence exists that the use of CPR 
systems (CPRS) can change both physician behavior and patient outcomes of care. As the speed and cost 
efficiency of computers rise, the cost of information storage and retrieval falls, and the breadth of 
ubiquitous networks becomes broader, it is essential that CPRs and systems that use them be evaluated 
for the improvements in health care that they can bring. 

The primary role of the CPR is to support the delivery of medical care to a particular patient. Serving 
this purpose, ideally the CPR brings past and current information about a particular patient to the 
physician, promotes communication among health care givers about that patient's care, and documents 
the process of care and the reasoning behind the choices that are made. Thus, the data in a CPR should 
be acquired as part of the normal process of health care delivery, by the providers of care and their 
institutions to improve data accuracy and timeliness of decision support. 

The CPR can also be an instrument for building a clinical data repository that is useful for collecting 
information about which medical treatments are effective in the practice of medicine in the community 
and for improving population-based health care. Additional applications of CPR data beyond direct patient 
care can improve population-based care. These applications bring personal and public benefits, but also 
raise issues that must be addressed by health care policy makers. Clinical data standards and communication 
networks, critical for using CPRs effectively, are treated separately in other sections in this chapter. 

176.2 Computer-Based Patient Record 

A CPR is a collection of data about a patient's health care in electronic form. The CPR is part of a system (a 
CPRS, usually maintained in a hospital or physician's office) that encompasses data entry and presentation, 
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storage, and access to the clinical decision maker — usually a physician or nurse. The data is entered by 
keyboard, dictation and transcription, voice recognition and interpretation, light pen, touch screen, hand- 
held computerized notepad (perhaps wireless) with gesture and character recognition and grouping 
capabilities, and other means. Entry may also be by direct instrumentation from electronic patient 
monitors and bedside terminals, nursing stations, analysis by other computer systems such as laboratory 
autoanalyzers and magnetic resonance imagers, or another provider's CPRS. 

Patient care data collected by a CPRS may be stored centrally or it may be stored in many places 
(distributed) for retrieval at the request of an authorized user through a database management system. 
The CPR may present data to the physician as text, tables, graphs, sound, images, full-motion video, and 
signals. The CPR may also point to the location of additional patient data that cannot be easily incor- 
porated into the CPR. 

In many current clinical settings (hospitals, physicians' offices, and ambulatory care centers), data 
pertaining to a patient's medical care are recorded and stored in a paper medical record. If the paper 
record is out of its normal location, or accompanying the patient during an off-site study or procedure, 
it is not available to the nurse, the attending physician, or the expert consultant. In paper form, data 
entries are often illegible and not easily retrieved and read by multiple users. On the other hand, an 
electronic form provides legible, clinical information which can be available to all users simultaneously, 
thus improving timely access to patient care data and communication among care providers. 

Individual hospital departments (laboratory or pharmacy, for example) often lose the advantages of 
automated data when their own computer systems print the computerized results onto paper. The pages 
are then sent to the patient's hospital floor and assembled into a paper record. The lack of standards for 
the electronic exchange of this data hinders the integration of computerized departmental systems. 
Searching electronic files is often, but not always, easier than searching through paper. Weaknesses of 
paper medical record systems for supporting patient care and health care providers have been detailed 
[Korpman, 1990 and 1991]. 

Many of the functions of a CPR and how it operates within a health-care information system to satisfy 
user demands are explained in the Institute of Medicine's report, The Computer-Based Patient Record: An 
Essential Technology for Health Care [1991]. 

176.3 Clinical Decision Support Systems 

Another role of the CPR is to enable a clinical decision support system (CDSS) — computer software 
designed to aid clinical decision making — to provide the physician with medical knowledge that is 
pertinent to the care of the patient. Diagnostic suggestions, testing prompts, therapeutic protocols, 
practice guidelines, alerts of potential drug-drug and drug-food reactions, treatment suggestions, and 
other decision support services can be obtained through the interaction of the CPR with a CDSS. 

Knowledge Server. Existing knowledge about potential diagnoses and treatments, practice guidelines, 
and complicating factors pertinent to the patient's diagnosis and care is needed at the time treatment 
decisions are made. The go-between that makes this link is a "knowledge server," which acquires the 
necessary information for the decision maker from the knowledge server's information sources. The CPR 
can provide the knowledge server with proper context, i.e., specific data and information about the 
patient's identification and condition(s) [Tuttle, et al., 1994]. 

Knowledge Sources. Knowledge sources include a range of options, from internal development and 
approval by a hospital's staff, for example, to sources outside the hospital, such as the National Guidelines 
Clearinghouse [www.guideline.gov] sponsored by the Agency for Health Care Policy and Research, the 
American Medical Association, and the Association of American Health Plans; the Physicians Data Query 
program at the National Cancer Institute; other consensus panel guidelines sponsored by the National 
Institutes of Health; and guidelines developed by medical and other specialty societies. Additional sources 
of knowledge include the medical literature, which can be searched for high-quality, comprehensive 
review articles and for particular subjects using the "Grateful Med" program to explore the MEDLINE 
literature database available through the National Library of Medicine. Evidence-based Practice Center 
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reports summarizing scientific evidence on specific topics of medical interest are available to support 
guideline development [www.guideline.gov]. 

Medical Logic Modules. If medical knowledge needs are anticipated, acquired beforehand, and put into 
a medical logic module (MLM), software can provide rule-based alerts, reminders and suggestions for 
the care provider at the point (time and place) of health service delivery. One format for MLMs is the 
Arden Syntax, which standardizes the logical statements [ASTM, 1992]. For example, an MLM might be 
interpreted as, "If sex = female, and age is greater than 50 years, and no Pap smear test result appears in 
the CPR, then recommend a Pap smear test to the patient." If MLMs are to have a positive impact on 
physician behavior and the patient care process, then physicians using MLMs must agree on the rules in 
the logical statements or conditions and recommended actions that are based on interactions with patient 
care data in the CPR. 

Nomenclature. Because MLMs are independent, the presence or absence of one MLM does not affect 
the operation of other MLMs in the system. If done carefully and well, MLMs developed in one health- 
care organization can be incorporated in the CPRSs of other health-care organizations. However, this 
requires much more than using accepted medical content and logical structure. If the medical concept 
terminology (the nomenclature used by physicians and by the CPR) differs among organizations, the 
knowledge server may misinterpret what is in the CPR, apply logic to the wrong concept, or select the 
wrong MLM. Further, the physician receiving its message may misinterpret the MLM. [Pryor and 
Hripcsak, 1994] 

For widespread use of CDSSs, a uniform medical nomenclature, consistent with the scientific literature 
is necessary. Medical knowledge is information that has been evaluated by experts and converted into 
useful medical concepts and options. For CDSSs to search through a patient's CPR, identify the medical 
concepts, retrieve appropriate patient data and information, and provide a link to the relevant knowledge, 
the CDSS has to recognize the names used in the CPR for the concepts [Cimino, 1993]. Providing 
direction for coupling terms and codes found in patient records to medical knowledge is the goal of the 
Unified Medical Language System (UMLS) project of the National Library of Medicine. 

176.4 Scientific Evidence 

Patient Care Processes. Controlled trials have shown the effectiveness of CDSS for modifying physician 
behavior using preventive care reminders. In a review of the scientific literature up to February 1992, 
Johnston, et al., [1994] reported that controlled trials of CDSSs have shown significant, favorable effects 
on care processes from (1) providing preventive care information to physicians and patients [McDonald, 
et al., 1984; Tierney, et al., 1986], (2) supporting diagnosis of high-risk patients [Chase, et al., 1983], 

(3) determining the toxic drug dose for obtaining the desired therapeutic levels [White, et al., 1987], and 

(4) aiding active medical care decisions [Tierney, 1988]. Johnston found clinician performance was 
generally improved when a CDSS was used and, in a small number of cases (3 of 10 trials), significant 
improvements in patient outcomes. 

In a randomized, controlled clinical trial, one which randomly assigned some teams of physicians to 
computer workstations with screens designed to promote cost-effective ordering (for example, of drugs 
and laboratory tests), Tierney, et al., [1993] reported patient lengths of stay were .89 days shorter, and 
charges generated by the intervention teams were $887 lower, than for the control teams of physicians. 
These gains were not without an offset. Time and motion studies showed that intervention physician 
teams spent 5.5 minutes longer per patient during 10-hour observation periods. This study is a rare 
controlled trial that sheds light on the resource impact of a CDSS. 

In this setting, physician behavior was changed and resources were reduced by the application of logical 
algorithms to computer-based patient record information. Nevertheless, a different hospital striving to 
attain the same results would have to factor in the cost of the equipment, installation, maintenance, and 
software development plus the need to provide staff training in the use of a CDSS. 

Malpractice. In some emergency rooms, CDSS are used to prompt physicians to document the patient's 
record and to suggest the possibility of missed diagnoses — ones that could lead to serious patient harm 
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and malpractice liability — such as myocardial infarctions. In 1990, all emergency room physicians in 
Massachusetts were offered lower malpractice premiums if they regularly used a particular CDSS [Institute 
of Medicine, 1991]. An additional benefit of a CDSS in ambulatory settings can be faster decisions and 
reduced time for the patient in the emergency room, (for example, for triage of patients with symptoms 
of acute cardiac ischemia) [Sarasin, et al., 1994]. 

Evaluation. CDSSs should be evaluated according to how well they enhance the user's performance 
in the user's environment [Nykanen, et al., 1992]. Extending this concept, society should judge CDSSs 
not only on enhanced physician performance, but also on whether patient outcomes are improved and 
system-wide health care costs are contained. 

CDSS Hurdles. In a review of medical diagnostic decision support systems, Miller [1994] examines 
the development of CDSSs over the past 40 years, and identifies several hurdles to be overcome before 
large-scale, generic CDSSs grow to widespread use. These hurdles include determining: (l)how to 
support medical knowledge base construction and maintenance over time, (2) the amount of reasoning 
power and detailed representation of medical knowledge required (e.g., how strong a match of medical 
terms is needed to join medical concepts with appropriate information), (3) how to integrate CDSSs into 
the clinical environment to reduce the costs of patient data capture, and (4) how to provide flexible user 
interfaces and environments that adjust to the abilities and desires of the user (e.g., with regard to typing 
expertise and pointing devices). 

CPR Implementation. The real and perceived barriers to implementing a CPR include physicians' 
reluctance to enter data. Yet direct physician interaction with the system is important because it reduces 
transcription errors and allows the system to respond to the physician with preventive medicine prompts, 
drug contraindication alerts, and other reminders during routine patient care. During Beth Israel Hos- 
pital's transition to a CPR, data entry may not have been as large a barrier as the burden of recording 
clinical data on both paper and a CPR. Moreover, there is reason to believe that physicians have a greater 
concern about the confidentiality and privacy of their text notes, than about other clinical data in the 
CPR [Rind and Safran, 1994]. Older physicians may make significantly fewer CPR system inquiries per 
case than younger physicians, but Clayton, et al., [1994] reported that physician age explained only a 
small amount of variation (3 percent) in CPR use. In contrast, he found the average use by physicians 
categorized by hospital department (medical specialty) to vary by a factor of 10 between high- and low- 
utilization departments. 

Although Massaro [1993a, 1993b] and Dambro, et al., [1988] report on the problems faced and lessons 
learned during the installation period of their information technology systems, there is generally insufficient 
documentation in the medical literature about identifying and overcoming the barriers to successfully 
installing comprehensive, integrated clinical information systems in hospitals' and physicians' offices. 

Research Databases. CPRs can have great value for developing research databases, medical knowledge, 
and quality assurance information that would otherwise require an inordinate amount of manual 
resources to obtain in their absence. An example of CPR use in research is found in a study undertaken 
at Latter Day Saints (LDS) Hospital. Using the HELP CPR system to gather data on 2,847 surgical patients, 
this study found that administering antibiotics prophylactically during the 2-hour window before surgery 
(as opposed to earlier or later within a 48-hour window) minimized the chance of surgical-wound 
infection. It also reduced the surgical infection rate for this time category to 0.59 percent, compared to 
the 1.5 percent overall infection rate for all the surgical patients under study [Classen, et al., 1992]. 

The same system was used at LDS Hospital to link the clinical information system data (including a 
measure of nursing acuity) with the financial systems' data. Using clinical data to adjust for the severity 
of patient illness, Evans, et al., [1994] measured the effect of adverse drug events due to hospital drug 
administration on hospital length of stay and cost. The attributable difference due to adverse drug events 
among similar patients was estimated to be an extra 1.94 patient days and $1,939 in costs. 

Telemedicine. A CPR may hold and exchange radiological and pathological images of the patient taken 
or scanned in digital form. The advantage is that digital images may be transferred long distances without 
a reduction in quality of appearance. This allows patients to receive proficient medical advice even when 
they and their local family practitioner are far from their consulting physicians. 
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However, interpreting digital representations (images that are scanned, not originally digital) of radio- 
graphs, or X-ray images, while feasible, is inferior in some applications to reading conventional radio- 
graphs. Radiologists prefer customary X-ray films and view boxes to their digitized images, and would 
require significant training and specialized experience to achieve optimal performance with digitized 
images. Digital images, such as magnetic resonance images (MRIs), should appear no different whether 
viewed at the patient's site of care or by a radiologist hundreds of miles away from the patient — if the 
algorithms for compressing (to reduce the costs of data transmission) and decompressing the digital data 
do not result in a discernable loss of quality. This feature lends itself to the development of expert systems 
for reading MRIs of difficult cases, for example, particularly when a patient's medical facility is isolated 
from academic medical centers. For the most part, only radiology and pathology applications of tele- 
medicine have been extensively evaluated and found medically feasible. The cost effectiveness of these 
applications has not been rigorously evaluated [Grigsby and Kaehny, 1993]. 

Telemedicine is, of course, more than teleradiology and telepathology. Much can be accomplished with 
telephone, facsimile machines, and teleconferencing that may not require substantial technical support 
or training. The financial concern is that payment for telemedicine services may increase the cost of care 
without improving patient outcomes. Research and project evaluation findings are needed to show where 
telemedicine applications improve the quality of care and are cost effective. 

Additionally, when personally identifiable health care data are transported electronically across state 
borders for telemedicine uses, the applicability of state laws regarding confidentiality and privacy of this 
data is not often obvious. This raises legal questions for organizations that wish to move this data over 
national networks for patient management, business, or analytical reasons. A Federal privacy protection 
law that applies generally to medical data about patients does not currently exist [Waller, 1991], nor is 
there a Federal law or licensing system to protect physicians and their patients against the consequences 
of telemedicine-failure malpractice. 

176.5 Hospital and Ambulatory Care Systems 

A review of the literature to identify CPRS studies (with a focus on ambulatory care and on the research 
use of their data) revealed the systems shown in Table 176.1 [Agency for Health Care Policy and Research, 
1993]. Table 176.1 presents the system acronym, its meaning, its location, and a brief description. Some 
systems have been in use for over 25 years and, for the most part, were developed in academic medical 
centers. In some cases, they are actually data registry systems. Six of these well-studied systems — COSTAR, 
TMR, HELP, STOR, BIHS, and RMRS — have built vast patient databases derived from the regular delivery 
of patient care [Tierney and McDonald, 1991]. Two of these systems, COSTAR and HELP, are also 
commercial systems. 

There are numerous commercial computerized hospital information systems (HISs). Most of them 
focus primarily on administrative functions (admission, discharge, transfer, scheduling, and billing) and 
secondarily on order entry and results reporting. Although no one commercial HIS is operating today 
that integrates all the patient's clinical data from all sources and makes the data easily accessible to 
clinicians, nine commercial HIS vendors offering systems were identified in a study by Abt Associates, 
Inc., that 

"are designed to be used by clinicians (especially physicians), are comprehensive in functionality and 
well integrated or take a unique approach to integration, and that meet moderately rigorous standards 
for integration of clinical, financial, and analytic information are: 3M (HELP), Bell Atlantic (StatLAN 
Oacis), Cerner (CareNet), Health Data Sciences (Ulticare), IDX (OCM), Meditech (HIS), Phamis 
(LastWord), SMS (Invision), and TDS (7000)" [1993, 2-2]. 

Systems of the future may rely on intranet networks within the health enterprise and on web-based 
exchanges of patient information using web browsers, object-oriented technology, and document for- 
matting languages, e.g., hypertext markup language (http) and extensible markup language (XML). 
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TABLE 176.1 Hospital and Ambulatory Care Computer-Based Patient Record Systems 



Private Sector Systems 
ARAMIS 

ATHOS 

BICS 

BIHS 

CIS 

COSTAR 

DIOGENE 

HCHP/AMRS 

HELP 

MARS 

PIMS/PIPS 

RMRS 

RPMS 

STOR 

THERESA 
TMR 

VIPOR 

Federal CPR systems 



Arthritis, Rheumatism, and Aging Medical Information System [TOR (Time Oriented Record) 

is the collection instrument]; Stanford University Medical Center (MA); National chronic 

disease data bank. 
AIDS Time-Oriented Health Outcome Study; Stanford University Medical Center (CA); 

National chronic disease data bank. 
Brigham and Women's Hospital Integrated Computing System; Brigham and Women's Hospital 

(MA); Hospital and ambulatory care patient record. 
Beth Israel Hospital System; Beth Israel Hospital (MA); Hospital and ambulatory care patient 

record. 
Clinical Information System; Columbia-Presbyterian Medical Center (NY); Hospital and 

ambulatory care patient record. 
Computer-Stored Ambulatory Record; Massachusetts General Hospital (MA); Ambulatory care 

patient record (also commercial). 
University Hospital of Geneva (Switzerland); Hospital and ambulatory care patient record [non- 

U.S. but identified in AHCPR, 1993]. 
Harvard Community Health Plan Ambulatory Medical Record; HCHP (MA); Ambulatory care 

patient record. 
Health Evaluation Through Logical Processing; LDS Hospital (UT); Hospital and ambulatory 

care patient record (also commercial). 
Medical Archival System; University of Pittsburgh (PA); Hospital and ambulatory care patient 

record (under development). 
Patient Information Management System/Patient Information Protocol System; Loyola 

University Medical Center (IL); Hospital and ambulatory care patient record. 
Regenstrief Medical Records System; Indiana University Medical Center (IN); Hospital and 

ambulatory care patient record. 
Resource and Patient Management System; Indian Health Service (AZ); Hospital and 

ambulatory care patient record. 
Summary Time-Oriented Record; University of California at San Francisco (CA); Ambulatory 

care patient record. 
Grady Memorial Hospital (GA); Hospital and ambulatory care patient record. 
The Medical Record; Duke University Medical Center (NC); Hospital and ambulatory care 

patient record. 
Vermont Integrated Problem-Oriented Record; University of Vermont Health Center (VT); 

Ambulatory care patient record. 



DCHP Decentralized Hospital Computer Program; Department of Veterans Affairs; Hospital patient 

record. 
CHCS Composite Health Care System; Department of Defense; Hospital patient record (6 modules). 

IHS Indian Health Service, Department of Health and Human Services; Hospital and ambulatory 

care patient record. 



176.6 Extended Uses of CPR Data 



Data produced by such systems have additional value beyond supporting the care of specific patients. 
For example, subsets of individual patient care data from CPRs can be used for research purposes, quality 
assurance purposes, developing and assessing patient care treatment paths (planned sequences of medical 
services to be implemented after the diagnoses and treatment choices have been made), assessments of 
treatment strategies across a range of choices, and assessments of medical technologies in use in the 
community after their approval by the Food and Drug Administration. When linked with data measuring 
patient outcomes, CPR data may be used to help model the results achieved by different treatments, 
providers, sites of care, and organizations of care. 
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If these data were uniformly defined, accurately linked, and collected into databases pertaining to 
particular geographical areas, they would be useful for research into the patient outcomes of alternative 
treatments for specific conditions treated in the general practice of medicine and for developing infor- 
mation to assist consumers, health care providers, health plans, payers, public health officials, and others 
in making choices about treatments, technologies, sites and providers of care, health plans, and commu- 
nity health needs. This is currently an ambitious vision considering the presently limited use of CPRs 
and insufficient incentives for validating and storing electronic patient record data. Many such decisions 
are now based on data of inferior quality; but the importance of these decisions to the health care market 
is driving higher the demand for uniform, accurate clinical data. 

Federal Programs 

Uniform, electronic clinical patient data could be useful to many Federal programs that have responsibility 
for improving, safeguarding, and financing America's health. For example, the Agency for Health Care 
Policy and Research (AHCPR) is charged with undertaking research to "identify the most effective 
methods for preventing, diagnosing, treating, and managing various health conditions" in community 
settings. [PL 101-239, 1989]. The research should improve patient outcomes of care, quality measurement, 
address cost, and access problems. To examine the influence on patient outcomes of alternative treatments 
for specific conditions, this research needs to account for the simultaneous effects of many patient risk 
factors, such as diabetes and hypertension. Health insurance claims data do not have sufficient clinical detail 
for many research, quality assurance, and evaluation purposes. Therefore, claims data must be supplemented 
with data abstracted form the patients' medical records. In many cases, the data must be identified and 
collected prospectively from patients (with their permission) to ensure availability and uniformity. The use 
of a CPR could reduce the burden of this data collection, support practice guideline development in the 
private sector, and support the development, testing, and use of quality performance measures. 

Medical effectiveness research could benefit from access to uniform, accurate, nationwide data from 
CPRs — for example, on the last 10,000 cases of a given condition such as benign prostatic hyperplasia, 
low back pain, or cataracts. Because CPRs do not typically contain information on patient outcomes 
after discharge from a hospital (or a physician's office), CPR data would have to be supplemented with 
patient outcome measures. 

Other Federal, State, and local health agencies also could benefit from CPR-based data collections. For 
example, the Food and Drug Administration, which conducts postmarketing monitoring to learn the 
incidence of unwanted effects of drugs after they are approved, could benefit from analyzing the next 
20,000 cases in which a particular pharmaceutical is prescribed, using data collected in a CPR. Greater 
confidence in postmarket surveillance could speed approval of new drug applications. The Health Care 
Financing Administration could provide guidance and information to its Professional Review Organiza- 
tions (PROs) about local and nationwide medical practice patterns founded on analysis of national and 
regional CPR data about Medicare beneficiaries; the state-based, Medicare PROs could analyze data from 
CPRs in their own states to provide constructive, quality-enhancing feedback to hospitals and physicians. 
As a further example, the Centers for Disease Control and Prevention could more quickly and completely 
monitor the incidence and prevalence of communicable diseases with access to locally available CPR data 
on patient care. State and local public health departments could allocate resources more quickly to address 
changing health needs with early recognition of a community health problem. Many of these uses require 
linked data networks and data repositories (that the communities trust with their health data). 

176.7 Selected Issues 

While there are personal and public benefits to be gained from extended use of CPR data beyond direct 
patient care, the use of personal medical information for these uses, particularly if it contains personal 
identification, brings with it some requirements and issues that must be faced. Some of the issues that 
must be addressed by health care policy makers, as well as by private markets, are as follows: 
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Standards are needed for the nomenclature, coding, and structure of clinical patient care data; the 
content of data sets for specific purposes; and the electronic transmission of such data to integrate data 
efficiently across departmental systems within a hospital and data from the systems of other hospitals 
and health care providers. If benefits are to be realized from rapidly accessing and transmitting patient 
care data for managing patient care, consulting with experts across long distances, linking physician 
offices and hospitals, undertaking research, and other applications, data standards are essential [Fitz- 
maurice, 1994a]. The Computer-based Patient Record Institute has taken an active interest in the devel- 
opment of such standards, and the Healthcare Informatics Standards Board of the American National 
Standards Institute has coordinated the standard developing organizations that work on such standards 
in the United States and provided an official link to standards organizations in other countries, such as 
the European Standards Committee (CEN). The Organization of International Standards (ISO) Technical 
Committee 215, Health Informatics, was formed in 1998 to provide a forum for international coordina- 
tion of health informatics standards. The Board of Directors of the American Medical Informatics 
Association [1994] has recommended specific approaches for many patient data standards' areas — adopt 
the best of what exists and move quickly toward continued refinement. 

Administrative health data standards are mandated in the Health Insurance Portability and Account- 
ability Act (HIPAA) of 1996 (Public Law 104-191). In this law, the Secretary of Health and Human 
Services (HHS) is directed to adopt standards for nine common health transactions (enrollment, claims, 
payment, and others) that must be used if those transactions are conducted electronically. Penalties are 
capped at $25,000 per year for each transaction violated. Digital signatures, when adopted by the 
Secretary, may be deemed to satisfy federal and state statutory requirements for written signatures for 
HIPAA transactions. The provisions of HIPAA are expected to take effect in 2001. 

Confidentiality and privacy of individually identifiable patient care and provider data are very likely 
the most important issues. For most purposes, including much research, the user does not need to know 
patient identities if the data have been accurately linked by a trusted party. The benefits from extended 
uses of this data must be balanced against the opportunities for personal harm from unwanted disclosure 
(which must be avoided) . A Federal privacy law that spells out the allowable uses of personally identifiable 
health care data and the conditions under which it can be used, with civil and criminal punishments for 
violating disclosure or redisclosure prohibitions, might appropriately address this issue. It should establish 
safeguards for privacy based on uniform and fair information practices [Gostin, etal., 1993; Privacy 
Protection Commission, 1977; Secretary's Advisory Committee on Automated Personal Data Systems, 
1973]. HIPAA requires the Secretary of HHS to promulgate final regulations containing standards with 
respect to the privacy of individually identifiable health information transmitted in HIPAA transactions 
in 2000, if privacy legislation is not enacted in 1999. 

The quality of stored clinical data may be questioned in the absence of organized programs and criteria 
to assess the reliability, validity, and sufficiency of this data for undertaking research and for providing 
useful information to consumers, medical care organizers, and payers. For proper analysis, the data should 
be sufficient to measure and assess the relevant risk factors influencing patient outcomes. 

Electronically stored records in one state may be considered to be legally the same as paper records, but 
not in another state. In law, regulation, and practice, many states require pen and ink recording and 
signatures, apparently ruling out electronic records and signatures. This inconsistency, and uncertainty, 
creates a barrier for the electronic communication and legal acceptance of patient care data, images, medical 
advice, and claims across state boundaries for medical and other publicly acceptable purposes [Waller, 1991]. 

Standard unique identifiers for patients, health care providers, institutions and payers are needed to 
obtain economies and accuracy when linking patient care data at different locations, and patient care 
data with other relevant data. Under HIPAA, the Secretary of HHS must adopt standards for uniquely 
identifying providers, health plans, employers, and individuals. Because of concerns about the confiden- 
tiality of personal health information that may be linked using the unique individual health identifier, 
final implementation of that identifier will await explicit approval by the administration and Congress 
[Public Law 105-277]. 
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Malpractice concerns arise as telemedicine technology allows physician specialists to give medical advice 
across state borders electronically to other physicians, other healthcare providers, and patients. Physicians 
are licensed by a state to practice within its own state borders. Does a physician who is active in 
telemedicine need to obtain a license from each state in which he or she practices medicine from outside 
the state? If the expert physician outside the patient's state gives bad advice, which state's legal system 
has jurisdiction for malpractice considerations? 

System security and integrity become improtant as more and more data and information for patient 
treatment and other uses are exchanged through national networks. Not only does this issue relate to 
purposeful violations of privacy, but also to the accuracy of medical knowledge for patient benefit. If the 
system fails to transmit accurately what was sent to a physician — for example, an MRI, a practice 
guideline, or a clinical research finding — and if a physician's judgment and recommendation is based on 
a flawed image or other misreported medical knowledge, who bears the legal responsibility for a resulting 
inappropriate patient outcome? National standards for the security of health information are mandated 
by HIPAA. 

Benefit-cost analysis methodologies must be developed and applied to inform investment decision 
makers about the most productive applications of CPR systems. There is a need for a common approach 
to measuring the benefits and the costs to be able to compare alternative applications. Accurate research 
findings applied to business risk and benefit assessments can advance commercial progress for developing 
and implementing CPR applications. 

Regional health data repositories for the benefit of employers, hospital groups, consumers, and state 
health and service delivery programs raise issues about the ownership of patient care data, the use of 
identifiable patient care data, and the governance of the data repository. A study by the Institute of 
Medicine [1994] examines the power of regional health data repositories for improving public health, 
supporting better private health decisions, recognizing medically and cost-effective health care providers 
and health plans, and generally providing the information necessary to improve the quality of health 
care delivery in all settings. Some of the data in these repositories may be based on CPR data. This data 
may include personally identifiable data and move outside the environment in which it was created. 

Patient medical record information is the subject of study by the National Committee on Vital and 
Health Statistics (NCVHS). The function of this expert public advisory is to advise the Secretary of HHS 
on issues relating to health data, statistics, privacy, and national health information policy 
[http://aspe.dhhs.gov/ncvhs/]. Mandated by HIPAA, NCVHS must "study the issues related to the adop- 
tion of uniform data standards for patient medical record information and the electronic exchange of 
such information" and "shall report to the Secretary not later than [in 2000] ... recommendations and 
legislative proposals for such standards and electronic exchange." Current issues dealing with computer- 
based patient record data-formatting, definitions, codes, confidentiality, and technology-are frequently 
the subject of public NCVHS hearings. 
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Computer technology plays a prominent role today in health care delivery centers throughout the world. 
Information technology is gradually transforming the basic practice of medicine by improving the quality 
and quantity of information used by clinicians and administrators. The fundamental compute technology 
that accomplishes transparent and efficient flow of information from the creators of the information to 
the ultimate end users is Computer Networking [Tanenbaum 1988]. 

In a medical center, and to a lesser degree, in a health care practice center, computers are used for 
many purposes: in ancillary, departmental settings such as laboratory, pharmacy, radiology, pathology; 
in administrative contexts such as billing, patient management, transportation, payroll; in clinical and 
scholarly purposes such as electronic medical records, imaging, searchings for medical references; and 
in basic research functions such as molecular modeling, genetics, robotics surgery [Rennels 1989; Duis- 
terhout et al., 1991]. The concept of distributed computing is fast becoming a reality in medical domains, 
and it deals with the collection, integration and presentation of data distributed over several computers. 
Seamless information exchange requires the ubiquitous presence of a network, which, comprised of 
hardware and software components or layers, interconnects these computers. Well-defined methods and 
rules to communicate between computers are called communication protocols, and a collection of specific 
layers and protocols with its implementation is called a communication standard. Several communication 
standards, proprietary and open, exist today. 

177.1 History 

During the 1970s and the early 1980s, computing at hospitals was targeted to administrative needs. 
Central mainframe computers were employed for financial data processing. Even then, laboratory func- 
tions were starting to be automated due to their sheer volume and importance to the clinical community. 
Advent of personal computers (PCs) truly opened up automation opportunities for the ancillary depart- 
ments as it became cost-effective to collect both departmental data for billing purposes and clinical data 
for clinical purposes. During the late 1980s, these stand-alone systems became permanent fixtures. 
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The networking technology also matured in the latter part of the 1980s. Networks' initial deployment 
within health care practice, however, was haphazard at best because of the large capital cost of laying 
cables and little understanding of the benefits at that time. Stand-alone systems started to need data from 
other systems: A single bill has to be sent to a patient for all services even though pharmacy and laboratory 
are different departments; ancillaries want to use the same demographic information that was entered 
in the hospital patient management system, and so on. Individual departments experimented with 
networking and succeeded in creating local solutions. This eventually became a problem when networks 
came to be appreciated as institutional resources and the local islands of information had to be rearranged 
for the larger institutional goals o networking efficiency and reliability. 

177.2 Impact of Clinical Data 

Clinical computing addresses the need for data related to patient care. These data must be delivered to 
the providers (physicians, nurses, technicians, therapists, social workers) in a timely fashion. The data 
must be accurate and be delivered in patient care settings such as bedside or nursing stations, intensive 
care units (ICUs), emergency rooms, operating theaters, clinics, and physicians' offices with varying degree 
of delivery efficiency. The needs for data in ICUs, for example, are more stringent than in a clinic. The 
primary data of importance in an inpatient setting are laboratory results. Other important sources of data 
include pharmacy, radiology, pathology, cardiology, vital signs, neurology, nursing documentation, obstet- 
rics/gynecology, operative reports, discharge summaries, and visit notes. Whereas the paper-based patient 
chart has traditionally served the need for such information, it is clear that computers are far more efficient 
and accurate in organizing, maintaining, and disseminating these records. The ability to use clinical 
information to support automated decision-making applications does not exist in paper-based systems. 
Hence, the concept of an all electronic medical record has become popular in the past few years. 

The delivery of clinical data from many different ancillary systems to diverse end-user settings requires 
a reliable networking infrastructure to transport the data. Clinical data may be collected from different 
sources and stored in a (conceptually) central clinical database. Here, a network is necessary to intercon- 
nect ancillary systems to a back-end central repository. In another scenario, a front-end workstation may 
query individual ancillaries and may dynamically construct an integrated view for a provider. Here, a 
network is necessary to connect ancillary systems to a front-end workstation. In reality, a comprehensive 
network that connects all computing entities within an institution enables all scenarios. 

177.3 Information Types 

Health care data have tremendous variety, all of which are important in patient care and have different 
implications for the performance required of a network. Most of the clinical data available today are in 
textual form as narrative reports. Radiology, pathology, and so on generate reports about specific tests; 
discharge summaries explain the entire episode of the patient's stay in the hospital; operative reports 
explain the procedure followed during operations. These reports range anywhere from a few sentences 
(in a specific type of pathology report) to several pages (autopsy reports). 

Laboratory data tend to be in coded form, where each data item is represented as a code (alphanumeric, 
integers). This is very useful as providers need to look at trends in data over time (examples include 
temperature readings and anion count in blood). Furthermore, for computer-based clinical decision 
support, more data need to be in coded form so that a computer-based expert system can understand 
the values and act upon them to send alerts to providers, if necessary. Coded data, in general, are small 
in size, but in some cases such as an EKG tracing, each point may be a code, and thus the report may 
be of substantial size. 

Hospitals also use a large amount of multimedia data. All radiologic tests are performed as visual 
interpretation of images and video sequences (X-ray images, CAT scans, PET scans, SPECT scans). To a 
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lesser degree, ultrasound sonograms, pathology slides, gated blood pool studies and neurology are other 
sources of these data. A network that can support these kinds of data requires special consideration and 
design and is also more expensive. 

In academic institutions, a large amount of computer-based educational information is graphical and 
video-based. Many stand-alone educational products are being actively used today. Looking toward the 
future, these data (depicting topics such as how to conduct a "physical and history" examination on a 
patient and how to conduct an arthroscopic surgery of the knee) will be available over the same network 
and on demand to all students, faculty, and practitioners. 

177.4 Platforms 

Individuals departments have typically purchased turnkey systems (or have purchased hardware and then 
developed the necessary software themselves) that are best-of-breed variety for their applications. Once the 
benefits of data exchange were understood, networking has become one of the key considerations for these 
systems. Since there are several standards of networking, and not all are supported by all systems, hetero- 
geneity of computing platforms has become an issue. The solutions include application gateways, computers 
that translate from one protocol to another while passing the data between them transparently. Gateways 
are acceptable as an exception, but too many gateways cause severe performance and reliability problems 
in the network. The current norm is to choose a restricted set of protocol standards that are fully supported 
within an institution. A strict control on resources helps in addressing the network needs of the majority. 

177.5 Current Technologies 

Networks are usually denned as a set of layers, each layer providing service to the layer above it. The 
networking infrastructure suitable in a campus environment today is called a local area network (LAN). 
An organization as full control over its LANs. In contrast, when communication is required over long 
distances (e.g., and organization which has offices all over the world), services from a long distance carrier 
company are required to establish what is called a wide area network (WAN). The protocols at higher 
layers work transparently over LANs and WANs, thus providing transparency to applications. Currently, 
software solutions are appearing that use distributed programs in order to achieve total network trans- 
parency and data integration from the end-user viewpoint. These are popularly termed middleware. 
Finally, there are specific solutions and standards pertinent to the health care environment which address 
medical domain problems which maybe called medical domain middleware. In the following, we elaborate 
further on each of these concepts. 

Local Area Networks 

The lowermost layer in networking is the physical medium, which can be copper cables (with different 
degrees of insulation, twists, and impedance), fiber cables (with different diameter and refraction indices), 
or even the atmosphere in the case of wireless, communication based on radiofrequency transmissions. 
Examples of physical medium standards include lOBaseT, 10Base5, and STP. The next higher layer, the 
link layer consists of electronics or optics to drive signals on the physical layer. The most popular LAN 
standards in this layer today are Ethernet (or IEEE 802.3) at a rated lOMbits/s speed (or bandwidth) and 
Token Ring (IEEE 802.5) at 4 or 16Mbits/speed. Above these layers, several proprietary protocols and 
layer stacks may coreside in typical medical practice center LANs. These include IBM SNA and NetBIOS, 
Novell SPX/IPX, DEC DECNet and LAT, and Apple AppleTalk. Among open protocols, the TCP/IP stack 
is the most popular standard. 

The topology of a LAN is the layout of networking segments (e.g., Ethernet, Token Ring) and their 
interconnections by devices such as repeaters, bridges, and routers. Some of the factors that influence 



e 2000 by CRC Press LLC 



topology design are traffic estimates and patterns, redundancy, isolation and security, physical plant 
including building and cabling layout, and organizational structure. A practical topology is that of 
redundant campus backbone segments that then connect to building segments, thus forming a hierarchy 
of segments. A medical center LAN needs to accommodate heterogeneous sets of computer and conse- 
quently needs to support several higher-layer protocols. The choice of higher-layer protocols also influ- 
ences the choice of bridges and router devices. Although initially driven by the integration of existing 
systems, a medical center LAN settles on a small subset of protocols for most of its mainstream commu- 
nication which is practically motivated by the support and maintenance issues. 

In addition to medical domain applications, users require the usual file and print services for day-to- 
day applications such as spreadsheets, word processors, presentation programs, and electronic mail. As 
networked PCs have become popular as user workstations, Novell SPX/IPX in the IBM PC world and 
Apple AppleTalk in the Apple Macintosh world protocols are increasingly used to provide these services. 
A widely used application on the PCs today is terminal emulation by which a PC behaves as a terminal 
to an ancillary or central computer on the network, and thus subsequently has access to all medical 
applications on that application server computer. LANs, instead of the traditional point-to-point con- 
nections, have become the primary vehicle for computer communication at health care practice centers 
to the extent that all new computers are expected to be LAN-compatible. A special group of LAN 
specialists is required to support the LANs. Since LAN technology is constantly evolving toward better 
and faster communications, it is important to maintain up-to-date knowledge in this field. 

Wide Area Networks 

Health care centers employ WANs mainly for three purposes: to connect networks at physically distant 
buildings, offices, or clinics, to connect networks to different organizations (affiliated hospitals, insurance 
agencies, state or federal government regulatory bodies) or services (Internet, disaster recovery backup 
locations); and to provide dial-in-and dial-out capabilities for employees. The difference between the 
first two purposes is that of control and security: access to medical information requires better control 
when connected to an external organization. The concept of a firewall as part of networking topology 
has become popular when connecting to the Internet in order to deter unwarranted access. A physician's 
office may connect to insurance networks for billing, as well as to on-line services to connect to the large 
Internet. In all cases, an external telecommunications carrier's service is used to provide underlying long- 
distance, point-to-point communications capability. 

The aggregate bandwidth in a WAN connection is typically much smaller than the LANs bandwidth 
because of the higher costs of WAN connections. Popular leased line connections have speeds ranging 
from 56 Kbits/s to 1.56 Mbits/s (T-l circuits) to 45 Mbits/s (T-3 circuits), and fractions thereof. Other 
modes of such connections are accomplished by dedicated microwave or laser communications. In all 
cases, end devices such as modems are required to drive these connections. Additionally, multiplexers 
are employed to effectively use the available bandwidth. It is common to extend the LAN protocols to 
run over the WAN connections, so that applications run transparently regardless of location, but perhaps 
at a slower speed due to the restricted bandwidth. 

Dial-in and dial-out access is provided over asynchronous phone lines with speeds ranging from 
1200 bits to 28.8 Kbits/s. Integrated Services Digital Network (ISDN) solutions (at 64 Kbits/s) are 
becoming available and cost-effective. With implementations of the point-to-point protocol (PPP) stan- 
dard, the dialing-in PC will become a full peer node on the organizational network, instead of becoming 
a terminal to some computer or going through a surrogate PC on the network. 

Middleware 

At higher layers of networking, the client-server computing paradigm is becoming an alternative to central 
host-based computing. By distributing the work over several nodes on the network, more efficient and 
flexible solutions are created. These solutions integrate data for the end-users at the front end while 
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hiding the multiplicity of origins of the data at the back end. A reliable, distributed file system, distributed 
printing services, consolidated directory services, network-based security service, and transparent invo- 
cation of applications residing elsewhere are examples of middleware. In the medical center context, it 
is extremely important to establish standards at this level to effectively tame the heterogeneity. Specific 
examples of middleware include Open Software Foundation's Distributed Computing Environment 
(DCE), SUN's Open Network Computing (ONC), OSI X.500 directory services, MIT kerberos security 
service, and OMG Common Object Request Broker Architecture (CORBA). 

Medical Domain Middleware 

With the availability of standard networks, applications, and enabling middleware technologies, clinical 
computing concentrates on the collection, integration, and dissemination of clinical data. In addition, 
the major focus is to convert data into knowledge in order to help the provider make better clinical 
decisions. In order to accomplish these transformations, several medical domain middleware tools and 
concepts need to be constructed and understood to operate in a distributed computing environment. 

Integrated Patient Database 

A comprehensive clinical information system requires construction of a logically integrated repository 
of all clinical information for each patient. As stated previously, clinical data originate at many ancillary 
systems. All data may be collected at a single, physically central repository, using the network as and 
when data are produced at the ancillaries. In this case, a data review application needs only to query 
(over the network) the central repository for all data, which reduces the complexity at the front end. 
Alternatively, data review applications may query several ancillaries dynamically at the time specific 
patient's data are being reviewed, thus creating a virtual patient repository. In either case, data integration 
is transparent to the end-use. 

Networks permit very efficient distributions of the patient database. Since, from a clinical viewpoint 
one patient's data are completely independent of another patient's data, each patient's data constitutes 
a separate database. Therefore, it is possible to isolate a specific patient database (or a set of patient 
databases) on one host. One logical distribution strategy is that of dedicating a fully functional database 
for each nursing station within a hospital with a well-defined strategy of migrating patients from one 
database to another. Once again, however, the operations must be transparent to the end-user, and the 
explicit reference to hosts and patient information must be hidden within the database functions. 

The need for clinical and hospital data exchange among heterogeneous systems has required the 
creation of common standards for the definition of medical events, the collection of data associated with 
these events, and the recognition of events that may be useful to participating systems [McDonald & 
Hammond, 1989]. The only practical standard today is Health Level 7 (HL-7), commonly followed by 
many vendors today. Examples of events defined in HL-7 include admit, discharge, transfer, order, and 
order results. The use of the standard avoids the need to make unnecessary translations. 

The success of these early databases has raised the interesting question: How can a patient's information 
be collected from many institutions (doctor's office, clinics, hospitals) to create a comprehensive record 
that is readily available over the network regardless of the patient's geographic location? Many efforts in 
electronic medical records at the national level include discussion of the pros and cons of merging of 
information for clinical care of the patient as well as for epidemiological research for the society at large. 

Medical Vocabulary 

A common medical vocabulary is fast becoming a necessity within clinical information systems. The 
vocabulary addresses the problem of understanding the semantics of medical concepts, answering ques- 
tions about whether two concepts are the same or different, and if the latter, how so [Cimino et al., 1994] . 
For example, cardiomegaly and enlargement of heart are the same concepts, a fact that cannot be derived 
syntactically. A medical vocabulary defines medical concepts as entities and supports links between 
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entities that collectively define that concept. In the exchange of information between systems, the vocab- 
ulary serves as the semantic standard of the meaning of the data items exchanged, much as HL-7 serves 
as the syntactic standard. Without the vocabulary, the end-user must know how each medical concept 
is described in each of the ancillaries to formulate his queries correctly. With the vocabulary, the concepts 
are semantically translated to either a canonical code or to the native ancillary code for that concept. 
The vocabulary makes the heterogeneity of terminology introduced by several ancillary systems trans- 
parent to the user. 

Directions and Challenges 

Medical computing will compute to take advantage of the ever-increasing expansion of the computing 
technology. Options not viable in the past are quickly becoming mainstream in the areas of improved 
networking bandwidth. In addition, the current changes in the health care business dictate more efficient 
forms of car delivery, which may be facilitated only by adopting information technologies. In the following 
sections, we discuss a few directions and challenges in this field. 

Improved Bandwidth 

The currently prevalent networks with 4, 10, and 16 Mbits. a bandwidth are inadequate for pervasive 
imaging and multimedia applications. Due to the richness of medical information, these applications 
are extremely important and have a tremendous impact on the required network bandwidth. Existing 
high-speed networks at 100 Mbits/s, primarily Fiber Distributed Data Interface (FDDI), have been used 
as the campus backbones but have failed to capture the marketplace due to the high cost of fiber cable 
deployment and adapter cards. Instead, new standards oflOO Mbits/s communication (100Base-T or Fast 
Ethernet, lOOVG-AnyLAN) are fast becoming reality. These technologies use the same cables (UTP 
grade 5, STP) as for the slower networks, but at a higher cost of equipment. If fiber optic cables are used, 
then communication can reach Gigabits/s (1000 Mbits/s) range, although most implementations today 
support 155Mbits/s. This is being accomplished by the new standard of asynchronous transfer mode (ATM) 
communication and corresponding devices called ATM switches. The long-distance carriers are stan- 
dardizing on interfaces called synchronous optical network (SONET) or synchronous digital hierarchy 
(SDH) that support variable speeds ranging from 1.56 Mbits/s (T-l circuits) to 155 Mbits/s (OC-3c) to 
655 Mbits/s (OC-7c) over fiber optic cable. Since ATM switches can use SONET connections, ATM may 
be the enabling technology that allows LANs and WANs to work together seamlessly at high speeds 
without any performance penalties. 

ATM technology is available in various forms. It can be used as a campus LAN backbone switch that 
connects routers, servers, and other switches. The second form is that of LAN extenders, whereby an 
Ethernet or Token Ring is extended to another location over high-speed ATM connections; in this form, 
ATM switches work as virtual LAN repeaters. ATM adapters are available for many workstations such as 
SUN SparcStations, IBM RS/6000, HP 9000, and DEC AlphaStations. The technology itself is flexible in 
that it can be supported over any type of cable (copper and fiber). The communication occurs over fixed 
length cells of 53 bytes (as opposed to variable length packet in X.25), which makes it possible to guarantee 
service to time-dependent (or isochronous) applications and allows very fast hardware-based cell switch- 
ing to switching route the cells through the switches. ATM networking also capitalizes on today's high 
speed, reliable communication links by forgoing error/detection/correction at each intermediate node 
within the network. 

The ATM standard, set by the International Telegraph and Telephone Union Telecommunications 
Sector (ITU-T), is promoted by an ATM Forum consisting of over 500 members. The Forum has also 
set standards for interfaces required to communicate between various ATM nodes, and between ATM 
layers and other protocols. Residing within the traditional link layer, several parallel stacks called ATM 
adaptation layers (AAL) define protocols and quality of services for voice circuit emulation (AAL 1) 
timing-sensitive video/audio transfer (AAL 2), connection-oriented and connectionless data transfer for 
bursty data (AAL 3/4), and simple and efficient adaptation layer for local, high speed LAN installations 
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(AAL 5). Although the ATM technology has great potential, the ATM solutions must integrate with the 
legacy networks in an orderly fashion to achieve a smooth transition. The accelerated integration of ATM 
technology into the desktop, hubs, bridges, and routers indicates that ATM has a high probability of 
dominating the marketplace. 

At many institutions today, campus backbones are composed of fiber optic cables, and some institu- 
tions even have fiber optic cables at the desktops. At a few well-known radiology centers, picture archiving 
communications systems (PACS), a technology for creating, storing, and disseminating radiologic images, 
have been installed using networks exclusively dedicated to PACS information exchange. The PACS 
systems may be considered one of the ancillary systems serving one department's needs. A faster net- 
working technologies become universally available and as workstations become powerful enough to 
provide on-the-fly decompression, a true multimedia view of patient information may be constructed 
for the first time at the provider's desktop. 

Another example of applications requiring higher bandwidth is telemedicine. It is a concept whereby 
health care may be delivered remotely over computer networks by providers who are physically far away 
from the patient [Halpern et al., 1992]. This technology is being actively pursued by the federal govern- 
ment and by several state governments through the National Information Infrastructure initiative and 
through statewide information highway initiatives. Several tertiary-care institutions are arranging affili- 
ation with primary and secondary care institutions to provide expert referral services. The concept is 
also popular in rural settings, where more sophisticated care may be available at a limited number of 
locations. 

One of the fundamental requirements for telemedicine is a networking bandwidth that is capable of 
supporting simultaneous video, audio, image and data transfer in real time. Technologies such as ATM 
and SONet promise to be the solutions to the problem. Today, telemedicine is restricted to asynchronous 
referral services for medical images, remote review of clinical information, and occasional video confer- 
ences. In prototype situations, however, complicated scenarios involving telesurgery have been shown to 
be possible using remote robotic hands and head-mounted displays in an augmented reality setting. 

Security 

Networks have made flexible data access possible. Recently data interconnections have exploded among 
care facilities, private physicians' offices, nursing homes, insurance agencies, health maintenance orga- 
nizations, research institutions, and state and federal regulatory agencies. Internet is the precursor to the 
eventual information super-highway. The problem of security has become ever more important in this 
new world of accessible yet confidential patient information — including issues of who owns the data, 
how to protect the data, how to convert it securely to meet societal needs for statistical information, and 
so on. Smart cards, encryption digital signatures, guaranteed authentication and authorization, and 
constant accounting and auditing are several techniques being considered in intense discussions address- 
ing the security aspects of medical information networking [US Congress 1993]. 

Management 

Managing networks and distributed applications is orders-of-magnitude harder than managing a single, 
monolithic system. It is, however, extremely important to invest in and learn how to manage the hundreds 
of networking and computing components that bring value to the networks. Although there are some 
good tools and standards available to monitor the lower layers of networks and devices such as bridges 
and routers, there is a total lack of management capabilities for middleware and nascent medical domain 
middleware. Monitoring of applications helps determine future demands for resources, performance 
guarantees, and opportunities to make a system more reliable. The primary management communication 
standard is simple network management protocol (SNMP), which is not yet at the advanced stage of 
managing higher-layer protocols and applications. 
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177.6 Conclusions 

The use of networks at health care practice centers has provided value by increasing data availability, 
flexibility, and efficiency of operations. The graphical-user-interface-based applications, along with client- 
server paradigm of computing have begun to appear on these networks. The next generation of multi- 
media applications will require higher bandwidth networks, which have already begun to appear. In 
addition, the national efforts to link health care industries and providers in order to improve efficiency 
will now be possible due to the networks that have become the fourth utility — after electricity, water, 
and heat — in many medical centers. Individual doctor's offices, clinics, and practice settings are connect- 
ing to computer networks that accomplish the communication functions of the larger health care business 
networks. 

Defining Terms 

Communication protocols: A set of rules and methods by which computers communicate with each 
other. 

Ethernet, Token Ring: Examples of specific kind of communication protocols suitable for local area 
networks. 

Fiber distributed data interface (FDDI), asynchronous transfer mode (ATM): High-speed commu- 
nication protocols suitable for both local and wide area networks. 

Local area network: Computer networking infrastructure suitable for a campus (or smaller) environ- 
ment where the owning organization controls connection and protocols. 

Medical domain middleware: Middleware addressed specifically to transparently access health care 
information on the network. 

Middleware: Software components that help create a transparent access to resources regardless of their 
physical location in the network. 

Network topology: A detailed map of physical and conceptual connections among networking devices. 

Unshielded or shielded twisted pair (UTP, STP): Types of cables with different electrical characteristics 
and prices. 

Wide area network: Computer networking infrastructure that connects several local area networks 
distributed over great geographical distances using external services. 
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As the cost of health care has become a larger percentage of the gross domestic product of many developed 
notions, the focus on methods to improve health care productivity and quality has increased. To address 
this need, the concept of a health care information infrastructure has emerged. Major elements of this 
concept include patient-centered care facilitated by computer-based patient record systems, continuity 
of care enabled by the sharing of patient information across information networks, and outcomes 
measurement aided by the greater availability and specificity of health care information. 

The creation of this health care information infrastructure will require the integration of existing and 
new architectures, products and services. To make these diverse components work together, health care 
information standards (classifications, guides, practices, terminology) will be required [ASTM, 1994]. 
This chapter will give you an overview of the major existing and emerging health care information 
standards, and the efforts to coordinate, harmonize, and accelerate these activities. It is organized into 
the major topic areas of: 

■ Identifier standards 

■ Communications (message format) standards 

■ Content and structure standards 
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• Clinical data representations (codes) 

• Confidentiality, data security, and authentication 

• Quality indicators and data sets 

• International Standards 

• Coordinating and promotion organizations 

• Summary 

178.1 Identifier Standards 

There is a universal need for health care identifiers to uniquely specify each patient, provider, site of care 
and product; however, there is not universal acceptance and/or satisfaction with these systems. 

Patient Identifiers 

The social security number (SSN) is widely used as a patient identifier in the United States today. However, 
critics point out that it is not an ideal identifier. They say that not everyone has an SSN; several individuals 
may use the same SSN; and the SSN is so widely used for other purposes that it presents an exposure to 
violations of confidentiality. These criticisms raise issues that are not unique to the SSN. A draft document 
has been developed by the American Society for Testing and Materials (ASTM) E31.12. Subcommittee 
to address these issues. It is called the "Guide for the Properties of a Universal Health Care Identifier" 
(UHID). It presents a set of requirements outlining the properties of a national system creating a UHIC, 
includes critiques of the SSN, and creates a sample UHD (ASTM E31.12, 1994). Despite the advantages 
of a modified/new patient identifier, there is not yet a consensus as to who would bear the cost of adopting 
a new patient identifier system. 

Provider Identifiers 

The Health Care Financing Administration (HCFA) has created a widely used provider identifier known 
as the Universal Physician Identifier Number (UPIN) [Terrell et al., 1991]. The UPIN is assigned to 
physicians who handle Medicare patients, but it does not include nonphysician care givers. The National 
Council of Prescription Drug Programs (NCPDP) has developed the standard prescriber identification 
number (SPIN) to be used by pharmacists in retail settings. A proposal to develop a new national provider 
identifier number has been set forth by HCFA [HCFA, 1994]. If this proposal is accepted, then HCFA 
would develop a national provider identifier number which would cover all caregivers an sites of care, 
including Medicare, Medicaid, and private care. This proposal is being reviewed by various state and 
federal agencies. It has also been sent to the American National Standards, Institute's Health Care 
Informatics Standards Planning Panel (ANSI HISPP) Task Force on Provider Identifiers for review. 

Site-of-Care Identifiers 

Two site-of-care identifier systems are widely used. One is the health industry number (HIN) issued by 
the Health Industry Business Communications Council (HIBCC). The HIN is an identifier for health 
care facilities, practitioners, and retail pharmacies. HCFA has also defined provider of service identifiers 
for Medicare usage. 

Product and Supply Labeling Identifiers 

Three identifiers are widely accepted. The labeler identification code (LIC) identifies the manufacturer or 
distributor and is issued by HIBCC [HIBCC, 1994]. The LIC is used both with and without bar codes for 
products and supplies distributed within a health care facility. The universal product code (UPC) is main- 
tained by the Uniform Code Council and is typically used to label products that are sold in retail settings. 
The national drug code is maintained by the Food and Drug Administration and is required for reimburse- 
ment by Medicare, Medicaid, and insurance companies. It is sometimes included within the UPC format. 
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178.2 Communications (Message Format) Standards 

Although the standards in this topic area are still in various stages of development, they are generally 
more mature than those in most of the other topic areas. They are typically developed by committees 
within standards organizations and have generally been accepted by users and vendors. The overviews 
of these standards below were derived from many sources, but considerable content came from the 
Computer-based Patient Record Institute's (CPRI) "Position Paper on Computer-based Patient Record 
Standards" [CPRI, 1994] and the Agency for Health Care Policy and Research's (AHCPR) "Current 
Activities of Selected Health Care Informatics Standards Organizations" [Moshman Associates, 1994]. 

ASC X12N 

This committee is developing message format standards for transactions between payors and providers. 
It is rapidly being accepted by both users and vendors. It defines the message formats for the following 
transaction types [Moshman Associates, 1994]: 

• 834 — enrollment 

• 270 — eligibility request 

• 271 — eligibility response 

• 837 — health care claim submission 

• 835 — health care claim payment remittance 

• 276 — claims status request 

• 277 — claims status response 

• 148 — report of injury or illness 

ASC X12N is also working on the following standards to be published in the near future: 

• 257, 258 — Interactive eligibility response and request. These transactions are an abbreviated form 
of the 270/271. 

• 274, 275 — patient record data response and request. These transactions will be used to request 
and send patient data (tests, procedures, surgeries, allergies, etc.) between a requesting party and 
the party maintaining the database. 

• 278, 279 — health care services (utilization review) response and request. These transactions will 
be used to initiate and respond to a utilization review request. 

ASC XI 2N is recognized as an accredited standards committee (ASC) by the American National Standards 
Institute (ANSI). 

American Society for Testing and Materials (ASTM) 
Message Format Standards 

The following standards were developed within ASTM Committee E3 1 . This committee has applied for 
recognition as an ASC by ANSI. 

• ASTM E1238 standard specification for transferring clinical observations between independent 
systems. El 238 was developed by ASTM Subcommittee E31.ll. This standard is being used by 
most of the largest commercial laboratory vendors in the United States to transmit laboratory 
results. It has also been adopted by a consortium of 25 French laboratory system vendors. Health 
Level Seven (HL7), which is described later in this topic area, has incorporated E1238 as a subset 
within its laboratory results message format [CPRI, 1994]. 

• ASTM El 394 standard specification for transferring information between clinical instruments. 
E1394 was developed by ASTM Subcommittee E31.14. This standard is being used for commu- 
nication of information from laboratory instruments to computer systems. This standard has been 
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developed by a consortium consisting of most U.S. manufacturers of clinical laboratory instru- 
ments [CPRI, 1994]. 

• ASTM 1460 specification for denning and sharing modular health knowledge bases (Arden Syn- 
tax). E1460 was developed by ASTM Subcommittee E31.15. The Arden Syntax provides a standard 
format and syntax for representing medical logic and for writing rules and guidelines that can be 
automatically executed by computer systems. Medical logic modules produced in one site of care 
system can be sent to a different system within another site of care and then customized to reflect 
local usage [CPRI, 1994]. 

• ASTM E1467 specification for transferring digital neurophysical data between independent com- 
puter systems. E1467 was developed by ASTM Subcommittee E31.17. This standard defines codes 
and structures needed to transmit electrophysiologic signals and results produced by electroen- 
cephalograms and electromyograms. The standard is similar in structure to ASTM El 238 and 
HL7; and it is being adopted by all the EEG systems manufacturers [CPRI, 1994]. 

Digital Imaging and Communications [DICOM] 

This standard is developed by the American College of Radiology — National Electronic Manufacturers' 
Association (ACR-NEMA). It defines the message formats and communications standards for radiologic 
images. DICOM is supported by most radiology Picture Archiving and Communications Systems (PACS) 
vendors and has been incorporated into the Japanese Image Store and Carry (ISAC) optical disk system 
as well as Kodak's PhotoCD. ACR-NEMA is applying to be recognized as an accredited organizations by 
ANSI [CPRI, 1994]. 

Health Level Seven [HL7] 

HL7 is used for intrainstitution transmission of orders; clinical observations and clinical data, including 
test results; admission, transfer, and discharge records; and charge and billing information. HL7 is being 
used in more than 300 U.S. health care institutions including most leading university hospitals and has 
been adopted by Australia and New Zealand as their national standard. HL7 is recognized as an accredited 
organization by ANSI [Hammond, 1993; CPRI, 1994]. 

Institute of Electrical and Electronic Engineers, Inc. (IEEE) PI 157 
Medical Data Interchange Standard (MEDIX) 

IEEE Engineering in Medicine and Biology Society (EMB) is developing the MEDIX standards for the 
exchange of data between hospital computer systems [Harrington, 1993; CPRI, 1994]. Based on the 
International Standards Organization (ISO) standards for all seven layers of the OSI reference model, 
MEDIX is working on a framework model to guide the development and evolution of a compatible set 
of standards. This activity is being carried forward as a joint working group under ANSI HISPP's Message 
Standards Developers Subcommittee (MSDS). IEEE is recognized as an accredited organization by ANSI. 

IEEE P1073 Medical Information Bus (MIB) 

This standard defines the linkages of medical instrumentation (e.g., critical care instruments) to point- 
of-care information systems [CPRI, 1994]. 

National Council for Prescription Drug Programs (NCPDP) 

These standards developed by NCPDP are used for communication of billing and eligibility information 
between community pharmacies and third-party payers. They have been is use since 1985 and now serve 
almost 60% of the nation's community pharmacies. NCPDP has applied for recognition as an accredited 
organization by ANSI [CPRI, 1994]. 
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178.3 Content and Structure Standards 

Guidelines and standards for the content and structure of computer-based patient record (CPR) systems 
are being developed within ASTM Subcommittees E31.12 and E31.19. They have been recognized by 
other standards organizations (e.g., HL7); however, they have not matured to the point where they are 
generally accepted or implemented by users and vendors. 

A major revision to E1384, now called a standard description for content and structure of the computer- 
based patient record, has been made within Subcommittee E31.19 [ASTM, 1994]. This revision includes 
work from HISPP on data modeling and an expanded framework that includes master tables and data 
views by user. 

Companion standards have been developed within E31.19. They are E1633, A Standard Specification 
for the Coded Values Used in the Automated Primary Record of Care (ASTM, 1994), and E1239-94, A 
Standard Guide for Description of Reservation/Registration-A/D/T Systems for Automated Patient Care 
Information Systems (ASTM, 1994). A draft standard is also being developed for object-oriented models 
for R-A/D/T functions in CPR systems. Within the E31.12 Subcommittee, domain specific guidelines for 
nursing, anesthesiology, and emergency room data within the CPR are being developed [Moshman 
Associates, 1994; Waegemann, 1994]. 

178.4 Clinical Data Representations (Codes) 

Clinical data representations have been widely used to document diagnoses and procedures. There are 
over 150 known code systems. The codes with the widest acceptance in the United States include: 

• International Classification of Diseases (ICD) codes, now in the ninth edition (ICD-9), are main- 
tained by the World Health Organization (WHO) and are accepted worldwide. In the United States, 
HCFA and the National Center for Health Statistics (NCHS) have supported the development of 
a clinical modification of the ICD codes (ICD-9-CM). WHO has been developing ICD- 10; however, 
HCFA projects that it will not be available for use within the United States for several years. Payers 
require the use of ICD-9-CM codes for reimbursement purposes, but they have limited valued for 
clinical and research purposes due to their lack of clinical specificity [Chute, 1991]. 

• Current Procedural Terminology (CPT) codes are maintained by the American Medical Associa- 
tion (AMA) and are widely used in the United States for reimbursement and utilization review 
purposes. The codes are derived from medical specialty nomenclatures and are updated annually 
[Chute 1991]. 

• The Systematized Nomenclature of Medicine (SNOMED) is maintained by the College of Amer- 
ican Pathologists and is widely accepted for describing pathologic test results. It has a multiaxial 
(11 fields) coding structure that gives it greater clinical specificity than the ICD and CPT codes, 
and it has considerable value for clinical purposes. SNOMED has been proposed as a candidate 
to become the standardized vocabulary for computer-based patient record systems (Rothwell et al., 
1993). 

• Digital Imaging and Communications (DICOM) is maintained by the American College of Radi- 
ology — National Electronic Manufacturers' Association (ACR-NEMA). It sets forth standards for 
indicies of radiologic diagnoses as well as for image storage and communications [ Cannavo, 1993]. 

• Diagnostic and Statistical Manual of Mental Disorders (DSM), now in its fourth edition (DSM- 
IV), is maintained by the American Psychiatric Association. It sets forth a standard set of codes 
and descriptions for use in diagnoses, prescriptions, research, education, and administration 
[Chute, 1991]. 

• Diagnostic Related Groups (DRGs) are maintained by HCFA. They are derivatives of ICD-9-CM 
codes and are used to facilitate reimbursement and case-mix analysis. They lack the clinical 
specificity to be of value in direct patient care or clinical research [Chute, 1991]. 
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• Unified Medical Language System (UMLS) is maintained by the National Library of Medicine 
(NLM). It contains a metathesaurus that links clinical terminology, semantics, and formats of the 
major clinical coding and reference systems. It links medical terms (e.g., ICD, CPT, SNOMED, 
DSM, CO-STAR, and D-XPLAIN) to the NLM's medical index subject headings (MeSH codes) 
and to each other [Humphreys, 1991; Cimino et al., 1993]. 

• The Canon Group has not developed a clinical data representation, but it is addressing two 
important problems: clinical data representations typically lack clinical specificity and/or are 
incapable of being generalized or extended beyond a specific application. "The Group proposes 
to focus on the design of a general schema for medical-language representation including the 
specification of the resources and associated procedures required to map language (including 
standard terminologies) into representations that make all implicit relations 'visible,' reveal 'hidden 
attributes', and generally resolve... ambiguous references" [Evans et al., 1994]. 

178.5 Confidentiality, Data Security, and Authentication 

The development of compute-based patient record systems and health care information networks have 
created the opportunity to address the need for more definitive confidentiality, data security, and authen- 
tication guidelines and standards. The following activities address this need: 

• During 1994, several bills were drafted in Congress to address health care privacy and confiden- 
tiality. They included the Fair Health Information Practices Act of 1994 (H.R. 4077), the Health 
Care Privacy Protection Act (S. 2129), and others. Although these bills were not passed as drafted, 
their essential content is expected to be included as part of subsequent health care reform legis- 
lation. They address the need for uniform comprehensive federal rules governing the use and 
disclosure of identifiable health, information about individuals. They specify the responsibilities 
of those who collect, use, and maintain health information about patients. They also define the 
rights of patients and provide a variety of mechanisms that will allow patients to enforce their 
rights. 

• ASTM Subcommittee E31.12 on Computer-based Patient Records is developing Guidelines for 
Minimal Data Security Measures for the Protection of Computer-based patient Records [Moshman 
Associates, 1994]. 

• ASTM Subcommittee E3 1 . 1 7 on Access, Privacy, and Confidentiality of Medical Records is working 
on standards to address these issues [Moshman Associates, 1994]. 

• ASTM Subcommittee E31.20 is developing standard specifications for authentication of health 
information [Moshman Associates, 1994]. 

• The Committee on Regional Health Data Networks convened by the Institute of Medicine (IOM) 
has completed a definitive study and published its findings in a book entitled Health Data in the 
Information Age: Use, Disclosure, and Privacy [Donaldson & Lohr, 1994]. 

• The Computer-based Patient Record Institute's (CPRI) Work Group on Confidentiality, Privacy 
and Legislation has completed white papers on "Access to Patient Data" and on "Authentication," 
and a publication entitled "Guidelines for Establishing Information Security: Policies at Organi- 
zations using Computer-based Patient Records" [CPRI, 1994]. 

• The Office of Technology Assessment has completed a two-year study resulting in a document 
entitled "Protecting Privacy in Computerized Medical Information." It includes a comprehensive 
review of system/data security issues, privacy information, current laws, technologies used for 
protection, and models. 

• The U.S. Food and Drug Administration (FDA) has created a task force on Electronic/Identification 
Signatures to study authentication issues as they relate to the pharmaceutical industry. 
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178.6 Quality Indicators and Data Sets 

The Joint Commission on Accreditation of Health Care Organizations (JCAHO) has been developing 
and testing obstetrics, oncology, trauma, and cardiovascular clinical indicators. These indicators are 
intended to facilitate provider performance measurement. Several vendors are planning to include 
JCAHO clinical indicators in their performance measurement systems [JCAHO, 1994]. 

The Health Employers Data and Information Set (HEDIS) version 2.0 has been developed with the 
support of the National Committee for Quality Assurance (NCQA). It identifies data to support perfor- 
mance measurement in the areas of quality (e.g., preventive medicine, prenatal care, acute and chronic 
disease, and mental health), access and patient satisfaction, membership and utilization, and finance. 
The development of HEDIS has been supported by several large employers and managed care organiza- 
tions [NCQA, 1993]. 

178.7 International Standards 

The International Organization for Standardization (ISO) is a worldwide federation of national standards 
organizations. It has 90 member countries. The purpose of ISO is to promote the development of 
standardization and related activities in the world. ANSI was one of the founding members of ISO and 
is representative for the United States [Waegemann, 1994]. 

ISO has established a communications model for Open Systems Interconnection (OSI) IEEE/MEDIX 
and HL7 have recognized and built upon the ISO/OSI framework. Further, ANSI HISPP has a stated 
objective of encouraging compatibility of U.S. health care standards with ISO/OSI. The ISO activities 
related to information technology take place within the Joint Technical Committee (JTC) 1. 

The Comite Europeen de Noramalisation (CEN) is a European standards organization with 16 tech- 
nical committees (TCs). Two TCs are specifically involved in health care: TC 251 (Medical Informatics) 
and TC 224 WG12 (Patient Data Cards) [Waegemann, 1994]. 

The CEN TC 251 on Medical Informatics includes work groups on: Modeling of Medical Records; 
Terminology, Coding, Semantics, and Knowledge Bases; Communications and Messages; Imaging and 
Multimedia; Medical Devices; and Security, Privacy, Quality and Safety. The CEN TC 251has established 
coordination with health care standards development in the United States through ANSI/HISPP. 

In addition to standards developed by ISO and CEN, there are two other standards of importance. 
United Nations (U.N.) EDIFACT is a generic messaging-based communications standard with health- 
specific subsets. It parallels X12 and HL7, which are transaction-based standards. It is widely used in 
Europe and in several Latin America countries. The READ Classification System (RCS) is a multiaxial 
medical nomenclature used in the United Kingdom. It is sponsored by the National Health Service and 
has been integrated into computer-based ambulatory patient record systems in the United Kingdom 
[CAMS, 1994]. 

178.8 Standards Coordination and Promotion Organizations 

In the United States, two organizations have emerged to assume responsibility for the coordination and 
promotion of health care standards development: the ANSI Health Care Informatics Standards Planning 
Panel (HISPP) and the Computer-based Patient Record Institute (CPRI).The major missions of an ANSI 
HISPP are: 

1. To coordinate the work of the standards groups for health care data interchange and health care 
informatics (e.g., ACR/NEMA, ASTM, HL7, IEEE/MEDIX) and other relevant standards groups (e.g., 
X3, X12) toward achieving the evolution of a unified set of nonredundant, non-conflicting standards. 

2. To interact with and provide input to CEN TC 251 (Medical Informatics) in a coordinated fashion 
and explore avenues of international standards development. 
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The first mission of coordinating standards is performed by the Message Standards Developers Subcom- 
mittee (MSDS). The second mission is performed by the International and Regional Standards Subcom- 
mittee. HISPP also has four task groups: (1) Codes and Vocabulary, (2) Privacy, Security and 
Confidentiality, (3) Provider Identification Numbering Systems, and (4) Operations. Its principle mem- 
bership is comprised of representatives of the major health care standards development organizations 
(SDOs), government agencies, vendors, and other interested parties. ANSI HISPP is by definition a 
planning panel, not an SDO (Hammond, 1994; ANSI HISPP, 1994). 

The CPRI's mission is to promote acceptance of the vision set forth in the Institute of Medicine Study 
report "The Computer-based Patient Record: An Essential Technology for Health Care." CRPI is a 
nonprofit organization committed to initiating and coordinating activities to facilitate and promote the 
routine use of computer-based patient records. The CPRI takes initiatives to promote the development 
of CPR standards, but it is not an SDO itself. CPRI members represent the entire range of stakeholders 
in the health care delivery system. Its major work groups are the: (1) Codes and Structures Work Group; 
(2) CPR Description Work Group; (3) CPR Systems Evaluation Work Group; (4) Confidentiality, Privacy, 
and Legislation Work Group; and (5) Professional and Public Education Work Group [CPRI, 1994]. 

Two work efforts have been initiated to establish models for principle components of the emerging 
health care information infrastructure. The CPR Description Work Group of the CPRI is defining a 
consensus-based model of the computer-based patient record system. A joint working group to create a 
common data description has been formed by the MSDS Subcommittee of ANSI HISPP and 
IEEE/MEDIX. The joint working group is an open standards effort to support the development of a 
common data model tat can be shared by developers of health care informatics standards [IEEE, 1994]. 

The CPRI has introduced a proposal defining a public/private effort to accelerate standards develop- 
ment for computer-based patient record systems [CPRI, 1994]. If funding becomes available, the project 
will focus on obtaining consensus for a conceptual description of a computer-based patient record system; 
addressing the need for universal patient identifiers; developing standard provider and sites-of-care 
identifiers; developing confidentiality and security standards; establishing a structure for and developing 
key vocabulary and code standards; completing health data interchange standards; developing imple- 
mentation tools; and demonstrating adoptability of standards in actual settings. This project proposes 
that the CPRI and ANSI HISPP work together to lead, promote, coordinate and accelerate the work of 
SDOs to develop health care information standards. 

The Workgroup on Electronic Data Interchange ( WEDI) is a voluntary, public/private task force which 
was formed in 1991 as a result of the call for health care administrative simplification by the director of 
the Department of Health and Human Services, Dr. Louis Sullivan. They have developed an action plan 
to promote health care EDI which includes: promotion of EDI standards, architectures, confidentiality, 
identifiers, health cards, legislation, and publicity [WEDI, 1993]. 

178.9 Summary 

This chapter has presented an overview of major existing and emerging health care information infra- 
structure standards and the efforts to coordinate, harmonize, and accelerate these activities. Health care 
informatics is a dynamic area characterized by changing business and clinical processes, functions, and 
technologies. The effort to create health care informatics standards is therefore also dynamic. For the 
most current information on standards, refer to the "For More Information" section at the back of this 
chapter. 
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Further Information 

For copies of standards accredited by ANSI, you can contact the American National Standards Institute, 
11 West 42d St., NY, NY 10036, (212) 642-4900. For information on ANSI Health Care Informatics 
Standards Planning Panel (HISPP), contact Steven Cornish, (212) 642-4900. 

For copies of individual ASTM standards, you can contact the American Society for Testing and 
Materials, 1916 Race Street, Philadelphia, PA 19103-1187, (215) 299-5400. 

For copies of the "Proposal to Accelerate Standards Development for Computer-based Patient Record 
Systems," contact the Computer-based Patient Record Institute (CPRI), Margaret Amatayakul, 1000 E. 
Woodfield Road, Suite 102, Schaumburg, IL 60173, (708) 706-6746. 

For information on provider identifier standards and proposals, contact the Health Care Financing 
Administration (HCFA), Bureau Program Operations, 6325 Security Blvd., Baltimore, MD 21207, (410) 
966-5798. For information on ICD-9-CM codes, contact HCFA, Medical Coding, 401 East Highrise Bldg. 
6325 Security Blvd., Baltimore, MD 21207, (410) 966-5318. 

For information on site-of-care and supplier labeling identifiers, contact the Health Industry Business 
Communications Council (HIBCC), 5110 N. 40th Street, Suite 250, Phoenix, AZ 85018, (602) 381-1091. 

For copies of standards developed by Health Level 7, you can contact HL7, 3300 Washtenaw Avenue, 
Suite 227, Ann Arbor, MI 48104, (313) 665-0007. 

For copies of standards developed by the Institute of Electrical and Electronic Engineers/ Engineering 
in Medicine and Biology Society, in New York City, call (212) 705-7900. For information on IEEE/MEDIX 
meetings, contact Jack Harrington, Hewlett-Packard, 3000 Minuteman Rd., Andover, MA 01810, (508) 
681-3517. 

For more information on clinical indicators, contact the Joint Commission on Accreditation of Health 
Care Organizations (JCAHO), Department of Indicator Measurement, One Renaissance Blvd., Oakbrook 
Terrace, IL 60181, (708) 916-5600. 

For information on pharmaceutical billing transactions, contact the National Council for Prescription 
Drug Programs (NCPDP), 2401 N. 24th Street, Suite 365, Phoenix, AZ 85016, (602) 957-9105. 

For information on HEDIS, contact the National Committee for Quality Assurance (NCQA), Planning 
and Development, 1350 New York Avenue, Suite 700, Washington, DC. 20005, (202) 628-5788. 

For copies of ACR/NEMA DICOM standards, contact David Snavely, National Equipment Manufac- 
turers Association (NEMA), 2101 L. Street N.W, Suite 300, Washington, D.C. 20037, (202) 457-8400. 

For information on standards development in the areas of computer-based patient record concept 
models, confidentiality, data security, authentication, and patient cards, and for information on standards 
activities in Europe, contact Peter Waegemann, Medical Records Institute (MRI), 567 Walnut, P.O. Box 
289, Newton, MA 02160. (617) 964-3923. 
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179.4 Summary 



Non-AI decision making can be defined as those methods and tools used to increase information content 
in the context of some specific clinical situation without having cause to refer to knowledge embodied 
in a computer program. Theoretical advances in the 1950s added rigor to this domain when Meehl argued 
that many clinical decisions could be made by statistical rather than intuitive means [ 1 ] . Evidence of this 
view was supported by Savage [2], whose theory of choice under uncertainty is still the classical and 
most elegant formulation of subjective Bayesian decision theory, and was much responsible for reintro- 
ducing Bayesian decision analysis to clinical medicine. Ledley and Lusted provided further evidence that 
medical reasoning could be made explicit and represented in decision theoretic ways [3] . Decision theory 
also provided the means for Nash to develop a "Logoscope" which might be considered as the first 
mechanical diagnostic aid [4]. 

An information system developed using non-AI decision-making techniques may comprise procedural 
or declarative knowledge. Procedural knowledge maps the decision-making process into the methods by 
which the clinical problems are solved or clinical decisions made. Examples of techniques which form a 
procedural knowledge base are those which are based on algorithmic analytical models, clinical algo- 
rithms, or decision trees. Information systems based on declarative knowledge comprise what can essen- 
tially be termed a database of facts about different aspects of a clinical problem; the causal relationships 
between these facts form a rich network from which explicit (say) cause-effect pathways can be deter- 
mined. Semantic networks and causal probabilistic networks are perhaps the best examples of information 
systems based on declarative knowledge. There are other types of clinical decision aids, based purely on 
statistical methods applied to patient data, for example, classification analyses based on logistic regres- 
sion, relative frequencies of occurrence, pattern-matching algorithms, or neural networks. 

The structure of this chapter mirrors to some extent the different methods and techniques of non-AI 
decision making mentioned above. It is important to distinguish between analytical models based on 
quantitative or qualitative mathematical representations and decision theoretic methods typified by the 
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use of clinical algorithms, decision trees, and set theory. Most of the latter techniques add to an infor- 
mation base by way of procedural knowledge. It is then, that advantage can be taken of the many 
techniques which have statistical decision theoretic principles as their underpinning. 

This section begins with a discussion of simple linear regression models and pattern recognition, but 
then more complex statistical techniques are introduced, for example, the use of Bayesian decision analysis 
which leads to the introduction of causal probabilistic networks. The majority of these techniques add 
information by use of declarative knowledge. Particular applications are used throughout to illustrate 
the extent to which non-AI decision making is used in clinical practice. 

179.1 Analytical Models 

In the context of this chapter, the analytical models considered are qualitative and quantitative mathe- 
matical models that are used to predict future patient state based on present state and a historical 
representation of what has passed. Such models could be representations of system behavior that allow 
test signals to be used so that response of the system to various disturbances can be studied, thus making 
predictions of future patient state. 

For example, Leaning and co-workers [5,6] produced a 19-segment quantitative mathematical model 
of the blood circulation to study the short-term effects of drugs on the cardiovascular system of normal, 
resting patients. The model represented entities such as the compliance, flow, and volume of model 
segments in what was considered a closed system. In total, the quantitative mathematical model com- 
prised 61 differential equations and 159 algebraic equations. Evaluation of the model revealed that it was 
fit for its purpose in the sense of heuristic validity, that is it could be used as a tool for developing 
explanations for cardiovascular control, particularly in relation to the CNS. 

Qualitative models investigate time-dependent behavior by representing patient state trajectory in the 
form of a set of connected nodes, the links between the nodes reflecting transitional constraints placed 
on the system [7]. The types of decision making supported by this type of model are assessment and 
therapy planning. In diagnostic assessment, the precursor nodes and the pathway to the node (decision) 
of interest define the causal mechanisms of the disease process. Similarly, for therapy planning, the optimal 
plan can be set by investigation of the utility values associated with each link in the disease-therapy 
relationship. These utility values refer to a cost function, where cost can be defined as the monetary cost 
of providing the treatment and cost benefit to the patient in terms of efficiency, efficacy, and effectiveness 
of alternative treatment options. Both quantitative [8] and qualitative [9] analytical models can be realized 
in other ways to form the basis of rule-based systems; however that excludes their analysis in this chapter. 

179.2 Decision Theoretic Models 
Clinical Algorithms 

The clinical algorithm is a procedural device that mimics clinical decision making by structuring the 
diagnostic or therapeutic decision processes in the form of a classification tree. The root of the tree 
represents some initial state, and the branches yield the different options available. For the operation of 
the clinical algorithm, the choice points are assumed to follow branching logic with the decision function 
being a yes/no (or similar) binary choice. Thus, the clinical algorithm comprises a set of questions that 
must be collectively exhaustive for the chosen domain, and the responses available to the clinician at 
each branch point must be mutually exclusive. These decision criteria pose rigid constraints on the type 
of medical problem that can be represented by this method, as the lack of flexibility is appropriate only 
for a certain set of well-defined clinical domains. Nevertheless, there is a rich literature available; examples 
include the use of the clinical algorithm for acid-base disorders [10] and diagnosis of mental disorders 
[111. 
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Decision Trees 

A more rigorous use of classification tree representations than the clinical algorithm can be found in 
decision tree analysis. Although from a structural perspective, decision trees and clinical algorithms are 
similar in appearance, for decision tree analysis the likelihood and cost-benefit for each choice are also 
calculated in order to provide a quantitative measure for each option available. This allows the use of 
optimization procedures to gauge the probability of success for the correct diagnosis being made or for 
a beneficial outcome from therapeutic action being taken. A further difference between the clinical 
algorithm and decision tree analysis is that the latter has more than one type of decision node (branch 
point): at decision nodes the clinician must decide on which choice (branch) is appropriate for the given 
clinical scenario; at chance nodes the responses available have no clinician control, for example, the 
response may be due to patient specific data; and outcome nodes define the chance nodes at the "leaves" 
of the decision tree. That is, they summarize a set of all possible clinical outcomes for the chosen domain. 

The possible outcomes from each chance node must obey the rules of probability and sum to unity; 
the probability assigned to each branch reflects the frequency of that event occurring in a general patient 
population. It follows that these probabilities are dynamic, with accuracy increasing, as more evidence 
becomes available. A utility value can be added to each of the outcome scenarios. These utility measures 
reflect a trade-off between competing concerns, for example, survivability and quality of life, and may 
be assigned heuristically. 

When the first edition of this chapter was written in 1995 it was noted that although a rich literature 
describing potential applications existed [12], the number of practical applications described was limited. 
The situation has changed and there has been an explosion of interest in applying decision analysis to 
clinical problems. Not only is decision analysis methodology well described [13-16] but there are numer- 
ous articles appearing in mainstream medical journals, particularly Medical Decision Making. An impor- 
tant driver for this acceleration of interest has been the desire to contain costs of medical care, while 
maintaining clinical effectiveness and quality of care. Cost-effectiveness analysis is an extension of decision 
analysis and compares the outcome of decision options in terms of the monetary cost per unit of 
effectiveness. Thus, it can be used to set priorities for the allocation of resources and to decide between 
one or more treatment or intervention options. It is most useful when comparing treatments for the 
same clinical condition. Cost-effectiveness analysis and its implications are described very well elsewhere 
[17,18]. 

Influence Diagrams 

In the 1960s researchers at Stanford Research Institute (SRI) proposed the use of influence diagrams as 
representational models when developing computer programs to solve decision problems. However, it 
was recognized somewhat later by decision analysts at SRI [19] that such diagrams could be used to 
facilitate communication with domain experts when eliciting information about complex decision prob- 
lems. Influence diagrams are a powerful mode of graphic representation for decision modeling. They do 
not replace but complement decision trees, and it should be noted that both are different graphical 
representations of the same mathematical model and operations. Recently, two exciting papers have been 
published that make use of influence diagrams accessible to those interested in medical decision making 
[20,21]. 

179.3 Statistical Models 

Database Search 

Interrogation of large clinical databases yields statistical evidence of diagnostic value and in some rep- 
resentations form the basis of rule induction used to build expert systems [22]. These systems will not 
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be discussed here. However the most direct approach for clinical decision making is to determine the 
relative frequency of occurrence of an entity, or more likely a group of entities, in the database of past 
cases. This enables a prior probability measure to be estimated [23]. A drawback of this simple, direct 
approach to problem solving is the apparent tautology of the more evidence available leading to fewer 
matches in the database being found; this runs against common wisdom that more evidence leads to an 
increase in probability of a diagnosis being found. Further, the method does not provide a weight for 
each item of evidence to gauge those that are more significant for patient outcome. 

Regression Analysis 

Logistic regression analysis is used to model the relationship between a response variable of interest and 
a set of explanatory variables. This is achieved by adjusting the regression coefficients, the parameters of 
the model, until a 'best fit' to the data set is achieved. This type of model improves upon the use of 
relative frequencies, as logistic regression explicitly represents the extent to which elements of evidence 
are important in the value of the regression coefficients. An example of clinical use can be found in the 
domain of gastroenterology [24]. 

Statistical Pattern Analysis 

The recognition of patterns in data can be formulated as a statistical problem of classifying the results 
of clinical findings into mutually exclusive but collectively exhaustive decision regions. In this way, not 
only can physiologic data be classified but also the pathology that they give rise to and the therapy options 
available to treat the disease. Titterington [25] describes an application in which patterns in a complex 
data set are recognized to enhance the care of patients with head injuries. Pattern recognition is also the 
cornerstone of computerized methods for cardiac rhythm analysis [26]. The methods used to distinguish 
patterns in data rely on discriminant analysis. In simple terms, this refers to a measure of separability 
between class populations. 

In general, pattern recognition is a two stage process as shown in Fig. 179.1. The pattern vector, P, is 
an n-dimensional vector derived from the data set used. Let D. be the pattern space, which is the set of 
all possible values P may assume, then the pattern recognition problem is formulated as finding a way 
of dividing D. into mutually exclusive and collectively exhaustive regions. For example, in the analysis 
of the electrocardiogram the complete waveform may be used to perform classifications of diagnostic 
value. A complex decision function would probably be required in such cases. Alternatively (and if 
appropriate), the pattern vector can be simplified to investigation of sub features within a pattern. For 
cardiac arrhythmia analysis, only the R-R interval of the electrocardiogram is required which allows a 
much simpler decision function to be used. This may be a linear or non-linear transformation process: 

X = xP 

where X is termed the feature vector and x is the transformation process. 
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Just as the pattern vector P belongs to a pattern space Q , so the feature vector X belongs to a feature 
space Q x . As the function of feature extraction is to reduce the dimensionality of the input vector to the 
classifier, some information is lost. 

Classification of Q x can be achieved using numerous statistical methods including: discriminant func- 
tions (linear and polynomial); kernel estimation; k-nearest neighbor; cluster analysis; and Bayesian 
analysis. 

Bayesian Analysis 

Ever since their reinvestigation by Savage in 1954 [2], Bayesian methods of classification have provided 
one of the most popular approaches used to assist in clinical decision making. Bayesian classification is 
an example of a parametric method of estimating class conditional probability density functions. Clinical 
knowledge is represented as a set of prior probabilities of diseases to be matched with conditional 
probabilities of clinical findings in a patient population with each disease. The classification problem 
becomes one of a choice of decision levels, which minimizes the average rate of misclassification or to 
minimise the maximum of the conditional average loss function (the so-called minimax criterion) when 
information about prior probabilities is not available. Formally, the optimal decision rule which mini- 
mizes the average rate of misclassification is called the Bayes rule; this serves as the inference mechanism 
that allows the probabilities of competing diagnoses to be calculated when patient specific clinical findings 
become available. 

The great advantage of Bayesian classification is that a large clinical database of past cases is not 
required, thus allowing the time taken to reach a decision to be faster compared with other database 
search techniques; furthermore, classification errors due to the use of inappropriate clinical inferences 
are quantifiable. However, a drawback of this approach to clinical decision making is that the disease 
states are considered as complete and mutually exclusive, whereas in real life neither assumption may be 
true. 

Nevertheless, Bayesian decision functions as a basis for differential diagnosis have been used success- 
fully, for example, in the diagnosis of acute abdominal pain [27]. De Dombal first described this system 
in 1972, but it took another 20 years or so for it to be accepted via a multi-center multinational trial. 
The approach has been exploited in ILIAD; this is a commercially available [28] computerized diagnostic 
decision support system with some 850-990 frames in its knowledge base. As it is a Bayesian system, 
each frame has the prevalence of a disease for its prior probability. There is the possibility however, that 
the prevalence rates may not have general applicability. This highlights a very real problem, namely the 
validity of relating causal pathways in clinical thinking and connecting such pathways to a body of 
validated (true) evidence. Ideally, such evidence will come from randomized controlled clinical or epi- 
demiological trials. However, such studies may be subject to bias. To overcome this, Eddy and co-workers 
devised the Confidence Profile Method [29]. This is a set of quantitative techniques for interpreting and 
displaying the results of individual studies (trials); exploring the effects of any biases that might affect 
the internal validity of the study; adjusting for external validity; and, finally, combining evidence from 
several sources. This meta-analytical approach can formally incorporate experimental evidence and, in 
a Bayesian fashion, also the results of previous analytical studies or subjective judgments about specific 
factors that might arise when interpreting evidence. Influence diagram representations play an important 
role in linking results in published studies and estimates of probabilities and statements about causality. 

Currently, much interest is being generated as to how, what is perceived as the Bayesian action-oriented 
approach can be used in determining health policy, where the problem is perceived to be a decision 
problem rather than a statistical problem, (see for instance Lilford and Braunholz [30]). 

Dempster- Shafer Theory 

One way to overcome the problem of mutually exclusive disease states is to use an extension to Bayesian 
classification put forward by Dempster [31] and Shafer [32] . Here, instead of focusing on a single disorder, 
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the method can deal with combinations of several diseases. The key concept used is that the set of all 
possible diseases is partitioned into n-tuples of possible disease state combinations. A simple example 
will illustrate this concept. Suppose there is a clinical scenario in which four disease states describe the 
whole hypothesis space. Each new item of evidence will impact on all the possible subsets of the hypothesis 
space and is represented by a function, the basic probability assignment. This measure is a belief function 
that must obey the laws of probability and sum to unity across the subsets impacted upon. In the example, 
all possible subsets comprise: one which has all four disease states in it; four which have three of the four 
diseases as members; six which have two diseases as members; and finally, four subsets which have a 
single disease as a member. Thus, when new evidence becomes available in the form of a clinical finding, 
only certain hypotheses, represented by individual subsets, may be favored. 

Syntactic Pattern Analysis 

As demonstrated above, a large class of clinical problem solving using statistical methods involves 
classification or diagnosis of disease states, selection of optimal therapy regimes, and prediction of patient 
outcome. However, in some cases the purpose of modelling is to reconstruct the input signal from the 
data available. This cannot be done by methods discussed thus far. The syntactic approach to pattern 
recognition uses a hierarchical decomposition of information and draws upon an analogy to the syntax 
of language. Each input pattern is described in terms of more simple subpatterns which themselves are 
decomposed into simpler subunits, until the most elementary subpatterns, termed the pattern primitives, 
are reached. The pattern primitives should be selected so that they are easy to recognize with respect to 
the input signal. Rules which govern the transformation of pattern primitives back (ultimately) to the 
input signal are termed the grammar. 

In this way a string grammar, G, which is easily representable in computer-based applications, can be 
defined: 

G = {V T ,V N , S,P} 

where, V T are the terminal variables (pattern primitives); V N are the nonterminal variables; S is the start 
symbol; and P is the set of production rules which specify the transformation between each level of the 
hierarchy. It is an important assumption that in set theoretic terms, the union of V T and V N is the total 
vocabulary of G, and the intersection of V T and V N is the null (empty) set. 

A syntactic pattern recognition system therefore comprises three functional subunits (Fig. 179.2): a pre- 
processor — this manipulates the input signal, P, into a form that can be presented to the pattern descriptor; 
the pattern descriptor which assigns a vocabulary to the signal; and the syntax analyzer which classifies the 
signal accordingly. This type of system has been used successfully to represent the electrocardiogram [33, 34] 
and the electroencephalogram [35] and for representation of the carotid pulse wave [36]. 

Causal Modeling 

A causal probabilistic network (CPN) is an acyclic multiply-connected graph which at a qualitative level 
comprises nodes and arcs [37]. Nodes are the domain objects and may represent, for example, clinical 
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findings, pathophysiologic states, diseases, or therapies. Arcs are the causal relationships between succes- 
sive nodes and are directed links. In this way the node and arc structure represent a model of the domain. 
Quantification is expressed in the model by a conditional probability table being associated with each 
arc, allowing the state of each node to be represented as a binary value, or more frequently as a continuous 
probability distribution. In root nodes the conditional probability table reduces to a probability distri- 
bution of all its possible states. 

A key concept of CPNs is that computation is reduced to a series of local calculations, using only one 
node and those that are linked to it in the network. Any node can be instantiated with an observed value; 
this evidence is then propagated through the CPN via a series of local computations. Thus, CPNs can 
be used in two ways: to instantiate the leaf nodes of the network with known patterns for given disorders 
to investigate expected causal pathways; or to instantiate the root nodes or nodes in the graphical hierarchy 
with, for example, test results to obtain a differential diagnosis. The former method has been used to 
investigate respiratory pathology [38], and the latter method has been used to obtain pathologic infor- 
mation for electromyography [39]. 

Artificial Neural Networks 

Artificial neural networks (ANNs) mimic their biologic counterparts, although at the present time on a 
much smaller scale. The fundamental unit in the biological system is the neuron. This is a specialized 
cell which, when activated, transmits a signal to its connected neighbors. Both activation and transmission 
involve chemical transmitters which cross the synaptic gap between neurons. Activation of the neuron 
takes place only when a certain threshold is reached. This biologic system is modelled in the representation 
of an artificial neural network. It is possible to identify three basic elements of the neuron model: a set 
of weighted connecting links that form the input to the neuron (analogous to neurotransmission across 
the synaptic gap); an adder for summing the input signals; and an activation function which limits the 
amplitude of the output of the neuron to the range (typically) -1 to +1. This activation function also 
has a threshold term that can be applied externally and forms one of the parameters of the neuron model. 
Many books are available which provide a comprehensive introduction to this class of model [e.g., 40]. 
ANNs can be applied to two categories of problems: prediction and classification. It is the latter which 
has caught the imagination of biomedical engineers for its similarity to diagnostic problem solving. For 
instance, the conventional management of patients with septicemia requires a diagnostic strategy that 
takes up to 18 to 24 h before initial identification of the causal microorganism. This can be compared 
to a method in which an ANN is applied to a large clinical database of past cases; the quest becomes one 
of seeking an optimal match between present clinical findings and patterns present in the recorded data. 
In this application pattern matching is a non-trivial problem as each of the 5000 past cases has 51 data 
fields. It has been shown that for this problem the ANN method outperforms other statistical methods 
such as k-nearest neighbor [41]. 

179.4 Summary 

This chapter has reviewed what are normally considered to be the major categories of approach available 
to support clinical decision making which do not rely on what is classically termed artificial intelligence 
(AI). They have been considered under the headings of analytical, decision theoretic, and statistical 
models, together with their corresponding subdivisions. It should be noted, however, that the division 
into non-AI approaches and AI approaches that is adopted in this volume (see Chapter 184) is not totally 
clear-cut. In essence, the range of approaches can in many ways be regarded as a continuum. There is 
no unanimity as to where the division should be placed and the separation adopted; here is but one of 
a number that are feasible. It is therefore desirable that the reader should consider these two chapters 
together and choose an approach that is relevant to their particular clinical context. 
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As discussed in previous chapters, health care facilities presently use computers to support a wide variety 
of administrative, laboratory, and pharmacy activities. However, few institutions have installed computer 
systems that provide ongoing monitoring of care and assist in clinical decision making. Despite some 
promising examples of the use of artificial intelligence to enhance patient care, very few products are 
routinely used in clinical settings [Morelli et al., 1987; Shortliffe & Duda, 1983]. One of the important 
factors that that has limited the acceptance of decision support tools for clinicians is the lack of medical 
staff input in system development. Thus, a major task for designers of computer systems that are to be 
incorporated into the daily clinical routine is to involve practitioners in the design process. This chapter 
presents a set of design recommendations, the goal of which is to ensure end-user acceptance, and 
describes a comprehensive clinical system that was designed using this approach and is now in routine 
use in a large psychiatric hospital. 

180.1 Design Recommendations 

The design phase of any initiative requires attention to the social and organizational context in which 
the new product or program will exist. Although a full discussion of the issue is beyond the scope of this 
chapter, it has been extensively covered in a number of texts. (See especially Peters and Tseng [1983] for 
a theoretical overview of the theory and illustrative case studies.) In addition to these general principles 
of organizational change, several specific design steps are critical to end-user acceptance. 

Establish a Project Team. The composition of the project team should reflect the scope of the activities 
within the facility and the organizational structure (hierarchy). It should include at least one outside 
consultant and at least one member of the medical/professional staff who will serve as the liaison to a 
clinician task force representing all practitioners. 
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Establish a Clinician Task Force. In addition to the project team, which represents the entire institu- 
tion, it is important to form a clinician task force of five to seven members of the medical/professional 
staff. In some facilities all care is provided by physicians while in other settings psychologists, social 
workers, nurses, and other professionals function as primary clinicians. The task force must represent 
all these professionals. 

Meetings of the task force must begin well before the hardware and software decisions have been made 
so that there will be adequate time to incorporate suggestions from the clinical staff. The implementation 
schedule must allow time for resolution of the issues raised by the end-users. (Resolution here does not 
imply that all parties will be pleased with all decisions, but the design team should ensure adequate time 
for discussion of each issue raised by the clinical staff and for reaching closure.) 

Know the Limits within Which the Project Team Must Work. Part of the design task is to determine 
the limitations present and the level of expectation for system use. For example, is the number of terminals 
and their location sufficient for all clinicians to have easy access? Does the administration expect clinicians 
to type rather than dictate patient summaries? If an administrator is not on the project team, interviews 
with the key executives should be held as soon as possible, with follow-up meetings regularly scheduled. 

Identify All Institutional Initiatives and Needs. It is important for system designers to be aware of all 
institutional plans. For example, the facility may have recently modified its clinical documentation 
procedures in response to a change in reimbursement regulations or as part of a new managed care 
contact. Many of these changes will have an impact on the system design, and some may create oppor- 
tunities for cost or time savings through automation. 

Identify the Types of Patient Care Activities Typical for the Institution. Using the language/action 
approach of Winograd [1987] one can classify all clinical care activities within an institution. As discussed 
by Morelli, and coworkers [1993] all actions in a psychiatric hospital can be grouped into six categories: 
assessment notation, medication order, nonmedication order, notification of a critical event, request by 
the clinician for additional information (consultation), and request of the clinician for additional infor- 
mation (e.g., to justify a treatment or hospital admission). This list, while not necessarily applicable to 
all clinical settings, is useful as a template. (See Lyytinen [1987] for a review of other approaches to 
understanding the component activities within an organization.) Once all possible clinical actions are 
categorized, list those that the system is intended to support. Such detailed assessment of the structure 
and nature of clinician actions allows designers to determine prior to implementation how the system 
will affect existing practices. For example, will clinicians be expected to use the computer to enter orders, 
eliminating the need for nurse transcription of handwritten orders? Will clinicians have to respond to 
computer notices and document why a treatment standard was not followed? 

Illustrate How the New System Will Be Integrated into the Existing Environment. Computerization 
of one aspect of clinical care may have an unexpected impact on other areas of practice. Thus, a detailed 
map of the interactions involved in formulating and executing medical orders is a useful step [Morelli 
et al., 1993]. Even desirable changes in the clinical routine, such as reducing handwritten progress notes, 
must be examined since the existing nonautomated methods may have served some additional (e.g., 
educational) purpose not evident without careful mapping of the interaction. Both automated and 
manual procedures should appear logical and relevant to the staff. Screen formats and input/retrieval 
features, for example, must follow practices acceptable to the facility. Interactions with the computer 
must be viewed as nonintrusive and must not require extensive data entry by clinicians. 

Clinician Interaction with the Terminal Should Be Part of Daily Clinical Activity. A computer sys- 
tem should not be a stand-alone tool; rather, it should be fully integrated into the clinical routine. For 
example, clinicians should have direct and easy access to on-line displays of current medications and 
results of laboratory tests. Furthermore, clinicians should be able to query the system for additional 
information when laboratory or other data indicate a potential problem. 

The System Should Provide Real-time Feedback to Clinicians. The timeliness of an information sys- 
tem is a critical factor in health care applications. Although all functions do not have to operate literally 
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in real time, they do have to operate in a manner consistent with clinician practice. For example, the 
system should notify clinicians immediately of events such as drug-drug interactions, but in other situa- 
tions (e.g., a reminder that a follow-up test is due) a notification within 2 days maybe sufficient. Laboratory 
results and treatment data must reflect the orders entered as of the day the system is being queried. 

Conduct Site Visits to Assess Computer Use in the Delivery of Clinical Service. Important aspects of 
system design may be overlooked if unique characteristics of a given site are not taken into account. For 
example, the system designers may plan to install a computer terminal in the area from which medications 
are dispensed in each nursing station. If on one unit, because of size or location, this area cannot 
accommodate a standard terminal, an alternative device may have to be purchased or architectural 
changes made. 

Include Support for Key Administrative as well as Clinical Care Tasks. Administrative activities directly 
related to clinician practice are quality assurance, peer review, continuing medical education (including 
information dissemination and needs assessment), drug utilization evaluations, patient outcomes assess- 
ment, and continuous quality improvement programs. Since both administrative and clinical tasks are 
intended to serve a common purpose, the delivery of the best available services to the patient, they should 
be integrated whenever possible. A detailed listing of all such tasks, specifying the structure of each and 
the individuals involved, may identify important areas in which the clinical information system can be 
usefully applied. 

Evaluate the System Prior to Implementation. Once the design is complete, an evaluation step is nec- 
essary to test the utility of the system in an actual clinical practice. 

The System Must Be Adaptable to Individual Clinician and Program Needs. Since rapid changes are 
commonplace in health care, the system must be able to be modified and updated without extensive 
additional programming. 

180.2 Description of a Clinical Monitoring System 

The Institute of Living's Clinical Evaluation and Monitoring System (CEMS) was developed to provide 
comprehensive support for clinical services in a psychiatric hospital. It represents an extension of earlier 
work that resulted in a prototype psychopharmacology monitoring system described elsewhere [Bronzino 
et al., 1989] The current system provides decision support and/or automated monitoring for each key 
component of care (assessment, diagnosis, and treatment) and consists of four modules: treatment 
standards ("pharmacotherapy guideline"), diagnostic checklists (DCLs), information alerts, and outcome 
assessment. Table 180.1 shows the components of care that are supported by each of the four modules 
of the CEMS. 

The treatment standards are accessed from the menu available to all clinical staff. This document 
summarizes key information about the use of selected psychiatric medications (dosages, therapeutic 
serum levels, indications, and side effects) and presents decision trees to guide drug selection. The manual 
is continuously updated and serves as both a reference and a vehicle for dissemination of new information. 

TABLE 180.1 Clinical Evaluation and Monitoring System (CEMS) 







Components of Care 






CEMS 


Assessment 
















Treatmen 


Modules 


Pt. History/Symptoms 


Labs 


Outcome 


Diagnosis 
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Treatment Standards 




X 




X 


X 


DCLs* 


X 




X 


X 




Information alerts 


X 


X 




X 


X 


Outcome assessment 






X 




X 



*DCLs = diagnostic checklists 
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It is not linked electronically to the other modules, but the content is the database for the decision rules 
in the information alerts (described below). 

The diagnostic checklists (DCLs) provide an automated method for assuring documentation of the 
key symptoms and behavioral issues that support the assigned diagnosis and for noting at subsequent 
evaluation points (e.g., at discharge) the degree of change in each symptom/behavior. In contrast to 
diagnostic assessments in other specialties, there are few procedures or laboratory tests that definitively 
establish a psychiatric diagnosis. The Diagnostic and Statistical Manual (DSM) of the American Psychi- 
atric Association [1987] and the International Classification of Diseases (ICD) published by the World 
Health Organization [1978] provide standard criteria on which to base diagnoses. In both DSM and ICD 
there is a specific algorithm that includes the type, number, and intensity of symptoms required for each 
diagnosis. The algorithm for all DSM diagnoses, modified to provide additional data, are used in the 
CEMS, and there is a separate 1-page checklist for each disorder. The clinician's admission diagnosis 
determines which form is presented. The form can be completed by the clinician at the terminal or a 
paper version can be used with subsequent input by clerical staff. The clinician indicates if each symp- 
tom/behavior is present or absent, thereby ensuring complete documentation of the findings relevant to 
that diagnosis. If the symptoms specified fail to meet DSM criteria for the disorder selected, a message 
is generated along with an explanation. The DCL is also completed at the time of discharge, at which 
time the system generates a new version of the form that contains only those items that the clinician 
indicated were present on admission. (For implementation in settings with longer periods of treatment, 
the DCLs could also be used for serial assessments prior to discharge.) 

The information alerts are computer-generated notifications that assist in ensuring compliance with 
practice guidelines and medication protocols. Clinicians may respond to these notices by changing the 
treatment plan or diagnosis or by following up on missing or abnormal laboratory orders as directed by 
the alert (i.e., the clinician is then in compliance with the treatment standard). The system also allows 
clinicians to document the reason for deviating from the standard. This step is initiated via a function key 
(see Screen 1); an item is then selected from a list of reasons for "nonstandard" practice or a free-text 
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disturbance, and is receiving a trial of antipsychotic medication 


07 


Patient is receiving an antidepressant from an outside pharmacy 


Selection 


Type: 


Fl: SELECT 


F5: RESTART F7: NXPG 




F10: RETURN F14: GUIDE 



Screen 2 

explanation is entered (see Screen 2). The alert statement with the attached explanation is printed on 
the next treatment plan review, a medical records form summarizing treatments and patient progress that 
the clinician must sign. Thus, the information alerts provide ongoing monitoring and critical event 
notification and ensure documentation of the rationale for any nonstandard practice. (All notifications 
and the clinician responses are reviewed by the medical director or designee daily.) The direct feedback 
provided by the alerts also serves an educational and information dissemination function. New infor- 
mation about medications, for example, can be incorporated into one or more alerts as well as added to 
the treatment guidelines described above. 

The system currently has 130 alerts and can be expanded to search as many as 2000 records for up to 
200 critical events each day. Additional alerts can be added without additional programming, and changes 
in the specifications for an existing alert (e.g., dosage parameters) can be made by accessing the appro- 
priate table from the maintenance screen. Alerts can include data elements on any medication prescribed, 
any laboratory value, any psychiatric diagnosis or a variety of historical/demographic items (e.g., gender, 
age). The limitation in the last of these four categories is the amount of information from the medical 
record that is currently on-line. With a completely electronic medical chart, this system could generate 
an alert for almost any clinical event. 

180.3 Outcome Assessment 

At admission, discharge, and regular intervals postdischarge nursing staff complete a functional assess- 
ment survey on all patients. These data are entered into the system using an optical scanner. Regular 
feedback is provided to hospital staff and used to evaluate clinical services and practice. 

In a typical scenario a clinician, after logging on, would select "Show My Alerts" from the menu 
(Screen 3). Alerts for all of the patients assigned to that clinician are displayed as shown on Screen 4. 
(The medical director or other supervising clinician can select all active alerts by type, by practitioner, 
or by patient.) Patient name and alert message are given, but no further detail can be obtained nor can 
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a direct response to the alert be made without selecting a specific patient and a specific alert for that 
patient. When the clinician goes to a specific alert, there is the option (Screen 1) to suspend the alert as 
described above or to query the system about the patient's diagnosis, current medication, laboratory 
values, or the "history" of that alert (i.e., if the alert has previously been issued, a log of all activity on 
that alert is given). From the current medication and laboratory screens there is an additional option 
that allows access to all historical lab values/medications. This historical feature is especially valuable for 
tracking patients who require frequent drug serum level determinations (e.g., with lithium carbonate) 
or who have medical disorders that necessitate periodic laboratory tests (e.g., chronic kidney or liver 
disease). 

180.4 Conclusion 

Computers are increasingly user friendly, but the clinical environment may often not be computer friendly. 
In addition to the obvious computer engineering tasks, there is a sizable human engineering issue that 
must be addressed if automated tools are to be accepted and fully utilized in clinical settings. Thus, 
designers must take into account the social and organizational structure of the health care delivery 
network the system is intended to serve, involve clinician staff in its development, and adhere to a design 
strategy that will ensure end-user acceptance. Although the system outlined in this chapter was developed 
for psychiatric practice, the components of care described are common to all branches of medicine, and 
the design steps recommended are applicable for a wide range of clinical settings. 
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Defining Terms 

Decision trees: Decision trees or flow charts are a common way to present medical information that 
is intended to support the decision-making process. Many decisions are hierarchical in nature, but 
there may be more than one acceptable option at each nodal point on the decision tree. 

Language/action model: The language/action model of human-computer communication is a term 
taken from the work of Winograd, who introduced the term "conversation for action" to describe 
how human behavior is coordinated within an organization. To quote Winograd "We work together 
by making commitments so that we can successfully anticipate the actions of others and coordinate 
them with our own." According to this model, computer systems are part of the conversational 
structure of the organization and, along with the human employees, engage in a variety of well- 
defined interactions (communications). Winograd's approach is based on earlier work by J. L. 
Austin and J. R. Searle. 

Map: A map of the interactions is a detailed assessment, often diagrammatically expressed, of how the 
work of the organization is accomplished. In a medical facility, the steps involved include the 
physical assessment of the patient by the clinician, various procedures performed on the patient, 
and the recording, storage, retrieval, and analysis of data. Such maps can be informed by but do 
not necessarily have to follow Winograd's language/ action approach (described above) or any other 
formal model of analysis. 
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THE FOCUS OF THIS SECTION is on artificial intelligence and its use in the development of 
medical decision systems with clinical application. The methodologic basis for expert systems and 
artificial neural networks in the medical/clinical domain will be discussed. This is one aspect of 
the growing field of medical or health informatics, the application of engineering, computer, and infor- 
mation sciences to problems in health and life sciences. Other informatics applications are addressed in 
Section XVII. This section contains eight original contributions that cover the history, methods, and 
future directions in the field. It intentionally omits specific computer hardware and software details 
related to currently available systems because they continue to change as technology changes and are 
likely to be outdated even as this Handbook goes to press. Furthermore, chapters representing complete 
examples of specific decision systems are not included in this section. Numerous example can be found 
in the current literature, and may have been cited in the chapters of this section to illustrate particular 
aspects of decision system structure, knowledge acquisition and representation, the user interface, and 
system evaluation and testing. 

In Chapter 181, Dr. Kulikowski presents the history of the development of artificial intelligence (AI) 
methodology for medical decision making, from the early statistical and pattern-recognition methods 
of the 1960s to the continuing development of knowledge-based systems of the 1990s. Dr. Kulikowski 
has described the evolution of these AI applications. The early application explored various approaches 
to handling knowledge within the specific domains of interest, utilizing either causal networks, modular 
rule-based reasoning, or frame/template representation of the knowledge describing the clinical domain. 
These varied approaches pointed out the difficulties involved in the actual acquisition of the domain 
expertise needed for each application and generated research investigations into acquisition strategies 
ranging from literature review, case study evaluation, and detailed interviews with the experts. Alternative 
approaches to knowledge representation, the development and assessment of decision system applica- 
tions, efforts to test and evaluate decision system performance, and the problems associated with rea- 
soning in uncertain environments have become important areas of investigation. Currently, there is active 
interest in qualitative reasoning representation, the importance of the temporal framework of the decision 
process, and the effort to move toward more practical systems that embody decision support for diag- 
nostic or treatment protocols rather than the fully automated decision system. Dr. Kulikowski points out 
that expert clinical systems have not become the indispensable clinical tool that many had predicted in 
the early days of AI in medicine, describes reasons for the shortfall, and looks forward to advanced 
software environments and medical devices that are beginning to routinely incorporate these ideas and 
methods in their development. 

In Chapter 182, Drs. Micheli-Tzanakou and Zahner focus on artificial neural network (ANN) method- 
ologies and their applications in the medical and health care arena. ANNs consist of a large number of 
interconnected neurons or nodes that can process information in a highly parallel manner. They are specified 
by their processing-element characteristics, their network topology, and the training rules they use to learn 
how to achieve correct pattern classification from an array of multiple inputs. Drs. Micheli-Tzanakou and 
Zahner provide the mathematical details for the back-propagation and ALOPEX training algorithms and 
discuss the benefits and deficiencies of these approaches in teaching the ANN to correctly classify input 
patterns presented to the system. Finally, the performance of several ANN approaches in mammography 
and chromosome and genetic sequence classification applications are reviewed and compared. 

The contribution by Nykanen and Saranummi (Chapter 183) also provides a history of AI applications 
in medical decision systems but focuses on the clinical reasoning process, systems development within 
the clinical environment, and the critical issues for system acceptance. The early introduction of expert 
system shells provided a strong impetus for system development and indeed resulted in many published 
applications. The availability of inexpensive and powerful microcomputers and dedicated workstations 
for knowledge engineering also has contributed to the development of medical expert systems. The 
authors point out that data and knowledge are often less quantitative and consistent in clinical medicine 
than in the physical sciences and that moving from the knowledge level to the knowledge use level posed 
significant problems. Appropriate, objective, and standardized evaluation testing, and validation are often 
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lacking as a part of the system development process, as discussed in this chapter, and this contributes to 
the lack of widespread acceptance of these systems. The authors conclude with a list of challenges for 
future clinical decision systems posed by the need for integration with new information technologies 
and the developing health care information infrastructure. 

The chapter by Summers, Carson, and Cramp (Chapter 184) discusses specific methodologies used 
in expert system development and provides some general examples of rule-based and semantic network 
approaches to knowledge representation in medical applications. The authors use MYCIN and CASNET, 
two of the earliest medical expert system applications to achieve widespread recognition and serve as 
models for subsequent developments, to demonstrate and compare these two approaches for knowledge 
representation. MYCIN uses production rules to represent causal relationships between knowledge items 
to diagnose and treat microbial infections. CASNET uses causal or semantic networks to represent 
knowledge relationships in the diagnosis and treatment of glaucoma. 

Dr. Garbay's contribution (Chapter 187) continues along these directions by providing a somewhat 
more conceptual approach to the question of knowledge domains, acquisition, and representation. 
Knowledge-acquisition techniques are reviewed, and several knowledge-acquisition tools designed to 
assist in the process are described. Dr. Garbay also presents fundamental ideas related to rule-based, case- 
based, and causal reasoning. She discusses models for the uncertainty and imprecision that is central to 
the clinical environment. The use of temporal knowledge and its incorporation into clinical decision 
systems is also discussed. The ideas of shallow and deep knowledge systems are introduced from the 
perspective of the system's explanation facilities. Shallow knowledge systems rely primarily on reasoning 
based on experience and experimental results, while deep knowledge systems are based on detailed 
knowledge regarding the structure and function of the underlying system such as those employing 
physiologic models within the knowledge base. Dr. Garbay, as have many contributors to this section, 
also introduces the new challenges for clinical decision systems with regard to the rapid developments 
in networking, communications, and information sciences. 

A new and expanding application area involving intelligent patient monitoring and management in 
critical care environments is discussed in Chapter 186, by Drs. Dawant and Norris. Such systems involve 
the context-dependent acquisition, processing, analysis, and interpretation of large amounts of possibly 
noisy and incomplete data. These systems can be viewed within four distinct functional levels. At the 
signal level, the system acquires raw data from patient monitors, which include analog-to-digital con- 
version and some low-level signal processing. Data validity checks and artifact removal occur at the 
validation level. The transformation from numerical features that characterize the signals to a symbolic 
representation, such as normal or abnormal, is performed at the signal-to-symbol level. Finally, the 
inference level consists of the reasoning elements described in previous chapters to arrive at diagnoses, 
explanations, prediction, or initiation of control actions. Examples of both shallow and deep knowledge 
systems developed for patient monitoring are presented 

Systems development and widespread dissemination often have been stymied by the lack of standard- 
ization of knowledge representation, as described in all the preceding contributions. The last two chapters 
look at this question from the perspective of medical terminology and the status of natural language 
processing in biomedicine. 

In discussing applications of medical knowledge bases, Dr. Kerkhof (Chapter 187) states that while 
technological advances in computer processors and storage media permit virtually unlimited storage and 
fast retrieval capability, the interpretation of natural language constitutes a major obstacle in the advance- 
ment of knowledge-based systems in medicine. In this chapter, classification and coding systems are 
identified, and their content is described. The richness and depth of natural language are often the focus 
of difficulties when attempting to develop such systems, encountering differences in such elementary 
concerns as definitions, spelling, usage, and precision. Dr. Kerkhof offers several approaches to handling 
linguistic problems associated with such systems, but all have their own inherent limitations. 

In Chapter 188, Dr. Johnson describes the techniques of natural language processing as a means to 
bridge the gap between textual and structured data so that users can interact with the system using 
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familiar natural language, while computer applications can effectively process the resulting data. Appli- 
cations are classified according to the levels of language competence embodied in their design. Speech- 
recognition and -synthesis systems deal with basic data representation, wile lexical, syntactic, and dis- 
course systems function at increasing levels of complexity from single words to entire discourses. 

While the specific applications described in this section relate to biomedical concerns, the definitions 
and methods overview relate to the development and utilization of expert systems and artificial neural 
networks for a wide variety of decision systems. Detailed implementation protocols for such systems are 
beyond the scope of this section but can be found in the references cited at the conclusion of each chapter. 
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Artificial intelligence (AI) for medical decision making and decision support had its origin in the 
knowledge-intensive expert consultation systems that were introduced in the early 1970s. These differed 
considerably from the data-intensive statistical and pattern-recognition methods that had been applied 
to medical reasoning problems since the 1960s and which saw a resurgence in the 1980s as new types of 
computationally more powerful artificial neural networks (ANNs) were developed, and sophisticated 
models of Bayesian and other belief networks were designed to capture the nuances of clinical reasoning. 
For the past decade it has become incresingly clear that a variety of approaches is needed to make 
computer-based medical decision systems useful and effective, preferably embedding them within systems 
that are based on an electronic medical record (EMR). Meanwhile, advanced software environments 
increasingly and routinely incorporate AI ideas and methods as they seek to facilitate the tasks of building, 
validating, and testing medical knowledge bases. 
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181.1 Early Models of Medical Decision Making 

The earliest efforts to formalize medical decision making involved the application of statistical decision 
methods (ROC curves) in radiographic interpretation. Precursor attempts at automating the logic of 
diagnostic reasoning by sorting symptoms and selecting diagnoses that matched a particular combination 
involved multiple slide-rule and early card and computer sorting techniques. 

As computers became more powerful and easily programmable, they were the natural tool for repre- 
senting diagnostic and treatment decision making. A watershed article by Ledley and Lusted [1959], in 
Science, described the reasoning bases for medical decision making in both logical and statistical terms. 
The main paradigm for representing medical decisions in the 1960s was statistical, whether Bayesian, 
hypothesis testing, or discriminant function analysis. The growth of standardized laboratory test panels 
and multiphasic screening methods had the effect of popularizing statistical techniques as an objective 
approach to the selection of decision thresholds. Bayesian methods were predominant and allowed 
incorporation of subjective estimates of prior probability into the calculation of diagnostic probabilities. 
Likelihood-ratio (hypothesis-testing) methods did not, but instead required a determination of sensitiv- 
ity/specificity trade-offs with threshold selection dependent on a choice of clinically tolerable levels for 
them. There also were heuristic pattern-recognition methods being developed if more flexibility was 
required, but all these approaches at that time suffered from a common drawback. While they might 
perform very well for circumscribed problems with well-defined statistics or adequate samples from 
which to estimate the statistics, they were rarely acceptable to practicing physicians beyond their site of 
origin. A reason frequently given for this was the difficulty of explaining decisions based on strictly 
computational theories of probability in terms of the qualitative language and arguments familiar to 
physicians. 

During the 1960s, the alternative to formal statistical methods was the coding of the sequence of expert 
decisions in a branching logic diagram or flowchart, often described as a clinical decision algorithm. This 
approach had the advantages of clarity and ease of explanation but was usually too rigid to capture the 
nuances of context without becoming very large, complex, and computationally expensive. During this 
period, most mainstream AI research appeared to have little to offer medical decision making, since it 
was concerned primarily with the computer representation and reasoning for general problem solving 
without considering explicit representations of uncertainty. 

181.2 Emergence of the Knowledge-Based AI Methods 
for Medical Consultation 

In the early 1970s, several groups involving both AI and biomedical researchers decided to explore 
different, more knowledge-intensive approaches to interpretive problem solving, including medical deci- 
sion making. Several common themes emerged from the work of the four groups that initiated these 
efforts at Rutgers University, Stanford University, the University of Pittsburgh, and the MIT/Tufts col- 
laborative group. All of them, at about the same time, recognized some of the shortcomings of existing 
methods and proposed a set of alternative representation and inferencing approaches that involved: 

• Representing uncertainty more flexibly and qualitatively than appeared possible with probabilities. 

• Representing more of the medical knowledge that motivated and justified a diagnostic, prognostic, 
or therapeutic decision. 

• Developing a descriptive component of medical knowledge to which some general problem solving 
or inferencing strategy could be applied. 

Early prototypes of AI consultation systems that incorporated distinct approaches to medical decision 
making were CASNET [Kulikowski and Weiss, 1972], MYCIN [Shortliffe et al, 1974], DIALOG/INTER- 
NIST-1 (Pople etal, 1975], the Present Illness Program (PIP) [Pauker et al, 1976], and the Digitalis 
Advisor Program [Gorry et al., 1978]. 
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That a domain-specific model could be used as the basis for interpretive decision making was first 
demonstrated in the DENDRAL system [Buchanan et al., 1970]. DENDRAL interpreted mass spectra by 
using rules that were constrained by a model of the possible chemical structures that could have given 
rise to the observed spectral data. This was quite different from the domain-independent, pattern- 
recognition approaches previously applied to the problem. 

The first AI approach to decision making in medicine itself evolved within the Rutgers Research 
Resource on Computers in Biomedicine, which was established in 1972. One of the goals of our research 
group was to find general methods of representing medical reasoning that would take advantage of the 
knowledge of specific diseases in ways similar to those used by medical specialists. In a collaboration 
with Dr. Aran Safir, an ophthalmologist at the Mt. Sinai School of Medicine (New York), we explored 
methods for describing and applying medical knowledge for computer-based diagnosis and management. 
With Sholom Weiss, who worked on his doctoral dissertation on this project, we found that a very natural 
way for characterizing the underlying mechanisms of disease was through cause-and-effect relations. A 
network of causal links among pathophysiologic conditions (or abnormal physiologic states) could then 
be used to describe the many pathways through which a disease might manifest itself. Each pathophys- 
iologic state could be inferred independently from the patient's condition and asserted with some degree 
of confidence depending on its pattern of supportive evidence. A prototype system for testing these ideas 
was developed in the area of glaucoma diagnosis and management. This proved to be a good problem, 
because glaucoma is a major cause of loss of vision and blindness with mechanisms that are sufficiently 
well understood to determine the course of management; yet at the same time sufficiently complex that 
advice on difficult cases (particularly those resistant to conventional treatment) must frequently be sought 
from specialist consultants. 

A variety of reasoning strategies seemed to apply to different stages of the consultation. For example, 
while starting to gather data on a patient, the major problem for a consultant is to elucidate the patient's 
problem and develop diagnostic leads by asking the right questions and acquiring the relevant data. 
When enough data have accumulated, a different mode of reasoning is characteristically needed. The 
findings must be attributed to specific causes, whether single or multiple. Finally, all these explanations 
must be combined into a differential diagnosis. This serves to predict what will happen if the patient 
remains untreated (prognosis) and suggests possible ways of managing the disease (treatment or thera- 
peutic planning). As we gained experience in extracting the knowledge from the expert practitioners, it 
became clear that different management strategies relied on different types of medical knowledge, and 
these strategies had to be sufficiently general and independent of the particular disease being modeled. 
Weiss built a prototype system called CASNET (for casual associational network) based on these ideas 
[Weiss, 1974; Weiss et al., 1978]. We were fortunate in being able to draw on the expertise of leading 
glaucoma specialists, who provided the in-depth knowledge needed to describe the disease and tested 
the system independently in their own laboratories. After two years of improving the initial prototype, 
the system was given a blind test by presenting it with new, previously unseen cases during a panel 
discussion at the major national meeting of the Academy of Ophthalmology in 1976. CASNET made no 
errors in its recommendations, though it was unable to answer some questions for which the knowledge 
was absent in the computer model [Lichter and Anderson, 1977]. 

During the early 1970s, another kind of biomedical AI resource was established at Stanford as the 
result of the DENDRAL project collaboration that had developed between AI researchers (E. Feigenbaum 
and B. Buchanan) and biomedical researchers (J. Lederberg and S. Cohen). This SUMEX-AIM Resource, 
established in 1974, provided advanced time sharing on a PDP-10 computer to a national network of 
investigators of AI in medicine, with appropriate systems and AI language support for what was then a 
very novel computational mode, serving an initial nucleus of investigators, including those at Rutgers 
and the University of Pittsburgh [Freiherr, 1979]. 

The medical component of the SUMEX-AIM research centered around the consultation system for 
infectious disease treatment developed by Shortliffe [Shortliffe et al., 1974; Shortliffe, 1976]. Its repre- 
sentation of medical knowledge was in the form of heuristic rules with a new certainty factor formalism, 
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which decoupled the way in which positive and negative evidence was credited toward a hypothesis, thus 
providing more heuristic flexibility than previous probabilistic frameworks. The system's strategy was 
goal driven for gathering the data needed to reach the diagnostic conclusions necessary to choose the 
appropriate covering therapies. The modularity of MYCIN's rules turned out to be a critical design choice, 
since it was soon found that it was easy to modify the rules independently of the reasoning strategy and 
its inference engine, giving rise to a very flexible and updatable knowledge base. The certainty- factor 
model turned out to be less long-lived, since it was soon found to implicitly involve assumptions and 
constraints that did not adequately represent the complex dependencies that often exist among data and 
hypotheses. However, MYCIN, by providing an easy means of directly and modularly encoding the rules 
of medical reasoning (without an underlying disease model as in CASNET), gave rise to the most powerful 
representation for the first generation of knowledge-based AI systems — the rule-based paradigm [Bucha- 
nan and Shortliffe, 1984]. 

Another group to work intensely on AI approaches to medical decision making was that of Gorry at 
MIT and Pauker and Schwartz at Tufts, who led the development of the Digitalis Advisor Program (Gorry 
et al., 1978]. This system introduced two interesting new concepts: a patient-specific model (PSM) and 
a mathematical model for describing the mechanism being regulated and interpreted (digitalis uptake). 
The PSM grouped together all the information known about a patient during a consultation session (the 
findings, history, test results, etc.), as well as the hypotheses about the patient (diagnostic, prognostic, 
and therapeutic) produced by the computer diagnostic/therapeutic model. The mathematical model was 
a compartmental model of digitalis uptake, which was able to relate dosages to their expected therapeutic 
and toxic effects. 

In a related AIM project, Pauker and Kassirer joined Gorry and Schwartz in formulating the Present 
Illness Program (PIP) [Pauker et al., 1976], which was designed to focus in on a medical problem based 
on a brief description of the patient's complaints (the present illness). Here, Minsky's frame formalism 
[Minsky, 1975], which had been developed to describe a template of expected objects in computer vision 
recognition tasks, was used as a template for grouping typical or characteristic information about a 
disease. Each frame contained slots that had to be filled with descriptors of the disease, with some of 
these slots containing rules for reasoning about the disease. In this way, diseases were related through 
rules, but these rules were grouped by their respective frames. An interesting observation that came from 
this work was that the probabilistic scoring of hypotheses, while important in diagnostic reasoning, is 
often overshadowed by the use of categorical or deterministic reasoning, which has to be flexible and 
suitably richly represented to be useful [Szolovitz and Pauker, 1978]. Subsequently, the ABEL system for 
modeling acid-base balance problems [Patil et al., 1981] took advantage of many of the ideas from PIP 
and combined them with a causal description of disease at multiple levels of abstraction and aggregation, 
which helped provide a more detailed description of pathophysiologic processes. 

Contemporaneously, Dr. Harry Pople at the University of Pittsburgh had been investigating biomedical 
applications of AI — specifically abductive reasoning with causal graphs for scientific theory formation 
[Pople, 1973]. In a collaboration with Dr. Jack Myers, an eminent internist, Pople began a very broad 
and ambitious project: to capture all of Dr. Myers' diagnostic knowledge of internal medicine into a 
system that could reason automatically from the facts of a case. Over the next few years, they developed 
a taxonomic and causal representation for characterizing diseases and their manifestations and proceeded 
to develop a set of alternatives for describing the various strategies of diagnostic reasoning: the preliminary 
assessment of findings, the attribution of causes and explanations for each of the findings, the compu- 
tation of confidences and resulting ranking of diagnostic hypotheses, the development of a differential 
diagnosis, and the confirmation of a diagnosis once it is already strongly indicated (through specialized 
tests). The DIALOG system was developed to test these ideas and codify Dr. Myers' knowledge [Pople 
etal, 1975]. Its later version, called INTERNIST-1 [Pople, 1977; Miller, 1984], demonstrated how the 
very large domain-specific knowledge base of internal medicine could be assembled and validated with 
complex cases of disease, both from the clinic and journal CPC reports. 
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Research in AI in medicine (AIM) was initially presented in a series of AIM workshops, starting at 
Rutgers University [Ciesielski, 1978], subsequently rotating among the AIM community. Later workshops 
were organized in conduction with the American Association for Artificial Intelligence (AAAI) as part of 
its spring symposia, while sessions on medical expert systems were to be increasingly found at most 
relevant scientific, engineering, medical, and informatics conferences (AAAS, IEEE, ACM, AAAI, 
SCAMC, MEDINFO, and AMIA). 

181.3 The Transition to Expert Systems and the Ascendancy 
of Rule-Based Systems— 1976-1982 

The experience in developing, testing, and disseminating the prototypes of the first-generation medical 
consultation systems, combined with similar experiences of AI researchers outside medicine, led to a 
shift in AI research away from general problem solving to more domain-specific knowledge-intensive 
problems. Feigenbaum (1978) defined generalizations of the MYCIN system for dealing with any problem 
where advice-giving knowledge could be captured in the form of rules. He emphasized the separation 
of rule-based systems from the inference engine that reasoned with them and coined the term knowledge 
engineering to describe the art of building a knowledge-based system. This process was centered around 
the interviewing of domain specialists by knowledge engineers (typically computer scientists or engi- 
neers), who would attempt to understand the problem being modeled, learn about the expertise that the 
specialist applied in solving it, and finally formalize all this into a computer representation of the problem- 
solving or consultative knowledge. The early stages of the process, usually called knowledge acquisition, 
frequently became the major bottleneck in developing an expert system, particularly if the knowledge 
engineer had difficulties in understanding the field of expertise or came with preconceived notions about 
it that did not match the expert's own. Another difficulty with first-generation expert system shells was 
that the knowledge had to be fitted to a predetermined computer representation, which might or might 
not match that of the domain problem. Most shells worked well with advice-giving reasoning that involved 
classification-type problems that could be reduced to the selection of one or several alternatives from a 
large set of candidate hypotheses. In this sense, their overall problem-solving paradigm did not differ 
much from the statistical and pattern-recognition methods that preceded them. Their major departure 
was in the architectural modularity of the rule base, flexible choice of reasoning strategies, and the 
representation of intermediate hypotheses and reasoning constructs with which to support and assemble 
a final conclusion. Researchers interested in the cognitive processes of human expert diagnosticians 
analyzed the problem-solving behavior of experts confronted with real and simulated cases to discern 
their reasoning strategies [Elstein et al., 1978; Kassirer and Gorry, 1978]. These were usually found to be 
sufficiently complex that they did not have an effect on the design of the rapidly spreading rule-based 
systems of the time. Rather, they influenced the development of more sophisticated systems based on 
deeper medical knowledge in the following decade. 

The years 1976-1978 saw the testing, critiquing, and elaboration of the prototype systems. This resulted 
in changes and generalizations of the initial designs, leading to the first general system building frame- 
works (EXPERT [Weiss and Kulikowski, 1979], EMYCIN [van Melle, 1979], AGE [Nii and Aiello, 1979]). 
These came about because in the course of the research it became clear that incorporating expert 
knowledge about advice-giving consultation could be carried out with various kinds of reasoning: either 
or both hypothesis- and data-driven strategies, with corresponding backward or forward chaining of 
rules. Furthermore, inferences incorporated into the rules could be either interpreted (as in EMYCIN) 
or compiled (as in EXPERT), and knowledge could be chunked or clustered by subtasks of the problem 
solving with the aid of various other representational devices (context trees in EMYCIN, knowledge 
sources in AGE, rule clusters in EXPERT, and frames in PIP). 

Building various expert systems capitalized on the increasing experience with alternative ways of 
representing knowledge. For example, the IRIS system [Trigoboff, 1976] applied a semantic network 
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model (as was being developed in PROSPECTOR [Hart and Duda, 1977]) to represent glaucoma con- 
sultation knowledge with multiple strategies of inference; the CRYSALIS system [Englemore and Terry, 
1979] which explicated the structure of chemical substances from X-ray diffraction data, used the 
blackboard model from speech understanding; the VM system developed a real-time variant of a rule- 
based system for ventilation management [Fagan et al., 1979]; and the MDX system pursued a distributed 
conceptual model for diagnosis [Chandrasekaran et al., 1979]. The CENTAUR system [Aikins, 1979] 
showed that both rules and frames could be usefully combined to represent expert reasoning, and 
AI/RHEUM [Lindberg, 1980] demonstrated how rule-based systems with specialized medical semantics 
could be adapted to represent already formalized knowledge found in rheumatologic diagnostic criteria 
tables. Meanwhile, many other systems were developed showing that rule-based methods could be widely 
and systematically applied [Reggia et al., 1980; Horn et al., 1980; Speedie et al., 1981; Buchanan and 
Shortliffe, 1984; Weiss and Kulikowski, 1984]. 

The evaluation of the first-generation systems was pursued for CASNET [Lichter and Anderson, 1977], 
MYCIN [YuetaL, 1979], the Digitalis Advisor [Swartout, 1981], and INTERNIST- 1 [Pople, 1977]. Critical 
examination of knowledge structures in INTERNIST- 1 led to their being rerepresented in INTERNIST-2 
[Pople, 1977]. General methodologies for evaluating clinical predictions were also being developed 
[Shapiro, 1977]. This testing and evaluation led to the first efforts at automating knowledge acquisition 
and maintenance of the rule bases. For instance, the TEIRESIAS system was designed to improve the 
performance of MYCIN through heuristic analysis of individual cases leading to changes in rules that 
were incorrectly invoked or designed [Davis, 1979]. 

Likewise, Politakis and Weiss [1980] developed methods for testing and improving EXPERT rule bases 
by providing logical and statistical performance evaluation and rule-modification tools in an interactive 
environment. In 1980, a first workshop on general expert systems was held to compare the different 
techniques for knowledge representation and the systems that implemented them. A common problem 
was selected in advance without the knowledge of the different research groups and presented to them 
at the workshop for implementation in their system. The results of this experiment are summarized in 
Hayes-Roth et al. [1983]. While all the existing shells were able to represent a complex advice-giving 
problem (the diagnosis and tracking of a chemical spill), those with fixed inference engines proved the 
easiest to use for rapid prototyping. As might have been expected, the more general Al languages designed 
to capture expertise did permit more flexible, expressive, and detailed descriptions of the problem but 
at the cost of a much greater investment of time and effort in building both knowledge bases and 
specialized inference procedures. 

The development and assessment of rule-based and frame-based systems demonstrated that while they 
could effectively capture large quantities of domain-specific expertise, there were still many unanswered 
questions about how the expertise should be best applied to specific problems while at the same time 
abstracting out general representations of knowledge. The complexity of human problem-solving pro- 
cesses revealed by cognitive analyses and simulations of clinical reasoning [Elstein et al., 1978] pointed 
to the difficulties of reconciling these goals. 

181.4 Exploration of Alternative Representations for Medical 
Al and the Search for Performance — 1983-1987 

Two almost opposite trends in medical Al emerged from the generalization of expert systems, which 
continue to the present. On the one hand, driven by the goal of practical clinical applications, researchers 
tried to adapt existing representations and obtain high performance from them. This led to various 
systems with very specific goals, such as the interpretation of laboratory tests, which even reached 
commercial implementation and dissemination [Weiss et al., 1983]. At the opposite end of the spectrum, 
dissatisfaction with the oversimplified cognitive style of rule-based expert systems and the inadequate 
coverage and performance of large knowledge bases led to research into alternative representations of 
deeper medical knowledge [Chandrasekaran et al., 1979]. 
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Explanation and Early Knowledge Level Work in AI Systems 

In an attempt to reuse the MYCIN knowledge base for tutorial purposes in the GUIDON system [Clancey, 
1989] it was found that generating explanations from the original rule base was not only difficult but 
also revealed that all kinds of implementational details for consultative inferencing were mixed in with 
the more abstract, descriptive medical knowledge that supported the inferences. Besides adding metarules 
for guiding the discourse involved in tutoring, the experience in building GUIDON led Clancey to 
reconsider the generality of the MYCIN rule base and its representation. The NEOMYCIN system 
[Clancey and Letsinger, 1981] completely separated descriptive medical knowledge from details of rea- 
soning task implementation so that it would better match the problem-solving tasks of human experts. 
This coincided with Newell's more general observation across AI systems: that they needed to describe 
problem solving at the knowledge level, free of the encumberances of specific programming details 
[Newell, 1982]. In medicine, a similar experience had been reported in a follow-up to the digitalis project 
by Swartout who designed the XPLAIN system [Swartout, 1981] as an automatic programming approach 
for generating a consultation model from the specification of abstract system goals. Explanations and 
justifications would then be automatically built into the performance system. 

The role of explanatory reasoning became a major focus of attention for many medical AI groups 
during this period. One of the earliest and major motivations for the AI approaches to reasoning was 
that they were able to explain their logic in terms that were much closer to the arguments of physicians. 
However, by the end of the first decade, most AIM researchers found that the explanatory capabilities of 
their systems, while useful for tracing logical connections through their knowledge, left much to be 
desired in terms of naturalness and flexibility. Traditional statistical decision systems could produce an 
explanation by accounting for each manifestation's contribution to the total probability (or weight) of a 
diagnosis. In CASNET, an explanation could be generated by tracing the pathway of confirmed causal 
states that led to the final diagnosis. The confidence in the confirmation of each state could, in turn, be 
explained by the pattern of observed evidence that supported it. In MYCIN and other derived rule-based 
systems, explanations could be easily generated by tracing the rules involved in producing a decision. In 
the Digitalis Advisor, an explanation could be generated in terms of the causal underpinnings of the 
patient-specific model, while in INTERNIST- 1, the diagnostic strategy could be traced in terms of the 
sequence of manifestations activated or covered by a particular set of hypotheses. All these approaches, 
while initially satisfying to the developers, upon further consideration proved to be inadequate. Tracing 
every detail of logic by which a system arrived at a diagnostic or therapeutic conclusion could be very 
useful for debugging the knowledge base during system developments but it rapidly became tedious to 
the expert reviewer or the practitioner using the system. What was missing was a clear summary of the 
underlying justification of the reasoning. A major reason for this was that the first-generation represen- 
tations, in their emphasis on symbolic and qualitative descriptions of knowledge, as opposed to the earlier 
numerical representations, failed to separate or abstract out information at the true knowledge level from 
the finer-grained symbolic details of the implementation. 

NEOMYCIN [Clancey and Letsinger, 1981] demonstrated the need to reorganize the knowledge in a 
rule base in order to be able to produce "intelligently studied reasons" for a decision. The ABEL system 
showed the critical importance of hierarchical causal descriptions using composition and decomposition 
operators to explain interactions among subprocesses contributing to an overall pathophysiologic disease 
process [Patil, 1983]. In a restructuring of the INTERNIST-1 knowledge base, Pople [1982] showed how 
combinations of very elementary operators (causal, hierarchical) could be used to construct and explain 
alternative complexes of hypotheses with different interpretations for the same set of confirmed states. 
MDX stressed the importance of describing problem-solving tasks in diagnosis and using procedural 
knowledge to capture the knowledge of expert practitioners [Chandrasekaran et al., 1979] . This research, 
applied to several liver diagnostic problems and a red cell identification problem [Smith et al., 1985], 
among others, led Chandrasekaran [1986] to develop a theory of generic problem-solving tasks in order 
to advance our understanding of diagnostic and other expert reasoning processes. 
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A completely different form of reasoning was implemented in the ATTENDING system [Miller, 1983] . 
Rather than modeling consultative reasoning directly, Miller analyzed and critiqued the plans that phy- 
sicians had developed for anesthesia administration, using an augmented transition network (ATN) of 
states. This novel departure helped define a new type of critiquing system that was very useful for 
instructional purposes and therefore more likely to be used by physicians who either felt threatened by 
automated consultative systems or else found them lacking and not essential to their practice. 

Performance of Expert Medical Systems 

In trying to deal with practical clinical problems, many AIM researchers found that the first general 
expert systems shells did not satisfy their requirements. The shells frequently had many options that 
would never be used in a particular application while being difficult to adapt to problem-specific rea- 
soning methods and domain-specific knowledge structures. Furthermore, for an expert system to be 
really useful, they found that design of easy-to-use human interfaces, while not scientifically rewarding 
at first glance, was actually quite essential. In this regard, the next major medical project of the Heuristic 
Programming Project at Stanford, the ONCOCIN system [Shortliffe et al., 1981], demonstrated that well- 
engineered interfaces mimicking on a screen, the input of charts, images, and other data for cancer 
treatment protocols could directly improve data acquisition and were essential in guiding the application 
of the protocols. Knowledge acquisition for ONCOCIN was provided by a very flexible graphics-oriented 
system, called OPAL [Musen et al., 1986]. An important representational issue also was settled in the 
ONCOCIN experiments: the need for event-driven reasoning as well as goal-driven reasoning in rule- 
based systems, as had been earlier advocated in the EXPERT scheme. 

Exploiting other, natural knowledge representations became a theme in an application of the EXPERT 
system for building a rheumatology knowledge base [Lindberg et al., 1983]. Here it was found that 
domain-specific decision rules had already been defined by the American Rheumatological Association 
in the form of diagnostic criteria tables. These, however, needed more detailed elaboration in terms of 
observational criteria for findings, as well as customization of the diagnostic criteria within a working 
expert system — AI/RHEUM [Kingsland et al, 1983]. 

The goal of producing high-performance systems led researchers during this period to experiment 
with technology transfer, disseminating the knowledge of experts within the medical community. The 
first expert system on a chip was pioneered at Rutgers with the SPE/EXPERT system for serum protein 
electrophoresis analysis [Weiss et al., 1983]. In this system, a knowledge base was developed by a leading 
specialist in the field, using the high-level EXPERT language, and then automatically translated into 
algorithmic form and compiled into the assembly language of the ROM that existed to process signals 
from the scanning densitometer of the Cliniscan instrument (TM-Helena Laboratories, Beaumont, 
Texas). Similar technology was used to develop an advice-giving system for primary eye care on one of 
the earliest hand-held computers [Kastner et al., 1984]. 

In a very large undertaking, the INTERNIST- 1 knowledge base was recast into a form that would make 
it more easily accessible by flexible querying in the QMR system [Miller, 1984] , rather than being restricted 
to the consultative mode it had been before. The intelligent retrieval facilities of QMR have made it one 
of the most widely disseminated results of expert system research [Miller, 1994] . In a similar trend, other 
earlier non-AI systems were reengineered to have more modular architectures, as with the HELP system, 
which provided advice as part of the medical information system at the Latter Day Saints Hospital. Its 
successor, the ILLIAD system, has subsequently become an integral part of quality assurance functions 
in the hospital network [Lau and Warner, 1992]. Likewise, the DXplain general decision support system 
was made widely available to physicians through the AMA/NET in the 1980s [Barnett et al., 1987]. 

With the increasing dissemination of AIM systems came parallel efforts to test and evaluate their 
performance while exploring new ways of dealing with the difficult issues of reasoning in uncertain 
environments. 

An empirical approach to performance evaluation was taken with the AI/RHEUM system testing and 
resulted in the first system for refining a rule base using the accumulated experience of expertly solved 
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cases [Politakis and Weiss, 1984]. This system, called SEEK, employed performance heuristics to suggest 
potential improvements in the rules (generalizations and specializations), which could be tested by 
incorporation into the model and evaluation over the entire database of cases. In SEEK 2, this work was 
extended to generate automatically and test the potential improvements, selecting the most successful 
improvements for inclusion in an updated knowledge base [Ginsberg, 1986]. Related to these approaches 
is the basic problem of uncovering underlying causal relations through the analysis of time-oriented data, 
such as was carried out in the RX system [Blum, 1982]. During this period, causality became a major 
concern of researchers seeking strong theoretical underpinnings that would explain reasoning in complex 
diagnostic consultation models [Patil, 1986]. 

Formal Theories of Reasoning Under Uncertainty in Medicine 

Many different formal methods of inference were explored and applied in knowledge-based medical 
decision making. These included the application of multihypothesis Bayesian methods (Ben-Bassat et al., 
1980], of fuzzy logic as in the CADIAG family of systems [Addlassnig and Kolarz, 1982] and the SPHINX 
system [Fieschi et al., 1983] , or of the Dempster- Shaffer theory of evidence to structured medical prob- 
lems [Gordon and Shortliffe, 1985]. Reggia et al. [1983] devised set covering strategies to parsimoniously 
explain diagnostic hypotheses within a probabilistic framework [Peng and Reggia, 1987]. 

Two major interrelated problems afflicting the handling of uncertainty in clinical decision making 
were how to combine evidence from multiple related sources in a non-mutually exclusive multihypothesis 
inference problem and how to represent the flow of uncertainty among intermediate causal and hierar- 
chically related hypotheses. These problems had existed from the very begining of medical Al, and their 
formal treatment was avoided through the use of empirical, heuristic methods in the first generation of 
systems. Now researchers embarked on a more principled search for solutions. A probabilistic approach 
to propagating the effect of local changes in probabilistic networks was developed by Lauritzen and 
Spiegelhalter [1987]. The Dempster-Shaffer theory provided an alternative formalism for incorporating 
partial belief information into a complex network of hypotheses. However, in practice, it often proved 
computationally expensive or might produce very broad and uninformative confidence bounds on 
hypotheses at the end of long reasoning chains. In an attempt to provide a link between the semantics 
of hypotheses related at different levels of causation and abstraction, Pearl [1987] developed methods 
for propagating uncertainty within a structured Bayesian framework of statistical decision making. The 
more systematic application of these techniques and their implications for specific medical reasoning 
problems continues [Pearl, 1995]. 

The revival of interest in connectionist, neural network methods for learning and decision making 
was strongly stimulated around this time by the introduction of backpropagation methods [Rumelhard 
and McClelland, 1986]. These used differentiable nonlinear threshold functions that enabled the learning 
of much more complex decision rules from data than had been possible with the simpler perceptron 
methods of the 1960s. As researchers and practitioners came to realize that the expert systems and 
knowledge-engineering approaches required large amounts of investment by the most costly and indis- 
pensable consultants in the specialty being modeled, efforts again began to emphasize methods that 
learned directly from data through artificial neural networks (ANNs), as well as simpler but more 
explainable structures, such as decision trees [Breiman et al., 1984]. 

181.5 The Past Decade — Structure, Formalism, 
and Empiricism 

Deep Medical Knowledge for Representation and Problem Solving 

The trends described above have continued since 1987 and throughout the present decade as researchers 
continued to work on representations for the deeper medical knowledge that could help explain and 
justify decision making. One approach has continued to emphasize the centrality of causal reasoning and 
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simulation in providing knowledge-based explanations for the physiologic underpinnings of medical 
reasoning [Kuipers, 1985; Long, 1987, 1991; Hunter and Kirby, 1991; Widman, 1992]. 

More generally, Newell's knowledge-level approach [Newell, 1982] became the inspiration for the 
second-generation of expert systems, which emphasized the multilevel nature of modeling problem- 
solving and related interacting knowledge structures [Chandrasekaran and Johnson, 1993]. 

Applications of Newell's SOAR architecture [Smith and Johnson, 1993] enabled a very general type of 
problem-solving mechanism (chunking) to roam over flexible spaces of goals, tasks, and specific knowl- 
edge structures, thereby providing a common framework for reasoning about decisions in individual 
cases or with groups of cases, as for learning rules or other changes to the knowledge base. In this, as in 
the KADS approach [Wielinga et al., 1992], expertise is viewed as a dynamic process that cannot be 
captured in strictly static knowledge structures but rather evolves as part of an ongoing interactive 
modeling of systems, agents, problems, and environments [Clancey, 1989]. A discussion of how this 
problem-solving cognitive approach affects medical decision making can be found in Kleinmuntz [1991]. 
Meanwhile, many of the lessons learned from earlier prototypes were applied to more sophisticated 
knowledge-based explanation and system-building schemes [Bylander et al., 1993; Koton, 1993; Swartout 
and Moore, 1993; Lanzola et al., 1995]. Relationships between abductive reasoning and temporal reason- 
ing were investigated [Console and Torasso et al., 1991], and an epistemologic framework for structuring 
medical problem solving stated [Ramoni et al., 1992]. It is interesting to note that a related cognitive 
approach to eliciting and structuring medical knowledge grew up independently in the former Soviet 
Union through the work of Gelfand and collaborators [Gelfand et al., 1987; Gelfand, 1989]. 

Formal Decision-Making Methods and Empirical Systems 

Formal decision-making approaches to medical reasoning took off in the 1980s with the founding of the 
journal, Medical Decision Making, which involved physicians as well as decision scientists in the process 
of understanding how formal statistical and logical methods can apply to the modeling and interpretation 
of medical problems. This renewed interest in the modeling of reasoning under risk and uncertainty was 
motivated by the twin needs of evaluating decision-support systems and of modeling more general 
epidemiologic effects of medical interventions, with and without computer-based systems being involved. 
While not directly related to knowledge-based Al approaches, this trend nevertheless interacted with 
developments in operations research (OR) and Al through the confluence of the influence diagram 
representation of abstract states for describing processes [Shachter, 1986] and belief network represen- 
tations for reasoning with causal structured hypothesis networks [Pearl, 1987]. Together, they helped 
make Bayesian belief networks a practical tool for modeling many of the dynamic aspects of clinical 
cognition [Cowell et al., 1991] and provided a combined statistical- structural representation for decision- 
making preferences in computer-based systems [Lehman and Shortliffe, 1990; Farr and Shachter, 1992; 
Heckerman and Shortliffe, 1992; Jimison et al., 1992; Neapolitan, 1993]. The related use of the decision 
theoretic approach to integrate knowledge-based and judgmental elements has also been an important 
development in assisting the evaluation of systems [Cooper, 1993] and knowledge-based tools for spec- 
ifying large expert system models [Olesen and Andreassen, 1993, Lazola et al., 1995] and building decision 
models are also becoming more prevalent [Sonneberg et al., 1994]. In addition, tools from meta-analysis 
have begun to be applied to assess conflicting results from related though different clinical studies 
[Littenberg and Moses, 1993] and merge diagnostic accuracy results [Midgette et al., 1993]. The temporal 
nature of medical processes and data has also led to some initial models for decision making that make 
temporal dependencies explicit in their representation [Kahn, 1991; VanBeek, 1991; Hazen, 1992; Cousins 
et al., 1993, Shuval and Musen, 1993]. Practical systems in the ICU or other monitoring situations 
typically have to use a temporal representation in the context of online expert systems [Schecke et al., 
1991; Mora et al, 1993; Rutledge et al., 1993; Lehmann et al, 1994, Becker et al, 1997]. Generalized 
representations of temporal knowledge that enable reasoning with abstractions of event sequences linked 
to ontologies of problem solving have been developed for a wide variety of medical problems where the 
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time course of an illness is critical [Haimowitz and Kohane, 1995, Long, 1996, Keravnou, 1996, Shahar 
and Mussen, 1996]. 

With more sophisticated methods for learning in artificial neural networks (ANNs), the last decade 
has seen a concomitant increase in their application to the modeling and learning of medical decisions 
[Barreto and DeAzevedo, 1993; Cherkassy and Lari-Najafi, 1992; Cho and Reggia, 1993; Forsstrom et al., 
1991a; Hudson et al, 1991; Sittig and Orr, 1992, Dorfnerr and Porenta, 1994, Stevens et al., 1996]. Logic- 
based methods have also seen some popularity in structuring and building knowledge bases in medicine, 
particularly in Europe and Japan [Fox et al., 1990; Krause et al., 1993; Lucas, 1993]. Fuzzy logic methods 
have also shown their usefulness in a number of monitoring and image interpretation tasks [Becker et al., 
1997, Park et al., 1998]. 

The evaluation of expert systems and decision support systems remains an important ongoing research 
and practical problem [Miller, 1986; Potoff et al, 1988; Willems et al, 1991; Bankowitz et al, 1992; Nohr, 
1994] to which a variety of machine learning techniques can be applied for the performance assessment 
[Weiss and Kulikowski, 1991], knowledge base refinement [Widmer et al., 1993], and the retrospective 
analysis of results [Forsstrom et al., 1991b] 

181.6 Prospects for Al in Medicine — Problems 
and Challenges 

Usefulness of Al Decision Support Systems 

As we approach the end of the 1990s, methodological developments in both the representation of 
knowledge and inferencing methods in medicine, however impressive in terms of their improvements 
over those of 20 or 30 years ago, still have not resulted in systems that are routinely used in the clinic. 
Reasons given for this usually include limitations in scope or ability to customize advice, lack of trust in 
automated advising systems, inadequate timeliness in critical situations, perception of the computer as 
a competitor, and most importantly, that decision support systems have yet to become indispensable 
adjuncts in medical practice situations. Problems and opportunities for medical Al in the 1990s have 
been identified and discussed with varying degrees of optimism or pessimism, depending on the author 
[Shortliffe, 1993; Uckun, 1993; Coeira, 1996; Kulikowski, 1996]. It has been suggested that the emphasis 
on decision support is misguided, and should be redirected towards medical education which would 
empower health care workers [Lillehaug and Lajoie, 1997]. This would tap into the developing theories 
of clinical cognition and the use of clinical simulations to train and enhance learning at various levels 
of expertise [Evans and Patel, 1989; Patel et al., 1995]. Some support for such a position comes from a 
comparative study of four of the most widely disseminated diagnostic decision support systems [Berner 
et al., 1994], which resulted in an editorial comment to the effect that their overall rating left much to 
be desired — a grade of C [Kassirer, 1994]. However, the systems reviewed were general diagnostic aids, 
which most physicians find less than indispensible given their own training and expertise. Another 
comparative study [Johnston et al., 1994] reported positive contributions to health care in three out of 
ten decision support systems reviewed. Most of the shortcomings came in systems that did not have to 
perform complex modeling or numerical computations, where computer-based assistance is often indis- 
pensable. Fields such as specialized laboratory test interpretation, treatment dosage planning and review, 
instrumentation monitoring and control, and multimodality imaging all suggest more essential domains 
for the application of Al methods [Adlassnig and Horak, 1995; Barahona, 1994; DeMelo et al., 1992; 
Kaye et al., 1995; Kulikowski et al., 1995; Menzies and Compton, 1997, Mora et al., 1993; Siregar and 
Sinteff, 1996, Taylor et al, 1997]. 

Opportunities: Knowledge-Bases, Systems Integration and the Web 

The great computing success story of the 1990s has been the World Wide Web, and the proliferation of 
web-based information services. The ease of designing good graphical user interfaces (GUIs) has resulted 
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in a concomitant ease of interaction with our networked machines. E-mail has become ubiquitous, and 
various standard data interchange formats across platforms have been developed in medicine as in other 
fields. This presents AI in medical decision-making with a number of challenges and opportunities. The 
first is to characterize and build the knowledge-bases that can help improve medical decision-making; 
the second is to integrate them into useful, working, indispensable clinical systems; the third is to validate, 
standardize, and share medical knowledge as widely and economically as is possible. Development of 
standard medical vocabularies and nomenclatures; coding schemes, and the Unified Medical Language 
System (UMLS) [Humphreys et al., 1995], are all essential to the standarization of medical knowledge 
representation. The availability of large digital image datasets has resulted in the creation of systems and 
knowledge bases for handling spatial information in anatomy [Brinkley, 1995; Hohne et al., 1995; Rosse 
et al., 1998], and has been accelerated by the widely disseminated multimodality image data from the 
Visible Human Project [Ackerman, 1995]. 

The re-usability of software is greatly facililtated by the development of ontologies for different types 
of knowledge and problem solving — a classical type of AI research which has received considerable new 
impetus from the needs of rapid knowledge transfer and scalability of web-based systems. In medicine, 
the PROTEGE system has become a model for such software re-use experiments [Tu et al., 1995], while 
other systems based on ontologies have also been developed [Smith et al., 1995, Muller et al., 1996]. 

Software integration for multiple uses is increasingly spurred by initiatives such as IAIMS [Stead, 
1997], and inclusion of useful decision support modules may evolve naturally as productive computing 
environments become more commonplace. One method for connecting clinical databases to knowledge- 
bases used in decision support that is gaining acceptance is the Arden syntax [Hripcsak et al., 1990]. 
Automatic extraction of clinical data from narrative reports is also progressing due to a combination of 
statistical and linguistic methods [Chute and Yang, 1995; Hripcsak et al., 1996]. Meanwhile, AI systems are 
being designed to provide patients with intelligent access to medical information [Buchanan et al., 1995]. 

The present situation, then, can be characterized as one of consolidation in integrating a variety of 
methods and techniques for different types of medical decision tasks while at the same time involving 
considerable reassessment and change in terms of expectations about the imminent application of 
decision-making systems within health care environments. On the one hand, the forthcoming increase 
in networking of various knowledge sources, digital libraries, and multimedia information opens up a 
whole new set of opportunities for filtering knowledge to the overwhelmed practitioner, whether in 
medicine, law, engineering, or other professions. On the other hand, the hard issues of understanding 
the role of human judgment and responsibility for clinical decisions and actions when using increasingly 
complex computerized instrumentation and systems presents an ongoing challenge to researchers in 
medicine, biomedical engineering and computer science, medical informatics, and all other fields essential 
to the solution of these problems. 
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The potential of achieving a great deal of processing power by wiring together a large number of very 
simple and somewhat primitive devices has captured the imagination of scientists and engineers for many 
years. In recent years, the possibility of implementing such systems by means of electro-optical devices 
and in very large scale integrations has resulted in increased research activities. 

Artificial neural networks (ANNs) or simply neural networks (NNs) are made of interconnected devices 
called neurons (also called neurodes, nodes, neural units, or simply units). Loosely inspired by the makeup 
of the nervous system, these interconnected devices look at patterns of data and learn to classify them. 
NNs have been used in a wide variety of signal-processing and pattern-recognition applications and have 
been applied successfully in such diverse fields as speech processing [1-4], handwritten character recog- 
nition [5-7], time-series prediction [8-9], data compression [10], feature extraction [11], and pattern 
recognition in general [12]. Their attractiveness lies in the relative simplicity with which the networks can 
be designed for a specific problem along with their ability to perform nonlinear data processing. 

As the neuron is the building block of a brain, a neural unit is the building block of a neural network. 
Although the two are far from being the same, or from performing the same functions, they still possess 
similarities that are remarkably important. NNs consist of a large number of interconnected units that 
give them the ability to process information in a highly parallel way. The brain as well is a massively 
parallel machine, as it has been long recognized. AseachofthelO u neuronsof the human brain integrates 
incoming information from all other neurons directly or indirectly connected to it, an artificial neuron 
sums all inputs to it an creates an output that is carrying information to other neurons. The connection 
from one neuron's dendrites or cell body to another neuron's processes is called a synapse. The strength 
by which two neurons are influencing each other is called a synaptic weight. In an NN, all neurons are 
connected to all other neurons by synaptic weights that can have seemingly arbitrary values, but in reality, 
these weights show the effect of a stimulus on the neural network and the ability or lack of it to recognize 
that stimulus. 
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In the biological brain, two types of processes exist: static and dynamic. Static brain conditions are 
those which do not involve any memory processing, while dynamic processes involve memory processing 
and changes through time. Similarly, NNs can be distinguished into static and dynamic: the former being 
such that do not involve any previous memory and only depend on current inputs and the latter having 
memory and being able to be described by differential equations that express changes in the dynamics 
of the system through time. 

All NNs have certain architectures, and all consist of several layers of neuronal arrangements. The 
most widely used architecture is that of the perceptron first described in 1958 by Rosenblatt [13]. In the 
sections that follow we will build on this architecture, but not necessarily on the original assumptions 
of Rosenblatt. 

Since there are many names in the literature that express the same thing and usually create a lot of 
confusion for the reader, we will define the terms to be used and use them throughout the chapter. 
Terminology is a big concern for those involved in the field and for organizations such as IEEE. A standards 
committee has been formed to address issues such as nomenclature and paradigms. In this chapter, 
whenever possible, we will try to conform to the terms and definitions already in existence. 

Some methods for training and testing of NNs will be described in detail, although many others will 
be left out due to lack of space, but references will be provided for the interested reader. A small number 
of applications will be given as examples, since many more are discussed in another chapter of this 
Handbook and it would be redundant to repeat them here. 

182.1 Definitions 

Neural nets (NNs) go by many other names, such as connectionist models, neuromorphic systems, and 
parallel distributed systems, as well as artificial NNs to distinguish them from the biologic ones. They 
contain many densely interconnected elements called neurons or nodes, which are nothing more than 
computational elements nonlinear in nature. A single node acts like an integrator of its weighted inputs. 
Once the result is found, it is passed to other nodes via connections that are called synapses. Each node 
is characterized by a parameter that is call threshold or offset and by the kind of nonlinearity through 
which the sum of all the inputs is passed. Typical nonlinearities are the hardlimiter, the ramp (threshold 
logic element), and the widely used sigmoid. 

The simplest NN is the single-layer perceptron [13, 14], and it is a simple net that can decide whether 
an input belongs to one of two possible classes. Figure 182.1 is a schematic representation of a perceptron 
the output of which is passed through a nonlinearity called an activation function. This activation function 
is of different types, the most popular being a sigmoidal logistic function. 




activation function 
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FIGURE 182.1 Artificial neuron. 
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FIGURE 182.2 Typical activation functions. 

Figure 182.2 is a schematic representation of some activation functions such as the handlimiter (or 
step), the threshold logic (or ramp), a linear, and a sigmoid. The neuron of Fig. 182.1 receives many 
inputs I { each weighted by a weight W, (z = 1,2,..., N). These inputs are then summed. The sum is then 
passed through the activation function f, and an output y is calculated only if a certain threshold is 
exceeded. Complex artificial neurons may include temporal dependencies and more complex mathemat- 
ical operations than summation [15]. While each node has a simple function, their combined behavior 
becomes remarkably complex when organized in a highly parallel manner. 

NNs are specified by their processing element characteristics, the network topology, and the training 
or learning rules they follow in order to adapt the weights W ; . Network topology falls into two broad 
classes: feedforward (nonrecursive) and feedback (recursive) [16]. Nonrecursive NNs offer the advantage 
of simplicity of implementation and analysis. For static mappings, a nonrecursive network is all one 
needs to specify any static condition. Adding feedback expands the network's range of behavior, since 
now its output depends on both the current input and network states. But one has to pay a price: longer 
times for teaching the NN to recognize its inputs. 

Obviously, the NN of Fig. 182.1 is quite simple and inadequate is solving real problems. A multilayer 
perceptron (MLP) is the next choice. A number of inputs are now connected to a number of nodes at a 
second layer called the hidden layer. The outputs of the second layer may connect to a third layer and so 
on, until they connect to the output layer. In this representation, every input is connected to every node 
in the next layer, and the outputs of one hidden layer are connected to the nodes of the next hidden 
layer, and so on. Figure 182.3 depicts a three-layer feedforward network, the simplest MLP. 

The optimal number of hidden neurons needed to perform arbitrary mapping is yet to be determined. 
Methods used in practice are mainly intuitive determination or are found by trial and error. Recent work 
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FIGURE 182.3 Three-layer feedforward network. 
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has attempted to formulate bounds for the number of hidden nodes needed [ 17] . Mathematical derivation 
proves that a bound exists on the number of hidden nodes m needed to map a fc-element input set. The 
formulation is that m = k - 1 is an upper bound. These results are consistent with the optimal number 
of hidden neurons, determined empirically in Kung et al. [18]. In other works, the number of hidden 
nodes necessary has been proposed to be a function of the number of the separable regions needed as 
well as the dimension of the input vector [19]. 

Artificial NNs usually operate in one of two modes. Initially, there exists a training phase where the 
interconnection strengths are adjusted until the network has a desired output. Only after training does 
the network become operational, i.e., capable of performing the task it was designed and trained to do. 
The training phase can be either supervised or unsupervised. In supervised learning, there exists infor- 
mation about the correct or desired output for each input training pattern presented [20]. The original 
perceptron and backpropagation are examples of supervised learning. In this type of learning, the NN 
is trained on a training set consisting of vector pairs. One of these vectors is used as input to the network; 
the other is used as the desired or target output. During training, the weights of the NN are adjusted in 
such a way as to minimize the error between the target and the computed output of the network. This 
process might take a large number of iterations to converge, especially because some training algorithms 
(such as backpropagation) might converge to local minima instead of the global one. If the training 
process is successful, the network is capable of performing the desired mapping. 

In unsupervised learning, no a priori information exists, and training is based only on the properties 
of the patterns. Sometimes it is also called self- organization [20]. Training depends on statistical regular- 
ities that the network extracts from the training set and represents them as weight values. Applications 
of unsupervised learning have been limited. However, hybrid systems of unsupervised learning combined 
with other techniques produce useful results [21-23]. Unsupervised learning is highly dependent on the 
training data, and information about the proper classification is often lacking [21]. For this reason, most 
NN training is supervised. 

182.2 Training Algorithms 

After McCulloch and Pitts [24] demosntrated, in 1943, the computational power of neuron-like networks, 
much effort was given to developing networks that could learn. In 1949, Donald Hebb proposed the 
strengthening of connections between presynaptic and postsynaptic units when both were active simul- 
taneously [25]. This idea of modifying the connection weights, as a method of learning, is present in 
most learning models used today. The next major advancement in neural networks was by Frank Rosen- 
blatt [13, 14]. In 1960, Windrow and Hoff proposed a model, called the adaptive linear element (ADA- 
LINE), which learns by modifying variable connection strengths, minimizing the square of the error in 
successive iterations [26]. This error-connection scheme is now known as the least mean square (LMS) 
algorithm, and it has found widespread use in digital signal processing. 

There was great interest in NN computation until Minsky and Papert published a book in 1969 
criticizing the perceptron. This book contained a mathematical analysis of perceptron-like networks, 
pointing out many of their limitations. It was shown that the single-layer perceptron was incapable of 
performing the XOR mapping. The single-layer perceptron was severely limited in its capabilities. For 
linear activation functions, multilayer networks were no different from single-layer models. Minsky and 
Papert pointed out that multilayer networks with nonlinear activation functions could perform complex 
mappings. However, the lack of any training algorithms for multiple-layer networks made their use 
impossible. It was not until the discovery of multilayer learning algorithms that interest in NNs resurfaced. 
The most widely used training algorithm is the back-propagation algorithm, as already mentioned. 

Another algorithm used for multilayer perceptron training is the ALOPEX algorithm. ALOPEX was 
originally used for visual receptive field mapping by Tzanakou and Harth in 1973 [28-30] and has since 
been applied to a wide variety of optimization problems. These two algorithms are described in detail 
in the next subsections. 
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Backpropagation Algorithm 

The backpropagation algorithm is a learning scheme where the error is backpropagated layer by layer 
and used to update the weights. The algorithm is a gradient-descent method that minimizes the error 
between the desired outputs and the actual outputs calculated by the MLR Let 



N 

-V(r-Y;) 2 (182.1) 



E=- 



be the error associated with template p. N is the number of output neurons in the MLP, T { is the target 
or desired output for neuron i, and Y, is the output of neuron i calculated by the MLP. Let E = Z£ be 
the total measure of error. The gradient-descent method updates an arbitrary weight w in the network 
by the following rule: 



w(n+l) = w(n) + Aw(n) (182.2) 



-\ j-, 

where AJnU-i] — ^r (182.3) 

3 win I 

where n denotes the iteration number, and r| is a scaling constant. Thus the gradient-descent method 
requires the calculation of the derivatives dE/[dw{n)] for each weight w in the network. For an arbitrary 
hidden-layer neuron, its output H is a nonlinear function /of the weighted sum of all its inputs (net): 



, = f( net ,) ( 182 - 4 ) 



H 

where /is the activation function. The most commonly used activation function is the sigmoid function 
given by 



v ' 1 + e 



(182.5) 



Using the chain rule, we can write 



dE _ dE dnetj 
dw.. dnet. dw- 



(182.6) 



and since 



we have 
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and thus Eq. (182.6) becomes 

dE dE 



I. (182.9) 

dw.. dnet. 
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dE _y^ dE dnet k dH j 
dnet T-^dnet, dH. dnet. 



(182.10) 



Recalling that 
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it follows the 
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Also, } -= f'lnet.). (182.13) 



Therefore, =f'[net ! )J w jk . (182.14) 



Assuming /to be the sigmoid function of Eq. (182.5), then 

/'(«rf.)=7.(l-F). (182.15) 

Equation (182.14) gives the unique relation that allows the backpropagation of the error to all hidden 
layers. For the output layer, 
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f'lnet) (182.16) 
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-(t-7.). (182.17) 



In summary, then, first, the output Y, for all the neurons in the network is calculated. The error 
derivative needed for the gradient-descent update rule of Eq. (182.2) is calculated from 

te = d^dne L _ (18218) 

dw dnet dw 
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If j is an output neuron, then 

-(t.-Y^Y.Ii-y). (182.19) 



dE 

dnet ■ 




If j is a hidden neuron, then the error derivative is backpropagated by using Eqs. (182.14) and (182.15). 
Substituting, we get 



(182.20) 



Finally, the weights are updated as in Eq. (182.2). 

There are many modifications to the basic algorithm that have been proposed to speed the convergence 
of the system. Convergence is defined as a reduction in the overall error between a minimum threshold. 
It is the point at which the network is said to be fully trained. One method [31] used is the inclusion of 
a momentum term in the updated equation such that 

dE 



v(n+l) = w(n)-r\ — j-^ + aAwin) (182.21) 

dwin\ 



r\ is the learning rate and is taken to be 0.25. a is a constant momentum term that determines the effect 
of past weight changes on the direction of current weight movements. 

Another approach used to speed the convergence of backpropagation is the introduction of random 
noise [32]. It has been shown that while inaccuracies resulting from digital quantization are detrimental 
to the algorithm's convergence, analog perturbations actually help improve convergence time. 

One of these variations is the modification by Fahlman [33], called the quickprop, that uses second- 
derivative information without calculating the hessian needed in the straight backpropagation algorithm. 
It requires saving a copy of the previous gradient vector, as well as the previous weight change. Compu- 
tation of the weight changes use only information associated with the weight being updated: 

S7w i,( n ) 

Vtc. in— II — Vw.i n) 

Aw = ^ / ^^ (182.22) 

AwAn-l) 

where Vw-iri) is the gradient vector component associated with the weight w t j at iteration n. This 
algorithm assumes that the error surface is parabolic, concave upward around the minimum, and that 
the slope change of the weight VwJn) is independent of all other changes in weights. There are obviously 
problems with these assumptions, but Fahlman suggests a "maximum growth factor" u. in order to limit 
the rate of increase of the step size, namely, that if AwJn) > [lAwJn - 1), then AwJri) = \xAwJn - 1). 
Fahlman also used a hyperbolic arctangent function to the output error associated with each neuron in 
the output layer. This function is almost linear for small errors, but it blows up for large positive or large 
negative errors. Quickprop is an attempt to reduce the number of iterations needed by straight back- 
propagation, and it succeeded doing so by a factor of 5, but this factor is problem-dependent. This 
method also requires several trials before the parameters are set to acceptable values. 

Backpropagation has achieved widespread use as a training algorithm for NNs. Its ability to train 
multilayer networks has led to resurgence of interest in the field. Backpropagation has been used 
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successfully in applications as adaptive control of dynamic systems and in many general NN applications. 
Dynamic systems require monitoring of time in ways that monitor the past. In fact, the biological brain 
performs in an admirable way just because it has access to and uses values of different variables from 
previous instances. Backpropagation through time is another extension of the original algorithm pro- 
posed by Werbos in 1990 [34] and has been applied previously in the "truck backer-upper" by Nguyen 
and Widrow [35] . In this problem, a sequence of decisions must be made without an immediate indication 
of how effective these steps are. No indication of performance exists until the truck hits the wall. 
Backpropagation through time solves the problem, but it has its own inadequacies and performance 
difficulties. Despite its tremendous effect on NNs, the algorithm is not without its problems. Some of 
the problems have been discussed above. In addition, the complexity of the algorithm makes hardware 
implementations of it very difficult. 

The ALOPEX Algorithm 

The ALOPEX process is an optimization procedure that has been demonstrated successfully in a wide 
variety of applications. Originally developed for receptive field mapping in the visual pathway of frogs, 
ALOPEX's usefulness and flexible form have increased the scope of its applications to a wide range of 
optimization problems. Since its development by Tzanakou and Harth in 1973 [28], ALOPEX has been 
applied to real-time noise reduction [36], pattern recognition [37], adaptive control systems [38], and 
multilayer NN training, to name a few. 

Optimization procedures, in general, attempt to maximize or 
minimize a function F(). The function F() is called the cost func- 
tion, and its value depends on many parameters or variables. 
When the number of parameters is large, finding the set (x l x 2 . . . 
x N ) that corresponds to the optimal (maximal or minimal) solu- 
tion is exceedingly difficult. If N were small, then one could 
perform an exhaustive search of the entire parameter space in 

order to find the "best" solution. As N increases, intelligent algorithms are needed to quickly locate the 
solution. Only an exhaustive search can guarantee that a global optimum will be found; however, near- 
optimal solutions are acceptable because of the tremendous speed improvement over exhaustive search 
methods. ALOPEX iteratively updates all parameters simultaneously based on the cross-correlation of 
local changes AX, and on the global response change AR, plus an additive noise. The cross-correlation 
term AX ; A_R helps the process move in a direction that improves the response. Table 182.1 shows how 
this can be used to find a global maximum of R. 

All parameters X, are changed simultaneously at each iteration according to 
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W„) = x(n-l) + YAx(n)A£(n)+rln) (182.23) 



The basic concept is that this cross-correlation provides a direction of movement for the next iteration. 
For example, take the case where X, i and _RT. This means that the parameter X, decreased in the previous 
iteration, and the response increased for that iteration. The product AX,A_R is a negative number, and 
thus X, would be decreased again in the next iteration. This makes perfect sense because a decrease in 
X, produced a higher response; if you are looking for the global maximum, then X, should be decreased 
again. Once X, is decreased and _R also decreases, then AX,A_R is now positive, and X, increases. 

These movements are only tendencies, since the process includes a random component that will act 
to move the weights unpredictably, avoiding local extrema of the response. The stochastic element of the 
algorithm helps it to avoid local extrema at the expense of a slightly longer convergence or learning period. 

The general ALOPEX updating equation (Eq. 182.23) is explained as follows: X,(n) are the parameters 
to be updated, n is the iteration number, and R() is the cost function, of which the "best" solution in 
terms of X, is sought. Gamma (y) is a scaling constant, and r,{n) is a random number from a Gaussian 
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distribution whose mean and standard deviation are to find the "best" solution. As N increases, intelligent 
algorithms are needed to quickly locate the solution. Only an exhaustive search can guarantee that a 
global optimum is found; however, near-optimal solutions are acceptable because of the tremendous 
speed improvement over exhaustive search methods. 

Backpropagation described earlier, being a gradient-descent method, often gets stuck in local extrema 
of the cost function. The local stopping points often represent unsatisfactory convergence points. Tech- 
niques have been developed to avoid the problem of local extrema, with simulated annealing [39] being 
the most common. Simulated annealing incorporates random noise, which acts to dislodge the process 
from local extrema. Crucial to the convergence of the process is that the random noise be reduced as the 
system approaches the global optimum. If the noise is too large, the system will never converge and can 
be mistakenly dislodged from the global solution. ALOPEX is another process that incorporates a sto- 
chastic element to avoid local extremes in search of the global optimum of the cost function. The cost 
function or response is problem-dependent and is generally a function of a large number of parameters. 
ALOPEX iteratively updates all parameters simultaneously based varied, and AX i (ri) and AR{n) are found 
by 

AX.ln\= xin-l)- X.(n-2\ (182.24) 

As(n)= Rin-l)- dn-2\ (182.25) 

The calculation of R() is problem-dependent and can be easily modified to fit many applications. This 
flexibility was demonstrated in the early studies of Harth and Tzanakou [29] . In mapping receptive fields, 
no a priori knowledge or assumptions were made about the calculation of the cost function; instead, a 
"response" was measured. By using action potentials as a measure of the response [28, 29, 40, 41], 
receptive fields could be determined by using the ALOPEX process to iteratively modify the stimulus 
pattern until it produced the largest response. 

It should be stated that due to its stochastic nature, efficient convergence depends on the proper control 
of both the additive noise and the gain factor y. Initially, all parameters X t are random, and the additive 
noise has a gaussian distribution with mean and standard deviation a initially large. The standard 
deviation a decreases as the process converges to ensure a stable stopping point. Conversely, y increases 
with iterations. As the process converges A_R becomes smaller and smaller, and an increase in y is needed 
to compensate for this. 

Additional constraints include a maximal change permitted for X, for one iteration. This bounded 
step size prevents the algorithm from drastic changes from one iteration tot he next. These drastic changes 
often lead to long periods of oscillation, during which the algorithm fails to converge. 

Two-Dimensional Template Matching with ALOPEX 

In this application, ALOPEX is used in template matching. A pattern is compared with a list of j templates, 
and for each template j, a response R: is calculated. ALOPEX then iteratively modifies the initial pattern 
according to the global response calculation, local changes in the pattern, and an additive noise element. 
The updating equations are as described in the preceding subsection (Eqs. 182.23 to 182.25). An expla- 
nation of the response calculation used in template matching follows X ; (w) are the parameters to be 
optimized, and cp ; are the stored templates; j is the number of the templates. 
The response, in this case, is given by 



R h n )=Y, x { n h (182 - 26) 
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R^ = CR'(n) (182.27) 

where R'(«) = Max R.(n) . (182.28) 

Other methods [40] of calculating R(n) include making it equal to a weighted sum of all the _R ( : 

M 

R'(n)=y\Rln)w{n) (182.29) 

,(„) = ^A^. (182 . 30 ) 



where W. 
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The weighting of R: by W, and summing tends to slow the convergence of the algorithm, and hence 
Eq. (182.28) is used to select R{n) from all Rj{n). 

Using the simple response calculation in Eq. (182.26) for a pixel-by-pixel multiplication between the 
pattern and template ;', good results were obtained for binary (two-valued) images. The algorithm has 
certain constraints that dictate how it finds the maximum response. However, when multivalued templates 
were used, there was no longer proper convergence. 

Using a response calculation based on pixel-by-pixel correlation is not suited for intermediate-valued 
optimization. In binary-valued optimization, the tendency to lump the maximum allowed luminance 
into a single pixel does not exist; therefore, the problem can go undetected. The basic problem is that in 
supervised learning the ideal image or solution should have maximal response, and for the example 
shown above, this was not the case. 

To solve this problem, a new response calculation had to be implemented. Understanding the previous 
problem and goal of any optimal response calculation, a new function was used. The new response 
function given by 



R J (n) = Y^-( X ,- ( ?, J f+ ( ?i J (182.31) 



defines a downward-facing parabola with its apex (or maximum) occurring only when X = (p. This is 
basically a least square error calculation that has been inverted so as to produce a maximum instead of 
a minimum. Also, the maximum contribution for each pixel is exactly cp; this is important because now 
the maximal response of R(ri) is simply equal to the total luminance. R(n) = CR (n) and R'(n) = 
Max[i? («)] are the same as before. Using the new response calculation, much better convergence was 
achieved. 

Multilayer Perceptron Network Training by ALOPEX 

An MLP also can be trained for pattern recognition using ALOPEX. A response is calculated for the jth 
input pattern based on the observed and desired output 



R ,( n )= & "-h b i n )-°t es 



(182.32) 
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where O k s and Of es are vectors corresponding to O k for all k. The total response for iteration n is the 
sum of all the individual template responses R(n): 



m 

R { n ) = Yi R i{ n ) (182.33) 



1=1 



In Eq. (182.33), m is the number of templates used as inputs. ALOPEX iteratively updates the weights 
using both the global response information and local weight histories, according to the following: 



W^nj^nj + yAW^nJAi^nj+W^n-l) (182.34) 

wjn) = r{n) + yAW jk U)AR(n) + W ik (n-l) (182.35) 

where y is an arbitrary scaling factor, r,(«) is an additive gaussian noise, AW represents the local weight 
change, and A_R represents the global response information. These values are calculated by 

Awin)= wln-\\-wln-2\ (182.36) 

AW jk (n) = W jk (n-l)-W jk (n-2) (182.37) 

AR.ln)=ldn-l\-ldn-2\. (182.38) 



and 



Besides its universality to a wide variety of optimization procedures, the nature of the ALOPEX algorithm 
makes it suitable for VLSI implementation. ALOPEX is a biologically influenced optimization procedure 
that uses a single-value global response feedback to guide weight movements toward their optimum. 
This single-value feedback, as opposed to the extensive error-propagation schemes of other NN training 
algorithms, makes ALOPEX suitable for fast VLSI implementation. 

182.3 VLSI Applications of Neural Networks 

Much debate exists as to whether digital or analog VLSI design is better suited for NN applications. In 
general, digital designs are an easier to implement and better-understood methodology. In digital designs, 
computational accuracy is only limited by the chosen word length. While analog VLSI circuits are less 
accurate, they are smaller, faster, and consume less power than digital circuits [42]. For these reasons, 
applications that do not require great computational accuracy are dominated by analog designs. 

Learning algorithms, especially backpropagation, require high precision and accuracy in modifying 
the weights of the network. This has led some to believe that analog circuits are not well suited for 
implementing learning algorithms [43 ] . Analog circuits can achieve high precision, at the cost of increas- 
ing the circuit size. Analog circuits with high precision (8 bits) tend to be equally large as their digital 
counterpart [44]. Thus high-precision analog circuits lose their size advantage over digital circuits. Analog 
circuits are of greater interest in applications requiring only moderate precision. 

Early studies show that analog circuits can realize learning algorithms, provided that the algorithm is 
tolerant to hardware imperfections such as low precision and inherent noise. In a paper by Macq et al. 
[45], a full analog implementation of a Kohonen map [20], one type of neural network, with on-chip 
learning is presented. With analog circuits having been shown capable of the computational accuracy 
necessary for weight modification, they should continue to be the choice of NN research. 
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Size, speed, and power consumption are areas where analog circuits are far superior to digital circuits, 
and it is these areas that constrain most NN applications. To achieve greater network performance, the 
size of the network must be increased. The ability to implement larger, faster networks is the major 
motivation for hardware implementation, and analog circuits are superior in these areas. Power con- 
sumption is also of major concern as networks become larger [46]. As the number of transistors per chip 
increases, power consumption becomes a major limitation. Analog circuits dissipate less power than 
digital circuits, thus permitting larger implementations. 

One of the leaders in the field of analog VLSI neural systems is Carver Mead, who has presented a 
methodology for implementing biologically inspired networks in analog VLSI [47]. A silicon retina 
modeled after the biological retina is constructed from simple analog circuits. These simple analog circuits 
function as building blocks and have been used in many other designs. Some of Mead's other work 
includes the design of a tracker [48], a sensorimotor integration system, capable of tracking a bright spot 
of light. An artificial cochlea for use in auditory localization also has been designed in analog VLSI [49] . 

Other methodologies of VLSI design of NNs include an analog/digital hybrid approach using pulse- 
stream networks. Pulse-stream encoding has been used in communication systems for many years and 
was first reported in the context of NNs by Murray and Smith in 1987 [50] . NN pulse-stream techniques 
attempt to combine the size and speed efficiency of analog circuits with the accuracy of digital circuits [51]. 

Recently, a digital VLSI approach to implementing the ALOPEX algorithm was undertaken by Pandya 
et al. [52] . Results of their study indicated that ALOPEX could be implemented using a single-instruction 
multiple-data (SIMD) architecture. A simulation of the design was carried out, in software, and good 
convergence for a 4 x 4 processor array was demonstrated. 

In our laboratory, an analog VLSI chip has been designed to implement the ALOPEX algorithm by 
Zanher et al. [53]. By making full use of the algorithm's tolerance to noise, an analog design was chosen. 
As discussed earlier, analog designs offer larger and faster implementations than do digital designs. 

182.4 Applications in Biomedical Engineering 
Expert Systems and Neural Networks 

Computer-based diagnosis is an increasingly used method that tries to improve the quality of health 
care. Systems that depend on artificial intelligence (Al), such as knowledge-based systems or expert 
systems, as well as hybrid systems, such as the above combined with other techniques, like NNs, are 
coming into play. Systems of this sort have been developed extensively in the last 10 years with the hope 
that medical diagnosis and therefore medical care would improve dramatically. Hatziglygeroudis et al. 
[54] are developing such a system with three main components: a user interface, a database management 
system, and an expert system for the diagnosis of bone diseases. Each rule of the knowledge representation 
part is an ADALINE unit that has as inputs the conditions of the rule. Each condition is assigned a 
significance factor corresponding to the weight of the input to the Adaline unit, and each rule is assigned 
a number, called a bias factor, that corresponds to the weight of the bias input of the unit. The output 
is calculated as the weighted sum of the inputs filtered by a threshold function. 

Hudson et al. [55] developed an NN for symbolic processing. The network has four layers. A separate 
decision function is used for layer three and a threshold for each node in the same layer. If the value of 
the decision function exceeds the corresponding threshold value, a certain symbol is produced. If the 
value of the decision function does not exceed the threshold, then a different symbol is produced. The 
so generated symbols of adjacent nodes are combined at layer four according to a well-structured 
grammar. A grammar provides the rules by which these symbols are combined [56]. The addition of a 
symbolic processing layer enhances the NN in a number of ways. It is, for instance, possible to supplement 
a network that is purely diagnostic with a level that recommends further actions or to add additional 
connections or nodes in order to more closely simulate the nervous system. 
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With increasing network complexity, parameter variance increases, and the network prediction becomes 
less reliable. This difficulty can be overcome if some prior knowledge can be incorporated into the NN to 
bias it [57]. In medical applications in particular, rules can either be given by experts or can be extracted 
from existing solutions to the problems. In many cases the network is required to make reasonable 
predictions before it has gone through any sufficient training data, relying only on a priori knowledge. 
The better this knowledge is initially, the better is the performance and the shorter is the training [58, 59]. 

Applications in Mammography 

One of the leading causes of death of women in the United States is breast cancer. Mammography has 
been proved to be an effective diagnostic procedure for early detection of breast cancer. An important 
sign in its detection is the identification of microcalcifications of mammograms, especially when they 
form clusters. Chan et al. [60] have developed a computer-aided diagnosis (CAD) scheme based on 
filtering and feature-extracting methods. In order to improve on the false-positive results, Zhang et al. 
[61] applied an artificial NN that is shift-invariant. They evaluated the performance of the NN by the 
"jack-knife" method [62] and receiver operating characteristic analysis [63, 64]. A shift-invariant NN is 
a feedforward NN with local, spatially invariant interconnections similar to those of the neocognition 
[65] but without the lateral interconnections. Backpropagation also was used for training for individual 
microcalcifications, and a cross-validation technique also was used in order to avoid overtraining. In this 
technique, the data set is divided into two sets, one used for training and the other for validating the 
predetermined intervals. The training of the network is terminated just before the performance of the 
network for the validating set decreases. The shift-invariant NN was proven to be much better in dropping 
the false-positive classifications by almost 55% over previously used NNs. 

In another study, Zheng et al. [66] used a multistage NN for detection of microcalcification clusters 
with almost 100% success and only one false-positive result per image. The multistate NN consists of 
more than one NN connected in series. The first stage is called the detail network, with inputs the pixel 
values of the original image, while the second network, the feature network, gets as inputs the output 
from the first stage and a set of features extracted from the original image. This approach has higher 
sensitivity of classifications and a lower false-positive detection than the previous reports. 

Another approach was used by Floyd et al. [67], where radiologists read the mammograms and come 
up with a list of eight findings, which were used as features for an NN. The results from biopsies were 
taken as the truth of diagnosis. For indeterminate cases, as classified by radiologists, the NN had a 
performance index of 0.86, which is quite high. 

Downes [68] used similar techniques to identify stellate lesions. He used texture quantification via 
fractal- analysis methods instead of using the raw data. In mammograms, specific textures are usually 
indicative of malignancy. The method used for calculating the fractal dimension of digitized images was 
based on the relationship between the fractal dimension and the power spectral density. 

Giger et al. [69] aligned the mammograms of left and right breasts and used a subtraction technique 
to find initial candidate masses. Various features were then extracted and used in conjunction with NNs 
in order to reduce false-positive assessments resulting from bilateral subtraction. Receiver operating 
characteristic (ROC) analysis was applied to evaluate the output of the NN. The methods used were 
evaluated using pathologically confirmed cases. This scheme yielded a sensitivity of 95% at an average 
of 2.5 false-positive detections per image. 

Chromosome and Genetic Sequences Classification 

Several clinical disorders are related to chromosome abnormalities that are difficult to identify accurately 
and also classify the individual chromosome. Automated systems can greatly help the human capabilities 
in dealing with some of the problems involved. One way to deal with this problem is the use of NNs. 
Several studies have already been done toward enhancing the ability of an automated computerized 
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system to analyze chromosome identification [70]. One such study by Sweeney and Musavi [71] analyzes 
the metaphase of chromosome spreads employing probablistic NNs (PNNs), which have been used as 
alternatives to various classification problems. First introduced by Specht [72, 73], PNNs are combina- 
tions of a kernel-based estimator for estimation of probability densities and the Bayes rule for classification 
decision. The estimation with the highest value specifies the correct class. Thus training of PNNs means 
to find appropriate kernel functions, usually taken to be gaussian densities, and therefore, the problem 
is reduced to the selection of a scalar parameter, namely, the standard deviation, of the gaussian. A way 
to improve the accuracy of a PNN for chromosome classification is to use the knowledge that there can 
be a maximum of only two chromosomes assigned to each class. This knowledge can be easily incorpo- 
rated into the NN. Similar or better results were obtained to the classic backpropagation-trained NN. 

A hybrid symbolic/NN machine learning algorithm was introduced by Noordewier et al. [74] for the 
recognition of genetic sequences. The system uses a knowledge base of hierarchically structured rules to 
form an artificial NN in order to improve the knowledge base. They used this system in recognizing 
genes in DNA sequences. The learning curve of this system was compared with that of a randomly 
initialized, fully connected two-layer NN. The knowledge-based NN learned much faster than the other 
one, but the error of the randomly initialized NN was slightly lower (5.5% versus 6.4%). Methods also 
have been devised to investigate what the NN has learned by an automatic translation into symbolic rules 
of trained NN initialized by the knowledge-based method [75]. 

Medial axis transform (MAT) -based features as inputs to an NN have been used in studying human 
chromosome classification [76]. Prenatal analysis, genetic syndrome diagnosis, and others make this 
research very important. Human chromosome classification based on NN requires no a priori knowledge 
or even assumptions on the data. MAT is a widely used method for transformations of elongated objects 
and requires less storage and time while preserving the topologic properties of the object. MAT also 
allows for a transformation from a 2D image to a ID representation of it. The so obtained features are 
then fed as inputs to a two-layer feedforward NN trained by back-propagation, with almost perfect results 
in classifying chromosomes. An optimization on an MLP also was done [77]. 
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Clinical decision systems are information systems that support and assist health care professionals in 
clinical decision making tasks like diagnosis, therapy planning and monitoring. Their purpose is not to 
replace or substitute for human abilities or skills. The value of clinical decision systems depends on their 
role and contribution to the patient care process. Their role may be to act as a reference book of relevant 
medical knowledge or similar patient cases, or as a watchdog to screen routines or detect deviations, or 
they may remind the user of extreme possibilities or forecast outcomes of different alternatives. Addi- 
tionally, these systems may produce information from data by interpretation, work as tutoring or guidance 
systems, and represent a more experienced consultant for a less experienced user. 

183.2 Clinical Decision Making 

The objective of health care is to understand, prevent and treat human diseases. In medical practice a 
clinician is involved in a complex cognitive task where he applies different kinds of knowledge on several 
cognitive and epistemic levels. A clinician develops, through education and experience, an understanding 
of fundamental anatomic, physiologic and pathologic processes, an understanding of how to apply 
diagnostic methods and procedures, and an understanding of the effects of drugs and treatments on 
patients. In a decision making situation a clinician has to understand the clinical state of the patient, the 
relevant processes going on in the patient, and he must have an understanding of the purpose of his 
intervention and of the possible actions he can initiate. 

The central goal of medical research is to improve clinical practice. This can be achieved in many 
different ways. The focus may be on improving the theoretical understanding of the physiologic and 
pathologic processes on a high level of abstraction, or on concentrating in the study of diagnostic methods, 
or on evaluating new therapies, or on developing preventive measures [Pedersen et al. 1993]. In some 
cases medical research is close to basic natural science, in other situations it is close to applied engineering 
research. 
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A medical decision-maker acquires and manages several different forms of knowledge. Basically two 
types of knowledge are involved in decision-making [van Bemmel and Musen 1997, Pedersen et al. 1993]: 
scientific or formal knowledge and experiential knowledge. Scientific knowledge deals with understanding 
the scientific principles and relationships between pathophysiological conditions and disease symptoms. 
This type of knowledge is found in scientific literature and articles. Scientific knowledge helps to explain 
and justify empirical phenomena and it tells to what extent simplifications and approximations make 
sense in real life situations. Experiential knowledge originates from well-documented patient cases and 
validated guidelines. Experiential knowledge helps the clinician to recognize a disease based on his 
experience. Experiential knowledge in decision making needs to be justified and validated within the 
framework of scientific knowledge, and when this is done, the most efficient computational models can 
be achieved. Scientific and experiential knowledge can be characterized as deep and shallow, or surface 
knowledge, respectively. The terms tacit and explicit knowledge provide another viewpoint on knowledge. 
Tacit knowledge in a sense is a descriptor for the skills of a clinician. Often during the course of his 
professional life, he may have operationalized his experiential and scientific knowledge to a degree that 
he can no longer explain "what he knows" explicitly. 

Medicine is a complex area for clinical decision systems. The complexity arises from many sources. 
Medicine is not a formal science; it has an empirical basis. We have shallow or limited understanding of 
many of the disease processes and also of the medical decision making. The challenge of a physician is 
to be up-to-date on current "best practices" in medicine. Unfortunately a large part of medical knowledge 
is based on experience rather than on hard facts. "Evidence based medicine" as a large-scale initiative 
aims to improve that ratio [Cochrane]. 

In medical decision making, scientific and experiential knowledge are interwoven. A normal situation 
is that a clinician knows enough and has enough reliable data available to make a decision in that situation. 
Medical expertise, or medical competence, is situational and it is used in a data-intensive environment. 
The kind of knowledge which is relevant for decision making is huge, even in restricted subspecialities of 
medicine [Miller 1988]. Diagnosis is often said to be the main task of the physician and a lot of attempts 
have been made to model the diagnostic process. However, diagnosis is not the only decision making task 
of a medical professional. He is also involved in decisions that concern selection of treatments, monitoring 
and follow up, interpretation of signs and symptoms, prediction of consequences and effects, selection of 
drugs and procedures, etc. Each clinician applies his own pathway to decision making depending on the 
case at hand, taking into account the patient and external factors like the urgency of the situation, difficulty 
of the case, role that the physician assumes and possible choices and actions that can be taken. 

In clinical decision making, medical knowledge is used to interpret patient data. This raises the issue 
of what patient data is available. The problem with patient data is that it is highly context sensitive. A 
major part of patient data collected during clinical episodes is unstructured. Referrals and discharge 
letters are mainly unstructured summaries accompanied by structured elements like laboratory findings. 
Each clinician has his own way of embedding the context into these; and other clinicians need to know 
this before they can truly interpret each other's notes. Several ways have been developed to summarize 
this information. These include classifiers like ICD-10, SNOMED, Read codes and DRGs. Much more 
ambitious are the two major projects which were launched on both sides of the Atlantic some years ago 
to develop "translators" for communication between clinicians, the Unified Medical Language System 
(UMLS) and Generalized Architecture for Languages, Encyclopedias and Nomenclatures in Medicine 
(GALEN). 

183.3 Clinical Decision Systems History 

The first approaches to providing computer support for medical decision making were based on Bayesian 
statistics. Programs for diagnostic problems showed that impressive diagnostic accuracy could be achieved 
if the computer programs were supported by reliable data [Shortliffe, 1988]. However, the demand for 
independence between variables in these approaches was found to be too restrictive in medicine and a 
lot of problems were related to the accessibility of reliable data. It was also noticed that to be able to 

© 2000 by CRC Press LLC 



tackle the information problem, simultaneous perspectives were needed from other disciplines such as, 
decision theory, linguistics, mathematical modeling, and cognitive psychology [Gremy, 1988]. 

Research with artificial intelligence in medicine started in the 1960s based on the understanding that 
when computer systems are used to emulate medical decision making steps, they need relevant medical 
knowledge and this knowledge has to be represented in a context-sensitive way. Much of this knowledge 
must be extracted from the experts in the field; the statistical data approach is not enough. Consequently, 
symbolic reasoning methods are needed. The early work resulted in knowledge-based consultations 
systems like MYCIN [Shortliffe, 1976] and INTERNIST [Miller et al., 1982] among others. 

The focus of this early work was in the development of methodologies and tools to support modeling 
of a domain, knowledge acquisition including machine learning approaches, modeling of reasoning 
strategies, and knowledge representations. Interests in reasoning strategies were focused on causal, tem- 
poral, heuristic, strategic, and anatomic reasoning and reasoning from multiple sources of knowledge. 
Issues of validation of knowledge and evaluation of prototype systems were also raised. The term expert 
system was introduced and much of the research was focused on how to represent expert knowledge, 
how to acquire it and how to use it in problem solving. Certainty factors and other heuristic scoring 
measures were developed to manage probabilities and weights of evidence. Also issues related to users 
and their environments were tackled, like human-computer interaction and interfaces to databases and 
database management systems, integration of natural language processing with knowledge-based systems, 
and integration of medical decision analysis with artificial intelligence techniques [Buchanan and Short- 
liffe, 1984, Miller, 1988, Shortliffe et al, 1990]. 

In the late 1980s the rapid evolution both in Al hardware and software caused a big change. The expert 
system shells (i.e., tools to develop expert systems where the inference engine is separated from the 
knowledge base) were introduced and widely used. Cheap and effective microcomputers as well as 
dedicated professional workstations for knowledge engineering offered integrated and interactive envi- 
ronments for expert systems development. Progress in computer graphics, multimedia, and imaging 
provided further support for this. 

The focus shifted to attempts to find general solutions to problems through modeling approaches. The 
final result of the development of a knowledge-based system was seen to be a model of expertise, which in 
turn is implemented in the knowledge base. These models of expertise incorporated assumptions about 
knowledge representations, knowledge base contents, problem-solving methods and application tasks 
[Musen, 1989]. Knowledge level modeling was widely recognized to be the major way to generalize and 
structure domain and task knowledge. The knowledge level model describes the problem-solving process 
in terms of general abstract conceptualizations without reference to the implementations [Newell, 1982]. 

Moving from the knowledge level to the knowledge-use level, however, posed a problem [Newell, 1982, 
Steels 1990]. Several frameworks were suggested for this: Heuristic classification with the focus on the 
inference structures [Clancey, 1985], distinctions between deep and shallow knowledge at the knowledge 
level [Chandrasekaran and Mittal, 1982], a problem-solving method approach focusing on how the 
problem might be solved at the knowledge-use level [Bennet, 1985] and finally the generic task approach 
where the analysis of expertise is performed in terms of the tasks and ordering of the tasks [Chandraseka- 
ran, 1986]. 

From the process point of view, medical reasoning may be modeled by fundamental medical tasks, 
e.g., diagnosis, therapy planning and monitoring. An ATbased clinical decision system requires formu- 
lation of an inference model (i.e., the reasoning process) which exploits an appropriate ontology (i.e., 
the medical process). An architecture then provides a conceptual description of these components at the 
knowledge and computational level [Stefanelli, 1993]. 

In spite of that progress, Al based systems did not gain widespread use. What are the reasons for this? 
Most Al systems were developed within a narrow framework for a few dedicated users and for well- 
defined tasks and restricted domains. Also, development projects were mostly very technology- or meth- 
odology-oriented. Knowledge acquisition was also a major barrier. It has been realized that expertise 
cannot be elicited by interviews and cannot be completely expressed as formal representations as experts 
cannot explicate their tacit knowledge. 
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Today, decision support systems are no longer knowledge-based systems, but rather, knowledge-based 
approaches have been integrated with traditional information technologies resulting in decision support 
systems. Also the connectionist approaches of classical AI, e.g., artificial neural networks, have gained 
more use and there exist many well-functioning neural network applications in restricted, data-rich 
problem areas. The Internet is a potential platform for the future of ATbased approaches because the 
flexibility for representations and programs which was typical for AI approaches can be found on the 
Internet. The Internet also allows new possibilities for distribution and access to clinical decision support. 

183.4 Clinical Application Areas and Types of Systems 

The first application areas were diagnosis and treatment planning. The scope broadened soon to systems 
addressing tasks of monitoring and follow-up of patients, diseases, and treatments, especially in the 
domains of chronic diseases and intensive care. Applications have tackled medical, managerial, admin- 
istrative, and financial decisions in health care. Most clinical decision systems were planned for highly 
specialized hospital environments, and very rarely were problems of community care, primary care, or 
occupational health examined. Most successful clinical decision systems exist in the following situations 
(van Bemmel and Musen 1997): 

• When systems offer support for data validation like supporting systems, which are interfaced with 
laboratory information systems. These kinds of systems help to increase data reliability, to admin- 
ister data processing and data flow activities, to transmit data, and to manage quality control. 

• When systems support data reduction, i.e., they reduce, transform, and present data in such a way 
that it is easier for the health care professional to interpret data and screen the massive data flow. 
Typical areas for this type of system are intensive care and operating rooms. 

• When systems support data acquisition like in medical imaging where data is processed to give 
the medical professional the best possible view. 

Most stand-alone systems work as consulting systems in such a way that they require direct interaction 
from the user to start the dialogue. Interfaced systems are connected with information systems like 
laboratory systems or electronic patient records and they retrieve data from that system for their input. 
Integrated systems are directly connected or embedded into medical devices like diagnostic ECG recorders. 

183.5 Requirements and Critical Issues for Clinical 
Decision Systems Development 

Clinical decision systems can be successful if they are planned to support human professionals in tasks 
where support is needed and where support can be offered with information technology. For instance, 
information enhancement is needed in many decision making situations as a precursor for the next 
decision step, data reduction is necessary to interpret or identify significant findings, and data validation 
is needed to have reliable data. 

Representational issues, both semantic and syntactic, have to be solved in such a way that no misun- 
derstandings or misinterpretations are met. We have to be conscious also of the system's brittleness, 
because the model is an abstraction and approximation of the reality, and transferring a model from one 
environment to another may result in an invalid model and implementation. 

In developing models and specifying the system, users need to be involved. On the other hand, they 
need tools which enable them to analyze, structure, and visualize domain knowledge without strong 
support by technology personnel, i.e., tools that support modeling and reasoning strategies and system 
architectures for different tasks, domains, and environments [Brender et al., 1993]. 

Clinical decision systems must be acceptable to the users, serve real users' needs, be cost-effective and 
their effects and impacts must be known. In the early expert systems, explanation facilities were important 
for acceptability by users. These explanation facilities served both educational purposes and through 
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them the system's output was made clear for the user. Critiquing was more acceptable to the users than 
the consulting mode [Miller 1988], because the critiquing system does not give advice to the user, but 
takes the user's proposal and discusses its pros and cons in relation to other possibilities. 

Medical textbooks, journals, and other reference materials are the stores of experiential and scientific 
knowledge. Accessing the right source at the right time is the question. Care guidelines and protocols 
are an approach to compiling medical knowledge into an operational form leading to protocol-based 
care. The acts needed in handling the problem of a patient can be seen as an instantiation of a template 
containing that "care package". Clinical protocols are used routinely in clinical research to determine the 
efficacy and effectiveness of new drugs and medical procedures. They are also commonly used in cancer 
therapy and in other medical specialities. 

However, while medical knowledge is universal, clinical practice is local. Therefore guidelines and 
protocols need to be locally adapted. Furthermore, as each patient is an individual, protocols cannot be 
applied in every detail. Instead the user must have the freedom to apply a template according to the needs 
of that particular case. Consequently what started as protocols and decision trees in the knowledge-based 
system era of the 1980s has gradually changed so that today they are called clinical guidelines and clinical 
pathways [Prestige]. Consequently, decision support is gradually becoming embedded into clinical infor- 
mation systems compared to the stand alone decision systems of the past. 

183.6 Evaluation of Clinical Decision Systems 

Evaluation and validation are necessary tasks during development and before accepting the systems into 
use [Clarke et al., 1994]. They are means to control the development and guarantee the quality of the 
results; thus it supports development of good clinical practice. The users need guidance and support to 
perform evaluations and the developer or the vendor of the system must provide the user with the 
necessary material needed for evaluation [Nykanen, 1990]. The evaluations of clinical decision systems 
performed and reported in the literature show that a global, generally accepted methodology for evalu- 
ation does not exist. The reported results are subjective and evaluations have been performed without 
generally accepted objectives and standards. 

In a study [Wyatt, 1987] 14 medical decision support systems were evaluated according to a three- 
stage procedure where in stage one, the main consideration was the system's clinical role, stage two 
considered if the system had been tested with a proper sample with predefined goals for accuracy and 
utility, and stage three considered if the performance of the doctor was tested when he was aided or not 
aided by the system. The result was that only three systems had their clinical role defined, seven systems 
had paper tests, and only one system had been subjected to a field test. Medical experts were used as the 
"Gold Standard," in most cases only one expert had been used. From these 14 systems, only two have 
ever been used in practice. Another study [Lundsgaarde, 1987] concluded that only about 10% of the 
medical decision support systems had reached laboratory testing. A further study of 64 medical decision 
support systems [Pothoff et al., 1988] concluded that most systems were still in their developmental stage 
and only eight systems were developed to the stage to enter routine clinical use. 

Most of the decision support systems are planned to work as supporting devices for the clinicians and 
thus they have to be integrated into the user's environment. Evaluation of their fitness for that has been 
neglected in reported evaluations with some exceptions. General practitioners used the SPHINX system 
[Botti et al., 1989] for six months. Though this system was used by the practitioners via the French 
MINITEL-network, the legal issues were not considered in the studies. It should, however, be of particular 
importance for a system operating on a network [Clarke et al., 1994]. 

In a study by [Nykanen et al. 1998], some health telematics projects were studied from the validation 
methodology perspective. Results show that importance on evaluation and validation has been under- 
stood only very recently. Too little resources are usually reserved for evaluation and usable evaluation 
methodologies still do not exist as well as experience and skills on how to apply them. Validation and 
evaluation are not yet fully integrated with the development process so that evaluation results could be 
used as feedback for the systems' development. 
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In summary, organizational and human aspects in a health care environment require more attention. 
In the user environment, evaluation should at least consider if the system helps the user to improve the 
outcome of his work in the organizational setting and if it is usable and cost-effective [Friedman and 
Wyatt 1997, Brender 1997]. 

It is difficult to foresee all the legal implications of decision support systems on clinical practice. Health 
professionals may even think that using computer applications is less harmful than not using them 
[Hafner et al. 1989]. Scientists have tried to stimulate a discussion of legal responsibilities of clinicians 
that have not been giving state-of-the-art treatment because they are not able to find relevant information. 
This is becoming one of the leading arguments in favor of decision support systems in health care in the 
future [Forstrom and Rigby 1998]. A general interpretation of the liability principle has also been that 
decisions suggested or supported by a computer system are ultimately the responsibility of the doctor 
who puts them into effect. 

A project in the Telematics Applications Programme of the European Union (VATAM, Validation of 
Telematic Applications in Medicine) develops guidelines for validation. These guidelines will include a 
repository of existing methodologies, a glossary of terms, and a library of tools and checklists [Talmon 
et al. 1998]. The purpose of the guidelines is to report on experiences, to inform on possible approaches, 
and to assist in selecting an experimental study design, methodology, methods, and metrics for evaluation. 
The guidelines are targeted for the various stakeholders, the different types of health telematic systems 
and they cover the system life cycle. Information and descriptions of various evaluation approaches and 
methodologies can also be found in [Friedman and Wyatt 1997, Brender 1997]. 

183.7 Summary 

First, clinical decision systems applied statistical methods and these systems were developed for restricted 
data reduction problems in data rich domains. Most problems at this phase were related to acquisition 
of required valid data. Next, a generation of clinical support systems were developed using artificial 
intelligence approaches and methods. These systems aimed at emulating human intelligent behavior. 
However, modeling and representation of human intelligence and medical reasoning proved to be more 
difficult than expected and the resulting systems were restricted in performance and brittle for their 
knowledge and domain. The biggest problems with artificial intelligence-based systems were related to 
knowledge acquisition because medical expertise could not be elicited through interviews as experts 
cannot explicate their tacit knowledge. Current clinical decision systems have been integrated with 
traditional information technologies. 

Integration of decision support systems with information technologies and with the developing health 
care information infrastructure offers real challenges for clinical decision systems in the future. Systems 
can be integrated, interfaced or embedded with other information technology systems and products and 
data acquisition can be performed via networks and support with the best modality offered via networks, 
where and when needed. The future users of decision support systems include not only health care 
professionals and physicians, but also patients and citizens at home and in need of support. 
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Biomedical engineering is a fertile domain for the application of computer-based techniques that have 
the potential to enhance patient care. These techniques come in many forms and can be classified 
according to the level of processing required. Low-level processing includes techniques such as signal 
interpretation, where, for example, a threshold value has to be reached before an appropriate action is 
taken. This type of system uses a shallow data model to transform data into a more useful form. However, 
in this chapter, the systems of interest are computer programs which provide a further transformation, 
that of information into knowledge, employing symbolic logic rather than numerical calculations alone 
to provide appropriate explanation and justification of concepts used and conclusions reached. These 
so-called expert systems have various synonyms, including intelligent systems, knowledge-based systems, 
and decision-support systems. 

Expert systems have an interesting historical development. The 1970s saw much of the work concen- 
trated on the problem of knowledge representation. As the introduction of the ubiquitous personal 
computer aided dissemination of the technology on smaller hardware platforms, the work in the 1980s 
looked much more deeply into the issues involved in knowledge acquisition. By the 1990s, even with 
commercial exploitation, the number of expert systems in everyday use is disappointingly small. Research 
into their evaluation may be one way in which their use will eventually be accepted. 

There are no methods and tools that cover every phase of expert system development (Fig. 184.1), so 
to aid the reader this chapter is subdivided into sections for knowledge acquisition and knowledge 
representation, respectively; a final catch-all section is used for those methods and tools which did not 
fit into either section. 

184.1 Expert System Process Model 

Before embarking on an investigation of the tools and methods available for expert system design and 
implementation, it is important to identify the distinct phases in their development (see Fig. 184.1). The 
problem is one in which knowledge from a human expert (or a set of experts) must be transferred to a 
computer program for dissemination. Domain-specific knowledge is elicited from the human expert by 
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FIGURE 184.1 Knowledge engineering design. 

knowledge-acquisition methods. The output of this phase of expert system design is a "paper-based" 
system. To be of any use, this system must be represented in a form that can be implemented as a computer 
program. A number of methods and tools are available to perform such tasks; these are detailed below. 

A different type of knowledge is required for the control strategy, i.e., decisions on when to use or 
"fire," specific rules in the knowledge base. In practice, these procedures are equated with different types 
of search strategies, such as depth-first, breadth-first, or various implementations of optimized search 
strategies. 

What is not captured in Fig. 184.1 are issues dealing with evaluation of the system of human-computer 
interactions. These are interesting topics worthy of investigation but are outside the scope of this chapter. 

184.2 Knowledge Acquisition 

The classic method of knowledge acquisition is the interview. Both experts and knowledge engineers are 
very familiar with the method because it is a part of everyday routine. However, it is not very efficient 
in capturing and formalizing the knowledge required for an expert system. To try and preserve the 
interview situation, a more formal method termed a structured interview is employed. In this method, 
the expert is given a list of variables used to describe the domain (e.g., clinical findings) and a second 
list that contains all possible outcomes of the system (e.g., disease states). The expert is asked to link the 
two lists, from which a set of production rules can be generated (see "Rule-Based Systems" below for a 
description of a production rule). It is crucial that all rules are formalized; although a rule of thumb 
suggests that 90% of the rules are captured in 10% of the development time, it is important that the 
remaining 10% of the knowledge base is captured. To aid this process, the second stage of the structured 
interview involves the expert looking at the paper-based rule set generated thus far. The knowledge 
engineer goes through each rule to test for conditions that affect their appropriateness. From the use of 
this procedure new rules may be generated. 

Advantages of this method are that the experts enjoy the process, which is not too demanding of their 
time (experts tend to be busy), and it can be used for any type of domain. Disadvantages of the technique, 
as of any interview, are that it is sometimes difficult to get experts to obey instructions; experts are prone 
to give anecdotes, and although these may be useful, efficiency of the method deteriorates. The knowledge 
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engineer also requires a certain amount of prior knowledge about the domain to be able to remain astute 
and alert to changes in theme during the interview. 

Protocol analysis is a third method of knowledge acquisition. The knowledge engineer records routine 
activity by "shadowing" the expert during daily activity. This is a time-consuming method that generally 
elicits knowledge by the appropriate use of "How?" and "When?" type queries. 

Graphic-based methods of knowledge acquisition also have been tried, such as the laddered grid 
technique, which investigates domain knowledge as a hierarchy. This is also a favored method for 
automatic elicitation techniques, such as inferring rules from training examples using the ID 3 algorithm 
(see "Rule-Induction"). 

184.3 Knowledge Representation 

Knowledge representation has two main formats: rule-based systems, which take advantage of production 
rule technology, and semantic networks, which provide a graphic representation of the knowledge base. 
Below are case studies in both areas, and although dated, it can be argued that they still convey the best 
exemplar material of their genre. The DENDRAL system was certainly one of the first functioning expert 
systems, which provided a landmark for further research. MYCIN is one of the most cited expert systems 
in the literature and one can infer the most well known. CASNET, with its semantic net representation, 
provides a link with Chapter 181. Further development of this type of model provided the basis for causal 
probablistic networks and other statistical network approaches to problem solving. 

Rule-Based Systems 

DENDRAL interpreted mass spectrographic data and worked at a high intellectual level. Although the 
scope of the system was very narrow, it assumed expert knowledge in what was a difficult task domain 
[1]. At the time of its first implementation, DENDRAL was a precursor to expert systems and was 
responsible for the foundation of many of the techniques used in knowledge engineering. These include 
both knowledge elicitation and knowledge representation. The former was identified to be a "bottleneck" 
in the design and implementation of expert systems, because it needed prior knowledge of the expertise 
to be embodied. This expertise could only be elicited in discrete stages due to the time demands made 
on the human experts involved. 

MYCIN was developed by Shortliffe and others at Stanford University [2]. The domain of interest was 
aiding the identification and recognizing the significance of organisms causing microbial infection and 
then recommending an optimal treatment protocol. 

MYCIN is an example of an expert system that uses "production rules" to represent the causal 
relationships between individual items of factual knowledge held on the knowledge base. The form of a 
production rule is as follows: 

IF premise assertions are true 

THEN consequent assertions are true 

with confidence weight X. 

Notice that each production rule has associated with it a measure of certainty. This aids in the reasoning 
strategy of the system, where a value of -1 represents complete disbelief and a value of +1 represents 
complete belief in the consequent assertion(s). The assertions can be boolean combinations of clauses, 
each of which consists of a predicate statement triple: (attribute, object, value). For example, (Gram 
stain, E. coli, gramneg), indicates that the Gram stain of the E. coli organism is gram-negative. 

The uniformity of representation for both domain-specific inferences and reasoning goals makes it 
possible for MYCIN to use a very general and simple control strategy, i.e., a goal-directed backward- 
chaining of the production rules. This approach can be described in the following way: the first rule to 
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be evaluated is the one that contains the highest-level goal, which for MYCIN is "To determine if there 
are any organisms, or classes of organisms, that require therapy." To deduce the need for therapy requires 
knowledge of the infections, which is usually unknown in the first instance. Therefore, the system tries 
to satisfy subgoals that originate in the premise of the top goal that will allow the infections to be inferred. 
Rule chaining is the name given to the process in which the production rule hierarchy is linked together; 
the premise portion of each subgoal rule fires a new set of subgoals. This process is repeated until the 
most fundamental level of the hierarchy is reached, where the rules become assertions that can only be 
confirmed or denied by directly questioning the user for the appropriate information. 

After MYCIN determines the significant infections, found by assessing the overall certainty factor that 
combines the individual degrees of confidence associated with each production rule, the organisms that 
account for the infections are found deterministically. Then, if appropriate, the system proceeds to 
recommend an antimicrobial therapy. To reach its decision, the MYCIN therapy selector uses a description 
of the infection(s) present, the causal organisms together with a ranking of drugs by their sensitivity, 
and a set of drug-preference categories. The algorithm used within the therapy selector also calculates 
the drug dose required and contains knowledge to modify the value if, for example, the patient is in renal 
failure. An advantage of this therapy selection system is that it can accept and critique a treatment protocol 
proposed by the user. The therapy selector has an appropriate facility to generate explanation and 
justification for its choice of action. "Why?" queries are dealt with by displaying the rule it is trying to 
imply, and if "Why?" is asked again, the query is answered by ascending the goal-tree hierarchy. "How?" 
queries are interpreted as the chain of rules that are fired to get to that particular conclusion, and if 
"How?" is repeated, the query is answer by descending the goal-tree hierarchy. 

An advantage of using a production rule system is that each rule is a small "packet" of knowledge, 
each one being independent of all the others. This has two consequences: (1) changing or adding 
knowledge to the MYCIN knowledge base is relatively easy, and (2) addition of new rules is facilitated 
by having a modular data structure. 

The MYCIN system can deal with both inexact and incomplete information. Inexact information is 
inherent in this domain because many test results are qualitative in nature, and relatively few statements 
can be made with absolute certainty. Incomplete information may arise from the time constraints involved 
in the identification of an organism, i.e., the time taken for an identifying laboratory test to be completed 
and the result of it made known to the clinician, which in some cases can be the order of 24 to 48 hours. 

A disadvantage of the system is that disease states cannot always be adequately described by a rule. 
Also, it may not always be possible to map a series of desired actions into a set of production rules. 
Another disadvantage is that although new knowledge can be added by inserting a new rule, this may 
not interact with the existing rules in the anticipated way. 

The MYCIN system is encoded in LISP, and a normal consultation takes on average 20 minutes, which 
includes time allowed for the optional use of the explanation facility. MYCIN has been evaluated as 
having a diagnostic success rate of 72%. This relatively poor performance (for an expert system) combined 
with clinician suspicion and resistance to new technology contributed to the fact that the system was not 
used in a clinical setting. However, as a research tool it was the subject of extensive interest in the AI 
fraternity, being responsible for a number of successful descendants. One of these, the PUFF system, 
developed to interpret pulmonary function test results, is one of the few medical expert systems in daily 
clinical use [3]. 

Network-Based Systems 

CASNET is an expert system for consultation in the diagnosis and therapy of glaucoma [4]. A feature of 
this system is that the medical knowledge used in the patient-specific reasoning process is encompassed 
in a causal-associational network model of the specific disease process. Such a network, termed a semantic 
net, allows the structure of the medical knowledge covered by the system to be more coherent. A semantic 
network comprises nodes connected by links, where nodes correspond to either the condition or action 
part of the rules and the links are the inferences between the two. The model of disease is separate from 
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the decision-making strategy, which allows the updating of both data structures to be facilitated more 
easily (compare with "Causal Probablistic Modeling" in Chapter 181). 

The CASNET model has a descriptive component that comprises four sets of elements as follows: 
observations (these consist of symptoms and laboratory test results, etc. and form the direct evidence 
that a disease is present), pathophysiologic states (these describe internal abnormal conditions or mech- 
anisms that can directly cause the observed findings; the causal relations between states are of the form 



n. !—>n 

• j 

where w,and « are pathophysiologic states, and a t - is the causal frequency with which state n it when present 
in a patient, leads to state n disease categories (each category consists of a pattern of states and obser- 
vations and is therefore conceptually at the highest level of abstraction), and treatment plans and therapies 
(composed of sets of related treatments or treatment plans). 

Other components of the CASNET model are the decision rules, which state the degree of confidence 
with which an inference of a pathophysiologic state can be made from an observed pattern of findings. 
In rule form, this translates to 
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where t i is a finding (or observation) or boolean combination of findings, n, is pathophysiologic state as 
before, and Q tj is a number in the range -1 to +1 representing the confidence with which t i is believed 
to be associated with n-. The value of Q is a certainty factor that indicates the belief that the patient is 
in state n . A threshold function is then used to determine whether or not the certainty factor confirms, 
denies, or leaves undetermined a particular state. Rules for associating disease categories and pathophys- 
iologic states to treatment protocols are in the form of a classification table that consists of ordered triples, 
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where n i is the pathophysiologic state, D, is the disease process arising from it, and T t are the preferred 
treatment regimes for disease D t , 

The pathogenesis and mechanisms of a disease process are described in terms of cause-and-defect 
relationships between pathophysiologic states. In this way, complete or partial disease processes can be 
characterized by pathways through the network. When a set of a cause-and-effect relationships is specified, 
the resulting network can be described as an acyclic graph of states. This state network is defined by a 
four-tuple (S, F, N, X), where S is the set of starting states (i.e., those states which have no antecedent 
causes), F is the set of final states, N are the number of states visited between S and F, and X are the 
causal relationships between the states visited (in the form of a list). 

An important test for any expert is how it copes with contradictory information. This can occur in 
the CASNET system, e.g., when a particular state is confirmed even though all its potential causes in the 
network are denied. When such a situation occurs, the CASNET system advises the user that this has 
happened. There are two explanations for the origin of this contradictory information: the model of the 
disease process may be incomplete (i.e., one or more nodes together with their respective causal rela- 
tionships are missing from the knowledge base), or it may be that the threshold function associated with 
one of the existing causal links has been set at an inappropriate level. The modular structure of the 
knowledge base enables easy access to the underlying reason for the contradiction, so updating of the 
knowledge base can be facilitated. 

An advantage of the CASNET system is that it can present alternative expertise derived from different 
consultants. It is this, coupled with the fact that glaucoma is a well-defined clinical problem, that results 
in an accuracy of diagnosis, which contributed to its relative success. The diagnostic accuracy of CASNET 
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has been evaluated at greater than 75% for particularly difficult cases and greater than 90% for cases that 
constitute a broad clinical spectrum. Unfortunately, the clinical utility of the system was not assessed as 
high, which resulted in CASNET never being used in a routine clinical environment. However, the system 
was a very useful tool for research into AI in medicine and spawned further interest in other approaches 
to causal modeling. 

184.4 Other Methods and Tools 

Blackboard Architectures 

The HEARSAY-II speech understanding system is a research program of Carnegie-Mellon University [5]. 
A feature of this system is its general problem-solving framework, termed the blackboard architecture, 
which has become the basic control model for real-time intelligent signal understanding systems [6]. 
The blackboard architecture contains four elements: entries (which is the name given to intermediate by- 
products of the problem-solving strategy), knowledge sources (these are diverse and independent subpro- 
grams that are capable of solving a specific problem efficiently; they can be though of as a condition- 
action duplex, where the condition premise describes the situations to which each knowledge source can 
contribute, and the action component specifies the interaction of that particular knowledge source in 
the overall problem-solving strategy; it is the interaction of that particular knowledge source in the overall 
problem-solving strategy; it is the activated knowledge sources that produce the entries), the blackboard 
itself (which can be considered as a structured database; it serves two roles; first, it represents intermediate 
states of problem-solving activity, and second, since all communication between the individual knowledge 
sources is carried out via the blackboard, it can accommodate hypotheses from one particular knowledge 
source that may activate a number of others), and an intelligent control mechanism (which has sufficient 
knowledge embodied within it to decide if and when any particular knowledge source should be activated, 
thus generating entries that are recorded on the blackboard). 

The blackboard control structure supports a hierarchical arrangement of knowledge sources. For 
instance, in the HEARSAY-II system, the top level of the hierarchy is the spoken sentence, followed by 
phrase, word sequence, word, syllable segment, and phoneme, respectively. The control postulate is that 
of "establish-and-refine," i.e., each activated knowledge source in the hierarchy tries to confirm or reject 
itself. If it is confirmed, then it refines itself by calling on the next immediate layer in the hierarchy to 
enter the same cycle. This procedure continues until (if successful) the bottom layer of the hierarchy is 
reached. Once the phonemes have established themselves, the spoken sentence has been modeled com- 
pletely. The spoken sentence is then recognized as syntactically valid, which is one of the definitions used 
for speed "understanding." Further signal-processing techniques can be applied to obtain a more com- 
prehensive understanding, such as translation into another language or paraphrasing an argument. 

The blackboard architecture as been used successfully in providing interpretation of the results of 
blood-gas analysis [7]. 

Rule Induction 

An expert system shell based on the rule-induction method of knowledge acquisition and representation 
is available and is based on the ID3 induction algorithm [8]. The original goal of the ID3 algorithm was 
to seek out the valuable information from large volumes of data by means of a classification rule. ID3 
uses known results described in terms of a fixed set of attributes and produces a decision tree that is able 
to classify all possible results correctly. To optimize the decision function, the concept of "information 
entropy" is used. 

These features are faithfully reproduced in the 1st Class Expert System shell (registered trademark of 
1st Class Expert Systems, Inc.). A hypertext facility is added to provide explanation or justification as 
well as further information for any attribute or result obtained. Using optimization of the decision 
function ensures that the minimum set of questions are asked of the user to obtain the required result. 
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As a consequence, this method is often more efficient and thorough than the human expert. The system 
based on the 1st Class Expert System shell is robust to missing and conflicting data. When classification 
is not possible, explanations can be provided by the hypertext facility. This system has been used in 
biomedical engineering applications such as machine malfunction, where alarms generated by the expert 
system are quicker and more informative than those set by the manufacturer. 

184.5 Summary 

The process of knowledge engineering design has been discussed in some detail. The phases of knowledge 
acquisition and knowledge representation have been presented, and the methods used in each phase have 
been discussed. The advantages of the structured interview method of knowledge acquisition over other 
available methods can be deduced. Knowledge representation is presented as a choice between rule-based 
systems and semantic networks. 

MYCIN is an example of a procedural expert system, to be compared with CASNET, which is an 
example of a declarative expert system. The distinction between the two types hinges on the relationship 
between the encoded knowledge representation and the control algorithm. In systems where the knowl- 
edge representation and control algorithm are combined, then a procedural definition applies. This is 
exemplified by the production rule methodology, where causality is implicit in the IF.. .THEN rules. The 
ability to provide explanation of action, one of the prerequisites of an expert system, cannot naturally 
be accommodated. However, to overcome this problem, context sensitive explanations can be used. These 
are a set of fixed files, each one specific to a particular production rule, which can be fired whenever the 
production rule they represent is used. A disadvantage of this approach is that the explanation can appear 
"stale" to an experienced user of the system. In systems where the knowledge representation and control 
algorithm are separate, then a declarative definition applies. The knowledge representation is explicitly 
defined, allowing a set of algorithms to act on it. This can include the test and hypothesis diagnostic 
cycle, an explanation-generating algorithm, and an algorithm that produces text which constitutes com- 
puter-based advice. 
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The scope of designing knowledge-based systems in medicine has evolved during these last few years, 
from pure decision making automata to open environments able to acquire, formalize, validate, as well 
as handle and disseminate elements of the human expertise. The application fields have expanded in a 
correlated manner, from the sole field of symbolic processing to data analysis, information management, 
simulation, and planning. 

Therefore, the scope of knowledge acquisition and representation has been shifted from the mere 
transcription of expert knowledge to the modeling of pathophysiological phenomena of increasing 
complexity [Stefanelli, 1993]. To reflect such evolution, the notion of models is made central in this 
chapter and the nature and specificity of medical expertise is addressed in the first section. Knowledge 
acquisition is considered afterwards, by successively considering human-centered and data-driven 
approaches to knowledge extraction. Knowledge representation issues are finally discussed in the frame- 
work of their evolution from basic representation schemes to second generation systems. 

185.2 Medical Expertise: Domain and Control Knowledge 

Expert knowledge is usually described as involving domain knowledge and control knowledge [Schreiber, 
1993]. Domain knowledge is meant to represent the various elements, e.g., facts, features, relations, and 
concepts, that are used to represent a case, while control knowledge is meant to represent the expert 
know-how, that is, the way to reason about a case. 

Domain Knowledge 

Domain knowledge may be seen as a kind of formal knowledge tying names, properties and relations to 
the whole empirium of data, facts, observations, hypotheses or results that are extracted from a case. 



) 2000 by CRC Press LLC 



Vertigo 



Sudden loss of 
vestibular function 



Paroxismal 
vertigo 
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Benign paroxismal 
vertigo of childhood 



Benign paroxismal Paroxismal vertigo due 
positional vertigo to vascular disorder 



FIGURE 185.1 Vertigo taxonomy (partial view). (Source: Schmid [1987].) 

Mammary gland 
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basement epithelial myoepithelial 

membrane cells cells 



lumen 



FIGURE 185.2 A "part of" hierarchy example (partial view). (Source: Garbay and Pesty [1988].) 

Domain ontologies are used to structure and organize domain knowledge according to well-identified 
knowledge types or relations. 

Taxonomic ontologies have been widely used in medicine because of their capacity to classify and 
organize diseases into hierarchical structures. As may be seen in Fig. 185.1, paroxismal vertigo is classified 
as a particular kind of vertigo, and may be further characterized as benign paroxismal vertigo of child- 
hood, benign paroxismal positional vertigo, or paroxismal vertigo due to vascular disorder [Schmid, 
1987]. 

"Part of" hierarchies make it possible to describe the compositional characteristics of concepts, and 
are particularly useful for describing anatomical properties. As may be seen in Fig. 185.2, a mammary 
gland lobule may be described as composed of stroma cells and acini, which in turn are composed of a 
basement membrane, epithelial and myoepithelial cells, and a lumen [Garbay and Pesty, 1988]. 

Knowledge may also be organized according to its role in the reasoning process. According to Patel 
et al. [1989], this process is decomposed into several levels, from the empirium (the whole universe of 
medical problem solving data) to the global complex level (complete understanding of the patient case). 
Several intermediate levels are distinguished in terms of observations, findings, facets (complex gathering 
of findings understood as the consequence of a pathophysiologic process) and diagnostic, as summarized 
in Fig. 185.3. 

Current research emphasizes the notion of causal ontologies 
(see Fig. 185.11), where a deep functional understanding of the 
process leading to a disease is sought [Jensen et al., 1987; Ste- 
fanelli, 1993]. 



Control Knowledge 

Reasoning may be seen as the process of handling facts, observations 
and hypotheses, to reach a final understanding of a situation. Var- 
ious mental abstraction levels are traversed in this process, which 




Empirium 
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Findings 

Facets 

Diagnostic 

Global complex 



FIGURE 185.3 Organizing knowledge 
according to its role in the reasoning 
process. (Source. Patel et al. [1989].) 
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FIGURE 185.4 An epistemological model of diagnostic reasoning. (Source: Lanzola and Stefanelli [1992].) 

may involve different inference schemes, depending on the type of data to be processed (e.g., uncertain, 
geometric or temporal), and on the type of inferencing to be applied (e.g., deductive, inductive, or non- 
monotonic). 

An epistemological model of diagnostic reasoning is proposed by Lanzola and Stefanelli [1992] in 
terms of abstraction, abduction, deduction, and eliminative induction (Fig. 185.4). The abstraction 
corresponds to identification of the clinical features that diagnosis should be able to explain. The abduc- 
tion corresponds to formulation of a set of hypotheses from which expected manifestations are deduced. 
Eliminative induction is finally used to reduce uncertainty of inferred hypotheses by matching their 
consequences against observed data. 

The global approach to a case nowadays is often formalized in terms of tasks [Bylander and Chan- 
drasekaran, 1987; Steels, 1990; Schreiber et al., 1993], which makes it possible to structure the overall 
reasoning process in terms of goals and sub-goals, and thus to organize knowledge and know-how 
according to a problem-solving rationale [Newell, 1982]. 

185.3 Knowledge Acquisition 

Knowledge acquisition has been traditionally considered as the process of extracting and transcribing the 
expert knowledge into a computer-usable form. This complex process has been shown to critically depend 
on the knowledge engineering tool at hand, since most representation formalisms convey restrictive and 
rigid semantics, which do not refer to a clear ontology [Schreiber, 1993]. Current trends are to consider 
knowledge acquisition as an integral part of the designing process, centered on the notion of a model. 

Knowledge discovery, data mining, and machine learning techniques have recently attracted consid- 
erable attention, due to the growing amount of data available, and to the growing necessity to base the 
reasoning on evidence taken from physical measurements. Data-driven approaches to knowledge extrac- 
tion have been developed as a consequence, complementing the more traditional human-centered 
approaches, by enabling systems to create new knowledge, update existing knowledge, and improve their 
performance without intervention and reprogramming. 

Human-Centered Approaches 
Interviewing Techniques 

Knowledge is usually acquired through direct interaction with a human expert, but may also be obtained 
from many other sources, such as textbooks, reports, databases, or case studies [Buchanan, 1983]. A 
systematic and structured approach is necessary to ensure the completion and consistency of the inter- 
viewing process, which is usually driven by a knowledge engineer. Other techniques are often used to 
complement the mere interviewing approach, such as observing the expert in daily working situations, 
writing scenarios and reports, or filling out questionnaires. 

Knowledge Acquisition Tools 

Several computerized tools have been designed in the past to assist the knowledge acquisition/transcrip- 
tion process. The reference work done in TEIRESIAS [Davis and Lenat, 1982] and ETS [Boose, 1985] 
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appears to bring complementary solutions to this difficult problem. TEIRESIAS is rather meant to support 
the testing and refinement of a "MYCIN like" system. It proceeds by searching for missing or inconsistent 
information and may suggest the generation of new inference rules. ETS, on the contrary, is meant to 
support direct elicitation of expert knowledge. It is based on bipolar constructs structuring the perception 
of the world (e.g., the grade of a tumor allows the organization of diseases according to their prognostic 
significance). The acquired knowledge is then represented by means of so-called repertory grids tying 
concepts and their properties. Distances between concepts may then be computed, as well as the dis- 
criminatory power of a given characteristic. Production rules are then generated and submitted to the 
expert to be modified or tested. 

Knowledge Acquisition Environments 

Knowledge acquisition environments have been designed to integrate several acquisition techniques. The 
expert is usually assisted by a cognitive engineer to use the software environment. AQUINAS [Boose and 
Bradshaw, 1987] and MORE [Kahn et al, 1985] are particularly representative of this approach. 
AQUINAS is based on the cooperation between several interviewing methods and has been designed as 
an extension of ETS. MORE may interview the expert according to eight different strategies and integrate 
several facilities such as a rule generator, an inference engine to test the rules, and online assistance to 
the user. 

Knowledge Acquisition Methodologies 

The term "knowledge acquisition methodologies" refers to approaches aiming at a close integration 
between knowledge acquisition and system design, in which knowledge acquisition is considered as a 
modeling process rather than as an extraction/transcription problem. The modeling approach permits 
the investigator to work at the "right" abstraction level, often called the knowledge level [Newell, 1982], 
and to refer a level at which knowledge may be expressed without referring to any implementation 
constraint [Schreiber et al., 1993]. It is formulated in opposition to symbol level approaches, which 
support implementation level modeling. 

KADS (KBS Analysis and Design Structured Methodology) is among the best known methodologies 
[Breuker and Wielinga, 1987], and has been conceived to guide the whole process of expert system design. 
It permits the progressive evolution from real world to implementation by the building of four successive 
models that organize the domain ontology into four knowledge levels: domain level (description of 
concepts and the relationship between them), inference level (reasoning primitives), task level (reasoning 
steps, goals and ways to reach them, structured task) and strategy level (control and goal scheduling). 

Data-Driven Approaches 
Knowledge Discovery and Data Mining 

Knowledge discovery may be defined as "the non-trivial extraction of implicit, previously unknown, and 
potentially useful information from given data" [W.J. Frawley et al., 1991]. Such a process makes use of 
various techniques, involving segmentation, filtering, statistical as well as model-based analysis, or super- 
vised induction. 

It is most often considered as a pre-processing step in the design of decision-support systems, by 
providing computerized support to the identification of salient attributes, to the computation of repre- 
sentative values for these attributes, or to the structurization of the database into semantically meaningful 
clusters, thus improving performance. Factor analysis may, for example, be used in this respect. Such an 
approach has been used in a system devoted to the development of in vitro fertilization treatment plans, 
where a case-based reasoning module has been extended by a knowledge mining component [Juristica 
etal., 1998]. 

Learning 

Three main approaches to machine learning are usually distinguished, i.e., inductive learning, analogical 
reasoning and explanation-based learning. Inductive learning techniques have been widely used in medicine, 
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in the framework of decision-trees or version space/conceptual clustering methods, and more recently 
by the introduction of the evolutionary and connectionist paradigms. 

A typical example of inductive learning is ID3, a decision-tree learning technique developed by Quinlan 
[1986]. ID3 constructs a classification tree in a recursive, top-down, divide and conquer fashion, using 
the concept of entropy as a quantitative measure of the "information" conveyed by each attribute. 
Conversely to ID3 which proceeds by division, the approach of version space/conceptual clustering 
proceeds in a bottom-up way, by grouping sets of domain concepts to form more general ones. 

Genetic-based learning has been conceived as an extension of the evolutionary programming para- 
digm: it has been designed as a generate-and-test process that maintains a pool of competing problem- 
solving procedures (most often decision rules), tests them to a limited extent, creates new ones from 
those that performed well in the past, and prunes poor ones from the pool. Connectionist approaches 
are revealed to be particularly useful in domains where it is impossible to structure and represent the 
domain knowledge in logic form. 

Machine learning is now introduced in an increasing number of systems and application domains in 
medicine, especially in domains where the knowledge is difficult to acquire by hand. Current approaches 
make use of real data and domain models to drive the knowledge acquisition process. 

Hau and Coiera [1997] for example, describe a learning system that takes real-time patient data 
obtained during cardiac bypass surgery to create models of normal and abnormal cardiac physiology. 
Conversely, qualitative models of heart disease were used early in Kardio to learn diagnostic rules [Bratko 
et al., 1989] . Examples of "faulty" and "non-faulty" behaviors are generated by the model and transmitted 
to a learning program, which can then learn diagnostic rules for each particular fault. 

Machine learning may also be used to refine existing knowledge bases in cases where there is a 
disagreement between the expert-provided domain knowledge and actual cases. An application to acute 
abdominal pain has been experimented with for effective performance improvement [Dzeroski et al., 
1997], using the theory revision system NEITHER. As a consequence of this process, rule Rl, for example, 
has been specialized into rules R6 and R7: 

rule Rl: OPERATE <- (RIGIDITY YES) 

rules R6: OPERATE <- (RIGIDITY YES) (TYPE STEADY) (SITE-RUQ NO) 

rules R7: OPERATE ^- (RIGIDITY YES) (TENDER-RLQ SURFACE) 

185.4 Knowledge Representation 

The representation of medical knowledge is a very active research field, characterized by a wide range of tools, 
models, and languages, which, together with the presence of increasing computer capabilities, allows one to 
specify and emulate systems of a growing complexity. Knowledge representation schemes indeed have known 
an important evolution, from basic schemes supporting a rather heuristic approach, to advanced schemes 
involving a deeper consideration of the various dependencies between knowledge elements. Second genera- 
tion systems have finally been introduced to cope with complex integration and modeling constraints. 

Basic Representation Schemes 

Basic schemes for the representation of domain and control knowledge are considered in this section. 
The representation of concepts, their structure and properties is considered first. How to model the expert 
way of reasoning is examined afterwards. It is examined successively as the ability to apply well-experi- 
mented inference schemes (usually represented by rules) and as the capacity to compare and retrieve 
learned situations (so-called case-based reasoning). One main feature of human reasoning, e.g., its 
capacity to handle uncertain and imprecise information, is finally considered. 

Frame Representations and Object-Oriented Languages 

Frame-based representations have been introduced as a way to organize prototypical knowledge about 
the world, and are currently among the most widely used tools for the construction of Al systems. A 
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Frame Cytomegalovirus 

• sort-of intestitial pneumonia 

• epidemiological data 

• other kind of pneumonia yes 

• CD4 less than 200/mmc 

• clinical picture 

• symptoms fever, non-productive cough, dyspnea 

• signs cyanosis tachypnea 

• laboratory tests 

• P02 lowering 

• LDH growing 

FIGURE 185.5 Using frames to represent diagnostic knowledge. (Source. Fiore et al. [1993]). 

frame defines the prototypical description of concepts sharing similar properties and behavior. It is defined 
as a set of slots describing the concept attributes and their values. A variety of other slots may be 
introduced, including procedures, default and current values, or type/value constraints. The frames are 
organized in hierarchies, tying classes and instances. The basic inference mechanism is instantiation, in 
which the attribute values of the new instance are obtained either by inheritance, by computation (using 
the procedural attachments), or by default. 

In a knowledge-based system to diagnose HIV-pneumonias [Fiore et al., 1993], frames are used to 
describe the physiopathological states of the disease for each particular pneumonia, in terms of several 
slots describing the epidemiological data, clinical picture, laboratory tests, and diagnosis (Fig. 185.5). 

The object-oriented paradigm is nowadays increasingly considered as a way to cope simultaneously 
with knowledge modeling and software engineering issues, as in HELIOS [Coignard et al., 1992]. HELIOS 
has been conceived as a software engineering environment to facilitate the development of medical 
applications. It provides a core set of medical classes, from which to build a new application. Several 
services are also provided, which include the acquisition and management of medical information, the 
processing of natural language queries, as well as data driven decision support. 

Semantic Networks and Conceptual Graphs 

Semantic Networks [Quillian, 1968] have been proposed as a flexible formalism to represent the semantics 
of concepts (the nodes in the network) and their relations (the arcs) in the framework of a graph-like 
structure. The reasoning proceeds by unification between an unknown fact and a known concept or 
subgraph in the network. 

The approach has rapidly been extended to represent causal relationhips as well as hierarchical tax- 
onomies and to include frames. It has been applied successfully to numerous applications in medicine, 
as in CASNET [Weiss et al., 1978], a consultation system for glaucoma, INTERNIST [Pople, 1977], a 
diagnostic consultant in internal medicine, and PIP [Pauker et al., 1976] a system devoted to renal disease. 
The reasoning strategy in these systems is of the event- driven type: initial data triggers a number of 
hypotheses, which are then to be confirmed [Kulikowski, 1984]. The subgraph of confirmed or undeter- 
mined hypotheses constitute a patient-specific model. Various weights and scores are usually introduced 
to render the reasoning strategy more flexible. 

Conceptual graphs [Sowa, 1984] have been designed as an extension of the previous formalism in 
which an explicit representation of the links between concepts is sought. A conceptual graph is a directed 
graph comprising two kinds of nodes: concept nodes and conceptual relationship nodes. The represen- 
tation is grounded on first-order logic and is currently being applied in a number of projects for the 
modeling and representation of medical terminology, as in GALEN [Alpay et al., 1993]. One major goal 
of GALEN (Generalized Architecture for Languages Encyclopedias and Nomenclatures in Medicine) is 
to form a general Terminology Server able to conciliate a number of classifications and nomenclatures. 
It is based on a Semantical Encyclopedia of Terminology connected to a Multilingual Information Module. 
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FIGURE 185.6 Conceptual graph representing the concept "simple oblique diaphyseal fracture of the right femur." 
{Source. Bernauer and Goldberg [1993]). 

Conceptual graphs are used in this project to generate natural language expressions from information 
coded according to the GALEN Master Notation. 

Conceptual graphs may also be used as a formalism that supports both expressiveness and classification 
purposes, i.e., as a knowledge representation formalism able to integrate both terminological and domain 
knowledge, as shown in Fig. 185.6 [Bernauer and Goldberg, 1993]. 

In the example in Fig. 185.6, the composite concept "simple oblique diaphyseal fracture of the right 
femur" is described as a nested set of 2-tuples each tying a relation (:loc, for example) and a composite 
concept (diaphyis, for example). Taxonomic as well as compositional relations are used to support 
classification. The equivalent nested notation for the graph in Fig. 185.6 is the following: 

(fracture 

(:loc (diaphysis (:part (femur (:side (right)))))) 

(:morph (oblique)) 

(xompl (simple))) 

Rule-Based Reasoning 

The use of if-then rules (also called production rules or condition-action pairs) is a straightforward and 
popular way to represent the expert know-how. A rule example is given in Fig. 185.7, in which a deduction 
is made about the presence of bacteroids based on a composite observation. 

This type of knowledge is often said to be shallow or heuristic, since it is a largely empirical and 
nonformalized representation mechanism. Given a set of facts, the reasoning process is then modeled as 
the successive activation of rules, which in turn produce new facts to be considered, until no applicable 
rule can be found. 

The rules are handled by means of an inference engine, which may work under two inferencing modes, 
the data driven or goal driven mode. In the data-driven mode, the reasoning is modeled as a deductive 
process proceeding from a given premise to some conclusion. In the goal-driven mode, on the contrary, 



IF the infection is primary-bacteremia 

and the site of the culture is one of the sterile sites 

and the suspected portal of entry of the organism is the gastro-intestinal tract 

THEN there is a suggestive evidence (0.7) that the identity of the organism is 

bacteroids 

FIGURE 185.7 A production rule example from MYCIN. (Source. Shortlife [1976].) 
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IF the site of the culture is one of the nonsterile sites 

and there are rules which mention in their premise a previous organism 

which may be the same as the current organism 

THEN it is definite ( 1 .0) that each of them is not going to be useful 

FIGURE 185.8 A meta-rule example from MYCIN. (Source. Shortlife [1976].) 

the reasoning is modeled as an inductive process proceeding backwards from some hypothesized con- 
clusion to the conditions to be verified. Both inference schemes are usually combined to implement 
complex hypotheses and test strategies. In addition, metarules are often used to structure the reasoning 
process by constraining the application of the rules. A metarule example is given in Fig. 185.8, which 
permits one to predict that some rules will be poorly informative, given the non-sterility of the culture site. 
MYCIN, a system for diagnosing and treating infectious blood diseases, is one of the earliest and most 
widely known expert system in medicine [Shortliffe, 1976]. 

Uncertain and Imprecise Reasoning 

How to model the uncertainty and imprecision that is central to the human reasoning style has raised a 
number of research activities during the last decades [van Ginneken and Smeulders, 1988]. The Bayesian 
model has been for a long time the primary numerical approach to uncertain reasoning. It is based on 
assigning a probability distribution to each of the variables representing the problem at hand. These 
probabilities express the uncertainty and likelihood of occurrence of symptoms as well as diseases. Given 
P(Di), the a priori probability of disease Di, given P(SIDz'), the probability that symptom S may occur in 
the context of disease Di, Bayes' rule (1) allows one to compute P(Di\S), the probability that disease Di 
occurs when S is observed, according to the following formula: 



, > p(sbi)*p(Di) 

PDiS = X ' V '- (185.1) 

v ' ; EjPlSDj *P[Dj] 

The most important assumption underlying the use of Bayes' rule is that all disease hypotheses must 
be mutually exclusive and exhaustive. It means that only one disease is assumed to be present. If this 
condition is met, Bayes has a completely predictable performance and guarantees a conclusion with 
minimum overall error [van Ginneken and Smeulders, 1988]. 

Some intuitive methods for the combination and propagation of these numbers have been suggested 
and used, as in MYCIN (see the rules in Figs. 185.7 and 185.8, where coefficients are used to weight their 
conclusions). The empirical character of the approach has often been criticized. However, it is consistent 
with the heuristic style that is inherent to rule-based modeling. 

The theory of possibility [Zadeh, 1978] has been used in medicine to represent the vagueness of clinical 
predicates. These vague predicates are represented by means of a fuzzy set and a possibility distribution 
(or membership function). Fuzzy reasoning may then be implemented by means of rules handling fuzzy 
facts. 

Examples of membership function modeling criteria for the classification of rheumatoid arthritis are 
given in Fig. 185.9 [Leitich et al., 1996]. Seven criteria are defined in total, from which at least four must 
be present in conjunction to establish a diagnosis. 

Case-Based Reasoning 

CBR (Case-based reasoning) emerged in recent years as a powerful problem-solving technique applicable 
to a wide range of tasks in Artificial Intelligence. It is based on the hypothesis that new situations are 
analyzed by reference to past experience, i.e., by considering their similarity to well-experimented situ- 
ations, a notion that is made central in this kind of reasoning [Campbell and Wolstencroft, 1990]. New 
problem-solution pairs may then be learned as a consequence: this inherent combination of problem 
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Criterion Si 



si: morning stiffness 
lasting at least one hour 



S2: arthritis of 3 
or more joint areas 



Membership function |i s j(x) 




ift < 15 
if 15 < t < 60 
ift >60 
v if t is unknown 

where t is the duration of morning stiffness in minutes 




if n < 1 
0.5 ifn = 2 

1 if n > 3 

v if n is unknown 

where n is the number of involved joint areas 



FIGURE 185.9 Membership functions examples in the field of rheumatoid arthritis. (Source: [Leitich et al., 1996].) 
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FIGURE 185.10 The Case-based reasoning cycle, a general view. (Source: [Aamodt and Plaza, 1994].) 



solving with learning through problem solving experience gives a particular strength to CBR over most 
other methods [Aamodt and Plaza, 1994]. 

Case-based reasoning finally appears as a cycle comprising four stages, as illustrated in Fig. 185.10: 
retrieve the most similar case, reuse the retrieved information to solve the problem by analogical rea- 
soning, revise the proposed solution, and finally retain the part of the experience likely to be useful for 
future problem solving. 

The advantage of the approach is two-fold: first of all, it is based on the description of experiences, 
rather than on the modeling of generic or abstract knowledge, second, such a system may evolve in an 
incremental way, as cases grow [Goos and Schewe, 1993]. However, the choice of the similarity measure 
turns out to be critical with respect to the reasoning accuracy. The structurization of the case base also 
has to be carefully considered in order to avoid combinatorial search [Gierl et al., 1993]. 

Current trends [Reateguiet al., 1997] are to combine CBR with complementary forms of reasoning 
such as rule-based, model-based, or neural networks (NN) to solve diagnosis problems. The NN here 
are used during the consultation process to make hypotheses of possible diagnosis solutions and to guide 
the search of similar cases, restricting the research to cases with similar interpretations (e.g., diagnoses). 
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Advanced Representation Schemes 

Recent approaches to knowledge representation consider as central the capacity to better situate the 
reasoning with respect to contextual, causal, and temporal modeling of knowledge elements. A number 
of knowledge domains are concerned, involving therapy and monitoring, pathophysiology, or planning. 

Context-Based Reasoning 

It is now well recognized that advanced medical decision systems should situate their reasoning with 
respect to the context in which a problem is considered, including the patient clinical context, but also 
the chronology of events together with their causal relationships. 

In connection with the development of the knowledge-based system TheMPO (Therapy Management 
in Pediatric Oncology), which supports therapy and monitoring in pediatric oncology, a graph-grammar 
approach has been used to design and implement a graph-oriented patient model which allows the 
representation of non-trivial (causal, temporal etc.) clinical contexts [Miiller et al., 1996]. A graphical 
interface has been designed to facilitate the specification and retrieval of contexts by the physician. 

Causal Reasoning 

The last decade has known the development of solid foundations for diagnostic systems employing logic, 
probability theory, and set theory, thus providing rich and robust modeling tools to the developer, and 
allowing him to cope with the increasing necessity to base the diagnosis on models of the disease process 
and models of the structure and function of the human body [Lucas, 1997]. 

There are at least two alternative ways to represent causal knowledge [Ramoni et al., 1992]. The first 
method is based on a network representation, the second implies the representation of both the structure 
and behavior of a physiological system. 

Network representations, called CPN (causal probabilistic netwoks) or BBN (Bayesian belief networks), 
are in the form of a direct acyclic graph in which nodes represent stochastic variables and arcs represent 
conditional dependencies among the variables. These representations make explicit the dependency and 
independency assumptions among variables. They are based on sound semantics and easily extend to 
compact representations called influence diagrams [Ramoni et al., 1995]. 

MUNIN [Jensen et al., 1987], an expert system in electromyography, is representative of this approach. 
The domain knowledge is embedded in a causal probabilistic network and is further divided into three 
levels, representing diseases, pathophysiological features, and findings. These levels are linked by causal 
relations: diseases cause certain affections in muscles. These affections, in turn, cause expectations for 
certain findings. A partial view of the causal network is given in Fig. 185.11. Diseases are grouped into 
a single node (on the left), and characterized by their grades. The disease node is connected to several 
pathophysiologic nodes describing the pathophysiologic status of a muscle in terms of discrete properties 
(the muscle structure is rather normal, there is no myotonia, and no muscle loss). Some expectations 
regarding, for example, the muscle force and atrophy or the presence of spontaneous myotonic distrophy 
are finally obtained. Probabilities are used to characterize each state in the network (the horizontal bars 
in the figure) and propagated through the causal links. 

Despite its effective representation power, the approach still bears strong limitations, due to the large 
amount of information required: current propagation algorithms require that all the conditional prob- 
abilities defining a conditional dependency be known, together with all the a priori probabilities attached 
to the root variables. MUNIN, for example, consists of about 1100 discrete random variables linked 
together, thus rendering its managing hardly tractable by current computer technology. Several methods 
have been investigated by the author of MUNIN to reduce the memory space and calculation time 
[Suojanen et al., 1997] . A methodology for automatically inducing Bayesian network has been introduced 
by Larranaga et al. [1997], based on genetic algorithms, and applied to the prediction of survival in 
malignant skin melanoma. 

An alternative way is to represent both the structure and behavior of a physiological system [Ironi 
et al., 1993]. The system structure is simply given in terms of variables and relations. The system behavior 
is then modeled by a set of mathematical equations tying related variables, which represent potential 
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FIGURE 185.11 The domain knowledge in MUNIN (partial view). (Source: [Jensen etal., 1987].) 

perturbations of the system state. Compartmental system theory is often used in this respect because it 
provides a robust modeling framework based on differential calculus. It implies, however, that a precise 
quantitative modeling of the pathophysiologic phenomena is possible. Qualitative models, on the con- 
trary, allow one to cope with the fuzziness and incompleteness of pathophysiological knowledge. Among 
these, QSIM is the most widely applied formalism in medicine [Stefanelli, 1993] . This formalism provides 
a descriptive language to represent the structure of a physiologic system and a simulation algorithm to 
infer its qualitative behavior. The language consists in qualitative constraints that abstract the relationships 
in a differential equation. This approach has been applied to a variety of fields in medicine [Ironi et al., 
1990] 

It should be noted that representing pathophysiologic knowledge requires a special emphasis on the 
notion of time [Stefanelli, 1993], a problem early addressed by Fagan [1979]. In addition, specific 
knowledge acquisition issues are raised that have been only recently addressed [Ironi et al., 1993]. 

Temporal Knowledge 

Time constitutes an integral and important aspect of medical concepts, and its explicit modeling is 
increasingly considered as central to the design of advanced medical systems. However, such modeling 
still remains challenging, due to the necessity to consider compound objects (e.g., disorders, treatments, 
or patient states) exhibiting different temporal existences and complex interactions, through mechanisms 
that are not completely understood. The definition of adapted temporal ontologies conveying a clear 
semantic is currently an important subject of debate among the community [Keravnou, 1995; Shahar 
andMusen, 1998]. 

The ability to reason about time is known to depend on the ability to provide short, informative, and 
context-sensitive summaries of time-oriented clinical data, in the form of temporal abstractions able to 
summarize clinical features over a period of time (Fig. 185.12). Such an approach has been considered 
in RESUME [Shahar and Musen, 1998], a system for forming high-level concepts from raw time-oriented 
clinical data which has been applied in three domains: monitoring of children's growth, care of diabetes 
patients, and protocol-based care. 

In RESUME, a temporal abstraction is characterized as a quadruplet: {(parameter, value, context), 
interval}, meaning that a given parameter has exhibited a certain value in a certain context for a certain 
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FIGURE 185.12 A temporal-abstraction task example: an evolution of the platelet and granulocyte counts is observed 
for a patient suffering from chronic graft-versus-host disease (CGVHD). The observation starts with a bone marrow 
transplantation (BMT), and is performed in a certain open context interval (shaded arrow). The solid bars represent 
abstracted intervals figuring the evolution of the myelotoxicity grade M(n). (Source. Shahar and Musen, 1998].) 
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FIGURE 185.13 The temporal abstraction task. Grey arrow: decomposed into, thin arrow: solved by, bold arrow: 
used-by (Source: Shahar and Musen, 1998].) 

period of time. The temporal abstraction task is modeled as a complex task decomposed into specific 
subtasks and problem-solving mechanisms, themselves depending on various knowledge types 
(Fig. 185.13). Such decomposition defines a general problem solving method (as defined in the section 
titled "Task Oriented Modeling") for interpreting data in time-oriented domains. 

Current efforts concern the integration of temporal and causal modeling, thus allowing the modeling 
of multiple interacting mechanisms operating over a variety of time periods. Such models have been 
applied to the domain of heart disease [Long, 1996] and to the interpretation of time-series of blood 
glucose measurements coming from home monitoring [Riva and Bellazzi, 1996]. However, severe prob- 
lems come from the computational burden induced by the learning and the managing of these models. 

Protocols & Guidelines 

Various attempts have been made to design knowledge representation systems which are capable of 
capturing and handling clinical procedures, from the early work on ONCOCYN [Tu et al., 1989]. The 
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FIGURE 185.14 Model of a prescription activity in primary care (partial view). {Source. Gordon et al. [1993].) 

objectives are to guide the user in the accurate and correct application of protocols but also to disseminate 
shared consensus guidelines among the community. 

A generic modeling of clinical protocol has been developed in DILEMMA [Gordon et al., 1993], a 
system delivering guideline-based decision support for shared care in oncology, cardiology, and primary 
care. A protocol is a formal description of a way to achieve an objective or goal, given a class of problems. 
It is used as a template from which to derive the specific activity to be performed for a given clinical 
context. Figure 185.14 depicts a data model of a prescription activity, in which precise information is 
given about the patient, the agent responsible for the drug prescription and administration, together 
with administration guidelines, and a description of potential side effects and expected consequences. 

The development of standard languages for guideline modeling is witnessing a growing interest, from 
mere procedural approaches like the Arden syntax [Hripsack et al., 1994] to high-level representation 
languages based on task ontologies like in the Proforma approach [Fox et al., 1997]. 

Current developments concern the integration of time-dependent patient information like in EON 
(Musen et al., 1995), as well as the development of web-based environments, the aim being to produce 
clinical guidelines which can be widely shared between humans from different institutions, thus dealing 
with patient and organization preferences [Quaglini et al., 1997]. GLIF (GuideLine Interchange Format), 
for example, has been introduced to allow the sharing of representations among different sites [Patel 
etal., 1997]. 

Second Generation Systems 

Second generation systems organize their knowledge into modules and multilevel structures to solve 
problems of increasing complexity. The objectives are (i) to better structure the knowledge, facilitate its 
management and allow its reuse, and (ii) to allow the design of hybrid systems combining heterogeneous 
knowledge and reasoning schemes. 

Task-Oriented Modeling 

Task-oriented modeling has been proposed to model general problem solving methods, based on two 
classes of reusable components [Bylander and Chandrasekaran, 1987]: (i) domain-independent problem- 
solving methods, e.g., standard methods to perform prototypical tasks (see Fig. 185.13) and 
(ii) domain ontologies, e.g., description of the main concepts and their relations (see Fig. 185.15). These 
generic components are then instantiated to cope with the specificity of given expertise domains. This 
notion may be used as a model to drive knowledge acquisition and system design, since knowledge and 
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FIGURE 185.15 A domain ontology example, from FreeCall, a system for emergency-call-handling support. (Source: 
Post et al. [1996].) Note that the knowledge is organized according to the classification proposed by Patel et al. [1989] 
(see Fig. 185.3). 

know-how specific to a task are specified at the right abstraction level [Steels, 1990]. Several knowledge 
acquisition tools specific to different task types may indeed be used to guide the expert through a 
structured elicitation protocol, for that specific problem type. Second generation expert systems also 
allow the reuse of knowledge, since some knowledge is generic and can be applied across several domains. 

Hybrid Systems 

There is a growing need for the integration of multimodal knowledge representation and reasoning 
schemes. Such integration is needed for at least two complementary reasons, i.e., the need to analyze 
data from different modalities and abstraction levels and the need to dynamically adapt the knowledge 
representation formalism and inference scheme to the situation at hand and to the current problem 
solving state. 

The first viewpoint is adopted in AMNESIA [O'Hare et al., 1992], a system devoted to the diagnosis 
of memory related disorder illnesses. In such systems, the knowledge and reasoning skills are distributed 
among several specialists that cooperate towards the solving of the problem, each specialist being devoted 
to handling a precise knowledge domain or task. 

The second viewpoint has been emphasized in NEOANEMIA, a system able to recognize disorders 
causing anemia [Ramoni et al., 1992]. An abductive inference scheme is first of all applied, to generate 
an initial set of hypotheses. A deductive inference scheme is then applied to compute the expected 
manifestations, which is based on a separate and explicit representation of taxonomic and causal ontology. 
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Inductive reasoning is finally used to prune the set of hypotheses, by matching their expected manifes- 
tations against the available data. 

Structural, functional, and causal knowledge have been combined at various abstraction levels in a 
system dedicated to EMG diagnosis [Cruz et al., 1997]. Various anatomical structures like the nervous 
fiber and the nervous bundle are described in terms of their state and function, as illustrated in Fig. 185.16. 
Abstraction and refinement operations are associated with these attributes and defined in a probabilistic 
way to cope with the non-deterministic character of this knowledge. 

Changes caused by normal or abnormal functioning of structures, as well as those caused by external 
agents, are modeled by means of processes expressed at the appropriate level of abstraction. A local 
neuropathy for example is a lesion that causes deep pathologic processes of local demyelination and axon 
loss, as modeled below: 



process: local neuropathy 
causes: neuropathy 

(pointj of nerve N , severity) 
constraints: transversal dispersion 

longitudinal dispersion 



effects: segment^ of fiber bundle FB 
normal = N 
demyelination = D 
demyelination without cond. block : 
axonal loss = A 



DB 



To characterize a neuropathy depends on the proportion of damaged fibers, on one hand, and on the 
extension of the damage along the nerve fibers, on the other hand. Transversal and longitudinal dispersion 
functions are moreover used to compute the lesion severity. The diagnosis finally proceeds through 
successive hypothesis refinement steps, progressively exploiting the various knowledge elements at hand. 

185.5 Conclusion 

The acquisition and representation of knowledge has been presented as an actively evolving research field 
characterized by modeling and software engineering issues of increasing complexity. Representing domain 
ontologies has been shown to raise a variety of issues, such as using conceptual graphs to represent the 
semantics of medical terms, using frames to describe domain taxonomies or modeling pathophysiological 
processes through causal networks. Modeling the expert control knowledge has been considered in terms 
of reasoning as well as planning abilities, which in turn leads to specific modeling issues. 

In addition, the scope of designing knowledge-based systems in medicine currently evolves from the 
mere diagnostic task to the broader issue of patient management, which implies a better integration in 
existing hospital information systems. 
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New challenges are now to be faced, due to the rapid development of the internet communication 
facilities, which increase the possibility of communication and cooperation among health care profes- 
sionals. It is clear, however, that the lack of a shared vocabulary and a shared understanding of medical 
terminology is currently impairing such dissemination [Lenat, 1995]. A stable ontological foundation is 
needed to allow the sharing of knowledge between healthcare professionals [Rossi Mori et al., 1997]. 

Defining Terms 

Control knowledge: The know-how by which to handle domain knowledge and to reason about a case. 

Domain knowledge: Formal knowledge about the application domain theory as well as actual knowl- 
edge about the data, facts, observations, hypotheses, or results that are attached to a given case. 

Generic modeling, Generic task: A way to describe a concept, or a task, that is domain dependent in 
the sense that it is related to a medical domain, but that is generic in the sense that it may apply 
to several application areas in medicine. 

Heuristic knowledge: Knowledge that reflects an empirical and non-formalized way to reason about 
a situation or solve a problem. 

Knowledge level: A notion introduced by Newell, to express the rationale behind the exploitation of 
knowledge, in terms that are free from any implementation constraint. 

Model, Modelling approach: A formal representation of the concepts, relationships, and reasoning 
schemes that constitue the domain and control knowledge of the application domain at hand; a 
designing approach that is driven by a formal representation of the domain and control knowledge, 
in which a close integration of the knowledge acquisition process is performed. This approach 
allows one to work at a level that is free from any implementation constraints, the knowledge level. 

Ontology (domain, taxonomic, causal): The domain ontology characterizes the structure of the 
domain knowledge, in terms of concepts and relationships. Taxonomic and causal ontologies are 
two major methods for organizing domain knowledge; taxonomic ontologies emphasize the hier- 
archical relations between concepts, while causal ontologies emphasize their deep causal relations. 

Shallow (vs deep) knowledge: Shallow knowledge is often used in opposition to deep knowledge, a 
knowledge based on a formal theory of the domain of interest (knowledge of a pathophysiological 
process for example). The term shallow refers to shortened reasoning schemes, that are often gained 
by experience. 
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Artificial Intelligence in Medicine is a bimonthly journal published by Elsevier Science Publishers B.V. 
This international journal publishes articles concerning the theory and practice of medical artificial 
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Studies in Health Technology and Informatics is a series published by IOS Press, in which two volumes 
are of particular interest to the reader interested in the new trends in health telematics: 

• Volume 16, edited by C. Gordon and J.P. Christensen, entitled Health Telematics for Clinical 
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• Volume 36, edited by M. Sosa-Iudicissa et al., entitled Internet, Telematics and Health (1997). 
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Medicine Europe, edited by P. Barahona et al. in 1995, and E. Keravnou et al. in 1997, gives an excellent 
outlook of the most recent advances of the field throughout Europe. 
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186.1 Introduction 



Patient monitoring and management in critical care environments such as intensive care units (ICUs) 
and operating rooms (ORs) involves estimating the status of the patient, reacting to events that may be 
life-threatening, and taking actions to bring the patient to a desired state. This complex process includes 
the interaction of physicians and nurses with diverse data (ranging from clinical observations to labora- 
tory results to online data) provided by bedside medical equipment. New monitoring devices provide 
health care professionals with unsurpassed amounts of information to support decision making. Ironi- 
cally, rather than helping these professionals, the amount of information generated and the way the data 
is presented may overload their cognitive skills and lead to erroneous conclusions and inadequate actions. 
New solutions are needed to manage and process the continuous flow of information and provide efficient 
and reliable decision support tools. 

Patient monitoring can be conceptually organized in four layers [Coiera, 1993]: (1) the signal level, 
which acquires and performs low-level processing of raw data; (2) the validation level, which removes 
data artifacts; (3) the signal-to-symbol transform level, which maps detected features to symbols such as 
normal, low, or high; and (4) the inference level, which relies on a computer representation of medical 
knowledge to derive possible diagnoses, explanations of events, predictions about future physiologic 
states, or to control actions. In addition to these four layers, medical decision support systems need data 
interfaces to other clinical information systems as well as carefully designed user interfaces to facilitate 
rapid and accurate situation assessment by care providers. 
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A number of systems have been developed to address problems faced by clinicians in critical care 
environments. These range from low-level signal analysis applications for detecting specific features in 
monitored signals to complete architectures for signal acquisition, processing, interpretation, and decision 
support. As always, specificity and generality are conflicting requirements. Systems developed for specific 
applications are usually successful in their limited domain of expertise. However, the lessons and problem- 
solving strategies learned in one domain are often difficult to generalize. Conversely, generic architectures 
aim at providing support for modeling and developing a wide range of applications. They also strive for 
flexibility, modularity, and ease of expansion. This generality is often at the expense of expertise and 
performance in specific domains [Uckun et al., 1993]. This chapter reviews some of the problems and 
proposed solutions associated with the four layers of intelligent monitoring systems. 

186.2 Intelligent Patient Monitoring and Management 
Signal Acquisition and Low-Level Processing 

Typical signals found in critical care environments include vital signs (ECG, EEG, arterial pressure, 
intracranial pressure, etc.) and information provided by therapeutic devices, infusion pumps, and drain- 
age devices. Modern monitoring devices are also capable of providing derived and computed information 
in addition to raw data. Heavily instrumented patients frequently have up to 20 medical devices moni- 
toring them, producing up to 100 pieces of clinically relevant information. These instruments are often 
stand-alone, and interconnection requires developing dedicated software in-house, usually with substan- 
tial effort. To address this problem, the IEEE standards committee, IEEE 1073, has been working since 
1984 on a standard for medical device data communication in critical care environments called the 
Medical Information Bus (MIB). Although much of the standard has been adopted by ANSI, and despite 
long-term efforts by individual institutions [Gardner et al., 1992], a standard adhered to by major medical 
device manufacturers is still lacking. Today, this absence of interconnectivity remains a major obstacle 
to the development and implementation of intelligent monitoring systems. 

Current bedside monitors typically provide instantaneous values for the monitored variables. To 
complement this information, numerous algorithms have been proposed to detect features in the signals. 
In particular, detection of significant trends has received much attention [Avent and Charlton, 1990, 
Haimowitz and Kohane, 1996]. Methods based on median filters [Makivirta et al., 1991] and fuzzy logic 
[Sittig et al., 1992; Steinman and Adlassnig, 1994] have also been proposed. More recently, techniques 
such as sub-band adaptive filtering, chaos analysis [Karrakchou et al. 1996], and wavelet transform [Unser 
and Aldroubi, 1996] have been investigated. Multivariate and data fusion methods have been used to 
reveal interactions between signals and applied to problems such as ventricular rhythm tracking [Thoraval 
and Carrault, 1997]. Traditional waveform analysis also continues to be applied to signals such as 
intracranial pressure [Czosnyka et al., 1996] and arterial blood pressure [Karamanoglu, 1997]. Interpret- 
ing the output of these algorithms and assessing their impact on patient care is an ongoing task. 

Data Validation 

The risk of noise contamination, inadequate wiring, or instrument failure is significant, especially as the 
amount of sampled data increases. Unreliable information can drastically reduce the practitioner's ability 
to rapidly assess and act on monitor data. In addition, false alarms due to erroneous or incomplete data 
reduce clinicians' confidence in the instrumentation resulting in alarms being disabled or ignored. Tsien 
and Fackler [1997] report that after 298 monitored hours in a pediatric ICU, 86% of the 2,942 alarms 
were found to be false-positive alarms while an additional 6% were deemed irrelevant true alarms. 

Methods relying on data redundancy and correlation (thus assuming some level of interconnectivity 
between bedside instruments) as well as contextual information to eliminate false alarms, validate data, 
and diagnose malfunctions in the monitoring equipment have been proposed. Rule-based systems [Jiang, 
1991; van der Aa, 1990] and neural networks [Orr, 1991] have been developed for detecting faults and 
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malfunctions in the breathing circuit during anesthesia. Other intelligent alarming systems focusing on 
data validation have been implemented in the operating room, and include rule-based systems for 
respiratory-circulatory management [Garfinkel et al., 1988] and for patient state-dependent data collec- 
tion and processing [Arcay-Varela and Hernandez- Sande, 1993]. Rule-based systems have been combined 
with fuzzy logic [Schecke et al., 1992] and neural networks for false alarm reduction [Navabi et al., 1991]. 
Laursen [1994] investigated causal probabilistic networks to reduce false alarms in the ICU. In addition 
to systems specifically designed for data validation and false-alarm rejection, diagnostic and advice-giving 
systems often include a data-validation step in their overall strategy. 

Signal-to-Symbol Transformation and Giving Advice 

The next layer in information processing involves transforming numerical values into symbolic infor- 
mation and using these symbols for diagnosis and giving advice. A representative sample of these systems 
is discussed below. For a more complete compilation the reader is referred to Uckun [1994]. Although 
a strict taxonomy of diagnostic and advice-giving systems is difficult to create, systems with so-called 
shallow and deep knowledge will be distinguished. Shallow knowledge expresses empirically based, 
experiential knowledge [Mora et al., 1993] generally called heuristics. Reasoning in these systems is mostly 
associative, and knowledge is usually represented in terms of an IF (antecedent) THEN (action) form. 
Shallow knowledge systems exhibit a high degree of competence in their limited domain of application, 
but performance usually degrades quickly near the edge of the domain of expertise. At the other end of 
the spectrum, deep knowledge refers to knowledge about the structure, behavior, or function of a system. 
Model-based reasoning involves using this knowledge to diagnose, predict, or explain the behavior of 
the system over time. 

Shallow Knowledge Systems 

Typically, these systems perform a specific task and have similar architectures: (1) a module responsible 
for transforming quantitative data into qualitative or symbolic information, (2) a module capable of 
translating symbolic information into patient state, and (3) a module designed to provide therapeutic 
advice based on the estimated patient state. 

Ventilator management has received much attention in intelligent monitoring research, following the 
pioneering efforts of Fagan et al. [1984] on the VM project. VM was developed in the late 1970s as an 
extension of the rule based MYCIN formalism to interpret online quantitative data in the ICU setting 
and to provide assistance for postoperative patients undergoing mechanical ventilation. VM addressed 
the importance of interpreting data differently depending on patient status. 

COMPAS [Sittig et al., 1989] was designed to assist in management of patients with adult respiratory 
distress syndrome (ARDS). COMPAS is a rule-based system incorporating a blackboard architecture with 
three main types of knowledge sources (KSs): (1) the data-processing KSs compute derived information 
from the primary data; (2) the data-classification KSs transform numerical information into symbolic 
information; and (3) the protocol-instruction KSs suggest therapeutic actions. These knowledge sources 
roughly correspond to the upper three layers of intelligent monitoring systems (see above). The raw data 
level consists of information provided by clinical or blood gas laboratories, x-ray departments, and 
respiratory therapists. Although COMPAS is no longer used the knowledge it contains has served as a 
basis for a system used clinically [Henderson et al., 1991]. 

The AIRS system provides three levels of decision support to match three phases of ventilation 
[Summers et al., 1993]: (1) the startup phase matches the patient state to one of 14 predetermined states 
and suggests initial ventilator settings; (2) the maintaining phase provides context-dependent values for 
set point and trend alarms; and (3) the weaning phase contains knowledge represented as PROLOG 
premise-actions rules to provide advice in the weaning process. 

Vie- Vent [Miksch et al., 1993] is an advice-giving system for the monitoring and management of 
mechanically ventilated neonates. It consists of three modules: (1) the data-acquisition and validation 
module performs simple range checking and measurement correlation; (2) the data-abstraction module 
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translates numerical values into normal, low, and high ranges based on predetermined rules; and (3) the 
rule-based therapy-planning module proposes ventilator settings based on the qualitative values for pH, 
pC0 2 , and P0 2 provided by the data abstraction module. These rules are adapted to reflect the site of 
measurement and the mode of ventilation. Vie-Vent is designed to operate in real-time and it includes 
a data validation component using time-point, time-interval, and trend-based methods [Horn et al., 
1997]. 

PATRICIA [Moret-Bonillo et al., 1993] is a system designed to assist in monitoring and managing 
mechanically ventilated patients in ICUs. It consists of two main components: the deterministic module, 
which transforms incoming information into symbolic ranges, and the heuristic module, which interprets 
these symbolic ranges using rules to infer patient status and prescribe therapeutic advice. It uses so-called 
natural contexts, which include demographic information, patient history, and diagnosis information, 
to provide patient-dependent ranges for the symbolic variables. Based on these symbolic values, the rule- 
based system performs data validation both in the deterministic and the heuristic modules. It is also 
capable of detecting "prealarm" situations by predicting a temporal evolution of monitored parameters. 
PATRICIA is currently working off-line and underwent an extensive retrospective validation [Moret- 
Bonillo etal, 1997] 

Other systems developed for ventilator management include ESTER [Hernandez-Sande etal., 1989], 
KUSIVAR (developed for the management of patients with ARDS) [Rudowski et al, 1989], and WEAN- 
PRO (designed for assisting physicians in weaning postoperative cardiovascular patients) [Tong, 1991]. 
Expert-system methodology also has been applied to the monitoring and management of patients in the 
cardiac care unit (CCU) [Mora et al, 1993, Sukuvaara et al, 1993b]. 

As opposed to the systems discussed before, GANESH [Dojat et al, 1992] has been designed for use 
as a closed-loop controller. Its domain of application is the weaning of patients under pressure-supported 
ventilation (PSV). Based on the respiratory rate, tidal volume, and end-tidal partial pressure of carbon 
dioxide, a rule-based controller determines the patient status and action to be taken and then acts on 
the ventilator. Preliminary clinical evaluations of the system have shown its ability to maintain patients 
in the region of comfort and to progressively reduce the pressure of weanable patients. Recently, the 
theory of Event Calculus has been proposed to capture and use temporal knowledge for this application 
[Chittaro and Dojat, 1997]. 

Blom [1991] also proposes an expert-system approach for closed-loop control of arterial pressure 
during surgery. In this approach, heuristic rules supervise a PID controller and adjust its parameters. 
The result is a robust controller capable of coping with artifacts in the signal, low signal-to-noise ratio, 
and patient sensitivity to nitroprusside, a drug used to achieve controlled hypotension during surgery. 
Huang and Rozear [1998] also describe a fuzzy logic-based system to control administration of multiple 
IV drugs for hemodynamic regulation. It consists of: (1) a fuzzy decision analysis module to assign the 
patient to one of six possible states using information derived from monitor data; (2) a hemodynamic 
management module to determine the dosage changes of each drug based on the current state; and (3) a 
therapeutic assessment module for scheduling drug administration based on drug properties such as 
pharmacological delay. 

Deep Knowledge Systems 

Despite its advantage over associative reasoning for explanation, robustness, and prediction, model-based 
reasoning has been used in few medical systems. Uckun [1992] suggests this may be attributed to the 
relatively shallow level of knowledge regarding most disease processes and the complexity and inherent 
variability of human anatomy, physiology, and pathophysiology. These difficulties still challenge our 
ability to develop accurate quantitative, qualitative, or hybrid models. Despite the difficulties involved, 
several model-based reasoning systems have been proposed for patient monitoring. 

Rutledge and colleagues' VentPlan [1993] recommends ventilator settings for ICU patients. It has three 
main components: a belief network, a mathematical modeling module, and a plan evaluator. The belief 
network computes probability distributions for the physiologic model parameters from quantitative and 
qualitative information. The model predicts the distribution of future values for the dependent variables 
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based on this data and current ventilator settings, and can simulate the evolution of important variables 
under various ventilator settings to evaluate proposed therapy plans. The plan evaluator predicts p a C0 2 
and 2 delivery to rank therapy plans using a multiattribute scoring system and to propose ventilator 
settings. 

The patient model component of SIMON [Uckun et al., 1993; Dawant et al., 1993] was built on an 
extension of the qualitative process theory (QPT) [Forbus, 1984] to diagnose neonates suffering from 
respiratory distress syndrome (RDS), to predict the patient's evolution, and to dynamically adjust alarm 
parameters based on patient state. As opposed to QPT, this approach does not exhaustively predict all 
distinct states the system may reach, since this is of little use when external variables (i.e., ventilator 
settings) are not constant. It determines patient status based on the temporal evolution of clinical events, 
and can modify the model based on discrepancies between observed and predicted values. SIMON 
underwent a limited retrospective evaluation using neonatal ICU data. 

KARDIO [Lavrac et al, 1985] and CARDIOLAB [Siregar et al., 1993] are examples of models developed 
for model-based ECG simulation, prediction, and interpretation. KARDIO generates qualitative descrip- 
tions of the ECG signals corresponding to various arrhythmia combinations, based on the relation 
between the ECG and 30 elementary arrhythmias. Observed ECG can then be compared with the 
generated descriptions, and the associated arrhythmias can be retrieved from the database. Without a 
model permitting its automatic generation, the creation of an exhaustive database associating arrhythmias 
with ECG description would be a daunting, if not impossible, task. 

186.3 Moving Toward Computer Architectures 
for Real-Time Intelligent Monitoring 

Previously highlighted is the fact that patient monitoring is a complex, multifaceted task, and that much 
research has been done to facilitate the monitoring tasks of clinicians. But despite such efforts, few of 
the systems described earlier are in clinical use. None of them have been fielded or tested in other 
institutions. This may be due to the specificity of these systems and to the difficulty of modifying their 
structure for slightly different applications. Rather than focusing on specific tasks, other researchers are 
attempting to develop generic architectures that can support the development and implementation of a 
wider range of applications. These architectures are designed to support diverse reasoning schemes, 
guarantee real-time response, and permit the integration of the entire spectrum of tasks required for 
intelligent patient monitoring. This section describes a few such systems. 

GUARDIAN [Hayes-Roth et al., 1992] is a blackboard system. Based on a general architecture for 
intelligent agents, it has been applied in respiratory and cardiovascular monitoring domains. GUARDIAN 
emerges from a long history of research on reasoning and problem solving, and much emphasis is put 
on the generic problem of coordinating and selecting one of possible perceptual, cognitive, or action 
operations under real-time constraints. It integrates heuristic, structural, or functional knowledge of 
(1) common respiratory problems, (2) respiratory, circulatory, metabolic, and mechanical ventilator 
systems, and (3) generic flow, diffusion, and metabolic systems. A system capable of instantiating treat- 
ment protocols based on (1) current patient context, (2) executing plans and close-loop control actions, 
(3) monitoring the execution of these plans and actions, and (4) modifying plan execution as necessitated 
by patient response has also been designed to work with GUARDIAN. 

The Intelligent Cardiovascular Monitor (ICM) [Factor et al., 1991] is an application of the multitrellis 
software architecture, which supports modeling of real-time tasks in terms of an acyclic hierarchical 
network of decision processes. In ICM, the lowest level corresponds to raw data, and processes at this 
level implement signal processing algorithms to extract features from the signal. Information is abstracted 
along the hierarchy in a way reminiscent of information flow in a blackboard architecture, until reaching 
the highest level corresponding to physiologic processes or events. As opposed to GUARDIAN, the process 
trellis architecture assumes sufficient resources to execute the entire program and meet real-time con- 
straints. This architecture has also been used for the development of DYNASCENE [Cohn et al., 1990], 
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which models hemodynamic abnormalities in terms of "scenes" and temporal relationships between 
scenes. Each scene corresponds to one physiologic process (e.g., increased pericardial pressure) and is 
related to modules for computing specific information from the incoming data. DYNASCENE and ICM 
do not work in conjunction with patient models and hence are not capable of predicting patient behavior. 

SIMON [Dawant et al„ 1993] is a framework designed to support the development of real-time 
intelligent patient monitoring systems. It is organized around three main modules: the patient model, 
the data-acquisition and abstraction module, and a flexible user interface. The patient model is respon- 
sible for (1) estimating the physiologic state of the patient, (2) predicting the temporal evolution of the 
monitored variables, and (3) defining a dynamic monitoring context. The context includes normal and 
abnormal ranges for monitored variables, expected temporal evolutions of these variables, significant 
values and patterns to be detected in the incoming signals (events), and display configuration information 
(i.e., critical information to be displayed). The data abstraction module adjusts alarm limits and thresh- 
olds based on the current context and implements strategies to detect complex events. These may involve 
multiple variables and temporal relationships between events detected on each variable. SIMON has been 
applied off-line to the problem of monitoring neonates in the intensive care unit. 

Following the recommendations of the INFORM project [Hunter et al, 1991], Sukuvaara et al. [1994] 
proposed an object-oriented framework to implement systems based on the blackboard approach to 
problem solving. Data on the blackboard are separated into four main categories: (1) source data (raw 
data acquired from monitors, hospital systems, or users), (2) preprocessed variables, (3) physiologically 
interpreted variables, and (4) patient status. Preprocessing knowledge sources (KSs) span the first two 
levels and extract features from the incoming data. Transformation KSs span the second and third levels 
in the blackboard hierarchy, and map values of preprocessed variables onto (low, high) intervals. Patient 
status assessment KSs get input from physiologically interpreted data and determine the patient state. 
The system also assumes sufficient resources to execute all its knowledge sources and meet real-time 
constraints. 

186.4 Discussion and Conclusions 

Intelligent patient monitoring and management are complex tasks involving all aspects of information 
processing. They require context-dependent acquisition, processing, analysis, and interpretation of large 
amounts of possibly noisy and incomplete data. These systems also need to interface to hospital-wide 
patient data management systems (PDMS) to gather data from the laboratories, the pharmacy, or 
radiology. Finally, these systems should include efficient mechanisms to present relevant and timely 
information to clinical users. 

The systems described here illustrate that it is not easy to choose between specificity and generality. 
Systems that undergo clinical evaluation are typically developed for a very narrow, specific application 
which lends itself to rule-based approaches. However, these systems are often of limited clinical utility 
due to their narrow scope. Research aimed at developing deeper models to infer a wider range of behaviors, 
on the other hand, has failed to produce overwhelming results. Models for predicting patient behavior or 
response to therapeutic actions are clearly desirable, but currently are only available in very limited or 
constrained situations. Original modeling methods and approaches have been proposed, but require 
assumptions that either drastically reduce the scope of the model or make it unrealistic for clinical use. 

Complete architectures aim at identifying the essence of patient monitoring tasks and at providing 
support for the development of a class of systems. These systems should permit experimentation with a 
number of problem-solving strategies, the integration of diverse reasoning mechanisms, and should 
ultimately span the entire spectrum of tasks required for efficient and reliable patient monitoring. 
However, to move from the research laboratory to the clinical arena, these systems will have to be fitted 
with rich and flexible modeling tools capable of overcoming the complexity of generic architectures. In 
addition, they will have to interface with a variety of devices and systems, efficiently manage vast amounts 
of diverse data, and successfully interact with clinical users. 
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To address the issue of information presentation, web-based technology is being integrated with clinical 
information systems, and prototypes have been developed to provide integrated clinical user interfaces 
to ICU patient data [Norris 1997, Nenov 1995] at or close to real time, via a web browser. 

However, the fact remains that despite years of research and development and despite the potential of 
techniques being developed, intelligent monitoring systems have failed to make a significant impact on 
the quality of patient care. Saranummi [1997] contends that the main reason for this phenomenon is a 
lack of interaction between clinicians, industry, and academic researchers. Clinicians and academic 
researchers must interact closely to define the problem, develop solutions, and evaluate these solutions. 
When this is done, industry needs to be involved to assess the commercial viability of the project. To 
foster interaction Saranummi advocates the use of new technical platforms that support "plug-in" mod- 
ules to existing monitoring and PDMS systems. These modules extend the functionality of existing 
monitoring systems without interfering with routine operation, permitting the in situ evaluation of new 
technical solutions. Efforts such as the IMPROVE project sponsored by the European Union, aimed at 
gathering large annotated data sets of ICU patients, could also facilitate the evaluation of novel infor- 
mation processing methods in close to real world situations [Nieminen et al., 1997]. 

Coiera [1994] also suggests that current systems are not adapted to their task. He notes that a major 
effort has been put into developing systems to assist the physician in making diagnoses and taking 
therapeutic actions. He suggests, however, that the main problem facing health care professionals is not 
choosing a diagnosis or therapeutic action, but rather establishing a clear picture of the world leading 
to these choices. Following this observation, Coiera advocates the creation of a "task layer" designed to 
provide a good match between displayed information and the cognitive processes of physicians and 
nurses. More research on the way clinicians interact with intelligent monitoring devices is necessary, 
though, to support the design of these interfaces. 

The design of intelligent monitoring systems is thus an ongoing task. Context-based acquisition, 
processing, analysis, and display of the information are an essential concept in these systems. Elements 
defining this context are, however, complex and diverse. On the one hand, patient status and history 
define an interpretation and monitoring context that can be used to determine normal and abnormal 
limit values, trends, normal relations between observed variables, and significant events to be detected 
in the signals. Models could automatically establish these contexts, which, in turn, could be used to 
configure the monitoring system. On the other hand, cognitive tasks faced by the clinician, such as organ 
system function, treatment course, or resource utilization define the type and form of information 
presented. This information-processing context may be more difficult to automate and will require 
development of better interaction mechanisms between the system and its users. 
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", . . this man with two blue hands and abnormal blood gases and a negative chest X-ray. And high blood 
pressure. . . . (one physician) said he didn't know what the hell was going on. Would I take a look at him? 
Just a curbside consultation — that's all. Just an opinion. I said O.K. — be glad to. It was entirely appropriate. 
I was a physician of record." 



From Two Blue Hands, by Berton Roueche 



187.1 Introduction 



Computers in medicine assist the process of communication and support the integration of information. 
In the above example given by Roueche, the consulting physician could be replaced by a computer system 
that stores and handles information that was previously obtained from a number of experts in various 
medical specialties. Ideally, the victim described here carries a patient data card which, in an emergency 
case, presents valuable information to the physician. The Medical Records Institute [1981] is actively 
engaged in the development of an electronic patient record. 

Computer-mediated reasoning systems have been described by Williams [1982]: Knowledge Base (KB) 
systems are used for interpretation of data about a specific problem, in the light of knowledge represented 
in the KB, to develop a problem specific model and then to construct plans for problem solution. The 
KB contains the descriptive or factual knowledge pertaining to the domain of interest. Candidate hypoth- 
eses are then derived through some pattern matching system. Next, a reasoning "engine" (also termed 
inference machine) carries out the manipulation specified to reach a decision. Knowledge obviously can 
be represented in several ways: it can be characterized by symbols, plain words, and definitions along 
with their interrelations. Knowledge may be expressed by spoken or written words, flow charts, (math- 
ematical) equations, tables, figures and so forth, depending on the level of abstraction. 
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Aspects of language and text interpretation are central issues in artificial intelligence. Language is 
regarded as a metaphor of thought. When it provides the primary abstraction of knowledge about reality, 
it seems reasonable to expect that a powerful abstraction of language should also provide a powerful 
representation of knowledge. Various strategies have been explored: semantic networks offer a versatile 
tool for representing knowledge of virtually any type that can be captured in words by employing nodes 
(representing things) and links (referring to meaningful relationships between things), thus expressing 
causal, temporal, taxonomic and associational connections. Other approaches (such as frame-systems 
and production systems) have also been investigated. Conceptual Graphs (ANSI X3H4) [Sowa, 1984] are 
an emerging standard for knowledge representation, and the method is particularly suited to the repre- 
sentation of natural language semantics. Weed [1991] has advanced the idea of "knowledge coupling" 
i.e., a combination of a computerized medical record and decision-support software. 

An ideal medical KB must be comprehensive (while integrating text, graphics, video, and sound), 
accurate and verifiable, easily accessible where doctors see patients, and the system should be adaptable 
to their own preferred terms or abbreviations [Wyatt, 1991]. Future developments will certainly include 
the use of artificial neural networks [Baxt, 1991], and examples realized thus far include myocardial 
infarction, diabetes mellitus, epilepsy, bone fracture healing, appendicitis, dermatology diagnosis, and 
EEG topography recognition. 

Currently, most internationally available textbooks of (internal) medicine and medical dictionaries 
are retrievable on CD-ROM. Lexi-Comp's Clinical Reference Library is an example, featuring user-defined 
hyperlinks. A variety of titles are available as digital books on data cards suitable for palm computers, 
such as the Pocket PDR. In addition, the Internet offers numerous sites with relevant information [Smith 
and Edwards, 1997]. The September 1997 issue of the journal, MD Computing, focuses on medical 
resources on the Web. 

This chapter addresses several approaches employed to represent medical definitions and knowledge. 
First, a survey will be presented on available coding systems, and next an overview is given of current 
KB systems which primarily refer to the process of establishing a diagnosis. Finally, an anthology con- 
cerning problems related to medical terminology is presented, along with directions for obtaining a 
solution. 

187.2 Classification and Coding Systems 

With the exception of one British system, all classification or coding systems have been developed in the 
USA. This survey lists all projects along with some of their characteristics. 

The International Classification of Diseases (ICD) [International Classification of Diseases, 1994] 
system entered its tenth version, succeeding the ninth edition released in 1978. With more than 8000 
codes, it is applied worldwide for classifying diagnoses and also permits diagnosis related group (DRG) 
assignment employed for billing and reimbursement purposes. 

Systematized Nomenclature of Medicine (SNOMED) [Systematized Nomenclature of Medicine, 1993] 
offers a structured nomenclature and classification for use in human as well as in veterinary medicine. 
It consists of eleven modules with multiple hierarchies covering about 132,600 records, with a printed 
as well as a CD-ROM version available. 

Physicians' Current Procedural Terminology (CPT) [Physicians' Current Procedural Terminology, 
1992] provides uniform language for diagnostic as well as surgical and other interventional services. This 
system is currently in its fourth edition which is distributed by the American Medical Association (AMA), 
and has been incorporated in the Medicare program. 

Medical Subject Headings (MeSH) [1993] is a systematic terminology hierarchy which is used to index 
the MEDLINE medical publications system. It forms the standard for coding keywords related to the 
contents of articles in the field of medicine. The system is updated yearly. 

The National Library of Medicine (NLM) in 1986 started a project called Unified Medical Language 
System (UMLS) [Lindberg et al., 1993]. The project aims to address the fundamental information access 
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problem caused by the variety of independently constructed vocabularies and classifications used in 
different sources of machine-readable biomedical information. The UMLS approach will be to compen- 
sate for differences in the terminologies or coding schemes used in different systems, as well as for 
differences in the language employed by system users, rather than to impose a single standard vocabulary 
on the biomedical community. The Metathesaurus contains 31 1,000 terms and maintains cross reference 
to CPT [Physicians' CPT, 1992], ICD, SNOMED [SNOMED, 1993] and other indexes. NLM encourages 
broad experimentation with the fourth edition, and the tools are (with certain provisions) available on 
the basis of a one-year agreement. The complete files are distributed on four CD-ROMs, while the 
application programs are limited to browsing tools for the metathesaurus. 

Gabrieli [1992] constructed a computer-oriented medical nomenclature based on taxonomic princi- 
ples. His system covers 150,000 preferred terms and a similar number of synonyms. The partitioning 
method employed for medical classification readily permits replacement of English names with terms of 
any other language, thus creating the perspective of a world-wide standard. 

Read from the U.K. designed a classification for various computer applications [Chisholm, 1990]. It 
is designed in accordance with six key criteria: to be comprehensive, hierarchical, coded, computerized, 
cross-referenced, and dynamic. The system is closely connected with the British National Health Service 
(NHS), and includes 100,000 preferred terms, 250,000 codes, and 150,000 synonyms. 

187.3 An Electronic Medical Encyclopedia at Your 
Fingertips: Knowledge Bases and Diagnosis 

In this era, most information (as stored in written documents or computer records) consists of natural 
language text. Traditional retrieval systems use sets of keywords (index terms). It is called a full text 
retrieval system, when all (nontrivial) words of that text are indexed for future identification during search 
procedures. Information requests can then be formulated with the aid of Boolean operators (namely: and, 
or, not). Synonymous terms can be specified by employing the or-operator. The fuzzy-set retrieval model 
still uses Boolean logic but refines the query process by assigning different weight factors to the individual 
index terms. As an attractive alternative, the vector space processing system evaluates the similarity where 
both the index terms and the stored texts are represented by weighted term vectors [Salton, 1991]. The 
assumption that index terms are independent (i.e., orthogonal vector space) obviously implies a short- 
coming of this retrieval method. 

Within the domain of medicine, various KB systems have been developed and current systems will be 
briefly characterized here. Again, all projects in this area originate in the USA, with the exception of two 
(namely OSM and Medwise). One approach (namely CONSULTANT) addresses the field of veterinary 
medicine. A tabulated survey of the situation created in 1987 has been published [Kerkhof, 1987]. An 
occasional system is in the public domain, whereas others sell at a single-user price anywhere between 
$125 and $2000 depending on the configuration, discount, and other factors. 

CMIT developed by the AMA [Finkel, 1981] forms a reference for the selection of preferred medical 
terms including certain synonyms, and generic terms with built-in arrangements to provide maximum 
convenience in usage, currency, and timely publication. Thus, CMIT is both a system of reference and a 
"distillate of a vast amount of medical knowledge." The system is available in electronic form. 

Blois was the first physician to apply CMIT as a diagnostic tool in his RECONSIDER project [Blois, 
1984]. The application was released in 1981 and covered 3,262 disease entities, while 21,415 search terms 
were listed in a directory along with their frequency of occurrence (serving as an indicator of impact for 
each search term). 

DXplain [Barnett et al., 1987] is also based on CMIT [Finkel, 1981] and was released in 1986. The 
project has close connections with AMA, and information is distributed using the World Wide Web. The 
KB contains information on 2,000 diseases and understands over 4,700 terms, with 65,000 disease-term 
relationships. 
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QMR patient diagnostic software [Middleton, 1991] covers 660 disease profiles and over 5,000 findings. 
It is the personal computer version of the INTERNIST-I prototype, sold by First DataBank in San Bruno 
at $495. The program is versatile, easy to use, and offers an attractive user-interface, but the size of its 
KB remained remarkably constant over the years. 

The program MEDITEL [Waxman and Worley, 1990] addresses the issue of diagnosis in adults, and was 
marketed by Elsevier Publishers. Over the last few years, not much news has been reported in the literature. 

ILIAD version 4.0 [Bergeron, 1991] is a software package designed to aid students and residents in 
their clinical decision logic. The ILIAD project is an outgrowth of the Health Evaluation through Logical 
Processes (HELP) system developed at LDS hospital in Salt Lake City, UT The KB includes 1,300 diseases 
and 5,600 manifestations, and is built upon the experience of experts in nine subspecialties of internal 
medicine. The KB is updated semi-annually for revision and expansion. Future versions will support 
access to video laser discs, thereby providing an actual view of X-rays, echocardiograms, skin lesions, 
and other physical findings. 

The Oxford System of Medicine (OSM) project was initiated by the Imperial Cancer Research Fund for 
use in primary care, and to help general practitioners during routine clinical work to support decision- 
making tasks. Such tasks include (1) diagnosing illnesses, (2) planning investigations, and patient treat- 
ment schedules, (3) prescribing drugs, (4) screening for disease, (5) assessing the risk of a particular 
disease, and (6) determining a referral to a specialist [Krause et al., 1993]. 

Medwise is a medical KB founded in 1983 and now covers some 3,900 disease entities, with 29,000 
different keywords [Kerkhof et al., 1993]. It includes a separate KB with almost 500 equivalent terms, 
each referring to an average of three related terms e.g., extremity, leg, and limb. Equivalents are auto- 
matically generated to assist the user during the process of data entry. The matrix structure of the Medwise 
KB permits semantic differentiation of particular observations, e.g., lymphadenopathy as detected during 
palpation versus lymphadenopathy as interpreted from an X-ray picture. In the Medwise system both refer 
to different elements of the matrix, and their corresponding weight factors are individually incorporated 
when calculating the score for disease profile matching. For obvious reasons the language employed is 
English, but a complete dictionary that offers translations into Dutch is available [Kerkhof, 1996]. 

The Framemed system [Bishop and Ewing, 1993] divides medical information into 26 domains (com- 
parable to the matrix in Medwise) [Kerkhof et al., 1993] , and arranges the items in a hierarchical sequence; 
formation of the hierarchies yields a logical framework for a standardized terminology that is in the 
public domain. The objective is to achieve a standard coded terminology to which all existing systems 
can relate, with obvious use as an electronic encyclopedia and for differential diagnosis. Framemed offers 
synonyms, and claims as an advantage a simple overall structure that facilitates updating. 

STAr/-_Re/[STAT, 1994] offers the contents of a first choice medical library (including several standard 
textbooks, e.g., on Primary Care) as well as Medline (either primary care, cardiology, or oncology) on 
CD-ROM. 

MD-Challenger [MD-Challenger, 1994] offers a clinical reference and educational software for acute 
care and emergency medicine (everything from abdominal pain to zygoapophyseal joint arthritis), with 
nearly 4,000 annotated questions and literature references. The system also includes Continuing Medical 
Education (CME) credits. 

Labsearch/286 is a differential diagnosis program allowing input of up to two abnormal laboratory 
findings plus information on symptoms and signs. Laboratory data concentrate on body fluids (blood, 
urine, cerebrospinal, ascitic, synovial, and pleural fluid) entered as high or low [Labsearch, 1994]. The 
system includes 6,500 diseases and 9,800 different findings. No details are available on its diagnostic 
performance, although the restriction of only two laboratory data certainly will limit its potential. 

CONSULTANT is a KB for veterinary medicine developed by White [White and Lewkowicz, 1987]. 
This database for computer-assisted diagnosis and information management is available on a fee-for- 
service basis and is actually used in hundreds of private practices and institutions in North America. 

Dambro [1998] compiled a book (approx. $50) that is updated annually, Griffith's 5 Minute Clinical 
Consult. The first edition appeared in 1993 and indeed contains a realm of chart-like presented practical 
information on a thousand topics with reference to their ICD-code. A CD-ROM version is also on the 
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market. Similar to the design of Framemed [Bishop and Ewing, 1993], the information is compiled by 
a group of contributing authors whose names are listed in conjunction with each disease profile. Tilly 
and Smith [1997] presented a comparable book for the field of veterinary medicine along with a CD- 
ROM version: The 5 Minute Veterinary Consult. 

The Birth Defects Encyclopedia edited by Buijse [1992] has the significant subtitle "The comprehensive, 
systematic, illustrative reference source for the diagnosis, delineation, etiology, biodynamics, occurrence, 
prevention and treatment of human anomalies of clinical relevance." A unique feature of this printed 
encyclopedia is the BDFax-service which allows anyone in the world to call the Center for Birth Defects 
Information Services [Buyse, 1992] and request a current, daily updated version of any article in the KB. 
The information will then be faxed. Furthermore, BRS Information Technologies makes the full text 
available for online search and retrieval. Finally, the related Birth Defects Information System (BDIS) is 
a sophisticated computer-based profile matching system that reduces the research and diagnostic assis- 
tance tasks associated with complex syndromes. 

187.4 Problems Related to Medical Terminology 

kiotomy (ki-ot'o-me) excision of the vulva (in Dorland's pocket Medical Dictionary, 21st ed., p. 342; of 
vulvectomy on p. 684). 

Inspection of other sources suggests that in the word vulva the first letter v should he replaced by the letter 
u, followed by rearrangement of letters to obtain an anagram, namely uvula (being a portion of the soft 
palate). 

Medical language forms one of the greatest obstacles for the practical use of any type of KB designed 
for application in the field of medicine [Kerkhof, 1992]. Just imagine the confusion introduced when a 
visiting surgeon from an exotic country is requested to perform a kiotomy, and consults the above 
mentioned popular dictionary to look up what is supposed to be excised. 

Natural language often has remote roots e.g., adrenaline and epinephrine are the same chemical sub- 
stances [Kerkhof, 1996]. Various words can be used in two senses: as noun and as adjective, e.g., antipyretic 
and adrenal. The Latin word "os" means both "mouth" and "bone". And what's in a phrase? "... a neurologist 
found a nonfluent dysphasia, without circumlocution or paraphasias" (Is there a single term for computer 
entry?). Or: "The CT scan showed accentuation of the peripheral margins of the bilateral parieto-occipital 
forceps major, and splenium low-absorptive abnormalities? What does it mean, anyway? These examples 
illustrate the problem of translating medical phrases into concise "computer-storable language". 

Besides problems inherent in the understanding of natural language, additional difficulties pertaining 
to medical terminology can be indicated: 

American vs. British spelling. Two standard differences are evident, namely the use of the digraph in 
British spelling (e.g., anaemia vs. anemia) and preference for using c (e.g., in leucocyte) rather 
than k (as in the American word leukocyte). However, combination of both rules does not apply 
in the British equivalent of the American spelling of the word leukemia, where the anglicized 
version is spelled as "leukaemia" [Kerkhof, 1996]. 

Synonyms. For some reason "epistaxis" is identical to "nosebleed", and "pruritus" equals "itching"; it 
is not all that difficult, but the major problem is that you have to recall this every time you use 
either of them. Thrombocytosis and thrombocythemia are two words to indicate that the number 
of platelets in the peripheral circulation is in excess of 350,000 per microliter. 

Eponyms. Many disease names refer to the first author (e.g., Boeck's disease for sarcoidosis) who 
described the particular disorder, to the first patient analyzed in detail (e.g., Mortimer's disease, 
again for sarcoidosis), or to the area (e.g., Lyme disease) where the illness was first detected. An 
epidemiologist ever attempted to replace the term Bornholm disease just to honor a friend 
(Sylvest). Geographical variations also occur: Grave's disease (or Parry's disease) as it is known in 
the U.S., U.K., and Australia (referring to goiter), is termed Basedow's disease on the European 
continent, but Flajani's disease in Italy. 
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Preferred terminology. In radiology "air" means gas within the body, regardless of its composition or 
site, but the term should be reserved for inspired atmospheric gas. With reference to pneumotho- 
rax, subcutaneous emphysema, or the contents of the gastrointestinal tract, the preferred term is 
"gas". Sometimes the preferred terminology refers to simplicity; there is no virtue in talking about 
"male siblings" if we mean "brothers". On other occasions the preferred terminology pertains to 
technical vocabulary which permits high precision and resolution descriptions if the available 
information is extremely exact. In these circumstances a valuable tool is blunted, if carelessness 
creeps in. For example the word "clumsiness" describes defective coordination of movement, 
whereas the term "dysdiadokokinesis" refers to the well-defined phenomenon of a defect in the 
ability to perform rapid movements of both hands in unison [Murphy, 1976]. 

Different expressions and their exact meaning. A straightforward example is: tympanism, tympanites, 
tympanitis and tympany, particularly in relation to: tympanal, tympanic, tympanous, and tym- 
panitic [Kerkhof, 1996]. One annotated example is presented: (1) heterogenous (not originating 
in the body); (2) heterogeneous (not of uniform composition, quality or structure); (3) hetero- 
genetic (pertaining to asexual generation); and (4) heterogenic (the same as heterogeneous). 

Implicit information. A particular statement may imply a multitude of information components, e.g., 
if urinalysis is found to be normal, then the preceding examination implies (at least) the absence 
of the quadruplet proteinuria, hematuria, glucosuria, and casts. Also mirror-terms may apply: "dry 
cough" implies "no productive cough" (which can be entered as a negative finding). Likewise, 
leukopenia in particular implies "no leukocytosis". This mutual exclusion principle applies to all 
antonyms, especially all terms beginning with hypo- or hyper-. 

Imprecise terminology [Yu, 1983]. Some terms may carry a vague meaning, e.g., tumor, swelling, mass, 
and lump. To a large extent, however, the use of such terms reflects the uncertainty around an 
observation. In that respect it refers to a justifiable "law of preservation of uncertainty". In other 
words, it would be incorrect to specify an observation in greater detail than the facts permit. This 
notion has consequences for the selection of equivalents. 

Certainty vs. uncertainty. Decision analysis itself does not reduce the uncertainty about the true state 
of nature, but as long as some choice needs to be made it does enable one to make rational decisions 
in the light of uncertainty [Weinstein and Fineberg, 1980]. Yet another aspect of certainty vs. 
uncertainty deserves mentioning, namely where percent-wise figures about prognosis or outlook 
are subjectively interpreted [Eraker and Politser, 1982]. Reduction of the probabilities by a factor 
of one-tenth completely reversed the preferences of a set of respondents. This "certainty effect" 
shows that outcomes perceived with certainty are overweighted relative to uncertain outcomes. 
Thus, the formulation of information affects its interpretation by humans. 

Knowledge engineering implies various levels of translation [Adeli, 1990]. The expert formulates as 
precisely as possible his thoughts, the engineer provides feedback using his/her own phrases to 
ensure an exact match between both minds, and subsequently the resulting expression is translated 
to a format usable for the computer program. These three steps involve transformations of 
language, while assuming that the occasional user of the program appreciates the full scope of the 
original thoughts of the expert. 

Information source vs. actual patient [Kerkhof, 1987]. Current medical information sources tend to 
adhere to preference terminology to promote the use of uniform medical language. However, such 
standard vocabulary is not used by the average patient to describe their individual health problems 
[Hurst, 1971]. Then it is left to the clinician to transpose, e.g., "puffy face" and "moon face" if 
appropriate. Indeed, better health care can be realized by educating the patient about the value 
of structured communication with the physician [Verby and Verby, 1977]. 

Subspecialty interpretation and jargon. In chest radiology "type I bulla" means the same as "bleb" or 
"air cyst", while pneumatocele is not regarded as a proper synonym [Glossary, 1989]. When naming 
a "hollow space" you may choose anything out of the following set: ( 1 ) cavity, (2) crypt, (3) pouch, 
(4) gap, (5) indentation, (6) dell, (7) burrow, (8) crater, (9) concavity, (10) excavation, (11) gorge, 
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(12) pocket, (13) cave, (14) cavern, (15) cistern, or (16) lacuna. However, every expression may 
exhibit a nuance within a certain context. Then there is a jargon: the terms (1) show, (2) engage- 
ment, (3) lightening, and (4) station, for example, have a particular meaning within the field of 
obstetrics [Kerkhof, 1996] . The term "streaking" has a meaning which differs for the microbiologist 
and the radiologist. 

Multilingual approaches. The relation between a concept and the various corresponding terms in 
different languages is, in general, not unique. This implies that a multitude of different words 
from different syntactical categories may represent a single concept. Particularly the European 
countries are confronted with additional natural language problems. The Commission of the 
European Communities supports research activities in this area such as EPILEX (a multilingual 
lexicon of epidemiological terms containing Catalan, Dutch, English, French, German, Italian, 
Portuguese, and Spanish) [Du V Florey, 1993] and the development of a multilingual natural 
language system [Baud et al., 1994]. 

Frequency of occurrence. The meaning of semi-quantitative indicators like "always, often, etc." is not 
transparent when screening a medical text. The intuitive interpretation of some quasi-numerical 
determinants is summarized elsewhere [Kong et al., 1986]. 

Noise terms. A study [Kerkhof, 1993] revealed that input information consisted on average of 75 terms 
per patient case; required for establishing the primary diagnosis were only 15 terms. This implies 
that 80% of the input data consisted of "noise terms", which do not directly contribute to the 
confirmation of the correct diagnosis. Rather such overwhelming portions of information may 
blur the process of hypothesis formation, and in particular when the diagnostic task is solely left 
to the human being who is prone to confusion if the number of contributing elements is more 
than about ten. 

187.5 Solution for Discrepancies 

"Representation of knowledge is an art, not a science" [D.B. Lenat, 1989]. 

Various scientists have described techniques to help and solve language-related problems in the commu- 
nication area of medicine. An available online medical dictionary can be an important tool for research 
and application in natural language processing. Dorland's Illustrated Medical Dictionary has been con- 
verted to an online interactive computer-based version. To assure exact description of definitions, a coding 
system has been advocated because of: (1) ease of handling, (2) redundancy to avoid errors, (3) specificity, 
(4) indication of relationships (e.g., by using common code groups for similar terms), (5) equating 
equivalent terms and linking them in different languages, and (6) the feasibility to cross-link terms on 
the basis of common portions of their codes [Bishop and Dombrowski, 1990]. 

Attempts have also been made to automatically translate existing medical terminology systems [Cimino 
and Barnett, 1990]. For example, using a semantic network for mapping, the closest match between the 
MeSH term "portography" was "portal contrast phlebogram" in the ICD-procedures directory. 

Two independent routes are available to attack the majority of linguistic problems related to a medical 
KB: (1) Use uniform input data, i.e., force the user of the KB to enter only controlled terms as they 
already exist in the KB (terms allowed by the editorial committee responsible for the KB). This route 
implies tedious work for the user, and the job of selecting recognizable entries has to be carried out every 
time the KB is consulted. (2) Apply an additional KB with equivalent expressions. Once the user enters 
a particular term, the KB automatically generates a list of equivalent expressions to choose from. The 
construction of such an auxiliary KB has to be realized only once, apart from the obvious updating 
process desired as the system evolves. Furthermore, a matrix structure as applied in the Medwise KB 
proves to be helpful in organizing information, and incorporating a substantial portion of semantic 
interpretation when dealing with medical knowledge. Inclusion of the auxiliary KB also enhances user- 
friendliness, and accelerates the process of data entry [Kerkhof et al., 1993]. 
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Discussion 

While advances in computer technology permit virtually unlimited storage and fast retrieval capabilities, 
interpretation of natural language still constitutes a major obstacle for the application of medical KBs. 
Even common words often have highly ambiguous meanings [Murphy, 1976], e.g., "weakness" can mean 
lethargy or paralysis. Therefore, the user of any KB should be aware of the fact that justifiable adoption 
of a particular equivalent expression may depend on the precise interpretation of the complete case report. 

On the other hand, the use of uniform input data on the basis of a thesaurus would imply a substantial 
loss of refinement with respect to terminology. Ideally, there is a maximum degree of freedom in the 
selection of words and their nuances, both for the KB and for the description of the individual case study. 

An auxiliary KB with synonyms [SNOMED, 1993; Gabrieli, 1992; Chisholm, 1990; Kerkhof et al, 1993; 
Lindberg et al., 1993] indeed may offer considerable help during the process of matching input terms 
with information sources [Kerkhof, 1993]. An adequate solution to (at least a part of the) linguistic 
problems, will substantially enhance the performance of diagnostic computer programs. 

Summary 

A multitude of projects address the issue of how to handle an ever increasing amount of medical 
information. Coding schemes have been designed and are regularly refined, while other approaches aim 
to collect, structure, and disclose this gigantic corpus of information that features an unconfined potential 
to thrive. 

The lack of standardized medical language limits the optimal use of computers in medicine. Major 
obstacles concern: (1) imprecise terminology, (2) the limited scope of a thesaurus, (3) multilingual 
conversion problems, (4) discrepancies between information sources vs. actual clinical cases, as well as 
(5) the ever expanding range of synonyms, acronyms, antonyms and eponyms. Incorporation of a separate 
knowledge base containing equivalent expressions appears to be indicated for the practical use of medical 
information systems. 

Defining Terms 

AMA: American Medical Association, with headquarters in Chicago. This organization is actively 

involved in new developments concerning medical informatics. CMIT [Finkel, 1981] and CPT 

[Physicians' Current Procedural Terminology, 1992] as well as the cooperation with DXplain 

[Barnett et al., 1987] are prototypes of their activities. 
ICD: International Classification of Diseases, a widely accepted system to organize all possible medical 

diagnoses. The tenth version is translated for worldwide application. 
IMIA: International Medical Informatics Association, consisting of the national societies from about 

40 countries. 
NLM: National Library of Medicine, in Bethesda (Maryland), USA. 
UMLS: Unified Medical Language System, a project initiated by the NLM and distributed on CD-ROM. 

Cooperation with parties to implement the system within their own environment is encouraged, 

but requires a contract [Lindberg et al., 1993]. 
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Further Information 

The fields of medical knowledge bases and medical terminology are rapidly developing. There is no single 
comprehensive source of information available, but the reader is advised to scrutinize the following 
journals for new information: 

M.D. Computing, published by Springer Verlag (New York and Berlin), reports on research in the field 
of medical informatics. Of special interest to the clinician. 

IEEE Expert appears four times a year and presents the latest on artificial intelligence and expert 
systems. Contact P.O. Box 3014, Los Alamitos, CA 90720. For those interested in technical details on 
intelligent systems and their applications. IEEE (P.O. Box 1331, Piscataway NJ 08855-1331, USA) pub- 
lishes a number of related journals e.g., on knowledge engineering and on multimedia. 

The National Library of Medicine (NLM, 8600 Rockville Pike, Bethesda MD 20894, USA) releases 
news bulletins, and provides information on UMLS and contracts for cooperation. 

Ongoing projects on medical nomenclature are surveyed elsewhere in this chapter. 

Pertinent Internet Web Sites: 

British Medical Journal: http://www.bmj.com 

JAMA: http://www.ama-assn.org/public/journals 

The Lancet: http://www.thelancet.com 

The New England Journal of Medicine: http://www.nejm.org 

Grateful Med: http://igm.nlm.nih.gov/ 

Physician's Desk Reference: http://www.pdrnet.com/ 

About ICD-9-CM: http:\www.icd-9-cm.org/ 

About ICD-10: http:\www.cdc. gov.nchswww/ about/major/ dvs/icd 10des.htm 

Medical guide: http://www.medmatrix.org 

Search engine for Health on the Net: http://www.hon.ch/ 

Alternative Medicine: http://www.yahoo.com/health/alternative_medicine 

Dermatology images: http://tray.dermatology.uiowa.edu/dermimag.htm 

Focus on molecular medicine: http://www.aston.it/icamm 

Obviously, annual meetings form the forum for presentation of the latest developments: IMIA con- 
ferences are well known besides the world congress organized every four years. IMIA publishes a Yearbook 
of Medical Informatics; contact Schattauer Publishers, Subscriptions Dept, P.O. Box 104545, W-7000 
Stuttgart 10, Germany. 
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188.1 Introduction 

Natural language is the primary means of communication in all complex social interactions. In biomedical 
areas, knowledge and data are disseminated in written form, through articles in the scientific literature, 
technical and administrative reports, and hospital charts used in patient care. Much vital information is 
exchanged verbally, in interactions among scientists, clinical consultations, lectures, and in conference 
presentations. Increasingly, computers are being employed to facilitate the process of collecting, storing, 
and distributing biomedical information. Textual data is now widely available in an electronic format, 
through the use of transcription services, and word processing. Important examples include articles 
published in the medical literature and reports dictated during the process of patient care (e.g., radiology 
reports and discharge summaries). 

While the ability to access and review narrative data is highly beneficial to researchers, clinicians, and 
administrators, the information is not in a form amenable to further computer processing, for example, 
storage in a database to enable subsequent retrievals. At present, the most significant impact of the 
computer in medicine is seen in processing structured data, information represented in a regular, pre- 
dictable form. This information is often numeric in nature, e.g., measurements recorded in a scientific 
study, or made up of discrete data elements, e.g., elements selected from a predefined list of diseases. 

The techniques of natural language processing provide a means to bridge the gap between textual and 
structured data, allowing humans to interact using familiar natural language, while enabling computer 
applications to process data effectively. 

188.2 Linguistic Principles 

Natural language processing (or computational linguistics) is a branch of computer science concerned 
with the relationship between information as expressed in natural language (sound or text) and repre- 
sented in formalisms that facilitate computer processing. Natural language analysis studies how to convert 
natural language input into a structured form, while natural language generation investigates how to 
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produce natural language output from structured representations. Natural language processing investi- 
gates and applies scientific principles of linguistics through development of computer systems. One of 
the most important principles is that natural language is built up from several layers of structure, each 
layer defined as a set of restrictions on the previous layer [Harris 1991]: 

TABLE 188.1 Layers of Linguistic Structure 

Linguistic Layer Description 

Phonological Mapping of sounds to phonemes (letters) 

Morphological Grouping of phonemes into morphemes (roots and inflections) 

Lexical Combining of morphemes into words 

Syntactic Combining of words into sentence structure 

Semantic Mapping of sentence structure into literal meanings 

Pragmatic Combining of sentence meanings into discourse meaning 

These layers correspond roughly to the subfields of linguistics, and to the areas of study in natural 
language processing, which seeks to develop them into computer models. Knowledge about the rules of 
language structure at any or all of its levels is called competence. Most natural language processing 
applications in biomedicine do not even begin to approach the language competence of humans. For 
example, a transcription system may have knowledge about the sounds of a language but know nothing 
about the syntax of sentences. Similarly, a program that indexes scientific articles may know which terms 
to look for in the text, but have no ability to express how these terms are related to one another. However, 
it is important to emphasize that such applications may perform extremely useful tasks, even with very 
limited competence. 

Many natural language processing applications exploit the fact that biomedical fields are restricted 
semantic domains, which means that natural language information associated with that field is focused 
on a narrow range of topics. For example, an article about cell biology is limited to discussions of cells 
and tissues, and is unlikely to mention political or literary issues. The natural language of a restricted 
semantic domain is called a sublanguage. Sublanguages tend to have specialized vocabularies and spe- 
cialized ways of structuring sentences (e.g., the "telegraphic style" of notes written about patients in 
hospital charts) and ways of organizing larger units of discourse (e.g., the format of a technical report) 
[Grishman and Kittredge 1986, Kittredge and Lehrberger 1982]. 

These properties of sublanguages allow the use of methods of analysis and processing that would not 
be possible when processing the language of newspaper articles or novels. For example, a program that 
indexes medical articles can select index terms from a list of terminology known to be of interest to 
researchers; a speech recognition system can exploit the fact that only certain words can be uttered by a 
user in response to a given prompt; a system that analyzes clinical reports can look for predictable semantic 
patterns that are characteristic of the given domain. 



188.3 Applications in Biomedicine 



There are four forms of natural language exchange possible between human and the computer: (1) the 
human can supply information to the computer using natural language; (2) the human can retrieve 
information from the computer through natural language; (3) the computer can supply information to 
the human in the form of natural language output; and (4) the computer can request information from 
the human using natural language questions. Applications of natural language processing may support 
one or several of these modes of exchange. For example, an automated questionnaire for patients may 
generate multiple choice questions but be able to receive only numbered answers, while a database 
interface may be able to accept natural language questions submitted by a researcher and return natural 
language replies. Applications can be classified according to the levels of language competence embodied 
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in their design: (1) speech systems, (2) lexical systems, (3) syntactic and semantic systems, and 
(4) discourse systems. 

Speech Systems 

Speech recognition systems [Grasso 1995] process voice data into phonemic representations (such as 
text), while speech synthesis systems generate spoken output from such representations. Speech process- 
ing applications may work only at this level of language competence, e.g., software that transcribes spoken 
input into text, or software that generates speech when visual information cannot be employed, such as 
on the telephone. Applications in this category include systems that capture structured endoscopic reports 
[Johannes and Carr-Locke 1992], and emergency room notes [Linn et al., 1992]. Applications may also 
employ speech technology as part of a larger system. Examples include interfaces to diagnostic systems 
[Landau et al., 1989], [Shiffman etal., 1992], and systems for taking patient histories [Johnson et al., 
1992]. 

Lexical Systems 

Lexical systems work with language at the level of words and terms (short sequences of words). A lexicon 
is a special type of database that provides information about words and terms which may include 
pronunciation, morphology (roots and affixes), and syntactic function (noun, verb, etc.). The Specialist 
Lexicon provides information about medical terms and general English words [National Library of 
Medicine 1993]. A thesaurus groups synonymous terms into semantic classes, which are frequently 
organized into a hierarchical classification scheme. The Systematized Nomenclature of Medicine 
(SNOMED) classifies medical terms in several hierarchies, such as (1) topography, (2) morphology, 
(3) etiology, (4) function, (5) disease, (6) occupation, and (7) procedure [Rothwell etal, 1993]. Medical 
Subject Headings (MeSH) classifies terms used in medical literature [National Library of Medicine 1990]. 
The Metathesaurus of the Unified Medical Language System (UMLS) combines SNOMED, MeSH, and 
other thesauri [Lindberg 1993]. 

Semantic classes are used to index natural language documents to facilitate their retrieval from a 
database. For example, MeSH is used to index the medical literature in the MEDLINE database [Bachrach 
1978]. Semantic classes are also used to define the set of data elements {controlled vocabulary) that can 
be processed by a computer application. Controlled vocabularies are used by diagnostic systems and 
clinical information systems — by the programs that collect data, store it in databases, and display it 
[Linarrson and Wigertz 1989]. 

Syntactic and Semantic Systems 

Lexical techniques can only approximate the meaning of an article or some text entered by the user of a 
computer application. One approach to obtaining a deeper understanding is to produce a representation 
of the syntactic structure of a sentence (e.g., determining the subject, the verb, and the object), and then 
map this structure into a representation of its meaning (such as predicate calculus). A computer program 
that analyzes the structure of sentences is called a parser, and uses a lexicon and a grammar (a formal 
representation of the syntactic rules of a language). 

Representations of semantic content (or knowledge representation) vary, but most representations are 
variations of a structure called a frame, in which predefined slots are filled with information from natural 
language sentences. A number of systems process clinical reports into their own specialized frame 
representations, for example the Linguistic String Project [Sager et al., 1986], MedLEE [Friedman 1994], 
and SymText [Haug 1997]. The representation of "conceptual graphs" [Sowa 1984] has emerged as a 
potential standard for semantics and is used for medical language processing by a number of systems 
which include METEXA [Schroder 1992], MENELAS [Bouaud etal, 1997], and RECIT [Baud etal., 
1992]. 
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Discourse Systems 

Information from the sentences of a discourse (e.g., an article or clinical report) combine in complex 
ways; for example, cross-references are made (e.g., using pronouns), and events are related temporally 
or causally. Understanding a discourse fully may require information about the context in which the 
natural language exchange occurs, or the background knowledge of a technical field. Systems based on 
syntax and semantics usually carry out an analysis of the discourse and context information as a subse- 
quent stage. Other systems attempt to approximate the semantic analysis of sentences, using medical 
knowledge to guide the overall processing of a text by defining the possible sequences of topics and 
subtopics characteristic of the domain. Domains in which these methods have been attempted include: 
(1) the history and physical sections of a patient chart [Archbold and Evans 1989]; (2) echocardiography 
reports [Canfield et al., 1989]; (3) discharge summaries [Gabrieli and Speth 1987]; and (4) radiology 
reports [Ranum 1988]. 

Natural language is also a convenient mode for the output of applications, for example in producing 
clinical reports [Kuzmak and Miller 1983]. Some expert systems have used natural language to express 
their recommendations [Rennels et al., 1989], and to provide explanations of their reasoning process 
[Lewin 1991]. 

188.4 Challenges 

The design of controlled vocabularies is an important area of research [Cimino et al., 1989]. As yet, no 
comprehensive vocabulary for clinical medicine exists, and the effort to create one is beyond the means 
of any single institution. Indexing of articles and reports is currently a labor-intensive process, and quality 
control is a significant problem. Index terms are a poor approximation of the meaning of a text. Similarly, 
systems that capture clinical information through the entry of individual findings fall short of capturing 
the patient history. Natural language processing systems with greater understanding of syntax and seman- 
tics may help by providing richer information representations. 

No natural language processing system currently covers the complete range of language interaction 
with significant competence at all levels — from speech to discourse. In the near future, the systems with 
the greatest practical success will specialize in performing selected language processing tasks in well- 
defined domains. 

Defining Terms 

Controlled vocabulary: A set of well-defined data elements intended for use in a computer application. 

Frame representation: A structure for representing complex information, consisting of fixed, named 
"slots" which may contain data values or frames. 

Grammar: A formal representation of language structure, usually syntactic structure. 

Language competence: Knowledge about the rules of a language. 

Lexicon: A formal compilation of the words of a language, with information about phonetics, mor- 
phology, syntax, and semantics (depending on requirements of computer applications). 

Morphology: Field studying how sounds combine into morphemes, and how morphemes combine to 
form words. 

Parser: A computer program that uses a grammar and lexicon to analyze sentences of a language, 
producing a syntactic or semantic representation. 

Phonology: Field studying the rules that govern the sounds used in languages. 

Pragmatics: Field studying how sentences are combined to form larger units of discourse, and how to 
represent meaning in the context of a situation. 

Semantics: Field studying the relation between natural language sentences and formal representations 
of meaning. 
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Structured data: Data that has a explicit, unambiguous, and regular structure, making it amenable to 

computer processing. 
Sublanguage: The natural language of a restricted semantic domain that is focused on a specific set of 

tasks or purposes, e.g., interpreting X-ray images. 
Syntax: Field studying how words combine to form sentences. 

References 

Allen, J. 1995. Natural Language Understanding, Second Edition. Redwood City, CA: The Benjamin 

Cummings Publishing Company, Inc. 
Archbold, A. and Evans, D. 1989. On the Topical Structure of Medical Charts. In: Proceedings of the 13th 

Annual SCAMC, p. 543-547. IEEE Computer Society Press, Washington, D.C. 
Bachrach, C.A. and Chaen, T 1978. Selection of MEDLINE contents, the development of its thesaurus, 

and the indexing process, Medical Informatics. 3(3): 237-254. 
Baud, R.H., Rassinoux, A.M., and Scherrer, J.R. 1992. Natural Language Processing and semantical 

representation of medical texts. Meth. Inform. Med. 31(2):1 17—125. 
Bouaud, J., Zweigenbaum, P., et al., 1997. A Semantic Composition Method Driven by Domain Knowledge 

Models. Twenty-First Annual Symposium of the American Medical Informatics Association. 
Canfield, K., Bray, B., Huff, S., and Warner, H. 1989. Database capture of natural language echocardio- 
graphy reports: A Unified Medical Language System approach. In: Proceedings of the 13th Annual 

SCAMC, p. 559-563. IEEE Computer Society Press, Washington, D.C. 
Cimino, J.J., Hripcsak, C, Johnson, S.B., Friedman, C, Fink, D.J., and Clayton, P.D. 1989. Designing an 

Introspective, Multi-Purpose Controlled Medical Vocabulary. Proceedings of the 13th Annual 

SCAMC, p. 513-518. IEEE Computer Society Press, Washington, D.C. 
Friedman, C, Alderson, P.O., Austin, H.M., Cimino, J.J., and Johnson, S.B. 1994. A general natural 

language text processor for clinical radiology. JAMIA 1(2): 161-174. 
Gabrieli, E. and Speth, D. 1987. Computer Processing of Discharge Summaries. In: Proceedings of the 11th 

Annual SCAMC, p. 137-140. IEEE Computer Society Press, Washington, D.C. 
Haug, P.J. et al., 1994. A Natural Language Understanding System Combining Syntactic and Semantic 

Techniques. Eighteenth Annual Symposium of the American Medical Informatics Association. 
Grasso, M.A. Automated speech recognition in medical applications. MD Computing, 1995;12(l):16-23. 
Grishman, R. and Kittredge, R., Eds. 1986. Analyzing Language in Restricted Domains: Sublanguage 

Description and Processing. Erlbaum Associates, Hillsdale, New Jersey. 
Harris, Z. 1991. A Theory of Language and Information — A Mathematic Approach. Clarendon Press, 

Oxford. 
Johannes, R.S. and Carr- Locke, D.L. 1992. The Role of Automated Speech Recognition in Endoscopic Data 

Collection. Endoscopy. 24(Suppl 2): 493-498. 
Johnson, K., Poon, A., Shiffman, S., Lin, R., and Fagan, L. 1992. A history taking system that uses 

continuous speech recognition. In: Proceedings of the 16th Annual SCAMC, p. 757-761. IEEE 

Computer Society Press, Washington, D.C. 
Kittredge, R. and Lehrberger, J., Eds. 1982. Sublanguage — Studies of Language in Restricted Semantic 

Domains. De Gruyter, New York. 
Kuzmak, P.M. and Miller, RA. 1983. Computer-aided generation of result text for clinical laboratory 

tests. In: Proceedings of the 7th Annual SCAMC, p. 275-278. IEEE Computer Society Press, Wash- 
ington, D.C. 
Landau, J.A., Norwich, K.N., and Evans, S.J. 1989. Automatic Speech Recognition — Can it Improve the 

Man-Machine Interface in Medical Expert Systems? Int. J. Biomed. Comp.. 24(2): 111-117. 
Lewin, H.C. 1991. HF-Explain: a natural language generation system for explaining a medical expert 

system. In: Proceedings of The 15th Annual SCAMC, p. 644-648. IEEE Computer Society Press, 

Washington, D.C. 



© 2000 by CRC Press LLC 



Lindberg, D.A.B., Humphreys, B.L., and McCray, A.T. 1993. The Unified Medical Language System. In: 

Yearbook of Medical Informatics, van Bemmel, J.H., McCray, A.T., Eds., p. 41-51. International 

Medical Informatics Association, Amsterdam. 
Linarrson, R. and Wigertz, O. 1989. The data dictionary — a controlled vocabulary for integrating clinical 

databases and medical knowledge bases. Meth. Inform. Med. 28(2): 78-85. 
Linn, N.A., Rubenstein, R.M., Bowler A.E., and Dixon, J.L. 1992. Improving the Quality of Emergency 

Room Documentation Using the Voice-Activated Word Processor: Interim Results. In: Proceedings of 

the 16th annual SCAMC, p. 772-776. McGraw Hill, New York. 
National Library of Medicine. 1990. Medical Subject Headings (NTIS NLM-MED-90-01). National 

Library of Medicine. Bethesda, Maryland. 
National Library of Medicine. 1993. The Specialist Lexicon. Natural language systems group, National 

Library of Medicine, Bethesda, MD. 
Ranum, D. 1988. Knowledge Based Understanding of Radiology Text. In: Proceedings of the 12th Annual 

SCAMC, p. 141-145. IEEE Computer Society Press, Washington, D.C. 
Rennels, G., Shortliffe, E., Stockdale, F„ and Miller, P. 1989. A computational model of reasoning from 

the clinical literature. AI Magazine. 10(1): 49-57. 
Rothwell, D.J., Palotay, J.L., Beckett, R.S., and Brochu, L., Eds. 1993. The Systematized Nomenclature of 

Medicine. SNOMED International. College of American Pathologists, Northfield, Illinois. 
Sager, N., Friedman, C, and Lyman, M. 1987. Medical Language Processing — Computer Management of 

Narrative Data. Addison-Wesley, Reading, Mass. 
Scherrer, J.R., Cote, R.A., and Mandil, S.H. 1989. Computerized natural language medical processing for 

knowledge representation. North Holland, Amsterdam. 
Schroder, M. 1992. Knowledge-Based Processing of Medical Language: A Language Engineering Approach. 

In: Advances in Artificial Intelligence, 16th German conference on AI, p. 221-234. Springer Verlag, 

Berlin. 
Shiffman, S., Lane, CD., Johnson, K.B., and Fagan, L.M. 1992. The integration of a continuous speech 

recognition system with the QMR diagnostic program. In: Proceedings of the 16th Annual SCAMC, 

p. 767-771. IEEE Computer Society Press, Washington, D.C. 
Spyns, P. 1996. Natural Language Processing in Medicine. Meth. Inform. Med. 35:285-301. 
Sowa, J.F. 1984. Conceptual Graphs: Information Processing in Mind and Machine. Addison-Wesley, Read- 
ing MA. 
Van Bemmel, J.H. (Ed.). Meth. Inform. Med., 1998:4(5). 

Further Information 

For general information about the field of natural language processing, see [Allen 1987], and [Covington 
1994]. Surveys of research in sublanguage can be found in [Kittredge and Lehrberger 1982] and [Grish- 
man and Kittredge 1986]. A variety of papers on medical language processing are collected in [Scherrer 
etal., 1989 and Van Bemmel 1998]. 



(0 2000 by CRC Press LLC 



Geddes, L. A. "Historical Perspectives 5 - Electroencephalography ." 

The Biomedical Engineering Handbook: Second Edition. 

Ed. Joseph D. Bronzino 

Boca Raton: CRC Press LLC, 2000 



Historical Perspectives 5 

Electroencephalography 



Leslie A. Geddes Historical Background 

Purdue Univeisity Commercial Production of EEG Machines 

Historical Background 

Hans Berger (1929) was the first to record electroencephalograms from human subjects. However, before 
then, it was well known that the brain produced electrical signals. In fact, in Berger 's first paper there is 
a short history of prior studies in animals. Interestingly, the first person to demonstrate the electrical 
activity of the brain did not make recordings. In 1875, Richard Caton in the United Kingdom used the 
Thomson (Kelvin) sensitive and rapidly responding reflecting telegraphic galvanometer to display the 
electrical activity of exposed rabbit and monkey brains. His report [Caton, 1875], which appeared in the 
British Medical Association Journal, occupied only 2 1 lines of a half-page column. In part, the report stated: 

In every brain hitherto examined, the galvanometer has indicated the existence of electric currents. 
The external surface of the grey matter is usually positive in relation to the surface of a section through 
it. Feeble currents of varying direction pass through the multiplier [galvanometer] when the electrodes 
are placed on two points of the external surface, or one electrode on the grey matter, and one on the 
surface of the skull. The electric currents of the grey matter appear to have a relation to its function. 
When any part of the grey matter is in a state of functional activity, its electric current usually exhibits 
negative variation. For example, on the areas shown by Dr. Ferrier to be related to rotation of the head 
and to mastication, negative variation of the current was observed to occur whenever those two acts 
respectively were performed. Impressions through the senses were found to influence the currents of 
certain areas; e.g., the currents of that part of the rabbit's brain which Dr. Ferrier has shown to be 
related to movements of the eyelids, were found to be markedly influenced by stimulation of the 
opposite retina by light. 

No recordings of the movement of the spot of light on the scale of the Kelvin galvanometer have been 
found, perhaps because, at that time, telegraphic operators used to read the dots and dashes of the Morse 
code by watching the movements of the spot of light on the galvanometer scale. Nonetheless, Caton's 
description clearly shows than he witnessed the fluctuating potentials that we now know exist. Also 
important is the fact that Caton was the first to report visual-evoked potentials. 

Berger, a psychiatrist in Jena, Germany, was aware of the several prior electroencephalographic animal 
studies and had conducted experiments using dogs. The only recording devices available to him were 
the string galvanometer, developed by Einthoven [1903] for electrocardiography and the capillary elec- 
trometer developed by Marey [1876]. Although there were a few mirror-type oscillographs available for 
recording waveforms from alternating current (50 to 60 Hz) generators and transformers, the sensitivity 
of such devices was very low, and they could not be used for bioelectric recording without a vacuum- 
tube amplifier. 

The voltage appearing on the scalp produced by the brain is only about one-tenth that of the ECG 
detected with limb leads. To enable recording brain activity with the string galvanometer, the tension in 
the string was reduced, which increased the sensitivity but reduced the speed of response. This was the 
method used by Berger when he found that the capillary electrometer was unsatisfactory. 
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FIGURE HP5.1 (a) The Edelmann double-string galvanometer of the type used by Berger. On the left it is a funnel 
for detecting the arterial or venous pulse and on the right is a device for detecting the heart sounds, all of which can 
be recorded along with the two channels of ECG, and a time signal on 12-cm-wide photographic paper, (b) The 
phonocardiagram, venous-pulse record, and two recordings of the ECG. (From Zusatz apparate zum Elektrokar- 
diographen, Siemens-Reiniger-Veifa Berlin ca. 1926.) 

Apart from problems of the lack of sensitivity and speed of response in the recording apparatus, Berger 
faced severe electrode problems. Because of the low amplitude of the cortical signals, the electrodes had 
to be very stable, producing no voltages that could be seen on the electrocortical recordings. In other 
words, the electrode noise and stability had to be in the low-microvolt range. Zinc electrodes were popular 
at that time, and Berger used them in his first dog studies just after the turn of this century. For human 
use, he used zinc-plated needles (insulated down to the tip) and inserted them through existing trephine 
holes of skull defects so that the tip was epidural. The electrodes were sterilized with 10% formalin 
solution. Berger first used the single-string Edelmann galvanometer designed for electrocardiography. 
Later he used a two-string unit of the type shown in Fig. HP5.1 so that he could record the ECG along 
with the EEC Such units also were equipped with devices to record the venous or arterial pulses and 
heart sounds. The photographic recording paper was 12 cm wide and 50 m in length. 

The string galvanometer is a current-drawing device; therefore, a low electrode-subject resistance was 
necessary for adequate sensitivity. Berger stated that it was difficult to obtain a low resistance and reported 
a value of 1600 Q for his needle electrodes when placed 5 to 6 cm apart with the tips in the epidural 
space. A high-resistance electrode pair would reduce the amplitude of the recorded activity. Later Berger 
used chlorided silver needle electrodes to help solve this problem. 
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Difficulties with the zinc-plated needle electrodes led Berger to develop very thin lead-foil electrodes, 
wrapped in flannel, and soaked in 20% NaCl solution. The combination of saline-soaked flannel and the 
use of a rubber bandage to hold the electrodes on the scalp permitted recording the EEG for hours 
without the saline evaporating. The resistance measured between a pair of such electrodes ranged from 
500 to 7600 Q depending on the size of the electrodes. 

Berger's final improvement to his recording equipment consisted of placing a capacitor in series with 
the electrodes and string galvanometer to block the steady potential difference due to slight electrochem- 
ical differences in the two lead electrodes. Recall that the amplitudes being recorded were in the range 
of tens of microvolts, and any steady difference in electrode potential would cause a steady deflection of 
the baseline of the recording. With the capacitor, this steady offset potential did not deflect the galva- 
nometer baseline. 

Describing his first studies with the zinc-plated needle electrodes, Berger stated [translation by Gloor, 
1969]: 

As a general result of these recordings with epidural needle electrodes I would consequently like to 
state that it is possible to record continuous current oscillations, among which two kinds of waves can 
be distinguished, one with an average duration of 90 a, the other with one of 35 a. The longer waves 
of 90 a are the ones of larger amplitude, the shorter, 35 a waves are of smaller amplitude. According 
to my observations there are 10 to 11 of the larger waves in one second, of the smaller ones, 20 to 30. 
The magnitude of the deflections of the larger 90 a waves can be calculated to be about 0.0007 to 
0.00015 V, that of the smaller 35 a waves, 0.00002 to 0.00003 V. [The symbol o was used for millisec- 
onds.] 

In his first paper, Berger called the dominant low-frequency first-order and the higher-frequency waves 
second order. In his second paper he stated: 

For the sake of brevity I shall subsequently designate the waves of first order as alpha waves = a— w, 
the waves of second order as beta waves = P - w, just as I shall use "E. E. G." as the abbreviation for the 
electroencephalogram and "E. C. G." for the electrocardiogram. 

The average duration of the waves reproduced in [his] Figure 1 is for the a - w = 120 a and for the 
P -w = 30 to 40 rj. 

Berger objected to the term electrocerebrogram to designate a record of the electrical activity of the 
brain. He stated: 

Because for linguistic reasons, I hold the word "electrocerebrogram" to be a barbarism, compounded 
as it is of Greek and Latin components, I would like to propose, in analogy to the name "electrocar- 
diogram" the name "electroencephalogram" for which here for the first time was demonstrated by me 
in man. 

Berger's papers contain many EEGs from patients, but those from his son Klaus are discussed fre- 
quently. In fact, Klaus was used as a subject for electrode testing. For example, Berger wrote: 

Klaus' records were taken with every other possible type of electrodes: silver, platinum, lead electrodes, 
etc; also, different arrangements of these on the skin surface of the head were used. However, time 
and again it was found that the best arrangement was that with electrodes placed on the forehead and 
occiput. Of Klaus' many records, I only want to show another small segment of a curve obtained in 
this manner [Fig. HP5.2]. In this instance head-band electrodes were applied to the forehead and 
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FIGURE HP5.2 Klaus at the age of 15. Double-coil galvanometer. Condenser inserted. Recording from forehead 
and occiput with head-band electrodes. (Top) The record obtained from the scalp; (bottom) time in 1/10 second. 
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occiput and were fixed with rubber bandages. From these head-band electrodes, records were taken 
with galvanometer 1 of the double-coil galvanometer; galvanometer 2 was set at its maximum sensi- 
tivity and was used as a control to make sure that no outside currents were entering the galvanometer 
circuit to disturb the examination. At that time I was still very distrustful of the findings I obtained 
and time and again I applied such precautionary measures. The record of galvanometer 2 ran as 
completely straight line, without my oscillation. 

From the lengthy discussions in Berger's papers, it is easy to see that few believed that his recordings 
originated in the brain. To dispel some of the uncertainty, Berger usually recorded the ECG along with 
the EEC Occasionally, he recorded heart sounds and the arterial pulse. In addition in the electrical 
activity of the heart, various critics proposed that Berger's recorded activity was due to friction of blood 
in the cerebral arteries, pulsations of the brain and/or scalp, respiration, contraction of piloerector or 
skeletal muscle, and glandular activity. Berger dealt with all the potential artifacts, pointing out that their 
time course and frequency were different from those of the EEG and showed that the EEG continued 
with transient slowing of the heart rate. Finally, he stated (perhaps in exasperation): 

I therefore believe I have discussed all the principal arguments against the cerebral origin of the curves 
reported here which in all their details have time and again preoccupied me, and in doing so I have 
laid to rest my own numerous misgivings. 

Among the first to publish an English-language verification of Berger's observations were Jasper and 
Carmichael [ 1935] , then at Brown University in the United States. Silver electrodes ( 1 to 2 cm in diameter), 
covered with flannel and soaked in saline, were connected to an amplifier/mirror-oscillograph system. 
They were able to confirm Berger's findings and extend them, showing that with a two-channel system 
used to record the EEG of a girl with a convulsive disorder, the alpha wave frequency was 10 per second 
on the left side of the head and 6 to 8 per second on the right side, one of the early indications that the 
EEG was altered by brain pathology. Figure HP5.3a shows one of the records obtained by Jasper and 
Carmichael. 

Jasper later came to the Montreal Neurological Institute (McGill University) and created the Electro- 
pysiological Laboratory, which was formally opened with a celebration meeting held on February 24-26, 
1939. In attendance were the world leaders in electrophysiology. 

Clinical EEG at the Montreal Neurological Institute was inaugurated using a machine built by Andrew 
Cipriani. The inkwriter was of unique design and featured a strong magnetic field produced by an 
electromagnet. In this field was a circular coil coupled to an inkwriting pen. It is interesting to observe 
that this principle had been used by d'Arsonval [1891] with pneumatic coupling to a tambour that caused 
a pen to write on a rotating drum, as shown in Fig. HP5.4«. Later in the 1930s, this coil design was coupled 
to a conical diaphragm and became the first dynamic loudspeaker. The pen motor devised by Cipriani is 
sketched in Fig. HP5.4fe A small vane, affixed to the writing stylus, dipped into an oil chamber (dashpot) 
to provided damping. This four-channel instrument was in routine use when the author first came in 
contact with Jasper in the early 1940s; it was replaced in 1946 by a six channel model III Grass EEG. 

Meanwhile, at Harvard University (Boston, Mass), Gibbs, Davis, and Lennox [1935], were pursuing 
their interest in epilepsy. Recognizing the potential of the EEG for the diagnosis of epilepsy, they initiated 
a series of studies that would occupy the next many decades. Their recorder consisted of an ink-writing 
telegraphic recorder called the Undulator. In December of 1935, they published their first paper on EEG 
which carried a footnote that read: "This paper is no, XVII in a series entitled Studies in Epilepsy." Citing 
the Berger papers and that by Jasper and Carmichael, Gibbs et al. stressed the importance of direct-inking 
pens for immediate viewing of the EEG so that the effect of environmental factors could be identified 
immediately. They reported: 

The method is exceedingly simple. Electrical contact is made to two points on the subject's head. 
Except for the study of grand mal epileptic seizures we regularly employ as electrodes two hypodermic 
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FIGURE HP5.3 The first U.S. records of the EEG to confirm Berger's report, (a) The records obtained by Jasper 
and Carmichael (1935). The first channel shows the alpha waves, and the second shows the electrical activity detected 
by electrodes on the leg above the knee. The second record shows alpha inhibition by illumination of the retina, and 
the third record shows return of the alpha waves when the light was extinguished. ( b) The first records published by 
Gibbs et al. (1935) showing alterations in normal subjects by various types of sensory stimulation eyes open and 
closed, problem solving, noise (rattle), and smelling ether. (Both by permission.) 



needles inserted one into the scalp at the vertex of the skull and the other into the lobe of the left ear. 
Enough procaine hydrochloride is injected previously to insure the continued comfort of the subject. 

Figure 3b is a reproduction of the first EEG obtained by Gibbs et al. Soon Gibbs produced his well- 
known Atlas of Electroencephalography, which first appeared in 1941 and became the "bible" for training 
electroencephalographers. 

The first recording equipment used by Gibbs et al. [1935] was built by Lovett Garceau. It consisted of 
a four-stage, singled-sided, resistance-capacity-coupled amplifier made with high-grain, screen-grid tubes 
driving a distinct-linking telegraphic recorder called the Undulator (U in Fig. 5) obtained from the 
Western Union Telegraph Company, Fig. 5 shows the circuit diagram, the overall high-frequency response 
extended to 25 Hz [Grass, 1984]. 

Slightly earlier, in Germany, Toennies [1932] had developed a direct-linking recorder that he called 
the Neurograph. Fig. HP5.6 shows a picture of the instrument and a record of the human electrocardio- 
gram and the response of canine eyes to light. Note that the chart is running in the opposite direction 
to conventional recordings. The time marks are 1/5 second. 
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FIGURE HP5.4 Electropenumatic inkwriter, described by d'Arsonval ( 1891) (a) and sketch of moving-coil inkwriter 
devised by Cipriani in the late 1930s for EEC at the Montreal Neurological Institute (b). 
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FIGURE HP5.5 The single-sided, five-stage, resistance-capacitor-coupled amplifier developed by Gareau to drive 
the Western Union inkwriting telegraphic recorder (Undulator) used by Gibbs et al. to record EEGs (From Garceau 
et al. 1935. Arch Neurol Psychiatry. With permission.) 
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FIGURE HP5.6 Toennies Neurograph and recordings showing a 1-mV calibration, the human ECG, and electrical 
activity from the canine eye, time marks 1/5 s. (From Toennies [1932].) 



Commercial Production of EEG Machines 

The excellent collaboration between Gibbs and Grass in 1935 resulted in replacement of the Undulator 
telegraphic inkwriter with a robust, d'Arsonval-type inkwriter in early 1936. Meanwhile, the Grass 
Instrument Company had been founded (1935) to produce electrophysiologic equipment at the well- 
known address, 101 Old Colony Avenue, Quincy, Mass; the address in the same today. 

In 1937, Grass adopted folding chart paper and by 1939 was providing three-, four-, and six-channel 
EEGs. The Model III shown in Fig. HP5.7 is the machine recognized by all and founded many EEG 
laboratories. It featured a knee-hole console with the pens at the right and ample viewing space for the 
record as it evolved. The chart speed was 30 mm/s, which ultimately became the standard. 

At about the same time Grass was building EEG machines in Quincy, Mass., Franklin Offner, a research 
associate of Ralph Gerard at the University of Chicago, started his own company at 5320 North Kedzie 
Avenue, Chicago, to produce an EEG using the Crystograph, a high-speed, piezoelectric inkwriter that was 
described by Offner and Gerard [1936]. It consisted of two slabs of rochelle salt crystals, three corners of 
each were clamped, and the fourth was free to move when a voltage was applied to electrodes on the crystals. 
The moving corners were mechanically linked by a slender brass belt that caused motion of the rod that 
carried the inkwriting stylus. The sinusoidal frequency response extended uniformly to 100 Hz; the chart 
speed 25 mm/s, the same as for ECG. However, a gear shift provided chart speeds above and below 25 mm/s. 

The Crystograph was used with the first Offner EEG machines, and it was ideally suited for high 
efficiency energy transfer from vacuum-tube amplifiers because of its high impedance. The differential 
amplifiers were housed in two 19-in relay tracks, as shown in Fig. HP5.8o; the Crystograph rested on an 
adjacent table; a six-channel Crystograph is illustrated. Damping was adjusted by a series variable resistor 
to achieve an excellent response to a step function. The record shown in Fig. HP5.8a was produced by 
a 3-uV (peak-peak) square wave, showing the excellent transient response and the remarkably low internal 
noise level of the amplifiers. 
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FIGURE HP5.7 The Grass Model III EEG, circa late 1940s. (Courtesy Grass Instrument Co., Quincy, Mass.) 

Although the Cyrstograph EEG produced elegant records, temperature and humidity played havoc 
with the rochelle salt crystals, and Offner replaced it with the Dynagraph, shown in Fig. HP5.8b, the 
recorder being a low-impedance, robust d'Arsonval-type moving-coil inkwriter. 

Efficient coupling between the output amplifier stage and the moving-coil inkwriter was always a 
problem that was solved by Offner in his Dynagraph. A synchronous vibrator sampled the signal from 
the preamplifier, amplified the pulses, and recombined them after passing through a step-down trans- 
former, thereby providing an efficient impedance match between the amplifier output stage and penwriter. 
Fig. HP5.8fo is an illustration of the Offner Dynagraph that featured a sit-down console for viewing the 
record. The antiblocking feature of this design is shown at the bottom of Fig. HP5.8fo by a sine wave 
record in which a transient, 100 times larger than the recording, was presented and the recording was 
restored a fraction of a second after the transient. Offner later replaced the vacuum tubes with transistors, 
bringing out the first transistorized EEG. 

In the late 1940s, Warren Gilson, who was a pioneer in devising physiologic instruments, started 
production of EEGs in Madison, Wise; Fig. HP5.9 is a photograph of one of his later instruments (circa 
1959). 

It is noteworthy that following the Grass Model III EEG, all subsequent instruments were of the console 
type with the penwriter at the right end of the desktop of the console. Not easily identified in any of the 
foregoing figures are the two rotary switches for each channel. Each switch could connect each side of 
the differential amplifier input to any of the 21 electrodes of the 10-20 system. Switching also was provided 
to apply a step-function calibrating signal to all channels simultaneously, the step function typically being 
produced by depressing a pushbutton or rotating a knob. In addition, a low-current ohmmeter was 
provided to permit measurement of the resistance of any pair of electrodes. Some EEGs included a high- 
frequency filter for each channel to exclude muscle artifacts often seen when patients clenched their jaws. 

Electroencephalographs have changed little since their development in the post-World War II days. 
Transistors have replaced the vacuum tubes, but the chart speed and frequency response are the same as 
those established by the first manufacturers of EEG machines, although many new recording techniques 
have been introduced. 
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FIGURE HP5.8 The Offner EEG which used the Crystograph recorder (a) and the Offner Dynagraph EEG (b), 
circa late 1940s (Courtesy of Franklin Offner.) 
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FIGURE HP5.9 The Gilson EEG, circa 1950 (Courtesy ofWarren Gilson). 
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Biomedical engineering is responsible for many of the recent advances in modern medicine. These 
developments have led to new treatment modalities that have significantly improved not only 
medical care, but the quality of life for many patients in our society. However, along with such 
positive outcomes new ethical dilemmas and challenges have also emerged. These include: ( 1 ) involvement 
of humans in clinical research, (2) definition of death and the issue of euthanasia, (3) animal experi- 
mentation and human trials for new medical devices, (4) patient access to sophisticated and high cost 
medical technology, (5) regulation of new biomaterials and devices. With these issues in mind, this section 
discusses some of these topics. The first chapter focuses on the concept of professional ethics and its 
importance to the practicing biomedical engineer. The second chapter deals with the role medical 
technology has played in the definition of death and the dilemmas posed by advocates of euthanasia. 
The third chapter focuses on the use of animals and humans in research and clinical experimentation. 
The final chapter addresses the issue of regulating the use of devices, materials, etc. in the care of patients. 
Since the space allocated in this Handbook is limited, a complete discussion of the many ethical 
dilemmas encountered by practicing biomedical engineers is beyond the scope of this section. Therefore, 
it is our sincere hope that the readers of this Handbook will further explore these ideas from other texts 
and articles, some of which are referenced at the end of the chapter. Clearly, a course on biomedical 
ethics should be an essential component of any bioengineering curriculum. 

With new developments in biotechnology and genetic engineering, we need to ask ourselves not only 
if we can do it, but also "should it be done?" As professional engineers we also have an obligation to 
educate the public and other regulatory agencies regarding the social implications of such new develop- 
ments. It is our hope that the topics covered in this section can provide an impetus for further discussion 
of the ethical issues and challenges faced by the bioengineer during the course of his/her professional life. 
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189.1 A Variety of Norms Govern Human Conduct 

Various norms or principles govern our activities. They guide us and provide standards for the evaluation 
of conduct. For example, our conduct is governed by legal and moral norms. These two sets of norms 
overlap, but it is clear that the overlap is not complete. Morality requires some acts that are not legally 
required of us and vice versa. Not surprisingly, then, we speak of legality and morality, of legal obligations 
and moral obligations. This is appropriate primarily because (1) law and morality are distinct sources 
of obligation and (2) it is possible for legal and moral obligations to conflict. Recognizing that moral 
and legal obligations have distinct sources, it is easier to appreciate the nature of a conflict between them 
and work toward its resolution. The same thing can be said within the sphere of morality. Moral rules 
and principles can and do give rise to conflicting obligations. Here too, the way people speak is revealing. 
People speak of professional obligations and social responsibilities, as well as the duties of "ordinary 
morality." Such talk is increasingly heard in the high-tech fields of biomedicine and biomedical engi- 
neering, for here the development and use of sophisticated technology intersect with the rights and 
interests of human beings in an especially profound way. 

189.2 Professional Ethics and Ethics Plain and Simple 

Talk of professional ethics presupposes a distinction between the constraints that arise "from what it 
means to be a decent human being" (Camenisch, 1981) and those that come with one's role or attach 
to the enterprise in which one is engaged. Paul Camenisch calls the former "ethics plain and simple." 
The latter are the elements of an occupational or role morality. They only apply to persons who occupy 
a specific role. The idea here is simple enough. A father, for example, has responsibilities that a man who 
is not a father does not have. So too, a college teacher, a cleric, or a police officer has responsibilities that 
persons not occupying these roles do not have. 
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A professional ethic is an occupational or role morality. It is like the law in several ways. For example, 
it is not a mere restatement of the norms of ordinary morality. Another point of similarity is that like 
the law, its scope is limited; it has a jurisdiction in the sense that what it requires or permits is role 
specific. Thus, for example, the norms of the lawyer's professional ethic do not impose obligations on 
people who are not lawyers. Still another similarity is that a professional ethic may allow (or even require) 
acts that ordinary morality disallows or condemns. An explanation of this fact begins with the observation 
that, in general, as professionals do their work they are allowed to put to one side considerations that 
would be relevant and perhaps decisive in the ethical deliberations of nonprofessionals. So, for instance, 
an attorney is free to plead the statute of limitations as a bar to a just claim against his/her client or to 
block the introduction of illegally seized evidence in a criminal trial even though his/her client committed 
the alleged offense. Now one thing that might be said here is that the attorney is simply doing his/her 
job. The point would be well taken. However, it leaves something rather important unsaid, namely, that 
the conduct in question is obligatory for the attorney. 

Thinking along lines such as these is not confined to the paradigm professions of law and medicine. 
For example, scientists have been known to claim that as scientists they are free to (indeed must) put to 
one side social, political, and moral concerns about the uses to which their discoveries may be put. Indeed, 
some have claimed that scientists are morally obligated not to forgo inquiry even if what emerges from 
it can be put to immoral or horrendous uses. Plainly, such an appeal to a professional ethic can excite 
controversy. And this reveals another respect in which a professional ethic (role morality) is like law: it 
is subject to moral critique. Indeed, it is not only subject to moral criticism, its validity depends on its 
being morally justified (i.e., justified in terms of "ethics plain and simple"). 

One important way in which a professional ethic differs from law is that it is widely held that its 
standards are in some sense higher than those of ordinary morality. More is expected of professionals 
than nonprofessionals. They are expected to act on the basis of the knowledge that sets them apart and 
in doing so they are expected to put the interests of clients or patients ahead of their own interests. 
Clearly, if professionals are governed by special, higher standards, if the rights and duties of professionals 
differ from the rights and duties of nonprofessionals, then it makes a difference whether one's occupation 
counts as a profession. 

189.3 Professions 

As it happens, there is no generally accepted definition of the term "profession". However, several writers 
have suggested that some characteristics common to recognized professions are necessary or essential. 
The idea that these writers share is that with these characteristics in mind one can mark a serviceable 
distinction between professional and nonprofessional occupations. 

Bayles [1989] maintains that three features of a profession are necessary: (1) extensive training, (2) that 
involves a significant intellectual component, and (3) puts one in a position to provide an important 
service to society. To be sure, other features are common. For example: (1) the existence of a process of 
certification or licensing, (2) the existence of a professional organization, (3) monopoly control of tasks, 
(4) self-regulation, and (5) autonomy in work. But, according to Bayles, they are not essential. The crucial 
point in the argument that they are not essential is that a large (and growing) number of professionals 
work in organizations (e.g., HMOs or institutions such as hospitals) where tasks are shared and activities 
are directed and controlled by superiors. Two things are noteworthy here. First, Bayles's analysis does 
not include normative features among those that distinguish professions from non professions. Second, 
in his analysis an occupation may count as a profession though it lacks certain features common to most 
professions. While Bayles's decision not to include normative features among the distinguishing features 
of professions has been criticized by several writers who insist that such features are indeed essential, the 
second point has met with widespread agreement. 

It will be well to avoid the controversy respecting what is and what is not essential to a profession. 
Happily, a survey of the substantial literature on professions reveals agreement on several points that, 
taken together, provide a helpful picture of professions and professional activity. This picture is rather 
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like a sketch made by a police artist who works with descriptions provided by various witnesses. It has 
five elements. The first is the centrality of abstract, generalized, and systematic knowledge in the perfor- 
mance of occupational tasks. The second element is the social significance of the tasks the professional 
performs; professional activity promotes basic social values. The third element is the claim to be better 
situated/qualified than others to pronounce and act on certain matters. This claim reaches beyond the 
interests and affairs of clients. Professionals (experts) believe that they should define various aspects of 
society, life, and nature and we generally agree. For example, we defer to them (or at least our elected 
representatives do) in matters of public policy and national defense. Moreover, in certain settings, a 
hospital for example, it is simply impossible not to defer to the judgment of experts/professionals. The 
crucial premise can scarcely be doubted: in the contemporary world there is more to know (much of it 
having immediate practical application) than any one person is capable of knowing. The fourth point is 
that, on the basis of their expertise and the importance of the work that requires it, professionals claim 
that as practitioners they are governed by role-specific norms — a professional ethic — rather than the 
norms that govern human conduct in general. Now, it cannot be denied that particular applications of 
the relevant norms have been a source of controversy. However, since controversy of this sort generally 
presupposes the applicability and validity of the role-specific norms, it only serves to confirm the general 
point. The final element of the composite picture is that now most professionals work in bureaucratic 
organizations/institutions. The romantic appeal of the model of the professional as solo-practitioner may 
incline one to balk at this. But romance must not be allowed to prevail over evidence. And in this case 
the evidence is substantial. In fact, some of it is arresting. For example, it was recently reported that 
eighty percent of recent medical school graduates are salaried employees of HMOs, clinics, or hospitals 
(US News and World Report, 1999). 

189.4 The Profession of Biomedical Engineering 

As we have seen, it is not an idle question whether biomedical engineering is a profession. If professionals 
are governed by special, higher standards, if the rights and duties of professionals differ from the rights 
and duties of nonprofessionals, then it makes a difference whether one's occupation counts as a profession. 
Since there is no standard definition, no set of necessary and sufficient conditions the satisfaction of 
which would be decisive, in answering the question we will have to proceed in a different way. It is 
suggested that we think in terms of characteristics shared by recognized professions that, taken together, 
constitute a composite picture akin to what a police sketch artist might draw. What is seen when one 
looks at this picture? In particular, does the picture match the reality of biomedical engineering practice? 
Looking at this composite picture of a profession it is seen that (1) abstract, generalized, and systematic 
knowledge is crucial to the performance of occupational tasks (2) these tasks promote basic social values, 

(3) practitioners claim to be better situated/qualified than others to pronounce and act on certain matters, 

(4) the conduct of practitioners is governed by role-specific norms, and (5) most of the work done by 
practitioners is done within bureaucratic institutions. The fit between the reality of biomedical engineer- 
ing practice and this composite picture of professions is tight. Indeed, looking to this picture to answer 
the question whether biomedical engineering is a profession, it can scarcely be doubted that the answer 
is yes. 

One possible objection to this answer might be that there is no code of ethics for bioengineers, and 
thus bioengineering does not count as a profession because the fourth requirement is not satisfied. The 
objection could be met by pointing to a code for bioengineers, if there was one. But a better response is 
that the objection itself is misplaced. It is based on the mistaken assumption that there is a set of conditions 
the satisfaction of which is necessary and sufficient for the ascription of the term "profession" to an 
occupation. But the existence of a code of ethics is neither necessary nor sufficient for an occupation to 
count as a profession; what we are working with is a composite picture, not a definition. The objection 
is misplaced for a second reason as well. Even if there is no code of ethics for bioengineers, codes of 
ethics are well known in engineering. For example, the National Society of Professional Engineers and 
the Institute of Electrical and Electronic Engineers have codes of ethics. In addition, the National 
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Committee on Biomedical Engineering (Australia) has produced a set of professional standards. More- 
over, a professional ethic may develop by other than quasi-legislative means; in this respect it is like law 
which has both legislative and customary forms. Thus, there is nothing here to impugn the claim that 
bioengineering is a profession. 

189.5 Two Sources of Professional Ethics 

It is important to be clear about the fact that a professional ethic may develop in more than one way. A 
professional ethic is a role morality. The norms of a profession's role morality need not be expressly 
"legislated" by, for example, a professional organization, because most of them are implicitly legislated 
in practice. Thus, one way to identify the norms of a profession's role morality is to reflect on the 
expectations one has respecting the conduct of one's peers. The stable interactional expectancies of 
practice can constitute what amounts to a customary morality of a profession, an uncodified professional 
ethic. To be sure, these customary norms may be codified (and often are). However, just as in the case 
of customary law, codification is not necessary for their validity. Indeed, it is entirely appropriate to say 
that codification of some such norms (like some laws) is the result of the codifier's recognition of their 
independent validity. 

It should be noted here that the norms of a customary morality are valid only if they are accepted in 
practice. It is important, however, that while acceptance is necessary, it is not sufficient. It cannot be 
sufficient, because if it were, the idea that something is right simply because someone or some group 
believes it is right would have to be granted. And that, of course, is patently false. What, then, are the 
additional conditions for the validity of such norms? This is surely a fair question. However, answering 
it completely would lead us far afield. Consequently, a short answer will have to suffice. The case for the 
validity of a norm of customary morality (in other words, for its status as a moral norm) turns on whether, 
in addition to being accepted in practice, compliance with it has good consequences and does not infringe 
upon the rights of other persons. It will be noticed that these are precisely the sorts of considerations 
that persons charged with the task of rule making do, or at any rate should, regard as decisive in doing 
their work. In any case, whether the norms of a professional ethic arise in practice or are expressly 
"legislated" by a group, they are "dual aspect norms" (Wellman, 1985). They are in play for role agents 
who are trying to decide what action to take; they are also in play for others within and outside of the 
profession who observe or by other means become aware of deviation from (or conformity to) them and 
react accordingly. 

189.6 Professional Ethics in Biomedical Engineering 

It is clear that biomedical engineering has an ethical dimension. After all, human well being is at stake 
in much if not most of what a biomedical engineer does. Indeed, error or negligence on the part of a 
biomedical engineer can result in unnecessary suffering or death. Of course, much the same can be said 
of other engineering fields. Yet, there is something distinctive here. The National Committee on Biomed- 
ical Engineering has identified three ways in which biomedical engineering differs from other branches 
of engineering. First, biomedical engineers work with biological materials that behave differently from 
and have different properties than the materials that most engineers work with. Second, preparation for 
a career as a biomedical engineer involves study of both engineering principles and the life sciences. 
Third, and most important for present purposes, is "the indirect and very often direct responsibility of 
biomedical engineers for their work with patients." Such responsibility for the well-being of others is a 
clear indicator that a role has an ethical dimension. 

Many things in biomedical engineering that fall under the rubric of professional ethics have to do with 
policies or procedures. For example, the development of 

1. methods for obtaining informed consent and criteria for justified departure from these methods, 

2. means for identifying subjects for clinical trials, 
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3. criteria of thorough testing, 

4. standards to obviate or mitigate conflicts of interest, as well as mechanisms for their application 
and enforcement, and 

5. criteria for just distribution of scarce biomedical resources (expertise and technology). 

All of these things (in this far from exhaustive list) fall under the rubric of professional ethics. However, 
they are institutional in the sense that they call for decisions about policies or procedures rather than 
individual action. Decisions in these areas are not decisions to be made by individual practitioners nor 
are they decisions to be taken case by case. They are decisions about structures of practice that require 
quasi-legislative activity. Relying on others with requisite expertise, and soliciting input from persons 
whose interests are at stake in biomedical engineering practices, biomedical engineers should work to 
develop structures of practice that satisfy legal requirements and ensure, as far as possible, that their 
professional practice manifests a commitment to safety and the promotion of human well-being. These 
carefully designed structures of practice should be part of the explicitly quasi-legislated portion of the 
professional ethic of biomedical engineering. 

The question that arises naturally here is how to proceed in this undertaking. Precisely which of the 
dominant approaches in ethics — utilitarian, deontological, or aretaic — is best is a subject of vigorous 
debate among philosophers. But this issue will not be debated here. Instead, a sketch will be presented 
of an approach that can be employed in designing ethical structures of practice, and then, with one 
difference to be explained, used in making individual ethical decisions in one's capacity as a professional. 

189.7 Tools for Design and Decision in Professional Ethics 

Multiple analyses or several independent judges are often relied on in making decisions. In general, this 
is done when something significant turns on the final decision. For example, physicians frequently call 
for a consultation and patients are encouraged to seek a second opinion before an invasive procedure is 
performed. Similarly, hospitals and universities rely on panels or commissions — an Institutional Review 
Board, for example — to make decisions about proposed research or other pressing issues. In such cases 
it is assumed, rightly, that relying on multiple modes of analysis or several judges is wise even though 
(1) they produce the same judgment in many — hopefully most — cases and, thus, appear to involve 
redundancy; and (2) in some cases they produce conflict that could have been avoided by relying on a 
single mode of analysis or single judge. Why is this a wise course? One part of the answer is that our 
confidence is bolstered when the same conclusion is reached by different trustworthy means or judges. 
Here redundancy is a value. The second part of the answer is that being open to conflicting opinions 
and analyses can help us to avoid errors that would occur otherwise. This will happen when the conflict 
prompts reexamination of the question that reveals facts previously overlooked or undervalued or 
mistaken analyses. It should be noted that the approach recommended here is not political; logical and 
evidentiary considerations rather than simple consensus justify the judgments it produces. Randy Barnett 
sums up the case for such an approach: 

The virtue of adopting multiple or redundant modes of analysis is... twofold: (a) convergence (or 
agreement) among them supports greater confidence in our conclusions; and (b) divergence (or 
conflict) signals the need to critically reexamine the issue in a search for reconciliation. In sum, 
convergence begets confidence, divergence stimulates discovery. [Barnett, 1990] 

In the context of professional ethics an approach of this sort would involve reliance on three modes 
of analysis: (1) utilitarian, (2) rights-based deontological, and (3) role-based institutional. A brief descrip- 
tion of each mode of analysis is presented in the following paragraphs. 

A utilitarian analysis begins with the assumption that Tightness is a function of value and tells us that 
what is morally required of us is the production of the greatest amount of good possible in a situation 
for all of the affected parties. Utilitarian thinking leads quite directly to an embrace of the familiar 
principles of nonmaleficence and beneficence. Deontological analysis denies the essential connection 
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between Tightness and goodness asserted by utilitarianism. Unlike utilitarians, deontologists hold that 
some actions are intrinsically right and some actions are intrinsically wrong. More particularly, they 
insist that the fact that the consequences of an action are the best possible in a given situation does not 
show that the action is right. The most famous of deontological theories, that of Immanuel Kant, teaches 
that what is morally required of us is that wherever found, in ourselves or others, humanity is always 
treated as an end and never as a mere means to an end. In other words, persons have intrinsic worth (as 
Kant said, they have a dignity rather than a price) and must never be treated as if their value were merely 
instrumental (as if they were things). That is our duty; the other side of that coin is the right others have 
to receive that sort of treatment from us. What matters on this approach, then, is whether one is treating 
people as they deserve to be treated. Thus, deontological thinking leads directly to an embrace of the 
familiar principles of autonomy and justice. These two modes of analysis are alike in this: both proclaim 
independence from what the commonly accepted ideas regarding right and wrong are in a community; 
on both views morality is not something that is instituted (made); rather, it is discovered (not by empirical 
investigation, but by ratiocination). Thus, these modes of analysis contrast sharply with the third mode 
of analysis to be discussed here, namely, the role-based institutional mode. 

Thinking in terms of role morality Tightness is a function of conformity with the stable interactional 
expectations — accepted norms of conduct — associated with a role. Responsibilities and rights are tied to 
the function of a role agent within an institution. A role morality is institutional in the quite basic sense 
that it is instituted, that is, brought into existence by human beings. Sometimes, of course, this is 
accomplished by quasi-legislative means. But such activity is neither necessary nor the most common 
means of creation. A role morality is implicit in practice, it is established through mutually beneficial 
interaction over time. It is a customary or conventional morality. (There is an analogy here to the law 
in its customary and legislative forms.) One final point, noted earlier, is that considerations of non- 
institutional morality play a critical role in the validation of the norms of a role morality. The familiar 
maxim in medicine, primum non nocere, as well as the implicit rule of lawmaking that lawmakers must 
promulgate the laws they make (no secret laws), are examples of principles that would be readily embraced 
by those thinking along these lines and validated by considerations of non-institutional morality. 

We can summarize this brief description of the three modes of analysis in the following way. The key 
question for the utilitarian is one of maximal value; for the rights-based deontological thinker it is one 
of deserved or rightful treatment; with role morality it is one of conformity to established custom or 
practice. 

The recommended approach for the design of ethical structures of practice is to use all three modes 
of analysis. The hope is that they will converge on the same result. When they do we can be confident 
that implementation of the principle or policy is justifiable. That is the case, for example, with the rules 
of professional practice requiring fidelity, confidentiality, privacy, and veracity. 

Using this approach one hopes for convergence on the same result. That is to be expected in easy cases. 
But not all cases are easy cases. Our analyses may diverge rather than converge. What then? Barnett 
suggests that divergence "stimulates discovery." The idea is that achieving convergence may be difficult, 
but at least sometimes it may be achieved on a second pass by retracing one's steps — going through the 
analyses again, paying special attention to the input — or rethinking the analyses themselves, paying special 
attention to previously identified sources of difficulty. For example, utilitarian and deontological analyses 
may diverge because they deal with individual rights in different ways. Deontologists treat rights as 
trumps; utilitarians simply include them among the considerations that count in their calculations 
respecting likely consequences. It may be that the two modes of analysis diverge because of an erroneous 
assignment of weight to the rights that are in play. If so, convergence can be achieved by rethinking the 
weight assigned to the rights in the utilitarian calculations. It must be admitted, however, that divergence 
may not be eliminated by such means. What then? In anticipation of such cases, a presumption should 
be made in favor of one of the modes of analysis. 

When one's project is the design of ethical structures of practice, the presumption should be made in 
favor of rights-based deontological analysis, it being understood that utilitarian considerations may rebut 
the presumption favoring rights in some circumstances. When one's project is not the design of decision 
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devices (structures of practice), but deciding what one ought to do as a professional, a presumption 
should be made in favor of institutional responsibilities, i.e., professional ethics, it being understood that 
utilitarian or deontological considerations may rebut the presumption in favor of role responsibilities 
under some circumstances. The point of this presumption is that the burden of justification is properly 
placed on those who would depart from valid norms. Two things argue in favor of this: ( 1 ) a presumption 
in favor of institutional responsibilities (professional ethics) presupposes a justification of the sort pro- 
vided by the convergence of the three modes of analysis and (2) making this presumption guards against 
the dangers of failing to take the professional ethic seriously and robbing the earlier work — constructing 
ethical structures of practice — of its point. 

189.8 Professional Integrity, Responsibility, and Codes 

Professional ethics involves more than merely complying with the norms of a code of ethics. This is true 
for several reasons, not the least of which is that there may not be a code. But even if there is a code, for 
example, the NSPE code, or the code of the AMA or ABA, there is still much more to professional ethics 
than compliance with the norms of that code. The rules and principles of a code set out the criteria for 
distinguishing malpractice from minimally acceptable practice. They do not reach to, nor do they define 
responsible practice. Indeed, they cannot do this, for responsible practice is more than doing one's duty 
(thus we speak of responsibilities rather than duties); responsible practice involves discretion and judg- 
ment in an essential way. Moreover, it involves the integration of professional judgment (expertise) and 
moral judgment [Whitbeck, 1998]. Here the boundaries between fact and value are fluid or at any rate 
they vary much as boundaries marked by a river, the course of which changes over time. 

It is perhaps best to conceive professional ethics as a call for responsible conduct on the part of 
practitioners. The call is justified because the integrity of individual practitioners is required for the 
integrity of a profession, which, in turn, is necessary to justify the trust of others essential to the success 
of professional practice in any area. There is much work to be done in making clear what the demands 
of responsible practice are and in maintaining integrity in practice (which is produced by adherence to 
the standards of responsible practice). There is no ethical algorithm; responsible judgment and action 
are essential in the development, interpretation, and application of the normative principles governing 
the profession of biomedical engineering. 
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190.1 Introduction 

Two moral norms have remained relatively constant across the various moral codes and oaths that have 
been formulated for health-care deliverers since the beginnings of Western medicine in classical Greek 
civilization, namely beneficence — the provision of benefits — and nonmaleficence — the avoidance of 
doing harm. These norms are traced back to a body of writings from classical antiquity known as the 
Hippocratic Corpus. Although these writings are associated with the name of Hippocrates, the acknowl- 
edged founder of Western medicine, medical historians remain uncertain whether any, including the 
Hippocratic Oath, were actually his work. Although portions of the Corpus are believed to have been 
authored during the sixth century BC, other portions are believed to have been written as late as the 
beginning of the Christian Era. Medical historians agree, though, that many of the specific moral directives 
of the Corpus represent neither the actual practices nor the moral ideals of the majority of physicians of 
ancient Greece and Rome. 

Nonetheless, the general injunction, "As to disease, make a habit of two things — to help or, at least, to 
do no harm," was accepted as a fundamental medical ethical norm by at least some ancient physicians. 
With the decline of Hellenistic civilization and the rise of Christianity, beneficence and nonmaleficence 
became increasingly accepted as the fundamental principles of morally sound medical practice. Although 
beneficence and nomaleficence were regarded merely as concomitant to the craft of medicine in classical 
Greece and Rome, the emphasis upon compassion and the brotherhood of humankind, central to 
Christianity, increasingly made these norms the only acceptable motives for medical practice. Even today 
the provision of benefits and the avoidance of doing harm are stressed just as much in virtually all 
contemporary Western codes of conduct for health professionals as they were in the oaths and codes that 
guided the health-care providers of past centuries. 

Traditionally, the ethics of medical care have given greater prominence to nomaleficence than to 
beneficence. This priority was grounded in the fact that, historically, medicine's capacity to do harm far 
exceeded its capacity to protect and restore health. Providers of health care possessed many treatments 
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that posed clear and genuine risks to patients but that offered little prospect of benefit. Truly effective 
therapies were all too rare. In this context, it is surely rational to give substantially higher priority to 
avoiding harm than to providing benefits. 

The advent of modern science changed matters dramatically. Knowledge acquired in laboratories, 
tested in clinics, and verified by statistical methods has increasingly dictated the practices of medicine. 
This ongoing alliance between medicine and science became a critical source of the plethora of technol- 
ogies that now pervades medical care. The impressive increases in therapeutic, preventive, and rehabil- 
itative capabilities that these technologies have provided have pushed beneficence to the forefront of 
medical morality. Some have even gone so far as to hold that the old medical ethic of "Above all, do no 
harm" should be superseded by the new ethic that "The patient deserves the best" However, the rapid 
advances in medical technology capabilities have also produced great uncertainty as to what is most 
beneficial or least harmful for the patient. In other words, along with increases in ability to be beneficent, 
medicine's technology has generated much debate about what actually counts as beneficent or nonma- 
leficent treatment. To illustrate this point, let us turn to several specific moral issues posed by the use of 
medical technology (Bronzino, 1992; 1999). 

190.2 Defining Death: A Moral Dilemma Posed 
by Medical Technology 

Supportive and resuscitative devices, such as the respirator, found in the typical modern intensive care 
unit provide a useful starting point for illustrating how technology has rendered medical morality more 
complex and problematic. Devices of this kind allow clinicians to sustain respiration and circulation in 
patients who have suffered massive brain damage and total permanent loss of brain function. These 
technologies force us to ask: precisely when does a human life end? When is a human being indeed dead? 
This is not the straightforward factual matter it may appear to be. All of the relevant facts may show that 
the patient's brain has suffered injury grave enough to destroy its functioning forever. The facts may 
show that such an individual's circulation and respiration would permanently cease without artificial 
support. Yet these facts do not determine whether treating such an individual as a corpse is morally 
appropriate. To know this, it is necessary to know or perhaps to decide on those features of living persons 
that are essential to their status as "living persons." It is necessary to know or decide which human 
qualities, if irreparably lost, make an individual identical in all morally relevant respects to a corpse. 
Once those qualities have been specified, deciding whether total and irreparable loss of brain function 
constitutes death becomes a straightforward factual matter. Then, it would simply have to be determined 
if such loss itself deprives the individual of those qualities. If it does, the individual is morally identical 
to a corpse. If not, then the individual must be regarded and treated as a living person. 

The traditional criterion of death has been irreparable cessation of heart beat, respiration, and blood 
pressure. This criterion would have been quickly met by anyone suffering massive trauma to the brain 
prior to the development of modem supportive technology. Such technology allows indefinite artificial 
maintenance of circulation and respiration and, thus, forestalls what once was an inevitable consequence 
of severe brain injury. The existence and use of such technology therefore challenges the traditional 
criterion of death and forces us to consider whether continued respiration and circulation are in them- 
selves sufficient to distinguish a living individual from a corpse. Indeed, total and irreparable loss of brain 
function, referred to as "brainstem death," "whole brain death," and, simply, "brain death," has been 
widely accepted as the legal standard for death. By this standard, an individual in a state of brain death 
is legally indistinguishable from a corpse and may be legally treated as one even though respiratory and 
circulatory functions may be sustained through the intervention of technology. Many take this legal 
standard to be the morally appropriate one, noting that once destruction of the brain stem has occurred, 
the brain cannot function at all, and the body's regulatory mechanisms will fail unless artificially sustained. 
Thus mechanical sustenance of an individual in a state of brain death is merely postponement of the 
inevitable and sustains nothing of the personality, character, or consciousness of the individual. It is 
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merely the mechanical intervention that differentiates such an individual from a corpse and a mechan- 
ically ventilated corpse is a corpse nonetheless. 

Even with a consensus that brainstem death is death and thus that an individual in such a state is 
indeed a corpse, hard cases remain. Consider the case of an individual in a persistent vegetative state, 
the condition known as "neocortical death." Although severe brain injury has been suffered, enough 
brain function remains to make mechanical sustenance of respiration and circulation unnecessary. In a 
persistent vegetative state, an individual exhibits no purposeful response to external stimuli and no 
evidence of self-awareness. The eyes may open periodically and the individual may exhibit sleep-wake 
cycles. Some patients even yawn, make chewing motions, or swallow spontaneously. Unlike the complete 
unresponsiveness of individuals in a state of brainstem death, a variety of simple and complex responses 
can be elicited from an individual in a persistent vegetative state. Nonetheless, the chances that such an 
individual will regain consciousness virtually do not exist. Artificial feeding, kidney dialysis, and the like 
make it possible to sustain an individual in a state of neocortical death for decades. This sort of condition 
and the issues it raises were exemplified by the famous case of Karen Ann Quinlan. James Rachels (1986) 
provided the following description of the situation created by Quinlan's condition: 

In April 1975, this young woman ceased breathing for at least two 15-minute periods, for reasons that 
were never made clear. As a result, she suffered severe brain damage, and, in the words of the attending 
physicians, was reduced to a "chronic vegetative state" in which she "no longer had any cognitive 
function." Accepting the doctors' judgment that there was no hope of recovery, her parents sought 
permission from the courts to disconnect the respirator that was keeping her alive in the intensive care 
unit of a New Jersey hospital. 

The trial court, and then the Supreme Court of New Jersey, agreed that Karen's respirator could be 
removed. So it was disconnected. However, the nurse in charge of her care in the Catholic hospital 
opposed this decision and, anticipating it, had begun to wean her from the respirator so that by the 
time it was disconnected she could remain alive without it. So Karen did not die. Karen remained 
alive for ten additional years. In June 1985, she finally died of acute pneumonia. Antibiotics, which 
would have fought the pneumonia, were not given. 

If brainstem death is death, is neocortical death also death? Again, the issue is not a straightforward 
factual matter. For, it too, is a matter of specifying which features of living individuals distinguish them 
from corpses and so make treatment of them as corpses morally impermissible. Irreparable cessation of 
respiration and circulation, the classical criterion for death, would entail that an individual in a persistent 
vegetative state is not a corpse and so, morally speaking, must not be treated as one. The brainstem death 
criterion for death would also entail that a person in a state of neocortical death is not yet a corpse. On 
this criterion, what is crucial is that brain damage be severe enough to cause failure of the body's regulatory 
mechanisms. 

Is an individual in a state of neocortical death any less in possession of the characteristics that 
distinguish the living from cadavers than one whose respiration and circulation are mechanically main- 
tained? Of course, it is a matter of what the relevant characteristics are, and it is a matter that society 
must decide. It is not one that can be settled by greater medical information or more powerful medical 
devices. Until society decides, it will not be clear what would count as beneficent or nonmaleficent 
treatment of an individual in a state of neocortical death. 

190.3 Euthanasia 

A long-standing issue in medical ethics, which has been made more pressing by medical technology, is 
euthanasia, the deliberate termination of an individual's life for the individual's own good. Is such an 
act ever a permissible use of medical resources? Consider an individual in a persistent vegetative state. 
On the assumption that such a state is not death, withdrawing life support would be a deliberate 
termination of a human life. Here a critical issue is whether the quality of a human life can be so low or 
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so great a liability to the individual that deliberately taking action to hasten death or at least not to 
postpone death is morally defensible. Can the quality of a human life be so low that the value of extending 
its quantity is totally negated? If so, then Western medicine's traditional commitment to providing benefits 
and avoiding harm would seem to make cessation of life support a moral requirement in such a case. 

Consider the following hypothetical version of the kind of case that actually confronts contemporary 
patients, their families, health-care workers, and society as a whole. Suppose a middle-aged man suffers a 
brain hemorrhage and loses consciousness as a result of a ruptured aneurysm. Suppose that he never regains 
consciousness and is hospitalized in a state of neocortical death, a chronic vegetative state. He is maintained 
by a surgically implanted gastronomy tube that drips liquid nourishment from a plastic bag directly into 
his stomach. The care of this individual takes seven and one-half hours of nursing time daily and includes 
(1) shaving, (2) oral hygiene, (3) grooming, (4) attending to his bowels and bladder, and so forth. 

Suppose further that his wife undertakes legal action to force his care givers to end all medical 
treatment, including nutrition and hydration, so that complete bodily death of her husband will occur. 
She presents a preponderance of evidence to the court to show that her husband would have wanted this 
result in these circumstances. 

The central moral issue raised by this sort of case is whether the quality of the individual's life is 
sufficiently compromised by neocortical death to make intentioned termination of that life morally 
permissible. While alive, he made it clear to both family and friends that he would prefer to be allowed 
to die rather than be mechanically maintained in a condition of irretrievable loss of consciousness. 
Deciding whether the judgment in such a case should be allowed requires deciding which capacities and 
qualities make life worth living, which qualities are sufficient to endow it with value worth sustaining, 
and whether their absence justifies deliberate termination of a life, at least when this would be the wish 
of the individual in question. Without this decision, the traditional norms of medical ethics, beneficence 
and nonmaleficence, provide no guidance. Without this decision, it cannot be determined whether 
termination of life support is a benefit or a harm to the patient. 

An even more difficult type of case was provided by the case of Elizabeth Bouvia. Bouvia, who had 
been a lifelong quadriplegic sufferer of cerebral palsy, was often in pain, completely dependent upon 
others, and spent all of her time bedridden. Bouvia, after deciding that she did not wish to continue such 
a life, entered Riverside General Hospital in California. She desired to be kept comfortable while starving 
to death. Although she remained adamant during her hospitalization, Bouvia's requests were denied by 
hospital officials with the legal sanction of the courts. 

Many who might believe that neocortical death renders the quality of life sufficiently low to justify 
termination of life support, especially when this agrees with the individual's desires, would not arrive at 
this conclusion in a case like Bouvia's. Whereas neocortical death completely destroys consciousness and 
makes purposive interaction with the individual's environment impossible, Bouvia was fully aware and 
mentally alert. She had previously been married and had even acquired a college education. Televised 
interviews with her portrayed a very intelligent person who had great skill in presenting persuasive 
arguments to support her wish not to have her life continued by artificial means of nutrition. Nonetheless, 
she judged her life to be of such low quality that she should be allowed to choose to deliberately starve 
to death. Before the existence of life support technology, maintenance of her life against her will might 
not have been possible at all and at least would have been far more difficult. 

Should Elizabeth Bouvia's judgment have been accepted? Her case is more difficult than the care of a 
patient in a chronic vegetative state because, unlike such an individual, she was able to engage in 
meaningful interaction with her environment. Regarding an individual who cannot speak or otherwise 
meaningfully interact with others as nothing more than living matter, as a "human vegetable," is not 
especially difficult. Seeing Bouvia this way is not easy. Her awareness, intelligence, mental acuity, and 
ability to interact with others means that although her life is one of discomfort, indignity, and complete 
dependence, she is not a mere "human vegetable." 

Despite the differences between Bouvia's situation and that of someone in a state of neocortical death, 
the same issue is posed. Can the quality of an individual's life be so low that deliberate termination is 
morally justifiable? How that question is answered is a matter of what level of quality of life, if any, is 
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taken to be sufficiently low to justify deliberately acting to end it or deliberately failing to extend it. If 
there is such a level, the conclusion that it is not always beneficent or even nonmaleficent to use 
life-support technology must be accepted. 

Another important issue here is respect for individual autonomy. For the cases of Bouvia and the 
hypothetical instance of neocortical death discussed above, both concern voluntary euthanasia, that is, 
euthanasia voluntarily requested by the patient. A long-standing commitment, vigorously defended by 
various schools of thought in Western moral philosophy, is the notion that competent adults should be 
free to conduct their lives as they please as long as they do not impose undeserved harm on others. Does 
this commitment entail a right to die? Some clearly believe that it does. If one owns anything at all, surely 
one owns one's life. In the two cases discussed above, neither individual sought to impose undeserved 
harm on anyone else, nor would satisfaction of their wish to die do so. What justification can there be 
then for not allowing their desires to be fulfilled? 

One plausible answer is based upon the very respect of individual autonomy at issue here. A necessary 
condition, in some views, of respect for autonomy is the willingness to take whatever measures are 
necessary to protect it, including measures that restrict autonomy. An autonomy- respecting reason offered 
against laws that prevent even competent adults from voluntarily entering lifelong slavery is that such 
an exercise of autonomy is self-defeating and has the consequence of undermining autonomy altogether. 
By the same token, an individual who acts to end his own life thereby exercises his autonomy in a manner 
that places it in jeopardy of permanent loss. Many would regard this as justification for using the coercive 
force of the law to prevent suicide. This line of thought does not fit the case of an individual in a persistent 
vegetative state because his/her autonomy has been destroyed by the circumstances that rendered him/her 
neocortically dead. It does fit Bouvia's case though. Her actions indicate that she is fully competent and 
her efforts to use medical care to prevent the otherwise inevitable pain of starvation is itself an exercise 
of her autonomy. Yet, if allowed to succeed, those very efforts would destroy her autonomy as they destroy 
her. On this reasoning, her case is a perfect instance of limitation of autonomy being justified by respect 
for autonomy and of one where, even against the wishes of a competent patient, the life-saving power 
of medical technology should be used. 

Active Versus Passive Euthanasia 

Discussions of the morality of euthanasia often distinguish active from passive euthanasia in light of the 
distinction made between killing a person and letting a person die, a distinction that rests upon the 
difference between an act of commission and an act of omission. When failure to take steps that could 
effectively forestall death results in an individual's demise, the resultant death is an act of omission and 
a case of letting a person die. When a death is the result of doing something to hasten the end of a 
person's life (giving a lethal injection, for example), that death is caused by an act of commission and is 
a case of killing a person. When a person is allowed to die, death is a result of an act of omission, and 
the motive is the person's own good, the omission is an instance of passive euthanasia. When a person 
is killed, death is the result of an act of commission, and the motive is the person's own good, the 
commission is an instance of active euthanasia. 

Does the difference between passive and active euthanasia, which reduces to a difference in how death 
comes about, make any moral difference? It does in the view of the American Medical Association. In a 
statement adopted on December 4, 1973, the House of Delegates of the American Medical Association 
asserted the following (Rachels, 1978): 

The intentional termination of the life of one human being by another — mercy killing — is contrary 
to that for which the medical profession stands and is contrary to the policy of the American Medical 
Association (AMA). 

The cessation of extraordinary means to prolong the life of the body where there is irrefutable evidence 
that biological death is imminent is the decision of the patient and immediate family. The advice of 
the physician would be freely available to the patient and immediate family. 
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In response to this position, Rachels (1978) answered with the following: 

The AMA policy statement isolates the crucial issue very well, the crucial issue is "intentional termi- 
nation of the life of one human being by another." But after identifying this issue and forbidding 
"mercy killing," the statement goes on to deny that the cessation of treatment is the intentional 
termination of a life. This is where the mistake comes in, for what is the cessation of treatment in 
those circumstances (where the intention is to release the patient from continued suffering), if it is 
not "the intentional termination of the life of one human being by another?" 

As Rachels correctly argues, when steps that could keep an individual alive are omitted for the person's 
own good, this omission is as much the intentional termination of life as taking active measures to cause 
death. Not placing a patient on a respirator due to a desire not to prolong suffering is an act intended 
to end life as much as the administration of a lethal injection. In many instances the main difference 
between the two cases is that the latter would release the individual from his pain and suffering more 
quickly than the former. Dying can take time and involve considerable pain even if nothing is done to 
prolong life. Active killing can be done in a manner that causes death painlessly and instantly. This 
difference certainly does not render killing, in this context, morally worse than letting a person die. 
Insofar as the motivation is merciful (as it must be if the case is to be a genuine instance of euthanasia) 
because the individual is released more quickly from a life that is disvalued than otherwise, the difference 
between killing and letting one die may provide support for active euthanasia. According to Rachels 
(1978), the common rejoinder to this argument is the following: 

The important difference between active and passive euthanasia is that in passive euthanasia the doctor 
does not do anything to bring about the patient's death. The doctor does nothing and the patient dies 
of whatever ills already afflict him. In active euthanasia, however, the doctor does something to bring 
about the patient's death: he kills the person. The doctor who gives the patient with cancer a lethal 
injection has himself caused his patient's death; whereas if he merely ceases treatment, the cancer is 
the cause of death. 

According to this rejoinder, in active euthanasia someone must do something to bring about the 
patient's death, and in passive euthanasia the patient's death is caused by illness rather than by anyone's 
conduct. Surely this is mistaken. Suppose a physician deliberately decides not to treat a patient who has 
a routinely curable ailment and the patient dies. Suppose further that the physician were to attempt to 
exonerate himself by saying, "I did nothing. The patient's death was the result of illness. I was not the 
cause of death." Under current legal and moral norms, such a response would have no credibility. As 
Rachels (1978) notes, "it would be no defense at all for him to insist that he didn't do anything. He would 
have done something very serious indeed, for he let his patient die." 

The physician would be blameworthy for the patient's death as surely as if he had actively killed him. 
If causing death is justifiable under a given set of circumstances, whether it is done by allowing death to 
occur or by actively causing death is morally irrelevant. If causing someone to die is not justifiable under 
a given set of circumstances, whether it is done by allowing death to occur or by actively causing death 
is also morally irrelevant. Accordingly, if voluntary passive euthanasia is morally justifiable in the light 
of the duty of beneficence, so is voluntary active euthanasia. Indeed, given that the benefit to be achieved 
is more quickly realized by means of active euthanasia, it may be preferable to passive euthanasia in some 
cases. 

Involuntary and Non- Voluntary Euthanasia 

An act of euthanasia is involuntary if it hastens the individual's death for his own good but against his 
wishes. To take such a course would be to destroy a life that is valued by its possessor. Therefore, it is no 
different in any morally relevant way from unjustifiable homicide. There are only two legitimate reasons 
for hastening an innocent person's death against his will: self-defense and saving the lives of a larger number 
of other innocent persons. Involuntary euthanasia does not fit either of these justifications. By definition, 
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it is done for the good of the person who is euthanized and for self-defense or saving innocent others. 
No act that qualifies as involuntary euthanasia can be morally justifiable. 

Hastening a person's death for his own good is an instance of non-voluntary euthanasia when the 
individual is incapable of agreeing or disagreeing. Suppose it is clear that a particular person is sufficiently 
self-conscious to be regarded a person but cannot make his wishes known. Suppose also that he is suffering 
from the kind of ailment that, in the eyes of many persons, makes one's life unendurable. Would hastening 
his death be permissible? It would be if there were substantial evidence that he has given prior consent. 
This person may have told friends and relatives that under certain circumstances efforts to prolong his 
life should not be undertaken or continued. He might have recorded his wishes in the form of a Living 
Will (below) or on audio- or videotape. Where this kind of substantial evidence of prior consent exists, 
the decision to hasten death would be morally justified. A case of this scenario would be virtually a case 
of voluntary euthanasia. 

But what about an instance in which such evidence is not available? Suppose the person at issue has 
never had the capacity for competent consent or dissent from decisions concerning his life. It simply 
cannot be known what value the individual would place on his life in his present condition of illness. 
What should be done is a matter of what is taken to be the greater evil — mistakenly ending the life of 
an innocent person for whom that life has value or mistakenly forcing him to endure a life that he 
radically disvalues. 

To My Family, My Physician, My Clergyman, and My Lawyer: 

If the time comes when I can no longer take part in decisions about my own future, let this statement stand as 
testament of my wishes: If there is no reasonable expectation of my recovery from physical or mental disability, 

I, , request that I be allowed to die and not be kept alive by artificial means 

or heroic measures. Death is as much a reality as birth, growth, maturity, and old age — it is the one certainty. I 
do not fear death as much as I fear the indignity of deterioration, dependence, and hopeless pain. I ask that 
drugs be mercifully administered to me for the terminal suffering even if they hasten the moment of death. 

This request is made after careful consideration. Although this document is not legally binding, you who care 
for me will, I hope, feel morally bound to follow its mandate. I recognize that it places a heavy burden of 
responsibility upon you, and it is with the intention of sharing that responsibility and of mitigating any feelings 
of guilt that this statement is made. 

Signed: 

Date: 



Witnessed by: 



Living Will statutes have been passed in at least 35 states and the District of Columbia. For a Living 
Will to be a legally binding document, the person signing it must be of sound mind at the time the will 
is made and shown not to have altered his opinion in the interim between the signing and his illness. 
The witnesses must not be able to benefit from the individual's death. 

Should Voluntary Euthanasia be Legalized? 

The recent actions of Dr. Kavorkian have raised the question: "Should voluntary euthanasia be legalized?" 
Some argue that even if voluntary euthanasia is morally justifiable, it should be prohibited by social 
policy nonetheless. According to this position, the problem with voluntary euthanasia is its impact on 
society as a whole. In other words, the overall disutility of allowing voluntary euthanasia outweighs the 
good it could do for its beneficiaries. The central moral concern is that legalized euthanasia would 
eventually erode respect for human life and ultimately become a policy under which "socially undesirable" 
persons would have their deaths hastened (by omission or commission). The experience of Nazi Germany 
is often cited in support of this fear. What began there as a policy of euthanasia soon became one of 
eliminating individuals deemed racially inferior or otherwise undesirable. The worry, of course, is that 
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what happened there can happen here as well. If social policy encompasses efforts to hasten the deaths 
of people, respect for human life in general is eroded and all sorts of abuses become socially acceptable, 
or so the argument goes. 

No one can provide an absolute guarantee that the experience of Nazi Germany would not be repeated, 
but there is reason to believe that its likelihood is negligible. The medical moral duty of beneficence 
justifies only voluntary euthanasia. It justifies hastening an individual's death only for the individual's 
benefit and only with the individual's consent. To kill or refuse to save people judged socially undesirable 
is not to engage in euthanasia at all and violates the medical moral duty of nomaleficence. As long as 
only voluntary euthanasia is legalized, and it is clear that involuntary euthanasia is not and should never 
be, no degeneration of the policy need occur. Furthermore, such degeneration is not likely to occur if 
the beneficent nature of voluntary euthanasia is clearly distinguished from the maleficent nature of 
involuntary euthanasia and any policy of exterminating the socially undesirable. Euthanasia decisions 
must be scrutinized carefully and regulated strictly to ensure that only voluntary cases occur, and severe 
penalties must be established to deter abuse. 
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191.1 Introduction 

The number and sophistication of new medical devices is transforming modern medicine at a rate never 
experienced before. These developments have saved the lives of many more patients and improved their 
quality of life drastically. Artificial joint replacements alone have transformed the field of orthopaedic 
surgery, and over 250,000 total hip and total knees are implanted annually in the United States. Ventricular 
assistive devices (VASs) have extended lives by years, lives that before would have certainly ended at the 
point the device would have been needed. An estimated two to three million artificial or prosthetic parts, 
manufactured by hundreds of different companies, are implanted in America each year. The massive 
production of such devices is not only big science but big business. 

New medical devices, however, require thorough testing for safety and efficacy as well as submission 
for approval by the federal Food and Drug Administration (FDA) before being put on the market for 
public use. Testing of a new product takes considerable time and expense and is not without problems 
beyond those of a technical nature. This paper considers the ethical questions that arise when the demands 
of science, economics, and progress are not entirely compatible with issues raised about the rights and 
obligations toward human beings and animals. 

One of the first levels of testing where ethical debate is more prominent is at the stage when the 
biomedical scientist is faced with the need to use animal subjects. 1 Over the last 30 years there has been 
a growing debate over whether or not the use of experimental animals is even appropriate. Some animal 
rights activists have made violent protests and vandalized research facilities where animal experimentation 
takes place. Increased public sensitivity did help promote an effort by the government, as well as the 
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scientific community, to regulate the use of animal subjects and to educate the public as to the importance 
of such research to the health and well being of both animals and humans. As a result efforts to promote 
the humane use of experimental animals that are used to advance knowledge of biomedical sciences are 
to be supported. 

Inevitably human beings are involved during the final stages of testing medical devices or systems. 
While this imposes numerous ethical concerns and has stirred much discussion, clinical trials are nec- 
essary as the alternative would mean an end of learning anything new for the betterment of medical 
science and continued use of unsupported practices based on conjecture. 2 

Consequently, engineers involved with the development and design of medical technology need to 
become familiar with various aspects of clinical trials and animal research as well as the ethical issues 
that they raise. Normally, conduction of human experimentation is not a part of the training of an 
engineer, 3 nor are complications presented by ethical concerns a traditional part of engineering educa- 
tion. 4 In this chapter various ethical concerns are examined that are essential elements in the animal 
testing and the clinical trial of any new treatment modality. 

191.2 Clinical Trials 

The Reason for Clinical Trials 

Clinical trials are designed to ascertain the effectiveness and safety of a new medical device as compared 
with established medical practice. This form of rigorous scientific investigation in a controlled environ- 
ment for the assessment of new treatment modalities is superior to the is forum of private opinion and 
individual chance taking. Holding the practice of medicine at the status quo and discontinuing all 
innovative work would be the only way to prevent exposing patients to some unforeseen risks in new 
treatments that come with the promise of improved care. Yet, even this does not protect patients from 
uncontrolled experimentation as conditions change even if the practice of medicine stands still. The 
changing effectiveness of antibiotics as well as the increase in antibiotic resistant strains through unin- 
formed overuse of such drugs is a perfect example of how even standard care held stationary can lead 
to a decline rather than just a status quo in level of medical care. If we wish to continue to seek out new 
ways to conquer disease or even maintain a current level of care, then we are essentially forced to decide 
on a manner in which human beings are to shoulder the risks involved. 

Total Mastectomies for Stage I and II breast tumors and extracranial-intracranial anastomosis for 
internal carotid atherosclerosis were once routinely practiced. However, randomized trials have brought 
the necessity of these surgical procedures under closer scrutiny. Certain procedures which are continually 
used, such as the now twenty-year-old use of obstetric ultrasonography, are losing support in the battle 
to cut medical costs as these do not have the benefit of randomized clinical trials to prove their value. 
Answers are needed to questions concerning health and economic risks and benefits, and the burden of 
false positive and negative results, especially in this time of multiple options in a profit-driven atmosphere, 
along with dwindling resources and government aid to the sick, old, and poor. 5 

A clinical trial is the most reasonable means to test a device as well as control risks and prevent abuse 
of human subjects because of the following six major factors: ( 1 ) A limited number of closely monitored 
subjects in a controlled environment are in a safer situation than are the same subjects unobserved in 
relation to one another within the larger population, where the same kinds of risks are imposed by 
uncertainty about the efficacy of new devices. (2) Clinical trials give conclusive answers to important 
medical inquiries that otherwise could only be answered by guessing. Medical decisions based on proven 
results are certainly superior to those dependent on untested clinical opinion. One author has stated that 
"trials were introduced because personal opinion was so notoriously fragile, biased, and unreliable." 6 
Risks would indeed be greater and potential for harm magnified if doctors are made to make decisions 
in an environment of general uncertainty and when medical products available are unsupervised by 
review boards. In an uncontrolled situation, the individual practitioner can be influenced by motives of 
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economic profit, the need to appear knowledgeable and abreast of the new, or by high-pressure sales- 
manship. In the zeal to argue for individual rights in clinical trials, worry should be placed on how those 
same rights are threatened in an environment without objective controls. (3) A society that restricts and 
oversees the advancement of medicine through regulated human experimentation will prevent the sub- 
jection of people to needless risks brought on by a plurality of devices that may cause harm and offer 
only a maintenance of the status quo as a benefit. There are a plentitude of types of redundant consumer 
goods on the market today, e.g., soft drinks that offer multiple ways to relieve one condition — thirst. 
However, in medicine the concerns are different. The onslaught of numerous types of drugs to relieve 
nausea during pregnancy should be limited to the testing of a few medicines that offer the greatest benefit. 
Control of the market place in medicine through government or private testing of the few most promising 
forms of treatment is a much safer environment than an extensive supermarket full of many possibilities 
and many potential risks. (4) Clinical trials advance expedient corroboration of medical theories so that 
research can be channeled in directions that show meaningful results. Promising research is rapidly 
pinpointed, and harmful products are removed from the hiding place of private opinion and promotional 
tactics. (5) Clinical trials are the answer to the moral imperative to thoroughly test all new medical 
devices. Such testing is deficient without a controlled study on human subjects. No researcher can state 
confidently that a product is safe and effective for human use without such a test. (6) All clinical trials 
are evaluated by specialized committees formed for the objective of supervising the ethical conduct of 
investigators using human beings as subjects. In this manner, individual rights are protected and ethical 
guidelines effectively imposed in a manner superior to what could be expected in the isolated world of 
private clinical practice, with its competition for patients and pressures from manufacturers. 

Dilemmas Presented by Clinical Trials 

The Problem of Informed Consent 

One of the most controversial issues generated by clinical trials is that of informed consent. 7 Informed 
consent protects certain human rights, such as the patient's freedom to decide what risks to take with 
his/her own body, the right to the truth from the doctor in the doctor-patient relationship, and a just 
distribution of goods in accordance with a standard of equity and access to redress for undeserved harm. 
These values cannot be sacrificed for any sort of anticipated benefit from research. "The loss of such 
values is so harmful that benefits become meaningless." 8 

Clinical Trials and the Doctor-Patient Relationship 

Another issue currently debated is the conflict of clinical trials with the therapeutic obligation. Some 
authors argue that we must face the fact that if we are to expand the knowledge needed for obtaining 
high-quality treatment, we must sacrifice our therapeutic obligation. There are others who take a more 
apprehensive view of clinical trials, stating that trials on healthy subjects are condemned by the Nurem- 
burg Code, the Tokyo Declaration, and the Helsinki Declaration of the World Medical Association. 9 

However, outright condemnation of experimentation on healthy human subjects ignores the vital need 
for progress in preventive as well as remedial medical treatment. For example, the development and use 
of vaccines carry minimal but real risks to their recipients. Yet few dispute that vaccine research using 
human subjects is morally justified and may even be compulsory, despite the reality that persons can 
and do die from experimental as well as FDA approved vaccines. The control of crippling and deadly 
diseases, and their eventual elimination (i.e., small pox), is due to the study and implementation of 
vaccines. Those who support clinical trials say that a validated medical practice is a far better alternative 
for both the individual and society as a whole than subjection to treatments whose effectiveness are not 
validated by controlled trials. 

One article offers several ideas for the elicitation of informed consent and assignment to randomized 
groups. The best model suggested for general use begins with selection of eligible patients, who are pre- 
randomized and given the entire protocol with an explanation of benefits and risks of all the options; 
consent is then sought, the patient knowing his or her group assignment. 10 
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Proposed deviations from the above standard could be defended before review boards; for example, 
investigations that would be impeded by the patient knowing his or her group's assignment but that 
promise the patient major benefits. However, the patient should be informed of any such stipulations 
and of the relative risks and benefits. 

Standards of research require further consideration as a consensus needs to be achieved. For example, 
in order to test the effectiveness of transplanted embryonic pig cells into the brain for the treatment of 
Parkinson's disease, a control group is required in which surgical boring of a hole into the skull occurs 
without the addition of any cells. This is to remove any placebo effect or any other unforeseeable effect 
of a mock surgical procedure on Parkinson's disease. However, the question of simply boring a hole does 
subject a human being to a procedure with significant risk with little bases for suspecting a beneficial 
outcome. The design of scientific studies needs to weigh the risks to the human subjects against the need 
for scientific purity. Such opinions have been voiced by Arthur Caplan, director of the University of 
Pennsylvania's Center for Biomedical Ethics who recently told the Boston Herald that, "striving for 
scientific accuracy is a commendable goal, but asking someone to have a hole drilled in their head for 
no purpose is putting science ahead of the subject's interest." 

There has been much effort particularly in recent years toward the protection of individual rights. 
This is not surprising in this age of rising autonomy and declining paternalism. This concern has led to 
the consideration of whether or not treatment should be changed mid-course in an experiment due to 
the "appearance" of a tendency toward one result as opposed to another. At some point in collecting 
data, the experimenter may begin to surmise a particular outcome or even come to expect it (although 
he/she may not be certain). In these circumstances, the question arises as to whether the individual 
subject should be granted the hypothetical benefit of the "good guess," or should the guess be treated as 
merely a guess and, hence, a gamble. To do so would mean that the best treatment for the individual 
might not be one that has been proven valid. In fact, such a choice implies that the best treatment is a 
perpetual gamble. Nonetheless, treatment methods that maximize the good guess have been presented. 11 

Another issue that has surfaced from the debate on informed consent is the idea of the "therapeutic 
misconception," which refers to an unyielding expectation of personalized care on the part of patients 
during a clinical trial. This finding shows a need for better communication between the patient and the 
doctor. However, such "false expectations" could be due to a fundamental trust on the patient's part 
believing that through cooperation with the physician in research, personal health needs will eventually 
be met according to the best available data at any given time. Perhaps, this notion should be addressed 
during a clinical trial. 12 

A local newspaper reported that some companies are enticing private physicians to register patients 
for their studies that have significant fees. What used to be the domain of academic researchers motivated 
by the drive for new discoveries, fame, and career advancement is now a multibillion dollar industry 
with numerous companies working with thousands of doctors in private practice having a profound 
impact on the doctor-patient relationship. Often the patient is unaware that significant amounts of money 
are involved in their recruitment for a study. 13 In addition to the expected problems of human research 
that have been debated in the past, the influence of a growing industry invites the need for continued 
inspection and control not only for the prevention of abuse of human subjects but for even the possible 
effect such methods of recruiting could have on scientific integrity. 

The Need for Double-Blind Trials 

The double-blind study has magnified the issues concerning clinical trials already mentioned. Obviously, 
this type of trial means that both the physicians and the patients involved will not be given full information 
about the experiment. The double-blind study is the best safeguard against biased results. Indeed, the 
purpose of the double-blind trial is to bring about a treatment plan based on objective fact rather than 
on biased personal belief or guess work. Even well intentioned and honest researchers can fall victim to 
seeing their data too subjectively. For instance, an investigator may have a vested interest in the study 
because of personal prestige, financial gain, or merely normal enthusiasm and faith in his own work. 14 
These represent obstacles to objective results in research that are not blind. The originator and/or sponsor 
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of a project may suffer from a conflict of interest when either becomes directly involved in a study. The 
double-blind approach protects against such conflict. Although some investigators have argued that it is 
unethical for them to deprive control subjects of their product because they believe so strongly in its 
efficacy, personal conviction of the usefulness of one's own work is not a good ethical reason for failing 
to thoroughly test a new device for safety and effectiveness. The moral imperative to thoroughly test is 
a basic concept to be followed to achieve a standing of good biomedical engineering. 15 

The biomedical engineer must face the current debate over the use of medical technology in accordance 
with the ethical, professional, and scientific imperative to thoroughly test his or her innovation. It is now 
time for biomedical engineers to stand up and support the sound moral use of clinical trials for the 
purposes of (1) scientific credibility of the biomedical engineering field, (2) protection of the individual 
from improper medical care due to unsupervised, unethical market-place forms of human research, and 
(3) the promotion of medical care based on sound reasoning and scientific fact. These principles are at 
the heart of the professionalism required of biomedical engineering research. 16 

191.3 Animal Experimentation 
Animal Testing 

Animal experimentation is an important step in the development of new implants and devices to 
determine their safety and effectiveness prior to their use in humans. There is no current alternative to 
the use of animal models to evaluate the biocompatibility of new materials and the response by a host. 17 
However, animal rights groups have raised important ethical considerations and have challenged the 
scientific community to justify the use of animals in research. The scientific community should respond 
and debate these issues. They should help to educate the public and increase understanding of the need 
for such research. A more proactive approach in this matter is imperative. It is also important that efforts 
be made by scientists to demonstrate to the public that animal research is done humanely, sparingly, and 
only when alternatives do not exist. 18 

The Need for Animal Research 

Biomedical engineering has brought remarkable advancement to the field of medicine in a very short 
time frame. Just in the last quarter of this century we have seen achievements such as (1) pacemakers, 
(2) total joint replacement, (3) artificial hearts, (4) CAT scan machines, and (5) improved surgical 
techniques made possible through the use of fiber optics, lasers, and ultrasonic devices. Along with these 
remarkable advances came the need for closer monitoring of the safety and effectiveness of inventions 
before their release for public use. More regulations, standards, and testing protocols were devised to 
ensure that each new product is subjected to a uniform system of scrutiny and procedure for approval. 
As part of that process, the testing of a product in animals is a likely necessary step. For example, NIH 
Guidelines for the Physicochemical Characterization of Biomaterials outlines a stepwise process for testing 
blood-contacting devices progressively in animals. A possible hierarchy of testing starting from in vitro 
to in vivo systems may be as follows: 19 

Cell culture cytotoxicity (mouse L929 cell line) 

Hemolysis (rabbit or human blood) 

Mutagenicity [human or other mammalian cells or Ames test (bacterial)] 

Systemic injection acute toxicity (mouse) 

Sensitization (guinea pig) 

Pyrogenicity (limulus amebocyte lysate [LAL] or rabbit) 

Intracutaneous irritation (rat, rabbit) 

Intramuscular implant (rat, rabbit) 

Blood compatibility (rat, dog) 

Long-term implant (rat, rabbit, dog, primate) 
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Animal testing is a necessary means of evaluating a device in the internal environment of the living 
system. The suggestion that this can be replaced by computer modeling can only be made by one unaware 
of the lack of knowledge about in vivo conditions. 20 The nearly inscrutable chemical pathways, the 
complex milieu of high chemical, mechanical and electrical activity, and the incalculable number of 
interactions inside the living organism cannot possibly be modeled theoretically today. Add to this the 
fact that normal conditions are different from disease states such as thrombosis or inflammation, and 
we find ourselves still further removed from the prospects of using simulation methods. Yet to thoroughly 
rule out both long and short-term failure of these products, they must be tested sufficiently to rule out 
risks of toxicity and even possible carcinogenic effects over many years. In response to questions raised 
about animal research, the American Association for the Advancement of Science issued a resolution in 
1990 that supports the use of animals in research while emphasizing that experimental animals be treated 
humanely. 

Regulations/Guidelines Related to Animal Research 

A biomedical engineer must become familiar with regulations with regard to animal research. The 
National Research Council published a Guide for the Care and Use of Laboratory Animals initially as early 
as 1963 and most recently in 1996. 

In addition to becoming familiar with this Guide, it is important to be aware of all applicable federal, 
state, and local laws, regulations, and policies such as the Animal Welfare Regulations and the Public 
Health Service Policy on Humane Care and Use of Laboratory Animals. 22 

The general objective is to have a cogent reason for conducting a particular experiment. It must be 
important to the health and well-being of humans or animals. 23 It must be demonstrated that alternative 
means would not achieve the necessary goal. The choice of species must be shown to be essential and 
that species lower on the evolutionary ladder would not be suitable. An effort must be made to minimize 
the number of animals required and to prevent unnecessary duplication of tests. Pain and discomfort 
must be minimized unless important to the conduction of the experiment. Standards must be followed 
concerning (1) space allotment, (2) type of confinement, (3) availability of food and water, (4) care in 
transit, and (5) periods of exercise. The experiment must have clear limits and be performed under the 
close supervision of individuals appropriately trained. Provisions for veterinary care, environmental 
conditions, and euthanasia must also be made. If palliation of discomfort or pain must be withheld, a 
detailed explanation must be made available. 

In 1985 congress passed The Food Security Act which amended the Animal Welfare Act of 1966. This 
required that research institutions have a committee of no less than three persons who are qualified to 
monitor animal care and practices in experimentation. 17 These persons are to represent society's interest in 
the proper treatment of animal subjects. The committee requires one veterinarian and one other person 
not connected with the institution. This committee must inspect all animal research and care facilities at 
the institution twice a year. The institution should in turn keep a report of each inspection on file for 3 years 
and report any information that notes violation of legal standards on the treatment of animals to federal 
officials. The committee must review specific areas of treatment of experimental animals, including pain 
management, veterinary care, and pre- and post-surgical care. A single animal must not be subjected to 
major survival surgery more than once unless justified scientifically. Dogs must be exercised and envi- 
ronmental conditions must be conducive to psychological well-being of primates. Note that policies by 
the National Science Foundation now extend to all vertebrate animals which is broader than the Animal 
Welfare Act that did not include (1) rats, (2) mice, (3) farm animals, and (4) birds. 

The Public Debate 

The debate about animal experimentation is marred by extremes on both sides of the issue. 24 Scientists 
have been physically threatened and assaulted and their laboratories vandalized by extremist animal rights 
groups. 25 Unwarranted charges that scientists only conduct animal research for personal gain and that 
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these experiments are cruel and unnecessary demonstrate the extent of misinformation that exists for 
the public. 

The actions of irresponsible activists must not obscure the serious philosophical issues raised by animal 
rights advocates. Some of them argue that there exists no justification for the claim that animals can be 
exploited for human benefit. This line of reasoning is that animals have rights equal to that of humans 
and as animals cannot give informed consent for an experiment they should be precluded from use. This 
represents a radical change that would have consequences beyond the use of animals in science. When 
animals are used as beasts of burden for example, should this be considered slavery? What of the use of 
animals for food, clothing, or even simply in sport. One author noted that a 1990 study showed that 
although 63% of literature advocating animal rights addresses only their use in research, such use annually 
is only 0.003% of the number of animals used as food. 26 What about protection against pests in the home 
as well as in agriculture? Even the ownership of pets may be called into question, as a case of imprison- 
ment. Of all possible targets for basing a case against human exploitation of animals, the use of them in 
medical experiments is the least worthy. Scientists not only make the least use of animals but do so under 
strict regulation and with all attempts being made to minimize their use. If we are seriously going to 
elevate animals to a moral standing equal to human beings, then the research laboratory is not the 
appropriate place to start. The burden of explaining how all future interaction should take place with 
the rest of the animal kingdom lies with those who make such a claim. 

The other extreme however, is the claim by some scientists that animals have no moral standing or 
intrinsic value. A report by the National Research Council in 1988 on the Use of Laboratory Animals in 
Biomedical and Behavioral Research says "Our society does, however, acknowledge that living things have 
inherent value." 27 This is consistent with society's tendency to treat animals differently than inanimate 
objects. Even a mouse merits higher consideration than a stone. That animals have value, moral standing, 
or even rights is not equivalent to the suggestion that they are equal to human beings in their value, 
standing, and rights. 

Humans are not the only beneficiaries of research on animals. Veterinary medicine would suffer 
without such research and the care of animals has significantly improved as a result of responsible animal 
experimentation. Today, human psychoactive drugs are being used for pets with behavioral problems. 28 

Conclusion 

While there is continued consensus within the scientific community on the need for animal research, 
there is also increased sensitivity and awareness of the need for humane treatment of animals and the 
intrinsic value and moral standing of non-human animals. This has lead to improved guidelines and 
regulation on such use as well as much needed discussion on the purpose and ethics of animal research. 
While the debate will certainly continue, the use of violence, slander, and media sensationalism should 
be discouraged. This will only lead to further polarization of extreme views. Constructive and responsible 
discussion will help bring about a consensus on this very important matter that concerns the health and 
welfare of animals and humans alike. 
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Responsibility for regulating medical devices falls to the Food and Drug Administration (FDA) under 
the Medical Device Amendment of 1976. This statute requires approval from the FDA before new devices 
are marketed and imposes requirements for the clinical investigation of new medical devices on human 
subjects. Although the statute makes interstate commerce of an unapproved new medical device generally 
unlawful, it provides an exception to allow interstate distribution of unapproved devices in order to 
conduct clinical research on human subjects. This investigational device exemption (IDE) can be obtained 
by submitting to the FDA "a protocol for the proposed clinical testing of the device, reports of prior 
investigations of the device, certification that the study has been approved by a local institutional review 
board, and an assurance that informed consent will be obtained from each human subject" (Bronzino et al., 
1990a, b). 

With respect to clinical research on humans, the FDA distinguishes devices into two categories: devices 
that pose significant risk and those that involve insignificant risk. Examples of the former included 
orthopedic implants, artificial hearts, and infusion pumps. Examples of the latter include various dental 
devices and daily-wear contact lenses. Clinical research involving a significant risk device cannot begin 
until an institutional review board (IRB) has approved both the protocol and the informed consent form 
and the FDA itself has given permission. This requirement to submit an IDE application to the FDA is 
waived in the case of clinical research where the risk posed is insignificant. In this case, the FDA requires 
only that approval from an IRB be obtained certifying that the device in question poses only insignificant 
risk. In deciding whether to approve a proposed clinical investigation of a new device, the IRB and the 
FDA must determine the following (Bronzino et al., 1990a, b): 

1. Risks to subjects are minimized. 

2. Risks to subjects are reasonable in relation to the anticipated benefit and knowledge to be gained. 

3. Subject selection is equitable. 

4. Informed consent materials and procedures are adequate. 

5. Provisions for monitoring the study and protecting patient information are acceptable. 

The FDA allows unapproved medical devices to be used without an IDE in three types of situations: 
emergency use, treatment use, and feasibility studies. 
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192.2 Ethical Issues in Feasibility Studies 

Manufacturers seeking more flexibility in conducting investigations in the early developmental stages of a 
device have submitted a petition to the FDA, requesting that certain limited investigations of significant risk 
devices be subject to abbreviated IDE requirements (Bronzino et al., 1990a, b). In a feasibility study, or 
"limited investigation," human research on a new device would take place at a single institution and 
involve no more than ten human subjects. The sponsor of a limited investigation would be required to 
submit to the FDA a "Notice of Limited Investigation," which would include a description of the device, a 
summary of the purpose of the investigation, the protocol, a sample of the informed consent form, and a 
certification of approval by the responsible IRB. In certain circumstances, the FDA could require additional 
information, or require the submission of a full IDE application, or suspend the investigation (Bronzino 
etal., 1990a, b). 

Investigations of this kind would be limited to certain circumstances: (1) investigations of new uses of 
existing devices, (2) investigations involving temporary or permanent implants during the early developmen- 
tal stages, and (3) investigations involving modification of an existing device (Bronzino et al., 1990a). 

To comprehend adequately the ethical issues posed by clinical use of unapproved medical devices 
outside the context of an IDE, it is necessary to utilize the distinctions between practice, nonvalidated 
practice, and research elaborated in the previous pages. How do those definitions apply to feasibility 
studies? 

Clearly, the goal of this sort of study, i.e., generalizable knowledge, makes it an issue of research rather 
than practice. Manufacturers seek to determine the performance of a device with respect to a particular 
patient population in an effort to gain information about its efficacy and safety. Such information would 
be important in determining whether further studies (animal or human) need to be conducted, whether 
the device needs modification before further use, and the like. The main difference between use of an 
unapproved device in a feasibility study and use under the terms of an IDE is that the former would be 
subject to significantly less intensive FDA review than the latter. This, in turn, means that the responsibility 
for ensuring that use of the device is ethically sound would fall primarily to the IRB of the institution 
conducting the study. 

The ethical concerns posed here are best comprehended with a clear understanding of what justifies 
research. Ultimately, no matter how much basic research and animal experimentation has been conducted 
on a given device, the risks and benefits it poses for humans cannot be adequately determined until it is 
actually used on humans. 

The benefits of research on humans lie primarily in the knowledge that is yielded and the generalizable 
information that is provided. This information is crucial to medical science's ability to generate new 
modes and instrumentalities of medical treatment that are both efficacious and safe. Accordingly, for 
necessary but insufficient condition for experimentation to be ethically sound, it must be scientifically 
sound (Capron, 1978; 1986). 

Although scientific soundness is a necessary condition of ethically acceptable research on humans, it 
is not of and by itself sufficient. Indeed, it is widely recognized that the primary ethical concern posed 
by such investigation is the use of one person by another to gather knowledge or other benefits where 
these benefits may only partly or not at all accrue to the first person. In other words, the human subjects 
of such research are at risk of being mere research resources, as having value only for the ends of the 
research. Research upon human beings runs the risk of failing to respect them as people. The notion that 
human beings are not mere things but entities whose value is inherent rather than wholly instrumental 
is one of the most widely held norms of contemporary Western society. That is, human beings are not 
valuable wholly or solely for the uses to which they can be put. They are valuable simply by being the 
kinds of entities they are. To treat them as such is to respect them as people. 

Respecting individuals as people is generally agreed to entail two requirements in the context of 
biomedical experimentation. First, since what is most generally taken to make human beings people is 
their autonomy — their ability to make rational choices for themselves — treating individuals as people 
means respecting that autonomy. This requirement is met by ensuring that no competent person is 
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subjected to any clinical intervention without first giving voluntary and informed consent. Second, respect 
for people means that the physician will not subject a human to unnecessary risks and will minimize the 
risks to patients in required procedures. 

Much of the ethical importance of the scrutiny that the FDA imposes upon use of unapproved medical 
devices in the context of an IDE derives from these two conditions of ethically sound research. The central 
ethical concern posed by use of medical devices in a feasibility study is that the decreased degree of FDA 
scrutiny will increase the likelihood that either or both of these conditions will not be met. This possibility 
may be especially great because many manufacturers of medical devices are, after all, commercial enter- 
prises, companies that are motivated to generate profit and thus to get their devices to market as soon 
as possible with as little delay and cost as possible. These self-interested motives are likely, at times, to 
conflict with the requirements of ethically sound research and thus to induce manufacturers to fail (often 
unwittingly) to meet these requirements. Note that profit is not the only motive that might induce 
manufacturers to contravene the requirements of ethically sound research on humans. A manufacturer 
may sincerely believe that its product offers great benefit to many people or to a population of especially 
needy people and so from this utterly altruistic motive may be prompted to take shortcuts that compro- 
mise the quality of the research. Whether the consequences being sought by the research are desired for 
reasons of self-interest, altruism, or both, the ethical issue is the same. Research subjects may be placed 
at risk of being treated as mere objects rather than as people. 

What about the circumstances under which feasibility studies would take place? Are these not suffi- 
ciently different from the "normal" circumstances of research to warrant reduced FDA scrutiny? As noted 
above, manufacturers seek to be allowed to engage in feasibility studies in order to investigate new uses 
of existing devices, to investigate temporary or permanent implants during the early developmental stages, 
and to investigate modifications to an existing device. As also noted above, a feasibility study would take 
place at only one institution and would involve no more than ten human subjects. Given these circum- 
stances, is the sort of research that is likely to occur in a feasibility study less likely to be scientifically 
unsound or to fail to respect people in the way that normal research upon humans does in "normal" 
circumstances? 

Such research would be done on a very small subject pool, and the harm of any ethical lapses would 
likely affect fewer people than if such lapses occurred under more usual research circumstances. Yet even if 
the harm done is limited to a failure to respect the ten or fewer subjects in a single feasibility study, the 
harm would still be ethically wrong. To wrong ten or fewer people is not as bad as to wrong in the same 
way more than ten people but it is to engage in wrongdoing nonetheless. In either case, individuals are 
reduced to the status of mere research resources and their dignity as people is not properly respected. 

Are ethical lapses more likely to occur in feasibility studies than in studies that take place within the 
requirements of an IDE? Although nothing in the preceding discussion provides a definitive answer to 
this question, it is a question to which the FDA should give high priority in deciding whether to allow 
this type of exception to IDE use of unapproved medical devices. The answer to this question might be 
quite different when the device at issue is a temporary or permanent implant than when it is an already 
approved device being put to new uses or modified in some way. Whatever the contemplated use under 
the feasibility studies mechanism, the FDA would be ethically advised not to allow this kind of exception 
to IDE use of an unapproved device without a reasonably high level of certainty that research subjects 
would not be placed in greater jeopardy than in "normal" research circumstances. 

192.3 Ethical Issues in Emergency Use 

What about the mechanism for avoiding the rigors of an IDE for emergency use? 

"The FDA has authorized emergency use where an unapproved device offers the only alternative for saving 
the life of a dying patient, hut an IDE has not yet been approved for the device or its use, or an IDE has 
been approved but the physician who wishes to use the device is not an investigator under the IDE 
(Bronzino et al., 1990a, b). 
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Because the purpose of emergency use of an unapproved device is to attempt to save a dying patient's 
life under circumstances where no other alternative is at hand, this sort of use constitutes practice rather 
than research. Its aim is primarily benefit to the patient rather than provision of new and generalizable 
information. Because this sort of use occurs prior to the completion of clinical investigation of the device, 
it constitutes a nonvalidated practice. What does this mean? 

First, it means that while the aim of the use is to save the life of the patient, the nature and likelihood 
of the potential benefits and risks engendered by use of the device are far more speculative than in the 
sort of clinical intervention that constitutes validated practice. In validated practice, thorough investiga- 
tion, including preclinical studies, animals studies, and studies on human subjects of a device has 
established its efficacy and safety. The clinician thus has a well-founded basis upon which to judge the 
benefits and risks such an intervention poses for his patients. 

It is precisely this basis that is lacking in the case of a nonvalidated practice. Does this mean that 
emergency use of an unapproved device should be regarded as immoral? This conclusion would follow 
only if there were no basis upon which to make an assessment of the risks and benefits of the use of the 
device. The FDA requires that a physician who engages in emergency use of an unapproved device must 
"have substantial reason to believe that benefits will exist. This means that there should be a body of pre- 
clinical and animal tests allowing a prediction of the benefit to a human patient" (Bronzino et al., 1990a, b). 

Thus, although the benefits and risks posed by use of the device are highly speculative, they are not 
entirely speculative. Although the only way to validate a new technology is to engage in research on 
humans at some point, not all nonvalidated technologies are equal. Some will be largely uninvestigated, 
and assessment of their risks and benefits will be wholly or almost wholly speculative. Others will at least 
have the support of preclinical and animal tests. Although this is not sufficient support for incorporating 
use of a device into regular clinical practice, it may however represent sufficient support to justify use in 
the desperate circumstances at issue in emergency situations. Desperate circumstances can justify des- 
perate actions, but desperate actions are not the same as reckless actions, hence the ethical soundness of 
the FDA's requirement that emergency use be supported by solid results from preclinical and animal tests 
of the unapproved device. 

A second requirement that the FDA imposes on emergency use of unapproved devices is the expectation 
that physicians "exercise reasonable foresight with respect to potential emergencies and make appropriate 
arrangements under the IDE procedures. Thus, a physician should not "create" an emergency in order to 
circumvent IRB review and avoid requesting the sponsor's authorization of the unapproved use of a device" 
(Bronzino et al, 1990a, b). 

From a Kantian point of view, which is concerned with protecting the dignity of people, it is a 
particularly important requirement to create an emergency in order to avoid FDA regulations which 
prevent the patient being treated as a mere resource whose value is reducible to a service of the clinician's 
goals. Hence, the FDA is quite correct to insist that emergencies are circumstances that reasonable 
foresight would not anticipate. 

Also especially important here is the nature of the patient's consent. Individuals facing death are 
especially vulnerable to exploitation and deserve greater measures for their protection than might oth- 
erwise be necessary. One such measure would be to ensure that the patient, or his legitimate proxy, knows 
the highly speculative nature of the intervention being offered. That is, to ensure that it is clearly 
understood that the clinician's estimation of the intervention's risks and benefits is far less solidly 
grounded than in the case of validated practices. The patient's consent must be based upon an awareness 
that the particular device has not undergone complete and rigorous testing on humans and that estima- 
tions of its potential are based wholly upon preclinical and animal studies. Above all the patient must 
not be lead to believe that there is complete understanding of the risks and benefits of the intervention. 
Another important point here is to ensure that the patient is aware that the options he is facing are not 
simply life or death but may include life of a severely impaired quality, and therefore that even if his life 
is saved, it may be a life of significant impairment. Although desperate circumstance may legitimize 
desperate actions, the decision to take such actions must rest upon the informed and voluntary consent 
of the patient, especially when he/she is an especially vulnerable patient. 
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It is important here for a clinician involved in emergency use of an unapproved device to recognize 
that these activities constitute a form of nonvalidated practice and not research. Hence, the primary 
obligation is to the well-being of the patient. The patient enters into the relationship with the clinician 
with the same trust that accompanies any normal clinical situation. To treat this sort of intervention as 
if it were an instance of research and hence justified by its benefits to science and society would be to 
abuse this trust. 

192.4 Ethical Issues in Treatment Use 

The FDA has adopted regulations authorizing the use of investigational new drugs in certain circumstances 
where a patient has not responded to approved therapies. This "treatment use" of unapproved new drugs 
is not limited to life-threatening emergency situations, but rather is also available to treat "serious" diseases 
or conditions (Bronzino et al., 1990a, b). 

The FDA has not approved treatment use of unapproved medical devices, but it is possible that a 
manufacturer could obtain such approval by establishing a specific protocol for this kind of use within 
the context of an IDE. 

The criteria for treatment use of unapproved medical devices would be similar to criteria for treatment 
use of investigational drugs: (1) the device is intended to treat a serious or life-threatening disease or 
condition, (2) there is no comparable or satisfactory alternative product available to treat that condition, 
(3) the device is under an IDE, or has received an IDE exemption, or all clinical trials have been completed 
and the device is awaiting approval, and (4) the sponsor is actively pursuing marketing approval of the 
investigational device. The treatment use protocol would be submitted as part of the IDE, and would 
describe the intended use of the device, the rationale for use of the device, the available alternatives and 
why the investigational product is preferred, the criteria for patient selection, the measures to monitor the 
use of the device and to minimize risk, and technical information that is relevant to the safety and 
effectiveness of the device for the intended treatment purpose (Bronzino et al., 1990a, b). 

Were the FDA to approve treatment use of unapproved medical devices, what ethical issues would be 
posed? First, because such use is premised on the failure of validated interventions to improve the patient's 
condition adequately, it is a form of practice rather than research. Second, since the device involved in 
an instance of treatment use is unapproved, such use would constitute nonvalidated practice. As such, 
like emergency use, it should be subject to the FDA's requirement that prior preclinical tests and animal 
studies have been conducted that provide substantial reason to believe that patient benefit will result. As 
with emergency use, although this does not prevent assessment of the intervention's benefits and risks 
from being highly speculative, it does prevent assessment from being totally speculative. Here too, 
although desperate circumstances can justify desperate action, they do not justify reckless action. Unlike 
emergency use, the circumstances of treatment use involve serious impairment of health rather than the 
threat of premature death. Hence, an issue that must be considered is how serious such impairment must 
be to justify resorting to an intervention whose risks and benefits have not been solidly established. 

In cases of emergency use, the FDA requires that physicians not use this exception to an IDE to avoid 
requirements that would otherwise be in place. This particular requirement would be obviated in 
instances of treatment use by the requirement that a protocol for such use be previously addressed within 
an IDE. 

As with emergency use of unapproved devices, the patients involved in treatment use would be 
particularly vulnerable patients. Although they are not dying, they are facing serious medical conditions 
and are thereby likely to be less able to avoid exploitation than patients under less desperate circumstances. 
Consequently, it is especially important that patients be informed of the speculative nature of the 
intervention and of the possibility that treatment may result in little or no benefit to them. 
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192.5 The Safe Medical Devices Act 

On November 28, 1991, the Safe Medical Devices Act of 1990 (Public Law 101-629) went into effect. 
This regulation requires a wide range of healthcare institutions, including hospitals, ambulatory-surgical 
facilities, nursing homes, and outpatient treatment facilities, to report information that "reasonably 
suggests" the likelihood that the death, serious injury, or serious illness of a patient at that facility has 
been caused or contributed to by a medical device. When a death is device-related, a report must be 
made directly to the FDA and to the manufacturer of the device. When a serious illness or injury is 
device-related, a report must be made to the manufacturer or to the FDA in cases where the manufacturer 
is not known. In addition, summaries of previously submitted reports must be submitted to the FDA on 
a semiannual basis. Prior to this regulation, such reporting was voluntary. This new regulation was 
designed to enhance the FDA's ability to quickly learn about problems related to medical devices. It also 
supplements the medical device reporting (MDR) regulations promulgated in 1984. MDR regulations 
require that reports of device-related deaths and serious injuries be submitted to the FDA by manufac- 
turers and importers. The new law extends this requirement to users of medical devices along with 
manufacturers and importers. This act represents a significant step forward in protecting patients exposed 
to medical devices. 
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(ICSU) • Biomedical Engineering Societies in Japan ■ BME 
Activities in Australia and New Zealand • Bioengineering in 
Latin America 

A.2 Summary 



Professionals have been defined as an aggregate of people finding identity in sharing values and skills 
absorbed during a common course of intensive training. Parsons [1954] stated that one determines 
whether or not individuals are professionals by examining whether or not they have internalized certain 
given professional values. Friedson [1971] redefined Parson's definition by noting that a professional is 
someone who has internalized professional values and is to be recruited and licensed on the basis of his 
or her technical competence. Furthermore, he pointed out that professionals generally accept scientific 
standards in their work, restrict their work activities to areas in which they are technically competent, 
avoid emotional involvement, cultivate objectivity in their work, and put their clients' interests before 
their own. 

The concept of a profession that manages technology encompasses three occupational models: science, 
business, and profession. Of particular interest in the contrast between science and profession. Science 
is seen as the pursuit of knowledge, its value hinging on providing evidence and communicating with 
colleagues. Profession, on the other hand, is viewed as providing a service to clients who have problems 
they cannot handle themselves. Science and profession have in common the exercise of some knowledge, 
skill, or expertise. However, while scientists practice their skills and report their results to knowledgeable 
colleagues, professionals — such as lawyers, physicians, and engineers — serve lay clients. To protect both 
the professional and the client from the consequences of the layperson's lack of knowledge, the practice 
of the profession is regulated through such formal institutions as state licensing. Both professionals and 
scientists must persuade their clients to accept their findings. Professionals endorse and follow a specific 
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code of ethics to serve society. On the other hand, scientists move their colleagues to accept their findings 
through persuasion [Goodman, 1989]. 

Consider, for example, the medical profession. Its members are trained in caring for the sick, with the 
primary goal of healing them. These professionals not only have a responsibility of the creation, devel- 
opment, and implementation of that tradition, they also are expected to provide a service to the public, 
within limits, without regard of self-interest. To ensure proper service, the profession itself closely 
monitors licensing and certification. Thus medical professionals themselves may be regarded as a mech- 
anism of social control. However, this does not mean that other facets of society are not involved in 
exercising oversight and control over physicians in their practice of medicine. 

Professional Development. One can determine the status of professionalization by noting the occur- 
rence of six crucial events: (1) the first training school, (2) the first university school, (3) the first local 
professional association, (4) the first national professional association, (5) the first state license law, and 
(6) the first formal code of ethics [Wilensky, 1964; Goodman, 1989; Bronzino, 1992]. 

The early appearances of training school and the university affiliation underscore the importance of 
the cultivation of a knowledge base. The strategic innovative role of the universities and early teachers 
lies in linking knowledge to practice and creating a rational for exclusive jurisdiction. Those practitioners 
pushing for prescribed training then form a professional association. The association defines the task of 
the profession: raising the quality of recruits, redefining their function to permit the use of less technically 
skilled people to perform the more routine, less involved tasks, and managing internal and external 
conflicts. In the process, internal conflict may arise between those committed to established procedures 
and newcomers committed to change and innovation. At this stage, some form of professional regulation, 
such as licensing or certification, surfaces because of a belief that it will ensure minimum standards for 
the profession, enhance status, and protect the layperson in the process. 

The latest area of professional development is the establishment of a formal code of ethics, which 
usually includes rules to exclude the unqualified and unscrupulous practitioners, rules to reduce internal 
competition, and rules to protect clients and emphasize the ideal service to society. A code of ethics 
usually comes at the end of the professionalization process. 

In biomedical engineering, all six critical steps mentioned above have been clearly taken. Therefore, 
biomedical engineering is definitely a profession. It is important here to note the professional associations 
across the globe that represent the interest of professionals in the field. 

A.l Biomedical Engineering Societies in the World 

Globalization of biomedical engineering (BME) activities is underscored by the fact that there are several 
major professional BME societies currently operational throughout the world. The various countries and 
continents to have provided concerted "action" groups in biomedical engineering are Europe, the Amer- 
icas, Canada, and the Far East, including Japan and Australia, while all these organizations share in the 
common pursuit of promoting biomedical engineering, all national societies are geared to serving the 
needs of their "local" memberships. The activities of some of the major professional organizations are 
described below. 

American Institute for Medical and Biological Engineering (AIMBE) 

The United States has the largest biomedical engineering community in the world. Major professional 
organizations that address various cross sections of the field and serve over 20,000 biomedical engineers 
include (1) the American College of Clinical Engineering, (2) the American Institute of Chemical Engi- 
neers, (3) The American Medical Informatics Association, (4) the American Society of Agricultural 
Engineers, (5) the American Society for Artificial Internal Organs, (6) the American Society of Mechanical 
Engineers, (7) the Association for the Advancement of Medical Instrumentation, (8) the Biomedical 
Engineering Society, (9) the IEEE Engineering in Medicine and Biology Society, (10) an interdisciplinary 
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Association for the Advancement of Rehabilitation and Assistive Technologies, (11) the Society for 
Biomaterials, (12) Orthopedic Research Society, (13) American Society of Biomechanics, and (14) Amer- 
ican Association of Physicist in Medicine. In an effort to unify all the disparate components of the 
biomedical engineering community in the United States as represented by these various societies, the 
American Institute for Medical and Biological Engineers (AIMBE) was created in 1992. The AIMBE is the 
result of a 3-year effort funded by the National Science Foundation and led by a joint steering committee 
established by the Alliance of Engineering in Medicine and Biology and the U.S. National Committee on 
Biomechanics. The primary goal of AIMBE is to serve as an umbrella organization "for the purpose of 
unifying the bioengineering community, addressing public policy issues, identifying common themes of 
reflection and proposals for action, and promoting the engineering approach in society's effort to enhance 
health and quality of life through the judicious use of technology" [Galletti, 1994]. 

AIMBE serves its role through four working divisions: (1) the Council of Societies, consisting of the 
11 constituent organizations mentioned above, (2) the Academic Programs Council, currently consisting 
of 46 institutional charter members, (3) the Industry Council, and (4) the College Fellows. In addition 
to these councils, there are four commissions, Education, Public Awareness, Public Policy, and Liaisons. 
With its inception in 1992, AIMBE is a relatively young institution trying to establish its identity as an 
umbrella organization for medical and biologic engineering in the United States. As summarized by two 
of the founding officials of the AIMBE, Profs Nerem and Galletti: 

What we are all doing, collectively, is defining a focus for biological and medical engineering. In a 
society often confused by technophobic tendencies, we will try to assert what engineering can do for 
biology, for medicine, for health care and for industrial development, We should be neither shy, nor 
arrogant, nor self-centered. The public has great expectations from engineering and technology in 
terms of their own health and welfare. They are also concerned about side effects, unpredictable 
consequences and the economic costs. Many object to science for the sake of science, resent exaggerated 
or empty promises of benefit to society, and are shocked by sluggish or misdirected flow from basic 
research to useful applications. These issues must be addressed by the engineering and medical com- 
munities. For more information, contact the Executive Office, AIMBE, 1901 Pennsylvania Avenue, 
N.W., Suite 401, Washington DC 20006-3405 (Tel: 202-496-9660; fax: 202-466-8489; email: 
AIMBE@aol.com). 

IEEE Engineering in Medicine and Biology Society (EMBS) 

The Institute of Electrical and Electronic Engineers (IEEE) is the largest international professional organi- 
zation in the world and accommodates 37 different societies under its umbrella structure. Of these 37, the 
Engineering in Medicine and Biology Society represents the foremost international organization serving 
the needs of nearly 8000 biomedical engineering members around the world. The field of interest of the 
EMB Society is application of the concepts and methods of the physical and engineering sciences in biology 
and medicine. Each year, the society sponsors a major international conference while cosponsoring a number 
of theme-oriented regional conferences throughout the world. A growing number of EMBS chapters and 
student clubs across the major cities of the world have provided the forum for enhancing local activities 
through special seminars, symposia, and summer schools on biomedical engineering topics. These are 
supplemented by EMBS's special initiatives that provide faculty and financial subsidies to such programs 
through the society's distinguished lecturer program as well as the society's Regional Conference Committee. 
Other feature achievements of the society include its premier publications in the form of three monthly 
journals ( Transactions on Biomedical Engineering, Transactions on Rehabilitation Engineering, and Transac- 
tions on Information Technology in Biomedicine) and a bi-monthly EMB Magazine (the IEEE Engineering in 
Medicine and Biology Magazine). EMBS is a transnational voting member society of the International 
Federation for Medical and Biological Engineering. For more information, contact the Secretariat, IEEE 
EMBS, National Research Council of Canada, Room 393, Building M-55, Ottawa, Ontario K1A OR8, 
Canada. (Tel: 613-993-4005; fax: 613-954-2216; email: soc.emb@ieee.org). 



© 2000 by CRC Press LLC 



Canadian Medical and Biological Engineering Society 

The Canadian Medical and Biological Engineering Society (CMBES) is an association covering the fields 
of biomedical engineering, clinical engineering, rehabilitation engineering, and biomechanics and biom- 
aterials applications. CMBES is affiliated with the International Federation for Medical and Biological 
Engineering and currently has 272 full members. The society organizes national medical and biological 
engineering conferences annually in various cities across Canada. In addition, CMBES has sponsored 
seminars and symposia on specialized topics such as communication aids, computers, and the handi- 
capped, as well as instructional courses on topics of interest to the membership. To promote the professional 
development of its members, the society as drafted guidelines on education and certification for clinical 
engineers and biomedical engineering technologists and technicians. CMBES is committed to bringing 
together all individuals in Canada who are engaged in interdisciplinary work involving engineering, the 
life sciences, and medicine. The society communicates to its membership through the publication of a 
newsletter as well as recently launched academic series to help nonengineering hospital personnel to gain 
better understanding of biomedical technology. For more information, contact the Secretariat, The Cana- 
dian Medical and Biological Engineering Society, National Research Council of Canada, Room 393, 
Building M-55, Ottawa, Ontario K1A OR8, Canada (Tel: 613-993-1686; fax: 613-954-2216). 

European Society for Engineering in Medicine (ESEM) 

Most European countries are affiliated organizations of the International Federation for Medical and 
Biological Engineering (IFMBE). The IFMBE activities are described in another section of this chapter. 
In 1992, a separate organization called the European Society for Engineering in Medicine (ESEM) was 
created with the objective of providing opportunities for academic centers, research institutes, industry, 
hospitals and other health care organizations, and various national and international societies to interact 
and jointly explore BME issues of European significance. These include (1) research and development, 
(2) education and training, (3) communication between and among industry, health care providers, and 
policymakers, (4) European policy on technology and health care, and (5) collaboration between eastern 
European countries in transition and the western European countries on health care technology, delivery, 
and management. To reflect this goal the ESEM membership constitutes representation of all relevant 
disciplines from all European countries while maintaining active relations with the Commission of the 
European Community and other supranational bodies and organizations. 

The major promotional strategies of the ESEM's scientific contributions include its quarterly journal 
Technology and Health Care, ESEM News, the Society's Newsletter, a biennial European Conference on 
Engineering and Medicine, and various topic-oriented workshops and courses. ESEM offers two classes 
of membership: the regular individual (active or student) membership and an associate grade. The latter 
is granted to those scientific and industrial organizations which satisfy the society guidelines and subject 
to approval by the Membership and Industrial Committees. The society is administered by an Admin- 
istrative Council consisting of 13 members elected by the general membership. For more information, 
contact the Secretary General, European Society for Engineering in Medicine, Institut fur Biomedizinische 
Technik, Seidenstrasse 36, D-70174 Stuttgart, Germany. (Fax: 711-121-2371 

French Groups for Medical and Biological Engineering 

The French National Federation of Bioengineering (Genie Biologique et Medical, GMB) is a multidisci- 
plinary body aimed at developing methods and processes and new biomedical materials in various fields 
covering prognosis, diagnosis, therapeutics, and rehabilitation. These goals are achieved through the 
creation of 10 regional centers of bioengineering, called the poles. The poles are directly involved at all 
levels, from applied research through the industrialization to the marketing of the product. Some of the 
actions pursued by these poles include providing financial seed support for innovative biomedical engi- 
neering projects, providing technological help, advice, and assistance, developing partnerships among 
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universities and industries, and organizing special seminars and conferences. The information dissemi- 
nation of all scientific progress is done through the Journal of Innovation and Technology in Biology and 
Medicine. For more information, contact the French National Federation of Bioengineering, Coordinateur 
de la Federation Francaise des Poles GBM, Pole GBM Aquitaine-Site Bordeaux-Montesquieu, Centre de 
Resources, 33651 Martillac Cedex, France. 

International Federation for Medical and Biological Engineering (IFMBE) 

Established in 1959, the International Federation for Medical and Biological Engineering (IFMBE) is an 
organization made up from an affiliation of national societies including membership of transnational 
organizations. The current national affiliates are Argentina, Australia, Austria, Belgium, Brazil, Bulgaria, 
Canada, China, Cuba, Cyprus, Slovakia, Denmark, Finland, France, Germany, Greece, Hungary, Israel, 
Italy, Japan, Mexico, Netherlands, Norway, Poland, South Africa, South Korea, Spain, Sweden, Thailand, 
United Kingdom, and the United States. The first transnational organization to become a member of the 
federation is the IEEE Engineering in Medicine and Biology Society. At the present time, the federation 
has an estimated 25,000 members from all of its constituent societies. 

The primary goal of the IFMBE is to recognize the interests and initiatives of its affiliated member 
organizations and to provide an international forum for the exchange of ideas and dissemination of 
information. The major IFMBE activities include the publication of the federation's bimonthly journal, 
the Journal of Medical and Biological Engineering and Computing, the MSEC News, establishment of close 
liaisons with developing countries to encourage and promote BME activities, and the organization of a 
major world conference every 3 years in collaboration with the International Organization for Medical 
Physics and the International Union for Physical and Engineering Sciences in Medicine. The IFMBE also 
serves as a consultant to the United Nations Industrial Development Organization and has nongovern- 
mental organization status with the World Health Organization, the United Nations, and the Economic 
Commission for Europe. For more information, contact the Secretary General, International Federation 
for Medical and Biological Engineering, AMC, University of Amsterdam, Meibergdreef 15, 1105 AZ 
Amsterdam, the Netherlands. (Tel: 20-566-5200, ext. 5179; fax 20-691-7233; email: ifmbe@amc.uva.nl) 

International Union for Physics and Engineering Sciences 
in Medicine (IUPESM) 

The IUPESM resulted from the IFMBE's collaboration with the International Organization of Medical 
Physics (IOMP), culminating into the joint organization of the triennial World Congress on Medical 
Physics and Biomedical Engineering. Traditionally, these two organizations held their conferences back 
to back from each other for a number of years. Since both organizations were involved in the research, 
development, and utilization of medical devices, they were combined to form IUPESM. Consequently, 
all members of the IFMBE's national and transnational societies are also automatically members of the 
IUPESM. The statutes of the IUPESM have been recently changed to allow other organizations to become 
members in addition to the founding members, the IOMP and the IFMBE. 

International Council of Scientific Unions (ICSU) 

The International Council of Scientific Unions is nongovernmental organization created to promote inter- 
national scientific activity in the various scientific branches and their applications for the benefit of human- 
ity. ICSU has two categories of membership: scientific academies or research councils, which are national, 
multidisciplinary bodies, and scientific unions, which are international disciplinary organizations. Currently, 
there are 92 members in the first category and 23 in the second. ICSU maintains close working relations 
with a number of intergovernmental and nongovernmental organizations, in particular with UNESCO. In 
the past, a number of international programs have been launched and are being run in cooperation with 
UNESCO. ICSU is particularly involved in serving the interests of developing countries. 
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Membership in the ICSU implies recognition of the particular field of activity as a field of science. 
Although ICSU is heralded as a body of pure scientific unions to the exclusion of cross and multidisci- 
plinary organizations and those of an engineering nature, IUPESM, attained its associate membership 
in the ICSU in the mid-1980s. The various other international scientific unions that are members of the 
ICSU include the International Union of Biochemistry and Molecular Biology (IUBMB), the Interna- 
tional Union of Biological Sciences (IUBS), the International Brain Research Organization (IBRO), and 
the International Union of Pure and Applied Biophysics (IUPAB). The IEEE is an affiliated commission 
of the IUPAB and is represented through the Engineering in Medicine and Biology Society [ICSU Year 
Book, 1994]. For more information, contact the Secretariat, International Council of Scientific Unions, 
51 Boulevard de Montmorency, 75016 Paris, France. (Tel: 1-4525-0329; fax: 1-4288-9431; email: 
icsu@paris7.jussieu.fr) 

Biomedical Engineering Societies in Japan 

The biomedical engineering activities in Japan are promoted through several major organizations: (1) the 
Japan Society of Medical Electronics and Biological Engineering (JSMEBE), (2) the Institute of Electron- 
ics, Information and Communication Engineering (IEICE), (3) the Institute of Electrical Engineers of 
Japan (IEEJ), (4) the Society of Instrument and Control Engineers (SICE), (5) the Society of Biomech- 
anisms of Japan (SBJ), (6) the Japanese Neural network Society (JNNS), (7) Japan Ergonomics Research 
Society (JERS), and (8) the Japan Society of Ultrasonics in Medicine (JSUM). The various special sister 
societies that are affiliated under the auspicies of these organizations mainly focus on medical electronics, 
biocybernetics, neurocomputing, medical and biologic engineering, color media and vision system, and 
biologic and physiologic engineering. The JSMEBE has the most BME concentration, with two confer- 
ences held each year in biomedic engineering and three international journals that publish original peer- 
reviewed papers. The IEICE, which is now 77 years old, is one of the largest international societies, 
constituting about 40,000 members. The aim of the society is to provide a forum for the exchange of 
knowledge on the science and technology of electronics, information, and communications and the 
development of appropriate industry in these fields. 

BME Activities in Australia and New Zealand 

The BME activities in Australia and New Zealand are served by one transnational organization called 
the Australasian College of Physical Scientists and Engineers in Medicine (ACPSEM) covering Australia 
and New Zealand and a national organization called the College of Biomedical Engineers of the Institute 
of Engineers, (CBEIE) serving the Australian member segment. 

The Australasian College of Physical Scientists and Engineers in Medicine was founded in 1977 and 
comprises 6 branches and 339 members. The membership is made up of 76% from Australia, 17% from 
New Zealand, and the rest from overseas. A majority of members are employed in pubic hospitals, with 
most of these in departments of medical physics and clinical engineering. The primary objectives of the 
college are (1) to promote and further the development of the physical sciences and engineering in 
medicine and to facilitate the exchange of information and ideas among members of the college and 
others concerned with medicine and related subjects and (2) to promote and encourage education and 
training in the physical sciences and engineering in medicine. Entry to ordinary membership of the 
college requires applicants to have an appropriate 4-year bachelor's degree and at least 5 years of expe- 
rience as a physical scientist or engineer in a hospital or other approved institution. 

Bioengineering in Latin America 

Latin American countries have demonstrated in the past decade significant growth in their bioengineering 
activities. In terms of IEEE statistics, Latin America has the fastest growing membership rate. Currently, 
there are over 10,000 IEEE members alone in this region, of which about 300 are members of the 
Engineering in Medicine and Biology Society. In an effort to stimulate this growth and promote active 
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international interactions, the presidents of the IEEE, EMBS, and the IFMBE met with representatives 
of biomedical engineering societies from Argentina, Brazil, Chile, Columbia, and Mexico in 1991 [Rob- 
inson, 1991]. This meeting resulted in the formation of an independent Latin American Regional Council 
of Biomedical Engineering, known by its Spanish and portugese acronym as CORAL (Consejo Regional 
de Ingenieria Biomedica para Americana Latino). Both the EMBS and the IFMBE are the founding 
sponsoring members of the CORAL. The main objectives of CORAL are (1) to foster, promote and 
encourage the development of research, student programs, publications, professional activities, and joint 
efforts and (2) to act as a communication channel for national societies within Latin American region 
and to improve communication between societies, laboratories, hospitals, industries, universities, and 
other groups in Latin America and the Caribbean. Since its inception, CORAL has already provided the 
centerpiece for bioengineering activities in Latin America through special concerted scientific meetings 
and closer society interactions both in a national and international sense. For more information, contact 
the Secretary General, CORAL, Centro Investigacion y de Estudios, Avanzados Duel Ipn, Departamento 
Ingenieria Electrica, Seccion Bioelectronica, Av. Instituto Politecnico Nacional 2508, Esg. Av. Ticoman 
07000, Mexico Apartado Postal 14-740, Mexico. 

A.2 Summary 

The field of biomedical engineering, which originated as a professional group on medical electronics in 
the late fifties, has grown from a few scattered individuals to very well-established organization. There 
are approximately 50 national societies throughout the world serving an increasingly growing community 
of biomedical engineers. The scope of biomedical engineering today is enormously diverse. Over the 
years, many new disciplines such as molecular biology, genetic engineering, computer-aided drug design, 
nanotechnology, and so on, which were once considered alien to the field, are now new challenges a 
biomedical engineer faces. Professional societies play a major role in bringing together members of this 
diverse community in pursuit of technology applications for improving the health and quality of life of 
human beings. Intersocietal cooperations and collaborations, both at national and international levels, 
are more actively fostered today through professional organizations such as the IFMBE, AIMBE, CORAL, 
and the IEEE. These developments are strategic to the advancement of the professional status of bio- 
medical engineers. Some of the self-imposed mandates the professional societies should continue to 
pursue include promoting public awareness, addressing public policy issues that impact research and 
development of biologic and medical products, establishing close liaisons with developing countries, 
encouraging educational programs for developing scientific and technical expertise in medical and 
biologic engineering, providing a management paradigm that ensures efficiency and economy of health 
care technology [Wald, 1993], and participating in the development of new job opportunities for bio- 
medical engineers. 
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